flexible, free and open data-driven learning for the masses

49
Flexible, Free and Open Data- Driven Learning for the Masses Alannah Fitzgerald http://maxpixel.freegreatpicture.com/photo- 1742679

Upload: alannah-fitzgerald

Post on 11-Apr-2017

312 views

Category:

Education


0 download

TRANSCRIPT

Page 1: Flexible, Free and Open Data-Driven Learning for the Masses

Flexible, Free and Open Data-Driven Learning for the Masses

Alannah Fitzgeraldhttp://maxpixel.freegreatpicture.com/photo-1742679

Page 2: Flexible, Free and Open Data-Driven Learning for the Masses

MINING & LINKING OPEN CONTENT FOR DATA DRIVEN LEARNING

FLAX Language Digital Library Project, University of Waikato, NZ

Page 3: Flexible, Free and Open Data-Driven Learning for the Masses

Data-Driven Learning

The metaphor that Johns evoked was one where language is treated as empirical data and “every student is a Sherlock Holmes”, investigating the uses of linguistic data directly to assist with language acquisition (Johns, 2002, p. 108).

Page 4: Flexible, Free and Open Data-Driven Learning for the Masses

flax.nzdl.org Powerful yet simple interfaces for Data-Driven Learning

Page 5: Flexible, Free and Open Data-Driven Learning for the Masses

The eBook of FLAX

“FLAX (Flexible Language Acquisition) is both a vision and a tool that you can use for language learning. The Web contains innumerable language activities, quizzes, and games, but they are fixed: the activities are cast in stone and the material is chosen by others. Our vision is to put the control back where it belongs, in the hands of teachers and learners.”

Page 6: Flexible, Free and Open Data-Driven Learning for the Masses

WHO ARE WE IN THIS FLAX RESEARCH & DEVELOPMENT COLLABORATION?

Page 7: Flexible, Free and Open Data-Driven Learning for the Masses

FLAX Language at Waikato University

http://flax.nzdl.org FLAX image by permission of non-commercial reuse by Jane Galloway

Page 8: Flexible, Free and Open Data-Driven Learning for the Masses

FLAX Language Project at the Greenstone Digital Library Lab, Waikato University NZ

Professor Ian WittenFLAX Project Lead

Dr Shaoqun WuFLAX Project Lead Researcher & Developer

Page 9: Flexible, Free and Open Data-Driven Learning for the Masses

Research on Open FLAX Collections

http://oerresearchhub.org/

Alannah FitzgeraldOpen Fellow with OERRHFLAX Language & OpenEducation Researcher

Page 10: Flexible, Free and Open Data-Driven Learning for the Masses

OPEN SOURCE LANGUAGE TOOLS DEVELOPMENT

Page 11: Flexible, Free and Open Data-Driven Learning for the Masses

FLAX Digital Library

Collections

Collocations database

Glossary

Open Educational Resources

Page 12: Flexible, Free and Open Data-Driven Learning for the Masses

Contemporary English(Wikipedia)

Page 13: Flexible, Free and Open Data-Driven Learning for the Masses

Google-esque Interface Designs

Designed for the non-expert corpus user, namely:

learners, teachers, subject academics, instructional designers and language resource developers.

http://flax.nzdl.org/greenstone3/flax?a=fp&sa=collAbout&c=collocations&if=flax

Page 15: Flexible, Free and Open Data-Driven Learning for the Masses

Introducing the Wikipedia Miner Toolkit (Milne & Witten, 2013)

Page 16: Flexible, Free and Open Data-Driven Learning for the Masses

Building Interactivity into FLAX Language Collections

Page 17: Flexible, Free and Open Data-Driven Learning for the Masses

FLAX Activities Continued

Page 18: Flexible, Free and Open Data-Driven Learning for the Masses

FLAX TEAM Apps for Android via GooglePlay

http://commons.wikimedia.org/wiki/File:Android_robot_skateboarding.svg /

http://commons.wikimedia.org/wiki/File:Google_Play_Store.svg

Page 19: Flexible, Free and Open Data-Driven Learning for the Masses

FLAX Team on Google Play

https://play.google.com/store/apps/developer?id=FLAX+TEAM&hl=en

Page 20: Flexible, Free and Open Data-Driven Learning for the Masses

FLAX Across Platforms• FLAX Website flax.nzdl.org for hosting open online language

collections• Building directly onto the Web with OER

• FLAX multilingual open-source software for download• Set up your own FLAX server online or;• Build collections offline for use on your PC

• FLAX Android app for download• Interact with game-based FLAX collections while on the go

• FLAX for MOODLE plug-in for download• FLAX for MOOC Platforms?• FLAX in conjunction with translation technologies?

Page 21: Flexible, Free and Open Data-Driven Learning for the Masses

DOMAIN-SPECIFIC OPEN LANGUAGE COLLECTIONS BUILDING

Page 22: Flexible, Free and Open Data-Driven Learning for the Masses

The eBook of FLAX “FLAX enables teachers to build bespoke libraries very easily. It is built upon powerful digital library technology, and provides access to vast linguistic resources containing countless examples of actual, authentic, usage in contemporary text. But teachers can also build collections using their own material, focusing on language learning in a particular domain (e.g., business, law) or motivating students by using text from a particular context (e.g., country or region, common interests).”

Page 23: Flexible, Free and Open Data-Driven Learning for the Masses

RESEARCH INTO MOOC LINGUISTIC SUPPORT

Page 24: Flexible, Free and Open Data-Driven Learning for the Masses

FLAX Academic English Collections

http://flax.nzdl.org/greenstone3/flax?a=fp&sa=library

Page 25: Flexible, Free and Open Data-Driven Learning for the Masses

MOOC Research Participants

• CopyrightX (Harvard University – formerly an edX MOOC, now a networked course)

• ContractsX (Harvard University with edX)• English Common Law (University of London

with Coursera)

Page 26: Flexible, Free and Open Data-Driven Learning for the Masses

Role and courses taken by respondents 2014-2016

Page 27: Flexible, Free and Open Data-Driven Learning for the Masses

Age bands of respondents

Page 28: Flexible, Free and Open Data-Driven Learning for the Masses

Educational background of respondents

Page 29: Flexible, Free and Open Data-Driven Learning for the Masses

Languages spoken by MOOC learners

• English (95.71%), followed by increasingly smaller numbers of participants who identified as being able to speak fluent:

• Spanish (16.56%), French (12.88%), German (8.59%), Italian (7.98%), Catalan (3.0%), Chinese, Finnish, Gujarati, Swahili (1.84%), French Creole, Hindi, Japanese, Korean, Luo, Norwegian, Portuguese, Russian, Serbian (1.23%), Arabic, Georgian, Slovak, Thai, Turkish, Ukrainian, Urdu, Vietnamese (0.61%).

Page 30: Flexible, Free and Open Data-Driven Learning for the Masses

“When you want to find out how to express something in English what resource(s) do you use? You can select more than one.”

Language Resources Informal learners (N=163) CopyrightX teachers (N=11)

Paper-based dictionaries 18.40% 18.18%

Online dictionaries 76.07% 100.00%Online reference resources (e.g. Wikipedia) 52.15% 81.82%Search engines (e.g. Google, using inverted commas "" and asterisks * to search for keywords/phrases for language use) 57.67% 100.00%Corpora / searchable web-based language collections (e.g. FLAX, WebCorp) 7.98% 0.00%

Grammar books 11.66% 9.09%

Language course books 1.84% 9.09%

Ask someone 31.90% 27.27%

Need nothing 2.45% 0.00%

Page 31: Flexible, Free and Open Data-Driven Learning for the Masses

Keyword search for creative in CopyrightX collection

Page 32: Flexible, Free and Open Data-Driven Learning for the Masses

Wikify function in the CopyrightX MOOC collection

Page 33: Flexible, Free and Open Data-Driven Learning for the Masses

Preview of some of the top 100 collocations in the CopyrightX collection displaying summary judgment

Page 34: Flexible, Free and Open Data-Driven Learning for the Masses

Learner motivations for using FLAX and other language support resources

Page 35: Flexible, Free and Open Data-Driven Learning for the Masses

FLAX user experience for learners

Page 36: Flexible, Free and Open Data-Driven Learning for the Masses

Learner feedback on the searchability of the FLAX ContractsX MOOC collection

Page 37: Flexible, Free and Open Data-Driven Learning for the Masses

Learner feedback on the collocations and Cherry Basket features in the English Common Law

MOOC collection

Page 38: Flexible, Free and Open Data-Driven Learning for the Masses

Negative features of FLAX according to respondents

Page 39: Flexible, Free and Open Data-Driven Learning for the Masses

Positive features of FLAX according to respondents

Page 40: Flexible, Free and Open Data-Driven Learning for the Masses

Extra comments on FLAX from respondents

Page 41: Flexible, Free and Open Data-Driven Learning for the Masses

RESEARCH INTO THE REUSE OF MOOC LINGUISTIC CONTENT

Page 42: Flexible, Free and Open Data-Driven Learning for the Masses

Fitzgerald, A., Marin. M.J., Wu, S. & Witten, I.H. (2017). Evaluating the Efficacy of the Digital Commons for Scaling Data-Driven Learning. In M. Carrier, R. M. Damerow, & K. M. Bailey (Eds.), Digital Language Learning and Teaching: Research, Theory, and Practice (pp. 38 – 51). New York, NY: Routledge & TIRF.

Page 43: Flexible, Free and Open Data-Driven Learning for the Masses

The Digital Commons

Typically, the digital commons involves the creation and distribution of informational resources and technologies that have been designed to stay in the digital commons using various open licenses, including the GNU Public License and the Creative Commons suite of licenses (Wikipedia, 2016). One of the most widely used informational resources developed by and for the digital commons is Wikipedia.

Page 44: Flexible, Free and Open Data-Driven Learning for the Masses

Data Collection Procedure

• 52 students in the fourth year of the Translation Degree program at the University of Murcia (Spain) were selected as informants.

• All the students’ linguistic competence level complied with the Common European Framework of Reference for Languages requirements for the B2 level.

Page 45: Flexible, Free and Open Data-Driven Learning for the Masses

Experimental & Control Groups

• The experimental group (16 informants organized into four sub-groups) were requested to only consult the FLAX English Common Law MOOC collection as the single source of information to draft their essays.

• The remaining 36 students (divided into nine different sub-groups) would act as the control group, following the traditional method for the design and drafting of essays before this experiment was carried out, that is, using any information source available.

Page 46: Flexible, Free and Open Data-Driven Learning for the Masses

Term Average in each corpus

FLAX Corpus Non-FLAX Corpus

Terms Identified by Themostat (A) (Drouin, 2003)

226 385

Corpus Size After Reduction

16,939 16,264

Number of Topics (B) 4 9

Term Average (A/B) 56.5 42.77

Standardized type/token ratio

35.3 38.63

Page 47: Flexible, Free and Open Data-Driven Learning for the Masses

Findings from Reuse Study

• According to the data, the members of the experimental group appear to have acquired the specialized terminology of the area better than those in the control group, as attested by the higher term average obtained by the texts in the FLAX-based corpus (56.5) as opposed to the non-FLAX-based text collection, at 13.73 points below

• However, the standardized type/token ratio assigned to each set of texts, which is often indicative of the richness of the vocabulary (the higher, the richer), is lower for the FLAX-based texts, standing at 3 points below the texts written by the control group

Page 48: Flexible, Free and Open Data-Driven Learning for the Masses

References• Biber, D., Conrad, S., & Cortes, V. (2004). If you look at . . .: lexical bundles in

university teaching and textbooks. Applied Linguistics, 25, 371–405. Biber, D. (2006). University Language, A corpus-based study of spoken and written registers. John Benjamins, Amsterdam.

• Biber, D., Barbieri F. (2007). Lexical bundles in university spoken and written registers. English for Specific Purpose, 26, 263–286.

• Fitzgerald, A., Marin. M.J., Wu, S. & Witten, I.H. (2017). Evaluating the Efficacy of the Digital Commons for Scaling Data-Driven Learning. In M. Carrier, R. M. Damerow, & K. M. Bailey (Eds.), Digital Language Learning and Teaching: Research, Theory, and Practice (pp. 38 – 51). New York, NY: Routledge & TIRF.

• Johns, T. (2002). Data-driven learning: the perpetual challenge. In B. Kettemann & G. Marko (Eds.), Teaching and Learning by Doing Corpus Analysis. Proceedings of the Fourth International Conference on Teaching and Language Corpora, Graz 19-24 July, 2000, (pp. 107-117). Amsterdam: Rodopi.

• Milne, D. & Witten, I.H. (2013). An open-source toolkit for mining Wikipedia. Artificial Intelligence, 194, 222-239.

• Wu, S., Li, L., Witten, I.H., Yu, A. (2016). Constructing a Collocation Learning System from the Wikipedia Corpus. International Journal of Computer-Assisted Language Learning and Teaching (IJCALLT), 6, issue 3, pp. 18-35

Page 49: Flexible, Free and Open Data-Driven Learning for the Masses

Thank YouSpecial Thanks:

Ruth Crymes TESOL Fellowship for Graduate StudyThe International Research Foundation (TIRF) for English Language Education

FLAX Language Project & Software Downloads: http://flax.nzdl.org/ FLAX Language Project Research:

https://www.researchgate.net/project/FLAX-Flexible-Language-Acquisition-flaxnzdlorg The How-to eBook of FLAX: http://flax-doc.nzdl.org/BOOK_OF_FLAX/BookofFLAX%20fullsize%20with%

20links.pdfFLAX Game-based Apps for Android via Google Play Store (free):

https://play.google.com/store/apps/developer?id=FLAX%20TEAM&hl=en

Ian Witten (FLAX Project Lead): [email protected] Wu (FLAX Research and Development): [email protected]

Alannah Fitzgerald (FLAX Open Language Research): [email protected]

TOETOE Technology for Open English Blog: www.alannahfitzgerald.org Slideshare: http://www.slideshare.net/AlannahOpenEd/

Twitter: @AlannahFitz