talc 2008 workshop 1 - teaching and language corpora
DESCRIPTION
Annotating pedagogy: implementing language teaching and learning-oriented annotation on corporaTRANSCRIPT
![Page 1: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/1.jpg)
TaLC 08 Workshop 1TaLC 08 Workshop 1
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
![Page 2: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/2.jpg)
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
Aims
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
Aims
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
![Page 3: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/3.jpg)
Aims of this workshopAims of this workshop
3
![Page 4: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/4.jpg)
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
Workshop Structure
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
Workshop Structure
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
![Page 5: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/5.jpg)
5
Workshop StructureWorkshop Structure
1. Introducing the workshop background
2. Overview of the annotation tool: SACODEYL Annotator
3. Getting down to annotation: annotating a text (1)
Coffee Break
1. Introducing the workshop background
2. Overview of the annotation tool: SACODEYL Annotator
3. Getting down to annotation: annotating a text (1)
Coffee Break
![Page 6: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/6.jpg)
6
Workshop StructureWorkshop Structure
3. Getting down to annotation: annotating a text (2)
4. Exploiting our Annotations5. Discussion and debate
3. Getting down to annotation: annotating a text (2)
4. Exploiting our Annotations5. Discussion and debate
![Page 7: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/7.jpg)
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
1. Introduction
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
1. Introduction
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
![Page 8: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/8.jpg)
8
IntroductionIntroduction
Braun (2005, 2007): pedagogically motivated corpora
(a) provide a more systematic range of material than individual texts or scattered collections of activities and, if well-designed, (b) offer a wider range of idiolects than the average material.
Braun (2005, 2007): pedagogically motivated corpora
(a) provide a more systematic range of material than individual texts or scattered collections of activities and, if well-designed, (b) offer a wider range of idiolects than the average material.
![Page 9: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/9.jpg)
9
IntroductionIntroduction
Braun (2006) states that thematic annotation, including topic keys and section titles, are particularly useful in the implementation of pedagogically motivated corpora
Braun (2006) states that thematic annotation, including topic keys and section titles, are particularly useful in the implementation of pedagogically motivated corpora
![Page 10: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/10.jpg)
10
IntroductionIntroduction
![Page 11: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/11.jpg)
IntroductionIntroduction
Pérez-Paredes & Alcaraz (2009, forthcoming)
For the time being, the natural corpus playground continues to be tertiary education.
Our proposal:XML >>CL in the language classroom. The resulting annotated corpus can be seen as
being integrative of language data and annotated pedagogy. The former is the liquid, the latter is the container. The interesting aspect here is that pedagogy can be annotated and, subsequently, accessed by corpus users.
Pérez-Paredes & Alcaraz (2009, forthcoming)
For the time being, the natural corpus playground continues to be tertiary education.
Our proposal:XML >>CL in the language classroom. The resulting annotated corpus can be seen as
being integrative of language data and annotated pedagogy. The former is the liquid, the latter is the container. The interesting aspect here is that pedagogy can be annotated and, subsequently, accessed by corpus users. 11
![Page 12: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/12.jpg)
12
IntroductionIntroduction
Corpus
LanguageData
Annotation
Language
Metadata
Pedagogy
![Page 13: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/13.jpg)
IntroductionIntroduction
13
![Page 14: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/14.jpg)
14
IntroductionIntroduction
Linguistic analysis of interest in FLT------>
Linguistics comes first------->
DDL materialsConcordances
and corpus
Researcher/LinguistEnd user The possibilities
Scenario(Alcáraz and
Pérez-Paredes2008)
![Page 15: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/15.jpg)
15
IntroductionIntroduction
Pedagogical analysis (and annotation)
of language corpora------>
Pedagogy comes first------->
Pedagogy-driven DDL
Pedagogical analysis (and annotation)
of language corpora------>
Pedagogy comes first------->
Pedagogy-driven DDL Material
developer/Teacher/ LearnerEnd user
The feasibilityScenario
(Alcáraz and Pérez-Paredes
2008)
![Page 16: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/16.jpg)
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
2. Overview of the annotation tool: SACODEYL Annotator
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
2. Overview of the annotation tool: SACODEYL Annotator
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
![Page 17: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/17.jpg)
17
SummarySummary
What is SACODEYL Annotator?How can I install SACODEYL
Annotator?Overview
Corpus ManagementCategory ManagementAnnotation ManagementMetadata Management
What is SACODEYL Annotator?How can I install SACODEYL
Annotator?Overview
Corpus ManagementCategory ManagementAnnotation ManagementMetadata Management
![Page 18: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/18.jpg)
18
What is SACODEYL Annotator?
What is SACODEYL Annotator?
SACODEYL Annotator …Is a free and open source tool for annotating
textsHas been developed from the pedagogical
perspective (not morpho-sintactic tagging oriented)
Offers support for pedagogical annotation in texts
Supports the customization of the annotated tags according to the necessities of the teacher.
SACODEYL Annotator …Is a free and open source tool for annotating
textsHas been developed from the pedagogical
perspective (not morpho-sintactic tagging oriented)
Offers support for pedagogical annotation in texts
Supports the customization of the annotated tags according to the necessities of the teacher.
![Page 19: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/19.jpg)
19
How can we install it?How can we install it?
In the SACODEYL Web Pagewww.um.es/sacodeyl
Platform-independent
(Remember to download the examples too)
In the SACODEYL Web Pagewww.um.es/sacodeyl
Platform-independent
(Remember to download the examples too)
![Page 20: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/20.jpg)
20
A) In the desktopB) In Program Files SACODEYL Tools
Where has it been installed?Where has it been installed?
![Page 21: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/21.jpg)
21
OverviewOverview
Categories Management
Menus and Corpus Management
Annotation Area
Metadata Management
![Page 22: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/22.jpg)
22
Corpus ManagementCorpus Management
We can …Create a new corpusOpen an existent
corpusSave the current
annotation changes
We can …Create a new corpusOpen an existent
corpusSave the current
annotation changes
![Page 23: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/23.jpg)
23
Creating a new corpus ICreating a new corpus I
![Page 24: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/24.jpg)
24
Creating a new corpus IICreating a new corpus II
A new corpus ALWAYS has a default category tree.
Default tree has been developed by the teachers of SACODEYL Project
A new corpus ALWAYS has a default category tree.
Default tree has been developed by the teachers of SACODEYL Project
![Page 25: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/25.jpg)
25
What categories does this defaultcategory tree contain ?
What categories does this defaultcategory tree contain ?
TopicsGrammaticalLexicalStyleCEF Level….
![Page 26: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/26.jpg)
26
How can I add a new document to the current corpus?
How can I add a new document to the current corpus?
1. Press in …
2. Select the text format/encoding
3. Select the new document
1. Press in …
2. Select the text format/encoding
3. Select the new document
![Page 27: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/27.jpg)
27
What does the text format mean?
What does the text format mean?
This is the manner in which the text is laid out in the file.
Mainly 3 text formats are supported: Plain text (written) .txt Oral text in SACODEYL Transcriptor format XML text in TEI standard format (text in special XML files)
This is the manner in which the text is laid out in the file.
Mainly 3 text formats are supported: Plain text (written) .txt Oral text in SACODEYL Transcriptor format XML text in TEI standard format (text in special XML files)
![Page 28: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/28.jpg)
28
This is the form in which the text is stored (related to the Multilanguage).
(In Windows ANSI by default)
What does the text encoding mean? What does the text encoding mean?
![Page 29: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/29.jpg)
29
Adding our exampleAdding our example
Our example file is en17.txtThis file has a SACODEYL
Transcriptor formatHas been encoded with
multilingual support encoding format (UTF-16)
Save the changes!!!!
Our example file is en17.txtThis file has a SACODEYL
Transcriptor formatHas been encoded with
multilingual support encoding format (UTF-16)
Save the changes!!!!
![Page 30: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/30.jpg)
30
Selecting the text to work with
Selecting the text to work with
Now we can insert a great amount of texts in the same corpus.
We can select one and annotate it.
1. Press …
2. Select the document
Now we can insert a great amount of texts in the same corpus.
We can select one and annotate it.
1. Press …
2. Select the document
![Page 31: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/31.jpg)
31
Friendly AppearanceFriendly Appearance
Pedagogical orientationPedagogical orientation
![Page 32: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/32.jpg)
32
Information showed in the working document
Information showed in the working document
Section Number Applied Categories to this section
(Annotations) Speaker (only in oral text) Transcription
Section Number Applied Categories to this section
(Annotations) Speaker (only in oral text) Transcription
![Page 33: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/33.jpg)
33
What is a section?What is a section?
Is a stretch of text that is pedagogically motivated.
A fragment that could be useful in a classroom
A section can be established in any kind of text (oral and written) with the insertion of the special char (#) for division of texts into sections.
Is a stretch of text that is pedagogically motivated.
A fragment that could be useful in a classroom
A section can be established in any kind of text (oral and written) with the insertion of the special char (#) for division of texts into sections.
![Page 34: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/34.jpg)
34
Intuitive Annotation ProcessIntuitive Annotation Process
Drag and Drop to Annotate a Section
Drag and Drop to Annotate a Section
![Page 35: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/35.jpg)
35
KeywordsKeywords
SACODEYL Annotator supports the annotation of keywords at any kind of level
Just select text and apply a category by right-clicking
SACODEYL Annotator supports the annotation of keywords at any kind of level
Just select text and apply a category by right-clicking
![Page 36: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/36.jpg)
36
What is a Keyword?What is a Keyword?
“… [a] keyword is a stretch of language (a word, more than one word or a whole paragraph) that the annotator associates to a category…”
Pérez-Paredes and Alcaraz, ReCALL, Jan. 2009 Vol 21. (1) (Forthcoming)
“… [a] keyword is a stretch of language (a word, more than one word or a whole paragraph) that the annotator associates to a category…”
Pérez-Paredes and Alcaraz, ReCALL, Jan. 2009 Vol 21. (1) (Forthcoming)
![Page 37: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/37.jpg)
37
Selective ViewSelective View
Offers a selective view of the information in order to facilitate the organization.
Offers a selective view of the information in order to facilitate the organization.
![Page 38: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/38.jpg)
38
Annotating a TitleAnnotating a Title
Drag and Drop the special “Title” category to the desired section.
The title is rendered by a tool tip when placing the
cursor on the section. (No tool tip = No title)
Drag and Drop the special “Title” category to the desired section.
The title is rendered by a tool tip when placing the
cursor on the section. (No tool tip = No title)
![Page 39: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/39.jpg)
39
MultilanguageMultilanguage
Supports a real multilingual annotation
May use texts in any language
Supports a real multilingual annotation
May use texts in any language
![Page 40: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/40.jpg)
40
ExtensibleExtensible
Supports customization of the annotation
User can add his/her own annotation taxonomy or remove any annotation category
Supports customization of the annotation
User can add his/her own annotation taxonomy or remove any annotation category
![Page 41: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/41.jpg)
41
How can I add a new category?
How can I add a new category?
Select the parent category. (i.e. Topics)
Press Add Cat. Button.Fill the dialog
Select the parent category. (i.e. Topics)
Press Add Cat. Button.Fill the dialog
![Page 42: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/42.jpg)
42
How can I remove a category?
How can I remove a category?
Select the category to remove (i.e. Topic)
Be careful …All the associated children
will be removed alsoAll the annotation with the
tags will be removed alsoPress Delete Cat. Button.
![Page 43: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/43.jpg)
43
How can I reorder the categories?
How can I reorder the categories?
Select the category to reorder (i.e. Topic)
Press Up Cat or Down Cat. to move it.
![Page 44: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/44.jpg)
44
How can I customize a category?
Select the category to customize (i.e. Topic)
Press double click
![Page 45: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/45.jpg)
45
Can I manage metadata for the working text?
Can I manage metadata for the working text?
![Page 46: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/46.jpg)
46
What if I find mistakes?What if I find mistakes?
Supports edition of the inserted texts.
Uses XML TEI standard for encoding corpora.
Supports edition of the inserted texts.
Uses XML TEI standard for encoding corpora.
![Page 47: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/47.jpg)
47
Multilingual InterfaceMultilingual Interface
Available in many languagesYou could also add your language
for the interfaces
Available in many languagesYou could also add your language
for the interfaces
![Page 48: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/48.jpg)
48
Has a detailed help tutorialHas a detailed help tutorial
![Page 49: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/49.jpg)
49
IntegrationIntegration
SACODEYL Annotator has been integrated with:SACODEYL TranscriptorSACODEYL Search SACODEYL VRP
SACODEYL Annotator has been integrated with:SACODEYL TranscriptorSACODEYL Search SACODEYL VRP
![Page 50: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/50.jpg)
50
Resource ManagementResource Management
Offers enrichment of text with external resources
i.e. html links, videos, audios, etc.
Offers enrichment of text with external resources
i.e. html links, videos, audios, etc.
![Page 51: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/51.jpg)
51
Where is the information stored?
Where is the information stored?
Remember: All the information is store in one file. The corpus file which you have created.
Remember: All the information is store in one file. The corpus file which you have created.
Corpus
LanguageData
Annotation
Language
Metadata
Pedagogy
![Page 52: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/52.jpg)
52
Advanced FeaturesAdvanced Features
There are more advanced features… How can I establish the default category
tree for a new corpus? When a have a lot of documents in the
same corpus, how can I reorder the documents in the list of the available texts?
….. Visit our web site for updates
www.um.es/sacodeyl
There are more advanced features… How can I establish the default category
tree for a new corpus? When a have a lot of documents in the
same corpus, how can I reorder the documents in the list of the available texts?
….. Visit our web site for updates
www.um.es/sacodeyl
![Page 53: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/53.jpg)
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
3. Getting down to annotation: annotating a text
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
3. Getting down to annotation: annotating a text
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
![Page 54: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/54.jpg)
54
Annotation AimAnnotation Aim
Why are you annotating?What is the purpose of your annotation?What use are you giving to your
annotation?
We assume you are annotating a text for your own students’ language learning purposes
Why are you annotating?What is the purpose of your annotation?What use are you giving to your
annotation?
We assume you are annotating a text for your own students’ language learning purposes
![Page 55: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/55.jpg)
55
Annotation AimAnnotation Aim
We assume you are annotating a text for your own students’ language learning purposes
We assume you are annotating a text for your own students’ language learning purposes
The best way to learn to walk is walk!!!!
Watch the video while you read the transcript of an interview which belongs to the English SACODEYL Corpus.
A talk with Mike, 15, about his interest in football and art, his cooking skills and plans for the future.
Video length: 10:47
![Page 56: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/56.jpg)
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
4. Exploiting our annotations
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
4. Exploiting our annotations
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
![Page 57: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/57.jpg)
57
What is the output of SACODEYL Annotator?
What is the output of SACODEYL Annotator?
A corpus stored in XML TEI format that contains:Texts (one text in our case)Annotations over the textThe category tree (shared by all the
texts)Resources linked to the textsMetadata associated to texts
A corpus stored in XML TEI format that contains:Texts (one text in our case)Annotations over the textThe category tree (shared by all the
texts)Resources linked to the textsMetadata associated to texts
![Page 58: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/58.jpg)
58
What is XML TEI format?What is XML TEI format?
TEI Text Encoding InitiativeThis is a format for storing corporaHas been promoted by OTA
(Oxford Text Archive)Is a continuously growing format
(more than 50 versions released yet, currently TEI P5)
Is rapidly spreading among the available tools
TEI Text Encoding InitiativeThis is a format for storing corporaHas been promoted by OTA
(Oxford Text Archive)Is a continuously growing format
(more than 50 versions released yet, currently TEI P5)
Is rapidly spreading among the available tools
![Page 59: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/59.jpg)
59
TEI Tools (Research)TEI Tools (Research)
TeiPublisher“This tool is a XML-based repository thatallows the publication of TEI corpora to
thepublic community and offers a search
tool.”Dexter
“This is other annotator tool that used TEI as
the format for the annotated files.”
TeiPublisher“This tool is a XML-based repository thatallows the publication of TEI corpora to
thepublic community and offers a search
tool.”Dexter
“This is other annotator tool that used TEI as
the format for the annotated files.”
![Page 60: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/60.jpg)
60
TEI Tools (Research)TEI Tools (Research)
Oxygen XML Editor and XMLSpy“These are XML Editors that allows the
modification of the TEI files without any limitation”
(These are complex for non-advanced users)
Oxygen XML Editor and XMLSpy“These are XML Editors that allows the
modification of the TEI files without any limitation”
(These are complex for non-advanced users)
![Page 61: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/61.jpg)
61
TEI Tools (Research)TEI Tools (Research)
TAPoR (http://portal.tapor.ca/)“The Text Analysis Portal for Research (TAPoR) is a gateway to tools for
sophisticated analysis and retrieval, along with
representative texts for experimentation.”
TAPoR (http://portal.tapor.ca/)“The Text Analysis Portal for Research (TAPoR) is a gateway to tools for
sophisticated analysis and retrieval, along with
representative texts for experimentation.”
![Page 62: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/62.jpg)
62
TEI Tools (Research)TEI Tools (Research)
TokenX http://www.unl.edu/libr/etext/tokenx.shtml
“Is a text visualization, analysis, and play tool” WordHoard
http://wordhoard.northwestern.edu/userman/index.html
“Is a tool for annotating or tagging texts by morphological, lexical, prosodic, and
narratological criteria and for determining frequency
information”
TokenX http://www.unl.edu/libr/etext/tokenx.shtml
“Is a text visualization, analysis, and play tool” WordHoard
http://wordhoard.northwestern.edu/userman/index.html
“Is a tool for annotating or tagging texts by morphological, lexical, prosodic, and
narratological criteria and for determining frequency
information”
![Page 63: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/63.jpg)
63
TEI Tools (Research)TEI Tools (Research)
XAIRAXAIRA (XML Aware Information
Retrieval Architecture) is an open source tool
for constructing high-quality
linguistically-motivated search interfaces to large collections of XML documents.
XAIRAXAIRA (XML Aware Information
Retrieval Architecture) is an open source tool
for constructing high-quality
linguistically-motivated search interfaces to large collections of XML documents.
![Page 64: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/64.jpg)
64
The XAIRA search with our example!!!!
The XAIRA search with our example!!!!
![Page 65: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/65.jpg)
65
TEI Tools (Research)TEI Tools (Research)
CorpusReader http://panini.u-paris10.fr/~sloiseau/CR/
“Is a tool for extracting subcorpora, KWIC and quantitative information from arbitrarily large corpora in the TEI vocabulary. It intends to
provide ways for processing corpora containing
milestoned annotation. It provides mechanism for
merging several XML documents together”.
CorpusReader http://panini.u-paris10.fr/~sloiseau/CR/
“Is a tool for extracting subcorpora, KWIC and quantitative information from arbitrarily large corpora in the TEI vocabulary. It intends to
provide ways for processing corpora containing
milestoned annotation. It provides mechanism for
merging several XML documents together”.
![Page 66: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/66.jpg)
66
TEI Tools (Classroom)TEI Tools (Classroom)
A more interesting orientation.
How I can use the Annotation in the classroom?
SACODEYL Search Toolwww.um.es/sacodeyl
A more interesting orientation.
How I can use the Annotation in the classroom?
SACODEYL Search Toolwww.um.es/sacodeyl
Don`t miss out “The SACODEYL Search Tool:´Exploiting corpora for
language learning purposes.Room A3, 18:00 Friday, 4
![Page 67: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/67.jpg)
TaLC 08 WorkshopTaLC 08 Workshop
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
6. Discussion and debate
Annotating pedagogy: implementing language
teaching and learning-oriented annotation on corpora
6. Discussion and debate
Pascual Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain
![Page 68: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/68.jpg)
Aims of this workshopre-visited
Aims of this workshopre-visited
68
![Page 69: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/69.jpg)
69
Discussion and debateDiscussion and debate
Pedagogical annotation vs. Morphological tagging paradigm
Learner-centered vs. Researcher-oriented
Indirect applications of language corpora vs. Direct applications
Constraints of traditional CL in the languagge classroom
Pedagogical annotation vs. Morphological tagging paradigm
Learner-centered vs. Researcher-oriented
Indirect applications of language corpora vs. Direct applications
Constraints of traditional CL in the languagge classroom
![Page 70: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/70.jpg)
70
Discussion and debateDiscussion and debate
Cognitive demands of traditional CL in the language classroom: learner as a reseacher and as a traveller
Is CL an extra hassle in language classrooms? (Mauranen 2004)
Customization of language corpus/collection of texts
Mediation role of corpus-based resources in the FLT classroom
Authenticity issues (Widdowson)
Cognitive demands of traditional CL in the language classroom: learner as a reseacher and as a traveller
Is CL an extra hassle in language classrooms? (Mauranen 2004)
Customization of language corpus/collection of texts
Mediation role of corpus-based resources in the FLT classroom
Authenticity issues (Widdowson)
![Page 71: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/71.jpg)
71
References and further reading
References and further reading
Braun, S. 2005. “From pedagogically relevant corpora to authentic language learning contents”, ReCALL 17/1:47-64.
Braun, S. 2006. “ELISA - a pedagogically enriched corpus for language learning purposes”. In Corpus Technology and Language Pedagogy: New Resources, New Tools, New Methods, Frankfurt M: Peter Lang. (eds) 25-47.
Braun, S. 2007. “Integrating corpus work into secondary education: from data-driven learning to needs-driven corpora”. ReCALL 19/3: 307-328.
Mauranen, A. 2004.” Spoken - general: Spoken corpus for an ordinary learner”. In How to Use Corpora in Language Teaching, Sinclair, J. McH. (Ed), 89–105.
Pérez-Paredes, P. and Alcaraz, J.M. 2009. “Developing annotation solutions for online data-driven learning”. ReCALL,21,1, (Forthcoming).
Römer, Ute. (Forthcoming). “Corpora and Language Teaching”. In Corpus Linguistics. An International Handbook, Lüdeling, Anke & Merja Kytö (eds.). Berlin: Mouton de Gruyter.
Widdowson, H.G. 2003. Defining issues in English Language Teaching. Oxford: Oxford University Press.
Braun, S. 2005. “From pedagogically relevant corpora to authentic language learning contents”, ReCALL 17/1:47-64.
Braun, S. 2006. “ELISA - a pedagogically enriched corpus for language learning purposes”. In Corpus Technology and Language Pedagogy: New Resources, New Tools, New Methods, Frankfurt M: Peter Lang. (eds) 25-47.
Braun, S. 2007. “Integrating corpus work into secondary education: from data-driven learning to needs-driven corpora”. ReCALL 19/3: 307-328.
Mauranen, A. 2004.” Spoken - general: Spoken corpus for an ordinary learner”. In How to Use Corpora in Language Teaching, Sinclair, J. McH. (Ed), 89–105.
Pérez-Paredes, P. and Alcaraz, J.M. 2009. “Developing annotation solutions for online data-driven learning”. ReCALL,21,1, (Forthcoming).
Römer, Ute. (Forthcoming). “Corpora and Language Teaching”. In Corpus Linguistics. An International Handbook, Lüdeling, Anke & Merja Kytö (eds.). Berlin: Mouton de Gruyter.
Widdowson, H.G. 2003. Defining issues in English Language Teaching. Oxford: Oxford University Press.
![Page 72: TALC 2008 Workshop 1 - Teaching and Language Corpora](https://reader035.vdocument.in/reader035/viewer/2022062312/5550d8d3b4c90599308b536c/html5/thumbnails/72.jpg)
TaLC 08 WorkshopTaLC 08 Workshop
Thanks for your attention
[email protected]@um.es
Thanks for your attention
[email protected]@um.es
[email protected] Pérez-Paredes José M. AlcarazUniversidad de Murcia, Spain