standards for digital encoding tomaž erjavec karl-franzens-universität graz tomaž erjavec lecture...
TRANSCRIPT
![Page 1: Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI](https://reader036.vdocument.in/reader036/viewer/2022082518/5697bfcf1a28abf838caa1f2/html5/thumbnails/1.jpg)
Standards for digital encoding
Tomaž Erjavec
Karl-Franzens-Universität Graz
Lecture 2: TEILecture 2: TEI
![Page 2: Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI](https://reader036.vdocument.in/reader036/viewer/2022082518/5697bfcf1a28abf838caa1f2/html5/thumbnails/2.jpg)
What can the TEI do for you?The TEI provides a framework for the
definition of multiple schemas it defines and names several hundred
useful textual distinctions it provides a set of modules that can be
used to define schemas making those distinctions
it provides a customization mechanism for modifying and combining those definitions with new ones using the same conceptual model
![Page 3: Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI](https://reader036.vdocument.in/reader036/viewer/2022082518/5697bfcf1a28abf838caa1f2/html5/thumbnails/3.jpg)
Where did the TEI come from? Originally, a research project within the
humanities Sponsored by three professional
associations Funded 1990-1994 by US, EU.
Major influences digital libraries and text collections language corpora scholarly datasets
International consortium established June 1999(see http://www.tei-c.org/)
![Page 4: Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI](https://reader036.vdocument.in/reader036/viewer/2022082518/5697bfcf1a28abf838caa1f2/html5/thumbnails/4.jpg)
Goals of the TEI better interchange and integration of scholarly
data support for all texts, in all languages, from all
periods guidance for the perplexed: what to encode —
hence, a user-driven codification of existing best practice
assistance for the specialist: how to encode — hence, a loose framework into which unpredictable extensions can be fitted
These apparently incompatible goals result in a flexible and modular environment
![Page 5: Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI](https://reader036.vdocument.in/reader036/viewer/2022082518/5697bfcf1a28abf838caa1f2/html5/thumbnails/5.jpg)
TEI Guidelines A set of recommendations for text encoding,
covering both generic text structures and some highly specific areas based on (but not limited by) existing practice
A very large collection of element definitions with associated declarations for various schema languages
a modular system for creating personalized schemas or DTDs from the foregoing
for the full picture see http://www.tei-c.org/Guidelines/
![Page 6: Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI](https://reader036.vdocument.in/reader036/viewer/2022082518/5697bfcf1a28abf838caa1f2/html5/thumbnails/6.jpg)
Legacy of the TEI
a way of looking at what ‘text’really is a codification of current scholarly
practice (crucially) a set of shared assumptions
and priorities about the digital agenda: focus on content and function (rather
than presentation) identify generic solutions (rather than
application-specific ones)
![Page 7: Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI](https://reader036.vdocument.in/reader036/viewer/2022082518/5697bfcf1a28abf838caa1f2/html5/thumbnails/7.jpg)
Users of TEI
Over 100 projects listed on the Over 100 projects listed on the TEI project pageTEI project page
Main areas:Main areas: digital librariesdigital libraries text-critical editionstext-critical editions computer corporacomputer corpora dictionariesdictionaries
![Page 8: Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI](https://reader036.vdocument.in/reader036/viewer/2022082518/5697bfcf1a28abf838caa1f2/html5/thumbnails/8.jpg)
Versions of the Guidelines TEI P3 (1994) first public version: TEI P3 (1994) first public version:
SGML + book (1200pp) and soon also on the Web. SGML + book (1200pp) and soon also on the Web. TEI P4TEI P4 (2002): (2002):
provides equal provides equal support for XMLsupport for XML and SGML applications using the and SGML applications using the TEI scheme; TEI scheme;
error correction, while maintaining backward compatibility: error correction, while maintaining backward compatibility: documents conforming to TEI P3 will not become illegal when documents conforming to TEI P3 will not become illegal when processed with TEI P4. processed with TEI P4.
TEI P5TEI P5 (2007): (2007): implements more fundamental changes to the schemas, in implements more fundamental changes to the schemas, in
line with current practice and identified problems, e.g. uses line with current practice and identified problems, e.g. uses namespacesnamespaces
no longer backward compatible (but a ~migration P4 to P5 no longer backward compatible (but a ~migration P4 to P5 XSLT exists)XSLT exists)
Relax NG becomes the main schema langaugeRelax NG becomes the main schema langauge continuous improvement..continuous improvement..
![Page 9: Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI](https://reader036.vdocument.in/reader036/viewer/2022082518/5697bfcf1a28abf838caa1f2/html5/thumbnails/9.jpg)
The general structure of TEI documents
Burnard, Driscoll, Rahtz, Burnard, Driscoll, Rahtz, TEI Training Course, Sofia 2005TEI Training Course, Sofia 2005::Slides for TEI overviewSlides for TEI overview
![Page 10: Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI](https://reader036.vdocument.in/reader036/viewer/2022082518/5697bfcf1a28abf838caa1f2/html5/thumbnails/10.jpg)
TEI Lite
TEI Lite is a particular TEI Lite is a particular parametrisation of TEI that parametrisation of TEI that “provides 90% of the elements “provides 90% of the elements needed for 90% of users”needed for 90% of users”
but better to make your own but better to make your own schema..schema..
![Page 11: Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI](https://reader036.vdocument.in/reader036/viewer/2022082518/5697bfcf1a28abf838caa1f2/html5/thumbnails/11.jpg)
Lab session
take personlist data and write take personlist data and write XSLT to transform it into teiLiteXSLT to transform it into teiLite
![Page 12: Standards for digital encoding Tomaž Erjavec Karl-Franzens-Universität Graz Tomaž Erjavec Lecture 2: TEI](https://reader036.vdocument.in/reader036/viewer/2022082518/5697bfcf1a28abf838caa1f2/html5/thumbnails/12.jpg)
XSLT
Erjavec: Course at ESSLII 2005 Erjavec: Course at ESSLII 2005 Annotation of Language Annotation of Language Resources, Resources, Lecture II. XML-Related Lecture II. XML-Related Recommendations:Recommendations:Formatting and Transforming XMLFormatting and Transforming XML
ZVON tutorialZVON tutorial W3schoolsW3schools ……