tei רוציקב - bgudhcs162/wiki.files/class16.bgu.dh.tei.pdf · text encoding initiative re-use...

19
tei בקיצור נצר יעל מתוך לקוחים מהשקפים חלקhttp://teibyexample.org Text Encoding Initiative Workshop: Intro to Text Encoding Michelle Dalmau & John Walsh, Indiana University Catapult /

Upload: others

Post on 05-Oct-2020

7 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled

tei בקיצור

יעל נצר חלק מהשקפים לקוחים מתוך

http://teibyexample.org

Text Encoding Initiative Workshop: Intro to Text Encoding Michelle Dalmau & John Walsh, Indiana University Catapult /

Page 2: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled

Purpose

• Adding semantics

• International consortium http://www.tei-c.org/

Page 3: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled

Types of mark-up

• procedural (i.e. italics to indicate word in foreign language)

• descriptive (“this is a word in foreign language)

Page 4: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled

SGML the great ancestorStandard Generalized Markup Language (SGML) 1986

Description of markup schemes that satisfied at least seven requirements for an encoding standard:

comprehensiveness;

simplicity;

documents be processable by software of moderate complexity;

standard not be dependent on any particular characteristic set or text-entry devise;

standard not be geared to any particular analytic program or printing system;

standard should describe text in editable form;

standard allow the interchange of encoded texts across communication networks.

metalanguageDocument Type Definition (DTD)

http://teibyexample.org

Page 5: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled

XML

• 1998

• platform-, software-, and system-independent

• text based (ascii / utf-)

Page 6: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled

Text Encoding Initiative

Re-use and flexibility: build once, use many Presentation and output of text controlled by style sheets.

Generate different views of the same text and different formats: PDF, HTML, ePub (ebooks), plain text (for text analysis), etc.

The document and the markup can serve as an object of analysis and increased discoverability

Every encoded text is a “reading” of the text.

Page 7: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled
Page 8: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled

TEI ground rules• Guidelines: http://www.tei-c.org/release/doc/tei-p5-

doc/en/html/

• 503 elements and 210 attributes

• teiHeader - metadata on document and source

• General elements

• Specific to genres

Page 9: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled
Page 10: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled

The metadata header for a TEI document

Page 11: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled

The text itself

Page 12: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled

Text Encoding Initiative Workshop: Intro to Text Encoding Michelle Dalmau & John Walsh, Indiana University Catapult / Scholars’

Page 13: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled
Page 14: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled
Page 15: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled
Page 16: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled
Page 17: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled
Page 18: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled

http://oxygenxml.com/download_oxygenxml_editor.html

http://www.wwp.northeastern.edu/outreach/seminars/_current/handouts/elementList.xhtml

Page 19: tei רוציקב - BGUdhcs162/wiki.files/class16.bgu.dh.tei.pdf · Text Encoding Initiative Re-use and flexibility: build once, use many Presentation and output of text controlled

נקודות לגבי תרגיל שני• http://mapoflondon.uvic.ca/agas.htm המפה של לונדון

״סטנדרטיזציה של ייצוג ידע״ - כיצד אפשר לייצג את •המידע במסמך כך שאפשר להציג אותו בדרכים שונות?

איך אפשר ״להראות״ את המכתבים של אבשלום •פיינברג? על מפה, בעזרת ענני מילים, קישור למקורות

מידע אחרים..

כל מכתב בפני עצמו, מכתבים לפי נמען, על ציר זמן, •לפי מקום?