metadata: the tei header and manuscript...

51
The TEI Header Manuscript Description Metadata: The TEI Header and Manuscript Description TEI@Oxford 2010-07 Oxford TEI Summer School 2010 1/51

Upload: others

Post on 17-Feb-2021

3 views

Category:

Documents


0 download

TRANSCRIPT

  • The TEI Header Manuscript Description

    Metadata: The TEI Header and ManuscriptDescription

    TEI@Oxford

    2010-07

    Oxford TEI Summer School 2010 1/51

  • The TEI Header Manuscript Description

    The TEI Header

    The TEI header was designed with two goals in mind

    needs of bibliographers and librarians trying to document‘electronic books’

    needs of text analysts trying to document ‘coding practices’within digital resources

    The result is that discussion of the header tends to be pulled in twodirections...

    Oxford TEI Summer School 2010 2/51

  • The TEI Header Manuscript Description

    The Librarian’s Header

    Conforms to standard bibliographic model, using similarterminology

    Organized as a single source of information for bibliographicdescription of a digital resource, with established mappings toother such records (e.g. MARC)

    Emerging code of best practice in its use, endorsed by majordigital collections

    Pressure for greater and more exact constraints to improveprecision of description: preference for structured data overloose prose

    Oxford TEI Summer School 2010 3/51

  • The TEI Header Manuscript Description

    Everyman’s Header

    Gives a polite nod to common bibliographic practice, but has afar wider scope

    Supports a (potentially) huge range of very miscellaneousinformation, organized in fairly ad hoc ways

    Many different codes of practice in different user communities

    Unpredictable combinations of narrowly encodeddocumentation systems and loose prose descriptions

    Oxford TEI Summer School 2010 4/51

  • The TEI Header Manuscript Description

    TEI Header Structure

    The TEI header has four main components:

    < leDesc> ( le description) contains a full bibliographicdescription of an electronic le.

    (encoding description) documents therelationship between an electronic text and the source orsources from which it was derived.

    (text-pro le description) provides a detaileddescription of non-bibliographic aspects of a text, speci callythe languages and sublanguages used, the situation in whichit was produced, the participants and their setting. (just abouteverything not covered in the other header elements

    (revision description) summarizes the revisionhistory for a le.

    Only < leDesc> is required; the others are optional.

    Oxford TEI Summer School 2010 5/51

  • The TEI Header Manuscript Description

    Example Header: Minimal required header

    .

    .

    . ..

    .

    .

    A title?

    Who published?

    Where from?

    Oxford TEI Summer School 2010 6/51

  • The TEI Header Manuscript Description

    The TEI supports two ‘levels’ or types of header

    corpus levelmetadata sets default properties for everything ina corpus

    text levelmetadata sets speci c properties for one componenttext of a corpus

    Oxford TEI Summer School 2010 7/51

  • The TEI Header Manuscript Description

    Corpus Header Example

    .

    .

    . ..

    .

    .

    Oxford TEI Summer School 2010 8/51

  • The TEI Header Manuscript Description

    Types of content in the TEI header

    free proseprose description: series of paragraphsphrase: character data, interspersed with phrase-levelelements, but not paragraphs

    grouping elements: specialised elements recording somestructured information

    declarations: Elements whose names end with the suffix Decl(e.g. subjectDecl, refsDecl) enclose information about speci cencoding practices applied in the electronic text.

    descriptions: Elements whose names end with the suffix Desc(e.g. , ) contain a prosedescription, possibly, but not necessarily, organised undersome speci c headings by suggested sub-elements.

    Oxford TEI Summer School 2010 9/51

  • The TEI Header Manuscript Description

    File Description

    has some mandatory parts:: provides a title for the resource and any associatedstatements of responsibility: documents the sources from which theencoded text derives (if any): documents how the encoded text ispublished or distributed

    and some optional ones:: yes, electronic texts have editions too: and they also t into "series".: how many oppy disks, gigabits, les?: notes of various types

    Oxford TEI Summer School 2010 10/51

  • The TEI Header Manuscript Description

    The File Description

    : contains a mandatory which identi es theelectronic le (not its source!)

    optionally followed by additional titles, and by ‘statements ofresponsibility’, as appropriate, using , ,, , or the generic : may contain

    plain text (e.g. to say the text is unpublished)one or more , , , eachfollowed by , , ,

    Oxford TEI Summer School 2010 11/51

  • The TEI Header Manuscript Description

    A minimal header for Punch

    .

    .

    . ..

    .

    .

    Punch, or the London Charivari: an electronic

    editionOwen Seaman (1861-1936)

    TEI versionTEI@Oxford team

    Unpublished

    Recoded from the Project Gutenberg versions

    Oxford TEI Summer School 2010 12/51

  • The TEI Header Manuscript Description

    Title- and Responsibility- statements...

    There may be many of them:.

    .

    . ..

    .

    .

    ArtameneLe Grand CyrusDigital Edition

    Amongst the guilty parties:.

    .

    . ..

    .

    .

    Scudery, Madeleine deGeffin, AlexandreFonds Nationale Suisse de la Recherche Scientifique

    Encoding checkJean Untel

    Oxford TEI Summer School 2010 13/51

  • The TEI Header Manuscript Description

    example

    .

    .

    . ..

    .

    .

    TEI ConsortiumOxford Text Archive1256

    Available under the terms of a Creative Commons Attribution and

    Share Alike licence.

    Oxford TEI Summer School 2010 14/51

  • The TEI Header Manuscript Description

    example

    can contain notes on almost any aspect:.

    .

    . ..

    .

    .

    Material prepared for the TEI@Oxford Summer School.

    Oxford TEI Summer School 2010 15/51

  • The TEI Header Manuscript Description

    The Source DescriptionAll electronic works need to indicate their source, even if it is just tosay that it is 'born digital'. There are variety of ways to do this:

    prose description

    : contains free text or any mixture of bibliographicelements such as , etc.

    contains effectively the same elements butconstrained in various ways according to bibliographicstandards

    special-cases texts which were born TEI byreplicating an embedded < leDesc>

    A may be used for lists of such descriptions

    Specialised elements for spoken texts ( etc.)and for manuscripts () Discussed later!

    Authority lists for e.g people () or places() can be included.

    Oxford TEI Summer School 2010 16/51

  • The TEI Header Manuscript Description

    examples

    .

    .

    . ..

    .

    .

    Born digital.

    .

    .

    . ..

    .

    .

    Enigma, Punch: or the

    London Charivari, July 1,1914, 147, p. 6

    Oxford TEI Summer School 2010 17/51

  • The TEI Header Manuscript Description

    vs. Example

    .

    .

    . ..

    .

    .

    Enigma, in Punch: or the

    London Charivari (July 1, 1914), vol 147, pp. 1-20

    .

    .

    . ..

    .

    .

    Enigma

    Punch: or the London Charivari

    LondonJuly 1, 19141471-20

    Oxford TEI Summer School 2010 18/51

  • The TEI Header Manuscript Description

    Encoding Description

    groups notes about the procedures used whenthe text was encoded, either summarised in prose or within speci celements such as

    : goals of the project

    : sampling principles

    : editorial principals, e.g. ,, , ,,

    : classi cation system/s used

    : speci cs about usage of particular elements

    The can replace the user manual, or facilitatesemi-automatic document management, given agreed codes ofpractice.

    Oxford TEI Summer School 2010 19/51

  • The TEI Header Manuscript Description

    Example (1).

    .

    . ..

    .

    .

    The Imaginary Punch Project aims to ....

    All pages containing editorial text have been

    transcribed in full. Pages containing only advertisements orillustrations have been omitted.

    Original spelling has been retained, except that

    words hyphenated across line breaks have been silentlyre-assembled. The hyphen has been retained only where thereexist cases of the same word being hyphenated in mid-lineposition.

    Oxford TEI Summer School 2010 20/51

  • The TEI Header Manuscript Description

    Example (2).

    .

    . ..

    .

    .

    story occupies more than half a page

    story occupies between quarter and a half page

    story occupies less than a quarter page

    Refers to domestic political events

    Refers to foreign political events

    refers to role of women in society

    refers to role of servants in society

    Oxford TEI Summer School 2010 21/51

  • The TEI Header Manuscript Description

    Pro le Description

    A collection of descriptions, categorised only as ‘non-bibliographic’.Default members of the model.pro leDescPart class include:

    : information about the origination of theintellectual content of the text, e.g. time and place

    : information about languages, registers, writingsystems etc used in the text

    and : classi cations applied to the textby means of a list of speci ed criteria or by means of acollection of pointers, respectively

    and : information about the‘participants’, either real or depicted, in the text

    : information about the hands identi ed in amanuscript

    Oxford TEI Summer School 2010 22/51

  • The TEI Header Manuscript Description

    Language and character set usage

    The element is provided to document usage oflanguages in the text. Languages are identi ed by their ISO codes:.

    .

    . ..

    .

    .

    EnglishFrenchBulgarian in Cyrillic characters Romanized Bulgarian

    Oxford TEI Summer School 2010 23/51

  • The TEI Header Manuscript Description

    Classi cation Methods

    provides a classi cation (by domain, medium, topic...)for the whole of a text expressed in one or more of the followingways:

    using direct reference to a locally de ned (e.g. in thecorpus header) category

    using reference to some commonly agreed andexternally de ned category (e.g. UDC)

    using assign arbitrary descriptive terms taken from abibliographic controlled vocabulary or a tag cloud

    Oxford TEI Summer School 2010 24/51

  • The TEI Header Manuscript Description

    BNC Example.

    .

    . ..

    .

    .

    W nonAc: humanities arts

    History, Modern - 19th centuryCapitalism - History - 19th centuryWorld, 1848-1875

    .

    .

    . ..

    .

    .

    This categorization applies to the whole text. For more ne grainedclassi cation, use@decls on e.g. a element.

    Oxford TEI Summer School 2010 25/51

  • The TEI Header Manuscript Description

    Revision Description

    A list of elements, each with a@date and@whoattributes, indicating signi cant stages in the evolution of adocument.

    Most recent rst.

    Can be maintained manually, but better done by means of aCMS (change management system)

    .

    .

    . ..

    .

    .

    $LastChangedDate: 2010-06-28 09:14:36 +0100 (Mon, 28 Jun

    2010) $.$LastChangedBy: lou $$LastChangedRevision: 10346 $

    Oxford TEI Summer School 2010 26/51

  • The TEI Header Manuscript Description

    Manuscript Description

    Why are manuscripts special?

    Manuscripts are unique objects, often of great cultural orpolitical value.

    Books, by contrast, exist in multiple copies, and can bedescribed adequately by well-established and formalisedbibliographic conventions.

    For manuscripts, there are several traditions, often descriptiveor belle lettriste, and little consensus.

    Similar concerns apply to other text-bearing objects.

    Oxford TEI Summer School 2010 27/51

  • The TEI Header Manuscript Description

    Objectives of

    The TEI element is intended for several different kinds ofapplications:

    standalone database of library records ( nding aid)

    discursive text collecting many records (catalogue raisonné)

    metadata component within a digital surrogate (electronicedition)

    tool for ‘quantitative codicology’

    Oxford TEI Summer School 2010 28/51

  • The TEI Header Manuscript Description

    Catalogue Raisonné

    An can appear anywhere a

    paragraph can.

    .

    . ..

    .

    .

    The Arnamagnæan Manuscript Collection

    The Arnamagnæan Collection is widely recognised as one of the

    most significant collections of early Scandinavian manuscripts inthe world…

    Among its more important holdings are:

    In the following manuscript….

    Oxford TEI Summer School 2010 29/51

  • The TEI Header Manuscript Description

    Having one's cake and eating it

    Two con icting desires:

    preserve (or perpetuate) existing descriptive prose

    reliable search, retrieval, and analysis of data

    The tries, wherever possible, to do both of these things.

    Oxford TEI Summer School 2010 30/51

  • The TEI Header Manuscript Description

    Components of a manuscript descriptionWithin the element come a required element, which groups information identifying the manuscript,followed by an optional , which can be used to provide in abrief, unstructured way information on the manuscript's contentsetc. These are then followed either by one or more paragraphs(

    ), or one or more of the following specialised elements:

    : an itemised list of the intellectual content ofthe manuscript, with transcriptions of rubrics, incipits, explicitsetc, as well as primary bibliographic references

    : groups information concerning all physicalaspects of the manuscript, its material, size, format, script,decoration, binding, marginalia etc.

    : provides information on the history of themanuscript, its origin, provenance and acquisition by itsholding institution

    Oxford TEI Summer School 2010 31/51

  • The TEI Header Manuscript Description

    Components of a manuscript description (cont.)

    : groups other information about the manuscript,in particular, administrative information relating to itsavailability, custodial history, surrogates etc.

    : contains in essence a nested , in cases ofcomposite manuscripts now regarded as constituting a singleunit but made up of two or more parts which were originallyphysically distinct.

    Within each of these elements a number of sub-elements isavailable; , for example, will normally consist of oneor more elements, each in turn containing speci celements for , , and , aswell as the standard TEI elements , and forbibliographic references. As with itself, however,the contents of these rst-level and second-level elements need notbe this structured, since there is also the option of using paragraphs.

    Oxford TEI Summer School 2010 32/51

  • The TEI Header Manuscript Description

    Identi cation (1)

    The

    Traditional three part speci cation:

    place (, , )

    repository (, )

    identi er (, ).

    .

    . ..

    .

    .

    CanadaOttawaLibrary and Archives CanadaE.W.B. MorrisonMG 30 E 81 v. 16

    Oxford TEI Summer School 2010 33/51

  • The TEI Header Manuscript Description

    Identi cation (2)

    Alternative or additional names can also be included:.

    .

    . ..

    .

    .

    DanmarkKøbenhavn Det ArnamagnæanskeInstitut AM 45 fol.Codex FrisianusFríssbók

    Oxford TEI Summer School 2010 34/51

  • The TEI Header Manuscript Description

    Intellectual ContentMay simply use paragraphs of text…

    … or a tree of elements

    … optionally preceded by a prose summary

    We can describe the content in general terms:.

    .

    . ..

    .

    .

    An extraordinary charivari of heroic deeds and improving tales,

    including an early version of Guy of Warwick andseveral hymns.

    or we can provide detail about each distinct item:.

    .

    . ..

    .

    .

    An extraordinary charivari of heroic deeds, improving

    tales, and hymns.

    Oxford TEI Summer School 2010 35/51

  • The TEI Header Manuscript Description

    The element

    Manuscripts contain identi able items, usually physically tied to alocus.

    , if present, must be given rstthen any of the following, in a speci ed order:

    , , , , , ,< nalRubric>, , , , , …… or nested s

    Oxford TEI Summer School 2010 36/51

  • The TEI Header Manuscript Description

    with multiple s

    .

    .

    . ..

    .

    .

    fols. 5r-7vAn ABC

    fols. 7v-8vLenvoy de Chaucer a

    Scogan

    fols. 14r-126vTroilus and CriseydeBk. 1:71-Bk. 5:1701, with additional losses due to

    mutilation throughout

    Oxford TEI Summer School 2010 37/51

  • The TEI Header Manuscript Description

    Physical Description

    An arti cial (but helpful) grouping of many distinct items.

    You can simply supply paragraphs of prose, covering such topics as

    : the physical carrier

    : what is carried on it

    , ,

    and

    : accompanying material

    Or, group your discussion within the speci c elements mentionedabove.

    Similarly, within the speci c elements, you can supply paragraphsof prose, or further speci c elements.

    Oxford TEI Summer School 2010 38/51

  • The TEI Header Manuscript Description

    The carrier 1

    The can contain just paragraphs, or and .

    .

    . ..

    .

    .

    Early modern parchment andpaper.

    Oxford TEI Summer School 2010 39/51

  • The TEI Header Manuscript Description

    The carrier 2

    A more complex substructure with speci c elements for ,, , , .Multiple layouts may also be speci ed:.

    .

    . ..

    .

    .

    Between 25 and 32 ruled

    lines.

    Between 34 and 50 ruled

    lines.

    Oxford TEI Summer School 2010 40/51

  • The TEI Header Manuscript Description

    and

    (note on hand) describes a particular style orhand distinguished within a manuscript.

    contains a note describing either a decorativecomponent of a manuscript or a fairly homogenous class ofsuch components.

    Oxford TEI Summer School 2010 41/51

  • The TEI Header Manuscript Description

    example (1)

    .

    .

    . ..

    .

    .

    The manuscript is written in two contemporary hands, otherwise

    unknown, but clearly those of practised scribes. Hand I writesff.1r-22v and hand II ff. 23 and 24. Some scholars, notablyVerner Dahlerup and Hreinn Benediktsson, have argued for a thirdhand on f. 24, but the evidence for this is insubstantial.

    Oxford TEI Summer School 2010 42/51

  • The TEI Header Manuscript Description

    example (2)

    .

    .

    . ..

    .

    .

    The first part of the manuscript, fols

    1v-72v:4, is written in a practised IcelandicGothic bookhand. This hand is not found elsewhere.

    The second part of the manuscript,

    fols 72v:4-194, is written in a handcontemporary with the first; it can also be found in afragment of Knýtlinga saga, AM 20b II

    fol..

    Oxford TEI Summer School 2010 43/51

  • The TEI Header Manuscript Description

    The element can be used to list or describe anyadditions to the manuscript, such as marginalia, scribblings,doodles, etc., which are considered to be of interest or importance..

    .

    . ..

    .

    .

    The text of this manuscript is not interpolated with sentences

    from Royal decrees promulgated in 1294, 1305 and 1314. In themargins, however, another somewhat later scribe has added therelevant paragraphs of these decrees, see pp. 8, 24, 44, 47etc.

    As a humorous gesture the scribe in one opening of themanuscript, pp. 36 and 37, has prolonged the lower stems of oneletter f and five letters þ and has them drizzle down themargin.

    Oxford TEI Summer School 2010 44/51

  • The TEI Header Manuscript Description

    (accompanying material) contains details of anysigni cant additional material which may be closely associated withthe manuscript being described, such as non-contemporaneousdocuments or fragments bound in with the manuscript at someearlier historical period..

    .

    . ..

    .

    .

    A copy of a tax form from 1947 is included in the envelopewith the letter. It is not catalogued separately.

    Oxford TEI Summer School 2010 45/51

  • The TEI Header Manuscript Description

    : where it all began

    : everything in between

    : how you acquired it

    is datable element and thus has attributes@notBefore and@notAfter,@when etc.

    Oxford TEI Summer School 2010 46/51

  • The TEI Header Manuscript Description

    Example

    .

    .

    . ..

    .

    .

    Written in England in the

    13th cent.

    On fol. 54v very faint is Iste liber est fratris guillelmi

    de buria de Roberti ordinisfratrum Predicatorum

    , 14th cent. (?):hanauilla is written at the foot of the page (15th

    cent.).

    Bought from the Rev. W. D. Macray

    on March 17, 1863, for 1 pound10s.

    Oxford TEI Summer School 2010 47/51

  • The TEI Header Manuscript Description

    information

    : administrative information

    : information about other surrogates, i.e.photographs, digital images etc.

    : accompanying material

    : bibliography

    Oxford TEI Summer School 2010 48/51

  • The TEI Header Manuscript Description

    Administrative information

    record history

    availability

    custodial history

    miscellaneous remarks

    .

    .

    . ..

    .

    .

    Conserved between March 1961 and February 1963 at Birgitte

    Dalls Konserveringsværksted.

    Photographed in May 1988 by AMI/FA.

    Oxford TEI Summer School 2010 49/51

  • The TEI Header Manuscript Description

    And nally

    A can contain , essentially a nested ,where originally distinct manuscripts or parts of a manuscripts havebeen brought together to form a composite manuscript..

    .

    . ..

    .

    .

    AmiensBibliothèque MunicipaleMS 3Maurdramnus Bible

    MS 6

    Oxford TEI Summer School 2010 50/51

  • The TEI Header Manuscript Description

    ConclusionsThe TEI header was originally conceived as something fornon-specialist usage but has everything needed for rigorousbibliographic descriptionIt provides detailed methods for encoding specialist itemssuch as manuscript descriptions or details concerning spokentexts or linguist corporaStandard codes of practice or ways of using have beendeveloped by particular user communities (e.g. digitallibrarians, corpus linguists)As a ‘primary source of information’ it remains an essentialframework for documenting:

    what your text iswhere it came fromhow you encoded ithow it may be used (technically)how it may be used (legally)

    Oxford TEI Summer School 2010 51/51

    The TEI HeaderManuscript Description