1 david nathan eldp training workshop march 2010 archiving
TRANSCRIPT
![Page 1: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/1.jpg)
1
David Nathan
ELDP Training Workshop
March 2010
Archiving
![Page 2: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/2.jpg)
2
Archiving: what do you think of?
![Page 3: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/3.jpg)
3
![Page 4: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/4.jpg)
4
![Page 5: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/5.jpg)
5
![Page 6: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/6.jpg)
6
![Page 7: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/7.jpg)
7
What is a language archive, then?
![Page 8: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/8.jpg)
8
![Page 9: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/9.jpg)
9
What is a digital language archive?
a forum / platform for data providers and data users to negotiate and exchange
a trusted repository created and maintained by an institution with a commitment to the long-term preservation of archived material
has policies and processes for materials acquisition, cataloguing, preservation, dissemination, migration to new digital formats
a collection of managed materials
![Page 10: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/10.jpg)
10
OAIS model
OAIS archives define three types of ‘packages’ingestion, archive, dissemination:
Archive Dissemination
afd_34
dfa dfadf
fds fdafds
afd_34
dfa dfadf
fds fdafds
afd_34
dfa dfadf
fds fdafds
afd_34
dfa dfadf
fds fdafds
afd_34
dfa dfadf
fds fdafds
IngestionProducers Designated communities
![Page 11: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/11.jpg)
11
What is archiving of language materials?
preparing materials in a structured, well-documented, and complete form
building long-term relationships it is not just backup it is not just dissemination/publication it does not define good linguistic practice
![Page 12: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/12.jpg)
12
What can a language archive offer?
Security - keep your electronic materials safe Preservation - store your materials for the long
term Discovery - help others to find out about your
materials, and you to find out about users Protocols - respect and implement sensitivities,
restrictions Sharing - share results of your work, if appropriate Acknowledgement - create citable
acknowledgement Mobilisation - create usable language materials for
communities Quality and standards - advice for assuring your
materials are of the highest quality and robust standards
![Page 13: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/13.jpg)
13
Kinds of language archives
many cross-cutting classifications: Indigenous and local, eg. Squamish Nation,
“language centres” regional, eg. AILLA, Paradisec international, eg. DoBeS, ELAR
associated with research institute, eg. AIATSIS, ANLC
grant-driven deposits, eg. DoBeS, ELAR digital vs physical vs mixed, eg. DoBeS vs
Vienna Sound Archive, ANLC
![Page 14: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/14.jpg)
14
Potential users
depositors – deposit, access or update materials
speakers and their descendants (“majority of users of Berkeley Language Center archive are community members”)
other researchers - comparative/historical linguists, typologists, theoreticians, anthropologists, historians, musicologists etc etc
other “stakeholders”, eg educationalists journalists and the wider public
![Page 15: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/15.jpg)
15
Archives networks and bodies
foundation concepts and technologies from library initiatives, eg. D-LIB http://www.dlib.org/ OAI (Open Archives Initiative) OAIS Open Archival Information Systems
(NASA and space agencies incl JAXA)
Open Language Archives Community (OLAC)
Digital Endangered Languages and Archives Network (DELAMAN) ELAR, DOBES, ANLC, Paradisec, EMELD,
LACITO, AIATSIS, AMPM (Maori)
![Page 16: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/16.jpg)
16
Archives networks and bodies
DELAMAN’s interests and activities include: language archiving training coordination and
syllabus citation of deposits (for academic recognition of
deposited corpora) archive federations (for seamless access to
resources across )
![Page 17: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/17.jpg)
17
Citation examples
Courtesy Heidi Johnson of AILLA
Collection:Sherzer, Joel. "Kuna Collection." The Archive of the Indigenous Languages of Latin America: www.ailla.utexas.org. Media: audio, text, image. Access: 0% restricted.
File/resource:Sherzer, Joel (Researcher). (1970). "Report of a curing specialist." Kuna Collection. Archive of the Indigenous Languages of Latin America: www.ailla.utexas.org. Type: transcription&translation. Media: text. Access: public. Resource ID: CUK001R001.
![Page 18: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/18.jpg)
18
Why is language archiving different?
what is a language? the data is not conventionalised (like $,
age, year of publication etc) – what and how to code?
varying and competing expectations
![Page 19: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/19.jpg)
19
And endangered languages archiving?
extremely diverse context – languages, cultures, communities, individuals, projects
typical source is fieldworkers no established genres difficult for archive staff to manage sensitivities and restrictions extremely high priority
![Page 20: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/20.jpg)
20
Endangered Languages ARchive (ELAR)
one of 3 semi-autonomous programs of the Hans Rausing Endangered Languages Project
staff of 3; archivist, software developer, technician, (research assistants etc)
develop policies, preservation infrastructure, cataloguing and dissemination, facilities, training, advice, materials development and publishing
![Page 21: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/21.jpg)
21
ELAR’s holdings
ELAR currently holds about 50 deposits with a total volume of approx 4 TB.
the average deposit is about 80 GB sizes vary widely, with a small number of huge
deposits. The median size is around 15GB we expect volume to nearly double over the
next 18 months see next slides for distribution of data types
![Page 22: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/22.jpg)
22
ELAR holdings by data type
data types for a 25% sample of holdings (early 2008)
data type by volume (MB) and number of files, sorted by volume
Data typeVolume
(MB) Files
audio 360,411 6,312
video 208,995 895
image 28,592 2,221
msword 223 404
pdf 196 134
eaf 33 176
text 32 781
lex 9 29
trs 5 246
xls 1 19
imdi 1 26
![Page 23: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/23.jpg)
23
The way we were ... ASEDA
Aboriginal Studies Electronic Data Archive, AIATSIS Canberra, founded early 1990s (modelled on Oxford Text Archive)
receive and catalogue electronic materials that were at risk or not accessible lexica grammars texts
![Page 24: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/24.jpg)
24
How things have changed ..
types of data (modalities and genres)now predominantly media / documentation
storage methodsnow “professional”, mass data systems
standardisation and metadatanow various standards for data and metadata
disseminationnow web-based disseminationexpanded influence into practice and workflow
of linguists
![Page 25: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/25.jpg)
25
Why digital?
preservation: digitisation is the only way that media (audio and video) can be preserved for the future
because it can be copied and transmitted with zero loss
cataloguing, sharing, dissemination all facilitated
![Page 26: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/26.jpg)
26
Digital disadvantages
digital data is fragile and ephemeral cost (human, equipment, maintenance) requires strategy and luck to get infrastructure
right preservation depends on file and data formats
depend on tools and software depends on formats (prefer standard, open,
explicit, long-lasting) materials may have to be converted and
migrated some formats require particular software (can
we archive the software?)
![Page 27: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/27.jpg)
27
These issues impact on archive policy
how to balance cost of andling and preservation with value of materials?
how to provide long-term preservation when our funding is time-limited?
![Page 28: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/28.jpg)
28
The archiving process (depositors’ view)
![Page 29: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/29.jpg)
29
Documenter and archive interactions
grant formulation and application communications, questions, advice training archiving services (transfer, conversion
etc) ongoing management of materials
![Page 30: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/30.jpg)
30
Documenter & archive interactions
![Page 31: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/31.jpg)
31
Query/interaction topics
analysis of approx 150 queries from documenters/linguists
![Page 32: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/32.jpg)
![Page 33: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/33.jpg)
33
ELAR Feedback template
ELAR Data Sample EvaluationPrepared for:By:Date:
TEXT - xx fileDocument typeDocument format/layout/data structuresCharacter/language representationLinking/referencesConsistency
![Page 34: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/34.jpg)
34
ELAR Feedback template
AUDIODocument type/formatResolutionQualityEditingLengthAnnotation/transcriptionConsistency
![Page 35: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/35.jpg)
35
ELAR Feedback template
VIDEODocument type/formatResolutionQualityEditingLengthAnnotation/transcriptionConsistency
![Page 36: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/36.jpg)
36
ELAR Feedback template
GENERALFile namingData volumeDeliveryConsistency
![Page 37: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/37.jpg)
37
Example detail (section: Document format)
Use of typography (size, underlining, bold, spaces etc) to make headings and other structures is weak - at least Styles should be used (with complete consistency).
Tables to represent interlinear data is reasonably appropriate, although would need to be converted later.
Is it clear from this document, or somewhere else, where to look up codes etc, such as the speaker initials?
While the language is consistently labelled in the interlinear section, it is identified only by the alternation in font in the first section.
![Page 38: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/38.jpg)
38
Example detail (section: Audio quality)
AD-MD03a 4Noe Song thami miya.wav - quality good.AD-MD04b 33Boa Sr. LongNarrativeOnTsunami.wav -
quality reasonable, but background hiss is too loud in proportion to the signal. Was this was part of your original recording (on what equipment?) or was introduced by digitisation, in which case it would be a good idea to try de-digitising.
AD-MD05b 34Peje Phonetic Variation.wav - quality quite good. Stereo separation of voices is nice.
CIILQ Seasons Contd 699-703.wav - suffers a number of faults, including severe clipping (overmodulation), background noise, microphone physical handling, and poor acoustic representation (probably due to poor microphone and/or recorder?).
![Page 39: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/39.jpg)
39
Audio evaluation using Dobbin
software from Cube-Tec who make Quadriga
audio evaluation, conversion and reporting
![Page 40: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/40.jpg)
40
Dobbin
![Page 41: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/41.jpg)
41
Dobbin
![Page 42: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/42.jpg)
42
Dobbin
![Page 43: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/43.jpg)
43
Dobbin
![Page 44: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/44.jpg)
44
Dobbin
![Page 45: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/45.jpg)
45
Dobbin
![Page 46: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/46.jpg)
46
What can you archive (at ELAR)?
media - sound, video graphics - images, scans text - fieldnotes, grammars, description,
analysis structured data - aligned and annotated
transcriptions, databases, lexica metadata - structured, standardised
contextual information about the materials
![Page 47: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/47.jpg)
47
Archive objects
an “object” could be a file, a set of files, a directory, a “session” or a set of files with relationships between them
these are often called “bundles” like all structures, these should be made
explicit eg through metadata our new catalogue system will provide a facility
to create and label bundles
![Page 48: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/48.jpg)
48
Data “portability” (Bird & Simons 2003)
data should also be “portable” (Bird & Simons “Seven Dimensions ...”) complete explicit documented preservable transferable accessible adaptable not technology-specific (also appropriate, accurate, useful etc!!)
![Page 49: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/49.jpg)
49
Archive material should be selected
example: Depositor’s question: How much video can I archive?
answer: ...
however, unlikely that linguist is in position to plan and
consistently create excellent video, so selection is unavoidable
data has always been edited and selected!
![Page 50: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/50.jpg)
50
(... selection)
in your linguistic work you also: selected labeled transformed/processed/edited added, corrected, expanded made links made or assumed relationships between
“whole” and processed units; invented labels, IDs, scope etc
imposed formats
![Page 51: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/51.jpg)
51
File organisation example 1
IPF10011-Disk3-Story-WulaTuki-LunarEclipse
IMDI_3.0.xsd
WulaTuki_LunarEclipse.eaf
WulaTuki_LunarEclipse.imdi
WulaTuki_LunarEclipse.imdi.backup
WulaTuki_LunarEclipse.pfs
WulaTuki_LunarEclipse.txt
WulaTuki_LunarEclipse.wav
![Page 52: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/52.jpg)
52
File organisation example 2
/labelling-system.doc
AngryD-Bsi
AngryD-Bsi.pdf
AngryD-Bsi.wav
AngryD-Bsi.doc
![Page 53: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/53.jpg)
53
File organisation example 3
/
archivist_notes.txt
ELAN transcription key FTG0025.pdfOverview metadata FTG0025.xls [open]
Kay07-aud
Kay07-aud-jul03a.wav
Kay07-aud-jul03b.wav
Kay07-aud-jul03c.wav
![Page 54: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/54.jpg)
54
Metadata
![Page 55: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/55.jpg)
55
Metadata
Metadata the data about data that enables the
management, identification, retrieval and understanding of that data
reflects the knowledge and practice of data providers
defines and constrains audiences and usages for data
documentation’s goals heighten the importance of metadata
![Page 56: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/56.jpg)
56
Metadata formats
common or standard: IMDI (‘ISLE Metdata Initiative’, from DoBeS) OLAC (Open Language Archives Community) EAD, and others
ELAR: has created its own set, currently in implementation deposit-wide metadata in deposit form file level metadata (will be) by web form also, depositor’s own metadata
![Page 57: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/57.jpg)
57
On metadata formats
each depositor can also have different metadata!
types of metadata are relative to each project, consultants, community ...
our goal: to maximise the amount and quality of metadata
quality and extent is more important than standards and comparability
many depositors are sending extensive metadata in a variety of formats including spreadsheets
![Page 58: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/58.jpg)
58
Types of metadata
depositor's / delegates' details descriptive metadata administrative metadata preservation metadata access protocols metadata for individual files
![Page 59: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/59.jpg)
59
Depositors and delegates
name address contact details (telephone, fax, email, URL) role affiliation date of birth nationality
![Page 60: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/60.jpg)
60
Descriptive metadata
title, description, subject, summary keywords subject language, community location time span
![Page 61: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/61.jpg)
61
Administrative metadata
project details funding and hosting institutions
details of external copies modifications and status details of accession agreement
cf. deposit form
access access protocols (see elsewhere) group membership identification
![Page 62: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/62.jpg)
62
Preservation metadata
carrier media formats, size provenance (source/history)
![Page 63: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/63.jpg)
63
File-level metadata
media files duration, file size MIME type, content type
text files font, character set, encoding format, markup
access protocols
![Page 64: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/64.jpg)
64
Access protocol
sensitivities, restrictions: identification, description and implementation
deposit, file or object-level protocol depositor-oriented change/manage protocol over time delegate other rights holders sunset clause
![Page 65: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/65.jpg)
65
Protocol grows naturally with documentation
focus on recorded data » more people, more genres, less researcher knowledge
community participation » framework for speakers to shape documentation process and products
mobilisation » selecting, juxtaposing; community participation
focus on revitalisation » which language to teach? who to host and teach? who can learn? etc
time » significance and sensitivities change over time
access » increasing scope for dissemination, control of IP
![Page 66: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/66.jpg)
66
Other kinds of metadata
information to make resources accessible to community members genres eg songs languages, eg community language materials for language teaching and learning
types of metadata are relative to the particular project, consultants, community ...
![Page 67: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/67.jpg)
67
Archiving and data management
Most data-related issues are properly part of linguistic data management
There are now few data-related issues that are archive-specific
But teaching curricula, training, and practices need time to catch up
Ultimate goal of documenting languages well means that we must find the optimal “division of labour” in each case
![Page 68: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/68.jpg)
68
ELAR assists depositors
preserve your deposited materials implement your access restrictions etc provide advice, general and specific assistance, eg data conversion provide web-based deposit management allow updates and additions provide some equipment and services on a case by case basis, develop
resources
![Page 69: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/69.jpg)
69
What is required to make a deposit?
resource(s) for an endangered language it could be just one file
inventory / metadata deposit form
an online version will be available soon
deposits can be updated, supplemented, metadata added/modified
![Page 70: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/70.jpg)
70
How do depositors deliver data?
Hard disks we return them we send them out some grant applicants factor them into grants
Email good for samples for evaluation OK for most text materials
Flash cards and USB sticks A web upload facility will be
available later Web download
![Page 71: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/71.jpg)
71
What about CDs and DVDs?
we have found CDs, andespecially DVDs, to bevery unreliable DVD fail rate about 10%
cause confusion as filesare allocated to fit on disks, not according to corpus structure
create a lot of work fordepositors and for ELAR
![Page 72: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/72.jpg)
72
We ask depositors to
manage materials well collect and provide protocol information deliver materials, metadata send trial samples etc (funded grantees) not withhold materials share/manage/delegate custodianship of
materials maintain relationships with language
stakeholders and ELAR
![Page 73: 1 David Nathan ELDP Training Workshop March 2010 Archiving](https://reader035.vdocument.in/reader035/viewer/2022062421/56649dd45503460f94acb6ce/html5/thumbnails/73.jpg)
73
ELAR online
We now have ELAR online archive, although data is only just starting to be released to public view: http://elar.soas.ac.uk/
The archive has been implemented using a Content Management System, in this case Drupal: open-source web software based on PHP, MySQL and JavaScript implements user, role and group-based
access to materials