personal digital archiving 2015 - nyu - workshop
TRANSCRIPT
![Page 1: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/1.jpg)
This work is licensed under a Creative Commons Attribution-NonCommercial 3.0 Unported License.
End-to-end digital preservation for diverse collections
Personal Digital Archiving – 04-26-2015Courtney C. Mumma, MAS/MLIS, US and International Community Development
+
![Page 2: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/2.jpg)
lead developers of Archivematica, Access to Memory (AtoM) and Binder
archivists, librarians, technologists
![Page 3: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/3.jpg)
core values
innovation and smart automation
leverage existing technology
transparency
interoperability and collaboration
grounded in archival practice
open source, including other projects
handshakes / integration
![Page 4: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/4.jpg)
bounty model
![Page 5: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/5.jpg)
hybrid public access and content management
manage accessions, taxonomies, multiple repositories, restrictions and rights, authority records
(ISAAR)
access derivatives including streaming video
multi-lingual description & ISAD(G), RAD, DACS, EAD export, MODS
link to preserved archival packages, sync metadata and PREMIS rights
![Page 6: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/6.jpg)
![Page 7: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/7.jpg)
FOSS digital preservation (AGPLv3)
good practices and standards
no barrier to user groups, community or documentation
consistent, system independent Archival Information Packages (AIPs)
Bagit, Dublin Core, METS, PREMIS
![Page 8: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/8.jpg)
![Page 9: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/9.jpg)
system synthesis
active integrations:
– DSpace
– CONTENTdm
– Islandora/Fedora
– Archivists' Toolkit
– LOCKSS
– DuraCloud
– OpenStack
– TRIM
on-going integrations:
– ArchivesSpace● Bentley● RAC
– Dspace
– Hydra
– Arkivum
– BitCurator
– Dataverse
![Page 10: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/10.jpg)
A flexible open-source application
for standards-based description and access
Access to Memory
![Page 11: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/11.jpg)
What is AtoM?AtoM stands for Access
to Memory.
It is a web-based, open source application for standards-based archival description and access in a multilingual, multi-repository environment.
Web-based
Open source
Standards-based
Multilingual
Multi-repository
![Page 12: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/12.jpg)
Web-based: platform independent
Browser-based user interface.
• Anyone with access to a browser (e.g., Chrome, Internet Explorer, Firefox, Safari etc.) has access to all the features and functionality of the AtoM application.
Platform independent application.
• The application runs on a web server that can be installed and run on many platforms.
![Page 13: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/13.jpg)
Standards-based description: User-friendly content standard edit templates
Templates: ISAD(G), DACS, RAD, DC, MODS ISAAR-CPF, ISDIAH, ISDF
←→
![Page 14: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/14.jpg)
Multi-lingual interfaces
![Page 15: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/15.jpg)
Multi-repository support: per-institution theming
![Page 16: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/16.jpg)
Archivematica integration
![Page 17: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/17.jpg)
Overall Workflow
describe and manage all hybrid content in AtoM
preserve digital content using Archivematica & hand off access copies and metadata to AtoM
provide access (digital copies and descriptions) and links to preserved content in AtoM
![Page 18: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/18.jpg)
A flexible open-source application
for standards-based digital preservation
![Page 19: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/19.jpg)
Archivematica makes OAIS (ISO 14721)
Archival Information Packages (AIPs)
– integrity & virus checks, format identification, characterization & metadata extraction, forensic activities, validation, arrangement, transcription, etc
– normalization to sustainable formats on ingest + preservation of the original file
– include or add metadata, including PREMIS rights and restrictions
– storage agnostic
– bagged AIP with logs and metadata (METS.xml)
![Page 20: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/20.jpg)
the AIP:so much bigger on the inside
value add to storage: metadata, logs, formats and structure to protect against software
obsolescence
![Page 21: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/21.jpg)
the METS.xml file
<dmdSec> (descriptive metadata) Dublin Core XML<amdSec> (administrative metadata) <techMD> PREMIS: object <digiProvMD> PREMIS: events PREMIS: agents <rightsMD> PREMIS: rights<fileSec> (a list of the files and their roles and relationships)<structMap> (a representation of the physical structure of the AIP)
![Page 22: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/22.jpg)
question break .......
then we get knee deep in computers
![Page 23: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/23.jpg)
identify your test content
✔ What✔ Where
✔ How much
![Page 24: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/24.jpg)
what types of digital content?
• born-digital
― government and university records, student artwork, e-theses and dissertations
― diverse formats: audiovisual, textual, geospatial, websites, presentations, images, databases
• digitized
― books, newspapers, images, video from vendors
― pre-made access and preservation copies
• submission documentation & metadata
― permission forms, accession records, pictures of digital media, etc.
― descriptive MD from other systems
![Page 25: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/25.jpg)
where is your digital content?
• stored locally
• in other systems
― ie CONTENTdm, Dspace, DuraCloud, Islandora
• on detached media
― floppies, hard drives, cds, dvds, usb sticks, etc.
• packaged
― Bagged using Library of Congress BagIt specification
― Forensic images
― Zipped or tarballed
![Page 26: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/26.jpg)
how much is there?
• Size: gigabytes, terabytes, petabytes
― Sum total of all material
― Size of distinct content sets
― Biggest single digital objects
• Quantity
― Sum total of all files
― Number of files in distinct content sets
• Resource capacity
― Space allocated to processing and storage locations
― Consider ideal transfer, SIP and AIP sizes
![Page 27: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/27.jpg)
asking questions of your content
• descriptive metadata?
― needs preserved? already existent or need to add? complex or simple objects?
• submission documentation?
― donor agreements, pictures of physical media, licenses, etc
• access copies?
― already have them? what system to send/store?
• generate preservation copies?
― already have them?
• service masters?
![Page 28: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/28.jpg)
asking questions of your content
• directory structure important (Original Order)?
• keep the package AND the content, or just one?
• rights information?
• is content Bagged? in DSpace? a forensic image? (Transfer type)
• how large should my archival packages be?
• will my archival packages have a 1:1 relationship with my transferred digital content? will my content be arranged into multiple packages or combined into one? (Arrangement workflow)
![Page 29: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/29.jpg)
processing in Archivematica
• determine readiness by pilot testing content streams using the methods just described
• prepare content for transfer:
– put it in a folder in a transfer source directory
– prepare a metadata CSV for simple or complex objects
– prepare submission documentation
– identify pre-made access, preservation and/or service copies
– select the right workflow: standard, DSpace, forensic image and pre-configured settings (more on this soon)
![Page 30: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/30.jpg)
now let's see it!
![Page 31: Personal Digital Archiving 2015 - NYU - Workshop](https://reader030.vdocument.in/reader030/viewer/2022032505/55c5ce9dbb61eb33418b4742/html5/thumbnails/31.jpg)
archivematica.org & accesstomemory.org
Questions??
Thank you!