archivesspace-archivematica-dspace workflow integration
TRANSCRIPT
Max Eckard (@max_eckard)Digital Preservation 2016
#digipres2016 #r1c.
ArchivesSpace-Archivematica-DSpace
WORKFLOW INTEGRATION
Ye Olde Days
1997-2009
● Highly manual procedures for born-digital content
● Very limited resources
2010-2011
● MeMail project (email preservation grant)● Additional staff and storage infrastructure● Developed more robust workflows (still
manual)
2011-2014 (and ongoing)
● Automation of key steps in workflow: AutoPro!
● Standardization of metadata creation/collection
Available Community Solutions in 2013 (and today!)
● Archival management system● Creates accession records, tracks locations, generates EAD
● Ingest tool● Produces AIPs, extensive technical and preservation metadata.
● Repository for preservation and access● Provides persistent URLs, secure/managed storage, access controls
GoalsFacilitate creation/reuse of metadata
Streamline the ingest and deposit of content in repository
Find solutions that meet Bentley needs but are flexible and scalable for others
● Modular so that institutions may adopt some, none or all
● Employ open standards so that other repository platforms could be used
Share code and documentation with archives and digital preservation communities
Key Development
Tasks*
● Appraisal Tab● ArchivesSpace Integration● DSpace Integration
*thank you, thank you, !
Appraisal Tab
Search the Backlog● Similar to searching in the
Ingest tab of current version● Among a number of new
features for managing a backlog
Characterize Content (File Formats)● Entire transfer, folder within
a transfer, or individual files● Toggle between report and
visualization ● See format information as
table or pie chart
Characterize Content (File Formats)
Examine Individual Files● Apply facets● Format facet populates File
List with files of that format● Browse and preview content
○ If browser has a viewer, it will appear
○ All files can be downloaded for viewing
Identify Sensitive Data● Examine Contents tab
displays bulk_extractor logs● Personably Identifiable
Information ● Credit Card numbers
Tag Content● Backlog, Analysis or File List
pane● Use cases
○ Tag for arrangement in a specific series or file
○ Tag for sensitive or restricted content
○ Tags as a simple aide-memoire--it’s like a virtual Post-it note!
ArchivesSpace Integration
Search/Browse ArchivesSpace Resources● ArchivesSpace configuration
set in Administration● Search by title or identifier● Browse relationships
Create/Update/Delete Archival Objects● Create/Update Archival
Objects with minimal metadata
○ Title○ Level○ General note○ Conditions governing
access note○ Start date, end date, date
expression● Delete Archival Objects● Written immediately to
ArchivesSpace via API
Create/Update/Delete Archival Objects
Associate Digital Objects with Archival Description● Drag and drop functionality● Folders or files● Once associated, digital
objects are struck through
Add PREMIS Rights Statements● Create Basis and Acts● Will be using to set access
profile in repository● Working with developers of
ArchivesSpace to expand Rights module
Add PREMIS Rights Statements
DSpace Integration
Deposit to DSpace● Tell Archivematica which
DSpace and which collection● AIP Repackaging
○ metadata.7z○ objects.7z
● Deposits to DSpace● Applies access restriction to
metadata● Newly minted handle is
associated with Digital Object in ArchivesSpace
Deposit to DSpace
Systems of RecordAKA Letting Each System Do It’s Thing
● Administrative, descriptive and rights metadata
● Technical and preservation metadata, reconstructing the AIP
● Manage content and enforce access restrictions
Create or Receive Appraisal & Selection Ingest Preservation Action Store Access, Use & Reuse Transform
Archivematica(Transfer)
ArchivesSpace(Accession)
Archivematica(Appraisal)
ArchivesSpace(Resource)
Archivematica(Ingest)
Archivematica (Storage Service)
DSpace(Item)
ArchivesSpace(Digital Object)
viaArchivesSpace REST API,PREMIS Rights Statements (forthcoming)
DCC Curation Lifecycle Map and Dataflow Diagram
viaArchivesSpaceREST API
viaSWORD v2,DSpace API
DLXS(EAD)
viaEncoded Archival Description (EAD) (export and import)
These Days● Wrapped up on October 31, 2016--still implementing locally● Released as part of Archivematica 1.6● Follow along at achival-integration.blogpost.com● “...initial foray that will improve as more institutions employ the Appraisal and
Arrangement tab and adapt it to local needs or integrate new functionality.”○ Treemap visualizations○ Brunnehilde integration○ Named Entity Recognition (NER), Natural Language Processing (NLP), topic modeling
Thanks!Questions?
archival-integration.blogspot.com
@UMBHLCuration