hello islandora building a digital repository nov 30, 2016 v6
TRANSCRIPT
Hello Islandora: Building a Digital Repository Part One: Panel DiscussionNew York Academy of Medicine (The Academy)November 30, 2016
Introductions
Erin Tripp, discoverygarden inc.Librarian and business development
manager at discoverygarden. Erin has managed more than 40 digital
repository projects.
❏ Currently enthralled with on the research data management applications of Islandora
❏ Particular interest in accommodating funder requirements and large datasets in obscure file formats
❏ Participating in the JISC Research Data Management Shared Service Framework and collaborating on the University of JISC testing instance of Islandora
AgendaIslandora Overview
Introducing our Panelists
Panelist Project Overviews
Panel Discussion - Questions
Top Takeaways
The Technology Islandora
The Technology
Presentation & Collaboration
Drupal is the leading open source content management system with over 30,000 user contributed modules from almost 100,000 active community members.
Drupal serves as the presentation and collaboration layer in Islandora.
Islandora is a set of Drupal modules which allow users to manage and preserve digital assets.
The Technology
Search & Discovery
Solr powers some of the most heavily-trafficked websites and applications in the world.
Key features include:
Full-text search
Search faceting & filtering
Highly scalable/Fault tolerant
Near real-time indexing
The TechnologyStorage & Preservation
Fedora Commons is purpose built for data preservation and long-term data accessibility.
Key features include:
Auditing & Fixity checks
RDF Support
Scales to millions of objects
Support for virtually any filetype
Files are readily accessible (no lock-in)
IslandoraOpen Source Digital
Repository Framework
Organizations can create robust digital repository systems tailored to their specific needs and grow the system to handle virtually limitless amounts of data.
Now, for introductions...
Introducing Martha Tenney
Martha Tenney, Barnard CollegeMartha is the Associate Director of
the Barnard Archives and Special Collections and oversees digital
archives projects.
❏ She manages selection and digitization of archival materials, works on metadata and design for the digital collections, and collaborates with Ben Rosner on development and functionality for the digital collections site.
❏ She holds a Master's in Information Science with a concentration in Digital Archives, and her undergraduate degree is in Sociology.
❏ Martha’s research interests include online communities and Web archiving, artists' archives, and human rights archives.
Barnard College
BarnardA liberal arts college for women. Affiliated with Columbia University.
Our collection includes over 125+ years of Barnard’s incredible history.
About 70k individual archival objects representing: Yearbooks, Oral Histories, Manuscripts, Photographs, Special Collections and Projects.
Involved in the community.
Solr Collections
Curated Collections Pages for Islandora content.
Introducing Chris Stanton
Chris Stanton, METROChris is the Digital Projects and
Metadata Librarian at the Metropolitan New York Library Council
(METRO), where he works on METRO's hosted digital collection
service, Digital Culture of Metropolitan New York (dcmny.org).
❏ Chris is primarily responsible for all aspects of metadata creation and editing as well as facilitating the ingest of partner content into the repository.
❏ At METRO, Chris also facilitates the harvesting and ingestion of metadata records from libraries, archives and cultural heritage institutions throughout New York State into the Empire State Digital Network (ESDN) repository for contribution to the Digital Public Library of America.
Metropolitan New York Library Council
Range of Content
Partners include a variety of institutions throughout New
York and Westchester County
Digitization and Support
Introducing Henry Raine
Henry Raine, NYHSHenry Raine is the Director of Digital
Projects and Library Technical Services at the New-York Historical Society, where he has worked since
1997 managing large-scale cataloging, archival processing, and
digitization projects.
❏ He recently oversaw the migration of the library’s digital collections to Islandora and the development of the N-YHS Islandora site.
❏ Prior to working at the N-YHS, he held positions at the Folger Shakespeare Library, the National Library of New Zealand, and the Library of Congress.
❏ Active in several professional organizations, he was Chair of the Rare Books and Manuscripts Section of the Association of College and Research Libraries in 2010-2011 and served on the Board of Directors of the Ephemera Society of America from 2010 to 2015.
New York Historical Society - Project Objectives
Project Objectives
Build a flexible, customizable digital repository for the entire institution, not just the library
Allow institutional branding
Realize the long-term goal of searching across library and museum collections
Wean from proprietary software and become part of a open-source community
Content Migration
102,773 images total
Migrated primarily historical manuscripts and photographs
CONTENTdm collections (Dublin Core metadata)
Legacy Collections (MARC metadata)
New York Historical Society - Next Steps
Introducing Robin Naughton
Robin Naughton, PhD, The AcademyRobin Naughton, PhD is Head of
Digital for the New York Academy of Medicine Library. She manages the
digital program for the library, including digitization efforts and
library systems.
❏ Since starting in June 2015, she has worked on migrating the Library’s digital collections to Islandora and is in the early phases of development.
❏ Prior to joining the Academy, Dr. Naughton was a Digital Consultant in educational technology and has worked with LearningExpress, an EBSCO company, Oxford University Press, English Language Teaching, and Kaplan Test Prep and Admissions.
❏ She has managed digital library migrations, interactive educational products (web & mobile) created for public and state libraries, the transition of print to eBooks, and eLearning courses.
The New York Academy of Medicine
Building Digital
CollectionsDigitizing and Migrating
Helfand Pharmaceutical Trade Card Collection
Building Digital
CollectionsInternet Archive Book
Viewer
Rare and Historical Books
Q&A Panel Discussion
Question 1Getting started
Henry,
Tell us about the decision to move forward with a new
digital repository.
Why did you go ahead with it? What were the main factors in
your decision?
Question 2Impact
Robin,
Tell us about your experience moving to an Open Source
software.
What were the surprises, challenges/ successes?
Question 3Community
Martha,
Open Source has a lot to do with community. Tell us about
your interactions with the Islandora Community.
What resources were most valuable?
Question 4Customization
Chris & Martha,
Tell us about your decision to customize Islandora.
What were the considerations/ implications/ benefits?
Question 5Data modelling
Henry,
Tell us about the migration process.
What should one take into consideration before starting a
migration?
Question 6Impact
Chris,
Tell us about the impacts of harvesting your repositories.
How has it impacted contributors and users?
Question 7Impact
Martha,
The Barnard College repository has been in production longer than the others. Tell us about the impacts of your repository
project overall.
Have there been engagement, research or other benefits?
Question 8The Future
Panelists,
Tell us where you’ll go next with your repositories.
What are your plans and your goals?
Curious? Open to the Floor for Questions
Top Takeaways
Group Discussion
● What have been the easy parts?
● What have been the most difficult tasks?
● What resources did you need or find helpful?
● What was your AH HA! moment?
Thanks for coming! Contact me at: [email protected]
Lunch Break60 minutes
Hello Islandora: Building a Digital Repository Part Two: WorkshopNew York Academy of Medicine (The Academy)November 30, 2016
Welcome!Agenda
• Islandora Overview
• Solution Packs & cModels
• Ingest Methods
• Configuring Search Tool
• Applying Restrictions
• Harvesting via OAI-PMH
Islandora is used by some of the most prestigious organizations in the world...
IslandoraOpen Source Digital
Repository Framework
Organizations can create robust digital repository systems tailored to their specific needs and grow the system to handle virtually limitless amounts of data.
discoverygarden
Removing Barriers to Using Open Source
• Service Provider
• Launched in 2010
• 92% of Islandora code is written by discoverygarden on behalf of customers
• Uses Drupal Coding Standards, Scrum Process, QA (Selenium, Travis CI)
Case Studies RDM, Archives, IRs
Case StudyUniversity of Prince Edward
IslandResearch Data Management
Repository to manage the entire lifecycle of research data.
Project features: Drupal data management
planning toolsAn open source dropbox-like
utility for “working” data called Pydio
Workflow states for Public, Embargoed, or Archive content
Minting DataCite DOIs for any published materials
https://data.upei.ca/
Case StudyNational Baseball Hall of
Fame and MuseumSpecial Collections
Building the digital presence of the Library and Museum under a unified DAMS with 3,500,000+ Objects
Large-scale architecture
Preservation Strategy
Integration of Library Cataloging Functions
Museum Functions
Workflow Development
Migration and Consolidation
Integration with Existing Portals and Apps
Training & Knowledge Transfer
http://collection.baseballhall.org/
Case StudyBoston College
Institutional Repository
Using Islandora as an Institutional Repository. Launched in 2015. Project features:
Focused on copyright management and embargo
Representing metadata in MODS and embedded ETD-MS
Preserving and exposing assets related to scholarly publications - research data
http://dlib.bc.edu/
The Technology
Drupal + Solr + Fedora
Ingest Methods
Creating, Reading, Updating and Deleting Content
Content Models act as a template for adding digital assets to the repository
E.g. Defines content model, workflow, metadata entry forms, derivative generation, viewer, display, etc.
Solution Packs are modules that apply content models and manage CRUD operations
Create, Read, Update, Delete
Ingest Methods
Over the Web or on the Command Line
Individual ingest available over the web
2 GB file limit
Batch Ingest modules over the web
Zip File Importer
Four Batch Citation Uploaders
2 GB file limit
Scripted ingests on the Command line for large files or large batches
British Columbia Electronic Library Network (BC ELN)Ingest MethodsIngest Methods
Discovery Settings Search Results,
Facets, Advanced Search,
Record Display and more
Basic/ keyword search out of the box
Easy to configure modules for collection search, advanced search, facets, sort, and custom record display
External discovery modules for being indexed by Google, Google Scholar, WorldCat, Summon and more!
British Columbia Electronic Library Network (BC ELN)Ingest MethodsDiscovery Settings
Access Controls
Applying Embargoes or Restrictions
on Repository Content
Islandora supports three methods for applying embargoes:
• Active/ Inactive State
• XACML Security Policy Editor
• Scholar Embargo /IP Embargo
British Columbia Electronic Library Network (BC ELN)Ingest MethodsAccess Controls
HarvestingIncreasing Access to your
Collections
Data Providers are repositories that expose structured metadata via OAI-PMH= Repository
Service Providers then make OAI-PMH service requests to harvest that metadata = Harvester
British Columbia Electronic Library Network (BC ELN)Ingest MethodsHarvesting
Vocabulary
Request/ Verb/ Service The action that the service provider (harvester) is requesting from
the data provider (repository)
Response Size The maximum number of records to issue per response
Resumption Token
When a request returns records greater than the response size a resumptionToken is issued such that the service provider can resume harvesting from where it left off
Vocabulary… continued
IdentifyThis request used to retrieve information about a repository. Some of the information returned is required as part of the OAI-PMH. Example: YourSite/oai2?verb=Identify
ListMetadataFormatsThis request is used to retrieve the metadata formats available from a repository.Example: YourSite/oai2?verb=ListMetadataFormats
Vocabulary… continued
ListRecordsThis request is used to harvest records from a repository. Optional arguments permit selective harvesting of records based on set membership and/or datestamp. Example: YourSite/oai2?verb=ListRecords&metadataPrefix=oai_dc
ListSetsThis request is used to retrieve the set structure of a repository, useful for selective harvesting. Specific Collection Example: YourSite/oai2?verb=ListRecords&metadataPrefix=oai_dc&set=ir_citationCollection
ResourcesWant to Learn More
About Islandora?
Islandora Online Discussion Forumshttps://groups.google.com/forum/#!forum/islandora
Islandora Documentation https://wiki.duraspace.org/display/ISLANDORA715/Start
discoverygarden Case Studies http://www.discoverygarden.ca/case-studies/
Questions?Contact us at: [email protected]