our marathon presentation at dh data curation workshop
TRANSCRIPT
![Page 1: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/1.jpg)
Our Marathon: The Boston Bombing Digital Archive
DH Data Curation Workshop
May 1, 2014
facebook.com/OurMarathon www.northeastern.edu/marathon@OurMarathon
![Page 2: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/2.jpg)
TELL A WIDE RANGE OF STORIES
![Page 3: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/3.jpg)
“NO STORY IS TOO SMALL”
![Page 4: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/4.jpg)
BUILD A LASTING COMMUNITY MEMORIAL
![Page 5: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/5.jpg)
PRESERVE THE HISTORICAL RECORD
![Page 6: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/6.jpg)
AUDIENCES
• Regional: Boston, MA residents directly and indirectly affected by these events
• More broadly, a “general” audience of anyone interested in these events
• Researchers and Scholars: interest in preserving items / files and creating / preserving metadata
![Page 7: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/7.jpg)
BUILDING OUR MARATHON
4,700+ items
Boston City Archives material
289 stories from the Globe Lab
307 memes (image macros)
40 oral histories (WBUR)
raw news footage (WCVB-TV)
![Page 8: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/8.jpg)
WBUR ORAL HISTORY PROJECT
![Page 9: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/9.jpg)
BOSTON CITY ARCHIVES COLLECTION
![Page 11: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/11.jpg)
HOW WE’RE USING OMEKA
• Dublin Core Metadata (currently Simple; transitioning to Extended Dublin Core imminently)
• Modified Contribution Plugin: Crowdsourced contributions submit Item Type Metadata
• Geotagging Items
• Tagging Items (Search Functionality / Organization)
![Page 12: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/12.jpg)
KINDS OF ITEMS IN THE ARCHIVE
• “Born Digital” Material (photos, text, memes, screencaps)
• Scanned / Digitized Items (BCA Items, Boston Medical Center Items)
• Modified Items (redacted files, edited audio files)
![Page 13: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/13.jpg)
SOME ITEMS / FILE TYPES IN THE ARCHIVE
• BCA Items (Hi-Res Scans: TIF files; JPEG Copies)
• Web Sites (Archive-It / The Internet Archive)
• Oral History Audio Files: .wav and .mp3
• Crowdsourced contributions: variety
• Social media files: screencaps
![Page 14: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/14.jpg)
![Page 15: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/15.jpg)
![Page 16: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/16.jpg)
![Page 17: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/17.jpg)
Crowdsourcing
Challenges of “Born Digital” Content
• Perceived Value By Contributor• Copyright Issues and Social Media• Preservation Challenges• Metadata Challenges
![Page 18: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/18.jpg)
DUBLIN CORE METADATA FIELDS
• Title
• Description
• Source
• Date
• Rights
• Language
![Page 19: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/19.jpg)
GEOTAGGING
![Page 20: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/20.jpg)
TAGS
![Page 21: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/21.jpg)
LONG-TERM PRESERVATION PLANS
• Northeastern’s Libraries (Archives & Special Collection) is final home of Our Marathon items
• Items Public Now Will Be Public In The Future
• “Planned Obsolescence” (Home Page / Site)
• Five year position (Basic Monitoring of Archive)
• Oral History Audio Files: .wav and .mp3
• Crowdsourced contributions: variety
• Social media files: screencaps
![Page 22: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/22.jpg)
SHORT-TERM CHALLENGES
• What Metadata Cleanup to Do Now (BCA Items, Public Submissions)
• How To Make Content More Accessible (Tags, Maps)
• Social Media Content (Tweets)
![Page 23: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/23.jpg)
SOME LONG-TERM CHALLENGES
• Institutional Memory of Project (Documentation of Methodologies, Meta-Archive)
• When to phase out web site / “Share Your Story” Plugin
• When to make sensitive material public
• Approval Process for Researchers / Scholars
![Page 24: Our Marathon Presentation at DH Data Curation Workshop](https://reader030.vdocument.in/reader030/viewer/2022032700/55d4ce4abb61ebb10b8b4576/html5/thumbnails/24.jpg)