web archiving collaborations: a presentation for colleagues working in the libraries of the...
DESCRIPTION
These slides were used to support a presentation on web archiving collaborations for colleagues working in the Libraries of the Metropolitan Museum of Art.TRANSCRIPT
![Page 1: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/1.jpg)
Web archiving collabora/ons at Columbia University Libraries
Anna Perricci
Columbia University Libraries
Metropolitan Museum of Art (August 19, 2014)
![Page 2: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/2.jpg)
Web Resources Archiving Collabora/on
Many thanks to the Mellon FoundaFon
Building collaboraFons among • The web archiving community
• Other research libraries • Users and potenFal users of web archives • Website creators
![Page 3: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/3.jpg)
Incen/ves grants to advance web archiving tools
Image source: hNp://imgur.com/gallery/vG7KE48
![Page 4: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/4.jpg)
Incen/ve awards projects
Warcbase: Building a Scalable Web Archiving PlaWorm on HBase and Hadoop. (Jimmy Lin, University of Maryland)
Archiving TransacFons Towards UninterrupFble Web Service (Zhiwu Xie and Edward A. Fox, Virginia Tech University)
![Page 5: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/5.jpg)
Incen/ve awards projects
Visualizing Digital Collections of Web Archives (Michele Weigle, Old Dominion University)
Tools for Managing Seed URLs (Michael Nelson, Old Dominion University)
![Page 6: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/6.jpg)
Incen/ve awards projects
Perma.cc: MiFgaFng the Pervasive Problem of Link Rot in Scholarly Works and Preserving Online Content (Kim Dulin, The Harvard Library InnovaFon Lab)
Free Law Project
Providing free access to primary legal materials, developing legal research tools, and supporFng academic research on legal corpora
![Page 7: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/7.jpg)
Building an efficient and scalable na/onal framework for collec/ng web content
Image source: hNp://imgur.com/gallery/1m5MBKf
![Page 8: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/8.jpg)
Designated space for collabora/ve collec/ng
![Page 9: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/9.jpg)
Collabora/ve Architecture, Urbanism and Sustainability Web Archive (CAUSEWAY)
hNps://archive-‐it.org/collecFons/4638
![Page 10: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/10.jpg)
Collabora/on with music librarians
![Page 11: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/11.jpg)
Contemporary composers—the perfect storm?
![Page 12: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/12.jpg)
Contemporary Composers Web Archive
Selectors
• Borrow Direct Music Librarians Group: music librarians at Brown, Columbia, Cornell, Dartmouth, Harvard, Johns Hopkins, Princeton, and Yale universiFes, MIT, and the universiFes of Chicago and Pennsylvania
Cataloging exper/se
• Russell MerriN (cataloger specializing in music resources) • Kate Harcourt (Director of Original and Special Materials Cataloging)
• Alex Thurman (Web Resources CollecFon Coordinator)
![Page 13: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/13.jpg)
Contemporary Composers Web Archive hNps://archive-‐it.org/collecFons/4019
![Page 14: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/14.jpg)
Quality Assurance
![Page 15: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/15.jpg)
Crea/ng MARC records for web archives
• CreaFng MARC records for archived websites is standard pracFce at CUL – MARC records make web archives discoverable in CLIO (Columbia Libraries InformaFon Online)
• CollecFon level and seed level records
• Will use Archive-‐It interface to make Dublin Core records
![Page 16: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/16.jpg)
Patron view of record in CLIO
![Page 17: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/17.jpg)
Cataloger’s view of record in CLIO
![Page 18: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/18.jpg)
An/cipa/ng wider use of MARC records
• Records have been released to WorldCat
• Collaborators on cataloging were aNenFve to which fields will ordinarily be stripped out when a MARC record is imported to another insFtuFon’s OPAC
![Page 19: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/19.jpg)
CCWA MARC records
• So far sample of 10 records has taught us…
• PosiFve feedback from music librarians
• Next we will add another 44 records for the archived sites in CCWA soon
![Page 20: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/20.jpg)
Project tracking
![Page 21: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/21.jpg)
Use cases
![Page 22: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/22.jpg)
Who are the web archives for? Are they being used? Could we encourage more effec/ve use?
![Page 23: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/23.jpg)
hSp://hrwa.cul.columbia.edu
![Page 24: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/24.jpg)
Using the Human Rights Web Archive & learning from human rights scholars’ work (publica/ons, cita/ons)
![Page 25: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/25.jpg)
Cita/ons scraped from ar/cles published in 2010 in select scholarly journals
![Page 26: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/26.jpg)
Isola/ng URLs from list of cita/ons (approximately 10% of cita/ons scraped have URLs in them)
![Page 27: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/27.jpg)
Best Prac/ces for site creators: working with website creators
Image source: hNp://imgur.com/gallery/NWJ12Pl
![Page 28: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/28.jpg)
Open issues: division and maintenance of coopera/ve efforts
(communica/on, so]ware and more)
![Page 29: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/29.jpg)
Process over next 16 months
• Further planning (revision as needed) and user interviews • Maintain group communicaFon
• Ongoing growth (scale of collecFng and distribuFon of effort) • Present shared costs and sustainability models (currently in
development)
• 3-‐5 year plan for Borrow Direct collaboraFons (collecFons strategy, finances, workflows and governance)
• If collaboraFon persists, idenFfy themes for further collecFng
• Catalog resources to high standards • Quality Assurance and ongoing evaluaFon
![Page 30: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/30.jpg)
Web archiving ini/a/ves focusing on art resources
An iniFaFve designed to address the “urgent need to document the dynamic web-‐based versions of aucFon catalogues, catalogues raisonnés, and scholarly research projects, as well as arFst, gallery, and museum websites” (hNp://www.nyarc.org/content/web-‐archiving)
ArFsts Files Special Interest Group
![Page 31: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/31.jpg)
Ques/ons?
Image source: hNp://imgur.com/gallery/qoCqQoh
![Page 32: Web archiving collaborations: a presentation for colleagues working in the Libraries of the Metropolitan Museum of Art](https://reader033.vdocument.in/reader033/viewer/2022060117/558670c2d8b42a2e278b46b0/html5/thumbnails/32.jpg)
Resources that came up in the Q & A
• Internet Archive "Save a Page" Plug-‐In for Chrome hNps://github.com/lintool/chrome-‐archive-‐this-‐page
• SAA Web Archiving Roundtable hNp://webarchivingrt.wordpress.com/