saa 2015 web archiving roundtable

22
WAS to Archive-It Migration Visualization of linking between websites of different languages, Babel 2012 project Rosalie Lack [email protected]

Upload: rosalielack

Post on 12-Apr-2017

146 views

Category:

Education


0 download

TRANSCRIPT

Page 1: SAA 2015 Web Archiving Roundtable

WAS to Archive-It Migration

Visualization of linking between websites of different languages, Babel 2012 project

Rosalie [email protected]

Page 2: SAA 2015 Web Archiving Roundtable

Who?

Page 3: SAA 2015 Web Archiving Roundtable

WAS is …• A service of University of California’s

California Digital Library• 2004: Funded by National Digital Information

Infrastructure Preservation Program (NDIIPP)• 2006: Launched with partner institutions• 2009: Transition to subscription model• 2015: 21 UC institutions; 12 external

Page 4: SAA 2015 Web Archiving Roundtable

Archive-It

A subscription service from the Internet Archive, which allows institutions to build, manage and search their own web archive

Over 300 partner orgs in the U.S. and worldwide

www.archive-it.org

Page 5: SAA 2015 Web Archiving Roundtable

Why?

Page 7: SAA 2015 Web Archiving Roundtable

Lean Books in Wikimedia Commons

Page 9: SAA 2015 Web Archiving Roundtable

How?

Page 11: SAA 2015 Web Archiving Roundtable

CUL-hosted Web Archiving Policies and Practice in the US summit

“… an articulation of a small number of model programs for web archiving, and development of ‘best practices’ for documenting program elements”

May 2012

Attendees: CDL, Columbia, CRL, Cornell, Duke, Georgetown, Frick, Harvard, Indiana, IA, LC, Michigan, North Texas, NYU, Sloan, Stanford, UC Irvine, UT Austin, Virginia Tech https://webarch.cul.columbia.edu/

Page 12: SAA 2015 Web Archiving Roundtable

CDL-hosted meeting

“… more robust collaboration was desirable in order to collectively address these challenges [research use, intensive resource requirements, the pace of change, fragmented collection development, etc.] and went so far as to brainstorm the benefits and risks of an all-in, formal association”

June 2014

Attendees: CDL, Columbia, George Washington, Harvard, IA, LC, North Texas, Stanford http://bit.ly/1N1GgGj

Page 13: SAA 2015 Web Archiving Roundtable

Collections/Access/QA Opportunities

• Federation/aggregation/collocation• Collaboration on collection development• Crowd sourced selection and QA • Education and advocacy• Create and endorse policies, best

practices and standards

Page 14: SAA 2015 Web Archiving Roundtable

Supporting research

• Outreach• Pilot projects • Computational analysis tools• Tools, tools, tools

Opportunities

Page 15: SAA 2015 Web Archiving Roundtable

Technology

• Shared infrastructure/operations• Data capture tools• Collaborate on API development • Preservation solutions• Tools, tools, tools

Opportunities

Page 16: SAA 2015 Web Archiving Roundtable

Steps toward collaboration: Community Principles for Web Archiving at Scale

“… a lightweight structure by which web archiving institutions can work collectively in order to achieve significant functional goals and operational efficiencies that they are unlikely to achieve individually”

September 2014

CDL, Columbia, George Washington, Harvard, IA, LC, North Texas, Stanford http://bit.ly/1NoB2l1

Page 17: SAA 2015 Web Archiving Roundtable

“…rely on external service providers whenever possible,

and restrict local efforts to areas in which institutions can uniquely add value.”

Page 18: SAA 2015 Web Archiving Roundtable

Value-added services locally or collaboratively developed

Page 19: SAA 2015 Web Archiving Roundtable

Next Steps

• Complete the migration• Conduct user research into researcher needs• Define, build and share APIs to meet

specialized needs• Explore feasibility of a national collaborative

model for web archiving• Continue to look for funding opportunities to

help facilitate this effort (IMLS 2016)

Page 21: SAA 2015 Web Archiving Roundtable

Questions?

Rosalie Lack [email protected]

Page 22: SAA 2015 Web Archiving Roundtable

Potential architecture