ubc library web archiving 2016
TRANSCRIPT
UBC LIBRARY WEB ARCHIVING
Presentation to UBC Library Community
L A R I S S A R I N G H A M , L I B R A R I A N , D I G I TA L P R O J E C T SC A R O L I N A R O M A N A M I G O , S T U D E N T L I B R A R I A N , D I G I TA L P R O J E C T S
2
TODAY …
1. Why web archiving?2. Benchmarking/research
project3. Web archiving at UBC4. How Archive-it works5. What’s next?
3
ephemerality
WHY WEB ARCHIVING?
4
RESEARCH AND BENCHMARKING
5
• 18 University of Victoria• 17 University of Alberta• 10 University of Toronto
• 6 University of British Columbia• 6 Wilfrid Laurier University• 4 University of Winnipeg• 4 University of Manitoba• 3 University of Waterloo• 3 Simon Fraser University• 1 University of Waterloo & Toronto• 1 University of Saskatchewan• 1 Dalhousie University• 1 Carleton University (MacOdrum Library)
WEB ARCHIVING BENCHMARKING – CANADIAN UNIVERSITIES
NUMBER OF COLLECTIONS PER INSTITUTION ON ARCHIVE-IT
12 Canadian
Universities have Web Archiving
Initiatives.
6
COLLECTION SCOPE – TYPE OF CONTENT
Institution owned/
affiliated website
Subject specific relevant websites
Federal/Local governmental
websites
Local relevant events
Local organization
s
Research projects
Local news
Local heritageInternatio
nalevents
7
COLLECTION SCOPE – REASONS FOR ARCHIVING
Public or scholarly interest
Preserving institution produced content
Historical or geographically
local significance
At risk or to beDecommissio
ned
Supplement existing
collection
Born digital
resource
8
ACCESS TO WEB ARCHIVING COLLECTIONS
WHERE ARETHEY AVAILABLE?
DO THEY HAVE A DEDICATED PAGE?
HOW ARE THEY LINKED?
Usually under digital/archives
or special collections
Large majority has a dedicated
page to Web Archives Initiative
Usually linked to Archive-it
institution page
Under subject guides
Under additional resources
Featured on library home page
LESS OFTEN:
Direct link for live webpages
Restricted access link
Direct link for archived webpages
LESS OFTEN:
9
• Ownership remains with website owner and university has no liability.
• Authorization is granted to educational purposes, since observing copyright restrictions from website owners.
OWNERSHIP, LIABILITY, AUTHORIZATION
• Only notifies owners/asks permission in case of technological protected content.
• Accepts takedown requests.
NOTIFICATION AND TAKEDOWN
POLICIES ADOPTED BY TOP 3 WEB ARCHIVES INITIATIVESAMONG CANADIAN UNIVERSITITES
10
WEB ARCHIVING AT UBC
11
* Pilot * Federal Government Websites• Partnered with HSS Library
First Nations and Indigenous Communities Websites• Partnered with Xwi7wxa Library
2015 Metro Vancouver Transportation and Transit Plebiscite• Partnered with HSS Library
UBC WEB ARCHIVING PROJECTS TO DATE
12
UBC Asian Library Historical Websites• Partnered with Asian Library
UBC Conferences and Events
UBC Community and Partners• Partnered with Faculty of Education, UBC Press
13
HOW ARE WE COLLECTING THE WEB CONTENT?
14
HOW ARCHIVE-IT WORKS
http://mayorscouncil.ca/
15
HOW ARCHIVE-IT WORKS
1st add seeds
2nd add scope rules
16
HOW ARCHIVE-IT WORKS
Crawl in progress
17
HOW ARCHIVE-IT WORKS
Crawl Report
18
HOW ARCHIVE-IT WORKS
QualityAssessment
19
HOW ARCHIVE-IT WORKS
Archived Page on WaybackMachine
20
UBC Archive-it collection homepage
https://archive-it.org/organizations/734
HOW ARCHIVE-IT WORKS
21
NEXT STEPS
22
BC Local Government Websites• Collaborative project with UBC / UVic / SFU
UBC.ca Institutional Website• Partnership with UBC Archives
COMING UP NEXT ……
23
• Metadata enhancement
• Access and discoverability
• Assessment and analytics
• Preservation with Archivematica
…. AND ON THE HORIZON
24
HOW DO I PROPOSE A WEB ARCHIVING PROJECT?
25
1. Research, public or governmental interests relevant for teaching or research
2. Historically or geographically local significance
3. Complementarity to relevant existing collections
4. Content produced by the university or affiliated organizations
WEB ARCHIVING: CONTENT PRIORITIES
26
Risk of disappearance
Originality
Frequency of update
Resource constraints
Duplication
Access
WEB ARCHIVING PROJECT PROPOSALS
27
HOW ARE WEB ARCHIVING PROJECTS STRUCTURED?
Source: Bragg, M., & Hanna, K. (2013). The Web Archiving Life Cycle Model (Publication). Internet Archive.
28
Stakeholder / project partner
• proposes the project• identifies the content• performs final QA check
WEB ARCHIVING PROJECT ROLES
Digital Initiatives
• evaluates project against the policy criteria
• scopes project and assesses for resources
• performs the archiving crawls• performs initial QA checks• creates and applies metadata• makes content available
29
Technical limitations with the Archive-it crawler
SOME THINGS TO KEEP IN MIND ….
including but very much not limited to ….
JavaScriptSilverlightDynamic databasesPassword protected
contentStreaming media
30
Web archives team
SOME THINGS TO KEEP IN MIND ….
[image not found]