repositories thru the looking glass andy powell eduserv foundation [email protected]
TRANSCRIPT
![Page 2: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/2.jpg)
There are many methods for predicting the future. For example, you can read horoscopes, tea leaves, tarot cards, or crystal balls. Collectively, these methods are known as “nutty methods.” Or you can put well-researched facts into sophisticated computer models, more commonly referred to as “a complete waste of time”.
Dilbert
![Page 3: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/3.jpg)
Either that wallpaper goes or I do.
Oscar Wilde’s last words
![Page 4: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/4.jpg)
Background
Issues
The future
![Page 5: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/5.jpg)
some background…
![Page 6: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/6.jpg)
![Page 7: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/7.jpg)
The DCMI Abstract Model
• a set of rules defining how DC metadata descriptions are constructed
– A description is made up of one or more statements …
– Each statement instantiates a property/value pair and is made up of …
– …
– Each value string is a simple, human-readable string …
– …
• a set of human-readable statements (as per above)
• also formalised using UML
![Page 8: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/8.jpg)
The DCMI Abstract Model
• independent of particular syntaxes
• but descriptions that comply with the model can be encoded using any of the recognised DCMI encodings
– i.e. XHTML, XML and RDF
• simple– largely based on resource, property, value triple
– formally mapped to the RDF model
• highly extensible
![Page 9: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/9.jpg)
The DCMI Abstract Model
record (encoded as HTML, XML or RDF/XML)
description set
description (about a resource (URI))
statement
property (URI) value (URI)
vocabulary encoding scheme (URI)
value string
language(e.g. en-GB)
syntax encodingscheme (URI)
![Page 10: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/10.jpg)
The DCMI Abstract Model
• relationships between the descriptions in a description set and the resources being described made explicit
• oddly, most metadata standardsdo not do this
• DC application profiles nowstart by defining which set ofresources are being described…
• …then assigning the set ofproperties and so on that willbe used to describe them
![Page 11: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/11.jpg)
E.g. an application profile for CDs• start with the set of entities that we want to
describe
• and the key relationships between those entities
• e.g. a CD collection entity/relationship model…
• then define a set of properties for each
collection CD
artistowner
record label
owned by
contained in
created by
released by
![Page 12: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/12.jpg)
JISC Information Environment
![Page 13: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/13.jpg)
![Page 14: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/14.jpg)
are we headingin the rightdirection?
![Page 15: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/15.jpg)
open access
not ‘if’ but ‘when’
![Page 16: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/16.jpg)
3 issues…
![Page 17: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/17.jpg)
issue #1have we got our
terminology right?
![Page 18: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/18.jpg)
a university-based institutional repository is a set of services that a university offers to the members of its community for the management and dissemination of digital materials created by the institution and its community members. It is most essentially an organizational commitment to the stewardship of these digital materials, including long-term preservation where appropriate, as well as organization and access or distribution. … An institutional repository is not simply a fixed set of software and hardware
(Cliff Lynch, 2003)
![Page 19: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/19.jpg)
a focus on ‘making content available on the Web’ would be more intuitive
to researchers
![Page 20: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/20.jpg)
•a focus on ‘content management’ would change our emphasis
•OAI-PMH out…
• search engine optimisation, usability, accessibility, tagging, information architecture, cool URIs in…
![Page 21: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/21.jpg)
issue #2service oriented
vs.resource oriented
![Page 22: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/22.jpg)
REST = Representational State Transfer
an architectural style with a focus on resources, their identifiers (e.g. URIs),
and a simple uniform set of operations that each resource
supports (e.g. GET, PUT, POST, DELETE)
![Page 23: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/23.jpg)
issue #3national vs. global
![Page 24: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/24.jpg)
The impact of Web 2.0
prosumer
remote apps
social
API
diffusion
concentration
![Page 25: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/25.jpg)
![Page 26: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/26.jpg)
thinking about the future…
![Page 27: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/27.jpg)
1. what would a Web 2.0 repository look like?
2. potential impact of the Semantic Web on repositories
![Page 28: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/28.jpg)
1. what would a Web 2.0 repository look like?
2. potential impact of the Semantic Web on repositories
![Page 29: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/29.jpg)
![Page 30: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/30.jpg)
• high-quality browser-based document viewer (not Acrobat!)
• tagging, commentary, more-like-this, favorites, …
• persistent (cool) URIs to content
• ability to form simple social groups
• ability to embed documents in other Web sites
• high visibility to Google
• offer RSS as primary API
• use of Amazon S3 to cope with scalability
![Page 31: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/31.jpg)
a Web 2.0 repository wouldbe a global service
global concentration is anenabler
of social interaction
![Page 32: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/32.jpg)
But…
• they don’t do preservation
• they don’t handle complex workflows
• they don’t expose rich metadata– yes, scholarly communication has some particular
functional requirements which are not met by Google…
– author searching, citation counting, object complexity
– not handled well by the current Web
– how are these requirements best met? thru richer metadata?
![Page 33: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/33.jpg)
1. what would a Web 2.0 repository look like?
2. potential impact of the Semantic Web on repositories
![Page 34: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/34.jpg)
SWAPThe Scholarly
Works ApplicationProfile
![Page 35: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/35.jpg)
A model based on FRBR
• Functional Requirements for Bibliographic Records
• an application model for the entities that bibliographic records are intended to describe
• FRBR models the world using 4 key entities– Work, Expression, Manifestation and Item
![Page 36: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/36.jpg)
FRBR and scholarly works
• FRBR is a useful model in the context of scholarly works (eprints) because it allows us to answer questions like
– what is the URL of the most appropriate copy (an item) of the PDF format (a manifestation) of the pre-print version (an expression) for this eprint (the work)?
– are these two copies related? if so, how?
![Page 37: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/37.jpg)
FRBR for scholarly works
The eprint as a scholarly work
Author’s Original 1.0 Author’s Original 1.1Version of Record
(French)
html pdf
publisher’s copyinstitutional repository
copy
scholarly work(work)
version(expression)
format(manifestation)
copy(item)
… Version of Record(English)
![Page 38: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/38.jpg)
SWAP application profile model
ScholarlyWork
Expression0..∞
isExpressedAs
Manifestation
isManifestedAs
0..∞
Copy
isAvailableAs
0..∞
0..∞
0..∞
isCreatedBy
isPublishedBy
0..∞isEditedBy
0..∞isFundedBy
isSupervisedBy
AffiliatedInstitution
Agent
![Page 39: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/39.jpg)
SWAP and FRBR
ScholarlyWork
Expression0..∞
isExpressedAs
Manifestation
isManifestedAs
0..∞
Copy
isAvailableAs
0..∞
0..∞
0..∞
isCreatedBy
isPublishedBy
0..∞isEditedBy
0..∞isFundedBy
isSupervisedBy
AffiliatedInstitution
Agent
FRBRWork
FRBRExpression
FRBRManifestation
FRBRItem
![Page 40: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/40.jpg)
SWAP and FRBR
ScholarlyWork
Expression0..∞
isExpressedAs
Manifestation
isManifestedAs
0..∞
Copy
isAvailableAs
0..∞
0..∞
0..∞
isCreatedBy
isPublishedBy
0..∞isEditedBy
0..∞isFundedBy
isSupervisedBy
AffiliatedInstitution
Agent
the eprint (an abstract concept)
the ‘version of record’
orthe ‘french
version’or
‘version 2.1’
the PDF format of the version of
record
the publisher’s copy of the
PDF …
the author or the publisher
![Page 41: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/41.jpg)
Attributes
• the application model defines the entities and relationships
• each entity needs to be described using an agreed set of attributes
![Page 42: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/42.jpg)
Example attributes
ScholarlyWork:titlesubjectabstractaffiliated institutionidentifier
ScholarlyWork:titlesubjectabstractaffiliated institutionidentifier
Agent:nametype of agentdate of birthmailboxhomepageidentifier
Agent:nametype of agentdate of birthmailboxhomepageidentifier
Expression:titledate availablestatusversion numberlanguagegenre / typecopyright holderbibliographic citationidentifier
Expression:titledate availablestatusversion numberlanguagegenre / typecopyright holderbibliographic citationidentifier
Manifestation:formatdate modified
Manifestation:formatdate modified Copy:
date availableaccess rightslicenceidentifier
Copy:date availableaccess rightslicenceidentifier
![Page 43: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/43.jpg)
Final thoughts on the model• this model makes it easier to rationalise ‘traditional’ and
‘modern’ citations– traditional citations tend to be made between eprint ‘expressions’
– hypertext links tend to be made between eprint ‘copies’ (or ‘items’ in FRBR terms)
• adopting a simple underlying model now may be expedient in the short term but costly to interoperability in the long term
– the underlying model need to be as complex as it needs to be, but not more so!
• a complex underlying model may be manifest in relatively simple metadata and/or end-user interfaces
• existing eprint systems may well capture this level of detail currently – but use of simple DC stops them exposing it to others!
![Page 44: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/44.jpg)
time to reflect?
![Page 45: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/45.jpg)
Repositories
• what can we learn learn from Web 2.0?– user interface design matters
– global ‘concentration’ is an enabler of social interaction
• simple DC is both too simple and too complex
• richer DC application profiles such as SWAP may be a way forward
• but need to ensure that their use does not over-complicate user interfaces and workflows
![Page 46: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/46.jpg)
Open Access
• in policy terms - talking about the aim, “making content available on the Web” would be much better than the objective, “putting content in repositories”
![Page 47: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/47.jpg)
more generally…
resource orientation
RESTSemantic Web
Web architecture
…are important
digital libraries ignore them at their peril
![Page 48: Repositories thru the looking glass Andy Powell Eduserv Foundation andy.powell@eduserv.org.uk](https://reader035.vdocument.in/reader035/viewer/2022070306/5516c746550346fc4e8b45f4/html5/thumbnails/48.jpg)
Thank you
images by eNil, Poppyseed Bandits, m o d e, striatic, estherase, Gen Kanai, //bwr - Hieronymus Karl Frederick, dullhunk, Today is a good day (all @Flickr), and yours truly