oai-ore/67531/metadc28342/m2/1/high_res… · object reuse and exchange. brought to you by the open...
TRANSCRIPT
OAI-ORE
Object Reuse and Exchange
Brought to you by the Open Archives Initiative
Haven't I heard of OAI before?
Sure, you might know OAI-PMH
Protocol for Metadata Harvesting
http://digital.library.unt.edu/oai/
http://texashistory.unt.edu/oai/
But we are talking about OAI-ORE
(OAI-ORE) defines standards for the description and exchange of aggregations of Web
resources.
Basic use case
We have a digital object
Which has many ways of interacting with it.
You could grab a citation
Or get the metadata
Lets say in Dublin Core format
This object has one manifestation
With 19 file sets
In this object a file set is a page
Which has multiple sizes to view
Here is a big one
And here is a small one
And of course pages of text have OCR
So about those aggregations
It is hard for a machine to know what constitutes a digital object on the Web
Our idea of a digital objectWhich are comprised of manifestations
Which are comprised of file setsWhich are comprised of files
Can't really be guessed by machines
Even if it could guess our structure
What about everyone else's?
OAI-ORE was developed to solve this problem
We have many aggregations in digital libraries
We have partners
Aggregation
We have collections
Aggregation
Partners/Collections have objects
Aggregation
Objects have object level files(thumbnail, small image, citation, metadata)
Aggregation
Our objects have manifestationsMaybe more than one
(image view, pdf file, html files, gis zip)
Aggregation
Those manifestations have file sets(think pages for book objects)
Aggregation
File sets have files(ocr, thumbnail, square, small, medium, xl, tiles)
Aggregation
Aggregations exist outside of the digital library
List of tracks in a playlist
Aggregation
Flickr Set
Aggregation
All of your bookmarks
Aggregation
We get it, there are aggregations
Some aggregations are abstract
Some aggregations are tangible
An aggregation is given a URI just like anything else on the Web
Maybe
http://digital.library.unt.edu/ark:/67531/metadc1228/#object
In OAI-ORE aggregations are described using a resource map
Which is a machine readable document with its own URI
Maybe
http://digital.library.unt.edu/ark:/67531/metadc1228.rdf
So
http://digital.library.unt.edu/ark:/67531/metadc1228.rdf
Describes
http://digital.library.unt.edu/ark:/67531/metadc1228/#object
Which is the aggregation for
http://digital.library.unt.edu/ark:/67531/metadc1228/
http://digital.library.unt.edu/ark:/67531/metadc1228.rdf
Points to all the objects manifestation aggregations
http://digital.library.unt.edu/ark:/67531/metadc1228/m1/#manifestationhttp://digital.library.unt.edu/ark:/67531/metadc1228/m2/#manifestationhttp://digital.library.unt.edu/ark:/67531/metadc1228/m3/#manifestation
Each has a resource map
http://digital.library.unt.edu/ark:/67531/metadc1228/m1.rdfhttp://digital.library.unt.edu/ark:/67531/metadc1228/m2.rdfhttp://digital.library.unt.edu/ark:/67531/metadc1228/m3.rdf
http://digital.library.unt.edu/ark:/67531/metadc1228/m1.rdf
Points to all the manifestations file set aggregations
http://digital.library.unt.edu/ark:/67531/metadc1228/m1/1/#filesethttp://digital.library.unt.edu/ark:/67531/metadc1228/m1/2/#filesethttp://digital.library.unt.edu/ark:/67531/metadc1228/m1/3/#fileset
…
Each has a resource map
http://digital.library.unt.edu/ark:/67531/metadc1228/m1/1.rdfhttp://digital.library.unt.edu/ark:/67531/metadc1228/m1/2.rdfhttp://digital.library.unt.edu/ark:/67531/metadc1228/m1/3.rdf
…
http://digital.library.unt.edu/ark:/67531/metadc1228/m1/1.rdf
Points to all of the pieces of the file set
http://digital.library.unt.edu/ark:/67531/metadc1228/m1/1/ocr/http://digital.library.unt.edu/ark:/67531/metadc1228/m1/1/thumbnail/
http://digital.library.unt.edu/ark:/67531/metadc1228/m1/1/small/http://digital.library.unt.edu/ark:/67531/metadc1228/m1/1/sizes/http://digital.library.unt.edu/ark:/67531/metadc1228/m1/1/zoom/
Now an OAI-ORE crawler could harvest an entire object
This is useful for search engines
Digital preservation
Web Harvesting
Self archiving
Sounds great, but what was that rdf thing?
OAI-ORE is a specification built with the Web in mind
Relies on concepts and technologies like
HTTPContent Negotiation
Linked DataCool URLsRDF/XML
ATOM XMLRDFa
All of which could be their own TechTalks
And besides, that's why we have
Brandon, Kurt and Lauren
The takeaway
Questions?