object re-use and exchange mellon retreat, nassau inn, princeton, nj, march 29 2007 herbert van de...

Post on 18-Dec-2015

216 Views

Category:

Documents

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

The OAI Object Re-Use & Exchange (ORE) Initiative

Herbert Van de Sompel (1) & Carl Lagoze (2)

(1) Research Library, Los Alamos National Laboratory(2) Information Science, Cornell University

OAI-ORE is supported by the Andrew W. Mellon Foundationwith additional support of the National Science Foundation

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

OAI Object Re-Use and Exchange

• OAI-ORE is a new effort conducted under the umbrella of the OAI

• International effort; October 2006 - September 2008:o Coordinators: Carl Lagoze & Herbert Van de Sompelo ORE Technical Committee: 13 international memberso ORE Liaison Group: 8 international memberso ORE Advisory Committee: 16 international memberso Representing: scholarly publishers and aggregators,

eScience, eHumanities, education, search engines, various repository systems, digital library efforts, related standardization efforts, etc.

• See http://www.openarchives.org/ore/

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Compound Information Objects

Units of scholarly communication are compound information objects:

Identified, bounded aggregations of related information units that form a logical whole.

id

id

compound information

objects

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Examples of Compound Information Objects

• Scholarly publication with an article and supporting information including dataset, video, etc.

• Digitized book with multiple chapters, each chapter containing multiple scanned pages.

• Archaeological assemblies of images, maps, charts, and find lists.

• An ARTstor image object that is the aggregation of various renderings of the same source image.

• …

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Compound Information Objects

Units of scholarly communication are compound information objects:

Components of a compound object may vary according to:

• Semantic type:o Texto Still image o Moving imageo Datasetso Softwareo Bibliographic and other types of metadatao …

• Media type:o PDFo HTMLo JPEGo Mp3o …

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Access Repositories

Compound objects are made accessible by a variety of scholarly repositories:

• Institutional repositories• Discipline-oriented repositories • Publisher repositories• Dataset repositories• Cultural heritage repositories • Learning object repositories• Digitized book and manuscript collections• Research-group and managed personal

(ePortfolio) repositories• …

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Access Repositories

Repositories expose compound objects in manners specific to the repository architecture:

• Interfaces (API & user-oriented)• Identification schemes• Representation of compound objects• Mapping of compound objects and

components to the Web

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Systems that manage digital

objects• Institutional repositories• Research-group and managed

personal (ePortfolio) repositories• Discipline-oriented repositories • Publisher repositories• Dataset repositories• Cultural heritage repositories • Learning object repositories• Digitized book and manuscript

collections

Systems that leverage managed

digital objects

• All repositories from left column

• Search engines• Authoring tools• Citation management tools• Collaborative environments• Social network applications• Graph analysis tools• Preservation services• Workflow tools• …

OAI-ORE Standards Protocols

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Working with the Web architecture

• Whatever we do it must be congruent with the Web architecture

o Use existing capabilities where they are appropriateo Cleanly layer capabilities meeting the needs of our

problem space

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Observation 1Components of compound object must be mapped to resources in

order to be reference-able

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Observation 2In the mapping from components to resources, the boundary of the

originating compound object is lost

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Observation 3Map the compound object to a resource with a representation that

formally expresses the boundaries of the object

Machine readable

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Observation 4Allow for discovery of that representation (and hence of the

compound object) by Web applications

HTTP LINK HEADER

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Observation 4 bis Allow for discovery of that representation (and hence of the

compound object) by Web applications

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Observation 5This approach reveals compound objects in the Web graph

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

OAI Object Re-Use and Exchange

• A core goal of OAI-ORE is to develop standardized, interoperable, and machine-readable mechanisms by which individual repositories can map and thereby expose compound objects to the Web.

• These mechanisms will allow Web applications to reconstruct:

o The boundaries of compound objectso The relationships among their internal componentso Their relationship to other resources on the Web

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

OAI Object Re-Use and Exchange

• Develop, identify, and profile extensible standards and protocols to allow repositories, agents, and services to interoperate in the context of use and reuse of compound digital objects beyond the boundaries of the holding repositories.

• Aim for more effective and consistent ways:o to facilitate discovery of these objects, o to reference (link to) these objects (and parts thereof),o to obtain a variety of disseminations of these objects, o to aggregate and disaggregate these objects,o Enable processing by automated agents

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Questions

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Compound object from aDORe repository

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

DC component of compound object

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

MPEG-21 DIDL component of compound object

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Zotero discovers a pointer to a Canonical Representation of the compound object

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Zotero parses the Canonical Representation and lets the user select components

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

Zotero now holds a derived compound object with multiple components

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

QuickTime™ and aMicrosoft Video 1 decompressorare needed to see this picture.

… A DIDL component

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

… a BIOSIS XML component

Object Re-Use and ExchangeMellon Retreat, Nassau Inn, Princeton, NJ, March 29 2007

Herbert Van de Sompel, Carl Lagoze

… a DC component

top related