dublin core metadata schema registry at tsukuba shigeo sugimoto, mitsuharu nagamori graduate school...

42
Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and M edia Studies University of Tsukuba Tsukuba, Ibaraki, Japan {sugimoto, nagamori} @ slis.tsukuba.ac.jp

Upload: dwight-harrell

Post on 04-Jan-2016

215 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Dublin Core Metadata Schema Registry at Tsukuba

Shigeo Sugimoto, Mitsuharu NagamoriGraduate School of Library, Information and Media Studies

University of TsukubaTsukuba, Ibaraki, Japan

{sugimoto, nagamori}   @   slis.tsukuba.ac.jp

Page 2: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Agenda/Outline

• Demonstration

• History of DCMI Registry Development

• DCMI Registry at Tsukuba

• Experimental Studies at Tsukuba

Page 3: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

DCMI Metadata Schema Registry

• Provides collection of DCMI terms– DCMI Terms: http://

dublincore.org/documents/dcmi-terms/

• Demonstration– OCLC: http://dublincore.org/registry/– Tsukuba: http://juror.slis.tsukuba.ac.jp/dcregistry

/

Page 4: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

History

• Project started at AIT, Thailand under supervision of Thomas Baker in 1998

• Mitsuharu Nagamori and Shigeo Sugimoto participated in the project from its early phase for the interface in multiple languages

• DCMI Registry WG since 1999– WG home: http://dublincore.org/groups/registry/– Current Version: http://dublincore.org/dcregistry/– The DCMI registry provides definitions of DCMI Term

s

Page 5: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Registry Chronology

1998

Xu Bo and Tom Baker (AIT, Thailand)

“DC in Multiple Languages”

Xu Bo and Tom Baker (AIT, Thailand)

“DC in Multiple Languages”

1999 2000~ 2002 2003

1st Registry Prototype was developed (August)

1st Registry Prototype was developed (August)

Internationalization and Research Requirements

Internationalization and Research Requirements

1st DCMI Registry released (phase 1)

1st DCMI Registry released (phase 1)

2004

Implemented Web Service Interface

(phase 2)

Implemented Web Service Interface

(phase 2)

Registry WG established (December)

Registry WG established (December)

Page 6: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Some Basic Issues in the DCMI Registry- Multi-lingual Issues -

• Translation of the DCMI terms was recognized as an important issue for DCMI from very early stage of the development of DCMI terms.

• Translations of DCMI terms have been done by volunteers– A few national standards– A few official local affiliates

• Multi-linguality issues in the DCMI Registry– Browse– Search– Translation Support

Page 7: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Some Basic Issues in the DCMI Registry- Underlying Model -

• DCMI Schema Registry stores the DCMI terms– Elements– Refined Elements– Encoding Schemes– DCMI Type Vocabulary terms

• The data model for the terms is based on RDF Schema

Page 8: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Some Basic Issues in the DCMI Registry- Underlying Model -

• DCMI Abstract Model– http://dublincore.org/documents/abstract-model/– A Reference Model of DC Metadata– Independent of any particular encoding syntax

• Two parts of the abstract model– DCMI Resource Model– DCMI Description Model

• The abstract model helps users understand the underlying model for the DCMI terms

Page 9: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Some Statistics of the Registry

• DCMI Schemas– Dublin Core Element Set– Dublin Core Qualifiers– DCMI Type Vocabulary

• Translations– Over 25 languages

• Open Source Software

Page 10: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Some Statistics of the Registry

• Main Registry and Collaborating Registries– Main Registry at OCLC, Dublin, Ohio, USA– Collaborating Registries at Goettingen, Tsukuba

• Registries– OCLC: http://dublincore.org/registry/– Tsukuba: http://juror.slis.tsukuba.ac.jp/dcregistry/ – Goettingen (Univ. Goettingen): http://dbclear.sub.uni-

goettingen.de/dcregistry/index.html– Beijing (Library of Chinese Academy of Sciences):

http://159.226.100.140:8080/dcregistry/

Page 11: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Browse the Registry By Classification typeBrowse the Registry By Classification type

• Elements• Element Refinements• Controlled Vocabulary Terms• Vocabulary and Encoding Schemes• Summary of All Terms

• Elements• Element Refinements• Controlled Vocabulary Terms• Vocabulary and Encoding Schemes• Summary of All Terms

Page 12: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Detailed view of the termDetailed view of the term

Page 13: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Search the Registry for TermsSearch the Registry for Terms

Page 14: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Multiple LanguagesMultiple Languages

Page 15: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Resource AdministrationResource Administration

Page 16: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Registry Model

DCMIRegistry

VocabularyManagement

Tool

Web Services

Applications

Metadata Schema (RDF)

DCMI Usage Board

People seeking information about termsFeedback

Page 17: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

I18N Issue: Early Efforts

• Many volunteers create translations, but...

• Difficulties with the range of file formats and character encodings

Volunteers

Translation

HTML, Plain Text,MS Word, etc…

Convert toUTF-8 Registry

Format toRDF

EUC-JP, TIS-630, GB2312, etc…

Administrators

Page 18: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Translation Tool• A simple support tool for volunteer

translators

Volunteers

Registry

TranslationTool

Make translations through Web-based user interfaces.

TranslationDB (RDF)

Administrators

Page 19: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

List of TermsList of Terms

Target LanguageTarget Language

Page 20: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Translation Tool

Page 21: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Metadata Schema Registry at Tsukuba

• Basic function + extended features• Basic function: DCMI Terms

– Basic functions

• Extension of Schema Collection to non-DC Schemas– metadata schemas collected on the Web and

some other resources– Experiments for cross-schema applications

• Extension to Application Profiles

Page 22: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Extending Schema Collection

• Extended the Registry DB to non-DCMI terms collected on the Web and other resources

• Student Monitors for the Tsukuba registry

• Functional extensions– A list of schemas in the ‘Browse’ page, and– A list of defined elements and classes that

relate to each schema.

Page 23: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

拡張機能1• スキーマ一覧表示の画面

Added a List of Schemas in the ‘Browse’ pageAdded a List of Schemas in the ‘Browse’ page

Page 24: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Added a List of Related TermsAdded a List of Related Terms

Page 25: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Lessons Learned

• Need to add relationships between term and schema, or term and term– isDefinedBy, subPropertyOf, dc:type, etc…

• Improve user interfaces displaying terms – Graphical ?

• Support various RDF formats– OWL, FOAF, SKOS, Application Profiles, etc…

• Vocabulary to specify types of schemas

Page 26: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Add Relationships

Schema X

Term A Term B

rdfs:isDefinedBy

Date

DC 1.1

rdfs:isDefinedBy

rdfs:subPropertyOf

Page 27: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Improvement of User Interfaces

• Better UI for the schema collection – 50+ schemas and 2500+ terms– Difficulty to display all of the schemas and the

terms in a table– Need Graphical UI (e.g., IsaViz)

• Need more feedback from users.

Page 28: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

×

Support various RDF formats

• Many schemas use OWL, FOAF, SKOS, etc…

<rdf:Description rdf:about=“http://www.slis…./”> <dc:title>Tsukuba Registry</dc:title> <dc:creator>

<foaf:Person> <foaf:name>Mitsuharu</foaf:name> <foaf:mbox>nagamori@slis…</foaf:mbox></foaf:Person>

</dc:creator></rdf:Description>

Page 29: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Vocabulary to classify and describe schemas

• Need a good scheme to browse and search metadata schemas– Classification of schemas and terms

• A vocabulary to specify the types of schemas is required– e.g., “element set”, “subject classification

vocabulary”, “type classification vocabulary”, “application profile”, etc.

Page 30: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Application Profiles

• An application profile is a schema defined for a specific application domain based on one or more element sets– DC Library Application Profile uses DCMI

terms and MODS terms– Simple Dublin Core is an application profile

Page 31: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Application Profile

termA: Mandatory

termC: Optional

Repeatable

termX: Mandatory

Repeatable

termZ: Mandatory

if applicable

termX termY termZ

Metadata Vocabulary 2(Metadata Element Set)

Metadata Vocabulary 1(Metadata Element Set)

termA termB termC

A structural view of application profile

Page 32: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Abstract Syntax and Concrete Syntax

termA: Mandatory

termC: Optional

Repeatable

termX: Mandatory

Repeatable

termZ: Mandatory

if applicable

Application Profile: Terms used in an application and structural constraints

termX termY termZ

Metadata Vocabulary 2(Metadata Element Set)

Metadata Vocabulary 1(Metadata Element Set)

termA termB termC

<rdf:Description about=”foo”> <mv1:A>an example.</mv1:A> <mv2:X>bar</mv2:X> ...

<meta name=”mv1:A” content=”an example”><meta name=”mv2:X” content=”bar”>

...

Description in a syntax defined in an application

Application Profile

Page 33: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

A Layered Modelsplit semantics and syntax into layers

Layered Model of Metadata Schema

Layer 1Semantics

Layer 2Abstract Syntax

Layer 3Concrete Syntax

DCMES (Elements and

Qualifiers)IEEE-LOM ULIS element

extension

DCMI Library Application

Profile

Open Archives Initiative Schema

IPL Asia Schema

ULIS Core Schema

XML implementation

in an XML Schema

An Oracle implementation

RDF implementation

Page 34: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Layered Modeland Metadata Schema Registry

Layer 1

Layer 2

Layer 3

DCMI Registry

DCMES Terms

(Elements and Qualifiers)

ULIS element extension

ULIS-DL Subject

Vocabulary

Tsukuba Registry

Application Profile A Application Profile B

Page 35: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Layered Modeland Metadata Schema Registry

Layer 1

Layer 2

Layer 3

DCMI Registry

DCMES Terms

(Elements and Qualifiers)

ULIS element extension

ULIS-DL Subject

Vocabulary

Tsukuba Registry

XML Schema for A XML Schema for B

Page 36: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Some Projects at Tsukuba: Registry-based Software Tools

• Metadata Schema Driven Software Tool Generator– A software generator which produces simple software

tools, eg. editors, search interfaces.

• Metadata Schema Search – an Application Profile oriented tool– An on-going project based on the experimental extens

ion of the DCMI registry– Help metadata schema designers find metadata sche

mas - element sets, terms and application profiles

Page 37: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Software Tool Generator

Software Tool Generator

MetadataSchema Registry

Metadata

Software for X:e.g. Editor

Metadata

Software for X:e.g. Editor

Metadata

Software for Y:e.g. Search Tool

Metadata

Software for Y:e.g. Search Tool

Profile XProfile X

Profile YProfile Y

Each metadata software is derived from its application profile.

Page 38: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Software Tool Generator: An Implementation

Metadata Driven Software

ApplicationProfile

Software Tool Generator

MetadataSchema Registry

Metadata Database

Primitive Functions

User Interfaces (XML texts created from A/P)

Page 39: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Application Profile Description Example<grammer> <start><ref name="Metadata"/></start> <define name="Metadata">

<element name="Bibliography information" dc:LanguageSchema="ja-JP"> <element name="title" occurrence="1">

<dc:source rdf:resource="http://purl.org/dc/elements/1.1/title"/> <data type="shortString"><param name="required"/></data> <dc:label priority="registry">Title</dc:label></element><element name="keyword" occurrence="1"> <dc:source rdf:resource="http://purl.org/dc/elements/1.1/subject"/> <data type="shortString“ /> <dc:label priority="registry"> キーワード </dc:label></element><element name="description" occurrence="1"> <dc:source rdf:resource="http://purl.org/dc/elements/1.1/description"/> <data type="longString“ /> <dc:label priority="registry"> 概要 </dc:label></element>

…(snip)…</grammer>

Page 40: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Generated Metadata Software• According to the specified Application Profile• Customizable by XSLT + CSS

Page 41: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Conclusion• Metadata schema registry is an important tool to

enhance interoperability and re-usability of metadata schemas.– Canonical descriptions in human and machine

understandable forms– Internationalization– Cross-domain applications

• “Terms” are the central component for the metadata schema registry. However, “Application Profiles” are also crucial to enhance interoperability and re-usability of metadata and metadata schemas.

Page 42: Dublin Core Metadata Schema Registry at Tsukuba Shigeo Sugimoto, Mitsuharu Nagamori Graduate School of Library, Information and Media Studies University

Thank you!