1 extended metadata registry (xmdr) ecoterm rome, italy may 17, 2006 bruce bargmeyer, lawrence...
TRANSCRIPT
1
eXtended Metadata Registry (XMDR)
EcotermRome, ItalyMay 17, 2006
Bruce Bargmeyer, Lawrence Berkley National LaboratoryUniversity of CaliforniaTel: +1 [email protected]
2
XMDR Project Draws Together
UsersUsers
ISO/IEC 11179MetadataRegistries
Terminology
CONCEPT
Referent
Refers To Symbolizes
Stands For
“Rose”,“ClipArt”
Metadata Registry
Terminology Thesaurus Taxonomy
DataStandards
Ontology
StructuredMetadata
ISO/IEC JTC 1/SC 32ISO TC 37 & …
3
XMDR Direction
ISO/IEC JTC 1/SC 32
UsUs
ersers
ISO/IEC 11179MetadataRegistries Terminology
CONCEPT
Referent
Refers To Symbolizes
Stands For
“Rose”,“ClipArt”
Metadata Registry
Terminology Thesaurus Taxonomy
DataStandards
Ontology
StructuredMetadata
ISO TC 37 & …
DataAdmin.
4
XMDR Direction
ISO/IEC JTC 1/SC 32
UsUs
ersers
ISO/IEC 11179MetadataRegistries Terminology
CONCEPT
Referent
Refers To Symbolizes
Stands For
“Rose”,“ClipArt”
Metadata Registry
Terminology Thesaurus Taxonomy
DataStandards
Ontology
StructuredMetadata
ISO TC 37 & …
SemanticComputing
5
Metadata Registry Extensions
Register (and manage) any semantics that are useful for managing data. E.g., this may include registering not only permissible
values (concepts), definitions, but may extend to registration of the full concept systems in which the permissible values are found.
E.g., may want to register keywords, thesauri, taxonomies, ontologies, axiomatized ontologies….
Support traditional data management and data administration
Lay Foundation for semantic computing: Semantics Service Oriented Architecture, Semantic Grids, Semantics based workflows, Semantic Web ….
6
XMDR Project Results
Design for next generation metadata registries, expressed as a standard—ISO/IEC 11179 family of standards
XMDR Prototype, open source software Semantic content in prototype Demonstrations for healthcare and the
environment Ecoinformatics test bed
Demonstration using water data and concept systems
7
XMDR: Register Any Concept System orKnowledge Organization System (KOS)
KeywordsGlossariesGazetteersThesauriTaxonomiesOntologiesAxiomatized Ontologies
8
XMDR Content List (partial)
NBII Biocomplexity Thesaurus
NCI Thesaurus National Cancer Institute Thesaurus
NCI Data Elements (National Cancer Institute Data Standards Registry
UMLS (non-proprietary portions)
GEMET (General Multilingual Environmental Thesaurus)
(New project to get Chinese terms for the GEMET concepts)
EDR Data Elements (Environmental Data Registry)
USGS Geographic Names Information System (GNIS) HL7 Terminology, Data Elements
Mouse Anatomy
GO (Gene Ontology)
EPA Web Registry Controlled Vocabulary
BioPAX Ontology
NASA SWEET Ontologies
AGROVOC …
9
XMDR: Register Ontologies
Concept Concept
ConceptConcept Geographic Area
Geographic Sub-Area
Country
Country Identifier
Country Name Country Code
Short Name ISO 31662-Character
Code
ISO 31663- Character
Code
Long Name
DistributorCountry Name
Mailing AddressCountry Name ISO 3166
3-Numeric CodeFIPS Code
10
XMDR: Register Graphs
Directed Graph
Directed Acyclic Graph
Graph
Undirected Graph
Bipartite Graph
Partial Order Graph
Faceted Classification
Clique
Partial Order Tree
Tree
Lattice
Ordered Tree
Note: not all bipartite graphsare undirected.
Graph Taxonomy:
11GooseDuck
Waterfowl
Represent concepts and relationships as nodes and edges in formal graph structurese.g., “is-a” hierarchies.
Duck Goose
Waterfowl
is-a is-a
is-ais-a
CanvasbackBufflehead
Include Concept System Semantics in Metadata Registries
12
Inference
Polio Smallpox
Infectious Disease
Disease
is-a
is-a is-a
is-a
is-a
Diabetes Heart disease
Chronic Disease
is-a
Signifies inferred is-a relationship
13
ISO/IEC 11179 Metadata Registries+ XMDR
Register and manage semantics that are or can be harmonized and vetted by some Community of Interest (COI)
Provide Semantic ServicesE.g., the semantics can be referenced by RDF
statements (subjects, predicates, objects)The semantics can be used for Semantic Web and
Semantic Computing A “vocabulary” that is grounded for some COI
14
Ontology EditorProtege11179 OWL Ontology
XMDR Prototype: Modular Architecture-- Initial Implemented Modules
MetadataValidator
AuthenticationService
MappingEngine
RegistryExternalInterface
Generalization Composition (tight ownership) Aggregation (loose ownership)
Jena, Xerces
Java
RetrievalIndex
FullTextIndex
Lucene
LogicBasedIndex
Jena, OWI KSRacer
RegistryStore
WritableRegistryStore
Subversion
15
XMDR Project Collaboration
Collaborative, interagency effort EPA, USGS, NCI, Mayo Clinic, DOD, LBNL
…& othersDraws on and contributes to
interagency/International Cooperation on Ecoinformatics
Interacts with many organizations around the world through ISO/IEC standards committees
16
Where does this fit with Ecoterm?
Ecoterm organizations as sources of contentEcoterm organizations as
collaborators/testersEcoterm organizations as potential users of
Open Source software Potential collaboration on R&D projects
e.g., under European Commission Framework Program 7
WWW.XMDR.ORG
17
Concept System Store
UsersUsers
Concept systems:KeywordsControlled VocabulariesThesauriTaxonomiesOntologiesAxiomatized Ontologies
(Essentially graphs: node-relation-node + axioms)
} Metadata Registry
Concept System Thesaurus Themes
DataStandards
Ontology GEMET
StructuredMetadata
18
Management of Concept Systems
Metadata Registry
Concept System Thesaurus Themes
DataStandards
Ontology GEMET
StructuredMetadata
UsersUsers
Concept system:RegistrationHarmonization StandardizationAcceptance (vetting)Mapping (correspondences)
}
19
Life Cycle Management
Metadata Registry
Concept System Thesaurus Themes
DataStandards
Ontology GEMET
StructuredMetadata
UsersUsers
Life cycle management:Data andConcept systems(ontologies)
}
20
Grounding Semantics
Metadata Registry
Concept System Thesaurus Themes
DataStandards
Ontology GEMET
StructuredMetadata
UsersUsers
MetadataRegistries Semantic Web
RDF TriplesSubjectVerbObject
Ontologies
22
See
www.xmdr.org