lo c 2011-05-18

45
gricultural information management standards and services - dr. johannes keizer CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12 Talk at Library of Congress, 2011-05- 18 Dr. Johannes Keizer Office of Knowledge Exchange, Research and Extension Food and Agriculture Organization of the UN Vocabularies and Linked Open Data

Upload: johannes-keizer

Post on 08-May-2015

393 views

Category:

Documents


2 download

DESCRIPTION

Presentation at Library of Congress

TRANSCRIPT

Page 1: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Talk at Library of Congress, 2011-05-18

Dr. Johannes KeizerOffice of Knowledge Exchange, Research and ExtensionFood and Agriculture Organization of the UN

Vocabularies and Linked Open Data

Page 2: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

We will promote research for food and agriculture, including research to

adapt to, and mitigate climate change, and access to research results and

technologies at national, regional and international levels.

We will reinvigorate national research systems and will share information

and best practices. We will improve access to knowledge.

world food summit 2009

Page 3: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Information Infrastructure for Agricultural Research and Innovation

Page 4: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Vocabularies and Linked Open Data

Page 5: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Page 6: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

http://aims.fao.org/aos/agrovoc/c_7825

Page 7: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

http://aims.fao.org/aos/agrovoc/c_7825

http://eurovoc.europa.eu/218754

Page 8: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

http://aims.fao.org/aos/agrovoc/c_7825

http://eurovoc.europa.eu/218754

Page 9: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

http://aims.fao.org/aos/agrovoc/c_7825

http://eurovoc.europa.eu/218754

http://agclass.nal.usda.gov/nalt/2011.xml#1780

Page 10: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

http://aims.fao.org/aos/agrovoc/c_7825

AGROVOC

http://aims.fao.org/aos/agrovoc/c_12332 owl:sameAs http://eurovoc.europa.eu/219871 skos: exact match UNBIS: Toxic Substances

http://agris.fao.org/agris-search/search/display.do?f=1996/TR/TR96001.xml;TR9600026

Linking data through common URIs

http://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2010:202:0011:0015:EN:PDF

http://unbisnet.un.org:8080/ipac20/ipac.jsp?session=128F308557F34.283092&profile=bib&uri=full=3100001~!685149~!1&ri=1&aspect=subtab124&menu=search&source=~!horizon

http://eurovoc.europa.eu/218754

Eurovoc TOXIC SUBSTANCES

UNBIS

http://agclass.nal.usda.gov/nalt/2011.xml#1780

NALT

http://www.agnic.org/search/CAT85822953

Page 11: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

If all institutions, which publish about toxic wastes would:- - Index their publications with URIs

from AGROVOC,GEMET, NALT, LCSH or EUROVOC

- (many do – low hanging fruit!)- - Publish their metadata as LOD- (quite easy to do, bibData map well to

RDF

ThenEveryone who knows to write Sparql Qeries could get all these publications with one shot for a new website on toxic wastes

Page 12: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Vocabularies and LOD

Simply publishing your data as RDF does not link them to other data sets

Creating this links by humans is interesting in detail, but unrealistic as mass processing

Linking 2 standard vocabularies can link 200 datasets which use these standard vocabularies

Page 13: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

…just out of the pipele

-----Original Message-----From: Antoine Isaac [mailto:[email protected]] Sent: Thursday, May 12, 2011 7:19 PMTo: UDC SummaryCc: Anibaldi, Stefano (OEKC); Dan BrickleySubject: Re: AGRIS Journals and UDC URIs/ checking

Aida, Stefano,…..Of course the first hints re. URIs is to keep it short. www.udcc.org/udcclass_631.1/50900 seems a bit long.Then it might be interesting to use "class" somewhere, if you're going to release entities with a different type one day.

On the most difficult issue, class numbers vs. DB identifiers. Probably you will have to create both, if you want to intercept these cases where concepts have changed class number.…………

Page 14: Lo c 2011-05-18

AGROVOC

Page 15: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

AGROVOC A multilingual agricultural vocabulary

organized as concept scheme in 20 languages

Covers agriculture, forestry, fisheries and related themes (food security, land use, environment, etc.)

Organized in sub-vocabularies, e.g. chemicals, fisheries terms, scientific/common names of organisms

Maintained by a global community (e.g. librarians, terminologists, information managers) using VocBench

Page 16: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

AGROVOC - Statistics

Total terms 580,239 Concepts ca. 40,000

Top concepts 25

English concepts / terms ca. 32,000 concepts / 40,737 terms

French terms 38,395

Spanish terms 41,745

Terms in Arabic, Chinese, Czech, German, Hindi, Hungarian, Italian, Japanese, Korean, Lao, Persian (Farsi), Polish, Portuguese, Russian, Slovak, Thai

456,952

Page 17: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

AGROVOC - Restructuring Goal: Transform AGROVOC from a traditional

thesaurus into a concept scheme with distinction between conceptual level and terminological level

Overall revision done by FAO in collaboration with KSI (Knowledge Sharing and Innovation) team at ICRISAT, Hyderabad, India

Top concepts reduced from 918 to 25

Around 85,000 term relations revised

Non-hierarchical relationships refined by semantic relations

Ca. 4,000 non-preferred terms changed to preferred terms

Page 18: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Top concepts

Page 19: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Relationships (examples)

Page 20: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Page 21: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Page 22: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Page 23: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Page 24: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

AGROVOC

EUROVOC

RAMEAU

LCSH

NALT

GEMET

STW

18000 outlinks

2000 inlinks

Thesauri into the AGROVOC LOD Cloud

Page 25: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

AGROVOC LOD-inlinks

Trusted Links from

AGROVOC

Page 26: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

AGROVOC Links after 3 weeks LOD

Outlinks:

GEMET-AGROVOC 1,198

RAMEAU-AGROVOC  :700

Total Outlinks: 1898

Inlinks:

AGROVOC-EUROVOC:1,297

AGROVOC-GEMET:1,198

AGROVOC-LCSH :1,093

AGROVOC-NAL: 13,390

AGROVOC-STW:1136

AGROVOC-RAMEAU:700

Total Inlinks:18,814

Page 27: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Europe:(It is better to use this example during the presentation)http://aims.fao.org/aos/agrovoc/c_2724

From the Top concept:

Ref:  http://aims.fao.org/aos/agrovoc/c_7644

Vocbench (Production)

Ref:   http://agrovoc.mimos.my/vocbenchv1.1i/

VocBench(Sandbox)

Ref:http://agrovoc.mimos.my/vocbenchv1.1i/

Page 28: Lo c 2011-05-18

The VocBench

Page 29: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

The VocBench VocBench

concepts and entities triples

Page 30: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

VocBench Features

Domain independent

Structure independent (i.e. thesauri, Glossaries, etc)

Supports RDF (SKOS, SKOS-XL), OWL

Supports collaborative editing

Supports editorial workflow, with user roles

Simple and advanced search

Supports data export: SKOS, Relational format (MySQL)

Page 31: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Page 32: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Page 33: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

LODE - BD

Page 34: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

..what it means

Guidelines how to produce data that easily can be transformed into LOD

Page 35: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

LODE-BD Recommendations 1.1.

What entities and relationships?

What properties?

Page 36: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

And….

What metadata terms?

dcdcterms

biboaglsags

eprintmarcrel

What metadata standards?

Page 37: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Decision TreesSubject

Page 38: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

AgroTaggerAndOpenCalais

Page 39: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Page 40: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

• Does Concept identification in unstructured texts

• Uses Agrovoc as a controlled vocabulary

• Prototype under testing with excellent results (entire repository of ICARDA indexed)

• Will produce in future Structured RDF files that can be used to link data like “open Calais”

AgroTagger

Page 41: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Page 42: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Page 43: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

Page 44: Lo c 2011-05-18

agricultural information management standards and services - dr. johannes keizer

CIARD – Linked Open Data Infrastructurevvvvvvvvv, May 12

RING

routemap to information nodes

and gateways

ToolsLOD

enabled software

VocBench

concepts and entities reference triples

LOD Generator

triplifier, concept and entity

identifier

Data Services

Webservices + APIs to triple stores

Cloud

storage for RDF data triples

agINFRA - the elements

Page 45: Lo c 2011-05-18

Thank You!

http://www.ciard.nethttp://ring.ciard.nethttp://aims.fao.orghttp://agris.fao.org