![Page 1: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/1.jpg)
Copyright 2009 Digital Enterprise Research Institute. All rights reserved.
Digital Enterprise Research Institute www.deri.ie
Re-using Cool URIs:Entity Reconciliation Against LOD Hubs
Fadi Maali, Richard Cyganiak, Vassilios PeristerasLDOW 2011
![Page 2: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/2.jpg)
Digital Enterprise Research Institute www.deri.ie
The Web of Data
![Page 3: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/3.jpg)
Digital Enterprise Research Institute www.deri.ie
The Web of Data
![Page 4: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/4.jpg)
Digital Enterprise Research Institute www.deri.ie
The Web of Data
![Page 5: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/5.jpg)
Digital Enterprise Research Institute www.deri.ie
The Web of Data
![Page 6: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/6.jpg)
Digital Enterprise Research Institute www.deri.ie
The Web of Data
![Page 7: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/7.jpg)
Digital Enterprise Research Institute www.deri.ie
The Web of Data
![Page 8: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/8.jpg)
Digital Enterprise Research Institute www.deri.ie
“LOD Hubs”
LOD Hubs = datasets that attract many inlinks
Music metadata community uses BBC Music identifiers
UK government data community uses Ordnance Survey identifiers
Library data community uses Library of Congress Subject Headings
![Page 9: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/9.jpg)
Digital Enterprise Research Institute www.deri.ie
Standard identifiers
![Page 10: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/10.jpg)
Digital Enterprise Research Institute www.deri.ie
Standard identifiers
![Page 11: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/11.jpg)
Digital Enterprise Research Institute www.deri.ie
For example, government data
![Page 12: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/12.jpg)
Digital Enterprise Research Institute www.deri.ie
For example, government data
5000 datasets about Politicians
Companies
Schools
Administrative areas
Motorways
Government departments
![Page 13: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/13.jpg)
Digital Enterprise Research Institute www.deri.ie
How to Attract Links
![Page 14: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/14.jpg)
Digital Enterprise Research Institute www.deri.ie
How to Attract Links
![Page 15: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/15.jpg)
Digital Enterprise Research Institute www.deri.ie
How to Attract Links
![Page 16: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/16.jpg)
Digital Enterprise Research Institute www.deri.ie
Reconciliation
City State CountryCambridge Massachusetts United States
![Page 17: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/17.jpg)
Digital Enterprise Research Institute www.deri.ie
Reconciliation
City State CountryCambridge Massachusetts United States
label=Cambridge
Cambridge Bay in Canada
![Page 18: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/18.jpg)
Digital Enterprise Research Institute www.deri.ie
Reconciliation
City State CountryCambridge Massachusetts United States
label=Cambridge type = City
Cambridge city in Maryland
Cambridge city in Canada
![Page 19: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/19.jpg)
Digital Enterprise Research Institute www.deri.ie
Reconciliation
City State CountryCambridge Massachusetts United States
label=Cambridge type = CityIn the state of Massachusetts
Cambridge city in Massachusetts
![Page 20: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/20.jpg)
Digital Enterprise Research Institute www.deri.ie
Approaches
SPARQL
SPARQL + full-text search
Silk Server
Semantic Web search engines
![Page 21: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/21.jpg)
Digital Enterprise Research Institute www.deri.ie
SPARQL
Based on regular expressions
Pros Standardised
Zero-effort approach
Cons Slow
Not good at text search
No ranked results
![Page 22: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/22.jpg)
Digital Enterprise Research Institute www.deri.ie
SPARQL + full-text search
Based on full-text extension for SPARQL
Pros More forgiving string matching
Ranking
Zero-effort (depending on your SPARQL store)
Cons Proprietary syntax
![Page 23: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/23.jpg)
Digital Enterprise Research Institute www.deri.ie
Silk Server
Pros Powerful declarative link specification
Variety of similarity functions
Cons Configuration needs to prepared
Silk Server needs to be deployed
Silk Server tightly couples its input and reference data
![Page 24: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/24.jpg)
Digital Enterprise Research Institute www.deri.ie
Semantic Web Search Engine
Based on Sindice API
Pros Zero-effort approach (if your dataset is indexed in Sindice)
Search distributed RDF datasets (e.g. FOAF profiles)
Cons Noisy
![Page 25: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/25.jpg)
Digital Enterprise Research Institute www.deri.ie
Benchmark
Data Interlinking benchmark (part of IM@OAEI2010)
We reconciled DailyMed against: DBpedia SPARQL endpoint (http://dbpedia.org/sparql)
Sider dump file (part of the benchmark)
![Page 26: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/26.jpg)
Digital Enterprise Research Institute www.deri.ie
Results
SPARQL with REGEX is unsuitable (performance)
Except if labels are very consistent
Type restrictions are very effective
Silk has best recall (but requires custom link spec)
Services performance against DBpedia Services performance against Sider RDF dump file
![Page 27: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/27.jpg)
Digital Enterprise Research Institute www.deri.ie
Google Refine + RDF
+ RDF
![Page 28: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/28.jpg)
Digital Enterprise Research Institute www.deri.ie
Example
List of people from DERI
![Page 29: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/29.jpg)
Digital Enterprise Research Institute www.deri.ie
Example
![Page 30: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/30.jpg)
Digital Enterprise Research Institute www.deri.ie
Example
![Page 31: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/31.jpg)
Digital Enterprise Research Institute www.deri.ie
Example
List of people from DERI
Find related RDF datasets
![Page 32: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/32.jpg)
Digital Enterprise Research Institute www.deri.ie
Example
![Page 33: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/33.jpg)
Digital Enterprise Research Institute www.deri.ie
Example
Reconciliation result facets Resource Preview
![Page 34: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/34.jpg)
Digital Enterprise Research Institute www.deri.ie
Reconcile against a SPARQL endpoint
![Page 35: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/35.jpg)
Digital Enterprise Research Institute www.deri.ie
Reconcile against an RDF dump
![Page 36: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/36.jpg)
Digital Enterprise Research Institute www.deri.ie
5-star plan for open data
★ Make your stuff available on the Web
★★ Make it available as structured data(e.g., an Excel sheet instead of image scan of a table)
★★★ Use a non-proprietary format(e.g., a CSV file instead of an Excel sheet)
★★★★ Use linked data format(i.e., URIs to identify things, and RDF to represent data)
★★★★★ Link your data to other people’s data to provide context
![Page 37: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/37.jpg)
Digital Enterprise Research Institute www.deri.ie
Making this easier!
![Page 38: Re-using Cool URIsevents.linkeddata.org/ldow2011/slides/ldow2011-slides... · 2011-04-04 · Re-using Cool URIs: Entity Reconciliation Against LOD Hubs Fadi Maali, Richard Cyganiak,](https://reader033.vdocument.in/reader033/viewer/2022042303/5eced4e7194eb40ca3646196/html5/thumbnails/38.jpg)
Digital Enterprise Research Institute www.deri.ie
RDF Extension for Google Refinehttp://lab.linkeddata.deri.ie/2010/grefine-rdf-extension/
Reconciliation will be in the upcoming next version
Thanks!