link discovery tutorial part v: hands-on

Post on 14-Jan-2017

431 Views

Category:

Science

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Link Discovery TutorialPart V: Hands-On

Axel-Cyrille Ngonga Ngomo(1), Irini Fundulaki(2), Mohamed Ahmed Sherif(1)

(1) Institute for Applied Informatics, Germany(2) FORTH, Greece

October 18th, 2016Kobe, Japan

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 1 / 19

Table of Contents

1 Test Dataset

2 Task I: Execute given Limes Configuration

3 Task II: Create your first Limes Configuration

4 Task III: Use Limes GUI

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 2 / 19

Table of Contents

1 Test Dataset

2 Task I: Execute given Limes Configuration

3 Task II: Create your first Limes Configuration

4 Task III: Use Limes GUI

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 3 / 19

Test Dataset

Semantic Web Dog Food CorpusData exposed: Metadata (papers, presentations, people) for several semantic webrelated conferences and workshops, including the most recent ISWC, ESWC andWWW events.

Dumps: http://data.semanticweb.org/dumpsEndpoint: http://data.semanticweb.org/sparqlDataHub: https://datahub.io/dataset/semantic-web-dog-foodDownload:http://iswc2016ldtutorial.aksw.org/tutorial-material/

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 4 / 19

Table of Contents

1 Test Dataset

2 Task I: Execute given Limes Configuration

3 Task II: Create your first Limes Configuration

4 Task III: Use Limes GUI

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 5 / 19

Task I: DeduplicationExecute the given Configuration File

Decentralized nature of LODData contain duplicatesHow to efficiently detect similar resources?

Task IFind duplicate authors in Semantic Web Dog Food Dataset

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 6 / 19

Task I: DeduplicationExecute the given Configuration File

Decentralized nature of LODData contain duplicatesHow to efficiently detect similar resources?

Task IFind duplicate authors in Semantic Web Dog Food Dataset

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 6 / 19

Limes Configuration FilePREFIXes

<PREFIX ><NAMESPACE >http :// www.w3.org /2000/01/ rdf - schema #</ NAMESPACE ><LABEL >rdfs </ LABEL >

</ PREFIX >

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 7 / 19

Limes Configuration FileSOURCE and TARGET datasets

<SOURCE ><ID >SDF1 </ID ><ENDPOINT >semanticDogFood .nt </ ENDPOINT ><VAR >?x</VAR ><PAGESIZE > -1</ PAGESIZE ><RESTRICTION >?x a foaf: Person </ RESTRICTION ><PROPERTY >rdfs: label </ PROPERTY ><TYPE >NT </TYPE >

</ SOURCE >

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 8 / 19

Limes Configuration FileMETRIC

<METRIC >Levenshtein (x.rdfs: label , y.rdfs: label )</ METRIC >

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 9 / 19

Limes Configuration FileACCEPTANCE and REVIEW

<ACCEPTANCE ><THRESHOLD >0.9 </ THRESHOLD ><FILE >similarAuthor .nt </FILE ><RELATION >ov: similarTo </ RELATION >

</ ACCEPTANCE >

<REVIEW ><THRESHOLD >0.5 </ THRESHOLD ><FILE >similarAuthor_review .nt </FILE ><RELATION >owl: sameAs </ RELATION >

</ REVIEW >

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 10 / 19

Limes Configuration FileOUTPUT format

<OUTPUT >TTL </ OUTPUT >

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 11 / 19

Task IRun Limes

Run Limesjava -jar limes-core-1.0.0.jar task1.xml

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 12 / 19

Task IUsing Machine Learning

<MLALGORITHM ><NAME >wombat simple </NAME ><TYPE >unsupervised </TYPE >

</ MLALGORITHM >

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 13 / 19

Table of Contents

1 Test Dataset

2 Task I: Execute given Limes Configuration

3 Task II: Create your first Limes Configuration

4 Task III: Use Limes GUI

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 14 / 19

Task IIFind similar publications

1 Find publications2 with similar keyword, but do not link any publication to itself

1. Find publications?x a swrc:InProceedings

?y a swrc:InProceedings

2. with similar keyword, but do not link any publication to itselfMINUS(jaccard(x.swrc:listKeyword , y.swrc:listKeyword)|0.3,ExactMatch(x.swrc:listKeyword , y.swrc:listKeyword)|1.0)

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 15 / 19

Task IIFind similar publications

1 Find publications2 with similar keyword, but do not link any publication to itself

1. Find publications?x a swrc:InProceedings

?y a swrc:InProceedings

2. with similar keyword, but do not link any publication to itselfMINUS(jaccard(x.swrc:listKeyword , y.swrc:listKeyword)|0.3,ExactMatch(x.swrc:listKeyword , y.swrc:listKeyword)|1.0)

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 15 / 19

Task IIFind similar publications

1 Find publications2 with similar keyword, but do not link any publication to itself

1. Find publications?x a swrc:InProceedings

?y a swrc:InProceedings

2. with similar keyword, but do not link any publication to itselfMINUS(jaccard(x.swrc:listKeyword , y.swrc:listKeyword)|0.3,ExactMatch(x.swrc:listKeyword , y.swrc:listKeyword)|1.0)

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 15 / 19

Table of Contents

1 Test Dataset

2 Task I: Execute given Limes Configuration

3 Task II: Create your first Limes Configuration

4 Task III: Use Limes GUI

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 16 / 19

Task IIIUse Limes GUI

Run Limes GUIjava -jar limes-core-1.0.0.jar -g

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 17 / 19

Acknowledgment

This work was supported by grants from the EU H2020 Framework Programmeprovided for the project HOBBIT (GA no. 688227).

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 18 / 19

References I

Ngonga Ngomo et al. (InfAI & FORTH) LD Tutorial: Hands-On October 17, 2016 19 / 19

top related