powerpoint presentation

27
Rhizomer Semantic Content Management System Roberto García, Juan Manuel Gimeno, Ferran Perdrix*, Rosa Gil, Marta Oliva HCI and Data Integration Research Group Universitat de Lleida, Spain *Segre Media, Spain

Upload: samuel90

Post on 14-May-2015

1.765 views

Category:

Education


0 download

TRANSCRIPT

Page 1: PowerPoint Presentation

Rhizomer Semantic Content Management System

Roberto García, Juan Manuel Gimeno, Ferran Perdrix*, Rosa Gil, Marta Oliva

HCI and Data Integration Research Group

Universitat de Lleida, Spain

*Segre Media, Spain

Page 2: PowerPoint Presentation

Contents

• Introduction

• The Rhizomer Platform

• Use Case: S5T Project

• Conclusions & Future Work

Rhizomer Semantic Content Management System

Page 3: PowerPoint Presentation

Contents

• Introduction

• The Rhizomer Platform

• Use Case: S5T Project

• Conclusions & Future Work

Rhizomer Semantic Content Management System

Page 4: PowerPoint Presentation

Rhizomer Semantic Content Management System

Introduction

• Difficult to manage Information Architecture with current CMSs/Wikis

• Use Semantic Web technologies

• Semantic CMSs/Wikis don’t fully exploit them:– Most semantic wikis

focus on page annotation, limited reasoning, no import...

– Drupal RDF Modulea Drupal extension, though Drupal 7 based on SWT [1]

[1] Buytaert, D.: DrupalCon Boston 2008: State of Drupal presentation.http://buytaert.net/files/state-of-drupal-2008-boston.pdf

Page 5: PowerPoint Presentation

Contents

• Introduction

• The Rhizomer Platform

• Use Case: S5T Project

• Conclusions & Future Work

Rhizomer Semantic Content Management System

Page 6: PowerPoint Presentation

Rhizomer

• Platform for Semantic Content Management and Human-Semantic Web Interaction– Based on Web 2.0 technologies

• Client (AJAX) / Server (REST)

– Web 3.0 data model• Semantic metadata store: RDF(OWL) / SPARQL

– Generic metadata browsing• HTML + Javascript

– Object-Action Interaction Paradigm• Semantic Web Services

Rhizomer Semantic Content Management System

Page 7: PowerPoint Presentation

Rhizomer Semantic Content Management System

Rhizomer

• Server part:– REST interface over the repository

• Jena, Sesame,…• GET: SPARQL end-point• PUT/POST/DELETE:

RDF metadata management

– Semantic Web Services• OWL-S descriptions

– Service Profile: Process, hasInput, hasOutput• REST adapters for

– Javascript APIs (e.g. Google Maps), SOAP,…

MetadataStore

Rhizomer

GE

T

PU

T

PO

ST

DE

L

search new edit delete

Page 8: PowerPoint Presentation

Rhizomer

• Client part:– AJAX (asynchronously send queries and metadata)– RDF/XML ─XSLT HTML (browsing)– “Semantic” forms (edition)

• RDF/XML ─XSLT FORM ─Javascript RDF

Rhizomer Semantic Content Management System

RDF

<HTML/>

<HTML/> SemFORMS

JavaScript

Page 9: PowerPoint Presentation

Contents

• Introduction

• The Rhizomer Platform

• Use Case: S5T Project

• Conclusions & Future Work

Rhizomer Semantic Content Management System

Page 10: PowerPoint Presentation

S5T Project

• S5T Spanish research project (TIN2005-06885)

– Scaleable Semantic personalised Search of Spoken and written contents on the Semantic Web

• Application in the Segre media group– Newspaper, TV, radio and web site– Internal use (media house): facilitate content

production, history tracking, contextualisation,…– External use (web users): advanced services (e.g.

news personalisation, semantic podcasts,…)

Rhizomer Semantic Content Management System

Page 11: PowerPoint Presentation

S5T Project

• Semantic annotation of audiovisual content– voice text (keywords) semantic annotations

• Rhizomer: user interface for browsing/interacting with content and associated knowledge

Rhizomer Semantic Content Management System

Search Engine

Search API

Media

word

Annotation Ontology

word

Media Browser

Ontology Browser

Page 12: PowerPoint Presentation

Rhizomer Semantic Content Management System

S5T Project: query

• Queries:– Metadata– Content (transcript)

• Words• Concepts title

date

genre

transcript

Search

+

Add property

GE

T

MPEG-7 Ontology

NewsML Ontology

S5T Ontology

s5t:Audiospecific propertiesP rdfs:domain s5t:Audio or

s5t:Audio owl:Restriction on P

(or superclass)

?query=

DESCRIBE ?s

WHERE {

?s dc:title ?y0;

s5t:genre ?y1.

FILTER regex(?y0, "…") && ?y1=<…./SubjectsOnto# 11000000> }

<form method=“GET” onSumbit=“formToSPARQL”><input type=“text” name=

“http://purl.org/dc/elements/1.1/title”/>

<input type=“text” name=“http://purl.org/dc/elements/1.1/date”/>

</form>

Page 13: PowerPoint Presentation

Rhizomer Semantic Content Management System

S5T Project: store

• GET SPARQL endpoint

• Metadata store– Jena, Sesame,…

• DESCRIBE queries by default. Facilitate:– Browse heterogeneous metadata– Without a priori knowledge

• DESCRIBE implementation– Concise Bounded Description +

rdfs:labels for all selected resources

• Result: RDF/XML

Page 14: PowerPoint Presentation

http://rhizomik.net/~rosa

http://...vcard#ADR

http://...vcard#StreetJaume II, 69

http://...vcard#Locality

Lleida

http://...onto#affiliation

http://www.udl.cat

http://...rdf#type

http://...rdfs#label

Rosa Gil

http://...rdf#type

http://...onto#Professor

http://...rdf#type

http://...vcard#work

http://...onto#University

Rhizomer Semantic Content Management System

S5T Project: store

• DESCRIBE http://rhizomik.net/~rosa: CBD(…~rosa) = Fragment A U rdfs:labels

Fragment A

Page 15: PowerPoint Presentation

Rhizomer Semantic Content Management System

S5T Project: browse

RDF2HTML

GET RDF2HTML

<s5t:Audio rdf:about="http://www.segre.com/audio/20070323-bm.mp3">      <dc:title>Butlletí Migdia</dc:title>      <dc:date>2007-03-23</dc:date>      <s5t:genre rdf:resource="&srs;11000000"/>      <s5t:transcriptResource>              http://www.segre.com/audio/20070323-bm.xml       </s5t:transcriptResource></srt:Audio> http://www.segre.com/ audio/ 20070323-bm.mp3 a Audio

title Butlletí Migdia

date 2007-03-23

genre politics

transcript http://www.segre.com/audio/20070323-bm.xml

Referrers

http://rhizomik.net/.../NewsCodes-SubjectsOnto#11000000 a Subject

label politics

descriptionLocal, regional, national and international exercise of power, or struggle for power, and the relationships between governing bodies and states.

Referrers

Page 16: PowerPoint Presentation

Rhizomer Semantic Content Management System

http://…/s5t.owl#Audiotype

title

date

genre http://...SubjectsOnto.owl#11000000

2007-03-23

Butlletí Migdia

Change/Add

+

Add property

RE

ST

PU

T

RE

ST

PO

ST

RE

ST

DE

L

http://www.segre.com/ audio/ 20070323-bm.mp3

edit or new

edit new del

transcript http://...com/audio/20070323-bm.xml

S5T Project: edition

edit – del – new

http://www.segre.com/ audio/ 20070323-bm.mp3 a Audio

title Butlletí Migdia

date 2007-03-23

genre politics

transcript http://www.segre.com/audio/20070323-bm.xml

Referrers

Page 17: PowerPoint Presentation

Rhizomer Semantic Content Management System

S5T Project: edition

• Semantic form:

<form method=“PUT/POST” onSumbit=“formToNTriples”><input type=“text” name=“http://...w3.org/.../rdf-syntax-ns#ID” value=“http://www.segre.com/audio/20070323-bm.mp3”/><input type=“text” name=“http://...w3.org/.../rdf-syntax-ns#type” value=“http://rhizomik.net/.../s5t.owl#Audio”/><input type=“text” name=“http://rhizomik.net/.../s5t.owl#genre”/

value=“http://.../NewsCodes-SubjectsOnto#11000000 ”/>

…</form>

– PUT: description for new resource– POST: replace resource description

<Subject,…

Predicate,Object>

Predicate,Object>

Page 18: PowerPoint Presentation

Rhizomer Semantic Content Management System

S5T Project: uploading

• Semantically annotate uploaded content(facilitate metadata creation)

• Content-type specific metadata extraction plugins– Audio: text-to-speech– Text: keyword detection, disambiguation

• OpenCalais, Freeling,…: named entities

– Photo: EXIFtoRDF,…– …

Page 19: PowerPoint Presentation

Rhizomer Semantic Content Management System

S5T Project: interaction

• Given a resource (object)…determine available services (actions)

• Resource description: type, properties, values,…

http://www.segre.com/ audio/ 20070323-bm.mp3 a Audio

title Butlletí Migdia

date 2007-03-23

genre politics

transcript http://www.segre.com/audio/20070323-bm.xml

Referrers

Page 20: PowerPoint Presentation

Rhizomer Semantic Content Management System

S5T Project: interaction

• Services descriptions:

– ID: Service Adapter URL– hasInput: an OWL Class

• Service applicable to any resource classified in this class

– hasOutput: output content type• text/html, application/rdf+xml,…

http://rhizomik.net/services/play a Process

label Play

hasInput http://rhizomik.net/.../s5t.owl#Audio

hasOutput text/html

Referrers

Page 21: PowerPoint Presentation

Rhizomer Semantic Content Management System

S5T Project: interaction

• For each resource:– Processes s.t. resource rdf:type hasInput Class

• Call: REST PUT/POST RDF resource• Adapter: service invocation

(Javascript, SOAP,…)

http://www.segre.com/ audio/ 20070323-bm.mp3 a Audio

title Butlletí Migdia

date 2007-03-23

genre politics

transcript http://www.segre.com/audio/20070323-bm.xml

Play Referrers

Page 22: PowerPoint Presentation

S5T Project: interaction

Rhizomer Semantic Content Management System

Butlletí Migdia (23/03/2007)

La mobilització en contra dels transgènics i en favor de Josep Pàmies també ha servit per introduir altres reclamacions. En aquest cas, alguns dels col·lectius de la lluita contra aquests cultius demanen que la Universitat de Lleida rebi una especialització en Agricultura Ecològica. Asseguren que serien uns estudis pioners que servirien al centre per recuperar prestigi.

Search Keyword

Browse Term

http://www.segre.com/ audio/ 20070323-bm.mp3 a Audio

title Butlletí Migdia

date 2007-03-23

genre politics

transcript http://www.segre.com/audio/20070323-bm.xml

Play Referrers

Search Keyword: describe resources whose transcript contains keyword…

Browse Term: describe the concept annotating the keyword

http://www.udl.cat a University

label Universitat de Lleida

lat 41.37

long 0.37

Map Referrers

Page 23: PowerPoint Presentation

S5T Project: interaction

• Map any resource with lat/long:

<process:Process rdf:about="http://rhizomik.net/services/map">  <rdfs:label>map</rdfs:label>  <process:hasInput>  <owl:Class rdf:ID="GeolocatedEntity">    <owl:intersectionOf rdf:parseType="Collection">     <owl:Restriction>      <owl:onProperty rdf:resource="&pos;lat"/>      <owl:minCardinality>1</owl:minCardinality>     </owl:Restriction>     <owl:Restriction>      <owl:onProperty rdf:resource="&pos;long"/>      <owl:minCardinality>1</owl:minCardinality>    </owl:Restriction>   </owl:intersectionOf>  </owl:Class>  </process:hasInput>  <process:hasOutput>text/html</process:hasOutput></process:Process>

• Service Adapter includes GoogleMap Javascript • The same for events (Timeline, Calendar,…), etc.

Rhizomer Semantic Content Management System

Page 24: PowerPoint Presentation

Contents

• Introduction

• The Rhizomer Platform

• Use Case: S5T Project

• Conclusions & Future Work

Rhizomer Semantic Content Management System

Page 25: PowerPoint Presentation

Rhizomer Semantic Content Management System

Conclusions

• Content Management System based on semantic metadata

• Framework for Human-Semantic Web Interaction

• Flexible – Projects: S5T, Rhizomik,…

• Extensible– Plugins for visualisation, interaction and

metadata extraction• Usable

– Preliminary usability tests

Page 26: PowerPoint Presentation

Future Work

• Integrate S5T project interface

• Assisted metadata edition– Properties and values recommended based on

available ontologies

• Create basic services set– Map– Time line– Calendar– Plot…

• Complete usability test

Rhizomer Semantic Content Management System

Page 27: PowerPoint Presentation

Thank you for your attention

More: http://rhizomik.net

Roberto García

http://rhizomik.net/~roberto