eswc ss 2013 - monday keynote stefan decker: from linked data to networked knowledge

53
© Copyright 2011 Digital Enterprise Research Institute. All rights reserved. Digital Enterprise Research Institute www.deri.ie Enabling Networked Knowledge From Linked Data to Networked Knowledge Stefan Decker [email protected] http://www.StefanDecker.org/

Upload: eswcsummerschool

Post on 28-Jan-2015

107 views

Category:

Education


5 download

DESCRIPTION

 

TRANSCRIPT

Page 1: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

© Copyright 2011 Digital Enterprise Research Institute. All rights reserved.

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

From Linked Data to Networked Knowledge

Stefan Decker

[email protected] http://www.StefanDecker.org/

Page 2: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Cave Drawings 30000 BC

Page 3: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Writing: 3200 BC (Sumerian cuneiform)

Page 4: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Printing Press (Gutenberg 1450)

Page 5: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Photography (Daguerre 1839)

Page 6: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Phonograph (Edison 1877)

Page 7: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Movies (Lumiere 1895)

Page 8: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Bush’s camera on the head

Page 9: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge 9

Memex

“A memex is a device in which an individual stores all his books, records, and communications, and which is mechanized so that it may be consulted with exceeding speed and flexibility”

Posited by Vannevar Bush in “As We May Think” The Atlantic Monthly, July 1945

Page 10: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge 10

Sketch of memex

Page 11: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

oNLine System- NLS, 1968 (Doug Engelbart, SRI)

“By ‘augmenting human intellect’ we mean increasing the capability of a man to approach a complex problem situation, to gain comprehension to suit his particular needs, and to derive solutions to problems.”

The Mouse;

Word Processing;

Data Sharing;

Hypertext;

Page 12: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

ARPANET (1969) (John Postel, David Crocker, Vint Cerf)

Page 13: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Xanadu (Ted Nelson ~1960-???)

13

Page 14: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

World Wide Web (Tim Berners-Lee 1989)

WWW (Tim Berners-Lee) “There was a second part of the dream […] we could then use computers to help us analyse it, make sense of what we re doing, where we individually fit in, and how we can better work together.”

Page 15: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge 15 of 46

Making Progress…

Memex (Vannevar Bush) A memex is “a device in which an individual stores all his books, records, and communications.” Augmenting Human Intellect (Doug Engelbart) “By "augmenting human intellect" we mean increasing the capability of a man to approach a complex problem situation, to gain comprehension to suit his particular needs, and to derive solutions to problems.” WWW (Tim Berners-Lee) “There was a second part of the dream […] we could then use computers to help us analyse it, make sense of what we re doing, where we individually fit in, and how we can better work together.”

Page 16: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

A Network of Data and Knowledge

n  Interconnected n  Universal n  All encompassing

n  assists humans, organisations and systems with problem solving

n  enabling innovation and increased productivity ?

Page 17: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

?

Page 18: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

1.  Scalability: No growth scalability problem (e.g., no back links from HTML pages)

2.  No censorship: no lengthy permission or review process

3.  Positive feedback loop: exploit Metcalf’s Law

What enabled the Web?

Page 19: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Metcalfe's law: The value of a network is proportional to the square of the number of connected members

Page 20: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Metcalfe’s Law 1: Links

Page 21: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Metcalfe’s Law 2

Page 22: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Page 23: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

1. Scalability. No centralized infrastructure (e.g., a central object repository) required. 2. No censorship. It must be possible to publish data without having to ask for prior permission. 3. Positive feedback loop. Capitalize on Metcalfe’s Law.

Requirements for a Data Web

Page 24: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Enabling Metcalfe’s Law

1. Global Object Identity. 2. Composability: The value of data can be increased if it can be combined

with other data. Composability has a number of consequences:

1.  schema-less. ( Combined data originating from difference sources unlikely to conform to a schema)

2.  self-describing

3.  “object centric”. In order to integrate information about different entities data must be related to these entities.

4. graph-based. The composition of multiple object-centric data sources results in a graph in the general case.

Page 25: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Observations

•  The relational model does not fulfill these requirements (not composable, no global object id)

•  XML is not object centric and not composable.

•  Graph based data formats are composable

•  RDF fulfills these requirements.

•  Claim: Any data format that fulfills the requirements is “more or less” isomorphic to RDF.

Page 26: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge 26 of 46

The usual two Ingredients

1.  RDF – Resource Description Framework Graph based Data – nodes and arcs ¨  Identifies objects (URIs)

¨  Interlink information (Relationships)

2.  Vocabularies (Ontologies) ¨  provide shared understanding of a domain

¨  organise knowledge in a machine-comprehensible way

¨  give an exploitable meaning to the data

Page 27: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Linked Open Data cloud - domains

Over 200 open data sets with more than 25 billion facts, interlinked by 400 million typed links, doubling every 10 month!

http://lod-cloud.net/

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.

Media

Government

Geo

Publications

User-generated

Life sciences

Cross-domain

US government UK government

BBC New York Times

LinkedGeoData

27

BestBuy Overstock.com Facebook

Page 28: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Do we really need Ontologies? (as we have them right now)

Provocative?

Page 29: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Anything wrong with this picture?

Page 30: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Wasn’t the Web about sharing?

Page 31: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Could modeling not be easy?

Animal? Mammal? Whale? Class? Instance?

Argh!!!

Page 32: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

n  Differentiation in Classes and Instances is difficult: no single way to abstract the world (observe Upper Ontology wars…..aehm…discussions!)

n  Choices between Instances and Classes done at

design cause usability issues (different treatment in applications) (animal-mammal-whale)

n  Ontologies cement power structures (prevent information sharing)

n  Sharing is only top-down

Issues with Ontologies

Page 33: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

n  Predecessor: Frame Representation Systems n  “Prototypes: KRL, RLL, and JOSIE employ prototype

frames to represent information about a typical instance of a class as opposed to the class itself and as opposed to actual instances of the class.” [Karp, 1993]

n  AFAIK: Formalisation classes as subsets and instances as elements [Hayes, 1979].

n  Formalization of Frame Systems (Description Logic) picked up on [Hayes, 1979] and left out alternatives

How did classes/instances happen?

Page 34: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

n  Stefan Decker, Dieter Fensel, Frank van Harmelen, Ian Horrocks, Sergey Melnik, Michel C. A. Klein, Jeen Broekstra: Knowledge Representation on the Web. Description Logics 2000: 89-97

n  OIL -> DAML+OIL -> OWL -> OWL 2.0

Note: How did DL & Ontologies/Classes happen in the Semantic Web?

Page 35: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge © A. Lienhard, O. Nierstrasz" 3.35

Class- vs. Prototype-based Programming Languages!

Class-based: Prototype-based:

>  No classes, only objects >  Objects define their own

properties and methods >  Objects delegate to their

prototype(s) >  Any object can be the

prototype of another object

Prototype-based languages unify objects and classes

From: A. Lienhard, O. Nierstrasz: Prototype based programming http://www.slidefinder.net/0/03prototypes/03prototypes/10603817

n  Classes share methods and define common properties"

n  Inheritance along class chain"n  Instances have exactly the

properties and behavior defined by their class"

n  Structure typically cannot be changed at runtime"

Page 36: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

> JavaScript, > Self, > NewtonScript, > Omega, Cecil,

Examples

From: A. Lienhard, O. Nierstrasz: Prototype based programming http://www.slidefinder.net/0/03prototypes/03prototypes/10603817

Page 37: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

How it could look like: (Horizontal Information Sharing)

Enabling Metcalfs Law!

Page 38: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

You can have Ontologies as well…

Page 39: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Object foaf:Agent weblog URL twitterID String

Object foaf:Person hasPrototype foaf:Agent fromSource: http://xmlns.com/foaf/spec/#term_Agent knows foaf:Agent

age [0…120] gender {Male, Female} ishuman yes

Object deri:Stefan hasPrototype foaf:person

fromSource: http://xmlns.com/foaf/spec/#term_Person weblog http://blog.stefandecker.org twitterID stefanjdecker age 44

gender Male

FOAF as Prototypes

Page 40: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

n  Delegation vs. Copying (deep, shallow) ¨  When to delegate (freshness)

¨  When to copy (access too slow, unsecure, unreliable)

¨  Caching, update strategies?

n  Determining object boundaries (e.g., via a scope-relationship between URIs). E.g., deri:stefan isScopeOf deri:address. Automatically possible?

n  Object modification – non monotonicity?

n  Referencing an object (URI & Source!) n  Specialization: Exact semantics?

Challenges/Design

Page 41: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

n  Knowledge Representation Constructs (Specialisation)

n  Logic based Formalisation of Prototypes

n  Reasoning (e.g., with Rules)

n  Complexity

n  Large Scale Storage, Querying

n  Collaboration facilities

Research Agenda

Page 42: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Get the car out of the mud…

Page 43: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Linked Open Data cloud - domains

Over 200 open data sets with more than 25 billion facts, interlinked by 400 million typed links, doubling every 10 month!

http://lod-cloud.net/

Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch.

Media

Government

Geo

Publications

User-generated

Life sciences

Cross-domain

US government UK government

BBC New York Times

LinkedGeoData

43

BestBuy Overstock.com Facebook

Page 44: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Problem Statement

Cisplatin

Cisplatin

Cisplatin: ~ €500 per injection High Toxicity Low Quality of Life

Page 45: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Methodology – Combining Open and Private Data

Uniprot

Entrez Gene

BioMart

KEGG Pathway

What are the proteins in the p53 pathway

What is the sequence for those proteins?

What genes encode for those proteins?

How to translate identifiers?

mirBase What is the sequence of those microRNA?

MicroCosm Targets

What microRNA target those genes? p53 17

proteins 7 proteins

155 Genes

817 microRNA

20 000 Genes

Linked Data Filtering

Between 500K and a few billion triples… Schema is at least 10 tables per dataset!

“Classic data integration = 2 months per data source… but we need 100s!”

Page 46: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Networked  Data  Management  

Abstrac4on,  Reasoning,  Analy4cs  

 

Visualisa4on,  Collabora4on,  Exploita4on    

Page 47: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Enabling an iterative exploration process

Transform, Integrate

Gather

Explore Analyse

Understand

Page 48: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Knowledge Pipeline

•  Global Data •  Flexible

Integration Linked Data

•  Liquid Processing •  Flexible •  Extensible

Big Data

HADOOP

•  Visualisation •  Analytics •  Collaboration •  Exploitation

Knowledge Discovery

•  Scalability •  Parallel

Processing •  Fast Access

FGCP

Cloud PaaS

Page 49: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Example

Find all topics in publications relevant to “neoplasm” and clinical trials relevant to the

publications topics!

Page 50: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Workflow

Publications

Find all topics in publications related to Neoplasm (e.g., “Neoplasm Metastasis”)

Topics

Find all Clinical Trials related to the identified topics

Clinical Trials

Page 51: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Page 52: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

Enabling an iterative exploration process

Transform, Integrate

Gather

Explore Analyse

Understand

Page 53: ESWC SS 2013 - Monday Keynote Stefan Decker: From Linked Data to Networked Knowledge

Digital Enterprise Research Institute www.deri.ie

Enabling Networked Knowledge

A Network of Knowledge

n  enabling innovation and increased productivity

n  Interconnected n  Universal n  All encompassing

n  assists humans, organisations and systems with problem solving

Linked Data

• Search • Collaboration • Text Mining

• Science • Commercialization`