linked data in pharma it univ 2 april 2012

19
Kerstin Forsberg AstraZeneca, R&D, Sweden Clinical Information Strategy [email protected] kerfors on Twitter, Google+, LinkedIn, Slideshare, Blogspot, citulike Linked Data in Pharmaceutical R&D PhD Informatics Seminar, IT University

Upload: kerstin-forsberg

Post on 11-May-2015

1.455 views

Category:

Education


0 download

TRANSCRIPT

Page 1: Linked data in pharma it univ 2 april 2012

Kerstin Forsberg

AstraZeneca, R&D, Sweden

Clinical Information Strategy

[email protected]

kerfors on Twitter, Google+, LinkedIn, Slideshare, Blogspot, citulike

Linked Data in Pharmaceutical R&D

PhD Informatics Seminar, IT University

Page 2: Linked data in pharma it univ 2 april 2012

What is this?

DBpedia

WikiData

Schema.org

Page 3: Linked data in pharma it univ 2 april 2012

Some recent headlines!

How DBpedia treats Wikipedia as a Database

Wikipedia’s Next Big Thing: Wikidata, A Machine-

Readable, User-Editable Database Funded By

Google, Paul Allen And Others

Yandex (Russia’s leading search engine) joins

Google, Yahoo! and Bing to collaborate on

Schema.org

Page 5: Linked data in pharma it univ 2 april 2012

Pharmaceutical Research and Development

Page 6: Linked data in pharma it univ 2 april 2012

Complexity

QoL? Outcomes?

Costs?

Pathophysiology?

Biomarkers?

Targets? Phenotypes?

Association and interpretation

of all data needed

has become a too complex task

for individuals, or even teams to handle.

Health care, pharma, academia,

authorities and payers.

Shared datasets

Different decisions and different types

of applications.

Page 9: Linked data in pharma it univ 2 april 2012

Opportunities Organized for associations

Prepared for not yet defined use

Ready for automation where computers can

function alongside us to

Mitigate the complexity in discovering, accessing,

connecting and interpreting information

Improve the productivity in managing

information

Page 10: Linked data in pharma it univ 2 april 2012

Semantic Web Standard Stack

Page 11: Linked data in pharma it univ 2 april 2012

RDF Triples

Resource Description Framework (RDF):

a general model of how any piece of data and

representations of knowledge can be expressed as

so called triples.

subject predicate

Stockholm place

Stockholm Sweden

Stockholm Port cities in Sweden

Stockholm “+46-8”

object (or value)

type

capital

subject

areaCode

“http://en.wikipedia.org/wiki/Stockholm” primaryTopic Stockholm

Page 12: Linked data in pharma it univ 2 april 2012

RDF Triples

Triples can be aggregated into graphs with subject

and objects as nodes, and predicates as arcs.

place

Sweden

Stockholm Port cities in Sweden

“+46-8”

type

capital

subject

areaCode

“http://en.wikipedia.org/wiki/Stockholm” primaryTopic

Page 13: Linked data in pharma it univ 2 april 2012

RDF Triples

Graphs of triples can be extended across different

sources and for different purpose.

place

Sweden

Stockholm Port cities in Sweden

“+46-8”

type

capital

subject

areaCode

Country type

Gothenburg

subject

CDISC

CDISC

Interchange

EU 2012

“http://en.wikipedia.org/wiki/Stockholm” primaryTopic

Page 14: Linked data in pharma it univ 2 april 2012

RDF Triples

RDF Schema and the RDF based Web Ontology

Language (OWL) add a typing mechanism to classify

subjects and objects into hierarchies of types

place

Sweden

Stockholm Port cities in Sweden

“+46-8”

type

capital

subject

areaCode

Country type CDISC

CDISC

Interchange

EU 2012

“http://en.wikipedia.org/wiki/Stockholm” primaryTopic

Adm.Area

Place

subClass

subClass

subClass

Organization

type

Business

Event

type

Event

subClass

Thing subClass subClass

Gothenburg

subject

Page 15: Linked data in pharma it univ 2 april 2012

RDF Triples

Simple Knowledge Organization System (SKOS) is

a thin RDF based vocabulary that can be used to

build terminologies of broader/narrower concepts.

place

Sweden

Stockholm Port cities in Sweden

“+46-8”

type

capital

subject

areaCode

CDISC

CDISC

Interchange

EU 2012

“http://en.wikipedia.org/wiki/Stockholm” primaryTopic

Organization

type

Business

Event

type

Gothenburg

subject

Cities in

Sweden

broader

Populated

places in Europe

broader narrower

narrower

Page 16: Linked data in pharma it univ 2 april 2012

4 Principles for Linked Data … … and 5 stars for Linked Open Data

• Use URIs (Uniform Resource Identifiers) as names for things.

• Use HTTP URIs so that people can look up (dereference) those names.

• When someone looks up a URI, provide useful information.

• Include links to other URIs so that they can discover more things.

Source: Linked Open Data star scheme by example

More resources introducing and describing the Linked Data idea

Page 17: Linked data in pharma it univ 2 april 2012

Linked Open Data cloud

Richard Cyganiak and Anja Jentzsch

http://lod-cloud.net/

Growing Linked Open Data Cloud

http://youtu.be/TXFYSWuEOOw

Page 18: Linked data in pharma it univ 2 april 2012

Linked Enterprise Data

Source: What does Open Data mean for Enterprises?

More resources introducing and describing the Linked Data idea

Page 19: Linked data in pharma it univ 2 april 2012

I’m encouraged by …

• … what actually can be done by applying Linked

Data principles, together with a stepwise

implementation and pragmatic application of

crucial building blocks, to …

• … improve the research and commercial

utility of information

• Organized for associations

• Prepared for not yet defined use

• Ready for automation where computers can

function alongside us to

Mitigate the complexity in discovering,

accessing, connecting and interpreting

information

Improve the productivity in managing

information

1

9

Health Care and Life Sciences (HCLS)

Interest Group

Linking Open Drug Data

EU project The Large Knowledge Collider

Linked Life Data

A 2-page summary of our learnings from

participating in these external projects:

Linked Data in Pharma, 2011, Bo Andersson

and Kerstin Forsberg