intro to linked data: context

Post on 27-Jan-2015

109 Views

Category:

Education

2 Downloads

Preview:

Click to see full reader

DESCRIPTION

 

TRANSCRIPT

An Introduction

toLinked Data

David Hyland-WoodUniversity of Mary Washington

Part 1 of 5Linked Data Context

Monday, May 16, 2011

Agenda

• Setting the scene

• What is Linked Data?

• RDF - A data format for the Web

• Data modeling without context

• URIs - Naming Things on the Web

Monday, May 16, 2011

Agenda

• Turtle - Human-readable RDF

• RDF/XML - RDF for XML pipelines

• Other RDF formats: RDFa, N3

• RDF Schema - Describing structure

Monday, May 16, 2011

Agenda

• Converting existing data into RDF

• Linked Open Data on the Web

• SPARQL - A query language for RDF

• Linked Data publishing options

• Building Linked Data applications

Monday, May 16, 2011

http://linkeddatabook.com/editions/1.0/

http://3roundstones.com/linking-enterprise-data/

Reading

Monday, May 16, 2011

Final Project

• Collect/convert/publish data about Fredericksburg

•   UMW, restaurants, Liberty Town, parks/rec, music/bands, etc, etc.

•   Whatever you want.

• Together, we will build one or more applications using that data.

Monday, May 16, 2011

Grades

• Grades will be based on class participation and completion of the final project.

Monday, May 16, 2011

A Process

PublishConvertDescribeNameModelIdentify

Monday, May 16, 2011

A Process

PublishConvertDescribeNameModelIdentify

Maintain

Monday, May 16, 2011

Settingthe

SceneMonday, May 16, 2011

Monday, May 16, 2011

shared innovation™

1970s 1980s 1990s

$ cat foo.txt | grep blah | sort

A neat little package Client-Server The Early Web

A HISTORY OF SILOS

Monday, May 16, 2011

shared innovation™

Web

Universal Client

Universal Connection

Universal Database

Logic and interlinking

Ubiquitous,reusable applications

URL Curation

of Data

Monday, May 16, 2011

0

10

20

30

40

50

60

70

80

1990 1992 1994 1996 1998 2000 2002 2004 2006 2008

Access per 100 population worldwide

1992

Mobile cellular subscriptionsFixed telephone linesInternet users

Monday, May 16, 2011

Monday, May 16, 2011

Monday, May 16, 2011

Monday, May 16, 2011

Data Inflation

Megabyte (MB) = 220

Gigabyte (GB) = 230

Terabyte (TB) = 240 1,000GB

Petabyte (PB) = 250 1,000TB

Exabyte (EB) = 260 or 1,000PB

Zettabyte (ZB) = 270 or 1,000EB

Yottabyte (YB) = 280 or1,000ZB

Monday, May 16, 2011

Monday, May 16, 2011

Monday, May 16, 2011

March of the Machines

“Information created by machines and used by other machines will probably grow faster than

anything else...

Only 5% of the information that is created is structured but this is changing as content

on the Web is increasing tagged.”

- Roger Bohn of University of California, San DiegoStudy on American Households

Monday, May 16, 2011

Monday, May 16, 2011

Monday, May 16, 2011

Monday, May 16, 2011

The painter... does not fit the paints to the world.He fits himself to the paint.

-- Paul Klee

Monday, May 16, 2011

Monday, May 16, 2011

17%

49%

16%

13%

4%

6 months12 months18 months24 monthsMore than 24 months

Acceptable ROI for IT

Monday, May 16, 2011

What isLinkedData?

Monday, May 16, 2011

The Web of Documents• A global network of linked documents

• A place where anyone can say anything about anything

• A vast collection of human-readable knowledge (and opinion)

• Documents are linked, but links are not qualified

Monday, May 16, 2011

The Web of Data• A global network of linked statements

• A place where anyone can say anything about anything

• A vast collection of machine-readable knowledge (and opinion)

• Statements are linked, and links are qualified

Monday, May 16, 2011

Monday, May 16, 2011

Monday, May 16, 2011

Monday, May 16, 2011

Monday, May 16, 2011

Monday, May 16, 2011

Monday, May 16, 2011

Monday, May 16, 2011

The Web of Data• Modeled for re-use using simple

statements

• Named using HTTP URIs for things and for documents

• Described using simple schema which can be remixed and extended

• Data published on the web in RDF

Monday, May 16, 2011

The Web of Data• Governments

http://data.gov/, http://data.gov.uk/ and others

• The BBChttp://www.bbc.co.uk/programmes/developers

• The New York Timeshttp://data.nytimes.com/

• The Library of Congresshttp://id.loc.gov/

• Facebookhttp://developers.facebook.com/docs/opengraphhttp://opengraphprotocol.org/

Monday, May 16, 2011

RDF:A data format for

the WebMonday, May 16, 2011

RDF is ...• Resource Description Framework

http://www.w3.org/RDF/

• A simple, powerful, data model

• Used to define relationships between things

• Designed to work with the architecture of the web

• The data model for the Web of Data

Monday, May 16, 2011

Allow anyoneto say anythingabout anything

Monday, May 16, 2011

Monday, May 16, 2011

Monday, May 16, 2011

Spacecraft Name Mass Launched Launch Site

1969-018A Apollo 9 26801.0 1969-03-03 1

1969-030A Mariner 7 411.8 1969-03-27 1

1969-043A Apollo 10 28834.0 1969-05-18 1

1969-059A Apollo 11 Command and Service Module (CSM) 28801.0 1969-07-16 1

1969-059C Apollo 11 Lunar Module / EASEP 15065.0 1969-07-16 1

1969-099A Apollo 12 Command and Service Module (CSM) 28790.0 1969-11-14 1

1970-029C Apollo 13 Lunar Module/ALSEP 15196.0 1970-04-11 1

1970-029A Apollo 13 Command and Service Module (CSM) 28945.0 1970-04-11 1

Monday, May 16, 2011

Spacecraft Name Mass Launched Launch Site

1969-018A Apollo 9 26801.0 1969-03-03 1

1969-030A Mariner 7 411.8 1969-03-27 1

1969-043A Apollo 10 28834.0 1969-05-18 1

1969-059A Apollo 11 Command and Service Module (CSM) 28801.0 1969-07-16 1

1969-059C Apollo 11 Lunar Module / EASEP 15065.0 1969-07-16 1

1969-099A Apollo 12 Command and Service Module (CSM) 28790.0 1969-11-14 1

1970-029C Apollo 13 Lunar Module/ALSEP 15196.0 1970-04-11 1

1970-029A Apollo 13 Command and Service Module (CSM) 28945.0 1970-04-11 1

Monday, May 16, 2011

Spacecraft Name Mass Launched Launch Site

1969-018A Apollo 9 26801.0 1969-03-03 1

1969-030A Mariner 7 411.8 1969-03-27 1

1969-043A Apollo 10 28834.0 1969-05-18 1

1969-059A Apollo 11 Command and Service Module (CSM) 28801.0 1969-07-16 1

1969-059C Apollo 11 Lunar Module / EASEP 15065.0 1969-07-16 1

1969-099A Apollo 12 Command and Service Module (CSM) 28790.0 1969-11-14 1

1970-029C Apollo 13 Lunar Module/ALSEP 15196.0 1970-04-11 1

1970-029A Apollo 13 Command and Service Module (CSM) 28945.0 1970-04-11 1

Monday, May 16, 2011

Spacecraft Name Mass Launched Launch Site

1969-018A Apollo 9 26801.0 1969-03-03 1

1969-030A Mariner 7 411.8 1969-03-27 1

1969-043A Apollo 10 28834.0 1969-05-18 1

1969-059A Apollo 11 Command and Service Module (CSM) 28801.0 1969-07-16 1

1969-059C Apollo 11 Lunar Module / EASEP 15065.0 1969-07-16 1

1969-099A Apollo 12 Command and Service Module (CSM) 28790.0 1969-11-14 1

1970-029C Apollo 13 Lunar Module/ALSEP 15196.0 1970-04-11 1

1970-029A Apollo 13 Command and Service Module (CSM) 28945.0 1970-04-11 1

Monday, May 16, 2011

Spacecraft Name Mass Launched Launch Site

1969-018A Apollo 9 26801.0 1969-03-03 1

1969-030A Mariner 7 411.8 1969-03-27 1

1969-043A Apollo 10 28834.0 1969-05-18 1

1969-059A Apollo 11 Command and Service Module (CSM) 28801.0 1969-07-16 1

1969-059C Apollo 11 Lunar Module / EASEP 15065.0 1969-07-16 1

1969-099A Apollo 12 Command and Service Module (CSM) 28790.0 1969-11-14 1

1970-029C Apollo 13 Lunar Module/ALSEP 15196.0 1970-04-11 1

1970-029A Apollo 13 Command and Service Module (CSM) 28945.0 1970-04-11 1

Monday, May 16, 2011

Spacecraft Name Mass Launched Launch Site

1969-018A Apollo 9 26801.0 1969-03-03 1

1969-030A Mariner 7 411.8 1969-03-27 1

1969-043A Apollo 10 28834.0 1969-05-18 1

1969-059A Apollo 11 Command and Service Module (CSM) 28801.0 1969-07-16 1

1969-059C Apollo 11 Lunar Module / EASEP 15065.0 1969-07-16 1

1969-099A Apollo 12 Command and Service Module (CSM) 28790.0 1969-11-14 1

1970-029C Apollo 13 Lunar Module/ALSEP 15196.0 1970-04-11 1

1970-029A Apollo 13 Command and Service Module (CSM) 28945.0 1970-04-11 1

ID Name

1 Cape Canaveral, United States

2 Tyuratam (Baikonur Cosmodrome), U.S.S.R

3 Xichang Space Launch Center

Monday, May 16, 2011

Spacecraft Name Mass Launched Launch Site

1969-018A Apollo 9 26801.0 1969-03-03 1

1969-030A Mariner 7 411.8 1969-03-27 1

1969-043A Apollo 10 28834.0 1969-05-18 1

1969-059A Apollo 11 Command and Service Module (CSM) 28801.0 1969-07-16 1

1969-059C Apollo 11 Lunar Module / EASEP 15065.0 1969-07-16 1

1969-099A Apollo 12 Command and Service Module (CSM) 28790.0 1969-11-14 1

1970-029C Apollo 13 Lunar Module/ALSEP 15196.0 1970-04-11 1

1970-029A Apollo 13 Command and Service Module (CSM) 28945.0 1970-04-11 1

ID Name

1 Cape Canaveral, United States

2 Tyuratam (Baikonur Cosmodrome), U.S.S.R

3 Xichang Space Launch Center

Monday, May 16, 2011

Resources...• Represent things on the web, like web

pages — Information Resources

• Represent things not on the web, like people and places — Non-Information Resources

• Can represent anything at all

• Named using URIs (usually)

• May not have a name — Blank Nodes

Monday, May 16, 2011

Literal Values...• Are values to work with and show

users

• Can be just a string of text — Plain Literals

• Can have a language assigned to the them using ISO codes

• Can have a specific datatype assigned to them — Typed Literals

Monday, May 16, 2011

Predicates are...• Relationships between Resources

• Named using URIs

• Described in Schema(or vocabularies, or ontologies)

Monday, May 16, 2011

Apollo 11 Command and Service Module Cape Canaveral Cape Canaveral,

United States namelaunchsite

Statement or Triple

Monday, May 16, 2011

Apollo 11 Command and Service Module Cape Canaveral Cape Canaveral,

United States namelaunchsite

Subject Predicate Object

Statement or Triple

Monday, May 16, 2011

Apollo 11 Command and Service Module Cape Canaveral Cape Canaveral,

United States namelaunchsite

Subject Predicate Object

Statement or Triple

Statement or Triple

Monday, May 16, 2011

Apollo 11 Command and Service Module Cape Canaveral Cape Canaveral,

United States namelaunchsite

Subject Predicate Object

Statement or Triple

Statement or Triple

Subject Predicate Object

Monday, May 16, 2011

Simple Rules for Merging• URIs name the resources we are

describing

• Two people using the same URI are describing the same thing

• The same URI in two datasets becomes one URI with all of the properties from both datasets

• RDF data can use URIs from many different websites

Monday, May 16, 2011

Simple Rules for Merging• Graphs from several different sources

can be merged

• Resources with the same URI are considered identical

• No limitations on which graphs can be merged

Monday, May 16, 2011

Any RDF can be merged with any

other RDF

Monday, May 16, 2011

RDF...• is a Graph data model

XML is a tree data modelRDBMS is a table data model

• can be easily distributed

• is easily mergeable

• can say anything about anything

• is not newthe spec is from 1999the idea, EAV, stems back to the late 1960s

Monday, May 16, 2011

Thanks to

• Rob Styles, Talis Group Ltd for his CC-BY licensed “Two Day Introduction to the Web of Data” slides and spacecraft example data.

Monday, May 16, 2011

This work is Copyright © 2010 David Hyland-WoodBased partially on CC-BY licensed content Copyright © 2010 Talis Systems Limited by Rob Styles. It is licensed under the Creative Commons Attribution 3.0 Unported License.Full details at: http://creativecommons.org/licenses/by/3.0/

You are free:

to Share — to copy, distribute and transmit the work

to Remix — to adapt the work

Under the following conditions:

Attribution. You must attribute the work in the manner specified by the author or licensor (but not in any way that suggests that they endorse you or your use of the work).

• For any reuse or distribution, you must make clear to others the license terms of this work.• Any of the above conditions can be waived if you get permission from the copyright holder.• Nothing in this license impairs or restricts the author's moral rights.• Some Content in the work may be licensed under different terms, this is noted separately.

Monday, May 16, 2011

top related