linked data: thinking big, starting small

23
Department of Parliamentary Services Parliamentary Library and Information Service Linked Data: thinking big, starting small VALA 6 February 2014 Peter Neish @peterneish

Upload: peter-neish

Post on 09-May-2015

762 views

Category:

Technology


4 download

DESCRIPTION

How we implemented Linked Data in a small project at the Victorian Parliamentary Library.

TRANSCRIPT

Page 1: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Linked Data:thinking big, starting small

VALA 6 February 2014

Peter Neish@peterneish

Page 2: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

What will be covered

• Background

– What is Linked Data?

– Linked Data in Libraries and Government

• What we did

– Linked Data Workflow

• What did we get out of it?

Page 3: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

What is Linked Data?

SubjectDenis Napthine

Predicateparty

ObjectLiberal Party

the triple statementslightly simplified example

http://www.w3.org/ns/org#memberOf

http://parliament.vic.gov.au/members/id/135 http://dbpedia.org/resource/Liberal_Party_of_Australia

1 October 1988

premier

United Australia

Party

Ted Baillieu

elected

hasRole

successorOf party

31 August 1945

formationDate

successorOf

Page 4: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Page 5: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Page 6: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Page 7: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Linked Data in Libraries

• OCLC – 1.2 million resources – 80 million triples

• LOC – Subject headings, authority files

• British Library – 2.8 million records, 93 billion triples

• BIBFRAME

• Schema Bib Extend Community Group

• LODLAM

Page 8: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Linked Data in Parliament and Government

– 6.4 billion triples of open government data

Page 9: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Open Government

Page 10: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Project aims• Is Linked Data useful in a

local context

• Explore the process of using Linked Data – where do you start?

• Being able to interrogate our data in new ways

• Use visualisation to gain new insights into data

Page 11: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Databases at Parliament

People and Organisations

Members of Parliament

government agencies

Media

media releases

newspaper clippings

video and audio clips

Documents

parliamentary debates (Hansard)

parliamentary papers

party policies

Page 12: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Linked Data Workflow

• choose ontology• investigate similar projectsPreparation

• clean data (cluster, facet)• named entity extraction• reconcile with other data

Clean and reconcile data

• output RDF• store data (files, triple store etc)Publish

Page 13: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

• Investigate similar projects

– Don’t reinvent the wheel

– Collaborate

• Choose an ontology (or build your own)

– Linked Data Open Vocabularies (lov.okfn.org)

Preparation

Page 14: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

• developing open government specifications relating to the legislature

• prioritizes reuse over novelty

• attempts to make it easy to represent real-world data

• consensus model – open to contributions (W3C community group, github)

Popolo Ontologypopoloproject.com

Page 15: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Clean and reconcile data

Page 16: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Clean and reconcile data

Page 17: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

• create RDF (Open Refine can do this too)

• store data

– separate files

– embedded in html

– Database mapping using D2RQ

– triple store

Publish

Page 18: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

• Combined approach

– embedded data in catalogue

– Fuseki Triple Store

• Complex queries using SPARQL:

– what have previous speakers being saying about the current issues in parliament?

– find all articles about transport that mention members of the Road Safety Committee

What do we get out of it?

Page 19: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Page 20: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Links to related articles

Page 21: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Federal Preferences 2013 Election

Page 22: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Conclusion

• The process itself is valuable

• Aligning data with standards (Popolo Ontology)

• Cleaning and reconciling adds value to data

• Databases linked internally

• Can now provide Linked Data externally

Page 23: Linked Data: thinking big, starting small

Department of Parliamentary ServicesParliamentary Library and Information Service

Further Information

Linked Data best practise and recipes

• freeyourmetadata.org

• linkeddatabook.com

• euclid-project.eu

@peterneish

github.com/peterneish