linked data presentation for libraries (como)

41
Making the Digital Connection: Linked Data and Libraries Robin Fay, UGA Laura Akerman, Emory Doug Goans, Georgia Tech Georgia COMO 2012 October 4, 2012 Macon, GA

Upload: robin-fay

Post on 16-Jan-2017

680 views

Category:

Education


5 download

TRANSCRIPT

Page 1: Linked data presentation for libraries (COMO)

Making the Digital Connection: Linked Data and Libraries

Robin Fay, UGALaura Akerman, Emory

Doug Goans, Georgia Tech

Georgia COMO 2012 October 4, 2012 Macon, GA

Page 2: Linked data presentation for libraries (COMO)

Head/Database Maintenance

University of Georgia Libraries

Robin Fay@georgiawebgurl

libraries

socialmedia

Page 3: Linked data presentation for libraries (COMO)

Making the Digital Connection: Linked Data and Libraries

Agenda:● Gain a basic understanding of linked data● Discover new library projects● Gain a better understanding of how linked data will drive our future

systems and how it fits into the future of the web -- the semantic web

What we will talk about:● Quick overview of semantic web and linked data (more

terminology in the glossary)● Getting started with Linked data and the Semantic Web● Making some connections with bibliographic data - a

case study

Page 4: Linked data presentation for libraries (COMO)

● What is linked data and open datao Linked data is about reusing datao We already do some linked data in our library

catalogs and even in our daily liveso The link in a bibliographic record (like an authority

record link) is linking datao A link that we share to our friends on facebook is

linked data (of sorts)● Linked data is a link to a record/data/content

that can then be utilized in some way ● Open data is data that available to be used

in some way with no barriers to access (licensing, etc.)

Making the Digital Connection: Linked Data and Libraries

Page 5: Linked data presentation for libraries (COMO)

Basic principles of linked data ● It keeps us from having to re-enter or copy information

o Making our data: reusable easy to correct (correct one record instead of multiples) efficient and potentially useful to others

● It can build relationships in different ways - allowing us to create temporary collections (a user could organize their search results in a way that makes sense to them) or more permanent (collocating ALL works by a particular author more easily; pulling together photographs more easily)

Making the Digital Connection: Linked Data and Libraries

Page 6: Linked data presentation for libraries (COMO)

How it fits in with the semantic webThe Semantic Web is based upon more precise utilization of data and is heavily dependent upon

The code The metadata and its metadata schemas (rules) The ability for machines (including devices and home

appliances) to talk to each other and make sense of that communication

Linking data makes this process easier since we do not have to re-enter data, we can just link to it. Linking data will make the Semantic Web happen.

Making the Digital Connection: Linked Data and Libraries

Page 7: Linked data presentation for libraries (COMO)

● Terminologyo RDF/XML (language - post MARC world) o URIs (Uniform Resource Identifer) linking point to

our data)o Serialization (store data; re-use later)o Triples (Simple semantic structure; Subject-

Predicate-Object)o SPARQL (QL = Query Language)

Making the Digital Connection: Linked Data and Libraries

Page 8: Linked data presentation for libraries (COMO)

● Advantages (reusable data, potential to provide and built relationships, discoverability)

● How library data fits into linked datao FRBR ( a bibliographic FRAMEWORK which is more

semantic by nature) RDA ( metadata rules which are not tied to a programming language such as MARC but can work with semantic web standards like XML); IRs, and CMS like Drupal which have semantic web capabilities

● RDA expressed as RDFa

Making the Digital Connection: Linked Data and Libraries

Page 9: Linked data presentation for libraries (COMO)

Getting Ready for LOD

Doug GoansHead of Library IT and Development

Georgia Institute of Technology

Page 10: Linked data presentation for libraries (COMO)

Readiness: What, Why and HowLinked is an approach to data.

Open is a policy.

Data is a technology and a set of standards.

Source: http://www.scholarslab.org/digital-libraries/introduction-to-linked-open-data-at-rare-books-school/

Making the Digital Connection: Linked Data and Libraries

Page 11: Linked data presentation for libraries (COMO)

Readiness: WhatDiscussions of data and services

Catalog, archives, repositories, library use data

LOD emerging services/data in general● WorldCat● DBpedia● data.gov

Making the Digital Connection: Linked Data and Libraries

Page 12: Linked data presentation for libraries (COMO)

Readiness: Data Records (MARC, etc.)

SOURCE: Getting triples from records: the role of ISBD http://www.slideshare.net/scottishlibraries/isbd-record2triples

Making the Digital Connection: Linked Data and Libraries

Page 13: Linked data presentation for libraries (COMO)

Readiness: Data Records (MARC, etc.)

SOURCE: Getting triples from records: the role of ISBD http://www.slideshare.net/scottishlibraries/isbd-record2triples

Making the Digital Connection: Linked Data and Libraries

Page 14: Linked data presentation for libraries (COMO)

Readiness: WhyDiscovery (Google Semantic Search, Facebook Open Graph)

Empowerment (library development, local and global)

Peripheral issues (important)● Data Use Policies / Licensing your data● System Readiness ● Teaching, Learning, Research (use cases and service

development)

Making the Digital Connection: Linked Data and Libraries

Page 15: Linked data presentation for libraries (COMO)

Readiness: How● Use Cases - http://obd.jisc.ac.uk/

● Integrate LOD into projects (IR, Archives, Web Presence)

● Explore RDF, RDFa, SPARQL end-points

● Framework approach (empower users and developers first)

Making the Digital Connection: Linked Data and Libraries

Page 16: Linked data presentation for libraries (COMO)

Readiness: HowDeveloping Use Cases http://obd.jisc.ac.uk/

Campus Competition

System Readiness

Making the Digital Connection: Linked Data and Libraries

Page 17: Linked data presentation for libraries (COMO)

Readiness: HowCurrent Services

Integrate LOD into projects

Framework and Services Approach

Making the Digital Connection: Linked Data and Libraries

Page 18: Linked data presentation for libraries (COMO)

Making some connections

Linked data at Emory LibrariesLaura Akerman

Page 19: Linked data presentation for libraries (COMO)

First we studiedConnections group formed Nov. 2011

Core group of 8, open to anyone Sponsors: Lars Meyer, John Ellinger Learning and planning for linked data important

Classes taught every other week Based on

Linked Data: Evolving the Web into a Global Data Space (Heath and Bizer)

High level overviewPrograms

Jon Voss, "LOD-LAM Emory", webinars

Making the Digital Connection: Linked Data and Libraries

Page 20: Linked data presentation for libraries (COMO)

Pilot planning● We were asked to develop a 3-month pilot project.● Goal: demonstrate value of linked data for users

and the library● Goal: get feet wet with "triples" and technology● With:

o 1 person, 20+hrs/week; 6 others, 1-3 hrs/week including a couple of "techies"

o Sandbox (repurposed pc/server running Linux)

Making the Digital Connection: Linked Data and Libraries

Page 21: Linked data presentation for libraries (COMO)

Pilot project - approved● After many ideas, choices -

o focus on topic (Civil War)oConvert sample "silo" metadata to linked dataoAdd some "new" metadata as linked dataoLink to some external dataoSome kind of display

● PROJECT BLOG: https://scholarblogs.emory.edu/connections/

Making the Digital Connection: Linked Data and Libraries

Page 22: Linked data presentation for libraries (COMO)

Results:Making the Digital Connection: Linked Data and Libraries

Page 23: Linked data presentation for libraries (COMO)

Things we learned, #1

Start small

Making the Digital Connection: Linked Data and Libraries

Page 24: Linked data presentation for libraries (COMO)

#2 Working with free software is time consuming!

● Even if you're a programmer...● Don't expect it to work the first time!● Patience, inner peace, get enough sleep!● Don't let fear of displaying your ignorance keep you

from learning!● Windows people: learn Linux command line!● When to "let it go", when to keep trying?● Tool evaluation is an important step!● Record what you learn and share!

Making the Digital Connection: Linked Data and Libraries

Page 25: Linked data presentation for libraries (COMO)

Things we are trying:Sesame - useful but web client limited (no batchload...)Callimachus - interesting but not ready to put time into it, and very betaLinkSailor - need to publish your data to use itPubby, Djubby - publish your data ??? (not working yet)Our own scripts to query id.loc.gov - got the basics...ArchivesHub stylesheet - useful as a starting point, but too complicated?Simile MARC to RDF stylesheets - too different from ArchiveHubLC MARC to RDF stylesheet - Dublin Core too simple but a good starting pointSimile Welkin - limited but at least it gave us some visualization!

Making the Digital Connection: Linked Data and Libraries

Page 26: Linked data presentation for libraries (COMO)

#3

We need linked data to find connections!

Making the Digital Connection: Linked Data and Libraries

Page 27: Linked data presentation for libraries (COMO)

Learn SPARQL

#4Making the Digital Connection: Linked Data and Libraries

Page 28: Linked data presentation for libraries (COMO)

Find two resources, name/creator in one is subject of another

SELECT DISTINCT ?resource1 ?name ?resource2 WHERE{{{?resource1 dc:creator ?name}UNION{?resource1 duch:origination ?person. ?person foaf:name ?name.}}{{?resource2 dc:subject ?name.}UNION{?resource2 duch:associatedWith ?concept. ?concept rdfs:label ?name.}}FILTER(?resource1 != ?resource2)}

Making the Digital Connection: Linked Data and Libraries

Page 29: Linked data presentation for libraries (COMO)

Some SPARQL resourcesSPARQL.PRO http://en.sparql.pro/wiki/Main_Pagelist of SPARQL endpoints with sample queries

XQuery/SPARQL Tutorialhttp://en.wikibooks.org/wiki/XQuery/SPARQL_Tutoria

W3C SPARQL Spechttp://www.w3.org/TR/rdf-sparql-query/

Making the Digital Connection: Linked Data and Libraries

Page 30: Linked data presentation for libraries (COMO)

#5

There are many ways to model your RDF - all

different!How do you decide?

Page 31: Linked data presentation for libraries (COMO)

ArchivesHub handles "subjects"<associatedWith><!--About the Concept (Person)--><skos:Concept xmlns:skos="http://www.w3.org/2004/02/skos/core#" rdf:about="http://duchamp.library.emory.edu/resource/id/concept/person/lcnaf/gearyjohnwhite1819-1873">

<rdfs:label xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xml:lang="en">Geary, John White, 1819-1873.</rdfs:label> <skos:inScheme> <skos:ConceptScheme rdf:about="http://duchamp.library.emory.edu/resource/id/conceptscheme/lcnaf"> <rdfs:label xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xml:lang="en">lcnaf</rdfs:label> </skos:ConceptScheme> </skos:inScheme> <foaf:focus xmlns:foaf="http://xmlns.com/foaf/0.1/"><!--About the Person--><foaf:Person rdf:about="http://duchamp.library.emory.edu/resource/id/person/lcnaf/gearyjohnwhite1819-1873"> <rdf:type rdf:resource="http://xmlns.com/foaf/0.1/Agent"/> <rdf:type rdf:resource="http://purl.org/dc/terms/Agent"/> <rdf:type rdf:resource="http://erlangen-crm.org/current/E21_Person"/> <rdfs:label xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xml:lang="en">Geary, John White, 1819-1873.</rdfs:label> </foaf:Person> </foaf:focus> </skos:Concept> </associatedWith>

Making the Digital Connection: Linked Data and Libraries

Page 32: Linked data presentation for libraries (COMO)

LC's MARCXML to RDF handles subjects:

dc:subject "Geary, John White, 1819-1873."

Making the Digital Connection: Linked Data and Libraries

Page 33: Linked data presentation for libraries (COMO)

Simile MARCXML to MODS to RDF: <modsrdf:subject rdf:resource= "http://simile.mit.edu/2006/01/Entity#Geary_John_White_18191873"/> <rdf:Description rdf:about= "http://simile.mit.edu/2006/01/Entity#Geary_John_White_18191873"> <rdf:type rdf:resource= "http://simile.mit.edu/2006/01/ontologies/mods3#Person"/> <modsrdf:fullName>Geary, John White</modsrdf:fullName> <modsrdf:dates>1819-1873</modsrdf:dates </rdf:Description>

Making the Digital Connection: Linked Data and Libraries

Page 34: Linked data presentation for libraries (COMO)

Our learning

No perfect models out there.

A common model for all our sources (with maybe some special items depending on type of content) could make things much easier for us and users of our data!

This is a major undertaking..

Making the Digital Connection: Linked Data and Libraries

Page 35: Linked data presentation for libraries (COMO)

#6 To FRBR, or not to FRBR?http://metadataregistry.org/schemaprop/list/page/1/schema_id/4.html

Making the Digital Connection: Linked Data and Libraries

Page 36: Linked data presentation for libraries (COMO)

#7 Matching up with external data is tricky!

● DBPedia: not easy to programmatically match up. We did some "by hand". o Issue: If only part of LC subject matches, should

you link?

● LC Vocabularies - id.loc.gov - straightforward but have to skip many final subdivisions.

● VIAF: another interesting target, but not helpful enough for this pilot.

Making the Digital Connection: Linked Data and Libraries

Page 37: Linked data presentation for libraries (COMO)

question: "Johnson's Island (Ohio)."

Q: Does this mean the island itself or is it referring to the prison that was located on the island?

o Found something for Johnson Island Civil War Prison and Fort Site

A: I used a SPARQL query to find the collection linked to this heading. The Scope and Content Note says letters were written while someone was imprisoned on the island.

http://dbpedia.org/resource/Johnson_Island_Civil_War_Prison_and_Fort_Site

Should we use "owl:sameAs"?

Making the Digital Connection: Linked Data and Libraries

Page 38: Linked data presentation for libraries (COMO)

#8 To have good linked data, you have to have... data

____________|1861_________________

No dates, no timeline(scholars could've found it useful)

Making the Digital Connection: Linked Data and Libraries

Page 39: Linked data presentation for libraries (COMO)

#9

MORE OPEN LINKED DATA!

Once you get started, you don't want to stop!

Making the Digital Connection: Linked Data and Libraries

Page 40: Linked data presentation for libraries (COMO)

Resources● LODLAM: http://lodlam.net/● LODAM CHALLENGE: http://summit2013.lodlam.net/● LODLAM Zotero Group (Webliography of good stuff): https://www.zotero.org/groups/lod-lam ● GLAMLOD: https://groups.google.com/group/glamlod● LC Bibliographic Framework Transition Initiative: http://www.loc.gov/marc/transition/ ● LITA - library linked data interest group: http://connect.ala.org/node/142470● Use Case Tool: http://obd.jisc.ac.uk/navigate

● Getting triples from records: the role of ISBD http://www.slideshare.net/scottishlibraries/isbd-record2triples

● Presentation materials at http://www.delicious.com/georgiawebgurl/metadata_presentation_como

Making the Digital Connection: Linked Data and Libraries

Page 41: Linked data presentation for libraries (COMO)

Thank You - Q&ARobin Fay [email protected]

Laura [email protected]

Doug [email protected]