niso webinar: library linked data: from vision to reality

151
NISO Webinar: Library Linked Data: From Vision to Reality December 11, 2013 Speakers: Jon Voss - Strategic Partnerships Director, We Are What We Do Matt Miller - Front End Developer, NYPL Labs at the New York Public Library Silvia Southwick - Digital Collections Metadata Librarian, UNLV University Libraries Cory Lampert - Head, Digital Collections , UNLV University http://www.niso.org/news/events/2013/webinars/linke

Upload: national-information-standards-organization-niso

Post on 15-Jan-2015

3.636 views

Category:

Education


2 download

DESCRIPTION

About the Webinar The library and cultural institution communities have generally accepted the vision of moving to a Linked Data environment that will align and integrate their resources with those of the greater Semantic Web. But moving from vision to implementation is not easy or well-understood. A number of institutions have begun the needed infrastructure and tools development with pilot projects to provide structured data in support of discovery and navigation services for their collections and resources. Join NISO for this webinar where speakers will highlight actual Linked Data projects within their institutions—from envisioning the model to implementation and lessons learned—and present their thoughts on how linked data benefits research, scholarly communications, and publishing. Speakers: Jon Voss - Strategic Partnerships Director, We Are What We Do LODLAM + Historypin: A Collaborative Global Community Matt Miller - Front End Developer, NYPL Labs at the New York Public Library The Linked Jazz Project: Revealing the Relationships of the Jazz Community Cory Lampert - Head, Digital Collections , UNLV University Libraries Silvia Southwick - Digital Collections Metadata Librarian, UNLV University Libraries Linked Data Demystified: The UNLV Linked Data Project

TRANSCRIPT

Page 1: NISO Webinar: Library Linked Data: From Vision to Reality

NISO Webinar: Library Linked Data:

From Vision to Reality

December 11, 2013

Speakers: Jon Voss - Strategic Partnerships Director, We Are What We Do

Matt Miller - Front End Developer, NYPL Labs at the New York Public Library

Silvia Southwick - Digital Collections Metadata Librarian, UNLV University Libraries

Cory Lampert - Head, Digital Collections , UNLV University Libraries

http://www.niso.org/news/events/2013/webinars/linked_data

Page 2: NISO Webinar: Library Linked Data: From Vision to Reality
Page 3: NISO Webinar: Library Linked Data: From Vision to Reality
Page 4: NISO Webinar: Library Linked Data: From Vision to Reality
Page 5: NISO Webinar: Library Linked Data: From Vision to Reality
Page 6: NISO Webinar: Library Linked Data: From Vision to Reality
Page 7: NISO Webinar: Library Linked Data: From Vision to Reality
Page 8: NISO Webinar: Library Linked Data: From Vision to Reality
Page 9: NISO Webinar: Library Linked Data: From Vision to Reality
Page 10: NISO Webinar: Library Linked Data: From Vision to Reality
Page 11: NISO Webinar: Library Linked Data: From Vision to Reality
Page 12: NISO Webinar: Library Linked Data: From Vision to Reality
Page 13: NISO Webinar: Library Linked Data: From Vision to Reality
Page 14: NISO Webinar: Library Linked Data: From Vision to Reality
Page 15: NISO Webinar: Library Linked Data: From Vision to Reality
Page 16: NISO Webinar: Library Linked Data: From Vision to Reality
Page 17: NISO Webinar: Library Linked Data: From Vision to Reality
Page 18: NISO Webinar: Library Linked Data: From Vision to Reality
Page 19: NISO Webinar: Library Linked Data: From Vision to Reality
Page 20: NISO Webinar: Library Linked Data: From Vision to Reality
Page 21: NISO Webinar: Library Linked Data: From Vision to Reality
Page 22: NISO Webinar: Library Linked Data: From Vision to Reality
Page 23: NISO Webinar: Library Linked Data: From Vision to Reality
Page 24: NISO Webinar: Library Linked Data: From Vision to Reality
Page 25: NISO Webinar: Library Linked Data: From Vision to Reality
Page 26: NISO Webinar: Library Linked Data: From Vision to Reality
Page 27: NISO Webinar: Library Linked Data: From Vision to Reality
Page 28: NISO Webinar: Library Linked Data: From Vision to Reality
Page 29: NISO Webinar: Library Linked Data: From Vision to Reality
Page 30: NISO Webinar: Library Linked Data: From Vision to Reality
Page 31: NISO Webinar: Library Linked Data: From Vision to Reality
Page 32: NISO Webinar: Library Linked Data: From Vision to Reality
Page 33: NISO Webinar: Library Linked Data: From Vision to Reality
Page 34: NISO Webinar: Library Linked Data: From Vision to Reality
Page 35: NISO Webinar: Library Linked Data: From Vision to Reality
Page 36: NISO Webinar: Library Linked Data: From Vision to Reality
Page 37: NISO Webinar: Library Linked Data: From Vision to Reality
Page 38: NISO Webinar: Library Linked Data: From Vision to Reality
Page 39: NISO Webinar: Library Linked Data: From Vision to Reality
Page 40: NISO Webinar: Library Linked Data: From Vision to Reality
Page 41: NISO Webinar: Library Linked Data: From Vision to Reality
Page 42: NISO Webinar: Library Linked Data: From Vision to Reality
Page 43: NISO Webinar: Library Linked Data: From Vision to Reality
Page 44: NISO Webinar: Library Linked Data: From Vision to Reality
Page 45: NISO Webinar: Library Linked Data: From Vision to Reality
Page 46: NISO Webinar: Library Linked Data: From Vision to Reality
Page 47: NISO Webinar: Library Linked Data: From Vision to Reality
Page 48: NISO Webinar: Library Linked Data: From Vision to Reality
Page 49: NISO Webinar: Library Linked Data: From Vision to Reality
Page 50: NISO Webinar: Library Linked Data: From Vision to Reality
Page 51: NISO Webinar: Library Linked Data: From Vision to Reality
Page 52: NISO Webinar: Library Linked Data: From Vision to Reality
Page 53: NISO Webinar: Library Linked Data: From Vision to Reality
Page 54: NISO Webinar: Library Linked Data: From Vision to Reality
Page 55: NISO Webinar: Library Linked Data: From Vision to Reality
Page 56: NISO Webinar: Library Linked Data: From Vision to Reality
Page 57: NISO Webinar: Library Linked Data: From Vision to Reality
Page 58: NISO Webinar: Library Linked Data: From Vision to Reality
Page 59: NISO Webinar: Library Linked Data: From Vision to Reality
Page 60: NISO Webinar: Library Linked Data: From Vision to Reality
Page 61: NISO Webinar: Library Linked Data: From Vision to Reality
Page 62: NISO Webinar: Library Linked Data: From Vision to Reality
Page 63: NISO Webinar: Library Linked Data: From Vision to Reality
Page 64: NISO Webinar: Library Linked Data: From Vision to Reality
Page 65: NISO Webinar: Library Linked Data: From Vision to Reality
Page 66: NISO Webinar: Library Linked Data: From Vision to Reality
Page 67: NISO Webinar: Library Linked Data: From Vision to Reality
Page 68: NISO Webinar: Library Linked Data: From Vision to Reality
Page 69: NISO Webinar: Library Linked Data: From Vision to Reality
Page 70: NISO Webinar: Library Linked Data: From Vision to Reality
Page 71: NISO Webinar: Library Linked Data: From Vision to Reality
Page 72: NISO Webinar: Library Linked Data: From Vision to Reality
Page 73: NISO Webinar: Library Linked Data: From Vision to Reality
Page 74: NISO Webinar: Library Linked Data: From Vision to Reality
Page 75: NISO Webinar: Library Linked Data: From Vision to Reality
Page 76: NISO Webinar: Library Linked Data: From Vision to Reality
Page 77: NISO Webinar: Library Linked Data: From Vision to Reality
Page 78: NISO Webinar: Library Linked Data: From Vision to Reality
Page 79: NISO Webinar: Library Linked Data: From Vision to Reality
Page 80: NISO Webinar: Library Linked Data: From Vision to Reality

Linked JazzRevealing the

relationships of the jazz community

Matt Miller@thisismmillerDecember 2013

Page 81: NISO Webinar: Library Linked Data: From Vision to Reality

Project Overview

• Investigating the application of Linked Open Data to enhance the discovery and visibility of digital cultural heritage materials.

• Build new methods of connecting cultural data.• Uncover meaningful connections between

documents and data related to the personal and professional lives of musicians who often practice in rich and diverse social networks.

Professor Cristina Pattuelli at the Pratt Institute School of Library Information Science is the director of the project which began in 2011.

Page 82: NISO Webinar: Library Linked Data: From Vision to Reality

Linked Data Now!

Why?• Bootstrap your project with existing data.• Highlights knowledge you have created and

knowledge that is missing. • Facilitates sharing, but also growing your own

project.

Page 83: NISO Webinar: Library Linked Data: From Vision to Reality

Bootstrapping – Identifying

Research QuestionHow can we discover and analyze the rich and diverse network of

relationships between jazz musicians?

Primary SourcesOral history interview transcripts

of jazz musicians.

Page 84: NISO Webinar: Library Linked Data: From Vision to Reality

Bootstrapping – Identifying

Research QuestionHow can we discover and analyze the rich and diverse network of

relationships between jazz musicians?

Primary SourcesOral history interview transcripts

of jazz musicians.

We need to know the names (and variants) of jazz

musicians in a structured controlled vocabulary.

Page 85: NISO Webinar: Library Linked Data: From Vision to Reality

Bootstrapping – Identifying

Charlie Parker

Many different LOD datasets contain this information. We need to access, query and link it

for only jazz related individuals.

Page 86: NISO Webinar: Library Linked Data: From Vision to Reality

Bootstrapping – Querying

Page 87: NISO Webinar: Library Linked Data: From Vision to Reality

Bootstrapping – Querying

• Processing the DBpedia dataset resulted in around 9,000 URIs.– DBpedia is fluid! After each release (currently 3.9) we

reprocess the files resulting in the addition of 500-700 URIs.

• We now have a name directory, but we want additional forms of personal names. To accomplish this we try mapping to Library of Congress.

• Matching DBpedia and LC URIs is not automatic.

Page 88: NISO Webinar: Library Linked Data: From Vision to Reality

Bootstrapping – Mapping• We matched identities based on:

• Name• Life Dates• White listed words found in sources

(http://www.loc.gov/mads/rdf/v1#Source)

• Reconciling authorities is difficult!• Use others work: http://viaf.org/viaf/data/

• But don’t discount your own processes.• Using our relatively simple process we

were able to match about 1500 more URIs than VIAF.org.

• This is due to a smaller domain (jazz).

Our name directory creation and authority matching is documented:

https://github.com/thisismattmiller/linked-jazz-name-directory

Page 89: NISO Webinar: Library Linked Data: From Vision to Reality

Bootstrapping – Curating

http://linkedjazz.org/public_demo_mapping/

Page 90: NISO Webinar: Library Linked Data: From Vision to Reality

Bootstrapping – Review

• Start small, think big.– Specific subject domain.– Large infrastructure not required (triple stores, etc.)

• Can get started with extract files and python scripting.

• Reuse as much as possible, but try new processes leveraging domain specificity.

• Always be curating, use tools to facilitate process but a human hand is often required.

Page 91: NISO Webinar: Library Linked Data: From Vision to Reality

Applying the Data

• Use the name directory to locate individuals in the interview transcript.

• This project phase involves 50 transcripts.• Because the names are tied to URIs we can

infer a relationship triple between two individuals.

<foaf:Person> <rel:knowsOf> <foaf:Person>

Page 92: NISO Webinar: Library Linked Data: From Vision to Reality

Applying the Data

Page 93: NISO Webinar: Library Linked Data: From Vision to Reality

Transcript Analyzer

Page 94: NISO Webinar: Library Linked Data: From Vision to Reality

Transcript Analyzer

• An interface to curate the transcripts and verify detected names.

• Implements off the shelf NLP (NLTK) to attempt to locate additional names not in our directory as well as corporate names and locations.

• Global rule system, as we process more transcripts the system is being trained.

• Using URIs to represent entities we can quickly see where we are discovering new material.– 50 Transcripts

• 1800 person entities tagged.• 250 names tagged without authoritative URI.

– Knowledge Creation

Page 95: NISO Webinar: Library Linked Data: From Vision to Reality

New Dataset

• We have created a new LOD dataset now of jazz musician’s relationships.

• Our next steps are:– Visualize.– Further qualify the rel:knowsOf relationships.– Provide access to the data created.

Page 96: NISO Webinar: Library Linked Data: From Vision to Reality

Visualize

http://linkedjazz.org/network/

Page 97: NISO Webinar: Library Linked Data: From Vision to Reality

Qualify Relationships – 52nd St.

• Recruit jazz experts and enthusiasts to help categorize relationships based on transcript text.

• We use existing vocabularies to build the data set: Foaf, Relationship Vocabulary, Music Ontology

• The interface is critical for crowdsourcing tools, we work with user experience experts and conduct user studies to refine our public facing tools

Page 98: NISO Webinar: Library Linked Data: From Vision to Reality

Qualify Relationships – 52nd St.

http://linkedjazz.org/52ndStreet/

Page 99: NISO Webinar: Library Linked Data: From Vision to Reality

Provide Access

• We provide a SPARQL endpoint.• But also a traditional API:

– http://linkedjazz.org/api/– Can return:

• JSON• N-Triples • Gephi graph files (GXEF)

Page 100: NISO Webinar: Library Linked Data: From Vision to Reality

Learn and Grow as a Team

• Experience through doing.

• Empower graduate students with skills and practical experience working with a LOD project.

• Use the project as a vehicle to make intra- and inter-intuitional collaborations.

Linked Jazz Team July 2013

Page 101: NISO Webinar: Library Linked Data: From Vision to Reality

Next Steps

• Refactor our prototype tools into sustainable open source projects.

• Redesign 52nd St. based on user study groups.• Work on emerging collaborations with Jazz Centers.

Page 102: NISO Webinar: Library Linked Data: From Vision to Reality

http://www.linkedjazz.org

Thanks!

Page 103: NISO Webinar: Library Linked Data: From Vision to Reality

Linked, Exposed Data: UNLV Linked Data Project

NISO Webinar: Library Linked Data: From Vision to RealityDecember 11, 2013

Silvia B. SouthwickDigital Collections Metadata LibrarianUNLV Libraries

Cory K. LampertHead, Digital CollectionsUNLV Libraries

Page 104: NISO Webinar: Library Linked Data: From Vision to Reality

Agenda

• Motivation • Environment• UNLV Linked Data project• Technologies• Transforming metadata into linked data• Next steps

Page 105: NISO Webinar: Library Linked Data: From Vision to Reality

How it Started

• Conferences and “buzz”• Curiousity and professional development• Exploration and pilot project• Compelling results; sharing impact of what

we’ve learned• Assessment • Much more to do...

Page 106: NISO Webinar: Library Linked Data: From Vision to Reality

Current Practice

• Data (or metadata) encapsulated in records• Records contained in collections• Very few links are created within and/or across

collections• Links have to be manually created• Existing links do not specify the nature of the

relationships among recordsThis structure hides potential links within and across collections

Page 107: NISO Webinar: Library Linked Data: From Vision to Reality

What we can do with linked data

• Free data from silos• Expose relationships• Powerful, seamless, interlinking of our data• Users interact or query data in new ways• Search results would be more precise• Data can be easily repurposed

Page 108: NISO Webinar: Library Linked Data: From Vision to Reality

Making the Case for Linked Data in Academic Library Digital Collections

– Problem: Rich metadata is being lost in dumbed down DC records

– Issue: Investment and resource allocation (Item-level philosophy)

– Goal: Increased: exposure, collaboration, and openness

– Outcome: Increased discovery and user-focus

Page 109: NISO Webinar: Library Linked Data: From Vision to Reality

Gaining Buy In

Administration• Innovative project, high impact• Pilot, experiment, learn by doing, share results Staff• We already have the metadata; We need to

transform them into triples• Managing change

Page 110: NISO Webinar: Library Linked Data: From Vision to Reality

Graphical Representation: One Record

Page 111: NISO Webinar: Library Linked Data: From Vision to Reality

Examples of records

Showgirls Menus

Dreaming theSkyline

Page 112: NISO Webinar: Library Linked Data: From Vision to Reality

titleDecember 12, 1915

Page 113: NISO Webinar: Library Linked Data: From Vision to Reality

Implications (Internal)

• Cross-unit collaboration is necessary• Staff expertise will evolve• Staff roles will change to accommodate new /

parallel workflow• Data clean-up will be an investment• Management of data becomes critical• Discovery issues = user interfaces still need

development

Page 114: NISO Webinar: Library Linked Data: From Vision to Reality

Implications (External)

• Publish data from our collections in the Linked Data Cloud to improve discoverability and connections with other related data sets on the Web

• Sharing data in new ways with new partners may raise new issues

• Need to engage with linked data community for technologies, tools, best practices, and to demand library vendor support for LOD.

Page 115: NISO Webinar: Library Linked Data: From Vision to Reality

UNLV Linked Data Project

Goals: • Study the feasibility of developing a common

process that would allow the conversion of our collection records into linked data preserving their original expressivity and richness

• Publish data from our collections in the Linked Data Cloud to improve discoverability and connections with other related data sets on the Web

Page 116: NISO Webinar: Library Linked Data: From Vision to Reality

PROJECT IMPLEMENTATION

Page 117: NISO Webinar: Library Linked Data: From Vision to Reality

Actions Technologies

Prepare dataExport data

Import dataPublish

Open Refine

Mulgara /Virtuoso

CONTENTdm

Import dataClean dataReconcileGenerate triplesExport RDF

Page 118: NISO Webinar: Library Linked Data: From Vision to Reality

Prepare / Export Data

Technology: CONTENTdm

• Increase consistency across collections: – metadata element labels– use of CV, share local CVs– etc.

• Export data as spreadsheet

Create mapping between metadata elements and EDM model predicates

Page 119: NISO Webinar: Library Linked Data: From Vision to Reality

OpenRefine

• Open source

• It is a server – can communicate with other datasets via http

• Open Refine and its RDF extension should be installed

Screenshots to show some of the functions we have used

Page 120: NISO Webinar: Library Linked Data: From Vision to Reality

OpenRefine first screen

Page 121: NISO Webinar: Library Linked Data: From Vision to Reality
Page 122: NISO Webinar: Library Linked Data: From Vision to Reality

Facets

Page 123: NISO Webinar: Library Linked Data: From Vision to Reality
Page 124: NISO Webinar: Library Linked Data: From Vision to Reality

Split multi-value cells

Page 125: NISO Webinar: Library Linked Data: From Vision to Reality
Page 126: NISO Webinar: Library Linked Data: From Vision to Reality
Page 127: NISO Webinar: Library Linked Data: From Vision to Reality

Facet view forGraphic Elementsafter splitting

Page 128: NISO Webinar: Library Linked Data: From Vision to Reality

Reconciliation

Page 129: NISO Webinar: Library Linked Data: From Vision to Reality

Specifying Reconciliation service

Page 130: NISO Webinar: Library Linked Data: From Vision to Reality

Activating Reconciliation

Page 131: NISO Webinar: Library Linked Data: From Vision to Reality
Page 132: NISO Webinar: Library Linked Data: From Vision to Reality
Page 133: NISO Webinar: Library Linked Data: From Vision to Reality
Page 134: NISO Webinar: Library Linked Data: From Vision to Reality

Creating a Skeleton

Page 135: NISO Webinar: Library Linked Data: From Vision to Reality
Page 136: NISO Webinar: Library Linked Data: From Vision to Reality
Page 137: NISO Webinar: Library Linked Data: From Vision to Reality

Exporting RDF files

Page 138: NISO Webinar: Library Linked Data: From Vision to Reality

Actions Technologies

Prepare dataExport data

Import dataPublishQuery

Open Refine

Mulgara /Virtuoso

CONTENTdm

Import dataClean dataReconcileGenerate triplesExport RDF

Page 139: NISO Webinar: Library Linked Data: From Vision to Reality

Mulgara Triple Store: Import

Page 140: NISO Webinar: Library Linked Data: From Vision to Reality

A simple SPARQL query

Select *

where

{ ?s ?p ?o} limit 100

Page 141: NISO Webinar: Library Linked Data: From Vision to Reality
Page 142: NISO Webinar: Library Linked Data: From Vision to Reality

SPARQL: Querying Data

• Using Virtuoso PivotViewer

Page 143: NISO Webinar: Library Linked Data: From Vision to Reality

Query

Costume DesignDrawings

Showgirls

Page 144: NISO Webinar: Library Linked Data: From Vision to Reality
Page 145: NISO Webinar: Library Linked Data: From Vision to Reality
Page 146: NISO Webinar: Library Linked Data: From Vision to Reality
Page 147: NISO Webinar: Library Linked Data: From Vision to Reality
Page 148: NISO Webinar: Library Linked Data: From Vision to Reality

Next steps for the UNLV project

• Transform all digital collections into linked data (parallel structure)

• Increase linkage with other datasets• Design interfaces to access and display our data

and related data from other datasets• Evaluate alternative interfaces from user’s

perspective• Produce a cost benefit analysis to inform future

plans for the development of digital collections

Page 149: NISO Webinar: Library Linked Data: From Vision to Reality

Thank You!

Questions?

Page 150: NISO Webinar: Library Linked Data: From Vision to Reality

NISO Webinar • December 11, 2013

Questions?All questions will be posted with presenter answers on the NISO website following the webinar:

http://www.niso.org/news/events/2013/webinars/linked_data

NISO Webinar: Library Linked Data: From Vision to Reality

Page 151: NISO Webinar: Library Linked Data: From Vision to Reality

Thank you for joining us today. Please take a moment to fill out the brief online survey.

We look forward to hearing from you!

THANK YOU