linked data standards and infrastructure for scientific publishing (w3c ledp 2011 workshop)

9
Linked Data Standards and Infrastructure for Scientific Publishing Bradley P. Allen Elsevier Labs W3C Workshop on Linked Enterprise Data Patterns 6 December 2011

Upload: bradley-allen

Post on 13-Dec-2014

849 views

Category:

Technology


1 download

DESCRIPTION

A presentation describing Elsevier's perspective of linked data in STM publishing, presented 2011-12-06 at the W3C Linked Enterprise Data Patterns Workshop in Cambridge, MA.

TRANSCRIPT

Page 1: Linked Data Standards and Infrastructure for Scientific Publishing (W3C LEDP 2011 Workshop)

Linked Data Standards and Infrastructure for Scientific Publishing

Bradley P. Allen Elsevier Labs W3C Workshop on Linked Enterprise Data Patterns 6 December 2011

Page 2: Linked Data Standards and Infrastructure for Scientific Publishing (W3C LEDP 2011 Workshop)

The role of linked data in STM publishing

Entities, concepts and relationships

Smart Content Delivery

Better understanding through analysis and visualization •Tag clouds •Heatmaps •Streamgraphs •Scatterplots •Time series •Animations

Better discovery through semantic search & navigation •Faceted search & browse •Ontology-driven navigation •Task-specific results •Personalized/localized results •Question answering

New knowledge through aggregation and synthesis •Topic pages •Social network maps •Geolocation maps •Data mashups •Text mining reports

Images

Text

Tables

Scholarly content

Scholarly knowledge organization systems

Linked data from partners and the Web

2

Page 3: Linked Data Standards and Infrastructure for Scientific Publishing (W3C LEDP 2011 Workshop)

3

Scientific publications as linked data

Linked data

Acquire

Transform, Enhance, Index, Analyze,

Compose

Deliver

Document

Entity record

Media object

Page 4: Linked Data Standards and Infrastructure for Scientific Publishing (W3C LEDP 2011 Workshop)

4

• Embrace linked data principles while leveraging our existing content production workflow and infrastructure – Find the right balance between production/QA and online

delivery • Leverage partners for content enhancement and

knowledge organization – Reuse Web-standard vocabularies, taxonomies, ontologies

and entity resources where possible • Build out linked data design patterns for application

development • Deliver benefits across the complementary use cases

of researcher and practitioner

Elsevier’s approach

Page 5: Linked Data Standards and Infrastructure for Scientific Publishing (W3C LEDP 2011 Workshop)

Elsevier work to date

• Standards – RDF named graphs

conformant with use-specific XML schemas for production/QA

– Taxonomies in SKOS • Infrastructure

– Linked Data Repository with CRUD API, Atom feeds for online delivery services

– Virtual Total Warehouse for content repository federation

• Applications – Semantic search for medical

researchers and practitioners

– Lancet, SciVerse app mashups

5

Page 6: Linked Data Standards and Infrastructure for Scientific Publishing (W3C LEDP 2011 Workshop)

6

• Easing technology adoption by enterprise IT staff

• Best practices for knowledge organization systems management

• Infrastructure for scholarly linked data publishing

LEDP2011: what we want to discuss

Page 7: Linked Data Standards and Infrastructure for Scientific Publishing (W3C LEDP 2011 Workshop)

7

• Tools and best practices for URL and namespace management and governance

• Best practices for publishing and consuming linked data that address IT concerns rather than legacy RDF issues – 2006 vs. later versions of “Four Principles” – Serialization “impedance mismatch” – RDF APIs vs. SPARQL – HTTP Range-14

Easing technology adoption

Page 8: Linked Data Standards and Infrastructure for Scientific Publishing (W3C LEDP 2011 Workshop)

8

• Tools and best practices for global/local knowledge organization systems management

• Standards for named entities and registries crucial to accreditation, provenance and trust – e.g. author identifiers and profiles in ORCID

Best practices for knowledge organization

Page 9: Linked Data Standards and Infrastructure for Scientific Publishing (W3C LEDP 2011 Workshop)

9

• Validators for linked data • Standards supporting scholarly publishing

workflows – Named graphs – Versioning – Access & entitlement

• Standards and best practices for annotation of scholarly content – e.g. CITO, SWAN, SIOC, AO, OAC

• Support for free text search

Infrastructure for scholarly linked data publishing