data in context co- chairs : brigitte jörg, keith jeffery

19
Data in Context Co-chairs: Brigitte Jörg, Keith Jeffery RDA 3rd Plenary, March, 26th - 28th, 2014 Dublin

Upload: nash

Post on 23-Feb-2016

35 views

Category:

Documents


0 download

DESCRIPTION

Data in Context Co- chairs : Brigitte Jörg, Keith Jeffery. RDA 3rd Plenary , March, 26th - 28th, 2014 Dublin. Brief History. 1st Plenary Gothenburg Preparing a WG Proposal /Case Statement „ Contextual Metadata “ A lot of interest Revision of Initial Use Cases - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Data in Context

Co-chairs: Brigitte Jörg, Keith Jeffery

RDA 3rd Plenary, March, 26th - 28th, 2014 Dublin

Page 2: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Brief History• 1st Plenary Gothenburg Preparing a WG Proposal/Case

Statement „Contextual Metadata“• A lot of interest• Revision of Initial Use Cases• Use Cases as specific as possible• Alignment with other WGs / Activities• Four revised use cases:

– Researcher: Find data ..– Manager: Indicate to funder – Provenance: Allow to take segments from streamed data workflows– Interoperability: Exchange of contextual metadata

• Rename Group to „Data in Context“

Page 3: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Data in Context IG Approach• Lifecycle Approach

– Linear Sequence of Elements– Cyclic Repetition of Elements

• Investigate Lifecycle Models– DCC: Conceptualize; Create; Access;

Use; Appraise; Select; Dispose; etc– DDI: Discovery & Planning; Initial

Data Collection; etc.– Research Lifecycle (Jisc): Research

Process: Simulate Experiment; Manage Data; Analyse; etc.

– etc. ??• Investigate contextually or

subcontextually-aware standardization work– OAIS; CASRAI; CERIF; VIVO; PROV;

PREMIS; MARC; CKAN; DCAT; ISO; W3C; OMG; Research Objects, etc.

• Investigate / Prioritize Reusable Requirements

• Deliverables: – M6: Overview of contextually-aware

standardization work– M12: Priority List of Requirements

• Goal: – Set up of a Working Group– Implementation of Standardized

Profiles

• Long-term Goal: – Automated Transformation

Between Standards

Page 4: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Collaboration / Exchange• RDA Foundation and Terminology• RDA Metadata Standards Directory WG• RDA PID Information Types WG• ICSU Open Metadata Catalogue and Knowledge Networks

WG• RDA/WDS Workflows for Publishing Data IG• RDA Data Description Registry Interoperability• RDA Semantic Interoperability Activity• RDA Metadata Interest Group• Various W3C groups (LOD, SW....)

Page 5: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Requirements / Needs

• Stakeholders• Data Producers• Data Consumers

• Standardized Open Vocabularies• Standardized Formal Data Profiles• Standardized Formal Semantics

Template

First Steps taken with developing a Template

Apply

Page 6: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

DCC – The Curation Lifecycle

Stakeholders

Data Producer

Data Consumer

Standardized Open Vocabularies

Standardized Formal Data Profiles

Standardized Formal Semantics

http://www.dcc.ac.uk/digital-curation/what-digital-curation

Page 7: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

DDI Lifecycle

http://www.ddialliance.org/Specification/DDI-CV/

DDI Controlled VocabulariesAnalysis Unit; Character Set; Commonality Type; Lifecycle Event Type; Response Unit; Software Package; Summary Statistic TypeTime Method

Stakeholders

Data Producer

Data Consumer

Standardized Open Vocabularies

Standardized Formal Data Profiles

Standardized Formal Semantics

Page 8: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Data Assets Framework

Stakeholders

Data Producer

Data Consumer

Standardized Open Vocabularies

Standardized Formal Data Profiles

Standardized Formal Semanticshttp://www.data-audit.eu/

Page 9: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Research Lifecycle

DDI Controlled VocabulariesAnalysis Unit; Character Set; Commonality Type; Lifecycle Event Type; Response Unit; Software Package; Summary Statistic TypeTime Method

Stakeholders

Data Producer

Data Consumer

Standardized Open Vocabularies

Standardized Formal Data Profiles

Standardized Formal Semantics

http://www.jisc.ac.uk/whatwedo/campaigns/res3/jischelp.aspx

Page 10: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

RDA Practical Policy WG

Stakeholders

Data Producer

Data Consumer

Standardized Open Vocabularies

Standardized Formal Data Profiles

Standardized Formal Semantics

Src: Slide Extract Rainer Stotzka, Reagan Moore provided for „Data in Context“ session, RDA 3rd Plenary

Page 11: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Data Lifecycle

Stakeholders

Data Producer

Data Consumer

Standardized Open Vocabularies

Standardized Formal Data Profiles

Standardized Formal Semantics

DATA

Collaboration &

Visualisation

Dissemination &

Sharing

Archiving &

Preserving

Analysis&

Data Mining

Acquisition &

Modeling

Src: Keynote Tony Hey at RDA 3rd Plenary

Page 12: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Experimental Context, Publishing and Research Objects

Proposal

Approval

SchedulingExperiment/Investigation

Data storage

Record Publication

Scientist submits application for

beamtime

Facility committee approves

applicationFacility registers,

trains, and schedules

scientist’s visit

Scientists visits facility, run’s experiment

Subsequent publication

registered with facility

Raw data filtered, and stored

Data analysis

Tools for processing made

available

Investigation as a first class object

Src: Slide extract Brian Matthews, STFC provided for „Data in Context“ session, RDA 3rd Plenary

Page 13: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Liberalised Meta-DataIs a network

13

Citation

Coverage(Temporal,

Spatial, Topic)

Use, Caveats, Lineage,

Methods, and Licenses

Publisher

People

Institutions

RDI Outputs/ Online

Resources

Projects

Initiatives

Networks

Funders

Relationships are contributed by (1) meta-data mining (2) information from websites conforming to schema (3) social-media-type sites and VREs (4) existing network contributions (5) scraping existing websites (6) ontologies and vocabularies (…)

Src: Slide Extract Wim Hugo, ICSU WDS provided for „Data in Context“ session, RDA 3rd Plenary

Page 14: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Etc.

• Data Curation Profiles (Purdue University)• ODP Model (ISO Reference Model for Open

Distributed Processing)

Page 15: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Standards

Jeffery et. al. 2013 http://resources.metapress.com/pdf-preview.axd?code=vl5422n2u7112669&size=largest

• e.g. • OAIS• CASRAI• CERIF• VIVO• PROV• PREMIS• MARC

• CKAN• DCAT• ISO• W3C• OMG• ODP• etc.

Page 16: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Emerging e-Infrastructure

Discovery

Contextual

Discovery

Jeffery et. al. 2013 http://resources.metapress.com/pdf-preview.axd?code=vl5422n2u7112669&size=largest

Page 17: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

AgendaSession 1: Thursday, March 27 - 15:30 - 17:00

• Introduction and Overview from Co-Chairs • Contributions from RDA Members

– Data Publishing Workflows, DCC Data Profiles (Angus Whyte) – Data Description Registry Interoperability (Amir Aryani)– Long-tail Data IG, Data Publishing IG (Jochen Schirrwagen)– WDS Knowledge Network activity (Wim Hugo) – Experimental Context, Publishing and Research Objects (Brian Matthews)– Reference Model Proposal (Yin Chen)

• Discussion

Notes Taking: Alessia Bardi, RDA Early Career Researchers Programme recipient.

Page 18: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

AgendaSession 2: Friday, March 28 - 11:00 – 12:30

• Recap and Overview from Co-Chairs • Contributions from RDA Members

– Semantic Interoperability, (Gery Berg-Cross) – Metadata WGs (Keith Jeffery, Rebecca Koskela)– Practical Policy Sessions (Slides Reagan Moore)

• Discussion

Notes Taking: Alessia Bardi, RDA Early Career Researchers Programme recipient.

Page 19: Data in  Context Co- chairs : Brigitte Jörg, Keith  Jeffery

Rough Work Plan

• M6: Overview of contextually aware standardization work

• M12: Priority List of Requirements

From there set up a RDA Working Group Requirements-drivenImplementation of Standards WG Plan