capturing data provenance with a user-driven feedback approach

20
The 21st International Congress on Modelling and Simulation (MODSIM2015) MINERAL RESOURCES FLAGSHIP Anusuriya Devaraju and Jens Klump 4 th December 2015 CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH by Matthew Niederberger

Upload: anusuriya-devaraju

Post on 10-Feb-2017

530 views

Category:

Data & Analytics


2 download

TRANSCRIPT

Page 1: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

The 21st International Congress on Modelling and Simulation (MODSIM2015)

MINERAL RESOURCES FLAGSHIP

Anusuriya Devaraju and Jens Klump

4th December 2015

CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

by Matthew Niederberger

Page 2: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Outline

• Definitions (Provenance, Research Data, User Feedback)

• Motivation

• Goals & Solutions

• Summary

Page 3: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Outline

• Definitions (Provenance, Research Data, User Feedback)

• Motivation

• Goals & Solutions

• Summary

Page 4: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

User Feedback

4 |

• Feedback refers to information about reactions to a product.

• Feedback Types

• General (comment, how-to, suggestion, dissuasion)• Requirements (feature, content, improvement)• Rating• User experience (assessment and usage)

Page 5: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Provenance

• Also known as lineage

• Information about entities and processes involved in producing and delivering a resource.

5 |

Image: http://ajdcreative.com.au

Page 6: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Research Data Provenance

• Research data are facts, observations or experiences on which an argument, theory or test is based.1

• Forward data provenance describes how a data is used/applied after it has been created.

6 |

Data creation and processing

Data assessment and application

DAT

A P

RO

VID

ER

DAT

A C

ON

SUM

ER

1The University of Melbourne draft policy on the Management of Research Data and Records

Page 7: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

How Do Feedback and Provenance Relate?

7 | Image : http://whartonmagazine.com/blogs/women-and-leadership-moving-forward/

Forward provenance information may be gathered via a user feedback approach.

Der

ived

Dat

aset

s

Dat

a A

pp

licat

ion

Dis

cove

red

Issu

es

Forward Data Provenance

Data Creation & Publication

Page 8: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Outline

• Definitions (Provenance, Research Data, User Feedback)

• Motivation

• Goals & Solutions

• Summary

Page 9: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Why does feedback information matter?

Use the feedback information to handle erroneous data and improve existing data collection and processing methods.

9 |

Page 10: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Why does feedback information matter?

Feedback information from data consumers gives a better insight into application and assessment of published data sets

10 |

Page 11: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Existing Feedback Mechanisms

11 |

Research Data Portals Feedback Mechanism

Research Data Australia (RDA) General feedback form, and user contributed tags for datadiscovery

CSIRO Data Access Portal Refer to the email of the data collector in the metadata

TERN Data Discovery Portal General contact form

Australian Ocean Data Network Portal (AODN)

General contact form and portal help forum

Atlas of Living Australia (ALA) UserVoice feedback portal

OzFlux Data Portal Email link (for all inquiries and assistance)

National Marine Mammal Data Portal General feedback form

Urban Research Infrastructure Network Email link for general inquiries, Social media buttons for distribute the link of a data set.

Table 1. Examples of research data portals and their feedback mechanisms

Page 12: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Outline

• Definitions (Provenance, Research Data, User Feedback)

• Motivation

• Goals & Solutions

• Summary

Page 13: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Goals

Develop a systematic and reusable approach to

1. Capture feedback information from data users on research data sets

2. Link feedback information to actual data sets

3. Support discovery of research data using feedback information.

13 |

Page 14: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Feedback Application Server

Da

ta P

ort

al w

ith

Feedback P

lug

in

Linked Data &

SPARQL Clients

Feedback Data Store (MySQL)

RESTFeedback Web

Service

RDF

SPARQLD2R Server

D2R Engine

JSON RDF

User Feedback System

14 |

Feedback from users may be gathered :• Implicit (automated

tracking of data activities)

• Explicit (predefined input templates)

1 Gather feedback

2 Store feedback

3 Publish feedback

Page 15: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

15 |

1. Gather Feedback1

Page 16: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

16 |

A relational data model representing key aspects of user feedback:• Feedback types and

contributors• Target data and context• Supporting documents

2. Store Feedback2

Page 17: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

17 |

3. Publish Feedback

Feedback published as Linked DataEntities and agent involved in an error report

feedback activity

3

Page 18: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Conclusions

• Contribution : A user-centric approach to capture forward provenance information of research datasets.

• What’s Next? Record provenance in python

18 |

Page 19: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Conclusions

19 |

• DataSync from eSciDoc - Synchronizes feedback information

Page 20: CAPTURING DATA PROVENANCE WITH A USER-DRIVEN FEEDBACK APPROACH

Thank You…

20 |

IMPORTANT ASPECTS:VALUE, EASY, FAST..