supporting open collaboration in science through explicit...

19
Supporting Open Collaboration in Science through Explicit and Linked Semantic Description of Processes 1 Information Sciences Institute, University of Southern California 2 Center for Integrated Data Analytics, U.S. Geological Survey 3 Software Engineering for Business Information Systems, Technical University of Munich 4 Civil and Environmental Engineering, Penn State University 5 Center for Limnology, University of Wisconsin Yolanda Gil 1 , Felix Michel 13 , Varun Ratnakar 1 , Jordan Read 2 , Matheus Hauder 3 , Christopher Duffy 4 , Paul Hanson 5 , Hilary Dugan 5 12 th ESWC 2015 Organic Data Science Framework http://www.organicdatascience.org/

Upload: others

Post on 17-Oct-2020

2 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

Supporting Open Collaboration in Science through Explicit and Linked Semantic Description of Processes

1Information Sciences Institute, University of Southern California 2Center for Integrated Data Analytics, U.S. Geological Survey3Software Engineering for Business Information Systems, Technical University of Munich4Civil and Environmental Engineering, Penn State University5Center for Limnology, University of Wisconsin

Yolanda Gil1, Felix Michel13, Varun Ratnakar1, Jordan Read2, Matheus Hauder3,

Christopher Duffy4, Paul Hanson5, Hilary Dugan5

12th ESWC 2015

Organic Data Science Frameworkhttp://www.organicdatascience.org/

Page 2: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 2

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Evolution of the scientific enterprise

Evolution of the scientific enterprise from [Barabasi, 2005] extended with the ATLAS Detector Project at the Large Hadron Collider [The ATLAS Collaboration, 2012].

Motivation

Page 3: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 3

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Need a platform to support science collaborations that require:

– Significant organization and coordination

– Maintaining a community over the longer term

– Growing the community based on unanticipated needs

A New Kind of Collaborative PlatformIntroduction

Shared Instruments NEON

Community Data Systems PDB

Open Community Contribution Systems Zooniverse

Virtual Communities of Practice GLEON

Virtual Learning Communities VIVO

Distributed Research Centers ENCODE

Community Infrastructure Projects CSDMS

Taxonomy of Science Communities [Bos et al 2007]

Page 4: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 4

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Requirements of Scientific Collaborations

Introduction

Multi-disciplinary contributions

Significant coordination

Unanticipated participants

Page 5: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 5

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Science processes describe the what, who, when, and how of the activities pursued by the collaboration using semantic representations.

The ODS Framework incorporates principles from studies of successful on-line communities [Michel et al. 2015]

The Organic Data Science Framework

Exposes science processes declaratively to support:

formation of ad-hoc groups to work on tasks of interest,

to enable anyone to contribute to tasks that match their interests,

to advertise ongoing work to potential newcomers.

ODS Framework

Page 6: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 6

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Representing Collaborative Tasks

Task Metadata

SubtasksTop-Level

Tasks

Current Task

Current Task

Parent Task

Text

Task based organization

Unique Task URL

Hierarchical Task structure

ContextTasks

Task Filter

ODS Framework

Page 7: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 7

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Task Metadata

ESCW2015

ODS Framework

Page 8: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 8

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Task Status (Progress Estimation)

PIHM model Documentation

Calibrating …90%

85%

80%

70% 80% 90%

90%

Low-level TaskLow uncertainty in estimation Estimated by users

90%

90%

90%

Medium-level TaskMedium uncertainty in estimation Average of its Subtasks

High-level TaskHigh uncertainty in the estimation Linear progress based on start and target date

90%

95%

ODS Framework

Semantic properties are used to estimate the progress and status of tasks

Legend

The abstraction level of a parent node/task can be equal or higher but not lower.

Rule:

Page 9: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 9

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Task Status (Visualization)ODS Framework

Page 10: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 10

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Task Status (Sequences)ODS Framework

Page 11: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 11

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Task Cloning

Cloned Taskwith Subtasks

Same task is done by several users with their own data: a clone of the task is created and assigned to each user

Task Owner Task Name

Unique Task Name

ClonedTasks

ODS Framework

Page 12: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 12

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Representing Users

Expertise

Username

Recent Edits

Credits

ODS Framework

User-DefinedProperties

Current Tasks

Future Tasks

Completed Tasks

Page 13: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 13

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Representing other Entities of InterestODS Framework

Wo

rkfl

ow

sUse

r-d

escr

ibed

/pro

vid

ed d

ata

Soft

wa

re c

om

po

nen

t

Page 14: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 14

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Queries

==List of G16 Workshop Participants==

{{#ask:

[[Category:Person]]

[[Participant Of::G16 Workshop

on Modeling the Age of Water]]

| format=ul

}}

Semantic Properties Semantic Query Dynamically Generated Content

ODS Framework

Semantic properties of the task are can be used in queries

Semantic properties also used to generate wiki page content

All the semantic properties are stored in the wiki framework as RDF assertions.

Page 15: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 15

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Architecture ODS Architecture

MediaWiki

Semantic MediaWiki

Provenance

Category handling Property Table

Page Object Model (POM)

Facts API

Task APICompletion API

User-Described

Data

Workflows CollaborationProceduresP

erso

n

Pro

ject

User-Provided

Data

Data Entities

Loca

tio

n

Rep

osi

tory

Per

son

Task

Task

Nav

igat

ion

Task

Met

adat

a

Exec

uti

on

s

Tem

pla

tes

Co

mp

on

ents

Dat

a P

rod

uct

s

Bac

ken

dFr

on

ten

d

Page 16: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 16

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Usage of the Organic Data Science Framework (1)

ODS Usage

Evolution of the collaboration graph based on task metadata

Nodes: Users Edges: Two users have common tasks (thickness indicates how many)

Page 17: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 17

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Usage of the Organic Data Science Framework (2)

Age of Water is community of hydrologists and limnologiststhat are studying the age of water in an ecosystem.

ENIGMA a consortium for neuroimaging genetics, it includes more than 70 institutions that collaborate to do joint neuroscience studies.

GPF a group of geoscientists publishing a special issue of a journal. The articles follow a similar format in that they include all datasets, software, and workflows used to generate the results in the paper

Community # Pages # Tasks # Tasks with

completed Metadata

Avg. of Task

Completion Rates

# Registered

Users

Age of Water 759 380 350 43.95% 53

ENIGMA 204 80 2 2.50% 6

GPF 239 168 168 26.19% 32

ODS Framework 417 77 61 77.92% 19

ODS Training 1,235 1115 1112 99.64% 36

ODS Usage

Several collaborations using the ODSF:

Page 18: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 18

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

ConclusionsConclusion

The Organic Data Science Framework (ODSF) is a new approach for scientific collaboration that opens the science process and enables scientists to formulate new tasks and contribute to tasks posed by others. It is built on a semantic wiki platform.

ODSF captures semantic properties about shared tasks, participants, and other relevant entities.

Provenance of assertions is created and used to credit contributors.

The framework is being used by several communities.

Future work:

Organizing tasks using semantic properties

Supporting setting up process for new communities

Page 19: Supporting Open Collaboration in Science through Explicit ...organicdatascience.org/papers/eswc15-talk.pdfMotivation Introduction ODS Framework ODS Architecture ODS Usage Conclusion

USC INFORMATION SCIENCES INSTITUTE Felix Michel 19

ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion

Thank You

https://github.com/IKCAP/organicdatascience

Organic Data Science Frameworkhttp://www.organicdatascience.org/

Development

AcknowledgmentsWe gratefully acknowledge funding from the US National Science Foundation under grant IIS-1344272.