supporting open collaboration in science through explicit...
TRANSCRIPT
Supporting Open Collaboration in Science through Explicit and Linked Semantic Description of Processes
1Information Sciences Institute, University of Southern California 2Center for Integrated Data Analytics, U.S. Geological Survey3Software Engineering for Business Information Systems, Technical University of Munich4Civil and Environmental Engineering, Penn State University5Center for Limnology, University of Wisconsin
Yolanda Gil1, Felix Michel13, Varun Ratnakar1, Jordan Read2, Matheus Hauder3,
Christopher Duffy4, Paul Hanson5, Hilary Dugan5
12th ESWC 2015
Organic Data Science Frameworkhttp://www.organicdatascience.org/
USC INFORMATION SCIENCES INSTITUTE Felix Michel 2
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Evolution of the scientific enterprise
Evolution of the scientific enterprise from [Barabasi, 2005] extended with the ATLAS Detector Project at the Large Hadron Collider [The ATLAS Collaboration, 2012].
Motivation
USC INFORMATION SCIENCES INSTITUTE Felix Michel 3
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Need a platform to support science collaborations that require:
– Significant organization and coordination
– Maintaining a community over the longer term
– Growing the community based on unanticipated needs
A New Kind of Collaborative PlatformIntroduction
Shared Instruments NEON
Community Data Systems PDB
Open Community Contribution Systems Zooniverse
Virtual Communities of Practice GLEON
Virtual Learning Communities VIVO
Distributed Research Centers ENCODE
Community Infrastructure Projects CSDMS
Taxonomy of Science Communities [Bos et al 2007]
USC INFORMATION SCIENCES INSTITUTE Felix Michel 4
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Requirements of Scientific Collaborations
Introduction
Multi-disciplinary contributions
Significant coordination
Unanticipated participants
USC INFORMATION SCIENCES INSTITUTE Felix Michel 5
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Science processes describe the what, who, when, and how of the activities pursued by the collaboration using semantic representations.
The ODS Framework incorporates principles from studies of successful on-line communities [Michel et al. 2015]
The Organic Data Science Framework
Exposes science processes declaratively to support:
formation of ad-hoc groups to work on tasks of interest,
to enable anyone to contribute to tasks that match their interests,
to advertise ongoing work to potential newcomers.
ODS Framework
USC INFORMATION SCIENCES INSTITUTE Felix Michel 6
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Representing Collaborative Tasks
Task Metadata
SubtasksTop-Level
Tasks
Current Task
Current Task
Parent Task
Text
Task based organization
Unique Task URL
Hierarchical Task structure
ContextTasks
Task Filter
ODS Framework
USC INFORMATION SCIENCES INSTITUTE Felix Michel 7
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Task Metadata
ESCW2015
ODS Framework
USC INFORMATION SCIENCES INSTITUTE Felix Michel 8
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Task Status (Progress Estimation)
PIHM model Documentation
Calibrating …90%
85%
80%
70% 80% 90%
90%
Low-level TaskLow uncertainty in estimation Estimated by users
90%
90%
90%
Medium-level TaskMedium uncertainty in estimation Average of its Subtasks
High-level TaskHigh uncertainty in the estimation Linear progress based on start and target date
90%
95%
ODS Framework
Semantic properties are used to estimate the progress and status of tasks
Legend
The abstraction level of a parent node/task can be equal or higher but not lower.
Rule:
USC INFORMATION SCIENCES INSTITUTE Felix Michel 9
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Task Status (Visualization)ODS Framework
USC INFORMATION SCIENCES INSTITUTE Felix Michel 10
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Task Status (Sequences)ODS Framework
USC INFORMATION SCIENCES INSTITUTE Felix Michel 11
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Task Cloning
Cloned Taskwith Subtasks
Same task is done by several users with their own data: a clone of the task is created and assigned to each user
Task Owner Task Name
Unique Task Name
ClonedTasks
ODS Framework
USC INFORMATION SCIENCES INSTITUTE Felix Michel 12
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Representing Users
Expertise
Username
Recent Edits
Credits
ODS Framework
User-DefinedProperties
Current Tasks
Future Tasks
Completed Tasks
USC INFORMATION SCIENCES INSTITUTE Felix Michel 13
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Representing other Entities of InterestODS Framework
Wo
rkfl
ow
sUse
r-d
escr
ibed
/pro
vid
ed d
ata
Soft
wa
re c
om
po
nen
t
USC INFORMATION SCIENCES INSTITUTE Felix Michel 14
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Queries
==List of G16 Workshop Participants==
{{#ask:
[[Category:Person]]
[[Participant Of::G16 Workshop
on Modeling the Age of Water]]
| format=ul
}}
Semantic Properties Semantic Query Dynamically Generated Content
ODS Framework
Semantic properties of the task are can be used in queries
Semantic properties also used to generate wiki page content
All the semantic properties are stored in the wiki framework as RDF assertions.
USC INFORMATION SCIENCES INSTITUTE Felix Michel 15
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Architecture ODS Architecture
MediaWiki
Semantic MediaWiki
Provenance
Category handling Property Table
Page Object Model (POM)
Facts API
Task APICompletion API
User-Described
Data
Workflows CollaborationProceduresP
erso
n
Pro
ject
User-Provided
Data
Data Entities
Loca
tio
n
Rep
osi
tory
Per
son
Task
Task
Nav
igat
ion
Task
Met
adat
a
Exec
uti
on
s
Tem
pla
tes
Co
mp
on
ents
Dat
a P
rod
uct
s
…
Bac
ken
dFr
on
ten
d
USC INFORMATION SCIENCES INSTITUTE Felix Michel 16
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Usage of the Organic Data Science Framework (1)
ODS Usage
Evolution of the collaboration graph based on task metadata
Nodes: Users Edges: Two users have common tasks (thickness indicates how many)
USC INFORMATION SCIENCES INSTITUTE Felix Michel 17
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Usage of the Organic Data Science Framework (2)
Age of Water is community of hydrologists and limnologiststhat are studying the age of water in an ecosystem.
ENIGMA a consortium for neuroimaging genetics, it includes more than 70 institutions that collaborate to do joint neuroscience studies.
GPF a group of geoscientists publishing a special issue of a journal. The articles follow a similar format in that they include all datasets, software, and workflows used to generate the results in the paper
Community # Pages # Tasks # Tasks with
completed Metadata
Avg. of Task
Completion Rates
# Registered
Users
Age of Water 759 380 350 43.95% 53
ENIGMA 204 80 2 2.50% 6
GPF 239 168 168 26.19% 32
ODS Framework 417 77 61 77.92% 19
ODS Training 1,235 1115 1112 99.64% 36
ODS Usage
Several collaborations using the ODSF:
USC INFORMATION SCIENCES INSTITUTE Felix Michel 18
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
ConclusionsConclusion
The Organic Data Science Framework (ODSF) is a new approach for scientific collaboration that opens the science process and enables scientists to formulate new tasks and contribute to tasks posed by others. It is built on a semantic wiki platform.
ODSF captures semantic properties about shared tasks, participants, and other relevant entities.
Provenance of assertions is created and used to credit contributors.
The framework is being used by several communities.
Future work:
Organizing tasks using semantic properties
Supporting setting up process for new communities
USC INFORMATION SCIENCES INSTITUTE Felix Michel 19
ODS FrameworkIntroductionMotivation ODS Architecture ODS Usage Conclusion
Thank You
https://github.com/IKCAP/organicdatascience
Organic Data Science Frameworkhttp://www.organicdatascience.org/
Development
AcknowledgmentsWe gratefully acknowledge funding from the US National Science Foundation under grant IIS-1344272.