from data portal to knowledge portal: leveraging semantic technologies to support interdisciplinary...

35
deepcarbon.net Xiaogang Ma, Patrick West , John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox Tetherless World Constellation Rensselaer Polytechnic Institute From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

Upload: xiaogang-marshall-ma

Post on 14-Apr-2017

444 views

Category:

Education


0 download

TRANSCRIPT

Page 1: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

deepcarbon.net

Xiaogang Ma, Patrick West, John Erickson, Stephan Zednik, Yu Chen, Han Wang, Hao Zhong, Peter Fox

Tetherless World ConstellationRensselaer Polytechnic Institute

From data portal to knowledge portal:Leveraging semantic technologies to support interdisciplinary studies

Page 2: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

2

Outline

• Deep Carbon Observatory

• Deep Carbon Virtual Observatory (DCvO)

– Architecture of DCvO

– DCO Ontologies

– Boundary activities

– Discovering information by clicking through

• Summary

Page 3: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

3

A 10-year (2009-2019) initiative to intensify global attention and scientific effort in the burgeoning field of deep carbon science

Page 4: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

4

• Faculty, staff and students from the Tetherless World Constellation (TWC) at Rensselaer Polytechnic Institute (RPI)

• Responsible for– DCO Architecture and technology infrastructure– DCO Computer Cluster– The Deep Carbon Virtual Observatory DCvO

Deep Carbon Observatory – Data Science

Page 5: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

5

Deep Carbon Virtual Observatory

Scientists – actually ANYONE - should be able to access a global, distributed knowledge base of scientific data and information that:• appears to be integrated• appears to be locally available • is in a language (written, programming, or science)

that is understandable and can be sharedData intensive – volume, complexity, mode, scale,

heterogeneity, … in an OPEN WORLD

Page 6: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

6

Deep Carbon Virtual Observatory

• A vision of the DCvO:– A conceptual model of the interplay between data, people,

publication, instruments, models, organizations, etc.– Identify, annotate and link all key entities, agents and activities – A repository for datasets and associated metadata– Unique and powerful data and metadata visualization for

dissemination of information– Facilitates the discovery of potential collaborations– An integrated portal for diverse content and applications

(Fox et al., 2014)

Page 7: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

7

DCvO “Architecture”

Page 8: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

8

vivo.cornell.edu

VIVO - represents academic research

communities

DCO ontology: a model for concept types and relationships

DCO ontologies extend each other and the VIVO ontology

Page 9: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

9

Ontologies and schemas used in the DCO web portal

Name Prefix

Dublin Core Metadata Element Set dc

DCMI Metadata Terms dct

VIVO Core vivo

VIVO Scientific Research Ontology scires

Data Catalog Vocabulary dcat

Bibliographic Ontology bibo

Citation Counting and Context Characterization Ontology c4o

Citation Typing Ontology cito

FRBR-Aligned Bibliographic Ontology fabio

Event Ontology event

Friend of a Friend foaf

vCard Ontology vcard

Geopolitical Ontology geo

Simple Knowledge Organization System skos

DCO Ontology dco

PROV Ontology prov

Page 10: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

10

Ontologies and schemas used in the DCO web portal

DCO Boundary Activities are driving the extensions within the DCO Ontologies

Page 11: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

11

DCO Extension for Project Updates

Page 12: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

12

Dynamically generated list of Grants that are part of the Deep Carbon

Observatory. Users can click through to learn more, and members can create

reports to be sent to funding orgs

Page 13: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

13

Grant page lists all projects and reporting updates for each of the

projects and field studies

Page 14: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

14

DCO Extension for Data Types

Page 15: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

15

A Few Boundary Activities

• Given a DOI pull publication information from CrossRef

and/or Web of Science

• DCO IGSN Allocation Agent to work with the IGSN

Registry

• Integration with existing data portals and repositories

• Data Rescue activities

Page 16: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

16

Modern informatics enables a new scale-free framework approach

• Use cases• Stakeholders• Modeling• Ontologies• Evaluation

Page 17: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

17

What does a DCO data publication look like?

Page 18: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

18

Identification and annotation

Information on the landing page of a dataset

Page 19: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

19

Linking to enable forward and backward tracking

Landing page of Helium Concept

Page 20: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

20

Landing page of a person

Linking to build Collaborations

Page 21: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

21

Landing page of a research area

Linking to build Collaborations

Page 22: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

22

DCO Knowledge Graph Analytics

Page 23: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

23

Thus… progress…

• Integrative – semantics• Transparent – semantics• Collaborative – semantics• Application integration

– Yep – semantics

Page 24: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

24

Thank you!Patrick West, [email protected], https://deepcarbon.net, http://tw.rpi.edu

Page 25: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

25

An integrated portal: deepcarbon.net

Page 26: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

26

Faceted publication

browser

Page 27: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

27

Repository for archiving datasets

Archived datasets of ‘Noble gas isotope abundances in

terrestrial fluids’

Page 28: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

28

Collaboration tools

Group Based CollaborationGroup data deposit and

reporting

Listings of group content

Group management

and messaging

Page 29: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

29

RDA DTR and PIT adoption

The DTR primitives are comparable to a list of BASIC DATA TYPE CLASSES in the DCO ontology, e.g. Dataset, Image, Video, Audio, etc.

A registered DCO dataset is asserted as an instance of one of those basic data type classes.

It is possible to further annotate the dataset with the SPECIFIC DATA TYPES defined within a DTR, and each data type has a unique PID.

A Few Boundary Activities

Page 30: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

Results of data type specification

• Updates to the DCO Ontology:– A new class dco:DataType. Each specific data type is an instance of it– An object property dco:hasDataType linking a dataset and a data type– A collection of other classes and properties associated with dco:DataType

30

Page 31: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

31

• New datasets available via dataset browser• Includes citations to the originating publication• Data files accessible through dataset repository

Thermodynamic Data Rescue

Page 32: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

32

DCO Knowledge Store Analytics

Page 33: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

33

DCO Knowledge Store Visualizations

Page 34: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

34

All information is linked and traceable!

Page 35: From data portal to knowledge portal: Leveraging semantic technologies to support interdisciplinary studies

35

Mediation

From: C. Borgman, 2008, NSF Cyberlearning Report, Illustration by Roy Pea and Jillian C. Wallis

Guess

6th Generation

All these generations of mediation are in effect as we collaborate