research data alliance plenary 9: ddri working group session

44
Data Description Registry Interoperability Working Group Co-Chairs: Amir Aryani, Adrian Burton and Brigitte Hausstein

Upload: amiraryani

Post on 16-Apr-2017

9 views

Category:

Technology


1 download

TRANSCRIPT

Data Description Registry Interoperability Working Group

Co-Chairs: Amir Aryani, Adrian Burton and Brigitte Hausstein

Agenda• Background and Introduction

• Challenge of cross-platform discovery, DDRI WG outcome and Implemntation (Switchboard)

• Adopters

• New Developments

• Distributed research graph

• JSON-LD

• Open Issues: Connecting to organisation and control vocabulalries graphs

BackgroundChallenge of cross-platform discovery

{All started here!}

Research Data Australia Suggested Links

March 2014, Version 12

Data Description Registry Interoperability (DDRI) Working Group

Research Data Alliance

Goal: enabling cross-platform discovery between research data infrastructures

DDRI WG Approach

Connecting datasets on the basis of co-authorship or other collaboration models such as joint funding and grants.

Research Data Alliance

Show 105 morepublications

http://dx.doi.org/10.1371/journal.pone.0079092

One of the 105 articles …

doi:10.5061/dryad.4qq0v

Authors: Wong ESW, Nichol S, Warren WC, Belov K

Dryad Dataset

http://datadryad.org/resource/doi:10.5061/dryad.4qq0v

We have found another dataset from the same author…

Dataset

Researcher

Publication

Dataset

Toward anAutomated Solution

Research Data SwitchboardToward automating the process

Using machines…

Exploring the graph…

Connecting Datasets by Three Degrees of Separation

Adoptors

National Computational Infrastructure

(NCI Australia)

nci.org.au nci.org.au

@NCInews

15#February##2015#

NCI#Overview###

nci.org.au @NCInews

Mission: World-class, high-end computing services for Australian research and innovation

What is NCI:• Australia’s most highly integrated e-infrastructure environment• Petascale supercomputer + highest performance research cloud + highest performance storage in the

southern hemisphere• Comprehensive & integrated expert service — internat. vanguard• National/internationally renowned support team

NCI is national and strategic:• Driven by national research priorities and excellence• Engaged with research institutions/collaborations and industry• A capability beyond the capacity of any single institution• Sustained by a collaboration of agencies/universities ($11+M p.a.)

NCI is important to Australia because it:• Enables research that otherwise would be impossible• Enables delivery of world-class science• Enables interrogation of big data, otherwise impossible• Enables high-impact research that matters; informs public policy• Attracts and retains world-class researchers for Australia• Catalyses development of young researchers’ skills

Research(Outcomes(

Communi1es(and((Ins1tu1ons/(

Access(and(Services(

Exper1se(Support((and((

Development(

HPC(Services(Virtual(Laboratories/(DataBintensive(Services((

(Integra1on(

Compute((HPC/Cloud)((Storage/Network(Infrastructure(

Res

earc

h O

bjec

tives

National Institute of Informatics (NII Japan)

3

Academic SocietyAcademic

SocietyAcademic Society

ReaD(JST) Google

Scholar

Official reports of Grants-in-aidFor scientific

research0.7 million reports

KAKEN

Resolving and linking service for

researchers' names0.15 millionresearchers

ResearcherName

ResolverNII-ELS

Refereed andUn-refereed articles from

societies or bulletins3.9 million articles

Electronic journal repositories for

universities3.5 million articles

NII-REO

Book Catalogs120 million +α

records

Comprehensivemetadata

harvester for Japanese Irs

2 million articles

JAIRO

indexing servicefor academic papers

19 million records

CiNii

InstitutionalRepositoryInstitutional

RepositoryInstitutionalRepositoryInstitutional

RepositoryInstitutionalRepository

UniversityResearcherDatabaseInstitutional

RepositoryInstitutionalRepositoryInstitutional

RepositoryInstitutionalRepositoryInstitutional

Repository

UniversityResearcherDatabase

UniversityResearcherDatabase

UniversityResearcherDatabase

UniversityResearcherDatabase

UniversityResearcherDatabase

Scholarly information services in NII

Universities and Institutes

Other Services

Information on ResearchersInformation on Papers

(650 organizations)(43 organizations)

UniversityLibraryUniversity

LibraryUniversityLibraryUniversity

LibraryUniversityLibraryUniversity

LibraryUniversityLibraryUniversity

LibraryUniversityLibraryUniversity

LibraryInformation on Books

J-Stage(JST) NDL

InformaticsSquare

Society support with publishing and communication in

Informatics areaPowered by

OLCerSpringer

CrossRef

Projects Books

NACSIS CAT

People

WebcatPlus

Researchers profiles and SNS

ResearchMap

Papers

3

May, 2008-October, 2008-

13

14

14

Academic SocietyAcademic

SocietyAcademic Society

ReaD(JST) Google

Scholar

Book Catalogs108 million +α

records

InstitutionalRepositoryInstitutional

RepositoryInstitutionalRepositoryInstitutional

RepositoryInstitutionalRepository

UniversityResearcherDatabaseInstitutional

RepositoryInstitutionalRepositoryInstitutional

RepositoryInstitutionalRepositoryInstitutional

Repository

UniversityResearcherDatabase

UniversityResearcherDatabase

UniversityResearcherDatabase

UniversityResearcherDatabase

UniversityResearcherDatabase

Scholarly information services in NII

Universities and Institutes

Other Services

Information on ResearchersInformation on Papers

(130 organizations)(43 organizations)

UniversityLibraryUniversity

LibraryUniversityLibraryUniversity

LibraryUniversityLibraryUniversity

LibraryUniversityLibraryUniversity

LibraryUniversityLibraryUniversity

LibraryInformation on Books

J-Stage(JST) NDL

Official reports of Grants-in-aidFor scientific

research0.61 million reports

KAKEN

Resolving and linking service for

researchers' names0.15 millionresearchers

ResearcherName

ResolverNII-ELS

Refereed andUn-refereed articles from

societies or bulletins3.4 million articles

Electronic journal repositories for

universities3.5 million articles

NII-REO

Comprehensivemetadata

harvester for Japanese IRs

0.5 million articles

JAIRO

indexing servicefor academic papers

12 million records

CiNii

InformaticsSquare

Society support with publishing and communication in

Informatics areaPowered by

OLCerSpringer

CrossRef

Projects Books

NACSIS CAT

People

WebcatPlus

R&D Center for Scientific Information ResourcesScholarly and Academic Information Division

Researchers profiles and SNS

ResearchMap

Papers

Australian National Data Service (ANDS)

RD-Switchboard and integration with Research Data Australia

As Australia’s first university our reputation spans more than 160 years. In 2014, we were ranked in the top 0.3% of universities worldwide. Across 16 Faculties we taught more than 33,000 undergraduate and 19,000 postgraduate students, including 10,800 international students from more than 145 countries.

The Australian Federal Government’s Excellence in Research for Australia (ERA) initiative rated our research at “above and well above world standard across 21 (of 22) broad discipline areas.

These disciplines include earth sciences, agricultural and veterinary sciences, mathematical sciences, engineering, biomedical and clinical health sciences, psychology and cognitive sciences, law and legal studies, language and communication, history and archaeology, and philosophy and religious studies.

The University of Sydney Early Adopter

New Adoptors

LOD Research GraphGESIS – Leibniz-Institute for the Social Sciences is the largest infrastructure institution for the Social Sciences in Germany. With da|ra, GESIS provides access to the data referencing system for social science research data.

• Purpose:

• Making connections between high-value collections (research datasets) and other scholarly works such as publications and grants discoverable in Europe

• Goals:

• Research graph that holds the connections between datasets, publications and grant information from European and Australian research institutions

• Software service that makes the graph accessible to third party software systems, data repositories and infrastructure providers

• Integration of the graph in the ResearchGraph.org website with results accessible using Linked Open Data and JSON-LD

• Challenges:

• Research an efficient way to link to GRID and organization Graph

• Research an efficient way to link to vocabulary Graph

• Access to new data sources

© 2017, RMIT University© 2017, RMIT UniversityThe eResearch Office

RMIT is a global university of technology, design and enterprise.One of Australia's original tertiary institutions, RMIT University enjoys an international reputation for excellence in professional and vocational education, applied research, and engagement with the needs of industry and the community.

RMIT is a world leader in Art and Design; Architecture; Education; Engineering; Development; Computer Science and Information Systems; Business and Management; and Communication and Media Studies.

A pilot project with Research Graph

Deliverables:• identification of differences between data derived from Graph and RM.• identification of “important" data that is not in RM that can be derived from

the larger Graph.• document opportunities to improve the larger Graph from RMIT University’s

data, and vice-versa.

Purpose: • compare the data from Graph with data from RMIT University’s Research

Master (RM) system.• evaluate the technical aspect, such as: ease of configuration, deployment,

usability, etc.• identify the “value add” to existing institutional data through integration with

the larger Graph.• prepare the groundwork for a larger project which will deploy Graph into a

production environment.

Goal: • create a set of RMIT records that contain Researcher, Publication, Research

data and Grants models from the Graph schema. These records would include data about a small number of top-performing researchers at RMIT University.

RMIT University

New Developments

Creating a distributed graph

http://researchgraph.org/schema/

Distributed Graph• Connecting

• Open data infrastrcutures to

• Close Research Management systems

• Reducing the cost and techinical requitement for implementing the Switchboard and creating connected research graphs.

Connecting Infrastructures

Connecting Graphs

Publication

Publication

Publication

Researcher

Grants

ResearchData

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

PublicationPublication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Grants

Publication

Publication

Publication

Publication

Publication

Researcher

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

PublicationPublication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

Publication

PublicationPublication

Publication

Publication

Publication

Publication

Research Data

Research Data

Publication

Publication

Research Data Researcher

PublicationGrant

Legend

Suppoting JSON-LD and schema.org

Open issues / connecting the graph to

• field of research codes e.g. ANZSRC FOR https://vocabs.ands.org.au/anzsrc-seo

• research oganisations information e.g. GRID database https://grid.ac/

Further information• Collaborators: http://researchgraph.org/collaborators/

• Publications: http://researchgraph.org/collaborators/publications

• Technology:

• https://github.com/researchgraph

• https://github.com/rd-switchboard

• http://researchgraph.org/schema/

To get involved in this project please contact [email protected]