establishing a generic research data repository: radar · digital cv / cris. contact: angelina...

20
Angelina Kraft, orcid.org/0000-0002-6454-335X Barcelona, 4 April 2017 ICSTI 2017 Annual Member’s Meeting & Workshops TACC Workshop Establishing a generic Research Data Repository: RADAR

Upload: others

Post on 01-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Angelina Kraft, orcid.org/0000-0002-6454-335X

Barcelona, 4 April 2017

ICSTI 2017 Annual Member’s Meeting & Workshops

TACC Workshop

Establishing a generic

Research Data Repository:

RADAR

Page 2: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 2

Agenda

1. RDM at the German National Library of Science and Technology (TIB)

DOI and DataCite

2. Why another (data) repository?

3. RADAR

Page 3: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 3

Requirements for Research Data

Preservation and Sharing

1. Trustworthy research data repositories

2. Data policies

3. Standards for data citation, metadata, licensing

4. Intellectual property rights and proprietary data

5. Methods and Tools adapted to the scientific workflows

6. Cost recovery strategies

7. Motivation for change

Page 4: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 4

DataCite in a nutshell

• Founded in December 2009

• International Membership organization

• 39 allocating members and 8 non-allocating members in 23 countries

• ~ 1,125 data centers

• DataCite DOIs have been assigned to 9.6 Mio DOIs of research datasets

making them public, citable, traceable.

• German non-profit association

• Business Office at German National Library of Science (TIB)

Page 5: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 5

DOI Service & Data Cite Business Office

DOIs registered via TIB (by March 2017)

• Total 1,165,411

62 % Research data

37 % Grey literature

1 % AV media

Registering data centers

• Total 139 data centers

Major research centers i.e. Pangaea, WDCC and ESO

65 universities/university libraries

RDM requirements at smaller/long-tail institutions?

Page 6: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 6

Challenges of ‘long-tail’ data:

• Heterogeneous

• Unique standards

• Costs: Set-up and maintenance oflong-term research data infrastructure

Approach:

Discipline-specific data repositories

or

Generic data repositories

Practical example: RADAR

German Research Foundation (DFG)

“The majority of datasets

produced through

research are part of the

‘Long Tail of Research

Data’”Source: Humphrey C (2014): OpenAIRE-COAR

Conference, Athens

Which data?

Page 7: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 7

Research Data Repository

Page 8: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 8

What is RADAR? An interdisciplinary repository for

- archival of research data as a generic service

- trustworthy preservation & traceable publication

Focus: Long Tail – Repository for specialized research

disciplines, addition to big data archives

Duration: September 2013 – August 2016,

project funded by German Research Foundation

Live System: Provided by

Support: Provided by

Project: https://www.radar-projekt.org

System: https://www.radar-service.eu

RADAR in a nutshell

Page 9: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 9

Why another data repository?

25 years guaranteed availability

Three copies geographically

distributed: SCC at Karlsruhe Institute of Technology,

City of Karlsruhe

SCC at Eggenstein-Leopoldshafen

Technical University Dresden

German legal framework

BagIt Files (In = Out + Metadata)

RADAR REST-API:

RADAR functionality accessed via HTTP and

JSON Requirements

Page 10: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 10

RADAR - the service

Page 11: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 11

Service & business model

Services:

Basic service: Archival Storage (5-15 years, expendable)

Extended service: Data Publication (25 years)

Features:

• Data Life Cycle support

• REST API for clients (customizable)

• Interoperability & cross-linking of

published datasets via API: DataCite, ORCID & others

• Optional Peer-Review Support

• Statistics on downstream data use

Prices:

Archival: 500 € annual fee + 0,39 € GB data volume per year (net price)

Publication: 6,37 € GB data volume with guaranteed availability for 25

years (net price)

Academic, publically funded institutions (in Germany)

Page 12: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 12

PENDING

Initial mode, editing possible (modification, update, deletion) –

up to 6 months

REVIEW

Dataset ‘frozen’ for duration of peer review process, ‘review-

URL’

ARCHIVED

Dataset is archived and identified via RADAR ID (no further

editing)

PUBLISHED

Dataset is published and identified via DOI (no further editing)

Dataset - status

Page 13: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 13

Scientists/Projects Institutions

– Libraries

– Research institutes

– Museums/Archives

Target groups - customers

Page 14: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 14

Administrator/(Sub)Curator

Administrator/Curator

Administrator Contract

Workspace

Dataset

Folder

File File

File

Dataset

File

Workspace

Roles & rights

Page 15: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 15

Register (ORCID iD)

Users can include their ORCID iD as part of their RADAR profile to ensure

they get credit for their work

Page 16: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 16

Login (Shibboleth)

You can use your institutional portal to login (e.g. Shibboleth)

Page 17: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 17

Landing Page: Metadata, content, download

Page 18: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 18

Downstream data use

Page 19: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Page 19

Summary: RDM @ TIB

Data Repository

Portal

DOI

DOI

DOI

Digital CV / CRIS

Page 20: Establishing a generic Research Data Repository: RADAR · Digital CV / CRIS. Contact: Angelina Kraft T +49 511 762-14238, angelina.kraft@tib.eu Thank you! Title: Hier steht ein Blindtext

Contact:

Angelina Kraft

T +49 511 762-14238, [email protected]

Thank you!