establishing a generic research data repository: radar · digital cv / cris. contact: angelina...
TRANSCRIPT
Angelina Kraft, orcid.org/0000-0002-6454-335X
Barcelona, 4 April 2017
ICSTI 2017 Annual Member’s Meeting & Workshops
TACC Workshop
Establishing a generic
Research Data Repository:
RADAR
Page 2
Agenda
1. RDM at the German National Library of Science and Technology (TIB)
DOI and DataCite
2. Why another (data) repository?
3. RADAR
Page 3
Requirements for Research Data
Preservation and Sharing
1. Trustworthy research data repositories
2. Data policies
3. Standards for data citation, metadata, licensing
4. Intellectual property rights and proprietary data
5. Methods and Tools adapted to the scientific workflows
6. Cost recovery strategies
7. Motivation for change
Page 4
DataCite in a nutshell
• Founded in December 2009
• International Membership organization
• 39 allocating members and 8 non-allocating members in 23 countries
• ~ 1,125 data centers
• DataCite DOIs have been assigned to 9.6 Mio DOIs of research datasets
making them public, citable, traceable.
• German non-profit association
• Business Office at German National Library of Science (TIB)
Page 5
DOI Service & Data Cite Business Office
DOIs registered via TIB (by March 2017)
• Total 1,165,411
62 % Research data
37 % Grey literature
1 % AV media
Registering data centers
• Total 139 data centers
Major research centers i.e. Pangaea, WDCC and ESO
65 universities/university libraries
RDM requirements at smaller/long-tail institutions?
Page 6
Challenges of ‘long-tail’ data:
• Heterogeneous
• Unique standards
• Costs: Set-up and maintenance oflong-term research data infrastructure
Approach:
Discipline-specific data repositories
or
Generic data repositories
Practical example: RADAR
German Research Foundation (DFG)
“The majority of datasets
produced through
research are part of the
‘Long Tail of Research
Data’”Source: Humphrey C (2014): OpenAIRE-COAR
Conference, Athens
Which data?
Page 7
Research Data Repository
Page 8
What is RADAR? An interdisciplinary repository for
- archival of research data as a generic service
- trustworthy preservation & traceable publication
Focus: Long Tail – Repository for specialized research
disciplines, addition to big data archives
Duration: September 2013 – August 2016,
project funded by German Research Foundation
Live System: Provided by
Support: Provided by
Project: https://www.radar-projekt.org
System: https://www.radar-service.eu
RADAR in a nutshell
Page 9
Why another data repository?
25 years guaranteed availability
Three copies geographically
distributed: SCC at Karlsruhe Institute of Technology,
City of Karlsruhe
SCC at Eggenstein-Leopoldshafen
Technical University Dresden
German legal framework
BagIt Files (In = Out + Metadata)
RADAR REST-API:
RADAR functionality accessed via HTTP and
JSON Requirements
Page 10
RADAR - the service
Page 11
Service & business model
Services:
Basic service: Archival Storage (5-15 years, expendable)
Extended service: Data Publication (25 years)
Features:
• Data Life Cycle support
• REST API for clients (customizable)
• Interoperability & cross-linking of
published datasets via API: DataCite, ORCID & others
• Optional Peer-Review Support
• Statistics on downstream data use
Prices:
Archival: 500 € annual fee + 0,39 € GB data volume per year (net price)
Publication: 6,37 € GB data volume with guaranteed availability for 25
years (net price)
Academic, publically funded institutions (in Germany)
Page 12
PENDING
Initial mode, editing possible (modification, update, deletion) –
up to 6 months
REVIEW
Dataset ‘frozen’ for duration of peer review process, ‘review-
URL’
ARCHIVED
Dataset is archived and identified via RADAR ID (no further
editing)
PUBLISHED
Dataset is published and identified via DOI (no further editing)
Dataset - status
Page 13
Scientists/Projects Institutions
– Libraries
– Research institutes
– Museums/Archives
Target groups - customers
Page 14
Administrator/(Sub)Curator
Administrator/Curator
Administrator Contract
Workspace
Dataset
Folder
File File
File
Dataset
File
Workspace
Roles & rights
Page 15
Register (ORCID iD)
Users can include their ORCID iD as part of their RADAR profile to ensure
they get credit for their work
Page 16
Login (Shibboleth)
You can use your institutional portal to login (e.g. Shibboleth)
Page 17
Landing Page: Metadata, content, download
Page 18
Downstream data use
Page 19
Summary: RDM @ TIB
Data Repository
Portal
DOI
DOI
DOI
Digital CV / CRIS