data quality, preservation and access: a dans perspective · the archive of the future . easy:...
Post on 10-Oct-2020
1 Views
Preview:
TRANSCRIPT
www.dans.knaw.nl DANS is an institute of KNAW and NWO
Data quality, preservation and access: a DANS perspective
Ingrid Dillo, DANS
PERICLES reuse of science data Meeting 4, 21 June 2016
EASY: long-term Electronic Archiving System for self-deposit
NARCIS: Gateway to scholarly information In the Netherlands
Core services
DataverseNL for short/intrmediate term storage
Training & Consultancy
Persistent Identifier URN:NBN resolver
Additional services
Common metadata harvester
Cradle of the Data Seal of Approval
Background Archive
Data quality
Data is generally considered high quality if, they are fit for their intended uses in operations, decision making and planning.
Data quality in science
Technical aspects of data quality: • Sustainable file formats • Adequate metadata and documentation • Completeness of the data • Integrity, authenticity, provenance
Data fitness
Scientific quality of the data – assessment of the reusability of the data Criteria: • Peer review • Curation • Annotation • Citability (PID) • Certified TDR
How to make this transparant: tagging system?
DANS data reviews
Data preservation
Data preservation
Data preservation
Data selection Preferred formats Metadata
“Archief van de toekomst”
Werkgroepje met Maarten, Christophe, Herbert: - kerndiensten onderling beter laten aansluiten - overlap zoveel mogelijk wegsnijden - meer fasen van de onderzoeksproces (en de “data-
kringloop”) ondersteunen - systeem voor archiefmanagement loskoppelen van opslag - betere ondersteuning van linked data en software
The archive of the future
EASY: long-term Electronic Archiving System for self-deposit
NARCIS: Gateway to scholarly information In the Netherlands
Core services
DataverseNL for short/intrmediate term storage
Better alignment of core services
EASY 3 archival
management system
NARCIS (more content)
Dataverse (or another
short/medium term archive)
• Registration • Portal for
searching and finding
• Statistics & visualisation
Auto ingest
Metadata harvesting?
Let die away: • data upload • datasearch
Agnostic concerning: • identifier • metadata • licenses
Also for DANS a Dataverse for data upload
Auto ingest from elsewhere
EASY storage
dark storage
Third party storage
Suitalbe for: • linked data • software
Metadata harvesting
Quality of sustainability and content
top related