data quality, preservation and access: a dans perspective · the archive of the future . easy:...

Post on 10-Oct-2020

1 Views

Category:

Documents

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

www.dans.knaw.nl DANS is an institute of KNAW and NWO

Data quality, preservation and access: a DANS perspective

Ingrid Dillo, DANS

PERICLES reuse of science data Meeting 4, 21 June 2016

EASY: long-term Electronic Archiving System for self-deposit

NARCIS: Gateway to scholarly information In the Netherlands

Core services

DataverseNL for short/intrmediate term storage

Training & Consultancy

Persistent Identifier URN:NBN resolver

Additional services

Common metadata harvester

Cradle of the Data Seal of Approval

Background Archive

Data quality

Data is generally considered high quality if, they are fit for their intended uses in operations, decision making and planning.

Data quality in science

Technical aspects of data quality: • Sustainable file formats • Adequate metadata and documentation • Completeness of the data • Integrity, authenticity, provenance

Data fitness

Scientific quality of the data – assessment of the reusability of the data Criteria: • Peer review • Curation • Annotation • Citability (PID) • Certified TDR

How to make this transparant: tagging system?

DANS data reviews

Data preservation

Data preservation

Data preservation

Data selection Preferred formats Metadata

“Archief van de toekomst”

Werkgroepje met Maarten, Christophe, Herbert: - kerndiensten onderling beter laten aansluiten - overlap zoveel mogelijk wegsnijden - meer fasen van de onderzoeksproces (en de “data-

kringloop”) ondersteunen - systeem voor archiefmanagement loskoppelen van opslag - betere ondersteuning van linked data en software

The archive of the future

EASY: long-term Electronic Archiving System for self-deposit

NARCIS: Gateway to scholarly information In the Netherlands

Core services

DataverseNL for short/intrmediate term storage

Better alignment of core services

EASY 3 archival

management system

NARCIS (more content)

Dataverse (or another

short/medium term archive)

• Registration • Portal for

searching and finding

• Statistics & visualisation

Auto ingest

Metadata harvesting?

Let die away: • data upload • datasearch

Agnostic concerning: • identifier • metadata • licenses

Also for DANS a Dataverse for data upload

Auto ingest from elsewhere

EASY storage

dark storage

Third party storage

Suitalbe for: • linked data • software

Metadata harvesting

Quality of sustainability and content

top related