liber webinar: are the fair data principles really fair?

35
WEBINAR: Research Data Services WEBINAR: 23 Things for Research Data Management Join the conversation: #FAIRdata

Upload: liber-europe

Post on 19-Mar-2017

313 views

Category:

Education


1 download

TRANSCRIPT

Page 1: LIBER Webinar: Are the FAIR Data Principles really fair?

WEBINAR: Research Data Services

WEBINAR: 23 Things for Research Data Management

Join the conversation: #FAIRdata

Page 2: LIBER Webinar: Are the FAIR Data Principles really fair?

Speakers

Alastair [email protected]

Moderating

Birgit SchmidtGoettingen State and University [email protected]

Rob GrimErasmus University [email protected]

Page 3: LIBER Webinar: Are the FAIR Data Principles really fair?

• Alastair Dunning@alastairdunning

• Jasmin Böhmer@JasminBoehmer

Technical University of DelftHosts of 4TU.Centre for Research Data

Are the FAIR Data Principles Fair?

• Madeleine de Smaele@MadeleineSmaele

Jasmin Böhmer
I think it is important to emphasize that this presentation is just a fraction of the actual results and serves as teaser / intro. All further information is in the paper and in the spreadsheet.
Alastair Dunning
Yes will do
Page 4: LIBER Webinar: Are the FAIR Data Principles really fair?

Are the FAIR Data Principles Fair?Blog Post with all the information:

http://bit.ly/2lIgc9pLIBER Webinar, 10th March

Page 5: LIBER Webinar: Are the FAIR Data Principles really fair?

Motivation for this Project● H2020 / EU demands on open data and research data

management.

● Providing insight and support for repositories to improve their information architecture and digital infrastructure to comply to H2020 and FAIR demands.

● Own aspiration to offer the best possible service and support for 4TU.Centre for Research Data.

● Working towards practices to improve interoperability and reuse-value of data-sets in research data repositories.

Jasmin Böhmer
haha, I like the children's merry-go-round
Alastair Dunning
You think it works ? I was wondering if we could convert the Delft library cone into a merry go round !
Jasmin Böhmer
Oh, that is smart... but I don't think people will get that. Will you make a witty remark with a fun fair at the beginning?
Page 6: LIBER Webinar: Are the FAIR Data Principles really fair?

Using the FAIR principles and corresponding facets as scoring matrix

Applying a traffic-light rating system:

Use the information available on the web-interface of the repository online to evaluate the FAIR Principles

Methodology

Page 7: LIBER Webinar: Are the FAIR Data Principles really fair?
Jasmin Böhmer
mention re3data and focus for NL + selection of popluar platforms like zenodo and EUDAT
Page 8: LIBER Webinar: Are the FAIR Data Principles really fair?

Our Interpretation of the FAIR Principles

http://bit.ly/2lI2CCJ

Page 9: LIBER Webinar: Are the FAIR Data Principles really fair?

1. Compliance is not high

Page 10: LIBER Webinar: Are the FAIR Data Principles really fair?

https://data.4tu.nl/repository/uuid:5146dd06-98e4-426c-9ae5-dc8fa65c549f / General Overview Charts

N = 37

Findable Accessible

Interoperable Re-Usable

Alastair Dunning
Maximum one graph per slide - choose which statistic you wish to highlight
Page 11: LIBER Webinar: Are the FAIR Data Principles really fair?

F1 (meta)data are assigned a globally unique and eternally persistent identifier.

49% of the repositories do not assign DOI, HANDLE, or URN.

E.g. Subject Based Repositories use project ID’s or subject specific ID-systems. These links do not work in public spheres.

Page 12: LIBER Webinar: Are the FAIR Data Principles really fair?

A4 metadata are accessible, even when the data are no longer available.

97% of the repository do not clearly write about their metadata persistency, if the data is not available (anymore).

The transparency and integrity of the repository is improved by providing metadata-records for closed, restricted, or unavailable data-sets.

Page 13: LIBER Webinar: Are the FAIR Data Principles really fair?

I2 (meta)data use vocabularies that follow FAIR principles.

100% of the repositories do not have visible ontologies or (controlled) vocabulary.

Adding a semantic layer that enables links to unambiguous terms and definitions needs a lot of curation effort.

Is e.g. ORCID (Open Researcher and Contributor ID) a vocabulary?

Page 14: LIBER Webinar: Are the FAIR Data Principles really fair?

R1 meta(data) have a plurality of accurate and relevant attributes.

38% of the repositories do not provide sufficient information that helps to determine the value of reuse for the information seeker.

Specific information are mostly included in the documentation. Displaying those information in appropriate metadata fields would be beneficial.

Jasmin Böhmer
I managed to say all the things I am supposed to say in 5 minutes. I'll try to make it as entertaining as possible, but it's stats after all...
Page 15: LIBER Webinar: Are the FAIR Data Principles really fair?

2. Some principles are easily measured; some are much more subjective

Page 16: LIBER Webinar: Are the FAIR Data Principles really fair?

Pretty Obvious - (meta)data are assigned a globally unique and eternally persistent identifier.

Page 17: LIBER Webinar: Are the FAIR Data Principles really fair?

Vague - data are described with rich metadataWhat makes metadata rich ?

Page 18: LIBER Webinar: Are the FAIR Data Principles really fair?

Subjective - (meta)data meet domain-relevant community standards

Page 19: LIBER Webinar: Are the FAIR Data Principles really fair?

Philosophically dubious - (meta)data use vocabularies that follow FAIR principles

Page 20: LIBER Webinar: Are the FAIR Data Principles really fair?

3. Some principles are narrow; others are broad

Page 21: LIBER Webinar: Are the FAIR Data Principles really fair?

Narrow - (meta)data are retrievable by their identifier using a standardized communications protocol.

the protocol is open, free, and universally implementable.

the protocol allows for an authentication and authorization procedure, where necessary.

Page 22: LIBER Webinar: Are the FAIR Data Principles really fair?

Broad - (meta)data include qualified references to other

(meta)data.

(meta)data meet domain-relevant community standards (takes a long time to figure out)

Page 23: LIBER Webinar: Are the FAIR Data Principles really fair?

Technical vs Policy ● (meta)data are retrievable by their identifier

using a standardized communications protocol. ● the protocol is open, free, and universally

implementable. ● the protocol allows for an authentication and

authorization procedure, where necessary.● metadata are accessible, even when the data

are no longer available.

Page 24: LIBER Webinar: Are the FAIR Data Principles really fair?

4. Some subject areas fare badly

Page 25: LIBER Webinar: Are the FAIR Data Principles really fair?

Compliance of Social Science Data Repositories against FAIR Findable Principles (F1, F2, F3 and F4)

Page 26: LIBER Webinar: Are the FAIR Data Principles really fair?

Practice for Social Science Repositories Analysed● Data only available on request

● Licence not visible / clear

● Plenty of free text documentation on collection of data exists

● No structured metadata per dataset / no machine readable metadata

● But still seem to work well within the discipline

Page 27: LIBER Webinar: Are the FAIR Data Principles really fair?

LASA - Longitudinal Aging Study Amsterdam. Aging research and collecting data on aging in the Netherlands

No global identifierNo structured metadataBut plenty of documentation

Page 28: LIBER Webinar: Are the FAIR Data Principles really fair?

● Licence sometimes clear (no data protection issues)

● Some free text documentation on the overall collection of data exists

● No structured metadata per dataset / sometime the data is dynamically created following query

● No global identifiers per dataset

● Meeting existing disciplinary norms but not fully embedded as machine readable data

Practice for Climate Data Repositories Analysed

Page 29: LIBER Webinar: Are the FAIR Data Principles really fair?

SACA - Southeast Asian Climate Assessment

No structured metadataBut plenty of documentation No global identifier

Page 30: LIBER Webinar: Are the FAIR Data Principles really fair?

5. For repositories, doing some simple(ish) things vastly helps compliance

Page 31: LIBER Webinar: Are the FAIR Data Principles really fair?

● Create a permanent identifier for each dataset

● Always use an open license or clear License

● Make sure each dataset has rich metadata associated with it (Dublin Core good starting place!)

● Make data available via http

Page 32: LIBER Webinar: Are the FAIR Data Principles really fair?

Some Final Points (I)

● FAIR principles are deliberately vague - principles to be interpreted

● Nothing about back-up and preservation. Relationship to Data Seal of Approval?

● Much more work to be done on relationship between FAIR data and FAIR repository

Page 33: LIBER Webinar: Are the FAIR Data Principles really fair?

Some Final Points (II) ● To create FAIR dataset demands alliance

between repository and dataset creator

● Governance? How are principles updated

● FAIR principles derive not from libraries / archives but more from life sciences; but still require good knowledge of metadata / archiving practice

Page 34: LIBER Webinar: Are the FAIR Data Principles really fair?

Questions?

• Type your questions in the chat box.• Rob Grim (moderator) will select and pose

questions to the speakers• Unanswered questions will be addressed

by Alastair in a blog post (to be published following the webinar)

Page 35: LIBER Webinar: Are the FAIR Data Principles really fair?

WEBINAR: Research Data Services

Thank You!Final Notes:

1. Blog post with more information https://openworking.wordpress.com/2017/02/10/fair-principles-connecting-the-dots-for-the-idcc-2017

We’ll email a link to the recording shortly.