data curation issues

Post on 27-Jun-2015

1.128 Views

Category:

Technology

2 Downloads

Preview:

Click to see full reader

DESCRIPTION

A very short, very minimal presentation I prepared for the Yale Libraries' SCOPA event to introduce librarians in diverse disciplines to the concepts and challenges of data curation.

TRANSCRIPT

DATA CURATION ISSUES

Michelle HudsonSCOPA Forum9.21.11

WHAT IS DATA?

WHAT IS DATA?

Definition varies by discipline and can include experimental, observational, and computational data.

WHAT IS DATA?

Definition varies by discipline and can include experimental, observational, and computational data.

In general “research data” refers to raw or processed products of a research project.

WHAT IS DATA?

Definition varies by discipline and can include experimental, observational, and computational data.

In general “research data” refers to raw or processed products of a research project.

These products can be video, images, or numeric files in the form of geographic information, spreadsheets, and other formats.

WHAT IS DATA CURATION?

WHAT IS DATA CURATION?

“Data curation is the active and ongoing management of research data through its lifecycle of interest and usefulness to scholarship, science, and education.” – Carole Palmer, UIUC GSLIS

WHAT IS DATA CURATION?

“Data curation is the active and ongoing management of research data through its lifecycle of interest and usefulness to scholarship, science, and education.” – Carole Palmer, UIUC GSLIS

“Curation” includes selection, appraisal, maintenance, preservation.

WHY IS DATA CURATION IMPORTANT FOR US?

WHY IS DATA CURATION IMPORTANT FOR US?

According to Paul F. Uhlir, Director of the Board on Research Data and Information, researchers are “contributing to a networked information enterprise where data are a fundamental infrastructural component of the modern research system.”

WHY IS DATA CURATION IMPORTANT FOR US?

According to Paul F. Uhlir, Director of the Board on Research Data and Information, researchers are “contributing to a networked information enterprise where data are a fundamental infrastructural component of the modern research system.”

Increasingly, data itself is a product and record of scholarship.

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

No standards.

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

No standards.

Lack of interoperability.

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

No standards.

Lack of interoperability.

Controlled vocabularies are missing.

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

No standards.

Lack of interoperability.

Controlled vocabularies are missing.

Storage space is limited.

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

No standards.

Lack of interoperability.

Controlled vocabularies are missing.

Storage space is limited.

Domain of stewardship/responsibility is unclear.

SOME PROBLEMS THAT MAKE CURATION DIFFICULT.

No standards.

Lack of interoperability.

Controlled vocabularies are missing.

Storage space is limited.

Domain of stewardship/responsibility is unclear.

Individual repositories make silos of content.

IDEAS FOR SOLUTIONS!

IDEAS FOR SOLUTIONS!

Experiment tracking software and electronic notebooks.

IDEAS FOR SOLUTIONS!

Experiment tracking software and electronic notebooks.

Automatic metadata.

IDEAS FOR SOLUTIONS!

Experiment tracking software and electronic notebooks.

Automatic metadata.

Integrating curation early into the researcher workflow.

IDEAS FOR SOLUTIONS!

Experiment tracking software and electronic notebooks.

Automatic metadata.

Integrating curation early into the researcher workflow.

Educating graduate students on proper data management.

IDEAS FOR SOLUTIONS!

Experiment tracking software and electronic notebooks.

Automatic metadata.

Integrating curation early into the researcher workflow.

Educating graduate students on proper data management.

DataONE and the Data Conservancy.

OTHER STUFF!

Data citation

Data sharing

Reward models

Identity control (ORCID, EZID)

Semantic web and linked data

Cyberinfrastructure

QUESTIONS?

michelle.hudson@yale.edu203.432.4587@michellehudsonin person for coffee @ kbt cafe

top related