Download - Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs
![Page 1: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/1.jpg)
COMING TO AN UNDERSTANDINGA Cross-institutional Examination of Assessments of Data Curation Needs
Jake Carlson - Purdue UniversityDianne Dietrich - Cornell UniversityGail Steinhart - Cornell UniversityAlison Valk - Georgia Institute of TechnologyStephanie Wright - University of Washington
![Page 2: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/2.jpg)
Dianne Dietrich
Planning & Data Management Plans
![Page 3: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/3.jpg)
Planning and Data Management Plans
May 2010
October 2010
December
2010
January 2011
NSF press release indicating intent to require data management plans with grant proposals.
NSF releases specifics for data management plan requirement.
Cornell survey distributed to PIs and Co-PIs of NSF grants.
NSF requirement goes into effect.
![Page 4: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/4.jpg)
Planning and Data Management Plans
How prepared are researchers to address data management plan requirements?
What is the potential impact of researcher plans on existing Cornell services?
![Page 5: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/5.jpg)
Planning and Data Management Plans
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
Percentage of respondents who answered "I'm not sure" for questions where that was an option
Each bar represents a question where respondents were asked to select "Yes", "No", or "I'm not sure"
Adapted from Steinhart, et al. (2012) Prepared to Plan? A Snapshot of Research Readinessto Address Data Management Planning Requirements. Journal of eScience Librarianship 1(2).
![Page 6: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/6.jpg)
Planning and Data Management Plans
No data
Up to 1 GB
1 GB - 100 GB
100 GB - 1 TB
1 TB - 100 TB
More than 100 TB
0% 5% 10% 15% 20% 25% 30% 35% 40% 45% 50%
Responses to the question: "Given the NSF ex-pectation to share data ... how much data would
you intend to share?"
Adapted from Steinhart, et al. (2012) Prepared to Plan? A Snapshot of Research Readinessto Address Data Management Planning Requirements. Journal of eScience Librarianship 1(2).
![Page 7: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/7.jpg)
Planning and Data Management Plans
Yes30%
I'm not sure61%
No: 9%I do not plan to create
metadata26%
I'm not sure if I plan to create metadata
32%
Have you produced or do you anticipateproducing metadata for this project?
Adapted from Steinhart, et al. (2012) Prepared to Plan? A Snapshot of Research Readinessto Address Data Management Planning Requirements. Journal of eScience Librarianship 1(2).
If you plan on creating metadata, does it conform to known standards in your discipline?
![Page 8: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/8.jpg)
Planning and Data Management Plans
Own
infra
stru
ctur
e
Campu
s so
lutio
n
Comm
ercial
sol
utio
n0
10203040506070
Anticipated Backup Strategy by Size of Data
More than 100 TB1 TB - 100 TB100 GB - 1 TB1 GB - 100 GBUp to 1 GB
Backup Strategy
Nu
mb
er
of
resp
on
ses
Adapted from Steinhart, et al. (2012) Prepared to Plan? A Snapshot of Research Readinessto Address Data Management Planning Requirements. Journal of eScience Librarianship 1(2).
![Page 9: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/9.jpg)
Stephanie Wright
Management
![Page 10: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/10.jpg)
Management: UW
Background
Services Survey &
Interviews
![Page 11: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/11.jpg)
Management: Organization
Survey Guidance on data
organization (file structure, file naming, etc.) ranked 13th out of 14
Tracking updates to data (versioning) ranked 8th
Image Credit: radrice “data cat finds no data” http://blog.looxii.com/wp-content/uploads/2011/06/new-data-cat.jpg
![Page 12: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/12.jpg)
Management: Organization
Interviews Whatever makes
sense to organizer More planning,
better organization Especially true of
larger, well-funded projects
“But that really was sort of something we addressed after the fact, after we started to go, ‘Huh, I’m naming them this way, you’re naming them that way, and I have no idea what your naming conventions mean.’”
(Health)
![Page 13: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/13.jpg)
Management: Description
Survey 1/3 didn’t know of
metadata standard 16% were able to
identify metadata standard
Metadata service ranked 10th out of 14
Image & Quote Credit: NYU Health Sciences Libraries “Data Sharing and Management Snafu in 3 Short Acts” http://www.youtube.com/watch?v=N2zK3sAtr-4
“Everything you need to know about the data is in the article.”
![Page 14: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/14.jpg)
Management: Description
Interviews Documentation is
biggest challenge in data management Recognize role of
metatadata Time consuming, no
immediate benefit Data planning vs.
data forensics
“If I was gonna make (the data) available to other people, I would feel some responsibility in documenting it a little bit better.”
(Social Sciences)
![Page 15: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/15.jpg)
Management: Summary
Services needed: Training on best
practices or general strategies
Tools that integrate description and organization of data into the workflow
“I kind of feel like we’re just making our way through the wilderness. And if there were somebody who could kind of hold our hands and say, ‘Look, data management is important and here are some strategies for going about it…’ That would be great.”
(Health)
![Page 16: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/16.jpg)
Jake Carlson
Sharing
![Page 17: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/17.jpg)
Sharing: Purdue
Background on Purdue’s work:
Primarily Interview Driven
• Data Curation Profiles• Data Management
Plans• Data Information
Literacy
![Page 18: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/18.jpg)
Sharing
Willingness to Share Generally, faculty are open to
sharing their data with others.
There is an “underground economy” of data sharing.
Factors in deciding whether or not to share:What will this person do with my data?
How much time & effort will it take me?
Image Credit: andrew_mc_d “Share” http://www.flickr.com/photos/andrew_mc_d/452728652/
![Page 19: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/19.jpg)
Sharing
![Page 20: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/20.jpg)
Sharing
ControlIssues in sharing data publicly:
Timing over when to release data. Use - If anyone can get the data,
anyone can use it for whatever they want to
Misinterpretation - there’s no guarantee that someone won’t misconstrue the data
![Page 21: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/21.jpg)
Sharing
Attribution Generally expressed as need for
others to cite the data set (though not always)
“So for in my personal opinion, data citations won’t help me too much. Paper citations count for everything. It counts for impact of the paper, it counts for tenure, it counts for the profile of my work.”
- Professor of Biochemistry
![Page 22: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/22.jpg)
Sharing
Documentation and Description
"If you ask someone if you can see their raw data, you might as well be asking if you can look at their underwear. It's really problematic."
- Agronomy Professor
![Page 23: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/23.jpg)
Sharing
Services for Data Sharing at Purdue
Consultation & Collaboration with Data Producers
Support "local" sharing Workflows Documentation Description
Support "external" sharing Workflows Documentation Description
![Page 24: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/24.jpg)
Alison Valk
Preservation
![Page 25: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/25.jpg)
Background
“Develop campus partnerships to collect, manage, share, and preserve Georgia Tech digital research data.”
“Improve and develop new resources & services to assist researchers with data stewardship”
Preservation
![Page 26: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/26.jpg)
IRB-approved research to
determine gaps in data curation services provided to
researchers.
Data assessment surveySeries of campus wide interviewsNSF DMP content analysis
Preservation
![Page 27: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/27.jpg)
By combining information gathered via the survey and the
interviews, we developed a clearer picture of the research
data curation needs on campus. Out of 77 who completed survey-
o 44 agreed to be interviewed
o 26 interviews completed
Preservation
![Page 28: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/28.jpg)
Interview Team
Chris DotySusan ParhamElizabeth RolandoAlison Valk
10 Interview questions
“How important is it for you to archive / preserve your data?”
“How important is it for you or others to have access to your data over the long-term?”
Preservation
Transcribe interviews
Web application for Qualitative & Mixed Methods research Visualize major discussion points or code correlations
Code
![Page 29: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/29.jpg)
Correlation between cost of working with data –
to how strongly participants feel data should be preserved…
Preservation
![Page 30: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/30.jpg)
Storage prices no longer cost prohibitive
Preservation
![Page 31: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/31.jpg)
Lack of metadata or curation = unusable data
Data is often “lost” when project participants
such as grad students leave institution
Computing professor:
“I don’t want to
micromanage my research
assistants”
Preservation
![Page 32: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/32.jpg)
Some
researchers are using
Cloud based tools, such as DropBox etc.
for archiving –
Little concern for security
risks associated.
Preservation
![Page 33: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/33.jpg)
Next Steps:
Select Case studies- oResearchers have volunteered to allow us
to archive their research data.Increased Outreach- New Services
oCustomized DMPtool oDepartmental Data Management Workshops oMore robust web presenceo Proof-of-concept Library hosted Research Data Repository
Preservation
![Page 34: Coming to an Understanding: a Cross-institutional Examination of Assessments of Data Curation Needs](https://reader035.vdocument.in/reader035/viewer/2022081602/554cf98fb4c90513118b532c/html5/thumbnails/34.jpg)
Questions?
Jake Carlson @jrcarlso [email protected]
Dianne Dietrich @nemka [email protected]
Gail Steinhart @gailst [email protected]
Alison Valk @valkcano [email protected]
Stephanie Wright @shefw [email protected]