cal poly - data management: who knew it was a hot topic?
DESCRIPTION
October 17, 2013 @ Robert E. Kennedy Library, Data Studio, California Polytechnic State University. New mandates, announcements, memos, and requirements are emerging that encourage better data management, data sharing, and data preservation. In this presentation, data curation specialist Carly Strasser, PhD, offers a lay of the data management land by discussing recent events, resources, and new directions for data stewardship.TRANSCRIPT
Data Management: Who Knew Could be a Hot Topic?
From Flickr by Velo Steve
Carly Strasser, PhD California Digital Library
@carlystrasser
Cal Poly Oct 2013
From
Flickr by ~sho
rts an
d long
s
Publications & Their Citation
Data
Metadata
Recreated from Klump et al. 2006
blog.order2disorder.com
From Flickr by csessum
s From
Flickr by csessums
From Flickr by diylibrarian
www �
UGLY TRUTH
are not taught data management
are not effectively managing or organizing their data
don’t share data publicly or store it in an archive
aren’t convinced they should share data
don’t know about tools that can help them
btrgroup
.com
Many (most?) researchers…
From Flickr by iowa_spirit_walker
• Cost • Confusion about standards
• Lack of training • Fear of lost rights or benefits
• No incentives
There is a social contract of science: we have an obligation to ensure dissemination, validation, & advancement.
To not do so is science malpractice.
– Brian Hole, Ubiquity Press at UCL
From Flickr by mikerosebery
Data are being recognized as first class products of research
From Flickr by Richard Moross
Data management plans
Data sharing mandates
Data publications
Data citation
From Flickr by torkildr
Data publications Data citation
Data management plans Data sharing mandates
Data
Metadata
Recreated from Klump et al. 2006
www �
From Flickr by torkildr
From Flickr by diylibrarian
www �
Federal Funding Accountability and Transparency Act 2006
2010 2010 –present
First: DMPs
Journal publishers
NSF bio-‐sketches can include data
… “Federal agencies investing in research and development (more than $100 million in annual expenditures) must have clear and coordinated policies for increasing public access to research products.”
Eight months ago…
1. Maximize free public access 2. Ensure researchers create data
management plans
3. Allow costs for data preservation and access 4. Ensure evaluation of data management
plan merits 5. Ensure researchers comply with their data
management plans
6. Promote data deposition into public repositories
7. Develop approaches for identification and attribution of datasets
8. Educate folks about data stewardship
From Flickr by Joe Crimmings Photography
CHORUS
Universities + government = cross-‐institutional repositories of public-‐access publications
How: 1. Establish common, brief set of metadata
requirements 2. Expose metadata to search engines and other
discovery
Data Publication
Data Citation
Example: Sidlauskas, B. 2007. Data from: Testing for unequal rates of morphological diversification in the absence of a detailed phylogeny: a case study from characiform fishes. Dryad Digital Repository. doi:10.5061/dryad.20
Researcher Identification
From Flickr by Skakerman
A word about Metrics…
Articles are the butterfly pinned on the wall. Pretty but not very useful. They are only the advertisements for scholarship. – A. Levi, U. Maryland College of Information Studies
From Flickr by LisaW123
Altmetrics • Article-‐level metrics • altmetrics for alt-‐products
Impact Factors
+ Citation Counts
Metrics
NSF funded DataNet Project Office of Cyberinfrastructure
www.dataone.org
A
B
C
B
C A
Intercept researchers where they already
work
28
Kepler
Investigator Toolkit Support
Plan
Collect
Assure
Describe
Preserve
Discover
Integrate
Analyze
From
Flickr by tw
m1340
Culture Shift Ahead
notebook science source content access data government repository knowledge Fr
om Flickr by cd
sessum
s
Open Science
From Flickr by CapesTreasures.com
Making scientific
data,
research, &
dissemination
available to all
flowingdata.com
Map of Scientific Collaborations
From
Flickr by da
rkun
cle
Reproducibility
Why participate?
0
5
10
15
20
25
Trial 1 Trial 2 Trial 3
Good
Bad
Ugly
From
Flickr by da
rkun
cle
Reproducibility
Credibility
Why participate?
www.brutallyhonest.org
From Flickr by Robert Couse-‐Baker
“Help us identify grants that are wasteful or that you don’t think are a good use of taxpayer dollars.” Rep. Adrian Smith (R-‐Nebraska), a member of the House Committee on Science and Technology
From
Flickr by da
rkun
cle
Reproducibility
Credibility
Faster progress
Why participate?
Success Stories
From
Flickr by da
rkun
cle
Reproducibility
Credibility
Faster progress
Required
Why participate?
Journals Institutions Funders From Flickr by Eva Rinaldi Celebrity and Live Music Photographer
notebook science source content access data government repository knowledge Fr
om Flickr by cd
sessum
s
Digital ¤ Free ¤ Online
No price or permission barriers
Full content
Immediately available
Not just publications
notebook science source content access data government repository knowledge Fr
om Flickr by cd
sessum
s
Open
certain data should be freely available to everyone to use & republish as they wish, without restrictions from copyright, patents or other mechanisms of control
Data
From Flickr by Ninja M.
From Flickr by jackcheng
But…
“Scientists will be overwhelmed”
“It may be used for evil”
“The public will misunderstand it”
notebook science source content access data government repository knowledge Fr
om Flickr by cd
sessum
s
My website Email me Tweet me My slides
carlystrasser.net [email protected] @carlystrasser slideshare.net/carlystrasser