research data sharing: a basic framework
TRANSCRIPT
![Page 1: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/1.jpg)
RESEARCH DATA SHARING:A BASIC FRAMEWORK
Paul Groth @pgroth
pgroth.com
Elsevier Labs @elsevierlabs
LERU Summer School 2016
Data Stewardship for Scientific Discovery and Innovation
![Page 2: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/2.jpg)
WHAT IS DATA?
![Page 3: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/3.jpg)
WHAT IS DATA?
“Data refers to entities used as evidence of phenomena for the purposes of research or scholarship”
[Borgman Big Data, Little Data, No Data 2015 p.29]
![Page 4: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/4.jpg)
WHY COLLECT DATA?
![Page 5: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/5.jpg)
WHY COLLECT DATA?
Borgman, C. L. (2012). The conundrum of sharing research data. Journal of the American Society for Information Science and Technology.
![Page 6: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/6.jpg)
HOW IS DATA OBTAINED
![Page 7: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/7.jpg)
HOW IS DATA OBTAINED
Borgman, C. L. (2012). The conundrum of sharing research data. Journal of the American Society for Information Science and Technology.
![Page 8: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/8.jpg)
WHY SHARE DATA?
![Page 9: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/9.jpg)
WHY SHARE DATA?• R1: reproduce or verify research,
• R2: make results of publicly funded research available to the public
• R3: enable others to ask new questions of extant data
• R4: advance the state of research and innovation.
Borgman, C. L. (2012). The conundrum of sharing research data. Journal of the American Society for Information Science and Technology.
![Page 10: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/10.jpg)
• All empirical papers must archive their data upon acceptance in order to be published unless the authors provide a compelling reason why they cannot (e.g., expense, confidentiality). The action editor will be the final arbiter of whether the reason is sufficiently compelling.
• “Data” refers to an electronic file containing nonidentified responses that are potentially already coded. Normally, the data would
represent an early stage of electronic processing, before individual responses have been aggregated. The data must be in a form that allows all reported statistical analyses to be reproduced while retaining the confidentiality of individual participants. This entails that the data are formatted and documented in a way that makes the structure of the data set readily apparent.
• Archiving consists either of submitting the data to the journal (to be displayed as supplementary material at the end of the article), sending it to some other archive that is accessible to established researchers and maintained by a substantial established institution, or authors making the data available on their own website, assuming that they can assure us the site will be maintained by a recognized institution for a reasonable period of time. Again, action editors will be the final arbiters of the appropriateness of an archive.
• Any publication that reports analyses of or refers to archived data will be expected to cite the original publication in which the data were reported.
• This policy is new and therefore open to modification. Our aim is to implement a policy that maximizes transparency while minimizing the burden on authors.
![Page 11: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/11.jpg)
![Page 12: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/12.jpg)
THE IMPORTANCE OF CITING DATA
Data Citation Synthesis Group: Joint Declaration of Data Citation Principles. Martone M. (ed.) San Diego CA: FORCE11; 2014 [https://www.force11.org/group/joint-declaration-data-citation-principles-final].
1. Importance2. Credit and Attribution3. Evidence4. Unique Identification5. Access6. Persistence7. Specificity and Verifiability8. Interoperability and Flexibility
![Page 13: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/13.jpg)
![Page 14: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/14.jpg)
10 ASPECTS OF HIGHLY EFFECTIVE RESEARCH DATA
https://www.elsevier.com/connect/10-aspects-of-highly-effective-research-data
![Page 15: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/15.jpg)
https://storify.com/chenghlee/dataformathellhttp://isps.yale.edu/sites/default/files/files/IDCC14_DQR_PeerGreenStephenson.pdf
ALL DATA ISN’T SUCCESSFUL
![Page 16: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/16.jpg)
BARRIERS TO REACHING SUCCESSFUL DATA?
![Page 17: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/17.jpg)
Common practice: data is very fragmented
Using antibodiesand squishy bits Grad Students experimentand enter details into theirlab notebook. The PI then tries to make sense of their slides,and writes a paper. End of story.
17
![Page 18: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/18.jpg)
ALL DATA ISN’T CURATED
![Page 19: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/19.jpg)
Cost of documentation
http://www.indoition.com/en/services/costs-prices-software-documentation.htm
![Page 20: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/20.jpg)
20Yolanda GilUSC Information Sciences Institute [email protected]
Measuring Time Savings with “Reproducibility Maps” [Garijo et al PLOS CB12]
2 months of effort in reproducing published method (in PLoS’10)
Authors expertise was requiredComparison of ligand binding sites
Comparison of dissimilar protein structures
Graph network generation
Molecular Docking
Work with D. Garijo of UPM and P. Bourne of UCSD
![Page 21: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/21.jpg)
CURRENT STRATEGIES FOR DATA SHARING
![Page 22: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/22.jpg)
SUBJECT SPECIFIC REPOSITORIES
![Page 23: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/23.jpg)
SUBJECT SPECIFIC REPOSITORIES
![Page 24: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/24.jpg)
COMMUNITY SPECIFIC REPOSITORIES
![Page 25: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/25.jpg)
GENERIC REPOSITORIES
http://data.mendeley.com/
Each dataset receives a versioned DOI, so it can be cited
The citation for the associated article is
displayed
![Page 26: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/26.jpg)
DATA PUBLICATION
![Page 27: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/27.jpg)
BENEFITS OF MACHINE READBILITY
![Page 28: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/28.jpg)
HOW DO WE MOVE UP THE PYRAMID
https://www.elsevier.com/connect/10-aspects-of-highly-effective-research-data
![Page 29: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/29.jpg)
60 % OF TIME IS SPENT ON DATA PREPARATION
![Page 30: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/30.jpg)
CURATED DATA SETS
![Page 31: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/31.jpg)
http://ivory.idyll.org/blog/replication-i.html
![Page 32: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/32.jpg)
MORE SEMANTICS
![Page 33: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/33.jpg)
A FRAMEWORK FOR HELPING RESEARCHERS SHARE DATA • What data?
• Determine the context
• Why is data being collected?• How is data obtained?
• What is the researchers’ reason for sharing?
• Document
• Understand Cost/benefit tradeoffs• Target audience• Automation
![Page 34: Research Data Sharing: A Basic Framework](https://reader036.vdocument.in/reader036/viewer/2022062901/58f20b831a28ab7a408b4567/html5/thumbnails/34.jpg)
FURTHER READING• Syllabus for Data Management and Practice, Part I, Winter 2016. Data
Management and Practice, Part I (2016)Christine L Borgmam. https://works.bepress.com/borgman/381/
• Christine L. Borgman. “Big Data, Little Data, No Data”
• Reference list ://www.zotero.org/groups/borgman_big_data_little_data_no_data
• Borgman, C. L. (2012). The conundrum of sharing research data. Journal of the American Society for Information Science and Technology.
• Goodman A, Pepe A, Blocker AW, Borgman CL, Cranmer K, et al. (2014) Ten Simple Rules for the Care and Feeding of Scientific Data. PLoS Comput Biol 10(4): e1003542. doi: 10.1371/journal.pcbi.1003542