data citation in the dataverse network ® micah altman, institute for quantitative social science,...

13
Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research Data and Information “Developing Data Attribution and Citation Practices and Standards An International Symposium and Workshop” August 22-23, 2011

Upload: delilah-miller

Post on 19-Jan-2016

217 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research

Data Citation in The Dataverse Network ®

Micah Altman, Institute for Quantitative Social Science, Harvard University

Prepared for the Board on Research Data and Information “Developing Data Attribution and Citation Practices and Standards

An International Symposium and Workshop”August 22-23, 2011

Page 2: Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research

Collaborators*

Data Citation in The Dataverse Network ®

Leonid Andreev, Ed Bachman, Adam Buchbinder, Ken Bollen, Bryan Beecher, Steve Burling, Kevin Condon, Jonathan Crabtree, Merce Crosas, Gary King, Patrick King, Tom Lipkis, Freeman Lo, Jared Lyle, Marc Maynard, Nancy McGovern, Lois Timms-Ferrarra, Akio Sone, Bob Treacy

Research SupportThanks to the Library of Congress (PA#NDP03-1), the

National Science Foundation (DMS-0835500, SES 0112072), IMLS (LG-05-09-0041-09), the Harvard University Library, the Institute for Quantitative Social Science, the Harvard-MIT Data Center, and the Murray Research Archive.

* And co-conspirators

Page 3: Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research

Related Work

Data Citation in The Dataverse Network ®

M. Crosas, 2011, “The Dataverse Network: An Open-Source Application for Sharing, Discovering and Preserving Data”, D-Lib Magazine 17(1/2).

M. Altman,2008, "A Fingerprint Method for Verification of Scientific Data" in, Advances in Systems, Computing Sciences and Software Engineering, (Proceedings of the International Conference on Systems, Computing Sciences and Software Engineering 2007) , Springer Verlag.

M. Altman and G. King. 2007. “A Proposed Standard for the Scholarly Citation of Quantitative Data”, D-Lib, 13, 3/4 (March/April).

G. King, 2007, " An Introduction to the Dataverse Network as an Infrastructure for Data Sharing", Sociological Methods and Research, Vol. 32, No. 2, pp. 173-199

Page 4: Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research

Data Citation in The Dataverse Network ®

Some Terminology

Page 5: Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research

Data Citation in The Dataverse Network ®

“dataverse” = a virtual archive“Dataverse Network” = a server“Study” = a work

An Open-Source Application for Publishing, Citing and Discovering

Research Data

Page 6: Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research

Data Citation in The Dataverse Network ®

Examples

Page 7: Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research

Josh Angrist’s Dataverse

Data Citation in The Dataverse Network ®

Page 8: Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research

Data Citation in The Dataverse Network ®

Two-for-one

“Data” Citation = Study Citation

Sorta-Kinda-Meta

Page 9: Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research

Data Citation in The Dataverse Network ®

Joshua D. Angrist; Eric Bettinger; Erik Bloom; Elizabeth King; Michael Kremer

2008

"Replication data for: Vouchers for Private Schooling in Colombia: Evidence from a Randomized Natural Experiment”

http://hdl.handle.net/1902.1/11298

UNF:3:4v7GYq3uSEeCpk8M567ITw==

Murray Research Archive [Distributor]

V1 [Version]

Author

Date

Title

PersistentID

Required

RecommendedUNF

DDI 2Extensions

Page 10: Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research

What’s a UNF?

Data Citation in The Dataverse Network ®

UNF = “Universal Numeric Fingerprint”=~ Semantic Fingerprint

Page 11: Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research

Variations

Data Citation in The Dataverse Network ®

Dataset specific – Same Id,part specified,UNF is for part

state,year,data_access_who [VarGrp/@var(DDI)]; UNF:5:X4QdWp04aCZntvxZKSHLzQ==

Citation for subset of Variables/columns/measures

(NOT observations!)

Proxy Handle

Page 12: Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research

Data Citation in The Dataverse Network ®

Use Cases

Page 13: Data Citation in The Dataverse Network ® Micah Altman, Institute for Quantitative Social Science, Harvard University Prepared for the Board on Research

Contact Us

Data Citation in The Dataverse Network ®

Micah Altman

maltman.hmdc.harvard.edu

The Dataverse Network ®

thedata.org