©2010 thomson reuters the new gold: unlocking hidden data christopher burghardt vp, market &...

36
©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY 2013

Upload: william-tucker

Post on 30-Dec-2015

216 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

THE NEW GOLD: UNLOCKING HIDDEN DATACHRISTOPHER BURGHARDTVP, MARKET & PRODUCT STRATEGYSCIENTIFIC & SCHOLARLY RESEARCH

FEBRUARY 2013

Page 2: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

“Data is the new gold... To be curated, shared, and cited”

Page 3: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Agenda

– The Data Landscape

– Challenges with Research Data

– An A&I Solution (Data Citation Index)

– Emerging Market Opportunities

– Resources for Follow-up

– Questions & Answers

Page 4: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Page 5: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Page 6: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Page 7: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Does your organization create or use research data (e.g. surveys, observations, sensor data)?

1. Yes

2. No

3. Don’t Know

Page 8: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Whenever and wherever there is research, there is research data

The digitization of data has created tremendous opportunities for research data of all varieties, creating a large and growing opportunity

The Ubiquity of Research Data

Page 9: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Data Sharing Rate is Increasing

PLOS ONE STUDY

Proportion of articles with shared data sets, by year

published

Page 10: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

The Increasing Visibility of Data• Data repositories &

registration agencies

• Journal publishers• Publisher websites

• Data journals

Page 11: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Why are Researchers Still Hiding Their Data?

Page 12: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Deposition of Data by Researchers

12

24%

36%

47%

51%

17%

Publisher website

Repository managed by a third party (e.g, domain-…

Department or institutional repository

Personal website

Other

Q16. Where do you place your non-traditional scholarly output to make it available to others? (n=471)

Source: Thomson Reuters Survey

Page 13: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Who should be responsible for the storing and archiving of research data (e.g. surveys, observations, sensor data)?

1. Researchers

2. Universities or Corporations

3. Funding Agencies

4. Government

5. No Opinion

Page 14: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

NIH (2003) Data Sharing Policy that all funding applications of $500,000 or more per year are expected to address data-sharing in their application.

NSF (2011) All funding proposals submitted on or after January 18, 2011, must include a “Data Management Plan” describing how the proposal will conform to NSF policy on the dissemination and sharing of research results.

The Emergence of Funding Mandates

Page 15: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Data Management Requirements Extend Across the Globe

Aug 2011… “expectation that all our funded researchers should maximise access to their research data with as few restrictions as possible. …. submit a data management and sharing plan as part of the application process.”

2007… “Researchers are to retain research data and primary materials, manage storage of research data and primary materials, maintain confidentiality of research data and primary materials.”

Page 16: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Funding Mandates Becoming Stronger

January 14, 2013… “failure to provide the requisite Data Management Plan will result in the application being rejected or terminated.”

Page 17: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Data Elevated to “Article Status”?

January 14, 2013.. Biographical Sketch(es), has been revised to rename the “Publications” section to “Products”…. This change makes clear that products may include, but are not limited to, publications, data sets, software, patents, and copyrights.

Biosketches now include “Products”, not “Publications

Page 18: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

What is the primary publishing/distribution channel for your organization’s research-related output?

1. Journal Articles

2. Patents

3. Conference Proceedings

4. Data Sets or Data Studies

5. None of the above

Page 19: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Challenges with Research Data• Access & discovery

• Citation standards

• Lack of willingness to deposit and cite

• Lack of recognition / credit

Page 20: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Over 500 Data Repositories Established

Page 21: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Research DataDiverse and Disparate Sources

There are many quality repositories maintained for the purpose of providing access to research data.

Repositories are separately maintained, with varying schemes of organization and search capabilities.

Page 22: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Barriers to Researchers Citing Data

Researchers agree that data should be cited, but there are currently no universally accepted standards for citing data

22

“Lack of knowledge about standards for citation and of proper scholarly recognition and/or evaluation of such materials.”…

“…cumbersome citation formats including very long internet addresses.”

“Incomplete citation information available (dates and real author names as distinct from aliases)’”

Page 23: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Data Citation BehaviourCurrent citation style (in full text of article)

Desired/future citation style (as part of cited references)

U.S. Dept. of Justice, Bureau of Justice Statistics (1996): MURDER CASES IN 33 LARGE URBAN COUNTIES IN THE UNITED STATES, 1988. Version 1. Inter-university Consortium for Political and Social Research. http://dx.doi.org/10.3886/ICPSR09907.v1

Lee, Seung-Jae; Lee, He-Jin; Cho, Ji-Hoon; Rho, Sangchul; Hwang, Daehee (2008): GSE11574: The responses of astrocytes stimulated by extracellular a-synuclein. Gene Expression Omnibus. http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE11574

Page 24: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Researchers Are Not Receiving Appropriate Credit

24Source: Thomson Reuters Survey

Page 25: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Where Do We Start?

• Enable the discovery of data repositories, data in the context of traditional literature

• Help researchers find data and track the full impact of their research output

• Establish attribution standards and incentives to make data discoverable

• Provide expanded measurement of research output and assessment

Page 26: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Thomson Reuters Solution

Page 27: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Relevant Content - ensuring that material is desirable to the research community.

Persistence and stability of the repository, with a steady flow of new information.

Thoroughness and detail of descriptive information.

Links from data to research literature.

Repository Selection Considerations

Page 28: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

TR takes descriptive

metadata feed from repository

Repository raw metadata is analyzed by

TR

TR adds metadata

Thomson Reuters Indexing of Research Data Repositories

Page 29: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Data Citation Record Model

Repository

Data Set

Repository: Comprised of data studies, data sets

Data Study: Descriptions of studies or experiments with associated data

Data Set: A single or coherent set of data or a data file provided by the repository

Page 30: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Research Data Repository Coverage

48%

22%

21%

2%

7%

Life Sciences

Physical Science

Social Sciences

Multidisciplinary

Arts & Humanities

Discipline Breakdown of Repositories

Page 31: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Challenges• Metadata availability

– Lack of resources

– Lack of expertise

• Metadata quality– Metadata inconsistencies

• Data repositories are not static

• Partnerships

Page 32: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Expected Outcomes: Data Citation Index• Discovery of data most important to scholarly

research

• Data linked to published research literature

• Measures of data use and reuse

• New metrics for digital scholarship

Page 33: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Emerging Market Opportunities

• Data citation standards and metrics

• Workflow solutions for data management

• Data storage solutions

• Consulting services

Page 34: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Resources• Data Citation Index

http://wokinfo.com/products_tools/multidisciplinary/dci

• DataCite http://datacite.org/

• CODATA http://www.codata.org/

• Board on Research Data and Information http://sites.nationalacademies.org/PGA/brdi/index.htm

• Australian National Data Service http://www.ands.org.au

• Databib http://databib.org/index.php

• DataOne http://www.dataone.org/

• DataVerse Network http://thedata.org/

Page 35: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

“Data is the new gold... To be curated, shared, and cited”

Page 36: ©2010 Thomson Reuters THE NEW GOLD: UNLOCKING HIDDEN DATA CHRISTOPHER BURGHARDT VP, MARKET & PRODUCT STRATEGY SCIENTIFIC & SCHOLARLY RESEARCH FEBRUARY

©20

10 T

hom

son

Reu

ters

Thank you

Christopher Burghardt

[email protected]