can data access combat fake news?

22
Can data access combat fake news? Mercè Crosas Chief Data Science and Technology Officer, Institute for Quantitative Social Science, Harvard University

Upload: merce-crosas

Post on 22-Jan-2018

191 views

Category:

Data & Analytics


3 download

TRANSCRIPT

Page 1: Can data access combat fake news?

Can data access combat fake news?

Mercè Crosas

Chief Data Science and Technology Officer, Institute for Quantitative Social Science, Harvard University

Page 2: Can data access combat fake news?

๏ Science must maintain credibility when moving from the lab to the public domain

๏ Journalistic credibility could be improved by adopting some standards and best practices used in science

Connecting Science with Journalism

Page 3: Can data access combat fake news?

In Science

Page 4: Can data access combat fake news?

What do we need to enable replication?

๏ Provide persistent access to data (and code)

๏ Data should:

๏ be well documented

๏ be accessible in reusable formats

๏ be open, or have a clear license and terms of use

๏ include provenance describing origin and transformations

๏ archived in trusted repositories

Page 5: Can data access combat fake news?

It’s not “Fake”, but can we reproduce it?

In Psychology: 39 out of 100 rated as successful replications

In Biomedicine: 6 out of 53 “landmark” cancer studies were

successfully replicated

Page 6: Can data access combat fake news?

“…data can be narrowed and expanded (p-hacked) to make either hypothesis appear correct. That’s because answering even a simple scientific question requires lots of choices that can shape the results.”

Science isn’t unreliable, but is more challenging than we give it credit for

Aschwanden, 2015, FiveThirtyEight

Page 7: Can data access combat fake news?

Science has a mechanism for self-correction

“Instances in which scientists detect and address flaws in work constitute evidence of success, not failure.”

“Ensuring that the integrity of science is protected is the responsibility of many stakeholders.”

Alberts et al., 2015, Science

Page 8: Can data access combat fake news?

Fraud happens, but is rare

“His lab’s controversial work raised the possibility that the human heart could repair itself, but many

other scientists have been unable to replicate the research.”

Page 9: Can data access combat fake news?

The scientific community is establishing best practices to address replication issues

Page 10: Can data access combat fake news?

Journal Data Policies are being applied across disciplines

Gen

etic

s

Biom

edic

al

Com

puta

tiona

l Sc

ienc

e

Econ

omic

s

Ecol

ogy

“weak”= recommend“strong” = require

Castro, Crosas, Garnett, Sheridan, Altman, 2017, Journal of Scholarly Publishing

Page 11: Can data access combat fake news?

Authors comply with strong data policies

0

10

20

30

40

50

60

70

80

90

100

Percentage of articles with available replication material

Data Sharing in Top Political Science Journals (Key 2016)*

Mandatory Replication Material

or Verification

Replication Material

Expected

No Requirement

Key, 2016, Political Science & Politics; via: Sebastian Karcher

Recently, 27 political science journals adopted the Journals Editors’ Transparency Statement

6 top journalsin Political Science

Page 12: Can data access combat fake news?

Journal guidelines encourage reporting of statistics and open research practices

Giofrè D, Cumming G, Fresc L, Boedker I, Tressoldi P (2017) The influence of journal submission guidelines on authors' reporting of statistics and use of open research practices. PLOS ONE 12(4): e0175583. https://doi.org/10.1371/journal.pone.0175583

NHST = null hypothesis significance testing; CI = confidence intervals; MA = meta-analysis; CI_interp = confidence intervals interpretation; ES_interp = effect size interpretation; Data_excl = exclusion criteria reported; Material = additional materials availability; Prereg = preregistered study.

In Psychology Science:• use of confidence

intervals grows from 28% in 2013 to 70% in 2015

• availability to open data, from 3% to 39%

Page 13: Can data access combat fake news?

From JournalArticle

To Data

Example: Journals integrate with

Data Repositories

Page 14: Can data access combat fake news?

For AJPS: Replication review becomes part of the submission workflow

+ +

0 Resubmits

1 Resubmit

2 Resubmits

3 Resubmits

4 Resubmits

0 10 20 30 40

3

17

31

37

8

Total: 95 completed verifications

via Thu-Mai Lewis Christian, Data Archives, ODUM Institute

Page 15: Can data access combat fake news?

From News Article to Data

1800 bicycle collisions from 2009 to 2012

Page 16: Can data access combat fake news?

In Journalism and Bloggers World

Page 17: Can data access combat fake news?

Connecting Science with Journalism

๏ Science must maintain credibility when moving from the lab to the public domain

๏ Journalistic credibility could be improved by adopting some standards and best practices used in science

๏ Apply data policies that require access to data (and code)

๏ Include citation to data in trusted repositories

๏ Provide guidelines for documenting statistics and data

๏ Add data fluency to journalism school experience

Page 18: Can data access combat fake news?

Education is Key: New data programs in journalism

Data Concentration Program in Columbia School of Journalism offered since 2014

Barret, Phillips, 2016, Teaching Data and Computational Journalism Report

From 113 programs in Journalism Schools

Page 19: Can data access combat fake news?

“Journalism schools that embrace both teaching and research into data journalism methods will be poised to fundamentally improve the way future journalists will inquire into maters of public interest and communicate with their audiences.”

Page 20: Can data access combat fake news?

“Science is the subset of knowledge on which we can aspire to universal

agreement*”

*Richard Muller, 2016, Now: The Physics of Time

But, can we also aspire to universal agreement for information that adheres

to scientific rigor?

Page 21: Can data access combat fake news?

Report written by: David Lazer, Matthew Baum, Nir Grinberg, Lisa Friedland, Kenneth Joseph, Will Hobbs, and Carolina Mattsson

This recent comprehensive, useful

report says:“As a research community, …

collaborate more closely with journalists in order to make the

truth “louder,””

This talk adds:make the truth “louder”

and more verifiable,with access to data

Page 22: Can data access combat fake news?

Thanks

Mercè Crosas, Institute for Quantitative Social Science, Harvard Universitymercecrosas.com@mercecrosas