the promise and peril of data mining

26
The Promise and Peril of Data Mining Including a Trip to Iceland Steve Smela, CSCI 5707 October 8, 2013

Upload: mercury

Post on 24-Feb-2016

34 views

Category:

Documents


0 download

DESCRIPTION

The Promise and Peril of Data Mining. Including a Trip to Iceland Steve Smela, CSCI 5707 October 8, 2013. Connections to CSCI 5707. Text Chapter 28 (Weeks 13 & 14)—Data Mining. Iceland’s Flag. Iceland’s Coat of Arms. Iceland’s King. Map of Iceland. What Does Iceland Have?. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: The Promise and Peril of Data Mining

The Promise and Peril of Data Mining

Including a Trip to IcelandSteve Smela, CSCI 5707

October 8, 2013

Page 2: The Promise and Peril of Data Mining

Connections to CSCI 5707

• Text Chapter 28 (Weeks 13 & 14)—Data Mining

Page 3: The Promise and Peril of Data Mining

Iceland’s Flag

Page 4: The Promise and Peril of Data Mining

Iceland’s Coat of Arms

Page 5: The Promise and Peril of Data Mining

Iceland’s King

Page 6: The Promise and Peril of Data Mining

Map of Iceland

Page 7: The Promise and Peril of Data Mining

What Does Iceland Have?

Icelandic Sheep

Glaciers

Volcanoes and Stuff

Page 8: The Promise and Peril of Data Mining

But Most Importantly….

Page 9: The Promise and Peril of Data Mining

Icelanders!(324,000 of them)

Ingolfur Arnarson, 1st Permanent Norse Settler in Iceland

One of Ingolfur’s Descendants

Page 10: The Promise and Peril of Data Mining
Page 11: The Promise and Peril of Data Mining
Page 12: The Promise and Peril of Data Mining
Page 13: The Promise and Peril of Data Mining
Page 14: The Promise and Peril of Data Mining
Page 15: The Promise and Peril of Data Mining

Used “Cognitive Performance Scale” toassess performance of those withmutation vs. those without mutation

Only possible because ratingsare done frequently in nursinghomes in Iceland

Page 16: The Promise and Peril of Data Mining
Page 17: The Promise and Peril of Data Mining

T Jonsson et al. Nature 000, 1-4 (2012) doi:10.1038/nature11283

Cognition measured by CPS as a function of age.

Page 18: The Promise and Peril of Data Mining

The Promise of Data Mining

• Possible insights into Alzheimer’s Disease, leading to new treatments

Page 19: The Promise and Peril of Data Mining

The Perils of Data Mining-I

• Correlation does not equal causality• Just because A673T is associated with better

scores on the Cognitive Performance Scale, it doesn’t mean that the mutation protects against Alzheimer’s

• Independent confirmation needed

Page 20: The Promise and Peril of Data Mining

Ways to Verify Insights Gained from Data Mining

• In “Big Data” context, it’s common practice to split data into at least 2 (sometimes more) sets– Training set– Validation set

• In Icelandic study, what did they do?

Page 21: The Promise and Peril of Data Mining

Independent Testing of Hypothesis

Page 22: The Promise and Peril of Data Mining

Perils of Data Mining-II

Page 23: The Promise and Peril of Data Mining

What Does this Mean?

Page 24: The Promise and Peril of Data Mining

Take-Home Messages

• Data mining holds great potential, especially in ideal conditions like Iceland

• Associations found through data mining must be independently verified

• Data mining must take into account ethical issues such as the informed consent of the research subjects

Page 25: The Promise and Peril of Data Mining

Sources• http://news.sciencemag.org/people-events/2012/12/purchase-amgen-wont-affect-decode-genetics-research-founder-says• http://en.wikipedia.org/wiki/Iceland• Jocelyn Kaiser. 2013. “Agency Nixes deCODE's New Data-Mining Plan.” Science 340, 1388-1389 (21 June 2013).• http://www.decode.com/ and http://www.decode.com/news-events/• http://www.interrai.org/assets/files/Scales/Cognitive%20Performance%20Scale.pdf• Thorlakur Jonsson, Jasvinder K. Atwal, Stacy Steinberg, Jon Snaedal, Palmi V. Jonsson, et al. 2012. “A Mutation in APP Protects against Alzheimer’s

Disease and Age-Related Cognitive Decline.” Nature 488, 96–99 (02 August 2012).• Jeff Gulcher, Agnar Helgason, and Kári Stefánsson. “Genetic Homogeneity of Icelanders.” Nature Genetics 26, 395 (2000).

Page 26: The Promise and Peril of Data Mining

Questions?