data: a cautionary tale by daniel katz

29
A Cautionary Tale

Upload: jooyoun-paek

Post on 27-Jan-2015

122 views

Category:

Education


6 download

DESCRIPTION

 

TRANSCRIPT

Page 1: Data: A Cautionary Tale by Daniel Katz

A Cautionary Tale

Page 2: Data: A Cautionary Tale by Daniel Katz
Page 3: Data: A Cautionary Tale by Daniel Katz
Page 4: Data: A Cautionary Tale by Daniel Katz
Page 5: Data: A Cautionary Tale by Daniel Katz

The Big Picture Collect Clean Model Store Present

Page 6: Data: A Cautionary Tale by Daniel Katz

{ "classes": [ { "name": "Fundamental Process of Design", "professor": "Joo Youn Paek" , "year" : " 2010 ", "semester" : "fall", "students": [ { "student" : { "name": “Joe Student", “email": “[email protected]", "twitter_name": “@itp4life" , “blog_url": “http://itp4life.blogspot.co" , } } ] } ]}

Page 7: Data: A Cautionary Tale by Daniel Katz

<classes><class>

<name>Fundamental Process of Design</name><professor>Joo Youn Paek</professor><year>2010</year><semester>Fall</semester><students>

<student><name>Joe Student</name><email>[email protected]</email>

<twitter_name>@itp4life</twitter_name><blog_url>http://itp4life.blogspot.com</blog_url></student>

</students></class>

</classes>

Page 8: Data: A Cautionary Tale by Daniel Katz
Page 9: Data: A Cautionary Tale by Daniel Katz
Page 10: Data: A Cautionary Tale by Daniel Katz

The Open Data Movement is in Full Swing Governments Institutions Scientists Enthusiasts

http://vimeo.com/2598878

Page 11: Data: A Cautionary Tale by Daniel Katz

Commercial tools and open source are starting to converge

Page 12: Data: A Cautionary Tale by Daniel Katz

There will always be assumptions

Page 13: Data: A Cautionary Tale by Daniel Katz

Bring it down

Page 14: Data: A Cautionary Tale by Daniel Katz

FreeBase – Entity Graph Info Chimp Twitter Facebook

Page 15: Data: A Cautionary Tale by Daniel Katz

Data.gov MTA

Page 16: Data: A Cautionary Tale by Daniel Katz

Arduino Smart Phone Other sensors

Page 17: Data: A Cautionary Tale by Daniel Katz
Page 18: Data: A Cautionary Tale by Daniel Katz

Don’t be intimidated by data from disparate sources

Page 19: Data: A Cautionary Tale by Daniel Katz
Page 20: Data: A Cautionary Tale by Daniel Katz
Page 21: Data: A Cautionary Tale by Daniel Katz

Clean up messy data Inconsistent data points Identify patterns Combine data from disparate

sources

Page 22: Data: A Cautionary Tale by Daniel Katz

Collection of Twitter Responses from API

Value.parseJson().user.screen_name

Page 23: Data: A Cautionary Tale by Daniel Katz
Page 24: Data: A Cautionary Tale by Daniel Katz

Depending on the type of data you are collecting, there are appropriate places to

store it

Page 25: Data: A Cautionary Tale by Daniel Katz

Non-programmers Google Fusion Tables

For programmers Geo Database and programming tools

PostGIS (Postgresql) GeoTools (Java)

Page 26: Data: A Cautionary Tale by Daniel Katz

Non-programmers Google Docs (Read into processing) Microsoft Excel (internal charting tool) Text based formatting (visualize with

Google Chart API)

For programmers Any relational database

MySql PostgresSql

Page 27: Data: A Cautionary Tale by Daniel Katz

Graph Database

Page 29: Data: A Cautionary Tale by Daniel Katz

http://www.infochimps.com/datasets/tweets-during-state-of-the-union-address

http://code.google.com/p/google-refine/

http://dev.twitter.com/doc/get/geo/search

http://flowingdata.com/2009/07/14/how-does-the-average-consumer-spend-his-money/

http://www.bls.gov/cex/ http://www.google.com/

fusiontables/Home