an obligatory introduction to data science
TRANSCRIPT
Agenda
● The dirty history of data science● Data scientist roles● Using data in your product and business● Tools and resources to get started
“...data scientists do three fundamentally different things: math, code (and engineer systems), and communicate.”
- Hilary Mason
“Many social/digital scientists are reluctant to invest in making data because it’s much more costly and risky than
analyzing data you already have available.” - Sean Taylor
1. Ask a question
2. Finding or building data
3. Using that data to understand the question
4. Presenting the data to stakeholders or users
Start smallLearn to code● Python● JavaScript
Learn to move data● SQL ● Mongo
Learn to question everything● No gut feelings● Data-based decision making
Find some data and play with it.● Government/municipality data● Social data● Open data
Online learning● Kaggle.com● Udacity.com● Lynda.com
Develop math skills● Regression● Error analysis● Data distributions● Linear Algebra
Then What?Mathematical Modeling● Logistics regression● One hot encoding● Decision trees● Correlation● Model assumptions
Data visualization● D3.js● Tell a story
Help a non-profit/municipality● Open up their data● Tell their story● Solve a problem
bit.ly/2bxnQgb