the democratization of data science education · 1. the data scientist’s toolbox 2. r programming...

18
The democratization of data science education Sean Kross UC San Diego Design Lab 2017-10-11

Upload: others

Post on 05-Jun-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

The democratization of data science education

Sean KrossUC San Diego Design Lab

2017-10-11

Page 2: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

A little about me

• New! Improved!• Advised by Philip Guo• Main interests:• Data Science• Online Education• Open Science

Page 3: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

Jeff Leek Roger Peng Brian Caffo

The Johns Hopkins Data Science Lab

Page 4: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

https://peerj.com/collections/50-practicaldatascistats/

Page 5: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

Rationale: “Let’s put in-person courses online to augment in-person teaching.”

We were on to something.

Page 6: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research
Page 7: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

Nine Courses1. The Data Scientist’s Toolbox

2. R Programming

3. Getting and Cleaning Data

4. Exploratory Data Analysis

5. Reproducible Research

6. Statistical Inference

7. Regression Models

8. Practical Machine Learning

9. Developing Data Products

Page 8: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

Enrollment and Completions of the Data Science Specialization

Page 9: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

Key innovations

Page 10: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

Give everything away for free

Page 11: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

Capstones -> Portfolios -> Jobs

Page 12: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

Run every course every month

Page 13: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

Integrate content

https://ubc-mds.github.io/

Page 14: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research
Page 15: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research
Page 16: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research
Page 17: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

Three areas I would like to concentrate on:

1. How can we develop new interactive learning systems for data science?

2. The great courses of the world are sitting in professor’s file cabinets. How can we make online course content creation easier?

3. How do folks do data analysis? Why do they make certain choices during an analysis? How do upstream decisions made during an analysis affect downstream results?

Page 18: The democratization of data science education · 1. The Data Scientist’s Toolbox 2. R Programming 3. Getting and Cleaning Data 4. Exploratory Data Analysis 5. Reproducible Research

Thank you!

Questions?

Link to these slides: seankross.com/dlab-talk-dss/

Let’s talk: [email protected]

Find me on Twitter: @seankross