data science from 3,209 feet john chandler university of montana and ars quanta
TRANSCRIPT
A Data Scientist Toolkit
• A scripting language (Python, C#, Java, Perl)• A statistical computing language (R, SAS, SPSS)• Database languages/environments (MSSQL, Oracle, Postgres, sqlite)• Distributed computing environment (MapReduce, in many flavors)
Fundamentally we are flipping bits, but this isn’t software development.
Tools for data preparation
• A scripting language (Python, C#, Java)• A statistical computing language (R, SAS, SPSS)• Database languages/environments (MSSQL, Oracle, Postgres, sqlite)• Distributed computing environment (MapReduce)