doing data science with f# (buildstuff)
DESCRIPTION
The ability to take data, understand it, visualize it and extract useful information from it is becoming a hugely important skill. How can you turn all those logs, histories of purchases and trades or open government data, into useful information that help your business make money? In this talk, we’ll look at doing data science using F#. The F# language is perfectly suited for this task – type providers integrate external data directly into the language – your language suddenly _understands_ CSV, XML, JSON, REST services and other sources. The interactive development style makes it easy to explore data and test your algorithms as you’re writing them. Rich set of libraries for working with data frames, time series and for visualization gives you all the tools you need. And finally – F# easily integrates with statistical environments like R and Matlab, giving you access to the industry standard libraries.TRANSCRIPT
Doing data science with F#
Tomas Petricek [email protected] | @tomaspetricek
PhD Student at Cambridge & Coordinator of http://fsharp.org
F# Software Foundation
http://www.fsharp.org
software stacks
trainings teaching F# user groups snippets
mac and linux community books and tutorials
consulting open-source MonoDevelop
contributions research support cross-platform mailing lists
kaggle vega grammar data sources presentation
R provider visualization
F# Data Science Working Group
www.fslab.org
data acquisition
statistics data cleaning machine learning
data transformation visualization type providers
time-series data aggregation
The Data
Demo: Understanding the world
F# Data type providers
First-class data
CSV, REST, WorldBank…
R Type provider
Statistics & visualization
5000 tested packages
www.fslab.org
Visualize
Acquire
Analyze
Demo: Interactive visualizations
F# Charting library
Simple & composable
Interactive style
F# to JS with FunScript
Type-safe JS interop
Type provider support
www.fslab.org
Demo: US debt over the last century
Deedle data frame
Time-series alignment
Data transformations
Vega visualization
F# wrapper for Vega
Pre-alpha version
www.fslab.org
F# for Data Science
acquire, analyze, visualize
interactive experience
safety and efficiency of .net
ready for production
@tomaspetricek
Going forward
Use #fsharp for fun & profit
Join local user groups
Help us build data science tools
fsharp.org | fslab.org | tomasp.net
@tomaspetricek