bringing open data science into excel with anaconda fusion

26
© 2016 Continuum Analytics- Confidential & Proprietary BRING OPEN DATA SCIENCE INTO EXCEL WITH ANACONDA FUSION Peter Wang, CTO Christine Doig, Senior Data Scientist & Anaconda Fusion Product Manager Fabio Pliger, Senior Software Engineer & Anaconda Fusion Tech Lead

Upload: continuum-analytics

Post on 10-Feb-2017

3.608 views

Category:

Data & Analytics


6 download

TRANSCRIPT

© 2016 Continuum Analytics- Confidential & Proprietary

BRING OPEN DATA SCIENCE INTO EXCEL WITH ANACONDA FUSION

Peter Wang, CTO Christine Doig, Senior Data Scientist & Anaconda Fusion Product Manager Fabio Pliger, Senior Software Engineer & Anaconda Fusion Tech Lead

2

• Empowering Analysts with Power of Anaconda • What is Anaconda? • Demo 1: Interactive Data Visualizations in Excel • Demo 2: Machine Learning in Excel • Demo 3: Big Data & ETL Processes in Excel • Demo 4: Python as a VBA replacement - Advanced mode • Summary

Agenda

Empower Business Analysts to: • Leverage data science assets easily through MS Excel • Enrich MS Excel analysis with the power of Python • Go beyond simple viz to powerful interactive viz

Empower Data Scientists to: • Extend their work to business analysts • Showcase their work through powerful interactive storytelling

3

Empowering Analysts with Power of Anaconda

ANACONDA

5

is…. leading Open Data Science platformPowered by Python the fastest growing data science language

• Accelerate Time-to-Value • Connect Data, Analytics & Compute • Empower Data Science Teams

© 2016 Continuum Analytics- Confidential & Proprietary 6

ACCELERATE Time-to-Value

INNOVATE faster through managed agile experimentation MOVE from analysis to deployment immediately DELIVER high performance analytics processing

CONNECT Data, Analytics & Compute

LEVERAGE innovative open source analytics to extract value from data MAXIMIZE your computational power to easily analyze all your data CONNECT and integrate all your data sources for predictive models

EMPOWER Data Science Teams

ITERATE quickly to create powerful analysis and predictive models COLLABORATE and share with your data science team PUBLISH interactive results to the business

© 2016 Continuum Analytics- Confidential & Proprietary 7

Data ScientistBiz Analyst Data EngineerDeveloper DevOps

Deploy & Operate

Explore & Analyze

Collaborate & Publish

Data Science Team

WHAT IS ANACONDA FUSION?

9

Connects Open Data Science with Microsoft Excel

Anaconda Fusion

• BRING interactive visualizations, machine learning and ETL to Excel

• BRIDGE Excel Data to Python and R through notebooks

• ACCESS all the power of Python and Big Data, natively embedded inside Excel

10

Biz Analysts

Explore & Analyze

Data Scientist

Collaborate

Empower the Data Science Team• Connect Analysts and Excel users with Open Data Science • Leverage analysis tools the team is already skilled with: Excel • Make it easy for Data Scientists to share their work with business leaders

11

Accelerate Time-to-Value• Increase impact of Data Science throughout the organization • Drive business value of analytic solutions by involving analysts and management • Make analyst teams more productive

Data ScientistBiz AnalystsBig Data & ETL

Interactive Data Visualizations

Client Machine Compute Node

Compute Node

Compute Node

Head Node

Machine Learning

Statistics and Advanced Analytics

12

• Connect Excel to the Anaconda Platform via Jupyter notebooks • Unified UI with spreadsheets (data), notebooks (analytics) and a kernel (compute) • Expose and explore Big Data in Excel

Connect Data, Analytics and Compute

ANALYTICS

COMPUTE

DATA

13

Excel, SQL, Tableau

Wor

ks

wit

hD

eliv

ers

Dat

a

spreadsheets spreadsheets, dataframes, tables dataframes, tables

spreadsheets, reports, visualizations

spreadsheets with macros predictive models, data transformations,

interactive data visualizations

Data ScientistAnalyst/Manager Advanced Analyst

Excel, SQL, VBA, SAS

(Python, R)

SQL, Hadoop Python, R

14

Data ScientistAnalyst/Manager Advanced Analyst

in magic mode in expert mode

Anaconda Fusion for Everyone

from fusion import fusion

15

Data ScientistAnalyst/Manager Advanced Analyst

16

Anaconda Repository

DeveloperBiz Analyst Data Scientist

Anaconda Enterprise Notebook Server

Anaconda Fusion

Build and publish

software packages

Install software packages

Collaborate with

notebooksAccess

notebooks functionality

in Excel

DevOps / IT

Manage and control accounts, software and data access and

guarantee security and complianceDeploying Anaconda in Enterprises

• No Python install needed for Business Analysts (small footprint)

• Computations can run server-side

• Python 2 and 3 support

DEMOS

18

Demo 1: Interactive Data Visualization for Analysts

Interactive visualizations for exploring Excel data • Move around, zoom in and out • Easily change visualization types • Select & highlight points

19

Demo 2: Machine Learning for everyone

• Analysts • Choose functions & parameters with “magic

mode” • Access and reference Excel data

• Data Scientists • Learn how to use Anaconda Fusion decorator

in notebooks to configure “magic mode” • Everyone

• Access all the rich capabilities in Python (including scikit-learn for machine learning)

20

Demo 3: Harness Big Data from Excel

• For Analysts: • Execute queries from Excel • Import and manage data from Hadoop

• For Data Scientists • Connect to Hadoop via Dask, Ibis, Impala • Build interactive query apps

Client Machine Compute Node

Compute Node

Compute Node

Head Node

21

Demo 4: Python as VBA replacement for Advanced Analysts

• Write Python scripts in Excel • Leverage Python ecosystem for

Open Data Science

SUMMARY

23

Why Anaconda Fusion?

Anaconda Fusion Connects Open Data Science with Microsoft Excel

EMPOWER Data Science Teams

ACCELERATE Time-to-Value

CONNECT Data, Analytics & Compute

24

https://www.continuum.io/anaconda-subscriptions

Anaconda Fusion is available with Anaconda Enterprise Subscription

25

Join the Innovators Program

Free of cost!

https://go.continuum.io/anaconda-fusion-innovators/

Q&A