Download - Bluegranite AA Webinar FINAL 28JUN16
![Page 1: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/1.jpg)
Microsoft R: A Revolution in Advanced Analytics
Presenters:• Andy Lathrop, Principal Consultant
• Mike Cornell, Senior Solution Consultant
Contributors:• David Eldersveld, Solution Consultant
• Jon Trapane, Staff Consultant
www.blue-granite.com A link to these slides and additional resources will be sent following the presentation
![Page 2: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/2.jpg)
Agenda (40-45 minutes):• Introduction of key organizations and
technologies (10 min)• Value of advanced analytics (5 min)• Demonstration; R in action (15 min)
• Local• AzureML• SQL2016• HDInsight• PowerBI
• Getting started (5 min)• Q&A (5-10 min)
Overview
Objectives:• Introduction to the R platform• Business value of enterprise-class R• Demonstration of R in multiple environments• Next steps
![Page 3: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/3.jpg)
Enable the business to store and analyze large volumes of data with optimized systems that can scale quickly to meet demand.
Help business users and decision makers understand past performance through visualizations, dashboards and automated reporting.
Solve challenging problems using mathematical models to prescribe actions and maximize business objectives.
Business Insights. Delivered.Founded in 1996, BlueGranite partners with Microsoft to deploy data warehousing, business intelligence, and advanced analytic solutions.
About BlueGranite
![Page 4: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/4.jpg)
Brief history of Microsoft R capabilities
About Microsoft R
• Revolution Analytics brought commercially supported, high performance computing to R, overcoming many obstacles for enterprise applications using large data sets.
• Revolution Analytics acquired by Microsoft in 2015.
• Microsoft is the highest rated vendor in both Business Intelligence and Advanced Analytics for Completeness of Vision (Gartner 2016)
“…this is about more than just R; it's about Microsoft's very identity. Microsoft has decided -- and I think rightly so -- that
the next era of computing, while enabled by the cloud, will feature data-driven intelligence; in platforms, in applications and in devices.” – ZDNet, “Microsoft's R Strategy”, May 2016
![Page 5: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/5.jpg)
Business Value of Advanced Analytics: Key Solution Areas
https://gallery.cortanaintelligence.com/
Buzzwords:
• Customer Analytics
• Predictive Maintenance
• Fraud Detection
• Demand Forecasting
• Price Optimization
• Customer Segmentation
• Campaign Analytics
![Page 6: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/6.jpg)
Business Value of Advanced Analytics: Key Analytic Capabilities
PWC: Data and Analytics: Creating or Destroying Shareholder Valuehttps://www.pwc.com/us/en/analytics/publications/assets/pwc-data-analytics-creating-or-destroying-shareholder-value.pdf
Buzzwords:
• Predictive Analytics
• Machine Learning
• Forecasting
• Regression
• Data Mining
• Clustering
• Segmentation
• Optimization
![Page 7: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/7.jpg)
Introduction to R
• 1993: Created by Ross Ihaka and Robert Gentlemen in Auckland, NZ as an open-source implementation of the S programming language
• Most widely used data analysis software#1 for data science; #6 general purpose (IEEE Spectrum Rankings)
• Common programming language of analytics and statistical computing
• Unique and immersive data visualizations
• Open-source, extensible, scalablelibrary of 7500+ add-on packages; community of millions of users
CRAN PackagesR Popularity is Growing!
![Page 8: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/8.jpg)
Introduction to Microsoft R Open (MRO)
o Enhanced open-source (CRAN) R distribution
o 100% compatible with all R-related software;CRAN packages, RStudio, and third-party R integrations
o Faster performance with multi-threaded math libraries
o CRAN “Time Machine” for reproducibility
o Available for Windows, Mac, Linux
o Free and Open Source
o Foundation for commercially-supported version (R Server)
o Available at mran.microsoft.com
MICROSOFT R OPEN: About
![Page 9: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/9.jpg)
Microsoft R Product Family
![Page 10: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/10.jpg)
Microsoft R: Write Once, Deploy Anywhere
![Page 11: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/11.jpg)
Deployment Environments
Cortana Intelligence SuiteA suite of products that allow you to Predict Outcomes, Prescribe Actions and Automate Decisions for Operationalized Solutions
Cloud On-Premises
![Page 12: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/12.jpg)
Demonstration
![Page 13: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/13.jpg)
SQL Server R Services
Data scientists / analysts work in
existing development environment
Use optimized RevoScaleR functions
(available with Microsoft R Client)
Compute in-database
instead of using local resources
Enhance SQL data with R features
Embed R code in
SQL stored procedures
Send SQL input
Receive dataset, model, or plot output
Consume in applications
or other client tools
Development Deployment
![Page 14: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/14.jpg)
SQL Server R Services
Development Deployment
EXEC sp_execute_external_script
@language = N’R’,
@script = N’[R code goes here]’,
@input_data_1 = N’[SQL input]’
[ , @input_data_1_name = N‘InputDataSet’ ]
[ , @output_data_1_name = N’OutputDataSet’ ]
[ , @params = N’parameter’ ]
WITH RESULT SETS (([SQL output]));
sql <- RxInSqlServer([SQL connection])
rxSetComputeContext(sql)
[…]
I/O RxSqlServerData(), etc.
Stats rxSummary(), etc.
Models rxLogit(), etc.
Plots rxRocCurve(), etc.
![Page 15: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/15.jpg)
HDInsight with R Server
Use familiar IDEs like R Studio
Machine Learning on Terabytes of Data
Bring your compute to the data in the data lake, not the other way around!
No Hadoop experience? No Problem!
R Server for HDInsight
![Page 16: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/16.jpg)
Local Local Parallel MapReduce or Spark
Execution Contexts in HDInsight with R Server
![Page 17: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/17.jpg)
Quick HDInsight with R Server Example
/data/income/income.csv
![Page 18: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/18.jpg)
Quick HDInsight with R Server Example
Set the compute context to ‘local’.
Check Hadoop version and explore the “data” folder in HDFS.
Define path to income.csv file, create a data source using the path specifying the HDFS file system, and view the variables for the data source.
Create a Logistic Regression model for the binary income variable using age, education, race, and sex as features.
Change the compute context to ‘localpar’ and create the same Logistic Regression model
Change the compute context to RxSpark() and create the same Logistic Regression model
![Page 19: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/19.jpg)
Azure Machine Learning
o ML Studio enables you to build an end-to-end, data science workflow in the form of an experiment
o Drag-and-drop predictive modeling
o Large library of modules to develop custom solutions
o Use existing R code with minimal modifications
o ML API service enables you to deploy predictive models as scalable web services
Azure ML: FeaturesA fully managed cloud service that enables you to easily build, deploy, and share predictive analytics solutions
Cloud
Local
![Page 20: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/20.jpg)
Power BI
o Display interactive reports across the whole organization
o Connect to 50+ data sources
o Including: Azure, Excel, GitHub, Visual Studio Online
o Rich visualization capability, including R graphics
Power BI: FeaturesA suite of business analytics tools to analyze data and share insights
Power BI Mobile apps
Power BI Desktop / Web
![Page 21: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/21.jpg)
R in Power BI
Use an R Script as a Data Source
Use an R Script to Create a Visualization
![Page 22: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/22.jpg)
Project Management and Analytics Governance
Source control, reporting, and project management
capabilities
Share code, track work, and ship software
Distributed version control system
Hosted public and private repositories; integrated with Visual Studio
![Page 23: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/23.jpg)
Review of Benefits
IT Director/CIOo Streamlined workflow: avoid re-engineeringo High-performance processing on high-value datao Align IT and data science developmento Integrate Microsoft data platform with advanced
analytics o Reduced data movement = higher performance and
security
Analytics Director/VPo Better analytics governance and collaboration via
centralized model and project management (AzureML, TFS, VSO, Jupyter)
o Broaden the reach of enterprise analytics
Data Scientisto Better tools = more opportunitieso Spend less time on data management – run your models
where the data resideso Take an active role in operationalizing analyticso Cool tech! Expand knowledge to cloud, Big Data, and
modern data platform
![Page 24: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/24.jpg)
Getting Started
![Page 25: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/25.jpg)
https://mran.revolutionanalytics.com/documents/getting-started/
Getting Started
![Page 26: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/26.jpg)
MICROSOFT AZURE: Getting Started
Create and share documents that contain live code, visualizations, and explanatory text
DATA SCIENCEvirtual machine
$200Azure credit on sign-up
Easily build, deploy and share predictive analytics solutions
![Page 27: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/27.jpg)
Thank you for attending!
For more information about Blue Granite, please visit us at www.blue-granite.com
Links to additional resources related to this webinar, including these slides, will be available via follow-up email
![Page 28: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/28.jpg)
Additional Information
![Page 29: Bluegranite AA Webinar FINAL 28JUN16](https://reader034.vdocument.in/reader034/viewer/2022051301/5a6d26547f8b9aff418b4e01/html5/thumbnails/29.jpg)
Microsoft R Parallelized Algorithms and Functions