rapidminer wisdom 2016 - hortonworks

Post on 22-Jan-2018

773 Views

Category:

Data & Analytics

3 Downloads

Preview:

Click to see full reader

TRANSCRIPT

#1 Agile Predictive Analytics Platform for Today’s Modern Analysts

Vamsi Chemitiganti

General Manager – Financial Services

How Predictive Analytics & Big Data are Disrupting Financial Services

©2016 RapidMiner, Inc. All rights reserved. - 2 -- 2 -©2016 RapidMiner, Inc. All rights reserved.

State of Global Banking

©2016 RapidMiner, Inc. All rights reserved. - 3 -

Financial Services and Big Data

Technology vectors

• Cloud computing (OpenStack)

• DevOps and PaaS

• Mobility

• Big Data and analytics

• BPM and microservices

• Software-defined datacenters

Business vectors

• Regulation and risk management

• Compliance and regulation

• Trading systems

• Omni-channel wealth management

• Payments systems

• Bank 3.0

Digital BankBank 3.0s

Focused around business and technology vectors

©2016 RapidMiner, Inc. All rights reserved. - 4 -- 4 -©2016 RapidMiner, Inc. All rights reserved.

Areas of Impact

©2016 RapidMiner, Inc. All rights reserved. - 5 -

Key areas within the financial services industry

©2016 RapidMiner, Inc. All rights reserved. - 6 -

Lifecycle of Big Data adoption

HDP helps FSIs drive efficiency gain..

©2016 RapidMiner, Inc. All rights reserved. - 7 -

Predictive Analytics on Hadoop

Write data prep and predictive analytics code for Hadoop

It’s complex, requires programming and specialized knowledge of each Hadoop

technology

Push automatically generated computations into Hadoop

It’s code-free, speaks Hadoop for you, and is 10 – 40 x faster to

implement Which would you prefer?

©2016 RapidMiner, Inc. All rights reserved. - 8 -

Impact of RapidMiner & Data Science

A representative sample only…

Survey of ML algorithms used (stated briefly for confidentiality purposes)

• Classification & Class Probability Estimation

• Regression

• Similarity Matching

• Clustering

• Co-Occurence Grouping

• Profiling

• Link Prediction

• Causal Modeling

• Most use cases typically revolve around a single view of Entity

©2016 RapidMiner, Inc. All rights reserved. - 9 -- 9 -©2016 RapidMiner, Inc. All rights reserved.

Digital Transformation

©2016 RapidMiner, Inc. All rights reserved. - 10 -

The digital journey in banking

©2016 RapidMiner, Inc. All rights reserved. - 11 -- 11 -©2016 RapidMiner, Inc. All rights reserved.

Cyber Security

©2016 RapidMiner, Inc. All rights reserved. - 12 -

Cyber security

©2016 RapidMiner, Inc. All rights reserved. - 13 -- 13 -©2016 RapidMiner, Inc. All rights reserved.

Customer Segmentation

©2016 RapidMiner, Inc. All rights reserved. - 14 -

Customer segmentation process

©2016 RapidMiner, Inc. All rights reserved. - 15 -- 15 -©2016 RapidMiner, Inc. All rights reserved.

Regulatory Risk Management

©2016 RapidMiner, Inc. All rights reserved. - 16 -

Proposed SolutionHortonworks Data Platform

LANDING DATA ZONE

L0

STANDARDIZED DATA ZONE

L1

CANONICAL DATA ZONE

L2 RegulatoryReports

Internal Reports

External Reports

Search

REPORTING/ANALYTICS ZONE

L3

Golden Source & Feeds

Master Data

Contrats

Balances

Transaction

Positions

Factors/Scenarios

Market Data

Unstructured Data(hdfs)

Original Data(hdfs)

RAW Datahdfs)

Standardized Data

(Hive/orc)

Materialized View

(Hive/orc)

sqoop/hadoop fs/nfs

Kafka/Storm

Java/Scala

Standardized Data

(Hive/Orc)

Hive/Spark/Scala

Hive/Spark/Scala

MHive/Spark/Scal

Hive/Spark/Scala

Standardized Data

(Hive/Orc)

Hive/Spark/Scala

Hive/Spark/Scala

Hive/Spark/Scala

TBD??

CanonicalPosition

Data(hive/orc)

CanonicalTransaction

Data(Hive/orc)

Hive/Spark

Scala/Python/R etc

Scala/Java

Hive/Spark

ScenariosResults

(Hive/orc)

Data Aggregations

(Hive/orc/Hbase)

Analytics/Reports

(Hive/orc/HBase)

Revision History

(Hive/orc)

Common Repositories/Meta Data Management

Security

Apache Atlas/Falcon/ Custom Solution

Apache Ranger/ Atlas and Custom/Partner Solution

©2016 RapidMiner, Inc. All rights reserved. - 17 -- 17 -©2016 RapidMiner, Inc. All rights reserved.

AML Compliance

©2016 RapidMiner, Inc. All rights reserved. - 18 -

Fraud/AML/Compliance Reference architecture

©2016 RapidMiner, Inc. All rights reserved. - 19 -- 19 -©2016 RapidMiner, Inc. All rights reserved.

Fraud Monitoring &

Detection

©2016 RapidMiner, Inc. All rights reserved. - 20 -

Fraud DetectionReference architecture

©2016 RapidMiner, Inc. All rights reserved. - 21 -

Modern Data Architecture with HWX and RM

©2016 RapidMiner, Inc. All rights reserved. - 22 -

RapidMiner™ Radoop

Big Data Predictive AnalyticsExtends RapidMiner’s visual predictive analytics to Hadoop and Spark

• We speak Hadoop so you don’t have toTranslates predictive analytics into native Hive, MapReduce, Spark, Pig and Mahout – you concentrate on competitive analytics, not Hadoop programming

• COMPLETE insights into your Big DataPushes analytic instructions into Hadoop for computation, so you can analyze the full breadth and variety of your Big Data

– Structured and non-structured

• Not just drag & drop: use your favorite Hadoop scripts, too!Incorporates your favorite SparkR, PySpark, Pig and HiveQLscripts within your predictive analytics workflow

• Safe and sound Integrates with Kerberos authentication, supports data access authorization for Apache Sentry and Apache Ranger – seamless for users, easy admin for IT

- 24 -CONFIDENTIAL

#1 Agile Predictive Analytics Platform for Today’s Modern Analysts

- 24 -©2016 RapidMiner, Inc. All rights reserved.

Vamsi ChemitigantiGeneral Manager Financial ServicesHortonworksvchemitiganti@hortonworks.com

top related