future of data - big data

Post on 25-Dec-2014

1.376 Views

Category:

Technology

0 Downloads

Preview:

Click to see full reader

DESCRIPTION

 

TRANSCRIPT

Future of Data : Big Data

Shankar Radhakrishnan

Cognizant

Topics

How did we get here ?

Data Explosion

Big Data

Big Data in an Enterprise

Big Data Platform - Hadoop

Big Data Adoption

Q & A

How did we get here?

Familiar World EDW

Datamarts

Familiar Problems

New World Newer type of data to integrate

Increase in volume

Newer analytical requirements

Data warehouse

Data Integration Problems

Data Processing Problems

Storage Management

Performance Problems

Limitations out of Complexity

Data Explosion

Newer Interests

Social Intelligence

DBIM, Sentiment Analysis, Social Customer Care

Predictive Analytics

Propensity, Price Elasticity, Anti-Fraud Analytics

Segmentation Insights

Funnel Analysis, Behavioral Patterns, Cohort Analysis

Mobile Analytics

Ad-Targeting, Geo-spatial Analytics

Categories

Structured Data

Enterprise Data (CRM, ERP, Data Stores, Reference Data)

Semi-structured Data

Machine Generated Data (Sensor Data, RFIDs)

Unstructured Data

Social Data (Comments, Tweets), Blog posts

Big Data

Big Data

Volume

Velocity

Variety

Complexity

“Big Data” refers to high volume, velocity, variety and complex information assets that

demand cost-effective, innovative forms of information processing for enhanced insight

and decision making

Big Data Platforms

• Data Integration o Informatica, Infosphere

o talenD, Pentaho, Karmasphere, Apache Sqoop, Apache Flume

• Database Framework o Hadoop (Distributions: Cloudera, Hortonworks, MapR)

o Hbase

o Hive

• NoSQL Databases o MongoDB, CouchDB

• Machine Data Processing o Splunk, Mahout

• Text Analytics o Clarabridge, Lexanalytics

Big Data in an Enterprise

Data warehouse Data

Sources ETL

Big Data Sources

ETL Big Data Platform

ETL Datamarts

Datamarts

Datamarts Analytical

Applications

Hadoop - Ecosystem

Big Data : Adoption Drivers

Platform

Cluster

Storage

Availability

Process

Distributed

Scalable

Performance

Possibilities

Data Integration

Data Processing

Actionable Insights

Ecosystem

Augmented

TCO

ROI

Big Data – Adoption Scenarios

Replatforming to Big Data (Hadoop, MapR)

Archival Solution (Hadoop)

Offloading Data warehouse, EDW (Hadoop, Hive)

Social Media Integration

Machine Data Analysis (Splunk, Mahout)

Complex Analytical Requirements (Hbase)

Q & A

top related