qdv
TRANSCRIPT
Categorize Data
Direct Value
Real Time Pricing
Real Time Bidding
Marketing Campaigns
Personalization
Item Availability
Relevance
Indirect Value Vs.
Inventory Optimization
Sales Transaction
Fraud Resolution
Product Management
Top Sellers
• Does it have revenue impact : Yes/No
• Is it critical for business : Yes/No
• Is it compliance related data : Yes/No
• Does it have external dependency: Yes/No
Categorize Data
Categorize Data
Compliance/Security
Strategic/ Business Critical
Revenue Impacting
Customer Impact
Impact Type
Last Access Metrics
Cost/Year …..
Customer Data Y Y Y Y Direct 1 Day $10000
Sales Transaction
Clickstream
Social
Sensor
Geo Spatial
Product Information
………
Data Platform : Architecture
May 21, 2015 19
Data Quality Service (Data Lineage & Profiling)
Security Scheduling & Cluster Monitoring
Applications & Visualization Tools
Dredge
Collection
• Apache Flume • Sqoop
Flow
• Kafka • Spark
Processing
• PIG • Spark • Map Reduce
Storage
• Hive • HBase • Vertica
Delivery
• Looker • Tableau • Visualization (d3.js) • Email/FTP
Data Platform
Data Access Abstraction
DAA is a abstraction layer for collecting data access patterns, enabling loosely coupled access to data.
WHAT IS DATA ACCESS ABSTRATION
Data Storage
Data Access API Data Ingestion Framework
Data Producers
Usage Patterns Storage
Data Consumers Usage Pattern Viewer
Data Telemetry
DAA
Data Access API
Request Manager
Data Formatter Query Service
Source End
Points Dredge
Events Management Log Streaming Configuration Abstraction
Delivery Service
Data Ingestion Framework
End Point Adapter Data Adapter
Data Consumers/ Producers
Data Consumers