big data education webcast: introducing dmx and dmx-h release 8
TRANSCRIPT
Greg Grubbs, Product Manager | Big Data
Jorge A. Lopez, Director Marketing | Big Data
DMX-h Release 8
Syncsort Confidential and Proprietary - do not copy or distribute2
Introducing Syncsort DMX-h Release 8
Intelligent Execution Layer isolates users from underlying complexities of Hadoop
Design once. Deploy anywhere - with or without Hadoop; on-premises or in the Cloud
Provides the most complete, end-to-end solution for offloading heavy legacy workloads to Hadoop
Delivers best-in-class, one-step data ingestion capabilities for Hadoop: mainframes, RDBMS, MPP, JSON, Avro/Parquet, NoSQL, and more
Facilitates metadata management and data lineage by automatically updating HCatalog when loading to Hive, Avro and Parquet
Syncsort Confidential and Proprietary - do not copy or distribute3
A Complete Solution to Harness the Power of Hadoop
Syncsort Confidential and Proprietary - do not copy or distribute4
Collect
Build Your Enterprise Data Hub
Hadoop + DMX-h
Avro
Parquet
Cassandra
MongoDB
Mainframe
Vertica
Oracle
Teradata
Netezza
JSON HBaseFiles
Cloud
• Collect virtually any data from mainframe to Big Data and NoSQL sources • Access, re-format and load data directly into Avro & Parquet. No staging
required• Access & translate mainframe data using Sqoop and Spark• Load more data into Hadoop in less time. Let DMX-h dynamically split the data
and load it to HDFS in parallel
Syncsort Confidential and Proprietary - do not copy or distribute5
Hive and HCatalog
Read & write to Hive
Support for multiple file formats including text, Avro, Parquet
Metadata using HCatalog (Hive Meta Store)
Dynamically parallelize load to Hive
0.00
2.00
4.00
6.00
8.00
10.00
12.00
1.01 2.03 4.50
Ho
ur
Hive Write Throughput (TB/hour)
DMX ODBC Parallel
Hive Command
Syncsort Confidential and Proprietary - do not copy or distribute6
Prepare
Get Your Data Ready for Analytics
• Sort• Cleanse• Partition• Translate
• Reformat• Compress• Validate
• Prepare your data on-the-fly at lightning speeds before loading into Hadoop
• Increase data compression ratios by up to 10x • Achieve significant storage savings
Hadoop + DMX-h
Syncsort Confidential and Proprietary - do not copy or distribute7
Blend
Find Bigger Insights by Combining New and Legacy Data
• Fastest, most efficient data joins• Best-in-class mainframe data access & translation• Common user experience with or without Hadoop!• No need to worry about mappers, reducers, big side, small side and so on• No code to generate, compile, maintain or tune!
Mainframe
JSON
RDBMS
Syncsort Confidential and Proprietary - do not copy or distribute8
Transform
Design Once, Deploy Anywhere!
• Free your users from the underlying complexities of Hadoop• Visually design data transformations once, and run anywhere • No changes or tuning required• Intelligent Execution Layer dynamically optimizes the job for each platform:
Hadoop, Windows, Unix, Linux or Cloud• Future-proof your applications!
Inte
llige
nt
Exec
uti
on
Lay
er Windows, Linux, Unix
Hadoop
Cloud
Syncsort Confidential and Proprietary - do not copy or distribute9
DEMO
DEMO
Syncsort Confidential and Proprietary - do not copy or distribute10
Distribute
Achieve the Fastest Path from Raw Data to Insight
• Create Tableau & Qlikview files with one click• Achieve the fastest data loads without tuning hassles:
• Fastest parallel loads to Greenplum, Netezza, Teradata & Vertica• High-performance connectivity to Big Data & NoSQL databases such as
Cassandra, Hbase & MongoDB
Hadoop + DMX-h
NoSQL
Syncsort Confidential and Proprietary - do not copy or distribute11
Not Using Hadoop?
Single Design Experience =
ETL Anywhere!
Best-in-class Data Visualizations,
Just a Click Away
Complete Access to All Your Data,
Big or Small
Web Based Monitoring &
Administration
Secure Mainframe Data Access
Intelligent Execution Layer
Windows, Linux, Unix, Cloud, and
more… when you’re ready
Syncsort Confidential and Proprietary - do not copy or distribute12
DEMO
DEMO
Syncsort Confidential and Proprietary - do not copy or distribute13
Plus… The Only Tool Specifically Designed for EDW Offload
Now with automatic DTL generation!
• Web-based utility
• Takes SQL as an input
• Provides visual analysis of SQL ELT jobs
• Generates metadata and data migration with DMX jobs
• Supports ANSI-SQL 2011, BTEQ, Netezza, Oracle PL/SQL
Syncsort Confidential and Proprietary - do not copy or distribute14
Save users from underlying Hadoop complexities Future-proof your applications. Design once, deploy anywhere! Offload heavy ELT workloads to Hadoop Secure, monitor, manage and scale with minimum effort
Sign up for a Free Trial!
Break Free from ETL Complexity
Experience DMX & DMX-h Release 8
Syncsort.com/dmxh8
Watch this webcast on demand – including the product demos!
http://bit.ly/1wI1SRN