big data launch keynote singapore patrick buddenbaum
DESCRIPTION
TRANSCRIPT
Open Platform for Next-Gen Analytics
Director, Enterprise Segment
Datacenter and Connected System Group
Patrick Buddenbaum
Today’s presentations contain forward-looking statements. All statements made that are not historical facts are subject to a number of risks and uncertainties, and actual results may differ materially. Please refer to our most recent Earnings Release and our most recent Form 10-Q or 10-K filing for more information on the risk factors that could cause actual results to differ.
If we use any non-GAAP financial measures during the presentations, you will find on our website, intc.com, the required reconciliation to the most directly comparable GAAP financial measure.
INFORMATION IN THIS DOCUMENT IS PROVIDED “AS IS”. NO LICENSE, EXPRESS OR IMPLIED, BY ESTOPPEL OR OTHERWISE, TO ANY INTELLECTUAL PROPERTY RIGHTS IS GRANTED BY THIS DOCUMENT. INTEL ASSUMES NO LIABILITY WHATSOEVER AND INTEL DISCLAIMS ANY EXPRESS OR IMPLIED WARRANTY, RELATING TO THIS INFORMATION INCLUDING LIABILITY OR WARRANTIES RELATING TO FITNESS FOR A PARTICULAR PURPOSE, MERCHANTABILITY, OR INFRINGEMENT OF ANY PATENT, COPYRIGHT OR OTHER INTELLECTUAL PROPERTY RIGHT.
Performance tests and ratings are measured using specific computer systems and/or components and reflect the approximate performance of Intel products as measured by those tests. Any difference in system hardware or software design or configuration may affect actual performance. Buyers should consult other sources of information to evaluate the performance of systems or components they are considering purchasing. For more information on performance tests and on the performance of Intel products, reference www.intel.com/software/products.
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult otherinformation and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products.
Intel product plans in this presentation do not constitute Intel plan of record product roadmaps. Please contact your Intel representative to obtain Intel's current plan of record product roadmaps.
Legal Information
Making Sense of One Petabyte
50xTo read
in Library of Congress
13yTo view
as HD Video
11sTo generate
in 2012
Sources: IDC 2012, The Digital Universe in 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far Easthttp://blogs.loc.gov/digitalpreservation/2011/07/transferring-libraries-of-congress-of-data/
Analysis of Data can Transform Society
Enhance understanding, drive innovation, and accelerate medical cures
Create new business models and transform organizational processes
Improve public safety and increase energy efficiency with smart grids
Virtuous Cycle of Data-Driven User Experience
CLOUD
Richer data to analyze
CLIENTS
Richer data from devices
Richer user experiences
INTELLIGENT SYSTEMS
Democratize Data Analysis from Edge to Cloud
Unlock value in silicon
Support open platforms
Intelligent Systems Framework
Intel at the Intersection of Big Data Forces
Enabling exascale computing on massive data sets
Helping enterprises build open interoperable clouds
Contributing code and fostering ecosystem
HPC Cloud Open Source
Intel®TrueScaleInfiniband
* Other names and brands may be claimed as the property of others.
Research
Benchmarking
TuningOptimization
Product
History of Intel and Apache Hadoop*
2009 2013
Open Cirrus*
HiBenchRelease 1.0
(2011)
* Other names and brands may be claimed as the property of others.
Release 2.0(2012)Telco Smart City
Web
RetailHealthcare
Announcing Availability ofIntel® Distribution for Apache Hadoop* software
Hardware-enhanced performance & security
Enables partner innovation in analytics
Strengthens Apache Hadoop* ecosystem
* Other names and brands may be claimed as the property of others.
Intel® Distribution for Apache Hadoop* software
• Up to 20x faster decryption with AES-NI*• Granular access controls for Hbase
• Optimized with SSD and Cache Acceleration• Up to 8.5X faster queries in Hive• Hardware-enhanced compression with AVX & SSE4.2
• Automated tuning with Intel® Active Tuner
*Based on internal testing
Intel Distribution for Apache Hadoop* software
* Other names and brands may be claimed as the property of others.
Intel® Manager for Apache Hadoop softwareDeployment, Configuration, Monitoring, Alerts, and Security
HDFSHadoop Distributed File System
YARN (MRv2)Distributed Processing Framework
HB
ase
Colu
mna
r St
ore
Zook
eepe
rCo
ordi
natio
n
Flum
eLo
g Co
llect
orSq
oop
Dat
a Ex
chan
ge PigScripting
HiveSQL Query
OozieWorkflow
MahoutMachine Learning
R connectorsStatistics
Intel enhancements contributed back to open source
Open source components included without change
Intel unique
Sold with World-Class Intel Support
Annual Subscription with Technical Support
Support Coverage Options: 24x7 or 8x5
Via Solution Vendors and Service Providers
Continued Innovation
* Other names and brands may be claimed as the property of others.
Pipeline of innovation from Intel Labs• Machine Learning, Graph Lab & Graph Builder• Data-Intensive Algorithms & Computer Architecture
Roadmap of open source from Intel Software• Project Rhino: Hardening Apache Hadoop• Project Panthera: Standard SQL on Apache Hadoop
Backed by Broad Portfolio of Datacenter ProductsSoftware
CacheAccelerationSoftware
NetworkStorage & MemoryServer
* Other names and brands may be claimed as the property of others.
Antoine HueRegional Sales Manager
APJC Data Center
>4 Hours to 7 MinutesIntel Platform Benefits for Sorting 1TB Data
Intel® Xeon 5690
7200 HDD
1GbE Adapters
Intel® Xeon®
E5-2690processor
~50%improved
Intel® SSD 520
Series
~80%improved
Intel® 10GbE
Adapters
~50%improved
Deploy IntelDistribution for Apache Hadoop*
~40%improved
Software and workloads used in performance tests may have been optimized for performance only on Intel microprocessors. Performance tests, such as SYSmark and MobileMark, are measured using specific computer systems, components, software, operations and functions. Any change to any of those factors may cause the results to vary. You should consult other information and performance tests to assist you in fully evaluating your contemplated purchases, including the performance of that product when combined with other products.
Source: Intel Internal testingFor more information go to : intel.com/performance
`
>4 Hours
~7 mins
Proven in the Enterprise
Using the Intel® Distribution to gain tremendous results
* Other names and brands may be claimed as the property of others.
IT
Customer Video
With Broad Support from the Ecosystem
* Other names and brands may be claimed as the property of others.
Chris LevanesDirector of Cloud Business Development
Savvis Asia
The Promise of Big Data Requires Industrialized Services
• Trusted, mission critical, high-powered computing solutions
• Robust security options
• Enterprise-grade global storage capabilities
• Highly available compute power
• Cloud-based economic model
• Expert consulting services to aide in transformation of data assets
Big Data Customers Need
BIGDATA
A Longstanding Successful Alliance
Enterprise-Grade, Industrialized Infrastructure Services for Intel Distribution for Apache Hadoop Software
Summary
• Intel announced Intel® Distribution for Apache Hadoop* software
• Delivers performance, security and ease of deployment
• Backed by broad portfolio of Intel data center products
• Contributes to open source and supports Apache Hadoop
• Enabling ecosystem of partners to innovate on analytics solutions
Q&A
Legal DisclaimersAll products, computer systems, dates, and figures specified are preliminary based on current expectations, and are subject to change without notice.Intel processor numbers are not a measure of performance. Processor numbers differentiate features within each processor family, not across different processor families. Go to: http://www.intel.com/products/processor_number
Intel, processors, chipsets, and desktop boards may contain design defects or errors known as errata, which may cause the product to deviate from published specifications. Current characterized errata are available on request.
Intel® Virtualization Technology requires a computer system with an enabled Intel® processor, BIOS, virtual machine monitor (VMM). Functionality, performance or other benefits will vary depending on hardware and software configurations. Software applications may not be compatible with all operating systems. Consult your PC manufacturer. For more information, visit http://www.intel.com/go/virtualization
No computer system can provide absolute security under all conditions. Intel® Trusted Execution Technology (Intel® TXT) requires a computer system with Intel® Virtualization Technology, an Intel TXT-enabled processor, chipset, BIOS, Authenticated Code Modules and an Intel TXT-compatible measured launched environment (MLE). Intel TXT also requires the system to contain a TPM v1.s. For more information, visit http://www.intel.com/technology/security
Intel, Intel Xeon, Intel Atom, Intel Xeon Phi, Intel Itanium, the Intel Itanium logo, the Intel Xeon Phi logo, the Intel Xeon logo and the Intel logo are trademarks or registered trademarks of Intel Corporation or its subsidiaries in the United States and other countries.
Other names and brands may be claimed as the property of others.
Copyright © 2013, Intel Corporation. All rights reserved.
Apache Hadoop Performance Test Configuration4 hours to 7 minutes
Cluster Configuration 1 Head Node (name node, job tracker) 10 Workers (data nodes, task trackers) 10-Gigabit Switch: Cisco Nexus 5020
Software Configuration Intel Distribution for Apache Hadoop 2.1.1 Apache Hadoop 1.0.3 RHEL 6.3 Oracle Java 1.7.0_05
29
Head Node Hardware 1 x Dell r710 1U servers
Intel: 2x3.47GHz Intel® Xeon®
processor X5690 Memory: 48G RAM Storage: 10K SAS HDD Intel® Ethernet 10 Gigabit SFP+ Intel® Ethernet 1 Gigabit
Worker Node Hardware 10 x Dell r720 2U servers
Intel: 2 x 2.90Ghz Intel® Xeon® processor E5-2690 Memory: 128G RAM Storage: 520 Series SSDs Intel® Ethernet 10 Gigabit SFP+ Intel® Ethernet 1 Gigabit