john glendenning - real time data driven services in the cloud

26
John Glendenning DataStax ‘Real-time data driven services in the Cloud’

Upload: weareesynergy

Post on 26-Jan-2015

105 views

Category:

Technology


2 download

DESCRIPTION

John Glendenning from DataStax's presentation from our Big Data breakfast conference

TRANSCRIPT

Page 1: John Glendenning - Real time data driven services in the Cloud

John Glendenning

DataStax

‘Real-time data driven services in the Cloud’

Page 2: John Glendenning - Real time data driven services in the Cloud

Real-time Data Driven Services in the CloudJohn Glendenning, DataStaxVP & GM EMEA

Page 3: John Glendenning - Real time data driven services in the Cloud

Line of Business Manager: Adapt With Customers

“I have to move as fast as my market. I can’t get

slowed down by people telling me this is going to

take six months. It’s got to be ready, quickly. No matter

what. And I need to adapt quickly with my customers.

Page 4: John Glendenning - Real time data driven services in the Cloud

VP of IT: How Can I Scale Without Surprises?

“Given the explosion of data in the enterprise, how can I scale my IT investment to meet the demands of my lines of business, without taking on undue risk? (My choices are to spend $10 million to scale what I’ve got versus do something new)”

Page 5: John Glendenning - Real time data driven services in the Cloud

Nearly All Businesses Must Think Global

Datacenter

Cloud

About 1/2 OF ALL SALES will be online BY THE END

OF 2013

Source: (http://www.datastax.com/resources/whitepapers/bigdata)

24/7 monitoring demands

Globalmarket

demands

Localizationdeployment

Page 6: John Glendenning - Real time data driven services in the Cloud

Your Data Demands Can Change in an Instant

2012

2011

2010

2009

Fluctuating

traffic demands

14

24

25

13

Fi

5

24

Page 7: John Glendenning - Real time data driven services in the Cloud

Major Changes:

The Evolving

Data Center

Page 8: John Glendenning - Real time data driven services in the Cloud

DataStax in the News

Big movies, big data: Netflix embraces NoSQL in the cloud

With billions of reads and writes daily, Netflix relies on NoSQL database Cassandra to replace a legacy Oracle deployment

May 02, 2013

(AP) The company chose Cassandra from DataStax for its flexibility to create and manage data clusters quickly, particularly in the cloud. Christos Kalantzis, Netflix's manager of cloud and platform engineering, explains that "solutions like Oracle don't run very well on virtualized hardware ... the architecture of Cassandra and the availability and consistency tuning and scalability made it a clear choice." To address these

Page 9: John Glendenning - Real time data driven services in the Cloud

Major Changes: The Evolving Data Center

LOBApp

Oracle

LOBApp

MySQL

LOBApp

SQLServe

r

“What’s Happening?”Hyper VelocityTransactional

NoSQL

Data Warehouse

Teradata/Exadata

“What Happened?”Massive Volume

Bit Bucket

Hadoop

Page 10: John Glendenning - Real time data driven services in the Cloud

Not Only SQL

Page 11: John Glendenning - Real time data driven services in the Cloud

What is a NoSQL Solution?

NoSQL is a broad class of next-generation database management systems that differ from the classic model of the relational database management system (RDBMS) in some significant ways, most important being they are:

• Designed from the ground up to deal with the challenges of Big Data

• Massively scalable at a fraction of the cost of a traditional RDBMS

• Less-rigid, more dynamic data model that drives flexibility and agility

• Can store structured, semi-structured and unstructured data• Not beholden to traditional RDBMS constraints such as ACID

compliance

Page 12: John Glendenning - Real time data driven services in the Cloud

What is Apache Cassandra?

Apache Cassandra™ is a massively scalable distributed open source database.

Cassandra is designed to handle big data workloads across multiple data centres with no single point of failure, providing enterprises with continuous availability without compromising performance.

Page 13: John Glendenning - Real time data driven services in the Cloud

Cassandra Architecture Overview

• Fast / Linear performance• Elastic scalability • No single point of failure • Enterprise / multi-data center /

cloud data distribution• Location independence – read

and write anywhere• Tunable data consistency (per

operation) • Familiar SQL-Like language –

CQL • Dynamic / Flexible schema• Can store structured, semi-

structured and unstructured data

• Replication Strategies from Amazon Dynamo paper• Data structure and storage design from Google

BigTable paper

Page 14: John Glendenning - Real time data driven services in the Cloud

Apache Cassandra Leading in Performance“In terms of scalability, there is a clear winner throughout our experiments. Cassandra achieves the highest throughput for the maximum number of nodes in all experiments with a linear increasing throughput.”Solving Big Data Challenges for Enterprise Application Performance Management, Tilman Rable, et al., August 2013, p. 10. Benchmark paper presented at the Very Large Database Conference, 2013. http://vldb.org/pvldb/vol5/p1724_tilmannrabl_vldb2013.pdf

http://techblog.netflix.com/2011/11/benchmarking-cassandra-scalability-on.html

Netflix Cloud Benchmark…End Point Independent NoSQL

Benchmark

Highest in throughput…

Lowest in latency…

Page 15: John Glendenning - Real time data driven services in the Cloud

Who’s using Cassandra?

Page 16: John Glendenning - Real time data driven services in the Cloud

Why We Exist

“I can create a Cassandra cluster in any region of the world in 10 minutes. When marketing guys decide we want to move into a certain part of the world, we’re ready.”

Today’s applications must be always available and lightning fast as they scale to previously unimaginable levels.

Cassandra delivers both with a beautifully simple and elegant architecture.

Page 17: John Glendenning - Real time data driven services in the Cloud

What We Do Best

Cassandra was designed to do things that are impossible in other databases when it comes to availability and performance.  Forget about losing a machine here or there -- Cassandra delivers a world where you can lose an entire datacenter and still perform as your customers expect.

“We have to be ready for disaster recovery all the time. It’s really great that Cassandra allows for active-active multiple data centers where we can read and write anywhere”

Jay PatelTechnical Architect at eBay(Describing why they switched from legacy relational architecture)

Page 18: John Glendenning - Real time data driven services in the Cloud

Without Breaking Your Budget

“To do what we need to do today without Cassandra would cost a couple million dollars more and would be significantly harder to manage operationally.”

Page 19: John Glendenning - Real time data driven services in the Cloud

DataStax: An Overview• Founded in April 2010

• Home to Apache Cassandra Chair & most committers

• DataStax Enterprise – ‘Certified for Production’ Big Data platform

• 300+ customers

• 100+ employees

• Headquartered in San Francisco Bay area

• European HQ in London, UK

• Funded by prominent venture firms

Page 20: John Glendenning - Real time data driven services in the Cloud

DataStax Enterprise

Cassandra users come to DataStax

For Confidence and Innovation

Page 21: John Glendenning - Real time data driven services in the Cloud

What Innovation?

• Production-certified Cassandra

• Round-the-clock support by the world’s experts

• Your big data system is easy to manage

• Satisfy your top security officer

• Search and analyze your hot data in context

Page 22: John Glendenning - Real time data driven services in the Cloud

Ask Different Things of Your Hot Data

Analyze(Hadoop) Write

Read

Write Search(Solr)

Search(Solr)

Write

Read

DataStaxEnterpriseMulti-Data

Center

Page 23: John Glendenning - Real time data driven services in the Cloud

With the Security You Need

Analyze(Hadoop) Write

Read

Write Search(Solr)

Search(Solr)

Write

Read

Page 24: John Glendenning - Real time data driven services in the Cloud

Into the Mainstream

“Security is very important to us, so we’re naturally very pleased to see all the new security features in DataStax Enterprise 3. Its scalability and performance are enabling us to develop an exciting financial data analytics platform that will create a better experience for our audience.”

Page 25: John Glendenning - Real time data driven services in the Cloud

Managed From a Single Pane

ProvisionMonitorPlanOptimizeRecover

Page 26: John Glendenning - Real time data driven services in the Cloud

CALL FOR PAPERSSPONSORSHIP 30+ SessionsTWO DAYSTRAINING DAY

Cassandra Summit Europe 2013

CALL FOR PAPERSSPONSORSHIP OPPORTUNITY

TWO DAYS30+ SESSIONSTRAINING DAY

London Barbican 2013