big data get ready to be agile & to use multiple...

31
Olivier Flebus February 16 th , 2012 Big Data Big Data Get ready to be agile & to use multiple solutions Get ready to be agile & to use multiple solutions February 16 , 2012

Upload: others

Post on 22-Aug-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Olivier Flebus

February 16th, 2012

Big DataBig Data

Get ready to be agile & to use multiple solutionsGet ready to be agile & to use multiple solutions

February 16th, 2012

Page 2: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Olivier Flebus ( [email protected] )

• Managing Enterprise Architect

• Agile Community Leader for Capgemini Aerospace & De fence(Toulouse, France)

CIO Forum Big Data

(Toulouse, France)

• Twitter @olivierflebus

bit.ly/flebus

2

Page 3: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

BigData is really big

Big data:the next chapter ofthe information revolution

bit.ly/pFV1aw

CIO Forum Big Data

3

bit.ly/q8CZ6J

bit.ly/o8xB2E

Page 4: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Capgemini’s BIM Global Service Line

� Business Information Management isbusiness critical

� Turn Data into Competitive advantage through the right Decision-Making

CIO Forum Big Data

through the right Decision-Making

4

“ Drowning in data, Thirsting for knowledge ”

bit.ly/nsQgLk

Page 5: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

What is BigData ?

“Every two days now we create as much information as we did from the dawn of

1. Volume

CIO Forum Big Data

5

as we did from the dawn of civilization up until 2003 .”

Eric Schmidt

bit.ly/oNyJ2otcrn.ch/nfXTAC

Page 6: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

What is BigData ?

• Many more data types (localisation, sensors, etc)

2. Variety

unstructured

structured

CIO Forum Big Data

6

unstructured

Page 7: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

What is BigData ?

• Need for real-time (or business-time)• Data lifecycles are accelerating

3. Velocity

CIO Forum Big Data

7

bit.ly/xCT0Mf

Page 8: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

What is BigData ?

• Business driven• End-to-end approach• Experimental• Time-to-value over

guaranteed success

4. Agility

CIO Forum Big Data

guaranteed success

8

“Big Data analytics must be business led, and not all projects will be successful at finding the needle in the haystack.”

bit.ly/pW9Tj9

Page 9: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

What is BigData ?

BigData is not only about storage & volume

The whole value chain matters, from a business point of view.

analytics 1. Volume

CIO Forum Big Data

9

capturestorage

search sharing

analyticsvisualizing

1. Volume2. Variety3. Velocity4. Agility

Page 10: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Approaches&

CIO Forum Big Data

10© 2012 Capgemini. All rights reserved.

&Solutions

Page 11: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

More demanding environments

MassiveDatasets

FlexibleData model

UnstructuredData

Petabytes

Trillions of records

Complex relationships Schema-less

CIO Forum Big Data

11

AdvancedAnalytics

Real-Time

ScalableI/O &

Processing

What are your needs ?

Trends, StatisticsPredictive modelingSimulation Models

Sub-secondLatency

Thousands of concurrent accesses

Continuous data loading

Page 12: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Big Data, Cloud & Web-scale platforms are real

CIO Forum Big Data

12

bit.ly/Ah9hjY

Page 13: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Transaction-oriented appsof the last century

End-user

Front-End

CIO Forum Big Data

13

SQL

Back-End

Relational Database

Page 14: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

« Classical » BI

End-user

Reporting/AnalysisApplication

Data warehouse

CIO Forum Big Data12

SQL Datamarts

SQL SQL SQLSourcedatabasesSQL SQL

SQL

Data warehouseOLAP

Page 15: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

SQL

CIO Forum Big Data

15

General-purpose.One size fits all.

Page 16: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

NoSQL

CIO Forum Big Data

16

Choose the one that best fits……according to what you need

Page 17: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

What is NoSQL ?

NoSQL = Not only SQL

An familly of data management solutions targeting Big Data requirements

CIO Forum Big Data

17

targeting Big Data requirementsMassiveDatasets

FlexibleData model

AdvancedAnalytics

Real-Time

ScalableI/O &

Processing

UnstructuredData

Page 18: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

NoSQL: modern web-scale databases

CIO Forum Big Data

400-node Cassandra architecture for the analysis of hundreds of millions of

intelligence documents

Yahoo! runs Hadoop on 42,000 nodesholding 180-200 petabytes of data

Page 19: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

NoSQL: 4 datamodels

CIO Forum Big Data

19

Source: Xebia

Key-Value stores

Big Table clones(column family)

Document databases

Graph databases

Page 20: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

NoSQL: additional charateristics

A whole ecosystem. Many kinds of approaches & products.

CIO Forum Big Data

20

http://www.rackspace.com/cloud/blog/2009/11/09/nosq l-ecosystem/

Page 21: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Multiple approaches & solutions

NoSQL

SBAsSearch-Based Applications

Crowdsourcing

Self-service BI

Columnar databases

In-database

Batch-Oriented /Stream-Oriented

CIO Forum Big Data

21© 2012 Capgemini. All rights reserved.

NoSQLIn-database processing

In-Memory

Data-Grid

Appliances (HW+SW)

Page 22: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Hybrid Architectures – Polyglot Persistence

CIO Forum Big Data

22© 2012 Capgemini. All rights reserved.

Selecting one single solution is not the target !

Page 23: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Separate MDM & Big Data approaches

ACIDAtomicity, Consistency,

Isolation, Durability

GovernanceData QualityNormalized

BASEBasically Available, Soft state, Eventually consistent

AgilityScalabilityDenormalized

CIO Forum Big Data

23© 2012 Capgemini. All rights reserved.

MDM(Master Data

Management)

Big Data

Normalized

Data as critical asset

Denormalized

Data as raw materialfor analysis & insight

Page 24: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

CustomerCase

CIO Forum Big Data

24© 2012 Capgemini. All rights reserved.

Case

Page 25: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Customer Case: Aircrafts Data

4890

CIO Forum Big Data

25© 2012 Capgemini. All rights reserved.

4890Flight hours Hours of video

uploaded

(Aircrafts worldwide operations 2010sources: http://www.boeing.com/news/techissues/pdf/statsum.p dfhttp://www.youtube.com/t/press_statistics )

~160 MBytes

bit.ly/ySQR7g

Page 26: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Customer Case: Big Data ���� Big Value

� Leverage Aircraft Data…� … to enable new value-added services…� … delivered in a new way

CIO Forum Big Data

26© 2012 Capgemini. All rights reserved.

Page 27: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Customer Case: Aircraft Product Structure

� Check Data Quality ~100 ms~400 ms

CIO Forum Big Data

� Traverse product structure ~400 ms(extract all DS for an effectivity)

� Manufacturing use case ~7 s(complete BOM for an effectivity)

27© 2012 Capgemini. All rights reserved.

~ 1 hourwith current solution

(RDBMS)

Page 28: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Conclusion

Principles & Trends

� Big Data � Big Opportunity

� An end-to-end approach implying a new mindset• IO, Storage, Query, Analysis all together

� High Diversity & Specialization of the available solutions

CIO Forum Big Data

� High Diversity & Specialization of the available solutions• All major vendors: HP (acquired Vertica), IBM (acquired Netezza), SAP, …

• Many startups: 10gen, Acunu, Basho, Cloudera, Datastax… Zillabyte

• Get ready to use several solutions

� Choose the right tool for the right job…� …but do not take too much time to think

28

Page 29: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach
Page 30: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

Thank you !

[email protected]

@olivierflebusMeet our experts

Benchmark with other customers

CIO Forum Big Data

30© 2012 Capgemini. All rights reserved.

http://www.capgemini.com/architectureweek/

Page 31: Big Data Get ready to be agile & to use multiple solutionsweb.idg.no/app/web/online/Event/CIOforum/2012/BigData/... · 2012. 2. 15. · Big Data Big Opportunity An end-to-end approach

www.capgemini.com

The information contained in this presentation is p roprietary. ©2010 Capgemini. All rights reserved