big data

22
BIG DATA

Upload: sameer-sawhney

Post on 19-Nov-2014

430 views

Category:

Technology


2 download

DESCRIPTION

My recent presentation about what is Big Data, Why so much Hype now, Startling Facts, Opportunity, History, Important Research Papers such as GFS, Map-Reduce , Technology Platforms and Organizations , Hadoop, Cassandra, Introduction to Hadoop, Contribution of Indians to various Big Data technologies working in Google, Cloudera, Hortonworks, Yahoo, Facebook, Aadhar - "All your answers lie in data - @Sameer Sawhney"

TRANSCRIPT

Page 1: Big Data

BIG

DATA

Page 2: Big Data

WHY NOW ?

Page 3: Big Data

World’s information totaled over

2 Zetabytes

That’s 2 Trillion Gigabytes

By 2020, this number will be

35 Trillion ZB

Page 4: Big Data

“world’s data is doubling every 1.2 years”

Page 5: Big Data

“80% of this data is unstructured”

Page 6: Big Data

5 V

Page 7: Big Data

Money

Page 8: Big Data
Page 9: Big Data

2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2013

Today

Apache Cassandra

Apache Hadoop

Amazon Dynamo

Big Table

Map Reduce

Google File System

Dremel

Impala

Spanner ?

Page 10: Big Data

Analytics

(Hadoop)

Realtime

(“NoSql”)

Page 11: Big Data

TH

E E

CO

SY

STEM

Page 12: Big Data

Hadoop Ecosystem

Page 13: Big Data

Apache Hadoop is an open-source software

framework that supports running applications on

large clusters of commodity hardware.

Page 14: Big Data
Page 15: Big Data

Replication

Fault Tolerant

Commodity Hardware

Page 16: Big Data

Map Reduce

Page 17: Big Data

Map Reduce

Page 18: Big Data

Word Count

Page 19: Big Data
Page 20: Big Data

World's largest biometric identity platform

2,00,00,00,00,000 Biometric Matches

2 PB Data

Hadoop Stack

Page 21: Big Data

This is just the Beginning of “Big Data Revolution”

This is just the Beginning of “Big Data Revolution”

Page 22: Big Data

[email protected]

@sameersaw at twitter

Images

Raymond Bryson Marius B IntelFreePress License Pedro Moura Pinheiro