big data by saikiran panjala

28
BIG DATA Presenting By XXXXXX 12XXXXX Under the guidance of XXXX

Upload: saikiran-panjala

Post on 21-Feb-2017

25 views

Category:

Engineering


4 download

TRANSCRIPT

Page 1: BIG DATA BY SAIKIRAN PANJALA

BIG DATAPresenting By

XXXXXX12XXXXX

Under the guidance ofXXXX

Page 2: BIG DATA BY SAIKIRAN PANJALA

JAVA CARD 2

Big data is a broad term for data sets so large or complex that traditional data processing applications are inadequate. Challenges include analysis, capture, data curtain, search, sharing, storage, transfer, visualization, and information privacy. The term often refers simply to the use of predictive analytics or other certain advanced methods to extract value from data, and seldom to a particular size of data set. Accuracy in big data may lead to more confident decision making. And better decisions can mean greater operational efficiency, cost reductions and reduced risk.

ABSTRACT

Page 3: BIG DATA BY SAIKIRAN PANJALA

3

No single standard definition…

“Big Data” is data whose scale, diversity, and complexity require new architecture, techniques, algorithms, and analytics to manage it and extract value and hidden knowledge from it…

JAVA CARD

Big Data Definition

Page 4: BIG DATA BY SAIKIRAN PANJALA

What is data?Types of data1. Relational Data 2. Text Data 3. Semi-structured Data 4. Graph Data5. Streaming Data

Data

Page 5: BIG DATA BY SAIKIRAN PANJALA

How much data daily using?

640K ought to be enough for anybody.

Page 6: BIG DATA BY SAIKIRAN PANJALA

Lots of data is being collected and warehoused ◦ Web data, e-commerce◦ purchases at department/

grocery stores◦ Bank/Credit Card

transactions◦ Social Network

Big Data Every Where!

Page 7: BIG DATA BY SAIKIRAN PANJALA

Maximilien Brice, © CERN

Page 8: BIG DATA BY SAIKIRAN PANJALA

The Model of Generating/Consuming Data has Changed

The Model Has Changed…

Old Model: Few companies are generating data, all others are consuming data

New Model: all of us are generating data, and all of us are consuming data

Page 9: BIG DATA BY SAIKIRAN PANJALA

Who’s Generating Big Data ?

Social media and networks

Scientific instruments

Mobile devices Sensor

technology and

networks

Page 10: BIG DATA BY SAIKIRAN PANJALA

How much data using?

Page 11: BIG DATA BY SAIKIRAN PANJALA

The Meaning of Big Data - 3 V’s

•Big Volume

•Big Velocity

•Big Variety

Page 12: BIG DATA BY SAIKIRAN PANJALA

Data Volume◦ 44x increase from 2009 2020◦ From 0.8 zettabytes to 35zb

Data volume is increasing exponentially

Characteristics of Big Data: 1-Scale (Volume)

Page 13: BIG DATA BY SAIKIRAN PANJALA

Consider closing price on all trading days for the last 5 years for two stocks A and B

What is the covariance between the two time-series?

(1/N) * sum (Ai - mean(A)) * (Bi - mean (B))

Big Data - AnalyticsAn Example

Page 14: BIG DATA BY SAIKIRAN PANJALA

Ignoring the (1/N) and subtracting off the means ….

Stock * StockT

Now try it for companies headquartered in Charlotte!

Array Answer

Page 15: BIG DATA BY SAIKIRAN PANJALA

Trading volume on Wall Street going through the roof

Breaking all their infrastructure

And it will just get worse

Big Velocity

Page 16: BIG DATA BY SAIKIRAN PANJALA

Data is begin generated fast and need to be processed fast

Online Data Analytics Late decisions

Examples:◦ E-Promotions◦ Healthcare monitoring

Characteristics of Big Data: Speed (Velocity)

Page 17: BIG DATA BY SAIKIRAN PANJALA
Page 18: BIG DATA BY SAIKIRAN PANJALA

There are three forms of variety:1. Structured2. Semi structured3. unstructured

Big variety

Page 19: BIG DATA BY SAIKIRAN PANJALA
Page 20: BIG DATA BY SAIKIRAN PANJALA

enterprise text data warehouse

The World of Data Integration

the rest of your data

Page 21: BIG DATA BY SAIKIRAN PANJALA

Some Make it 4V’s

Page 22: BIG DATA BY SAIKIRAN PANJALA

What is hadoop? Apache hadoop is a framework that allows for

the distributed processing of large data It is an open-source data management

hadoop

Page 23: BIG DATA BY SAIKIRAN PANJALA

Hadoop key characteristics

Page 24: BIG DATA BY SAIKIRAN PANJALA

Hadoop Eco-System

Page 25: BIG DATA BY SAIKIRAN PANJALA

Advantages of big dataDisadvantages of dig data

Page 26: BIG DATA BY SAIKIRAN PANJALA

The biggest challenge for any big application

Choose wisely and move forward otherwise it cant get the value of data

CONCLUSION

Page 27: BIG DATA BY SAIKIRAN PANJALA

http://www.edureka.in/blog/the-hype-behind-big-data/

http://en.wikipedia.org/wiki/big-data

REFERENCES

Page 28: BIG DATA BY SAIKIRAN PANJALA