Download - BIG DATA BY SAIKIRAN PANJALA
![Page 1: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/1.jpg)
BIG DATAPresenting By
XXXXXX12XXXXX
Under the guidance ofXXXX
![Page 2: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/2.jpg)
JAVA CARD 2
Big data is a broad term for data sets so large or complex that traditional data processing applications are inadequate. Challenges include analysis, capture, data curtain, search, sharing, storage, transfer, visualization, and information privacy. The term often refers simply to the use of predictive analytics or other certain advanced methods to extract value from data, and seldom to a particular size of data set. Accuracy in big data may lead to more confident decision making. And better decisions can mean greater operational efficiency, cost reductions and reduced risk.
ABSTRACT
![Page 3: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/3.jpg)
3
No single standard definition…
“Big Data” is data whose scale, diversity, and complexity require new architecture, techniques, algorithms, and analytics to manage it and extract value and hidden knowledge from it…
JAVA CARD
Big Data Definition
![Page 4: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/4.jpg)
What is data?Types of data1. Relational Data 2. Text Data 3. Semi-structured Data 4. Graph Data5. Streaming Data
Data
![Page 5: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/5.jpg)
How much data daily using?
640K ought to be enough for anybody.
![Page 6: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/6.jpg)
Lots of data is being collected and warehoused ◦ Web data, e-commerce◦ purchases at department/
grocery stores◦ Bank/Credit Card
transactions◦ Social Network
Big Data Every Where!
![Page 7: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/7.jpg)
Maximilien Brice, © CERN
![Page 8: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/8.jpg)
The Model of Generating/Consuming Data has Changed
The Model Has Changed…
Old Model: Few companies are generating data, all others are consuming data
New Model: all of us are generating data, and all of us are consuming data
![Page 9: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/9.jpg)
Who’s Generating Big Data ?
Social media and networks
Scientific instruments
Mobile devices Sensor
technology and
networks
![Page 10: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/10.jpg)
How much data using?
![Page 11: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/11.jpg)
The Meaning of Big Data - 3 V’s
•Big Volume
•Big Velocity
•Big Variety
![Page 12: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/12.jpg)
Data Volume◦ 44x increase from 2009 2020◦ From 0.8 zettabytes to 35zb
Data volume is increasing exponentially
Characteristics of Big Data: 1-Scale (Volume)
![Page 13: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/13.jpg)
Consider closing price on all trading days for the last 5 years for two stocks A and B
What is the covariance between the two time-series?
(1/N) * sum (Ai - mean(A)) * (Bi - mean (B))
Big Data - AnalyticsAn Example
![Page 14: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/14.jpg)
Ignoring the (1/N) and subtracting off the means ….
Stock * StockT
Now try it for companies headquartered in Charlotte!
Array Answer
![Page 15: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/15.jpg)
Trading volume on Wall Street going through the roof
Breaking all their infrastructure
And it will just get worse
Big Velocity
![Page 16: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/16.jpg)
Data is begin generated fast and need to be processed fast
Online Data Analytics Late decisions
Examples:◦ E-Promotions◦ Healthcare monitoring
Characteristics of Big Data: Speed (Velocity)
![Page 17: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/17.jpg)
![Page 18: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/18.jpg)
There are three forms of variety:1. Structured2. Semi structured3. unstructured
Big variety
![Page 19: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/19.jpg)
![Page 20: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/20.jpg)
enterprise text data warehouse
The World of Data Integration
the rest of your data
![Page 21: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/21.jpg)
Some Make it 4V’s
![Page 22: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/22.jpg)
What is hadoop? Apache hadoop is a framework that allows for
the distributed processing of large data It is an open-source data management
hadoop
![Page 23: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/23.jpg)
Hadoop key characteristics
![Page 24: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/24.jpg)
Hadoop Eco-System
![Page 25: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/25.jpg)
Advantages of big dataDisadvantages of dig data
![Page 26: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/26.jpg)
The biggest challenge for any big application
Choose wisely and move forward otherwise it cant get the value of data
CONCLUSION
![Page 27: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/27.jpg)
http://www.edureka.in/blog/the-hype-behind-big-data/
http://en.wikipedia.org/wiki/big-data
REFERENCES
![Page 28: BIG DATA BY SAIKIRAN PANJALA](https://reader035.vdocument.in/reader035/viewer/2022062522/58ac2a031a28abf03a8b65e1/html5/thumbnails/28.jpg)