big data today and tomorrow
DESCRIPTION
TRANSCRIPT
![Page 1: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/1.jpg)
BIGdatatoday and tomorrow
Mariusz Gil
![Page 2: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/2.jpg)
/ ABOUT ME /
![Page 3: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/3.jpg)
BIG DATAThis talk is about
![Page 4: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/4.jpg)
BIG DATA?What is...
![Page 5: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/5.jpg)
![Page 6: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/6.jpg)
VOLUMElarge amounts of data
![Page 7: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/7.jpg)
VELOCITYneeds to be analyzed quickly
![Page 8: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/8.jpg)
VARIETYdifferent types of structured and unstructured data
![Page 9: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/9.jpg)
Big Data is data that is too large, complex and dynamics for any conventional data tools to capture, store, manage and analyze.
![Page 10: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/10.jpg)
30 billion pieces of content we added past month
![Page 11: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/11.jpg)
more than 2 billion videos were watched yesterday
![Page 12: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/12.jpg)
more than 58 millions messages were send yesterday
![Page 13: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/13.jpg)
WHY?
![Page 14: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/14.jpg)
![Page 15: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/15.jpg)
690 nodes Hadoop cluster for predictions and analytics
![Page 16: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/16.jpg)
HOW?
![Page 17: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/17.jpg)
![Page 18: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/18.jpg)
HBASECOLUMNAR STORAGE
HIVESQL DATA WAREHOUSE ENGINE
AVRODATA SERIALIZATION
MAHOUTSCALABLE MACHINE LEARNING
OOZIEWORKFLOWS ORCHESTRATION
ZOOKEEPERDISTRIBUTED COORDINATION SERVICE
FLUMELOG COLLECTOR
HDFSHADOOP DISTRIBUTED FILE SYSTEM
YARN / MapReduce v2DISTRIBUTED PROCESSING FRAMEWORK
AMBARIPROVISIONING, MANAGING AND MONITORING CLUSTERS
WHIRRRUNNING CLOUD SERVICES
![Page 19: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/19.jpg)
EVOLVE
![Page 20: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/20.jpg)
HADOOP!The future is not only
![Page 21: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/21.jpg)
REALTIMEFuture is low latency and
![Page 22: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/22.jpg)
![Page 23: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/23.jpg)
Apache Drill
Storm
![Page 24: Big data today and tomorrow](https://reader030.vdocument.in/reader030/viewer/2022020207/54b7461e4a79599a288b4577/html5/thumbnails/24.jpg)
BIG THINGData is the next