intro to big data

14
Intro to Big Data On Premise Presented by: Jon Bloom Senior Consultant, Agile Bay, Inc.

Upload: jonathan-bloom

Post on 27-May-2015

502 views

Category:

Technology


2 download

DESCRIPTION

Introduction to Big Data.

TRANSCRIPT

Page 1: Intro to Big Data

Intro to Big Data On Premise

Presented by: Jon BloomSenior Consultant, Agile Bay, Inc.

Page 2: Intro to Big Data

Jon BloomBlog: http://www.bloomconsultingbi.com

Twitter: @sqljon

Linked-in: http://www.linkedin.com/in/BloomConsultingBI

Email: [email protected]

Customers & Partners

Page 3: Intro to Big Data

w w w . a g i l e b a y . c o m

Page 4: Intro to Big Data

Session AgendaWhat is Big Data?What is Hadoop?BI vs. HadoopDemo:

Page 5: Intro to Big Data

Terms and Acronyms Hadoop:

Apache project (open source) project to develop software for reliable, scalable, distributed computing.

Cluster: A group of computers (nodes) linked together to perform a highly-available and high computation work

HDFS distributed file system that provides high-throughput access to application data.

YARNA framework for job scheduling and cluster resource management.

MapReduce A system for parallel processing of large data sets.

Page 6: Intro to Big Data

What is Big Data?

Page 7: Intro to Big Data

What is Big Data?Volume, Velocity, Variety

Page 8: Intro to Big Data

What is Hadoop?

Page 9: Intro to Big Data

What is HadoopApache open source project Batch Oriented Parallel Processing across

Commodity Servers Ecosystem

• Ambari• HBase• Avro• Cassandra• Chukwa

• Hive• Mahout• Pig• ZooKeeper

Page 10: Intro to Big Data

Distributed Computing & MapReduce

MapperReducer

Page 11: Intro to Big Data

BI vs. Hadoop?

Page 12: Intro to Big Data

BI vs. HadoopHadoop not a replacement of BIExtends BI capabilitiesBI = Scale up to 100s of GigabytesHadoop = From 100s of Gygabytes to

Terabytes (1,000s og Gygabytes) and Terabytes (1,000,000 Gigabytes)

Page 13: Intro to Big Data

Demo

Page 14: Intro to Big Data

Thank you for attending!Q & A

Blog: www.bloomconsultingbi.comTwitter: @sqljon

Linked-in: http://www.linkedin.com/in/BloomConsultingBI

Email: [email protected]