hadoop for microsoft devssddconf.com/.../hadoop_kickstarter_for_microsoft_devs.pdf ·...

21
Hadoop Kickstarter For Microsoft Devs By Gary Short Duncodin Limited www.duncodin.it

Upload: others

Post on 09-Jul-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

Hadoop Kickstarter For Microsoft Devs

By Gary Short

Duncodin Limited

www.duncodin.it

Page 2: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

Introduction

• Gary Short• Microsoft MVP C#• Freelance data scientist• Big Data / architect / engineer• HDInsight / Hadoop / Pig / Hive• Predictive Analytics• Machine Vision• Computational Linguistics• [email protected]• @garyshort

Image © @Blackmarble

Page 3: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

Agenda

• What problem does Hadoop solve?

• How do I install it?

• How do I get my C# code running on it?

• Questions?

Page 4: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

Demo – What Problem Does Hadoop Solve?

Page 5: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

You Just Swapped One Set of Problems For Another!

Page 6: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

Hadoop Architecture – Data Storage

Page 7: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

Hadoop Architecture – Map Reduce

Page 8: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

How do I Install It?

Page 9: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured
Page 10: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured
Page 11: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured
Page 12: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured
Page 13: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured
Page 14: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured
Page 15: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

How do I get my C# Code Running?

Page 16: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

Say “word count” one more time!

Page 17: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

Which one will win?

Page 18: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

Demo - Streaming

Page 19: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

I’m Not Gonna Lie, That Was a Ballache.Is there an easier way?

Page 20: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

Demo - SDK

Page 21: Hadoop For Microsoft Devssddconf.com/.../Hadoop_KickStarter_For_Microsoft_Devs.pdf · 2015-05-05 · Storing & Querying Big Data in Hadoop Distributed File System ( HDFS ) Unstructured

Questions?

• Gary Short

• Duncodin Limited

• Freelance data scientist

• Big Data architect / engineer

• www.duncodin.it

[email protected]

• @garyshort

Image © @Blackmarble