2.3 methods for big data what is “big data”? summarizing big data

Click here to load reader

Post on 24-Dec-2015




15 download

Embed Size (px)


  • Slide 1
  • Slide 2
  • 2.3 Methods for Big Data What is Big Data? Summarizing Big Data
  • Slide 3
  • The Flood of Big Data 90% of all data created by humankind has been created in the last 2 years
  • Slide 4
  • Data Creation Data Flow a Decade AgoData Flow Now Marketing Survey
  • Slide 5
  • What Exactly is BIG DATA? n BIG DATA refers to a collection of tools, techniques and technologies that make it possible to work with data at any scale. n BIG DATA is less about size, more about flow and velocity
  • Slide 6
  • The 3 Vs of BIG DATA 1. Volume Larger than conventional databases can handle 2. Velocity High rate at which data is generated, processed and analyzed in real time 3. Variety Data formats are unstructured and inconsistent
  • Slide 7
  • Volume
  • Slide 8
  • n Walmart collects more than 2.5 petabytes of data every hour from its customer transactions.
  • Slide 9
  • Velocity n Twitter Twitter
  • Slide 10
  • Variety: Data formats are Unstructured and Inconsistent
  • Slide 11
  • Big Data Technologies n http://aws.amazon.com/big-data/ http://aws.amazon.com/big-data/ n https://cloud.google.com/products/bigquery/ https://cloud.google.com/products/bigquery/ n https://support.google.com/fusiontables/ans wer/2571232 https://support.google.com/fusiontables/ans wer/2571232 n http://www.microsoft.com/en-us/server- cloud/solutions/big-data.aspx http://www.microsoft.com/en-us/server- cloud/solutions/big-data.aspx Word walls, word clouds, correlation wheels, heat maps, fusion tables, NOSQL, networks
  • Slide 12
  • Correlation Wheel (sort of) n http://www.bytemuse.com/post/nfl- football-schedule/ http://www.bytemuse.com/post/nfl- football-schedule/
  • Slide 13
  • Time Warner Outage 8/27
  • Slide 14
  • End of Section 2.3

View more