adatao bigdata summit v02

Upload: chewable

Post on 03-Jun-2018

218 views

Category:

Documents


0 download

TRANSCRIPT

  • 8/13/2019 Adatao BigData Summit v02

    1/43

    Big Data, Big OpportunitFrom Buzz to Biz

    Presented on December 6, 2013

    Christopher Nguyen, PhDCo-Founder & CEO

  • 8/13/2019 Adatao BigData Summit v02

    2/43

    Step 1

    Instrument Calibration

  • 8/13/2019 Adatao BigData Summit v02

    3/43

    do you get when you cross

    Atlantic Titan

    What

  • 8/13/2019 Adatao BigData Summit v02

    4/43

    AbouHalfway

  • 8/13/2019 Adatao BigData Summit v02

    5/43

    do you get when you cross

    What

  • 8/13/2019 Adatao BigData Summit v02

    6/43

    Big DataHadoopIf are not enough

    Whado we ne

  • 8/13/2019 Adatao BigData Summit v02

    7/43

    WHAT isBig Data ?

  • 8/13/2019 Adatao BigData Summit v02

    8/43

    Huge Volume

    High Velocity

    Great Variety

    Big DataProblemsStandard Denition

  • 8/13/2019 Adatao BigData Summit v02

    9/43

    Learn from Data

    Predict the Unknowns from the Knowns

    automatic custom

    B I G D A T A = P R O B L E M S

    BIG DATA + BIG COMPUTE=BIG OPPORTUNITIESAlternative Denition

  • 8/13/2019 Adatao BigData Summit v02

    10/43

    WHATs

    the bigdeal?Ive had

    big data

    since the 60s

  • 8/13/2019 Adatao BigData Summit v02

    11/43

    WHATs

    the bigdeal?

    Weve had the sun for a fewthousand years,

    right?

  • 8/13/2019 Adatao BigData Summit v02

    12/43

    Big Data:The questionisnt WHAT.Its WHY.

  • 8/13/2019 Adatao BigData Summit v02

    13/43

    Competitive Advantages It BringsHolistic business insights See underlying patterns Predict unknowns from knowns Automate decisions

    Technology Cost/Benet Threshold

  • 8/13/2019 Adatao BigData Summit v02

    14/43

    WHERE didBig DataTechnologycome from ?

  • 8/13/2019 Adatao BigData Summit v02

    15/43

    98 06 0904

    A Timeline ofBig Data Techology

    GoogleSearch

    Build biggest Index of the

    Internet

    Jeff Dean

    Doug Cutting

    MapReduce Paper

    Hadoop

    Qi Lu

    Eric14

    Hadoop

    MapR Cloudera

  • 8/13/2019 Adatao BigData Summit v02

    16/43

    WHAT SHOULD Big Data AnalyticsLOOK like ?

  • 8/13/2019 Adatao BigData Summit v02

    17/43

    HOW are peopleoperationalizingBig Data

    0%

    50%

    100%

    EMA Research

    2012 2

    % Respondents with Big DAlready in Opera

  • 8/13/2019 Adatao BigData Summit v02

    18/43

    Interactive, Ad Hoc Business Query

    Insight Discovery on AggregatedOperational Data

    Finance

    Engineering Sales

    Google BigQuery

    Employee Engagement with Operational Data goes thr

  • 8/13/2019 Adatao BigData Summit v02

    19/43

    Mobile AdPlatform

    Ad Targeting

    CTR Prediction

    100+ Million Devices

  • 8/13/2019 Adatao BigData Summit v02

    20/43

    Customer ServiceProvider

    Product Recommendation

    Cross-channelUser Experience Optimization

  • 8/13/2019 Adatao BigData Summit v02

    21/43

    Are therePa t erns ofBig-DataSUCCESS ?

  • 8/13/2019 Adatao BigData Summit v02

    22/43

    Have aData-drivenCULTURE

  • 8/13/2019 Adatao BigData Summit v02

    23/43

    Jim Barksdale, former Netscape CEO

    If we have data ,lets go with that.

    If all we have are opinions ,lets go with mine.

  • 8/13/2019 Adatao BigData Summit v02

    24/43

    User Survey Op We prefer 30-result pages to 10-res

    EmpiricalThe extra 500ms causes users to search by 2

    Thats $12 Billion per ye

    Google Search Latency Experiment

    Marissa Mayer, Googl

  • 8/13/2019 Adatao BigData Summit v02

    25/43

    VS. 2CentralizedData Service BureauDistributedSelf Service Da ta Tools

  • 8/13/2019 Adatao BigData Summit v02

    26/43

    Why?Users didnt ask enough

    Why noFriction too h

    CentralizedData Service Bureau

    Didnt

    work out

  • 8/13/2019 Adatao BigData Summit v02

    27/43

    Team self collect, analyze, & learn from own data

    Lower latency to insight

    Positive feedback loop to improve tools

    Distributed,Self-service Data Tools

  • 8/13/2019 Adatao BigData Summit v02

    28/43

    Watch WHICH pa t ernyour Chief DataOffi cer chooses

  • 8/13/2019 Adatao BigData Summit v02

    29/43

    3Think BIG about Big DataOpportunities

  • 8/13/2019 Adatao BigData Summit v02

    30/43

    Build

  • 8/13/2019 Adatao BigData Summit v02

    31/43

    BuildBiggest Hanga

    Longest R

    I will make surethe planes comeEric Schmidt, Google then-CEO

  • 8/13/2019 Adatao BigData Summit v02

    32/43

    WHY Think BigaboutBig Data ?

  • 8/13/2019 Adatao BigData Summit v02

    33/43

    Big Data + Machine Learning

    Algorithms ModelsData+ =Brain WisdomExperiences+ =

  • 8/13/2019 Adatao BigData Summit v02

    34/43

    Deep Learning Neuron

    (Source: http://capone.mtsu.edu/wlangsto/)

    Human

    Machine

    http://capone.mtsu.edu/wlangsto/
  • 8/13/2019 Adatao BigData Summit v02

    35/43

    Deep Learning Neural Networks

    Human

    Machine

    (Source: http://www.doc.ic.ac.uk/~nd/surprise_96/journal/vol2/cs11/article2.html)

    http://www.doc.ic.ac.uk/~nd/surprise_96/journal/vol2/cs11/article2.html
  • 8/13/2019 Adatao BigData Summit v02

    36/43

    Reading Digits in Zip CodesGeo ff Hinton, Yann Lecun, et al.

    Demo

    http://www.cs.toronto.edu/~hinton/adi/index.htm

    http://www.cs.toronto.edu/~hinton/adi/index.htm
  • 8/13/2019 Adatao BigData Summit v02

    37/43

  • 8/13/2019 Adatao BigData Summit v02

    38/43

    Words are Vectors Mikolov et al.

    Source: http://gigaom.com/2013/08/16/were-on-the-cusp-of-deep-learning-for-the-masses-you-can-thank

    Portugal

    -China+

    Bejing=

    Lisbon

    !!!

    http://gigaom.com/2013/08/16/were-on-the-cusp-of-deep-learning-for-the-masses-you-can-thank-google-later/
  • 8/13/2019 Adatao BigData Summit v02

    39/43

    Machine Translation Quoc V. Le et

    Source: http://arxiv.org/pdf/1309.4168v1.pdf

    http://arxiv.org/pdf/1309.4168v1.pdf
  • 8/13/2019 Adatao BigData Summit v02

    40/43

  • 8/13/2019 Adatao BigData Summit v02

    41/43

    Big Data + Big Computewill lead tosuper-human

    Machine Intelligencin 10+ y

  • 8/13/2019 Adatao BigData Summit v02

    42/43

    SUMMARY WHAT & WHY of Big Data

    EXAMPLES & BEST PRACTICES of Big

    EXCITING FUTURE IMPLICATIONS of

  • 8/13/2019 Adatao BigData Summit v02

    43/43

    Thank you!