adatao bigdata summit v02
TRANSCRIPT
-
8/13/2019 Adatao BigData Summit v02
1/43
Big Data, Big OpportunitFrom Buzz to Biz
Presented on December 6, 2013
Christopher Nguyen, PhDCo-Founder & CEO
-
8/13/2019 Adatao BigData Summit v02
2/43
Step 1
Instrument Calibration
-
8/13/2019 Adatao BigData Summit v02
3/43
do you get when you cross
Atlantic Titan
What
-
8/13/2019 Adatao BigData Summit v02
4/43
AbouHalfway
-
8/13/2019 Adatao BigData Summit v02
5/43
do you get when you cross
What
-
8/13/2019 Adatao BigData Summit v02
6/43
Big DataHadoopIf are not enough
Whado we ne
-
8/13/2019 Adatao BigData Summit v02
7/43
WHAT isBig Data ?
-
8/13/2019 Adatao BigData Summit v02
8/43
Huge Volume
High Velocity
Great Variety
Big DataProblemsStandard Denition
-
8/13/2019 Adatao BigData Summit v02
9/43
Learn from Data
Predict the Unknowns from the Knowns
automatic custom
B I G D A T A = P R O B L E M S
BIG DATA + BIG COMPUTE=BIG OPPORTUNITIESAlternative Denition
-
8/13/2019 Adatao BigData Summit v02
10/43
WHATs
the bigdeal?Ive had
big data
since the 60s
-
8/13/2019 Adatao BigData Summit v02
11/43
WHATs
the bigdeal?
Weve had the sun for a fewthousand years,
right?
-
8/13/2019 Adatao BigData Summit v02
12/43
Big Data:The questionisnt WHAT.Its WHY.
-
8/13/2019 Adatao BigData Summit v02
13/43
Competitive Advantages It BringsHolistic business insights See underlying patterns Predict unknowns from knowns Automate decisions
Technology Cost/Benet Threshold
-
8/13/2019 Adatao BigData Summit v02
14/43
WHERE didBig DataTechnologycome from ?
-
8/13/2019 Adatao BigData Summit v02
15/43
98 06 0904
A Timeline ofBig Data Techology
GoogleSearch
Build biggest Index of the
Internet
Jeff Dean
Doug Cutting
MapReduce Paper
Hadoop
Qi Lu
Eric14
Hadoop
MapR Cloudera
-
8/13/2019 Adatao BigData Summit v02
16/43
WHAT SHOULD Big Data AnalyticsLOOK like ?
-
8/13/2019 Adatao BigData Summit v02
17/43
HOW are peopleoperationalizingBig Data
0%
50%
100%
EMA Research
2012 2
% Respondents with Big DAlready in Opera
-
8/13/2019 Adatao BigData Summit v02
18/43
Interactive, Ad Hoc Business Query
Insight Discovery on AggregatedOperational Data
Finance
Engineering Sales
Google BigQuery
Employee Engagement with Operational Data goes thr
-
8/13/2019 Adatao BigData Summit v02
19/43
Mobile AdPlatform
Ad Targeting
CTR Prediction
100+ Million Devices
-
8/13/2019 Adatao BigData Summit v02
20/43
Customer ServiceProvider
Product Recommendation
Cross-channelUser Experience Optimization
-
8/13/2019 Adatao BigData Summit v02
21/43
Are therePa t erns ofBig-DataSUCCESS ?
-
8/13/2019 Adatao BigData Summit v02
22/43
Have aData-drivenCULTURE
-
8/13/2019 Adatao BigData Summit v02
23/43
Jim Barksdale, former Netscape CEO
If we have data ,lets go with that.
If all we have are opinions ,lets go with mine.
-
8/13/2019 Adatao BigData Summit v02
24/43
User Survey Op We prefer 30-result pages to 10-res
EmpiricalThe extra 500ms causes users to search by 2
Thats $12 Billion per ye
Google Search Latency Experiment
Marissa Mayer, Googl
-
8/13/2019 Adatao BigData Summit v02
25/43
VS. 2CentralizedData Service BureauDistributedSelf Service Da ta Tools
-
8/13/2019 Adatao BigData Summit v02
26/43
Why?Users didnt ask enough
Why noFriction too h
CentralizedData Service Bureau
Didnt
work out
-
8/13/2019 Adatao BigData Summit v02
27/43
Team self collect, analyze, & learn from own data
Lower latency to insight
Positive feedback loop to improve tools
Distributed,Self-service Data Tools
-
8/13/2019 Adatao BigData Summit v02
28/43
Watch WHICH pa t ernyour Chief DataOffi cer chooses
-
8/13/2019 Adatao BigData Summit v02
29/43
3Think BIG about Big DataOpportunities
-
8/13/2019 Adatao BigData Summit v02
30/43
Build
-
8/13/2019 Adatao BigData Summit v02
31/43
BuildBiggest Hanga
Longest R
I will make surethe planes comeEric Schmidt, Google then-CEO
-
8/13/2019 Adatao BigData Summit v02
32/43
WHY Think BigaboutBig Data ?
-
8/13/2019 Adatao BigData Summit v02
33/43
Big Data + Machine Learning
Algorithms ModelsData+ =Brain WisdomExperiences+ =
-
8/13/2019 Adatao BigData Summit v02
34/43
Deep Learning Neuron
(Source: http://capone.mtsu.edu/wlangsto/)
Human
Machine
http://capone.mtsu.edu/wlangsto/ -
8/13/2019 Adatao BigData Summit v02
35/43
Deep Learning Neural Networks
Human
Machine
(Source: http://www.doc.ic.ac.uk/~nd/surprise_96/journal/vol2/cs11/article2.html)
http://www.doc.ic.ac.uk/~nd/surprise_96/journal/vol2/cs11/article2.html -
8/13/2019 Adatao BigData Summit v02
36/43
Reading Digits in Zip CodesGeo ff Hinton, Yann Lecun, et al.
Demo
http://www.cs.toronto.edu/~hinton/adi/index.htm
http://www.cs.toronto.edu/~hinton/adi/index.htm -
8/13/2019 Adatao BigData Summit v02
37/43
-
8/13/2019 Adatao BigData Summit v02
38/43
Words are Vectors Mikolov et al.
Source: http://gigaom.com/2013/08/16/were-on-the-cusp-of-deep-learning-for-the-masses-you-can-thank
Portugal
-China+
Bejing=
Lisbon
!!!
http://gigaom.com/2013/08/16/were-on-the-cusp-of-deep-learning-for-the-masses-you-can-thank-google-later/ -
8/13/2019 Adatao BigData Summit v02
39/43
Machine Translation Quoc V. Le et
Source: http://arxiv.org/pdf/1309.4168v1.pdf
http://arxiv.org/pdf/1309.4168v1.pdf -
8/13/2019 Adatao BigData Summit v02
40/43
-
8/13/2019 Adatao BigData Summit v02
41/43
Big Data + Big Computewill lead tosuper-human
Machine Intelligencin 10+ y
-
8/13/2019 Adatao BigData Summit v02
42/43
SUMMARY WHAT & WHY of Big Data
EXAMPLES & BEST PRACTICES of Big
EXCITING FUTURE IMPLICATIONS of
-
8/13/2019 Adatao BigData Summit v02
43/43
Thank you!