using bigquery as a main big data solution

16
Nikolay Novozhilov Wego.com Using BigQuery as a main Big Data solution

Upload: nikolay-novozhilov

Post on 06-Aug-2015

86 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: Using BigQuery as a main Big Data solution

Nikolay Novozhilov Wego.com

Using BigQuery as a main Big Data solution

Page 2: Using BigQuery as a main Big Data solution

About Wego

Wego.com is Asia Pacific and the Middle East’s leading flight/hotel metasearch engine used by millions of travelers.

Wego was founded in 2005 in Singapore

Page 3: Using BigQuery as a main Big Data solution

Introducing BigQuery

Service for interactive analysis of massive datasets (TBs)

Query billions of rows: seconds to write, seconds to return

Uses a SQL-style query syntax

It's a service, accessed by a RESTful API

Pay only for what you use

Based on internal Google tool - Dremel

Column oriented, append only…

Page 4: Using BigQuery as a main Big Data solution

Data architecture in Wego

...

Page 5: Using BigQuery as a main Big Data solution

Why did we do it?

MySQL

“Zoo”

BigQuery

Page 6: Using BigQuery as a main Big Data solution

Why Hadoop is more popular?

Page 7: Using BigQuery as a main Big Data solution

My collection of concernsYour data goes to cloud

Not open-source, Google can stop the service

“Strange” pricing model

Hadoop is trending, has bigger community

Append only database

???

Page 8: Using BigQuery as a main Big Data solution

Costs: storage + cost per query

Same fallacy again: “I want to launch a mom@pop – let’s buy a

building” “I want to build a site – let’s by servers” “I want big data – let’s build a data-

warehouse”

Usual concerns: No realistic estimate upfront “Fear of running a query”

Page 9: Using BigQuery as a main Big Data solution

StackOverflow support

53 minutes

!

Page 10: Using BigQuery as a main Big Data solution

Append only…Slowly changing dimensions: daily re-load from MySQL daily upload from MySQL, keeping history

Absolutely necessary updates: do you really need it? BigQuery allows to save query to initial

table:

Your tabl

eQuery

Page 11: Using BigQuery as a main Big Data solution

Actually useful - “Discovery mode”

Page 12: Using BigQuery as a main Big Data solution

Actually useful

Huge joins

REGEXT_MATCH(), …

Rich SQL - window functions

Nested data

Page 13: Using BigQuery as a main Big Data solution

My answer

Page 14: Using BigQuery as a main Big Data solution

What is Big Data revolution?

There is no difference between big data and small data anymore

Page 15: Using BigQuery as a main Big Data solution

Contacts

Blog: www.novozhilov.co

Email: [email protected]

Page 16: Using BigQuery as a main Big Data solution

“Yes, Sir, I tired to build an ROI case for our BI project - but I couldn’t

access any reliable data!”TimoElliott.com