redisconf17 - redis powers next-gen ambient intelligence platform

18
Title Visualize, Engage and Analyze Audience https://near.co/products/allspark 2.0 REDIS POWERS NEXT-GEN AMBIENT INTELLIGENCE PLATFORM REDISCONF 2017 Madhu Therani, CTO [email protected]

Upload: redis-labs

Post on 21-Jan-2018

204 views

Category:

Technology


8 download

TRANSCRIPT

Page 1: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0REDIS POWERS NEXT-GENAMBIENT INTELLIGENCE PLATFORM

REDISCONF 2017

Madhu Therani, [email protected]

Page 2: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

www.near.co | CONFIDENTIAL**

Company Overview

www.near.co

Unify streaming and static data from multiple sources to map out “consumer” journeys

Near is a “Ambient Intelligence” platform that uses massive data and artificial intelligence to understand consumers in smart environments

Near’s Mission

Page 3: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

www.near.co | CONFIDENTIAL**

Company Overview

www.near.co

Varied Data Sources – Connected via Location

Bringing massive data into a unified platformfor the most accurate understanding of consumer behavior

Location DataSINGLE IDENTIFIER

Page 4: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

Our Current Platform

Ad Engine

PlacesAPI

NearPlacesDB

ID System

LocationRefinement

Batch Analytics

To ExternalSystems

Profile Cache

Count Store

AudienceAnalyticsAPI

AudienceRulesAPI

CampaignAnalyticsAPI

Sources

Profile Store

ProductAPI

PeopleAPI

ALLSPARK

DaaS APIs

DedupingID Linkage

RealtimeAudience Segmentation

SpatialCounts

Model-basedProfile

RealtimeAnalytics

Page 5: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

Our SaaS Product - AllSpark

Enables Definition, Curation of and Engagement with “Audiences”

Page 6: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

Our requirements - 24 months ago

Scale from 20K events per sec to 200K events per sec with & without location

Map to the physical world - Gather/index geo information and associated activity to “physical spaces”

Maintain “user” level aggregated identity - accumulate, summarize, analyze, expire

Evaluate utility of inferences in realtime and batch mode

Page 7: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

Where are we now?

Process 150K events per sec globally, 6-10 TB of data everyday

Global footprint - Washington DC, Amsterdam, Hong Kong, Singapore, Tokyo

Geo-data from 44 countries

1.5 Billion profiles actively managed

Allspark actively manages nearly 3000 audience segments, 1000+ campaigns per quarter

Page 8: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

Role of Redis

Powered incremental development

Scaling of key subsystems

Allowed “exploration” of data structures as platform requirements evolved

Enabled the development of a reliable realtime + batch pipeline - Storm/MR

Page 9: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

Key Subsystem - ID System

Powers ID generation for profiles - across sources at different data centers

Deduplication

Opt-out management

ID unification - maintain multiple channel-specific IDs for system-wide Allspark ID

Metrics - what is the traffic like ? location vs non-location? Source mix ?

Page 10: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

Key Subsystem - GridStore

Measuring physical pings at scale

Every point is an HLL

HLL by hour/days

For all countries

At multiple resolutions

Page 11: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

Key Subsystem - CountStore

Counts about Audience Groups

Powers varied kind of demographic and behavior analysis - what interests/behaviors

Spatio-temporal distribution - What other locales were visited in a given time period?

Page 12: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

Key Subsystem - RealTime Bidding Engine

Powering Programmatic Advertising

Metrics on a billion+ events in a day

Caches - bid cache, event cache, ad cache

Realtime status for various kinds of optimization

Page 13: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

Other Usecases

Realtime analytics across pipelines

Caches across pipeline - Audience analytics

Powering model-as-a-service - metrics for data science

Shared state across both realtime and batch pipelines

Page 14: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

Open source Redis deployment scale

60+ high-end servers

RAM requirements range from 128 GB to 512 GB

DBs are clustered per sub-system

Backup and maintenance with manual processes

Page 15: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

Ad hoc Issues

Backups - snapshots versus AOF - how to evaluate

Replication across DCs

Key semantics and management - Key definition embeds a lot of ad hoc meaning

Page 16: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

Redis Benefits

Enabled incremental development

Externalization of core data structures - developers of varying ability can develop scalable systems

Developing systems in multiple languages

Sweet spot between - relational dbs (mysql) and nosql (couch/mongo) - build using Redis - then decide

Page 17: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Visualize, Engage and Analyze Audience

https://near.co/products/allspark

2.0

What next

Evaluating Enterprise Redis

Reorganizing some of other datastores - KyotoTycoon, ElasticSearch, Mongo, Cassandra, Hbase

ML on the edge - Realtime analytics on samples - alert generation

Page 18: RedisConf17 - Redis Powers Next-gen Ambient Intelligence Platform

Title

Thank You

Acknowledgements

NearNEar Tech Team

NEar Tech Team

Near’s Tech Team in Bangalore & SFO