tokyogo -- capstone project of galvanize dsi

11
TokyoGo City Attractions Recommender Wan-Ru Yang Dec, 2016

Upload: wan-ru-yang

Post on 23-Feb-2017

40 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: Tokyogo -- capstone project of Galvanize DSI

TokyoGoCity Attractions Recommender

Wan-Ru YangDec, 2016

Page 2: Tokyogo -- capstone project of Galvanize DSI

Introduction Data Coll ResultsAnalysis

263103 Records

Discussion

Page 3: Tokyogo -- capstone project of Galvanize DSI

FoursquareAPI Query

Webscraping

AWS S3

Photos

PostgreSQLVenues Mongodb

Users

Tips

Introduction Data Coll ResultsAnalysis

Image Tagging

TensorFlow/ Spark EMR

Page 4: Tokyogo -- capstone project of Galvanize DSI

Feature Extraction

Tipgs

DBScan

NMF

Cluster Model User Interface

Web AppLocationVenue Stats

Introduction Data Coll ResultsAnalysis Discussion

Page 5: Tokyogo -- capstone project of Galvanize DSI

FoursquareAPI Query

Webscraping

AWS S3

Photos

PostgreSQL

Mongodb

Introduction Data Coll ResultsAnalysis Discussion

Tipgs

DBScan

NMF

Recommender Web App

LocationVenues

Tips

Users

Data Storage Features Process

Page 6: Tokyogo -- capstone project of Galvanize DSI

Introduction Data Coll ResultsAnalysis Discussion

Page 7: Tokyogo -- capstone project of Galvanize DSI

Introduction Data Coll ResultsAnalysis Discussion

Topic tag Top words

theme tour Famous shrine, totoro, national park …

Game or outdoor Video game, sunshine, bandit

History Garden, manju, oldtokyo

Culture Shinkansen, coast, market

theme park and shopping Indianajons, waterfall, waterpark

Page 8: Tokyogo -- capstone project of Galvanize DSI

Recommended VenuesUser Input

Introduction Data Coll ResultsAnalysis Discussion

Page 9: Tokyogo -- capstone project of Galvanize DSI

Introduction Data Coll ResultsAnalysis Discussion

Page 10: Tokyogo -- capstone project of Galvanize DSI

Next steps:• Improve the model with user – user similarity • Include the seasonality and full tips data• Train a neural network model to tag the image content

Introduction Data Coll ResultsAnalysis Discussion

• The method applied was able to distinguish (to a certain extent) preferences of different groups (local, visitors from other areas in Japan, and forigner travelers).

• My recommender system product of this project will include only the top 200 venues of each visitor source group (sum up to ~ 500 venues) as an toy example that can be deployed on a small amazon instance. The framework can be extended when more data available, and he business features and A/B testing evaluation can be added.

• The NMF analysis indicates visitors to all the venues tend to mention some food, which also indicates that food is an important element that shared among all city attractions! Restaurant recommender is not the topic of this project, but I am expecting to see interesting patterns among different tourist sources in Tokyo.

Page 11: Tokyogo -- capstone project of Galvanize DSI

https://github.com/WanRuYang

https://zuya.siraya.net

https://www.linkedin.com/in/WanRuYang