tokyogo -- capstone project of galvanize dsi
Post on 23-Feb-2017
20 Views
Preview:
TRANSCRIPT
FoursquareAPI Query
Webscraping
AWS S3
Photos
PostgreSQLVenues Mongodb
Users
Tips
Introduction Data Coll ResultsAnalysis
Image Tagging
TensorFlow/ Spark EMR
Feature Extraction
Tipgs
DBScan
NMF
Cluster Model User Interface
Web AppLocationVenue Stats
Introduction Data Coll ResultsAnalysis Discussion
FoursquareAPI Query
Webscraping
AWS S3
Photos
PostgreSQL
Mongodb
Introduction Data Coll ResultsAnalysis Discussion
Tipgs
DBScan
NMF
Recommender Web App
LocationVenues
Tips
Users
Data Storage Features Process
Introduction Data Coll ResultsAnalysis Discussion
Topic tag Top words
theme tour Famous shrine, totoro, national park …
Game or outdoor Video game, sunshine, bandit
History Garden, manju, oldtokyo
Culture Shinkansen, coast, market
theme park and shopping Indianajons, waterfall, waterpark
Next steps:• Improve the model with user – user similarity • Include the seasonality and full tips data• Train a neural network model to tag the image content
Introduction Data Coll ResultsAnalysis Discussion
• The method applied was able to distinguish (to a certain extent) preferences of different groups (local, visitors from other areas in Japan, and forigner travelers).
• My recommender system product of this project will include only the top 200 venues of each visitor source group (sum up to ~ 500 venues) as an toy example that can be deployed on a small amazon instance. The framework can be extended when more data available, and he business features and A/B testing evaluation can be added.
• The NMF analysis indicates visitors to all the venues tend to mention some food, which also indicates that food is an important element that shared among all city attractions! Restaurant recommender is not the topic of this project, but I am expecting to see interesting patterns among different tourist sources in Tokyo.
top related