data science presentation

23
VacAdvisor Fred N. Kiwanuka Fellow Insight Data Science August 6, 2015 Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Upload: noah-kiwanuka

Post on 17-Aug-2015

105 views

Category:

Data & Analytics


1 download

TRANSCRIPT

Page 1: Data Science Presentation

VacAdvisor

Fred N. KiwanukaFellow Insight Data Science

August 6, 2015

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 2: Data Science Presentation

What are my options for vacation with a specified budget ?

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 3: Data Science Presentation

The Data

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 4: Data Science Presentation

The Data

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 5: Data Science Presentation

The Data

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 6: Data Science Presentation

The Data

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 7: Data Science Presentation

Conceptual Framework

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 8: Data Science Presentation

Algorithm: Clustering

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 9: Data Science Presentation

Algorithm: Clustering

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 10: Data Science Presentation

Algorithm: Clustering

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 11: Data Science Presentation

Cluster Validation

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 12: Data Science Presentation

Cluster Validation

Table: (Cluster Validation)

Number of Clusters WSS(103) City Cities Closest to Centroid

2 10.32 Seattle Detroit, Charlotte, South Bend

3 9.93 Seattle Boston, Phoenix, Detroit

4 9.91 Seattle Charlotte, South Bend, Minneapolis

5 8.40 Seattle Detroit, Charlotte, South Bend

7 9.62 Seattle Detroit, Charlotte, South Bend

10 9.63 Seattle Sacrameto, San Jose, Colombus

12 9.52 Seattle Sacrameto, San Jose, Colombus

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 13: Data Science Presentation

Silhouette Score

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 14: Data Science Presentation

Silhouette Score

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 15: Data Science Presentation

Silhouette Score

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 16: Data Science Presentation

Silhouette Score

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 17: Data Science Presentation

Silhouette Score

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 18: Data Science Presentation

Cluster Initialization and Validation

Table: (Cluster Initialization and Validation)

Alg Time(s) homo compl v-meas ARI AMI

k-means 0.03 0.971 0.971 0.971 0.988 0.970

VQ 0.04 1.000 1.000 1.000 1.000 1.000

PCA +kmeans 0.00 1.000 1.000 1.000 1.000 1.000

Mean Shift 0.24 1.000 0.970 0.972 0.980 0.972

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 19: Data Science Presentation

Fred N. Kiwanuka

PhD(Groningen), MSC(London), MIT(Fellow)

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 20: Data Science Presentation

Mobile Malaria Diagnosis

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 21: Data Science Presentation

Classification Challenge

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 22: Data Science Presentation

Feature Engineering

Number of Images: 50,000 and 60 feature vector for each image

Perimeter

Moment of Inertia [4 features]

Elongation

Jaggedness

Circularity

Moment features [9 features]

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor

Page 23: Data Science Presentation

Results

Fred N. Kiwanuka Fellow Insight Data Science VacAdvisor