using ipython to find correlation

Post on 14-Apr-2017

200 Views

Category:

Data & Analytics

1 Downloads

Preview:

Click to see full reader

TRANSCRIPT

CORRELATION BETWEEN HOTEL PRICE V. VOTING

WITH PYTHON AND PANDASBY APIPOJ PIASAK

USING PYTHON AND PANDASCORRELATION BETWEEN HOTEL PRICE V.

VOTING

• Sample of using Data Science in a real-world data

• How to apply use scatter plot to find a relationship between Room price and voting.

• To check weather the price range cause the high voting?

HOTEL Data in Thailand(Source of data can't be disclosed)

Import *.csv data using PandasRun the code in IPYTHON Notebook

Using pandas Describe() function to check outliers with percentiles

Run the code in IPYTHON Notebook

Remove unused column and calculate quantile()Run the code in IPYTHON Notebook

Using Apply function to filter value based on percentiles value

Run the code in IPYTHON Notebook

Check data frame again, data looks much better now.Run the code in IPYTHON Notebook

Check data frame again, data looks much better now.Run the code in IPYTHON Notebook

BONUS (1)Run the code in IPYTHON Notebook

BONUS (2)Run the code in IPYTHON Notebook

FROM OUR DATA PLOTCONCLUSION

• The relationship between price and voting, if you want to get higher vote, you may think about your price range should be between 500THB - 3000THB, hmmm interesting.

FROM OUR DATA PLOTCONCLUSION

• Most of Thailand hotel has price range between 500THB-1500THB

FROM OUR DATA PLOTCONCLUSION

• Seems like, the customer who have booked any hotel vote at least 7 score.

• This means if any vote is beyond 7 it is bad! you have to improve now!

— Carly Fiorina, former executive of Hewlett-Packard Co.

THE GOAL IS TO TURN DATA INTO INFORMATION, AND INFORMATION INTO

INSIGHT.

top related