modeling human mobility using location based social networks

Post on 05-Jul-2015

83 Views

Category:

Mobile

0 Downloads

Preview:

Click to see full reader

TRANSCRIPT

Modeling Human MobilityUsing Location Based Social Networks

Gene Moo LeeUniversity of Texas at Austin

2012.08.11UKC 2012 - CSI Track

Introduction: LBSN

● LBSN (Location Based Social Network) becomes very popular○ Foursquare: 10M users, 1B checkins (Sept. 2011)○ Gowalla is acquired by Facebook○ Facebook, Twitter, Google+ also added location

features

● LBSN gives us a good chance to understand how people move around

Data CollectionCity (Data) Number of Users Number of Places Number of Check-ins

New York, NY (4SQ) 101,068 6,443 657,452

New York, NY (GWL) 5,121 13,651 116,376

Austin, TX (4SQ) 14,744 1,042 116,325

Austin, TX (GWL) 6,904 11,024 213,823

San Francisco, CA (4SQ) 41,114 2,325 237,666

San Francisco, CA (GWL) 6,386 14,234 220,405

● Foursquare: January 2012 ~ February 2012● Gowalla: May 2010 ~ July 2010

Airports in Foursquare

● LBSN is popular globally● Mainly US, Europe, South Asia

Manhattan in both datasets

● LBSN gives a representative coverage of cities

Understanding Human Movement

● Microscopic View○ We can focus on individual users and try to

understand their movements○ Many studies have been conducted on this line

● Macroscopic View○ We can focus on locations and analyze aggregated

user movements between locations○ In this study, we define "human traffic matrix" to

achieve this

Human Traffic Matrix

● U: set of users● V: set of venues/locations

● T is a traffic matrix where each row and column corresponds to locations

● The element of a human traffic matrix T(v1,v2;t) is defined as the number of users in U who moved from venue v1 to v2 during time t○ We use daily time snapshots for t○ Venues are collapsed into 0.01o x 0.01o grids

(around 1.1km x 0.7 km)

Gravity Model

● We model the human traffic matrix by gravity model○ Newton's law of gravitation: gravity between two

objects is proportional to the product of masses of two objects divided by the squared distance

● T(a,*) = number of check-ins at venue a● d(a,b) = the geographic distance between a and b● Try to find the optimal scaling factor "alpha" and

distance exponent "beta" (minimizing NMAE)

Error Metrics

● NMAE (Normalized Mean Average Error)○ Lower values are preferred

● CORR (Correlation Coefficient)○ +1 indicates strong positive correlation○ -1: negative correlation, 0: independence

Results: New York, NY

● New York (4SQ): CORR = 0.99, NMAE = 0.16● New York (GWL): CORR = 0.96, NMAE = 0.34* Green lines indicate 20% error boundaries

Results: San Francisco, CA

● SF (4SQ): CORR = 0.98, NMAE = 0.27● SF (GWL): CORR = 0.93, NMAE = 0.56

Results: Austin, TX

● Austin (4SQ): CORR = 0.97, NMAE = 0.31● Austin (GWL): CORR = 0.96, NMAE = 0.34

Evaluation Summary

● Gravity models give very good estimations○ High correlations: 0.93 ~ 0.99○ Good estimation errors: 16% ~ 56%

● Beta (distance exponent)○ 0.0 ~ 0.3 for Foursquare○ 0.4 ~ 0.6 for Gowalla○ Distance matters

● Alpha (constant factor)○ Vary due to the diverse properties of each city

Concluding Remarks

● Gravity models can be used to generate human traffic between locations

● LBSN is a great new data source● On going works to study more properties of

human movements using LBSN

Thank you!

Contact me atgene@cs.utexas.edu

top related