Download - Location Mining from Online Social Networks
![Page 1: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/1.jpg)
Location Mining from Online Social Networks
Satyen AbrolAdvisors:
Dr. Latifur KhanDr. Bhavani Thuraisingham
![Page 2: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/2.jpg)
Location Mining in Online Social Networks
What is the city level home location of a user?
![Page 3: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/3.jpg)
Outline
• Introduction and Problem Statement• Different Approaches• Social Graph Based: Our Approaches
Tweethood: Fuzzy k – Closest Friends with Variable Depth
Tweecalization: Label Propagation Tweeque: Graph Partitioning for Spatio-Temporal
Analysis• Experiments and Results• Future Work
![Page 4: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/4.jpg)
Outline
• Introduction and Problem Statement• Different Approaches• Social Graph Based: Our Approaches
Tweethood: Fuzzy k – Closest Friends with Variable Depth
Tweecalization: Label Propagation Tweeque: Graph Partitioning for Spatio-Temporal
Analysis
• Experiments and Results• Future Work
![Page 5: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/5.jpg)
Twitter - Basics
Tweets:Maximum 140 Characters
# of Tweets
# of Following
# of FollowersLocation
![Page 6: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/6.jpg)
Why is location so important?
![Page 7: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/7.jpg)
Privacy and Security
• Losing locational privacy forever Users leave field blank, don’t want
strangers to know their locations
• http://pleaserobme.com/
![Page 8: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/8.jpg)
Trustworthiness
• Corporate companies use social media for better advertising and marketing
• Iran Elections of 2009– US State Department used Twitter as a source
• Trustworthiness is important in such cases
To be able to trust/verify the correctness of location mentioned in user profile
![Page 9: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/9.jpg)
Marketing and Business
• Large corporations Walmart, Starbucks, United Airlines use social media Great tool for inexpensive advertising Getting feedback from users
![Page 10: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/10.jpg)
The Problem
• Leave the location field blank in their Twitter profiles• Do not provide valid geographic information
• “Justin Biebers heart”, “NON YA BISNESS!!”, “looking down on u people”
• Provide incorrect locations which may actually exist in real world• “Nothing” in Arizona, “Little Heaven” in Connecticut
• Provide several locations, difficult to identify the home location • “CALi b0Y $TuCC iN V3Ga$” – California boy stuck in Las Vegas, NV
• (~35%) enter just country, state, county, etc. and no city level locations1
1. B. Hecht, L. Hong, B. Suh, E. H. Chi, “Tweets from justin biebers heart: the dynamics of the location field in user profiles”, In SIGCHI ’11.
![Page 11: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/11.jpg)
Outline
• Introduction and Problem Statement• Different Approaches• Social Graph Based: Our Approaches
Tweethood: Fuzzy k – Closest Friends with Variable Depth
Tweecalization: Label Propagation Tweeque: Graph Partitioning for Spatio-Temporal
Analysis
• Experiments and Results• Future Work
![Page 12: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/12.jpg)
Location Prediction in Social Networks
• Two Approaches– Content Based1,2
– Using Social Graph3,4,5
1. Z. Cheng, J. Caverlee, and K. Lee, “You are where you tweet: A content-based approach to geo-locating twitter users”. In CIKM ’10.2. B. Hecht, L. Hong, B. Suh, E. H. Chi, “Tweets from justin biebers heart: the dynamics of the location field in user profiles”, In SIGCHI ’11. 3. S. Abrol, L. Khan and B. Thuraisingham,“Tweeque: Spatio-Temporal Analysis of Social Networks for Location Mining Using Graph Partitioning,” The First ASE/IEEE
International Conference on Social Informatics, December 14-16, 2012, Washington D.C., USA.4. S. Abrol., L. Khan and B. Thuraisingham “Tweecalization: Efficient and intelligent location mining in Twitter using semi-supervised learning,” 8th IEEE International
Conference on Collaborative Computing, October 14–17, 2012 Pittsburgh, Pennsylvania.5. S. Abrol., L. Khan, “Agglomerative clustering on fuzzy k-closest friends with variable depth for location mining,” The Second IEEE International Conference on Social
Computing (SocialCom2010), Aug 20-22, 2010 Minneapolis, Minnesota.
![Page 13: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/13.jpg)
Content Based Approach
• Inaccurate – Location in Text not Location of User
• Involves Ambiguity: Paris can mean– Paris Hilton– Paris, the capital of France– Paris, a town in Texas
• Slow – Uses NLP/ Machine Learning techniques, searches gazetteers
![Page 14: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/14.jpg)
Using Social Graphs
• Based on Japanese Proverb - “When the character of a man is not clear to you, look at his friends.”
• Relationship between geospatial proximity and friendship
• Uses classical data mining algorithms for more accurate results
• Faster and can be used for real world applications
![Page 15: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/15.jpg)
Geospatial Proximity and Friendship
• Form 1012 Twitter user pairs and identify geo distance
• Curve follows power law, curve of form a(x+b)-c with exponent of -0.87
![Page 16: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/16.jpg)
Graph Construction
• Vertices (data points) represents users
• Edge represents ‘similarity’ between two users
• Deal with special cases• Spammers – follow random people• Celebrities – followed by random people
• Edge weight gets abbreviated
![Page 17: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/17.jpg)
Defining Edge Weight
• Consists of two components:– Trustworthiness (TW)– Mutual Friends (MF)
![Page 18: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/18.jpg)
Trustworthiness• Fraction of friends which have the same label as the user himself
• Intuition: A person who has stayed at the same place all his life will have most friends from same location and hence high trustworthiness
Location : Seattle/WA/USA
Location : Seattle/WA/USA
Location : Seattle/WA/USA
Location : Seattle/WA/USA
Location : Seattle/WA/USA
Location : Seattle/WA/USA
Friend
Trustw
orthiness:
0.6
Location:Seattle/WA/USA
![Page 19: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/19.jpg)
Mutual Friends
• Chose number common friends for similarity– Better Accuracy– Low Time Complexity
![Page 20: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/20.jpg)
• Defined as
Weightij=α×Max{TW(Ui), TW(Uj)} + (1- α) × MFij
• 0<α<1, typically chosen to be around 0.7
Defining Edge Weight
![Page 21: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/21.jpg)
Outline
• Introduction and Problem Statement• Different Approaches• Social Graph Based: Our Approaches
Tweethood: Fuzzy k – Closest Friends with Variable Depth
Tweecalization: Label Propagation Tweeque: Graph Partitioning for Spatio-Temporal
Analysis
• Experiments and Results• Future Work
![Page 22: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/22.jpg)
Tweethood: Fuzzy k-Closest Friends with Variable Depth
• Choose k “closest” friends for the user
• If location is not found look further for the answer
• Each node is defined by a vector having locations with their respective probabilities
• Boost and Aggregate at each stepSatyen Abrol, Latifur Khan, “TweetHood: Agglomerative Clustering on Fuzzy k-Closest Friends with Variable Depth for Location Mining”. In Proc. of the Second IEEE International Conference on Social Computing (SocialCom-2010), Minneapolis, USA, August 20-22, 2010
![Page 23: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/23.jpg)
Find the location of John Doe
![Page 24: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/24.jpg)
Social Network of John Doe
CB1
CB2
CB3
CBn
![Page 25: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/25.jpg)
Choose k closest friends of John Doe
CB1
CB2
CB3
CBk
![Page 26: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/26.jpg)
Identify Locations
CB1
CB2
CB3
CBk
Location : NULL
Location : NULL
Location : NULL
Location : Seattle, USA
LO
W
AC
CU
RA
CY
![Page 27: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/27.jpg)
What if we have depth=2 ?
CB1
CB2
CB3
CBk
Location : Seattle/WA/USA
Location : NULL
Location : NULL
Location : Sydney/AU
Location : Dallas/TX/USA
Location : Richardson/TX/USA
Location : NULL
Location : NULL
Location : Dallas/TX/USA
Location : NULL
![Page 28: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/28.jpg)
CB1
CB2
CB3
CBk
Dallas/TX/USA 0.4Seattle/WA/USA 0.2Richardson/TX/USA 0.2Sydney/AU 0.2
Dallas/TX/USA 0.33New Delhi/Delhi/India 0.33Sunnyvale/CA/USA 0.33
Austin/TX/USA 0.50Minneapolis/MN/USA 0.50
Plano/TX/USA 0.25Boulder/CO/USA 0.25Salt Lake City/UT/USA 0.25London/London/GB 0.25
Location Vector for John Doe’s friends
![Page 29: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/29.jpg)
Location Vector for John Doe
Dallas/TX/USA 0.1825Seattle/WA/USA 0.05Richardson/TX/USA0.05Sydney/AU 0.05New Delhi/Delhi/IN 0.0825Sunnyvale/CA/USA 0.0825Austin/TX/USA 0.125Minneapolis/MN/USA 0.125Plano/TX/USA 0.0625Boulder/CO/USA 0.0625Salt Lake City/UT/US 0.0625London/GB 0.0625
![Page 30: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/30.jpg)
Agglomerative Clustering
Dallas/TX/USA 0.1825Seattle/WA/USA 0.05Richardson/TX/USA0.05Sydney/AU 0.05New Delhi/Delhi/IN 0.0825Sunnyvale/CA/USA 0.0825Austin/TX/USA 0.125Minneapolis/MN/USA 0.125Plano/TX/USA 0.0625Boulder/CO/USA 0.0625Salt Lake City/UT/US 0.0625London/GB 0.0625
![Page 31: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/31.jpg)
{Dallas, Plano, Richardson}/TX/USA 0.295
Seattle/WA/USA 0.05Sydney/AU 0.05New Delhi/Delhi/IN 0.0825Sunnyvale/CA/USA 0.0825Austin/TX/USA 0.125Minneapolis/MN/USA0.125Boulder/CO/USA 0.0625Salt Lake City/UT/US 0.0625London/GB 0.0625
Agglomerative Clustering
![Page 32: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/32.jpg)
Tweethood: Algorithm
![Page 33: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/33.jpg)
Outline
• Introduction and Problem Statement• Different Approaches• Social Graph Based: Our Approaches
Tweethood: Fuzzy k – Closest Friends with Variable Depth Tweecalization: Label Propagation Tweeque: Graph Partitioning for Spatio-Temporal Analysis
• Experiments and Results• Future Work
![Page 34: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/34.jpg)
Tweecalization: Label Propagation
• But the availability of users with location is limited
• Most of users do not have a location• Need a method that can learn from
unlabeled data
Satyen Abrol, Latifur Khan and Bhavani Thuraisingham, “Tweecalization: Efficient and Intelligent location mining in Twitter using semi- supervised learning,” 8th IEEE International Conference on Collaborative Computing, October 14–17, 2012, Pittsburgh, Pennsylvania
![Page 35: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/35.jpg)
Tweecalization: Label Propagation
• Ideal scenario for semi supervised learning: Only a few friends with locations(labeled data)1
• Use both labeled and unlabeled data for training
• Points which are close to each other are more likely to share a label
1. Y. Bengio, O. Dellalleau, and N. L. Roux, “Label propagation and quadratic criterion,” In O. Chapelle, B. Schlkopf and A. Zien (Eds.), Semi-supervised learning. MIT Press, 2006.
![Page 36: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/36.jpg)
Label Propagation: An Illustration
?
Central User
Friends with location
Friends without location
“CLAMPED LOCATIONS”
![Page 37: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/37.jpg)
Tweecalization: Algorithm
![Page 38: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/38.jpg)
Outline
• Introduction and Problem Statement• Different Approaches• Social Graph Based: Our Approaches
Tweethood: Fuzzy k – Closest Friends with Variable Depth Tweecalization: Label Propagation Tweeque: Graph Partitioning for Spatio-Temporal
Analysis
• Experiments and Results• Future Work
![Page 39: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/39.jpg)
What About Temporal Analysis?
• None of the existing works do temporal analysis
• What about migration/ geographical mobility?
![Page 40: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/40.jpg)
Migration/Geographical Mobility
• 4% to 6% every year, means 12 to 17 million each year
1. United States Census Bureau - Geographical Mobility/Migration Data - http://www.census.gov/hhes/migration/
![Page 41: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/41.jpg)
Migration/Geographical Mobility
• Migration as a function of age
• People aged 20-29 have a higher probability to move
High Migration Rate: College and Jobs
Low Migration Rate: Old age, people settle down
1. United States Census Bureau - Geographical Mobility/Migration Data - http://www.census.gov/hhes/migration/
![Page 42: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/42.jpg)
Facebook Users and Mobility
• Let us look at the cumulative effect
• Only 28% to 37% are currently living in their hometown
1. Based on our experiments on 300k Public Facebook Profiles
![Page 43: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/43.jpg)
Twitter Users and Mobility
• Linking Twitter users to migration
• 33% of all Twitter users are aged 25-34 years
1. Based on our findings by [1] ABI Research. Online. Available: http://www.abiresearch.com
![Page 44: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/44.jpg)
Tweeque: Graph Partitioning
• How do we know if “this” is the current location for a user?
• How do we perform temporal analysis of friendships?
• Propose a technique that indirectly infers the current location
Satyen Abrol, Latifur Khan and Bhavani Thuraisingham,“Tweeque: Spatio-Temporal Analysis of Social Networks for Location Mining Using Graph Partitioning,” The First ASE/IEEE International Conference on Social Informatics, December 14-16, 2012, Washington D.C., USA.
![Page 45: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/45.jpg)
Observation 1: Social Cliques and Location
• Our definition: A social clique is an inclusive group of people that share friendship
• Apart from friendship, what is the attribute that links members of a clique? Individual Locations
• All members of a clique were or are at a particular geographical location at a particular instant of time like college, school, a company, etc.
![Page 46: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/46.jpg)
• As shown previously over course of time, people have tendency to migrate
• Based on these two observations we hypothesize
• If we can divide the social graph of a particular user into cliques and check for location based purity of the cliques, we can accurately separate out his current location from previous locations.
• Migration is our latent time factor
Observation 2: Migration and Time
![Page 47: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/47.jpg)
Tweeque: An exampleFriends from high school in Dallas
Friends from college in Boston
Relatives/Cousins
Friends from job in Seattle
![Page 48: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/48.jpg)
Tweeque: An example
All Friends of the User
![Page 49: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/49.jpg)
Tweeque: An exampleSocial Clique #1 (High School)
Social Clique #2 (College)
Social Clique #3 (Current Work)
Social Clique #4 (Relatives)
![Page 50: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/50.jpg)
Tweeque: An Example
Dallas/TX/USA
Seattle/WA/USA
Dallas/TX/USA
San Diego/CA/USA
New York/NY/USA
Boston/MA/USA
Portland/OR/USA
Austin/TX/USA
Boston/MA/USA
Dallas/TX/USA
Singapore
Sydney/Australia
Dallas/TX/USA
Dallas/TX/USA
Ontario/Canada
Seattle/WA/USA
Seattle/WA/USA
Dallas/TX/USA
Seattle/WA/USA
Redmond/WA/USA
High School College Relatives Work
Purity (Dallas) = 0.32 Purity (Boston) = 0.45 Purity (Dallas) = 0.18 Purity (Seattle) = 0.69
![Page 51: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/51.jpg)
Tweeque: Graph Partitioning
![Page 52: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/52.jpg)
Tweeque: Graph Partitioning
J. Shi and J. Malik, “Normalized Cuts and Image Segmentation,” IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 22, no. 8, pp. 888-905, Aug. 2000.
![Page 53: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/53.jpg)
Tweeque: Graph Partitioning
![Page 54: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/54.jpg)
Tweeque: Algorithm
![Page 55: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/55.jpg)
Tweeque: Purity Voting
![Page 56: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/56.jpg)
Outline
• Introduction and Problem Statement• Different Approaches• Social Graph Based: Our Approaches
Tweethood: Fuzzy k – Closest Friends with Variable Depth Tweecalization: Label Propagation Tweeque: Graph Partitioning for Spatio-Temporal Analysis
• Experiments and Results• Future Work
![Page 57: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/57.jpg)
Experiment Data
• Randomly choose 1000 Twitter users
![Page 58: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/58.jpg)
Experiments and Results
• We observe that the accuracy saturates after depth 4• Six degrees of separation is the idea that everyone is on average
approximately six steps away, by way of introduction, from any other person in the world`
• For Twitter this distance is found to be 4.67
![Page 59: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/59.jpg)
Comparison of Different Approaches
Tweethood1 Tweecalization2 Tweeque3 Content Based4
Accuracy (City) 72.1% 75.5% 76.3% 35.6% - 51%
Accuracy (Country)
80.1% 80.1% 84.9% 52.3%
Complexity O(n) O(n3) O(n3) N/A
Temporal Analysis
No No Yes Yes
1. Satyen Abrol, Latifur Khan, “TweetHood: Agglomerative Clustering on Fuzzy k-Closest Friends with Variable Depth for Location Mining”. In Proc. of the Second IEEE International Conference on Social Computing (SocialCom-2010), Minneapolis, USA, August 20-22, 2010 (Nominated for best paper award, Acceptance Rate:13%)
2. Satyen Abrol, Latifur Khan and Bhavani Thuraisingham, “Tweecalization: Efficient and Intelligent location mining in Twitter using semi- supervised learning,” 8th IEEE International Conference on Collaborative Computing, October 14–17, 2012, Pittsburgh, Pennsylvania
3. Satyen Abrol, Latifur Khan and Bhavani Thuraisingham,“Tweeque: Spatio-Temporal Analysis of Social Networks for Location Mining Using Graph Partitioning,” The First ASE/IEEE International Conference on Social Informatics, December 14-16, 2012, Washington D.C., USA.
4. Z. Cheng, J. Caverlee, and K. Lee, “You are where you tweet: A content-based approach to geo-locating twitter users”. In CIKM ’10.
![Page 60: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/60.jpg)
Outline
• Introduction and Problem Statement• Different Approaches• Social Graph Based: Our Approaches
Tweethood: Fuzzy k – Closest Friends with Variable Depth Tweecalization: Label Propagation Tweeque: Graph Partitioning for Spatio-Temporal Analysis
• Experiments and Results• Future Work
![Page 61: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/61.jpg)
Contributions
• Developed three graph based location mining algorithms for online social networks Maps location mining problem to k-nearest
neighbor, semi supervised and graph partitioning problem
Outperform content based approach in time and accuracy
• Relationship between geospatial proximity and friendship
• Effect of geographical mobility on current location of users
![Page 62: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/62.jpg)
Future Work
• Combining Content and Graph based methods Score based geo-tagging technique1
Associating keywords with locations to build probabilistic model: “cowboys” Dallas, “casino” Las Vegas
Since tweets have timestamps, it leads to more accurate prediction of current location
1 Satyen Abrol, Latifur Khan, Tahseen Al-khateeb, “MapIt: Smarter Searches using Location Driven Knowledge Discovery and Mining”, In Proc. of 1st SIGSPATIAL ACM GIS 2009 International Workshop on Querying and Mining Uncertain Spatio-Temporal Data (QUeST), Nov 2009, Seattle.
![Page 63: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/63.jpg)
Future Work
• Improve scalability of current algorithms using cloud computing framework Each of the friends of a user is handled by a
separate node in the distributed environment• Micro-level location identification
Identify specific points of interests (POIs) such as restaurants, place of work, etc from tweets
Identify comfort zone for a user Use Foursquare check-in dataset: over 30
million POIs all over the world
![Page 64: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/64.jpg)
Publications
• Satyen Abrol, Latifur Khan and Bhavani Thuraisingham,“Tweeque: Spatio-Temporal Analysis of Social Networks for Location Mining Using Graph Partitioning,” The First ASE/IEEE International Conference on Social Informatics, December 14-16, 2012, Washington D.C., USA.
• Satyen Abrol, Latifur Khan and Bhavani Thuraisingham, “Tweecalization: Efficient and Intelligent location mining in Twitter using semi- supervised learning,” 8th IEEE International Conference on Collaborative Computing, October 14–17, 2012, Pittsburgh, Pennsylvania
• Satyen Abrol, Latifur Khan, “TweetHood: Agglomerative Clustering on Fuzzy k-Closest Friends with Variable Depth for Location Mining”. In Proc. of the Second IEEE International Conference on Social Computing (SocialCom-2010), Minneapolis, USA, August 20-22, 2010 (Nominated for best paper award, Acceptance Rate:13%)
![Page 65: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/65.jpg)
Publications
• Satyen Abrol And Latifur Khan, “TWinner: Understanding News Queries With Geo-Content Using Twitter”. In Proc. of 6th Workshop on Geographic Information Retrieval (GIR'10) At Zurich, Switzerland.
• Satyen Abrol, Latifur Khan, Tahseen Al-khateeb, “MapIt: Smarter Searches using Location Driven Knowledge Discovery and Mining”, In Proc. of 1st SIGSPATIAL ACM GIS 2009 International Workshop on Querying and Mining Uncertain Spatio-Temporal Data (QUeST), Nov 2009, Seattle.
• Satyen Abrol, Latifur Khan, Vaibhav Khadilkar, Bhavani M. Thuraisingham, Tyrone Cadenhead, “Design and implementation of SNODSOC: Novel class detection for social network analysis”, ISI 2012: 215-220
![Page 66: Location Mining from Online Social Networks](https://reader036.vdocument.in/reader036/viewer/2022062410/56815546550346895dc314dc/html5/thumbnails/66.jpg)
Thank You!
Questions?