real-time social event analytics - ibm€¦ · real-time social event analytics francesco calabrese...

3
Real-Time Social Event Analytics Francesco Calabrese * , Giusy Di Lorenzo * , Gavin McArdle *† , Fabio Pinelli * , Erik Van Lierde * IBM Research - Ireland, Dublin, Ireland {fcalabre, giusydil, gavinm, fabiopin}@ie.ibm.com National Centre for Geocomputation, Maynooth University, Maynooth, Co Kildare, Ireland [email protected] Mobistar, Brussels, Belgium [email protected] Managing public safety at large events is important. Crowd control and traffic management are particularly relevant for non-ticketed events in public spaces. In such cases, it can be difficult for organisers to anticipate the number of people who will attend and to validate the event’s success [1]. Given the ubiquitous nature of mobile phones, Call Detail Records (CDRs), which are the logs of user transactions with a mobile phone service provider, have been widely used to study urban processes [2], [3], [4]. By building on the work of [5] and [6] our research explores the use of real-time CDR data as a proxy to estimate the density of crowds in different areas of a city while events are taking place. The research has also been extended to estimate the density of vehicles on the main access routes to a city. This has led to the development of an application entitled Social Event Analytics (SEA) which provides both real-time and historic information about crowd and vehicle densities. The application can be used by authorities and event organisers to manage the event and gauge its success. The application was used in January 2015 for monitoring city wide events in Mons, Belgium which marked the launch of Mons as the European City of Culture for 2015. Using SEA local police simultaneously monitored the density of vehicles on the road network and the crowd density in different areas of the city. Below, we briefly describe the new real-time data analysis which we carried out for this specific case in which over 20 million CDRs were analysed each day. The results are useful for authorities but will also help to further our knowledge of human processes in urban environments. We use data from a Belgium telecommunication operator who provided details about the distribution and azimuth of their cell towers in the city of Mons. Using this information the city is divided into cells using a Voronoi tessellation [7]. In total, 319 distinct cells were created (figure 1). These formed the basic unit on which crowd and vehicle density are analysed. Prior to the opening ceremony, the police and organisers provided details of specific public squares in the city where events were taking place along with a list of important access routes to the city and our analysis focuses on these spatial features. Anonymous CDR data for individuals connected to cell towers in Mons and the surrounding area are supplied directly by the telecommunication operator. Each row of CDR data consists of an anonymous user ID, a time-stamp, the ID of the cell tower, the home country of the device and the type of record (call, SMS, data). Typically, data connections are an Fig. 1. 319 Voronoi cells were computed from the cell tower distribution and azimuth data provided by the telecommunication operator in Mons. always on service and lead to generation of comprehensive CDRs. In this case the data are recorded at a rate of more than 250 records every second. By analysing the cell IDs, it is possible to determine the number of users connected to each cell tower and determine the Voronoi cell they are in. By combining this data with the known area of each Voronoi cell and the known customer penetration rate of the specific mobile operator, the density of crowds in any given Voronoi cell is estimated. For the public squares which are of particular interest to the police, we also produce a count of the number of individuals in these spaces. Furthermore, an estimation of the number of people in the whole city is calculated by summing the amount people connected to the relevant cell towers. While the police are interested in public safety and crowd control, organisers are also interested in assessing the success of the event and the marketing campaigns used to attract people. In addition to the numbers attending the event, the organisers also wanted to know where people were travelling from in order to attend the events in the city. Therefore, the number of users by nationality are calculated using the home country of the device which is available directly from each CDR. In order to obtain the home city of Belgian users, further

Upload: others

Post on 17-Apr-2020

4 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Real-Time Social Event Analytics - IBM€¦ · Real-Time Social Event Analytics Francesco Calabrese , Giusy Di Lorenzo , Gavin McArdley, Fabio Pinelli , Erik Van Lierdez IBM Research

Real-Time Social Event Analytics

Francesco Calabrese∗, Giusy Di Lorenzo∗, Gavin McArdle∗†, Fabio Pinelli∗, Erik Van Lierde‡∗IBM Research - Ireland, Dublin, Ireland

{fcalabre, giusydil, gavinm, fabiopin}@ie.ibm.com†National Centre for Geocomputation, Maynooth University, Maynooth, Co Kildare, Ireland

[email protected]‡Mobistar, Brussels, Belgium

[email protected]

Managing public safety at large events is important. Crowdcontrol and traffic management are particularly relevant fornon-ticketed events in public spaces. In such cases, it canbe difficult for organisers to anticipate the number of peoplewho will attend and to validate the event’s success [1]. Giventhe ubiquitous nature of mobile phones, Call Detail Records(CDRs), which are the logs of user transactions with a mobilephone service provider, have been widely used to study urbanprocesses [2], [3], [4]. By building on the work of [5] and[6] our research explores the use of real-time CDR dataas a proxy to estimate the density of crowds in differentareas of a city while events are taking place. The researchhas also been extended to estimate the density of vehicleson the main access routes to a city. This has led to thedevelopment of an application entitled Social Event Analytics(SEA) which provides both real-time and historic informationabout crowd and vehicle densities. The application can beused by authorities and event organisers to manage the eventand gauge its success. The application was used in January2015 for monitoring city wide events in Mons, Belgium whichmarked the launch of Mons as the European City of Culturefor 2015. Using SEA local police simultaneously monitoredthe density of vehicles on the road network and the crowddensity in different areas of the city. Below, we briefly describethe new real-time data analysis which we carried out for thisspecific case in which over 20 million CDRs were analysedeach day. The results are useful for authorities but will alsohelp to further our knowledge of human processes in urbanenvironments.

We use data from a Belgium telecommunication operatorwho provided details about the distribution and azimuth oftheir cell towers in the city of Mons. Using this informationthe city is divided into cells using a Voronoi tessellation [7].In total, 319 distinct cells were created (figure 1). Theseformed the basic unit on which crowd and vehicle densityare analysed. Prior to the opening ceremony, the police andorganisers provided details of specific public squares in the citywhere events were taking place along with a list of importantaccess routes to the city and our analysis focuses on thesespatial features.

Anonymous CDR data for individuals connected to celltowers in Mons and the surrounding area are supplied directlyby the telecommunication operator. Each row of CDR dataconsists of an anonymous user ID, a time-stamp, the ID ofthe cell tower, the home country of the device and the typeof record (call, SMS, data). Typically, data connections are an

Fig. 1. 319 Voronoi cells were computed from the cell tower distributionand azimuth data provided by the telecommunication operator in Mons.

always on service and lead to generation of comprehensiveCDRs. In this case the data are recorded at a rate of morethan 250 records every second. By analysing the cell IDs,it is possible to determine the number of users connected toeach cell tower and determine the Voronoi cell they are in.By combining this data with the known area of each Voronoicell and the known customer penetration rate of the specificmobile operator, the density of crowds in any given Voronoicell is estimated. For the public squares which are of particularinterest to the police, we also produce a count of the number ofindividuals in these spaces. Furthermore, an estimation of thenumber of people in the whole city is calculated by summingthe amount people connected to the relevant cell towers.

While the police are interested in public safety and crowdcontrol, organisers are also interested in assessing the successof the event and the marketing campaigns used to attractpeople. In addition to the numbers attending the event, theorganisers also wanted to know where people were travellingfrom in order to attend the events in the city. Therefore, thenumber of users by nationality are calculated using the homecountry of the device which is available directly from eachCDR. In order to obtain the home city of Belgian users, further

Page 2: Real-Time Social Event Analytics - IBM€¦ · Real-Time Social Event Analytics Francesco Calabrese , Giusy Di Lorenzo , Gavin McArdley, Fabio Pinelli , Erik Van Lierdez IBM Research

Fig. 3. A screen capture from the SEA application showing the estimated crowd density for the Voronoi cells in Mons. Additional bar charts show the estimatedvolume of people at several locations in the city and the home country of visitors.

Fig. 2. 86 road segments were computed based on their intersection withthe Voronoi cells.

analysis of the complete CDR data for Belgium was carriedout off line. Several weeks of CDR data were analysed in orderto determine the city where each device generally connected

to network during the night time hours between Midnightand 6 AM. This is similar to approaches for calculating thesignificant places people visit [8], [9].

In addition to crowd estimation, the density of vehiclestravelling on the main access roads to the city is estimatedusing a minimal computational approach. Each road of interestis segmented so each segment is contained within a singleVoronoi cell. This produces 86 road segments (figure 2). Whilethe density of crowds in these cells can be estimated usingthe techniques described above, we are interested in thosetravelling through the cell. Historic CDR data was analysed todetermine users who regularly spend time in that cell. Thesetypically represent people who live or work in the area. Theseusers are removed from the analysis of the density for that cellwhich allows us to estimate the density of the major roads inthe cell. When two or more major roads are covered by a singleVoronoi cell, it is necessary to estimate the density of theseroads by averaging the known densities of the road segmentson either side of the particular cell.

The analysis above was used to produce an application,called SEA, which monitors vehicle and crowd density innear real-time. The data is supplied by the telecommunicationoperator every 15 minutes and contains the CDR data forthe previous 15 minutes. Based on the analysis, dynamicvisualisations are produced to present the results to the policeand organisers. The main visualisation, as seen in figure 1

Page 3: Real-Time Social Event Analytics - IBM€¦ · Real-Time Social Event Analytics Francesco Calabrese , Giusy Di Lorenzo , Gavin McArdley, Fabio Pinelli , Erik Van Lierdez IBM Research

Fig. 4. A screen capture from the SEA application showing the estimated traffic density on several access routes for Mons.

consists of a map with the Voronoi cell structure superimposed.The cells are coloured based on the crowd density. The Voronoicells containing the squares where events are taking place arefurther highlighted. A bar chart shows the count of the crowdfor each of these squares. An additional bar chart shows thehome country of international visitors along with an estimateof the total number of people in the city. A separate interactivemap shown in figure 4 presents the roads of interest. Eachroad segment is coloured based on the estimated traffic density.Roads which share cells are coloured with 50% transparencyto signify an average of the surrounding road segments wasused to calculate its density.

The real-time application was used by the police andorganisers to monitor the events during the opening ceremonyof Mons as the European City of Culture 2015. Preliminaryfeedback has been positive and we are awaiting the results ofa more detailed evaluation. We will also use the CDR data todetermine the amount of time individuals spend at events andto understand the fine grained mobility occurring in the city.These will serve as indicators regarding the success of eventswhile also furthering our understanding of urban processes.

ACKNOWLEDGMENTS

This publication has emanated from research conductedwith the financial support of Science Foundation Ireland underGrant ”SFI 13/IF/I2783”. The authors would like to thankMobistar for providing access to the anonymized data.

REFERENCES

[1] F. Calabrese, F. C. Pereira, G. Di Lorenzo, L. Liu, and C. Ratti, “Thegeography of taste: analyzing cell-phone mobility and social events,” inPervasive computing. Springer, 2010, pp. 22–37.

[2] F. Calabrese, L. Ferrari, and V. D. Blondel, “Urban sensing using mobilephone network data: A survey of research,” ACM Computing Surveys(CSUR), vol. 47, no. 2, p. 25, 2014.

[3] J. Reades, F. Calabrese, and C. Ratti, “Eigenplaces: analysing cities usingthe space-time structure of the mobile phone network,” Environment andPlanning B: Planning and Design, vol. 36, no. 5, pp. 824–836, 2009.

[4] R. Caceres, J. Rowland, C. Small, and S. Urbanek, “Exploring the useof urban greenspace through cellular network activity,” in Proc. of 2ndWorkshop on Pervasive Urban Applications (PURBA), 2012.

[5] T. Sohn, A. Varshavsky, A. LaMarca, M. Y. Chen, T. Choudhury, I. Smith,S. Consolvo, J. Hightower, W. G. Griswold, and E. De Lara, “Mobilitydetection using everyday gsm traces,” in UbiComp 2006: UbiquitousComputing. Springer, 2006, pp. 212–224.

[6] C. Ratti, S. Sobolevsky, F. Calabrese, C. Andris, J. Reades, M. Martino,R. Claxton, and S. H. Strogatz, “Redrawing the map of great britain froma network of human interactions,” PloS one, vol. 5, no. 12, p. e14248,2010.

[7] M. C. Gonzalez, C. A. Hidalgo, and A.-L. Barabasi, “Understandingindividual human mobility patterns,” Nature, vol. 453, no. 7196, pp. 779–782, 2008.

[8] S. Isaacman, R. Becker, R. Caceres, S. Kobourov, M. Martonosi, J. Row-land, and A. Varshavsky, “Identifying important places in peoples livesfrom cellular network data,” in Pervasive computing. Springer, 2011,pp. 133–151.

[9] F. Calabrese, Z. Smoreda, V. D. Blondel, and C. Ratti, “Interplay betweentelecommunications and face-to-face interactions: A study using mobilephone data,” PloS one, vol. 6, no. 7, p. e20814, 2011.