user generated data on extreme events

17
1 Dept. of Systems Engineering & Computer Science, Universidad del Norte, Colombia 2 Dept. of Civil and Environmental Engineering, Politecnico di Milano, Como Campus, Italy 3 Transport and ICT Global Practice, The World Bank User Generated Data During Extreme Events Mayra Zurbar á n 1 , Maria A. Brovelli 2 , Danilo Ardagna 2 , Mattia Manara 2 , Mark Iliffe 3

Upload: mayra-zurbaran

Post on 12-Apr-2017

47 views

Category:

Data & Analytics


6 download

TRANSCRIPT

Presentazione standard di PowerPoint

1 Dept. of Systems Engineering & Computer Science, Universidad del Norte, Colombia2 Dept. of Civil and Environmental Engineering, Politecnico di Milano, Como Campus, Italy3 Transport and ICT Global Practice, The World BankUser Generated Data During Extreme EventsMayra Zurbarn1, Maria A. Brovelli2, Danilo Ardagna2, Mattia Manara2, Mark Iliffe3

11The purpose is to provide links and resources

Social media APIs allow access to rich sources of user generated contentThis serves for manyfold applications, specially when it is geo-referenced data, e.g.:Spotting popular locationsDifferent activities within a cityRelevant local newsSentiment analysisMonitoring extreme eventsThis was done in Italy using the Twitter Streaming API for the following events:Monitor precipitation in the countryEarthquake detectionScript openly available at: https://github.com/mazucci/geocollect

Usage of Social Media APIs

22These maps are then validated using reference data obtained either through photo interpretation or field surveys.emphasizes user-generated contentadvancements in GPU (graphics processing unit) technologies and improvements in 3D graphics libraries available virtual globes: Google Earth, Cesium, NASA World Wind -> free and open source vs proprietary

The Twitter Search API:Size limited response per requestNeeds to reconnect on every queryRestricted requests over time; limited at 180 queries per 15 min window

Available at: https://dev.twitter.com/rest/public/search The Twitter Streaming API:Needs only one request and the connection remainsLow latency access up to an estimated 1% of all twitter dataParametrized queries get higher percentage of the full responseOptimized traffic due to less connection attemptsAvailable at: https://dev.twitter.com/streaming/overview Advantages of Using Streamig APIFor Italy around 30,000 geo-referenced tweets are collected daily through the Streaming API

By GEOlab - Geomatics and Earth Observation Laboratory @ POLIMI

33These maps are then validated using reference data obtained either through photo interpretation or field surveys.emphasizes user-generated contentadvancements in GPU (graphics processing unit) technologies and improvements in 3D graphics libraries available virtual globes: Google Earth, Cesium, NASA World Wind -> free and open source vs proprietary

The Twitter Search API:The Twitter Streaming API:

Architecture Comparison

By GEOlab - Geomatics and Earth Observation Laboratory @ POLIMI

Tecnologies Used for Data CollectionThe web framework for perfectionists with deadlines

&

Streaming response&RetrievesSavesRequest

CSV to processDatabaseApplication

By GEOlab - Geomatics and Earth Observation Laboratory @ POLIMI

Earthquake in Central ItalyCase Study:

HashtagAppered First AtNumber of Tweets#Italy03:4059#terremoto03:40287#earthquake03:40149#sismo03:4175#Roma03:415#terremotoRoma03:412#quake03:563#Norcia04:095#terremotoItalia8:387

Trends from TweetsFirst reported tweet after the earthquake at aprox. 250 km from the epicentre:

2 minutes after the official time of major earthquake (Mag: 6,0) at 3:36 AM

Popular Hashtags

By GEOlab - Geomatics and Earth Observation Laboratory @ POLIMI

Intention of the Content

Number of Tweets by ProvincesA total of 498 tweets were related to the earthquake on Aug 24th

Number of Tweets by UsersMost users tweet less than 3 tweets related to the earthquake on Aug 24th

Timelapse of Occurrence of Tweets Related to the Earthquake on Aug 24

From 00 AM to 10 AM

By GEOlab - Geomatics and Earth Observation Laboratory @ POLIMI

Density Map of Epicentres

*Values calculated using the Kernell formula by Silverman (1986, p. 76, equation 4.5)

Data: A total of 499 earthquakes reported from Aug 17th until Aug 24th of 2016.Source: INGV - http://www.ingv.it/it/

By GEOlab - Geomatics and Earth Observation Laboratory @ POLIMI

Density Map of Georeferenced Tweets

Precipitations in ItalyCase Study:

Keywords to include while processing: Temporale, acquazzone, diluvio, alluvioneHeatmap of Tweets Distribution

By GEOlab - Geomatics and Earth Observation Laboratory @ POLIMI

Acquired KnowledgeDuring extreme events, in our case study of earthquakes in central Italy:Users tweet more when they are near the epicentre but on non critical areasPopulation density impacts the content generation: There are more tweets from big citiesAggregating user-generated content aids in understanding users perceptions during extreme events e.g. EMSC - http://www.emsc-csem.orgThe precipitations study shows that are more tweets regarding rain near the Alps Area where it rains more during summer time according to official reportsFor Future Work it is interesting to supervise the behavior during different seasons

By GEOlab - Geomatics and Earth Observation Laboratory @ POLIMI

Any Questions?Thank you!