intensity analysis of spatial point patternschris/lecture5_210c_spring2011_pointpattern... ·...

Intensity Analysis of Spatial Point PatternsGeog 210C

Introduction to Spatial Data Analysis

Chris Funk

Lecture 5

Topic Overview

1) Introduction/Unvariate Statistics2) Bootstrapping/Monte Carlo Simulation/Kernel

Estimation3) Distance Matrices/Point Pattern Analysis4) Bivariate/Multivariate/Spatial Regression5) Spatial Covariance and Covariance Models

1) Spatial Stochastic Processes

6) Kriging and Spatial Estimation – I7) Kriging and Spatial Estimation – II8) Spatial Sampling Strategies9) Principal Components and Ordination10) Combining 1st order and 2nd effects

2

3

Spatial Point Patterns

DefinitionSet of point locations with recorded “events" within study region, e.g., locations of trees, disease or crime incidents

point locations could correspond to all possible events or to subsets of them (mapped versus sampled point pattern)attribute values could have also been measured at event locations, e.g., tree diameter (marked point pattern)

Objective: Introduce statistical tools for quantifying spatial interaction of events,e.g., clustering versus randomness or regularity

4

1D Kernel Density Estimation Flowchart

1D Kernel Density Estimation Flowchart1. choose a kernel function k(x-xi), i.e., a PDF, and a bandwidth parameter b controlling kernel extent and consequently the “smoothness" of the final estimated density profile f(x); this amounts to choosing a scaled kernel function k(x-xi;b)2. discretize 1D segment, i.e., choose a set of P x-coordinates {xp; p = 1…P} at which the density function f(x) will be estimated3. for each datum coordinate xi , evaluate the scaled kernel function k(xp-xi;b) for all P x-values; this yields N scaled kernel profiles {ki(b); i = 1…N} each one stemming from a particular event coordinate xi4. for each discretization coordinate xp, compute estimated density f(xp) as the sum of the N scaled kernel values k(xp-xi; b), after weighting each such value by 1/N:

OutputA (Px1) vector k(b) with estimated density values f(x) at the specified x-coordinates; the N scaled & weighted kernels {(1/N)ki (b); i = 1…N} can be regarded as N elementary profiles whose super-position builds up the final estimated density profile

Checkouthttp://parallel.vub.ac.be/research/causalModels/tutorial/kde.html

5

Constructing A Separable 2D Kernel

Two 1D Gaussian kernels for the x- and y-dimensions

Replicated 1D Gaussian kernels and 2D separable composite

Anisotropic kernel = multidimensional kernel with different bandwidths along different directions

6

2D Kernel Intensity Estimation

1. center a circle C(u;b) of radius b at any arbitrary location u in D2. estimate local intensity at u as: λ(u) =N(u; b)/|C(u;b)|

where N(u; b) = # of events within C(u; b)|C(u; b)| = kernel measure, b2 in 2D.

Note: steps 1 and 2 amount to choosing a 2D kernel function that plots like a cylinder with base radius b and height 1=(b2)3. repeat estimation for set of points (typically arranged at the grid nodes of a regular raster) in the study region to create an intensity map

Looping over # of grid nodes (P) instead over # of events (N),yields same results as kernel density estimation case

7

Recap (Lecture 3)

Event intensity of spatial point patternsλ(u): mean # of events over a unit area centered at uestimated overall intensity λ = N/|D|local intensity via quadrat counts or kernel density estimation

Kernel intensity estimationconversion of point data (events) to raster format (intensity surface)statistical multidimensional (multivariate) density estimation methods are used to estimate local intensity f(u). Note: Density surface integrates to 1, so multiply every such estimate f(u) by N to convert it to an intensity value f(u)resulting intensity surface depends on: (i) kernel type, and (ii) bandwidth; the latter is more influentialalternative approaches for non-parametric multivariate density estimation include: k-nearest neighbor and mixture of Gaussian densities methodsintensity surface can be linked (via regression models) to explanatory variables, e.g., disease occurrence intensity as function of air quality variables

8

Outline-Lecture 5

Concepts & NotationDistance & Distance MatricesDistances Involved in Spatial Point PatternsStatistical Tests –

Sampling Distributions Bootstrap exampleQuadrat count example

Quantifying Spatial Interaction: Nearest Neighbor Distance Metrics G Function

Proportion of minimum event-to-nearest-event distances no greater than given distance cutoff d

F FunctionProportion of minimum point-to-nearest-event distances no greater than given distance cutoff d

Measures based on the distribution of event intensities The K function Quadrat counts and KS-test?

Quantifying Spatial Interaction: K FunctionPoints To Remember

9

Some Notation

Point eventsSet of N locations of events occurring in a study area:

Variable of interesty(s) = number of events (a count) within arbitrary domain or support s with measure (length, area, volume) |s|; support s is centered at an arbitrary location u and can also be denoted as s(u); in statistics, y(s) is treated as a realization of a random variable (RV) Y(s)

ObjectiveQuantify interaction, e.g., covariation, between outcomes of any two RVs Y(s) and Y(s’). To do so, all RVs must lie in the same “environment"; in other words, the long-term average (expectation) of RV Y(s) should be similar to that of Y(s’)

10

Intensity of Events

Local intensity λ(u)Mean number of events per unit area at an arbitrary location or point u, formally defined as:

Overall intensity λ

First order stationarityAny RV should have the same long-term average, for a fixed areal unit s. This implies a constant intensity:

The expected number of events with a region s is just a function of |s|: E{Y(s)} =λ|s|

11

Interaction Between Count RVs

Second-order intensityLong-term average (expectation) of products of counts per unit areas at any two arbitrary points u and u’, formally defined as:

Some terminologysecond-order stationarity: expectation of all RVs is constant (first-order stationarity), and second-order intensity is a function of separation vector between any two locations u and u’isotropy: only distance (not orientation) of separation vector matters

OutlookQuantifying interaction in spatial point patterns within the above assumptions orworking hypotheses amounts to studying distances between events

12

Distance

A measure of proximity (typically along a crow's flight path) between any two locations or spatial entitiesEuclidean distance

Consider two points in a 2D (geographical or other) space with coordinates ui = (xi;yi) and uj = (xj;yj). The Euclidean distance dij between points ui and uj is computed via Pythagoras's theorem as:

13

Distance Metric

Formal characteristics of a distance metricA measure dij of proximity between locations ui and uj is a valid distance metric if it satisfies the following requirements:distance between a point and itself is always zero:

dii = 0distance between a point and another one is always positive:

dij > 0

distance between two points is the same no matter which point you consider first:

dij = dji

the triangular inequality holds: sum of length of two sides of a triangle cannot be smaller than length of third side:

dij ≤ dil + dlj

A metric dij need not always be Euclidean,and hence should be checked to ensure that it is a valid distance metric

14

Non-Euclidean Distances

Alternative “distance" measures(i) over a road, or railway, (ii) along a river, (ii) over a network

Even more exotic “distance" measures(i) travel time over a network, (ii) perceived travel time between urban landmarks, (iii) volume of exports/imports

Euclidean distances between network nodes ≠ actual or perceived distances on the network the latter might not even be formal distance metrics, i.e. dij ≠ dji

15

Minkowski's Generalized Distance

DefinitionConsider two points in a K-dimensional (geographical or other) space RK with coordinate vectors ui = [ui1…uik … uiK] and uj = [uj1…ujk…ujK ]. The Minkowski distance of order p (with p > 1), denoted as dij

(p), between points ui and uj is computed as:

Particular casesManhattan or city-block distance:

Euclidean distance

Distances computed from points in multidimensional spaces are routinely used in statistical pattern recognition; points represent objects or cases, each described by K attribute values

http://www.google.com/imgres?imgurl=http://www.statemaster.com/wikimir/images/upload.wikimedia.org/wikipedia/commons/thumb/0/08/Manhattan_distance.svg/300px-Manhattan_distance.svg.png&imgrefurl=http://www.statemaster.com/encyclopedia/Taxicab-geometry&h=300&w=300&sz=8&tbnid=HS-oGjEqXE7ZUM:&tbnh=116&tbnw=116&prev=/images%3Fq%3Dmanhattan%2Bdistance&hl=en&usg=__ukaVPHqn-NcYAb6SDHVAqesOhgU=&ei=Mlm6S_fxK5OysgPN5ZT1BA&sa=X&oi=image_result&resnum=4&ct=image&ved=0CBkQ9QEwAw�

16

Euclidean Distance Matrix: Single Set of Points

DefinitionConsider a set of N points {u1…ui…uN} in a K-dimensional (geographical or other) space. The distance matrix D is a square (NxN) matrix containing the distances {d(ui,uj); i = 1…N; j = 1…N} between all NxN possible pairs of points in the set

by convention, u1 is the coordinate vector of the 1st point in the set (1st entry in data file)

i-th row (or column) contains distances between i-th point ui and all others (including itself)D is symmetric with zeros along its diagonal

17

Euclidean Distance Matrix: Two Sets of Points

Consider 2 sets of points {u1…ui…uN} and {t1…ti…tM} in a K-dimensional (geographical or other) space. The distance matrix D is a (NxM) matrix containing the Euclidean distances {d(ui,tj); i = 1…N; j = 1…M} between all N x M possible pairs formed by these two sets of points

by convention, u1 is the coordinate vector of the 1st datum in the data set #1, and similarly for t1

i-th row contains distances between i-th point ui in set #1 and all points in set #2j-th column contains distances between j-th point tj in set #2 and all points in set #1

D is not symmetric, i.e., d12 ≠ d21: pair {u1,t2} is not the same as pair {u2,t1}

18

Distances Between Events in A Point Pattern

Event-to-event distanceDistance dij between event at location ui and another event at location uj

Point-to-event distanceDistance dpj between a randomly chosen point at location tp and an event at location uj :

Event-to-nearest-event distanceDistance dmin(ui) between an event at location ui and its nearest neighbor event:

Point-to-nearest-event distanceDistance dmin(tp) between a randomly chosen point at location tp and its nearest neighbor event:

19

Event-to-Nearest-Event Distances

Some events might be nearest neighbors of each other: e.g., u4,u5, or have same nearest neighbor: e.g., u2, u3, u4 are nearest neighbors of u5

Mean nearest neighbor distanceAverage of all dmin(ui) values:

Drawback: single number does not suffice to describe point pattern

20

The G Function

DefinitionProportion of event-to-nearest-event distances dmin(ui) no greater than given distance cutoff d, estimated as:

Cumulative distribution function (CDF) of all N event-to-nearest-event distances

Example

21

Event-to-Nearest-Event (E2NE) Distance Histograms

for evenly-spaced events, E2NE distances similar to spacing of eventsfor clustered events, more small E2NE distances and fewer large distances

Point Pattern Analysis Metrics

Ghat functionCumulative distribution function (CDF) of all N event-to-nearest-event distances

Fhat functionCumulative distribution function (CDF) of all N point-to-nearest-event distances

Khat functionRelative number of events at distance D calculated around all events

event-to-nearest-event distances → use the sample G function G(d)point-to-nearest-event distances → use the sample F function F(d)event-to-event distances → use the sample K function K(d)2

2

23

Sample G Function Examples

for evenly-spaced events, G(d) rises gradually up to the distance at which most events are spaced, and then increases rapidlyfor clustered events, G(d) rises rapidly at short distances, and then levels off at larger d-values

Ghat=Cumulative distribution function (CDF) of all N event-to-nearest-event distances

24

The F Function

DefinitionProportion of point-to-nearest-event distances dmin(tj) no greater than given distance cutoff d, estimated as:

Cumulative distribution function (CDF) of all M point-to-nearest-event distances

for larger number M of random points, F(d) becomes even smootherNote: The F function provides information on event proximity to voids

25

Point-to-Nearest-Event (P2NE) Distance Histograms

for evenly-spaced events, there are more nearest events at small distances from randomly placed pointsfor clustered events, P2NE distances are generally larger than the previous case, and there are a few large such distances

26

Sample F Function Examples

for evenly-spaced events, F(d) rises rapidly up to the distance at which most events are spaced, and then levels off (more nearest neighbors at small distances from randomly placed points)for clustered events, F(d) rises rapidly at short distances, and then levels off at larger d-values

27

The Sample K Function

Concept building1. construct set of concentric circles (of increasing radius d) around each event2. count number of events in each distance “band"3. cumulative number of events up to radius d around all events = sample K function, K(d)

Formal definition

28

Interpreting The Sample K Function

Re-expressing

In other words: Function K(d) is the sample cumulative distribution function (CDF) of all N2-N event-to-event distances, scaled by |D|

Note: Ignore bin at d = 0 (center plot) and point at d = 0 (right plot

29

Event-to-Event Distance Histograms

for evenly-spaced events, there are more medium-sized E2E distances than small or large such distancesfor clustered events, the distribution of E2E distances is multi-modal

Event-to-Event Distance CDFs

for clustered events, there are multiple bumps in the CDF of E2E distances due to the grouping of events in space

30

31

Sample K Function Examples

sample K function K(d) is monotonically increasing and is a scaled (by domainmeasure |D| version of the CDF of E2E distances

32

Recap

Quantifying interaction in spatial point patternsevent-to-nearest-event distances → use the sample G function G(d)point-to-nearest-event distances → use the sample F function F(d)event-to-event distances → use the sample K function K(d)K function looks at information beyond nearest neighbors

Caveatsclustering is always a function of the overall intensity of a point patternclustering might occur due to local intensity variations or due to interaction;it is very difficult to disentangle each contribution

Watch out forboundaries and edge effectsdistance distortions due to map projectionssampled versus mapped point patternsInteractions of 1st versus second order stationarity

intensity analysis of spatial point patternschris/lecture5_210c_spring2011_pointpattern... ·...

Documents