1 statistical bases for map reconstructions and comparisons jerry platt may 2005

57
1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

Upload: louise-knight

Post on 17-Dec-2015

214 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

1

Statistical Bases for Map Reconstructions and Comparisons

Jerry Platt

May 2005

Page 2: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

2

Preliminaries

Page 3: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

3

Outline• Motivation

– Do Different Maps “Differ”?

• Methods

– Singular-Value Decomposition

Multidimensional Scaling and PCA

Mantel Permutation TestProcrustean Fit and Permu. TestBidimensional Regression

• Working Example– Locational Attributes of Eight URSB Campuses

Page 4: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

4

Motivation• Comparing Maps Over Time

Accuracy of a 14th Century MapLeader Image Change in Great BritainWhere IS Wall Street, post-9/11?

• Comparing Maps Among Sub-samplesThings People Fear, M v. F Face-to-Face Comparisons

• Comparing Maps Across AttributesCompetitive Positioning of FirmsChinese Provinces & Human Dev. Indices

Page 5: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

5

Accuracy of a 14th Century Map

http://www.geog.ucsb.edu/~tobler/publications/pdf_docs/geog_analysis/Bi_Dim_Reg.pdf

Page 6: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

6http://www.mori.com/pubinfo/rmw/two-triangulation-models.pdf

Page 7: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

7http://igeographer.lib.indstate.edu/pohl.pdf

Page 8: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

8

http://www.analytictech.com/borgatti/papers/borgatti%2002%20-%20A%20statistical%20method%20for%20comparing.pdf

Things People Fear, F v. M

Page 9: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

9http://www.multid.se/references/Chem%20Intell%20Lab%20Syst%2072,%20123%20(2004).pdf

Face-to-Face Comparisons

Page 10: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

10http://www.gsoresearch.com/page2/map.htm

Page 11: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

11

Page 12: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

12

MethodsEigen-Analysis and Singular-Value Decomposition

Multidimensional Scaling & Principal Comps.

Mantel Permutation Test

Procrustean Fit and Permutation Test

Bidimensional Regression

Page 13: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

13

Eigen-analysis

• C = an NxN variance-covariance matrix

• Find the N solutions to C = = the N Eigenvalues, with 1 ≥ 2 ≥ …

= the N associated Eigenvectors

• C = LDL’, where

L = matrix of s

D = diagonal matrix of s

Page 14: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

14

Singular Value Decomposition

• Every NxP matrix A has a SVD

• A = U D V’

• Columns of U = Eigenvectors of AA’

• Entries in Diagonal Matrix D = Singular Values

= SQRT of Eigenvalues of either AA’ or A’A

• Columns of V = Eigenvectors of A’A

Page 15: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

15

SVD

Page 16: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

16

Principal Component Analysis

• A is a column-centered data matrix

• A = U D V’

• V’ = Row-wise Principal Components

• D ~ Proportional to variance explained

• UD = Principal Component Scores

• DV’ = Principle Axes

Page 17: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

17

Multidimensional Scaling• A is a column-centered dissimilarity matrix

• B =

• B = U D V’

• B = XX’, where X = UD1/2

• Limit X to 2 Columns Coordinates to 2d MDS

'

1'

1

2

1 2 iiN

IAiiN

I

Page 18: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

18

A RandomPermutation

Test

Given DissimilarityMatrices A and B:

N! Permutations37! = 1.4*E+43 8! = 40,320

Page 19: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

19

Permutation Tests

PermuteList & rerun

ObservedTestStatisticTS = 25# CorrectOf 37 SB.

Is 25Significantly> 18.5?

Ho: TS = 18.5HA: TS > 18.5

P = .069P > .05Do NotReject Ho

Page 20: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

20

Page 21: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

21http://www.entrenet.com/~groedmed/greekm/mythproc.html

Page 22: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

22

http://www.zoo.utoronto.ca/jackson/pro2.html

Centering &Scaling

MirrorReflection

Rotation &Dilation toMin ∑(є2)

Page 23: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

23

Procrustean Analysis

• Two NxP data configurations, X and Y• X’Y = U D V’• H = UV• OLS Min SSE = tr ∑(XH-Y)’(XH-Y)

= tr(XX’) + tr(YY’) -2tr(D)

= tr(XX’) + tr(YY’) – 2tr(VDV’)

Page 24: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

24

OLS Regression

• Y = X + • Y = Xb + e• X = UDV’• b = VrD-1Ur’Y, where r = first r columns (N>P)

• b = (X’X)-1X’Y

• b = VrVr’ • Estimated Y values = Ur Ur’Y

Page 25: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

25

Bidimensional Regression• (Y,X) = Coordinate pair in 2d Map 1

Y = 0 + 0X

• (A,B) = Coordinate pair in 2d Map 2

E[A] 1 1 -2 X 1

E[B] 1 2 1 Y 2

1 = Horizontal Translation

2 = Vertical Translation

= Scale Transformation = SQRT(12

+ 22)

= Angle Transformation = TAN-1(2 / 1 ) +1800

= + +

Iff 1 < 0

Page 26: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

26

Althoughr = 1,differ inlocation,scale, andangles ofrotationaroundorigin (0,0)

Horizontal& VerticalTranslation

Angle ofrotationaroundorigin (0,0)

Scaletransform,with < 1 ifcontration,& > 1 ifexpansion

Page 27: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

27

Working Example

• Eight URSB Campuses– RD, BK, TO, RC, SA, RV, SD, TA

• Data Sources– Locations– Housing Attributes– Tapestry Attributes

• Data Analyses

Page 28: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

28

Eight URSB Campuses

Page 29: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

29

87.5 miles

88.1 miles

Page 30: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

30

Page 31: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

31

Page 32: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

32

Page 33: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

33

EXAMPLE: Eight URSB Campuses

Page 34: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

34

Page 35: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

35

SD

TA

RDRVRCBK

TOSA

Page 36: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

36

… and if DISTANCES available, but COORDINATES Unavailable?

• Treat Distance Matrix as Dissimilarity Matrix

• Apply Multidimensional Scaling

• Apply the two-dimension solution “as if” it represents latitude and longitude coordinates

Page 37: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

37

Distance Estimates Vary

… But Not “Significantly”

Page 38: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

38

MDS RepresentationInput = D; Output = 2d

D8x8

Page 39: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

39

SD

TA

RD

RVRC

BK

TO

SA

Errors“appear”

to bequitesmall

…BUT

is therea wayto test

if errorsare

“STATSIGNIF”

?

Page 40: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

40

Mantel Test

Page 41: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

41

Procrustean Test:MDS Map Recreation

CONCLUDE: Near-perfect Map Recreation

Page 42: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

42

Driving Distances

Do these differ “significantly” from linear distances?

STATISTICAL PRACTICAL

Page 43: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

43

DriveD = Driving DistancesEight URSB Locations

Multidimensional Scaling,with 2-dimension solution

Page 44: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

44

SD

TA

RD

RVRC

BK

TO

SA

Page 45: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

45

Page 46: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

46

Bidimensional Regression:AB on YX

Page 47: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

47

PROTEST Comparison

BidimensionalRegression

ProcrusteanRotation

Page 48: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

48

Housing

Page 49: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

49

Tapestry (ESRI)

Page 50: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

50

Map Coordinates as Explanatory Variables in Linear Models

Page 51: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

51

Incremental Tests

So Map Coordinates seem sufficient as predictors

Page 52: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

52

Proxy Measures of lat-longin Linear Model

Translations& Transforms

Reduce 8

And ↑ R2

Page 53: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

53

Robust criterionwould help here:

Min (Med(є2))

Page 54: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

54

Bidimensional Regression

Is There a Linear RelationshipBetween Housing and Tapestry

Data?

r = 0.5449

MustStandardize

Data

Page 55: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

55

Page 56: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

56

It’s Still a 3-d World

Page 57: 1 Statistical Bases for Map Reconstructions and Comparisons Jerry Platt May 2005

57