chapters 9 and 10, longley et al. data bases: population and maintenance geog 176b lecture 8

42
Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Upload: nyah-clemmens

Post on 31-Mar-2015

229 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Chapters 9 and 10, Longley et al.

Data Bases: Population and Maintenance

Geog 176B Lecture 8

Page 2: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Data Collection

One of most expensive GIS activitiesMany diverse sources (source integration, data fusion, interoperability)Two broad types of collection

Data capture (direct collection)Data transfer

Two broad capture methodsPrimary (direct measurement)Secondary (indirect derivation)

Page 3: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Stages in Data Collection Projects

Planning

Preparation

Digitizing / TransferEditing / Improvement

Evaluation

Page 4: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Data Collection Techniques

Raster VectorPrimary Digital remote

sensing imagesGPS measurements

Digital aerial photographs

Survey measurements

Secondary Scanned maps Topographic surveys

DEMs from maps

Toponymy data sets from atlases

Page 5: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Primary Data Capture

Capture specifically for GIS useRaster – remote sensing

e.g. SPOT and IKONOS satellites and aerial photographyPassive and active sensors

Resolution is key considerationSpatialSpectralTemporal

Page 6: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

www.spot.ucsb.edu

Page 7: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Imagery for GIS

Page 8: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Vector Primary Data Capture

SurveyingLocations of objects determines by angle and distance measurements from known locationsUses expensive field equipment and crewsMost accurate method for large scale, small areas

GPSCollection of satellites used to fix locations on Earth’s surfaceDifferential GPS used to improve accuracy

Page 9: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Total Station

Page 10: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Pen/Portable PC and GPS

Page 11: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Secondary Geographic Data Capture

Data collected for other purposes can be converted for use in GISRaster conversion

Scanning of maps, aerial photographs, documents, etcImportant scanning parameters are spatial and spectral (bit depth) resolution

Page 12: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Scanner

Page 13: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Raster to vector conversion

Page 14: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Vector Secondary Data Capture

Collection of vector objects from maps, photographs, plans, etc.Digitizing

Manual (table) Heads-up and vectorization

Photogrammetry – the science and technology of making measurements from photographs, etc.

Page 15: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Digitizer

Page 16: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Data Transfer

Buy vs. build is an important questionMany widely distributed sources of GIIncludes geocodingKey catalogs include

Geodata.govGeography Network

Access technologiesTranslationDirect read

Page 17: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Managing Data Capture ProjectsKey principles

Clear plan, adequate resources, appropriate funding, and sufficient time

Fundamental tradeoff among Quality, accuracy, speed and price

Two strategiesIncremental‘Blitzkrieg’

Alternative resource optionsIn houseSpecialist external agency

Page 18: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Map scale Ground distance corresponding to 0.5 mm map distance

1:1250 62.5 cm

1:2500 1.25 m

1:5000 2.5 m

1:10,000 5 m

1:24,000 12 m

1:50,000 25 m

1:100,000 50 m

1:250,000 125 m

1:1,000,000 500 m

1:10,000,000 5 km

A useful rule of thumb is that positions measured from maps are accurate to about 0.5 mm on the map. Multiplying this by the

scale of the map gives the corresponding distance on the ground.

Page 19: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Positional Accuracy (cont.)within a database a typical UTM coordinate pair might be:Easting 579124.349 mNorthing 5194732.247 mIf the database was digitized from a 1:24,000 map sheet, the last four digits in each coordinate (units, tenths, hundredths, thousandths) would be questionable

Page 20: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Testing Positional AccuracyUse an independent source of higher accuracy:

find a larger scale mapuse precision GPS

Use internal evidence:digitized polygons that are unclosed, lines that overshoot or undershoot nodes, etc. are indications of errorsizes of gaps, overshoots, etc. may be a measure of positional accuracy

Page 21: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Testing Accuracy (cont.)Compute accuracy from knowledge of the errors introduced by different sourcese.g., 1 mm in source document0.5 mm in map registration for digitizing0.2 mm in digitizingif sources combine independently, we can get an estimate of overall accuracy...

(12 + 0.52 + 0.22) 0.5 = 1.14 mm

Page 22: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Definitions

Database – an integrated set of data (attributes) on a particular subjectGeographic (=spatial) database - database containing geographic data of a particular subject for a particular areaDatabase Management System (DBMS) – software to create, maintain and access databases

Page 23: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

A GIS links attribute and spatial data

Attribute Data• Flat File• Relations

Map Data• Point File• Line File• Area File• Topology• Theme

Page 24: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Advantages of Databases over Files

Avoids redundancy and duplicationReduces data maintenance costsFaster for large datasetsApplications are separated from the data

Applications persist over timeSupport multiple concurrent applications

Better data sharingSecurity and standards can be defined and enforced

Page 25: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Disadvantages of Databases over Files

ExpenseComplexityPerformance – especially complex data typesIntegration with other systems can be difficult

Page 26: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Types of DBMS Model

HierarchicalNetworkRelational - RDBMSObject-oriented - OODBMSObject-relational - ORDBMS

Page 27: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Relational Databases rule now

Page 28: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Characteristics of DBMS (1)

Data model support for multiple data types

e.g MS Access: Text, Memo, Number, Date/Time, Currency, AutoNumber, Yes/No, OLE Object (MS Object linking and embedding), Hyperlink, Lookup Wizard

Load data from files, databases and other applicationsIndex for rapid retrieval

Page 29: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Characteristics of DBMS (2)

Query language – SQLSecurity – controlled access to data

Multi-level groups (e.g. census, NGA)

Controlled update using a transaction managerVersioningBackup and recovery

Page 30: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Characteristics of DBMS (3)

ApplicationsForms builderReportwriterInternet Application ServerCASE tools

Programmable API (Applications program interface)

Page 31: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Geographic Information

System

Database Management

System

• Data load• Editing• Visualization• Mapping• Analysis

• Storage• Indexing• Security• Query

Data

System TaskRole of DBMS

Page 32: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Relational DBMS (1)

Data stored as tuples (tup-el), conceptualized as tablesTable – data about a class of objects

Two-dimensional list (array)Rows = objectsColumns = object states (properties, attributes)

Page 33: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Table

Row = objectVector feature

Column = attribute

Page 34: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Relational DBMS (2)

Most popular type of DBMSOver 95% of data in DBMS is in RDBMS

Commercial systemsIBM DB2InformixMicrosoft AccessMicrosoft SQL ServerOracleSybase

Page 35: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

SQL

Structured (Standard) Query Language – (pronounced SEQUEL)Developed by IBM in 1970sNow de facto and de jure standard for accessing relational databasesThree types of usage

Stand alone queriesHigh level programmingEmbedded in other applications

Page 36: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Types of SQL Statements

Data Definition Language (DDL)Create, alter and delete dataCREATE TABLE, CREATE INDEX

Data Manipulation Language (DML)Retrieve and manipulate dataSELECT, UPDATE, DELETE, INSERT

Data Control Languages (DCL)Control security of dataGRANT, CREATE USER, DROP USER

Page 37: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Relational Join

Fundamental query operationOccurs because

Data created/maintained by different users, but integration needed for queries

Table joins use common keys (column values)Table (attribute) join concept has been extended to geographic case

Page 38: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

JoinRecord ID

Address

#cars

1241 123 State St. 3

1242 1801 Main St. 1

1243 2106 Elm St. 2

1244 7262 Pine Drive 1

1241 Ford 2003

1241 Subaru 2000

1241 Honda 1999

1241 123 State St.

Ford

1241 123 State St.

Subaru

1241 123 State St.

Honda

1242 1801 Elm St.

Kia

Page 39: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Spatial indexing

Many maps tiledB-tree (Balanced) Grid indexingQuad tree: Points/regionsR-tree (Based on MBR)

Page 40: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

New global/spatial grids: QTM

Page 41: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Go2 Grids38:53:22.08N 077:02:06.86WUS.DC.WAS.54.18.28.83.11US.CA.SBA.UCSB.UCEN

Page 42: Chapters 9 and 10, Longley et al. Data Bases: Population and Maintenance Geog 176B Lecture 8

Spatial Search:Gateway to Spatial Analysis

Overlay is a spatial retrieval operation that is equivalent to an attribute join. Buffering is a spatial retrieval around points, lines, or areas based on distance.