geospatial analysis in the cloud

90
Use of Cloud Computing for scalable geospatial data processing and access Andrew Turner CTO, FortiusOne [email protected] Partner: U.S. Federal Geographic Data Committee

Upload: andrew-turner

Post on 16-May-2015

999 views

Category:

Technology


2 download

DESCRIPTION

Presented at the Government Cloud Service Oriented Architecture Workshop

TRANSCRIPT

Page 1: Geospatial Analysis in the Cloud

Use of Cloud Computing for scalable geospatial data processing and access

Andrew TurnerCTO, [email protected]

Partner: U.S. Federal Geographic Data Committee

Page 2: Geospatial Analysis in the Cloud

What is GeoCommons?A Brief History

Page 3: Geospatial Analysis in the Cloud

Vulnerability Identification

Chicago

Denver

Route 2

Route 1Los Angeles

Atlanta Fiber Density

Electric Transmission Line

Density

Page 4: Geospatial Analysis in the Cloud
Page 5: Geospatial Analysis in the Cloud
Page 6: Geospatial Analysis in the Cloud

Baseline connectivity of a fiber network provider in NYC. This particular provider is a good proxy for the structure of the entire island of Manhattan since they have about 80% of the right of ways on the island and a large number of egress points off the island. The higher the peak in the map the more frequently used the path is as a possible routing path.

WTC

Holland Tunnel

Columbus Circle

Page 7: Geospatial Analysis in the Cloud

Lastly a scenario is run where just 10,000 sq ft. of damage is done to the Holland Tunnel and the impact calculated. The result is a 8.6% loss of network connectivity, 134 times the impact of the WTC simulation. The dramatic impact is seen in the image from the loss as well as the stress put on the GW Bridge route out of the city.

Page 8: Geospatial Analysis in the Cloud

GeoCommons: Version 1

Page 9: Geospatial Analysis in the Cloud
Page 10: Geospatial Analysis in the Cloud

Find interesting data

Page 11: Geospatial Analysis in the Cloud

Find interesting data

Map arelevant area

Page 12: Geospatial Analysis in the Cloud

Find interesting data

Map arelevant area

Visualize to find meaning

Page 13: Geospatial Analysis in the Cloud

Find interesting data

Map arelevant area

Visualize to find meaning

Layer, Modify,and Analyze

Page 14: Geospatial Analysis in the Cloud

Find interesting data

Map arelevant area

Visualize to find meaning

Collaborate with others

Layer, Modify,and Analyze

Page 15: Geospatial Analysis in the Cloud

Find interesting data

Map arelevant area

Visualize to find meaning

Collaborate with others

Publish and share results

Layer, Modify,and Analyze

Page 16: Geospatial Analysis in the Cloud
Page 17: Geospatial Analysis in the Cloud
Page 18: Geospatial Analysis in the Cloud
Page 19: Geospatial Analysis in the Cloud

Visualization

Page 20: Geospatial Analysis in the Cloud

Analysis

Page 21: Geospatial Analysis in the Cloud

Applying Lessons Learned

Page 22: Geospatial Analysis in the Cloud

Modularize

MakerFinder

CoreRESTfulInterfaces

Application Programming Interface

Page 23: Geospatial Analysis in the Cloud

Relational Databases Don’t Scale Well

Page 24: Geospatial Analysis in the Cloud

Datasets as Databases

MakerFinder

Core

KML

Shapefile

CSV (Excel)

GeoRSS

Documents

Page 25: Geospatial Analysis in the Cloud

Datasets as Databases

MakerFinder

Core

KML

Shapefile

CSV (Excel)

GeoRSS

Documents

Upload

Page 26: Geospatial Analysis in the Cloud

Datasets as Databases

MakerFinder

Core

KML

Shapefile

CSV (Excel)

GeoRSS

Documents

Upload

Parse & Store

Page 27: Geospatial Analysis in the Cloud

Datasets as Databases

MakerFinder

Core

KML

Shapefile

CSV (Excel)

GeoRSS

Documents

Upload

Parse & Store

Page 28: Geospatial Analysis in the Cloud

Datasets as Databases

MakerFinder

Core

KML

Shapefile

CSV (Excel)

GeoRSS

Documents

Upload

Parse & Store

Page 29: Geospatial Analysis in the Cloud

Datasets as Databases

MakerFinder

Core

KML

Shapefile

CSV (Excel)

GeoRSS

Documents

Upload

Parse & Store

Page 30: Geospatial Analysis in the Cloud

Datasets as Databases

MakerFinder

Core

KML

Shapefile

CSV (Excel)

GeoRSS

Documents

Upload

Parse & Store

Download

Page 31: Geospatial Analysis in the Cloud

Datasets as Databases

MakerFinder

Core

KML

Shapefile

CSV (Excel)

GeoRSS

Documents

Upload

Parse & Store Analyze

Download

Page 32: Geospatial Analysis in the Cloud

Datasets as Databases

MakerFinder

Core

KML

Shapefile

CSV (Excel)

GeoRSS

Documents

Upload

Parse & Store Analyze

Download

Visualize

Page 33: Geospatial Analysis in the Cloud

Geospatial Catalog and Server

Page 34: Geospatial Analysis in the Cloud

Delivery Mechanisms

Page 35: Geospatial Analysis in the Cloud

Appliances

• Sun 4150• RAID Array

Page 36: Geospatial Analysis in the Cloud

Web Scaled Racks

• 3 Appliances• Network File Storage• Load Balancer• Monitoring and Tunnels• Production & Staging racks• Racks in office for development

Page 37: Geospatial Analysis in the Cloud

Limits in Scaling

Limits in Development

Page 38: Geospatial Analysis in the Cloud

Limits in Scaling

People

Limits in Development

Page 39: Geospatial Analysis in the Cloud

Limits in Scaling

PeoplePower

Limits in Development

Page 40: Geospatial Analysis in the Cloud

Limits in Scaling

PeoplePowerSize

Limits in Development

Page 41: Geospatial Analysis in the Cloud

Limits in Scaling

PeoplePowerSizeCost

Limits in Development

Page 42: Geospatial Analysis in the Cloud

Limits in Scaling

PeoplePowerSizeCostTime

Limits in Development

Page 43: Geospatial Analysis in the Cloud

Limits in Scaling

PeoplePowerSizeCostTime

Limits in Development

Page 44: Geospatial Analysis in the Cloud

Limits in Scaling

PeoplePowerSizeCostTime

Limits in Development

Testing on “clean” machines

Page 45: Geospatial Analysis in the Cloud

Limits in Scaling

PeoplePowerSizeCostTime

Limits in Development

Testing on “clean” machines

Deployment testing of upgrades

Page 46: Geospatial Analysis in the Cloud

Limits in Scaling

PeoplePowerSizeCostTime

Limits in Development

Testing on “clean” machines

Deployment testing of upgrades

Controlled Environments

Page 47: Geospatial Analysis in the Cloud

url

Leveraging the Cloud

http

://w

ww

.flic

kr.c

om/p

hoto

s/kk

y/70

4056

791

Page 48: Geospatial Analysis in the Cloud

Amazon Web Services

Page 49: Geospatial Analysis in the Cloud

Management Consoles

Page 50: Geospatial Analysis in the Cloud

Processing via MapReduce

Page 51: Geospatial Analysis in the Cloud

Launching New Instances

Page 52: Geospatial Analysis in the Cloud

Elastic Computing Cluster - EC2

• Virtual Servers

• Machine Images (AMI)

• On-Demand

CentOS AMI

Page 53: Geospatial Analysis in the Cloud

Elastic Computing Cluster - EC2

• Virtual Servers

• Machine Images (AMI)

• On-Demand

CentOS AMI

build

Page 54: Geospatial Analysis in the Cloud

Elastic Computing Cluster - EC2

• Virtual Servers

• Machine Images (AMI)

• On-Demand

CentOS AMI

build

bundle

register

Page 55: Geospatial Analysis in the Cloud

Elastic Computing Cluster - EC2

• Virtual Servers

• Machine Images (AMI)

• On-Demand

CentOS AMI

build

bundle

register

instantiate

Page 56: Geospatial Analysis in the Cloud

Elastic Computing Cluster - EC2

• Virtual Servers

• Machine Images (AMI)

• On-Demand

CentOS AMI

build

bundle

register

instantiate

Page 57: Geospatial Analysis in the Cloud

Elastic Computing Cluster - EC2

• Virtual Servers

• Machine Images (AMI)

• On-Demand

CentOS AMI

build

bundle

register

instantiate

Page 58: Geospatial Analysis in the Cloud

Elastic Computing Cluster - EC2

• Virtual Servers

• Machine Images (AMI)

• On-Demand

CentOS AMI

build

bundle

register

instantiate

Page 59: Geospatial Analysis in the Cloud

Elastic Computing Cluster - EC2

• Virtual Servers

• Machine Images (AMI)

• On-Demand

CentOS AMI

build

bundle

register

instantiate

Page 60: Geospatial Analysis in the Cloud

Elastic Block Store - EBS

Create EBS

100 GB

Page 61: Geospatial Analysis in the Cloud

Elastic Block Store - EBS

attach

Create EBS

100 GB

Page 62: Geospatial Analysis in the Cloud

Elastic Block Store - EBS

attach

Create EBS

snapshot100 GB

Page 63: Geospatial Analysis in the Cloud

Elastic Block Store - EBS

attach

Create EBS

snapshot100 GB

Diff v1S3

Page 64: Geospatial Analysis in the Cloud

Elastic Block Store - EBS

attach

Create EBS

snapshot100 GB

Diff v2Diff v1S3

Page 65: Geospatial Analysis in the Cloud

Elastic Block Store - EBS

attach

Create EBS

snapshot100 GB

Diff v2

Create & AttachDiff v1S3

Page 66: Geospatial Analysis in the Cloud

Elastic Block Store - EBS

attach

Create EBS

snapshot100 GB

Diff v2

Create & AttachDiff v1S3

Page 67: Geospatial Analysis in the Cloud

Elastic Block Store - EBS

attach

Create EBS

snapshot100 GB

Diff v2

Create & AttachDiff v1S3

Page 68: Geospatial Analysis in the Cloud

Elastic Block Store - EBS

attach

Create EBS

snapshot100 GB

Diff v2

Create & AttachDiff v1S3

Page 69: Geospatial Analysis in the Cloud

Public Datasets

Page 70: Geospatial Analysis in the Cloud

Additional Benefits

• Federation

• Tile generation

• Content-delivery System

• Simple Queue System (SQS)

tiles/openstreetmap/9/74/97.png

tiles/openstreetmap/9/74/98.png

tiles/bluemarble/9/74/97.png

tiles/bluemarble/9/74/98.pngS3 Storage

Page 71: Geospatial Analysis in the Cloud

Cloud Architecture

• EC2 image of current system architecture

• EBS image stored to S3 of default database

• Current application release in S3

• Start an EC2, attach data, attach code, startup

Default

Datasets

v1.4.3

Page 72: Geospatial Analysis in the Cloud

Cloud Architecture

• EC2 image of current system architecture

• EBS image stored to S3 of default database

• Current application release in S3

• Start an EC2, attach data, attach code, startup

createinstance

Default

Datasets

v1.4.3

Page 73: Geospatial Analysis in the Cloud

Cloud Architecture

• EC2 image of current system architecture

• EBS image stored to S3 of default database

• Current application release in S3

• Start an EC2, attach data, attach code, startup

createinstance

Default

Datasets

v1.4.3

Page 74: Geospatial Analysis in the Cloud

Cloud Architecture

• EC2 image of current system architecture

• EBS image stored to S3 of default database

• Current application release in S3

• Start an EC2, attach data, attach code, startup

attach data

createinstance

Default

Datasets

v1.4.3

Page 75: Geospatial Analysis in the Cloud

Cloud Architecture

• EC2 image of current system architecture

• EBS image stored to S3 of default database

• Current application release in S3

• Start an EC2, attach data, attach code, startup

attach data

createinstance

Default

Datasets

v1.4.3

Backup BackupBackup

Snapshot

Page 76: Geospatial Analysis in the Cloud

Cloud Architecture

• EC2 image of current system architecture

• EBS image stored to S3 of default database

• Current application release in S3

• Start an EC2, attach data, attach code, startup

attach data

createinstance

Default

Datasets

v1.4.3

Backup BackupBackup

Snapshot

Cache Downloads

S3

Page 77: Geospatial Analysis in the Cloud

Scaling

• RESTful architecture

• Caching for speed, and CDN support

• Amazon Web Services

• CloudWatch

• Elastic Scaling

• Load Balancer

Page 78: Geospatial Analysis in the Cloud

Private Instances

Page 79: Geospatial Analysis in the Cloud

First Users: Meedan, Media

Page 80: Geospatial Analysis in the Cloud

Repeatable

Page 81: Geospatial Analysis in the Cloud

Repeatable

Page 82: Geospatial Analysis in the Cloud

Data Federation

community

Page 83: Geospatial Analysis in the Cloud

Geospatial Federated Search Search

Page 84: Geospatial Analysis in the Cloud

Geocoding

Page 85: Geospatial Analysis in the Cloud

Geocoding - Scale as Required

TIGER/LineSQLite

Geocoding Engine

API

UploadCSV

GeocodeCacheResults

Page 86: Geospatial Analysis in the Cloud

Geocoding - Scale as Required

TIGER/LineSQLite

Geocoding Engine

API

UploadCSV

GeocodeCacheResults

Page 87: Geospatial Analysis in the Cloud

Best Practices Applied to the Government

• Built using open, established tools

• Full choice - Linux, Windows

• Full Control

• Repeatable processes

• Continual backup

• Scaling dynamic and large datasets

• Synchronous and Asynchronous analysis

Page 88: Geospatial Analysis in the Cloud

Level of Maturity

• Widely adopted

• Broad support and ecosystem

• Full stack support

Page 89: Geospatial Analysis in the Cloud

Perceived Impediments to Adoption

• Single Vendor (open-source alternatives arising)

• Maintenance and Location

• Data Security