analyzing large earth data sets: new tools from the optiputer and looking projects

29
“Analyzing Large Earth Data Sets: New Tools from the OptIPuter and LOOKING Projects” Presentation to 3 rd Annual GEON Meeting Bahia Resort San Diego, CA May 5, 2005 Dr. Larry Smarr Director, California Institute for Telecommunications and Information Technology Harry E. Gruber Professor, Dept. of Computer Science and Engineering Jacobs School of Engineering, UCSD

Upload: larry-smarr

Post on 20-Aug-2015

354 views

Category:

Education


1 download

TRANSCRIPT

Page 1: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

“Analyzing Large Earth Data Sets: New Tools from the OptIPuter and

LOOKING Projects”

Presentation to 3rd Annual GEON Meeting

Bahia Resort

San Diego, CA

May 5, 2005

Dr. Larry Smarr

Director, California Institute for Telecommunications and Information Technology

Harry E. Gruber Professor,

Dept. of Computer Science and Engineering

Jacobs School of Engineering, UCSD

Page 2: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Abstract

Earth and ocean sciences are powerful application drivers for extending the Grid to the LambdaGrid. In the NSF OptIPuter project, the Grid, which is defined on the best effort shared internet, is extended to dedicated 1 or 10 Gb/s optical circuits, thereby adding predictability to the network underpinning the Grid middleware. This project is driven by both medical and earth sciences, in particular, EarthScope, the Mars rovers, and large scale integration of a variety of earth sciences data. Much progress has been made in scalable visualization nodes for the end user, which have been distributed through the GeoWall Consortium. A newer NSF grant LOOKING is extending the OptIPuter to include an integration of Web and Grid Services for remote control of ocean observatory instruments. Ontology for the ocean sciences is a central part of the LOOKING project, with strong overlap with GEON. We look toward the future in which GEON will utlize some of these more advanced services, creating a unified ontology and middleware system for the earth and ocean sciences.

Page 3: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Calit2 -- Research and Living Laboratorieson the Future of the Internet

www.calit2.net

UC San Diego & UC Irvine FacultyWorking in Multidisciplinary Teams

With Students, Industry, and the Community

Page 4: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Two New Calit2 Buildings Will Provide a Persistent Collaboration “Living Laboratory”

• Over 1000 Researchers in Two Buildings

• International Conferences and Testbeds

• Will Create New Laboratory Facilities– SDSC/Calit2 Synthesis Center– SDSC Data Group

Bioengineering

UC San Diego

UC Irvine

California Provided $100M for BuildingsIndustry Partners $85M, Federal Grants $250M

Page 5: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Challenge: Average Throughput of NASA Data Products to End User is Only < 50 Megabits/s

Tested from GSFC-ICESATJanuary 2005

http://ensight.eos.nasa.gov/Missions/icesat/index.shtml

Page 6: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

San Francisco Pittsburgh

Cleveland

National Lambda Rail (NLR) and TeraGrid Provides Researchers a Cyberinfrastructure Backbone

San Diego

Los Angeles

Portland

Seattle

Pensacola

Baton Rouge

HoustonSan Antonio

Las Cruces /El Paso

Phoenix

New York City

Washington, DC

Raleigh

Jacksonville

Dallas

Tulsa

Atlanta

Kansas City

Denver

Ogden/Salt Lake City

Boise

Albuquerque

UC-TeraGridUIC/NW-Starlight

Chicago

International Collaborators

NLR 4 x 10Gb Lambdas Initially Capable of 40 x 10Gb wavelengths at Buildout

NSF’s TeraGrid Has 4 x 10Gb Lambda Backbone

Links Two Dozen State and Regional Optical

Networks

DOE, NSF, & NASA

Using NLR

Page 7: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Lambdas Provide Global Access to Large Data Objects and Remote Instruments

Global Lambda Integrated Facility (GLIF)Integrated Research Lambda Network

Visualization courtesy of Bob Patterson, NCSA

www.glif.is

Created in Reykjavik, Iceland Aug 2003

Page 8: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

September 26-30, 2005Calit2 @ University of California, San Diego

California Institute for Telecommunications and Information Technology

The Networking Double Header of the Century Will Be Driven by LambdaGrid Applications

iGrid

2oo5T H E G L O B A L L A M B D A I N T E G R A T E D F A C I L I T Y

Maxine Brown, Tom DeFanti, Co-Organizers

www.startap.net/igrid2005/

http://sc05.supercomp.org

Page 9: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

The OptIPuter Project – Creating a LambdaGrid “Web” for Gigabyte Data Objects

• NSF Large Information Technology Research Proposal– Calit2 (UCSD, UCI) and UIC Lead Campuses—Larry Smarr PI– Partnering Campuses: USC, SDSU, NW, TA&M, UvA, SARA, NASA

• Industrial Partners– IBM, Sun, Telcordia, Chiaro, Calient, Glimmerglass, Lucent

• $13.5 Million Over Five Years• Linking User’s Linux Clusters to Remote Science ResourcesNIH Biomedical Informatics NSF EarthScope

and ORION

http://ncmir.ucsd.edu/gallery.html

siovizcenter.ucsd.edu/library/gallery/shoot1/index.shtml

Research Network

Page 10: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Optical Networking, Internet Protocol, ComputerBringing the Power of Lambdas to Users

• Complete the Grid Paradigm by Extending Grid Middleware to Control Jitter-Free, Fixed Latency, Predictable Optical Circuits– One or Parallel Dedicated Light-Pipes

– 1 or 10 Gbps WAN Lambdas

– Uses Internet Protocol, But Does NOT Require TCP – Exploring Both Intelligent Routers and Passive Switches

• Tightly Couple to End User Clusters Optimized for Storage, Visualization, or Computing– Linux Clusters With 1 or 10 Gbps I/O per Node– Scalable Visualization Displays with OptIPuter Clusters

• Applications Drivers: – Earth and Ocean Sciences– Biomedical Imaging– Designed to Work with any Discipline Driver

Page 11: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Earth and Planetary Sciences: High Resolution Portals to Global Earth Sciences DataEVL Varrier Autostereo 3D Image USGS 30 MPixel Portable Tiled Display

SIO HIVE 3 MPixel Panoram

Schwehr. K., C. Nishimura, C.L. Johnson, D. Kilb, and A. Nayak, "Visualization Tools Facilitate Geological Investigations of Mars Exploration Rover Landing Sites",

IS&T/SPIE Electronic Imaging Proceedings, in press, 2005

Page 12: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Tiled Displays Allow for Both Global Context and High Levels of Detail—150 MPixel Rover Image on 40 MPixel OptIPuter Visualization Node Display

"Source: Data from JPL/Mica; Display UCSD NCMIR, David Lee"

Page 13: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Interactively Zooming In Using UIC’s Electronic Visualization Lab’s JuxtaView Software

"Source: Data from JPL/Mica; Display UCSD NCMIR, David Lee"

Page 14: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Highest Resolution Zoom

"Source: Data from JPL/Mica; Display UCSD NCMIR, David Lee"

Page 15: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Landsat7 Imagery100 Foot Resolution

Draped on elevation data

High Resolution Aerial Photography Generates Images With 10,000 Times More Data than Landsat7

Shane DeGross, Telesis

USGSNew USGS Aerial ImageryAt 1-Foot Resolution

~10x10 square miles of 350 US Cities 2.5 Billion Pixel Images Per City!

Page 16: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Multi-Gigapixel Images are Available from Film Scanners Today

The Gigapxl Projecthttp://gigapxl.org

Balboa Park, San Diego

Page 17: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Large Image with Enormous DetailRequire Interactive LambdaVision Systems

The OptIPuter Project is Pursuing Obtaining Some of these Images for

LambdaVision 100M Pixel Walls

http://gigapxl.org 1/1000th the Area of Previous Image

Page 18: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

OptIPuter Scalable Displays Have Been Extended to Apple-Based Systems “iWall Driven by iCluster”

Source: Atul Nayak, SIOCollaboration of

Calit2/SIO/OptIPuter/USArray

Source: Falko Kuester, Calit2@UCINSF Infrastructure Grant

See GEON Poster: iCluster : Visualizing USArray Data on a Scalable High Resolution Tiled Display Using the OptIPuter

16 Mpixels50 Mpixels

36 Mpixels100 Mpixels

Apple G5s

MacApple 30-inch Cinema HD Display

Page 19: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Personal GeoWall 2 (PG2): Individual OptIPuter User Node

Dual-output for stereo visualization (GeoWall)

LCD array for high-resolution display (7.7 Mpixels)

Single 64-bit PC

Demonstrated by EVL (UIC) at 4th GeoWall Consortium Meeting

Page 20: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Campuses Must Provide Fiber Infrastructure to End-User Laboratories & Large Rotating Data StoresSIO Ocean Supercomputer

IBM Storage Cluster

2 Ten Gbps Campus Lambda Raceway

Streaming Microscope

Source: Phil Papadopoulos, SDSC, Calit2

UCSD Campus LambdaStore Architecture

Global LambdaGrid

Page 21: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

UCSD

StarLight Chicago

UIC EVL

NU

CENIC San Diego GigaPOP

CalREN-XD

8

8

The OptIPuter LambdaGrid is Rapidly Expanding

NetherLight Amsterdam

U Amsterdam

NASA Ames

NASA GoddardNLRNLR

2

SDSU

CICESE

via CUDI

CENIC/Abilene Shared Network

1 GE Lambda

10 GE Lambda

PNWGP Seattle

CAVEwave/NLR

NASA JPL

ISI

UCI

CENIC Los Angeles

GigaPOP

22

Source: Greg Hidley, Aaron Chin, Calit2

Page 22: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

OptIPuter Middleware Architecture-- The Challenge of Transforming Grids into LambdaGrids

Distributed Applications/ Web Services

Telescience

GTP XCP UDT

LambdaStreamCEP RBUDP

Vol-a-Tile

SAGE JuxtaView

Visualization

DVC ConfigurationDVC API

DVC Runtime Library

Data Services

LambdaRAM

Globus

XIOPIN/PDC

DVC Services

DVC Core Services

DVC Job Scheduling

DVCCommunication

Resource Identify/Acquire

NamespaceManagement

Security Management

High SpeedCommunication

Storage Services

GRAM GSI RobuStore

Photonic Infrastructure

Page 23: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Interactive Retrieval and Hyperwall Display of Earth Sciences Images Using NLR

Earth Science Data Sets Created by GSFC's Scientific Visualization Studio were Retrieved Across the NLR in Real Time from

OptIPuter servers in Chicago and San Diego and from GSFC Servers in McLean, VA, and Displayed at the SC2004 in Pittsburgh

Enables Scientists To Perform Coordinated Studies Of

Multiple Remote-Sensing Datasets

http://esdcd.gsfc.nasa.gov/LNetphoto3.html

Source: Milt Halem & Randall Jones, NASA GSFC& Maxine Brown, UIC EVL

Eric Sokolowsky

Page 24: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

LOOKING: (Laboratory for the Ocean Observatory

Knowledge Integration Grid)

Adding Web and Grid Services to Lambdas to Provide Real Time Control of Ocean Observatories

• Goal: – Prototype Cyberinfrastructure for NSF’s

Ocean Research Interactive Observatory Networks (ORION)

• LOOKING NSF ITR with PIs:– John Orcutt & Larry Smarr - UCSD– John Delaney & Ed Lazowska –UW– Mark Abbott – OSU

• Collaborators at:– MBARI, WHOI, NCSA, UIC, CalPoly,

UVic, CANARIE, Microsoft, NEPTUNE-Canarie

www.neptune.washington.edu

http://lookingtosea.ucsd.edu/

Page 25: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Pilot Project ComponentsPilot Project Components

LOOKING Builds on the Multi- Institutional SCCOOS Program, OptIPuter, and CENIC-XD

• SCCOOS is Integrating:– Moorings– Ships– Autonomous Vehicles – Satellite Remote Sensing– Drifters– Long Range HF Radar – Near-Shore

Waves/Currents (CDIP)– COAMPS Wind Model– Nested ROMS Models– Data Assimilation and

Modeling– Data Systems

www.sccoos.org/

www.cocmp.org

Yellow—Initial LOOKING OptIPuter Backbone Over CENIC-XD

Page 26: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

ROADNet Architecture: SensorNets, Storage Research Broker, Web Services, Work Flow

KeplerWeb ServicesSRBAntelope

Frank Vernon, SIO; Tony Fountain, Ilkay Altintas, SDSC

Page 27: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

LOOKING Service-Oriented System Software Architecture

http://lookingtosea.ucsd.edu/

Page 28: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

LOOKING High-Definition Interactive Instrument Cluster Goals

• Multiple Instruments on Ocean Floor– Operated Through Ocean Observing Workbench

• Feature Identification and Analysis– Exercising Metadata – Ontology Development

• Command & Control of an Instrument Cluster– Exercising Instrument Command Interface – Resource Management– Coordinated Control of Multiple Instruments

• Utilization of High-Bandwidth Cabled-Network– Linked to User’s Over NLR with OptIPuter Middleware

Page 29: Analyzing Large Earth Data Sets: New Tools from the OptiPuter and LOOKING Projects

Proposed Experiment for iGrid 2005 –Remote Interactive HD Imaging of Deep Sea Vent

Source John Delaney & Deborah Kelley, UWash

To Starlight, TRECC,

and ACCESS