grid computing for real world applications

44
Grid Computing for Real World Applications Suresh Marru Indiana University ([email protected]) 5th October 2005 OSCER Symposium @ OU

Upload: honey

Post on 13-Jan-2016

23 views

Category:

Documents


2 download

DESCRIPTION

Grid Computing for Real World Applications. Suresh Marru Indiana University ([email protected]). 5th October 2005 OSCER Symposium @ OU. Storms Forming. Forecast Model. Streaming Observations. Data Mining. On-Demand Storm predictions. Motivation: Scientific Challenges. - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Grid Computing for Real World Applications

Grid Computing forReal World Applications

Suresh MarruIndiana University([email protected])

5th October 2005OSCER Symposium @ OU

Page 2: Grid Computing for Real World Applications

Motivation: Scientific ChallengesScience Communities and Outreach

• Communities• CERN’s Large Hadron Collider

experiments

• Physicists working in HEP andsimilarly data intensive scientificdisciplines

• National collaborators and thoseacross the digital divide indisadvantaged countries

• Scope• Interoperation between LHC

Data Grid Hierarchy and ETF

• Create and Deploy ScientificData and Services Grid Portals

• Bring the Power of ETF to bearon LHC Physics Analysis: Helpdiscover the Higgs Boson!

• Partners• Caltech

• University of Florida

• Open Science Grid and Grid3

• Fermilab

• DOE PPDG

• CERN

• NSF GriPhyn and iVDGL

• EU LCG and EGEE

• Brazil (UERJ,…)

• Pakistan (NUST, …)

• Korea (KAIST,…)

LHC Data Distribution Model

Identify Genes

Phenotype 1 Phenotype 2 Phenotype 3 Phenotype 4

Predictive Disease Susceptibility

Physiology

Metabolism Endocrine

Proteome

Immune Transcriptome

BiomarkerSignatures

Morphometrics

Pharmacokinetics

EthnicityEnvironment

AgeGender

Genetics and Disease Susceptibility

Source: Terry Magnuson, UNC

On-DemandStorm predictions

StreamingObservations

Forecast Model

Data Mining

Storms Forming

The current and future generations

of scientific problems are:Data Oriented

Increasingly stream based. Often need petabyte archives.

In need of on-demand computing resources

Conducted by geographically distributed teams of specialists Users do not want to expend too

much time learning new technologies.

Page 3: Grid Computing for Real World Applications

Solution

• Grid Technology bridges the gap between the applications and the infrastructure.

Fine, but what the heck is grid computing? follow along to find out …

Adapt Grid Computing and solve every computing problem in this world.

Is this true? I wish it is, but not really ..then what ..

Page 4: Grid Computing for Real World Applications

Introduction to Grid

Grid Computing enables • sharing, • selection and • aggregation of a wide variety of geographically distributed resourcesincluding • supercomputers, • storage systems, • data sources and • specialized devices owned by different organizationsfor solving large-scale resource intensive problems inscience, engineering, and commerce.

Page 5: Grid Computing for Real World Applications

Power Grid Analogy

Supercomputers

Networks

Storage Devices

Resources

Grid Portal

MyProxy

Globus Server API

Globus ClientCog Kit API

Gateway Services

Instrumentation

Infrastructure

Interface

Users

Page 6: Grid Computing for Real World Applications

Key Features of Grid Computing

• Provides a secure infrastructure for computing on a distributed computing environment

• Provides single sign-on feature by which a user can authenticate once and perform multiple computations over extended period of time

• Facilitates inter-domain access mechanisms

• Better portability (code can run on many kinds of computers) and exportability (move files from one computer to another)

Page 7: Grid Computing for Real World Applications

Grid Security

• Grid Certificates:

• Needed for using the Grid

• Used to provide a set of privileges of one resource to another

• Provide the features of dynamic delegation, dynamic entities and repeated authentication

• Standard PKI infrastructure is used for validation

Page 8: Grid Computing for Real World Applications

Challenges of using Grid computing

• The concept of grid is promising but users have to cope up with .. Emerging technology Evolving standards Frequent new versions of middleware with little or no

backwards compatibilityUsers have to learn the technology to use it.

Alternatively, use grid-enabled science portals

Page 9: Grid Computing for Real World Applications

Science Portals

• The goals of a Science Portal are– To give a community of researchers easy access to

the tools, data and computational power needed to solve today’s scientific and engineering problems.

– To do this in a discipline specific language that is common to the target community.

– To hide any underlying Grid technology.

Page 10: Grid Computing for Real World Applications

Portal Science Capabilities

• Data Access is the most important– Allow the user community access to important shared data

resources• Visualize it, publish it, download it, curate it.

– Data Discovery• Searchable metadata directories.

• Web access to important tools– Web-form interfaces to allow users to run important community

codes– Webstart access to common java-based tools– Limited shell access - perhaps to a VM

• Workflow tools– Allow users to combine community codes into workflows

managed by the portal back-end resources.

Page 11: Grid Computing for Real World Applications

The Architecture of Gateway ServicesThe Users Desktop.The Users Desktop.

Gateway Services

Grid Portal Server

Grid Portal Server

Physical Resource Layer

Core Grid Services

Proxy CertificateServer / vault

Proxy CertificateServer / vault

Application EventsApplication EventsResource BrokerResource Broker

User MetadataCatalog

User MetadataCatalog

Replica MgmtReplica Mgmt

ApplicationWorkflow

ApplicationWorkflow

App. Resourcecatalogs

App. Resourcecatalogs

ApplicationDeployment

ApplicationDeployment

ExecutionManagement

ExecutionManagement

InformationServices

InformationServices

SelfManagement

SelfManagement

DataServices

DataServices

ResourceManagement

ResourceManagement

SecurityServicesSecurityServices

OGSA-like Layer

Page 12: Grid Computing for Real World Applications

Service Architecture

• The Foundation of the gateway science portal software is based on the concept of “services” and “service oriented architectures.”

Page 13: Grid Computing for Real World Applications

What’s a service anyway?

• A “web server” that runs an application for you.– You send it requests (XML documents) and it

processes the information and send replies (notifications) when it is done.

ApplicationService

1. Service Request

Compute Machine

2. Run Application

3. Publish notifications

Page 14: Grid Computing for Real World Applications

The Portal - Service interaction

Browser

Portal Server

1.

App serviceregistry

App ServiceInstance2.

3.

4. 5.

• Each application is deployed as a service which can be invoked by the portal or another service.– 1. User looks up & selects application services from

portal.– 2. Portal locates service instance.– 3. Service is contacted and replies with a interface

description– 4. Server displays the interface and user fills it out.– 5. Server create ws request and sends it to the app

service.

Page 15: Grid Computing for Real World Applications

What do we do with Applications?

• Service-oriented applications– Wrap applications as

services– Compose applications

into workflows– Execute applications on remote resources on

behalf of user in a secured manner

Page 16: Grid Computing for Real World Applications

What a User Gains By Using Grid and Portals

• As a direct user• – Can easily• Execute jobs at one or more remote sites• Move data between sites• All with single sign-on security• As a user of a grid enabled application• Will not see the grid• Will see an application whose development was• eased with grid functions or grid-based web services• Ease of development should result in more• applications or faster availability of applications

Page 17: Grid Computing for Real World Applications

What Application DevelopersGain by Using Grids and Portals

• Application web services can be built by re-using• capabilities provided by existing grid-enabled• Web services.• Applications can also be built by using grid• functions• Grid functions/services handle distributed• management of tasks and data• – Developer can focus on logic of application

and not• logic of distributed interaction

Page 18: Grid Computing for Real World Applications

Example Gateway LEAD – Linked Environments for Atmospheric Discovery

(Mesoscale Meteorology)NSF LEAD project - making the tools thatare needed to make accurate predictions of tornados and hurricanes. - Data exploration and Grid workflow

Page 20: Grid Computing for Real World Applications

LEAD utilizes grid tools adopting a strategy to deal with middleware issues

• Stick with pre-ws globus version (Globus 2.4) and slowly transition to GT 4

• Develop software on self-controlled test grid constituting of machines distributed at various partner institutions

• port the tested version on to teragrid – production grid resources by working closely with the resource providers

Page 21: Grid Computing for Real World Applications

Web ServicesComponents

Non-WS

Components

Data Management

SecurityCommonRuntime

Execution Management

Information Services

Pre-WSAuthenticationAuthorization

GridFTP

GridResource

Allocation Mgmt(Pre-WS GRAM)

Monitoring& Discovery

System(MDS2)

C CommonLibraries

GT2

WSAuthenticationAuthorization

ReliableFile

Transfer

OGSA-DAI[Tech Preview]

GridResource

Allocation Mgmt(WS GRAM)

Monitoring& Discovery

System(MDS4)

Java WS Core

CommunityAuthorization

ServiceGT3

ReplicaLocationService

XIO

GT3

CredentialManagement

GT4

Python WS Core[contribution]

C WS Core

CommunitySchedulerFramework

[contribution]

DelegationService

GT4

Pre-WS Globus components are still supported

Page 22: Grid Computing for Real World Applications

Java Services in Apache AxisPlus GT Libraries and Handlers

YourJavaService

YourPythonService

YourJavaService R

FT

GR

AM

Del

egat

ion

Inde

x

Trig

ger

Arc

hive

r

pyGlobusWS Core

YourC

Service

C WS Core

RLS

Pre

-WS

MD

S

CA

S

Pre

-WS

GR

AM

Sim

pleC

A

MyP

roxy

OG

SA

-DA

I

GT

CP

Grid

FT

P

C Services using GT Libraries and Handlers

SERVER

CLIENT

InteroperableWS-I-compliant

SOAP messaging

YourJavaClient

YourC

Client

YourPythonClient

YourJavaClient

YourC

Client

YourPythonClient

YourJavaClient

YourC

Client

YourPythonClient

YourJavaClient

YourC

Client

YourPythonClient

X.509 credentials =common authentication

Python hosting, GT Libraries

GT4 Components

Page 23: Grid Computing for Real World Applications

LEAD Test-bed Grid

Unidata

OU

UI IU

UAH

UNC

The LEAD Grid

Page 24: Grid Computing for Real World Applications

DEMO

• LEAD Portal demo

Page 25: Grid Computing for Real World Applications

The top level view

The current weather.Click on a location for

more data

The current testbed status

Top Level tabs to public Tools and information.

To get to your stuff, log in hereOr create a new account

Page 26: Grid Computing for Real World Applications

GEO Reference GUI Prototype• Use mouse to drag

a region of interest.• Fill in the data

requirements• The tool, when

finished will gather the data for you.

Page 27: Grid Computing for Real World Applications

Educational Resources

Page 28: Grid Computing for Real World Applications

Log in and see your MyLEAD Space

• x

Page 29: Grid Computing for Real World Applications

Searching MyLEAD

• Shots of the search tool.

Page 30: Grid Computing for Real World Applications

The Experiment Builder

• To review your previous experiments and create new ones• Experiments are organized into projects

– You can select an old one to look at,– Or create a new project or experiment.– Let’s do a new experiment! (click “new”)

Page 31: Grid Computing for Real World Applications

Creating a workflow for Data Mining• Use ADaM services from UAH

3DMesocyclone Detection

ESML_Converter

MinMaxNormalizer

Visualization

BayesClassifying

Feature Extraction

Service

Nexrad II RadarData

ESML Descriptor

Data Transformation

Service

Data Normalization

Service

Classification Service

Page 32: Grid Computing for Real World Applications

Provide a name and description• Next select an application from the dropdown list or create

a new workflow.• Once we have selected the app, we push “next” to add

data.

Page 33: Grid Computing for Real World Applications

Composing the Workflow• Graphical Composer

– Standard drop-and-drag composer model (like Kepler and others)

– Compiles Python or PBEL code

Page 34: Grid Computing for Real World Applications

Final Workflow

• Save it back to my lead• Next we must bind the inputs to the workflow

Page 35: Grid Computing for Real World Applications

Wizard understand the workflow requirements

Page 36: Grid Computing for Real World Applications

Select an output location

Page 37: Grid Computing for Real World Applications

Submitting the workflow

Page 38: Grid Computing for Real World Applications

Monitor results in real time

Page 39: Grid Computing for Real World Applications

Check it out in MyLEAD

Page 40: Grid Computing for Real World Applications

Click on the output file to see visualization

Page 41: Grid Computing for Real World Applications

Large workflows can be composed

Page 42: Grid Computing for Real World Applications

Output from the Weather Workflow

Page 43: Grid Computing for Real World Applications

Acknowledgements

Slide Courtesy:• Dr. Dennis Gannon

• Globus Website (http://globus.org)

Page 44: Grid Computing for Real World Applications

? ??