national center for supercomputing applications gridchem: integrated cyber infrastructure for...

47
National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry http://www.gridchem.org Sudhakar Pamidighantam [email protected] 10 Oct 2005

Upload: aron-dickerson

Post on 28-Dec-2015

220 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

GridChem: Integrated Cyber Infrastructure for Computational

Chemistryhttp://www.gridchem.org

Sudhakar [email protected]

10 Oct 2005

Page 2: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Acknowledgements

Page 3: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Motivation

Software - Mature and easy to use to address chemists

questions of interest

Community of Users - Need and capable of using the software Some are non traditional computational

chemistsResources - Various in capacity and capability

Page 4: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Background

Qauntum Chemistry Remote Job Monitor

( Quantum Chemistry Workbench)

1998, NCSA

Chemviz

1999-2001, NSF

Page 5: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Technologies

Web Based Client Server Models

Visual Interfaces

Distributed computing

Page 6: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

GridChem

NCSA Alliance was commissioned 1998

Diverse HPC systems deployed

both at NCSA and Alliance Partner Sites

Batch schedulers different at sites

Policies favored different classes and modes of

use at different sites/HPC systems

Page 7: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Grid and Gridlock

Alliance lead to Physical Grid

Grid lead to TeraGrid

Homogenous Grid was planned but it was difficult to keep it homogenous

Things got more complicated and we have heterogeneous grid now!

Page 8: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Extended TeraGrid Facility

www.teragrid.org

Page 9: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Initial Grid Status

Grid Hardware

Middleware

ScientificApplications

Page 10: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Current Grid Status

Grid Hardware

Middleware

Scientific Applications

Interfaces

Page 11: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

User Community

Chemistry and Computational Biology

User BaseSep 03 – Oct 04

NRAC AAB Small Allocations

-------------------------------------------------------------

#PIs 26 23 64

#SUs 5,953,100 1,374,100 640,000

Page 12: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

User Issues• New systems meant learning new commands• Porting Codes• Learning new job submissions and

monitoring protocols• New proposals for time• Computational modeling became more

popular and users increased • Batch queues are longer / waiting increased• Find resources where to compute - probably

multiple distributed sites• Multiple proposals/allocations/logins• Authentication and Data Security • Data management

Page 13: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Computational Chemistry Grid

Integrated Cyber Infrastructure for Computational Chemistry

Integrates Applications, Middleware, HPC

resources, Scheduling and Data

management

Allocations, User Services and Training

Page 14: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Resources• Over 400 processors and 3,525,000 CPU hours available annually

System (Site) Procs Avail Total CPU Hours/Year

HP Intel Cluster (OSC) 12 100,000

Intel Cluster (OSC) 36 315,000

Intel Cluster (UKy) 96 840,000

HP Integrity Superdome 33 290,000

Intel Cluster (NCSA) 64 560,000

SGI Origin2000 (NCSA)IA32 Linux Cluster

12864

1,000,000560,000

Intel Cluster (LSU)Intel Cluster (LSU)

321024

280,0001,000,000

IBM Power4 (TACC) 16 140,000

Page 15: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Other Resources

Extant HPC resources at variousSupercomputer CentersNCSA Tungsten and Copper

Optionally TeraGrid/local/personal resources

These may require existing allocations/Authorization

Page 16: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Applications

• GridChem supports some apps already– Gaussian 98/03, GAMESS, Molpro, NWChem

• Schedule of integration of additional software– ACES-2– Crystal– Q-Chem– Wein2K– MCCCS Towhee – More …..

Page 17: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Details

• Three tiered architecture– GridChem client– middleware server– computational

resources

Page 18: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Grid Middleware Proxy Server

GridChem System

user user useruser user

PPortal Clientortal Client

Grid ServicesGrid Services

GridGrid

applicationapplicationapplicationapplication

Mass Storage

http://www.nsf.gov/awardsearch/showAward.do?AwardNumber=0438312

Page 19: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

GridChem Client• Graphical user

interface (GUI)• Java desktop

application• Pre and Post

Processing

– helps scientists generate input

– submit and monitor quantum chemistry jobs remotely

– visualize output data

• Leverages internal development projects

Page 20: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

GridChem Client Architecture• Composed of several

modules– authentication– job-editor

• molecule builder• visual molecular editor• molecular fragment

database• crystal structure database

– job submission– job manager

• job status info• output monitoring and

retrieval

Page 21: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Allocation

• GridChem is in friendly user mode• GridChem allocations are ready!

Visit https://www.gridchem.org/allocations/policy.htm

Simple one page submission

Password is user generated ( with strong suggestions of the type)

Registration of External allocations possible“Add Users” Facility Available for PI’s

Page 22: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

GridChem Client Launch

Page 23: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

GridChem AuthenticationRequires a GridChem Login

Page 24: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

GridChem Allocation Information

Page 25: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

User Preferences

Page 26: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Creating Jobs

Page 27: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

GridChem Job Editor

Page 28: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

GridChem Job EditorGamess Template

Page 29: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

GridChem Job EditorNWChem Template

Page 30: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

GridChem Nanocad Molecular Editor

Page 31: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Molecular Editor Based Input

Page 32: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

GridChem Job SubmissionJob Object

.

Page 33: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Submit Jobs

Page 34: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

GridChem Job Management

Page 35: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Job Status Information

Page 36: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Monitor Output

Page 37: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Post-ProcessingLexical Analysis And Parsing

• Follows the progress of the calculation while it is still running or has completed.

• Plots the energy, gradient, etc versus iteration number.

• Displays the current (optimized) structure in the Molecular Editor

Page 38: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Post ProcessingResults

Page 39: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Allocations

Existing allocations at various extant

resources are supported

Community allocation to reduce proposal

writing/review “headaches” for already

“funded” research.

Page 40: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

User/Community Support• Support provided by distributed set of staff involved in the

project - Chemists, Programmers, Systems Engineers

• Problems tracking through single mechanism - PCS Ticketing System

https://www.gridchem.org/consult/

• Online documentation to be provided on the CCG website - FAQs, Benchmarking Informations, Tips/Tricks etc..

Page 41: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Training and Outreach Integration

• Develop modules on a set of topics– interface fundamentals (e.g., inputs, choice lists, controls, etc.)

authentication/authorization molecular builderjob manager resource managementpost-processing visualizationintegration of additional applications

• Provide as workshops and seminars– 5th Annual Computational Chemistry Conference at the UIUC, April

17-19, 2005 • Annual updates

– track advancements and additional technologies developed and integrate them into CCG

Page 42: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Work In Progress

New Resource Integration Easily manageable Resource Description

Facility to add or remove resources and validation Software Provide Pre and Post Processing for additional

Application SoftwareAutomated Distributed computingMetascheduling across Grid

Page 43: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Applications

• Gamess

• NWChem

• MolPro

Pre and Post Processing Support

Page 44: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Post Processing

Some example Visuals Arginine Gamess/6-31G*Total electronic density 2D - Slices

Page 45: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Molecular Orbitals 2D

Page 46: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

3D VRML Views

Page 47: National Center for Supercomputing Applications GridChem: Integrated Cyber Infrastructure for Computational Chemistry  Sudhakar

National Center for Supercomputing Applications

Questions

We would like to be responsive to community requests

Source is OPEN. Contributions welcome!https://www.gridchem.org/software/software.htm