the center for computational research - university …university at buffalo the state university of...

38
Russ Miller Center for Computational Research Computer Science & Engineering SUNY-Buffalo Hauptman-Woodward Medical Inst The Center for Computational Research: Grid, Visualization, and BioMedical Computing University at Buffalo The State University of New York NSF, NIH, DOE NIMA, NYS, HP

Upload: others

Post on 20-Jun-2020

0 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

Russ MillerCenter for Computational ResearchComputer Science & Engineering SUNY-BuffaloHauptman-Woodward Medical Inst

The Center for Computational Research:Grid, Visualization, and BioMedical Computing

University at BuffaloThe State University of New York

NSF, NIH, DOE NIMA, NYS, HP

Page 2: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Center for Computational Research1999-2004 Snapshot

Raptor Image

n High-Performance Computing and High-End Visualizationq 110 Research Groups in 27 Deptsq 13 Local Companies q 10 Local Institutions

n External Fundingq $116M External Fundingm$16M as leadm$100M in support

q $43M Vendor Donationsq Total Leveraged: $0.5B

n Deliverablesq 400+ Publicationsq Software, Media, Algorithms, Consulting,

Training, CPU Cycles…

Page 3: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

n Apex Bioinformatics Systemq Sun V880 (3), Sun 6800q Sun 280R (2)q Intel PIIIsq Sun 3960: 7 TB Disk Storage

n HP/Compaq SANq 75 TB Diskq 190 TB Tapeq 64 Alpha Processors (400 MHz) q 32 GB RAM; 400 GB Disk

n IBM RS/6000 SP: 78 Processorsn Sun Cluster: 80 Processorsn SGI Intel Linux Clusterq 150 PIII Processors (1 GHz)qMyrinet

Major Compute Resources n Dell Linux Cluster: #22→#25→#38→#95q 600 P4 Processors (2.4 GHz)q 600 GB RAM; 40 TB Disk; Myrinet

n SGI Origin3700 (Altix)q 64 Processors (1.3GHz ITF2)q 256 GB RAMq 2.5 TB Disk

n SGI Origin3800q 64 Processors (400 MHz) q 32 GB RAM; 400 GB Disk

n Dell Linux Cluster: #187→#368→off q 4036 Processors (PIII 1.2 GHz)q 2TB RAM; 160TB Disk; 16TB SAN

n IBM BladeCenter Cluster: #106q 532 P4 Processors (2.8 GHz)q 5TB SAN

Page 4: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

CCR’s BioACE Systemn BioACE Computing Environmentq SunFire 6800 (12P, 24GB), 2 SunFire V880’s (16P, 32GB)q 16 RLX Pentium 3 Server Bladesq 104 GB of RAM; 7 TB of disk storage

n SoftwareqGenomics PackagesmGCG, Vector NTI, Vector Xpression, Vector PathBlazer

q Sequence Analysis mEMBOSS, ClustalW, MUMmer,

q Database SearchmBlast, PSI Blast, HMMER

qGene ExpressionmCluster/Xcluster, TreeView, J-Express

q Statistics PackagesmR & Bioconductor

Page 5: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

CCR Visualization Resourcesn Fakespace ImmersaDesk R2q Portable 3D Device

n Tiled-Display Wallq 20 NEC projectors: 15.7M pixelsq Screen is 11’×7’q Dell PCs with Myrinet2000

n Access Grid Nodes (2)qGroup-to-Group Communicationq Commodity components

n SGI Reality Center 3300Wq Dual Barco’s on 8’×4’ screen

Page 6: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

CCR Research & Projectsn Ground Water Modelingn Computational Fluid Dynamicsn Molecular Structure Determination

via Shake-and-Baken Protein Foldingn Digital Signal Processingn Grid Computingn Computational Chemistryn Bioinformatics

n Real-time Simulations and Urban Visualization

n Accident Reconstructionn Risk Mitigation (GIS)n Medical Visualizationn High School Workshopsn Virtual Reality

Page 7: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Molecular Structure Determinationvia Shake-and-Bake

n SnB Software by UB/HWIq “Top Algorithms of the

Century”

n Worldwide Utilizationn Critical Stepq Rational Drug Designq Structural Biologyq Systems Biology

n Vancomycinq “Antibiotic of Last Resort”

n Current EffortsqGridq Collaboratoryq Intelligent Learning

Page 8: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

n Objective: Provide a 3-D mapping of the atoms in a crystal.

n Procedure:1. Isolate a single crystal.2. Perform the X-Ray diffraction experiment.3. Determine molecular structure that agrees

with diffration data.

X-Ray Crystallography

Page 9: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Protein Foldingn Ability of protein to perform

biological function is attributed to 3D structure

n Protein folding problemq Predict 3D structure from amino-

acid sequence n Solving the folding problem

impacts drug designn Research underway at UB on

the development of models to improve accuracy and efficiency of 3D prediction

n 4000 processor Dell P3 cluster dedicated solely to protein folding problem

Page 10: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Computational Chemistryn UB Software Development in Quantum ChemistryqQ-Chem – development of parallel algorithms and combined

QM/MM methods for large molecular systemsq ADF – development of algorithms to calculate magnetic and optical

properties of moleculesn Used to determineqMolecular Structureq Electronic Spectraq Chemical Reactivity

n Applicationsq Pharmaceutical Drug Designq Industrial CatalysisqMaterials Scienceq Nanotechnologyq Solution Phase Chemistryq Chemical Kinetics

Page 11: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Groundwater Flow Modelingn Regional-scale modeling of groundwater

flow and contaminant transport (Great Lakes Region)

n Ability to include all hydrogeologicfeatures as independent objects

n Current work is based on Analytic Element Method

n Key features:q High precisionq Highly parallelq Object-oriented programmingq Intelligent user interfaceq GIS facilitates large-scale regional applications

n Utilized 10,661 CPU days (32 CPU years) of computing in past year on CCR’s commodity clusters

Page 12: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Geophysical Mass Flow Modeling

n Modeling of Volcanic Flows, Mud flows (flash flooding), and Avalanches

n Integrate information from several sourcesq Simulation resultsqRemote sensingqGIS data

n Develop realistic 3D models of mass flows

n Present information at appropriate level

Page 13: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Confocal Microscopy

n 3D Reconstruction of an Oral Epithelial Cell

n Translucent White Surface Represents the Cell Membrane

n Reddish Surface Represents Groups of Bacteria

Page 14: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

3D Medical Visualization App

n Collaboration with Children’s HospitalqLeading miniature

access surgery centern Application reads data

output from a CT Scann Visualize multiple surfaces

and volumesn Export images, movies or

CAD representation of model

Page 15: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Multiple Sclerosis Project

n Collaboration with Buffalo Neuroimaging Analysis Center (BNAC)qDevelopers of Avonex,

drug of choice for treatment of MS

nMS Project examines patients and compares scans to healthy volunteers

Page 16: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Multiple Sclerosis Project

n Compare caudate nuclei between MS patients and healthy controls

n Looking for size as well as structure changesqLocalized deformitiesqSpacing between halves

n Able to see correlation between disease progression and physical structure changes

Page 17: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

DISCOM

SinRG

APGrid

IPG …

Grid Computing

Page 18: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Grid Computing Overview

n Coordinate Computing Resources, People, Instruments in Dynamic Geographically-Distributed Multi-Institutional Environment

n Treat Computing Resources like Commoditiesq Compute cycles, data storage, instruments q Human communication environments

n No Central Control; No Trust

Imaging Instruments Large-Scale Databases

Data Acquisition AnalysisAdvanced Visualization

Computational ResourcesLHC

Page 19: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Factors Enabling the Gridn Internet is Infrastructureq Increased network bandwidth and advanced services

n Advances in Storage Capacityq Terabyte costs less than $5,000

n Internet-Aware Instrumentsn Increased Availability of Compute Resourcesq Clusters, supercomputers, storage, visualization devices

n Advances in Application Conceptsq Computational science: simulation and modelingq Collaborative environments → large and varied teams

n Grids TodayqMoving towards production; Focus on middleware

Page 20: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

NSF Extensible TeraGrid Facility

NCSA: Compute IntensiveSDSC: Data Intensive PSC: Compute Intensive

IA64

IA64 Pwr4EV68

IA32

IA32

EV7

IA64 Sun

10 TF IA-64128 large memory nodes

230 TB Disk StorageGPFS and data mining

4 TF IA-64DB2, Oracle Servers500 TB Disk Storage6 PB Tape Storage1.1 TF Power4

6 TF EV6871 TB Storage

0.3 TF EV7 shared-memory150 TB Storage Server

1.25 TF IA-6496 Viz nodes

20 TB Storage

0.4 TF IA-64IA32 Datawulf80 TB Storage

Extensible Backplane NetworkLA

HubChicago

Hub

IA32

Storage Server

Disk Storage

Cluster

Shared Memory

VisualizationCluster

LEGEND

30 Gb/s

IA64

30 Gb/s

30 Gb/s30 Gb/s

30 Gb/s

Sun

Sun

ANL: VisualizationCaltech: Data collection analysis

40 Gb/s

Backplane Router

Figure courtesy ofRob Pennington, NCSA

Page 21: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Advanced Computational Data Center ACDC: Grid Overview

300 Dual Processor2.4 GHz Intel XeonRedHat Linux 7.338.7 TB Scratch Space

Joplin: Compute Cluster 75 Dual Processor1 GHz Pentium IIIRedHat Linux 7.31.8 TB Scratch Space

Nash: Compute Cluster

9 Single Processor Dell P4 DesktopsSchool of Dental Medicine

13 Various SGI IRIX ProcessorsHauptman-Woodward Institute

25 Single Processor Sun Ultra5sComputer Science & Engineering

Crosby: Compute Cluster

SGI Origin 380064 - 400 MHz IP35IRIX 6.5.14m360 GB Scratch Space

9 Dual Processor1 GHz Pentium IIIRedHat Linux 7.3315 GB Scratch Space

Mama: Compute Cluster

16 Dual Sun Blades47 Sun Ultra5Solaris 8770 GB Scratch Space

Young: Compute Cluster

19 IRIX, RedHat, & WINNT Processors

CCRRedHat, IRIX, Solaris,

WINNT, etc

Expanding

ACDC: Grid Portal4 Processor Dell 66501.6 GHz Intel XeonRedHat Linux 9.066 GB Scratch Space

1 Dual Processor250 MHz IP30IRIX 6.5

Fogerty: Condor Flock Master

Page 22: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

FDDI

100 Mbps

1.54 Mbps (T1) - RPCI

1000 Mbps

1.54 Mbps (T1) - HWI

44.7 Mbps (T3) - BCOEB

OC-3 - I1

155 Mbps (OC-3) I2

NYSERNet350 Main St

NYSERNet350 Main St

CommercialAbilene

622 Mbps (OC-12)

100 Mbps

BCOEB

Medical/Dental

Network Connections

Page 23: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

300 Dual Processor2.4 GHz Intel XeonRedHat Linux 7.338.7 TB Scratch Space

Joplin: Compute Cluster 75 Dual Processor1 GHz Pentium IIIRedHat Linux 7.31.8 TB Scratch Space

Nash: Compute Cluster

4 Processor Dell 66501.6 GHz Intel XeonRedHat Linux 9.066 GB Scratch Space

ACDC: Grid Portal

Crosby: Compute Cluster

SGI Origin 380064 - 400 MHz IP35IRIX 6.5.14m360 GB Scratch Space

9 Dual Processor1 GHz Pentium IIIRedHat Linux 7.3315 GB Scratch Space

Mama: Compute Cluster

16 Dual Sun Blades47 Sun Ultra5Solaris 8770 GB Scratch Space

Young: Compute Cluster

182 GB Storage

100 GB Storage56 GB Storage

100 GB Storage

70 GB Storage

Network AttachedStorage1.2 TB

Storage Area Network75 TB

136 GB Storage

CSE MultiStore40 TB

ACDC Data Grid Overview(Grid-Available Data Repositories)

Page 24: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

ACDC-Grid

Browser view of “miller” group files published by user

“rappleye”

Page 25: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

ACDC-Grid Administration

Page 26: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Grid-Enabling Application Templates

nStructural BiologynEarthquake EngineeringnPollution AbatementnGeographic Information Systems & BioHazards

Page 27: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

ACDC-Grid Cyber-Infrastructure

n Predictive SchedulerqDefine quality of service estimates of job completion, by

better estimating job runtimes by profiling users.

n Data GridqAutomated Data File Migration based on profiling users.

n High-performance Grid-enabled Data RepositoriesqDevelop automated procedures for dynamic data

repository creation and deletion.

n Dynamic Resource AllocationqDevelop automated procedures for dynamic

computational resource allocation.

Page 28: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Initial ACDC Campus Grid

Page 29: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

ACDC-Grid Portal Condor Flockn CondorView integrated

into ACDC-Grid Portal

Page 30: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

ACDC-Grid Collaborations

n Grid3+ Collaboration / iVDGL Membern Open Science Grid Founding ParticipantqMonitoring & Information Services, co-chairq Security, Tech Working Group Participant

n WNY Grid Initiativen Grid-Based Visualizationq SGI Collaboration

n Grid-LiteqHP Labs Collaboration

n Innovative Laboratory Prototypeq Dell Collaboration

n NE Bio-Gridq IBM Research CollaborationqMIT, Harvard

Page 31: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Innovative LaboratoryPrototype

Grid3

High-PerformanceNetworking Infrastructure

HP Labs GridLite

Access Grid

Open Science Grid

NEES Grid

Northeast Bio-Grid

ACDC-Campus Grid

Western New York Grid Interoperable Grids

GRASEVO

ACDC-Grid Collaborations

Page 32: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Grid3 Snapshot of Sites

UBuffalo-CCR Virtual Organization

Grid Resources for Advanced Science and Engineering (GRASE)

Page 33: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Northeast Structural Genomics Consortium

n Consortiumq UB, Rutgers, Columbia, Cornell, PNNL, Yale, UToronto, Robert

Wood Johnson Medical Center, Hauptman-Woodward Medical Research Center

n Missionq Develop integrated technologies for high-throughput (htp) protein

production and 3D structure determinationq The goal is to determine 500 new protein structures over 5 yearsq Combination of strong parallel efforts in both X-ray

crystallography and solution-state NMR spectroscopyq UB Professor Thomas Szyperski awarded Scientific American’s

Top 50 Scientists in 2003 for novel work in high-throughput structure determination with NMR

Page 34: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Western New YorkHealth Information Project

Goals:n Build a secure community-

wide healthcare databasen Develop an electronic patient

medical record that “follows the patient”

n Provide care providers with real-time patient information wherever they are

n Provide a tool to aid agencies in community safety, epidemiology, resource allocation, and bioterrorismresponse

n Improve the overall quality of healthcare while reducing costs

Selected Participants:n University at Buffalo (CCR,

School of Informatics, School of Medicine, Health Science Library)

n Buffalo Academy of Medicinen Erie County DoHn New York State DoHn WNY HealtheNetn Involvement from Kaleida

Health, ECMC, Catholic Health System, Independent Health, HealthNow, and Univera Healthcare

Page 35: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Outreach

nHS Summer Workshops in Computational ScienceqChemistry, Bioinformatics, Visualizationq10-14 HS Students Participate Each Summer for 2 weeksqProject-Based Program

Page 36: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

OutreachnPilot HS Program in Computational ScienceqYear long extracurricular activity at Mount St. Mary’s,

City Honors, and Orchard Park HSqProduce next generation scientists and engineersqStudents learn Perl, SQL, Bioinformaticsq$50,000 startup funding from Verizon, PC’s from HP

Page 37: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

Media Coverage

Page 38: The Center for Computational Research - University …University at Buffalo The State University of New York Center for Computational Research CCR nApex Bioinformatics System qSun

University at Buffalo The State University of New York CCRCenter for Computational Research

www.ccr.buffalo.edu