extending the internet throughout the physical world keynote to the ec-us taskforce on biotechnology...
TRANSCRIPT
Extending the Internet Throughout the Physical World
Keynote to
The EC-US Taskforce on
Biotechnology Research
Arlington, VA
Sept 9, 2001
Larry SmarrDepartment of Computer Science and Engineering
Jacobs School of Engineering, UCSDDirector, California Institute for Telecommunications and Information Technology
Towards a Global Biological Knowledge Grid
• Capture and Integrate Multiple Scales of Science– Genomes, Proteins, Metabolic Pathways– Cellular Systems– Organism Models– Ecological Systems– Geographic Biodiversity– Environmental Interactions
• Adapting to the Emerging Information Infrastructure– Wireless Access--Anywhere, Anytime– Distributed Sensors, Data, People, Computers– From the Web to the Grid– Highly Parallel Light Waves Through Fiber– Emergence of a Distributed Planetary Computer
Dynamic Growth in Mobile InternetForecast of Internet users worldwide
0
200
400
600
800
1,000
1,200
1,400
1,600
1,800
2,000
1999 2000 2001 2002 2003 2004 2005
Mobile Internet
Fixed Internet
Subscribers (millions)
3G Adds Mobility, QoS, and High Speeds
Source: Ericsson
California Has Undertaken a Grand Experiment in Partnering
UCSBUCLA
The California NanoSystems Institute
UCSFUCB
The California Institute for Bioengineering, Biotechnology,
and Quantitative Biomedical Research
UCI
UCSD
The California Institute for Telecommunications
and Information Technology
The Center for Information Technology Research
in the Interest of Society
UCSC
UCD
UCM
½ Mile
•Commodity Internet, Internet2•CENIC’s ONI, Cal-REN2, Dig. Cal.•PACI Distributed Terascale Facility
• Wireless LANs
The UCSD “Living Grid Laboratory”—Fiber, Wireless, Compute, Data, Software
SIO
SDSC
CS
ChemMed
Eng. / Cal-(IT)2
Hosp
• High-speed optical core
Source: Phil Papadopoulos, SDSC
Wireless WAN
The High PerformanceWireless Research and Education Network
• Cal-(IT)2 Will Build on This Pioneering Experiment
• Add New Ecological Sensor Arrays
• Try Out New Wireless Technologies
• Data Analysis
• Outreach and Education
NSF FundedPI, Hans-Werner Braun, SDSC,
UCSDCo-PI, Frank Vernon, SIO, UCSD
45mbps Duplex Backbone
As Our Bodies Move On-LineBioengineering and Bioinformatics Merge
• New Sensors—Israeli Video Pill– Battery, Light, & Video Camera– Images Stored on Hip Device
• Next Step—Putting You On-Line!– Wireless Internet Transmission– Key Metabolic and Physical Variables– Model -- Dozens of 25 Processors and 60
Sensors / Actuators Inside of our Cars
• Post-Genomic Individualized Medicine– Combine
– Genetic Code
– Body Data Flow
– Use Powerful AI Data Mining Techniques
www.givenimaging.com
FDA ApprovedAug. 2001
Adding Brilliance to Wireless SensorsWith Systems-on-Chip
Memory
Protocol Processors
ProcessorsProcessors DSP
Applications
Sensors
Source: Sujit Dey, UCSD ECE
EmbeddedSoftware
Radio
Critical New Role of Power Aware Systems
Internet
Ad Hoc Hierarchical Network of Brilliant Sensors
Moore’s Law—Simple 2D Shrinking Reaches End by 2015
CMOSBipolar, NMOS ??
100nm
Fea
ture
siz
e (
nan
om
eter
s)
PentiumProPentiumIII
Intel8080
Intel386
Pentium
1000nm
10nm
1nm1970 1980 1990 2000 2010 2020 2030 2040 2050
Intel4861 m
illion tr
ansis
tors
IA-6410
0 milli
on
10 billi
on
15
DARPADARPA NanosciencesNanosciencesMolecular Electronics/
Quantum / Bio3-D CMOS + - HYBRIDS
Source: Shankar Sastry, DARPA ITO
The Perfect Storm: Convergence of Engineering with Bio, Physics, & IT
5 nanometersHuman Rhinovirus
IBM Quantum CorralIron Atoms on Copper
VCSELaser
2 mm
Nanogen MicroArray500x
Magnification
400x Magnification
Nanobioinfotechnology
Why the Grid is the Future
Scientific American, January 2001
Layered Software Approach to Building the Planetary Grid
Science Portals & Workbenches
Twenty-First Century Applications
Computational Services
Performance
Networking, Devices and Systems
Grid Services(resource independent)
Grid Fabric(resource dependent)
Access Services & Technology
Access Grid
Computational Grid
Edited by Ian Foster and Carl Kesselmanwww.mkp.com/grids
“A source book for the historyof the future” -- Vint Cerf
The Grid Physics Network Is Driving the Creation of an International Grid
• Paul Avery (Univ. of Florida) and Ian Foster (U. Chicago and ANL), Lead PIs– Largest NSF Information Technology Research Grant– 20 Institutions Involved– Built on Globus Middleware
CMS
ATLAS
Sloan Digital Sky Survey
LHC
The EUROGRIDCreates an EU Virtual Machine Room
• UNICORE– Java
Middleware
• Driven by Applications
• Links to Key Databases
• One Interface to Multiple Machines
STAR TAP:Science Technology And Research Transit Access Point
Japan
Korea (2)
Taiwan
Singapore (2)
Australia (2)
China
Norway
Iceland
Sweden
Finland
Denmark
Russia
France
Netherlands
CERN
Israel
Ireland
Belgium
Europe/DANTE
United Kingdom
Canada
Chile, Brazil ANSP, Brazil RNP, Mexico
US: Abilene, DREN, ESnet, NISN, NREN, vBNS/vBNS+
www.startap.nethttp://www.ucaid.edu/abilene/html/itnpeerparticipants.html (Abilene ITN)http://www.canet3.net/optical/peering_info/intl_peering.html (CA*net3 ITN)
Star Light International Wavelength Switching Hub
Seattle
Portland
Caltech
SDSC
NYC
SURFnet, CERN
CANARIE
Asia-Pacific
Asia-Pacific
AMPATH
TeraGrid
*ANL, UIC, NU, UC, IIT, MREN
AMPATH
Source: Tom DeFanti, Maxine Brown
The NSF TeraGridPartnerships for Advanced Computational Infrastructure
NCSA8 TF
4 TB Memory240 TB disk
Caltech0.5 TF
0.4 TB Memory86 TB disk
Argonne1 TF
0.25 TB Memory25 TB disk
TeraGrid Backbone (40 Gbps)
SDSC4.1 TF
2 TB Memory250 TB disk
This will Become the National Backbone to Support Multiple Large Scale Science and Engineering Projects
DataCompute
VisualizationApplications
• Pre-Blue Horizon (mid-1990s):– Model Electrostatic Forces of a
Structure up to 50,000 Atoms– a Single Protein or Small Assembly
• Pre-TeraGrid (2001):– Model One Million Atoms
– Simulate Drawing a Drug Molecule Through a Microtubule or Tugging RNA Into a Ribosome
• TeraGrid (2003): – Models of 10 Million Atoms
– Model Function, Structure Movement, and Interaction at the Cellular Level for Drug Design and to Understand Disease
Baker, N., Sept, D., Joseph, S., Holst, M., and McCammon, J. A. PNAS 98: 10037-10040 (2001)
Advancing Realism in Modeling Cell Structures
Source: Fran Berman
Prototyping the Grid Cyber-Infrastructurefor a Biomedical Imaging Research Network
Source: Mark Ellisman, UCSD
Part of the UCSD CRBS Center for Research on Biological Structure
Wireless “Pad” Web Interface
Surface Web
NCRR Imaging and Computing Resources UCSD
Cal-(IT)2SDSC
Deep Web
DukeUCLA
Cal Tech
Harvard
UCSD
Forming a National-Scale Grid Federating Multi-Scale Neuro-Imaging Data from Centers with High Field MRI and Advanced 3D Microscopes
BIRNBIRN
From Telephone Conference Calls to Access Grid International Video Meetings
Access Grid Lead-ArgonneNSF STARTAP Lead-UIC’s Elec. Vis. Lab
Creating a Virtual Global Research Lab
Vast Data Sets Will RequireHigh Resolution Data Analysis Facilities
SDSCSIO
Newsday PhotoIra Schwarz
Celera Control Room
Cal-(IT)2 Control Room
Cox CommunicationsTeraburst Networks
Panoram Technologies
Grid-Enabled Collaborative Analysisof Ecosystem Dynamics Datasets
Chesapeake Bay Data in Collaborative Virtual Environment
Web Browser - Portal Interface
Portal Engine User PreferencesState Values
Data Gather
XML HTML
Legacy and Problem Specific Databases, Collections, & Literature
Analysis Tools- Genome, Protein, & Metabolic Pathways- Cellular Models- Integrative Systems- Species Identification- GIS Biodiversity- Data Mining- ...
Common Portal ArchitectureCustomized for Biological Sciences
A Global IT Strategy Is Neededto Integrate the Emerging Plant Genomes
Immense Computing Power Will Be Required to Lead in Post-Genomic Research
• ”We Don’t Need an Evolution in Computing, We Need a Revolution”—Craig Venter
• Sandia and Celera Will Collaborate On: – Advanced Algorithms – Visualization Technologies for Analyzing
Massive Quantities of Experimental Data From High-Throughput Instruments
• Equivalent to 100,000 Pentium 4’s!• Prototype by 2004
Biology is at the Leading Edge of Using the Emerging Planetary Computer
Application Software Has Been Downloaded to Over 30,000 PCsOver 500 CPU-Years Computed
Total Storage 50 Terabytes, Peak Speed 13 Teraflops
In SilicoDrug Design
Art Olson, The Scripps
Research Institute
A Planetary MegaComputer—Distributed Computing & Mass Storage
• Napster Meets Seti@Home !
• Assume Ten Million PCs in Five Years– Average Speed Ten GigaFLOP– Average Free Storage 100 GB
• Planetary Computer Capacity– 100,000 TetaFLOP Speed– 1 Million Terabyte Storage
• Global Distributed Server for Mobile Clients
Will a New Form of Intelligence Join Human Kind?
1 Million x
Source: Hans Moravecwww.transhumanist.com/volume1/power_075.jpg
•Will the Grid Become Self-
–Organizing
–Powered
–Aware?