computing as a tool for high energy and nuclear physicshome.fnal.gov/~wolbers/jeffersonlab.pdf ·...

94
April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium 1 Computing as a Tool for High Energy and Nuclear Physics Stephen Wolbers Fermilab April 25, 2001

Upload: others

Post on 14-Aug-2020

1 views

Category:

Documents


0 download

TRANSCRIPT

Page 1: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

1

Computing as a Tool for High Energy andNuclear Physics

Stephen WolbersFermilab

April 25, 2001

Page 2: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

2

Outline• Introduction to High Energy and Nuclear

Physics (HENP) and HENP Computing– Accelerators, detectors, data volumes

• Data Rates, Facilities, Techniques– Run 2a at Fermilab– RHIC/JLAB Experiment Offline Computing – Lattice Gauge Calculations– Accelerator Design

• Future– Run 2b at Fermilab– Data GRIDs and Distributed Computing

Page 3: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

3

New York Times, Sunday, March 25, 2001

Page 4: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

4

Introduction

• Particle and nuclear physics use beams of particles striking targets to study the fundamental nature of matter and interactions.

• Advances in the field have come from:– Higher Energy Particles and Interactions– More collisions per unit time and space (luminosity)– Better detectors

• More sensitivity, more granular, fewer cracks, lower deadtime, more radiation-hard

– More “events” saved to storage (disk or tape)– More sophisticated analysis of “events”– Better simulation of the beams, collisions, and detector– Advances in Theory

Page 5: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

5

Collisions Simplified

• Collider:

• Fixed-Target:

p pAu Au

e

Page 6: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

6

Collisions – CDF/Fall 2000

Page 7: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

7

Collision at RHIC from STAR and PHOBOS

Page 8: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

8

Particle Acceleration

• Particle acceleration occurs in a multi-step process

• Example: Fermilab– Cockcroft-Walton– Linac– Booster– Main Injector– Tevatron

• Anti-Protons– Accumulator/debuncher– Recycler

Page 9: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

9

Page 10: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

10

Cockcroft-Walton

Page 11: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

11

Linac

Page 12: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

12

Booster

Main Injector

Tevatron

Page 13: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

13

Antiproton Source

Recycler

Page 14: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

14

Particle Detectors

• Large detectors are used to “see” the interactions of particles.

• Consist of subdetectors which record information about particle position, energy and momentum.– Used to measure the number and types of

particles coming from collisions.– Also used to identify particle decays.

• e, ���������p, K, W, Z, b, c, …

Page 15: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

15

CDF Detector

Page 16: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

16

D0 Detector

Page 17: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

17

Jefferson Lab

Hall A

Hall B

Hall C

Page 18: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

18

Physics to Raw Data(taken from Hans Hoffman, CERN)

e+

e-

f

fZ0 _

Fragmentation,Decay

Interaction withdetector materialMultiple scattering,interactions

2037 2446 1733 16994003 3611 952 13282132 1870 2093 32714732 1102 2491 32162421 1211 2319 21333451 1942 1121 34293742 1288 2343 7142

Raw data(Bytes)

Read-out addresses,ADC, TDCvalues,Bit patterns

DetectorresponseNoise, pile-up,cross-talk,inefficiency,ambiguity,resolution,response function,alignment,temperature

Page 19: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

19

From Raw Data to Physics

2037 2446 1733 16994003 3611 952 13282132 1870 2093 32714732 1102 2491 32162421 1211 2319 21333451 1942 1121 34293742 1288 2343 7142

Raw data

Convert tophysics quantities

Detectorresponseapplycalibration,alignment,

Fragmentation,DecayPhysics analysis

Interaction withdetector materialPattern,recognition,Particleidentification

_

e+

e-

f

fZ0

Basic physics

Results

AnalysisReconstruction

Simulation (Monte-Carlo)

Page 20: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

20

Computing Connection

Computing TechniqueAccelerator Design/simulationAcc. Design and controlsTriggers (networks, CPU), simulationDisk, tape, CPU, networks Disk, tape, CPU, networks, algorithmsCPU, algorithms, OO CPU, algorithms, OO

Desired ImprovementHigher energyMore collisionsBetter detectorsMore eventsBetter analysisSimulationTheory

Page 21: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

21

Example – Higgs Search at CDF and D0 at the Tevatron

• Proton and Antiproton beam energy has been increased to 1 TeV (actually 980 GeV) and the luminosity of collisions has been (will be) dramatically increased– The pbar upgrades were

modeled and tested on theFermilab compute farms, as were the Recycler Ring design and Tevatrontuning.

– Many of these calculations and tunings continue today.

Page 22: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

22

Computing and Particle/Nuclear Physics Advances

• HENP has always required substantial computing resources– Computing advances have enabled “better physics”– Physics research demands further computing advances– Physics and computing have worked together over the

years

Computing Advances Physics Advances

Page 23: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

23

Examples• Data acquisition systems have advanced to allow

increased data writing– Higher bandwidth DA systems– More sophisticated triggers, including software

triggers– Even with the same beam and detector, an

experiment can increase the “physics reach” by taking more data

• Increased computing power allows analysis of larger and larger datasets– This in turn allows for increased DA volume

Page 24: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

24

Charm Physics

• Early experiments were able to study only a few hundred events.

• A series of experiments at Fermilab pushed to write a huge amount of data (for the time) in order to increase the statistics dramatically.

• This was possible because of the availability of large offline computing capability (“compute farms”) to cope with the large data samples.

Page 25: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

25

Data Volumes for HENP Experiments(in units of 109 bytes)

0

200000

400000

600000

800000

1000000

1200000

1980 1985 1990 1995 2000 2005 2010

E691

E665

E769

E791

CDF/D0

KTeV

E871

BABAR

CMS/ATLAS

E831

ALEPH

JLAB

RHIC

NA48

Volume = ��L ���

����Cross SectionL = Luminosity����Efficiency for collecting dataE = Event size

Page 26: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

26

Data Volume per experiment per year (in units of 109 bytes)

100

1000

10000

100000

1000000

1980 1985 1990 1995 2000 2005 2010

E691

E665

E769

E791

CDF/D0

KTeV

E871

BABAR

CMS/ATLAS

E831

ALEPH

JLAB

RHIC

NA48

Data Volume doubles every 2.4 years

Page 27: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

27

Data Rates, Facilities and Techniques

Page 28: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

28

Run 2a Computing

• Data Rates and Volume• Software Development• Computing Model

– Event Reconstruction– Analysis– Mass Storage

Page 29: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

29

Run 2a Computing• Scope of Computing Problem

– Data volume of ~1 Pbyte (over 2 years, CDF+D0)(Pbyte = 1015 bytes = 1,000,000 Gbytes)

• Typical hard disk is 20 Gbytes, DVD is 5 Gbytes.– Rates out of detector to storage up to 20 Mbyte/sec

(each detector).– CPU for event reconstruction of about 5 sec/event on a

PIII/500 MHz PC (Each event is 250 Kbyte).– CPU for data analysis is supplied on large SMP systems

and Linux desktops.– Large disk arrays (>50 Tbyte).– 100’s of physicists spread across the world need to

analyze the data.

Page 30: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

30

Software Development

Page 31: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

31

Software Development• Event Reconstruction Software

– Written by physicists.– Translates detector output (ADC counts, TDC,

hit maps) into energy measurements, particle positions and directions and momentum.

– Written in FORTRAN in previous runs.– Written in C++ in Run 2. – Hundreds of packages or modules, millions of

lines of code, many 10’s of authors.

Page 32: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

32

C++ Experience• Big change from procedural to object-oriented

language.• Some resistance.• Large training requirements.• Need for C++ experts to support the physicists

on design and coding.– Two individuals were hired by Fermilab to

provide that support.• The code runs, is probably as fast or faster

than Fortran code, and in general the exercise has been successful.

• Most (not all) new experiments choose C++ for offline event reconstruction.

Page 33: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

33

Was the transition to C++ beneficial?

• I’m not an expert and haven’t worked with the code directly.

• The answer probably won’t be known for some time:– Will code be more easily maintainable?– Will the code be more robust?– Will the code be as fast or at least not too

slow?– Will we be aligned better with industry and

other code developers?

Page 34: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

34

Other software for Run 2

• Mixture of commercial, lab-developed and open source.

• Each product is chosen based on its ability to solve a problem and on its cost (both to write and to support).

• Long list of products, some examples:– Linux, gcc, emacs, MySQL– KAI C++ compiler, LSF (Batch system), Purify– FBS, Enstore, SAM, ftt, ZOOM– GEANT3/4, ROOT

Page 35: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

35

Computing Model

Page 36: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

36

Computing Model for Run 2a

• CDF and D0 have similar but not identical computing models.– In both cases data is logged to tape stored in

large robotic libraries.– Event reconstruction is performed on large

Linux PC farms.– Analysis is performed on medium to large multi-

processor computers– Final analysis, paper preparation, etc. is

performed on Linux desktops or Windows desktops.

Page 37: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

37

Support for Run 2 Computing

• It takes many people to build and support Run 2 computing systems– Design and build: 20-30 people per experiment

• This includes code development and systems integration. Other support like networks, system administration is not included here.

– Maintain: About 35 people per experiment• This includes product support, networks,

system administration, etc.– Some of this support is provided by scientists

from the two experiments.

Page 38: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

38

Page 39: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

39

Stephen Wolbers HEP-CCC June 25, 1999

15 MBps

20 MBps

100 Mbps

400 MBps

Page 40: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

40

Reconstruction Systems

Page 41: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

41

CDF/D0 Offline Production Farmsfor event reconstruction

• The CDF/D0 farms must have sufficient capacity for Run 2 Raw Data Reconstruction.

• The farms also must provide capacity for any reprocessing needs.

• Farms must be easy to configure and run.• The bookkeeping must be clear and easy to use• Error handling must be excellent.

Page 42: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

42

Requirements for CDF (CPU)

• CPU goal is <5 seconds/event on PIII/500• Assuming 70% efficiency this translates to

– 200 PIII/500 equivalents (each is about 20 SpecInt95)

– 4200 SpecInt95• Adding in reprocessing, simulation, responding to

peak rates– 300-400 PIII/500 equivalents (150-200 duals)– 6300-8400 SpecInt95

Page 43: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

43

Simple Model

Page 44: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

44

Run II CDF PC Farm

Page 45: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

45

Design/Model• Hardware

– Choose the most cost-effective CPU’s for the compute-intensive computing.

– This is currently the dual-Pentium architecture – Network is fast and gigabit ethernet, with all

machines being connected to a single or at most two large switches.

– A large I/O system to handle the buffering of data to/from mass storage and to provide a place to split the data into physics datasets.

Page 46: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

46

Page 47: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

47

Page 48: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

48

Software Model

• Software consists of independent modules– Well defined interfaces– Common bookkeeping– Standardized error handling

• Choices– Python – MySQL database (internal database)– FBSNG (Farms Batch System) – FIPC (Farms Interprocessor Communication) – CDF Data Handling Software

Page 49: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

49

Page 50: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

50

Physics Analysis Requirements and Impact

• Raw Data Files come in ~8 flavors, or streams– 1 Gbyte input files

• Reconstruction produces inclusive summary files– 250 Mbyte output files

• Output Files must be split into ~8 physics datasets per input stream – Target 1 Gbyte files– About 20% overlap

• Leads to a complicated splitting/concatenation problem, as input and output streams range from tiny (<few percent) to quite large (10’s of percent)

Page 51: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

51

A1

A2

A3Input Stream (x8)

A A4Farms

A5

A6

A7

Page 52: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

52

Status of CDF/D0 Farms

• 88+97 PC’s are in place.– 48+47 PIII/500 duals– 40+53 PIII/750 or 800 duals

• 60 more PC’s are on order (PIII/1 GHz duals)• I/O nodes are ready.• Integration and testing of the system is complete.

– 20 Mbyte/sec can be achieved. • The CDF system is being used to process and reprocess

data from the commissioning run (about 1.3 Tbytes taken in October, 2000) and both systems are used to generate and reconstruct simulated data.

• Both are ready for raw data reconstruction.

Page 53: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

53

Analysis Computing

Page 54: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

54

Analysis Computing – Run 2a• CDF and D0 have both acquired large Silicon

Graphics O2000 multi-processor systems for large analysis tasks.

Page 55: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

55

Analysis Computing

• Each system has:– Access to data on tape.

• D0 access is over the network• CDF access is via SCSI connected peripherals

– Access to disk storage.• About 30 Tbytes attached to central systems• This will increase, especially as disk prices continue to

fall– LSF Batch software is used to schedule jobs and manage

resources on these systems.

Page 56: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

56

Page 57: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

57

Data Access Model: CDF

• Ingredients:– Gigabit Ethernet– Raw data are

stored in tape robot located in FCC

– Multi-CPU analysis machine

– High tape access bandwidth

– Fiber Channel connected disks

Page 58: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

58

Analysis Computing

• The large SGI is a conservative (and expensive) solution to analysis computing needs.

• Both collaborations are exploring the use of PC’s + EIDE disk + 100 Mbit or 1 Gbit network connection for analysis.

• These projects may lead the way to more cost-effective solutions for the analysis of the large amount of data that will be taken in Run 2a and Run 2b.

Page 59: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

59

PC analysis computing -- examples

Page 60: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

60

Mass Storage

Page 61: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

61

Storing Petabytes of Data in mass storage• Storing (safely) petabytes of data is not easy

or cheap.– Need large robots (for storage and tape

mounting).– Need many tapedrives to get the necessary I/O

rates.• Tapedrives and tapes are an important part

of the solution, and has caused some difficulty for Run 2.

– Need bandwidth to the final application (network or SCSI).

– Need system to keep track of what is going on and schedule and prioritize requests.

Page 62: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

62

Robots and tapes

Page 63: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

63

Enstore Software System

Page 64: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

64

Tapedrives and tapes• Tapedrives are not always reliable, especially when one is

pushing for higher performance at lower cost.• Run 2 choice is Exabyte Mammoth 2.

– 60 Gbytes/tape.– 12 Mbyte/sec read/write speed.– About $1 per Gbyte for tape. (A lot of money.)– $5000 per tapedrive.

• AIT2 from SONY is the backup solution.• The robotics which exist can handle most any tapedrive

technology.• Given the Run 2 timescale, upgrades to newer technology

will occur.• Finally, Fermilab is starting to look at PC diskfarms to

replace tape completely.

Page 65: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

65

Software system, Jasmine, (from Jefferson Lab)

D is p a tc h e r

C a c h eM a n a g e r

D r iv eM a n a g e r

D r iv eD is k

L ib r a ryM a n a g e r

V o lu m eM a n a g e rC l ie n t

R e q u e s tM a n a g e r S c h e d u le r

D a ta M o v e r

L o gM a n a g e r

L ib r a ryM a n a g e r

D a ta b a s e

R e q u e s tM a n a g e r

Page 66: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

66

Data Flow into/out of mass storage (D0)

Page 67: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

67

RHIC and JLAB Computing

Page 68: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

68

RHIC Computing

• Computing problem is very similar and is in fact somewhat larger than the Run 2a computing problem.

• For Nominal Year Operations, 2001/2002– Aggregate Raw Data Recording at 60

MBytes/sec– Annual Data Storage: 1 PByte– Online Storage: 40 TBytes– Online Data Access at 1 GByte/sec– Installed Compute Capacity: 20,000 SPECint95

Page 69: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

69

RHIC Computing Facility

FarmCPU's

CentralReconstruction

Server

AFSServers

SoftwareDevel Gateway

User, etc.Disk

InteractiveServices

BackupSystem

General ComputingEnvironment

RoboticTape

HPSSDisk

UNIXDisk

ShelfTape

HPSS CoreServer HPSS Data Movers

Managed Data Server

SMPCPU's

FarmCPU's

InteractiveCPU's

Central AnalysisServer

Back

Plane

External WideArea Network

RHICExperiments

RCF Gbit/FastEthernet Switching

Fabric

NFS FileServer

Page 70: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

70

Page 71: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

71

Jefferson Lab Computing

• Jefferson Lab Computing Problem is also quite substantial– Data rates are comparable to CDF/D0/RHIC.– Need large reconstruction, storage, and

analysis systems.– Future increases (for new beams and facilities)

will have to be handled.

Page 72: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

72

Storage servers

Storage systems

Batch & Analysis Farm

DST cache servers5 TB – RAID 0

~6000 SPECint95

Gigabitswitching

Farm cache servers1.6 TB – RAID 0

Gb Ethernet

100 Mb Ethernet

SCSI

FC from CLAS DAQ

From A,C DAQ

JLABFarm and Mass Storage Systems

End FY00

Farm control front-endsInteractive

NFS work areas5 TB – RAID 5

Page 73: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

73

Lattice QCD

Page 74: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

74

Lattice QCD

• Lattice QCD is a powerful approach to study QCD and to calculate fundamental quantities in the theory.

• Lattice QCD calculations require extremely large computing power in a tightly-coupled computing architecture (because of the demand for fast, low-latency communications).

• The computers used for this are almost always special-purpose machines, designed for this class of calculations

• Examples include, APE, Columbia machine, ACPMAPS, etc.

Page 75: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

75

Commodity Lattice Gauge Machine

• Idea: Take advantage of commodity hardware and software to build a large lattice QCD machine.

• Goal: 10 Teraflop peak performance as cheaply as possible.

• R&D at Fermilab (similar work at Jefferson lab).– Small Machine (80 dual PC’s) has been purchased and is

being integrated at Fermilab.– Much larger machine (on order 1000 PC’s) will be built

assuming that funding is available, no serious problems are found in scaling, etc.

– Workshop was held March 26-28 at Fermilab to discuss the current ideas and progress.

Page 76: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

76

Old and New Lattice Gauge Computing at Fermilab

Page 77: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

77

Accelerator Physics

Page 78: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

78

Accelerator Design and Tuning

• Computing is essential for accelerator design and tuning:– Optimizing design of future accelerators:

• Electron-positron linear collider• Muon collider/neutrino factory• Very Large Hadron collider

– Getting the most out of current machines:• Higher luminosity • With lower backgrounds/halo, etc

– Parallel computing can potentially decrease the time taken to properly simulate each configuration and allow for faster and more complete design.

Page 79: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

79

Accelerator Design and Computing

Page 80: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

80

Future

Page 81: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

81

Run 2b at Fermilab• Run 2b will start in 2004 and will increase the integrated

luminosity to CDF and D0 by a factor of approximately 8 (or more if possible).

• It is likely that the computing required will increase by the same factor, in order to pursue the physics topics of interest:– B physics– Electroweak– Top– Higgs– Supersymmetry– QCD– Etc.

Page 82: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

82

Run 2b Computing

• Current estimates for Run 2b computing:– 8x CPU, disk, tape storage.– Expected cost is same as Run 2a because of

increased price/performance of CPU, disk, tape.– Plans for R&D testing, upgrades/acquisitions

will start next year.• Data-taking rate:

– May be as large as 80 Mbyte/s.– About 1 Petabyte/year to storage.

Page 83: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

83

LHC Computing

• LHC (Large Hadron Collider) will begin taking data in 2006-2007 at CERN in Switzerland.

• Data rates per experiment of 100 Mbytes/sec.• 1 Pbyte/year of storage for raw data per

experiment.• World-wide collaborations and analysis.

– Desirable to share computing and analysis throughout the world.

– GRID computing may provide the tools.

Page 84: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

84

level 1 - special hardware

40 MHz (40 TB/sec)level 2 - embedded processorslevel 3 - PCs

75 KHz (75 GB/sec)5 KHz (5 GB/sec)100 Hz(100 MB/sec)data recording &

offline analysis

Page 85: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

85

World Wide Collaboration � distributed computing & storage capacity

Page 86: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

86

CMS/ATLAS and GRID Computing

CERN – Tier 0

Tier 1 FNAL RAL

IN2P362

2 Mbps

2.5 Gbps

622

Mbp

s

155 mb

ps 155 mbps

Tier2 Lab aUni b Lab c

Uni n

Department ��

Desktop

DHL

From Les Robertson, CERN

Page 87: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

87

Fermilab Networking and connection to Internet

D0CDF

Fixed Tgt

TD

SDSSCD

PPD

BD

General

MREN ESNet

Ameritech

155 Mb/s off-site links

LSSBSS

ESHS

CorenetworkOff-Site Bytes per Day

0

50000000000

100000000000

150000000000

200000000000

250000000000

300000000000

350000000000

400000000000

D a t e

Byt e s pe r Da y

7 Da y Avg Byt e s

34 kb/s analog &128 kb/s

ISDN

Off Site

On Site

1-2 Mb/s ADSL

Network Management

Page 88: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

88

Are Grids a solution?Computational Grids• Change of orientation of Meta-computing activity

– From inter-connected super-computers… .. towards a more general concept of a

computational power Grid (The Grid – Ian Foster, Carl Kesselman**)

• Has found resonance with the press, funding agencies

But what is a Grid?“Dependable, consistent, pervasive access to

resources**”So, in some way Grid technology makes it easy to use

diverse, geographically distributed, locally managed and controlled computing facilities – as if they formed a coherent local cluster

Les Robertson, CERN

** Ian Foster and Carl Kesselman, editors, “The Grid: Blueprint for a New Computing Infrastructure,” MorganKaufmann, 1999

Page 89: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

89

What does the Grid do for you?• You submit your work • And the Grid

– Finds convenient places for it to be run– Organises efficient access to your data

• Caching, migration, replication– Deals with authentication to the different sites that

you will be using– Interfaces to local site resource allocation

mechanisms, policies– Runs your jobs– Monitors progress– Recovers from problems– Tells you when your work is complete

• If there is scope for parallelism, it can also decompose your work into convenient execution units based on the available resources, data distribution

Les Robertson

Page 90: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

90

PPDG GRID R&DRichard Mount, SLAC

PPDG LHC Computing ReviewNovember 15, 2000

PPDG Multi-site Cached File Access System

UniversityUniversityCPU, Disk, CPU, Disk,

UsersUsers

PRIMARY SITEPRIMARY SITEData Acquisition,Data Acquisition,Tape, CPU, Disk, Tape, CPU, Disk,

RobotRobot

Satellite SiteSatellite SiteTape, CPU, Tape, CPU, Disk, RobotDisk, Robot

Satellite SiteSatellite SiteTape, CPU, Tape, CPU, Disk, RobotDisk, Robot

UniversityUniversityCPU, Disk, CPU, Disk,

UsersUsers

UniversityUniversityCPU, Disk, CPU, Disk,

UsersUsers

Satellite SiteSatellite SiteTape, CPU, Tape, CPU, Disk, RobotDisk, Robot

Page 91: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

91

GriPhyN Overview(www.griphyn.org)

• 5-year, $12M NSF ITR proposal to realize the concept of virtual data, via:1) CS research on

• Virtual data technologies (info models, management of virtual data software, etc.)

• Request planning and scheduling (including policy representation and enforcement)

• Task execution (including agent computing, fault management, etc.)

2) Development of Virtual Data Toolkit (VDT)3) Applications: ATLAS, CMS, LIGO, SDSS

• PIs=Avery (Florida), Foster (Chicago)

Page 92: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

92

User View of PVDG Architecture

Virtual Data Tools Request Planning and Scheduling Tools

Request Execution Management Tools

Transforms

Distributed resources(code, storage,computers, and network)

Resource Management

Services

Resource Management

Services

Security and Policy

Services

Security and Policy

Services

Other Grid Services

Other Grid Services

Interactive User Tools

Production Team

Individual Investigator Other Users

Raw data source

Page 93: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

93

GRID Computing

• GRID computing is a very hot topic at the moment.

• HENP is involved in many GRID R&D projects, with the next steps aimed at providing real tools and software to experiments.

• The problem is a large one and it is not yet clear that the concepts will turned into effective computing.– CMS@HOME?

Page 94: Computing as a Tool for High Energy and Nuclear Physicshome.fnal.gov/~wolbers/jeffersonlab.pdf · • Particle and nuclear physics use beams of particles striking targets to study

April 25, 2001 Stephen Wolbers, Jefferson Lab Colloquium

94

Conclusions

• Computing in HENP is a vital tool for extracting physics results, especially as the volume of data continues to dramatically increase.

• The challenges grow rapidly, certainly as fast or faster than advances in computing hardware and software techniques.

• Future experiments/accelerators/theory will continue to more heavily rely on computing, and new ideas like GRID computing will be pursued.