the muon conditions data management: database architecture and software infrastructure monica...

30
The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009 11th ICATPP Villa Olmo Como (Italy)

Upload: brittney-glenn

Post on 12-Jan-2016

214 views

Category:

Documents


1 download

TRANSCRIPT

Page 1: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Monica VerducciUniversity of Wuerzburg & CERN5-9 October 2009 11th ICATPP Villa Olmo Como (Italy)

Page 2: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

Outline

• Introduction @ the ATLAS Muon Spectrometer

•Muon Spectrometer Data Flow: Trigger, Streams and Conditions Data

•Muon Conditions Database Storage and Architecture

•Software Infrastructure•Applications and Commissioning

test

2The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 3: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

3

The ATLAS Muon Spectrometer

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 4: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

4

The ATLAS Detector

Page 5: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

The Muon Spectrometer

5

Three toroidal magnets create a magnetic field with:• Barrel: ∫Bdl = 2 – 6 Tm• Endcaps: ∫Bdl = 4 – 8 Tm

• RPC & TGC: Trigger the detector and measure the muons in the xy and Rz planes with an accuracy of several mm.• CSC: Measure the muons in Rz with ~80 mm accuracy and in xy with several mm. Cover 2<| |<m 2.7• MDT: Measure the muons in Rz with ~80 mm accuracy . Cover |m|<2

Page 6: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

6

The Muon Spectrometer IICathode-Strip

Chambers (CSC) : 32 chambers, 31k channels

Monitored Drift Tube (MDT): 1108 chambers, 339k channels

Thin Gap Chambers (TGC): 3588 chambers, 359k channels

Resistive Plate Chambers (RPC): 560 chambers, 359k channels

Trigger chambers

Precision chambers

Need a good resolution in the timing, pT and position measure to achieve the physics goals! Extremely fine checks of all the parts of each subdetector! Huge amount of information…

Need a good resolution in the timing, pT and position measure to achieve the physics goals! Extremely fine checks of all the parts of each subdetector! Huge amount of information…

Page 7: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

7

Data Flow and Conditions Data

Page 8: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

subset

ATLAS Event Flow

8

Output Streams

Detector Parameters

Configuration DB

TRIGGER

Conditions DB

Non event data

ATHENA

Offline reconstruction

Event Selection

Pri

mary

Pat

hol

ogic

al

Express

Calib

ratio

n &

Alig

n

Hierarchical trigger system ~MB/sec

~PB/year raw data

109 events/s =>1GHz1 event~ 1MB (~PB/s)

Page 9: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

The MUON “Non Event Data”

A typical ATLAS “Non-Event data” could be a:▫ Calibration and Alignment data (from express and

calibration streams for a total data rate of about 32MB/s, dominated by the inclusive high pt leptons (13% EF bandwidth= 20Hz of 1.6MB events). RAW Data -> 450 TB/year. More streams are now subsumed into the express stream)

▫ PVSS Oracle Archive, i.e. the archive for the DCS « slow control system » data, and DAQ via OKS DB.

▫ Detector configuration and connectivity data, specific subdetector data

Mainly used for:▫ Diagnostic by detector experts▫ Geometry, DCS▫ Sub-Detector hardware and software▫ Data defining the configuration of the

TDAQ/DCS/subdetector hardware and software to be used for the following run

▫ Calibrations and Alignment▫ Event Reconstruction and analysis▫ Conditions data

9The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 10: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

10

Muons Conditions Data from Trigger Streams

Some Conditions Muon Data will be produced by detector analysis performed on the:▫ Calibration and Alignment Stream

Muons are extracted from the second level trigger (LVL2) at a rate of ~1 KHz, data are streamlined to 3 Calibration Centres (Ann Arbor, Munich, Rome (from to Naples for RPCs)

100 CPUs each, ~1 day latency for the full chain▫ Express Stream

10

The ATLAS Trigger will produce 4 streams (200Hz, 320 MB/s):

Primary stream (5 streams based on trigger info: e,m,jet)

Calibration and Alignment Stream (10%) Express Line Stream (Rapid processing of

events also included in the Primary Stream 30 MB/s, 10%)

Pathological events (events not accepted by EF)

40 MHz

105 Hz

103 Hz

102 Hz

Front end pipelines

Readout buffers

Processor farms

Switching network

Detectors

µsec

ms

sec

LVL 1

LVL 2

LVL 3

25ns

~PB/sec

~MB/sec

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 11: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

Muon Conditions Data• Calibration for the precision chambers • Alignment from sensors and from tracks• Efficiency flags for the trigger chambers

• Data Quality flags (dead / noisy channels) and final status for the monitoring

• Temperature map, B field map

• DCS information (HV,LV,gas…)• DAQ run information (chamber

initialized)

• SubDetector Configuration parameters (cabling map, commissioning flags…)

11

Calibration Stream & offline algo (express stream)

Analysis algorithms

Hardware Sensor

OKS2COOL and PVSS2COOL

Constructor parameters

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 12: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

Storage of the ‘non-event’ data

• There are different Database storage solution to deal the different hardware and software subdetector work point.

1. Hardware Configuration DB▫ Oracle private DB, architecture and maintenance

under detector’s experts 2. Calibration & Alignment DB

▫ Oracle private DBs, one for the MDT Calibration (replicated in three centers) and one for the Alignment sensors.

3. Condition DB▫ Contains a subset and less granularity

information▫ Cool Production DB

12The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 13: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

Conditions DataBase

• The Conditions data are non-event data that could:

Vary with time May exist in different versions Data coming from both offline and online

• The Conditions DB is mainly accessed by the ATLAS offline reconstruction framework (ATHENA)

• Conditions Databases are distributed world-wide (for scalability)▫ accessed by an “unlimited” number of computers on the

Grid: simulations jobs, reconstruction jobs, analysis jobs,…

• Within ATLAS, the master conditions database is at CERN and using Oracle replica mechanism will be available in all Tier-1 centers

• The technology used in the Conditions DB is an LCG product: COOL (COnditions Objects for LHC) implemented using CORAL

13The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 14: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

Cool Interface for Conditions Database• The interface provided by COOL:

▫ LCG RelationalAccessLayer software which allows database applications to be written independently of the underlying database technology (Oracle, in MySQL or in SQLite).

• COOL provides a C++ API, and an underlying database schema to support the data model.▫ Once a COOL database has been created and

populated, it is possible for users to interact with the database directly, using lower-level database tools

• COOL implements an interval of validity database▫ Database schema optimized for IOV retrieval &

look-up▫ objects stored or referenced in COOL have an

associated start and end time between which they are valid.

▫ times are specified either as run/event, or as absolute timestamps in agreement with the meta-data stored.

14The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 15: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

Since(Time)

Until(Time)

ChannelId(Integer)

Payload(Data)

Tag(String)

15

• COOL data are stored in folders (tables)▫ Database = set of folders▫ Within each folder, several objects of the same

type are stored, each with their own interval of validity range

• COOL folders can be▫ SingleVersion: only one object can be valid at any

given time value DCS data, where the folder simply records the values

as they change with time▫ MultiVersion: several objects can be valid for the

same time, distinguished by different tags calibration data, where several valid calibration sets

may exist for the same range of runs (different processing pass or calibration algorithm)

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 16: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

16

Muon Spectrometer examples

DCS: Temperature or HV values depends on the IoV and are relative simple and small Inline Payload

Calibration Data and Alignment: Parameters with a high granularity, more parts can give the same IoV CLOB Payload

DCSHV11

Evt20Run10

Evt10Run1

TagPayloadCh IdUntilSince

CosmicsM4

T0CLOB

1Evt20Run10

Evt10Run1

TagPayloadCh IdUntilSince

LV

Page 17: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

17

Conditions Servers and Schema• Each subdetector (MDT, RPC, CSC, TGC) has a different schema necessary because of the options introduced by the Oracle Streams architecture used to replicate data from the ATLAS online server (ATONR) to the ATLAS offline server in the CERN computer centre (ATLR) and on to the servers at each of the Tier-1 sites. The different schema are as follows: ▫ Schema ATLAS_COOLONL_XYZ which can be written on the online server, and

is replicated to offline and Tier-1s. ▫ Schema ATLAS_COOLOFL_XYZ which can be written on the offline server, and

is replicated to Tier-1s.

• Each schema is associated with several accounts: ▫ A schema owner account ▫ A writer account ATLAS_COOLONL_XYZ_W is used for insert and tag

operations, ▫ A generic reader account ATLAS_COOL_READER is used for read-only

access to all the COOL schema and can be used on online, offline and Tier-1 servers.

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

T0 T1

ATONRCOOL

ATONR

COOL

ATONRCOOL

P1Oracle Stream

Oracle StreamT0 T1

ATRCOOL

ATRCOOL

Oracle Stream

Page 18: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

Writing in the Conditions DB• We have several sources of conditions data

(online pit, offline, calibration stream)• The software analysis and the publishing

interface are in general different, depending on the framework where the analysis code works

• The final mechanism is unique, every insertion passes via a sqlite file and, after checking, stored in the official production DB, using python scripts.

• The service names are defined in the Oracle tnsnames.ora file, and the connection/password name are handled automatically in the authentication.xml file.

18The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 19: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

Muon Software Interface

• Unique interface inside the reconstruction package: MuonConditionsSummarySvc

• Every subdetector part has its own code to handle the proper Conditions Data(DCS, Status flags, ...)using a

XYZConditionsSummarySvc which initializes several tools:

DCSConditionsTool, DQConditionsTool,...

• Using the IoVService the information used in the reconstruction algorithms is always “on time” with the current event processed

19The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 20: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

20

Access by the Reconstruction

7) Set IOV

Algorithm or AlgTool

Detector StoreService

IOVSvcService

IOVDbSvcService

IOV BD

1) Begin Run / Event

6) Get Payload

5) Update Address

2) Delete Objects (expired)

4)Retrieve CondObjColl

3)Callback

CondObj

Transient Detector Store

CondObjCollection

Access to COOL from Athena is done via the

Athena IOVDbSvc (provides an

interface between

conditions data objects in the

Athena transient detector store (TDS) and the

conditions database itself). Reading event

data, the IOVDbSvc

ensures that the correct Cond Data Obj are

always loaded into the Athena

TDS for the event currently being

analyzed.

Paylo

ad

Tim

eAddress

CondObjColl

ref

Page 21: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

21

Commissioning and TestsTests of all the chain, transfer of data

(streaming), access to the data in reconstruction job have been carried out

The cosmics data have been stored successfully (in particular alignment and calibration info)

The Muon data replica and access have been tested inside the overall ATLAS test with some dummy data:◦ The production schema for ATLAS have been

replicated from the online RAC ATONR to ATLR and then on to the active Tier-1 sites

◦ Tests on the access by ATHENA and on the replica/transfer data between Tier1 and Tier0 have been done, good performance @ Tier1 (~200 jobs in parallel – tests done in 2006).

• Tests of the Muon Data Access by Reconstruction partially done without any problems

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 22: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

Conclusions

•The Muon DataBase layout has been defined and tested.

•The Access and architecture for most of the Muon Conditions Data have been extensively tested.

•The Muon data replica and access during the ATLAS test have been positive

22The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 23: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

23

Backup

Page 24: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

24

Muon Calibration StreamMuons in the relevant trigger regions are

extracted from the second level trigger (LVL2) at a rate of ~1 KHz

Data are streamlined to 4 Calibration Centers◦ Ann Arbor, Munich, Rome and Naples for RPCs◦ 100 CPUs each

The stream is useful also for Data Quality Assessment, alignment with tracks and trigger efficiency studies

~1 day latency for the full chain◦ From data extraction, to calibration computation

at the Centers, to writing the calibration constants in the

Conditions DB at CERNNeed to carefully design the data flow and

the DB architecture

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 25: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

25

Calibration and Alignment Stream

Production of NON-EVENT DATA in different steps, used for the event recostruction

• Input Raw Data can come from the event stream or be processed by the sub-detector read-out system. RODs level (not read-out by the standard DAQ path, not physics events: pulse signal)

• At the event filter level (standard physics events: Z ->ee, muon sample, etc.)

• After the event filter but before the “prompt reconstruction”

• Offline after the “prompt reconstruction” (Z ->ee, Z+jet, ll )

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 26: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

26

• Calibration Stream (technical detector stream)▫ An Inner Detector Alignment Stream (100 Hz reco

tracks info 4kB)▫ A LAr electromagnetic calorimeter stream (50Hz of

inclusive electrons pt > 20 GeV up to 50kB)▫ A muon calibration stream (Level1 trigger region

~10kHz for 6kB)▫ Isolated hadron (5Hz for 400 kB)

• Express Stream (processed promptly, i.e. within < 8 hours)▫ contain all calibration samples needed to extract

calibration constants before the 1st-pass reconstruction of the rest of the data: Z ll, pre-scaled W l, tt, etc.

▫ Inclusive high-pt electrons and muons (20Hz with full event read-out 1.6MB)

• These streams sum to a total data rate of about 32MB/s, dominated by the inclusive high pt leptons (13% EF bandwidth= 20Hz of 1.6MB events). RAW Data -> 450 TB/year. More streams are now subsumed into the express stream

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 27: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

27

Some numbers: CondDB design

▫ATLAS daily reconstruction and/or analysis job rates will be in the range from 100k to 1M jobs/day

▫For each of ten Tier-1 centers that corresponds to the Conditions DB access rates of 400- 4000 jobs/hour

▫Each reconstruction job will read 10-100MB of data

▫Atlas requests to Tier-1s is a 3-node RAC cluster dedicated to the experiment.

▫Expected rate of data flow to Tier-1s is between 1-2 GB/day

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 28: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

28

0

500

1000

1500

2000

0 5 10 15 20 25 30

Concurrent Jobs per Oracle CPU

Jobs

/hr

COOL 2.2 "no DCS"

COOL 2.2 "with DCS" (to be optimized)"

COOL 2.2 with "10xDCS" (to be optimized)

COOL 2.1 "no DCS" (manual optimization)

CNAF result for "with DCS"

Vaniachine @ CHEP

Workload scalability results (2006)

In ATLAS we expect 400 to 4,000 jobs/hour for each Tier1For 1/10th of the Tier1 capacities that corresponds to the rates of 200 to 2,000 jobs/hour

Good Results!

For the Access by Athena we have obtained 1000 seconds per job events at ATLR due to the DCS and TDAQ schema access!

Page 29: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

29

M.Verducci 29

ATLASDetector

DCSDetector Con. Sys.•HV, LV•Temperature•Allingment

Front-End

EventFilter

Level2Trigger

ROSs

Level1Trigger

VME CrateRODs

ATHENA code

Configuration Database

Conditions Database

ByteStream Files

Manual Input

TCorddb

RODHLT/DAQ

DCS System

Online Calib. farm

RODHLT/DAQ

DCS System

Monitorqueries

Reco.farms

Offline analysis

Geom.

Setup

Calib

CONFIGURATION DB

Geom.

Setup

Calib

CONDITION DB

Monitordata

DCS

Databases are organized collections of data• Organized according to a certain data model• The data model defines not only the structure but also whichoperations can be performed

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

Page 30: The Muon Conditions Data Management: Database Architecture and Software Infrastructure Monica Verducci University of Wuerzburg & CERN 5-9 October 2009

The Muon Conditions Data Management: Database Architecture and Software Infrastructure

30

• Eµ~ 1TeV -> Δ~500µm• /m ~10% -> δΔ~50µm• Alignment accuracy

~30µm • B Field accuracy

|ΔB| ~ 1-2 mT

Alignment Accuracy

testbeam results

Muon Spectrometer Strategy for muon PID

Dilepton resonances (mostly Z) sensitive to:

• Tracker-spectrometer misalignment• Uncertainties on Magnetic field• Detector momentum scale• Width is sensitive to muon momentum resolution

Calibration Samples in 100pb-1:•J/ ~1600k (+~10% ’) • ~300k (+~40% /)•Z ~60k