online summary

21
CM26 March 2010 Jean-Sebastien Graulich Slide 1 Online Summary Online Summary o The heplnw17 case o DAQ o CAM o Online Reconstruction o Data Base o Data Storage Jean-Sebastien Graulich, Geneva Software also discussed in the same session, not Software also discussed in the same session, not reported here reported here

Upload: alcina

Post on 08-Jan-2016

25 views

Category:

Documents


0 download

DESCRIPTION

Online Summary. Jean-Sebastien Graulich, Geneva. The heplnw17 case DAQ CAM Online Reconstruction Data Base Data Storage. Software also discussed in the same session, not reported here. Online Activities. Main Issue: Breakdown of heplnw17 Not discussed in Online session - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: Online Summary

CM26 March 2010 Jean-Sebastien Graulich Slide 1

Online SummaryOnline Summary

o The heplnw17 case

o DAQ

o CAM

o Online Reconstruction

o Data Base

o Data Storage

Jean-Sebastien Graulich, Geneva

Software also discussed in the same session, not Software also discussed in the same session, not reported herereported here

Page 2: Online Summary

Online ActivitiesOnline Activities

Main Issue: Breakdown of heplnw17Main Issue: Breakdown of heplnw17 Not discussed in Online session Not discussed in Online session

Collaboration Forum

Revealed Revealed Lack of robustness, single point of failure Original misunderstanding:

Private network <-> Protected subnet Need for formally agreed support (from PPD or

ISIS ?)

ConsequenceConsequence 3 months of relative chaos (bad) Start working on a general computing and

network requirement document (good)

CM26 March 2010 Jean-Sebastien Graulich Slide 2

Page 3: Online Summary

CM26 March 2010 Jean-Sebastien Graulich Slide 3

DAQ achievementsDAQ achievements

DAQ system upgrade is readyDAQ system upgrade is ready Luminosity monitors integrated Luminosity monitors integrated Trigger system cabling optimizedTrigger system cabling optimized DAQ and Trigger System ConsolidationDAQ and Trigger System Consolidation Cabling documentation in progressCabling documentation in progress Progress in EMR front-end electronicsProgress in EMR front-end electronics

Page 4: Online Summary

CM26 March 2010 Jean-Sebastien Graulich Slide 4

Event BuildingEvent Building

The synchronization problem between The synchronization problem between the two crates persiststhe two crates persists

We incriminate the PCI/VME interface It couldn’t be replaced because

all the spares used for the mirror DAQ system Massive failure of boards: 4 out of 10 boards had to

be send for repair

In the meanwhileIn the meanwhile Online monitoring histogram allow to spot the

problem A VME and PC power cycle solve it temporarily Shifter’s attention is required

Page 5: Online Summary

CM26 March 2010 Jean-Sebastien Graulich Slide 5

Schedule Schedule MilestonesMilestones

From CM25From CM25 CAM data in Online Data StreamCAM data in Online Data Stream Nov 09Nov 09 -> Mai -> Mai

1010 Tracker integrated in DAQ and OLMTracker integrated in DAQ and OLM Jan 10Jan 10 -> July 10 -> July 10 TOF TDC Clock SynchronizationTOF TDC Clock Synchronization March 10March 10 -> Aug 10 -> Aug 10

More complicated than first thought. Need a dedicated board Burst Gate Signal in the Trigger SystemBurst Gate Signal in the Trigger SystemNeed support here

--------------------------------------------------------------------------

The priority has been set to the DAQ and Trigger system The priority has been set to the DAQ and Trigger system upgrade and consolidationupgrade and consolidation

DAQ System upgradeDAQ System upgrade Mai 10Mai 10 Production of SW/EMR Front End ElectronicsProduction of SW/EMR Front End Electronics Jan 10 Jan 10 -> -> StartedStarted

Page 6: Online Summary

Control and Control and MonitoringMonitoring

Outstanding ProgressOutstanding Progress “Control is under control”

Computer management ---> OKComputer management ---> OK Software Management ---> OKSoftware Management ---> OK Data Management ---> OKData Management ---> OK DocumentationDocumentation ---> OK ---> OK

All this sustained by mainly two All this sustained by mainly two individualsindividuals

James Leaver and Pierrick Hanlet

CM26 March 2010 Jean-Sebastien Graulich Slide 6

Page 7: Online Summary

CM26 March 2010 Jean-Sebastien Graulich Slide 7

Page 8: Online Summary

CM26 March 2010 Jean-Sebastien Graulich Slide 8

Page 9: Online Summary

CM26 March 2010 Jean-Sebastien Graulich Slide 9

Page 10: Online Summary

CM26 March 2010 Jean-Sebastien Graulich Slide 10

Page 11: Online Summary

CM26 March 2010 Jean-Sebastien Graulich Slide 11

Page 12: Online Summary

CM26 March 2010 Jean-Sebastien Graulich Slide 12

Page 13: Online Summary

CAMCAM

Decay solenoid included in the alarm Decay solenoid included in the alarm handlerhandler

Linde control panel mirrored into Linde control panel mirrored into EPICSEPICS

Very useful for expert remote monitoring

Next:Next: remote gateway and remote archive viewer new IOCs for new equipments

The “Long, hard road to ramp up CaM The “Long, hard road to ramp up CaM infrastructure and knowledge base” has lead infrastructure and knowledge base” has lead us to a point where we no longer foresee us to a point where we no longer foresee difficult hurdles to regularly add new IOCs, difficult hurdles to regularly add new IOCs, monitoring, alarm handling, and archiving…monitoring, alarm handling, and archiving…

CM26 March 2010 Jean-Sebastien Graulich Slide 13

Page 14: Online Summary

CM26 March 2010 Jean-Sebastien Graulich Slide 14

Page 15: Online Summary

CM26 March 2010 Jean-Sebastien Graulich Slide 15

Page 16: Online Summary

CM26 March 2010 Jean-Sebastien Graulich Slide 16

Page 17: Online Summary

Data baseData base

What it doesWhat it does Store Configuration = Set values != read values Document hardware status

- Geometry <-> G4MICE

- Cabling

- Alarm Handler settings, etc

Record automatically the magnet settings, ‘ISIS Record automatically the magnet settings, ‘ISIS settings’, target information and DAQ settings’, target information and DAQ information information

Superset of what is currently entered manually into the run configuration spreadsheet on the MICO page

Allow retrieving these settings at the start of Allow retrieving these settings at the start of the runthe run

Also allow saving settings not attached to a runAlso allow saving settings not attached to a run E.g. Pion at 300 MeV/c

EPICS client developed by James Leaver for thisEPICS client developed by James Leaver for this

CM26 March 2010 Jean-Sebastien Graulich Slide 17

Page 18: Online Summary

Data base statusData base status

Progress was suspended in January due Progress was suspended in January due to failure of heplnw17to failure of heplnw17

Local copy of DB system under Local copy of DB system under development in Glasgow, progress development in Glasgow, progress resumed resumed

The main server functionality requested The main server functionality requested has now been implemented has now been implemented (except cabling)(except cabling)

Proper migration to Rutherford Lab Proper migration to Rutherford Lab scheduledscheduled

the bulk of outstanding work

See See David ForrestDavid Forrest’s talk for details’s talk for details

CM26 March 2010 Jean-Sebastien Graulich Slide 18

Page 19: Online Summary

Data StorageData Storage

The only formally-agreed route for The only formally-agreed route for access to data (DAQ output) is via the access to data (DAQ output) is via the Grid.Grid.

The Grid Transfer Box (miceacq05) is The Grid Transfer Box (miceacq05) is located in the MLCR. It will eventually located in the MLCR. It will eventually run an autonomous agent that reads the run an autonomous agent that reads the data from the RAID system in the MLCR data from the RAID system in the MLCR and uploads it to the Grid, in particular and uploads it to the Grid, in particular the CASTOR tape system at RALthe CASTOR tape system at RAL

In the meantime data IS being uploaded In the meantime data IS being uploaded to the Grid, but on a manual, next-day to the Grid, but on a manual, next-day timescale.timescale.

CM26 March 2010 Jean-Sebastien Graulich Slide 19

Page 20: Online Summary

Data AccessData Access

Henry Nebrensky Henry Nebrensky presented a tutorial on presented a tutorial on how to access the data using the gridhow to access the data using the grid

Open IssuesOpen Issues Permanent storage is on Tape at RAL

Long access time (Robot loading the tape)We should foresee a place where actively used data is stored on disk

Files on tape must be at least 200 MB… Eventually, someone on duty (MOM or shifter) will

need to have a Grid certificateOnce again we have a single (human) point of failure here

CM26 March 2010 Jean-Sebastien Graulich Slide 20

Page 21: Online Summary

General CommentGeneral Comment

The MOG still suffers for a loose The MOG still suffers for a loose leadershipleadership

Compensated by the enthusiasm and Compensated by the enthusiasm and commitment of the individuals commitment of the individuals inside the groupinside the group

Most members are on short term Most members are on short term contractscontracts

Linda and Pierrick depend on NFS grant David has to write his Ph.D. James will leave in January 2011 I’ll leave on June 2011

CM26 March 2010 Jean-Sebastien Graulich Slide 21