southgrid technical meeting pete gronbech: 16 th march 2006 birmingham

10
Southgrid Technical Meeting Pete Gronbech: 16 th March 2006 Birmingham

Upload: janel-hopkins

Post on 29-Dec-2015

214 views

Category:

Documents


2 download

TRANSCRIPT

Page 1: Southgrid Technical Meeting Pete Gronbech: 16 th March 2006 Birmingham

Southgrid Technical Meeting

Pete Gronbech: 16th March 2006

Birmingham

Page 2: Southgrid Technical Meeting Pete Gronbech: 16 th March 2006 Birmingham

Present

• Pete Gronbech – Oxford• Rosario Esposito - Oxford• Chris Brew – RAL PPD• Yves Coppens – Birmingham• Winnie Lacesso - Bristol

Page 3: Southgrid Technical Meeting Pete Gronbech: 16 th March 2006 Birmingham

Agenda

• 10:30 Start• Pete + Others• 12pm Lunch• Interactive Workshop!!• 3:15pm Coffee ??• 4:20pm Finish

Page 4: Southgrid Technical Meeting Pete Gronbech: 16 th March 2006 Birmingham

Agenda Topics

• LCG-2_7_0 – Experiences from Yves & Chris– Plans for Oxford and Cambridge

• Monitoring– Network Monitoring Box– Ganglia at Bristol once webserver ready.– Ganglia mods for VO’s Help again from Chris?– Nagios anyone?– Pakiti– aid – tripwire??– swatch /ranger

• SC4 – T2 workshop Who is going?– Throughput tests –Bristol and Cam repeat tests at end of Month– Network connectivity at Bristol/ Cambridge at 1Gbps ?? Next week Bristol, Cam ??– UI FTS client works out of the box.– Storage security challenge?? Do we know which logs to look at or even are the

SRM’s doing enough logging.

– Re security Challenge a Best practice how to should now be made available on the wiki….

Page 5: Southgrid Technical Meeting Pete Gronbech: 16 th March 2006 Birmingham

Agenda Topics

• ALICE paying for machine to act as VO box at Birmingham. Possibly also at other SouthGrid sites. Security. Root access. policy?? Yves will test first …

• Unified Naming Scheme? UKI-SOUTHGRID-OXF ?? Oxford to try it!!• Cambridge progress with APEL / Condor?• future upgrades • VO support Can all SouthGrid sites support the same VO’s? With lcg

2_7_0 vo tool available.• On clusters with various memory allocations have to advertise

memory available per job slot not per machine!!• Backups:

– DPM database on se.– ce– se– mon– lfc

• Central Logging Machine also useful as a secondary backup of logs• VRVS demo

Page 6: Southgrid Technical Meeting Pete Gronbech: 16 th March 2006 Birmingham

LCG 2_7_0

• Birmingham SL304 problems with mktmp so upgrading to sl305 first helps.

• Check SL mirror is still OK.• Bham other problems

– info provider on ce (extra info from maui only worked for default maui setup, as we have a customized config )

• Recent (March 7th) Bristol Upgrade was much smoother. Many bugs fixed

• SouthGrid Now using more modules eg for ganglia• For new nodes use pbsnodes –o fqdn then when

happy use pbsnodes –c fqdn• Plan for Oxford Next week and Camb shortly after.

Page 7: Southgrid Technical Meeting Pete Gronbech: 16 th March 2006 Birmingham

SL 3 – 4- 5

• Summary of Talk given at CERN wrt to SL versions.

• SL5 will be to late for LHC so push to certify SL4 by end of March and migrate in Autumn 2006

• There will be no OS upgrade planned for 2007!

Page 8: Southgrid Technical Meeting Pete Gronbech: 16 th March 2006 Birmingham

Future Upgrades

• RAL PPD (£250K)– 52 dual core dual opteron wn’s (late March

06)– 7 * 8TB sata disk servers (2 sys, 1 parity, 1

Hot spare, 20 data)– Network: separate 1Gb/s fibre. separate from

rest of PPD. Nortell switches (8) 10Gb capable. so should be able to have a 10Gb link to RAL T1.

– CPU over MoU can be used for T3 local PPD VO

Page 9: Southgrid Technical Meeting Pete Gronbech: 16 th March 2006 Birmingham

Future Upgrades

• Oxford– Still waiting for our new computer room…..now

Autumn 06– Short term air con upgrade to allow us to stay as

we are!

• Bristol– Uni cluster to go on 5th floor of physics.– New room ?? June?? seems un realistic– Need to know how to send LCG jobs to external

clusters – We should ask LT2 how they did escience centre.

Page 10: Southgrid Technical Meeting Pete Gronbech: 16 th March 2006 Birmingham

Future Upgrades

• Birmingham– Atlas farm may get integrated more but…– Babar farm increased nodes but h/w

reliability problems. Some new disks purchased and may buy some replacement PSU’s

– babar ce to be migrated to gridpp ui box.– Lawrie applying for esci clusters….