wlcg operations coordination

11
IT-SDC : Support for Distributed Computing WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013

Upload: gwyn

Post on 22-Feb-2016

28 views

Category:

Documents


0 download

DESCRIPTION

WLCG Operations Coordination. Andrea Sciabà IT/SDC GDB 11 th September 2013. Outline. Status of task forces News from EGI Conclusions. Middleware news. perfSONAR now tracked in baseline versions table End of support for dCache 1.9.12 was extended to September 30 - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: WLCG Operations Coordination

IT-SDC : Support for Distributed Computing

WLCG Operations Coordination

Andrea SciabàIT/SDC

GDB11th September 2013

Page 2: WLCG Operations Coordination

2IT-SDC

Outline

Status of task forces News from EGI Conclusions

WLCG Operations Coordination – A. Sciabà

Page 3: WLCG Operations Coordination

3IT-SDC

Middleware news

perfSONAR now tracked in baseline versions table

End of support for dCache 1.9.12 was extended to September 30 Due to delay in releasing SHA-2 ready version

dCache 2.2.17 (just released) is SHA-2 compliant

All sites should update their BDIIs to the latest version, including fixes for GLUE-2 and security

WLCG Operations Coordination – A. Sciabà

Page 4: WLCG Operations Coordination

4IT-SDC

gLExec

Tickets opened to sites 39 closed and verified 55 still open (but many on hold until

mid/late autumn to unify with SL6 migration)

Some countries already done, in particular France

CMS will make the gLExec SAM test critical by late fall

WLCG Operations Coordination – A. Sciabà

Page 5: WLCG Operations Coordination

5IT-SDC

SHA-2 Will be covered in detail today by Peter Just a few highlights

WLCG “deadline” extended to December 1st

By then we expect users to be able to use SHA-2 certificates for their work It should be compatible with measured state and progress

Validation status by experiment ALICE: various software still to be checked ATLAS and LHCb: all central services verified CMS: all OK but for DBS-2 (soon to be decommissioned anyway), end-to-end

job submission tests done Need testing job submission with SHA-2 certificates for pilots

VOMRS → VOMS-Admin migration IT to deploy VOMS-Admin servers for the VO managers to test Migration will be fully automated In the worst case, VOMRS can still be used if users authenticate with it using

a SHA-1 certificate and register DN and CA of their new SHA-2 certificate

WLCG Operations Coordination – A. Sciabà

Page 6: WLCG Operations Coordination

6IT-SDC

CVMFS

Patch released with security fix, sites should upgrade or apply hot fix

ALICE 33/51 sites deployed it, the rest by this fall:

excellent progress! CMS

Using gLite UI from CVMFS for Parrot Only ~10 CMS sites remaining!

New “CVMFS” SU in GGUS under “File System”

10 July 2013WLCG Operations Coordination – A. Sciabà

Page 7: WLCG Operations Coordination

7IT-SDC

FTS-3

ATLAS, CMS and LHCb using FTS3 for functional tests or production transfers Up to 30% of ATLAS production transfers Using CERN (Oracle, MySQL) and RAL

(MySQL) instances Instances at PIC, IN2P3-CC and ASGC not yet

used Some bugs found under load and fixed

Overall, very impressive results

10 July 2013WLCG Operations Coordination – A. Sciabà

Page 8: WLCG Operations Coordination

8IT-SDC

Tracking tools evolution

Savannah-to-JIRA migration Completed migration for some ALICE and LHCb

trackers ATLAS wrote the list of projects to migrate

Meetings in October to discuss various issues Savannah→JIRA for experiments and for the GGUS

tacker New GGUS functionality (e.g. tickets to multiple

sites) Savannah→GGUS for CMS

10 July 2013WLCG Operations Coordination – A. Sciabà

Page 9: WLCG Operations Coordination

WLCG Operations Coordination – A. Sciabà 9IT-SDC

Middleware readiness verification

Basically consisting in testing middleware and service updates on O(10) volunteering sites on a fraction of their production resources by running real workflows

Extends staged rollout using WLCG use cases

Membership (and coordinators) to be defined Start from the “old” middleware deployment

TF?

10 July 2013

Page 10: WLCG Operations Coordination

WLCG Operations Coordination – A. Sciabà 10IT-SDC

News from EGI operations

At the EGI Technical Forum (16-20/9) there will be training for site admins Including how to correctly publish

information in GLUE-2 Discussion started on the opportunity to

include Frontier/Squid in UMD It implies staged rollout, early adopters, etc. To be seen how many sites would find it

more convenient. Feedback?

10 July 2013

Page 11: WLCG Operations Coordination

11IT-SDC

Conclusions

Several task forces and activities on a very good track Progress measured by numbers of sites/tickets

Machine/job features TF fully set up, ready to start

Middleware readiness verification TF still to set up

Next WLCG Operations Coordination meeting on 19th September

10 July 2013WLCG Operations Coordination – A. Sciabà