dpm collaboration motivation and proposal

26
Grid Technology CERN IT Department CH-1211 Geneva 23 Switzerland www.cern.ch/ DB CF CF GT DPM Collaboration Motivation and proposal Oliver Keeble CERN On behalf of DPM/LFC team

Upload: arlais

Post on 19-Jan-2016

46 views

Category:

Documents


0 download

DESCRIPTION

DPM Collaboration Motivation and proposal. Oliver Keeble CERN On behalf of DPM/LFC team. Overview. Context: middleware funding Why DPM matters To whom it matters What DPM represents today What to do about point 1 above…. Overview. Context: middleware funding Why DPM matters - PowerPoint PPT Presentation

TRANSCRIPT

Page 1: DPM Collaboration Motivation and proposal

Grid Technology

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

DBCFCFGT

DPM CollaborationMotivation and proposal

Oliver Keeble

CERN

On behalf of DPM/LFC team

Page 2: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Overview

1. Context: middleware funding

2. Why DPM matters

3. To whom it matters

4. What DPM represents today

5. What to do about point 1 above…

DPM Collaboration / GDB / 11/07/2012

Page 3: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Overview

1. Context: middleware funding

2. Why DPM matters

3. To whom it matters

4. What DPM represents today

5. What to do about point 1 above…

DPM Collaboration / GDB / 11/07/2012

Page 4: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Context (I)

• Middleware has enjoyed European funding for over 10 years

• This ends with the EMI project• DPM/LFC is a project heavily reliant on this

source of support• At the same time, the project has never

been in better shape…• We propose a collaboration of stakeholders

to drive the project forward

DPM Collaboration / GDB / 11/07/2012

Page 5: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Context (II)

DPM Collaboration / GDB / 11/07/2012

Page 6: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Overview

1. Context: middleware funding

2. Why DPM matters

3. To whom it matters

4. What DPM represents today

5. What to do about point 1 above…

DPM Collaboration / GDB / 11/07/2012

Page 7: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT DPM today - deployment

DPM Collaboration / GDB / 11/07/2012

• 36PB– 10 sites with > 1PB

• Over 200 sites in 50 regions– ~90 WLCG sites

• Over 300 VOs have access to DPMs

Page 8: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Disk storage

DPM Collaboration / GDB / 11/07/2012

Source: Information System

Page 9: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Overview

1. Context: middleware funding

2. Why DPM matters

3. To whom it matters

4. What DPM represents today

5. What to do about point 1 above…

DPM Collaboration / GDB / 11/07/2012

Page 10: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Who depends on DPM

DPM Collaboration / GDB / 11/07/2012

Page 11: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Overview

1. Context: middleware funding

2. Why DPM matters

3. To whom it matters

4. What DPM represents today

5. What to do about point 1 above…

DPM Collaboration / GDB / 11/07/2012

Page 12: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Dispelling myths

• Some DPM myths:• DPM is only for small sites

– No, there are several multiPB DPMs, incl. a T1

• DPM doesn’t do xrootd– A new plugin has been created

• DPM doesn’t represent much storage– DPM represents an respectable fraction of grid

storage

• DPM hasn’t seen investment– DPM has seen significant investment thanks to the

EMI project• Not everything has been released yet

DPM Collaboration / GDB / 11/07/2012

Page 13: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT DPM today – new features

• HTTP/WebDAV: released with EMI2• NFSv4.1: released r/o with EMI2• Refactoring (dmlite) to make future additions

much easier and allow performance improvements

• Perfsuite: internal performance testing suite, completed recently

• Nagios probes: released during EMI1• EPEL compliance – you can find it there

now• + more!

DPM Collaboration / GDB / 11/07/2012

Page 14: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GTPerformance, performance, performance

DPM Collaboration / GDB / 11/07/2012

https://cdsweb.cern.ch/record/1458022?ln=en

Page 15: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT LFC

• LFC is a file catalogue derived from DPM’s nameserver

• It is currently used by Atlas and LHCb– Both have declared their intention to move away

from LFC– Timescales are not clear though

• It is also used by other VOs…

DPM Collaboration / GDB / 11/07/2012

Page 16: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GTDPM tomorrow – features and opportunities

• HTTP Federation: ongoing work with dCache

• Memcache: scheduled for Q3 release– Fast in-memory namespace cache

• Improved xrootd support– In production testing

• Different backends have been prototyped: – HDFS

– S3

• Disk/tape decoupling offers new deployment scenarios for DPM as a radically simpler solution for larger sites

DPM Collaboration / GDB / 11/07/2012

Page 17: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT DPM xrootd plugin

GOAL: functional and performance improvements.

• Performance is of primary importance– Profit from changes in DPM

• Allow xrootd clients to access and write files to DPM– Use X.509 authentication, with VOMS extensions

• DPM does the usual authorization– Optionally allow the ALICE “tkauthz” mechanism

• Mixed multi-VO setups– A single DPM can serve more than one VO

• Support for per-VO xrootd federations– Following the evolution of the ATLAS LFC-based name

translations• Currently under beta testing by Atlas

DPM Collaboration / GDB / 11/07/2012

Page 18: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GTGT

Providing XROOTD access to DPM (for all VOs)

Increasing GET performance

1. We were here (~1-20 req/sec)

1. We were here (~1-20 req/sec)

2. Aiming here now (red line)

2. Aiming here now (red line)

3. Will be here with the NS improvements

3. Will be here with the NS improvements

4. Will be higher with the newer architecture (see CHEP contribs)

4. Will be higher with the newer architecture (see CHEP contribs)

DPM Collaboration / GDB / 11/07/2012

Page 19: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Who depends on DPM

DPM Collaboration / GDB / 11/07/2012

Page 20: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Overview

1. Context: middleware funding

2. Why DPM matters

3. To whom it matters

4. What DPM represents today

5. What to do about point 1 above…

DPM Collaboration / GDB / 11/07/2012

Page 21: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Proposal

• Those to whom DPM matters are invited to express their interest in a collaboration

• Who– All stakeholders, sites, projects, NGIs…

• Why (ie the benefits)– A sustainable vehicle for evolving the software on which they depend

• Ensure continued support, updates and development

– Protection of investment. Ensure that existing DPM installations remain well maintained and able to evolve with the hardware and OS.

– Influence according to contribution• Credit system?

• Nature of collaboration to be decided by involved parties

• Potentially very important for larger sites considering investment in DPM

– Opportunity to seek funding in the context of participation in an international project

DPM Collaboration / GDB / 11/07/2012

Page 22: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Different Contributions

• Development and research– thanks to recent improvements this has become

easy• examples: memcached modules • interesting for experienced sites and universities

• Staged Rollout/stress testing– as important as development

• Operational tools• Support

– including improved documentation for sites– configuration management

DPM Collaboration / GDB / 11/07/2012

Page 23: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Ideas on organisation

• Current reality: – direct influence: WLCG, CERN, EMI – indirect influence: sites and volunteers– a lot depends on CERN IT – classical “producer – consumer” relation

• not helpful to establish a collaborative approach

• We need something where effort is translated into influence

DPM Collaboration / GDB / 11/07/2012

Page 24: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Possible collaboration model

– The shape of the collaboration will be defined by the members in the end

– straw-man: – influence based on contribution ( by effort )

• all forms have equal weight

– main contributors (3-5) agree on a MoU• CERN will be one of these• without a MoU local needs “win” too often

– see experience with staged rollout

– volunteers are welcome and encouraged– low threshold for contributing

– major directional changes agreed once or twice a year by a collaboration board meeting

– Combined with community workshop

DPM Collaboration / GDB / 11/07/2012

Page 25: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Alternative

• DPM will be in the best state possible by the end of EMI

• If we can’t find the resources: – Try to maintain DPM during the transition phase

after the end of EMI• To allow for an orderly migration of the data• not clear how long this will take ( 1-2 years??)• decreasing effort needed ( see LCG-CE )

– CERN’s contribution to a collaboration could be used to maintain a static DPM for an extended period

• Until natural obsolescence

DPM Collaboration / GDB / 11/07/2012

Page 26: DPM Collaboration Motivation and proposal

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

GT Contacts and References

• Collaboration– [email protected][email protected]

• DPM technical– [email protected]

• CHEP 2012 papers:– Overview

• https://cdsweb.cern.ch/record/1458022?ln=en

– HTTP/dav• https://cdsweb.cern.ch/record/1457962?ln=es

– Dynamic federations• https://cdsweb.cern.ch/record/1460525?ln=en

DPM Collaboration / GDB / 11/07/2012