cern it department ch-1211 geneva 23 switzerland t wlcg operation coordination 20-2-2014 luca...

17
CERN IT Department CH-1211 Geneva 23 Switzerland www.cern.ch/ WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

Upload: joella-curtis

Post on 18-Jan-2018

221 views

Category:

Documents


0 download

DESCRIPTION

CERN IT Department CH-1211 Geneva 23 Switzerland t Oracle Upgrades 2014 Hardware upgrades New servers and storage –Servers: more RAM, more CPU 128GB of RAM memory (48GB current prod machines) –Storage: more SSD cache Newer NetApp model Consolidated storage Refresh cycle of OS and OS related –Puppet & RHEL 6

TRANSCRIPT

Page 1: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

WLCG Operation Coordination20-2-2014

Luca Canali (for IT-DB)

Oracle Upgrades

Page 2: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Oracle Upgrades 2014

Scope

• Upgrade of the Oracle version for all DBs– Evolution to the latest Oracle releases– Keep the support levels, new features

• New HW– HW evolution (more capacity)– Move to the BARN (critical power)

• Simultaneous move HW+SW– Reduced downtime and risk

Page 3: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Oracle Upgrades 2014

Hardware upgrades

• New servers and storage – Servers: more RAM, more CPU

• 128GB of RAM memory (48GB current prod machines)– Storage: more SSD cache

• Newer NetApp model• Consolidated storage

• Refresh cycle of OS and OS related– Puppet & RHEL 6

Page 4: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Oracle Upgrades 2014

New Oracle software

• Latest Oracle releases– 11.2.0.4– 12.1.0.1– Note: production now on 11.2.0.3

• Evolution – how to balance– Stable services– Latest releases for bug fixes– Newest releases for new features– Fit with LHC schedule

Page 5: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Oracle Upgrades 2014

Support for Oracle versions

• 11.2.0.4– Terminal patch set of Oracle 11g – From January 2016: additional support fees– Extended support ends January 2018

• 12.1.0.1– First release of 12c– Next Patch Set: 12.1.0.2 coming in Q3 or Q4– Educated Guess: users of 12.1.0.1 will have to

upgrade to 12.1.0.2 or higher by 2016• Note

– No current Oracle version fits well the entire RUN 2

Page 6: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

DBAs and workload validation

• What the DBAs can do:– Test upgrades of integration and production

databases– Share experience across users communities– Database CAPTURE and REPLAY

• Capture workload from production and replay it in upgraded DB

• Useful to catch bugs and regressions• Unfortunately it cannot cover the edge cases

Oracle Upgrades 2014

Page 7: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Validation Infrastructure

• Application validation against new software releases & new hardware– Devdb11 is 11.2.0.4 since October 2013– Devdb12 is 12.1.0.1 since September 2013 – Integration DBs on new HW have been set up in

November 2013– Test and integration DBs upgraded

December/January – Test restores of production database servers on

new HW and SW available in Q1 2014

Oracle Upgrades 2014

Page 8: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Validation by the users community• Validation by the application owners is very

valuable to reduce risk– Functional tests– Tests with ‘real world’ data sizes– Tests with concurrent workload

• The criticality depends– on the complexity of the application– On how well they can test their SQL

• Note this is not the first major upgrade– See 10g -> 11g upgrade in Q1 2012

Oracle Upgrades 2014

Page 9: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Upgrade proposal, overview

• Move IT-DB services to 12c gradually– 12c clusterware deployed for RAC now– Select a few DB services for 12c now

• 7 DB clusters being tested and proposed for 12c– A few more services candidate to 12c later

• After tests of 12.1.0.2• We work on reduced downtime for upgrade too• Ideally another group of DBs upgraded to 12c in

December/January

• The rest of the services – To 11.2.0.4– See next slide for WLCG details

Oracle Upgrades 2014

Page 10: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Upgrade proposal, WLCG

• DB services for Experiments/WLCG– Target version 11.2.0.4– Proposed exceptions (target 12c)

• ATLARC• LHCBR

• ATLARC– Archive (low load) + TAGS (being phased out)

• LHCBR– Mainly LFC and DIRAC bookkeeping– Passed 12c DBA tests with CAPTURE-REPLAY – Pending user community tests and approval

Oracle Upgrades 2014

Page 11: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Notable Validations

• Accelerator DBs already in prod 11.2.0.4• CMSARC upgraded to 11.2.0.4

• PVSS validated for 11.2.0.4• CASTOR validation for 11.2.0.4 done• ATLAS RUCIO validated for 11.2.0.4

Oracle Upgrades 2014

Page 12: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Upgrade operation• Downtime

– Between 2h and 4h depending on the complexity– Additional time for ADG (read-only replicas)

where relevant• Tnsnames.ora

– Unchanged, we will move network ip aliases• Fallback

– If upgrade is stuck we can abort upgrade and restore services on old HW• Current prod system is unchanged (exception: in-place

upgrades for some of the online DBs)– Will keep compatible to 11.2.0.3

• Will allow rollback to 11.2.0.3 version

Oracle Upgrades 2014

Page 13: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

13

Upgrade technique: high level view

Clusterware 12c+

RDBMS target version

Redo Transport

NEW PRODUCTION

DB

CURRENT PRODUCTION DB

Redo Transport

RW AccessRW

Ace

ss

RDBMS upgrade

DATABASE downtime

Upgrade complete!

123456

Page 14: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Recap of the process

• Integration and test systems available for functional and stress-testing validation

• Test-upgraded production DBs on the future production HW available– Tests on the will-be production system

• IT-DB available to work together on regressions and bugs, if found

• We wait for the OK of the DB owners for the upgrade of production– We schedule together downtime for the

intervention

Oracle Upgrades 2014

Page 15: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Scheduling

• Target: schedule most of the upgrades in March

• Ready for upgrade and already scheduled– Castor, CMS

• The rest of upgrades should follow soon– Note: critical power will soon not be available for

what is current production (HW location called RAC10 and RAC11) due to IT works

Oracle Upgrades 2014

Page 16: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Tier 1 DBs and replication

• Tier 1 DBs – Advised to upgrade to 11.2.0.4

• TRIUMF already upgraded– When? Proposal: by the end of June

• Technology evolution for Streams – Replace Streams with its successor: Golden

Gate (Oracle)• Status: we are getting ready for rollout

– Proposed rollout for online-offline: Q2-Q3– Proposed rollout for Tier 1s: Q3

Oracle Upgrades 2014

Page 17: CERN IT Department CH-1211 Geneva 23 Switzerland  t WLCG Operation Coordination 20-2-2014 Luca Canali (for IT-DB) Oracle Upgrades

CERN IT Department

CH-1211 Geneva 23

Switzerlandwww.cern.ch/

it

Conclusions

• New HW and SW are ready for Experiments/WLCG databases

• Proposed versions for the upgrade – Most DBs to 11.2.0.4 for now– Phased approach to 12c

• Validation– Capture-replay done by DBAs– Need help of application owners too

• Upgrades– Schedule agreed with users community– Tentative: most upgrades by end of March

Oracle Upgrades 2014