tom levey (appdynamics) - the devops payrise

65
Realising the true value of DevOps The DevOps Payrise

Upload: ranger4-limited

Post on 20-Aug-2015

410 views

Category:

Technology


0 download

TRANSCRIPT

Page 1: Tom Levey (AppDynamics) - The DevOps Payrise

Realising the true value of DevOpsThe DevOps Payrise

Page 2: Tom Levey (AppDynamics) - The DevOps Payrise

Tom LeveyTech Evangelist

@tlevey

Page 3: Tom Levey (AppDynamics) - The DevOps Payrise

DevOps?

Page 4: Tom Levey (AppDynamics) - The DevOps Payrise

Developers working together with

Operations to get things done faster in an

automated and repeatable way

Page 5: Tom Levey (AppDynamics) - The DevOps Payrise

DevOps Success?

Page 6: Tom Levey (AppDynamics) - The DevOps Payrise

2005 Portal-services & handset testing

Page 7: Tom Levey (AppDynamics) - The DevOps Payrise

my����������� ������������������  desk

Page 8: Tom Levey (AppDynamics) - The DevOps Payrise

space

flashing����������� ������������������  lights

Page 9: Tom Levey (AppDynamics) - The DevOps Payrise

IREGInternational Roaming

Expert Group

Page 10: Tom Levey (AppDynamics) - The DevOps Payrise

International roaming

voice calls

video calls

signalingnetwork backbone

portal services

messaging

france network

spain network

italy network

germany network

norway network

Page 11: Tom Levey (AppDynamics) - The DevOps Payrise

Runbook1. Call L2 ops team 2. Call infrastructure ops 3. Call signaling partner 4. Call radio ops 5. Call backbone provider 6. Call international L2 ops team 7. Wait for reply... 8. Repeat 1-7 until resolution

Page 12: Tom Levey (AppDynamics) - The DevOps Payrise
Page 13: Tom Levey (AppDynamics) - The DevOps Payrise

“Everything is fine from our end.”

OPS:

Page 14: Tom Levey (AppDynamics) - The DevOps Payrise

Typical Day1. Open 30 new tickets 2. Make 200 phone calls 3. Attend executive P1 status update meeting 4. Argue about what a P1 and P2 really is 5. Reprioritise P2 tickets to P1 6. Reprioritise P3 tickets to P2 7. Close tickets as ‘Cannot reproduce’ or ‘Duplicate’

Page 15: Tom Levey (AppDynamics) - The DevOps Payrise

What did I care about?

Page 16: Tom Levey (AppDynamics) - The DevOps Payrise

P1’sSLA’s

What did I really care about?

Page 17: Tom Levey (AppDynamics) - The DevOps Payrise

What did the Business care about?

£

Page 18: Tom Levey (AppDynamics) - The DevOps Payrise

P1 = £Even though the business could never prove it.

Page 19: Tom Levey (AppDynamics) - The DevOps Payrise

How the Business really viewed me

Page 20: Tom Levey (AppDynamics) - The DevOps Payrise

How L2 & L3 Support viewed me

Page 21: Tom Levey (AppDynamics) - The DevOps Payrise

How Ops viewed me

Page 22: Tom Levey (AppDynamics) - The DevOps Payrise

False Alarms

Site is down

404 Errors

My search is slow

Page 23: Tom Levey (AppDynamics) - The DevOps Payrise

2am Friday - #FFS

We have had an alert that the load on one of your staging servers is critical.

Page 24: Tom Levey (AppDynamics) - The DevOps Payrise

How much time do false alarms waste?

Role Hours Per Week Cost Per Week Cost Per Year

Me 20 £400 £20,800

L2 10 £200 £10,400

L3 15 £300 £15,600

Hosting 6 £120 £6240

Network 6 £120 £6240

CMS 10 £200 £10,400

Total 55 £1,340 £69,680

Conservative estimates assuming £20/hour

Page 25: Tom Levey (AppDynamics) - The DevOps Payrise

How much revenue did the business lose?

No idea

Page 26: Tom Levey (AppDynamics) - The DevOps Payrise

Typical Day1. Open 30 new tickets 2. Make 300 phone calls 3. Attend executive P1 status update meeting 4. Argue about what a P1 and P2 really is 5. Reprioritize P2 tickets to P1 6. Reprioritize P3 tickets to P2 7. Close tickets as ‘Cannot reproduce’ or ‘Duplicate’

Page 27: Tom Levey (AppDynamics) - The DevOps Payrise

Things I wish I’d had

1. Automation

2. Collaboration

3. Better Tooling

4. Business Metrics

Page 28: Tom Levey (AppDynamics) - The DevOps Payrise

Things I wish I’d done

1. Baseline where I started

2. Measured my progress

3. Calculated Business Impact

4. Promote success not problems

5. Proved my value

Page 29: Tom Levey (AppDynamics) - The DevOps Payrise

Present Day

Page 30: Tom Levey (AppDynamics) - The DevOps Payrise

3 billion daily transactions

250 milliseconds

500+ updates/yr

Page 31: Tom Levey (AppDynamics) - The DevOps Payrise

1 million+ servers

100 million GB

1,000 man years

1,500 miles

Konstantin Karpov

Users Expectations

Page 32: Tom Levey (AppDynamics) - The DevOps Payrise

Web server 1

Internet FirewallLoad

Balancer

Web server 2

Database

Page 33: Tom Levey (AppDynamics) - The DevOps Payrise

Napkin architecture…

Page 34: Tom Levey (AppDynamics) - The DevOps Payrise

Key:

= bad

= not bad

Page 35: Tom Levey (AppDynamics) - The DevOps Payrise

Pre$Produc)on+APM+–+“Non+Produc)on+Data”+

Development Operations

Dev Test Staging Live

Monitor & Manage Profile QA Load Test

Pre-Production Production

Page 36: Tom Levey (AppDynamics) - The DevOps Payrise

Produc'on)APM)–)“Produc'on)Data”)

6

Development Operations

Dev Test Staging Live

Monitor & Manage

Pre-Production Production

Profile QA Load Test

Page 37: Tom Levey (AppDynamics) - The DevOps Payrise

tools can be helpful

Page 38: Tom Levey (AppDynamics) - The DevOps Payrise

right tools

right hands

right use

Page 39: Tom Levey (AppDynamics) - The DevOps Payrise

How much time and £ do these tools save?

INFRASTRUCTURE AUTOMATION

Page 40: Tom Levey (AppDynamics) - The DevOps Payrise

How much time and £ do these tools save?

DEPLOYMENT AUTOMATION

Page 41: Tom Levey (AppDynamics) - The DevOps Payrise

How much time and £ do these tools save?

LOG AUTOMATION

LogStash

Page 42: Tom Levey (AppDynamics) - The DevOps Payrise

Monitoring

How much time and $ do these tools save?

Page 43: Tom Levey (AppDynamics) - The DevOps Payrise
Page 44: Tom Levey (AppDynamics) - The DevOps Payrise

severe outage?

Page 45: Tom Levey (AppDynamics) - The DevOps Payrise

PLAN FOR FAILURE!be stronger than the weakest link

Page 46: Tom Levey (AppDynamics) - The DevOps Payrise

Traditional monitoring approach is limited

APPLICATION

BUSINESS TRANSACTION

Server

OS DB

MQ

Web

JVM

Silo’d domain visibility

EXISTING APPROACH

EXPANDED APPROACH

Business transaction

99.9% 99.9% 99.9%99.9%

END USER EXPERIENCE

Page 47: Tom Levey (AppDynamics) - The DevOps Payrise

How many of you use performance

management tools?

Page 48: Tom Levey (AppDynamics) - The DevOps Payrise
Page 49: Tom Levey (AppDynamics) - The DevOps Payrise

Identify early !Troubleshoot fast !Resolve quickly !Quantify impact

x

Page 50: Tom Levey (AppDynamics) - The DevOps Payrise

FOCUS

Page 51: Tom Levey (AppDynamics) - The DevOps Payrise
Page 52: Tom Levey (AppDynamics) - The DevOps Payrise

automatically map your application

Page 53: Tom Levey (AppDynamics) - The DevOps Payrise

Who is using my app?

Page 54: Tom Levey (AppDynamics) - The DevOps Payrise

device, browser, network, Carrier…

Page 55: Tom Levey (AppDynamics) - The DevOps Payrise

DETAILED CRASH ANALYSIS

Page 56: Tom Levey (AppDynamics) - The DevOps Payrise

device, carrier & browser performance

Page 57: Tom Levey (AppDynamics) - The DevOps Payrise

§

Show me the daily deals in my local area…

47 nodes

Page 58: Tom Levey (AppDynamics) - The DevOps Payrise

Search for a hotel in London…

215 nodes

Page 59: Tom Levey (AppDynamics) - The DevOps Payrise

Executed code diagnostics

Page 60: Tom Levey (AppDynamics) - The DevOps Payrise

Understand the impact of slow performance

* Screenshot from US e-Commerce AppDynamics Customer

Application Revenue

Application Errors

Application Response time

$64,499 per min

$11,987 per min

10.1 s

100 ms

Page 61: Tom Levey (AppDynamics) - The DevOps Payrise

Understand the benefit of an application release

Application Revenue

Application Response time

code release 1

code release 2

code release 3

$44,499 per min

$58,237 per min

1.9 s3.1 sec

Page 62: Tom Levey (AppDynamics) - The DevOps Payrise

Collaboration = ??

Productivity saves time Time is money.

Page 63: Tom Levey (AppDynamics) - The DevOps Payrise

DevOps Tasks Annual Cost £

Annual Benefit £

Infrastructure Automation ? ?

Deployment Automation ? ?

Log Automation ? ?

Monitoring ? ?

Collaboration Pride ?

Total X Y

DevOps Value £ Y - X

Page 64: Tom Levey (AppDynamics) - The DevOps Payrise

!

Tell the Business how much you save them.

DevOps

Automation & collaboration Time Money= =

Page 65: Tom Levey (AppDynamics) - The DevOps Payrise

Free trial: appdynamics.com