10 Things I Hate about DevOps

Preview:

Citation preview

10 Things I HATE About DevOps

Dave Cliffe – PagerDuty

@CliffeHangers

Dearest DevOps:

“In faith, I do not love thee with mine eyes,

For they in thee a thousand errors note,

But ‘tis my heart that loves what they despise,

Who in despite of view is pleased to dote.

…But my five wits nor my five senses

canDissuade one foolish heart from

serving thee.”- Shakespeare, Sonnet 141

Don’t be fooled:I am not a poet

• Software Engineer @ Amazon.com

• Program Manager @ Microsoft (.NET, Azure)

• Group Product Manager @ PagerDuty

• 3x Co-organizer @ devopsdays Toronto

Collaboration/Resolution

Your Fastest Path to Incident Resolution 8000+ customers globally, 50 of the Fortune 100, proud supporters of DevOps

MICROSERVICES

APPS & SERVICES

CONTAINERS

CLOUD

NETWORK

DATABASE

SERVERS

Developer

NOC

Helpdesk

IT OpsSystem and User

Efficiency

ALERT 1 ALERT 2 ALERT 3

Correlate, Cluster and Manage

EVENTS

People Tools Process

Deployment Tools

Monitoring Tools

Ticketing Tools

APP

SYSTEM

LOG

WEB

MOBILE APP

Automatic Escalations

On-CallScheduling

Agenda

•10 Things I HATE About DevOps• From the Business• For Developers• For Ops

•The Five Wits of Incident Response w/ DevOps• Common Sense• Imagination• Fantasy• Estimation• Memory

1. You say Potato, I say DevOps

(Are we even talking the same language?)

What do we mean by DevOps?

C ultureA utomationM easurementS haring

Damon Edwards and John Willisdevopsdays.org

https://blog.chef.io/2010/07/16/what-devops-means-to-me/

2. DevOps is a tech fad, irrelevant to the business

$$$

https://puppet.com/resources/white-paper/2016-state-of-devops-report

“By 2016, DevOps will evolve from a niche to a mainstream strategy employed by 25% of global 2000 organizations”

- Gartner

“… high-performing IT organizations …”

3. DevOps means more changes and change is painful

$$$

4. DevOps seems to break things more frequently

$$$

http://www.ca.com/us/rewrite/articles/devops/face-to-face-devops-to-protect-and-serve-.html

Move Fast: Break, Fail, Learn

5. I shouldn’t have to care about the customer

Dev

I have people skills. I am good at dealing with people!

Can’t you understand that?What the hell is wrong with you people?!

6. I hate being accountable for what I build

Dev

7. It’s not my fault(a.k.a. it works on my machine)

(a.k.a. that’s not a bug, it was spec’d that way)

Dev

8. DevOps isn’t ___ compliant

Ops

AUTOMATION CAN HELP ENFORCE COMPLIANCE

9. But I’m the expert

Ops

http://futurice.com/blog/from-t-to-pi-design-skill-expectations-in-change

10. DevOps is automating me out of a job

Ops

https://twitter.com/wattersjames/status/770375207219449858

Call to Action:Stop drinking the haterade

!

The Five Wits: Incident Response w/ DevOps

1. Common Sense2. Imagination

• Realize what the mind is being presented

3. Estimation• Decide on matters of time, space, etc.

4. Fantasy• Act in accordance with judgment

5. Memory• Retain the recollection of what is imparted

Realize

Decide

Act

Retain

Common Sense: DevOps

C ultureA utomationM easurementS haring

Damon Edwards and John Willisdevopsdays.org

https://blog.chef.io/2010/07/16/what-devops-means-to-me/

Culture

“Giving developers operational responsibilities has greatly enhanced the quality of the services, both from a customer and a technology point of view.… You build it, you run it.”- Dr. Werner Vogels, CTO Amazon

Realize a.k.a. Monitor

• Automation • Build the platform for effective monitoring

• Measurement • Know BEFORE your customer

• Sharing • Aggregate across tools, alert the right people

Realize

Decide

Act

Retain

Decide a.k.a. Triage

• Automation • Find the signal in the noise

• Measurement • Business Metrics, not just Technical Metrics

• Sharing • “Are you dealing with something? We’re seeing a P1”

Realize

Decide

Act

Retain

Act a.k.a. Resolve

• Automation • ChatOps – more effective resolution

• Measurement • Test your hypotheses – differential diagnosis

• Sharing • Proactive communication with stakeholders

Realize

Decide

Act

Retain

Retain a.k.a. Learn

• Automation • Build the timeline of events

• Measurement • How effective was our response?

• Sharing • Blameless Post-Mortem

Realize

Decide

Act

Retain

Recap: Incident Response w/ DevOps

Monitor

Triage

Resolve

Learn

S haringM easurementA utomationC ulture

Lay the SMAC down on Incidents!

Thank You!Dave Cliffe - @CliffeHangers

www.pagerduty.com

Your Fastest Path to Incident Resolution

8,000+ Customers

150,000+ Users

80 Countries

50 of the Fortune 100

175+ Native Integrations

PagerDuty The Fastest Path to Incident Resolution

Bringing together tools, process, and people

PeopleOne platform for NOC,

Helpdesk, & Responders that brings teams together

ProcessTo consolidate and cluster alerts

to surface the real problems and notify the right people

ToolsLeverage existing tools to deliver modern ops for a

unified view across all apps and services

“With PagerDuty, we spend less time worrying about on-call, and more time creating great products to impact lives.”

- Bryant Eastham, Chief Architect

“If we didn't get PagerDuty, we wouldn't have a business.”

- Martin Rhoads, Site Reliability Engineer

“Centralized alerts allow us to streamline communication, decreasing MTTR and increasing incident efficiency.”

- Shaun Boucher, Manager, Network Operations

“I’m calling it — PagerDuty is a must-have WebOps service.”

- Jesse Robbins, Co-founder