Clearly, I Have Made Some Bad Decisions

“I don’t have scaling problems”

Scaling is about change

not about quantity

Problems don’t occur when things are normal

If things change, you will have scaling problems

Work takes time to do

Email needs to be read

Work takes time to do

Email needs to be read

Code runs on a server

Mistake!“I don’t have scaling

problems”

Mistake!“I don’t have scaling

problems”Not a mistake we’re making if we’re here?

Mistakes will be made

Problems will happen

Mistakes will be made

Problems will happen

But there are things we can do to be prepared

#1 Measure Everything

How do you know if something is wrong?

not wrong?

# uptime 17:27:18 up 405 days, 2:36, 1 user, load average: 26.93, 10.46, 6.16

Read your log files

Read your log files(Exceptions aren’t always exceptional)

Measure in production (hat tip: Coda, “metrics, metrics everywhere”)

That’s the only place where things are really happening

Measure in production (hat tip: Coda, “metrics, metrics everywhere”)

That’s the only place where things are really happening

But don’t let your metrics causeperformance problems

PING web (192.168.19.1): 56 data bytesRequest timeout for icmp_seq 0Request timeout for icmp_seq 1Request timeout for icmp_seq 2Request timeout for icmp_seq 3

Sometimes you can just tell things are wrong

#2 Infrastructure as code(and config management)

Don’t do this.

Chef or Puppet(or cfengine or bcfg2)

Server config is code

Server config is codeRevision control

Feature branches

Feature branchesCommenting and authorship

Feature branchesCommenting and authorship Centralized

(not in someone’s head)

Should I choose Chef or Puppet?

Yes(Seriously, this is non-negotiable.)

How do I switch my servers to start using config management?

My advice:build new ones, throw the old

ones away.

Clean Known state

test clustersBuild

Clean Known state

test clustersDestroy

Clean Known state

test clusters

live machines

DestroyBuild

Clean Known state

test clusters

live machines

DestroyBuild

Clean Known state

test clusters

live machines!

DestroyBuild

BuildUse

Destroy

One-button servers

What about your code?

#3a Real deployment

Don’t do this.

$ svn upU www/index.phpU www/payments.phpU www/settings-live.phpU www/settings-dev.phpA www/specials.php U .Updated to revision 9703.

Deployment is more than just putting code in place.

reproducible idempotent rollouts

tied to a known build number

with separately-versioned known configuration

triggered non-manually across any number of servers

with full dependency management

and automated regression testing.

Etsy’s Deployinator

Vlad the Deployer

Fabric

Capistrano

OS Packages

Roll your own

#3b Continuous deployment

Holy Grailtrunk = live

tests block commits

feature flags?

dark launches?

Cowboy

Perfectionist

Fast iteration = fast test results

One huge feature tested... and rejected

Ten new tiny features testedTwo accepted

Failure is comfortable

Blame out, responsibility in

Consequences immediately visible

Okay, fine:Continuous Integration

Things still go wrong

After all that

#4 Plan for failure

Take backups

Test backups

Automate servers

Test server crashes

Netflix’s Chaos Monkey

And cousins: the Simian Army

Server failures predicted and foiled

What about code? New features?

#5 Future Compatibility

ALTER TABLE `user` ADD COLUMN `twootr` VARCHAR(16);CREATE INDEX `twootr_idx` ON `user` (`twootr`);

Don’t do this.(on live)

“Future compatible” schemas

“Future compatible” code

Normalized tables are performance heavy

Don’t assume any columns?

Shiny new Yucky old

ReadWrite

Migrate

What about other bad decisions?

#6 Wing It

- Django- MySQL

spof.yola.com

Scheduled for reboot

- Django- MySQL

spof.yola.com

- MySQL

Slave Replication

- Django- MySQL

spof.yola.com

- MySQL

- Django

spof.yola.com

- Django

- MySQL - MySQL

Slave replication

- Django

spof.yola.com

- Django

- MySQL - MySQL

Slave replication

- Django

spof.yola.com

- Django

- MySQL - MySQL

Slave replication

Drop DNS TTL

- Django

spof.yola.com

- Django

- MySQL - MySQL

Slave replication

But it’s okay

Jonathan Hitchcock

@vhata

github.com/vhata

Clearly, I Have Made Some Bad Decisions

Technology

Lab 1 AEV Introduction · Good Idea Bad Idea Bad Idea Bad Idea Good Idea Good Idea Rev: 20140827, NAO. Reducing Noise: Presenting Clearly Watch size and complexity of figures and

Decision Making or How to Tell Good from Bad Decisions · Decision Making (or How to Tell Good from Bad Decisions and How to Avoid Bad Decisions) Jens Lillebæk –Sweco ... •Dangerous

Problem Solving Escalator Video. Decisions!! How many bad decisions have you made lately?

Why Do Good Managers Make Bad Ethical Decisions

ABSTRACT OR SUPPORTING INFORMATION Errors in Aviation ...€¦ · ABSTRACT OR SUPPORTING INFORMATION Errors in Aviation Decision Making: Bad Decisions or Bad Luck? Judith Orasanu

Global Thinking and Bad Decisions: Why Clients Need Your ......Global Thinking and Bad Decisions: Why Clients Need Your Help Making Contextual Distinctions With Michael D. Yapko ,

BI forward: A full view of your business - Barrachdbarrachd.co.uk/.../06/...view-of-your-business.pdf · • Make sound business decisions with confidence. • Overcome bad decisions

Todd Ehren presents How To Deal With Bad Decisions

Bad Ethical Decisions

Minimising the Risk of Bad Decisions

Credit bureaus and ColleCtion praCtiCesmhsstobbs.weebly.com/uploads/3/8/6/0/3860037/ch6.pdf · Bad decisions lead to bad debt and bad credit. We asked some high school students if

Why Good Managers May Stick to Bad Decisions - Internal

Managing Change Making Strategic Decisions “Good decisions come from wisdom. Wisdom comes from experience. Experience comes from bad decisions.” Anonymous

Metrology & The Consequences of Bad Measurement Decisions

Fiscal Sustainability of Mexican Debt Decisions: Is Bad

See clearly. Make better decisions. Reduce costs. - Optimizing Data Analytics for Quality and Economic Outcomes Robert Littrell, Pharm.D

· Web viewJohny Says Stay Cool Bad Decisions - Skins {Ft. Blake Rose} Bad Pony - Deficiency Bad Sounds - Are You High? Bad//Dreems - Feeling Remains Bad//Dreems - Gutful BADBADNOTGOOD

Peru’s mining & metals investment guide · 2 Peru’s mining & metals investment guide 2017 / 2018 “ ” The difference between good investment decisions and bad investment decisions

Why good leaders make bad decisions

On bad decisions and discon® rmed expectancies: The psychology of regret and disappointment