High availability in IT: AAAARGH

Embed Size (px)

DESCRIPTION

My Barcamp presentation (#bca4) on High Availability, and how much energy and frustration it costs to implement it. In Dutch.

Citation preview

  • 1. High Availability in IT Mattias Geniar BarCamp 2010 High Attention High Annoyance High AAAAAARGH

2. High Availability

  • Introduction

3. Name: Mattias Geniar 4. Twitter: @mattiasgeniar 5. Blog: http://mattiasgeniar.be 6. E-mail: [email protected] 7. Job: System Engineer @ Nucleus 8. Walkthrough

  • What is High Availability?

9. What is downtime 10. How are systems currently built

  • What are the weak points

11. How can we prevent it What's so annoying about it 12. #define ha

  • What is high availability?

13. #define ha

  • What is High Availability?
  • High availability is asystem design approach and associated service implementation that ensures a prearranged level ofoperational performancewill be met during a contractual measurementperiod .

14. #define ha

  • What is High Availability?
  • We can use Facebook, Twitter, Work, Blog, ...

15. #define ha

  • What is High Availability?
  • Redundancy All good things come in pairs.

16. #define ha

  • What is High Availability?
  • Backup A working backup, when you need it.

17. #define ha

  • What is High Availability?
  • Connectivity You can reach your system, when you need it.

18. #define downtime

  • When is it considered downtime?

19. #define downtime

  • When is it considered downtime?
  • When it's unavailable site offline, application unreachable, ...

20. #define downtime

  • When is it considered downtime?
  • When it's partially unavailable site is working, but some key functions are malfunctioning

21. #define downtime

  • When is it considered downtime?
  • When it's slow to respond users click away on slow sites, get annoyed at loading times, ...

22. #define downtime

  • When is it considered downtime?
  • Planned vs Unplanned Is a planned maintenance downtime?

23. #define downtime

  • What is acceptable?

24. #define downtime

  • What is acceptable?
  • Webshop during the night: 30 minutes offline ?

25. #define downtime

  • What is acceptable?
  • Business Application during off-hours: 15 minutes offline ?

26. #define downtime

  • What is acceptable?
  • Airport during rush hour: 5 minutes offline ?

27. #define downtime

  • What is acceptable?
  • Twitter during World Championship: 2 minutes offline ?

28. #define downtime

  • What is acceptable?
  • Nuclear Power Plant during scheduled maintenance: 15 seconds offline ?

29. #define downtime

  • Impact of downtime?

30. #define downtime

  • Impact of downtime?
  • Webshop can't sell when they're offline.

31. #define downtime

  • Impact of downtime?
  • People ignore you, and move on.

32. #define downtime

  • Impact of downtime?
  • Your reputation takes (a) (s)hit.

33. Our savior: High Availability

  • No more downtime

34. Always on services 35. 24/7 availability 36. Responsive sites 37. = Increase in productivity = WIN? 38. Designing for High Availability

  • The datacenter

39. #define downtime

  • What's a datacenter?
  • A secure area

40. #define downtime

  • What's a datacenter?
  • Filled with racks.

41. #define downtime

  • What's a datacenter?
  • Who are filled with servers.

42. #define downtime

  • What's a datacenter?
  • Which are all kept cool.

43. Designing for High Availability

  • The typical datacenter design

44. High Availability: Datacenter This should be simple. Building POWER COOLING 45. High Availability: Datacenter UPS Hmm ... Building POWER COOLING POWER COOLING 46. High Availability: Datacenter UPS ... and done! Building POWER COOLING POWER COOLING UPS GENERATOR GENERATOR UPS 47. Designing for High Availability

  • More than just power & cooling

48. High Availability: Datacenter

  • Cages or seperate rooms.

49. High Availability: Datacenter

  • General security

50. Designing for High Availability

  • Your hosting

51. High Availability: Hosting Right. How hard can this be? INTERNET CONNECTIVITY SERVER 52. High Availability: Hosting UPS UPS I shouldn't have said that. INTERNET SERVER A NETWORK A INTERNET SERVER C NETWORK B SERVER B 53. Designing for High Availability

  • Example: very important, high traffic webshop

54. High Availability: Hosting Go get'm Frank! WEBSITE ORDER 55. High Availability: Hosting Holy servers, Batman! FRONTEND FRONTEND FRONTEND LOAD BALANCER LOAD BALANCER STORAGE STORAGE DATABASE DATABASE 56. Designing for High Availability

  • Doesn't that get messy?

57. High Availability: Hosting

  • Doesn't that get messy?
  • Sometimes.

58. High Availability: Hosting

  • Doesn't that get messy?
  • But it doesn't have to.

59. Designing for High Availability

  • That's it for hosting then?

60. High Availability: Hosting

  • That's it for hosting then?
  • First, there's backups. Lots of them.

61. High Availability: Hosting

  • That's it for hosting then?
  • Monitoring know what's happening.

62. High Availability: Hosting

  • That's it for hosting then?
  • Monitoring get alerted.

63. High Availability: Hosting

  • That's it for hosting then?
  • Monitoring react. 24/7

64. High Availability: Hosting

  • That's it for hosting then?
  • Automate as much as possible.

65. High Availability: Hosting

  • That's it for hosting then?
  • Think about every move you make.

66. High Availability: Hosting

  • That's it for hosting then?
  • Innovate. Change.

67. High Availability: Hosting

  • That's it for hosting then?
  • Eliminate further (hardware) failures.

68. Designing for High Availability

  • The Application Website, Business App, ...

69. High Availability: Developer

  • Seperate development environment.
  • The Application

70. High Availability: Developer

  • Dedicated & motivated programmers.
  • The Application

71. High Availability: Developer

  • Knowledge of underlying systems.
  • The Application

72. High Availability: Recap

  • Power

73. Cooling

  • Internet connectivity

74. Servers

  • Datacenter
  • Hosting Provider
  • Application
  • Developer

We've got it! 75. High Availability: 99% vs 99,99%

  • 99% availability

76. Monthly downtime: 7,20 hours

  • 99,99% availability

77. Monthly downtime: 4,32 minutes

  • 99,9999% availability

78. Monthly downtime: 2,59 seconds Numbers? 79. Presentation Down Mattias Geniar BarCamp 2010 Thanks for not running away.