26
© 2009 EMC Corporation. All rights reserved. EMC Proven Professional The #1 Certification Program in the information storage and management industry Introduction to Business Continuity Chapter 11 Section 3 : Business Continuity

Chapter 11

Embed Size (px)

Citation preview

Page 1: Chapter 11

© 2009 EMC Corporation. All rights reserved.

EMC Proven Professional

The #1 Certification Program in the information storage and management industry

Introduction to Business ContinuityIntroduction to Business Continuity

Chapter 11

Section 3 : Business Continuity

Page 2: Chapter 11

© 2009 EMC Corporation. All rights reserved.

ISM - Course OrganizationInformation Storage

and Management

Section 1

Storage System

Section 2

Storage Networking

Technologies and Virtualization

Section 3

Business Continuity

Section 4

Storage Security

and Management

Introduction to Information

Storage and Management

Storage System Environment

Data Protection: RAID

Intelligent Storage System

Direct-Attached Storage and

Introduction to SCSI

Storage Area Networks

Network-Attached Storage

IP SAN

Content-Addressed Storage

Storage Virtualization

Introduction to

Business Continuity

Backup and Recovery

Local Replication

Remote Replication

Securing the

Storage Infrastructure

Managing the

Storage Infrastructure

Page 3: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Chapter Objective

After completing this chapter, you will be able to:

o Define Business Continuity and Information Availability

o Detail impact of information unavailability

o Define BC measurement and terminologies

o Describe BC planning process

o Detail BC technology solutions

Page 4: Chapter 11

© 2009 EMC Corporation. All rights reserved.

What is Business Continuity

o Business Continuity is preparing for, responding to, and recovering from an application outage that adversely affects business operations

o Business Continuity solutions address unavailability and degraded application performance

o BC is an integrated and enterprise wide process and set of activities to ensure “information availability”

Page 5: Chapter 11

© 2009 EMC Corporation. All rights reserved.

What is Information Availability (IA)

o IA refers to the ability of an infrastructure to function according to business expectations during its specified time of operation

o IA can be defined in terms of three parameters:o Accessibility

o Information should be accessible at right place and to the right usero Reliability

o Information should be reliable and correcto Timeliness

o Information must be available whenever required

Page 6: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Causes of Information Unavailability

Disaster (<1% of Occurrences)

Natural or man made Flood, fire, earthquakeContaminated building

Unplanned Outages (20%)

FailureDatabase corruptionComponent failureHuman error

Planned Outages (80%)

Competing workloads Backup, reportingData warehouse extractsApplication and data restore

Page 7: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Impact of Downtime

Lost RevenueKnow the downtime costs (per hour, day, two days...)• Number of employees

impacted (x hours out * hourly rate)

Damaged Reputation

• Customers• Suppliers• Financial markets• Banks• Business partners

Financial Performance

• Revenue recognition• Cash flow• Lost discounts (A/P)• Payment guarantees• Credit rating• Stock price

Other ExpensesTemporary employees, equipment rental, overtime costs, extra shipping costs, travel expenses...

• Direct loss• Compensatory payments• Lost future revenue• Billing losses• Investment losses

Lost Productivity

Page 8: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Measuring Information Availability

o MTBF: Average time available for a system or component to perform its normal operations between failures

o MTTR: Average time required to repair a failed component

IA = MTBF / (MTBF + MTTR) or IA = uptime / (uptime + downtime)

Detection

IncidentTime

Detection elapsed

time

Diagnosis

Response Time

Repair

Recovery

Repair time

Restoration

Recovery Time

MTTR – Time to repair or ‘downtime’

Incident

MTBF – Time between failures or ‘uptime’

Page 9: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Availability Measurement – Levels of ‘9s’ Availability

% Uptime % Downtime Downtime per Year Downtime per Week

98% 2% 7.3 days 3hrs 22 min

99% 1% 3.65 days 1 hr 41 min

99.8% 0.2% 17 hrs 31 min 20 min 10 sec

99.9% 0.1% 8 hrs 45 min 10 min 5 sec

99.99% 0.01% 52.5 min 1 min

99.999% 0.001% 5.25 min 6 sec

99.9999% 0.0001% 31.5 sec 0.6 sec

Page 10: Chapter 11

© 2009 EMC Corporation. All rights reserved.

BC Terminologies

o Disaster recoveryo Coordinated process of restoring systems, data, and infrastructure

required to support ongoing business operations in the event of a disastero Restoring previous copy of data and applying logs to that copy to bring it

to a known point of consistencyo Generally implies use of backup technology

o Disaster restarto Process of restarting from disaster using mirrored consistent copies of data

and applicationso Generally implies use of replication technologies

Page 11: Chapter 11

© 2009 EMC Corporation. All rights reserved.

BC Terminologies (Cont.)

Recovery Point Objective (RPO)

o Point in time to which systems and data must be recovered after an outage

o Amount of data loss that a business can endure

Recovery Time Objective (RTO)

o Time within which systems, applications, or functions must be recovered after an outage

o Amount of downtime that a business can endure and survive

Recovery-point objective Recovery-time objective

Seconds

Minutes

Hours

Days

Weeks

Seconds

Minutes

Hours

Days

Weeks Tape Backup

Periodic Replication

Asynchronous Replication

Synchronous Replication

Tape Restore

Disk Restore

Manual Migration

Global Cluster

Page 12: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Business Continuity Planning (BCP) Process

o Identifying the critical business functions

o Collecting data on various business processes within those functions

o Business Impact Analysis (BIA) o Risk Analysis

o Assessing, prioritizing, mitigating, and managing risk

o Designing and developing contingency plans and disaster recovery plan (DR Plan)

o Testing, training and maintenance

Page 13: Chapter 11

© 2009 EMC Corporation. All rights reserved.

BC Technology Solutions

o Following are the solutions and supporting technologies that enable business continuity and uninterrupted data availability:o Single point of failureo Multi-pathing softwareo Backup and replication

o Backup recoveryo Local replicationo Remote replication

Page 14: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Resolving Single Points of Failure

FC Switches

Storage Array

Redundant Network

Clustered ServersRedundant Arrays

Remote Site

Redundant Ports

Redundant FC Switches

Redundant Paths

Heartbeat Connection

IP

Storage Array

Client

Page 15: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Multi-pathing Software

o Configuration of multiple paths increases data availability

o Even with multiple paths, if a path fails I/O will not reroute unless system recognizes that it has an alternate path

o Multi-pathing software helps to recognize and utilizes alternate I/O path to data

o Multi-pathing software also provide the load balancing

o Load balancing improves I/O performance and data path utilization

Page 16: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Backup and Replication o Local Replication

o Data from the production devices is copied to replica devices within the same array

o The replicas can then be used for restore operations in the event of data corruption or other events

o Remote Replicationo Data from the production devices is copied to replica devices on a remote

array o In the event of a failure, applications can continue to run from the target

device

o Backup/Restoreo Backup to tape has been a predominant method to ensure business

continuityo Frequency of backup is depend on RPO/RTO requirements

Page 17: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Chapter Summary

Key points covered in this chapter:

o Importance of Business Continuity

o Types of outages and their impact to businesses

o Information availability measurements

o Definitions of disaster recovery and restart, RPO and RTO

o Business Continuity technology solutions overview

Page 18: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Concept in Practice – EMC PowerPath

SE

RV

ER

ST

OR

AG

E

SCSISCSIDriverDriver

SCSISCSIDriverDriver

SCSISCSIDriverDriver

SCSISCSIDriverDriver

SCSISCSIDriverDriver

SCSISCSIDriverDriver

SCSISCSIControllerController

SCSISCSIControllerController

SCSISCSIControllerController

SCSISCSIControllerController

SCSISCSIControllerController

SCSISCSIControllerController

PowerPathPowerPath Host Based Software

Resides between application and SCSI device driver

Provides Intelligent I/O path management

Transparent to the application

Automatic detection and recovery from host-to-array path failures

Host Application (s)Host Application (s)

LUNLUN

LUNLUN

Storage Network

Page 19: Chapter 11

© 2009 EMC Corporation. All rights reserved.

PowerPath Video

Click here for video

Additional Task

Research on EMC Powerpath

Page 20: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Summarizing Read the Whitepaper on Virtualization and Business Continuity

Page 21: Chapter 11

© 2009 EMC Corporation. All rights reserved.

External Linkso Affordable Disaster Recovery Solutions - Minimize your

exposure to data loss from disasters while reducing total cost of ownership.

o Enterprise Business Continuity Solutions - EMC Enterprise Business Continuity solutions ensure that your applications and data are available during planned and unplanned outages.

o Heterogeneous Data Replication Solutions - Protect data consistently on heterogeneous server, network, and storage infrastructures with robust, scalable, asynchronous remote replication solutions.

o Visit http://www.emc.com/solutions/business-need/business-continuity-availability/index.htm for more information

Page 22: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Check Your Knowledge o Which concerns do business continuity solutions address?

o “Availability is expressed in terms of 9s.” Explain the relevance of the use of 9s for availability, using examples.

o What is the difference between RPO and RTO?

o What is the difference between Disaster Recovery and Disaster Restart?

o Provide examples of planned and unplanned downtime in the context of storage infrastructure operations.

o What are some of the Single Points of Failure in a typical data center environment?

Page 23: Chapter 11

© 2009 EMC Corporation. All rights reserved.

For your Reading on www.emc.com

o Understand Basics of BC, Review analyst paper, whitepapers & Customer Stories

o Visit : http://www.emc.com/campaign/business-continuity.htm

Page 24: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Watch the Full Keynote Address

on www.emcworld.com/keynotes.htm

or

www.youtube.com/EMCCorp

Keynote Address

Next Event

Frank M. HauckExecutive Vice President

EMC Storage Division and Global Marketing & Customer Quality

Additio

nal In

form

ation

Watch the 2009 KeynoteClick to View the Presentation

Page 25: Chapter 11

© 2009 EMC Corporation. All rights reserved.

Additional Information

Customer Showcase

FundamentalsDemo

Best Practices

Product Showcase

Tech Deep dives

Tech Trends

Attend Free Webinars on http://www.emc.com/events/ondemand-events.esp

www.youtube.com/PointBB

www.youtube.com/EMCCorp

More Videos on

Page 26: Chapter 11

© 2009 EMC Corporation. All rights reserved.

#1 ITcompany

For more information visit http://education.EMC.com