31
© Copyright 2012 EMC Corporation. All rights reserved. Get involved! Ask questions and participate. If you use Twitter, feel free to tweet about this session (use hashtag #VMwarePEX or #PEX) I encourage you to take photos or videos of this session and share them online This presentation will be made available online after the event Before we start

Exploring Stretched Clusters

Embed Size (px)

Citation preview

Page 1: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Get involved! Ask questions and participate.

• If you use Twitter, feel free to tweet about this session (use hashtag #VMwarePEX or #PEX)

• I encourage you to take photos or videos of this session and share them online

• This presentation will be made available online after the event

Before we start

Page 2: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

EXPLORING STRETCHED CLUSTERS

Scott Lowe, VCDX 39CTO, VMware Affinity TeamAuthor, Mastering VMware vSphere 5Blogger, http://blog.scottlowe.org

Examining the use of stretched clusters in a VMware vSphere environment

Page 3: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

Part 1: Stretched Cluster Or SRM?

Page 4: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Quick review of terminology

• Comparing SRM and vMSC requirements

• Comparing SRM and vMSC advantages

• Comparing SRM and vMSC disadvantages

• Mixing SRM and vMSC

Part 1 Agenda

Page 5: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• vMSC = vSphere Metro Stretched Cluster

• Introduces some new terms:• Uniform access = "stretched SAN"

• Non-uniform access = "distributed virtual storage"

• Provides boundaries for supportability of stretched cluster configurations

New vMSC HCL category

Page 6: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• RPO = Recovery Point Objective

• RPO is a measure of how much data loss the organization is willing to sustain

• RTO = Recovery Time Objective

• RTO is a measure of how long of a wait the organization is willing to tolerate before recovery is complete

RPO versus RTO

Page 7: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• DA = Disaster avoidance• Seeks to protect applications and data before a

disaster occurs

• How often do you know before a disaster is going to occur?

• DR = Disaster recovery• Seeks to recover applications and data after a

disaster occurs

• Think of DA as vMotion and DR as vSphere HA

DR versus DA

Page 8: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Some form of supported storage replication (synchronous or asynchronous)

• Layer 3 connectivity

• No minimum inter-site bandwidth requirements (driven by SLA/RPO/RTO)

• No maximum latency between sites (driven by SLA/RPO/RTO)

• At least two vCenter Server instances

Requirements for SRM

Page 9: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Some form of supported synchronous active/active storage architecture

• Stretched Layer 2 connectivity

• 622Mbps bandwidth (minimum) between sites

• Less than 5 ms latency between sites (10 ms with vSphere 5 Enterprise Plus/Metro vMotion)

• A single vCenter Server instance

Requirements for vMSC

Page 10: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Defined startup orders (with prerequisites)

• No need for stretched Layer 2 connectivity (but supported)

• The ability to simulate workload mobility without affecting production

• Supports multiple vCenter Server instances (including in Linked Mode)

Advantages of SRM

Page 11: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• The possibility of non-disruptive workload migration (disaster avoidance)

• No need to deal with issues changing IP addresses

• Potential for running active/active data centers and more easily balancing workloads between them

• Typically a near-zero RPO with RTO of minutes

• Requires only a single vCenter Server instance

Advantages of vMSC

Page 12: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Typically higher RPO and RTO than stretched clusters

• Workload mobility is always disruptive

• Requires at least two vCenter Server instances

• Operational overhead from managing protection groups and protection plans

Disadvantages of SRM

Page 13: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Greater physical networking complexity due to stretched Layer 2 connectivity requirement

• Greater cost resulting from higher-end networking equipment, more bandwidth, active/active storage solution

• No ability to test workload mobility

• Operational overhead from management of DRS host affinity groups

• Supports only a single vCenter Server instance

Disadvantages of vMSC

Page 14: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• It can be done, but it has its own set of design considerations

• For any given workload, it's an “either/or” situation

What about a mixed architecture?

Page 15: Exploring Stretched Clusters

© Copyright 2011 EMC Corporation. All rights reserved.

Diagram of a mixed architecture

Page 16: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

Part 2: Building Stretched Clusters

Page 17: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• vSphere recommendations

• Storage recommendations

• Networking recommendations

• Operational recommendations

Part 2 Agenda

Page 18: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Use vSphere 5

• Use DRS host affinity groups

• Run vSphere HA with N+2 capacity

vSphere recommendations

Page 19: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• vSphere 5 eliminates some vSphere HA limitations

• vSphere 5 introduces the vMSC HCL category

Use vSphere 5

Page 20: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Allows you to mimic site awareness

• Use PowerCLI to address manageability concerns

• Use "should" rules instead of "must" rules

Use vSphere DRS host affinity groups

Page 21: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Use some sort of unique property to "group" VMs

• Use this "grouping" to automate VM placement into groups

• Run the PowerCLI script regularly to ensure correct VM group assignment

Using PowerCLI with host affinity groups

Page 22: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Use storage from vMSC category

• Be aware of storage performance considerations

• Account for storage availability

• Plan Storage DRS carefully

• Use profile-driven storage

Storage recommendations

Page 23: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Consider cross-connect topology

• Ensure multiple storage controllers at each site for availability

• Provide redundant and independent inter-site storage connections

• With VPLEX, use the third-site cluster witness

Account for storage availability

Page 24: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Align datastore clusters to site/array boundaries

• Don't combine stretched/non-stretched datastores

• Understand the impact of SDRS on your storage solution

Plan Storage DRS carefully

Page 25: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Use user-defined capabilities to model site topology

• Create VM storage profiles to provide site affinity

• Can help avoid operational concerns with VM placement

Use profile-driven storage

Page 26: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Plan for different traffic patterns

• Where possible, separate management traffic onto a vSwitch

• Incorporate redundant and independent inter-site network connections

• Minimize latency as much as possible

Networking recommendations

Page 27: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Account for backup/restore in your design

• Handle inter-site vMotion carefully

• Don't split multi-tier apps across sites

Operational recommendations

Page 28: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• Consider a solution with client-side deduplication to reduce WAN traffic

• A mechanism to reduce restore traffic would be nice to have as well

• Might be able to leverage storage solution itself for restores• Restore to local side

• Allow storage solution to replicate to remote side

Backup/restore for stretched clusters

Page 29: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

• An inter-site vMotion will impact DRS host affinity rules

• An inter-site vMotion could require storage configuration updates

• Review inter-site vMotions to:• Reconcile DRS host affinity rules and VM locations

• Reconcile storage availability rules and VM locations

• Impact on other operational areas

Handling inter-site vMotion

Page 30: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.

Questions & Answers

Page 31: Exploring Stretched Clusters

© Copyright 2012 EMC Corporation. All rights reserved.