45
© 2015 VMware Inc. All rights reserved. VMware vSphere 6 Performance Technical Overview Mark Achtemichuk, VCDX, @vmMarkA Senior Technical Marketing Architect, Performance Specialist, VMware

VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Embed Size (px)

Citation preview

Page 1: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

© 2015 VMware Inc. All rights reserved.

VMware vSphere 6 Performance Technical Overview

Mark Achtemichuk, VCDX, @vmMarkA Senior Technical Marketing Architect, Performance Specialist, VMware

Page 2: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Agenda

2

1 Conquering Performance

2 New & Important Performance Features

3 Recommended Practices

4 Performance Troubleshooting

5 Resources

Page 3: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

What Do You Want to Run on vSphere 6.0? çEVERTHING!

3

Scale Up Apps/Business Critical

Apps

Scale Out Applications

Desktop Virtualization

Containers

Integrated OpenStack

•  vSphere is the platform for existing and new applications •  Scalability enhancements (VMs and Clusters) for all

application workloads* •  Big Data Extensions and Pivotal CF (PaaS) Support

•  Desktop Virtualization – 2D/3D Graphics, Instant Clone* •  OpenStack on vSphere = Success*

•  Linux Container Support

Overview

•  Increased scalability and performance •  SAP Hana – 400% performance gains over RDBMS and

9x gains in planning load times •  Rapid deployment of desktop virtual machines in seconds

•  10x faster than in previous releases •  Productivity and portability for application developers

•  Deliver Choice of Architecture

Benefits / Use Cases

*New with vSphere 6.0

Page 4: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Conquering Performance

Page 5: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Virtual Machine Scalability •  Virtualize 99.99% of workloads today

5

VMware vSphere 5.5

1mm+ IOPS >80Gb/s

HW v10

1TB RAM

64 VCPUs

VMware vSphere 6

HW v11

4 TB RAM

128 VCPUs

1mm+ IOPS >80Gb/s

Page 6: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Platform and Host Scalability

6

vSphere 5.5 vSphere 6.0

32 Hosts per Cluster 64 Hosts per Cluster

4000 Virtual Machines per Cluster 8000 Virtual Machines per Cluster

320 CPUs 480 CPUs

4 TB RAM 12 TB RAM

512 Virtual Machines per Host 1000 Virtual Machines Per Host

Page 7: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Virtual Machine Virtual Hardware 11

7

•  128 vCPUs •  4 TB RAM

•  Hot-add RAM now vNUMA aware •  WDDM 1.1 GDI acceleration features

•  xHCI 1.0 controller compatible with OS X 10.8+ xHCI driver

•  Serial and parallel port enhancements –  A virtual machine can now have a maximum of 32 serial ports –  Serial and parallel ports can now be removed

ESXi 6 Supports:

Shout Out: Virtual Hardware 10 - http://blogs.vmware.com/vsphere/2013/10/microsoft-operating-system-time-sources-and-virtual-hardware-10.html

Page 8: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Low Latency Storage IO •  1mm IOPS, >2ms latency, 8kb block, 32 OIO’s

8

Reference: www.vmware.com/files/pdf/1M-iops-perf-vsphere5.pdf

Page 9: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Low Latency Network IO •  Latency features reduce overhead to near native

9

Reference: http://www.vmware.com/files/pdf/techpaper/latency-sensitive-perf-vsphere55.pdf

Page 10: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

The Worlds First TPC-VMS Benchmark Result •  Compliant and audited by a 3rd party. •  While not a direct comparison, you can see how database consolidation scenarios could

achieve near native capabilities on the same hardware (>99%).

10

Reference: http://blogs.vmware.com/vsphere/2013/09/worlds-first-tpc-vms-benchmark-result.html

0

200

400

600

800

1000

1200

1400

Virtual VMStpsE

457.55

468.11

470.31

Thro

ughp

ut S

core

HP Proliant DL385 G8

OLTP Instance3

OLTP Instance2

OLTP Instance1

Page 11: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Virtualized Hadoop •  12% better performance than native for TeraSort

11

Reference: http://blogs.vmware.com/performance/2015/02/virtualized-hadoop-performance-vsphere-6.html

Page 12: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Super Monster VM Performance

12

- Linear Performance gains as VM is scaled up to Monster Size

Referencehttps://www.vmware.com/resources/techresources/10455 Reference: https://www.vmware.com/resources/techresources/10455

Page 13: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Super Monster VM Performance

13

- Linear Performance gains as VM is scaled up to Monster Size

Reference: http://blogs.vmware.com/performance/2015/03/vmware-vsphere-6-oracle-12c-scalability-study-scaling-monster-virtual-machines.html Reference: https://www.vmware.com/resources/techresources/10455

Page 14: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

New & Important Features

Page 15: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

vCenter Server Features - Enhanced Capabilities – Parity

15

Metric Windows Appliance

Hosts per VC 1,000 1,000

Powered-On VMs per VC 10,000 10,000

Hosts per Cluster 64 64

VMs per Cluster 6,000 6,000

Linked Mode ✔ ✔

•  Scalability supported by both Windows Install and vCenter Server appliance.

•  Windows install supports Postgres and External SQL and Oracle DBs.

•  vCSA supports embedded Postgres and external

Oracle DBs.

Overview

Page 16: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

vCenter Server 6.0 - vSphere Web Client Features

16

0

2

4

6

8

10

12

14

16

18

20

RO

DC

To

p le

vel O

bjec

ts

Clu

ster

s

Hos

t V

ms

Dat

asto

res

Net

wor

k D

Sw

itch

DP

ortg

roup

R

O C

lust

er

Topl

evel

H

osts

V

Ms

Dat

asto

res

Net

wor

k D

Sw

itch

RO

Hos

t V

ms.

N

etw

ork

DS

witc

h

Dat

asto

res

RO

VM

s N

etw

ork,

da

tast

ores

R

O d

Sw

itch

H

osts

V

ms

Vm

Tem

plat

e D

Por

tgro

up

Upl

inkP

ortg

roup

Thou

sand

s

Previous

Latest

Major Performance Improvements:

–  UI •  Screen by screen code optimization •  Login now 13x faster •  Right click menu now 4x faster •  Most tasks end to end are 50+% faster

–  Performance charts •  Charts are available and usable in less

then half the time

–  VMRC integration •  Advanced virtual machine operations

Page 17: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

vCenter Throughput

288  

471  

612  

464  

1292   1261  

0  

200  

400  

600  

800  

1000  

1200  

1400  

Small   Medium   Large  

Throughp

ut  (o

pera<o

ns/m

inute)  

Inventory  Size  

vCenter  5.5   vCenter  6.0  Windows  

Reference: What’s New in VMware vSphere 6 – Performance

Page 18: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

vCenter Latency

4.3  

10.5  

19.4  

2.9  4.2  

3.6  

0  

5  

10  

15  

20  

25  

Small   Medium   Large  

Med

ian  Latency  (secon

ds)  

Inventory  Size  

vCenter  5.5   vCenter  6.0  Windows  

Reference: What’s New in VMware vSphere 6 – Performance

Page 19: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

vCenter Latency

4.3  

10.5  

19.4  

2.9  4.2  

3.6  

0  

5  

10  

15  

20  

25  

Small   Medium   Large  

Med

ian  Latency  (secon

ds)  

Inventory  Size  

vCenter  5.5   vCenter  6.0  Windows  

Reference: What’s New in VMware vSphere 6 – Performance

Page 20: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Performance: Caution Using FireFox

20

+50%

Page 21: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

•  Superior 2D and 3D graphics on hosts with NVIDIA GRID GPU •  Enables shared access to physical GPUs

•  Uses native NVIDIA drivers •  Best for high performance needs like desktop virtualization with

graphic intensive applications in the fields of healthcare, manufacturing, higher education, and oil & gas

Overview

•  Superior 2D and 3D graphics •  Full capabilities of physical NVIDIA GPUs

•  High Density, Low Cost Graphics workstation

Benefits

NVIDIA GRID vGPU

21

Superior 2D and 3D Graphics for Virtual Environments

Page 22: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

•  Technology that enables the ability to rapidly clone and provision thousands of VMs in minutes.

•  Part of vSphere 6.0 but will be enabled by other applications such as BDE and Horizon View in later releases.

Overview

•  Clone VMs 10X faster than what is currently possible today

•  Deploy thousands of virtual desktops in a matter of minutes versus what would normally take hours

Benefits

Instant Clone

22

Rapidly Clone and Provision Virtual Machines

Lighting Fast Cloning

Page 23: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Network – New Features •  NetIOC v3

Reserve bandwidth to guarantee service levels at vnic and portgroup

•  Host-Wide Performance Tuning Engine 10% higher consolidation ratios with web farm use case

•  vmxnet3 LRO Support 15-20% improvement in receive throughput and efficiency for Windows

•  40GbE Improvements 35Gbps and 50% reduced compute cost

0  10  20  30  40  50  60  70  80  90  

0  

1  

2  

3  

4  

5  

6  

256  bytes   16KB   64KB  

Diff  (%

)  

CPU  Efficien

cy  (G

bps/Co

re)  

Message  Size  

LRO  vs  NoLRO  CPU  Efficiency  Comparison  with  NaFve-­‐VM    Traffic  

CpuEffi-­‐NoLRO   CpuEffi-­‐LRO   Diff  (%)  

Page 24: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Storage – New Features •  Storage Stack Optimizations

Effort spent reducing overhead and increasing capabilities to best leverage flash storage Examples: Samsung NVMe 240k -> 710k IOPS, EMC XtremSF 200k -> 670k IOPS

•  VSAN 6.0 7 Million IOPS, <2 ms Latency

•  VVOLs Performance the same or better as previous forms of storage integration

VSAN: http://www.vmware.com/files/pdf/products/vsan/VMware-Virtual-San6-Scalability-Performance-Paper.pdf

Page 25: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Recommended Practices

Page 26: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

vCenter – Recommendations •  Web Browser Selection Important

Firefox slowest, Chrome fastest, IE11 very close 2nd

•  Database Performance Critical vCenter experience most impacted by database performance, ensure proximity and speed

•  Place vCenter on Tier 1 Storage Placing the vCenter virtual machine on low latency storage will improve performance and experience

•  Don’t Change Statistics Levels Change only as necessary, short intervals, as it places a large demand on vCenter and the DB

•  Check JVM Sizing Adding RAM to VM doesn’t automatically adjust JVM service sizing, see KB2021302

Page 27: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Compute & Memory – Recommendations (1/2) •  Rightsize, Rightsize, Rightsize

Spend effort on rightsizing workloads for vCPU count and assigned memory

•  Size VM into pNUMA Node if Possible Doing this will reduce the potential for remote memory access and/or thread migration

•  Don’t use vCPU Hot-Add As it disables vNUMA and presents the virtual machine with UMA topology

•  Select High Performance in BIOS or vSphere Selecting anything else will save power but does potentially induce compute latency

•  Enable Hyper-Threading vSphere understands and uses Hyper-Threading to its advantage

Page 28: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Compute & Memory – Recommendations (2/2) •  Watch Memory Overcommit

Overcommit provides consolidation value at risk of performance during shortages

•  Do NOT Use ‘Active Memory’ in a Vacuum Active Memory is more a ‘rate’ counter than a ‘capacity’ counter, temper it with other counters like ‘Consumed’ or use vROPs

Page 29: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Network – Recommendations •  Use vmxnet3 Guest Network Driver

Very efficient and required for maximum performance

•  Evaluate Disabling Interrupt Coalescing Default mechanism may induce small amounts of latency in favor of throughout, evaluate disabling it as cost today is negligible

•  Jumbo Frames Provide Value While challenging to enable end-to-end sometimes they provide value to high throughput functions like VSAN, vMotion and NAS

•  It’s a 10Gb World 1Gb saturation is real, more bandwidth required today, especially in light of VSAN, MonsterVM vMotion

•  Use Latency Sensitivity ‘Cautiously’ While it can reduce latency and jitter in the 10us use case, it comes at a cost with core reservations, etc

Page 30: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Storage – Recommendations •  Use Multiple vSCSI Adapters

Allows for more queues and I/O’s in flight

•  Use pvscsi vSCSI Adapter More efficient I/O’s per cycle

•  Don’t Use RDM’s Unless needed for shared disk clustering, no longer a performance advantage

•  Leverage Your Storage OEM’s Integration Guide They provide necessary guidance around items like multi-pathing

Page 31: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Performance Troubleshooting

Page 32: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Define the Performance Issue •  Understand Application Function & Architecture

At a minimum know what your application does and what it’s dependent on.

•  Select Application KPIs Application performance must be measured using an application counters (tps, response time, etc) and not virtual resource consumption.

•  Define Success Criteria With your app owner, define at what level the application KPI’s must be to consider it performant.

•  Comparisons must be Apples-to-Apples Any changes to infrastructure (physical or virtual) create comparison challenges.

•  Now the Gap is Identified, Begin Troubleshooting With an understanding of the requirements and current deficiency, you can now begin to investigate and/or tune.

Page 33: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Use the Right Tool •  esxtop

2 sec data points, VERY granular, not scalable across hosts

•  vRealize Operations 5 min data points, very scalable, best starting view

•  vCenter Performance Charts 20 sec data points, okay real-time data, poor history, recommend vROPs

•  VSAN Observer Most detailed tool to troubleshoot VSAN related performance

•  3rd Party Ensure you know what the counters mean and their sample rate

Page 34: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Suggested Methodology

Storage

Guest

CPU Memory

Network What are my config/

tuning options?

Change & re-test

Acceptable?

yes no

Page 35: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Storage: What’s important in the stack

VMkernel

Guest I/O Drivers

File System

Virtual SCSI

File System

Application

K

D

G

Windows Device Queue

R = Perfmon Physical Disk “Disk Secs/transfer”

R A

K = ESX Kernel

G = Guest Latency

A = Application Latency

D = Device Latency

S S = Windows Physical Disk Service Time

Page 36: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Storage: Key Indicators •  Kernel Latency Average (KAVG)

This counter tracks the latencies of IO passing thru the Kernel Investigation Threshold: 1ms

•  Device Latency Average (DAVG) This is the latency seen at the device driver level. It includes the round-trip time between the HBA and the storage. Investigation Threshold: 10-15ms, lower is better, some spikes okay

•  Device Latency Average (GAVG) This is the latency seen at the guest level. It is effectively DAVG + KAVG. Needed for network attached storage. Investigation Threshold: 10-15ms, lower is better, some spikes okay

Page 37: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

CPU: Key Indicators •  Ready (%RDY)

% time a vCPU was ready to be scheduled on a physical processor but couldn’t due to processor contention Investigation Threshold: 10% per vCPU

•  Co-Stop (%CSTP) % time a vCPU in an SMP virtual machine is “stopped” from executing, so that another vCPU in the same virtual machine could be run to “catch-up” and make sure the skew between the two virtual processors doesn’t grow too large Investigation Threshold: 3%

•  Used (%USED) Make sure the VM is not oversized.

Page 38: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Memory: Key Indicators

•  Balloon driver size (MCTLSZ) the total amount of guest physical memory reclaimed by the balloon driver Investigation Threshold: 1

•  Swapping (SWCUR) the current amount of guest physical memory that is swapped out to the ESX kernel VM swap file. Investigation Threshold: 1

•  Swap Reads/sec (SWR/s) the rate at which machine memory is swapped in from disk. Investigation Threshold: 1

Page 39: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Network: Key Indicators

•  Transmit Dropped Packets (%DRPTX) The percentage of transmit packets dropped. Investigation Threshold: 1

•  Receive Dropped Packets (%DRPRX) The percentage of receive packets dropped. Investigation Threshold: 1

Page 40: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Resources

Page 41: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Resources VMware’s Performance – Technical Whitepapers http://www.vmware.com/resources/techresources/cat/91,96 VMware’s Tech-Marketing Performance Blog http://blogs.vmware.com/vsphere/performance/ VMware’s Perf-Eng Blog (VROOM!) http://blogs.vmware.com/performance Performance Community Forum http://communities.vmware.com/community/vmtn/general/performance VMware Performance Links – Master List https://communities.vmware.com/docs/DOC-25253 Virtualizing Business Critical Applications http://www.vmware.com/solutions/business-critical-apps/

41

Page 42: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Resources Performance Best Practices http://www.vmware.com/pdf/Perf_Best_Practices_vSphere4.0.pdf http://www.vmware.com/pdf/Perf_Best_Practices_vSphere4.1.pdf http://www.vmware.com/pdf/Perf_Best_Practices_vSphere5.0.pdf http://www.vmware.com/pdf/Perf_Best_Practices_vSphere5.1.pdf http://www.vmware.com/pdf/Perf_Best_Practices_vSphere5.5.pdf https://www.vmware.com/resources/techresources/10480 <-vSphere 6 Troubleshooting Performance Related Problems in vSphere Environments http://communities.vmware.com/docs/DOC-14905 (vSphere 4.1) http://communities.vmware.com/docs/DOC-19166 (vSphere 5) http://communities.vmware.com/docs/DOC-23094 (vSphere 5.x with vCOps)

42

Page 43: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Resources Virtualizing Microsoft Business Critical Applications on VMware vSphere

by: Matt Liebowitz, Alexander Fontana

vSphere High Performance Cookbook by: Prasenjit Sarkar

Troubleshooting Storage Performance By: Mike Preston

VMware vSphere Performance: Designing CPU, Memory, Storage, and Networking for Performance-Intensive Workloads

By: Matt Liebowitz, Christopher Kusek, Rynardt Spies

Virtualizing SQL Server with VMware: Doing IT Right By: Jeff Szastak, Michael Corey, Michael Webster

Virtualizing Oracle Databases on vSphere By: Don Sullivan, Kannan Mani

VMware vRealize Operations Performance and Capacity Management By: Ewan ‘e1’ Rahabok

43

Page 44: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Resources VMware Hands-On-Labs http://labs.hol.vmware.com/ HOL-SDC-1404:

vSphere Performance Optimization – This has always been one of the most popular labs and has content for both the beginner and the advanced vSphere Administrator. You can learn more about the basics of vSphere Performance or delve into esxtop, or vNUMA.

http://labs.hol.vmware.com/HOL/#lab/1474

44

Page 45: VMware vSphere 6 Performance · PDF file3 4 5 6 256#bytes# 16KB 64KB $ $ Message$Size$ LRO$vs$NoLRO$CPU$Efficiency$Comparison$ with$NaFveGVM$Traffic$ CpuEffiQNoLRO# CpuEffiQLRO# Diff(%)

Thank You