Upload
vmworld
View
206
Download
0
Tags:
Embed Size (px)
DESCRIPTION
VMworld 2013 Praveen Kannan, VMware Samuel McBride, VMware Learn more about VMworld and register at http://www.vmworld.com/index.jspa?src=socmed-vmworld-slideshare
Citation preview
How to Troubleshoot VM Performance Issues Across
Applications, Infrastructure and Storage Using
vCenter Operations Management
(Live Demonstration!)
Praveen Kannan, VMware
Samuel McBride, VMware
VCM5169
#VCM5169
2 2
Agenda
Introduction to vCenter Operations Management Suite
Real world troubleshooting scenarios
Q&A
3 3
VMware Cloud Management Portfolio
SIMPLE,
AUTOMATED
MANAGEMENT
FOR THE CLOUD
CLOUD SERVICE
PROVISIONING
CLOUD OPERATIONS
MANAGEMENT
CLOUD BUSINESS
MANAGEMENT
vCloud Automation
Center
vFabric Application
Director
vCenter
Operations
Management
Suite
vCenter Log
Insight
VMware
IT Business
Management
Suite
New!
4 4
vCenter Operations Management Suite
The VMware Cloud Operations Management Platform
Cloud Operations Console Extensibility
APIs
SDKs
3rd Party
adapters
Content
Packs
Helpdesk Integrated Management Disciplines
Performance Compliance Config Capacity Cost
Patented Analytics
App Visibility Logs Inventory Reporting Automation
5 5
Product Overview: vSphere Dashboard
Overview
Comprehensive monitoring for
vSphere operations with health,
risk and efficiency badge scores
Single tool to manage
performance and capacity
across multiple vCenter servers
Benefits
End-to-end visibility into cloud
infrastructure health
Ensure and restore
service levels
Optimize for efficiency and cost
Immediate
Problems
Future
Problems
Opportunities
to Optimize
6 6
Product Overview: Custom Dashboards
Overview
Monitor your business critical
apps and infra through
customizable dashboards
Dashboards can bring one or
more widgets to see related
information about resources
Benefits
Highly flexible to customize per
environment need
Create dashboards for different
roles within the team
7 7
Product Overview: Details pages
Provides the next level of drill-down
Key metrics that
drives Workload
8 8
Product Overview: Events
Correlate events across time
Choose
Badge For which objects should I
show Alerts and Events? Overlay Badge Alerts
Overlay
Change
Events
Health
Score
Line
9 9
Product Overview: All Metrics
Compare across different metrics and visualize side-by-side
10 10
Agenda
Introduction to vCenter Operations Management Suite
Real world troubleshooting scenarios
Q&A
11 11
Day in the Life of an Admin
What happened
at Sunday 9 am?
What’s going on
right now?
What do I need
in three months?
Alerts
All Metrics
Symptom Summary
Explore Find Alert
Details
All Metrics
What-If
Reports
12 12
Demo
13 13
Agenda
Introduction to vCenter Operations Management Suite
Real world troubleshooting scenarios
Q&A
14 14
Other VMware Activities Related to This Session
HOL:
HOL-SDC-1301
Applied Cloud Operations
Group Discussions:
VCM1002-GD, VCM1004-GD
Cloud Operations with Hicham Mourad or Sam McBride
VCM5169
THANK YOU
How to Troubleshoot VM Performance Issues Across
Applications, Infrastructure and Storage Using
vCenter Operations Management
(Live Demonstration!)
Praveen Kannan, VMware
Samuel McBride, VMware
VCM5169
#VCM5169
18 18
Backup Slides
19 19
Key Concepts: HEALTH – Immediate issues
Workload
• Measures demand for resource vs. effective capacity
• Low number is Good – Object has resources it needs
Anomalies
• Measures stats that are outside of their "normal",
trended ranges based on self-learns behavior
• Low anomalies is Good - less chance of problem
Faults
• Problems based on Hardware failures, HA issues etc.
• Low number is good – less issues!
Health: combines Workload, Anomalies and Faults to help admins
react to service impacting issues that need to be resolved immediately
20 20
Key Concepts: RISK – Future Issues
Time remaining
• Time left before resources are exhausted
• High number is good – enough capacity to meet
Capacity remaining
• How many more VMs can I fit with what I have
• High number is good – no need to provision more
capacity immediately
Stress
• Patterns of long-term or chronic strain
• Low number is good - resources are not strained
Risk: combines the scores of Time remaining, Capacity remaining and
Stress to help admins be proactive about building problems
21 21
Key Concepts: EFFICIENCY – Opportunities to Optimize
Waste
• Shows reclaimable waste - idle, over-sized and over-
provisioned resources
• Low number is Good – indicated low wastage
Density
• Compares current vs. optimal consolidation ratios of
VM, CPU and memory
• High number is good – indicates efficient use without
impacting performance
Efficiency: combines Waste and Density to highlight how to save cost
and maximize the use of resources.