Upload
others
View
3
Download
0
Embed Size (px)
Citation preview
For internal use only. Not for external use or consumption.
Welcome to the Data Center & Cloud Webinar Series
© 2020 Cisco and/or its affiliates. All rights reserved.
Bilal Sherrief – Cisco Technical Solution ArchitectData Center and Cloud
Ajaz Shahid- Cisco Technical Solution ArchitectData Center and Cloud
• AI and ML Transforming Day2 Operations
Cisco Data Center Network Assurance and Insights
© 2020 Cisco and/or its affiliates. All rights reserved.
The golden age of
app development
network operations
© 2020 Cisco and/or its affiliates. All rights reserved.
#1Config error is a leading cause of breaches and
hacks1
average cost of unplanned downtime2
$250K/hour 43%% of IT time spent troubleshooting2
2McKinsey Study of Network Operations for Cisco–2016
1IDC, Cybersecurity is a Daunting Challenge that Requires a Holistic Solution: Implications from Cloud Pulse, 1Q19 Survey, DOC #US45678519, December 2019
We all have been here…
4 hr down time
$10M for investigation
$$$ Cost on global trading 75000 cancelled flights
$68M in passenger reimbursements
2.8% drop in stock price
600K missing transactions
6M customers impacted
$72M in fines
Human Error
#1 most common cause of outagesInadequate system automationQuantity of data exceeds human reasoning
Limited Insights
Poor system wide visibilityNo actionable and useful insightsRapid change cycles
Reactive Posture
Vulnerabilities unknown until breachDifficulty in correlating eventsLacking failure pattern learning and anticipation
Common denominators
© 2020 Cisco and/or its affiliates. All rights reserved.
Main challenges in network operations
Lacking pervasive visibility and insights
No event and issue correlation
Inability to understand change impact
Limited performance and availability
© 2020 Cisco and/or its affiliates. All rights reserved.
Cisco Advantage, Best Data, Best Knowledge Base
Accurate insights
Improved performance
Cisco Intent Based Network Assurance
Diverse data: Network, application, security
IPAM
CMXAppD
IPSLA
SNMP
OID
Telnet
DNS
MIB
Ping
CLI
DHCP
Wireless
AAA
Syslog
Router
NetFlowTraceroute
Your network
Streaming telemetry :Cloud connected
Diverse networks:Local and global
35years of top engineering knowledge
Communities
Distinguished Engineers
Cisco Fellows
CX TAC
Worldwide data platform
Anonymized dataAI/ML
Knowledge base
© 2020 Cisco and/or its affiliates. All rights reserved.
Transform day 2 networking operations from reactive to proactive
Cisco Data Center Network Assurance and Insights Suite
Assure intent“Verify impact of changes before deployment to ensure business intent”
Guarantee reliability“Identify and resolve problems before they impact business”
Troubleshoot intelligently“Leverage real time telemetry to accelerate remediation”
© 2019 Cisco and/or its affiliates. All rights reserved.
Day 2 Operations Stack – What Is It?
10
Troubleshooting/Monitoring• Fabric Health monitoring• Fabric wide resource monitoring • Anomaly detection • Endpoint and Flow Analysis
NIR
Assurance• Change Management• Compliance and Connectivity• Policy/ Control/ Data plane Assurance• Incident and Problem Management
Proactive Maintenance• Fabric health and maintenance based on global
Cisco advisories• Network security maintenance based on PSIRTs and
known vulnerabilities• TAC Troubleshooting Assist
OPSTACK
NAE
NIA
DCNMAPIC
© 2019 Cisco and/or its affiliates. All rights reserved.
Intent Automation
“What you want to happen”
Cisco ACI and DCNM
Assurance
“What will/should happen”
Configuration analysisWhat is wrong and how to fix it
Cisco Network Assurance
Cisco ACI/NXOS and Day 2 Operations
Analytics
“What is happening”
Traffic analysis,Active monitoring
Cisco Network Insights
© 2019 Cisco and/or its affiliates. All rights reserved.
Deployment-specific recommendations & best practices, upgrade impact analysis
Advisories
How Can NIA Help with Day 2 Operations?
NetworkInsightsAdvisor
Alert to bugs, PSIRTs,Forwarding state checksAnomalies
TAC assist, Enhanced TAC AssistDiagnostics
Inbox function/Smart Inbox*, proactive EOL/EOS announcements, new Field Notices, new software/SMUs
Notices
System hardening checks, version-specific scale limits monitoring (NIR -> NIA) to generate advisory *
Compliance
* Roadmap
© 2020 Cisco and/or its affiliates. All rights reserved.
Prevention and RemediationUse case
Customer benefitsProactive advisory to preserve infrastructure health
Predictive alerts to fix Issues before impact
Automatically identify root cause of issues
TAC Assist and Remote Fix
Prevent Outages
Accelerate Remediation
© 2019 Cisco and/or its affiliates. All rights reserved.
Monitor fabric-wide and node-specific resource utilization Resources
How Can NIR Help with Day 2 Operations?
NetworkInsights
Resources
Track CPU & memory consumption, monitor power and temperatureEnvironmental
Track endpoint flows and paths, identify applications experiencing high latency or packet drops
Flows
Correlate changes to events, identify faultsEvents
Monitor network bandwidth utilization, packet drops, and network protocol statistics
Statistics
© 2020 Cisco and/or its affiliates. All rights reserved.
Productivity and AvailabilityUse case
Customer benefitsKnowledge base digitization to reduce manual process
Machine Learning assisted Anomaly detection
Utilize telemetry data to provide real time visibility
Automated risk exposure analysis
IncreaseProductivity
EnsureAvailability
© 2019 Cisco and/or its affiliates. All rights reserved. © 2019 Cisco and/or its affiliates. All rights reserved. Cisco Public
Network Assurance Engine: How it Works
• How it Works
16
Capture DC Wide Intent, Policy, Control/State across Forwarding &
Security
Precise Formal Models that codify Cisco’s 30+ Years of Networking and Cross Customer
Domain Knowledge
Data Collection Modeling of Network Continuous Analysis
Models verify that Network operates per Intent, and accurately tell what is wrong,
where, why, impact and how to fix
Expert in the NOC watching the state of fabric 24 hours a day x 365 days a year!
© 2019 Cisco and/or its affiliates. All rights reserved.
Verification Results Delivered via Smart Events
Reduce Mean Time to Repair with Precise Analysis and Remediation
What ? Who and Where ?
Why ? How to fix ?
© 2019 Cisco and/or its affiliates. All rights reserved.
Change management
Customer benefitsDetect network impact before and after changes
Use case
Reduce maintenance windows require for changes
Lifecycle management for network issues
Drive faster change approval and service deployment to meet business needs
Configchange deltaHealth delta
01/06/2020
© 2019 Cisco and/or its affiliates. All rights reserved.
Compliance and connectivityUse case
Customer benefitsEnsure network policies are compliant against business rules
Pass audits faster and comply with regulatory clauses
Ability to query the fabric to gain deeper insights to connectivity and asset relationship
Continuous fabric analysis and visibility to reduce security issues
Policy explorerCompliance health score
© 2019 Cisco and/or its affiliates. All rights reserved.
Cisco Application Services Engine
2.2 GHz 10 core CPU x 2
256 GB memory
2.4 TB x 4 HDD
10G/25G/40G connect
Network insights
Network Assurance Engine * 3rd Party apps *
Modern Scale-out application services stack to host Day-2 Operations applications
SE-CL-L3 Network automation Scale-out cluster
Supported fromACI 4.2
ACIDCNM
* In planning
© 2020 Cisco and/or its affiliates. All rights reserved.
Transforming day 2 operations from reactive to proactiveCisco Data Center Network Assurance and Insights Suite
No More Mistakes. Uncertainties. Surprises.www.cisco.com/go/dcassuranceinsights