Upload
duongdan
View
215
Download
0
Embed Size (px)
Citation preview
S O N i C D e v e l o p m e n t a n d D e p l o y m e n t a t A l i b a b aHaiyong WangSenior DirectorAlibaba Group
Search Engine AD Engine
Business Platform
Software
Infrastructure
Security
Productivity
Computing platform
TMall Taobao AliExpress Data AdsCloud
Computing
Digital Media
Entertainment CainiaoAnt
Financial
ML, Speech, NLP, Image & Video
Container/VirtualizationScheduling
Modular / Customization
High throughput Low latency
Intelligent
Cloud / Heterogeneous Anti-fraud
Privacy
Algorithm component / Deep learning framework / Visualization
EDAS / MQ / ARMS
Graph / Stream / Batch
Operation System Database Storage Middleware
Server NetworkData Center Operation
Data platform Algorithm platform
Green Available Scalable
OneID / OneData / OneService / GProfile
Alibaba Technology Stack
Alibaba Network OSAlibaba Network Management and Monitoring Services
Switch Hardware
DB
L2/L3 Monitoring Management
System Process
Hardware Abstraction
SONiC
• Focus on hyper-scale datacenters• Open source and vibrant community• Wide support from ODMs and ASIC vendors• Linux software and integration
Community Participation
• Announce joining SONiC@OCP community at ONS 2017• Expand ODM and ASIC vendor SONiC@OCP support• Lead SONiC adoption in China• Major contributor to SONiC
BGP Enhancements
• Announce low priority routes for graceful device restart• RFC7911 enhancement to support multipath advertisement• Enable ECMP member de-active on data plane before route removal• Replace periodic timer with event trigger for faster response and
lower CPU
Networking Telemetry
Networking data collectors
Networking data analytics
UI
Switch
Troubleshooting Monitoring/Alert
CLI SNMP Syslog Mirror INT gRPC
gRPC Architecture
Virtual DB paths
SONiC Redis DB
gRPC Get andSubscribe (poll and stream)
SONiC System Telemetry gRPC service
SONiC Redis DB
SONiC Redis DB DB Client
Non-DB Client
Platform data
Proc data
ASIC data(non-SAI )
gRPC Dial-in Server (gNMI server)
gRPC Dial-out Client ( gNMIDiaout Publish service)
System Telemetry data Collectors
gRPC Publish (periodic and stream)
Quagga/FRR Data
Ali CLI
TACACS+ Linux CMD APP CMD
Authorization
CFG DB APP DB Counter DB
Config/ShowShow
socket
APP Data
ShowConfig
Unified CLI for Operation
Get Ready for Production
• Feature development and customization• Testing
• Automate regression with Robot framework• Add test topology to align with Alibaba network architecture
• Operation• Integrate with network management and monitoring systems• Customized zero touch provision• Unified CLI for network operation team
Alibaba Data Center Network Deployment
���
POD
��� ��� ��� ���
POD
��� ��� ���
��� ��� ��� ��� ��� ��� ��� ���
Challenges
• Multi-vendor hardware troubleshooting• Interoperability of INT metadata across multi-vendor devices
Summary
• Alibaba has became a key contributor to SONiC community• Tremendous progress of developing and deploying NOS at Alibaba• Future work
• 10X deployment scale• Expand to more device roles• Programmable network