49
© 2006 Cisco Systems, Inc. All rights reserved. Cisco Confidential Presentation_I D 1 Nexus 5000 Architecture

Nexus 5000 Architecture

  • Upload
    sucol

  • View
    969

  • Download
    4

Embed Size (px)

DESCRIPTION

Nexus 5000 Switch Architecture

Citation preview

Page 1: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 1

Nexus 5000 ArchitectureNexus 5000 Architecture

Page 2: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 2

Agenda

System Hardware Overview

Internal Architecture

Fabric Data Path

Lossless data path

Forwarding and Policy Enforcement

Page 3: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 3

Nexus 5000 Solution Components

• VM-Optimized network services

• End Port Virtualization

• Unified I/O with 10Gb and FCoE

• Delivers increased, security operational agility, and better utilization of network assets

VirtualizationNetworking

VirtualizationNetworking

MACB

MACA

MACC

A & B C

End nodes

LAN

MACB

MACA

MACC

A & B C

End nodes

LAN

• L2 Multi-pathing for increased bandwidth, scalable L2 domains

• Priority Flow Control –based Lossless fabric

• Congestion Management

• DCE auto-negotiation

Lossless fabric & Greater Data Center Scale

Data Center Ethernet (DCE)

Data Center Ethernet (DCE)

LAN

N5000

Active-Active

MACB

MACA

• Unified fabric for LAN, SAN, HPC/IPC

• Enables FC connectivity across more servers

• Fewer switches, fewer points of management

• Significant cable and adapter reduction

• Lower cost, power, cooling

FCoEFCoE

LAN SAN BSAN ALAN SAN BSAN A

N5000

StandardsStandards

Eco System PartnersEco System Partners

• 10GE L2 non-blocking, wire-speed switch

• Competitive price-performance

• Low latency < 3.2s

• Lossless

• Unified fabric

• Future proof

Wire speed 10GE Access Switch

Wire speed 10GE Access Switch

LAN

Access LayerN5000

Page 4: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 4

System Hardware Overview

Page 5: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 5

Nexus 5000 Product PortfolioIndustry’s First I/O Consolidation Virtualization Fabric for

Enterprise Data CenterIndustry’s First I/O Consolidation Virtualization Fabric for

Enterprise Data Center

OS

Cisco Fabric Manager and Cisco Data Center Network Manager

Cisco NX-OS

FC + Ethernet • 4 Ports 10GbE/FCoE • 4 Ports 1/2/4G FC

Mgmt

ExpansionModules

Ethernet • 6 Ports

10GE/FCoE

Eco System PartnersEco System Partners

Fibre Channel • 8 Ports 1/2/4G FC

28-Port 1RU Switch 56-Port 2RU Switch

Page 6: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 6

Rear PanelsNX5020NX5020

Expansion Module(s)

Cables connect in the rear for ease of server wiringCables connect in the rear for ease of server wiring

Power Entry

Base 10GE 10/100/1000

Out of Band Mgmt Console

All 10GE ports are FCoE capable!

Support for 1 GE Support for crypto

Page 7: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 7

Front PanelsNX5020NX5020

N+1 redundant fans

Replaceable components on the front for easy accessReplaceable components on the front for easy access

Dual redundant power supplies

Page 8: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 8

Power

NX5020Power Supply

NX5020Power Supply

Fully redundant, load sharing and hot swappableFully redundant, load sharing and hot swappable

Maximum Power – 750WTypical Operating Power – 480WAC Input - 110/208 VoltsEfficiency - 82-88%Protection - 110%-150% max load

Page 9: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 9

Expansion Modules

Ethernet Expansion Module

Ethernet Expansion Module

CombinationExpansion Module

CombinationExpansion Module

Six 10G Ethernet

Four 10G Ethernet

Four 1/2/4GFibre Channel

Fibre ChannelExpansion Module

Fibre ChannelExpansion Module Eight 1/2/4G

Fibre Channel

Page 10: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 10

Cooling

NX5020Cooling Module

NX5020Cooling Module

Max RPM - 12KFailover - N+1Op Temp - 0 to 40 CHumidity - 95% non-condensingElevation - 10K feet

Page 11: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 11

SFP+ Transmission Media

SFP+ CuSFP+ Cu

CableTransceiver

Latency (link)Power

(each side)DistanceTechnology

Twinax ~0.25 s~0.1W10mSFP+ CUCopper

MM OM2MM OM3

~0.1 s1W82m300m

SFP+ SRshort reach

MM OM2MM OM3

~0.1 s1W10m100m

SFP+ USRultra short reach

Cat6Cat6a/7Cat6a/7

2.5s2.5s1.5s

~8W~8W~4W

55m100m30m

10GBASE-T

•Low power consumption•Low cable cost•Low transceivers latency•Low error rate (10 exp-17)

Page 12: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 12

Internal Architecture

Page 13: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 13

Hardware Architecture

10 GE& FC

Inte

l 31

00

PC

I Co

ntro

ller

IntelLV Xeon

(1.66 GHz)

FLASH

NVRAM

Serial

PCIe

1GE

10 GE Interfaces

10 GE Interfaces 10 GEInterfaces

RS-232 Console

10/100/1000 Management

XAUI

XFI

Dual NICXAUI

XFI

UnifiedPort

Controller

XAUI

XFI

UnifiedPort

Controller

UnifiedPort

Controller

10 GE

UnifiedPort

Controller

UnifiedPort

Controller

UnifiedPort

Controller

1/2/4 Gbps Fibre Channelto Storage Network

Memory

UnifiedCrossbar

Fabric

UnifiedPort

Controller

Dual NIC

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFPxcvr

SFPxcvr

SFPxcvr

SFPxcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

SFP+xcvr

Page 14: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 14

Supervisor Details

CPU 1.66 GHz Intel LV Xenon - LF80538KF0281M

IO Chip Set Intel 3100 South Bridge for embedded applications

DRAM 2 GBytes of DDR2 400 (PC2 3200) in two DIMM slots

Program Store 1 GBytes of USB based (NAND) Flash

Boot/BIOS 2 Mbytes of EEPROM with locked recovery image

On-board Fault Log 64 MBytes of Flash for failure analysisKernel Stack traces, boot record and fault logs

NVRAM 2 Mbytes of SRAM – Syslog and licensing information

Secure Keystore Renesas AE46C1 – Credentials and secure RNG

Management Interfaces RS-232 console port – console0

10/100/1000BASE-T – mgmt0 partitioned from inband VLANs

Page 15: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 15

Unified Crossbar Fabric

58 port crossbar and scheduler

3 unicast and 1 multicast crosspoints

Central tightly coupled scheduler

Request, propose, accept, grant, acknowledge semantics

Packet enhanced iSLIP scheduler

Distinct unicast and multicast schedulers

Eight classes of service

Egress buffer credits

DWRR class of service

DWRR ingress interface

Total SRAM 24.6 Mbits

Gates 12.4 Million

Transistors ~200 Million

Metal Layers 7

Signal Pins 1286

SerDes 232 @ 3.75Gbps

Page 16: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 16

Unified Port Controller

Media Access Controllers

1/10G Ethernet and 1/2/4G Fibre Channel

Packet Buffering and Queuing

Total of 1.875 MBytes used in four slices

Forwarding Controller

Ethernet, Fibre Channel

Layered policy engine

Four data path slices

One 1/10G Ethernet or two 1/2/4G Fibre Channel ports

Connects to one Altos port

All switching done in Altos crossbar

480 KBytes of buffering

Total SRAM 35 Mbits

Total TCAM 1 Mbit

Logic Gates 18 Million

Transistors ~300 Million

Metal Layers 7

Total Pins 900

SerDes 32 @ 3.75Gbps

Page 17: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 17Unified Crossbar Fabric

Unified Port Controller

Slice 4Slice 2 Slice 3

Switch ASIC Architecture

Slice 1

Forwarding

?

1/10G MAC

Transceiver

VirtualQueues

EgressQueues

PacketBuffer

VirtualQueues

EgressQueues

PacketBuffer

Unified Port Controller

Slice 2 Slice 3Slice 1

VirtualQueues

EgressQueues

PacketBuffer

Slice 4

4 @ 3.75G – 12Gbps 4 @ 3.75G – 12Gbps 4 @ 3.75G – 12Gbps

XAUI – 10 Gbps4 @ 3.125G

Transceiver

XAUI – 10 Gbps4 @ 3.125G

1/10GE Attached Server

10GE LAN Uplink

58 source busses in total

Parsing &Editing

Forwarding

?

ForwardingParsing &

Editing

1/10G MACFC

MAC

SAN B

FCMAC

Fibre ChannelSAN Uplinks

1/2/4G Fibre Channel1 @ 1.0625/2.125/4.25G

Parsing &Editing

Fabric Buffer Fabric Buffer Fabric BufferUnicast and

Multicast Schedulers

Page 18: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 19

Switch Fabric Data Path

Page 19: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 20

Altos

Gatos

Port 4

Data Path Deep Dive

Media Access Controllers

Crossbar operation

Unicast

Multicast

Latency

Port 1

Forwarding

?

1/10G MAC

Transceiver

VirtualQueues

EgressQueues

PacketBuffer

Fabric Buffer

VirtualQueues

EgressQueues

PacketBuffer

Fabric BufferUnicast and

Multicast Schedulers

10GE Attached Servers

58 source busses in total

Parsing &Editing

1/10G MAC

Parsing &Editing

Transceiver

Page 20: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 21

Media Access Controllers

Each Unified Port Controller slice has…One 1 Gigabit Ethernet MAC

One 10 Gigabit Ethernet MAC

Two 1/2/4 Gigabit Fibre Channel MACs

Two of the slices in each Gatos have an 802.1AE LinkSec encryption engine

Integrated Flow Control handlingEthernet – 802.3X “PAUSE” and Cisco Priority Flow Control

Fibre Channel – BB_credits

Page 21: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 22

Crossbar Overview

Tightly coupled scheduler and crosspoint

20% link speedup

12 Gbps

Unicast Scheduler

Virtual Output Queuing

3x fabric speed up

3 crosspoints

Multiple frames transferred per scheduling event

“Superframing”

Multicast Scheduler

System Class queuing

Separate crosspoint

Page 22: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 23

Unicast Virtual Output Queuing

CrossbarFabric

Ingress Port

Packet Buffer

Egress Port

Packet BufferPacket Buffer

SchedulerEgress Queue

Q1

Q8

Q1

Q8

Egress Queue

VOQ 1

VOQ N

Egress Port

Packet

Buffer

Packet

Buffer

Egress Queue

Q1

Q8

VOQ 1

VOQ N

Q1

Q8

Q1

Q8

Page 23: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 24

Day in the Life of a Unicast Frame

1. Frame arrive into Packet buffer

2. Frame pointer posted to Virtual Output Queue

3. VOQ posts request to Scheduler

4. Scheduler arbitrates and grants access

5. Frame sent to Fabric Buffer

6. Fabric Buffer sends to egress queue

7. Egress port sends frame on wire

8. Egress buffer indicates freed buffer resources

CrossbarFabric

Ingress Port

Packet BufferPacket Buffer

Egress Port

Packet BufferPacket Buffer

Scheduler

Egress QueueEgress Fabric buffer

Q1

Q8

Q1

Q8

Egress Queue

VOQ 1

VOQ N

VOQ 1

VOQ N

1

2

3

4

56

7

8

Page 24: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 25

Scheduler overview

RequestProposal/mandate

Grant

Accept

EgressScheduler

EgressScheduler

EgressScheduler

IngressScheduler

IngressScheduler

IngressScheduler

VOQs IF1

VOQs IF2

VOQs IF3

Page 25: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 26

Unicast scheduler algorithm

Egress Scheduler A priority is selected

Fixed priority, or

DWRR

An ingress is selected within that priority

Highest priority “current preferred” ingress is given a “mandate”

iSLIP maximally matches remaining requesters

Ingress SchedulerEgress schedulers make a proposal

Ingress scheduler selects an egress

Fixed Round Robin selection

The selected Egress Scheduler updates its own “Current preferred”

In multi-pass scheduling, this step happens only for first-pass selections

Page 26: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 27

Multicast

Page 27: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 28

Multicast MAC lookups

MAC table32K entries total (unicast, multicast, Fibre Channel)

1K entries (software setting) for multicast

Populating multicast MAC tableIGMP snooping

Static

Multicast MAC lookup missSource only multicast (for L3 multicast)

Forward frame to interfaces linked to Multicast Routers

Learned via PIM snooping

Flooding (for L2 multicast)

Page 28: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 29

Multicast, fabric replication

CC

Ingress Fabric Egress

BMMcast

A

Ucast

BMcast

C

Mcast

AU-VOQU-VOQ

BBU-VOQU-VOQ

U-VOQU-VOQ

Use cases• Ethernet multicast

M-VOQM-VOQAA

Mcast

A

Mcast

A

Page 29: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 30

Multicast Class Queuing

Port 1MC Ingress PriorityQueues

EgressPriorityQueues

Port 2MC Ingress PriorityQueues

EgressPriorityQueues

Port

4MC

Ingr

ess

Prio

rity

Queu

es

Egre

ssPr

iorit

yQu

eues

MC Ingress PriorityQueues

EgressPriorityQueues

Port 3Port 1MC Ingress PriorityQueues

EgressPriorityQueues

Port 1MC Ingress PriorityQueues

MC Ingress PriorityQueues

MC Ingress PriorityQueues

EgressPriorityQueues

EgressPriorityQueues

EgressPriorityQueues

Port 2MC Ingress PriorityQueues

EgressPriorityQueues

Port 2MC Ingress PriorityQueues

MC Ingress PriorityQueues

MC Ingress PriorityQueues

EgressPriorityQueues

EgressPriorityQueues

EgressPriorityQueues

Port

4MC

Ingr

ess

Prio

rity

Queu

es

Egre

ssPr

iorit

yQu

eues

Port

4MC

Ingr

ess

Prio

rity

Queu

es

MC In

gres

s Pr

iorit

yQu

eues

MC In

gres

s Pr

iorit

yQu

eues

Egre

ssPr

iorit

yQu

eues

Egre

ssPr

iorit

yQu

eues

Egre

ssPr

iorit

yQu

eues

MC Ingress PriorityQueues

EgressPriorityQueues

Port 3MC Ingress PriorityQueues

MC Ingress PriorityQueues

MC Ingress PriorityQueues

EgressPriorityQueues

EgressPriorityQueues

EgressPriorityQueues

Port 3

Page 30: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 31

Multicast Scheduling AlgorithmGl

obal

RR

poin

ter

Pref

erre

d in

gres

s =

X

DWRR priorityselection

Egress 1Scheduler

Egress 2Scheduler

…Egress 57Scheduler

Ingress 3Scheduler

Match

Ingress selected

Glob

al R

R po

inte

rPr

efer

red

ingr

ess

= X

Request vector Proposal vector

DWRR priorityselection

Egress 1SchedulerEgress 2

SchedulerEgress 2

SchedulerEgress 3

Scheduler……

Egress 57Scheduler

Egress 58Scheduler

Match

Egress 2SchedulerEgress 2

SchedulerEgress

Scheduler

Multicast Fabric Buffer

Free?

Egress BufferFree?

Multicast Fabric Buffer

Free?

Egress BufferFree?

Multicast Fabric Buffer

Free?

Egress BufferFree?

…..

1

Page 31: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 32

Altos

Gatos

Port 4

Latency 3.2 µsec port-to-port

First-In-First-Out

Full featured forwarding

6.7 µsec kernel to kernel

Stateless offloads

no DDP

1.4 µsec host send

2.1 µsec host receive

Increases with OS, interrupt, and transport overheads

Socket layer app-to-app

Linux 2.6

Raw – 10.1 µsec

UDP – 11.2 µsec

TCP – 11.8 µsec

Port 1

Forwarding

?

1/10G MAC

Transceiver

VirtualQueues

EgressQueues

PacketBuffer

Fabric Buffer

VirtualQueues

EgressQueues

PacketBuffer

Fabric BufferUnicast and

Multicast Schedulers

10GE Attached Servers

58 source busses in total

Parsing &Editing

1/10G MAC

Parsing &Editing

Transceiver

Page 32: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 33

Lossless Data Path

Page 33: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 34

QoS Processing Flow

STOP

MACMAC TrafficClassification

TrafficClassification

IngresspolicingIngresspolicing

MTU checking

MTU checking

Per-classBuffer usageMonitoring

Per-classBuffer usageMonitoring

VoQs for unicast

(8 per egress port)

Egress Queues

MarkingMarking

PFC ON/OFF signal

Trust CoSL2/L3/L4 info with ACL

Trust CoSL2/L3/L4 info with ACL

Truncate or droppackets if MTU is violated

Truncate or droppackets if MTU is violated

If buffer usage crosses threshold:• Tail drop for droppable class• Assert PFC signal to MAC

for no-drop system class

If buffer usage crosses threshold:• Tail drop for droppable class• Assert PFC signal to MAC

for no-drop system class

Mark packet with CoS value

Mark packet with CoS value

EgressScheduler

Strict priority +DWRR scheduling

Strict priority +DWRR scheduling

Central Scheduler

8 muticast VoQs

STOP

MACMAC TrafficClassification

TrafficClassification

IngresspolicingIngresspolicing

MTU checking

MTU checking

Per-classBuffer usageMonitoring

Per-classBuffer usageMonitoring

VoQs for unicast

(8 per egress port)

Egress Queues

MarkingMarking

PFC ON/OFF signal

Trust CoSL2/L3/L4 info with ACL

Trust CoSL2/L3/L4 info with ACL

Trust CoSL2/L3/L4 info with ACL

Trust CoSL2/L3/L4 info with ACL

Truncate or droppackets if MTU is violated

Truncate or droppackets if MTU is violated

Truncate or droppackets if MTU is violated

Truncate or droppackets if MTU is violated

If buffer usage crosses threshold:• Tail drop for droppable class• Assert PFC signal to MAC

for no-drop system class

If buffer usage crosses threshold:• Tail drop for droppable class• Assert PFC signal to MAC

for no-drop system class

If buffer usage crosses threshold:• Tail drop for droppable class• Assert PFC signal to MAC

for no-drop system class

If buffer usage crosses threshold:• Tail drop for droppable class• Assert PFC signal to MAC

for no-drop system class

Mark packet with CoS value

Mark packet with CoS value

Mark packet with CoS value

Mark packet with CoS value

EgressScheduler

Strict priority +DWRR scheduling

Strict priority +DWRR scheduling

Strict priority +DWRR scheduling

Strict priority +DWRR scheduling

Central Scheduler

Central Scheduler

8 muticast VoQs

Page 34: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 35

Class Based Data Path

Different classes of traffic require different treatment, e.g.

FC class of traffic requires lossless or no drop treatment

Market Data Ethernet class traffic may also require no drop

Remaining Ethernet Data may only require best effort

Nexus 5000 data path resource and features are all per class based; for example,

Per class VOQs and egress queues, buffers, MTU, drop behavior

Per Class behavior should be consistently configured system wide and network wide

Page 35: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 36

Nexus 5000 supports Modular QoS CLI (MQC) for all QoS configuration

System is a new target introduced at the global cfg level

System classes are instantiated within a system policy

System policy is a service-policy attached to the ‘system’ target

At FCS, parameters configurable under system class:

MTU

Drop, no Drop

At ingress, packets are classified into a system class

At FCS, classification can be based on .1p or interface

Once classified, this class assignment travels with the packet through the entire system to select per class treatment at every step

At Egress, 802.1p rewrite is supported. 802.1p value can then be consistently used throughout the network to select the same system class treatment

System Class

Page 36: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 37

PFC and BB_Credits

IEEE 802.3x Pause provides no drop flow control similar to BB credits for FC

Priority Flow Control is a finer grained mechanism of flow control over standard pause or link level BB credits

Priority Flow Control uses .1p CoS value mapping to a system class to send appropriate pause to previous hop

Page 37: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 38

Priority Flow Control

Priority based Flow ControlPriority based Flow Control

• Enables lossless behavior for each class of service• PAUSE sent per priority when buffers limit exceeded

Page 38: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 39

Priority based bandwidth managementPriority based

Bandwidth ManagementPriority based

Bandwidth Management

• Enables Intelligent sharing of bandwidth between traffic classes control of bandwidth• 802.1Qaz Enhanced Transmission

Page 39: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 40

Forwarding

Page 40: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 41

Data Path

Altos

Gatos

Port 4Port 1

Forwarding

?

1/10G MAC

Transceiver

VirtualQueues

EgressQueues

PacketBuffer

Fabric Buffer

VirtualQueues

EgressQueues

PacketBuffer

Fabric BufferUnicast and

Multicast Schedulers

10GE Attached Servers

58 source busses in total

Parsing &Editing

1/10G MAC

Parsing &Editing

Transceiver

Page 41: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 42

Forwarding Pipeline

Wire rate “fixed” latency

Parsed frame fields, configuration, and control plane state are evaluated to determine destination(s)

Policy engine filters based on configuration, bindings, and layered ACLs

Layered equal cost multi path expansion

Fibre Channel

EtherChannel/PortChannel

Parsed Packet

Collect Interface Configuration and

State

Virtual Interface Table (512)

Vlan Translation Table (4K)

Vlan State Table (1K)

Determine Destination

(ingress only)

Fibre Channel Switch Table (4K)

EthernetLearning

Policy EnforcementACL Search Engine

(2K)

MultipathExpansion

(ingress only)

Zoning Table(2K)

RBACL Label Table(2K)

Binding Table(2K)

Fibre Channel Multipath Table (1K)

PortChannel Table(16)

Multicast Vector Table (4K)Station Table

(16K)

Editing Instructions &Virtual Output Queue List

Page 42: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 43

Parsed Packet

Interface StateVirtual

Interface Table (512)

Vlan Translation Table (4K)

Vlan State Table (1K)

Forwarding(ingress only)

Fibre Channel Switch Table

(4K)

EthernetLearning

Policy Enforcement

ACL TCAM(2K)

MultipathExpansion

(ingress only)

Zoning Table(2K)

RBACL Label Table(2K)

Binding Table(2K)

Fibre Channel Multipath Table

(1K)

PortChannel Table(16)

Multicast Vector Table

(4K)Station Table

(16K)

Virtual Output Queue List

Destination address

Source address

Ethertype = 2

FCS

Ethertype = .1Q VLANCoS d

TOS Total lenVer IHL

Identification Flg Frgm offset

TTL Proto Header cksum

Source address

Destination address

IP options

Src port Dst port

Seq number

Ack number

Hdr len Flags Win size

Cksum Urgent ptr

TCP options and data

checksum check

FCS check

Parsing ethernet IP packets

Page 43: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 44

Parsed Packet

Interface StateVirtual

Interface Table (512)

Vlan Translation Table (4K)

Vlan State Table (1K)

Forwarding(ingress only)

Fibre Channel Switch Table

(4K)

EthernetLearning

Policy Enforcement

ACL TCAM(2K)

MultipathExpansion

(ingress only)

Zoning Table(2K)

RBACL Label Table(2K)

Binding Table(2K)

Fibre Channel Multipath Table

(1K)

PortChannel Table(16)

Multicast Vector Table

(4K)Station Table

(16K)

Virtual Output Queue List

Destination address

Source address

FCS

Ethertype = .1Q VLANCoS d

Ethertype = FCoE Ver

Reserved

SOF

EOF Reserved

r_ctl d_id

seq_id df_ctl seq_cnt

ox_id rx_id

Payload

Parameters

cs_ctl s_id

type f_ctl

CRC

CRC checkFCS check

Fibre Channel frames are FCoE encapsulated prior to

forwarding

Parsing FCoE packets

Page 44: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 45

Unified Fabric & FCoE

Page 45: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 46

Mapping of FC Frames over Ethernet

Enables FC to Run on a Lossless Ethernet Network

Mapping of FC Frames over Ethernet

Enables FC to Run on a Lossless Ethernet Network

Fewer Cables

Both block I/O & Ethernet traffic co-exist on same cable

Fewer adapters needed

Overall less power

Interoperates with existing SAN’s

Management SAN’s remains constant

No Gateway

Fewer Cables

Both block I/O & Ethernet traffic co-exist on same cable

Fewer adapters needed

Overall less power

Interoperates with existing SAN’s

Management SAN’s remains constant

No Gateway

FCoEFCoE BenefitsBenefits

FC over Ethernet (FCoE)

Fibre Channel Traffic

Ethernet

Page 46: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 47

Destination MAC Address

Source MAC Address

IEEE 802.1Q Tag

ET = FCoE Ver Reserved

Reserved

Reserved SOF

Encapsulated FC Frame(Including FC-CRC)

EOF Reserved

FCS

Reserved

FCoE Standard Specification Status

An extension of FC over lossless Ethernet

FCoE Standard Specification in ANSI INCITS FC T11.3

Frame Format agreement Aug. 2007

Addressing scheme ratified Feb. 2008

Target completion 2H08

24 companies behind the standard

FCoE Frame FormatBit 0 Bit 31

Eth

ern

etH

ead

er

FC

oE

Hea

der

FC

Hea

der

FC Payload CR

C

EO

F

FC

S

Byte 0 Byte 2197

Page 47: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 48

I/O consolidation Phase 1 (Mid 2008)

Reduction of server adapters

Simplification of access layer & cabling

Gateway free implementation – fits in installed base of existing LAN and SAN

L2 Multipathing Access – Distribution

Lower TCO

Fewer Cables

Investment Protection (LANs and SANs)

Consistent Operational Model

Today:Parallel LAN/SAN Infrastructure

Inefficient use of Network Infrastructure

5+ connections per server – higher adapter and cabling costs

Adds downstream port costs;

cap-ex and op-ex

Each connection adds additional points of failure in the fabric

Longer lead time for server provisioning

Multiple fault domains – complex diagnostics

Management complexity

I/O Consolidation

Enhanced Ethernet and FCoE Ethernet FC

LAN SAN BSAN A

Today I/O Consolidation with FCoE

LAN SAN BSAN A

Nexus 5000Nexus 5000

Page 48: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 49

Nexus 5000: I/O Consolidation

16 Servers Enet FC Total

Adapters 20+ 20 40*

Switches 2 2 4

Cables 40 40 80

Mgmt Pts 2 2 4

16 Servers Enet FC Total

Adapters 20 0 20

Switches 2 0 2

Cables 40 0 40

Mgmt Pts 2 0 2

4

2

8

2

LAN SAN BSAN A LAN SAN BSAN A

Nearly twice the Cables

Nearly twice the Cables

• Half the Cables• Half the Adapters• Power & Cooling Savings• Consistent Management

• Half the Cables• Half the Adapters• Power & Cooling Savings• Consistent Management

Page 49: Nexus 5000 Architecture

© 2006 Cisco Systems, Inc. All rights reserved. Cisco ConfidentialPresentation_ID 50

Menlo: I/O Consolidation Network Adapter

Off the shelf NIC and HBA ASICs from: Qlogic, Emulex

Dual 10GE/FCoE ports

Support for native drivers and utilities

Customer certified stacks

Replaces multiple adapters per server

Consolidates 10GE and FC on a single interface

Minimum disruption in existing customer environments

Supports PFC & DCBX

Linux (SLES & Redhat) and Windows versions

June 2008 availability

10GE/FCoE

PCIe Bus

Menlo ASIC Cisco/Nuova designed multiplexer and FCoE offload protocol engine

Menlo ASIC Cisco/Nuova designed multiplexer and FCoE offload protocol engine

FC10GE