16
RDMA In Virtual Environments (v1.0) Aaron Blasius, ESXi Product Manager Bhavesh Davda, Office of CTO VMware April 25, 2013 1 VMware

RDMA In Virtual Environments (v1.0)

  • Upload
    others

  • View
    21

  • Download
    0

Embed Size (px)

Citation preview

Page 1: RDMA In Virtual Environments (v1.0)

RDMA In Virtual Environments (v1.0) Aaron Blasius, ESXi Product Manager Bhavesh Davda, Office of CTO VMware

April 25, 2013 1 VMware

Page 2: RDMA In Virtual Environments (v1.0)

iSCSI vSAN FT vMotion

RDMA Verbs TCP/IP

RDMA for hypervisor services

April 25, 2013 VMware 2

10 GigE

Virtual Switch

10/40 GigE RoCE

Page 3: RDMA In Virtual Environments (v1.0)

vMotion/RDMA Performance

April 25, 2013 VMware 3

70.63

45.31

01020304050607080

TCP/IP RDMA

36 % faster

330,813.66

432,757.73

0 100000 200000 300000 400000 500000

TCP/IP

RDMA

30% Higher

14.18 Gbps

10.84 Gbps

Total vMotion Time (seconds) Precopy bandwidth (Pages/sec)

0

10

20

30

40

50

0:00

0:05

0:11

0:16

0:21

0:26

0:31

0:36

0:41

0:46

0:51

0:56

1:01

1:06

1:11

1:16

% C

ore

Util

izat

ion

used

by

vMot

ion

92% Lower

Destination CPU Utilization

05

101520253035404550

0:00

0:05

0:11

0:16

0:21

0:26

0:31

0:36

0:41

0:46

0:51

0:56

1:01

1:06

1:11

% C

ore

Util

izat

ion

used

by

vMot

ion

84% 84% Lower

Source CPU Utilization Time (s) Time (s)

Page 4: RDMA In Virtual Environments (v1.0)

Hypervisor level RDMA Requirements • Integrate OFED Kernel Space Mid-Layer and

Provider components into ESXi hypervisor • Add RDMA Verbs support for hypervisor

services: vMotion, FT, vSAN, vRDMA, iSCSI • Create RDMA Device layer for hardware drivers

to plug into • Work in progress

April 25, 2013 VMware 4

Page 5: RDMA In Virtual Environments (v1.0)

Options to offer RDMA to vSphere Virtual Machines • Full-function VM DirectPath (passthrough) • SR-IOV VF VM DirectPath (passthrough) • SoftRoCE over 10GbE in VM DirectPath mode • SoftRoCE over paravirtual Ethernet vNIC over

10GbE uplink • SoftRoCE over paravirtual Ethernet vNIC

between VMs • Paravirtual RDMA HCA (vRDMA) offered to VM

April 25, 2013 VMware 5

≥ ESXi 4.0

≥ ESXi 5.1

Prototyping/Future

Page 6: RDMA In Virtual Environments (v1.0)

SR-IOV VF VM DirectPath

• Single-Root IO Virtualization (SR-IOV): PCI-SIG standard

• Physical (IB/RoCE/iWARP) HCA can be shared between VMs or by the ESXi hypervisor – Virtual Functions direct

assigned to VMs – Physical Function controlled

by hypervisor • Still VM DirectPath, which is

incompatible with many important vSphere features

April 25, 2013 VMware 6

RDMA HCA VF Driver

RDMA HCA VF Driver

I/O MMU

PF Device Driver

VF VF VF

PF SR-IOV RDMA HCA

Guest OS

RDMA HCA VF Driver

Guest OS Guest OS

Virtualization Layer

OFED Stack

OFED Stack

OFED Stack

RDMA HCA VF Driver

RDMA HCA VF Driver

Page 7: RDMA In Virtual Environments (v1.0)

Paravirtual RDMA HCA (vRDMA) offered to VM • New paravirtualized device

exposed to Virtual Machine – Implements “Verbs” interface

• Device emulated in ESXi hypervisor – Translates Verbs from Guest to

Verbs to ESXi “OFED Stack” – Guest physical memory regions

mapped to ESXi and passed down to physical RDMA HCA

– Zero-copy DMA directly from/to guest physical memory

– Completions/interrupts “proxied” by emulation

• “Holy Grail” of RDMA options for vSphere VMs

April 25, 2013 VMware 7

vRDMA HCA Device Driver

Physical RDMA HCA Device Driver

Physical RDMA HCA

vRDMA Device Emulation

Guest OS OFED Stack

ESXi “OFED Stack” I/O

Stack

Page 8: RDMA In Virtual Environments (v1.0)

Guest Kernel vRDMA Driver

• Registers kernel verbs functionality with ib_core framework

• Re-uses response structures from ib_user_verbs.h

Page 9: RDMA In Virtual Environments (v1.0)

ESXi VMkernel vRDMA Backend

Page 10: RDMA In Virtual Environments (v1.0)

vRDMA: VMs On Different Hosts

Presenter
Presentation Notes
Text highlighted in purple is implemented by us
Page 11: RDMA In Virtual Environments (v1.0)

vRDMA: VMs On The Same Host

Page 12: RDMA In Virtual Environments (v1.0)

vRDMA Prototype Status

• Prototyped by intern in CTO office in summer ‘12 • MAD Verbs functional • Data path Verbs work-in-progress • Estimated performance

– 5 µs HRT for RDMA Write with polling completion – Back-of-envelope estimate based on VM-exit

overheads and VMCI performance benchmarks

April 25, 2013 VMware 12

Page 13: RDMA In Virtual Environments (v1.0)

RDMA Use Cases

• Traditional Enterprise • HPC • Big Data • Storage • Messaging • Other Scale Out Applications • What’s your preferred method for obtaining

device drivers? – Linux distribution OFED – Hardware vendor OFED

April 25, 2013 VMware 13

Page 14: RDMA In Virtual Environments (v1.0)

RDMA Adoption Trend

• What % of your organization’s applications leverage RDMA today?

• What % by 2016? • Is RDMA driving your hardware purchasing

decisions? • Are any of your RDMA-based applications

stateful?

April 25, 2013 VMware 14

Page 15: RDMA In Virtual Environments (v1.0)

Moving the enterprise to virtual RDMA • Do you want to run RDMA-based applications in

a VM? • Would you use a virtual RDMA device in a VM?

– vMotion, HA, Snapshots, DRS • What is your tolerance for latency overhead?

Aaron Blasius

[email protected]

April 25, 2013 VMware 15

Page 16: RDMA In Virtual Environments (v1.0)

VMware

Thank You

16 April 25, 2013