23
Network Requirements for Resource Disaggregation Peter Gao (Berkeley), Akshay Narayan (MIT), Sagar Karandikar (Berkeley), Joao Carreira (Berkeley), Sangjin Han (Berkeley), Rachit Agarwal (Cornell), Sylvia Ratnasamy (Berkeley), Scott Shenker (Berkeley/ICSI)

Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

  • Upload
    lyanh

  • View
    227

  • Download
    1

Embed Size (px)

Citation preview

Page 1: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Network Requirements for Resource Disaggregation

Peter Gao (Berkeley), Akshay Narayan (MIT), Sagar Karandikar (Berkeley), Joao Carreira (Berkeley), Sangjin Han (Berkeley),

Rachit Agarwal (Cornell), Sylvia Ratnasamy (Berkeley), Scott Shenker (Berkeley/ICSI)

Page 2: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Current Datacenter: Server-Centric Future datacenter: Disaggregated?

Disaggregated Datacenters

GPU GPUGPUGPU

Datacenter Network

Datacenter Network

HP – The Machine Intel - RSD Facebook

Huawei - NUWA SeaMicro Berkeley - FireBox2

Page 3: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Disaggregation Benefits (Architecture Community)

3

Overcome memory capacity wallHigher resource density

SimplifyHardware

Design

RelaxPower & Capacity

Scaling

Page 4: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Network is the key

Network

GPU

Network

GPUGPU

4

QPI, SMI, PCI-e

Existing prototypes use specialized hardware, such as Silicon Photonics, PCI-e

Server-Centric Disaggregated

GPU

Do we need specialized hardware?e.g.: Silicon photonics, PCI-e

Page 5: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

• What end-to-end latency and bandwidth must the network provide for legacy apps?

• Do existing transport protocols meet these requirements?

• Do existing OS network stacks meet these requirements?

• Can commodity network hardware meet these requirements?

Application

OS

Transport

NIC

Switch

Remote Resource

OS

Transport

NIC

5

OS

Transport

NIC

Switch

OS

Transport

NICCommodity hardware solutions may be sufficient ✔

Current OS and network stack are not(Solutions are feasible) ✘

Worst case performance degradation

Page 6: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Assumptions

CPU

Memory

Storage

Scale

Datacenter Network

• Limited cache coherence domain• Small amount of local cache (how much?)

• Page-level remote memory access

• Block-level distributed data placement

• Rack-scale?• Datacenter-scale?

6

Cache Coherence

Page 7: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Methodology: Workload Driven

• 10 workloads on 8 applications

• ~ 125 GB input data

• 5 m3.2xlarge EC2 nodes

• Virtual Private Cloud enabled

Latency and Bandwidth Requirements

7

Key-value Store

SQL

Streaming

Wordcount

Sort

Pagerank

Collaborative Filter.

Spark Hadoop

Timely Dataflow

Graphlab

Memcached HERD

Spark SQL

Spark Streaming

Batch Processing

Interactive

Workloads Applications

Page 8: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Disaggregated Datacenter EmulatorOS

Memory

Special Swap Device (Handles Page Fault)

Local RAM Emulated Remote RAM

8

• Backed by the machine’s own memory• Partition the memory into local and remote

Free to access Via swap device

Inject latency and bandwidth constraints• Using special swap device• Delay = latency + request size / bandwidth• Akin to a dedicated link between CPU and memory

Page 9: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

*Note: Delay = latency + request size / bandwidth

Latency and Bandwidth Requirement

5% degradation

9

5% degradation

1us 5us 10us

100 G

bps

40 G

bps

10 G

bps

100 G

bps

40 G

bps

10 G

bps

100 G

bps

40 G

bps

10 G

bps

~3us latency / 40Gbps bandwidth is enough, ignoring queueing delay

Page 10: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Understanding Performance Degradation

10

Spark StreamingWordcount

MemcachedYCSB

GraphlabCF

HadoopSort

HadoopWordcount

TimelyPagerank

HERDYCSB

SparkSQLBDB

SparkSort

SparkWordcount

Performance degradation is correlated with application memory bandwidth

Page 11: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Application

OS

Transport

NIC

Switch

Remote Resource

OS

Transport

NIC

Application Remote Resource

• 3us end-to-end latency• 40Gbps dedicated link (no queueing delay)

11

Page 12: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Transport Simulation Setting

Special Swap Instrumentation Network Simulator

Flow Trace

Flow completion time distribution

12

Need new transport protocols

Page 13: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Application Performance Degradation

~5% degradation

40Gbpsnetwork

100Gbpsnetwork

~5% degradation

13

40Gbps (no queueing delay) DC Scale (with queueing delay) Rack Scale (with queueing delay)

100Gbps (no queueing delay) DC Scale (with queueing delay) Rack Scale (with queueing delay)

100Gbps networkDC scale for some apps, rack scale for others

Page 14: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Application

OS

Transport

NIC

Switch

Remote Resource

OS

Transport

NIC

• 3us end-to-end latency• 40Gbps dedicated link

Transport Transport

• Efficient Transport• 100Gbps network

14

Page 15: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Is 100Gbps/3us achievable?

15

Page 16: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Feasibility of end-to-end latency within a rack

Application Remote Resource

0.32us 0.8us 2us 1.9us

Propagation Transmission Switching Data Copying OS

3us Target

16*Numbers estimated optimistically based on existing hardware

Page 17: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Feasibility of end-to-end latency within a rack

Application Remote Resource

0.32us 0.8us 2us 1.9us

Propagation Transmission Switching Data Copying OS

3us Target

Application Remote Resource

0.32us 0.8us

Propagation Transmission Switching

2us 1.9us

Data Copying OSCut-through Switch

0.48us

15*Numbers estimated optimistically based on existing hardware

Page 18: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Feasibility of end-to-end latency within a rack

Application Remote Resource

0.32us 0.8us 2us 1.9us

Propagation Transmission Switching Data Copying OS

3us Target

Application Remote Resource

0.32us 2us

Data Copying

0.48us

Cut-through Switch NIC Integration

1us 1.9us

OS

15*Numbers estimated optimistically based on existing hardware

Page 19: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Feasibility of end-to-end latency within a rack

Application Remote Resource

0.32us 0.8us 2us 1.9us

Propagation Transmission Switching Data Copying OS

3us Target

Application Remote Resource

0.32us 0.48us

Cut-through Switch NIC Integration

1us 1.9us

OSUse RDMA

3us Target

15*Numbers estimated optimistically based on existing hardware

Feasible to meet target across the datacenter?

Page 20: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Application

OS

Transport

NIC

Switch

Remote Resource

OS

Transport

NIC

• 3us end-to-end latency• 40Gbps dedicated link

• Efficient Transport (pFabric,SIGCOMM’13, pHost,CoNEXT’15)

• 100Gbps network (Available)

• Kernel bypassing (RDMA common)

• CPU-NIC Integration (Coming soon)• Cut-through switch (Common?)• 100Gbps links (Available)

Application

OS

Transport

NIC

Switch

Remote Resource

OS

Transport

NIC

16

Page 21: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

What’s next?

17

Please refer our paper for evaluations on improving application performance in disaggregated datacenters

Application Design

Rethinking OS Stack

StorageNetwork

StackFailure Models

Network Fabric Design

Page 22: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for

Thank You!

Peter X. Gao Akshay Narayan Sagar Karandikar Joao Carreira Sangjin Han

Sylvia Ratnasamy Scott ShenkerRachit Agarwal

Page 23: Network Requirements for Resource Disaggregation - … · Network Requirements for Resource Disaggregation ... • What end-to-end latency and bandwidth must the network provide for