CompSci514: Computer Networks Lecture 4: End-to-end ... · The TCP Congestion Collapse Incident...

CompSci 514: Computer NetworksLecture 4: End-to-end Congestion

Control

Xiaowei Yang

Outline• TCP congestion collapse

• TCP congestion control algorithm

• Analysis

• Throughput and loss rate modeling

A fundamental networking problem

• Packet switching– Statistical multiplexing

• Q: N users, and one network– How fast should each user send?– Or when should a packet be sent?– Why is it a difficult problem?

• A à C, B à D• How fast should A and B send?• Don’t know the number of users• Don’t know the bottleneck capacity

D1Gbps

10Mbps

The TCP Congestion Collapse Incident

• In October of 1986, the Internet had the first of what became a series of �congestion collapse�: increased load leads to decreased throughput

• The NSFnet phase-I backbone dropped three orders of magnitude from its capacity of 32 kbit/s to 40 bit/s, and continued until end nodes started implementing Van Jacobson's congestion control between 1987 and 1988.

What caused the TCP congestion collapse?

• Original TCP (Cerf74)– Static window W– W = # of unacknowledged bytes

Send 1, 2, 3, …, W bytes

ACK 1, 2, 3, …, WSend W+1, W+2, … 2W bytes

ACK W+1, W+2, … 2W bytes

Review of TCP’s sliding window algorithm

• A well-known algorithm in networking• Used for

– Reliable transmission– Flow control– Congestion control

Definitions

• Round Trip Time (RTT)– The time it takes from sending a packet to

receiving an ACK for that packet– RTTmin = minimum RTT observed during a

connection

• Bandwidth Delay Product (BDP)– Bottleneck Capacity * RTTmin

Drawbacks of static window sizes

• One TCP flow– MSS = 512 bytes, Window size = 10 MSS,

RTTmin = 100ms• 10 TCP flows, 100 TCP flows, …

What caused the TCP congestion collapse?

• Original TCP (Cerf74)– Static window W– W = # of unacknowledged bytes

• When user increases, queueing delay increases• TCP times out, retransmitting the whole window• TCP retransmits too early, wasting the

network’s bandwidth to retransmit the same packets already in transit and reducing useful throughput (goodput)

Retransmitting packets that are not lost à congestion collapse

How can we prevent this problem?

Design Goals• Congestion avoidance: making the

system operate around the knee to obtain low latency and high throughput

• Congestion control: making the system operate left to the cliff to avoid congestion collapse

• Congestion avoidance: making the system operate around the knee to obtain low latency and high throughput

Design goals

• More concretely– Efficiency

• High throughput/utilization– Low latency– Fairness– Fast convergence

Key Improvements from the TCP88 paper

• Revised RTO computation– Prevent spurious retransmissions

• ACK self-clocking– Determines when to send a packet

• Dynamic window sizing– Determines how fast a sender can send a

packet

Revised RTO estimates

• Old design: RTTn+1 = a RTT + (1- a ) RTTnRTO = β RTTn+1

• Improved designsrttn+1 = a RTT + (1- a ) srttnrttvarn+1 = b ( | RTT – srttn | ) + (1- b )rttvarn

RTOn+1 = srttn+1 + 4 rttvarn+1

Exponential backoff• When a timeout occurs , the RTO value is

doubledRTOn+1 = max (2*RTOn , 64) seconds

• Can save the day in the worst case

ACK self-clocking

• When pipe is full, the speed of ACK returns equals to the speed new packets should be injected into the network

Why is self-clocking important

• Determines when a packet should be sent– A flow should send no faster than what its

bottleneck allows– ACK spacing correlates with bottleneck

capacity

Does ACK clocking solve it all?• No

• A sender will not send faster, but may under-utilize bandwidth or cause packets being buffered at the bottleneck

Dynamic window sizing• Sending speed: W / RTT

• Ideally, W / RTT = available bandwidth at the bottleneck

• à Adjusting W based on available bandwidth

1. Increases W when there is no congestion2. Decreases W when there is congestion

How to detect congestion?

• Packet loss– Time out– Duplicate ACKs

• Latency– Current RTT – RTTmin

• Sending rate / ACKing rate

• ML? 31

TCP Reno: Two Modes of Congestion Control

1. Probing for the available bandwidth– slow start (W < ssthresh)

2. Avoid overloading the network– congestion avoidance (W >= ssthresh)

Slow Start• Initial value: Set W = 1 MSS

• Modern TCP implementation may set initial window size to a much larger value

• When receiving an ACK, W+= 1 MSS

• W doubles every RTT– Exponential increase

Congestion Avoidance

• If W >= ssthresh then each time an ACK is received, increases W as follows:

• W += 1 / W

• W increases by one MSS per RTT

Reaction to Congestion

• Reduce W

• Timeout: severe congestion– W is reset to one MSS:– ssthresh is set to half of the current size of the

congestion window:ssthressh = W / 2

– entering slow-start

Reaction to Congestion• Duplicate ACKs: not so congested (why?)• Fast retransmit

– Three duplicate ACKs indicate a packet loss

– Retransmit without timeout

Reaction to congestion: Fast

Recovery

• Avoiding slow start (changed from TCP88)

– ssthresh = W / 2

– W = W +3MSS

– Increase W by one MSS for each additional duplicate ACK

• When ACK arrives that acknowledges “new data,” set:

W = ssthresh = W/2

enter congestion avoidance, i.e., increase W by 1 MSS per RTT

The Sawtooth behavior of TCP

• For every ACK received– W += 1/W

• For every packet lost– W /= 2

• Only true in unit of RTT

Why does it work? [Chiu-Jain]

• A feedback control system• The network uses feedback y to adjust users�

load åx_i

Goals of Congestion Avoidance

– Efficiency: the closeness of the total load on the resource of its knee: åx_i ~= capacity

– Fairness:

• When all x_is are equal, F(x) = 1• When all x_i�s are zero but x_j = 1, F(x) = 1/n

– Distributedness• A centralized scheme requires complete knowledge of the state of

the system

– Convergence• The system approach the goal state from any starting state

Metrics to measure convergence

• Responsiveness• Smoothness

Model the system as a linear control system

• Four sample types of controls• AIAD, AIMD, MIAD, MIMD

Phase space

X1=W1/RTT

X2=W2/RTT

TCP congestion control is AIMD

• Problems:– Each source has to probe for its bandwidth– Congestion occurs first before TCP backs off– Unfair: long RTT flows obtain smaller bandwidth

shares

Macroscopic behavior of TCP

pRTTMSS••5.1

• Throughput is inversely proportional to RTT:

• In a steady state, total packets sent in one sawtooth cycle:– S = w + (w+1) + … (w+w) = 3/2 w2

• the maximum window size is determined by the loss rate– 1/S = p– w =

• The length of one cycle: w * RTT• Average throughput: 3/2 w * MSS / RTT

Why is congestion control still relevant

• The Mice & Elephant phenomenon of Internet traffic– Many small flows– But a few large flows sent most of the byte

Conclusion

• Congestion control is one of the fundamental issues in networking

• TCP congestion control algorithm• The AIMD algorithm• TCP macroscopic behavior model

CompSci514: Computer Networks Lecture 4: End-to-end ... · The TCP Congestion Collapse Incident...

Documents

Approximating Fair Queueing on Reconfigurable Switches · Congestion Control Today Primarily achieved using end-to-end protocols (TCP, DCTCP, TIMELY, ..) •End-hosts react to signals

Promoting the Use of End-to-End Congestion Control in the

End-to-End Monitoring of Network Traffic and Congestion Events on

CompSci514: Computer Networks Lecture 5: Congestion Control

Performance Analysis and Enhancement of TCP in Presence of ... · In addition to this, challenges for applying TCP over wireless link is an end-to-end congestion control. An end-to-end

End-to-End Bandwidth Estimation for Congestion Control in ... · packets experience congestion along the backward path they traverse and arrive bunched. The latter phenomena, known

End-to-end Congestion Management for the NGI Hari Balakrishnan MIT Laboratory for Computer Science DARPA NGI PI Meeting October

Detecting Shared Congestion of Flows Via End-to-end Measurement (and other inference problems)

Congestion Control - unipi.ita008149/corsi/reti/lucidi/07-Congestion-Control.pdf · TCP congestion control algorithm Congestion Control 4 ... Congestion Control 24 How Congestion

TCP End-To-End Congestion Control

End-to-End Congestion Control for High Performance Data

Congestion-aware Evacuation Routing using Augmented ... · from user-end Augmented Reality (AR) devices, is used to model the congestion distribution inside a building. To efﬁciently

This Lecture: Congestion Controlsrini/15-744/F09/lectures/04-tcpintro.pdf · congestion control: • Routers provide feedback to end systems • Single bit indicating congestion (SNA,

1. Congestion Control Congestion Control Factors that Cause Congestion Factors that Cause Congestion Congestion Control vs Flow Control Congestion

P802.1Qcz Congestion Isolation - Welcome to Mentor · P802.1Qcz –Congestion Isolation - Goals •Work in conjunction with higher-layer end-to-end congestion control (ECN, BBR, etc)

Promoting the Use of End-to-End Congestion Control in the Internet

End-to-End Congestion Control Protocols for Remote … · 2008-01-23 · End-to-End Congestion Control Protocols for Remote Programming of Robots using Heterogeneous Networks: A Comparative

TCP End-To-End Congestion Control Wanida Putthividhya Dept. of Computer Science Iowa State University Jan, 27 th 2002 (May, 25 th 2001)

JOB 629 Gridlock reportmyweb.liu.edu/~scarlin/reports/gridlock.pdfforeseeable future, congestion will continue to be with us; we cannot “solve” congestion. We ... The East End

End-to-end transport protocols, TCP, congestion control ...cseweb.ucsd.edu/~gmporter/classes/wi15/cse124/... · TCP Flow Control and Congestion Control Sender must determine maximum