56
Will E. Leland, Walter Willinger and Daniel V. Wilson BELLCORE Murad S. Taqqu Boston University CS634 ADVANCED COMPUTER NETWORKING COMPUTER SCIENCE COLLEGE OF WILLIAM AND MARY ON THE SELF-SIMILAR NATURE OF ETHERNET TRAFFIC Presented by: Feng Yan

CS634 Advanced Computer Networking Computer Science College of William and Mary

  • Upload
    kaz

  • View
    43

  • Download
    0

Embed Size (px)

DESCRIPTION

On the Self-Similar Nature of Ethernet Traffic . Presented by: Feng Yan. Will E. Leland, Walter Willinger and Daniel V. Wilson BELLCORE Murad S. Taqqu Boston University. CS634 Advanced Computer Networking Computer Science College of William and Mary. Overview. - PowerPoint PPT Presentation

Citation preview

Page 1: CS634      Advanced Computer Networking Computer Science College of William and Mary

Will E. Leland, Walter Willinger and Daniel V. Wilson BELLCOREMurad S. Taqqu Boston University

CS634 ADVANCED COMPUTER NETWORKING

COMPUTER SCIENCECOLLEGE OF WILLIAM AND MARY

ON THE SELF-SIMILAR NATURE OF ETHERNET TRAFFIC

Presented by: Feng Yan

Page 2: CS634      Advanced Computer Networking Computer Science College of William and Mary

OVERVIEW

What is Self Similarity?

Ethernet Traffic is Self-Similar

Implications of Self Similarity

Conclusion

Discussion2

Page 3: CS634      Advanced Computer Networking Computer Science College of William and Mary

PART 1:

WHAT IS SELF-SIMILARITY ?

Page 4: CS634      Advanced Computer Networking Computer Science College of William and Mary

INTUITION OF SELF-SIMILARITY

Something “feels the same” regardless of scale

4What is that???

Page 5: CS634      Advanced Computer Networking Computer Science College of William and Mary

INTUITION OF SELF-SIMILARITY

Something “feels the same” regardless of scale

5

Self-similar in nature

Page 6: CS634      Advanced Computer Networking Computer Science College of William and Mary

INTUITION OF SELF-SIMILARITY

Something “feels the same” regardless of scale

6

The Koch snowflake fractal

Page 7: CS634      Advanced Computer Networking Computer Science College of William and Mary

INTUITION OF SELF-SIMILARITY

Something “feels the same” regardless of scale

7

The Koch snowflake fractal

Page 8: CS634      Advanced Computer Networking Computer Science College of William and Mary

INTUITION OF SELF-SIMILARITY

Something “feels the same” regardless of scale

8

The Koch snowflake fractal

Page 9: CS634      Advanced Computer Networking Computer Science College of William and Mary

INTUITION OF SELF-SIMILARITY

Something “feels the same” regardless of scale

9

Page 10: CS634      Advanced Computer Networking Computer Science College of William and Mary

INTUITION OF SELF-SIMILARITY

10

Categories:

Exact self-similarity: Strongest Type

Approximate self-similarity: Loose Form

Statistical self-similarity: Weakest Type

Page 11: CS634      Advanced Computer Networking Computer Science College of William and Mary

INTUITION OF SELF-SIMILARITY

11

Approximate self-similarity:Recognisably similar but not exactly so.

e.g. Mandelbrot set

Statistical self-similarity:Only numerical or statistical

measures that are preserved across scales

Page 12: CS634      Advanced Computer Networking Computer Science College of William and Mary

STOCHASTIC OBJECTS

In case of Stochastic Objects

e.g. time-series

Self-similarity is used in the distributional sense

12

Page 13: CS634      Advanced Computer Networking Computer Science College of William and Mary

WHY SELF-SIMILARITY IMPORTANT?

Recently, network packet traffic has been identified as being self-similar.

Current network traffic modeling using Poisson distributing (etc.) does not take into account the self-similar nature of traffic.

This leads to inaccurate modeling of network traffic. 13

Page 14: CS634      Advanced Computer Networking Computer Science College of William and Mary

PROBLEMS WITH CURRENT MODELS

A Poisson process When observed on a fine time scale will

appear bursty When aggregated on a coarse time scale

will flatten (smooth) to white noise

A Self-Similar (fractal) process When aggregated over wide range of

time scales will maintain its bursty characteristic

14

Page 15: CS634      Advanced Computer Networking Computer Science College of William and Mary

SELF-SIMILARITY BY PICTURE

15

packets per time unit

Ethernet traffic August’89 trace

Page 16: CS634      Advanced Computer Networking Computer Science College of William and Mary

CURRENT MODELING BY PICTURE

16

Page 17: CS634      Advanced Computer Networking Computer Science College of William and Mary

SIDE-BY-SIDE VIEW

17

Page 18: CS634      Advanced Computer Networking Computer Science College of William and Mary

COMPARE BY VIEW

18

Page 19: CS634      Advanced Computer Networking Computer Science College of William and Mary

CONSEQUENCES OF SELF-SIMILARITY

19

Bursty DataStreams

Aggregation

Smooth PatternStreams

Bursty DataStreams

Aggregation

Bursty AggregateStreams

Reality (self-similar):

Current Model:

Consequence: Inaccuracy

Page 20: CS634      Advanced Computer Networking Computer Science College of William and Mary

MATHEMATICAL DEFINITIONS

Long-range Dependence autocorrelation decays slowly

Hurst Parameter Developed by Harold Hurst (1965) H is a measure of “burstiness”

▪ also considered a measure of self-similarity 0 < H < 1 H increases as traffic increases

▪ i.e., traffic becomes more self-similar20

Page 21: CS634      Advanced Computer Networking Computer Science College of William and Mary

PROPERTIES OF SELF SIMILARITY

X = (Xt : t = 0, 1, 2, ….) is covariance stationary random process (i.e. Cov(Xt,Xt+k) does not depend on t for all k)

Let X(m)={Xk(m)} denote the new process obtained by

averaging the original series X in non-overlapping sub-blocks of size m.

Mean , variance 2

Suppose that Autocorrelation Function r(k) k -β, 0<β<1

21

e.g. X(1)= 4,12,34,2,-6,18,21,35

Then X(2)=8,18,6,28

X(4)=13,17

Page 22: CS634      Advanced Computer Networking Computer Science College of William and Mary

DEFINITION BY AUTO-CORRELATION

X is exactly second-order self-similar if The aggregated processes have the same

autocorrelation structure as X. i.e. r (m) (k) = r(k), k0 for all m =1,2, …

X is asymptotically second-order self-similar ifthe above holds when [ r (m) (k) r(k), m ]

Most striking feature of self-similarity: Correlation structures of the aggregated process do not degenerate as m 22

Page 23: CS634      Advanced Computer Networking Computer Science College of William and Mary

23lag

ACF

DEFINITION BY AUTO-CORRELATION

Page 24: CS634      Advanced Computer Networking Computer Science College of William and Mary

24

DEFINITION BY AUTO-CORRELATION

Page 25: CS634      Advanced Computer Networking Computer Science College of William and Mary

TRADITIONAL MODELS

Correlation structures of their aggregated processes degenerate as m i.e. r (m) (k) 0 as m , for k = 1,2,3,...

Short Range Dependence Processes: Exponential Decay of autocorrelations i.e. r(k) ~ pk , as k , 0 < p < 1 Summation is finite

25

Page 26: CS634      Advanced Computer Networking Computer Science College of William and Mary

LONG RANGE DEPENDENCE Processes with Long Range Dependence are

characterized by an autocorrelation function that decays hyperbolically as k increases

Important Property: This is also called non-summability of correlation

kkr )(

26

Page 27: CS634      Advanced Computer Networking Computer Science College of William and Mary

INTUITION

The intuition behind long-range dependence:

While high-lag correlations are all individually small, their cumulative affect is important

Gives rise to features drastically different from conventional short-range dependent processes

27

Page 28: CS634      Advanced Computer Networking Computer Science College of William and Mary

THE MEASURE OF SELF-SIMILARITY

Hurst Parameter H , 0.5 < H < 1

Three approaches to estimate H (Based on properties of self-similar processes) Variance Analysis of aggregated

processes Rescaled Range (R/S) Analysis for

different block sizes: time domain analysis

Periodogram Analysis: frequency domain analysis (Whittle Estimator)

28

!

Page 29: CS634      Advanced Computer Networking Computer Science College of William and Mary

VARIANCE ANALYSIS Variance of aggregated processes decays

as: Var(X(m)) = am-b as m infinite,

For short range dependent processes (e.g. Poisson Process):

Var(X(m)) = am-1 as m infinite,

Plot Var(X(m)) against m on a log-log plot

Slope > -1 indicative of self-similarity 29

Page 30: CS634      Advanced Computer Networking Computer Science College of William and Mary

VARIANCE PLOT EXAMPLE

30

Slope=-1

Slope=-0.7

Page 31: CS634      Advanced Computer Networking Computer Science College of William and Mary

THE R/S STATISTIC

)],......,,0min(),......,,0[max()(

1)()(

2121 nn WWWWWWnSnS

nR

)(),(

),,....2,1:(2 nSVarianceSamplenXmeanSample

nkX k

)()....( 21 nXkXXXW kk

31

where

For a given set of observations,

Rescaled Adjusted Range or R/S statistic is given by

Page 32: CS634      Advanced Computer Networking Computer Science College of William and Mary

EXAMPLE

Xk = 14,1,3,5,10,3

Mean = 36/6 = 6W1 =14-(1*6 )=8W2 =15-(2*6 )=3W3 =18-(3*6 )=0W4 =23-(4*6 )=-1W5 =33-(5*6 )=3W6 =36-(6*6 )=0 32

R/S = 1/S*[8-(-1)] = 9/S

Page 33: CS634      Advanced Computer Networking Computer Science College of William and Mary

THE HURST EFFECT

For self-similar data, rescaled range or R/S statistic grows according to cnH H = Hurst Paramater, > 0.5

For short-range processes , R/S statistic ~ dn0.5

History: The Nile river In the 1940-50’s, Harold Edwin Hurst studied the 800-year record of

flooding along the Nile river. (yearly minimum water level) Finds long-range dependence.

33

Page 34: CS634      Advanced Computer Networking Computer Science College of William and Mary

POX PLOT EXAMPLE

34

Slope = 1.0

Slope = 0.5

Slope = 0.79

Page 35: CS634      Advanced Computer Networking Computer Science College of William and Mary

WHITTLE ESTIMATOR

Provides a confidence interval

Property: Any long range dependent process approaches fractional Gaussian noise (FGN), when aggregated to a certain level

Test the aggregated observations to ensure that it has converged to the normal distribution 35

Page 36: CS634      Advanced Computer Networking Computer Science College of William and Mary

SUMMARY

Self-similarity manifests itself in several equivalent fashions:

Non-degenerate autocorrelations Slowly decaying variance Long range dependence Hurst effect

36

Page 37: CS634      Advanced Computer Networking Computer Science College of William and Mary

PART 2:

ETHERNET TRAFFIC IS SELF-SIMILAR

Page 38: CS634      Advanced Computer Networking Computer Science College of William and Mary

THE FAMOUS DATA

Leland and Wilson collected hundreds of millions of Ethernet packets without loss and with recorded time-stamps accurate to within 100µs.

Data collected from several Ethernet LAN’s at the Bellcore Morristown Research and Engineering Center at different times over the course of approximately 4 years.

38

Page 39: CS634      Advanced Computer Networking Computer Science College of William and Mary

39

Page 40: CS634      Advanced Computer Networking Computer Science College of William and Mary

PLOTS SHOWING SELF-SIMILARITY (Ⅰ)

40H=0.5

H=0.5

H=1

Estimate H 0.8

Page 41: CS634      Advanced Computer Networking Computer Science College of William and Mary

PLOTS SHOWING SELF-SIMILARITY (Ⅱ)

41Higher Traffic, Higher H

High Traffic

Mid Traffic

Low Traffic1.3%-10.4%

3.4%-18.4%

5.0%-30.7%

Packets

Page 42: CS634      Advanced Computer Networking Computer Science College of William and Mary

H : A FUNCTION OF NETWORK UTILIZATION

Observation shows “contrary to Poisson”

Network Utilization H

42

As number of Ethernet users increases, the resulting aggregate traffic becomes burstier instead of smoother

Page 43: CS634      Advanced Computer Networking Computer Science College of William and Mary

DIFFERENCE IN LOW TRAFFIC H VALUES

Pre-1990: host-to-host workgroup traffic

Post-1990: Router-to-router traffic

Low period router-to-router traffic consists mostly of machine-generated packets Tend to form a smoother arrival stream,

than low period host-to-host traffic43

Page 44: CS634      Advanced Computer Networking Computer Science College of William and Mary

SUMMARY

Ethernet LAN traffic is statistically self-similar

H : the degree of self-similarityH : a function of utilizationH : a measure of “burstiness”

Models like Poisson are not able to capture self-similarity

44

Page 45: CS634      Advanced Computer Networking Computer Science College of William and Mary

PART 3:

IMPACT OF SELF SIMILARITY

Page 46: CS634      Advanced Computer Networking Computer Science College of William and Mary

COMPARISON

46

Page 47: CS634      Advanced Computer Networking Computer Science College of William and Mary

TWO EFFECTS

The superposition of many ON/OFF sources whose ON-periods and OFF-periods exhibit the Noah Effect produces aggregate network traffic that features the Joseph Effect.

47

Also known as packet train models

Noah Effect: high variability or infinite variance

Joseph Effect: Self-similar or

long-range dependent traffic

Page 48: CS634      Advanced Computer Networking Computer Science College of William and Mary

EXISTING MODELS

Traditional traffic models: finite variance ON/OFF source models

Superposition of such sourcesbehaves like white noise, with only short range correlations

48

Page 49: CS634      Advanced Computer Networking Computer Science College of William and Mary

EASY MODELING: NOAH EFFECT

Questions related to self-similarity can be reduced to practical implications of Noah Effect

Queuing and Network performance Network Congestion Controls Protocol Analysis

49

Page 50: CS634      Advanced Computer Networking Computer Science College of William and Mary

The Queue Length distribution Traditional (Markovian) traffic: decreases exponentially

fast Self-similar traffic: decreases much more slowly

Not accounting for Joseph Effect can lead to overly optimistic performance

50

Effect of H (Burstiness)

QUEUING PERFORMANCE

Page 51: CS634      Advanced Computer Networking Computer Science College of William and Mary

CONGESTION CONTROL

How to design the buffer size? Trade-off between Packet Lose and Packet

Delay

51

Page 52: CS634      Advanced Computer Networking Computer Science College of William and Mary

52

Packet Lose Packet Delay

Short Range Dependence

Decrease Exponentially

Fixed Limit

Long Range Dependence

Decrease Slowly Always Increase

Compare SRD and LRD when increase buffer size

CONGESTION CONTROL

Page 53: CS634      Advanced Computer Networking Computer Science College of William and Mary

PROTOCOL DESIGN

Protocol design should take into account knowledge about network traffic such as the presence or absence of the self-similarity.

53

Parsimonious Models Small number of parameters Every parameter has a physically meaningful

interpretation e.g. Mean , Variance 2, H Doesn’t quantify the effects of various factors in traffic

Page 54: CS634      Advanced Computer Networking Computer Science College of William and Mary

54

CONCLUSION Demonstrated the existence of self-similarity in Ethernet Traffic irrespective of time scales

Proposed the degree of self-similarity can be measured by Hurst parameter H (higher H implies burstier traffic)

Illustrated the difference between the self-similar and standard models

Explained Importance of self similarity in design, control, performance analysis

Page 55: CS634      Advanced Computer Networking Computer Science College of William and Mary

A USEFUL LINK AND MATERIALS

55

http://ita.ee.lbl.gov/html/contrib/BC.html

Page 56: CS634      Advanced Computer Networking Computer Science College of William and Mary

Questions?

THANK YOU!