21
February 12, 2017 1 Fault Tolerant Computing CS 530 Probabilistic Methods: Review LN 6 Yashwant K. Malaiya Colorado State University

Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 1

Fault Tolerant Computing CS 530

Probabilistic Methods: ReviewLN 6

Yashwant K. MalaiyaColorado State University

Page 2: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

2

Probabilistic Methods

•  Much of this may be a review of probability and statistics you have taken elsewhere.

•  We cannot predict exactly when something will fail, but we can calculate the probability of a failure, and what can be done to reduce that.

•  This is similar to what insurance industry does: they may not know when a person will die, but they can compute life-expectancy of someone who is say, 45 years old, and maintains an ideal weight.

Page 3: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

3

Probabilistic Methods: Overview•  We can have concrete numbers even in presence of uncertainty.Topics:•  Probability

§  Disjoint events§  Statistical dependence

•  Random variables and distributions§  Discrete distributions: Binomial, Poisson§  Continuous distributions: Gaussian, Exponential

•  Stochastic processes§  Markov process§  Poisson process

Page 4: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

4

Basics•  Probability of an event A

if A occurs n times among N equally likely outcomes.•  Probability is a number between 0 and 1.•  Ex: Roll of a die

•  If more information is available, probability of the same event changes. If we know die is loaded, perhaps

NnAP =}{

5.063}{ ==oddP

6.0}{ =oddP is possible.

Page 5: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

5

Basics Concepts•  Prob. Of union of two events:

§  •  Ex: Roll of a die

•  If A and B are disjoint, i.e. if (i.e. empty set),

A B

P{A∪ B} = P{A}+P{B}−P{A∩ B}

65

61

63

63

}3{}3{}{}3{

=−+=

≤−≤+=

∩∪

evenPPevenPoutcomeevenoutcomeP

∪A

P{A∪ B} = P{A}+P{B}

P{A} =1−P{A}

A∩ B = φ

A∩ B

Page 6: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

6

Conditional Probability

•  Conditional probability

•  If A and B are independent, P{A|B}= P{A}. Then

•  Example: A toss of a coin is independent of the outcome of the previous toss.

P{A | B} = P{A∩ B}P{B}

for P{B}> 0

P{A|B} is the probability of A, given we know B has happened.

P{A∩ B} = P{A}P{B}

Page 7: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

7

Conditional Probability

•  If A can be divided into disjoint Ai, i=1,..,n, then

•  Example: A chip is made by two factories A and B. One percent of chips from A and 0.5% from B are found defective. A produces 90% of the chips. What is the probability a randomly encountered chip will be defective?

•  P{a chip is defective} = (1/100)x0.9 + (0.5/100)x0.1 =0.0095 i.e. 0.95%

.}{}|{}{ ∑=i

ii APABPBP

Page 8: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

8

Bayes’ Rule

•  Conditional probability

•  Bayes’ Rule

•  Example: A drug test produces 99% true positive and 99% true negative results. 0.5% are drug users. If a person tests positive, what is the probability he is a drug user?

P{A | B} = P{A∩ B}P{B}

for P{B}> 0

P{A|B} is the probability of A, given we know B has happened.

P{A | B} = P{B | A}P{A}P{B}

for P{B}> 0

P{DU | P} = P{P |DU}P{DU}P{P |DU}P{DU}+P{P | nDU)P{nDU}

= 33.3%

Page 9: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

9

Random Variables•  A random variable (r.v.) may take a specific random value at a time. For example

§  X is a random variable that is the height of a randomly chosen student§  x is one specific value (say 5’9”)

•  A random variable is defined by its density function.•  A r.v. can be continuous or discrete

∑∫

∑∫

=

=

+≤≤

max

min

max

min

max

minmin

)()()(

)()()(

)(}{)(

i

iiii

x

x

i

iii

x

x

i

xpxdxxfxXE

xpdxxfxF

xpdxxXxPdxxfdiscretecontinuous

Density function

“Cumulative distribution function” (cdf)

Expected value (mean)

Page 10: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

10

Distributions, Binomial Dist.

•  Note that

•  Major distributions: §  Discrete: Bionomial, Poisson§  Continuous: Gaussian, expomential

•  Binomial distribution: outcome is either success or failure§  Prob. of r successes in n trials, prob. of one success being p

1)(1)(max

min

max

min

== ∑∫i

ii

x

x

xpdxxf

nrforppr

nrf rnr ,,0)1()( …=−⎟

⎟⎠

⎞⎜⎜⎝

⎛= −

)!(!!rnr

nCrn

rn

−==⎟⎟

⎞⎜⎜⎝

⎛incidentally

Page 11: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

11

Distributions: Poisson•  Poisson: also a discrete distribution, λ is a parameter.

•  Example: µ = occurrence rate of something.§  Probability of r occurrences in time t is given by

!)()(retrf

tr µµ −

=

!)(

xexfx λλ −

=

Often applied to fault arrivals in a system

Page 12: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

12

Distributions: Gaussian1809 AD

•  Continuous. Also termed Normal (called Laplacian in France!1774 AD)

+∞≤≤−∞

−−

=

x

x

exf ,21)( 2

2

2)(

2σµ

πσ

40 50 60 70 80 90 100

0.00

0.01

0.02

0.03

0.04

0.05

0.06

0.07

0.08

Grades

Den

sity

Bell-shaped curve

µ = 70 σ = 5

µ = 70 σ = 10

mean : )variance(

iswhich deviation standard :

µ

σ

Laplace discovered it beforeGauss in 1774 AD!

Page 13: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

13

Normal distribution (2)

•  Tables for normal distribution are available, often in terms of standardized variable z=(x- µ)/σ.

•  (µ-σ, µ+σ) includes 68.3% of the area under the curve.

•  (µ-3σ, µ+3σ) includes 99.7% of the area under the curve.

•  Central Limit Theorem: Sum of a large number of independent random variables tends to have a normal distribution. The reason why normal distribution is applicable

in many cases

Page 14: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

14

Exponential & Weibull Dist.Exponential Distribution: is a

continuous distribution. §  Density function

f(t) = λ e- λ t 0<t≤∞Example:•  λ: exit or failure rate.•  Pr{exit the good state during (t, t

+dt)}= e- λt λ dt

•  The time T spent in good state has an exponential distribution

•  Weibull Distribution: is a 2-parameter generalization of exponential distribution. Used when better fit is needed, but is more complex.

λ

State 0

0

0 5 0 10 0 15 0

t i me

f(t)

e-λ t

1/λ

0. 37 λ

λ

Page 15: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

15

Variance & Covariance

•  Variance: a measure of spread§  Var{X} = E[X-µx]2

§  Standard deviation = (Var{x})1/2 §  σ = standard deviation (usually for normal dist)

•  Covariance: a measure of statistical dependence§  Cov{X,Y} = E[(X-µx)(Y-µy)]§  Correlation coefficient: normalizedρxy = Cov{X,Y}/ σx σy

Note that 0<|ρxy|<1

Page 16: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

16

Stochastic Processes

•  Stochastic process: that takes random values at different times.§  Can be continuous time or discrete time

•  Markov process: discrete-state, continuous time process. Transition probability from state i to state j depends only on state i (It is memory-less)

•  Markov chain: discrete-state, discrete time process.•  Poisson process: is a Markov counting process N(t),

t ≥ 0, such that N(t) is the number of arrivals up to time t.

Page 17: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

17

Poisson Process: properties

•  Poisson process: A Markov counting process N(t), t ≥ 0, N(t) is the number of arrivals up to time t.

•  Properties of a Poisson process:§  N(0) = 0§  P{an arrival in time Δt} = λΔt§  No simultaneous arrivals

•  We will next see an important example. Assuming that arrivals are occurring at rate λ, we will calculate probability of n arrivals in time t.

Page 18: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

18

Poisson process: analysis•  A process is in state I, if I arrivals have occurred.•  Pi(t) is the probability the process is in state i.

•  In state i, probability is flowing in from state i-1, and is flowing out to state i+1, in both cases governed by the rate λ. Thus

λ λ λ λ

…0 1 i

i arrivals

,..1,0)()()(

1 =+−= − ntPtPdttdP

iii λλ

We’ll solve it first for P0(t), then for P1(t), then …

Page 19: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

19

Poisson process: Solution for P0(t)

λ λ λ λ

…0 1 i

i arrivals

)()(

)()()(]1)[()(}0{

00

000

00

0

tPdttdP

tPt

tPttPttPttP

stateinprocessPP

λ

λ

λ

−=

−=Δ−Δ+

Δ−=Δ+

=

t

t

etPCPSince

eCtP

CttPSolution

λ

λ

λ

=

==

=

+−=

)(

,1,1)0()(

))(ln(:

0

20

20

0

Page 20: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

20

Poisson Process: General solution

,..1,0!)()(

,

== − nenttP

getweyrecursivelSolving

tn

nλλ

We need to solve ,..1,0)()(

)(1 =+−= − ntPtP

dttdP

iii λλ

Using the expression for P0(t), we can solve it for P1(t).

Which we know is Poisson distribution!

Page 21: Fault Tolerant Computing - Colorado State Universitycs530dl/s17/6Prob.pdf · 2017. 2. 13. · February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya 2 Probabilistic Methods •

February 12, 2017 Fault Tolerant Computing ©Y.K. Malaiya

21

Poisson Process: Time between Two Events

time

T

ith arrival

t

t

tiii

etf

etTPtF

etttinarrivalnoPttP

λ

λ

λ

λ −

−+

=

−=≤≤=

=+=>

)(get wesides,both atingdifferenti

cdf, of derivative isfunction density theSince1}0{)(

bygiven is (cdf)function on distributi cumulative theThus)},({}{ 1

Exponential distribution

i+1th arrival

Here we’ll show that the time to next arrival is exponentially distributed.