Chapter 2 Probabilistic Design

8/2/2019 Chapter 2 Probabilistic Design

1/30

2-1

The "Bijlmer disaster"; an El-Al freight plane crashed into a block of flats in the Bijlmermeer (6-10-'92).


2/30

PROBABILITY IN CIVIL ENGINEERING

2-2

Chapter 2: Probability calculus

2.1 THE CONCEPT OF PROBABILITY

2.1.1 LAPLACE'S DEFINITION OF PROBABILITY

Laplace's (1812) classic definition of probability reads:

number of outcomes for which the event E occursprobability =

number of possible results(2.1)

"If during an experiment in total n different and equally probable results are possible, and if forprecisely m of the n outcomes the event E occurs, then the probability of this event equals m/n.

Laplace's definition mentions "equally probable results. This concept actually implies that all resultsshould have an equal probability. This means that in this definition of probability the concept of

probability has already been used. Furthermore, the probability of an event is not defined if all

outcomes are not equally probable.

Thus it will be clear that there is a need for a more specific formulation of the concept of probability.

2.1.2 EXPERIMENTAL DEFINITION OF PROBABILITY

When an experiment is repeated n times, in which the event E occurs m times, the frequency quotient

of the event equals m/n. The experimental law of large numbers states that, for increasing n, thevalue of the frequency quotient m/n of a certain event converges to a fixed value. In formula:

n

mP( E ) = lim

n(2.2)

The bigger the amount of experiments n, the better the probability of event E will be estimated.

An objection to the use of this definition for the determination of probabilities is the large number of

experiments needed for a reliable estimate.

2.1.3 DEFINITION OF THE CONCEPT OF PROBABILITY BY MEANS OF AXIOMS

The concept of probability can be introduced according to a number of basic rules, which are imposedas axioms. Prior to naming the probability axioms, a number of notions have to be amplified.

Consider e.g. the wave height at sea. Measuring these heights leads to a number of distinguishable

results. The set of all possible outcomes of the wave height is called the solution space . An event E

occurs when the outcome of an experiment meets the description of E, e.g. E is all the wave heights

between 0 1. The results corresponding to E form a subset of.The result of the experiment is certainly an element from ; therefore is called the certain event.

The empty set , which does not contain a result from the experiment, is called the impossible event.The complementary event is the event that E does not occur.


3/30


2-3

The intersectionn

i 1 2 n

i 1

E of E E E, , ,

=

is the event that each of these events occurs. In the Venn-diagram in figure 2.1 the cross-section of

six events is indicated by the hatched area.

Figure 2.1 Intersection. Figure 2.2 Union.

The union

n

i 1 2 n

i 1

E of E , E ,...,E=

is the event that at least one of these events occurs (see the hatched part of figure 2.2). If E1 E2 =, then E1 and E2 are called mutually exclusive or disjoint events.

The definition of probability by means of axioms reads:

"The probability is a function symbolised by the letter P, defined for events E i in the total set

of all possible events. It is a measure for the probability of the occurrence of the events E i ".

The function has to satisfy three probability axioms, namely:

( )P(E) 0 E (2.3)

( )P 1 =(2.4)

nn

i i ii=1 i=1

P( E ) = P(E ), if the events E are disjoint .(2.5)

The rules for probability calculations are defined with the help of these probability axioms. These

mathematical rules are stated in appendix A.

2.1.4 SUBJECTIVE CONCEPT OF PROBABILITY

In many cases a mathematical substantiation of the probability of a certain event E is not possible.

This is caused by a lack of relevant statistical data. In such a situation the determination of the

probability of such an event will generally be a case of instinctive considerations. In that case a

subjective probability is involved.


4/30


2-4

Even in the case where a probability is determined by means of statistics, subjectivity may be

involved. For example, when someone does not have all relevant information at his disposal or when

merely a part of the available information is used for the sake of simplicity.

The subjective concept of probability is usually a controversial subject in discussions amongmathematicians and users of statistics. For risk-analysts it is mostly inevitable to make use of the

subjective concept of probability.

Usually instinctive considerations are combined with available statistical information. The foundation

of the creation of such a combination was laid by Thomas Bayes. The method is thus known as the

Bayesian method (see appendix C).

2.2 RANDOM VARIABLES

2.2.1 TYPES OF RANDOM VARIABLES

If the outcomes of an experiment are uncertain, one speaks of a random variable. The random

variable is denoted by a variable X or Y, which describes the probability of the results of theexperiment. This is schematically shown in fig. 2.3.

Figure 2.3 Schematic representation of a random variable as an uncertain outcome.

As an example consider again the wave height at sea. The value of the wave height, H, has units of

meters. A possible definition of the random variable might simply be Y(H) = H (m). In this case, the

parameter which is measured is called the random variable; this is common in many engineering

applications.

A random variable can be either continuous or discrete. In the previous paragraph the wave height H

is a continuous random variable, because the variable could attain any value between 0 . Anexample of a discrete random variable is, when we would classify the continuous wave height as

follows:

( )

1 if 0 0.5

2 if 0.5 1.0

3 if 1.0 1.5

4 if 1.5

H m

H mY H

H m

H m


5/30


2-5

2.2.2 PROBABILITY MASS FUNCTION, PROBABILITY DISTRIBUTI ONFUNCTION AND PROBABILITY DENSITY FUNCTION

For discrete random variables it is possible to define the probability that the variable will attain acertain value. This is done with the probability mass function:

X( ) =P( X= ),p X X X (2.6)

What is actually stated is that pX(X) is the probability that variable X will attain the specified valueX.

WhereXis a real number. Note that the random variable (with an uncertain value) is denoted by a

capital letter, whereas a specific value of the variable is denoted by a capital italicletter. Figure 2.4

shows a probability mass function. The variable X can attain only four discrete values.

The probability axioms directly lead to the following:

X

0 ( ) 1p X (2.7)

n

X i

i 1

p ( ) = 1X= (2.8)

( ) ( ) ( )i i

X i X i

b a

P a X b p p

< = X X

X X (2.9)

The probability distribution function of a random variable gives the probability that this variable issmaller than or equal to a certain value (see fig. 2.4). The probability distribution function of X is

related to the probability mass function by:

i

X X

X

P ( ) = P(X )= p ( ),i

X

X X X X (2.10)

Figure 2.4 Probability mass function and probability distribution function.

A continuous random variable can attain an infinite number of values (even within a small domain).

The probability that the variable will attain a certain value exactly is therefore zero. The probability

mass function of a continuous random variable therefore has no value. The probability distribution

function, however, remains defined as:


6/30


2-6

( ) ( )XF P X , X X X = (2.11)

The probability axioms lead to:

( )XF 1 = (2.12)

( )XF 0 = (2.13)

FX(X) is monotonously non-decreasing.

In general it is useful to know the derivative of the probability distribution function. This derivative is

called the probability density function and is defined as (see fig. 2.5):

( )( )X

X

dFf ,d

XX XX= (2.14)

Figure 2.5 Probability density function and probability distribution function.

The mathematical rules for probabilities (probability mass functions) are also valid for the derived

function. The following applies for the probability density function:

( )Xf 0 = (2.15)

Xf ( ) = 0 (2.16)

X ( ) 0,f X X (2.17)

X

-

f ( )dX = 1X

(2.18)

There are probability distributions, for which the probability density function is defined, but for which

the probability distribution function can only be presented as:

( ) ( )X XF f t dtX

X

=


7/30


2-7

because the integral does not have an analytical solution. This underlines the importance of the

probability density function. Appendix B gives some common probability distribution types with

corresponding probability density functions.

2.2.3 CHARACTERISTICS OF A RANDOM VARIABLE

A random variable is defined by its probability distribution. For the evaluation of and calculating with

random variables it is often useful to know also a couple of characteristics that can be derived from

the probability distribution. A random variable can completely be determined by the type of probability

distribution and the characteristics.

But in practice it appears to be even simpler to make estimates of these characteristics than to

determine the exact probability distribution.

The expected value of a random variable X is defined by:

( )( )

( )

X

X

Xf d

E X

f d

X X

X X

=

(2.19)

This is the determination of the weighed average of X or theXco-ordinate of the centre of mass of

the probability density function. The expected value of X is therefore also called the average and is

denoted as X.

As:

( )Xf d 1,X X

=

X

-

E(X) = f ( ) d X X X

(2.20)

applies, which corresponds to the determination of the static moment of the probability density

function relative to the axisX= 0 (see fig. 2.6). For this reason the expected value of X is also

referred to as the first moment.

E(X2) can be determined the same way. This corresponds to the moment of inertia of the probability

density function relative to the axisX= 0 and is therefore referred to as the second moment.

In general the kth moment is defined by:

( ) ( )k k XE X f d X X X

= (2.21)

Analogously, the moments can be calculated in relation to the axisX= X (see fig. 2.7). These

moments are called the central moments, the general formulation reads:


8/30


2-8

( )( ) ( )kk XX X-

m = E X - = - f ( )dk

X X X

(2.22)

Figure 2.6 First moment. Figure 2.7 First central moment.

By definition the first central moment equals zero. The second central moment is known as the

variance and is denoted as Var(X) orX2:

( )( )22X XVar(X) = = E X - (2.23)

The positive square root of the variance, X, is called the standard deviation and is a measure for thespread around the average. For the evaluation of the spread of a random variable around the

average, not so much the absolute value of the standard deviation but more so the relative value in

relation to the average is of importance.This relative value is represented by the coefficient of variation VX, which is defined as:

X

X

X

=V

(2.24)

In the case X = 0 the coefficient of variation has no value, however, there's also no need for arelative value of the standard deviation in this case. For negative mean values, we take its absolute

value to calculate a coefficient of variation.

There are also defined "standardised" central moments. Note the standard deviation in the

denominator. The third standardised central moment 1= m3/3is a measure for the asymmetry or

skewness and the standardised fourth central moment 2= m4/4is a measure for the kurtosis or

peakness of the probability density function.

Figure 2.8 gives three different probability density functions, drawn with the same average and thesame standard deviation.


9/30


2-9

Figure 2.8 Skewness and kurtosis.

The standardised third central moment equals zero for probability density function 1, this function is

symmetrical relative to the average. Probability density function 2 slants to the right and its third

central moment is thus positive. For probability density function 3 the tail is left of the average and the

third central moment is negative. The standardised fourth central moments of probability density

functions 2 and 3 are larger than that of function 1.

2.3 RANDOM VECTORS

Frequently, observations contain pairs of random variables, e.g. simultaneously humidity X1 and

temperature X2 . This observation will be expressed as a random vector:

1 2X = ( , )X X

(2.25)

X

is a two-dimensional random variable. Like the one-dimensional random variable, the random

vector is defined by a probability distribution function. The probability distribution function for this

vector reads:

( )1 2X ,X , 1 2 1 1 2 2X

F (X) = F ( , ) = P (X ) ( X ) X X X X (2.26)

From this it's easy to determine the joint probability density function of this random vector:

2

X

X

1 2

F ( )f ( ) =

XX

X X

(2.27)

This function is depicted in figure 2.9, by means of contour levels. On the edges the single or

marginal density functions ( ) ( )1 2X 1 X 2

f and f X X are given.


10/30


2-10

Figure 2.9 Joint probability density function

The joint probability density function ( )Xf X

has the shape of a hill. It reveals that there's a certain

correlation between the humidity X1 and temperature X2. This correlation will be expressed in a

number.

First, analogous to equation (2.20), the expected values of the variables of a random vector are

defined by:

( ) ( )iii i X i iX

-

E X = = f d X X X

(2.28)

and the variances can be found with:

( ) ( ) ( )i ii i

2 22

X X i ii iX X

-

= E = f dX - - X XX

(2.29)

Beside the variances of the single random variables, the mixed central moment or covariance also

plays an important part. This is defined by:

( ) ( )( )( ) ( )1 2 1 21 2 1 X 2 X 1 2 X X

Cov X ,X = E X - X = E X X - (2.30)

The correlation coefficient is a parameter derived from the covariance and the variances. It reads:

( )1 2 1 2

1 2

1 2

, ,X X X X

X X

Cov ,X X= 1 1

(2.31)

This coefficient is a measure for the linear independence between two random variables. IfX1, X2 = 0,

the variables are linearly uncorrelated. However, this tells nothing about a possible non- linear

X2

0 X1

( )X

f X

( )1X 1f X

( )2X 2

f X


11/30


2-11

correlation. The variables are fully correlated ifX1, X2 = 1.

Physical relations may exist between the random variables of a vector. Such a case involves so-

called dependent variables. If the relation is known exactly, it is possible to write a vector withdependent variables as a vector with independent variables by substituting the physical relations in

the vector.

In appendix A, the probability that two events both occur is described with:

1 2 2 1 2P( ) = P( ) P( | )E E E E E (2.32)

According to this formulation the combined probability density function of a vector of two random

variables X1 and X2 can be written as:

( )1 2 1 2 1

X ,X 1 2 X 1 2 1X Xf (X ,X ) = f X f (X | X ) (2.33)

In which fX2X1(X2X1) is the conditional probability density function of X2, given that X1 =X1:

1 2

2 1

1

X ,X 1 2

2 1|X X

X 1

f (X ,X )( | ) =f X X

f (X ) (2.34)

If the variables X1 and X2 are statistically independent, then:

22 12 1 2X|X X

( | ) = ( )f X X f X (2.35)

applies.

In that case, the joint probability density function is defined by the product of the marginal probability

density functions:

( ) ( )1 2 1 2X ,X 1 2 X 1 X 2

f ( , ) = f f X X X X (2.36)

Given that the variables X1 and X2 are statistically independent, the following applies:

( ) ( ) ( )

1 2

2 1

2

1 2 1 2 1 2 1 2X

-

1 2 X 1 X 2 1 2

-

2 X 2 1 X 1 1 2

- -

1 2 X 2 2 1 2

-

E (X X )= X X f (X X ) d X X

= X X f (X )f (X )d X dX

X f (X ) X f (X )d X X

E X f ( )d E X E X X X X

=

= =

(2.37)

From this it follows that the covariance, according to equation (2.30), is zero.

For vectors of n marginal random variables the functions can be easily extended. The probability


12/30


2-12

distribution function for a vector reads:

( )

1 2 nX ,X ,...,X 1 2 nX

n

1 1 2 2 n n i i

i 1

F (X) = F ( , ,..., )

= P (X ) ( X ) ... (X ) = P (X )

X X X

X X X X =

(2.38)

The joint probability density function of the random vector is:

n

X

X

1 2 n

F ( )f ( ) =

...

XX

X X X

(2.39)

Obviously, the reverse applies:

( )1 2 nX X X

n 2 1X XF (X) = ... f x dx ...dx dx

(2.40)

Out of here the probability distribution function of the single random variable X1 can be determined

with multiple (n times) integration:

1

1

X

X 1 n 2 1X

- - -

F (X ) = ... f (x)d x ...d x d x

(2.41)

The marginal probability density function of X1 can be found by partial differentiation:

( ) ( )1 1X 1 X 1

1

f X F XX

d

d=

( )1

1X n 3 2X

- - -

f ( ) = ... f x d x ...dx dxX

(2.42)

The preceding is based on a known probability distribution function or probability density function of

the random variable. The marginal probability density functions can be determined from the

probability density function of the random vector. In practice, however, the interest is often in a

random vector, which consists of a number of random variables for which the marginal probability

density functions are known. Knowing the marginal probability functions the random vector can bedetermined and reversely.

In addition to eq. 2.33 the joint probability density function of a vector of n random variables can be

written by:

1 2 1 nX 1 | 2 1 n 1X X X X 2 1 n n 1 2 1(X) = ( ) ( | ).... | ....X X (X X ....X X )f f X f X X f X

(2.43a)


13/30


2-13

When all the random variables n of the vector are independent, the joint probability density function is

given by ( extending equation 2.36):

( ) ( ) ( ) ( ) ( )1 2 n i

n

X 1 X 2 X n X iXi 1

f f f ...f f X X X X X =

= =

(2.43)

For random dependent variables the marginal probability density functions offer insufficient

information to be able to determine the joint probability density function.

For random vectors with n variables the mutual linear correlations between the variables are

described with the covariance-matrix, given by:

1 1 2 1 n

2 1 2 2 n

XX

n 1 n 2 n

Var(X ) Cov(X , X ) Cov(X , X )

Cov(X , X ) Var(X ) Cov(X , X )=C . . .

. . .

Cov(X , X ) Cov(X , X ) Var(X )

(2.44)

in which:

( ) ( )1 1 1Var X Cov X ,X=

If the covariance-matrix only has values on the diagonal it involves so-called uncorrelated base

variables. With the help of linear algebra it is possible to transform the set of correlated variables to a

set of uncorrelated variables. This transformation reads:

TY = A X

(2.45)

in which:

XX

XX

X is the vector with the correlated base variables;A is the matrix with the orthonormal eigenvectors of C as column vectors;

C is the covariancematrix of X;

Y is the vector with uncorrelated base variables.

The expected values of the uncorrelated base variables are determined with:

TE(Y) = A E(X)

(2.46)

On the diagonal, the covariance matrix of Y

contains the variances that equal the eigenvalues of

CXX.


14/30


2-14

2.4 FUNCTIONS OF RANDOM VARIABLES

2.4.1 FUNCTIONS OF ONE RANDOM VARIABLE

In practice, the risk analyst often encounters functions of random variables. For example consider the

wind velocity causing a pressure p. Velocity v is a random variable. For the pressure21

2=p v

applies, where is the density. Because v is a random variable, as a result p is also a random

variable.

A random variable is characterised by the average and the variance, or it can be necessary to know

also its probability distribution. In this subsection it is explained how to determine the average and the

variance, by determining the moments of the random variable Y. Subsequently the determination of

the probability functions is explained.

In 2.2.3 the kth

moments are defined as the expected values of Xk

. If in equation (2.21) Xk

is replacedby an arbitrary function Y = g(X), the expected value of the variable Y is defined as:

Y X X

- -

E(Y) = = Yf ( )d = g( )f ( )d X X X X X

(2.47)

For linear functions g(X):

XY= g( ) (2.48)

applies.

The kth

central moment of the function Y is determined by:

( )( ) ( )k k XY Y-

E Y- = Y- f ( )dX X

(2.49)

The variance of Y is therefore:

( )( ) ( )2 2 XY Y-

Var(Y) = E Y - = - f ( ) dY X X

(2.50)

For the linear function Y = a X + b:

( )( ) ( )2 2 2 2XX XVar(Y) = E g(X) - g( ) = E (a X + b - a - b =) a (2.51)

For non-linear functions the function can be approximated by means of a Taylor-polynomial in the

average of X. This is denoted as:


15/30


2-15

22X X

X X X2

nnx

Xn

dg( ) g( )1 dg(X) g( )+ ( - ) + (X- + ...)

d 2! d

g( )1 d... + ( - )

n! d

XX X

XX

(2.52)

Subsequently, the expected value of the polynomial can be determined as an approximation of the

expected value of Y:

( )

( ) ( )

2 n2 nX X X

XX X X X2 n

-

2 n2 nX X

X X X2 n

dg( ) g( ) g( )1 1d dE(Y) g( )+ ( - ) + ( - + ( - f d) )

d 2! d n! d

g( ) g( )1 1d d= g( )+ E ( - + E ( -) )

2! d n! d

X X X X X X X X

X XX X

(2.53)

The approximation of the expected value of Y is therefore a function of the average and the central

moments of X. When a function is approximated by the first two terms of the Taylor-polynomial one

speaks of a linear function:

X

X X

dg( )g( ) g( ) + ( - )

dX X

X

(2.54)

The expected values of Y can be approximated by:

XE(Y) g( ) (2.55)

This approximation is known as the Mean Value approximation.

By substituting the derivative of g(X) for a in equation (2.51) an approximation for the variance of Y is

found:

2

2XX

dg( )Var(Y)

d X

(2.56)

In the preceding the average and the variance of Y can be determined exactly or approximated.

Hereafter it is explained how to determine the probability functions. If the function g(X) is monotonous,

the probability distribution function is given by:

-1 -1

Y XF ( ) = P(Y ) = P(X ( )) = F ( ( ))g gY Y Y Y (2.57)

The probability density function of Y can be derived by substituting equation (2.57) in equation (2.46).

This gives:

( )( )

-1-1

X -1YY X X

dF ( )gdF ( ) d ( ) dgf ( ) = = = f ( ) = f ( )g

d d d d

YY Y XY Y X

Y Y Y Y (2.58)

This is illustrated in figure 2.10 for a functionn

Y = X .


16/30


2-16

Figure 2.10 Probability density functions for random variables X and Y=g(X)

The properties of a random variable Y, which is a function of another random variable X, can be

determined using these equations.

2.4.2. FUNCTIONS OF SEVERAL RANDOM VARIABLES (FROMn n )

A given random vector Y

= (Y1, Y2, ..., Yn) is a function g

= (g1, g2, ..., gn) of X

= (X1, X2, ..., Xn), so

that:

i 1 2 niY (X ,X , ...,X )g= (2.59)

In 2.4.1 formulae are given for the expected value and the probability density function of a function of

a random variable. Completely analogously, the expected value and probability density function for a

function of a random vector can be determined.

The expected value of Y

is determined by:

X 1 n

- -

E( )= ... g ( ) f ( )d ... dY X X X X

(2.60)

The probability density function of Y

is given by:

nX

( )Xf X

( )Yf Y

d

d

Y

X


17/30


2-17

1 1

1 n

Y X

n n

1 n

. .f ( ) = f ( )

. .

X X

Y Y

Y X

X X

Y Y

(2.61)

The determinant with the partial derivatives is known as Jacobi's determinant of the Jacobian and is

denoted as J. Equation (2.61) can now simply be written as:

Y Xf ( ) = f ( ) | J |Y X

(2.62)

Though the formulation seems relatively simple, the determination of ( )Yf Y

usually requires a lot ofcalculations in practice.

The foregoing will be clarified with an example.

EXAMPLE 2.1

The question is to determine the probability density function of the function Y1 = X1 + X2,

in which X1 and X2 are independent uniformly distributed random variables in the interval

(0,1). X1 and X2 are graphically represented in figure 2.11

Figure 2.11 Random variables X1 and X2.

Auxiliary variable Y2 is defined as Y2 = X1.X1 and X2 can be written as a function of Y1 and Y2, namely:

X1 = Y2

X2 = Y1 - Y2 = Y1 - X1

The Jacobian is:

0 1J = = -1, thus | J |= 1

1 -1

The probability density function of Y1 if found by integration:

1

1 1 X2

X1


18/30


2-18

1Y 1 1 1 1 1X

-

f ( ) = f ) | J | d( ,Y X Y X X

As it is given that X1 and X2 are independent:

1 21 1 1 X 1 X 1 1X( , ) = f ( )f ( )f X Y X X Y X

From which follows:

1 1 2Y 1 X 1 X 1 1 1f ( ) = f ( )f ( )dY X Y X X

In a couple of cases this integral can be solved analytically. The solvability is dependent

on the marginal probability density functions of X1 and X2.

In this example it has been assumed that both X1 and X2 are uniformly distributed in the

interval (0,1), so:

1 1

2 2

1 X 1 X 1

1 1 X 1 1 X 1 1

if 0 X 1 then f ( ) =1 and otherwise f ( ) = 0

if 0 Y X 1 then f ( ) =1 and otherwise f ( ) = 0

X X

Y -X Y -X

When integrating, these limits, within which the probability density functions are defined,

have to be observed. This leads to the following solution:

1

1

1

1

1

1 Y 1 1 1

0

1

1 Y 1 1 1

1

Y 1

if 0 1 then : f ( ) = 1 1d =

if 1 2 then : f ( ) = 1 1d = 2

and otherwise : f ( ) = 0

Y

Y

Y Y X Y

Y Y X Y

Y

See figure 2.12

Figure 2.12 Probability density function of Y1.

1 2 Y1

1 ( )1Y 1

f Y


19/30


2-19

2.4.3 FUNCTIONS OF A RANDOM VECTOR (FROMn 1 )

In many cases the risk-analyst is interested in the probability distribution of a random variable Y whichis defined as a function of a random vector X

:

n 1Y=g(X) |g :

(2.61)

Often this form applies for the limit state function Z (see chapter 5)

The probability density function can be found by first considering a function fromn n .

Suppose:

( )1 i i 1Y g X and Y X for i 2 3 n, , , ,= = =

(2.62)

If the function ( )g X is monotonous, it is possible to define the following inverse functions:

i 1i

-1

1 2 n 1 1 2 n 1n

= Y for i = 1,2,...,n-1X

= (Y) = h(Y) = h(Y , Y ,..., Y ) = h(Y , X , X ,..., X )X g

+

(2.63)

The Jacobian can be defined by partial differentiation of the functions of Xi:

1

2

2

3

n n

1 n

0 0 0

0 0 0

J. .

. .

. . .

X

Y

X

Y

X X

Y Y

=

(2.64)

The probability density function of Y

is:

Y X

X 1 2 n

X 1 2 1 1 n - 1

( ) = ( ) | J |f f

= ( , , ..., ) | J |f

= ( , , ..., h ( , , ..., )) | J |f

Y X

X X X

X X Y X X

(2.65)

From the probability density function of Y

, the marginal probability density function of Y1 can be

determined by means of integration:


20/30


2-20

( )

1Y 1 n 3 2Y

- - -

1 2 1 1 2 n 1 n 1 2 1X

- - -

f ( ) = ... f ( )d ... d d

= ... f ( , ,...,h , , ,..., | J | d ...d d

Y Y Y Y Y

X X Y X X X X X X

(2.66)

It is also possible to determine the probability density function of Y1 by first calculating the probability distribution

function and subsequently differentiating it. Calculating the probability distribution function can be done with the

help of the total likelihood theorem (see appendix A5). The formulation of the probability distribution function is:

1Y 1 1 1 1 2 n 1 2 nX

- - -

1 1 2 n 1 2 nX

- - -

F ( ) = ... P(Y < | )f ( , ,..., )d d ... d

= ... (g( ) - )f ( , ,..., )d d ... d1

Y Y X X X X X X X

X Y X X X X X X

(2.67)

in which:

( )( ) ( )

( )( ) ( )1 1

1 1

g 1 if g 0;

g 0 if g 0.

1

1

X Y X Y

X Y X Y

=

The probability density is now found by differentiation. This method for determining the probability density function

and the probability distribution function of Y1 is particularly suitable for application in numerical methods.

Using the marginal probability density function the expected value and the variance of Y1 can be

calculated. In the case of a linear function it is possible to calculate these values without determining

the probability density. In this case the function Y1 is:

1 1 1 2 2 n nY = g(X) = a X + a X + ...+ a X + b

(2.68)

The expected value of Y1 is then:

1 n

1 1 1 2 2 n n

1 1 n n

1 nX X

X

E(Y )=E(a X + a X + ...+ a X + b)

=a E(X ) + ... + a E(X ) + b

= a + ... + a + b= g( )

(2.69)

and the variance of Y1 is:


21/30


2-21

( )

( )( )

( )( )

1

1 n

1 n

i j

2

1 1 Y

2

1 1 n n 1 nX X

2

1 1 n nX X

i X j X

i j

Var(Y ) = E (Y - )

= E (a X + ... + a X + b) - (a + ...+ a +b)

= E a (X - ) + ...+ a (X - )

n n= E a a (X - ) (X - )

i ji = 1j = 1

n n

= a a Cov(X , X )i ji = 1j = 1

(2.70)

If the function is non-linear, it can be approximated around an arbitrary point by the first two terms of

the Taylor-polynomial:

( )i

n0

01 i 0

i 1 i

g(X )Y = g(X) g(X ) + X X

X=

(2.71)

The expected value of Y1 can then be approximated by:

ii

n0

01 0Xii = 1

g(X )E(Y ) = E(g(X)) g(X ) + ( - X )

X

(2.72)

and the variance by:

( )n n

0 0

1 i j

i = 1 j = 1 i j

g(X ) g(X )Var Y = Cov(X , X )

X X

(2.73)

If the expected value of X

is chosen for 0X

, a so called Mean Value approximation is used.

2.4.4 CENTRAL LIMIT THEOREM

This theorem implies a special case of a function of a random vector in section 2.4.3 like

n

i

i 1

Y X=

= An important property of random variables is given by the central limit theorem:

"When a large number of independent random variables, of which none dominates, are added up, this

results in a random variable that is normally distributed, irrespective of the starting distributions of the

added variables.

In figure 2.13 this is demonstrated for the sum of respectively 2, 3 and 4 random independent

variables that are uniformly distributed between 0 and 1. Already the sum of 4 variables results in a

distribution that is fairly similar to a normal distribution( except for the tails).


22/30


2-22

A consequence of the central limit theorem is that the sum of two normally distributed variables is

once again normally distributed.

Figure 2.13 Central limit theorem.

Analogously to the central limit theorem for the sum of a large number of independent random

variables, the product of a large number of independent random variables is lognormally distributed.

2.5 EXTREME VALUE DISTRIBUTI ONS

Many applications in civil engineering concern the largest or smallest value of a group of random

variables. For instance, a constructor would like to know the maximum wind load on a construction

during the design period and not per storm. Also, for a construction consisting of a number of

elements, where the weakest element determines the strength of the construction, he would like to

know the minimum strength.

In case of the wind velocity the group of random variables can be defined as:

1

2

3

n

X : wind velocity year1

X : wind velocity year 2

X : wind velocity year 3

X : wind velocity year n

From this number of variables the maximum or minimum value is distracted, these functions are the

so called extreme value functions. They are written as:

n

1 2 ni

i 1

n

1 2 ni

i 1

Y = smallest value of X ,X ,...,XX

Y = largest value of X ,X ,...,XX

Min

Max

=

=

=

=(2.74)


23/30


2-23

If the variables X1 up to and including Xn are random, the extreme values are random variables too.

The probability distributions of the variables X1 up to and including Xn are called the mother

distributions. Those of the largest and smallest values are known as the extreme value distributions.This paragraph focuses on the extreme value distribution of a number of identically distributed

random variables. This corresponds to the probability distribution of the largest or smallest value the

variable can attain in a number of realisations.

Using the mathematical laws for calculus of probability it is possible to determine the extreme value

distributions from the mother distributions.

Suppose that the mother distribution of a random variable X is known. The question is now what is

the probability distribution of the maximum and minimum value of X for n realisations, given that the

realisations do not influence each other. This is actually about n independent random variables with

the same probability distribution.

The probability that all realisations deliver values for X which are smaller than or equal to X, is given

by:

( ) ( ) ( )1 2 n 1 2 nP(X X ... X ) = X X X X X X P X P X ...P X (2.75)

Because P(X1X) = P(X2X) = ... = P(XnX) = FX(X):

( )n

1 2 n XP(X X ... X ) = F ( ) X X X X (2.76)

This defines the probability that n values of X for n realisations are smaller than or equal toX. The

probability distribution for maximums is usually written as:

( )nn

n

XXF ( ) = F ( )X X (2.77)

Analogously, the probability that all realisations deliver values that are larger thanX, is determined by:

( )n

1 2 n XP(X X ... X ) = 1 F ( ) X X X X> > > (2.78)

The probability that at least one of the realisations gives a value that is smaller than or equal toX, is

complementary to the foregoing. The probability distribution of the minimum value of X with n samples

is therefore:

( )n1

n

XX( ) = 1- 1- F ( )F X X (2.79)

From the probability distributions of the extreme values for maximums and minimums the probability

density functions can be determined by differentiating toX.

The result is:

( )nn

nn

n - 1X

X XX

d F (X)(X) = = n f (X) (extreme value distribution for maximum)F (X)f

d X(2.80)

( )n1

n1

n - 1X

X XX

d F (X)(X) = = nf (X) (extreme value distr. for minimum)1 F (X)f

d X (2.81)

In figures 2.14 and 2.15 the probability density functions are given of respectively the maximums and

minimums of X for a number of values of n. The mother probability density function of X is also drawn

in both figures. The extreme value distribution does not have to be of the same type as the mother


24/30


2-24

distribution.

Figure 2.14 Probability density functions of the maximums of X.Figure 2.15 Probability density functions of the minimums of X.

For large values of n the extreme value distributions of the random variable approach a limited

number of possible distribution types. These distributions are known as the asymptotic extreme value

distributions (see appendix B6) and are subdivided in three main types. For the theoretical

substantiation one is referred to [2.1] and [2.7]. It is noted that the convergence to the asymptotic

extreme value distributions is very slow for increasing n, much slower than for example the

convergence to a normal distribution.

For the middle area of the mother distribution there usually is convergence, but for the tail of the

distribution there is often hardly any convergence to speak of. Therefore one has to be cautious with

assumptions concerning the type of the extreme value distribution.

2.6 ESTIMATING DISTRIBUTI ONS

2.6.1 INTRODUCTION

An important element in the risk analysis is the determination of the distributions of the random

variables. The selection of the distribution type and the distribution parameters mainly determine the

outcome of the analysis.

If a large number of statistical data is known, for example hourly observations of the wind velocity

during a long period frequentistic methods from classic statistics can be used.

In many cases the amount of statistical data is inadequate. There are almost no statistical data

available when the probability of a human failure is considered of the pilot of a bowing 747 landing atSchiphol airport in the dark when the weather's bad. Then one has to resort to more subjective

methods for estimation of distribution types and parameters.

Prior to making an estimate of the distribution of a random variable, a number of considerations have

to be contemplated.

In the first place distributions often have a theoretical base, which makes them suited or not to

describe a certain phenomenon. The most well-known are:

> the central limit theorem according to which the sum of a large number of random variables isnormally distributed and the product of the variables is lognormally distributed;

> maximum/minimum of a large number of independent random variables is often divided accordingto one of the asymptotic extreme value distributions.


25/30


2-25

In some cases the sought random variables are functions of other known random variables, of which

the distribution types are known. For example the Rayleigh-distribution applies for the height of sea

waves. Then one can derive a distribution theoretically.

In other cases there are considerations that exclude certain distributions. For example, in theory, if avariable can only attain positive values, a normal distribution doesn't qualify. However, one mustn't be

too strict in maintaining this argument and only apply it when the deviation coefficient is large: for

example when the probability of a negative value is in the order of 10-8

, there is no reason to dismiss

the normal distribution for most applications.

In general however, these considerations won't suffice or will at least require verification. This can be

done in two ways, namely the classical and the Bayesian way. Both procedures will be described.

Besides following the formal procedure it is always helpful to draw the found distribution and the

observation material in a figure. Preferably, both the distribution and density function are drawn. On

the face of it some conclusions are already possible, considerations can be involved that are difficult

to formalise. For example, sometimes the right or left tail is important and sometimes only the middle

area.

In 2.6.2 up to and including 2.6.4 several methods for the estimation of parameters of a knowndistribution type are discussed and in 2.6.5 methods for choosing or rejecting a distribution type are

mentioned.

The estimated parameters are generally called estimates of statistics and are denoted with a . Thusp is an estimate of p. The estimates of statistics for the average and the standard deviation are

usually denoted by m and s respectively.

2.6.2 SUBJECTIVE PARAMETER ESTIMATION

Frequently there is a lack of statistical data concerning random variables, for example the earlier

mentioned landing of a bowing. In such cases one relies on the experience and intuition of experts,

supported by data from literature. The estimate of the properties of random variables gathered in this

way is subjective and often liable to discussion. But even when a lot of statistical material is available,more knowledge than just the statistical data is necessary to estimate the probability distribution of a

variable. Some subjectivity can not be ruled out therefore.

In general one attempts to define a lower and an upper limit, within which the value of a variable is

most probable, and to define a most likely value. In such cases, most probably is usually understood

to mean "with a probability of 95 %. This gives two points of the probability distribution, namely:

F(upper limit) = 97.5 %

F(lower limit) = 2.5 %(2.82)

Obviously, other limits can be chosen too. If the distribution type of the variable is known, the

parameters of the probability distribution can be estimated, using the chosen values.

In most cases, the selection of the distribution type is also based on experience, intuition and

literature. It is advisable to base the choice of the distribution type on an analysis of the physical

factors that influence the value of the random variable.

2.6.3 FREQUENTISTIC PARAMETER ESTIMATION (SEE ALSO ANNEX C)

If the distribution type of a random variable is known and observations of the variable are available,

the parameters of the distribution can be estimated by means of frequentistic methods.

The estimate of a parameter is a function of the observations and can be written as:

( )1 2 np= g( ) = g , ,..., X X X X

(2.83)


26/30


2-26

In which:

X is a vector with the results of random sample surveys (observations) of X.

Because X is a random variable, the observations can also be considered random variables in

advance. The estimate of statistic p is therefore a function of n random variables.

Based on the expected value and the standard deviation of p several properties can be defined.

The estimate of statistic p for a parameter p is called an unbiased estimate of statistic if the expected

value of g( X

) equals p, so if E(g( X

)) = E(p ) = p.

EXAMPLE 2.2

From a random sample of n observations the average mX is determined as an estimate

of the average of the distribution of X, thus:

1 2 nX

(X + X + ... + X )m =

n

The expected value for mX is:

( )1 2 n

1 2 nX X

-

1 X 2 X n X

- - -

( + + ...+ )E(m )= f ( )d

n

1= f ( )d + f ( )d + ...+ f ( )d

n

1= + +... + =

nX X X X

X X X X X

X X X X X X X X X

From this it follows that mX is an unbiased estimate ofX.

If E( p ) = p is only valid for great values of n, then p is an asymptotic unbiased estimate of statistic.

Sometimes a number of unbiased estimates can be defined for one parameter of a probability

distribution and not only one. The average of a normal distribution for instance, can be estimated with:

> the average of the random sample;> the median of the observations;> the average of the highest and the lowest observation.

The difference between the estimates of statistics lies in the standard deviation of mX of the three

above estimators. The average of the random sample has the smallest standard deviation and is

therefore the most efficient estimate of statistic.

It is illustrative to deal with two often used estimators for the standard deviation:


27/30


2-27

2

i XX

( - )s =

n

X (2.84)

2

i XX

( - )s =

n - 1

X (2.85)

If the average is unknown, the first mentioned estimator is biased and the latter is unbiased. However

the biased estimate does have the smallest expected value of the mean square error.

Some methods of frequentistic parameter assessment are:

> the method of moments;> the method of the maximum likelihood;> the least square method.

Appendix C elaborates on these methods.

2.6.4 BAYESIAN PARAMETER ESTIMATION

The Bayesian parameter estimation is a mix of the subjective and the frequentistic parameter

estimation. First, one determines the so-called a priori probabilities that a number of hypotheses

concerning the parameters to be estimated, are true. These hypotheses reflect the knowledge when

no statistical data are available yet. These probabilities are subjective and thus form the debatable

side of the Bayesian analysis.

For example statistical data of a dutch laboratory aren't available yet, but statistical data from a

german laboratory are used to formulate hypothesis concerning the requested parameters. Now the

a-priori probabilities are estimated. When the statistical data from the dutch laboratory are available,

the objective material, they are combined with the a priori parameters by means of a standardprocedure, to come to the so-called a posteriori parameters. This procedure is described in appendix

C.

The a posteriori probabilities give the likelihood of the posed hypotheses concerning the parameters

to be estimated.

2.6.5 SELECTION OF DISTRIBUTION

In the foregoing it was presumed that the probability distribution type of the considered random

variable was already known. Usually, this probability distribution type is not known however. The

choice of the type depends on the knowledge concerning the random variable. Often a subjective

estimate of the type will have to be based on the intuition and knowledge of experts and on datafrom

literature.If statistical data are available, an estimate of the distribution type can be made by employing

frequentistic methods. One of these methods uses the estimates of the standardised asymmetry or

skewness and the standardised kurtosis or sharpness.

The central moments of the distribution of a random variable can be estimated by determining the

moments of the statistical material:

k

iin

k

i = 1

-n

=mn

XX

(2.86)


28/30


2-28

in which:

thk

i

m is the estimate of the k central moment;is the observation i of X;

n is the number of observations of X.X

With help of the estimates of statistic for the central moments it is possible to estimate the so-called

standardised skewness and the standardised kurtosis of the distribution:

3 4

3 2 2212 2

m m = respectivel y =( () ) m m

(2.87)

For various known distribution types the relations between the standardised skewness (1) and the

standardised kurtosis (2) have been investigated by Pearson. Some of these relations are given in

figure 2.16. Based on the estimates of statistics 1 and 2 of 1 and 2 , this figure can help selecta distribution type.

Figure 2.16 Relations between 1 and2of different distribution types (after Professor E.S. Pearson,

University College, London).

The practice has revealed that, even if a lot of observations are available, it is almost impossible to

find the exact distribution. Using figure 2.16 a number of possible distributions are usually selected.

From these chosen distributions a further selection can be made by means of a number of tests.

Appendix C describes two tests, namely the chi-square-test and the Kolmogorov-Smirnov-test.

The previously named tests are based purely on the statistical material, they don't take other

knowledge, relevant for the choice of the distribution type, into account. A method with which this

knowledge can be taken into account, is the Bayesian procedure for the selection of the distribution

type. This procedure is also described in appendix C.


29/30


2-29

LITERATURE

Recommended literature:

2.1. GUMBEL, E.J., Statistics of extremes. Columbia University Press, 1958.

Consulted literature:

2.2. BENJAMIN, J.R. and C.A. CORNELL, Probability, statistics and decision for civil engineers.

McGraw-Hill, 1970.

2.3. BOLOTIN, V.V., Statistical methods in structural dynamics. Holden-Day, San Francisco, 1989.

2.4. GROENEBOOM, P et al, Statistics and operational analysis. (in Dutch: "Statistiek en

operationele analyse.") Technical University Delft, Delft, 1993.

2.5. NOWAK, A.S. and R.K. COLLINS, Reliability of Structures. McGraw-Hill, 2000.

2.6. SCHNEIDER, J. and H.P. SCHLATTER, Sicherheit und Zuverlssigkeit im Bauwesen. Verlag

der Fachvereine an den schweizerischen Hochschulen und Techniken AG, Zrich, und B.G.

Teubner Verlag, Stuttgart, 1994.

2.7. VRIJLING, J.K. and P.H.A.J.M. van GELDER, Probabilistic design in Hydraulic engineering.Lecture notes CT5310, 2002.


30/30


2-30

Documents

Chapter 2 Probabilistic Design