24
President University Erwin Sitompul PBST 11/1 Lecture 11 Probability and Statistics Dr.-Ing. Erwin Sitompul President University http://zitompul.wordpress.com 2 0 1 3

Sampling Distribution of S 2

  • Upload
    vern

  • View
    89

  • Download
    0

Embed Size (px)

DESCRIPTION

Chapter 8.6. Sampling Distribution of S 2. Sampling Distribution of S 2. If S 2 is the variance of a random sample of size n taken from a normal population having the variance σ 2 , then the statistic. has a chi-squared distribution with v = n – 1 degrees of freedom. Chapter 8.6. - PowerPoint PPT Presentation

Citation preview

Page 1: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/1

Lecture 11

Probability and Statistics

Dr.-Ing. Erwin SitompulPresident University

http://zitompul.wordpress.com

2 0 1 3

Page 2: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/2

Sampling Distribution of S2

If S2 is the variance of a random sample of size n taken from a normal population having the variance σ2, then the statistic

Chapter 8.6 Sampling Distribution of S2

222

2 21

( 1) ni

i

X Xn S

has a chi-squared distribution with v = n – 1 degrees of freedom.

Page 3: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/3

Sampling Distribution of S2

Table A.5 gives values of for various values of α and v. The column headings are the areas α. The left column shows the degrees of freedom. The table entry are the value.

Chapter 8.6 Sampling Distribution of S2

2

2

Page 4: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/4

Table A.5 Chi-Squared DistributionChapter 8.6 Sampling Distribution of S2

Page 5: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/5

Table A.5 Chi-Squared DistributionChapter 8.6 Sampling Distribution of S2

Page 6: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/6

Sampling Distribution of S2

Chapter 8.6 Sampling Distribution of S2

A manufacturer of car batteries guarantees that his batteries will last, on the average, 3 years with a standard deviation of 1 year. If five of these batteries have lifetimes of 1.9, 2.4, 3.0, 3.5, and 4.2 years, is the manufacturer still convinced that his batteries have a standard deviation of 1 year? Assume that the battery lifetime follows a normal distribution.

2

2

1 12

( 1)

n n

i ii i

n x x

sn n

2(5)(48.26) (15)

(5)(4)

0.815

22

2

( 1)n s

3.26

From the table, 95% of the χ2 values with 4 degrees of freedom fall between 0.484 and 11.143.

The computed value with σ2 = 1 is reasonable.

The manufacturer has no reason to doubt the current standard deviation.

(4)(0.815)

(1)

Page 7: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/7

t-Distribution In the previous section we discuss the utility of the Central Limit

Theorem to infer a population mean or the difference between two population means.

These utility is based on the assumption that the population standard deviation is known.

However, in many experimental scenarios, knowledge of σ is not reasonable than knowledge of the population mean μ.

Chapter 8.7 t-Distribution

Often, an estimate of σ must be supplied by the same sample information that produced the sample average x.

As a result, a natural statistic to consider to deal with inferences on μ is

_

XT

S n

Page 8: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/8

t-Distribution If the sample size is large enough, say n ≥ 30, the distribution of T

does not differ considerably from the standard normal. However, for n < 30, the values of S2 fluctuate considerably from

sample to sample and the distribution of T deviates appreciably from that of a standard normal distribution.

In the case that sample size is small, it is useful to deal with the exact distribution of T.

Chapter 8.7 t-Distribution

In developing the sampling distribution of T, we shall assume that the random sample was selected from a normal population,

2 2 ( 1)

X n ZT

V nS

XZ

n

2

2

( 1)n SV

where

Page 9: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/9

t-DistributionChapter 8.7 t-Distribution

Let Z be a standard normal random variable and V a chi-squared random variable with v degrees of freedom. If Z and V are independent, then the distribution of the random variable T, where

is given by the density function

ZT

V v

1 22( /1) 2( ) 1 ,

2

vv t

h t tvv v

This is known as the t-distribution with v degrees of freedom.

Page 10: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/10

t-DistributionChapter 8.7 t-Distribution

Let X1, X2,..., Xn be independent random variables that are all normal with mean μ and standard deviation σ. Let

Then the random variable has a t-distribution with v = n – 1 degrees of freedom.

22

1 1 1and

n nii

i i

X XXX S

n n

X

TS n

The shape of t-distribution curves for v = 2, 5, and ∞

Page 11: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/11

Table A.4 t-DistributionChapter 8.7 t-Distribution

It is customary to let tα represent the t-value above which we find an area equal to α.

The t-distribution is symmetric about a mean of zero, that is, t1–α = –tα

Page 12: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/12

Table A.4 t-DistributionChapter 8.7 t-Distribution

A t-value that falls below –t0.025 or above t0.025 would tend to make us believe that either a very rare event has taken place or perhaps our assumption about μ is in error.

Should this happen, we shall make the latter decision and claim that our assumed value of μ is in error.

Page 13: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/13

t-DistributionChapter 8.7 t-Distribution

The t-value with v = 14 degrees of freedom that leaves an area of 0.025 to the left, and therefore an area of 0.975 to the right, is

0.975 0.025t t 2.145

Find P(–t0.025 < T < t0.05).

1 0.05 0.025 0.925Area

0.025 0.05( ) 0.925P t T t

Page 14: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/14

t-DistributionChapter 8.7 t-Distribution

Find k such that P(k < T < –1.761) = 0.045, for a random sample of size 15 selected from a normal distribution and

0.05( ) 0.045 0.05P t T t t

xT

s n

From t-Distribution Table, the value 1.761 corresponds to t0.05 for v = 14. So, t–0.05 = –1.761.

0.005t t

( 2.977 1.761) 0.045P T

0.005t 2.977k

Page 15: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/15

t-DistributionChapter 8.7 t-Distribution

A chemical engineer claims that the population mean yield of a certain batch process is 500 grams per millimeter of raw material. To check this claim he samples 25 batches each month. If the computed t-value falls between –t0.05 and t0.05, he is satisfied with his claim.What conclusion should he draw from a sample that has a mean x = 518 gr/mm and a sample standard deviation s = 40 gr? Assume the distribution of yields to be approximately normal.

_

xt

s n

518 5002.25

40 25

From t-Distribution Table, the value 2.25 corresponds to α between 0.02 and 0.015.

This means, the probability of obtaining a mean of 518 gr/mm for a certain sample while the mean of population is 500 gr/mm is only approximately 2%.

It is more reasonable to assume that μ > 500. Hence, the manufacturer is likely to conclude that the process

produces a better product than he thought.

0.05 0.05( ) ( 1.711 1.711)P t T t P T

Page 16: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/16

F-Distribution If the t-distribution is motivated by the comparison between two

sample means, the F-distribution finds enormous application in comparing sample variances.

The statistic F is defined to be the ratio of two independent chi-squared random variables, each divided by its number of degrees of freedom. Hence, we can write

Chapter 8.8 F-Distribution

1

2

U vF

V v

where U and V are independent random variables having chi-squared distributions with v1 and v2 degrees of freedom, respectively.

Page 17: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/17

F-Distribution Let U and V be two independent random variables having chi-

squared distributions with v1 and v2 degrees of freedom, respectively. Then the distribution of the random variable is given by the density

Chapter 8.8 F-Distribution

1

2

U vF

V v

11

1 2

2 12 1

1 2 1 2

( ) 21 2 1 2

2, 0( ) 2 2 (1 )

0, elsewhere

vv

v v

v v v v ffh f v v v f v

This is known as the F-distribution with v1 and v2 degrees of freedom.

Table A.6 in the reference gives values of fα for α = 0.05 and α = 0.01 for various combinations of the degrees of freedom v1 and v2.

Table A.6 can also be used to find values of f0.95 and f0.99. The theorem will be presented later.

Page 18: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/18

Table A.6 F-Distribution

α = 0.05

Chapter 8.8 F-Distribution

Page 19: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/19

Table A.6 F-Distribution

α = 0.05

Chapter 8.8 F-Distribution

Page 20: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/20

Table A.6 F-Distribution

α = 0.01

Chapter 8.8 F-Distribution

Page 21: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/21

Table A.6 F-Distribution

α = 0.01

Chapter 8.8 F-Distribution

Page 22: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/22

F-DistributionChapter 8.8 F-Distribution

Typical F-distribution Tabulated values of the F-distribution

Writing fα(v1, v2) for fα with v1 and v2 degrees of freedom, we obtain

1 1 22 1

1( , )

( , )f v v

f v v

Page 23: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/23

F-Distribution with Two Sample Variances If and are the variances of independent random samples of

size n1 and n2 taken from normal populations with variances and , respectively, then

Chapter 8.8 F-Distribution

has an F-distribution with v1 = n1 – 1 and v2 = n2 – 1 degrees of freedom.

2 2 2 21 1 2 12 2 2 22 2 1 2

S SF

S S

21S 2

2S212

2

Page 24: Sampling Distribution of  S 2

President University Erwin Sitompul PBST 11/24

Homework 10AProbability and Statistics

1. A maker of a famous chocolate candies claims that their average calorie is 5 cal/g with a standard deviation of 1.2 cal/g. In a random sample of 8 candies of this famous brand, the calorie content was found to be 6, 7, 7, 3, 4, 5, 4, and 2 cal/g. Would you agree with the claim? Use Chi-squared distribution and assume a normal distribution.

(Wal8.852 ep.283)

2. A small cleaning service company obtains a contract proposal from a customer owning an office tower with 100 rooms. The company has only 5 workers. It needs to determine its profit margin by first finding out the time required by the workers to finish cleaning 100 rooms. The first estimation is that the workers would need 5.5 hours to clean the 100 rooms.The company starts a probation period for two week, while collecting data so that it can later charge the customer correctly. The data collected by the company can be seen on th next table.After collecting this data, the company wants to determine if the first estimation of 5.5 hours to finish cleaning 100 rooms was reasonable. If the computed t-value falls between –t0.025 and t0.025, the company would be satisfied and will stay with its first estimation. What is your opinion?

(Int.Rndvz ep.283)

Time to clean 100

rooms

5.5

7

6.4

4.5

3.9

7.1

5.6

5.8

7.8

4.6

4.5

5.5