Interval Estimation of the Population Mean for a Normal Population with s Unknown

Preview:

DESCRIPTION

Interval Estimation of the Population Mean for a Normal Population with s Unknown. Student’s t-distribution. If s is not known and n 30, the derivation of the confidence interval must be changed slightly. - PowerPoint PPT Presentation

Citation preview

10 - 1

Interval Estimation of the Interval Estimation of the Population Mean for a Normal Population Mean for a Normal Population with Population with Unknown Unknown

10 - 2

Student’s t-distribution

• If is not known and n 30, the derivation of the confidence interval must be changed slightly.

• Provided the population from which the sample is drawn is normally distributed, the distribution of the quantity

has a Student’s t-distribution, which was discovered by W. S. Gossett in 1908.

t (x - )sn

10 - 3

Degrees of Freedom

• The t-distribution has one parameter, degrees of freedom.

• Degrees of freedom for any t-distribution is computed in the following manner

d.f. = n - 1,

where n is the number of sample observations.

10 - 4

Shape of the t-distribution

• The t-distribution is very much like the normal; it is a symmetrical, bell-shaped distribution with slightly larger tails than a normal.

• In fact, as the degrees of freedom become larger, the t-distribution approaches the normal distribution.

normal

t

10 - 5

Confidence Interval for the Population Mean

Definition:Definition:If n 30, is unknown, and the sample is drawn from a normal population, a (1 - ) confidence interval for the population mean is given by

where is the critical value for a t-distribution with n - 1 degrees of freedom which captures an area of /2 in the right tail of the distribution.

x t sn2 , n - 1

t2 , n - 1

10 - 6

Example 5

Construct an 80% confidence interval for the mean of a normal population assuming that the values listed below comprise a random sample taken from the population.

83.9 87.4 65.2 86.0 73.1 80.3 92.7 87.5 69.3 77.5 91.9 71.1 79.1 72.4 88.2

The population standard deviation is unknown.

10 - 7

Example 5 - Solution

We are given:

X has a normal distribution,

the variance is unknown,

n = 15,

and the confidence level = .80.

10 - 8

Example 5 - Solution

2

.202

.10 1 - = .80 = .20

t 1 .345,.10 14

.40

.10

.40

.10

.80

10 - 9

Example 5 - Solution

We must calculate = 80.37 and s = 8.68. We want to determine an 80% confidence interval for the mean.

Since X is normal, is unknown and n 30, the confidence interval is given by

x

x t sn

80.37 1.345 8.68152 , n 1

80.37 3.01

( 77 .36 , 83 .38 ) .

10 - 10

Precision and Sample Precision and Sample SizeSize

10 - 11

Precision

• The best interval estimates a decision maker could hope for would be those which are very small in size and possess a large amount of confidence.

• The width of the confidence interval defines the precision with which the population mean is estimated; the smaller the interval the greater the precision.

10 - 12

Components Which Affect the Width

Note: The entire confidence interval width = 2 .

represents the distance the confidence interval boundary is from the mean in standard deviation units. The distance is related to the specified level of confidence.

represents the population standard deviation.

n represents the sample size.

zn

2

z2

10 - 13

Which component can change?

• Since the population standard deviation () is a constant, it does not change.

• However, the sample size, n, is selected by the decision maker.

• Since the sample size can enlarge or reduce the width of the confidence interval, how large should the sample be?

10 - 14

Error

• The sample size should be selected in relation to the size of the maximum positive or negative error the decision maker is willing to accept.

• This can be achieved by setting the error equal to one half the confidence interval width,

error = zn

2 .

10 - 15

Determining the Sample Size

• The equation for error can be solved for the sample size,

• By selecting a level of confidence and the maximum error, the relationship can be used to determine the sample size necessary to estimate the sample mean with the desired accuracy.

• In order to assure the desired level of confidence, always round the value obtained for the sample size up to the next whole integer.

n =zerror

2

2

.

10 - 16

Example 6

A computer software company would like to estimate how long it will take a beginner to become proficient at creating a graph using their new spreadsheet package.

10 - 17

Example 6

• Past experience has indicated that the time required for a beginner to become proficient with a particular function of new software products has an approximately a normal distribution with a standard deviation of 15 minutes.

• Find the sample size necessary to estimate the average time required for a beginner to become proficient at creating a graph with the new spreadsheet package to within 5 minutes with 95% confidence.

10 - 18

Example 6 - Solution

We are given:

= 15,

error = E = 5,

and the confidence level = .95.

10 - 19

Example 6 - Solution

z 1.96.025

.475

.025

1 - = .95

2

.052

.025 = .05

10 - 20

Example 6 - Solution

We want to determine the sample size necessary to estimate the mean time required for a beginner to become proficient at creating a graph with the new spreadsheet package.

The sample size is given by

To get the desired accuracy, we must round up to 35.

n z

E1.96(15)

534.5744. [ ] [ ]

2

2 2

10 - 21

Sample Size for Unknown

• The most obvious method for obtaining an estimate of is to take a small sample and use the sample standard deviation as an estimate of the population standard deviation.

• Replacing with s in the sample size determination relationship will provide an initial estimate of the required sample size.

10 - 22

Estimating Population Estimating Population AttributesAttributes

10 - 23

Attribute

• An attribute is a characteristic that members of a population either do or do not possess.

• Attributes are almost always measured as the proportion of the population that possess the characteristic.

• Example: An attribute of a person would be

whether they smoke cigarettes or not.

p = proportion of the population which smoke cigarettes

10 - 24

Estimating the Proportion

• Estimating the proportion of the population that possesses an attribute is straightforward.

• A random sample is selected and the sample proportion is computed as follows:

X = number in the sample that possess the attribute,

n = sample size, and

p Xn.

10 - 25

Example 7

The Richland Gazette, a local newspaper, conducted a poll of 1,000 randomly selected readers to determine their views concerning the city's handling of snow removal. The paper found that 650 people in the sample felt the city did a good job.

10 - 26

Example 7

Compute the best point estimate for the percentage of readers who believe the city is doing a good job in snow removal.

10 - 27

Example 7 - Solution

X = number of people who believe the city is doing a good job

X = 650

n = number of randomly selected readers

n = 1000

p Xn 6501000

.65

10 - 28

Interval Estimation of a Interval Estimation of a Population AttributePopulation Attribute

10 - 29

Sampling Distribution of the Point Estimate

• In order to develop the confidence interval for a population proportion the sampling distribution of the point estimate must be developed.

• The random variable, , has a binomial distribution which is approximated with a normal random variable.

p

10 - 30

The Sample Proportion

The sample proportion, , is distributed normally with mean, p, and variance, .

p

p(1 p)n

p

10 - 31

The Sample Proportion

• The standard deviation of the sample proportion ( ) is denoted symbolically as and is given by

• where is used as an estimate of p if the population proportion is unknown.

p p

p p(1 p)

np(1 p)

n

p

10 - 32

Confidence Interval for the Population Proportion

Definition:Definition:If the sample size is sufficiently large, np 5 and n(1-p) 5, the 1 - confidence interval for the population proportion is given by the expression

where is the distance from the point estimate to the end of theinterval in standard deviation units,and is the standard deviation of . p

p

p z

2 p

z2

10 - 33

Example 8

The Peacock Cable Television Company thinks that 40% of their customers have more outlets wired than they are paying for.

A random sample of 400 houses reveals that 110 of the houses have excessive outlets.

10 - 34

Example 8

Construct a 99% confidence interval for the true proportion of houses having too many outlets.

Do you feel the company is accurate in its belief about the proportion of customers who have more outlets wired than they are paying for?

10 - 35

Example 8 - Solution

We are given:

X = number of houses that have excessive outlets

= 110,

n = 400,

and the confidence level = .99.

Thus, p 110400

.275 .

10 - 36

Example 8 - Solution

z 2 575.005 .

1 - = .99

2

.012

.005 = .01

.495

.005

10 - 37

Example 8 - Solution

A 99% confidence interval for the true proportion of houses having too many outlets is given by

.275 .0575

(.2175 , .3325 ) .

p z p (1 p)n

2

.275 2.575 .275 (1 .275 )400

10 - 38

Example 8 - Solution

• Interpretation: We are 99% confident that the true proportion of houses having too many outlets is between .2175 and .3325.

• Note: We do not believe that 40% of customers have more outlets wired than they are paying for because we are 99% confident that the true proportion is in the interval 21.75% to 33.25% and 40% is not in that interval.

10 - 39

Precision and Sample Size Precision and Sample Size for Population Attributesfor Population Attributes

10 - 40

Accuracy in Estimating a Population Proportion

• Just as for the population mean, a specific level of accuracy in estimating a population proportion is desirable.

• When we estimate extremely small quantities, highly precise estimates are necessary.

10 - 41

Deriving the Sample Size

• The technique for deriving the sample size parallels the discussion of the sample mean.

• Setting one half the entire width of the confidence interval equal to the maximum allowable error yields

error =

• Solving for n yields

n =

z z p (1 p)n .

2 2p

z p (1 p)error

.2

2

2

10 - 42

Sample Size when is Known

• Generally the population proportion is unknown and is estimated from a pilot study.

• In this case, the sample size necessary to estimate the population proportion to within a particular error with a certain level of confidence is given by

n =

where is the estimate obtained from the pilot study.

z p (1 p)error

,2

2

2

p

p

10 - 43

Sample Size when is Unknown

• If an estimate of the population proportion is not available, then the population proportion is set equal to .5 and the sample size is given by

n =

• The value .5 maximizes the quantity p(1 - p) and thus provides the most conservative estimate of the sample size.

z .5 (1 .5)error

.2

2

2

p

10 - 44

Example 9

Researchers working in a remote area of Africa feel that 40% of the families in the area are without adequate drinking water either through contamination or unavailability.

What sample size will be necessary to estimate the percent without adequate water to within 5% with 95% confidence?

10 - 45

Example 9 - Solution

We are given:

,

error = E = .05,

and the confidence level = .95.

p .40

10 - 46

Example 9 - Solution

z 1.96.025

.475

.025

1 - = .95

2

.052

.025 = .05

10 - 47

Example 9 - Solution

We want to determine the sample size necessary to estimate the proportion of families in the area without adequate drinking water. Since we have an estimate of , the sample size is given by

To get the desired accuracy, we must round up to 369 families.

p

nz p (1 p)

E

2

2

2

368.79.

= (1.96) .40 (1 .40)(.05)

2

2

Recommended