50
1 Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved. Estimates and Sample Sizes

Estimates and Sample Sizes

  • Upload
    redford

  • View
    40

  • Download
    0

Embed Size (px)

DESCRIPTION

Estimates and Sample Sizes. But, first. Example 1:. In a recent poll, 70% of 1501 randomly selected adults said they believed in global warming. Q: What is the proportion of the adult population that believe in global warming? - PowerPoint PPT Presentation

Citation preview

Page 1: Estimates and Sample Sizes

1Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Estimates and Sample Sizes

Page 2: Estimates and Sample Sizes

2Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

But, first

Page 3: Estimates and Sample Sizes

3Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Example 1:

In a recent poll, 70% of 1501 randomly selected adults said they believed in global warming.

Q: What is the proportion of the adult population that believe in global warming?

Notation: p is the population proportion (an unknown parameter).

is the sample proportion (computed). From the poll data = 0.70.

Apparently, 0.70 will be the best estimate of the proportion of all adults who believe in global warming.

p̂p̂

pg 329

Page 4: Estimates and Sample Sizes

4Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Definition

A point estimate is a single value (or point) used to approximate a population parameter.

pg 329

Page 5: Estimates and Sample Sizes

5Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

The sample proportion p is the best point estimate of the population proportion p.

ˆ

Definition

Page 6: Estimates and Sample Sizes

6Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Example (continued)

We say that 0.70, or 70% is the best point estimate of the proportion of all adults who believe in global warming.

But how reliable (accurate) is this estimate?We will see that its margin of error is 2.3%. This

means the true proportion of adults who believe in global warming is between 67.7% and 72.3%. This gives an interval (from 67.7% to 72.3%) containing the true (but unknown) value of the population proportion.

Page 7: Estimates and Sample Sizes

7Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

What Is the Problem?

We have a point estimate, but we don’t

know how good it is.

We need to have a range of values.

Page 8: Estimates and Sample Sizes

8Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Definition

A confidence interval (or interval estimate) is a range (or an interval) of values used to estimate the true value of a population parameter.

A confidence interval is sometimes abbreviated as CI.

pg 329

Page 9: Estimates and Sample Sizes

9Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Notation for Critical Value

α = Greek letter alpha

Usually the confidence interval is 1- α

Page 10: Estimates and Sample Sizes

10Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

A confidence level is the probability 1 – (often expressed as the equivalent percentage value) that the confidence interval actually does contain the population parameter.

The confidence level is also called degree of confidence, or the confidence coefficient.

Most common choices are 90%, 95%, or 99%.

( = 10%), ( = 5%), ( = 1%)

Definition

pg 330

Page 11: Estimates and Sample Sizes

11Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

In a recent poll, 70% of 1501 randomly selected adults said they believed in global warming.

The sample proportion = 0.70 is the best estimate of the population proportion p.

A 95% confidence interval for the unknown population parameter is

0.677 < p < 0.723

What does it mean, exactly?

Example (continued)

Page 12: Estimates and Sample Sizes

12Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

We are 95% confident that the interval from 0.677 to 0.723 actually does contain the true value of the population proportion p.

This means that if we were to select many different samples of size 1501 and construct the corresponding confidence intervals, then 95% of them would actually contain the value of the population proportion p.

Interpreting a Confidence Interval

Correct

pg 330

Page 13: Estimates and Sample Sizes

13Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

There is a 95% chance that the true value of p will fall between .0677 and .0723.

It would also be incorrect to say that 95% of sample proportions fall between .0677 and .0723

Interpreting a Confidence Interval

Incorrect

pg 330

Page 14: Estimates and Sample Sizes

14Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Know the correct interpretation of a confidence interval.

It is wrong to say “the probability that the population parameter belongs to the confidence interval is 95%”

because the population parameter is not a random variable, it does not change its value.

Caution

Page 15: Estimates and Sample Sizes

15Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

A confidence level of 95% tells us that the process we are using will result in confidence level limits that contain the true population proportion 95% of the time.

We expect that 19 out of 20 samples should contain confidence levels that contain the true value of p

Caution

Page 16: Estimates and Sample Sizes

16Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Do not confuse two percentages: the proportion may be represented by percents (like 70% in the example), and the confidence level may be represented by percents (like 95% in the example).

Proportion may be any number from 0% to 100%.

Confidence level is usually 90% or 95% or 99%.

Caution

Page 17: Estimates and Sample Sizes

17Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

p – E < p < + E

p + E

p ˆ ˆ

Confidence Interval for a Population Proportion p

ˆ

(p – E, p + E)ˆ ˆ

Page 18: Estimates and Sample Sizes

18Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Finding the Point Estimate and E from a Confidence Interval

Margin of Error:

E = (upper confidence limit) — (lower confidence limit)

2

Point estimate of p:

p = (upper confidence limit) + (lower confidence limit)

ˆ

Page 19: Estimates and Sample Sizes

19Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Next we learn how to construct confidence

intervals

pg 331

Page 20: Estimates and Sample Sizes

20Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Critical ValuesA z score can be used to distinguish between sample statistics that are likely to occur and those that are unlikely to occur. Such a z score is called a critical value.

The standard normal distribution is divided into three regions: middle part has area 1-α and two tails (left and right) have area α/2 each:

pg 331

Page 21: Estimates and Sample Sizes

21Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Critical ValuesThe z scores separate the middle interval (likely values)

from the tails (unlikely values). They are zα/2 and – zα/2 , found from Table A-2.

Page 22: Estimates and Sample Sizes

22Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

DefinitionA critical value is the number on the borderline separating sample statistics that are likely to occur from those that are unlikely to occur.

Page 23: Estimates and Sample Sizes

23Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Notation for Critical Value

The critical value z/2 separates an area of /2 in the right tail of the standard normal distribution. The value of –z/2

separates an area of /2 in the left tail.

The subscript /2 is simply a reminder that the z score separates an area of /2

in the tail.

Page 24: Estimates and Sample Sizes

24Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Finding zα/2

pg 332

Confidence α zα/2

Level

90% 0.10 1.64595% 0.05 1.9699% 0.01 2.575

Page 25: Estimates and Sample Sizes

25Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Finding zα/2 for a 95% Confidence Level

- zα/2zα/2

Critical Values

α/ 2 = 2.5% = .025α = 5%

pg 332

Page 26: Estimates and Sample Sizes

26Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Definition

Margin of error, denoted by E, is the maximum likely

difference (with probability 1 – , such as 0.95)

between the observed proportion and the true

value of the population proportion p.

The margin of error E is also called the maximum

error of the estimate.

pg 333

Page 27: Estimates and Sample Sizes

27Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Margin of Error for Proportions

2

ˆ ˆpqE z

n

Formula 7-1, pg 333

Page 28: Estimates and Sample Sizes

28Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Notation

p = sample proportion

n = number of sample values

E = margin of error

q = 1 – p^ ^

^

Page 29: Estimates and Sample Sizes

29Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Confidence Interval for a Population Proportion p

p – E < < + Eˆ p ˆ

p

where

2

ˆ ˆpqE z

n

Page 30: Estimates and Sample Sizes

30Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

p – E < < + E

p + E

p p ˆ ˆ

Confidence Interval for a Population Proportion p

ˆ

(p – E, p + E)ˆ ˆ

Also written as:

Also written as:

Page 31: Estimates and Sample Sizes

31Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Finding the Point Estimate and E from a Confidence Interval

Margin of Error:

E = (upper confidence limit) — (lower confidence limit)

2

Point estimate of p:

p = (upper confidence limit) + (lower confidence limit)

ˆ

Page 32: Estimates and Sample Sizes

32Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Round-Off Rule for Confidence Interval Estimates of p

Round the confidence interval limits for p to

three significant digits.

Page 33: Estimates and Sample Sizes

33Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Example 3:

In a recent poll, 70% of 1501 randomly selected adults said they believed in global warming.

a.Find the margin of error E that corresponds to a 95% confidence level.

b.Find the 95% confidence interval estimate of the population proportion p.

c.Based on the results, can we safely conclude that the majority of adults believe in global warming?

d.Assuming that you are a newspaper reporter, write a brief statement that describes the results accurately, with all relevant information.

pg 334

Page 34: Estimates and Sample Sizes

34Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Example 3:

In a recent poll, 70% of 1501 randomly selected adults said they believed in global warming.

a.Find the margin of error E that corresponds to a 95% confidence level.

Zα/2 = 1.96 (Slide 24)

p=.70

q=.30

n=1501

E = 1.96 * sqrt(.70*.30/1501) = .023183 Slide 27

pg 334

Page 35: Estimates and Sample Sizes

35Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Example 3:

In a recent poll, 70% of 1501 randomly selected adults said they believed in global warming.

b. Find the 95% confidence interval estimate of the population proportion p.

pg 334

p – E < p < p + E

.70 - .023183 < p < .70 + .023183

.677 < p < .723

ˆ ˆ

Page 36: Estimates and Sample Sizes

36Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Example 3:

In a recent poll, 70% of 1501 randomly selected adults said they believed in global warming.

c.Based on the results, can we safely conclude that the majority of adults believe in global warming?

Based on the confidence interval obtained in b. it appears that the proportion of adults who believe in global warming is greater than .5, so we can conclude that the majority of adults believe in global warming.

pg 334

Page 37: Estimates and Sample Sizes

37Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Example 3:

In a recent poll, 70% of 1501 randomly selected adults said they believed in global warming.

c.Based on the results, can we safely conclude that the majority of adults believe in global warming?

Yes.

pg 334

Page 38: Estimates and Sample Sizes

38Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Example 3:

In a recent poll, 70% of 1501 randomly selected adults said they believed in global warming.

d.Assuming that you are a newspaper reporter, write a brief statement that describes the results accurately, with all relevant information.70% of U.S. adults believe that the earth is getting warmer, based on a Pew Research Center poll of 1501 randomly selected adults in the US. In theory, in 95% of such polls, the percentage should differ by no more than 2.3% in either direction from the percentage that would be found by interviewing all adults in the U.S.

pg 334

Page 39: Estimates and Sample Sizes

39Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

• Press STAT and select TESTS

• Scroll down to 1-PropZInt

• and press ENTER

• Type in x: (number of successes)

• n: (number of trials)

• C-Level: (confidence level)

• Press on Calculate

• Read the confidence interval (…..,..…)

• and the point estimate p=…

Confidence Intervals by TI-83/84

^

Page 40: Estimates and Sample Sizes

40Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Confidence Intervals by Excel

Example 3, pg 334. Enter data

Page 41: Estimates and Sample Sizes

41Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Confidence Intervals by Excel

Enter formula

Page 42: Estimates and Sample Sizes

42Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Confidence Intervals by Excel

Page 43: Estimates and Sample Sizes

43Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Sample Size

pg 336

Page 44: Estimates and Sample Sizes

44Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Sample SizeSuppose we want to collect sample data in order to estimate some population proportion. The question is how many sample items must be obtained?

Page 45: Estimates and Sample Sizes

45Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Determining Sample Size

(solve for n by algebra)

Page 46: Estimates and Sample Sizes

46Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Sample Size for Estimating Proportion p

When an estimate of p is known: ˆ

When no estimate of p is known:ˆ

Page 47: Estimates and Sample Sizes

47Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Round-Off Rule for Determining Sample Size

If the computed sample size n is not a whole number, round the value of n up to the next larger whole number.

Examples: n=310.67 round up to 311 n=310.23 round up to 311 n=310.01 round up to 311

pg 336

Page 48: Estimates and Sample Sizes

48Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

Example:

A manager for E-Bay wants to determine the current percentage of U.S. adults who now use the Internet.

How many adults must be surveyed in order to be 95% confident that the sample percentage is in error by no more than three percentage points?

a) In 2006, 73% of adults used the Internet.b) No known possible value of the proportion.

Example 4, pg 336

Page 49: Estimates and Sample Sizes

49Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

a) Use

To be 95% confident that our sample percentage is within three percentage points of the true percentage for all adults, we should obtain a random sample of 842 adults.

Example:

2

ˆ ˆ ˆ0.73 and 1 0.27

0.05 so 1.96

0.03

p q p

z

E

Page 50: Estimates and Sample Sizes

50Copyright © 2010, 2007, 2004 Pearson Education, Inc. All Rights Reserved.

b) Use

This time we don’t know the sample proportion

To be 95% confident that our sample percentage is within three percentage points of the true percentage for all adults, we should obtain a random sample of 1068 adults.

Example: