Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved.9 | 1 Chapter 9...

Preview:

Citation preview

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 1

Chapter 9 Hypothesis Testing

• Introduction to Statistical Tests• Testing the Mean µ• Testing a Proportion p• Tests Involving Paired Differences• Testing µ1-µ2 and p1-p2

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 2

9.1 Introduction to Statistical Tests

• We can draw inference on a population parameter in two ways:

1) Estimation (Chapter 8)

2) Hypothesis Testing (Chapter 9)

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 3

Hypothesis Testing

• In essence, hypothesis testing is the process of making decisions about the value of a population parameter.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 4

Establishing the Hypotheses

• Null Hypothesis: A hypothesis about the parameter in question that often denotes a theoretical value, an historical value, or a production specification.– Denoted as H0

• Alternate Hypothesis: A hypothesis that differs from the null hypothesis, such that if we reject the null hypothesis, we will accept the alternate hypothesis.– Denoted as H1 (in other sources HA).

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 5

Hypotheses Restated

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 6

Example

A car manufacturer advertises that its new subcompact models get 47 miles per gallon (mpg). Let µ be the mean of the mileage distribution for these cars. You assume that the manufacturer will not underrate the car, but you suspect that the mileage might be overrated.

(a) What shall we use for H0?(b) What shall we use for H1?

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 7

Types of Tests

• The null hypothesis is always a statement of equality.– H0: μ = k, where k is a specified value

• The alternate hypothesis states that the parameter (μ, p) is less than, greater than, or not equal to a specified value.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 8

Types of Tests

• Left-Tailed Tests: H1: μ < k

H1: p < k

• Right-Tailed Tests: H1: μ > k

H1: p > k

• Two-Tailed Tests: H1: μ ≠ k

H1: p ≠ k

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 9

Hypothesis Testing Procedure

1) Select appropriate hypotheses.2) Draw a random sample.3) Calculate the test statistic.4) Assess the compatibility of the test statistic

with H0.5) Make a conclusion in the context of the

problem.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 10

Hypothesis Test of μx is Normal, σ is Known

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 11

ExampleRosie is an aging sheep dog in Montana who get regular check-ups from her owner, the local veterinarian. Let x be a random variable that represents Rosie’s resting heart rate (in beats per minute). From past experience, the vet knows that x has a normal distribution with σ=12. The vet checked the Merck Veterinary Manual and found that for dogs of this breed, µ=115 beats per minute. Over the past six weeks, Rosie’s heart rate (beats/min) measured 93 109 110 89 112 117The sample mean is =105. The vet is concerned that Rosie’s heart rate may be slowing. Do the data indicate that this is the case?

x

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 12

P-Value

P-values are sometimes called the probability of chance.

Low P-values are a good indication that your test results are not due to chance.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 13

P-Value for Left-Tailed Test

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 14

P-Value for Right-Tailed Test

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 15

P-Value for Two-Tailed Test

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 16

Types of Errors in Statistical Testing

• Since we are making decisions with incomplete information (sample data), we can make the wrong conclusion!!

– Type I Error: Rejecting the null hypothesis when the null hypothesis is true.

– Type II Error: Accepting the null hypothesis when the null hypothesis is false.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 17

Errors in Statistical Testing

• Unfortunately, we usually will not know when we have made an error!!

• We can only talk about the probability of making an error.

• Decreasing the probability of making a type I error will increase the probability of making a type II error (and vice versa).

• We can only decrease the probability of both types of errors by increasing the sample size (obtain more information), but this may not be feasible in practice.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 18

Type I and Type II Errors

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 19

Level of Significance

• Good practice is to specify in advance the level of type I error we are willing to risk.

• The probability of type I error is the level of significance for the test, denoted by α (alpha).

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 20

Type II Error

• The probability of making a type II error is denoted by β (Beta).

• 1 – β is called the power of the test.

– 1 – β is the probability of rejecting H0 when H0 is false (a correct decision).

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 21

The ProbabilitiesAssociated with Testing

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 22

Concluding a Statistical Test

For our purposes, significant is defined as follows:

At our predetermined α level of risk, the evidence against H0 is sufficient to discredit H0. Thus we adopt H1.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 23

Statistical Testing Comments

• In most statistical applications, α = 0.05 or α = 0.01 is used

• When we “accept” the null hypothesis, we are not proving the null hypothesis to be true. We are only saying that the sample evidence is not strong enough to justify the rejection of the null– Some statisticians prefer to say “fail to reject

the null” rather than “accept the null.”

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 24

Interpretation of Testing Terms

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 25

Testing µ When σ is Known

1) State the null hypothesis, alternate hypothesis, and level of significance.

2) If x is normally distributed, any sample size will suffice. If not, n ≥ 30 is required.

Calculate:

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 26

Testing µ When σ is Known

3) Use the standard normal table and the type of test (one or two-tailed) to determine the P-value.

4) Make a statistical conclusion:

If P-value ≤ α, reject H0.

If P-value > α, do not reject H0.5) Make a context-specific conclusion.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 27

ExampleThe Environmental Protection Agency has been studying Miller Creek regarding ammonia nitrogenconcentration. For many years, the concentration has been 2.3 mg/l. However, a new golf course and housing developments are raising concern that the concentration may have changed because of lawn fertilizer. Any change (either an increase or a decrease) in the ammonia nitrogen concentration can affect plant and animal life in and around the creek (Reference: EPA Report 832-R-93-005). Let X be a random variable representing ammonia nitrogen concentration (in mg/l). Based on recent studies of Miller Creek, we may assume that x has a normal distribution with σ=0.3. Recently, a

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 28

random sample of eight water tests from the creek gave the following x values2.1 2.5 2.2 2.8 3.0 2.2 2.4 2.9The sample mean is =2.51Let us construct a statistical test to examine the claim that the concentration of ammonia nitrogenhas changed from 2.3 mg/l. Use level of significance α =0.01(a)What is the null hypothesis? What is the alternate hypothesis? What is the level of significance?(b)Is this a right-tailed, left-tailed, or two-tailed test?

x

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 29

Example

(c) What is the sample test statistic? Convert thesample mean to a standard z value.(d) Draw a sketch showing the P-value area on thestandard normal distribution. Find the P-value(e) Compare the level of significance and the P-value. What is your conclusion?

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 30

9.2 Testing µ When σ is Unknown

1) State the null hypothesis, alternate hypothesis, and level of significance.

2) If x is normally distributed (or mound-shaped), any sample size will suffice. If not, n ≥ 30 is required. Calculate:

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 31

Testing µ When σ is Unknown

3) Use the Student’s t table and the type of test (one or two-tailed) to determine (or estimate) the P-value.

4) Make a statistical conclusion:

If P-value ≤ α, reject H0.

If P-value > α, do not reject H0.5) Make a context-specific conclusion.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 32

Using Table 6 to Estimate P-Values

Suppose we calculate t = 2.22 for a one-tailed test from a sample size of 6.

Thus, df = n – 1 = 5.We obtain: 0.025 < P-Value < 0.050

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 33

Testing µ Using theCritical Value Method

• The values of that will result in the rejection of the null hypothesis are called the critical region of the distribution

• When we use a predetermined significance level α, the Critical Value Method and the P-Value Method are logically equivalent.

x

x

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 34

Critical Regions for H0: µ = k

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 35

Critical Regions for H0: µ = k

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 36

Critical Regions for H0: µ = k

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 37

Testing µ When σ is Known (Critical Region Method)

1) State the null hypothesis, alternate hypothesis, and level of significance.

2) If x is normally distributed, any sample size will suffice. If not, n ≥ 30 is required.

Calculate:

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 38

Testing µ When σ is Known (Critical Region Method)

3) Show the critical region and critical value(s) on a graph (determined by the alternate hypothesis and α).

4) Conclude in favor of the alternate hypothesis if z is in the critical region.

5) State a conclusion within the context of the problem.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 39

Left-Tailed Tests

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 40

Right-Tailed Tests

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 41

Two-Tailed Tests

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 42

9.3 Testing a Proportion p

• Test assumptions:r is a binomial variablen is the number of independent trialsp is the probability of success on each trialnp > 5 and n(1-p) > 5

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 43

Types of Proportion Tests

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 44

The Distributionof the Sample Proportion

n

ppp

)1( and

: withnormalely approximat isn

r p̂ ofon distributi theRecall

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 45

Converting the Sample Proportion to z

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 46

Testing p

1) State the null hypothesis, alternate hypothesis, and level of significance.

2) Check np > 5 and nq > 5 (recall q = 1 – p). Compute:

p = the specified value in H0

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 47

Testing p

3) Use the standard normal table and the type of test (one or two-tailed) to determine the P-value.

4) Make a statistical conclusion:

If P-value ≤ α, reject H0.

If P-value > α, do not reject H0.5) Make a context-specific conclusion.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 48

Using the Critical Value Method for p

• As when testing for means, we can use the critical value method when testing for p.

• Use the critical value graphs exactly as when testing µ.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 49

Critical Thinking: Issues Related to Hypothesis Testing

• Central question – Is the value of sample test statistics too far away from the value of the population parameter proposed in H0 to occur by chance alone? – P-value tells the probability for that to occur by chance alone.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 50

Critical Thinking: Issues Related to Hypothesis Testing

• If the P-value is so close to α, then we might attempt to clarify the results by

- Increasing the sample size - controlling the experiment to reduce the

standard deviation.• How reliable is the study and the

measurements in the sample? – consider the source of the data and the reliability of the organization doing the study.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 51

9.4 Tests Involving Paired Differences

• Data pairs occur naturally in many settings:– Before and after measurements on the same

observation after a treatment.– Be sure to have a definite and uniform

method for creating pairs of data points.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 52

Advantages to Using Paired Data

• Reduces the danger of extraneous or uncontrollable variables

• Theoretically reduces measurement variability.• Increases the accuracy of statistical

conclusions.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 53

Testing for Differences

• We take the difference between each pair of data points.– Denoted by d

• We then test the average difference against the Student’s t distribution.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 54

Hypotheses for Differences

H0: µd = 0

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 55

Sample Test Statistic for Differences

with

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 56

Finding the P-Value

• Just as in the test for µ when σ is unknown, use Table 6 to estimate the P-Value of the test.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 57

Testing d

1) State the null hypothesis, alternate hypothesis, and level of significance.

2) If you can assume d is normal (mound-shaped), any sample size will do. If not, make sure n ≥ 30. Calculate:

df = n-1

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 58

Testing d

3) Use the Student’s t table and the type of test (one or two-tailed) to determine (or estimate) the P-value.

4) Make a statistical conclusion:

If P-value ≤ α, reject H0.

If P-value > α, do not reject H0.5) Make a context-specific conclusion.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 59

9.5 Testing the Differences Between Independent Samples

• Many practical applications involve testing the difference between population means or population proportions.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 60

Testing µ1 - µ2 when σ1, σ2 are Known

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 61

Hypotheses for Testing µ1 - µ2 when σ1, σ2 are Known

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 62

Testing µ1 - µ2 when σ1, σ2 are Known

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 63

Testing µ1 - µ2 when σ1, σ2 are Known

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 64

Testing µ1 - µ2 when σ1, σ2 are Known

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 65

Testing µ1 - µ2

when σ1, σ2 are Unknown

• Just as in the one-sample test for the mean, we will resort to the Student’s t distribution and proceed in a similar fashion.

– Remark: in practice, the population standard deviation will be unknown in most cases .

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 66

Testing µ1 - µ2

when σ1, σ2 are Unknown

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 67

Testing µ1 - µ2

when σ1, σ2 are Unknown

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 68

Testing µ1 - µ2

when σ1, σ2 are Unknown

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 69

Deciding Which Testto Use for H0: µ1 - µ2

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 70

Testing for a Differencein Proportions, p1 – p2

• Suppose we have two independent binomial experiments.

• We would like to test if the two population proportions are equal.

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 71

Testing for a Difference in Proportions

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 72

Testing for a Difference in Proportions

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 73

Testing for a Difference in Proportions

• The test statistic is as follows:

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 74

The Test Procedure

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 75

The Test Procedure

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 76

The Test Procedure

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 77

The Test Procedure

Copyright © Houghton Mifflin Harcourt Publishing Company. All rights reserved. 9 | 78

Critical Regions For Tests of Differences

Recall, our emphasis is on the P-Value method. Most scientificstudies use this technique. Also, for a fixed α-level test, the

methodsare equivalent and lead to identical results.

Recommended