33
MORE ABOUT TESTS Statistics

More about tests

  • Upload
    gayle

  • View
    72

  • Download
    0

Embed Size (px)

DESCRIPTION

Statistics. More about tests. Statistically significant – When the P-value falls below the alpha level, we say that the tests is “statistically significant” at the alpha level. - PowerPoint PPT Presentation

Citation preview

Page 1: More about tests

MORE ABOUT TESTSStatistics

Page 2: More about tests

TERMS Statistically significant– When the P-

value falls below the alpha level, we say that the tests is “statistically significant” at the alpha level.

Alpha level– The threshold P-value that determines when we reject a null hypothesis. If we observe a statistic whose P-value based on the null hypothesis is less than , we reject that null hypothesis.

α

Page 3: More about tests

TERMS Significance level– The alpha level is also

called the significance level, most often in the phrase such as a conclusion that a particular test is “significant at the 5% significant level.”

Critical value—The value in the sampling distribution model of the statistic whose P-value is equal to the alpha level. Any statistic value farther from the null hypothesis value than the critical value will have a smaller P-value than and will lead to rejecting the null hypothesis. The critical value is often denoted with an asterisk, as z*, for example.

α

Page 4: More about tests

TERMS Type I error– The error of rejecting a null

hypothesis when in fact it is true (also called a “false positive”). The probability of a Type I error is .

Type II error– The error of failing to reject a null hypothesis when in fact it is false (also called a “false negative”). The probability of a Type II error is commonly denoted and depends on the effect size.€

α

β

Page 5: More about tests

TERMS Power– The probability that a hypothesis

test will correctly reject a false null hypothesis is the power of the test. To find power, we must specify a particular alternative parameter value as the “true” value. For any specific value in the alternative, the power is .

Effect size—The difference between the null hypothesis value and true value of a model parameter is called the effect size.€

1−β

Page 6: More about tests

ZERO IN ON THE NULL Null hypothesis have special

requirements. The null must be a statement about the

value of a parameter for a model. This value is used to compute the

probability that the observed sample statistic– or something even farther from the null value– would occur.

Page 7: More about tests

CHOOSING AN APPROPRIATE NULL Arises directly from the context of the

problem. It is not dictated by the data, but

instead by the situation. One good way to identify both the null

and alternative hypothesis is to think about the Why of the situation.

Page 8: More about tests

CHOOSING AN APPROPRIATE NULL To write a null hypothesis, you can’t

just choose any value you like. The null must relate to the question at

hand. Even though the null usually means no

difference or no change, you can’t automatically interpret “null” to mean zero.

Page 9: More about tests

STATING NULL AND ALTERNATIVE HYPOTHESES

Example– Fourth-graders in Elmwood School perform the same in math than fourth-graders in Lancaster School.

H0 :μ1 = μ2

HA :μ1 ≠ μ2

Page 10: More about tests

STATING NULL AND ALTERNATIVE HYPOTHESES

Example– Fourth-graders in Elmwood School perform better in math than fourth-graders in Lancaster School.

H0 :μ1 ≤ μ2

HA :μ1 > μ2

Page 11: More about tests

STATING NULL AND ALTERNATIVE HYPOTHESES EXAMPLE

Suppose you want to test the theory that sunlight helps prevent depression. One hypothesis derived from this theory might be that hospital admission rates for depression in sunny regions of the country are lower than the national average. Suppose that you know the national annual admission rate for depression to be 17 per 10,000. You intend to take the mean of a sample of admission rates from hospitals in sunny parts of the country and compare it to the national average.

Page 12: More about tests

STATING NULL AND ALTERNATIVE HYPOTHESES EXAMPLE

Your research hypothesis is: The mean annual admission rate for

depression from the hospitals in sunny areas is less than 17 per 10,000.

The null hypothesis is: The mean annual admission rate for

depression from the hospitals in sunny areas is equal to or greater than 17 per 10,000

HA :μ1 <17 per 10,000

H0 :μ1 ≥17 per 10,000

Page 13: More about tests

STATING NULL AND ALTERNATIVE HYPOTHESES EXAMPLE

You know that the mean must be lower than 17 per 10,000 in order to reject the null hypothesis, but how much lower?

You decide on the probability level of 95%. In other words, if the mean admission rate for

the sample of sunny hospitals selected at random from the national population is less than 5%, you will reject the null hypothesis and conclude that there is evidence to support the hypothesis that exposure to the sun reduced the incidence of depression.

Page 14: More about tests

EXAMPLE CONTINUED Next, look up the critical z-score– the z-score

that corresponds to your chosen level of probability– in the standard normal table.

It is important to remember what end of the scale you are looking at.

Because a computed test statistic in the lower end of the distribution will allow you to reject your null hypothesis, you look up the z-score for the probability (or area) of .05 and find that it is -1.65.

Page 15: More about tests

EXAMPLE CONTINUED The z-score defines the boundary of the

zones of rejection and acceptance.

Z: -1.65 -1.20 0

Rate (per 10,000) 13 17

Region of Acceptance

Region of Rejection

Page 16: More about tests

EXAMPLE CONTINUED Suppose the mean admission rate for the sample

hospitals in sunny regions is 13 per 10,000 and suppose also that the corresponding z-score for that mean is -1.20.

The test statistic falls in the region of acceptance; so you cannot reject the null hypothesis that the mean in sunny parts of the country is significantly lower than the mean in the national average.

There is a greater than 5% chance of obtaining a mean admission rate of 13 per 10,000 or lower from a sample of hospitals chosen at random from the national population, so you cannot conclude that your sample mean could not have come from the population.

Page 17: More about tests

HOW TO THING ABOUT P-VALUES A P-value is a conditional probability. It tells us the probability of getting

results at least as unusual as the observed statistic given that the null hypothesis is true.

The P-value is not the probability that the null hypothesis is true.

The P-value is not the conditional probability that the null hypothesis is true given the data.

Page 18: More about tests

STATISTICALLY SIGNIFICANT How do you know how much confidence to put

in the outcome of a hypothesis test? The statistician’s criterion is the statistical

significance of the test, or the likelihood of obtaining a given result by chance.

This is called the alpha level. Common alpha levels are 0.10, 0.05, and 0.01. The smaller the alpha level, the more stringent

the test and the greater the likelihood that the conclusion is correct.

Page 19: More about tests

STATISTICALLY SIGNIFICANT The following statements are all

equivalent. The finding is significant at the .05 level. The confidence level is 95%. The Type I error rate is .05. The alpha level is .05. There is a 95% certainty that the result

is not due to chance.

Page 20: More about tests

STATISTICALLY SIGNIFICANT There is a 1 in 20 chance of obtaining

this result. The area of the region of rejection

is .05. The P-value is .05 P = .05

α =.05

Page 21: More about tests

STATISTICALLY SIGNIFICANT Traditional critical values from the Normal model.

1-sided 2-sided0.05 1.645 1.960.01 2.33 2.5760.001 3.09 3.29

α

α

When the alternative is one-sided, the critical value puts all of alpha on one side.

α2

α2

When the alternative is two-sided, the critical value splits alpha equally into two tails.

Page 22: More about tests

TYPE I AND TYPE II ERRORS Even with lots of evidence, we can still

make the wrong decision. When we perform a hypothesis test, we

can make mistakes in two ways: I. The null hypothesis is true, but we

mistakenly reject it. II. The null hypothesis is false, but we fail

to reject it.

Page 23: More about tests

TYPE I AND TYPE II ERRORS Types of Statistical Errors

H0 True H0 FalseReject H0 Type I

ErrorOK

Fail to reject H0

OK Type II Error

My Decisio

n

The Truth

Page 24: More about tests

TYPE I ERROR Represented by the Greek letter alpha. In choosing the level of probability for a test,

you are actually deciding how much you want to risk committing a Type I error– rejecting the null hypothesis when, in fact, it is true.

This is why the threshold level or area in the region of rejections is called the alpha level.

It represents the likelihood of committing a Type I error.

α

Page 25: More about tests

TYPE II ERROR Type II errors are represented by the

Greek letter beta. This is harder to find because it

requires estimating the distribution of the alternative hypothesis, which is usually unknown.

β

Page 26: More about tests

POWER Power is the probability that a test will reject

the null hypothesis when it, in fact, false. In other words, the power of a test is the

probability that it correctly rejects a false null hypothesis.

When the power is high, we can be confident that we have looked hard enough.

We know that beta is the probability that a test fails to reject a false null hypothesis.

The power of the test is the complement

1−β

Page 27: More about tests

TRY THIS An advertiser wants to know if the

average age of people watching a particular TV show regularly is less than 24 years. Is this a one- or two-tailed test?

One State the alternative and null hypotheses.

HA :μ < 24H0 :μ ≥ 24

Page 28: More about tests

TRY THIS-- CONTINUED An advertiser wants to know if the

average age of people watching a particular TV show regularly is less than 24 years. A random survey of 50 viewers determines

that their mean age is 19 years, with a standard deviation of 1.7 years. Find the 90% confidence interval of the age of the viewers. Name the variables

x =19 z =1.645σ =1.7 n = 50

Page 29: More about tests

TRY THIS-- CONTINUED Here is the work to find the confidence

interval.

x ± z ⋅ σn

19 ±1.645 ⋅ 1.750

19 ±1.645 .24( )19 ± .394819 ± .418.6, 19.4( )

We are 90% confident that the mean age of viewers is between 18.6 and 19.4.

Page 30: More about tests

TRY THIS-- CONTINUED An advertiser wants to know if the

average age of people watching a particular TV show regularly is less than 24 years. What is the significance level for rejecting

the null hypothesis? .0016

Page 31: More about tests

TRUE OR FALSE Statistical tests should always be

performed on a null hypothesis. True

If two variables are correlated, then they must casually related. False

A result with a high level of significance is always very important. False

Page 32: More about tests

TRY THIS Rejecting the null hypothesis when it is

actually true is: No error A Type I error A Type II error Neither a Type I or Type II error Impossible.

Answer: A Type I error

Page 33: More about tests

WHAT CAN GO WRONG? Don’t interpret the P-value as the

probability that H0 is true. Don’t believe too strongly in arbitrary

alpha values. Don’t confuse practical and statistical

significance. Don’t forget that in spite of all your

care, you might make a wrong decision.