View
6
Download
0
Category
Preview:
Citation preview
Eastern Mediterranean University Faculty of Medicine Biostatistics course
Non-parametric methods March 4&7, 2016
Instructor: Dr. Nimet İlke Akçay (ilke.cetin@emu.edu.tr)
Learning Objectives
1. Distinguish Parametric & Nonparametric Test Methods
2. Explain commonly used Nonparametric Test Methods
3. Perform Hypothesis Tests Using Nonparametric Procedures
Nonparametric Statistics Introduction to nonparametric methods
• Parametric tests (t-test, z-test, etc.)
involve estimating population parameters such as the mean
are based on the assumptions of normality or known variances (stringent assumptions)
• What if data does not follow a Normal distribution?
• Non-parametric tests
were developed for these situations where no (or fewer) assumptions have to be made
are also called as Distribution-free tests
Still have assumptions but they are less stringent
Can be applied for a normally distributed data, but Parametric tests have greater power IF the assumptions met
Parametric vs. Nonparametric test procedures Parametric
• Involve population parameters Example: Population mean
• Require interval scale or ratio scale Whole numbers or fractions
Example: height in inches
(72, 60.5, 54.7)
• Have stringent assumptions Example: Normal dist.
• Examples: z-test, t-test, F-test
Nonparametric
• Do not involve population parameters Example: Probability
distributions, indepedence
• Data measured on any scale Ratio or Interval
Ordinal Example: good-better-best
Nominal Example: male-female
• Have no stringent assumptions
• Examples: Sign test, Mann-Whitney U test, Wilcoxon test
Advantages and Disadvantages of Non-parametric tests Advantages
• Used with all measurement scales Analysis possible for ranked
or categorical data
• Distribution-free: may be used when the form of the sampled population is unknown
• Easy to compute
• Make fewer assumptions
• No need to involve population parameters
• Results may be as exact as parametric procedures
Disadvantages
• May waste information, if data and assumptions permit using parametric procedures Example: converting data
from ratio to ordinal scale
E.g. Height values -> short-average-tall
• Difficult to compute by hand for large samples
• Tables not widely available
Determination: Parametric or Non-parametric?
Hypothesis Testing Procedures
Type of Design Parametric
Test
Nonparametric
Test
One sample One-sample t-
test
Sign Test
Two indepent
samples
Independent-
samples t-test
Wilcoxon Rank
Sum test
Mann Whitney U
test
Two paired
sample
Paired-samples
t-test
Wilcoxon Signed
Ranks test
.
.
.
.
.
.
.
.
.
One sample hypothesis testing design: Sign Test
Sign Test • This test is used as an alternative to one sample t-test,
when normality assumption is not met
• The only assumption is that the distribution of the underlying variable (data) is continuous.
• Test focuses on median rather than mean.
• The test is based on signs, plus and minuses
• Test is used for one sample as well as for two samples
Test Statistic in Sign Test • Test Statistic: The test statistic for the sign test is
either the observed number of plus signs or the observed number of minus signs.
• The nature of the alternative hypothesis determines which of these test statistics is appropriate. In a given test, any one of the following alternative hypotheses is possible:
• HA: P(+) > P(-) one-sided alternative
• HA: P(+) < P(-) one-sided alternative
• HA: P(+) ≠ P(-) two-sided alternative
Test Statistic in Sign Test • If the alternative hypothesis is HA: P(+) > P(-) a
sufficiently small number of minus signs causes rejection of H0. The test statistic is the number of minus signs.
• If the alternative hypothesis is HA: P(+) < P(-) a sufficiently small number of plus signs causes rejection of H0. The test statistic is the number of plus signs.
• If the alternative hypothesis is HA: P(+) ≠ P(-) either a sufficiently small number of plus signs or a sufficiently small number of minus signs causes rejection of the null hypothesis. We may take as the test statistic the less frequently occurring sign.
• Calculation of test statistic:
P (X ≤ k | n, p) =
qpCxnxk
x
n
x
0
Sign Test – Example • 7 patients were asked to rate the
services in a private hospital on a 5-point scale (1=terrible,..., 5=excellent)
• The obtained ratings are given in the table
• At the α=0.05 level, is there evidence that the median rating is at least 3?
Patient Rating Sign
1
2
3
4
5
6
7
2
5
3
4
1
4
5
-
+
+
+
-
+
+
Sign Test – Example (cont’d) Hypotheses:
H0: = 3
Ha: < 3
Significance Level:
= .05
Test Statistic:
P-Value:
Decision:
Conclusion:
Do Not Reject at = .05
There is No evidence
for Median < 3
P(x 2) = 1 - P(x 1)
= 0.9375
(Binomial Table, n = 7, p = 0.50)
Or calculate from the formula
S = 2
(Ratings 1 & 2
are < = 3:
2, 5, 3, 4, 1, 4, 5)
Comparing Two Populations: Independent Samples Wilcoxon Rank Sum test
Wilcoxon Rank Sum test
• Tests two independent population probability distributions
• Corresponds to t-test for two independent means
• Assumptions — Independent, random samples
— Populations are continuous
• Can use normal approximation if ni 30
© 2
01
1 P
ea
rson
Ed
uca
tion
, In
c
Wilcoxon Rank Sum Test: Independent Samples Let D1 and D2 represent the probability distributions for populations 1 and 2, respectively.
One-Tailed Test
H0: D1 and D2 are identical
Ha: D1 is shifted to the right of D2
[or D1 is shifted to the left of D2]
Test statistic:
T1, if n1 < n2; T2, if n2 < n1
(Either rank sum can be used if n1 = n2.)
Wilcoxon Rank Sum Test: Independent Samples Let D1 and D2 represent the probability distributions for populations 1 and 2, respectively.
Two-Tailed Test
H0: D1 and D2 are identical
Ha: D1 is shifted to the left or to the right of D2
Test statistic:
T1, if n1 < n2; T2, if n2 < n1
(Either rank sum can be used if n1 = n2.) We will denote this rank sum as T.
Conditions Required for Valid Wilcoxon Rank Sum Test
1. The two samples are random and independent.
2. The two probability distributions from which the samples are drawn are continuous.
Comparing Two Populations: Paired Differences Experiment Wilcoxon Signed Ranks test
© 2
01
1 P
ea
rson
Ed
uca
tion
, In
c
Wilcoxon Signed Rank Test
• Tests probability distributions of two related populations
• Corresponds to t-test for dependent (paired) means
• Assumptions — Random samples
— Both populations are continuous
• Can use normal approximation if n 30
© 2011 Pearson Education, Inc
Wilcoxon Signed Rank Test for a Paired Difference Experiment
Let D1 and D2 represent the probability distributions for populations 1 and 2, respectively.
One-Tailed Test
H0: D1 and D2 are identical
Ha: D1 is shifted to the right of D2
[or D1 is shifted to the left of D2]
© 2011 Pearson Education, Inc
Wilcoxon Signed Rank Test for a Paired Difference Experiment
Let D1 and D2 represent the probability distributions for populations 1 and 2, respectively.
Two-Tailed Test
H0: D1 and D2 are identical
Ha: D1 is shifted to the left or to the right of D2
© 2011 Pearson Education, Inc
Conditions Required for a Valid Signed Rank Test
1. The sample of differences is randomly selected from the population of differences.
2. The probability distribution from which the sample of paired differences is drawn is continuous.
Recommended