Upload
jake-harrington
View
228
Download
0
Tags:
Embed Size (px)
Citation preview
Aims of a Variance Components Analysis
• Estimate the amount of variation between groups (level 2 variance) relative to within groups (level 1 variance)– How much variation is there in life expectancy between and within
countries?– How much of the variation in student exam scores is between
schools? i.e. is there within-school clustering in achievement?
• Compare groups– Which countries have particularly low and high life expectancies?– Which schools have the highest proportion of students achieving
grade A-C, and which the lowest?
• As a baseline for further analysis
Revision of Fixed Effects Approach
One way of allowing for group effects is to include as explanatory variables a set of dummy variables for groups. E.g. for 20 countries define 2021 ,,, DDD such that 1jD for
individuals in country j and 0jD otherwise. Denote the coefficient of
jD by j and the overall intercept by 0 as before. Model for outcome
of individual i in country j is:
ijk
kkij eDy
20
10
Cannot estimate 0 and all 20 j s. We usually estimate 0 and take
one country as the reference, e.g. fixing 20 =0 makes country 20 the reference.
Limitations of the Fixed Effects Approach
• When number of groups is large, there will be many extra parameters to estimate. (Only one in ML model.)
• For groups with small sample sizes, the estimated group effects may be unreliable. (In a ML model residual estimates for such
groups ‘shrunken’ towards zero.)
• Fixed effects approach originated in experimental design where number of groups is small (e.g. treatment vs. control) and all groups sampled. More generally, our groups may be a sample from a population. (The multilevel approach allows inferences to this population.)
Multilevel (Random Effects) Model
ijy is the value of y for the i th individual in the j th group. A model that
allows for (random) group effects is:
ijjij euy 0
0 is the overall mean of y (across all groups).
ju0 is the mean of y for group j
ju is the difference between group j ’s mean and the overall mean.
ije is the difference between the y-value for the i th individual and that
individual’s group mean, i.e. )( 0 jijij uye .
Individual (e) and Group (u) Residuals in a Variance Components Model
0
y42
e42
u2
u1
Partitioning Variance
Assume ),0(~ 2uj Nu and ),0(~ 2
eij Ne
2u is the between-group (level 2) variance and 2
e is the within-group (level 1) variance
Total variance = 2u + 2
e Variance partition coefficient is proportion of total variance due to differences between groups
22
2
eu
uVPC
= 0 if no group effects and 1 if no within-group differences.
Model Assumptions
Individual- and group-level residuals are normally distributed ),0(~ 2
uj Nu and ),0(~ 2eij Ne
Residuals at the same level are uncorrelated with one another
0),cov(
21jj uu for two different groups
0),cov(
2211jiji ee for two different individuals in the same or different groups
Residuals at different levels are uncorrelated with one another
0),cov(21
jij ue for the same or different groups
Intra-class Correlationρ = correlation between responses for 2 individuals in the same group:
)var()var(
),cov(),corr(
21
21
21jiji
jijijiji yy
yyyy
Now 2)var(),cov(),cov(),(co2121 ujjjjijjijjiji uuueueuyyv
because 0),cov(),cov(),(co2112
iijijjij eeeueuv
and 22)var()var()var( euijjij euy
so VPCeu
u
22
2
Examples of ρ
0
10
●●●●●
●●●●●
●●●●●
●●●●●
School1 2 3 4
0
10
●●●●●
●●●●●
●●●●●
●●●●●
School1 2 3 4
0
10●●●●●
●●●●●
●●●●●
●●●●●
School1 2 3 4
ρ ≈ 0 ρ ≈ 0.4 ρ ≈ 0.8
Note: Schools ranked according to mean of y
Example: Between-Country Differences in Hedonism
Parameter Estimate Standard error
0 -0.203 0.069 2u 0.094 0.030
2e 0.885 0.007
Total variance = 0.094+0.885=0.979 ρ = 0.094/(0.094 + 0.885) = 0.096. Thus 9.6% of the total variance in hedonism scores is due to between-country differences. Can also interpret 0.096 as the correlation between the hedonism scores for two randomly selected individuals from the same country.
Testing for Group Effects
We wish to test the null hypothesis (H0) that 2u =0, i.e. no between-
group variance. We compare a single-level (SL) model ii ey 0 and a multilevel (ML) model ijjij euy 0 in a likelihood ratio test.
Test statistic is LR = -2 log LSL – (-2 log LML) where LSL and LML are likelihood values for the two models.
The test statistic LR is compared with a chi-squared distribution. Here
we have 1 extra parameter ( 2u ) so d.f. = 1. [Sometimes take (p-
value)/2 as correction for fact that 2u must always be greater than 0]
Example: Testing for Country Differences in Hedonism
Null hypothesis is that there are no differences between countries in their mean hedonism scores, i.e. zero between-country variance. -2 log LSL = 102590 -2 log LML = 99303 LR = -2 log LSL – (-2 log LML) = 3287 which we compare to a chi-squared distribution on 1 d.f. The critical value for a test at the 5% significance level is 3.84, so strong evidence of betwen-country differences.
Residuals in a Variance Components Model
Group effects ju (level 2 residuals) are random variables assumed to
follow a normal distribution, so their distribution is summarised by two
parameters: the mean (fixed at zero) and variance 2u .
To make comparisons among groups we need to estimate ju . These
estimates are obtained after fitting the model. The total residual is ijj eu , estimated as 0ˆˆ ijijijij yyyr .
Need to split this into separate estimates of ju and ije .
Level 2 Residuals
A starting point for an estimate of ju would be to take the mean of
0ijy for group j . This is sometimes called the mean raw residual:
0 jj yr
where jy is the mean of ijy in group j .
We multiply this raw residual by a factor k called the shrinkage factor:
jj rku ˆ
where )/ˆ(ˆ
ˆ22
2
jeu
u
nk
and jn is sample size in group j
Shrinkage
k is always lies between 0 and 1 so jj ru ˆ
For large jn , k will be close to 1 and so ju will be close to jr
k also close to 1 when 2ˆe small relative to 2ˆu
Greater shrinkage (k closer to zero) when jn small or 2ˆe is large
relative to 2ˆu (high within-group variability), i.e. when we have little
information about the group. Then the group mean juˆ0 is pulled
towards the overall mean 0
Example: Mean Raw Residuals vs. Shrunken Residuals for Selected Countries
Country (j) jr ju jn
Poland -0.727 -0.723 1980 Czech Rep. -0.480 -0.476 1213
. . . . Belgium 0.392 0.390 1819 Denmark 0.425 0.423 1455
In this case, there is little shrinkage because nj is very large.
Example: Caterpillar Plot showing Country Residuals and 95% CIs
Residual Diagnostics
• Use normal Q-Q plots to check assumptions that level 1 and 2 residuals are normally distributed– Nonlinearities suggest departures from normality
• Residual plots can also be used to check for outliers at either level– Under normal distribution assumption, expect 95% of standardised residuals to lie between -2 and +2
– Can assess influence of a suspected outlier by comparing results after its removal
Example: Normal Plot of Individual (Level 1) Residuals
Linearity suggestsnormality assumption is reasonable
Example: Normal Plot of Country (Level 2) Residuals
Some nonlinearity butonly 20 level 2 units