27
Does Religion Make You Healthier and Longer Lived? Evidence for Germany Bruce Headey Gerhard Hoehne Gert G. Wagner Accepted: 16 December 2013 Ó Springer Science+Business Media Dordrecht 2013 Abstract Researchers in the US have consistently reported substantial—not just statis- tically significant—links between religious belief and practice, and improved health and longevity. In this paper we report evidence for Germany, using data from the long-running, nationally representative German Socio-Economic Panel (SOEP 1984). The SOEP dataset includes multiple measures of health, plus many ‘controls’ which it is appropriate to use in assessing links between religious practice, health and longevity. These controls include personality traits known to be associated with better health (notably conscientiousness), and also the age of death of parents of the survey respondents. Initial results suggested that religious practice (church attendance) may be linked only to subjective (self-rated) mea- sures of health, not to more objective measures. It seemed possible that results in some previous research could be due to what may be termed satisfaction bias or positivity bias; the known tendency of religious people to report higher than average satisfaction with almost all aspects of life. Further investigation indicated that relationships between church attendance and subjective measures of health were weaker, when a control for satisfaction bias was in place. However, there was countervailing evidence that the subjective measures in SOEP may actually be more not less valid than the objective measures; they are better not worse predictors of mortality. It was also clear that religious belief and church attendance are associated with health-protective behaviors and attitudes, including taking more exercise, not smoking and higher life satisfaction. At the end of the paper we estimate B. Headey (&) Melbourne Institute, University of Melbourne, Parkville 3052, Australia e-mail: [email protected]; [email protected] B. Headey G. G. Wagner German Institute for Economic Research (DIW), Berlin, Germany G. Hoehne G. G. Wagner Berlin University of Technology, Berlin, Germany G. G. Wagner Max Planck Institute for Human Development, Berlin, Germany 123 Soc Indic Res DOI 10.1007/s11205-013-0546-x

Does Religion Make You Healthier and Longer Lived? Evidence for Germany

  • Upload
    gert-g

  • View
    213

  • Download
    1

Embed Size (px)

Citation preview

Does Religion Make You Healthier and Longer Lived?Evidence for Germany

Bruce Headey • Gerhard Hoehne • Gert G. Wagner

Accepted: 16 December 2013� Springer Science+Business Media Dordrecht 2013

Abstract Researchers in the US have consistently reported substantial—not just statis-

tically significant—links between religious belief and practice, and improved health and

longevity. In this paper we report evidence for Germany, using data from the long-running,

nationally representative German Socio-Economic Panel (SOEP 1984). The SOEP dataset

includes multiple measures of health, plus many ‘controls’ which it is appropriate to use in

assessing links between religious practice, health and longevity. These controls include

personality traits known to be associated with better health (notably conscientiousness),

and also the age of death of parents of the survey respondents. Initial results suggested that

religious practice (church attendance) may be linked only to subjective (self-rated) mea-

sures of health, not to more objective measures. It seemed possible that results in some

previous research could be due to what may be termed satisfaction bias or positivity bias;

the known tendency of religious people to report higher than average satisfaction with

almost all aspects of life. Further investigation indicated that relationships between church

attendance and subjective measures of health were weaker, when a control for satisfaction

bias was in place. However, there was countervailing evidence that the subjective measures

in SOEP may actually be more not less valid than the objective measures; they are better

not worse predictors of mortality. It was also clear that religious belief and church

attendance are associated with health-protective behaviors and attitudes, including taking

more exercise, not smoking and higher life satisfaction. At the end of the paper we estimate

B. Headey (&)Melbourne Institute, University of Melbourne, Parkville 3052, Australiae-mail: [email protected]; [email protected]

B. Headey � G. G. WagnerGerman Institute for Economic Research (DIW), Berlin, Germany

G. Hoehne � G. G. WagnerBerlin University of Technology, Berlin, Germany

G. G. WagnerMax Planck Institute for Human Development, Berlin, Germany

123

Soc Indic ResDOI 10.1007/s11205-013-0546-x

a structural equation model which maps links between religious practice, these protective

behaviors and attitudes, and improved health outcomes.

Keywords Subjective and objective health measures � Church attendance �Satisfaction bias � German Socio-Economic Panel (SOEP)

1 Introduction

Obviously, social scientists have nothing useful to say about the truth or falsity of the

world’s religions. But social scientists certainly have had a great deal to say about the

benefits which religion may confer on its adherents. In particular, it has been claimed that

Christian belief and practice are strongly linked to better health and longer life. Because

most of the available evidence comes from the United States, it is worthwhile examining

whether similar results hold for Germany, another Western country in which most people

who practise religion are Protestant or Catholic Christians. In Germany about 17 % (21 %

in the West, 8 % in the East) of the population report that they attend a religious service at

least once a month, compared with about 35 % in the United States.

Evidence comes from the nationally representative, long-running German Socio-Eco-

nomic Panel Survey (SOEP) in which respondents have been interviewed annually since

1984 (Frick et al. 2007). The questionnaire includes a fairly comprehensive set of health

measures. We use five of these measures, described in the Methods section. Here we note

that the measures can be classified as ranging from relatively subjective (e.g. health sat-

isfaction reported on a 0–10 scale) to relatively objective (e.g. grip strength measured by

squeezing a dynamometer). Our initial results showed that links between religious practice

and health differed systematically according to the subjectivity–objectivity of the measures

deployed. The more subjective the measure, the stronger the evidence seemed that reli-

gious practice benefits health. But the more objective the measure, the less convincing was

the evidence.

These first results drove the investigation in new directions. However, it must imme-

diately be pointed out that we do not assume that measurement objectivity can be equated

with measurement validity. Objective measures are nearly always more reliable (consis-

tent, repeatable) than subjective measures, but there is no reason to assume that they are

more (or less) valid as ‘true’ indicators of health. Measurement validity will be discussed

more fully in the ‘‘Methods’’ and ‘‘Results’’ sections. Here we just flag the issue that

religious practice is more strongly linked to subjective than objective indicators of health.

There has to be a reason for this difference. One hypothesis is that religious people may

have a tendency to report relatively high levels of satisfaction with their lives-as-a-whole,

and with virtually all specific aspects of life, including health (Campbell et al. 1976;

Argyle 2001). They may tend to overrate their health, compared to ratings that professional

observers (e.g. a team of physicians) would give. This tendency (if it exists) might be

termed satisfaction bias or positivity bias.

We find some initial evidence to support a hypothesis of satisfaction bias. But, again

unexpectedly, we also find strong countervailing evidence that the subjective measures in

SOEP are more not less valid than the objective measures. The strong evidence is that the

subjective measures are better predictors of mortality than the objective ones. Eventually,

by a rather tortuous route, we reach the conclusion that religious people are healthier and

B. Headey et al.

123

longer lived than less religious and non-religious people. They have attitudes and behaviors

(one could say a lifestyle) which promotes longevity. Relative to the population average,

religious people are more likely to marry and less likely to divorce. They smoke less, take

more exercise, are more socially active and volunteer more (partly through their church).

Their higher than average life satisfaction also contributes to longevity. Towards the end of

the paper we estimate a structural equation model which summarises linkages between

church attendance, exercise, smoking, life satisfaction and health. Church attendance is

shown to improve health primarily via a healthier lifestyle and increased life satisfaction.

There appear to be two-way causal links between health and life satisfaction; good health

enhances life satisfaction and vice versa.

As well as providing a comprehensive set of health measures, SOEP includes a wide

range of variables that it is desirable to take into account in assessing links between religious

practice and health. These include both health risk and protective factors. Among the risk

factors are smoking and risk-taking, including health risk-taking. Among the protective

factors are getting married and staying married, social networks and participation, and the

personality trait of conscientiousness (Friedman and Martin 2011). For many respondents

information is available about the age at which their father and mother died. It is desirable to

control for these variables in assessing the impact of religiosity on longevity, although

previous research has found that it is only in very long-lived families that there are sig-

nificant associations between the death ages of successive generations (Gudmundsson et al.

2000; Lach et al. 2006). More technically, the panel design makes it possible to undertake

fixed-effects panel regression analyses in which the effects on health of all time-invariant

factors are automatically controlled. These include family background, genetic effects,

intelligence and personality traits (assuming these are time invariant).

1.1 Previous Research

The claim that the religious are especially healthy and long lived has a long history. Biblical

scholars used to claim that Abraham lived to be 175 and Moses to 120. More plausibly,

Schnall et al. (2010), analysing data from the longitudinal US National Institute of Health’s

Women’s Health Initiative (N = 95,000), reported that women over fifty were 20 % less

likely to die in any given year if they attended a religious service every week, and 15 % less

likely to die if they were less frequent attenders. These findings held, even controlling for

baseline measures of health (physician assessments). Strawbridge et al. (1997) conducted a

28-year follow-up study and found that regular church-goers were significantly more likely

to be still alive, even after this long period. During the course of the study, religious

individuals were more likely than average to take up regular exercise, stop smoking,

improve their social networks and stay married; all factors associated with longevity.

Strawbridge et al. (1997) found that religion benefited the health of women more than

men. By contrast, Maselko and Kubzansky (2006), analysing data from the US General

Social Survey, found that religious activity benefited the health of men more than women,

although both genders gained. Hall (2006) reported that church-goers live 2–3 years longer

than non church-goers, partly because churches provide effective social networks. He

found that church networks were about as beneficial as regular exercise and cholesterol-

reducing drugs in promoting these extra years of life.

The Handbook of Health and Religion (Koenig et al. 2012) and The Handbook of

Mental Health and Religion (Koenig 1998) report that positive associations have been

found between religious practice and better physical and mental health in several hundred

studies, nearly all of which were conducted in the US and based on cross-sectional data.

Evidence for Germany

123

Research which explores possible mechanisms promoting better health among churchgoers

is particularly interesting. A Norwegian study (Sørensen et al. 2011) found that attending

church services directly lowers blood pressure and that the more often a person goes to

church, the lower blood pressure falls. Maselko (2006) found that churchgoers have better

lung health than non-attenders, noting that lung health is a valid measure of general

physical health and that smoking, which greatly affects lung health, is less prevalent among

churchgoers than non-attenders.

The most astonishing claim about the health benefits of religion was made in the Byrd

study (1988) conducted at a large hospital in San Francisco. This appeared to show that the

incidence of complications following a heart attack among patients who had a group of

believers praying for them was significantly lower than the incidence for a control group, who

did not receive intercessory prayer. However, a recent large scale investigation, sponsored by

the Templeton Foundation, failed to replicate this finding and, in fact, found that patients who

were prayed for suffered more complications. The difference between the prayer group and

the control groups was small but statistically significant (Benson et al. 2006).

All the American studies reviewed so far have claimed direct positive associations

between religious practice, health and longevity. The US study to which our German results

are closest is Friedman and Martin’s (2011) reanalysis of data related to longevity in the

eight-decade long Terman Study of the Gifted. Friedman and Martin analysed a wide range

of factors affecting longevity; their study was not focussed primarily on religion. However,

they did find that church attendance was one of many ‘lifestyle’ variables associated with

longevity. Good social networks, the personality trait of conscientiousness, volunteering,

community consciousness, high career achievement (see also Marmot et al. 1978, 1991)

and, for men, avoiding marital break-up were also found to be protective health factors.

1.2 Health and Longevity Hypotheses

We began this research with the following hypotheses:

1. Regular Christian religious practice (attendance at church services) is associated with

better health.

2. Regular religious practice is associated with increased longevity.

3.1–3.2 Regular religious practice is more beneficial for women than men in relation to

(i) health and (ii) longevity.

The finding that the strength of the apparent link between religious practice and health

depends partly on the subjectivity–objectivity of health measures only emerged in the

course of our research. Similarly, we did not envisage that subjective measures of health

might prove to be more valid, at least as predictors of mortality, than more objective

measures. Precisely because some main findings emerged in the course of the investiga-

tion, rather than being envisaged as hypotheses, it will be particularly important in future

work to check results and see if they replicate.

2 Methods

2.1 The German Socio-Economic Panel (SOEP): Annual Interviews Since 1984

The German (SOEP) panel began in 1984 in West Germany with a sample of 12,541

respondents (Frick et al. 2007). Interviews have been conducted annually ever since.

B. Headey et al.

123

Everyone in the household aged 16 and over is interviewed. The cross-sectional repre-

sentativeness of the panel is maintained by interviewing ‘split-offs’ and their new families.

So when a young person leaves home (‘splits off’) to marry and set up a new family, the

entire new family becomes part of the panel. The sample was extended to East Germany in

1990, shortly after the Berlin Wall came down, and since then has been boosted by the

addition of new immigrant samples, a special sample of the rich, and recruitment of new

respondents partly to increase numbers in ‘policy groups’. There are now over 60,000

respondents on file, including a few grandchildren, as well as children of the original

respondents. The main topics covered in the annual questionnaire are family, income and

labour force dynamics.

2.2 Measures

2.2.1 Religious Practice: Church Attendance

The main measure of religious practice available in SOEP is attendance at religious ser-

vices or activities (church attendance) reported in four categories: every week, every

month, seldom or never. The question was first asked in 1990 and has been asked on

thirteen occasions since. Attendance has declined steadily. In 1990 27.5 % attended reli-

gious services either weekly or at least monthly. By 2009 (latest date available) regular

attendance had dropped to 17.5 %.

On just three occasions (1994, 1998 and 1999) respondents have been asked about the

‘importance’ of religion in their lives, responding on a scale running from 1 (not at all

important) to 4 (very important). This question is probably intended to tap the spiritual

dimension of religion; the extent to which it provides a sense of purpose and meaning to

life. In 1999 11.1 % said that religion was ‘very important’ to them. Information is also

available about which Christian denomination respondents belong to. In Germany about

34 % are Protestant (Evangelisch), 29 % are Roman Catholic, and 2.5 % belong to other

Christian denominations.1 In SOEP about 3 % report belonging to other religions, with

most being Moslems. 31 % report ‘no denomination’.

Because our focus is on the effects of Christian religious practice on health, we exclude

those who practise religions other than Christianity. Our sample/panel therefore comprises

Christians, plus those who report no denomination.

We have imputed values for church attendance in years since 1990 in which the

question was not asked. This was done by averaging results for the nearest year before, and

the nearest year after any year with missing data.

The church attendance measure used here is almost identical to measures used in most

US surveys which report evidence on religious practice and health. It may be noted that

social scientists generally prefer behavioral measures of religiosity rather than less reliable

attitudinal measures intended to assess beliefs or the spiritual dimensions of religion. In

SOEP the behavioral church attendance measure has a Spearman rank order correlation of

0.65 with the ‘importance of religion’ measure. This is a very high correlation for two

items measured on 4-point response scales, so essentially both measures are identifying the

same people as ‘religious’.

Preliminary analysis indicated that relationships between church attendance and health

measures are not linear, or at least not clearly so. The main break appeared to be between

‘high’ attenders who attend every week or every month, and those who attend seldom or

1 These figures are for 2007; the latest year in which denomination was asked in SOEP.

Evidence for Germany

123

never. Accordingly, for inclusion in regression analyses, we constructed a dichotomous

variable, church attendance, scored 1 for high attenders and 0 for seldom and non-

attenders.

2.2.2 Health Measures Rank Ordered from Subjective to Objective

The five health measures used in this article can be ranked in a common sense way from

most subjective to most objective. Perhaps the most subjective is health satisfaction, which

has been asked every year in SOEP and is measured on a single item 0–10 scale, on which

zero is labelled ‘very dissatisfied’ and 10 is labelled ‘very satisfied’. The next most (or

perhaps equally) subjective measure is based on asking respondents to rate their current

state of health on a 1–5 scale ranging from ‘very good’ to ‘bad’. This question has been

asked annually since 1992. It correlates satisfactorily with physician ratings of health

(Schwarze et al. 2000). Somewhat less subjective is the physical health measure (pcs)

based on the SF12 health survey, which has been included in SOEP in alternate years since

2002. The SF12 is a short form of the internationally widely used Medical Outcomes Study

SF36 survey (Ware et al. 2000). The twelve items cover physical and mental health issues.

The data are factor analysed to yield physical (pcs) and mental health (mcs) summary

scales.2 The pcs is partly based on the 1–5 self-report item described earlier, plus other

items focussed on physical problems in the past 4 weeks (e.g. ‘‘During the past 4 weeks,

how much did pain interfere with your normal work…?’’ and ‘‘Does your health now limit

you in these activities…moving a table? Pushing a vacuum cleaner? Climbing several

flights of stairs?’’ Scores on the pcs scale are standardized to range from 0 to 100.

A somewhat more objective measure, although open to recall bias, is reported number

of doctor visits (general practitioners and all other types of medical doctor) in the last

3 months.3 Answers are multiplied by four to provide annual estimates. The doctor visits

question has been asked every year in SOEP. In a typical year about a third of the

population reports no visits, the mean is about ten, and over 20 % are estimated to have

made more than twelve visits.

Clearly, the most objective measure currently included in SOEP is grip strength,

measured in SOEP in 2006 and 2008.4 Grip strength is assessed by squeezing a dyna-

mometer. Respondents squeeze twice each with their left and right hands, with results for

each hand being averaged. Current grip strength of the dominant hand, measured in

kilograms, has been shown to be well correlated with physician ratings of health (Bo-

hannon 2001, 2008). Furthermore, change in grip strength is claimed to be a fairly accurate

predictor of death, especially for older people (Metter et al. 2002; Rantanen et al. 2000;

Ambrasat et al. 2011).

Tables 1 (men) and 2 (women) report the means and standard deviations of the health

measures, plus correlations among them.

The summary statistics in Tables 1 and 2 indicate that men report slightly better health

than women and go to the doctor less. These are standard results found in surveys in all

Western countries. The correlations among the measures are all moderate to high. The

measure which has the highest average correlation with the other items in the set is the pcs

2 Pcs and mcs stand for physical component summary and mental component summary.3 The longer ago a doctor visit occurred, the more likely it is that it will be forgotten and not reported,especially if the health problem was minor. Recall over a 3-month period is likely to be subject to a smalldegree of downward bias (Ayhan and Isiksal 2004).4 Data have also been collected in 2010 and 2012 but are not yet available.

B. Headey et al.

123

physical health index, which may be taken as an indication of its convergent validity; a

point to which we will return later.5

The remaining health outcome used in this article is mortality (1 = dead 0 = still

alive). Plainly, this is an unimpeachably objective measure. The obvious way to measure

the effects of church attendance on longevity might seem to be to relate it to age of death

(controlling for other factors). But this is not the approach usually taken by health

researchers, because it yields a biased sample in which the dead are included but those who

survive are right censored. The usual approach, followed here, is to ‘predict’ mortality

some years ahead on the basis of explanatory variables of interest, plus standard controls.

Here we predict mortality by church attendance, taking account of gender, age and many

other controls described below.

2.2.3 Personality Traits and Other Variables Included in Equations Primarily as Controls

We now describe a range of variables included in our equations mainly as ‘controls’ in order

to estimate the net effects of church attendance on health and longevity.

In 2005 and again in 2009 a short version of the Big Five Personality Domains (Costa

and McCrae 1991) was included in SOEP. The traits in the Big Five are neuroticism,

extroversion, openness, agreeableness and conscientiousness. The data managers report

that the short version has proved satisfactorily reliable and valid, and has yielded trait

measures which correlate highly with longer versions of the NEO (Gerlitz and Schupp

2005).

The Big Five personality traits are estimated to be about 30–50 % genetic (Lucas 2008),

so it makes sense to regard them as causally antecedent to measures of religiosity and

health. Traits are usually assumed to be stable in adulthood, so in the case of panel

members who provided trait measures in both 2005 and 2009, the average of their two

scores was taken. If only one measure was available that was used. Our working

assumption of trait stability is not entirely correct. Even if traits are 50 % genetic, that still

leaves 50 % of the variance to be explained by environmental effects. It is thought that

ratings on traits can be changed to some degree by life experiences like having a stable

marriage or an absorbing job (Roberts et al. 2006; Scollon and Diener 2006).

It is well known that neuroticism is quite strongly related to subjective measures of

health; neurotic individuals are almost by definition hypochondriacal (Costa and McCrae

1991). Conscientiousness is usually found to be positively related to health (Costa and

McCrae 1991).

Other variables included in equations primarily as controls are: gender, age (also age-

squared), marital/partnership status (1 = partnered 0 = not partnered), years of education,

household net income, unemployed (1 = unemployed 0 = employed or not in the labour

force), age of death of father and age of death of mother. There are straightforward

justifications for all of these controls. Women generally report somewhat worse health than

men, although they live longer. Age: health declines with slowly with age, and then sharply

in old age (hence the need for a quadratic term). Years of education and household net

income are included because both are positively associated with health. Being unemployed

is quite strongly associated with ill-health. Age of father’s death and age of mother’s death

are valuable inclusions as indicators of genetic ‘healthiness’. In SOEP new respondents are

5 However, the correlation of pcs with self-reported health cannot be considered in this context because theself-reported health measure (a single item) is included in the pcs index. It remains the case that pcs has thehighest mean correlation with the other 3 items.

Evidence for Germany

123

asked a lengthy set of questions about their parents, including whether they are still

alive…and, if they are dead, the age at which they died. Also, as the years pass, the death

of SOEP respondents is recorded. As explained above, some are the parents of other

respondents, so the age of death of these respondents’ parents is known too.

2.2.4 Panel Attrition and Panel Conditioning Effects

In any panel survey, what are called ‘panel conditioning effects’ are a possible source of

bias. That is, panel members might tend to change their answers over time—and answer

differently from the way non-panel members would answer—as a consequence just of

being panel members. In SOEP there is some evidence that panel members, in their first

few years of responding, tended to report slightly higher life satisfaction scores than when

they had been participating for a good many years (Frijters et al. 2004). This could be due

to ‘social desirability bias’; a desire to look good and appear to be a happy person, which

may be stronger in the first few years of responding than in later years. Or it could be due to

a ‘learning effect’; learning to use the middle points of the 0–10 scale, rather than the

extremes and particularly the top end.

To compensate for these possible sources of bias, we constructed a variable which

measures the number of years panel members have already responded to survey questions.

This variable (‘years in panel’) is included as a control on the right hand side of all

equations in this paper.

Table 1 Means, standard deviations and Pearson correlations for health measures: men

Na Mean SD Correlations

Health satisfaction 179,947 6.753 2.235 1.000

Self-reported health 178,590 3.462 0.954 0.748 1.000

Physical health (pcs) 49,753 50.053 9.436 0.659 0.780 1.000

Doctor visitsb 166,886 9.067 16.662 -0.389 -0.427 -0.456 1.000

Grip strengthb (kg) 4,522 46.524 0.123 0.212 0.265 0.334 -0.232 1.000

a The observations are person years for the period 1990–2010; i.e. a person is counted for each year heresponded in SOEPb The logarithm of this variable was used for calculating correlations

Table 2 Means, standard deviations and Pearson correlations for health measures: women

Na Mean SD Correlations

Health satisfaction 194,249 6.561 2.291 1.000

Self-reported health 193,024 3.347 0.975 0.749 1.000

Physical health (pcs) 53,786 49.086 9.946 0.664 0.780 1.000

Doctor visitsb 180,462 11.699 17.571 -0.374 -0.404 -0.414 1.000

Grip strengthb (kg) 4,892 29.589 7.031 0.256 0.300 0.374 -0.185 1.000

a The Ns are person years for the period 1990–2010; i.e. a person is counted for each year she responded inSOEPb The logarithm of this variable was used for calculating correlations

B. Headey et al.

123

2.3 Data Analysis Methods

Most of the statistical analysis in the paper involves use of standard econometric tech-

niques for analysing panel data; mainly Generalized Least Squares random effects and

fixed effects regressions. These techniques will be briefly described in the ‘‘Results’’

section.

Towards the end of the paper we estimate a 3-wave structural equation model of

relationships between church attendance, life satisfaction and the pcs measure of health. A

non-recursive (two-way causation) relationship is found between life satisfaction and pcs.

Structural equation modelling, rather than regression analysis, is necessary whenever

the aim is to estimate a set of equations, rather than a single equation, and especially when

two-way causal linkages are involved.6 The structural equations in this paper are estimated

using maximum likelihood (ML) analysis.7 ML coefficients and their associated standard

errors can be given the same interpretation as regression coefficients. However, assessing

the ‘goodness of fit’ of structural models is more complicated than for regression models. It

is necessary to assess the overall fit between estimates for several equations and the input

data for the model (usually a variance–covariance matrix).8 Several measures of fit are

conventionally used. The root mean squared error of approximation (RMSEA) and the

Standardized Root Mean Squared Residual are directly based on comparing differences

(residuals) between the actual input matrix and the matrix implied by model estimates. It

has become conventional to regard an RMSEA under 0.05 and an SRMR under 0.08 as

satisfactory (Bentler 1990; Browne and Cudeck 1993).

A more complicated assessment is provided by the comparative fit index (CFI). This is

based on a likelihood ratio (LR) Chi square test and takes account of the contribution of

each estimate in the model to overall goodness of fit. A somewhat similar index, the

Tucker–Lewis Index (TLI), more strongly rewards parsimony and penalises models

including explanatory variables which account for little variance, even if statistically

significant. CFI and TLI results above 0.90 used to be regarded as satisfactory, but some

recent reviews recommend 0.95 (Joreskog and Sorbom 1986; Browne and Cudeck 1993).

Of course, results should also be assessed by reference to the variance accounted for in the

whole model, which is given by the coefficient of determination (CD).

Models which involve two-way causation raise issues of identification; issues about

whether there are there sufficient independent pieces of information (variances and

covariances) in the input matrix to enable all free parameters in the model to be estimated.

In the model presented in this paper identification is achieved partly by constraining

parameters linking the same variables to be equal across waves. Equality constraints of this

6 Regression analysis is essentially a single equation technique. Regression estimates derived from multi-equation systems are likely to be biased, due to correlations between explanatory variables and error terms insome or all equations. A key assumption of OLS regression is that such correlations are zero.7 ML estimates are usually consistent and asymptotically normal under the (not very restrictive) assumptionof conditional normality (STATA 2011). Only paths or covariances linking conditioning (i.e. control)variables may not be consistent and asymptotically normal (even then, the main problem lies just withestimates of standard errors). These paths are not usually of substantive interest; substantive interest lies inpaths (1) linking exogenous with endogenous variables and (2) between endogenous variables.8 From a mathematical standpoint, a model can be viewed as a set of constraints—or a set of restrictedpaths—limiting the possibilities of simply reproducing the input data. Attempts by a researcher to improvehis/her model involve modifying these constraints to improve model fit…subject to the theory/hypothesesunderlying the model.

Evidence for Germany

123

kind are reasonably plausible for panel data in which the observed covariances are

approximately equal within and across waves (Kessler and Greenberg 1981; Finkel 1995).

Models involving two-way causation can be highly unstable; that is, small changes in

model specification can produce large and non-credible changes in results. In view of this,

Bentler and Freeman (1983) developed a test of model stability, which is used here.

We used the new STATA module for structural equation modelling to generate the

results reported here (STATA, Release 12, 2011). This package offers a range of esti-

mators, including maximum likelihood, and includes the tests of goodness of fit described

above. It also includes checks for model identification.

Statistics textbooks usually assert that regression analysis and structural equation

modelling both require the assumption that the endogenous variables of main interest

(church attendance, health measures and life satisfaction) are measured on an interval or

ratio scale. In fact, our main endogenous variables are only measured on ordinal (rank

order) scales. In recent years, it has become quite common to treat ordinal-level variables

as interval-level, provided that their distributions are not seriously skewed.9 The argument

for this approach is that interval-level techniques are more flexible, better understood by

most readers, and generally enable stronger causal inferences than nominal and ordinal-

level statistics.

3 Results: Effects of Religion Practice on Health and Longevity

3.1 Links Between Church Attendance and Health Measures

Our estimates of the effects of church attendance on health are mostly based on generalized

least squares (GLS) random effects (RE) or fixed effects (FE) longitudinal (panel)

regressions. Each method has advantages and disadvantages. An advantage of the RE

regressions is that they take account of both cross-sectional, between-person differences

and also longitudinal within-person changes in associations between church attendance and

health. A wide range of other variables can be included in equations—mostly as ‘con-

trols’—and their effects estimated. The main disadvantage of RE analysis is that, as with

ordinary least squares regression, it is assumed that there are no omitted variables which

are significantly associated with both the outcome variable (a health measure) and

explanatory variables. This assumption is usually dubious and cannot be checked. If it is

not correct, regression estimates of the effects of explanatory variables are likely to be

biased (omitted variables bias).

FE analysis is entirely based on within-person changes over time (within-person

regression), so that variables which vary between persons, but do not change within-person

(e.g. gender, personality traits) are automatically ‘controlled’ and just drop out of the

analysis. Because one major source of omitted variables bias is removed, FE estimates

allow for stronger causal inferences about the effects of explanatory variables on outcomes

of interest.10 A disadvantage is that information about the effects of variables which are

fixed within-person is lost. Further, a consequence of discarding between-person variance

is that estimates usually have much larger standard errors than would be the case if

between-person variance had been retained.

9 The pcs health measure (0–100 scale) and life satisfaction (0–10 scale) both have quasi-normaldistributions.10 However, omitted variables which vary within-person over time can still bias coefficients.

B. Headey et al.

123

Tables 3 and 4 provide RE and FE estimates of the effects of church attendance on

health for men and women. The health measures are arrayed from left to right in the tables

according to their assumed subjectivity–objectivity: health satisfaction, self-reported

health, pcs health, doctor visits and grip strength. It should be noted that results in these

two tables are net of controls for variables routinely shown to be associated with health:

age, age-squared, partnered, years of education, household income and being unemployed.

The NEO personality traits are also included in the RE equations.

The German results for the two most subjective measures—health satisfaction and self-

rated health—replicate results reported in the American literature. For both men and

women statistically significant relationships are found between church attendance and

these subjective measures. For men the metric regression coefficient linking church

attendance to health satisfaction is 0.168 (p \ 0.001) in the RE equation and 0.102

(p \ 0.001) in the FE equation. The equivalent results for women are RE = 0.118

(p \ 0.001) and FE = 0.048 (p \ 0.05). For the self-reported health measure, the esti-

mated effects of church attendance for men are RE = 0.054 (p \ 0.001) and FE = 0.034

(p \ 0.01). The results for women are RE = 0.050 (p \ 0.001) and FE = 0.032

(p \ 0.001).

The RE equations indicate that the personality trait of conscientiousness is positively

related to health, while the neuroticism trait is strongly negatively related. Being unem-

ployed is strongly associated with poor health for both men and women. It is perhaps worth

mentioning that, if we had run simpler models and omitted personality traits and the

unemployment variable (as is done in many published studies), then the effects of church

attendance on these subjective measures of health would have seemed even stronger.

The pcs index, which is partly based on respondent reports of health-related difficulties

in performing daily activities, may be regarded as intermediate in terms of subjectivity–

objectivity. For both men and women statistically significant coefficients (men b = 0.590,

p \ 0.001; women b = 0.296, p \ 0.05) are found between church attendance and pcs in

RE equations, but the FE equations yield non-significant results.

Finally, no statistically significant relationships were found for either men or women

between church attendance and the two most objective measures, namely annual doctor

visits and grip strength of the dominant hand. Nor were the grip strength results significant

when the sample was confined to people age 60 and over, or when the outcome was change

in grip strength.

It should be mentioned that the regression models used for these last two health mea-

sures were different from the models previously described, although in terms of causal

rigour much closer to RE than FE. Doctor visits is a ‘count’ measure with a large standard

deviation (over-dispersion).11 This made it sensible to use a Poisson-type regression, and

specifically a negative binomial regression which caters for over-dispersion (Winkelmann

2000). In the case of the grip strength measure, it was inappropriate to use longitudinal RE

and FE estimators, because the sample size was relatively small and only two time points

were available (2006 and 2008). Ordinary least squares regression was used, with the

outcome variable being logged (ln grip strength).12

It was hypothesized that church attendance would have a more beneficial effect on

women’s health than men’s. This hypothesis has perhaps been rendered less interesting by

11 A large number of respondents reported never have been to the doctor in the period in question, and a fewreported going hundreds of times.12 Because some individuals have very strong grips, the variable is not normally distributed, but isapproximately log-normal.

Evidence for Germany

123

Ta

ble

3M

en:

rela

tionsh

ipbet

wee

nch

urc

hat

tendan

cean

dhea

lth

mea

sure

sra

nked

from

subje

ctiv

eto

obje

ctiv

e:G

LS

random

effe

cts

(RE

),fi

xed

effe

cts

(FE

),neg

ativ

eb

ino

mia

lan

dO

LS

reg

ress

ion

s(m

etri

cco

effi

cien

tsw

ith

tv

alu

esin

par

enth

eses

)

Eff

ects

of

churc

hat

tendan

ce(1

–0)

on

hea

lth

Subje

ctiv

em

easu

res !

obje

ctiv

em

easu

res

Expla

nat

ory

var

iable

sH

ealt

hsa

tisf

acti

on

(0–10)

Sel

f-re

port

edhea

lth

(1–5)

Physi

cal

hea

lth

(pcs

)(0

–100)

Doct

or

vis

itsa

(Ln)

Gri

pst

rength

RE

FE

RE

FE

RE

FE

NB

RE

GO

LS

Churc

hat

tendan

ce0.1

68***

(6.8

0)

0.1

02***

(3.9

8)

0.0

54***

(5.0

2)

0.0

34**

(3.1

4)

0.5

90***

(3.6

0)

0.2

16

(0.9

9)

0.0

11

ns

(0.7

0)

20.0

01

ns

(0.4

6)

Age

-0.0

47***

(9.2

0)

-0.0

70

(1.8

4)

-0.0

21***

(9.4

4)

-0.0

34

(1.8

1)

-0.1

30***

(4.9

2)

-0.5

95

(1.4

6)

0.0

06**

(2.6

3)

0.0

16***

(10.8

6)

Age-

squar

ed0.0

11*

(2.2

0)

-0.0

16*(2

.52)

0.0

02

(0.7

0)

-0.0

09**

(2.9

6)

-0.1

46***

(5.5

9)

-0.2

45***

(4.3

4)

0.0

15***

(6.5

4)

-0.0

23***

(15.8

1)

Par

tner

ed0.0

43

(1.5

4)

0.0

37(1

.20)

0.0

08

(0.6

3)

0.0

16

(1.1

7)

-0.6

88***

(4.4

8)

-1.0

94***

(4.7

5)

0.0

10

(0.6

8)

0.0

43***

(5.4

7)

Yrs

.ed

uca

tion

0.0

36***

(6.3

2)

-0.0

07(0

.70)

0.0

21***

(8.5

5)

-0.0

01

(0.3

1)

0.5

39***

(20.3

3)

0.0

29

(0.3

6)

-0.0

22***

(9.9

8)

-0.0

00

(0.1

0)

HH

net

inco

me

0.0

18***

(5.3

2)

0.0

07**

(2.8

0)

0.0

07***

(5.0

6)

0.0

03**

(2.8

0)

0.0

68***

(3.3

2)

0.0

11

(0.7

0)

-0.0

11***

(6.9

5)

-0.0

00

(0.4

0)

Unem

plo

yed

-0.2

78***

(8.2

6)

-0.2

26***

(7.4

5)

-0.1

20***

(8.8

1)

-0.0

93***

(7.5

8)

-1.1

16***

(4.5

3)

-0.5

89*

(2.2

4)

0.1

50***

(6.0

2)

-0.0

67***

(3.9

3)

Yrs

.in

pan

el-

0.0

31***

(16.2

5)

0.0

16

(0.4

3)

-0.0

14***

(15.9

7)

0.0

08

(0.4

0)

-0.0

52***

(5.1

5)

0.5

23

(1.2

8)

0.0

01

(1.2

5)

-0.0

02*

(3.1

3)

Neu

roti

cism

-0.3

45***

(25.3

0)

-0.1

44***

(25.4

1)

-0.9

01***

(13.8

1)

0.1

53***

(31.5

7)

-0.0

13***

(4.0

4)

Extr

over

sion

0.0

54***

(3.6

9)

0.0

20***

(3.3

2)

-0.0

58

(0.8

3)

0.0

19***

(3.6

2)

0.0

04

(1.3

4)

Open

nes

s0.0

27

(1.8

2)

0.0

05

(0.7

6)

0.0

29

(0.4

0)

0.0

32***

(6.0

2)

-0.0

01

(0.1

7)

Agre

eable

0.0

77***

(4.5

8)

0.0

15*

(2.1

7)

0.1

21

(1.4

9)

-0.0

06

(0.9

4)

-0.0

18***

(4.7

7)

Consc

ienti

ous

0.1

54***

(8.6

3)

0.0

57***

(7.4

0)

0.5

13***

(5.7

9)

-0.0

83***

(12.5

9)

0.0

04

(1.0

2)

R-s

quar

ed0.1

63

0.0

90

0.2

04

0.1

47

0.2

90

0.2

10

–0.3

17

Nb

96,8

49

123,2

98

96,8

32

122,8

05

28,8

56

32,8

52

92,0

97

3,6

58

Tval

ues

are

bas

edon

robust

stan

dar

der

rors

aA

neg

ativ

ebin

om

ial

regre

ssio

nw

asuse

dto

esti

mat

eth

eeq

uat

ion

for

doct

or

vis

its.

Mea

sure

of

fit:

Chi

squar

e(1

3)

=5,0

97.6

0b

The

obse

rvat

ions

are

per

son-y

ears

***

Sig

nifi

cant

at0.0

01;

**

signifi

cant

at0.0

1;

*si

gnifi

cant

at0.0

5

B. Headey et al.

123

Ta

ble

4W

om

en:

rela

tionsh

ipbet

wee

nch

urc

hat

tendan

cean

dhea

lth

mea

sure

sra

nked

from

subje

ctiv

eto

obje

ctiv

e:G

LS

random

effe

cts,

fixed

effe

cts,

neg

ativ

eb

ino

mia

lan

dO

LS

regre

ssio

ns

(met

ric

coef

fici

ents

wit

ht

val

ues

inp

aren

thes

es)

Eff

ects

of

churc

hat

tendan

ce(1

–0)

on

hea

lth

Subje

ctiv

em

easu

res !

obje

ctiv

em

easu

res

Expla

nat

ory

var

iable

sH

ealt

hsa

tisf

acti

on

(0–10)

Sel

f-re

port

edhea

lth

(1–5)

Physi

cal

hea

lth

(pcs

)(0

–100)

Doct

or

vis

itsa

(Ln)

Gri

pst

rength

RE

FE

RE

FE

RE

FE

NB

RE

GO

LS

Churc

hat

tendan

ce0.1

18***

(5.2

4)

0.0

48*

(2.1

1)

0.0

50***

(5.3

2)

0.0

32***

(3.3

0)

0.2

96*

(2.0

0)

20.0

09

(0.0

5)

0.0

15

(1.4

0)

20.0

02

(0.1

9)

Age

-0.0

28***

(6.2

7)

-0.1

52***

(3.7

8)

-0.0

15***

(7.3

8)

-0.0

51***(2

.72)

-0.0

55*

(2.2

6)

-0.4

49

(1.1

2)

-0.0

06***

(3.4

7)

0.0

18***

(10.9

6)

Age-

squar

ed-

0.0

09

(1.9

0)

-0.0

20***

(3.5

2)

-0.0

05**

(2.6

8)

-0.0

10***

(3.9

6)

-0.2

39***

(9.8

8)

-0.3

74***

(6.9

9)

0.0

17***

(10.4

1)

-0.0

25***

(15.2

3)

Par

tner

ed0.0

13

(0.5

1)

-0.0

38

(1.3

8)

0.0

04

(0.3

8)

-0.0

09

(0.7

4)

-0.4

70***

(3.2

1)

-0.8

97***

(3.9

6)

0.0

07

(0.7

5)

0.0

16

(1.8

5)

Yrs

.ed

uca

tion

0.0

51***

(9.3

4)

0.0

29***

(3.2

3)

0.0

25***

(10.3

2)

0.0

12**(2

.95)

0.3

77***

(13.3

2)

0.0

00

(0.0

0)

-0.0

09***

(4.9

4)

0.0

04***

(3.2

9)

HH

net

inco

me

0.0

06

(1.6

2)

-0.0

00

(0.3

3)

0.0

02

(1.1

5)

-0.0

01*

(2.0

0)

0.0

19

(1.7

5)

-0.0

22*

(2.3

2)

-0.0

03***

(4.6

7)

0.0

00

(1.0

2)

Unem

plo

yed

-0.2

05***

(6.3

9)

-0.1

35***

(4.5

8)

-0.0

86***

(6.3

9)

-0.0

57***

(4.5

8)

-0.5

06*

(2.2

8)

0.0

94

(0.4

0)

0.1

01***

(5.2

3)

-0.0

42*

(2.1

9)

Yrs

.in

pan

el-

0.0

20***

(10.8

1)

0.1

15**

(2.8

6)

-0.0

08***

(10.4

8)

0.0

32

(1.7

2)

-0.0

40***

(3.9

0)

0.5

09

(1.2

7)

0.0

03***

(3.9

2)

-0.0

00

(0.3

6)

Neu

roti

cism

-0.3

57***

(28.4

8)

-0.1

61***

(30.7

7)

-0.9

69***

(15.3

9)

0.1

29***

(36.1

9)

-0.0

11***

(3.7

8)

Extr

over

sion

0.0

80***

(5.4

0)

0.0

28***

(4.6

0)

0.1

03

(1.4

3)

0.0

21***

(4.9

2)

-0.0

01

(0.1

7)

Open

nes

s0.0

24

(1.7

2)

0.0

01

(0.1

7)

-0.0

09

(0.1

2)

0.0

19***

(4.9

4)

0.0

10**

(2.7

9)

Agre

eable

0.0

73***

(4.2

4)

0.0

07

(1.0

2)

-0.0

51

(0.5

9)

-0.0

23***

(4.6

4)

-0.0

16***

(3.8

1)

Consc

ienti

ous

0.1

31***

(6.7

8)

0.0

46***

(5.6

7)

0.4

37***

(4.6

4)

-0.0

41***

(7.6

0)

0.0

02

(0.4

6)

R-s

quar

ed0.1

62

0.0

92

0.2

12

0.1

50.3

03

0.2

4–

0.3

40

Nb

107,5

38

135,0

41

107,5

87

134,6

03

32,1

46

36,0

77

102,3

26

4,0

01

Tval

ues

are

bas

edon

robust

stan

dar

der

rors

aA

neg

ativ

ebin

om

ial

regre

ssio

nw

asuse

dto

esti

mat

eth

eeq

uat

ion

for

doct

or

vis

its.

Mea

sure

of

fit:

Chi

squar

e(1

3)

=3,8

49.3

4b

The

obse

rvat

ions

are

per

son-y

ears

***

Sig

nifi

cant

at0.0

01;

**

signifi

cant

at0.0

1;

*si

gnifi

cant

at0.0

5

Evidence for Germany

123

finding that the apparent strength of all effects depends heavily on which measures are

used. However, if we focus just on the three more subjective measures in Tables 3 and 4,

the evidence could be read as indicating that, if anything, men benefit somewhat more than

women.13 We will find similar pro-male evidence in a later section on mortality. The

thorny issue of whether the church attendance-health link is mainly an artefact of sub-

jective measurement is covered in the next section.

3.1.1 Is the Link Between Church Attendance and Health Measures Partly Due

to Satisfaction Bias/Positivity Bias? Issues of Measurement Validity

In this section we present evidence which suggests that the apparent link between church

attendance and better health might be due to satisfaction bias/positivity bias. This evidence

will be more or less contradicted later, but first let us see where the trail leads us.

It is certainly the case that religious people report that they are relatively satisfied with

everything under the sun (Andrews and Withey 1976; Campbell et al. 1976; Argyle 2001).

In SOEP there are positive correlations between church attendance and such miscellaneous

items as job satisfaction, leisure satisfaction, satisfaction with one’s school grades (years

ago), satisfaction with local public transport, and satisfaction with the introduction of the

Euro. It may be, of course, that religious people really are more satisfied with everything—

we come back to this possibility later—but it is also possible that what may be termed

satisfaction bias or positivity bias, is wholly or partly responsible for apparent links

between church attendance and subjective measures of health.

Our first approach to assessing this issue is to construct a composite measure of sat-

isfaction from four survey items which have been included in SOEP in most survey years:

job satisfaction, leisure satisfaction, satisfaction with household income, and satisfaction

with your household work. These items are all measured on the same 0–10 scale as life

satisfaction; the composite measure was constructed simply by averaging respondents’

scores. The Pearson correlation between this composite satisfaction measure and church

attendance is 0.078 (p \ 0.001). Its correlations with the subjective health measures are all

substantial: 0.446 with health satisfaction, 0.320 with self-reported health, and 0.207 with

the pcs health index. Correlations with the more objective measures of health are in the

expected direction but not statistically significant: -0.028 with (ln)doctor visits and 0.011

with (ln)grip strength of the dominant hand. Notice that these correlations decline

monotonically, moving from the most subjective to the most objective measures of health.

The next step is to see what happens to the associations reported in Tables 3 (men) and

4 (women) between church attendance and the subjective health measures, when we add

the composite satisfaction measure as a ‘control’. The original estimates (which were net

of demographic, socio-economic and personality trait controls) and the revised estimates

with the extra control are compared in Tables 5 and 6.

The evidence in these two tables could be read as indicating that the link between

church attendance and subjective health measures is in large part due to satisfaction/

positivity bias. Most of the relationships initially estimated to be statistically significant in

Tables 3 and 4 turn out to be non-significant, when the composite satisfaction measure is

added to equations as an extra ‘control’. In the men’s equations five out of six coefficients

for the effects of church attendance were initially estimated to be statistically significant, at

13 Again using the subjective measures, no differences in the church attendance-health link were foundbetween Protestants and Catholics. Nor did church attendance appear to benefit older people, or specificallyolder widows, more than younger people.

B. Headey et al.

123

Ta

ble

5M

en:

effe

cto

fch

urc

hat

ten

dan

ceo

nsu

bje

ctiv

eh

ealt

hm

easu

res,

con

tro

llin

gfo

rp

oss

ible

sati

sfac

tio

n/p

osi

tiv

ity

bia

s:ra

nd

om

effe

cts

(RE

)an

dfi

xed

effe

cts

(FE

)p

anel

regre

ssio

ns

(met

ric

coef

fici

ents

wit

ht

val

ues

inp

aren

thes

es)

Su

bje

ctiv

em

easu

res !

ob

ject

ive

mea

sure

s

Expla

nat

ory

var

iable

sH

ealt

hsa

tisf

acti

on

(0–

10)

Hea

lth

sati

sfac

tio

n(0

–1

0)

Sel

f-re

port

edh

ealt

h(1

–5

)S

elf-

rep

ort

edh

ealt

h(1

–5

)P

hysi

cal

hea

lth

(pcs

)(0

–1

00

)P

hy

sica

lh

ealt

h(p

cs)

(0–

100

)R

Ea

FE

bR

Ea

FE

bR

Ea

FE

b

Ch

urc

hat

ten

dan

cein

Tab

le3

0.1

68

**

*(6

.80

)0

.102

**

*(3

.98

)0

.054

**

*(5

.02

)0

.034

**

(3.1

4)

0.5

90

**

*(3

.60

)0

.216

(0.9

9)

Ch

urc

hat

ten

dan

cen

eto

fco

mp

osi

tesa

tisf

acti

on

mea

sure

0.1

03

**

(2.8

5)

0.0

36

(0.7

7)

0.0

24

(1.4

6)

-0

.021

(1.0

7)

0.6

04

*(2

.26

)-

0.4

71

(1.9

4)

Tv

alues

are

bas

edo

nro

bu

stst

andar

der

rors

Th

eo

bse

rvat

ion

sar

ep

erso

n-y

ears

aT

he

RE

coef

fici

ents

are

net

of

age,

age-

squ

ared

,p

artn

ered

(1–

0),

yea

rso

fed

uca

tio

n,

ho

use

ho

ldn

etin

com

e,u

nem

plo

yed

(1–

0),

yea

rsin

pan

el,

and

the

NE

OA

Cp

erso

nal

ity

trai

tsb

The

FE

coef

fici

ents

are

net

of

the

effe

cts

of

age,

age-

squar

ed,

par

tner

ed(1

–0),

yea

rso

fed

uca

tion,

house

hold

net

inco

me,

unem

plo

yed

(1–0),

and

yea

rsin

pan

el

**

*S

ign

ifica

nt

at0

.00

1;

**

sig

nifi

can

tat

0.0

1;

*S

ign

ifica

nt

at0

.05

Evidence for Germany

123

Ta

ble

6W

om

en:

effe

ctof

churc

hat

tendan

ceon

subje

ctiv

ehea

lth

mea

sure

s,co

ntr

oll

ing

for

poss

ible

sati

sfac

tion/p

osi

tivit

ybia

s:ra

ndom

effe

cts

(RE

)an

dfi

xed

effe

cts

(FE

)pan

elre

gre

ssio

ns

(met

ric

coef

fici

ents

wit

ht

val

ues

inp

aren

thes

es)

Su

bje

ctiv

em

easu

res !

ob

ject

ive

mea

sure

s

Expla

nat

ory

var

iable

sH

ealt

hsa

tisf

acti

on

(0–

10)

Hea

lth

sati

sfac

tio

n(0

–1

0)

Sel

f-re

port

edh

ealt

h(1

–5

)S

elf-

rep

ort

edh

ealt

h(1

–5

)P

hysi

cal

hea

lth

(pcs

)(0

–1

00

)P

hy

sica

lh

ealt

h(p

cs)

(0–

100

)R

Ea

FE

bR

Ea

FE

bR

Ea

FE

b

Ch

urc

hat

ten

dan

cein

Tab

le3

0.1

18

**

*(5

.24

)0

.048

*(2

.11

)0

.050

**

*(5

.32

)0

.032

**

*(3

.30

)0

.29

6*

(1.6

8)

-0

.00

9(0

.05

)

Ch

urc

hat

ten

dan

cen

eto

fco

mp

osi

tesa

tisf

acti

on

mea

sure

0.0

39

(1.3

1)

-0

.010

(0.2

7)

0.0

31

*(2

.25

)0

.018

(1.0

9)

0.2

70

(0.5

3)

-0

.22

2(0

.50

)

Tv

alues

are

bas

edo

nro

bu

stst

andar

der

rors

Th

eo

bse

rvat

ion

sar

ep

erso

n-y

ears

aT

he

RE

coef

fici

ents

are

net

of

age,

age-

squ

ared

,p

artn

ered

(1–

0),

yea

rso

fed

uca

tio

n,h

ou

seh

old

net

inco

me,

un

emp

loy

ed(1

–0

),y

ears

inp

anel

,an

dN

EO

AC

per

son

alit

ytr

aits

bT

he

FE

coef

fici

ents

are

net

of

the

effe

cts

of

age,

age-

squar

ed,

par

tner

ed(1

–0),

yea

rso

fed

uca

tion,

house

hold

net

inco

me,

unem

plo

yed

(1–0),

and

yea

rsin

pan

el

**

*S

ign

ifica

nt

at0

.00

1;

**

sig

nifi

can

tat

0.0

1;

*si

gn

ifica

nt

at0

.05

B. Headey et al.

123

least at the 0.01 level. Of these just one remains significant at the 0.01 level, and another at

the 0.05 level. In the women’s equations only one out of an initial five coefficients remains

significant, and that only at the 0.05 level.

3.2 Mortality: Do Churchgoers Live Longer?

This section on mortality provides strong countervailing evidence. The results just given

only undermine the validity of the link between church attendance and health if it is the

case that subjective measures of health lack validity, and objective measures are preferable

on grounds of validity (and not just reliability). This might seem a ‘natural’ assumption,

but is it correct?

A reasonable way to check is to compare the different measures for their ability to

predict mortality. It can surely be assumed that the evidence in SOEP about whether people

who were once respondents are now dead or alive is almost 100 percent reliable and valid.

So if a particular health measure is a relatively good predictor of mortality, it can be

regarded as relatively valid, whereas if it is a poor predictor it is relatively invalid. In

Table 7 we report Pearson correlations between the health measures and whether SOEP

respondents were dead or alive 2, 4, 6 and 8 years after the measures were taken. Separate

tables are not given here for men and women, because the correlations for both sexes were

very similar.

A key result emerging from Table 7 is the subjective measures of health clearly do a

better job of predicting mortality than the objective measures (Ambrasat et al. 2011).14

Health satisfaction and self-reported health are better predictors of mortality 4, 6 and 8

years ahead than the two most objective measures (doctor visits and grip strength of the

dominant hand). The doctor visits measure performs poorly at all time intervals, and grip

strength is only as predictive as the subjective measures 2 years ahead.15

The most predictive measure at all time intervals is the pcs health index. Further, as the

evidence in Table 7 makes clear, its superiority over other measures steadily increases as

the time interval of prediction increases. It may also be recalled that this measure has the

highest mean correlation with all other measures, and so may be regarded as having the

highest convergent validity, as well as the highest predictive validity. An interesting and

Table 7 Predicting mortality (dead = 1 alive = 0) 2, 4, 6 and 8 years ahead: Pearson correlations

Subjective measures !objective measures

Time period Health satisfaction(0–10)

Self-reportedhealth (1–5)

Physical health(pcs) (0–100)

(ln) Doctorvisits

(ln) Gripstrength

2 Years ahead -0.092 -0.098 20.117 0.056 -0.091

4 Years ahead -0.123 -0.138 20.170 0.081 -0.097

6 Years ahead -0.151 -0.172 20.212 0.102 na

8 Years ahead -0.168 -0.193 20.263 0.112 na

All correlations are significant at the 0.001 level. Observations are person-years. The pcs measure was firsttaken in 2002, so 8-year predictions are the longest feasible

14 It has been suggested to us that this may be partly because the subjective measures, as well as being self-assessments of health, may also to some extent measure optimism or cheerfulness, which could itself berelated to longevity.15 (Ln) Grip strength of dominant hand proved to be a better predictor than change in grip strength(Bohannon 2008).

Evidence for Germany

123

perhaps critical feature of the pcs measure is that it is intermediate in the range from

subjective to objective. Recall that it combines self-rated health on a 1–5 scale with other

items measuring the difficulty a person experiences in performing routine tasks of daily

living.

The mortality results in the next two tables—Tables 8 (men) and 9 (women)—are the

key results in the paper. The aim is to predict who will be dead or alive 8 years into the

future (1990–1998, 1991–1999…2002–2010). Obviously the main predictor of interest is

church attendance, with other explanatory variables being included mainly as ‘controls’.

Only people age 52 and over at the starting date (1990, 1991…), who therefore died or

remained alive at age 60?, are included in the analyses. To have included younger people

would have largely negated the chance of getting significant results, since death rates for

under-60 s are low. The coefficients in the tables are odds ratios from logit regressions. An

odds ratio less than 1.00 mean that the explanatory variable in question reduces the odds of

dying within 8 years (net of the effects of other variables in the equation). An odds ratio

greater than 1.00 means that the variable increases the odds of dying.

The four models in Tables 8 and 9 differ from each other only in the type and number of

‘controls’ included. Model 1 only includes what may be regarded as standard controls (age,

partnered, income etc.). In model 2 the NEO personality traits are also included. Then in

model 3 father’s age of death and mother’s age of death are added. In model 4 the pcs

health index (which we are now treating as the ‘best’ measure of health) is also used as a

‘control’, in order to assess whether church attendance affects longevity, even controlling

for health status at baseline.

Table 8 Mortality predictions (death within 8 years) based on church attendance ? control variables: logitregression odds ratios for men age 52?

Explanatory variables Men model 1 Men model 2 Men model 3 Men model 4

Church attendance 0.801*** (4.50) 0.687*** (4.03) 0.620*** (4.54) 0.677* (1.99)

Age 1.115*** (41.14) 1.141*** (27.01) 1.124*** (20.47) 1.095*** (8.70)

Partnered 0.740*** (5.51) 0.608*** (5.44 0.634*** (4.42) 0.756 (1.57)

Yrs. education 0.921*** (6.57) 0.924*** (4.25) 0.922*** (3.69) 0.949 (1.40)

HH net income 0.926** (2.86) 1.005 (0.23) 0.971 (0.67) 0.963 (0.49)

Unemployed 0.990 (0.08) 1.119 (0.57) 1.209 (0.82) 1.220 (0.44)

Yrs. in panel 1.003 (0.83) 0.995 (0.69) 0.994 (0.72) 1.003 (0.27)

Neuroticism 1.063 (1.89) 1.069 (1.85) 0.978 (0.35)

Extroversion 0.969 (1.01) 0.994 (0.17) 1.021 (0.32)

Openness 0.861*** (4.82) 0.861*** (4.47) 0.841** (2.74)

Agreeableness 1.046 (1.05) 1.050 (1.07) 1.152 (1.74)

Conscientiousness 0.847*** (4.03) 0.836*** (3.87) 0.815* (2.47)

Age father died 0.999 (0.22) 1.000 (0.13)

Age mother died 1.002 (1.03) 1.003 (0.65)

Pcs health measure 0.953*** (6.38)

Pseudo R-squared 0.151 0.167 0.139 0.167

Na 24,737 19,210 11,208 2,237

T values are based on robust standard errorsa The observations are person-years

*** Significant at 0.001; ** significant at 0.01; * significant at 0.05

B. Headey et al.

123

The parental death variables in model 3 are somewhat problematic because only just

over two-thirds of respondents (more women than men) had recorded this information by

the time they themselves were age 60?. In many cases their parents probably had died, but

SOEP information on this matter is mainly obtained when new survey respondents provide

family history information, rather than being updated every year. It was essential to impute

values for parents whose year of birth was recorded but age of death was missing;

otherwise we would have been left with a sub-sample biased towards those whose parents

died relatively young. We imputed an age of death of 78 for male parents under 73; 78

being the current median death age of males in Germany. For female parents who were

under 78 we imputed an age of death of 83; this being the current median death age for

females. In the case of parents who were already within 5 years of the median death age for

their gender, or were already older than that, we assumed they would live for another

5 years. In the case of parents who, following this last decision rule, had an apparent

current age over 90, we assumed that death had already occurred, but had not been

recorded, and so left their age of death as missing.16 Plainly, a possible consequence of

imputing close to a third of cases (in this relatively crude way), is under-estimation of the

relationship between age of parental death and respondents’ own mortality.

For both men and women in this age group church attendance is strongly related to

longevity. The evidence from model 3—a model with all reasonable controls in place—

Table 9 Mortality predictions (death within 8 years) based on church attendance ? control variables: logitregression odds ratios for women age 52?

Explanatory variables Women model 1 Women model 2 Women model 3 Women model 4

Church attendance 0.835*** (4.01) 0.788** (2.82) 0.805* (2.33) 0.687* (2.05)

Age 1.117*** (41.94) 1.124*** (22.40) 1.111*** (17.10) 1.089*** (8.00)

Partnered 0.715*** (7.11) 0.750*** (3.37) 0.726*** (3.35) 0.636** (2.58)

Yrs. education 0.963*** (3.48) 1.047* (2.74) 1.066*** (3.25) 1.038 (1.01)

HH net income 0.980 (1.24) 1.014 (1.10) 1.014 (1.07) 0.996 (0.13)

Unemployed 1.074 (0.49) 1.205 (0.76) 1.499 (1.55) 1.061 (0.10)

Yrs. in panel 1.003 (0.79) 0.997 (0.43) 1.000 (0.05) 1.018 (1.60)

Neuroticism 1.058 (1.91) 1.038 (1.16) 1.046 (0.72)

Extroversion 0.965 (1.04) 0.987 (0.34) 0.994 (0.09)

Openness 0.951 (1.49) 0.878*** (3.72) 0.850* (2.41)

Agreeableness 0.983 (0.43) 0.942 (1.38) 0.974 (0.31)

Conscientiousness 0.763*** (6.62) 0.761*** (6.10) 0.718*** (4.09)

Age father died 0.994* (2.32) 0.995 (1.03)

Age mother died 0.994* (2.40) 0.994 (1.23)

Pcs health measure 0.960*** (5.20)

Pseudo R-squared 0.170 0.147 0.134 0.189

Na 28,617 19,878 13,464 2,421

T values are based on robust standard errorsa The observations are person-years

*** Significant at 0.001; ** significant at 0.01; * significant at 0.05

16 Without making this assumption, we would have had to leave an implausible number of apparentcentenarians in the dataset!

Evidence for Germany

123

indicates that the odds of men dying within 8 years are 38 % lower (odds ratio = 0.620,

p \ 0.001) if they attend church regularly rather than seldom or never. For women church

attendance reduces mortality by just under 20 % (odds ratio = 0.805, p \ 0.05).

The evidence in Tables 8 and 9 confirms previous research in showing that longevity is

related to being partnered (odds ratio for men in model 3 = 0.634, p \ 0. 001; for women

odds ratio = 0.726, p \ 0.001) and is also increased by rating high on the personality trait

of conscientiousness (men’s odds ratio in model 3 = 0.836, p \ 0.001; women’s

ratio = 0.761, p \ 0.001) (Strawbridge et al. 1997; Friedman and Martin 2011; Koenig

et al. 2012).

In this dataset the age at which their parents died appears to be associated with women’s

but not men’s mortality rates.17 For every extra year their father lived, women’s odds of

dying within the next 8 years are reduced by about half of one percent, with a similar

margin for every extra year their mother lived. As noted above, it is possible that inter-

generational longevity links would have appeared stronger if more complete data were

available on age of parental death (but see Lach et al. 2006 for not dissimilar results in a

large Israeli study).

The results for model 4—the final column in these tables—are astonishing. Here we

control for health status at baseline.18 It transpires that church attendance has a statistically

significant effect on mortality, even controlling for pcs 8 years before. The odds of

churchgoing men and women dying within 8 years are nearly one-third lower (men’s odds

ratio 0.677, p \ 0.05; women’s odds ratio 0.687, p \ 0.05) than the odds for individuals

who are non-churchgoers but otherwise have the same health status (and the same age,

education, income and personality traits) at baseline.

The wheel has come full circle. In the previous section we reviewed evidence that lent

itself to the interpretation that the apparent link between church attendance and health is

due to satisfaction/positivity bias in the more subjective measures of health. But then

countervailing evidence made it clear that the subjective measures, and even more the

intermediate pcs measure, are in one key respect—ability to predict mortality—more valid

than the objective measures. Finally, evidence on mortality showed that church attendance

significantly reduces the probability of dying, even controlling for health status at baseline.

4 Church Attendance is Linked to Better Health Indirectly Via Health ProtectiveAttitudes and Behaviors: A Structural Equation Model

Plainly, it is improbable that attending church could by itself make individuals healthier

and longer lived.19 There must be intermediate causal ‘‘mechanisms’’—intermediate

17 Age of father’s death (but not age of mother’s death) appears significantly related to men’s own longevityin alternative equations in which personality traits are omitted. Further, the results linking women’s lon-gevity to parental longevity are strengthened if traits are omitted. It may be noted that personality traits werenot included as controls in the Israeli and Icelandic studies cited in the ‘‘Introduction’’, which showedstatistically significant but weak intergenerational longevity linkages.18 In this table only the coefficients for church attendance and the pcs health measure are of interest. Theother variables are worth inclusion as ‘controls’, but their coefficients are somewhat misleading because thecausal ordering is inappropriate. For example, the variables measuring age, partnership status and years ofeducation should be regarded as causally antecedent to the pcs health measure. See Table 5 for preferableestimates.19 Note, however, the Norwegian study referenced above which indicated that attending religious serviceslowers blood pressure (Sorensen et al. 2011).

B. Headey et al.

123

links—between church attendance and health which account for health benefits. In this

section we offer a structural equation model which estimates the effects of some of these

causal mechanisms. The mechanisms include non-smoking, taking regular exercise and

enjoying higher than average life satisfaction, all of which are associated with both church

attendance and religious belief. There are likely to be other more intangible mechanisms,

which for lack of evidence in SOEP, we cannot investigate here. These may include the

spiritual comfort and sense of purpose and meaning in life which religion can provide.

First some bivariate and partial correlations which show associations between church

attendance and a range of health protective (and risk-avoiding) attitudes and behaviors. In

the SOEP dataset the correlation between church attendance and smoking (1 = smoker

0 = non-smoker) is -0.173 (p \ 0.001). Correlations with being risk averse and health

risk averse are, respectively, 0.079 (p \ 0.001) and 0.084 (p \ 0.001). Controlling for

gender and age, the partial correlation between church attendance and being partnered

which, as we have seen, is quite strongly related to better health, is 0.108 (p \ 0.001).

Churchgoers are somewhat less likely to become separated or divorced; the partial cor-

relation, controlling for gender and age is -0.030 (p \ 0.001). Churchgoers also take more

exercise (partial correlation = 0.044, p \ 0.001) and have higher rates of social engage-

ment/participation (partial correlation = 0.085, p \ 0.001). Both these behaviors are

associated with better health; the partial correlation of the pcs index with exercise (again

controlling for gender and age) is 0.044 (p \ 0.001) and with social participation it is 0.085

(p \ 0.001).

Religious individuals mostly come from religious families. In the SOEP data the

bivariate correlation between respondents’ own level of church attendance and fathers’

attendance is 0.580 (p \ 0.001); with mothers’ attendance it is 0.571 (p \ 0.001). To a

considerable extent, parents inculcate not only religious belief and practice, but also the

health-protective and health risk-avoidance attitudes and behaviors just reviewed. There

are positive associations in the German data between parent and ‘child’ levels of smoking,

risk aversion, exercise and social participation (Headey et al. 2013).

More important for health and longevity than any of the attitudes or behaviors con-

sidered so far is life satisfaction/happiness. It has been shown that church attendance is

quite strongly associated with higher than average life satisfaction (Myers 2013; Clark and

Lelkes 2009). The link holds in the German dataset even in fixed effects models (Headey

et al. 2010). Furthermore, it is well established that life satisfaction is associated with better

health and longevity (Deeg and van Zonneveld 1989; Diener et al. 1999; Argyle 2001;

Ambrasat et al. 2011). Just by way of illustration: in the SOEP data the correlation of life

satisfaction with 8-year mortality is -0.120 (p \ 0.001). So, if life satisfaction had been

included in Table 8 as a quasi-health measure, it would have performed better than doctor

visits and, presumably, grip strength (Ambrasat et al. 2011).20

The structural equation model in Fig. 1 shows links from church attendance to exercise

and (not) smoking, and then through to life satisfaction and our preferred measure of

health, the pcs index. It is a 3-wave panel model of a somewhat unusual kind. It covers the

years 2002–2010 with variables being grouped as contemporaneous, 2-year lagged and

4-year lagged. So the years covered are 2002–2004–2006, then 2004–2006–2008 and

finally 2006–2008–2010. The reason for the 2-year lags is that the pcs measure was only

taken biennially in these particular years. It was thought preferable to run a 3-wave model,

rather than the maximum possible 5-wave model, in order to obtain more reliable results

20 The grip strength measure was first taken in 2006, so it is not possible to make an 8-year prediction.However, grip strength performs less well than life satisfaction for 2- and 4-year predictions.

Evidence for Germany

123

due to larger sample numbers at each time point. An assumption implied by collapsing to

3-waves is that relationships among variables did not change within the overall time

period. Inspection of bivariate correlations within and across waves suggests that this

assumption is plausible.

The three mechanism variables which are included in the final model are smoking,

taking regular exercise and life satisfaction. Other variables, including social participation,

and both general and health-related risk aversion, were tried in earlier versions of the

model but had negligible effects on health, net of standard controls.

In Fig. 1 no direct link is drawn between church attendance and health. This is because

successive iterations of the model showed that, while church attendance substantially

affects health, the links are only indirect via the mechanism variables. Another important

point about the lay-out of the model is that it shows two-way (reciprocal) causation

between health and life satisfaction. In the life satisfaction literature health is usually

viewed as a cause of life satisfaction (Diener et al. 1999; Argyle 2001; Veenhoven 2009),

but it is reasonable to hypothesize that the reverse link may also be significant (Headey

et al. 1991). Indeed it virtually has to be so, otherwise life satisfaction would not predict

longevity. In the technical modelling literature, the links drawn in Fig. 1 between life

satisfaction and health are referred to as ‘‘cross-lagged’’ (Kessler and Greenberg 1981;

Finkel 1995). In cross-lagged models it is assumed that there is lagged two-way causation

between variables x and y, but not simultaneous causation (Kessler and Greenberg 1981;

Fig. 1 Effects of church attendance on exercise, smoking, life satisfaction and health (pcs): a three-wavestructural equation modelabcd. a Maximum likelihood (ML) coefficients *1,000. Results were initiallyobtained net od age, age-squared, years of education, household net income, NEOAC personality traits, andnumber of years in the panel. Variables not significant at 0.05, which detracted from the fit, were droppedfrom the model. b The pairs of coefficients labelled =a, =b, =c, d, =e, =f, =g, =h and =i were constrained(fixed) to be equal. c Covariances among exogenous variables, and also among the error terms ofendogenous variables measured at the same time point, were estimated as part of the model. This is routinelydone in structural equation modelling. It is necessary to achieve adequate model fit. d Coefficients in thisfigure are statistically significant at the 0.001 level unless marked ns (not significant). Measures of fit LRChi2 (229) = 1,575.303 (p \ 0.001); CD = 0.601; CFI = 0.993, TLI = 0.998; stability test = all eigen-values within unit circle; RMSEA = 0.017, SRMR = 0.012; Sample N = 19,423

B. Headey et al.

123

Finkel 1995). It can be shown by calculus that a cross-lagged model is mathematically

justified if the variables implicated in the two-way causation are assumed to influence each

other more or less continuously (Coleman 1968; Tuma and Hannan 1984; Finkel 1995).

This seems plausible in regard to health and life satisfaction.

In this model we make use of a concept of causation due to Nobel Laureate, Clive

Granger. Granger (1969) proposed that x can be regarded as a cause of y if any lagged

values of x remain statistically significant in equations which include multiple lagged

values of y. The intuitions behind what is usually termed ‘‘Granger-causation’’ are that (1)

causes must precede effects in time and (2) if spikes in values of time series variable x are

associated with subsequent spikes in time series y, despite taking account of multiple

lagged values of y, then there is prima facie evidence that x is a cause of y.21 In our model

the estimated links between explanatory variables (e.g. church attendance) lagged by two

waves are net of outcome variables (e.g. smoking) lagged by both two and four waves.22

Not everyone shares Granger’s view of causation. Some specialists in panel data

modelling take the view that, if researchers believe that causal effects are more or less

simultaneous (or occur at time intervals that are closer to simultaneous than the time

between panel waves), then it is better to omit cross-lagged links and instead include

simultaneous links (Kessler and Greenberg 1981).23 An argument in favour of this

approach is that, although coefficients obtained from a cross-lagged model will have the

right sign, they are likely to be biased downwards if simultaneous causation is more or less

what is occurring. Here we rely primarily on a Granger-style model, but we also estimated

a simultaneous causation model and will mention a few results from it.24

A further point of both substantive and methodological importance is that the Fig. 1

model contains nine pairs of equality constraints (pairs of coefficients fixed equal to each

other), marked =a, =b, =c, =d, =e, =f, =g, =h and =i. These coefficients are estimates of

the nine most interesting links in the model: links between church attendance and the

lifestyle mechanism variables, and between life satisfaction and pcs health. It is a rea-

sonable supposition that these links should be stable. That is, one would expect links

between consecutive waves of data to be the same. By imposing these equality constraints,

and then later relaxing them one by one, we can provide sensitivity tests of whether the

links really are stable; stable when nested within the overall model (Kessler and Greenberg

1981; Finkel 1995).25

The Fig. 1 model proves to be a very satisfactory fit to the data. The Comparative Fit

Index (CFI) is 0.993 and the Tucker–Lewis index (TLI) is 0.988; both well above the ‘good

fit’ threshold of 0.95. The RMSEA is 0.017 (a good fit is generally taken to be below 0.05)

21 Initially, Granger’s reasoning only applied to pairs of variables and took no account of possible 3rd, 4thetc confounding variables which might be associated with both x and y. However, the approach was laterextended to relationships involving multiple variables by using vector autoregression.22 In practice, when 6- and 8-year lags were included, these ML models began to show symptoms ofmulticollinearity. This was another reason for preferring 3-wave panel models to models with 4 or 5 waves.23 It can be technically feasible to estimate 3? wave models which include both lagged and simultaneouslinks. However, it is difficult to achieve identification for such models, so few examples are found in theliterature.24 The simultaneous causation model included 2-year lagged links between the same variable at differenttime points, but not 4-year (Granger-style) lags.25 The sensitivity tests are all the more stringent because two lags of the ‘‘mechanism’’ variables areincluded in estimates of links between church attendance lagged by 2 years and contemporaneous (time t)‘‘mechanisms’’.

Evidence for Germany

123

and the SRMR is 0.012 (under 0.08 is usually regarded as a good fit).26 The coefficient of

determination (CD) for the whole model is 0.601 (p \ 0.001) and the model satisfies the

Bentler and Freeman (1983) stability criterion with all eigenvalues falling within the unit

circle.

Results for men and women were so similar that separate models were unnecessary.

A key result is that religious practice/church attendance improves pcs health indirectly

through the mechanisms of taking more exercise (b = 0.051 p \ 0.001), being less likely

to smoke (b = -0.015, p \ 0.001) and enjoying greater life satisfaction (b = 0.165,

p \ 0.001). While the effect of life satisfaction is substantial, it might seem that the effects

of exercise and (not) smoking are quite small. However, it should be recalled that these

estimates have survived the rigorous test imposed by Granger-causation.27 The effect sizes

are still significant at the 0.001 level despite inclusion of lagged versions of dependent

variables (which of course have very strong associations, shown in Fig. 1, with ‘them-

selves’ at time t). Similarly, the two-way causal links between life satisfaction and pcs

health remain significant in this Granger framework. The coefficient for the effect of life

satisfaction on health is 0.349 (p \ 0.001) and the reverse link from pcs health to life

satisfaction is 0.016 (p \ 0.001).

The finding that links between church attendance and pcs health are indirect does not

mean that they are insubstantial. The total effect of church attendance on health is the sum

of its indirect effects via the mechanisms (Sobel 1987). Estimates of total effects are

routinely provided by structural equation modelling programs and are often the chief point

of interest in social science research (rather than more routinely reported direct effects).

Our best estimate of the total effect of church attendance on health is 0.172 (p \ 0.001),

which comes from the alternative simultaneous causation model rather than the Granger-

style model in Fig. 1.28

Most of this effect is transmitted via enhanced life satisfaction, which in the simulta-

neous model has a very strong direct effect on pcs (b = 0.726, p \ 0.001).

As explained above, equality constraints were imposed on the nine inter-wave pairs of

coefficients in Fig. 1 linking church attendance, exercise, smoking, life satisfaction and

health. Pairs of equality constraints can be removed (one pair at a time) to check whether

the link in question is stable over time, and to check whether the overall fit of the model is

improved (Kessler and Greenberg 1981; Finkel 1995). When these sensitivity tests were

undertaken, and results assessed by Lagrange multiplier tests (Kline 2005), it transpired

that seven of the nine equalities were well justified. However, by small margins (just

significant at the 0.05 level), the equality constraints on the links between church atten-

dance and life satisfaction, and between smoking and health, were diagnosed as not jus-

tified; the overall model fit would be slightly better if they were removed and each

26 Our alternative simultaneous causation model is also a good fit: CFI = 0.993; TLI = 0.985;RMSEA = 0.020; SRMR = 0.013 and CD = 0.627 (p \ 0.001).27 In our alternative simultaneous equation model the coefficients (bs) for links between church attendanceand exercise, smoking and life satisfaction are a bit larger: exercise b = 0.066 (p \ 0.001), smoking b =-0.019 (p \ 0.001) and life satisfaction b = 0.196 (p \ 0.001).28 The reason for preferring the simultaneous model at this juncture is largely technical. In the simultaneousmodel it is possible to obtain estimates of the total effect of contemporaneous church attendance on pcs(given in the text), as well as church attendance lagged by 2 years and 4 years. In the Fig. 1 model, solelybecause of the Granger-style specification, there are neither direct nor indirect links between pcs andcontemporaneous church attendance, or church attendance lagged by 2 years. The only available estimate isbetween pcs and church attendance lagged by 4 years. This estimate is 0.075 (p \ 0.001) which, given the4-year lag, seems certain to be downwardly biased.

B. Headey et al.

123

parameter estimated separately. On theoretical grounds we decided to leave all equality

constraints in place, taking the view that (1) these repeated links between successive waves

of data are expected to be the same and (2) the overall fit of the model is excellent, so

improving it by margins that are barely statistically significant is not a major consideration.

As would be expected, the same causal mechanisms that affect health also affect

mortality. Logit regressions (sample restricted to age 52?) similar to those in Tables 8 and

9 indicate that exercise, smoking and life satisfaction mediate the relationship between

church attendance and 8-year mortality. When exercise, smoking and life satisfaction are

added as explanatory variables to model 3 in Tables 8 and 9, they have quite strong direct

effects on the odds of dying within 8 years. Church attendance ceases to have a statistically

significant direct effect, but it is clear from all the evidence assembled that it must have

substantial indirect effects on mortality.

5 Discussion

We have found somewhat different results for Germany compared with the US American

researchers have mostly reported strong direct links between church attendance, health and

longevity. In the German case we initially reported that such direct links appeared to hold

only in very simple models—models with minimal demographic and socio-economic

controls—and models in which health outcomes were assessed by subjective measures.

Fixed effects models for the intermediate pcs index, and for the two most objective health

measures, suggested that links to church attendance were not statistically significant. It

seemed that links between church attendance and subjective measures were at least partly

due to a general tendency among religious people to report greater satisfaction with vir-

tually all aspects of life (Andrews and Withey 1976; Campbell et al. 1976; Argyle 2001).

This interim assessment was then thrown into question by finding that the subjective

measures, and even more the intermediate pcs index, were actually better predictors of

mortality than the objective measures. So, using the pcs index as our preferred outcome

measure, we investigated a range of causal mechanisms which might account for links

between religious practice and better health. Our final structural equation panel model can

be read as indicating that church attendance improves health and mortality indirectly, in

part via the mechanisms of regular exercise, not smoking and enhanced life satisfaction.

This is a similar conclusion to that reached in Friedman and Martin’s (2011) magisterial

study of longevity in the US.

Contrary to hypothesis, we found no evidence that women’s health and mortality

benefits more than men’s from religious practice. There was little difference between the

sexes, but the mortality results in particular (Tables 8, 9) tended to favour religious men.

The most striking result in the paper is that both male and female churchgoers have

significantly better 8-year survival rates, even controlling for health status (pcs index) at

baseline.

Most of our results were not hypothesized in advance of data analysis. Instead they

emerged from following unexpected trails of evidence. This being so, it will be especially

important to see if results replicate in future research and other countries. In this paper we

have unavoidably focussed on church attendance as the main measure of religiosity. In

future work, it will be desirable to investigate the potential health benefits of differing

religious beliefs and the more spiritual dimensions of religion. Christian religious belief

and practice appear to be linked to improved health and mortality; is the same true for other

religions?

Evidence for Germany

123

Acknowledgments We are grateful for comments and suggestions from Ruud Muffels of Tilburg Uni-versity and Stephen Headey of Monash University.

References

Ambrasat, J., Schupp, J., & Wagner, G. G. (2011). Comparing the predictive power of subjective andobjective health indicators: Changes in hand grip strength and overall satisfaction with life as pre-dictors of mortality. SOEP papers on Multidisciplinary Panel Data Research from DIW Berlin No. 398.

Andrews, F. M., & Withey, S. B. (1976). Social indicators of well-being. New York: Plenum.Argyle, M. (2001). The psychology of happiness (2nd ed.). London: Taylor and Francis.Ayhan, H. O., & Isiksal, S. (2004). Memory recall errors in retrospective surveys: A review check record

study. Quality & Quantity, 38, 475–493.Benson, H., Dusek, J. A., Sherwood, J. B., et al. (2006). Study of the therapeutic effects of intercessory

prayer (STEP) in cardiac bypass patients: A multicenter randomized trial of uncertainty and certaintyof receiving intercessory prayer. American Heart Journal, 151(4), 934–942.

Bentler, P. M. (1990). Comparative fit indices in structural models. Psychological Bulletin, 107, 238–246.Bentler, P. M., & Freeman, E. H. (1983). Tests for stability in linear structural equation systems, Psy-

chometrika, 48, 143–145.Bentler, P. M., & Raykov, T. (2000). On measures of explained variance in non-recursive structural equation

models. Journal of Applied Psychology, 85, 125–131.Bohannon, R. W. (2001). Dynamometer measurements of hand-grip strength predict multiple outcomes.

Perceptual and Motor Skills, 93, 323–328.Bohannon, R. W. (2008). Hand-grip dynamometry predicts future outcomes in aging adults. Journal of

Geriatric Physical Therapy, 31, 3–10.Browne, M. W., & Cudeck, R. (1993). Alternative ways of assessing model fit. In J. S. Bollen & K. A. Long

(Eds.), Testing structural equation models. Newbury Park, CA: Sage.Byrd, R. C. (1988). Positive therapeutic effects of intercessory prayer in a coronary care unit population.

Southern Medical Journal, 81(7), 826–829.Campbell, A., Converse, P. E., & Rodgers, W. R. (1976). The quality of American Life. New York: Sage.Clark, A. E., & Lelkes, O. (2009). Let us pray: Religious interactions in life satisfaction. PSE Working

Papers, halshs-00566120, HAL.Coleman, J. S. (1968). The mathematical study of change. In H. Blalock & A. Blalock (Eds.), Methodology

in social research (pp. 428–478). New York: McGraw Hill.Costa, P. T., & McCrae, R. R. (1991). NEO PI-R. Odessa, FL: PAR.Deeg, D., & van Zonneveld (1989). Does happiness lengthen life? In R. Veenhoven (Ed.), How harmful is

happiness?, Chap. 5. Rotterdam: Erasmus University Press.Diener, E., Suh, E. M., Lucas, R. E., & Smith, H. L. (1999). Subjective well-being: Three decades of

progress. Psychological Bulletin, 25, 276–302.Finkel, S. E. (1995). Causal analysis with panel data. Thousand Oaks, CA: Sage.Frick, J. R., Schupp, J., & Wagner, G. G. (2007). Enhancing the power of the German Socio-Economic

Panel Study (SOEP)—evolution, scope and enhancements. Schmollers Jahrbuch, 127, 139–169.Friedman, H. S., & Martin, L. R. (2011). The longevity project: Surprising discoveries for health and long

life from the landmark eight-decade study. Harmondsworth: Penguin.Frijters, P., Haisken-DeNew, J. P., & Shields, M. A. (2004). Money does matter! Evidence from increasing

real incomes and life satisfaction in East Germany following reunification. American EconomicReview, 94, 730–741.

Gerlitz, J. -Y., & Schupp, J. (2005). Zur Erhebung der Big-Five-basierten Persoenlichkeitsmerkmale imSOEP. www.diw.de/deutsche/produkte/publikationen/researchnotes/docs/papers/rn4.pdf.

Granger, C. W. J. (1969). Investigating causal relations by econometric models and cross-spectral methods.Econometrica, 37(3), 424–438.

Gudmundsson, H., et al. (2000). Inheritance of human longevity in Iceland. European Journal of HumanGenetics, 10, 743–749.

Hall, D. E. (2006). Religious attendance: More cost-effective than Lipitor? The Journal of the AmericanBoard of Family Medicine, 19, 103–109.

Headey, B. W., Muffels, R. J. A., & Wagner, G. G. (2013). Parents transmit happiness along with associatedvalues and behaviors to their children: A lifelong happiness dividend? Social Indicators Research,.doi:10.1007/s/11205-013-0326-7.

B. Headey et al.

123

Headey, B. W., Schupp, J., Tucci, I., & Wagner, G. G. (2010). Authentic happiness theory supported byimpact of religion on life satisfaction: A longitudinal analysis with data for Germany. Journal ofPositive Psychology, 5, 73–82.

Headey, B. W., Veenhoven, R., & Wearing, A. J. (1991). Top-down versus bottom-up theories of subjectivewell-being. Social Indicators Research, 24, 81–90.

Joreskog, K. G, & Sorbom, D. (1986). LISREL VI: Analysis of linear structural relationships by the methodof maximum likelihood. Mooresville, IN: Scientific Software.

Kessler, R. C., & Greenberg, D. F. (1981). Linear panel analysis. New York: Academic Press.Kline, R. B. (2005). Principles and practice of structural equation modelling. New York: Guilford Press.Koenig, H. G. (1998). Handbook of religion and mental health. New York: Academic Press.Koenig, H. G., King, D., & Carson, V. (2012). Handbook of religion and health. New York: Oxford

University Press.Lach, S., Riton, Y., & Simhon, A. (2006). Longevity across generations. Discussion Paper No. 4.06.

Maurice Falk Institute for Economic Research in Israel, Hebrew University of Jerusalem.Lucas, R. E. (2008). Personality and subjective well-being. In M. Eid & R. J. Larsen (Eds.), The science of

subjective well-being (pp. 171–194). New York: Guilford Press.Marmot, M. G., Davey Smith, G., Stansfield, S., et al. (1991). Health inequalities among British Civil

Servants: The Whitehall II study. Lancet, 337(8754), 1387–1393.Marmot, M. G., Rose, G., Shipley, M., & Hamilton, P. J. (1978). Employment grade and coronary heart

disease in British civil servants. Journal of Epidemiology and Community, 32(4), 244–249.Maselko, J. (2006). Senior citizens who attend religious services may prolong lung health. www.

seniorjournal.com, 1 Nov 2006 [cached].Maselko, J., & Kubzansky, L. (2006). Gender differences in religious practices, spiritual experiences and

health: Results from the General Social Survey. Social Science and Medicine, 32(3), 245–253.Metter, E. J., Talbot, L. A., Schrager, M., et al. (2002). Skeletal muscle strength as a predictor of all-cause

mortality in healthy men. Journal of Gerontological and Biological Sciences Medical Sciences, 57(10),B359–B365.

Myers, D. G. (2013). Religious engagement and well-being. In S. A. David, I. Boniwell, & A. Conley Ayers(Eds.), The Oxford handbook of happiness (pp. 88–100). Oxford: Oxford University Press.

Rantanen, T., Harris, T., Leveille, S. G., Visser, M., Foley, D., Masaki, K., et al. (2000). Muscle strength andbody mass index as long-term predictors of mortality in initially healthy men. Journal of Gerontology:Medical Sciences, 55A, M168–M173.

Roberts, B. W., Walton, K., & Viechtbauer, W. (2006). Patterns of mean-level change in personality traitsacross the life course: A meta-analysis of longitudinal studies. Psychological Bulletin, 132, 3–27.

Schnall, E., Wassertheil-Smoller, S., Swencionis, C., Zemon, V., et al. (2010). The relationship betweenreligion and cardiovascular outcomes and all-cause mortality in the women’s health initiative obser-vational study. Psychology and Health, 25, 249–263.

Schwarze, J., Andersen, H., & Silke, A. (2000). Self-rated health and changes in self-rated health aspredictors of mortality—first evidence from the German panel data. DIW Discussion Paper No. 203.Berlin, DIW.

Scollon, C. N., & Diener, E. (2006). Love, work and changes in extroversion and neuroticism over time.Journal of Personality and Social Psychology, 91, 1152–1165.

Sobel, M. E. (1987). Direct and indirect effects in linear structural equation models. Sociological Methodsand Research, 16, 155–176.

Sørensen, T., Danbolt, L. J., Lien, L., Koenig, H. G., & Holmen, J. (2011). The relationship betweenreligious attendance and blood pressure: The Hunt Study, Norway. The International Journal ofPsychiatry in Medicine, 42(1), 13–28.

STATA. (2011). Structural equation modeling reference manual. College Station, TX: STATA Press.Strawbridge, W. J., Cohen, R. D., Shema, S. J., & Kaplan, G. A. (1997). Frequent attendance at religious

services and mortality over 28 years. American Journal of Public Health, 87(6), 957–961.Tuma, N., & Hannan, M. (1984). Social dynamics. New York: Academic Press.Veenhoven (2009). World database of happiness. Rotterdam: Erasmus University Press. http://

worlddatabaseofhappiness.eur.nl/. Accessed 6 August 2013.Ware, J. E., Snow, K., & Kosinski, M. (2000). SF-36 health survey: Manual and interpretation guide.

Lincoln, RI: QualityMetric Inc.Winkelmann, R. (2000). Seemingly unrelated negative binomial regression. Oxford Bulletin of Economics

and Statistics, 62, 553–560.

Evidence for Germany

123