Upload
roger-watson
View
216
Download
3
Embed Size (px)
Citation preview
Janforum
Publishing the results of factor analysis:interpretation and presentation
The use of factor analysis in the Journal of Advanced
Nursing is increasing. There were two papers published
using this technique in 1994 and 1995, three in 1996 and
1997 and by July 1998 there were ®ve papers. Factor
analysis is a statistical technique for the reduction to
underlying dimensions of multivariate data (Dillon &
Goldstein 1984), where many variables are simultaneously
measured in a group of individuals. As someone who
employs factor analysis regularly and always pays partic-
ular attention to studies in which it is used I would like to
propose some standards for the presentation of factor
analytical results which will both enhance their utility to
readers and demonstrate that those who use the technique
really understand its complexities.
Speci®cally I would like to address the following
aspects:
� reasons for using factor analysis;
� sample size;
� presentation of ®rst unrotated principal component;
� communalities;
� rotation, the presentation of Eigenvalues and explained
variance;
� selecting the number of factors from a principal com-
ponents analysis;
� presenting the rotated solution; and
� limitations of exploratory factor analysis.
For the purpose of this article all papers using factor
analysis in the Journal of Advanced Nursing between
January 1994 and July 1998 were reviewed.
Reasons for using factor analysis
Factor analysis encompasses a range of statistical tech-
niques for the reduction of multivariate data sets to fewer
underlying dimensions or factors (Hair et al. 1987). There
is considerable laxity in the literature where factor anal-
ysis is used over the use of the terms `dimensions',
`constructs' and `factors' (Kline 1994) and writers should
try to con®ne themselves to one term within a publication
and to explain what they mean by it. Whichever term is
used, a factor is essentially a undimensional construct or
dimension within a data set which is characterized by the
variables of which it is comprised.
There are two branches of factor analysis, explorato-
ry and con®rmatory. All of the studies except one
(Dunn & Burnett 1995) published in the Journal of
Advanced Nursing are purely exploratory and the term
exploratory factor analysis (e.g. principal components
analysis, common factor analysis, principal axis factor-
ing and maximum likelihood factor analysis) should be
more commonly used in order to distinguish these
techniques from con®rmatory factor analysis. The lim-
itation of exploratory factor analysis will be discussed
below.
Sample size
Popular textbooks on factor analysis give speci®c advice
on sample size (Child 1990, Kline 1994). However, spe-
ci®c comment on sample size is rarely made in papers
using factor analysis in the Journal of Advanced Nursing.
It is quite frequently the case that adequate sample sizes
are not reported (Zhan & Shen 1994, Lauri & SalanteraÈ
1995, Crow et al. 1996, Drummond & Rickwood 1997,
Persson et al. 1997, Sitzia et al. 1997, Gilleard & Reed
1998, Severinsson 1998). The required variable to subject
ratio lies between 1:5 and 1:10 (Kline 1994), with the
former being the absolute minimum and the latter being
suf®cient. In order to detect underlying dimensions
resulting from individual differences between subjects,
there must be a greater number of variables being mea-
sured than subjects in the study.
Presentation of ®rst unrotated principal component
The most commonly applied technique of exploratory
factor analysis is principal components analysis. Strictly
speaking this is not really a method of factor analysis but a
way of making a preliminary investigation of the correla-
tion between all of the variables in a set of data (Kline
1994). However, this is an easily applied method of
identifying factors and the results, especially where sam-
ple sizes are adequate, differ very little from other meth-
ods such as principal axis factoring. However, it is very
exceptional, where principal components analysis has
been used, to see the loadings for the ®rst unrotated
principal component being reported in the Journal of
Advanced Nursing (Watson & Deary 1994). Nevertheless, I
would advocate this practice, in addition to reporting the
variance extracted by this component, because useful
Journal of Advanced Nursing, 1998, 28(6), 1361±1363
Ó 1998 Blackwell Science Ltd 1361
information can be gleaned, such as whether or not there
is a general factor underlying the data.
Communalities
Similarly to the ®rst unrotated principal component, it is
very exceptional to see communalities being reported in
the Journal of Advanced Nursing and these are also useful
(Watson & Deary 1994). Communality represents the
amount of variance in one variable which is shared by
all the other variables and, generally speaking, these
should be high. Where communalities are low it can
indicate that variables are unreliable and it is helpful if the
reader can judge the contribution of each variable to the
overall analysis (Child 1990).
Rotation, the presentation of Eigenvaluesand explained variance
While it is common practice to present the Eigenvalues
(Bryman & Cramer 1997) and the amount of variance
extracted by the unrotated principal components in a table
along with each of the factors (Zee et al. 1994, Hicks 1996,
Persson et al. 1997, Andrew 1998, Rosenkoetter & Garris
1998, Servinsson 1998) for the rotated solution this is,
strictly speaking, incorrect. The Eigenvalues are derived
from the variance extracted by the unrotated principal
components Ð which merely serves as a guide to the
number of factors to be rotated Ð the Eigenvalues and the
variance extracted by the unrotated principal components
are only relevant to the unrotated solution.
Once a rotational procedure has been imposed the
variance is distributed differently across the rotated fac-
tors compared with the unrotated principal components.
In fact, that redistribution is the point behind the rotation
which is aimed at achieving a simpler and more easily
interpreted solution than is possible merely from the
principal unrotated components (Child 1990). The Eigen-
values and extracted variances should be presented sep-
arately from the rotated solution or merely reported in the
text (Watson & Deary 1994, Zhan & Shen 1994, Lauri &
SalanteraÈ 1995).
Selecting the number of factors from principalcomponents analysis
Admittedly there is no approved method of selecting the
number of factors to be rotated from a principal compo-
nents analysis. The choice for principal components anal-
ysis is between using Eigenvalues greater than unity or the
scree slope method. The scree slope method, reported only
once in the Journal of Advanced Nursing (Dunn & Bynett
1995) involves plotting the Eigenvalues against the number
of factors and selecting the number of factors by visual
inspection of the resulting curve. Using Eigenvalues great-
er than one is commonly employed because it requires no
judgement on behalf of the researcher.
However, it frequently leads to solutions which are not
parsimonious leading to a large number of factors which
cannot be interpreted. Using Eigenvalues greater than
unity is often suf®cient but I lodge a plea for a routine
combination of methods in order that the scree slope may
be used either to verify the number of factors selected
using Eigenvalues greater than unity or to rotate a more
economical number of factors.
Presenting the rotated solution
The practice, in presenting the results of the rotated
solution, of only showing the variables which load on the
rotated factors should be discouraged (Crow et al. 1996,
Hicks 1996, Andrew 1998, Manninen 1998, Rosenkoetter
& Garris 1998). This practice can be very misleading to the
reader because it may make the derived solution look
simpler than it really is. What is masked by the practice of
only showing those variables loading on the rotated
factors are those variables which either cross-load on
other factors or variables with loadings on factors other
than those to which they are ascribed which are relatively
high. Quite simply, the loadings of all variables on all
factors should be shown (van der Zee et al. 1994, Watson
& Deary 1994, Lauri & SalanteraÈ 1995, Clifford 1996,
Drummond & Rickwood 1997, Persson et al. 1997).
Limitations of exploratory factor analysis
Exploratory factor analysis explores multivariate data for
putative factorial solutions whereby the data may be
reduced. In contrast, con®rmatory factor analysis tests
hypothesized factorial solutions and provides estimates of
how closely hypothesized solutions explain the variance
in the data. In other words, the investigator can set up
models based on previous research, on intuition or on the
intercorrelations in the data and test how closely these ®t
the data (Dunn & Burnett 1995).
Researchers presenting the results of exploratory factor
analysis should acknowledge the limitations of this ap-
proach and refer to the superior, but vastly more complex,
technique of con®rmatory factor analysis. Nevertheless,
the value of exploratory factor analysis in exploring data
and providing putative models for testing should be
maintained, in my view.
Conclusion
Hopefully these views will stimulate others to contribute
to the discussion about the presentation of results from
multivariate statistical analysis and to a more informative
way of using such results in papers published in the
Journal of Advanced Nursing. The excitement, individu-
Janforum
1362 Ó 1998 Blackwell Science Ltd, Journal of Advanced Nursing, 28(6), 1361±1363
ality and interest in factor analysis should be in the
interpretation and not in the presentation.
Roger Watson
BSc PhD RGN CBiol MIBiol
Professor of Nursing,
Dublin City University,
Dublin, Republic of Ireland
References
Andrew S. (1998) Self-ef®cacy as a predictor of academic perfor-
mance in science. Journal of Advanced Nursing 27, 596±603.
Bryman A. & Cramer D. (1997) Quantitative Data Analysis with
SPSS for Windows. Routledge, London.
Child D. (1990) The Essentials of Factor Analysis. Cassel, London.
Clifford C. (1996) Nurse teacher's clinical work: a survey report.
Journal of Advanced Nursing 23, 603±611.
Crow C.S., Olivet L.W., Burry-Stock J. & VanderMeer J.L. (1996)
Assessment of pain coping styles: development of an inventory.
Journal of Advanced Nursing 24, 890±898.
Dillon W.R. & Goldstein M. (1984) Multivariate Analysis. John
Wiley, New York.
Drummond J. & Rickwood D. (1997) Childbirth con®dence:
validating the childbirth self-ef®cacy inventory (CBSEI) in an
Australian sample. Journal of Advanced Nursing 26, 613±622.
Dunn S.V. & Burnett P. (1995) The development of a clinical
learning environment scale. Journal of Advanced Nursing 22,
1166±1173.
Gilleard C. & Reed R. (1998) Validating a measure of patient
satisfaction with community nursing. Journal of Advanced
Nursing 28, 84±100.
Hair J.F., Anderson R.E. & Tatham R.L. (1987) Multivariate Data
Analysis 2nd edition. Macmillan, New York.
Hicks C. (1996) A study of nurses' attitudes towards research: a
factor analytic approach. Journal of Advanced Nursing 23, 373±
379.
Kline P. (1994) An Easy Guide to Factor Analysis. Routledge,
London.
Lauri S. & SalanteraÈ S. (1995) Decision-making models of Finnish
nurses and public health nurses. Journal of Advanced Nursing
21, 520±527.
Manninen E. (1998) Changes in nursing students' perceptions of
nursing as they progress through their education. Journal of
Advanced Nursing 27, 390±398.
Persson L., Hallberg I.R. & Ohlsson O. (1997) Survivors of acute
leukaemia and highly malignant lymphoma Ð retrospective
views of daily life problems during treatment and when in
remission. Journal of Advanced Nursing 25, 68±76.
Rosenkoetter M.M. & Garris J.M. (1998) Psychological changes
following retirement. Journal of Advanced Nursing 27, 966±
976.
Severinsson E.L. (1998) Bridging the gap between theory and
practice: a supervision programme for nursing students. Journal
of Advanced Nursing 27, 1269±1277.
Sitzia J., Dikken C. & Hughes J. (1997) Psychometric evaluation of
a questionnaire to document side-effects of chemotherapy.
Journal of Advanced Nursing 25, 999±1007.
van der Zee J., Kramer K., Derksen A., Kerkstra A. &
Stevens F.C.J. (1994) Community nursing in Belgium,
Germany and the Netherlands. Journal of Advanced Nurs-
ing 20, 791±801.
Watson R. & Deary I.J. (1994) Measuring feeding dif®culty in
patients with dementia: multivariate analysis of feeding prob-
lems, nursing interventions and indicators of feeding dif®culty.
Journal of Advanced Nursing 20, 283±287.
Zhan L. & Shen C. (1994) The development of an instrument to
measure self-consistency. Journal of Advanced Nursing 20,
509±516.
Janforum
Ó 1998 Blackwell Science Ltd, Journal of Advanced Nursing, 28(6), 1361±1363 1363