Meta-Analyses of Experimental Data in the Animal Sciences

Embed Size (px)

Citation preview

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    1/16

    Revista Brasileira de Zootecnia 2007 Sociedade Brasileira de ZootecniaISSN impresso: 1516-3598ISSN on-line : 1806-9290www.sbz.org.br

    R. Bras. Zootec., v.36, suplemento especial , p.343-358, 2007

    Correspondncias devem ser enviadas para: [email protected] and research support were provided by state and federal funds appropriated to the Ohio

    Agricultural Research and Development Center, the Ohio State University.

    Meta-analyses of experimental data in the animal sciences

    Normand Roger St-Pierre

    1 - Department of Animal Sciences, The Ohio State University, 2029 Fyffe Rd., Columbus, OH-43210, USA. [email protected]

    ABSTRACT - In certain areas of animal research, such as nutrition, quantitative summarizations of literaturedata are periodically needed. In such instances, statistical methods dealing with the analysis of summary data (generallyfrom the literature) must be used. These methods are known as meta-analyses. The implementation of a meta-analysisis done in several phases. The first phase defines the study objectives and identifies the criteria for selecting prior

    publications to be used in the construction of the database. Publications must be scrupulously evaluated for theirquality before being entered into the database. During this phase, it is important to carefully encode each record withpertinent descriptive attributes (experiments, treatments, etc.) to serve as important reference points later on. Statistically,databases from literature data are inherently unbalanced, leading to considerable analytical and interpretation difficulties.Missing data are frequent, and data are not the outcomes of a classical experimental system. A graphical examinationof the data is useful in getting a global view of the system as well as to hypothesize specific relationships to beinvestigated. This phase is followed by a statistical study of the meta-system using the database previously assembled.The statistical model used must follow the data structure. Variance decomposition must account for inter-and intra-study sources; dependent and independent variables must be identified either as discrete (qualitative) or continuous(quantitative). Effects must be defined as either fixed or random. Often observations must be weighed to account fordifferences in the precision of the reported means. Once model parameters are estimated, extensive analyses of residualvariations must be performed. The roles of the different treatments and studies in the results obtained must be identified.Often, this requires returning to an earlier step in the process. Thus, meta-analyses have inherent heuristic qualitiesthat can guide in the design of future experiments as well as aggregating prior knowledge into a quantitative predictionsystem.

    Key Words: meta-analysis, mixed models, nutritionAbbreviation key: DMI = Dry matter intake, GLM = Generalized linear model, GLMM = Generalized linearmixed model, NDF = Neutral detergent fiber, SAS = Statistical Analysis System, SEM = Standard error of themean, VFA = Volatile fatty acids, VIF = Variance inflation factors.

    Introduction

    The research environment in the animal

    sciences, especially nutrition, has markedlychanged in recent years. In particular, there hasbeen a noticeable increase in the number of publications, each containing an increasingnumber of quantitative measurements. Meanwhile,treatments often have smaller effects on the systemsbeing studied than in the past. Additionally, controlledand non-controlled factors, such as the basal planeof nutrition, vary from study to study, thusrequiring at some point a quantitativesummarization, an integration of prior research.

    Fundamental research in the basic animalscience disciplines generates results that generally

    are at a much lower level of aggregation than thoseof applied research (organs, whole animals), thussupporting the necessity of integrative research.

    Research stakeholders, those who ultimatelyuse the research outcome, increasingly want morequantitative knowledge, particularly on animalresponse to diet. Forecasting and decision supportsoftwares require quantitative information asinputs. Lastly, research prioritization by publicfunding agencies may force abandoning activeresearch activities in certain areas. In suchinstances, meta-analyses can still supportdiscovery-type activities from aggregating resultsfrom published literature.

    The objectives of this paper are to describe theapplication of meta-analytic methods to animal

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    2/16

    2007 Sociedade Brasileira de Zootecnia

    344 St -Pierre, N.R.

    nutrition studies, including the development andvalidation of literature derived databases, and thequantitative statistical techniques used to extractthe quantitative information.

    Definitions and nature of problems

    Limits to Classical ApproachesResults from a single classical experiment

    cannot be the basis for a large inference spacebecause the conditions under which observationsare made in a single experiment are forcibly verynarrow,i.e., specific to the study in question. Suchstudies are ideal to demonstrate cause and effect, andto test specific hypothesis regarding mechanisms andmodes of action. In essence, a single experimentationmeasures the effects of one or a very few factorswhile maintaining all other factors as constant aspossible. Often, experiments are repeated by othersto verify the generality and repeatability of theobservations that were made, as well as to challengethe range of applicability of the observed results andconclusions. Hence, it is not uncommon that overtime, many studies are published even on a relativelynarrow subject. In this context, there is a need tosummarize the findings across all the publishedstudies. Meta-analytic methods are concerned withhow best to achieve this integration process.

    The classical approach to synthesizingscientific knowledge has been based on qualitativeliterature reviews. A limitation of this approach isthe obvious subjectivity involved in the process.The authors subjectively weigh outcomes fromdifferent studies. Criteria for the inclusion or non-inclusion of studies are poorly defined. Differentauthors can draw dramatically differentconclusions from the same initial set of published

    studies. Additionally, the limitation of the humanbrain to differentiate the effects of many factorsgrows with the number of publications involved.

    Definitions and objectives of meta-analysesMeta-analyses use objective, scientific

    methods based on statistics to summarize andquantify knowledge acquired through priorpublished research. Meta-analytic methods wereinitially developed in psychology, medicine andsocial sciences a few decades ago. In general,

    meta-analyses are conducted for one of thefollowing four objectives:

    For Global hypothesis testing , such astesting for the effect of a certain drug or of afeed additive using the outcomes of manypublications that had as an objective thetesting of such effect. This was by far the

    predominant objective of the first meta-analyses published (Mantel & Haenszel,1959; Glass, 1976). Early on, it was realizedthat many studies lacked statistical power forstatistical testing, so that the aggregation of results from many studies would lead tomuch greater power (hence lower type IIerror), more precise point estimation of themagnitude of effects, and narrowerconfidence intervals of the estimated effects.

    For Empi rica l modeli ng of biolog ical responses , such as the response of animalsto nutritional practices. Because the dataextracted from many publications cover amuch wider set of experimental conditionsthan those of each individual study,conclusions and models derived from thewhole set have a much greater likelihood of yielding relevant predictions to assistdecision-makers. There are numerousexamples of such application of meta-

    analytical methods in recent nutritionpublications, such as the quantification of thephysiological response of ruminants to typesof dietary starch (Offner & Sauvant, 2004),grain processing (Firkinset al ., 2001) andrumen defaunation (Eugeneet al. , 2004).Others have used meta-analyses to quantifyin situ starch degradation (Offneret al .,2003), and microbial N flow in ruminants(Oldick et al ., 1999; St-Pierre, 2003).

    For co ll ec ti ve su mm ar iz at ion s of measurements that only had a secondary or mi no r ro le in pr ior ex pe ri me nts .Generally, results are reported with theobjective of supporting the hypothesisrelated to the effect of one or a fewexperimental factors. For example, ruminalVFA concentrations are reported in studiesinvestigating the effects of dietary starch, orforage types. None of these studies have asan objective the prediction of ruminal VFAs.But the aggregation of measurements frommany studies can lead to a betterunderstanding of factors controlling VFA

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    3/16

    2007 Sociedade Brasileira de Zootecnia

    345Meta-analyses of experimental data in the animal sciences

    concentrations, or allow the establishmentof new research hypotheses. A meta-analysisof ruminal liquid flow rates allowed theidentification of an indirect criterion to salivaproduction and buffer recycling, which

    criterion is linked to ruminal conditions(Sauvant & Mertens, 2000). In mechanistic modeling, for parameter

    estimates and estimates of initial conditionsof state variables . Mechanistic modelsrequire parameterization, and meta-analysesoffer a mechanism of estimation that makesparameter estimation more precise and moreapplicable to a broader range of conditions.Meta-analyses can also be used for externalmodel validation (Sauvant & Martin, 2004),or for a critical comparison of alternatemechanistic models (Offner & Sauvant,2004).

    Types of Data and Factors in Meta-AnalysesAs in conventional statistical analyses,

    dependent variables in meta-analyses can be of various types such as binary [0, 1] (e.g ., forpregnancy), counts or percentages, categorical-ordinal (good, very good, excellent), andcontinuous, which is the most frequent type inmeta-analyses related to nutrition.

    Independent factors (or variables) have eithera fixed or random effects on the dependent variableof interest (McCulloch & Searle, 2001). In general,factors related to nutrition (grain types, DMI, etc)should be considered as fixed effects factors. Thestudy effect can either be considered as random orfixed. If a dataset comprised many individualstudies from multiple research centers, thestudyeffect should be considered random because each

    study is conceptually a random outcome from alarge population of studies to which inference isto be made (St-Pierre, 2001). This is especiallyimportant if the meta-analysis has for objectivethe empirical modeling of biological responses,or the collective summarizations of measurementsthat only had a secondary or minor role in priorexperiments, because it is likely that the researcherin those instances has a targeted range of inferencemuch larger than the limited conditionsrepresented by the specific studies. There areinstances, however, where each experiment canbe considered as an outcome each from a different

    population. In such instances, the levels ofstudyor trial are in essence arbitrarily chosen by theresearch community, and thestudy effect shouldthen be considered fixed. In such instance, therange of inference for the meta-analysis is limited

    to the domain of the specific experiments in thedataset. This is of little concern if the objective of the meta-analysis is that of global hypothesistesting, but it does severely limit the applicabilityof its results for other objectives.

    Difficulties Inherent to the DataThe meta-analytic database is best

    conceptualized with rows representing treatments,groups or lots, while the columns consist of themeasured variables (those for which least-squares

    means are reported) and characteristics (classlevels) of the treatments or trials. A primarycharacteristic of most meta-analytic databases isthe large frequency of missing data in the table.This reduces the possibility of using large multidimensional descriptive models, and generallyforces the adoption of models with a small subsetof independent variables. Additionally, theunderlying data design, referred as the meta-design, is not determined prior to the datacollection as in classic randomized experiments.Consequently, meta-analytic data are generallyseverally unbalanced and factor effects are far frombeing orthogonal (independent). This leads tounique statistical estimation problems similar tothose observed in observational studies, such asleverage points, near collinearity, and evencomplete factor disconnectedness, thus prohibitingthe testing of the effects that are completelyconfounded with others.

    An example of factor disconnectedness is

    shown in Table 1, where two factors each takingthree possible levels are investigated. In thisexample, factor A and B are disconnectedbecause one cannot join all bordering pairs of cells with both horizontal and vertical links.Consequently, the effect of the third level of Acannot be estimated separately from the effectof the third level of B. This would be diagnoseddifferently depending on the software used withdifferent combination of error messages, zerodegrees of freedom for some effects in theANOVA table, or a missing value for the statisticused for testing.

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    4/16

    2007 Sociedade Brasileira de Zootecnia

    346 St -Pierre, N.R.

    Table 1 - Example of factor disconnectedness. Factor A Levels

    Factor B Levels 1 2 3

    1 x x2 x x

    3 x

    In general, the variance between studies is largecompared to the variance within studies, henceunderlying the importance of including the studyeffect into the meta-analytical model. The studyeffect represents the combined effect of manyfactors that differ between studies, but whichfactors are not in the model because they eitherwere not measured, or have been excluded fromthe model, or for which the functional form in themodel is inadequately representing the true butunknown functional form (e.g ., the model assumesa linear relationship between the dependent andone independent continuous variable whereas thetrue relationship is nonlinear). In the absence of interactions between design variables (e.g .,studies) and the covariates (e.g. , all continuousindependent variables of interest), parameterestimates for the covariates are unbiased, but thestudy effect adds uncertainty to future predictions

    (St-Pierre, 2001). The presence of significantinteractions between studies and at least onecovariate is more problematic since this indicatesthat the effect of the covariate is dependent on thestudy, implying that the effect of a factor isdependent on the levels of unidentified factors.

    Steps in the meta-analytic process

    There are several inherent steps to meta-analyses, the important ones being summarizedin Figure 1. An important aspect of this type of

    analyses is the iteration process, which is underthe control of the analyst. This circular patternwhere prior steps are re-visited and refined is animportant aspect of meta-analyses and contributemuch to their heuristic characteristic.

    Objectives of the studyEstablishing a clear set of study objectives is a

    critical step that guides most ulterior decisionssuch as the database structure, data filtering,weighing of observations, and choice of statistical

    model. Objectives can cover a wide range of targets, ranging from preliminary analyses toidentify potential factors affecting a system, thusserving an important role to the formulation of research hypotheses in future experiments, to thequantification of the effect of a nutritional factorsuch as a specific feed additive.

    Data EntryResults from prior research found in the

    literature must be entered in a database.Thestructure and coding of the database must includenumerous variables identifying the experimentalobjectives. Hence numerous columns are addedto code the objectives of each study. This codingis necessary so as to avoid the improper

    Figure 1 - Schematic representation of the meta-analytic process.

    ObjectivesConceptual

    Basis

    Selection of experiments& Data entry

    Literature search, ExperimentsSurvey.

    Graphical Analyses

    Determination of meta-debion

    Selection of statistical model

    User Post-analysis

    evaluation

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    5/16

    2007 Sociedade Brasileira de Zootecnia

    347Meta-analyses of experimental data in the animal sciences

    aggregation of results across studies with verydifferent objectives. During this coding phase, theanalyst may chose to transform a continuousvariable to a discrete variable withn levels codedin a single column with levels of the discrete

    variable as entries, or inn columns with 0-1 entriesto be used as dummy variables in the meta-analyticmodel. Different criteria can guide the selectionof classes, such as equidistant classes, or classeswith equal frequencies or probability of occurrence. The important point is that the sum of these descriptive columns must entirelycharacterize the objectives of all studies used.

    Data FilteringThere are at least three steps necessary to

    effective data filtering. First, the analyst mustensure that the study under consideration iscoherent with the objectives of the meta-analysis.That is, the meta-analytic objectives dictate thatsome variables must have been measured andreported. If, for example, the meta-analysisobjective is to quantify the relationship betweendietary NDF concentration and DM intake, thenone must ensure that both NDF concentration andDM intake were measured and reported in allstudies. The second step consists of a thoroughand critical review of each publication underconsideration, focusing on the detection of errorsin the reporting of results. This underlines theimportance of having a highly trained professionalinvolved in this phase of the study. Only afterpublications have passed thisexpert quality filtershould their results be entered in the database.Verification of data entries is then another essentialcomponent to the process. In this third step, it isimportant to ensure that a selected publication does

    not appear to be an outlier with respect to thecharacteristics and relations under consideration.

    Preliminary Graphical AnalysesA thorough visual analysis of the data is an

    essential step to the meta-analytic process. Duringthis phase, the analyst can form a global viewregarding the coherence and heterogeneity of thedata, as well as to the nature and relative importanceof the inter-study and intra-study relationships of prospective variables taken two at a time.

    Systematic graphical analyses should lead tospecific hypotheses and initial selection of

    alternate statistical models. Graphics can also helpidentifying observations that appear unique oreven outliers. The general structure of relationships can also be identified, such as linearvs. nonlinear relationships as well as the presence

    of interactions. As an example, Figure 2 shows anintra-study curvilinear relationship between twovariables in the presence of a significant inter-study effect (i.e. , different intercepts betweenstudies). In this example, the inter-study effectassociated with the X variable indicates thepresence of a latent (hidden) variable that differedacross studies. Another example is shown in Figure3, which suggests the presence of a linear intra-study effect interacting with the study effect (i.e. ,different regression slopes across studies). Thismay be due to a narrow range of the X variable ineach experiment, or that again experimentsdiffered with respect to a latent, interactingvariable that was maintained constant or nearlyconstant within each experiment. Thisvisualization phase of the data should always betaken as a preliminary step to the statistical analysisand not as conclusive evidence. The reason is thatas the multi dimensions of the data are collapsedinto two or possibly three dimension graphics, the

    unbalance that clearly is an inherent characteristicof meta-analytic data can lead to false visualrelationships. This is because simple X-Y graphicsdo not correct the observations for the effects of all other variables that can affect Y.

    Figure 2 - Example of a curvilinear intra-studyeffect in the presence of a random study effect.

    Graphical analyses should also be done inregards to the joint coverage of predictor variables,identifying their possible ranges, plausible ranges,and joint distributions, all being closely related tothe inference range. Figure 4 shows the conceptsinvolved when the dependent variable is plotted

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    6/16

    2007 Sociedade Brasileira de Zootecnia

    348 St -Pierre, N.R.

    against one of the predictor variable. Similargraphics should be drawn to explore therelationships between predictor variables takentwo at the time. In such graphics, the presence of any linear trends indicates correlations betweenpredictor variables. Strong positive or negativecorrelations of predictor variables have twoundesirable effects. First, they may induce nearcollinearity, implying that the effect of onepredictor cannot be uniquely identified (i.e. , isnearly confounded with the effect of anotherpredictor). In such instance, the statistical model

    can include only one of the two predictors at atime. Second, the range of a predictor X1 given alevel of a second predictor X2 is considerably lessthan the unconditional range of predictor X1. Inthese instances, although the range of a predictorappears considerable in a univariate setting, itseffective range is actually very much reduced inthe multivariate space.

    Figure 4 - Illustration of the concepts of possible range, plausible range, and actual range in a meta-analysis with a single predictor variable.

    Figure 3 - Example of a linear intra-study effectin the presence of an interaction of study bypredictor variable.

    Figure 5 illustrates some of these conceptsusing an actual set of data on chewing activity incattle and the NDF content of the diet. Visually,one concludes that the intra-study relationshipbetween chewing activity and diet NDF is

    nonlinear, and that the intercept (i.e ., height of thecurves) differed across studies. This observationcan be formally tested using the statistical methodsto be outlined later. In this example, the possibleNDF range is 0 100%, whereas its plausiblerange is more likely between 20 and 60%.Interestingly, the figure illustrates thatexperimental measurements were frequently in the20-40% and 45-55% ranges, leaving a gap withvery few observations in the 40-45% range.

    Figure 5 - Effect of dietary neutral detergentfiber (NDF) content on chewing activity incattle.1 -Data are from 88 published experimentswhere the NDF content of the diet was theexperimental treatment.

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    7/16

    2007 Sociedade Brasileira de Zootecnia

    349Meta-analyses of experimental data in the animal sciences

    Study of the experimental meta-designThe meta-design is determined by the structure

    of the experiments for each of the predictorvariables. To characterize the meta-design,numerous steps must take place before and after

    the statistical analyses. The specific steps dependon the number of predictor variables in the model.

    One predictor variable. The experimental design used in each of the

    studies forming the database must beidentified and coded, and their relativefrequencies calculated. This information canbe valuable during the interpretation of theresults.

    Frequency plots (histograms) of the predictorvariable can identify areas of focus of priorresearch. For example, Figure 6 shows thefrequency distributions of NDF for the 517treatment groups for the meta-analyticdataset used to draw Figure 5. Both figuresindicate a substantial research effort towardsdiets containing 30-35% NDF, an area of dietary NDF density that borders the lowerlimit of recommended dietary NDF forlactating dairy cows (NRC, 2001). Because

    of the high frequency of observations in the30-35% NDF range, thea priori expectationis that the effect of dietary NDF will beestimated most precisely in this range.

    One should also consider the intra and inter-study variances for the predictor variable.Small intra-study variances reduce the ability

    of assessing the structural form of therelationship between the predictor and thedependent variable. Large intra-studyvariances but with only two levels of thepredictor variable in all or most of the studies

    hides completely any potential nonlinearrelationships. Figure 7 shows the intra-studystandard deviation (S) of NDF as a functionof the mean dietary NDF for experimentswith two treatments, and for those with morethan two treatments. The analyst candetermine a minimum threshold of S toexclude experiments with little intra-studyvariation in the predictor variable. Ininstances where the study effect is consideredrandom, this is not necessary and generallynot desirable. In such instances the inclusionof studies with little variation in the predictor

    Figure 6 - Frequency plot of neutral detergent fiber (NDF) content of diets across experimental treatments.

    Figure 7 - Intra-study standard deviation of dietary neutral detergent fiber (NDF) and meanNDF of experiments designed to study theeffects of NDF in cattle.

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    8/16

    2007 Sociedade Brasileira de Zootecnia

    350 St -Pierre, N.R.

    variable does little in determining therelationship between the predictor and thedependent variable, but adds observations anddegrees of freedom to estimate the variancecomponent associated with the study effect.

    When looking at a possible nonlinear intra-study relationship, it is intuitive to retain onlythose experiments with three or more levels of the predictor variables in the dataset. In thisinstance, intuition is incorrect. Experimentswith two levels of the predictor variables addinformation on the study component (intercept)and the linear parameter of the model. In Figure7, eliminating studies with less than three levelsof the predictor variable would eliminate 40%of all the observations.

    Another important aspect at this stage of theanalysis is to determine the significance of thestudy effect on the predictor variables. Asexplained previously in the context of Figures4 and 5, one must be very careful in regards tothe statistical model used to investigate theintra-study effect when there is an interactionbetween the predictor variable(s) and study. Inthese instances, the relationship between thepredictor variable and the dependent variable

    is dependent on the study, which itself represents the sums of a great many factorssuch as measurement errors, systematicdifferences in the methods of measurementsof the dependent variable across studies, and,more importantly, latent variables (hidden) notbalanced across experiments. In thoseinstances, the analyst must exert great cautionin the interpretation of the results, especiallyregarding the applicability of these results.

    It is generally useful to calculate the leverage

    of each observation (Tomassoneet al ., 1983).Traditionally, leverage values are calculatedafter the model is fitted to the data, but nothingprohibits the calculation of leverage values atan earlier stage because their calculationsdepend only on the design of the predictorvariable in the model. For example, in the caseof the simple linear regression withnobservations, the leverage point for theithobservation is calculated as:

    h i = 1/n + (X i X m) / (X i X m)2 [1]

    where:hi is the leverage value,Xi is the value of the Ith predictorvariable, andXm is the mean of all Xi.

    Equation [1] clearly indicates that the leverageof an observation,i.e. its weight in thedetermination of the slope, grows with its distancefrom the mean of the predictor variable. Theextension of the leverage point calculations tomore than one predictor variables is straightforward (St-Pierre & Glamocic, 2000).

    In a final step, the analyst must graphicallyinvestigate the functional form of therelationship between the dependent variableand the predictor variable.

    Two or more predictor variablesIn the case of two or more predictor variables,

    the analyst must examine graphically and thenstatistically the inter and intra-study relationshipsbetween the predictor variables. Leverage valuesshould be examined. With fixed models (all effectsin the models are fixed with the exception of theerror term), variance inflation factors (VIF) shouldbe calculated for each predictor variables (St-Pierre & Glamocic, 2000). An equivalent statistichas not been proposed for mixed models (e.g. ,when the study effect is random), but asymptotictheory would support the calculation of the VIFsfor the fixed effect factors in cases where the totalnumber of observations is large. The objective inthis phase is to assess the degree of inter-dependence between the predictor variables.Because predictor variables in meta-analyses arenever structured prior to their determination, they

    are always non-orthogonal and, hence, showvariable degrees of inter-dependency. Collinearitydeterminations (VIF) assess ones ability toseparate the effects of inter-dependent factorsbased on a given set of data. Collinearity is notmodel driven, but completely data driven.

    Weighing of observationsBecause meta-analytic data are extracted from

    the results of many experiments conducted undermany different statistical designs and number of experimental units, the observations (treatmentmeans) have a wide range of standard errors.

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    9/16

    2007 Sociedade Brasileira de Zootecnia

    351Meta-analyses of experimental data in the animal sciences

    Intuition and classical statistical theory wouldindicate that observations should be subjected tosome sort of weighing scheme. Systems used forweighing observations form two broad categories.

    Weighing based on classical statistical theoryUnder a general linear model whereobservations have heterogeneous but knownvariances, maximum likelihood parametersestimates are obtained by weighing eachobservation by the inverse of its variance. In thecontext of a meta-analysis where observations areleast-squares (or population marginal) means,observations should be weighed by the inverse of the squares of their standard errors, which are thestandard errors of each mean (SEM).Unfortunately, when such weights are used, theresulting measures of model errors (i.e ., standarderror, standard error of predictions, etc.) are nolonger expressed in the original scale of the data.To maintain the expressions of dispersion in theoriginal scale of the measurements, St-Pierre(2001) suggested dividing each weight by themean of all weights, and to use the resulting valuesas weighing factors in the analysis. Under thisprocedure, the average weight used is algebraicallyequal to 1.0, thus resulting in expressions of dispersion that are in the same scale as the originaldata.

    Weighing based on other criteriaOther weighing criteria have been suggested

    for the weighing of observations, such as the powerof an experiment to detect an effect of a sizedefineda priori , the duration of an experiment,etc. The weighing scheme can actually be basedon an expert assessment, partially subjective, of

    the overall quality (precision) of the data. Theopinion of more than one expert may be useful inthis context. From a Bayesian statistical paradigm,the use of subjective information for decision-making is perfectly coherent and acceptable, assubjective probabilities are often used to establishprior distributions in Bayesian decision theory (DeGroot, 1970). Traditional scientific objectiveness,however, may restrict the use of this weighingscheme in scientific publications.

    Predictably, the importance of weighing

    observations decreases with the number of observations used in the analysis, especially if the

    observations that would receive a small weighthave relatively small leverage values. An exampleof this is shown in Figure 8.

    Whether the analyst should weight theobservations based on the SEM for each individual

    treatments or the pooled SEM from the studies isopen for debate. There are many reasons why theSEM of each treatment within a study can bedifferent. First, the original observationsthemselves could have been homoscedastic(homogeneous variance) but the least-squaresmeans would have different SEM due to unequalfrequencies (e.g ., missing data). In such case, it isclear that the weight should be based on the SEMof each treatment. Second, the treatments may haveinduced heteroscedasticity, meaning that theoriginal observations did not have equal variancesacross sub-classes. In such instances, the originalauthors should have conducted a test to assess theusual homoscedasticity assumption in linearmodels. The problem is that a lack of significance(i.e ., P > 0.05) when testing the homogeneityassumption does not prove homoscedasticity, butonly that the null hypothesis (homogeneousvariance) cannot be rejected at aP < 0.05. In ameta-analytic setting, the analyst may deem the

    means with larger apparent variance to be lesscredible and reduce the weigh of theseobservations in the analysis. Unfortunately, mostpublications lack the information necessary to thisoption.

    Among the more subjective criteria availablefor weighing is the quality of the experimentaldesign used in the original study in regard to themeta-analytic objective.

    Experimental designs have various trade-offs

    Figure 8 - Estimated response of lactating dairycows to concentrate intake using weighed andunweighed observations.

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    10/16

    2007 Sociedade Brasileira de Zootecnia

    352 St -Pierre, N.R.

    due to their underlying assumptions. For example,the Latin square is often used in instances whereanimal units are relatively expensive, such as inmetabolic studies. The double orthogonal blockingused to construct Latin square designs can remove

    a lot of variation from the residual error.Thus very few animals can be used comparedto a completely randomized design for an equalpower of detecting treatment effect. The downside,however, is that the periods are generally relativelyshort to reduce the likelihood of a period bytreatment interaction (animals in differentphysiological status across time periods), thusreducing the magnitude of the treatment effectson certain traits, such as production and intake forexample. In those instances, the analyst shouldlegitimately weigh down observations fromexperiments whose designs limited the expressionof the treatment effects.

    Statistical modelsThe independent variables can be either

    discrete or continuous. With binary data (healthy/ sick, for example), generalized linear models(GLM) based on the logit or probit link functionsare generally recommended (Agresti, 2002).Because of advances in computational power, theGLM has been extended to include random effectsin what is called the generalized linear mixedmodel (GLMM). In its version 9, the SAS systemincludes a beta release of the GLIMMX procedureto fit these complex models.

    In nutrition, however, the large majority of thedependent variables subjected to meta-analyses arecontinuous, and their analyses are treated at lengthin the remainder of this paper.

    St-Pierre (2001) made a compelling argument

    to include the study effect in all meta-analyticmodels. Because of the severe imbalance in mostdatabases used for meta-analyses, the exclusionof the study effect in the model leads to biasedparameter estimates of the effects of other factorsunder investigation, and severe biases in varianceestimates. In general, the study effect should beconsidered random because it represents the sumof the effects of a great many factors, all withrelatively small effects on the dependent variable.Statistical theory indicates that these effects wouldbe close to Gaussian (normal), thus much betterestimated if treated as random effects.

    Practical recommendations regarding theselection of the type of effect for the studies arepresented in Table 2. In short, the choice dependson the size of the conceptual population, and thesample size (the number of studies in the meta-

    analysis).The ultimate (and correct) meta-analysis wouldbe one where all the primary (raw) data used toperform the analyses in each of the selectedpublications were available to the analyst. In suchinstance, a large segmented model that includesall the design effects of the original studies (e.g .,the columns and rows effects in Latin squares) plusthe effects to be investigated by the meta-analysiscould be fitted by least-squares or maximumlikelihood methods. Although computationallycomplex, such huge meta-analytic models shouldbe no more difficult to solve than the large modelsused by geneticists to estimate the breeding valuesof animals using very large national databases of production records. Raw data availability shouldnot be an issue in instances where meta-analysesare conducted with the purpose of summarizingresearch at a given research center. This, however,is very infrequent, and meta-analyses are almostalways conducted using observations that are

    themselves summaries of prior experiments (i.e .,treatment means). It seems evident that a meta-analysis conducted on summary statistics shouldlead to the same results as a meta-analysisconducted on the raw data, which itself would haveto include a study effect because the design effectsare necessarily nested within studies (e.g ., cow 1in the Latin square of study 1 is different than thecow 1 of study 2), which itself would be consideredrandom. Thus, analytical consistency dictates theinclusion of the study effect in the model, generally

    as a random effect. The study effect will beconsidered random in the remainder of this paper,with the understanding that under certainconditions explained previously it should beconsidered a fixed effect factor.

    Beside the study effect, meta-analytic modelsinclude one or more predictor variables that areeither discrete, or continuous. For clarity, we willinitially treat the case of each variable typeseparately, understanding that a model can easilyinclude a mixture of variables of both types, aswe describe later.

    Model with discrete predictor variable(s)

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    11/16

    2007 Sociedade Brasileira de Zootecnia

    353Meta-analyses of experimental data in the animal sciences

    A linear mixed model easily models thissituation as follows:

    Y ijk = + S i + j + S tij + e ijk [4]

    where:Yijk = the dependent variable, = overall mean,Si = the random effect of the ith study, assumed

    ~ iidN (0, 2S),t j = the fixed effect of the jth level of factor ,Stij = the random interaction between the ith

    study and the jth level of factor , assumed~ iidN (0, 2S ), and

    eijk = the residual errors, assumed ~iidN (0, 2

    e).

    eijk, S ij and Si are assumed to be independentrandom variables.

    For simplicity reasons, model [4] is writtenwithout weighing the observations. The weightswould appear as multiplicative factors of thediagonal elements of the error variance-covariancematrix (Draper & Smith, 1998). Model [4]corresponds to an incomplete, unbalanced

    randomized block design with interactions inclassic experimental research. The following SASstatements would solve this model:

    PROC MIXED DATA=Mydata CL COVTEST;CLASSES study tau;MODEL Y = tau; [5]RANDOM study study*tau;LSMEANS tau;

    RUN;

    Standard tests of significance on the effect of are easily conducted and least-squares meanscan be separated using an appropriate meanseparation procedure. Although it may be temptingto remove the study effect from the model ininstances where it is not significant (also calledpooling of effects), this practice can lead to biasedprobability estimations (i.e ., final tests on fixedeffects are conditional on tests for random effects)and is not recommended. This is because not beingable to reject the null hypothesis of no study effect(i.e. , variance due to study is not significantlydifferent from zero) is a very different proposition

    than proving that the effect of study is negligible.At the very least, the probability threshold forsignificance of study should be much larger thanthe traditionalP = 0.05. Ideally, the analyst shouldstate before the analysis is performed what size of

    estimated variance due to study should beconsidered negligible, such as 2S< 0.1 2e.

    Model with continuous predictor variable(s)A linear mixed model is used:

    Y ij = B 0 + S i + B 1 X ij + b i X ij + e ij [6]

    where:Yij = the dependent variable,B0 = overall (inter-study) intercept (a fixed

    effect equivalent to in [4]),Si = the random effect of the ith study,

    assumed ~iidN (0, 2S),B1 = the overall regression coefficient of Y

    on X (a fixed effect),Xij = value of the continuous predictor

    variable,bi = random effect of study on the regression

    coefficient of Y on X, assumed ~iidN(0, 2b), and

    eij = the residual errors, assumed ~

    iidN (0,

    2e).

    eij, bi and Si are assumed to be independentrandom variables.

    The following SAS statements solve thismodel:

    PROC MIXED DATA=Mydata CL COVTEST;CLASSES study;MODEL Y = X / SOLUTION; [7]RANDOM study study*X;

    RUN;

    Using a simple Monte Carlo simulation, St-Pierre (2001) demonstrated the application of thismodel to a synthetic dataset, showing the powerof this approach, and the interpretation of theestimated parameters.

    Model with both discre te and continuous predictor variable(s)Statistically, this model is a simple combination

    of [4] and [6] as follows:Y ijk = + S i + j + S ij + B 1 X ij + b i X ij + B j X ij

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    12/16

    2007 Sociedade Brasileira de Zootecnia

    354 St -Pierre, N.R.

    + e ijk [8]

    where:B j = the effect of the jth level of the discrete

    factor on the regression coefficient (a

    fixed effect).

    The following SAS statements would be usedto solve this model:

    PROC MIXED DATA=Mydata CL COVTEST;CLASSES study tau;MODEL Y = tau X tau*X; [9]RANDOM study study*tau study*X;LSMEANS tau;

    RUN;

    In theory, [8] is solvable, but the large numberof variance components and interaction terms thatmust be estimated, in combination with theimbalance in the data makes it often numericallyintractable. In such instances, at least one of thetwo random interactions must be removed fromthe model.

    In [4], [6], and [8], the analyst secretly wishesfor the interactions between study and the predictorvariables to be highly nonsignificant. Recall that

    the study effect represents an aggregation of theeffects of many uncontrollable and unknownfactors that differed between studies. A significantstudy x interaction in [4] implies that the effectof (the intercept) is dependent on the study, henceof factors that are unaccounted for. Similarly, asignificant interaction of study by X in [6] (the biterms) indicates that the slope of the linearrelationship of Y on X is dependent on the study,hence of unidentified factors. In such situation,the analysis produces a model that can explain very

    well the observations, but whose predictions of future outcomes are generally not precise becausethe actual realization of a future study effect isunknown.

    The maximum likelihood predictor of a futureobservation is computed by setting the study andthe interaction of study with the fixed effect factorsto their mean effect values of zero (McCulloch &Searle, 2001), but the standard error of thisprediction is very much amplified by theuncertainty regarding the realized effect of thefuture study.

    When the study effect and its interaction with

    fixed effect is correctly viewed as an aggregationof many factors not included in the model, but thatdiffered across studies, the desirability of includingas many fixed factors in the model as can beuniquely identified from the data becomes

    obvious. In essence, the fixed effects shouldultimately make the study effect and its interactionswith fixed effects predictors small and negligible.In such instances, the resulting model should havewide forecasting applicability. Imagine forinstance that much of the study effect on bodyweight gain of animal is in fact due to largedifference in the initial body weight across studies.In such instances, the inclusion of initial bodyweight as a covariate would remove much of thestudy effect, and the diet effects (as continuous ordiscrete variables) would be estimated withoutbiases, with a wide range of applicability (i.e ., afuture prediction would require a measurement of initial body weight as well as measurements of the other predictor variables).

    Whether one chooses [4] or [6] as a meta-model is somewhat arbitrary when the predictorvariable has an inherent scale (i.e ., is a measurednumber). The assumptions regarding the relationshipbetween Y and the predictor variable are, however,

    very different between the two models. In [4], themodel does not assume any functional form for therelationship. In [6], the model explicitly assumes alinear relationship between the dependent and thepredictor variables. Different methods can be usedto determine whether the relationship has a linearor nonlinear structure.

    The first method consists in classifyingobservations into five sub-classes based onthe quintiles for the predictor variable, andperforming the analysis according to [4] with

    five discrete levels of the predictor variable.Although the selection of five sub-classes issomewhat arbitrary, there are substantivereferences in the statistical literatureindicating that this number of levelsgenerally works well (Cochran, 1968; Rubin,1997). A visual inspection of the five least-squares means, or the partitioning of the fourdegrees of freedom associated with the fivelevels of the discrete variable into singularorthogonal polynomial contrasts can rapidlyidentify an adequate functional form to usefor modeling the Y-X relationship.

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    13/16

    2007 Sociedade Brasileira de Zootecnia

    355Meta-analyses of experimental data in the animal sciences

    The second method can be directly appliedto the data, or can be a second step thatfollows the identification of an adequatedegree for a polynomial function. Model [6]is augmented with the square (and possibly

    higher order terms) of the predictor variable.In the MIXED procedure of SAS, this canbe done simply by adding an X*X term tothe model statement. It is important tounderstand that in the context of a linear(mixed or not) model, the matrixrepresentation of the model and the solutionprocedure used are no different when X andX2 are in the model compared to a situationwhere two different continuous variables(say X and Z) are included in the model. Theproblem, however, is that X and X2 areimplicitly dependent; after all, there is analgebraic function relating the two. Thisdependence can result in a large correlationbetween the two variables, thus leading topossible problems of collinearity.

    A third method can be used in more complexsituations where the degree of thepolynomial exceeds two, or the form of therelationship is sigmoid, for example. The

    relationship can be modeled as successivelinear segments, an approach conceptuallyclose to the first method explained previously.Martin and Sauvant (2002) used this methodto study the variation in the shape of thelactation curves of cows subjected to variousconcentrate supplementation strategies, usingthe model of Grossman & Koops (1988) as itsfundamental basis. Using this approach,lactation curves were summarized by a vectorof 9 parameter estimates, which estimates

    could be compared across supplementationstrategies.

    In [8], the interest may be in the effect of thediscrete variable after adjusting for the effect of a continuous variable X as in a traditional covariateanalysis, or the interest may be inverse,i.e ., theinterest is in the effect of the continuous variableafter adjusting for the effect of the discretevariable. The meta-analyses of Firkins et al . (2001)provide examples of both situations. In oneinstance, the effect of grain processing (a discretevariable) on milk fat content was being

    investigated while correcting for the effect of drymatter intake (DMI, a continuous variable). In thisinstance, the interest was in determining the effectof the discrete variable. In another instance, theeffects of various dietary factors such as dietary

    NDF, DMI and proportion of forage in the diet(all continuous variables) on starch and NDFdigestibility, and microbial N synthesis wereinvestigated, while correcting for the effect of themethod of grain processing. In this case, theinterest was much more towards the effects of thecontinuous variables exempt from possible biasesdue to different grain processing methods acrossexperiments.

    Accounting for interfering factors

    Differences in experimental conditionsbetween studies can affect the treatment response.The nature of these conditions can be representedby quantitative or qualitative variables. In the firstinstance the variable and possibly its interactionwith other factors can be added to the model if there are sufficient degrees of freedom. Themagnitude of treatment response is sometimesdependent on the observed value in the controlgroup. For example, Figure 9 shows the milk fatresponse in lactating cows to dietary buffersupplementation as a function of the milk fat of the control group (Meschyet al ., 2004). Theresponse was small or non-existent when the milkfat of control cows was near 40 g/L, but increasedmarkedly when the control cows had low milk fat,possibly reflecting a higher likelihood of sub-clinical rumen acidosis in these instances.

    The presence of a study by predictor variableinteraction can indicate a nonlinear relationshipand the need for a higher degree of polynomial in

    the model. Applying model [6] with the additionof a square term for the Xij to the data shown inFigure 5 results in a relatively good quantificationof the relationship between chewing time anddietary NDF, as shown in Figure 10. In this typeof plot, it is important to adjust the observationsfor the study effect, or the regression may appearto poorly fit the data because of the many hiddendimensions represented by the studies (St-Pierre,2001).

    In instances where the interacting factor isdiscrete, the examination of the sub-classes least-squares means can clarify the nature of the

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    14/16

    2007 Sociedade Brasileira de Zootecnia

    356 St -Pierre, N.R.

    interaction. For example, the effect of a dietarytreatment may be dependent on the physiologicalstatus of the animals used in the study. Thisphysiological status can be coded using multipledummy variables, as explained previously.

    Figure 9 - Response in milk fat content (FAT)to dietary buffer supplementation.

    Figure 10 - Effect of dietary neutral detergentfiber (NDF) content on chewing activity incattle. 1Data are from published experimentswhere the NDF content of the diet was theexperimental treatment. Observations wereadjusted for the study effect before being plotted,as suggested by St-Pierre (2001).

    Post-optimization analyses

    a variance 2e. The normality assumption can betested using a standard Chi2 test, or a Shapiro-Wilkes test, both available in the UNIVARIATEprocedure of the SAS system. Residuals can alsobe expressed as Studentized residuals, with

    absolute values exceeding 3 being suspectedoutliers (Tomassoneet al ., 1983). In meta-analysis, the removal of a suspected outlierobservation should be done only with extremecaution. This is because the observations in ameta-analysis are the calculated outcomes (least-squares means) of models and experiments thatshould themselves be nearly free of the influenceof outliers. Thus, meta-analytic outliers aremuch more likely indicative of a faulty modelthan of a defective observation. In addition, theremoval of one treatment mean as an observationin a meta-analysis might be removing all thevariation in the predictor variable for theexperiment in question, thus making the valueof the experiment in a meta-analytic settingnearly worthless. In addition, the analyst shouldexamine for possible intra and inter-studyrelationships between the residuals and thepredictor variables.

    Structure of study variationWhen a model of the type described in [4],[6], or [8] is being fitted, it is possible toexamine each study on the basis of its ownresiduals. For example, Figure 11 shows thedistribution of the residual standard errors forthe different studies used in the meta-analysisof chewing time in cattle (Figures 5 and 10).Predictably, the distribution is asymmetrical andfollows the law of Raleigh for the standarderrors, while variances have a Chi2 distribution.

    Studies with unusual standard errors, say thosewith a Chi2 probability exceeding 0.999 couldbe candidates for exclusion from theanalysis.Alternatively, one could consider usingthe inverse of the estimated standard errors asweights to be attached to the observation beforere-iterating the meta-analysis.

    Other calculations such as leverage values,Cooks distances, and other statistics can be usedto determine the influence of each observation onthe parameter estimates (Tomassoneet al ., 1983).

    As when fitting conventional statistical models,numerous analyses should follow the fitting of ameta-analytic model. These analyses are used toassess the assumptions underlying the model, andto determine whether additional meta-analyticmodels should be investigated.

    Structure of Residual VariationIn [4], [6], and [8], the residuals (errors) are

    assumed independent, and identically distributedfrom a normal population with a mean of zero and

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    15/16

    2007 Sociedade Brasileira de Zootecnia

    357Meta-analyses of experimental data in the animal sciences

    Figure 11 - Frequency distribution of theresidual standard errors of the studies used inthe meta-analysis of chewing activity in cattle.

    Table 2 - Guidelines to establish whether an effect should be considered fixed orrandom in a meta-analytic model1.

    Population Experiment Effect of t in the modelCase 1 T is small2 t T Fixed effectCase 2 T is large t

  • 8/12/2019 Meta-Analyses of Experimental Data in the Animal Sciences

    16/16

    2007 Sociedade Brasileira de Zootecnia

    358 St -Pierre, N.R.

    Chapter of the American Statistical Association, 1999. 120p.

    National Research Council.Nutrient Requirements of DairyCattle . 7ed. Washington, D.C.: National Academic Press,2001. 381p.

    OFFNER, A.; BACH, A.; SAUVANT, D. Quantitative reviewof in situ starch degradation in the rumen.Animal Feed

    Science and Technology , v.106, p.81-93, 2003.OFFNER, A.; SAUVANT, D. Prediction ofin vivo starchdigestion in cattle from in situ data.Animal Feed Scienceand Technology , v.111, p.41-56, 2004.

    OLDICK, B.S.; FIRKINS, J.L.; ST-PIERRE, N.R. Estimationof microbial nitrogen flow to the duodenum of cattle basedon dry matter intake and diet composition.Journal of DairyScience , v.82, p.1497-1511, 1999.

    RUBIN, D.B. Estimating causal effects from large data sets usingpropensity scores.Annals of Internal Medicine , v.127,p.757-763, 1997.

    SAUVANT, D.; MARTIN O. Empirical modelling through meta-

    analysis vs mechanistic modelling. In: KEBREAB, E. (Ed.)Nutrient Digestion and Utilization in Farm Animals:Modelling approaches . London:CAB International, 2004.

    SAUVANT, D.; MERTENS, D. Relationship betweenfermentation and liquid outflow rate in the rumen.Reproduction Nutrition Development , v.40, p.206-207,2000.

    ST-PIERRE, N.R. Invited review: integrating quantitativefindings from multiple studies using mixed modelmethodology.Journal of Dairy Science , v.84, p.741-755,2001.

    ST-PIERRE, N.R. Reassessment of biases in predicted nitrogenflows to the duodenum by NRC 2001.Journal of DairyScience , v.86, p.344-350, 2003.

    ST-PIERRE, N.R.; GLAMOCIC, D. Estimating unit costs of nutrients from market prices of feedstuffs.Journal of DairyScience , v.83, p.1402-1411, 2000.

    TOMASSONE R.; LESQUOY, E.; MILLIER C. La rgression:nouveau regard sur une anciennemthode statistique. In:MASSON (Ed.)Actualits Scientifiques et Agronomiquesde lINRA . Paris:INRA, 1983.