Upload
lu
View
0
Download
0
Embed Size (px)
Citation preview
PETER GXRDENFORS
D Y N A M I C M O D E L S AS T O O L S F O R F O R E C A S T I N G
AND P L A N N I N G : A P R E S E N T A T I O N A N D SOME
M E T H O D O L O G I C A L A S P E C T S *
I. INTRODUCTION
Many of the mathematical models which have proved to be most useful as tools for forecasting and planning can be described as dynamic models. A
dynamic model consists essentially of a set of equations where the variables
included in the equations can be divided into input variables (or exogenous variables) and state variables (or endogenous variables). Dynamic models can
be used to combine, in a strictly mathematical manner, an analysis of causes and effects, with or without time-lags, and various feed-back relationships.
The abstract definition of a dynamic model has grown out from various applications within control theory. But also within other areas dynamic models have been used to model processes. Above all, many macroeconomic models are of this kind, but applications can be found also in other social and biological sciences.
As regards the classical applications within control theory, the equfitions
which describe the connections between the variables in the model can be obtained from some well-developed theory, viz. some branch of physics.
However, when the applications within the 'soft' social sciences are con- sidered, there is no such generally accepted theory from which the mathe-
matical formalism can be acquired. When a dynamic model is used to produce forecasts within an area of this kind, the forecasting process is not the methodologically most difficult part of the work, but it is rather the model
construction, i.e., the description of the dynamic model, that requires the greatest caution and the soundest judgments. There is always some portion
of fortuitousness in the choice of equations in, for instance, macroeconomic
models. A central problem, when judging a forecast obtained from such a model, is thus how much the uncertainty about the form of the equations influences the accuracy of the forecast.
It would be an exaggeration to maintain that the dynamic models which
Theory and Decision 14 (1982) 237-273. 0040-5833/82/0143-0237503.70. Copyright �9 1982 by D. ReideI Publishing Co., Dordrecht, Holland, and Boston, U.S.A.
238 PETER G.~RDENFORS
are used within the social sciences are applications of the mathematical control theory and that the models can be compared to steam engine governors, autopilots etc. The reason is that the methodological problems connected with forecasting models are quite different from the problems connected with physical applications of control theory. Much of the mathe- matical apparatus which belongs to the engineer's tool kit is simply irrelevant to applications within the 'soft' sciences. On the other hand, problems of equation choice and parameter estimation do not bother the engineer to the same extent as when constructing forecasting models.
This argument does not prevent that some of the conceptual framework which has developed within control theory may be useful also when non- classical applications are considered. For instance, in connection with the evaluation of forecasting models I will emphasize the methodological importance of the concepts of stability and sensitivity of a model. I believe that, when social science applications are considered, these aspects have been swept under the carpet in favour of statistical concepts of 'goodness of fit', etc.
One of the main reasons why dynamic models have become so frequently used as tools for forecasting and planning is that much of the heavy work in connection with model construction and model analysis can be done by a computer. The computer can be used when estimating equation parameters, when analysing the dynamic properties of the model by simulation, when forecasting and when planning with the aid of the model. A number of user directed programs (and even programming languages) have been specially constructed for simulating and analysing dynamic models.
When studying the use of dynamic models in forecasting and planning it is easy to be swamped with mathematical analysis and statistical hair-splitting. Here I will try to keep my head above the surface, but still look for what is going on in the depth. The paper will mainly be a surveyal presentation of dynamic models and their use in forecasting and planning. This will be complemented with some methodological remarks and recommendations in connection with model construction and model evaluation.
TOOLS FOR FORECASTING AND PLANNING 239
II. DYNAMIC MODELS
1. Definitions o f Deterministic Dynamic Models
The abstract definition of a dynamic model has developed from various ways of analysing and synthesising control systems. The methods used when con-
structing centrifugal governors for steam engines, electronic amplifiers, auto-
pilots, etc. have resulted in an abstract mathematical theory where the basic
'objects' of study are models of dynamic systems. In this section I will intro-
duce some basic concepts from control theory and give a definition of a
discrete deterministic dynamic modelJ Some examples of dynamic models will also be outlined.
A. Basic Concepts A fundamental postulate is that the system or process which is to be
modelled can be described by a set of state variables taking real numbers as values. These variables are functions of time and they will be denoted by
xl(t), x2(t) . . . . . The state of a system at a time to is identified with the set
of values xl(to), x2(to) , . . . . In this paper only models with finitely many
state variables will be discussed. The state of a model can then be described
by a vector 2(t) = (xl( t) , x2(t) . . . . . Xn(t)). The set of all possible states of a dynamic model is called the state space. The first state during the time
interval which is considered is called the initial state. Normally, a system can be controlled by external factors. It is assumed
that the control can be characterised by a set of real-valued input variables (or
exogenous variables). These variables are also functions of time and will be
denoted ul(t), u2(t) . . . . . Here, only finitely many input variables will be considered, which then can be described by a vector ti(t) = (ul(t), u 2 ( t ) , . . . , urn(t)). The set of all possible input vectors is called the input space.~
In a deterministic model the state of the model is uniquely determined by
the input vector together with earlier and present values of the state variables.
If the earlier and present values of the state variables have no influence on the
state of the model, but this is determined from the present values of the input variables only, then the model is said to be static. In a stochastic model the input vector and the earlier and present states of the model determine a prob- ability distribution over the state space.
The time variable is either characterised by a continuous line or by a
240 PETER G)~RDENFORS
discrete set o f points o f time. In the first case, the system is said to be con- tinuous, and in the second, discrete.
The central part in the definition of a dynamic model is to describe how
the state o f the model depends on the time variable and the input and state
variables. In this paper I shall concentrate on the case when the state variables
are differentiable functions of time. The change of the state of the model can
then be defined as a function of the previous state 2 ( t -- 1), the present state
~(t), the present input vector fi(t) and the time t. For continuous models the
changes can be characterised by first order differential equations and for
discrete models first order difference equations can be used.
B. Discrete dynamic models Next, a more systematic definition of a discrete dynamic model will be given.
DEFINITION 1. A discrete deterministic dynamic model consists of:
(i) a set T which is a set of consecutive natural numbers to, tl, t2, �9 �9 �9
representing the points of time;
(ii) a set X which is a continuous subset of R n, representing states of the
system. The number of state variables is n and an element in X, a state, is
written Yc = (xl, x2 . . . . . Xn); (iii) a set U which is a subset of R m, representing the input space. The
number of input variables is m and an element in U, an input vector is written
17 = ( u l . . . . ,Urn); (iv) a set Y which is a subset o f R k, representing observation vectors. The
number of observation variables is k and an element in Y, an observation
vector, is written)7 = (Yl . . . . , Yk);
(v) for each state variable xi(t), a function fi from X x U x T to R which
is continuously differentiable with respect to time such that
xi(t) = k ( ~ ( t - 1),~(t), • t) , t). []
A special case o f this definition is when xi(t ) can be described as a function
of 2 ( t -- 1) and t~(t) only. This case is of interest in many respects, and if the
equations of a model are written in this form one says that the model is given
a state-space description or that the model is given in reduced form. A definition of a continuous dynamic model would be quite similar
except that the points of time in (i) would be replaced by a time interval and
TOOLS FOR FORECASTING AND PLANNING 241
that the difference equations in (v) would be replaced with differential
equations. The rather abstract definition of a discrete deterministic dynamic model
which has been given here will now be illustrated by two examples.
Example 1. A very simple macroeconomic model will be outlined and it wiP be shown how it can be given a state-space description. The model contains four variables: aggregate consumption C(t), gross investment I(t), gross national product Y(t), and government spending G(t). These variables are related by the following three equations:
(1) C(t) = al + a2Y(t -- 1);
(2 ) I ( t ) = b~ + b 2 ( r ( t - 1) - - Y ( t - - 2));
(3) r ( t ) = C(t) + I(t) + G(t).
In these equations as, a2, bl and b2 are parameters. 2 A good starting point when reformulating these equations as a state-space
description is to sort up the variables. According to economic terminology C, I and Y are endogenous variables while G is an exogenous variable. Exogenous variables correspond to input variables in a dynamic model, and G is thus the only input variable in the desired dynamic model.
The Equations (2) and (3) do not quite fit the pattern of a state-space description since they do not only contain values of state variables at t - 1, but also at t and t - - 2. We can get away from this deviation by some mani- pulations of the Equations (1)-(3). If we put in (1) and (2) in (3), we obtain
(4) Y(t) = a 1 + a 2 Y ( t - - 1 ) + b l + b 2 Y ( t - - 1 ) - - b 2 Y ( t - - 2 ) + G ( t ) .
From (1) it follows that
(5) Y ( t - - 2 ) = 1 C ( t _ 1) al a2 a2"
If we now put (5) in (2) and (4) we get finally
(6) I(t) = bl + b2ala2 + b2Y( t - - 1) --~2 2 C(t - - 1); and
242 PETER G~ .RDENFORS
(7) Y(t) = a i + b l + b 2 a l +(a2+b2)Y( t - -1) - -b-~2C(t - -1) a2 a2
+ G(t).
The form of the Equations (1), (6) and (7) now accords with the require- ments of a state-space descdption. []
Example 2. The 'world models' which have been constructed by, among others, Forrester [11] and Meadows et al. [23] can easily be rewritten as
discrete dynamic models. These models do not explicitly contain any input
variables, but the value of a state variable at t is determined solely from the
state of the model at t - 1. On the other hand, some of the parameters that appear in the model equations can be interpreted as input variables and
with the aid of this it is possible to perform simple policy experiments. In Forrester's model the state of the model is described by five state vari-
ables, viz., population, natural resources, pollution, capital investment, and capital-investment-in-agriculture fraction. Besides these state variables the
model contains a number of auxiliary variables which are introduced by static
equations. The model constructed by Meadows et al. contains eleven state variables and also here a large number of auxiliary variables. All state variables are assumed to be measurable, but in many cases the high level of aggregation,,
makes this assumption seem very artificial. For example, Forrester measures natural resources in 'natural resource units' and pollution in 'pollution units'.
The equations of Forrester's model are well documented in [ 11 ]. In Limits to Growth, the equations which have been used when simulating the model are not explicitly stated, but these have been presented in a technical report ([24]) which has been published only afterwards. []
C. Classifying Dynamic Systems A model with input variables is said to be autonomous (or closed). The world models presented in Example 2 are autonomous models. A model is said to be time invariant, if the functions f l . . . . , fn and gl . . . . . gh do not depend on t. In this paper only time invariant models will be considered. A model is said to be linear, if the functions f l . . . . , fn and g~ . . . . . g~ are linear in the state variables, the control variables and t. The model in Example 1 is linear, while both Forrester's and Meadows's world models contain non-linear equations.
T O O L S F O R F O R E C A S T I N G A N D P L A N N I N G 243
2. Some Comments on the Definitions o f Dynamic Models
A. Classifying Variables
Although some variable may be classified as an input variable in a dynamic
model it is not always possible for human agents to have any influence to 'control' the values of the variable; or sometimes the 'control' may be very
restricted. For example, in a dynamic model which is intended to model
agricultural production, meteorological variables as temperature and precipitation may occur as input variables. For this reason, we can divide the input variables of a model used for forecasting and planning into policy
variables (or control variables), which are controlled by those who use the
model when forecasting and planning, and external variables, which are not
controlled. The borderline between these kinds of variables is not sharp since
some variables may be controlled only to a limited degree. In econometric models a distinction is made between endogenous and
exogenous variables. Exogenous variables correspond to input variables in the definition of a dynamic model. Endogenous variables correspond to state variables and other auxiliary variables which may be introduced by static
equations or identities.
B. Higher Order Equations
The equations used in (v) of Definition 1 are first order difference equations. It is now easy to give examples from models in physics or econometrics where higher order difference (or differential) equations are exploited. For
this reason, the requirement that when formulating dynamic models only first order equations may be used seems, a priori, as a serious restriction.
It is, however, easy to show that this requirement does not involve any real restriction. By introducing an appropriate number of new state variables it is possible to reduce a higher order equation to a set of first order equations. Suppose, for example, that we are given a third order difference equation of the form
x ( t ) = f ( x ( t -- 1), x ( t -- 2), x ( t -- 3), t~, t).
If we now introduce two variablesy(t) and z(t) which represent x ( t - 1) and x ( t -- 2) respectively, then the equation above can be replaced by the follow- ing set of first order equations:
x ( t ) = f ( x ( t - - 1 ) , y ( t - - 1 ) ,z ( t - - 1),K, t);
244 PETER G~RDENFORS
y ( t ) = x ( t - - 1);
z( t ) = y ( t - - 1).
If all equations are linear (and in many other cases), then it is, in this way,
possible to present all equations in accordance with (v) of Definition 1.
C. Stochastic Models
A generalisation of Definition 1 is to relax the assumption that the model e q~Ja~.ms are deterministic and introduce stochastic equations which will
then give us stochastic dynamic models. In continuous stochastic models the equations are so-called stochastic differential equations. It would take me too far to present here this kind of equations and their analysis. The interested reader is referred to Astr6m [4].
The equations used in discrete stochastic systems are called stochastic difference equations. These equations have the following form:
xi( t) = f i ( x ( t -- 1), x( t ) , u(t), t) + v i ( x ( t - 1), x( t ) , u(t) , t).
Here v i is supposed to be a random variable with fixed mean. Sometimes it is assumed to have certain additional statistical properties as being normally distributed, having some fixed standard deviation, being serially uncorrelated,
etc. Another form of stochastic equations is obtained if the input variables are
regarded as stochastic variables. For the solution of stochastic difference equations and further analysis of
stochastic models the reader is referred to Astr6m [4]. I will return to stochastic dynamic models in connection with simulation and when analysing forecasts made with the aid of dynamic models.
3. Analysing Dynamic Models
There are two main ways of studying the properties of a dynamic model. The first way is to search for analytic solutions to the set of equations given by the model. An analytic solution consists of l~mding, for each state variable, an equation which expresses the state variable as a function of the time and of the input variables (and with the initial value as a parameter), but which is independent of all state variables, including earlier values of the variable itself. From these equations the interesting properties of the model can then be derived.
TOOLS FOR FORECASTING AND PLANNING 245
The second way to study a dynamic model is simulation, By providing
initial values to the state variables and values to the input variables during
some subsequent time interval, one can use the equations in a deterministic model to compute the values of the state variables at all points of time in the
interval. By varying the initial values, the values for the input variables and
the equation parameters one can then obtain some information about the
dynamic properties of the model. I will now turn to a more detailed
discussion of these two methods of analysing dynamic models.
A. Analytic Solutions The goal, when looking for the analytic solution to a set of model equations,
is to 'solve' these equations so that the references to the state variables in the
equations are eliminated. In deterministic models the equations are ordinary
difference (or differential) equations. When solving these kinds of equations
we can rely on standard classical techniques.
Example 3. Let us assume that the following difference equation is one of
the equations in a discrete dynamic model:
x(t) = 2 + 0 .9"x ( t - - 1).
This equation is a purely autoregressive equation since the new value of the
variable depends only on an earlier value of the same variable and neither on
time nor on any input variable.
If we put X(to) = Xo, the solution to this equation is given by
x(t) = (Xo-- 20)" 0.9(t -to) + 20.
If xo > 20, x(t) will describe a curve which decreases exponentially towards
the limit value 2 0 ; i f x o < 20, the curve will increase towards 20. []
If the model is linear and if the input variables are ignored in the sense that
their values during the time interval are assumed to be constantly zero, then
there is a general method of solving a differential or difference equation which essentially involves finding the roots of the so-called characteristic equation of the model equation. With the aid of these roots the transient part of the solution of the model equations is obtained. If now each input variable varies according to some simple function u(t), it may be possible to find some particular solution to the model equation. The complete solution
246 PETER G~RDENFORS
is then obtained by adding the transient part to the particular solution (and
fixing the arbitrary constants so that the initial conditions are satisfied). For many simple equations particular solutions are known and in these cases the
analytical solutions are easy to obtain. However, as soon as the course of the input variables becomes more complex, it is in general extremely difficult,
not to say impossible, to fred particular solutions, a For non-linear models
there is no such general method but the equations in such a model have to be
attacked by more or less ad hoc methods. A deterministic difference or differential equation for a state variable x(t)
is solved if we know the values of x(t) for all t. The solution of a stochastic equation will not produce determinate values but probability distributions
over the values of the state variable. Thus a stochastic equation for a state
variable x(t) is solved if we know the probability distribution ofx( t ) for all t. The solution of a stochastic difference equation will be a Markov process. It would take us too far to present here any solution methods for stochastic
equations. The interested reader is referred to Astr6m [4].
B. Simulation It does not take many variables and complicated functions for the input vari- ables before it becomes overwhelmingly difficult to find the analytic solution
even for a linear deterministic model; and as soon as non-linearities appear in the equations or stochastic equations are considered, the search for an analytic solution is, from a practical point of view, hopeless. In these situations other means for analysing dynamic models must be exploited; and
here computer simulation techniques offer great possibilities. A simulation of a dynamic model during a time interval T consists of
supplying initial values for the state variables and values for the input vari- ables during T and then using the model equations to compute the values of
the state variables for a number of points of time during T. As regards discrete deterministic models, the equations of which are given
in reduced form, this process is computationally simple. From the initial values of the state variables, the values for the next period can be obtained immediately from the system equations. These new values can then be put back into the system equations and the values for the subsequent period can
be computed, and so on. If the equations cannot be put in reduced form, then the computational
TOOLS FOR FORECASTING AND PLANNING 247
process may be more complicated. In this case one may have to rely on
algorithms for solutions of equation systems where an approximative solution
is computed by an iterative procedure. Among the most common are algorithms based on the Gauss-Seidel, Newton-Raphson and Runge-Kutta procedures.
There is a danger in using approximate solutions. During a simulation, computed values for the state variables, which contain some approximation
errors, are' used to compute new values for the state variables at a later point
of time. Although the error at each step may be small, it may happen that
these errors concur and magnify each other so that the simulated value of a
state variable soon deviates considerably from the analytic value. For this reason some form of error analysis of the approximations used in a simu- lation is necessary.
When stochastic models are considered the advantages of using simulation
are even greater since analytic solutions are difficult to obtain and use for this
kind of models. The inclusion of random variables in the model equations involves no computational problems. There are, however, some methodo-
logical problems in connection with simulation of stochastic models as is pointed out by Howrey and Keleijan [17]. I will retum to this topic in connection with the evaluation of stochastic dynamic models.
With the aid of a computer the simulation methods can be programmed and the simulations performed rapidly and efficiently. A number of special programming languages have been developed with the particular purpose of simulating dynamic models. 4 As input to such a program one gives a model
description, essentially consisting of edited versions of the difference (or differential) equations, together with initial values for the state variables and values for the input variables during the simulation interval. The program then performs the simulation and produces as output either tables of values of the state variables during the simulation interval, or plotted curves
describing how the state variables vary with time.
Example 4. The curves which can be found in the books by Forrester and
Meadows et al. are the results of a number of computer simulations of their world models. Since these models are autonomous only initial values for the state variables are necessary as input when the system equations are described.
248 PETER GARDENFORS
The programming language which has been used for the simulation process is DYNAMO. s []
If one is able to work with an interactive simulation program, it is possible to
produce the state variable curves directly on a screen. One can then comfort-
ably see the effects on the dynamic behaviour of the model of changes in
initial values, parameters in the equations, and the values for the input vari-
ables. In this way one may, without knowing anything about the analytic
solution of the model, work out a 'Fingerspitzengefuhl' for the dynamic
properties of the studied model, even if it is quite complex.
C. Stability and Sensitivity A central problem within control theory is whether a system is stable or not.
In an unstable system a small disturbance may, for example, lead to self-
oscillations which reinforce and the system collapses. The disturbances may
originate from different sources. The initial conditions for a system may not
be set accurately, which will lead to a different course of the system than the
desired one. Another type of disturbance occurs if the equations in the model
used to represent the system are not exact because of error in the equation parameters or because some aspect of the modelled system is neglected or
simplified when the equation is formulated. A third type occurs if the control
variables cannot be perfectly controlled but are subject to some error. An
important question is now how these kinds of disturbances affect the course
of the state variables in general and the output variables in particular. For
some of the possible kinds of disturbances of a dynamic model criteria of
stability have been formulated.
The first definition of stability is connected with disturbances in the initial
values of a dynamic model. The basic idea is that small disturbances in the
initial values will only lead to small changes in the course of the state vari-
ables, if the model is stable.
DEFINITION 2. A dynamic model with given initial values at to for the state variables is said to be Lyapunov stable if and only if for every 8 > 0 there is
some e > 0 such that whenever an initial value is changed at to by an amount
the absolute value of which is smaller than 8, then the changes of the values of the state variables for all points of time t after to never exceed e. []
TOOLS FOR FORECASTING AND PLANNING 249
Example 5. I return to the simple macroeconometric model which was pre-
sented in Example 1. If the equation for Y(t) is solved as a function of lagged
values and the control variable G(t), one obtains the following equation:
Y ( t ) - (a2 + b~)" Y ( t -- l) + b2" Y ( t -- 2) = al + bl + G(t) .
Now suppose the initial values for this equation are Y(0) and Y(1). For simplicity, it will be assumed that G(t) is kept constant and equal to
- - (al + bl) during the relevant time interval. 6 Thus we wilt be studying the
behaviour of the homogenous equation
Y ( t ) -- (a2 + b2)" Y ( t -- 1) + b2" Y ( t -- 2) = 0
under disturbances of the initial conditions. The analytic solution to this
second order difference equation, under the given initial conditions, is (unless
as = 2x/b2 -- b2)
Y ( t ) = Ax" 3`I + A2" 3 t2
where Aa = (3`2 Y(0) -- Y(1))/(3`2 -- 3`1), A2 = (Y(1) -- 3 a Y(0))/(3,2 -- 3`a) and where 3`1 and 3 2 are the roots of the characteristic polynom
3`2_(a 2+b2 ) . 3 `+b2 = 0
of the homogenous equation.
(If a2 = 2x/b2 -- b2, then 3 1 = 3`2 and the solution is then Y ( t ) = Y(O)" 3`t,
which entails that Y(1) = 3,1" Y(0).) The dynamic behaviour of Y(t) depends on the values of 3 1 and 3`2 and
thereby on the values of a2 and b2. For some values of 3 1 and X2, Y(t) will converge towards 0 without oscillations (independently of the values of Y(0)
and Y(1)) and the model described by the equation above will be Lyapunov stable. For other values, Y ( t ) will converge towards 0 with damped oscil-
lations and also be Lyapunov stable. For still other values of 3`I and X2, Y(O will grow exponentially, with or without oscillations, and in these cases the model is not Lyapunov stable] []
It can be shown that if a linear time invariant dynamic model is Lyapunov stable for some initial values of the state variables, then it is stable for all initial values. The stability of a linear time invariant dynamic model is thus a property of the model, and not only a property referring to a particular set of initial conditions.
250 PETER G)kRDENFORS
Another concept of stability is introduced in the following definition:
DEFINITION 3. A dynamic model is input-output stable if and only if
limited values for the input variables result in limited values for the output variables. []
It can be shown that if a linear time invariant dynamic model where all out- put variables are state variables is Lyapunov stable, then it is also input-
output stable. Input-output stability covers disturbances in the input variables, since such
will only have a limited effect if the system is input-ouput stable. In a sense,
this criterion also covers disturbances in the equation specifications since these can be seen as changes in the values of the input parameters. As regards linear time invariant models the result mentioned above shows that if a model is stable under disturbances of initial values, then it is stable under other kinds of disturbances as well.
When investigating the dynamic properties of a linear economic model, the
method most frequently used is that of examining the multipliers associated with the model's exogenous variables. If the model is stable then a one-period increase in the value for one of the input variables would result in limited
changes in the following values of the state variables. These changes are called the dynamic multipliers of the input variable. The first-period change of a
state variable is called the impact multiplier of the variable.
The long-run multiplier of a state variable is the sum of all the dynamic multipliers over time. The multipliers of a model often give a good picture
of the dynamic properties of the model, but it should be noted that as regards
non-linear models all questions concerning the stability and sensitivity of such
a model are not settled by multiplier analysis. Another important part of the study of a dynamic model is the sensitivity
analysis where the goal is to find out how sensitive the model is to different kinds of changes. This kind of analysis is, of course, closely related to the
analysis of the stability of the model. The most important aspects are to analyse the model with respect to (i) changes in initial values for the state variables, (ii)changes in the parameters in the model equations, and 0ii) changes in the values of the input variables. The sensitivity of a factor in a model is a measure of how much a change in the factor is magnified when
TOOLS FOR FORECASTING AND PLANNING 251
comparing the behaviour of the state variables before and after the change. If
it is possible to compute an analytic solution to the equations of a dynamic
model, then it is possible to answer all questions concerning sensitivity
from this solution. If simulation is used, the sensitivity of a model can be
determined by running through various combinations of changes in initial
values, equation parameters, and input variables. I will return to this kind of
analysis in the sequel since the sensitivity of a dynamic model is an important
feature when evaluating the adequacy of using the model as a tool for fore-
casting.
III . FORECASTING AND PLANNING WITH DYNAMIC MODELS
1. Model Construction
When one wishes to use a dynamic model as a tool for forecasting and plan-
ning the most difficult part, from a methodological point of view, is not how
to do the forecasting or planning with the model, but how to construct the
model. In this section I will dis.cuss some methodological issues connected
with the choice of variables in the model, the choice of the form of the model equations, and the estimation of the equation parameters.
A. Choice of Variables The 'normal' procedure when constructing a dynamic model as a model of
a real process consists roughly of the following main steps: (i) choose the set
of variables to be included in the model and divide them into input and state
variables; (ii) formulate the system equations (at least with respect to 'form'
and which variables should be included in each equation); and (iii) use avail-
able data for the state and input variables to estimate the parameters in the equations. I do not intend to say that one of these steps must be finished
off before the next may be commenced. In practice one often starts with a
rough work on step (i) and formulates some preliminary equation in step (ii)
to obtain some first estimation results and then returns to step (i) or (ii) to search for less approximate solutions.
It is hardly possible to formulate any general rules for which variables should be included in a dynamic model as state variables or as input variables.
For this problem any form of theory of the process one wishes to construct
a model for may be useful. Theoretical considerations may result in anything
252 PETER G ~ R D E N F O R S
from well corroborated mathematical formulas to loose speculations about causal connections.
A variable, for which one intends to obtain a forecast, is never an isolated
phenomenon. It is part of some causal structure and is influenced by other
variables and the variable also influences other factors itself. It may be a
part of causal cycles, i.e., feed-back loops. A first delimitation when con-
structing a model is to find some causal network, which includes the variables
to be forecasted, but which is relatively isolated in the sense that variables
'within' the network, i.e., those which are potential state variables, do not
influence the variables 'outside' the network, i.e., potential input variables,
in any essential way. This causal network can be chosen more or less
inclusive. To make the demarcation one has to weigh the completeness and
richness of detail against perspicuity and manageability.
Another problem when selecting the set of variables for a dynamic model
is the level of aggregation. For the construction of a macroeconometric model
we may have available sufficient data for a large number of variables as individual consumers, individual entrepreneurs etc. It might be possible to
write out various kinds of relationships between these magnitudes as model equations and thus obtain an extremely detailed model. Such a dynamic
model, however, would have a rather limited practical interest. In order to
explain and predict the behaviour of macroeconometric variables, it is
necessary to deliberately disregard a considerable amount of details. Here,
again, completeness and manageability must be weighed against each other.
The advantage of a model which contains a large number of variables is
that it may be possible to reach a closer representation of the 'real' process
than with a model containing fewer variables. The main disadvantage is that
things get complicated and much more work is needed; more model equations
have to be specified, more equation parameters have to be estimated, and it
may be difficult to collect sufficient data for all variables. Another, more fundamental, disadvantage is that if the theory for the process to be modelled is 'weak', i.e., not articulated in an exact way or not so well corroborated,
then it is difficult to motivate the choice of model equations; and the more equations one has to construct, the more uncertain will thermodel as a whole be.
Even if some variable has been chosen to be included in the model, one has to decide how the variable is to be represented in the model equations.
TOOLS FOR FORECASTING AND PLANNING 253
Sometimes the derivative of a variable may seem more appropriate than the
variable itself or some functional transformation of the values of a variable
may yield a representation which is easier to include in the equations. Even if
the resulting models are, in a sense, equivalent, the choice of the represent-
ation of a variable may have devastating consequences for the outcomes of
parameter estimation procedures.
When choosing the control variables to be included in the model one is
faced with the same problems as when choosing state variables. Since a
dynamic model is often used as a tool for planning, the choice of control
variables may even have political consequences. A dynamic model can be
used to give recommendations of which values of the control variables to
aim at, if a certain goal is to be attained. Since different control variables
may be governed by different political institutions, the mere choice of
control variables may reflect or influence a political power structure.
When choosing input variables which are not policy variables, i.e., what
was earlier called external variables, the accessibility of actual and future
data for these variables is an important aspect. The theory of the process to
be modelled may prescribe an input variable which, because of measurement
problems or other data collecting problems, may be useless when the model is
to be used for forecasting or planning. Actual and past data of input variables
are necessary for the estimation of equation parameters and forecasted" future
data are necesary when using the model to forecast future values of the state
variables. If the forecast for an external variable is very unreliable, then this
usually affects the reliability of the forecast for the state variables of the model; and it may then be preferable to replace the external variable with
another for which there exists a more reliable forecast.
B. Choice of Equations Within areas where precisely formulated and well corroborated theories are
available, as e.g. the classical theories of physics, the choice of model
equations does not normally present any fundamental problems, since the
equations are, on the whole, provided by the theory. On the other hand,
within the 'soft ' sciences, where the theoretical foundations are more shaky and less informative, the choice of model equations often is the most difficult part when constructing dynamic models.
Even if an incomplete theory does not provide us with the exact model
254 PETER G ~ R D E N F O R S
equation for a state variable, it may nevertheless give some guidance as to which state and input variables are causally connected with the state variable
under consideration and thus determine, at least, which variables ought to be
essentially included in the equation. The theory may also indicate the 'form'
of the equation, e.g., whether it should be linear, exponential or logarithmic in the variables. But frequently the information provided by the theory is
scanty and one will then be forced to experiment with different types of equations.
Within economic theory it is often assumed that all equations are linear in the variables. The main motivations for this assumption seem to be
practical: Having linear equations allows you to, more or less mechanically,
solve the system of equations analytically and thereby you will be able to
study the dynamic properties of the model in a careful way; and, further-
more, the statistical techniques available for estimating linear equations are
more thoroughly investigated, better developed and have more appealing
properties than the corresponding methods for non-linear equations. It is
also argued that the restriction to linear equations is not a serious short-
coming since linear equations (the linear terms of a Taylor expansion) are
good approximations of the 'real' equations as long as the variations in the
variables are relatively small. This argument is, naturally, of limited value.
For many processes stochastic models seem more appropriate than deter-
ministic ones. When formulating the equations of a stochastic model the
choice of the error term involves another source of problems. In most appli-
cations the error term is simply added to the rest of the model equation. This
error term is then furnished with various statistical properties (normally dis-
tributed, zero mean, no serial correlation with earlier errors, homoscedastic,
etc.) depending on what the theory of the process says about the error and what data show.
An important problem is how the error term in an equation is to be inter-
preted. One way is that the model equation for a state variable is not perfectly specified but, due to some idealisations or some unknown features
of the modelled process, there will be a small residual difference between the true value of the state variable at a certain point of time and the value obtained from the equation at the same point of time. The residuals can be seen as the unexplained part of the value of the variable.
Another way of interpreting the error term, which can be combined with
TOOLS F O R F O R E C A S T I N G AND P L A N N I N G 255
the first, is that some or all of the variables used in the equations are
measured with error. For this kind of error, assumptions o f normal distribu-
tion, zero mean and no serial correlation between different error terms seem
quite natural. I f the stochastic parts of the model equations consist only o f
measurement errors, then the underlying process is essentially deterministic
and the stochastic dynamic model may, i f correctly specified, be replaced by
a deterministic one. If, on the other hand, the error terms are principally
residuals, then there is no reason, a priori, why they should have the statistical
properties ment ioned above.
In many cases the theory of the process to be modelled does not give
much guidance as to which form of the model equations is the appropriate
one. In these cases one may acquire some information by studying the avail-
able data for the state variables. The curves may suggest a linear or an
exponential trend or there may be some cyclical movements; and this
information can be helpful when formulating the model equations.
I t is dangerous, however, to let the apparent form of the curves for the
state variables dictate the choice of the model equations. This danger was
pointed out by Haavelmo [15] so long ago that much of his argument needs
be repeated:
The degree of conformity between. . , theoretical solution and the corresponding observed time series is used as a test of the validity of the model. In particular, since most economic time series show cyclical movements, one is led to consider only mathematical models the solutions of which are cycles corresponding approximately to those appearing in the data. This means that one restricts the class of admissible hypotheses by inspecting the apparent form of the observed time series.
This condit ion. . , may not even be a necessary condition, and its application may result in a dangerous and misleading discrimination between theories. (p. 312)
The reason why inspection of data series may lead to prejudiced conclusions
about the form of the model equation is, according to Haavelmo, that
the whole question is connected with the type of errors we have to"lntroduce as a bridge between pure theory and actual observations. (p. 312)
Haavelmo then discusses two kinds of errors corresponding to the two inter-
pretat ions of stochastic equations, i.e., measurement error and unexplained
residuals, mentioned above.
256 PETER GXRDENFORS
The conclusion to be drawn from this discussion is that the form of a
model equation should not be chosen only because the data series shows a
certain pattern - there should be some theoretical reason, even if it is weak, for making the choice.
This ends my discussion of the choice of model equations. The results of
the search for a set of appropriate equations may not be univocal but one is
often left with a number of equation forms which all are reasonable candi- dates for being included in the final model. The final choice among these
equations is postponed until their ability to 'explain' data and their dynamic properties are investigated.
C. Estimation o f Equation Parameters
Apart from state and input variables the model equations contain a number
of parameters. Even if the mathematical form of the equation can be fully
inferred from the theory of the process to be modelled, the actual value of
the parameters are not derivable from theory alone. In very particular cases
only it is possible to obtain the 'correct' values from data in combination
with theory - in general one must be content with estimates.
The theory of parameter estimation in multi-equation models is complex.
There are a great number of statistical procedures available for the estimation
of the parameters in a system of equations. Some of the more well-known
methods are ordinary least squares, two-stage least squares, and full-
information maximum likelihood. These methods are constructed to be
applicable under certain assumptions concerning the error terms in the
equations. It would, however, take us too far to present any of these methods here.
There are no general rules for determining the choice of estimation pro-
cedure for a particular system of equations. Partly the choice depends on
what is known about the statistical properties of the errors in the equations, partly it depends on the purpose for which the estimated system of equations is to be used. For example, the choice of estimation technique can have sub-
stantial effects on the dynamic properties of the estimated model.
Example 6. Pindyck and Rubinfeld [29], pp. 329-330, present a simple
macroeconomic dynamic model. The equation for the consumption function C(t) is estimated as a linear function of C(t -- 1), G(t -- 1), Y ( t -- 2) and
TOOLS FOR F O R E C A S T I N G AND P L A N N I N G 2 5 7
R ( t -- 4), where G is government spending, Y is GNP, and R is a short-term
interest rate. Using quarterly data during the period 1955-1970, the
equation was estimated by two-stage least squares combined with an auto-
regressive transformation to correct for serial correlations. The result was
C(t) = 2.097 + 0.966C(t -- 1) + O.187G(t -- 1) --
-- 0.009Y(t -- 2) -- 0.609R(t -- 4).
I f the equation is estimated using ordinary least squares instead of the pro-
cedure above, then the result is:
C(t) = 5 .842+ 0 . 9 2 6 C ( t - 1)+ 0 . 4 3 7 G ( t - 1 ) -
- O . l O O r ( t - 2) - 1.687R(t - 4).
The dominating part of both of these equations is the autoregressive term
C(t -- 1). The behaviour of the variable C is very sensitive to changes in the
parameter for this term. So even if there seems to be little difference between
the two values 0.966 and 0.926, the dynamic properties of the two equations
are quite different. Which equation is 'best ' cannot be determined by
statistical techniques only, but the dynamic behaviour of the model as a
whole must be considered when choosing the final estimates of the
parameters. []
2. Model Evaluation
When a dynamic model has been constructed its performance must be evalu-
ated. This means that the values for the state variables produced by the sys-
tem is to be compared to the 'real' values, past, present, or future, of the state
variables. The better the values produced by the model fit the real values, the
better is the model. But how is one to estimate the degree of fit between
model output and actual values? Frijda [13] notes that
There is hardly any methodology existing here. As much ingenuity as has been invested in the making of computer programs, as little has been spent on the assessment of their value. Next to high precision there always seem to be spots of rough approximations which undercut this very precision. We are left largely to our subjective impressions of what we consider good or bad correspondence. (p. 65)
There exists a large number of criteria which can be used to evaluate dynamic
258 PETER G.g, RDENFORS
models. Here, I will not attempt at a systematic presentation but confine my-
self to a discussion of some of the most important criteria.
A. Goodness o f Fit The quantitative measure of how closely individual state variables track their
corresponding data series that is most often used is called the root-mean- square (RMS) error. In order to define this measure for a discrete state vari-
able x( t ) , we distinguish between xs(t), the value o fx ( t ) at t predicted by the
given model under the given initial conditions, and Xa(t), the actual value of
x( t ) at t.
DEFINITION 4. Let xs(to), Xs(tl ) . . . . . xs(tn) be a series of values obtained from a discrete dynamic model and let xa(to ), Xa(tl ), . . . , xa(tn) be the
corresponding data series. Then the RMS error for this time interval is
x/Zi(xs(t i) -- xa(ti))2/(n + 1). []
The square root of the mean square error is taken in order to obtain a
measure which has the same dimension as the predicted and actual values themselves. If a dimensionless measure is desired, one can use either the RMS
percent error which is defined as
x/Zi(xs(t i) -- xa(tt)/xa(ti))2/(n + 1)
or Theil's inequality coefficient as it is defined in [31], which is just a way of
normalising the RMS error.
All these measures can be decomposed into three parts. The first part, the
error in central tendency, is zero if and only if the average predicted change
from the mean of the predicted values coincides with the average actual
change. The second part, the error due to unequal variation, is zero if and
only if the standard deviations of predicted and actual changes are equal. The third term, the error due to incomplete covariation, is zero if and only if the
covariance of predicted and actual values takes its maximum value. This kind of decomposition is often useful since each term refers to a particular kind of
prediction error, s
B. The R 2 Measure Another measure which is often used as an estimate of the goodness of fit is
TOOLS FOR F O R E C A S T I N G AND P L A N N I N G 259
the R 2 measure. It measures the proportion of the variation of a discrete state
variable x( t ) which is 'explained' by the system equation. R 2 is often
informally used to compare the validity of alternative specifications of system
equations for a state variable.
To define this measure we begin by defining the total variation of x(t) about its mean (or total sum of squares) as
TSS = ~i(xa(ti) -- ~a) 2.
Here, Xa(ti) are the actual values, as before, and xa is the mean of these values
during the time interval t o , . . . , tn. The residual variation o f x ( t ) ( o r error sum o f squares) is
ESS = "ti(xa(ti)-- xs(ti)) 2.
As before, xs(ti) denotes the value of x(t) at t i produced by the model.
Finally, the explained variation o f x ( t ) (or regression sum o f squares) is
RSS = ~i(xs(ti)-- ~a) 2.
Now, if the mean error of the predicted values is zero and the errors are
perfectly uncorrelated with the variables in the equation, it can be shown that
TSS = ESS + RSS.
DEFINITION 5. The R 2 measure for a given discrete equation during the
time interval to . . . . . t,, isR 2 = RSS/TSS. []
When the above mentioned equality holds, this is equivalent to R 2 = 1 -- ESS/
TSS. Thus R 2 is the proportion of the total variation in x( t ) explained by the
model equation. The R 2 ranges in value between 0 and 1, and is higher the
better the xs(t) values correspond with the Xa(t ) values. An R ~ value of 1 can
occur only in case o f a perfect fit.
A high R 2 value cannot be the only criterion to determine whether an
equation describes some data 'correctly', since if further variables are added
to the model equation, the R 2 value is never lowered and it is likely to be
raised (the addition of a new explanatory variable does not alter TSS, but is likely to increase RSS). There must be additional reasons for including a vari- able in an equation - theoretical reasons or, if the theory is 'weak', tests for statistical significance.
260 PETER G~RDENFORS
In Gardenfors [14] the R 2 measure is further criticised as a measure of the explanatory value of a model equation. In that paper it is argued that the explanatory value of a forecasting equation is to be measured by the improve- ment of fit to the actual values obtained from the forecasting model in com- parison to the values obtained from a 'naive' model. Here, 'naive' means that
the forecast of a variable is determined without the aid of any other explanatory variables, as e.g. in a time series model.
C. The t Statistic
A mistake that is difficult to discover when evaluating a system of dynamic
equations is that some variable is missing in an equation. This problem occurs
mainly when the theory of the process to be modelled does not give much guidance as to which variables have a direct influence on a given variable. For
this reason it is easy to include too many variables in the first attempts at
formulating the model equations and then later exclude some variables when
the statistical analysis shows that they have no explanatory value. When the model is linear, one of the main tools in this statistical anaysis is the so called t statistic which, for each variable, determines whether the estimated parameter coefficient for the variable is significantly different from 0, i.e.,
that the null hypothesis that the coefficient equals 0 can be rejected with a
certain degree of confidence. It is often demanded, for each estimated coefficient in a linear equation, that the t statistic should show that the null
hypothesis can be rejected at the 5 percent level of significance. However, if this demand is followed too strictly, then it may very well
happen that some important variables will be excluded from the model
equations. If, for example, an input variable (say, oil price), which has a strong influence on the variable one wishes to explain; is constant or almost
constant during the time interval which is used for the parameter estimation, then it is likely that the t statistic for the coefficient for this input variable is not large enough to warrant the rejection of the null hypothesis. If there are theoretical reasons for including a certain variable in an equation, my recom- mendation is that it should be included even if the t statistic for the corre- sponding parameter is low. It should be noted, however, that, in these cases, the estimated value of the parameter may not be reliable.
Example 7. The following quotation from Pindyck and Rubinfeld [29], p. 379, illustrates the argument above:
TOOLS F O R F O R E C A S T I N G AND P L A N N I N G 261
Despite the fact that the estimated coefficient of the long-term interest rate is not significant at the 5 percent level, we do not drop the variable from the equation. Because we believe a priori that the interest rate variable provides the link between monetary policy and investment, it is necessary that we include an interest rate as one of the deter- minants of interest. []
The conclusion to be drawn is that one should not rely blindly on the
statistical tools when choosing and specifying model equations. The
parameter values, acquired by some estimation procedure, must be inter-
preted and their reasonableness checked against whatever theory is available.
D. Dynamic Properties When studying a ' real ' process one acquires some intuitive knowledge about
the dynamic propert ies of the process. One may know that it is stable under
quite general condit ions and one may have a fairly detailed knowledge of how
sensitive the process is to different changes in the input variables. When trying
to represent such a process by a dynamic model it is important that the
model has roughly the same dynamic properties.
By using the definitions of stability presented earlier, it may be possible to
express some of the properties of the model in a precise manner. But even if
the available knowledge about the stability and sensitivity properties of a pro-
cess cannot be expressed formally, it is nevertheless an important comple-
ment to the data series for the state and control variables when the tentative
dynamic model is to be evaluated. Since a system of equations, by including
more and more variables, can be made to fit actual data series for the state
variables arbitrarily well, the goodness of fit cannot be the only critierion of
the validity o f the model, but the dynamic properties also play an important
part in the evaluation. Kmenta [21], p. 593, notes that
The stability condition is clearly important from the economic point of view. It should also be realised that the existence of stability (or, at worst, of a regular oscillatory pattern) is, in fact, assumed in the process of estimation. The assumption that the pre- determined variables of the system have finite variances as t-~ ~ applies also to the lagged endogenous variables, and this assumption would be violated ff the endogenous variables were to grow or decline without limit. If this assumption were not made, there would be difficulties in proving the asymptotic properties of estimators.
Example 8. Pindyck [28] aims at formulating an equation for inventory
investment, which is denoted by I(t). As explanatory variables he chooses
262 PETER G~ ,RDE NFORS
Y(t), disposable income, C(t), consumption, together with lagged values of
I( t) , Y(t) and C(t). The model is discrete and the equation is assumed to be
linear in these variables. The following two equation forms are then selected
for estimation:
( i ) I ( t ) = a l Y ( t ) + b l ( Y ( t ) - - Y ( t - - 2)) + c l ( C ( t ) - - C ( t - - 2)) +
+ d ~ I ( t - - i) + k~ ;
(i i) I ( t ) = a 2 ( Y ( t ) - - Y ( t - - 1)) + b 2 ( C ( t ) - - C ( t - - 1)) +
+ e 2 I ( t - - 1) + k2.
The results of the estimations are:
(i) I ( t ) = 0.0113Y(t) + 0.4647(Y(t) - Y ( t - - 2)) - 0.6002(C(t) --
- - C ( t - 2)) + 0.4219I( t -- 1 ) - 2.4615;
(ii) I ( t ) = 0.8407(Y(t) -- Y ( t - 1)) -- 1.0087(C(t) -- C ( t - - 1)) +
+ 0.7362I( t - 1) + 0.8726.
The R 2 value for (i) is 0.740 and the standard error is 2.228 while the R 2
value for (ii) is 0.905 and the standard error 1.349. Pindyck has the following
comment on these statistical results:
[Equation (ii)] gives better statistical fit along the regression line, which is not surprising since a large proportion of the high frequency change in Y can be accounted for by fluctuations in inventory investment. But ff [Equation (ii)] is used in simulation of the model as a whole, it results in extreme instability. This is not surprising - a sudden large increase results in a large I, which in turn results in a larger increase in Y and thus an even larger I, etc. This is an example of statistical fit that is somewhat artificial - we in effect are regressing a noisy variable against its own noise. For these reasons [Equation (i)] was found preferable for use in the final model. (p. 59)
There are no particular reasons within macroeconomic theory for choosing (i) instead of (ii). Pindyck's argument shows that the good statistical fit of
Equation (ii) is outweighed by its dynamic properties. The interpretation of
the equation coefficients shows that Equation (i) is preferable to (ii). []
E x a m p l e 9 . In connection with the world models constructed by Forrester
[11] and Meadows e t al. [23] some interesting sensitivity analyses have been
TOOLS FOR FORECASTING AND PLANNING 263
made. The authors are well aware of the problems connected with the esti-
mation of the equation parameters in their models. It is argued, however, that
accuracy is not necessary since the main variables of the models show the
same qualitative behaviour, which essentially means an impending population
catastrophe, even if the parameter values are allowed to vary within 'reason- able' limits. In support of this claim both Forrester and Meadows et al. pro-
vide the results of a number of simulations of their models where various
equation parameters and initial values are changed. For instance, Meadows
et al. show that even if the assumed initial amount of natural resources is
doubled, there is no essential change in the behaviour of the population vari-
able, but a catastrophe is also in this case predicted to occur in the middle of the 21st century.
It is, however, not sufficient to make sensitivity tests for some model
parameters only in order to evaluate the reliability of the models, but a more
systematic investigation is necessary. Zwicker [32] has pointed out a most
undesirable case of sensitivity in Meadows' model. He studies the effects of
changing three coefficients in some equations concerning capital. In Meadows'
model, the constants are called ALIC (average lifetime of industrial capital),
ICOR (industrial capital coefficient) and FIOAC (consumption fraction).
Zwicker increases the values for ALIC and ICOR by 10% and decreases the
value for FIOAC by 10%. These changes are clearly within the error limits of
the estimations of the coefficients. Zwicker shows that if these constants are
changed in the year 1900 in a simulation run, then all important state vari-
ables (population, natural resources, food per capita, industrial production,
and pollution) will remain at a constant or almost constant level for all foreseeable future. In another simulation it is shown that if these changes of
the equation coefficients are made in the year 1975, instead of 1900, then
the curves will have almost the same form as those considered desirable by Meadows et al.
Zwicker's investigation shows, contrary to what is maintained in Limits to
Growth, that Meadows' model is unwarrantably sensitive to changes in the
values of the equation parameters. His simulations are alone sufficient to
considerably lower the trustworthiness of the conclusions and recommend- ations presented in Limits to Growth. []
264 PETER G ~ R D E N F O R S
3. Forecasting
Once a dynamic model has been constructed, forecasting with the aid of the
model is, from a methodological point of view, comparatively simple. What is
needed when using a deterministic model is a set of initial values for the state
variables and a set of values for the input variables for the time interval of the
forecast. With the aid of these requisites the model is then run into the
future. If the model is stochastic it is necessary to rely on so-called stochastic
simulation. Such a simulation can be made by including various kinds of
errors in the description of the model - making the external variables and the
equation parameters random variables and adding error terms to the
equations. By running a large number of simulations with this kind of model
one gains a comprehension o f the most likely values of the forecasting vari-
ables and with the aid of the distribution of the simulation curves one also
obtains a rough measure of the uncertainty of the forecast. 9
The acquisition of future values of the input variables may be problematic.
As regards policy variables, the forecaster chooses their values himself or
relies on the decision makers' plans, but when it comes to external variables,
their values must be obtained from already given forecasts.
A. Conditional and Unconditional Forecasts
A distinction is often made between conditional and unconditional forecasts.
In an unconditional forecast, it is assumed that values for all the input vari-
ables are known with certainty. In a conditional forecast, values for one or
more of the input variables are not known with certainty, but the forecast is
made under the condition that certain assumed values of the input variables
will become the actual values.
If the dynamic model, which is used for making a forecast, is a closed
model, i.e., without input variables, then all forecasts will, by definition, be
unconditional. When models with control variables are used, which are the most common and most important cases, unconditional forecasts will occur only in extreme cases, for instance when the input variables influence the state variables only after some lag and the forecast interval is so short that only known values of the input variables are needed. In all other cases, the required values of the input variables have to be found in some other way.
For all external variables utilised when forecasting with a dynamic model
TOOLS FOR FORECASTING AND PLANNING 265
one needs forecasts of their future values. These forecasts are afflicted with
some uncertainty and this uncertainty is the source of a corresponding uncer-
tainty in the forecast made by the dynamic model. If the uncertainty of the
external variables can be described by some probability distribution around
an expected mean, then, with the aid of statistical analysis or stochastic simu-
lation, this distribution can be used to determine the sensitivity of the fore-
cast obtained by the dynamic model to errors in the forecasts of the external
variables.
As regards the policy variables, one cannot, in the same sense, forecast their future values, since they are dependent, at least to some extent, on
decisions which are not yet made. This means that a forecast which depends
on policy variables is made under the condition that certain decisions are
made. Thus, the distinction between forecasting and planning is dissolved in
these cases, since the forecasts are dependent on the planning for the future.
B. The Uncertainty of Forecasts A fundamental source of uncertainty is the model used when producing the
forecast. The choices of variables and model equations are, as pointed out
earlier, always dependent on the theory of the process which is to be
modelled; and within the fields where forecasts are most desirable, i.e., much of the social sciences, theories are often vaguely formulated and considered
to be first approximations only. Dynamic models which are constructed with the aid of such theories are, at best, rough simplifications of the process of
interest. This kind of uncertainty is unavoidable and it is extremely difficult
to judge how much the simplifications and idealisations influence a given
forecast.
When judging the uncertainty of forecast obtained from econometric models it is often assumed, simply, that the model is a correct description of
the real process to the extent that the right variables are included in the
model and the mathematical form, at least, of the model equations is correct.
Even if this strong assumption is made, there are several sources of uncer-
tainty and error in the forecast. I have already mentioned the uncertainty due
to the forecasts of the values of the external variables. Other uncertain factors are the parameters in the model equations. These are almost never the 'real'
coefficients but only estimated values. The process of estimating the parameters may give some clues to the degree of uncertainty associated with
266 PETER G~RDENFORS
these values. For instance, the standard error of the parameter can be estimated.
If a model equation contains a stochastic error term, this is another source
of uncertainty since the future values of this error term have to be predicted. Here it is possible to form a picture of the properties of the error term by studying the residuals of the estimation from the set of available data. For instance, one may estimate the mean of the error (in general zero), the standard error and the serial correlation of the errors. These estimated properties are then assumed to be valid also for the errors in the forecasting
period.
For each of these types of uncertainty it may be possible to use some kind of statistical analysis to compute a set of confidence intervals for the forecasts
of the state variables. However, this kind of analysis is complicated even if strong assumptions are made and as soon as several kinds of uncertainty are
to be combined one has to use other tools. Here stochastic simulation can be
helpful.
4. Policy Experiments
A. Trial and Error
One of the main advantages of using dynamic models is that they are very
suitable for policy experiments. If we assume that reliable forecasts for the external variables are available, then by studying the behaviour of the model
for various choices of the future values of the policy variables one may get a rough idea of which planning leads to the most desired combinations of values for the input and state variables. As was pointed out above, forecasting with the aid of a dynamic model in general involves a certain amount of plan-
ning and there is thus little difference between the two kinds of activities. Necessary prerequisites for this method of planning are ( i)a certain
knowledge of the dynamic properties of the model, (ii)enough energy to investigate a large number of choices of future values of the policy variables in a systematic manner, and (iii)a sufficient amount of computer time (analytic solutions are, in general, too complicated and therefore expensive to use). By repeated simulation trials and by learning from mistakes, it is possible to pick out some plans for the future which will result in acceptable consequences.
TOOLS FOR FORECASTING AND PLANNING 267
Example 10. Both Forrester and Meadows et al. use their dynamic models as
tools for policy recommendations. Their world models are, as pointed out
above, defined as closed models, but, in practice, some of the equation parameters function as policy variables. In both cases, the authors' aim is an
equilibrium state of the world with constant population and constant quality
of life. Forrester shows, by some examples, that such an equilibrium is
possible in his model, but he does not give any account of an attempt to show that the final equilibrium is the 'optimal' one (which may mean that it has as high quality of life as possible). The changes of model parameters which leads
to the equilibrium state in his final example can hardly be taken as recom-
mendations for future planning since one change is that food production is
reduced by 20% in 1970. The method used by Meadows et al. is similar. It is not argued that the
recommended changes are optimal, but it is claimed that if the changes are
postponed (to 2000 instead of 1975), then it is no longer possible to maintain the equilibrium state. []
B. Optimal Control Trial-and-error is no efficient method of finding the optimal planning for the future with a dynamic model and there is no guarantee that it will ever be
found. As a consequence, one has tried to use certain results from'control
theory by constructing the optimal planning problem as a mathematical optimisation problem.
Such an optimisation problem contains the following components:
(i) a dynamic model; (ii) a time interval for which the optimal planning is to be computed;
(ii 0 a set of initial values of the state variables, and, possibly, a set of final values which the state variables must reach in the final state;
(iv) restrictions on the state and input variables (these restrictions may consist of maximum or minimum limits which must not be overstepped);
(v) a function which measures the utility of the behaviour of the model for various choices of values for the input variables (this utility function is a quantitative aggregation of the planner's goals, values and costs).
When these five components are given, the optimisation problem is then to find the set of values for the control variables during the time interval such that the utility of the behaviour of the model is maximised.
268 PETER G~RDENFORS
For certain special cases there are general algorithms for solving this optimisation problem. The cases which are most extensively studied are linear time-invariant models. As regards the utility measure, it is constantly assumed that the utility of the behaviour of the model during the time interval is the sum of the utilities of the states of the model during that interval. The utility of a state of the model is then, normally, determined in the following way: for each state and input variable, first determine an 'optimal' value of the variable, and then let the utility of the actual value of the variable in the given state depend on its distance from this optimum. It is often assumed that the utility of a variable is inversely proportional to the square of the distance from the optimal value. The utility of a state, finally, is then computed as a weighted sum of the utilities of the state and input variables, where the weight of a variable depends on its relative importance.
For the kind of optimisation problem outlined above, there are numerical
methods for computing the optimal sequence of values for the input variables
(cf. e.g. Chow [6] and Pindyck [28]). The computations soon get complex and a computer is unavoidable. I f the dynamic model to be used corresponds
well to the process to be analysed and if the utility function realistically
describes actual goals, costs and values, then one may avoid intuitive planning
and instead rely on the computed solution.
Example 11. Pindyck [28] describes a small macroeconometric model for the American economy which he then uses as a basis for optimisation investi-
gations. He uses the same kind of utility function as described above. He com-
putes the optimal series of values for the input variables during a five year period and studies the effects of using different variants of the utility func-
tion (obtained by varying the weights of the variables). In this way it is possible to acquire a conception of how sensitive the optimal planning is to
the uncertainty of the utility valuations. Pindyck also studies how sensitive the optimal planning is to small vari-
ations of the parameters in the model equations. He shows, by an example, that a change in a parameter (for the autoregressive term of the price equation) which is definitely within the margin of error, will radically change the series of optimal values for the poficy variables. Pindyck confesses that
these results are rather disconcerting, since they say that a small change in a coefficient value can result in a large change in the optimal policy and in the resulting behaviour of
TOOLS F O R F O R E C A S T I N G AND P L A N N I N G 269
the economy. This throws some doubt on the usefulness of our particular econometric model for policy planning. In particular, the explanatory power of the model as indi- cated by the standard regression statistics is probably misleading because autoregressive terms were used so extensively. In any case, a sensitivity testing of the optimal policy solution is, as we have here seen, essential before that policy is used. (pp. 146, 148)
Somewhat later, in connection with a similar result for a small change in the
autoregressive term in the consumption function, he complements the argu-
ment above with the following methodological remark:
What this says is that the optimal policy is very sensitive to a change in a particular coefficient because the model itself is sensitive to such a change. In particular, the use of autoregressive terms in building models must be done with caution, since the apparent increase in explanatory power that results is just an extrapolation of past trends, the effects of changes in other coefficients are magnified, and a change in the autoregressive coefficient itself has a large effect on the behaviour of the model. (pp. 149, 151)
Pindyck's remark that the standard way of judging the explanatory value of a
model may be misleading accords well with the discussion in Ggrdenfors [14]
of the appropriateness of the R 2 value as a measure of the explanatory value. []
The discussion so far has been limited to the optimal control of deterhainistic
systems. When stochastic systems are considered one faces additional
problems when determining the optimal sequence of control values. The com-
putat ional techniques become even more complicated. But, apart from this
complication, there is a more fundamental problem which arises from the fact
that for a stochastic system we cannot 'know' the state of the system at a
future point of time, even if all values of the input variables are known up to
that point.
When considering deterministic systems the planner may specify the time
paths o f the policy variables at the beginning of the planning period. These
paths can then be followed without regard to future events. For stochastic
systems another type of pol icy making may be necessary. Here the policy
variables should be specified as functions of observations of the state of the
system yet to be made, so that the future values of the variables depend on
future observations. This type of control is called feedback control or a
closed loop policy, while the former type is called an open loop policy.
270 PETER G~RDENFORS
IV. CONCLUSION
Dynamic models are, no doubt, among the best tools for forecasting and plan-
ning. The conceptual framework is rich enough to cover most mathematically formulated models in which temporal aspects are central, and it allows precise formulations of causal relationships and mutual dependencies between vari-
ables. The equations in a model are formal representations of the structure of a complex situation. Our intuitions and theoretical education may be of good
help when formulating these equations. But when the pieces of the model
have been assembled, our intuitions are poor when estimating the dynamic interactions of the parts of the model. Here the computer may be extremely helpful. The mould of a dynamic model, as given by the definitions, is well
suited for computer representations. Much of the estimation and experiment- ing in connection with the forecasting and planning process can be efficiently
carried out on a computer. The fundamental problem when using dynamic models is how to find the
model which is the most suitable representation of the 'real' system or pro-
cess one wishes to forecast. Since dynamic models are such powerful tools for forecasting and planning it is easy to forget that the choice of model for a
particular process presents the greatest methodological problems. In my opinion, too much emphasis has been put on mathematical and statistical
methods for analysing given dynamic models and too little on how to choose among the potential models. Only when it comes to parameter estimation,
some detailed work has been done. However, the efforts put into the con- struction of efficient estimation procedures have tended to overshadow other, more fundamental, problems of modal construction. Also, the work with
estimation procedures has concentrated on the statistical fit of the model to given data and neglected the dynamic properties of the resulting model. Even though the problems connected with choice of variables and choice of model equations do not lend themselves to mathematical formalism as parameter estimation does, these problems should not be neglected since they are of fundamental importance for the success of the dynamic model to be
constructed. Throughout the paper I have illustrated my arguments with examples
taken from world models and macroeconomic models. I will now, finally, make some brief remarks on the methodological status of these models.
T O O L S F O R F O R E C A S T I N G A N D P L A N N I N G 271
The main advantage of a small world model is that it may present a general view of a complex structure. Even if such a small model must be based on
gross simplifications, it may have a basically correct dynamic structure, which can be studied and evaluated. Already in models of this small size our mental
powers are not sufficient to keep count of the dynamic changes in the model,
but a computer is necessary. I have already commented upon some of the drawbacks of Forrester's and
Meadows' world models. The variables in the models are difficult to measure, and this is mainly due to their high level of aggregation. This makes any
numerical relationships in the system equations hard to test. Even if simu- lations of the world models show a reasonable fit with actual data, the dynamic properties of the models must be considered highly unrealistic. They are, for instance, unwantedly sensitive to changes of some parameters, and thus the forecasts produced by the models are on the verge of nonsense.
The macroeconomic models fare somewhat better. Variables are only allowed in these models if they are objectively measureable. A great deal of effort is spent on estimating the parameters in the model equations. Some
analysis of the sensitivity of the models is often made, in particular by studying dynamic multipliers.
From a methodological point of view, the standard way of constructing and exploiting dynamic macroeconomic models nevertheless has some flaws. The class of permitted model equations is too often restricted to linear equations. This restriction is motivated by the applicability of the 'best'
estimation procedures. However, this argument points to another drawback: too much emphasis is put on the statistical fitting of the model to data and
too little on the dynamic properties of the model. Finally, among the many
measures used to evaluate a model, few have been supplied with a good motivation for being appropriate for forecasting and planning models.
Lund University, Sweden
N O T E S
* The work with this paper has been supported by a research project on future studies sponsored by the Planning Division o f the Research Inst i tute of Swedish National Defence (FOA P). I wish to thank Professor Eckhart Zwicker for extensive and construc- tive criticism of earlier versions o f this paper.
272 P E T E R G X R D E N F O R S
i I have befitted from the presentation in AstrSm [3]. 2 A note on terminology: Economists sometimes distinguish between models, where the parameters in the equations are undetermined variables, and structures, which are models where the parameters have been replaced by fixed numbers. a For an elementary presentation of the solutions of difference equations and their economic interpretations, the reader is referred to Baumol [5 ]. 4 For a presentation of simulation languages, cf. Kiviat [19]. A popular tool for econo- metric forecasts is the TROLL system. The use of the language DYNAMO is presented in Pugh [30], and the SIMNON language, which is a general programming language for simulating (linear and non-linear) dynamic systems, is presented in Elmqvist [9]. 5 Cf. Forrester [12] and [11] and Pugh [30]. 6 This assumption is not realistic, but it will simplify the mathematics.
For stability analyses of difference equations, cf. Baumol [5]. 8 This decomposition was first presented and discussed by Thei1 [ 31 ]. 9 In the case of non-linear stochastic models Howrey and Keleijan [17] argue that only stochastic simulations of models containing stochastic errors are appropriate since the non-stochastic portions of the model may yield inconsistent forecasts. Their arguments are questioned by Aigner [2]. Also cf. Haavelmo's paper which was mentioned earlier.
R E F E R E N C E S
[ 1 ] Aberg, C. J., Samhiillsekonomisk prognosteknik, Norstedts, Stockholm, 1966. [2] Aigner, D. J., 'A Note on Verification of Computer Simulation Models', Manage-
ment Science 18 (1972), 615 -619. [3 ] AstrSm, C. J., Reglerteori, 2rid ed., Almqvist & Wiksell, Uppsala, 1976. [4] AstrSm, C. J., Introduction to Stochastic Control Theory, Academic Press, New
York, 1970. [5] Baumol, W. J., Economic Dynamics, 3rd ed., Macmillan, Toronto, 1970. [6] Chow, G. C., Analysis and Control of Dynamic Economic Systems, Wiley, New
York, t975. [7] Cole, H. S. D. and Curnow, R. C., 'An Evaluation of the World Models', Futures
5 (1973), 108-134. [8] Dutton, J. M. and Starbuck, W. H., Computer Simulation of Human Behaviour,
Wiley, New York, 1971. [9] Elmqvist, H., SIMNON: User's Manual, Lund Institute of Technology, Lund,
1975. [10] Evans, M. K., Macroeconomic Activity, Harper & Row, New York, 1969. [ 11 ] Forrester, J. W., World DynamiCs, Wright-Allen Press, Cambridge, Mass., 1971. [12] Forrester, J. W., Principles of Systems, 2nd ed., Wright-Allen Press, Cambridge,
Mass., 1968. [ 13 ] Frijda, N. H., 'Problems of Computer Simulation', Behavioural Science 12 (1967),
59-67. (Reprinted in Dutton & Starbuck [8] .) [14] G~denfors, P., 'On the Information Provided by Forecasting Models', Techno-
logical Forecasting and Social Change 16 (1980), 351- 361. [15] Haavelmo, T., 'The Inadequacy of Testing Dynamic Theory by Comparing
Theoretical Solutions and Observed Cycles', Econometrica 8 (1940), 312-321.
TOOLS FOR F O R E C A S T I N G AND P L A N N I N G 273
[16] Hanna, J.F., 'Information-theoretic Techniques for Evaluating Simulation Models', in Dutton & Starbuck [8], pp. 682-692.
[17] Howrey, E. P. and Keleijan, H. H., 'Simulation Versus Analytical Solutions: The Case of Econometric Models', in Naylor [26], pp. 299-319.
[18] Johnston, J.,EeonometrieMethods, McGraw-Hill, New York, 1963. [19] Kiviat, P. J., 'Simulation Languages', in Naylor [26], pp. 406-408. [20] Klein, L. R., An Essay on the Theory of Economic Prediction, Yrj6 Jahnssonin
Siiiiti/5, Helsinki, 1968. [21] Kmenta, J., Elements of Eeonometrics, Macmillan, New York, 1971. [22] Maier, H. and Hugger, W., 'Invariant Structures of Prognosis Curves in Forrester's
World Dynamics', Theory and Decision 3 (1973), 231-261. [23] Meadows, D. L., Meadows, D. H., Randers, J., and Behrens, W. W., The Limits to
Growth, Wright-Allen Press, Cambridge, Mass., 1972. [24] Meadows, D. L. et al., Dynamics of Growth in a Finite World, Wright-Allen Press,
Cambridge, Mass., 1974. [25] Meadows, D. L. etal., 'A Response to Sussex', Futures 5 (1973), 135-152. .~ [26] Naylor, T.H., Computer Simulation Experiments with Models of Economic
Systems, Wiley, New York, 1971. [27] Nordhaus, W.D., 'World Dynamics: Measurement Without Data', Economic
Journal 9 (1973), 1156-1183. [28] Pindyck, R. S., Optimal Planning for Economic Stabilisation, North-Holland,
Amsterdam, 1973. [29] Pindyck, R. S. and Rubinfeld, D. L., Econometric Models and Economic Fore-
casts, McGraw-Hill, New York, 1976. [ 30 ] Pugh, A. L., DYNAMO User's Manual, the MIT Press, Cambridge, Mass., 1963. [31] Theft, H.,Economic Forecasts and Policy, North-Holland, Amsterdam, 1961. [32] Zwicker, E., 'M6glichkeiten und Grenzen der modellgesttitzten Prognose
sozio6konomischer Entwicklungen: Dargestellt am Beispiel der Weltmodelle yon Meadows und Forrester', Preprint Series of the International Institute of Manage- ment 1/76-84, Berlin.