View
839
Download
0
Tags:
Embed Size (px)
DESCRIPTION
Sustainable development in three newspapers: How does coverage in a particular newspaper influence other newspapers’ attention to sustainable development? Vector autoregression
Citation preview
Sustainable development in three
newspapers:
How does coverage in a particular newspaper influence other newspapers’ attention to
sustainable development?
Vector autoregression
Assignment 4
Mark Boukes ([email protected])5616298
1st semester 2010/2011Dynamic Data Analysis
Lecturer: Dr. R. VliegenthartDecember 9, 2010
Communication Science (Research MSc) Faculty of Social and Behavioural Sciences
University of Amsterdam
Table of contents
INTRODUCTION...................................................................................................................................................1
METHOD.................................................................................................................................................................1
RESULTS.................................................................................................................................................................2
VAR MODEL...........................................................................................................................................................2
CONCLUSION........................................................................................................................................................8
Reference...................................................................................................................................................................8
IntroductionIn this study I aim to investigate the influence news coverage in a particular newspaper has on
the coverage of another newspaper, and viceversa. For this purpose I have chosen a specific
topic, sustainable development, that seems to get a lot media attention in the last years. The
topic of sustainable development was chosen, because it can be related to several parts of
society, such as the economy, science and also for the man in the street this topic is relevant.
As those different parts of society are represented by different media, it is interesting to
see how they influence each other on the amount of attention that is paid to this issue. Will an
increase in attention of business men’s newspapers result in an increase of attention in
newspapers that deal for a large part with science; and is this also occurring the other way
around? How is the attention in a popular newspaper caused by or perhaps causing itself
attention in scientific or economic newspapers. The most read popular newspaper in the
Netherlands is De Telegraaf, a newspaper that has a main economic or business focus is Het
Financieele Dagblad, and NRC Handelsblad is known for its relative large attention to
scientific developments.
To study the relationship between the attention to sustainable development in news
coverage of these three newspapers, my main research question was: Does the amount of
attention in one newspaper for sustainable development cause attention in the other newspapers
and viceversa? The sub research questions were therefore the following:
Did the amount of attention to sustainable development in De Telegraaf causes the amount of attention in NRC Handelsblad?
Did the amount of attention to sustainable development in NRC Handelsblad causes the amount of attention in De Telegraaf?
Did the amount of attention to sustainable development in Het Financieele Dagblad causes the amount of attention in NRC Handelsblad?
Did the amount of attention to sustainable development in NRC Handelsblad causes the amount of attention in Het Financieele Dagblad?
Did the amount of attention to sustainable development in Het Financieele Dagblad causes the amount of attention in De Telegraaf?
Did the amount of attention to sustainable development in NRC Handelsblad causes the amount of attention in De Telegraaf?
MethodA dataset was created via a computer assisted content analysis that was conducted using the
digital archive of LexisNexis. Articles were selected via the Boolean search term duurza! OR
"groene energie" OR "zonne-energie" OR "windenergie". The period I analyzed was from 1
1
January 1999 until 31 December 2009. This period was chosen, because information about De
Telegraaf is only available from 1999. The search procedure was repeated three times; one time
for every newspaper, so three variables could be created by aggregating the data on a weekly
basis. A total of 35225 articles were found for 581 weeks; 18501 in Het Financieele Dagblad,
10335 in NRC Handelsblad and 6389 in De Telegraaf.
To analyse the effects of the different newspapers on each other, a vector autoregression
(VAR) analysis was conducted in Stata 10.1. A VAR analysis was chosen, because there are no
clear theories about which variables are exogenous in which relations; in a VAR analysis all our
newspaper article variables could be endogenous. To control for possible effects of the various
climate conventions that were held in the period under study, a dummy variable was created for
the weeks1 in which such conferences were held.
ResultsI specify in this results section, how the VAR analysis was conducted and which results it
found. In doing this, I follow the procedure described by Brandt and Williams (2007).
VAR modelFigure 1 plots the time series of the attention in the three newspapers for the period that we are
studying. It seems that Het Financieele Dagblad (FD) pays the most attention to sustainable
development and De Telegraaf the least attention. The amount of attention seems to be quite stable
over time, and that is also what augmented Dickey-Fuller tests confirm (see Table 1). Hypotheses
for unit root are rejected, so the data is treated as stationary and I did not need to integrate the data.
Figure 1. The number of articles about sustainable development over time in the three newspapers of interest.
1 Based on http://en.wikipedia.org/wiki/United_Nations_Framework_Convention_on_Climate_Change
2
Table 1. The results of augmented Dickey-Fuller tests for the amount of articles over time Augmented Dickey-Fuller test FD NRC Telegraaf
Random walk without drift -4.990 -5.596 -6.560
Random walk with drift -10.104 -14.525 -10.878Random walk with drift and trend -15.716 -16.751 -16.487
Note. All tests indicate the absence of a unit root.
In order to create the VAR, I had to select the appropriate number of lags for the model. Various
models were tested with lag lengths ranging from 1 to 8. Model fit statistics suggest that a model
that includes either two (AIC= 20.93, SBIC= 21.11) or seven lags (AIC = 20.90, SBIC = 21.43)
has the best fit. When lag-order selection statistics for the different VARs are studied, according
to Akaike Info Criterion (AIC) and by inspecting the differences in Log Likelihood a VAR
model with seven lags is preferred; however, according to the Schwarz's Bayesian information
criterion (SBIC) and the Hannan and Quinn information criterion (HQIC) the VAR model with
two lags should be preferred. Finally, I chose to use the VAR model with 2 lags as this model
was much more parsimonious and differences in AIC with models including more lags were
very small.
To check whether this model can be used, the residuals of the three newspaper variables
in this model were tested for autocorrelation with the Ljung–Box Q test statistic and for the
presence of conditional heteroscedasticity with the Engle-Granger test. The residuals of the
number of articles in Het Financieele Dagblad (Q = 26.33, p = .16) and NRC Handelsblad (Q =
15.28, p =.76) reflected white noise, however the residuals of the number of articles in De
Telegraaf seem to autocorrelate (Q = 32.60, p = .04). To solve this, the model was extended by
including another lag, resulting in the VAR model with three lags. This model does not seem to
the problems of autocorrelation in the residuals for Het Financieele Dagblad (Q = 24.89, p
= .21), NRC Handelsblad (Q = 15.76, p =.73) and De Telegraaf (Q = 31.38, p =.05). However,
the squared residuals of Het Financieele Dagblad (Q = 82.49, p < .01) and De Telegraaf (Q =
32.39, p =.04) indicate the presence of heteroscedasticity; the squared residuals of NRC
Handelsblad do not reflect heteroscedasticity (Q = 10.61, p = .96). However, besides noting it, I
did not pay attention to this, but it will be solved with ARCH and GARCH models later in this
Dynamic Data Analysis course.
The VAR model with three lags explains more than half of the variation in the number of
articles per week in Het Financieele Dagblad (R2 = .577) and De Telegraaf (R2 = .558). The
proportion of explained variance of the number of articles per week in NRC Handelsblad is
3
somewhat less, though still substantial, because almost a third of the variation is explained (R2 =
.316). The estimated impact of the various variables in the model can be found in Table 2. The
values in this table can be used to understand the direction of effects, but not to understand the
effects of particular lags on a dependent variable. The values are imprecise and standard errors
are high due to multicollinearity in the VAR model; therefore, we used Granger causality
testing to inspect the joint statistical significances of the different independent variables (see
Table 3).
Table 2. VAR estimates for the 3-lag model of the number of articles in different newspapersDependent variable
Het Financieele Dagblad NRC Handelsblad De Telegraaf
Het Financieele Dagblad (t-1) 0,32 (-0,04)* 0,11 (0,03)* 0,10 (0,02)*
Het Financieele Dagblad (t-2) 0,17 (0,05)* 0,03 (0,03) 0,02 (0,02)
Het Financieele Dagblad (t-3) 0,03 (0,04) -0,05 (0,03) 0,05 (0,02)*
NRC Handelsblad (t-1) 0,16 (0,07)* 0,23 (0,04)* 0,08 (0,04)*
NRC Handelsblad (t-2) -0,07 (0,07) 0,11 (0,04)* -0,01 (0,04)
NRC Handelsblad (t-3) 0,05 (0,07) 0,01 (0,04) -0,12 (0,04)*
De Telegraaf (t-1) 0,47 (0,08)* 0,08 (0,05) 0,26 (0,04)*
De Telegraaf (t-2) 0,03 (0,09) -0,01 (0,05) 0,20 (0,04)*
De Telegraaf (t-3) 0,11 (0,08) 0,09 (0,05) 0,09 (0,04)*
Climate conventions 0,35 (2,46) 0,71 (1,50) -2,25 (1,28)
Constant 4,97 (1,39)* 7,25 (0,85)* 0,86 (0,72)
Ljung-Box Q(20) residuals 24.89 15.76 31.38Ljung-Box Q(20) residuals² 82.49* 10.61 32.39*
R2 .577 .316 .558Note. Unstandardized coefficients. Standard errors in parentheses; * p<.05
To test whether the news coverage about sustainable development in one newspaper Granger-
causes news coverage about sustainable development in another newspaper, I look at the test for
Granger-causality. It becomes clear that De Telegraaf is Granger-caused by both Het
Financieele Dagblad and NRC Handelsblad; the chi-squared test suggest that excluding the
lagged values of Het Financieele Dagblad results in a worse prediction of De Telegraaf (χ2 =
35.67, p < .001), the prediction also becomes worse when NRC Handelsblad’s lagged values are
excluded from the model (χ2 = 14.62, p < .01). This substantially means that news coverage
about sustainability in both Het Financieele Dagblad and NRC Handelsblad cause coverage in
De Telegraaf. From Table 2 we can know that these effects will be positive; more news
coverage about sustainability in Financieele Dagblad and NRC Handelsblad will result in more
news coverage about this issue in De Telegraaf. Similar Granger causality effects are found for
4
the news coverage of De Telegraaf on Het Financieele Dagblad (χ2 = 42.98, p < .001) and of
coverage in Het Financieele Dagblad on coverage in NRC Handelsblad (χ2 = 19.10, p < .001);
these effects will be positive as can be found in table 2. Two investigated relations between
newspaper were not significant; news coverage about sustainable development in Het
Financieele Dagblad is not Granger-caused by NRC Handelsblad and news coverage about this
issue in NRC Handelsblad is also not Granger-caused by De Telegraaf. The results of the
Granger causality tests indicate that the coverage of not any of the newspapers are exogenous,
as they all are Granger-caused by one or two other newspapers.
Table 3. Granger causality tests for the number of articles in the different newspapers
Hypothesid exogenous variable Block coefficients restriced χ2 df pHet Financieele Dagblad NRC Handelsblad 62.998 3 0.098Het Financieele Dagblad De Telegraaf 42.983 3 0.000
NRC Handelsblad Het Financieele Dagblad 19.099 3 0.000NRC Handelsblad De Telegraaf 73.269 3 0.062
De Telegraaf Het Financieele Dagblad 35.666 3 0.000De Telegraaf NRC Handelsblad 14.615 3 0.002
The values of the Granger causality tests do not really help in having a good understanding of
the different effects. Therefore, impulse response analysis were used, by which it is possible to
see the over-time effects of an unexpected one-unit increase in an independent variable in the
future values for the dependent variable. Figure 2 shows the different graphs belonging to the
impulse response analysis. It shows a significant and positive effect of De Telegraaf on Het
Financieele Dagblad, which endures the complete period; one week after an unexpected one-
article increase in De Telegraaf, an increase of about half an article is found in Het Financieele
Dagblad, this effect decays but still resulted in a quarter article increase after eight weeks. The
effect of Het Financieele Dagblad on NRC Handelsblad seems smaller; one week after an
unexpected one-article increase in Het Financieele Dagblad, about a tenth of an article increase
in NRC Handelsblad is expected, this effect drops to zero but does not become insignificant. A
similar effect is found for the coverage about sustainable development in Het Financieele
Dagblad on coverage about this issue by De Telegraaf. The effect of coverage in NRC
Handelsblad on coverage in De Telegraaf is also rather weak, however this effect becomes
insignificant after one week.
5
Figure 2. Graphs of the impulse response analysis, title of graph is [impulse variable], [response variable]
When the consequences of the effects are added together in a cumulative impulse response
analysis, it is possible to see what the result was of a one-article increase in a newspaper on the
other newspapers’ number of articles about this topic summed together over a period of time
(eight weeks in this case). Figure 2 shows the graphs belonging to the cumulative impulse
response analysis. The graph which displays the cumulative effect of De Telegraaf on Het
Financieele Dagblad, shows that an unexpected one-article increase in this popular newspaper
resulted in about three more article in Het Financieele Dagblad after eight weeks. Viceversa, an
unexpected increase of one article in Het Financieele Dagblad results after eight weeks in about
just one more article in De Telegraaf. About the same was found for the effect of Het
Financieele Dagblad on NRC Handelsblad. The weakness of the effect of NRC Handelsblad on
De Telegraaf also becomes clear in this figure; eight weeks after a one-article increase in NRC
Handelsblad this did not led to any increase in the number of articles about sustainable
development in De Telegraaf.
6
Figure 3. Graphs of the cumulative impulse response analysis, title of graph is [impulse variable], [response variable]
Finally, the decomposition of the Forecast Error Variance is assessed to investigate the effects
of the different variables. The graphs belonging to this decomposition are shown in Figure 4.
The decomposition of the Forecast Error Variance determines the impact of one variable’s
forecast error on the error in forecasting other variables; it indicates how much of the series
future values cannot be explained due to unexpected shocks in all the different variables in the
system. After eight weeks for example, 13 percent of the error variance of the number of articles
about sustainable development in Het Financieele Dagblad can be attributed to shocks in the
number of article about this issue in De Telegraaf. Substantially, this means that about a
twentieth of the proportion of variance in this number of articles in Het Financieele Dagblad is
caused by variance in De Telegraaf. Remarkable is the relatively large proportion of Forecast
Error Variance in De Telegraaf that can be attributed to variance in Het Financieele Dagblad;
after eight weeks about 33 percent of this error variance of De Telegraaf can be attributed to
shock in Het Financieele Dagblad. When the diagonal of Figure 4 is assessed, it is clear that the
proportion of error variance that can be attributed to other newspapers is the largest for De
Telegraaf; the error variance of Het Financieele Dagblad and NRC Handelsblad seem to be less
influenced by other newspapers.
7
Figure 4. Graphs of the decomposition of the Forecast Error Variance, title of graph is [impulse variable], [response variable]
ConclusionThe answers on the various research questions are: yes, Het Financieele Dagblad Granger-
caused news coverage about sustainable development in both De Telegraaf and NRC
Handelsblad; NRC Handelsblad Granger-caused coverage in De Telegraaf, but not in Het
Financieele Dagblad; and De Telegraaf Granger-caused news coverage about sustainable
development in Het Financieele Dagblad, but not in NRC Handelsblad.
Based on these findings, I conclude that developments in the economy or business world
attract both the attention of popular media and media with a focus on science. In addition,
attention in popular newspapers also causes coverage in business media. On the other hand,
coverage in the scientifically oriented NRC Handelsblad did not causes differences in attention
of business newspaper Het Financieele Dagblad, and NRC Handelsblad, for its part, seems not
to be influenced by the popular medium De Telegraaf. To extend this discussion: the world of
science seems thus not to be interested in what the ordinary people think and business people
are not interested in what science has to offer them. The man in the street on the other hand is
interested in developments of both science and economics, while business people also follow
what ordinary people think is interesting.
ReferenceBrandt, P., & Williams, J. T. (2007). Multiple time series models. Thousand Oaks: Sage
Publications.
8
Do File:*Climate conventionsgen convention=0replace convention=1 if week==43 |week== 44|week==99 |week== 100|week== 135|week==136|week== 150|week== 151 |week==201 |week==202 |week==260 |week==261 |week==314 |week==315 |week==365 |week==366 |week==415 |week==416 |week==471 |week==472 |week==524 |week==525 |week==578 |week==579
replace week = week + 2027
tsset week, weekly
twoway (tsline FD, lcolor(red)) (tsline NRC, lcolor(green) lpattern(dash) lwidth(medthick)) (tsline Telegraaf, lcolor(blue) lpattern(dash) lwidth(medium))
*Data for FD was double entered in 2006replace FD=FD/2 if week>=2398 & week<=2449
twoway (tsline FD, lcolor(red)) (tsline NRC, lcolor(green) lpattern(dash) lwidth(medthick)) (tsline Telegraaf, lcolor(blue) lpattern(dash) lwidth(medium))
*with driftdfuller FD*random walkdfuller FD, noconstant*trenddfuller FD, trend
*with driftdfuller NRC*random walkdfuller NRC, noconstant*trenddfuller NRC, trend
*with driftdfuller Telegraaf*random walkdfuller Telegraaf, noconstant*trenddfuller Telegraaf, trend
varsoc FD NRC Telegraaf, maxlag(8) ex(convention)*two models are prefered: with 2 lags or with 7 lags
*Test the VAR model with 2 lagsvar FD NRC Telegraaf, lags(1,2) ex(convention)vargranger
*Is an independent variable Granger-causing the dependent variable?When we exclude a variable, does the model get signicantly worse?*df is the number of variables the model gains.
predict rv1, resid equation (FD)predict rv2, resid equation (NRC)predict rv3, resid equation (Telegraaf)
gen rv1_s = rv1* rv1gen rv2_s = rv2* rv2gen rv3_s = rv3* rv3
*test for white noise; p>.05wntestq rv1, lags(20)wntestq rv2, lags(20)wntestq rv3, lags(20)wntestq rv1_s, lags(20)wntestq rv2_s, lags(20)wntestq rv3_s, lags(20)
*autocorrelation in the residuals of De Telegraaf; so one more lag included
*Test the VAR model with 3 lagsvar FD NRC Telegraaf, lags(1,2,3) ex(convention)vargranger
drop rv1 rv2 rv3drop rv1_s rv2_s rv3_s
predict rv1, resid equation (FD)predict rv2, resid equation (NRC)predict rv3, resid equation (Telegraaf)
gen rv1_s = rv1* rv1gen rv2_s = rv2* rv2gen rv3_s = rv3* rv3
*test for white noise; p>.05wntestq rv1, lags(20)wntestq rv2, lags(20)wntestq rv3, lags(20)wntestq rv1_s, lags(20)wntestq rv2_s, lags(20)wntestq rv3_s, lags(20)
* no autocorrelation, heteroscedasticity for FD and Telegraaf
var FD NRC Telegraaf, lags(1,2,3) ex(convention)
*impulse response analysis irf create cms, set(cms)irf graph irfirf table irf
irf table cirfirf graph cirf
*Decompositition of forecast varianceirf graph fevdirf table fevd
ii