39
Publié par : Published by : Publicación de la : Faculté des sciences de l’administration Université Laval Québec (Québec) Canada G1K 7P4 Tél. Ph. Tel. : (418) 656-3644 Fax : (418) 656-7047 Édition électronique : Electronic publishing : Edición electrónica : Aline Guimont Vice-décanat - Recherche et partenariats Faculté des sciences de l’administration Disponible sur Internet : Available on Internet Disponible por Internet : http ://www.fsa.ulaval.ca/rd [email protected] DOCUMENT DE TRAVAIL 2001-015 MODELLING INTERACTIONS OF LOCATION WITH SPECIFIC VALUE OF HOUSING ATTRIBUTES Marius THÉRIAULT, François DES ROSIERS, Paul VILLENEUVE and Yan KESTENS Version originale : Original manuscript : Version original : ISBN 2-89524-136-8 ISBN - ISBN - Série électronique mise à jour : One-line publication updated : Seria electrónica, puesta al dia 11-2001

DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

  • Upload
    others

  • View
    3

  • Download
    0

Embed Size (px)

Citation preview

Page 1: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

Publié par : Published by : Publicación de la :

Faculté des sciences de l’administration Université Laval Québec (Québec) Canada G1K 7P4 Tél. Ph. Tel. : (418) 656-3644 Fax : (418) 656-7047

Édition électronique : Electronic publishing : Edición electrónica :

Aline Guimont Vice-décanat - Recherche et partenariats Faculté des sciences de l’administration

Disponible sur Internet : Available on Internet Disponible por Internet :

http ://www.fsa.ulaval.ca/rd [email protected]

DOCUMENT DE TRAVAIL 2001-015

MODELLING INTERACTIONS OF LOCATION WITH SPECIFIC

VALUE OF HOUSING ATTRIBUTES Marius THÉRIAULT, François DES ROSIERS, Paul VILLENEUVE and Yan KESTENS

Version originale : Original manuscript : Version original :

ISBN – 2-89524-136-8 ISBN - ISBN -

Série électronique mise à jour : One-line publication updated : Seria electrónica, puesta al dia

11-2001

Page 2: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

Modelling Interactions of Location with Specific Value of Housing Attributes

Marius Thériault, Ph.D.

Director – Planning and Development Research Centre (CRAD), Félix-Antoine-Savard Building, 16th floor, Laval University, Canada G1K 7P4

Phone (418) 656-2131 ext. 5899; Fax (418) 656-2018 e-mail: [email protected]

François Des Rosiers, Ph.D.

Professor – Faculty of Business Administration, Laval University, Canada G1K 7P4 e-mail: [email protected]

Paul Villeneuve, Ph.D.

Professor – Department of Planning, Laval University, Canada G1K 7P4 e-mail: [email protected]

Yan Kestens

Ph.D. Candidate - Department of Planning, Laval University, Canada G1K 7P4 e-mail: [email protected]

Page 3: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

1

Modelling Interactions of Location with Specific Value of Housing Attributes

Abstract

This paper presents a procedure for considering interactions of neighbourhood quality and property specifics

within hedonic models of housing price. It handles interactions between geographical factors and the marginal

contribution of each property attribute for enhancing values assessment. Making use of simulation procedures,

it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

economic census related to transaction prices of single-family homes. An application to the housing market of

the Quebec Urban Community (more than 3,600 bungalows transacted in 1990 and 1991) illustrates its

usefulness for building spatial hedonic models, while controlling for multicollinearity, spatial autocorrelation

and heteroskedasticity. Distance weighted averages of each property attribute in the neighbourhood and

interactions of property attributes with each principal component are used to detect any spatial effect on sale

price variations. This first-stage spatial hedonic model approximates market prices which are then used in

order to compare "expected" and actual property tax amounts, which are added to obtain a second-stage model

incorporating fiscal effects on house values. Interactions between geographical factors and property specifics

are computed using formulas avoiding multicollinearity problems, while considering several processes

responsible for spatial variability. For each property attribute, they define sub-models which can be used to

map variations, across the city, of its marginal value, assessing the cross effect of geographical location (in

terms of neighbourhood profiles and accessibility to services) and its own valuation parameters. Moreover,

this procedure distinguishes property attributes exerting a stable contribution to value (constant over the entire

region) from those whose implicit price significantly varies over space.

Keywords Hedonic modelling, Housing markets, Neighbourhood effects, Location, Spatial

interaction, Spatial gradients.

Page 4: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

2

Problem statement

This piece of research presents a procedure aimed at handling interactions between spatial factors and housing

attributes. In traditional hedonic modelling, the contextual variations over space are usually specified using

“fixed” coefficients to assess their direct effect on housing values. This is based on the assumption that the

marginal prices of structural housing attributes are invariant across the city. This stable price assumption does

not hold if various types of households have different needs and are not, at the same time, distributed evenly

within the city. These households form the market of home buyers and sellers. The heterogeneity of their

distribution and tastes could locally distort the demand for specific structural attributes and amenities of

homes, thus creating significant geographical trends that should be reflected in the coefficients of hedonic

models.

Moreover, any spatial heterogeneity characterising the valuation process will produce spatial autocorrelation

among model residuals, if it is not appropriately handled in the model specification. Following Can (1990), we

believe that the influence of property specifics on house prices is, itself, influenced by the spatial variability of

demand, which is linked to heterogeneity in the distribution of household types and services within the city.

Therefore, hedonic modelling procedures cannot rely entirely on fixed space coefficients. There is a clear need

to test, and eventually incorporate, interactions between the structural characteristics of urban space and those

housing specifics which are putatively linked to them.

According to Griffith (1992, p. 278), “spatial autocorrelation may be defined as an average correlation

between observations based upon replicated realisations of the geographic distribution of some attribute.” It is

often linked to processes of diffusion occurring in geographical space. Anselin and Can (1986) have linked it

to the spatial distribution of externalities which produces urban density gradients. Autocorrelation is highly

detrimental to the efficiency of statistical tests used to assess the statistical significance of OLS (ordinary least

square) regression coefficients (Anselin, 1990b). In applied regression analysis, two methodological concerns

should be addressed: (1) testing for the presence of spatial heterogeneity and (2) implementing alternative

estimation techniques. Anselin (1990b, p. 186) suggests that: “With respect to estimation, the best-known

Page 5: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

3

approach is probably the spatial expansion method of Casetti (1972, 1986), which introduces models of spatial

drift in the regression coefficients.” The expansion method has been used in numerous geographical

applications for investigating spatial and temporal drifts, especially in the study of migration and labour

market transformation. Can (1990) was first to use it in the context of hedonic modelling. This paper extends

its procedure, using several neighbourhood quality indexes, and combining them with comparisons of each

property with its nearest immediate neighbours. Thus, it is allowing for a much more sensitive assessment of

spatial dependence.

Conceptual framework

During the last twenty years, rapid social and economic changes have been restructuring the distribution of

activities in general, and housing demand in particular, in North American cities. These changes are related to

the pervasive processes associated with so-called “post-fordist” and “post-modern” emergent forms. The

social and demographic correlates of these emergent forms include three interrelated tendencies capable of

profoundly modifying the structure of residential markets. First, rapid increase in female labour force

participation rates, especially during the seventies, has diversified household profiles. Now, many working

couples, dual earner families and non-family households tend to rely on market solutions and practise

domestic outsourcing, because of their small size, the time constraints they operate under, or their high

disposable income (Rose and Villeneuve, 1993 and 1998). Second, rising income inequalities in the

occupational sphere, as well as among households now counting on varying numbers of working members,

produce forms of social polarisation that are inscribed into urban social space (Bourne, 1993). Third, the

development of the service sector of most cities, together with the decentralisation of manufacturing activities,

has also affected the social profile of many neighbourhoods, especially by redistributing services' locations.

All these changes have an effect on residential relocation behaviour, which is highly related to household

cycles (Nijkamp et al., 1993). As social diversity is rapidly increasing within cities, it is less likely to translate

into spatial variation in the price of housing structured around monocentric rent gradients. Distance from the

CBD as the sole measure of access to employment and consumption opportunities becomes less relevant

Page 6: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

4

because households value access to places distributed at various locations. Therefore, assessing accessibility

requires the measurement of distance and travel time to a larger set of amenities. For Baltimore, Dubin and

Sung (1987) have found that the influence of suburban employment centres on housing prices is restricted to a

two-kilometre radius. Considering the existence of several non-CBD peaks in the rent gradient, estimation of

an area-wide function may give misleading results concerning the influence of the CBD on housing prices.

In the field of ecology, Legendre (1993) makes a useful distinction between “true” and “false” gradients. For a

trend (true gradient), values observed at specific locations can be modelled using a polynomial function of

their geographic coordinates (trend surface), plus an error term that is specific to each location. Error terms

around the trend surface are not spatially autocorrelated. A true gradient violates the stationarity assumptions

of most spatial-analysis methods and should be removed from the data. Otherwise, it should be explicitly

integrated in the model. Such trends are the results of exogenous processes, and they invalidate the

assumption of independence among the observations. In a “false” gradient, the trend-like structure is caused

by spatial autocorrelation. There is no change in expected value throughout the surface, although the value

observed at each locality is partly determined by neighbouring values: ( )∑ ++= ijjiij zfbz ε''0 . In this

model, the second term represents the sum of the effects of the points located within some distance from the

observation point. Here, autocorrelation is an outcome of endogenous processes (E.g. diffusion, competition,

association) while the phenomenon remains stationary. This raises the central issue of distinguishing true

gradients from false ones, a distinction which partly depends on the overall extension of the study area and on

the significance scale of the phenomenon variation over space.

For hedonic modelling purpose, we must distinguish three types of gradients: city-wide gradients related to

structural factors (E.g. urban form, socio-economic status of neighbourhoods, location rent), local gradients

linked to externalities (E.g. noise caused by a motorway), and local gradients reflecting local market internal

dynamics (E.g. diffusion of home renovation among neighbours). The first two types are true gradients

because they are mostly related to exogenous effects (not entirely driven by the housing market), the last one

is a “false” gradient related to an endogenous emulation effect. If an independent variable is spatially

structured and is assumed to be driving price formation, then, a true gradient, where present in both the

Page 7: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

5

controlling and the dependent variables, is seen as a trend and should be explicitly handled (or removed using

trend surfaces and/or autoregressive techniques). If, however, the investigator has good reasons to believe that

the spatial structure (yielding autocorrelation on the independent variable) is the result of local market internal

dynamics (endogenous processes), then the situation should be assimilated to a “false” gradient which should

not be removed before modelling because this effect, being caused by the market itself, will not generate

additional spatial autocorrelation among the residuals of the hedonic model. These differences between types

of gradients are important when tests of statistical hypotheses are involved, and not important when the

purpose is only to describe spatial structures.

Socio-demographic profiles of the population are certainly major determinants of housing market

differentiation and form city-wide trends (exogenous effects). However, the distribution of population is also

constrained by the housing market (endogenous effect). Those effects are intermingled as demand and supply

are driven by the availability and affordability of housing. Therefore, their interactions with housing attributes

should be modelled explicitly and translated into space-varying coefficients using Casetti’s expansion method

to take into account the interactions of the housing market with urban socio-economic and accessibility

gradients.

How should we conceptualise these interactions? Let us first note that macro-economic analyses of the

housing sector pay great attention to household formation and household income as determinant of demand.

The rate of household formation is probably, with household size, the main quantitative component of housing

demand, determining as it does the number of units demanded, while household income certainly constitutes a

major qualitative component of housing demand (Miron, 1987). As put by Muth and Goodman (1988), both

determinants are closely related since household formation is as dependent upon economic conditions as it is

upon mere demographic factors. It so happens that household status and socio-economic status form basic

dimensions of residential differentiation in urban space, as identified through factorial ecologies (Le Bourdais

and Beaudry, 1988; Wyly, 1999). The others dimensions refer to ethnicity, a component that can be discarded

in the Quebec urban region because the population is highly homogenous in terms of language, origin and

religion. In general, for very similar houses within specific segments of the housing market, we would expect

Page 8: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

6

a positive impact on demand, and hence, on prices, in neighbourhoods showing large households and/or high

rates of household formation. Similarly, with respect to socio-economic status, we should expect increasing

demand at the upper end of the market segment.

Furthermore, we should also expect the presence of local endogenous effects among housing attributes of

neighbouring homes. As pointed out by Can (1990, p. 256), most people prefer to live in neighbourhoods

where they think the return for their housing investment will be highest. For the same reason, people are

willing to invest in maintaining dwellings where the return on such expenditures are sufficiently high. This

suggests that home owners are observing their immediate neighbours and will be more prone to improve their

property if the neighbourhood itself is upgrading. Conversely, home buyers generally try to find homes in

neighbourhoods having socio-economic status similar to their own, implying that similar people, with the

same needs, tend to agglomerate at specific locations. Hence the intrinsic nature of spatial dependence which

governs real estate markets, each social group tending to value specific sets of housing attributes. Conversely,

a specific attribute may be valued differently by various segments of the population. If similar people are

concentrating at certain locations, thereby increasing residential segregation between social groups, then this

could seriously distort the valuation of specific amenities, hence the need to explicitly measure the conformity

of a house with its neighbours when put on the market. Obviously, a cosy house surrounded by poorly-

maintained properties will loose a large part of its value, eventually more if it belongs to the higher segment of

the local market.

Previous work

The hedonic approach, on which is based this investigation, is applied to the residential market of the Quebec

City region. As is known, this approach aims at explaining property prices on the basis of their physical and

neighbourhood-related characteristics. Its purpose is to evaluate the respective contribution of each attribute of

the residential bundle to market value (Can, 1990 and 1993; Dubin, 1998), using multiple regression analysis.

From a conceptual point of view, land – and property - prices are a combination of externality effects and

location rents (Krantz et al., 1982; Hickman et al., 1984; Shefer, 1986; Yinger et al., 1987; Strange, 1992;

Page 9: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

7

Can, 1993; Dubin, 1998). Hoch and Waddell (1993) point out that the overlapping of access and

neighbourhood characteristics leads to highly complex influences on rent levels and values. As shown by Des

Rosiers et al. (1996 and 2001), specific transformation may be applied to the hedonic equation to account for

the non-monotonicity of some of the distance functions.

While hedonic models have long proved their usefulness as an analytical device, previous research has shown

that substantial portions of price variability remain unexplained (Anselin and Can, 1986; Dubin and Sung,

1987; Can, 1993; Dubin, 1998). Moreover, the appropriate neighbourhood factors needed to improve hedonic

models will vary among locations and market segments (Adair et al., 1996), making it difficult to integrate all

significant factors. These may include socio-demographic, household mobility and local housing stock

structural attributes, as well as macroeconomic determinants operating at a micro level (Adair et al., 1998).

Finally, multicollinearity of model attributes, as well as structural heteroskedasticity and spatial

autocorrelation among residuals are detrimental to the stability of regression coefficients, and even more to

the accuracy of their standard errors (Dubin, 1988; Anselin and Rey, 1991; Can and Megbolugbe, 1997; Basu

and Thibodeau, 1998; Pace et al., 1998; Des Rosiers and Thériault, 1999). These are issues deserving

substantial research efforts for improving robustness of hedonic models.

Previous research (Des Rosiers et al., 2000) demonstrates how principal component analysis can be

introduced into hedonic models of housing markets. Highly significant additive models of house prices were

built, while simultaneously controlling for multicollinearity and heteroskedasticity. The usefulness of

principal components to summarize geographical factors was illustrated using two sets of neighbourhood

attributes and factoring them using Hotelling’s (1933) method.

Take in Table I

The first set is formed of fifteen accessibility variables at various points of the road network computed

through simulation of travelling times in a transportation GIS (Thériault et al., 1998 and 1999). This set of

variables is decomposed into a regional component and a local component (Table I). The regional component

refers to locations with poor (positive loadings and factor scores) and good (negative loadings and factor

Page 10: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

8

scores) access to regional-level services (universities and colleges, regional shopping centres, CBD). The local

component refers to locations with poor and good access to community-level services (primary schools, high

schools, neighbourhood shopping centres). These externalities have been previously identified for their

specific contribution to house values (Colwell, 1990, 1998; Dubin, 1992, 1998; Guntermann and Colwell,

1983; Shefer, 1986; Sirpal, 1994; Des Rosiers et al., 1996, 1999, 2001).

Another principal component analysis was performed on thirty socio-economic characteristics of

neighbourhoods taken from the 1991 population census, and known to be associated with spatial variations in

house values (Jackson, 1979; Anselin and Can, 1986; Can, 1990, 1992; Can and Megbolugbe, 1997; Dubin

and Sung, 1987; Dubin, 1992, 1998; Rodriguez et al., 1995; Des Rosiers and Thériault, 1992, 1999; Strange,

1992) or identifying socio-demographic and neighbourhood features having a potential qualitative or

quantitative effect on housing demand (E.g., children in families, age groups of population, single-person

households, age of houses in the neighbourhood, population density). Four components are found to extract

75% of the total variance and identify complementary dimensions of socio-economic urban structure (Table

II).

Take in Table II

The first component measures household structure, especially household size, larger households tending to be

family households and more frequently homeowners (positive loadings and factor scores), while the smaller

ones being more frequently non-family households and tenants (negative loadings). This component shows a

centre-periphery spatial gradient. We expect the scores on this component to be positively related to house

prices (families needing larger homes). The second component is more specific, opposing young families with

children living in newer suburbs and in gentrified neighbourhoods (positive loadings) to empty-nesters and

retirees living in older suburbs (negative loadings). With the aging of the population that pushes demand up in

older suburbs, scores on this component are negatively related to house values, especially since the first

component has already been hypothesised to account for the effect of larger households on house prices. The

third component clearly expresses the spatial structure of socio-economic status, with the upper town and its

Page 11: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

9

westward extension showing high positive scores. As found by Can (1990) in Columbus (OH), we expect here

a strong positive influence of this component on prices. The fourth factor is much more specific. It opposes

young adults (positive loadings) still living with their parents in low density suburbs to retirees living in the

old city core (negative loadings). Similarly, because of a well-off aging population, a negative relationship

between this component and house values is expected.

With these two principal component analyses, Des Rosiers et al. (2000) were able to replace, in the hedonic

model, 45 highly interrelated variables by six independent (or nearly independent) factors obtained after

Varimax rotation with Kaiser’s normalisation is applied to the initial set of components in order to optimise

their contrasts. The principal components are mutually orthogonal within each set, a feature which provides an

efficient mean of controlling multicollinearity problems while including highly correlated neighbourhood

effects. Indeed, accessibility and socio-economic features of neighbourhoods are well known for their role in

the household location choice process (Landau et al., 1981; Shefer, 1986) through the complex interplay of

local positive and negative externalities (Colwell, 1990; Grieson, 1989; Hickman, 1984; Krantz et al., 1982;

Yinger et al., 1987). However, adding all these neighbourhood effects in hedonic models often raises

heteroskedasticity problems and hides the mechanisms by which they influence house values. Is it because

people are selecting their home location based on their neighbours’ socio-economic status (social attraction

process) or because the property attributes they are searching for appear only at some specific places, that is in

neighbourhoods offering the appropriate socio-economic status (enabling factor)? Is it because some major

externalities are concentrated in space (e.g. quality of view, noise related to motorways, power lines) (site

effect) or because amenities are not evenly distributed as a consequence of transportation-induced

disturbances (communication effect)? Obviously, simply adding geographical factors in hedonic models

cannot distinguish among all these effects.

This paper aims at illustrating the efficiency of interaction indexes to help answer these questions. Previous

work by Pavlov (2000) shows that space-varying regression coefficients can reveal geographical components

behind housing markets and detect the spatial dependence affecting value coefficients. Is the value of a square

meter of living area valid for the entire metropolitan region or is it changing over space? What could explain

Page 12: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

10

these variations? It is the kind of issue Pavlov (2000) addresses using a semi-parametric approach. The same

issue is dealt with here, although using an approach based on OLS. Spatial interactions are formed using

Casetti’s expansion method (spatial trends or true gradients) and consideration is given to local endogenous

market features (false gradients) using distance-weighted averages of neighbouring property attributes.

Hedonic Modelling Procedure

Our case study develops hedonic models using as units of observation a sample of 4040 bungalows (one-story

single family house) sold within the Quebec Urban Community (QUC) from January 1990 to December 1991

(Figure 1). Each property is described using a large set of property-specific attributes (Table III) and

neighbourhood-related attributes. Among these neighbourhood characteristics, accessibility components are

linked with the home through selecting the nearest street corner and socio-economic components are

attributed using a point-in-polygon algorithm. Other externalities are measured using buffers around, or

distance statistics from, features that are known to produce externalities (E.g. motorways, power lines). Table

III shows the attributes found to have a significant relationship (Models A to E below) with the sale price of

bungalows sold in the QUC in 1990-91. It is a mix of property specifics, externality indicators and principal

components of neighbourhood-related features.

Take in Figure 1

Take in Table III

In order to provide a way to counter-validate results, the hedonic models are built using 90% of transactions,

selected at random with a proportional stratification by municipality. This provides 3633 cases for model

building, the 407 remaining observations being kept apart for an independent model effectiveness assessment.

Table III shows averages for each set of transactions and clearly illustrates the similitude of both model and

control samples on almost every attribute.

Although quite useful for explanatory purposes, integrating large sets of variables into a single regression

model may prove problematic. Multicollinearity, temporal or spatial autocorrelation and heteroskedasticity

Page 13: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

11

isues should find solutions in order to allow reliable hypothesis testing (Anselin and Can, 1986; Anselin and

Rey, 1991; Goodman and Thibodeau, 1995). In particular, sorting out accessibility and neighbourhood

attributes can prove quite tricky considering the cross-influences between these two sets of factors. The

general procedure to be followed involves a succession of checks (Figure 2) designed to achieve optimal

model performances, subject to coefficient stability in the regression model. The order of checks is relevant.

While heteroskedasticity can exist without autocorrelation, temporal or spatial autocorrelation generates

heteroskedasticity. Therefore, two strategies could be used to avoid heteroskedasticity. The first one is to

define housing markets controlling simultaneously for home types, macro economic trends, homogeneity of

geographical areas and similar price segments (E.g. lower/middle/higher segments). This leads to very local

market segmentation valid for a specific period of time (economic cycle). Such models are useful for

assessment purpose and are using only property specifics. However, this approach cannot be used to study

urban dynamics at the city-wide level. Hence the need for the second strategy that specifically handles

variation of price effects over time and space, leading to autocorrelation problems.

Take in Figure 2

In the worked example, there is no temporal trend in the data and our attention will be focussed on

reducing spatial autocorrelation among model residuals. According to Anselin (1990b, p. 186),

“Significance tests and measures of fit that ignore spatial autocorrelation may be misleading. Also,

the presence of spatial error autocorrelation can make the indication tests for error heteroskedasticity

highly unreliable.” In fact, ecological fallacy effects often arise strictly from the presence of spatial

dependency (Griffith, 1992). Therefore, causality could falsely be attributed to the environmental

variables, when in fact the correlation merely reflects a common spatial structure present in both the

dependent and the independent data sets as a result of either inappropriate spatial aggregation or

some exogenous common underlying factor. Spatial autocorrelation is based on positional

information of geo-referenced data which is not captured by classical statistics, including OLS.

However, in most geographical study, including urban economics, location conveys non-trivial information

Page 14: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

12

needed to understand the mobility of persons, land use patterns and socio-economic structures. According to

Griffith (1992, p. 273), spatial autocorrelation “[…] may be defined as a measure of true but masked

information content in geo-referenced data.” A complementary view point sees autocorrelation as an

artifact of specification error in spatial modelling. “[…] If a single variable is missing from a

regression equation, the spatial distribution of this variable constitutes a communality across

regression residuals, causing them to appear to be spatially autocorrelated. This problem is

exacerbated by multiple missing variables.” This is clearly the case for housing market modelling,

mainly because it is very difficult to quantitatively specify every factor influencing price formation.

Including all relevant attributes implies measuring such features as the quality of view, the effect of

vegetation, the attractiveness of historical monuments, the perception of crime rates by the

population, traffic noise, etc.

It is possible to work around this problem by adding autoregressive terms in the hedonic function to model

functional interdependence between the transaction price of a given house at time t and the prior sale prices

within its immediate neighbourhood (Ord and Getis, 1995). This approach was used by Can and Megbolugbe

(1997) in their hedonic model of 944 house transactions in Miami. They used various time and space lags to

build distance-weighted lagged price variables, combining these through the expansion method to assess the

spatial drift. Another way, used here, is to search for spatial dependence among housing attributes by trying to

measure the extent to which the departure of a given attribute from the surrounding values (each attribute of

each house being compared to its neighbours) can influence the spatial variation of its implicit price. Are the

“false” gradients on some attributes (neighbourhood specifics) influencing house values?

Take in Table IV

To answer this question, it is first useful to assess spatial autocorrelation (Odland, 1988). Table IV presents

autocorrelation measured among the 4040 bungalows. Their computation is based on the Moran’s I (Moran,

Page 15: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

13

1950) formula:

( )( )

( )∑

∑∑

∑∑=

=

=

=

=

−−= n

ii

j

n

ii

ijn

j ij

n

i

ijn

j ijzz

zzzzd

d

nI

1

2

1

,

12

1

,

12

1

1.

Where, zi and zj are values at locations i and j; z is the average of all n values; 2ijd is the squared Euclidean

distance between location i and j. The use of squared-inverse distance is based on the assumption that mutual

influence among perceptions follows a gravity-like process (Niedercorn and Ammari, 1987). These

autocorrelation coefficients were computed using, for each home, the 15 nearest neighbours transacted in a

circular radius of less than two kilometres. This yielded 60340 pairs of neighbours (some have less than 15

neighbours within the distance threshold). Cases (8) having less than five neighbours in the two kilometre

radius were discarded from the analysis. Here, due to the high density of transactions in urban areas, we can

use the 15 nearest transactions as a “proxy” to compute the average of the property’s neighbourhood on each

housing attribute. Results indicate that most of the housing attributes are significantly autocorrelated. Only

three very scarce attributes did not show autocorrelation of their values (presence/absence) because less than

7% of the properties had these attributes: InfCeilQual (0.8%), CentVacuum (6%) and Balcony (2.9%). As

expected, sale prices are highly autocorrelated. The same holds true for accessibility and socio-economic

factors. We can hypothesise that part of this autocorrelation is reflected in the spatial dependence among sale

prices, and that explicitly integrating it into the hedonic model should help reducing spatial autocorrelation

among residuals.

For each attribute showing significant autocorrelation two new vectors of values were computed, one

conveying the weighted average of the 15 nearest neighbours (expressed as NhbdiAttribute in the tables),

while the other is the specific difference between the value of each house and the neighbourhood average

(PdifiAttribute). This formulation is similar to that used for computing Moran’s I:

ki

d

dAttribute

AttributeNhbd

k ik

k ik

k

i ≠=

=

= ;1

15

12

15

12 and AttributeNhbdAttributeAttributePdif iii −= .

Take in Table V

Page 16: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

14

The hedonic analytical procedure rests upon five steps (models A to E, Table V), gradually improving spatial

attributes specification:

1. Build an hedonic model using only property specifics and geographical components in order to

identify attributes significantly contributing to the price (Model A).

2. Try to replace each property specific attribute by a combination of the 15 nearest neighbours

weighted average and specific departure from the neighbourhood trend; retain only those showing

significant relationships (t test) on both indicators, otherwise keep the original variable because there

is no measurable local endogenous gradient (Model B).

3. Compute interactions between each principal component and each initially defined property attribute

and externality index using the Casetti’s expansion method. In order to avoid multicollinearity

problems, both indicators should be centred before making the product:

))(( eFactorScoreFactorScorAttributeAttribute ii −− . Repeat step 2, using both interactions

and initial variable in a combined stepwise regression after inclusion of property specifics identified

at step 1 (Model C). Eventually, remove variables that fall below significance threshold.

4. Integrate specific attributes, neighbourhood trends/specific departures from the local mean and

spatial interactions, choosing, for each attribute the combination yielding higher t tests for all

coefficients. It is preferable to include or discard simultaneously both NhbdiAttribute and

PdifiAttribute. Indeed, if one of the two fails the significance test, it is better to keep only the

original attribute since it suggests that the local (false) gradient is not significant. Notwithstanding, a

significant interaction of this attribute with a geographical component may indicate the presence of a

spatial trend (true gradient). This integrated model is called Model D.

5. From our previous research, one specific variable, the " relative tax differential" was found to have a

highly significant influence on sale prices. When people try to find housing, they generally visit

many properties on the market and make their own estimation of their value. They also consider the

tax burden and the appraisal value. In Model D, the local tax rate is already included. It assesses the

effect of taxation at the regional level (13 municipalities) using global rates at time of transaction.

However, a property may be over/under-assessed by the municipality, thereby giving rise to

Page 17: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

15

opportunities/externalities that could influence the offer price. If the property is an opportunity for

the buyer (lower taxes than expected from the price he is going to pay), or in contrast “over-taxed”,

the price will probably be adjusted somewhat (Timmermans and Golledge, 1990). The problem is

that the only indication we have to compute this differential between buyer’s and municipalities’

assessments is the sale price. Obviously, it should not be used to compute any independent variable

because this will result in heteroskedasticity. Therefore, a two-stage process is resorted to here, with

estimates from model D being used as a “proxy” for the estimated value. A relative tax differential

ratio is then computed and reintroduced in the equation at a second stage (using spatial expansion) to

build Model E (Annex 1):

{ } iiiiii EstModelDlueTaxationVaLTaxRateEstModelDLTaxRatelTaxDif )01.0()01.0(100Re −=

Summary of Results

Table V shows summary results from the five models. At each step, the adjusted R-square is increasing, from

0.768 to 0.822. Standard errors of estimates are decreasing, from 0.118 to 0.104, yielding fairly good

estimates of housing prices. Thus, adding 23 spatial variables has a rather marginal effect on explanatory

power. Standard deviations of residuals are falling very slowly from 0.1177 to 0.1027 for the model sample,

which is consistent with the trend for the control sample (0.1247 to 0.1055). F ratios decrease until model D

(366 to 247) but show a net improvement for model E (324), which has 56 significant independent variables.

All model estimates show spatial autocorrelation higher than that of the dependent variable (natural logarithm

of the sale price). However, Moran’s I for estimates of Model E (0.549) seems closer to that of the actual sale

prices and shows the same distance range (1125 metres).

The most important improvement concerns the reduction of spatial autocorrelation among models’ residuals.

It is dropping by nearly one half from Model A (0.16) to Model E (0.08). This is a clear indication that well

specified interaction variables can help reducing the spatial autocorrelation among residuals, thereby

improving in the long term, the reliability of OLS hypothesis testing. More importantly, despite the inclusion

of “false” gradients which did not lower the spatial range of autocorrelation among residuals (see correlogram

of Model B in Figure 3; Table V, section b), the inclusion of spatial interaction terms and of the relative tax

Page 18: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

16

differential did a fairly good job, reducing the range of significant autocorrelation to about 375 metres (Table

V, sections c and e; Figure 3).

Take in Figure 3

A crude test for heteroskedasticity was carried out (Table V). It is using the Goldfeld and Quandt (1965)

approach dividing the total sample of transactions in three parts based on sale price: the lower end (40% of

transactions), the mid price (20%) and the higher segment of the market (40%). Table V shows sums of

squares of each model’s residuals for lower and higher market segment. Tests (F) indicate that Models “A”

and “E” seem to display a homoskedastic distribution of errors. However, Models "B", "C" and "D" have a

heteroskedastic distribution of error relative to values of the dependent variable. More sophisticated

approaches suggested by Anselin (1990a) will be needed to account for relationships of spatial variation,

Casetti's expansion and normality of residuals. However, this is far beyond the scope of this paper.

Take in Table VI

Table VI (Model A) shows a standard hedonic model based on the assumption that every coefficient is stable

in space. Multicollinearity (VIF) is low (lower than 5) reaching a maximum which is showing unsurprising

relationships between accessibility and socio-economic factors. Each coefficient expresses the relative

marginal contribution of the pertaining and all values are consistent with expectations.

Take in Table VII

Table VII (Model D) shows the results of step 4. Spatial factors and externalities are simply added to Model A

in order to test the existence of measurable spatial effects in the market under analysis. All the six components

do contribute significantly to explain sale prices and stay in the model as interactions and local endogenous

gradients are included. Slowly decreasing significance of their coefficients (Table VI versus Table VII)

suggest that a small part of their effect is transferred on the other spatial trend indicators. Twenty-three spatial

interaction and neighbourhood effects entered the model, all of these emerging as statistically significant. Lot

size, local tax rate, sheds and apparent age are showing multidimensional spatial effects. Apparent age of the

Page 19: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

17

property (age adjusted for improvements) has both interaction and local gradient components. The interaction

component expresses a structural effect linked to the historic development of the city: in mature suburbs, non-

renovated bungalows are depreciating faster than elsewhere. Conversely, the local gradient is showing a

premium for renovating homes in older central neighbourhoods. Here, it is important to note that the

functional form of the relationship (Sc means spatially adjusted coefficient):

[ ] ( ) ( ) ( )( )( )01373.0248170.2ln01848.0ln12083.0ln11291.0ln +−+−+−= iiii CSFAppAgeAppAgePdifAppAgeNhbdi eAppAgeSc could also be

written [ ] ( ) ( ) ( )( )( )01373.0248170.2ln01848.0ln00079.0ln12083.0ln +−++−= iiii CSFAppAgeAppAgeNhbdAppAgei eAppAgeSc .

This second form is more convenient and will be used to specify equations of Model E (Annex 1). It is also

noteworthy that the most important determinants of sale price show spatial trends (AppAge, LotSize, LivArea,

LtaxRate).

Take in Table VIII

Table VIII (Model E) is very similar to the previous, except for the inclusion of RelTaxDif and its spatial

trend. Property specific attributes were retained to give a clearer view of the sub-models. Annex 1 presents the

mathematical specifications of the model with all its coefficients and constants. There is an increasing

collinearity in the AppAge spatial relationship but it is still acceptable (less than 5). Local tax effects become

highly multidimensional, suggesting strong spatial variations in the way taxation influences housing prices.

Analysing this complex relationship requires more than the mapping of coefficients and will be the subject of

a forthcoming paper. It is also important to point out that most of the internal attributes of the property did not

define spatial gradients. There are three exceptions (inferior ceiling quality, kitchen cabinets made of hard

wood and number of washrooms) which can be used to distinguish cheaper and more expensive properties.

This improved form of the model clearly identifies those attributes that are space-independent (i.e. they do not

generate interaction) and those needing space-dependent adjustments. More importantly, this procedure

specifically identifies which socio-economic and/or accessibility factors are mostly responsible for (or related

to) the spatial variation of the relative value. Finally, since the interactions involve geographical factors that

Page 20: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

18

can be mapped, it is possible to compute local values of space-varying coefficients using equations of Annex

1 to build maps of their distributions.

Discussion

Take in Figure 4

Figure 4 shows an interpolated view of the location rent effect for a bungalow situated on a larger than

average lot (1000 sq. m. versus a geometric mean of 622 sq. m. for all sold properties in 1990-91). Building

this map implies calculating the following expression for every house in the market, replacing its lot size by

1000 sq. m. (fixed parameter) in order to obtain a location-dependent size-related spatially adjusted

coefficient: [ ] ( )( )( ) ( )( )( )09166.0343297.61000ln02342.001373.0243297.61000ln03047.0)1000ln05931.0(4 −−++−−+= ii CSFCSFi eMapSc . Then

local values are interpolated within a short radius (say 1000 meters) around the property, using IWD (inverse

weighted distance) with exponent 2 in order to estimate coefficients at every location. These maps were easily

realised using MapInfo GIS software. It takes about ten minutes to build such a map, allowing for easy

exploration of spatial dependence on many housing attributes. As the above expression suggests, the relative

value of land is related to socio-economic status (CSF3, positive influence) and family cycle A (CSF2,

negative scores), favouring mature suburbs inhabited by empty-nesters. The absolute effect is very strong,

from 44% to 60% of house value. The spatial pattern is closely associated with the major axis of the historic

development of the city, defining two sectors originating from Old Quebec (in white on the map because there

is no bungalow in that part of the city).

Take in Figures 5 and 6

Figures 5 and 6 show depreciation-related spatial trends for a 10 year-old and a 40 year-old house,

respectively, computed using the following functional forms:

[ ] ( ) ( ) ( )( )( )01373.0248170.210ln01855.0ln01234.010ln12424.05 +−++−= ii CSFAppAgeNhbdi eMapSc and

[ ] ( ) ( ) ( )( )( )01373.0248170.240ln01855.0ln01234.040ln12424.06 +−++−= ii CSFAppAgeNhbdi eMapSc . The absolute ranges of spatial

variability are smaller for new houses (79-75% versus 68-62%). These functions integrate two effects: a local

Page 21: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

19

gradient that compare the property to its immediate neighbourhood (less depreciated if younger than average)

and a regional trend that favours older neighbourhoods. This is consistent with Can's (1997, p. 213) findings

that an older house in a transitional neighbourhood would be less valuable than an older house in a

neighbourhood that is undergoing gentrification. The effect of aging cannot be isolated from household

maintenance/repair decisions, which are not solely determined by households income levels but also by

households’ perception concerning the future value of their residential asset, in turn primarily determined by

the “signals” they receive from their immediate neighbourhood. Maps 5 and 6 clearly support this statement:

life cycle is more related to depreciation than socio-economic status and there is more incentive for

maintenance (higher depreciation rate) in older than in newer suburbs. This could be a spatial expression of

the growing market of housing renovation carried out by aging baby boomers.

Take in Figure 7

Figure 7 shows the marginal effect on house value of adding a second washroom. It is computed using the

ratio of two coefficients (2 versus 1 washroom) : [ ]( ) ( )( )( )

( )( )( )42520.0125406.1102594.005440.0

42520.0125406.1202594.005440.02

7 −−−+

−−−+

=i

i

ACF

ACF

i eeMapSc . The

spatial trend estimator is accessibility to regional-level services. Per se, a washroom adds 5.4% to the house

value. There was an average of 1.25 washroom per house in that market and the average accessibility

component score is 0.42520. Therefore being far away from regional-level services (positive scores) nearly

counterbalances the advantage of having a second washroom, although it is needed near the centre of the

agglomeration. This is probably related to young families with middle income being forced to go to new

suburbs in order to access home ownership. These new homes are generally equipped with only one

washroom. In older neighbourhoods, the second washroom is prevalent and there is a price penalty for houses

having only one.

Take in Figure 8

The last example concerns the marginal effect of having a shed on the lot. It is linked to both socio-economic

status and family-cycle B (young adults still at home versus retirees). Its spatial trend is shown on Figure 8:

Page 22: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

20

[ ] ( )( )( ) ( )( )( )25839.044572.0102090.009166.034572.0101686.002050.08 −−+−−−+= ii CSFCSFi eMapSc . In general, a shed adds about 2%

to house value. However, people living in neighbourhoods with higher socio-economic status do not

appreciate this amenity. These areas are mostly inhabited by professionals and retirees. A large proportion of

them outsource maintenance of their lawn and garden. Conversely other people give a higher than average

value to the presence of a shed. These are mostly sectors showing less than average household income, where

the proportion of manual workers is higher than average.

Conclusion

All in all, these worked examples clearly show the benefits of including spatial interactions with urban

dynamics within hedonic models. The purpose is not primarily to improve explanatory power, but rather to

enhance our understanding on the complex linkages between housing prices and the socio-demographic

evolution of North American cities. An important side benefit of this approach is that it clearly helps reduce

spatial autocorrelation among residuals. At the present stage, autocorrelation is not completely removed and

autoregressive methods could be helpful for handling the remaining space-related effects. We can identify at

least four ways to improve the handling of spatial dependence in hedonic models: (1) by considering a wider

range of spatial attributes, especially those related to environment; (2) by defining more flexible ways of

measuring spatial dependence (non linear functions, multivariate interactions, may be some interaction among

property specifics); (3) by considering information on the socio-economic status and perceptions of home

buyers to complement information on socio-economic profiles of their new neighbourhood; (4) by improving

measurement of interactions adjusting trend surfaces over principal components thus dividing their meaning in

two parts: firstly the city-wide trend, the departure from the trend being secondly related to peculiarities of

specific areas.

Moreover, similar models must be designed for different market segments. It is also necessary to compare

temporal trends in the housing demand itself, and in the transformation of spatial interactions linked as they

are to population aging and its long-term replacement. In this research, we were advantaged by an

insignificant temporal trend and by the absence of interaction between time and space. In 1990-91, the

Page 23: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

21

bungalow market in the Quebec urban region was experiencing stable, or slowly rising values. Comparison

with models built using data for the period of 1993 to 1996 (depressed market), and, eventually, 1997 to 2000

(progressive recovery) will allow testing for persistence of the spatial trends identified here. Finally,

additional work must be carried out to assess variations in coefficients’ error-terms over space.

Acknowledgement The authors gratefully acknowledge Martin Lee-Gosselin, Corinne Thomas, Josée Bouchard, Pierre Lemieux, Raynald Sirois, Marie-Hélène Vandersmissen and Maxime Pètre for their valuable help at various stages of this research. This project was funded by the Quebec Province’s FCAR program, the Canadian Social Sciences and Humanities Research Council, the Canadian Central Mortgage and Housing Corporation, the Canadian Network of Centres of Excellence in Geomatics (GEOIDE) and the Canadian Natural Sciences and Engineering Research Council. It was realised in close co-operation with the Quebec Urban Community Appraisal Division, the Quebec Ministry of Transport and the Quebec Urban Community Transit Society (STCUQ).

References Adair, A. S., Berry, J. N. and McGreal, S. W. (1998) “Assessing influences upon the housing market in

Northern Ireland”, Journal of Property Research, Vol. 15 No 2, pp. 121-134. Adair, A. S., Berry, J. N. and McGreal, S. W. (1996) “Hedonic modelling, housing submarkets and residential

valuation”, Journal of Property Research, Vol. 13 No 1, pp. 67-83. Anselin, L. (1990a) “Some robust approaches to testing and estimation in spatial econometrics”, Regional

Science and Urban Economics, Vol. 20 pp. 141-163. Anselin, L. (1990b) “Spatial dependence and spatial structural instability in applied regression analysis”,

Journal of Regional Science, Vol. 30 No 2, pp. 185-207. Anselin, L., and Can, A. (1986) “Model comparison and model validation issues in empirical work on urban

density functions”, Geographical Analysis, Vol. 18, pp. 179-197. Anselin, L., and Rey, S. (1991) “Properties of tests for spatial dependence in linear regression models”,

Geographical Analysis, Vol. 23 No 2, pp. 112-131. Basu, S., and Thibodeau, T. G. (1998) “Analysis of spatial autocorrelation in house prices”, The Journal of

Real Estate Finance and Economics, Vol. 17 No 1, pp. 61-86. Bourne, L. S. (1993) “Close together and worlds apart: An analysis of changes in the ecology of income in

Canadian cities”, Urban Studies, Vol. 30, pp. 1293-1317. Can, A. (1990) “The measurement of neighbourhood dynamics in urban house prices”, Economic Geography,

Vol. 66, pp. 254-272. Can, A. (1992) “Residential quality assessment: Alternative approaches using GIS”, The Annals of Regional

Science, Vol. 26, pp. 97-110. Can, A. (1993) “Specification and estimation of hedonic housing price models”, Regional Science and Urban

Economics, Vol. 22, pp. 453-474. Can, A., and Megbolugbe, I. (1997) “Spatial dependence and house price index construction”, Journal of Real

Estate Finance and Economics, Vol. 14, pp. 203-222. Casetti, E. (1972) “Generating models by the expansion method: Applications to geographical research”,

Geographical Analysis, Vol. 4, pp. 81-91.

Page 24: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

22

Casetti., E. (1986) “The dual expansion method: An application for evaluating the effects of population growth on development”, IEEE Transactions on Systems, Man, and Cybernetics, Vol. 16, pp. 29-39.

Colwell, P. F., Gujral, S. S. and Coley, C. (1985) “The impact of shopping centers on the value of surrounding properties”, Real Estate Issues, Vol. 10 No 1, pp. 35-39.

Colwell, P. F. (1990) “Power lines and land values”, Journal of Real Estate Research, Vol. 5 No 1, pp. 117-127.

Colwell, P. F. (1998) “A primer on piecewise parabolic multiple regression analysis via estimations of Chicago CBD land prices”, The Journal of Real Estate Finance and Economics, Vol. 17 No 1, pp. 87-98.

Des Rosiers, F., Thériault, M. and Villeneuve, P. Y. (2000) “Sorting out access and neighbourhood factors in hedonic price modelling”, Journal of Property Investment and Finance, Vol. 18 No 3, pp. 291-315.

Des Rosiers, F., and Thériault, M. (1992) “Integrating geographic information systems to hedonic price modelling: An application to the Quebec region”, Property Tax Journal, Vol. 11 No 1, pp. 29-57.

Des Rosiers, F. and Thériault, M. (1999) House Prices and Spatial Dependence: Towards an Integrated Procedure to Model Neighborhood Dynamics. Laval University, Faculty of Business Admin., Québec, Publ. 1999-002.

Des Rosiers, F., Lagana, A., Thériault, M. and Beaudoin, M. (1996) “Shopping centers and house values: An empirical investigation”, Journal of Property Valuation and Investment, Vol. 14 No 4, pp. 41-63.

Des Rosiers, F., Lagana, A. and Thériault, M. (2001) “Size and proximity effect of primary schools on surrounding house values”, Journal of Property Research, Vol. 18 No 2, pp. 1-20.

Dubin, R. A. (1988) “Estimation of regression coefficients in the presence of spatially autocorrelated error terms”, Review of Economics and Statistics, Vol. 70, pp. 466-474.

Dubin, R. A. (1992) “Spatial autocorrelation and neighbourhood quality”, Regional Science and Urban Economics, Vol. 22, pp. 433-452.

Dubin, R. A. (1998) “Predicting house prices using multiple listings data”, The Journal of Real Estate Finance and Economics, Vol. 17 No 1, pp. 35-60.

Dubin, R. A., and Sung, C. H. (1987) “Spatial variation in the price of housing: Rent gradients in non-monocentric cities”, Urban Studies, Vol. 24, pp. 193-204.

Goldfeld, S. M. and Quandt, R. E. (1965) “Some tests for homoskedasticity“, Journal of the American Statistical Association, Vol. 60, pp. 539-547.

Goodman, A. C., and Thibodeau, T. G. (1995) “Age-related heteroskedasticity in hedonic house price equations”, Journal of Housing Research, Vol. 6, No 25-42.

Grieson, R. E., and White, J. R. (1989) “The existence and capitalization of neighborhood externalities: A reassessment”, Journal of Urban Economics, Vol. 25 No 1, pp. 68-76.

Griffith, D. A. (1992) „What is spatial autocorrelation? Reflections on the past 25 years of spatial statistics”, L’Espace géographique, Vol. 21 No 3, pp. 265-280.

Guntermann, K. L., and Colwell, P. F. (1983) “Property values and accessibility to primary schools”, Real Estate Appraiser and Analyst, Vol. 49 No 1, pp. 62-68.

Hickman, E. P., Gaines, J. P. and Ingram, F. J. (1984) “The influence of neighbourhood quality on residential values”, The Real Estate Appraiser and Analyst, Vol. 50 No 2, pp. 36-42.

Hoch, I., and Waddell, P. (1993) “Apartment rents: Another challenge to the monocentric model”, Geographical Analysis, Vol. 25 No 1, pp. 20-34.

Hotelling, H. (1933) “Analysis of a Complex of Statistical Variables into Principal Components”, Journal of Educational Psychology, Vol. 24, pp 417-441 and 498-520.

Jackson, J. R. (1979) “Intraurban variation in the price of housing”, Journal of Urban Economics, Vol. 6, pp. 464-479.

Page 25: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

23

Krantz, D. P., Waever, R. D. and Alter, T. R. (1982) “Residential tax capitalization: Consistent estimates using micro-level data”, Land Economics, Vol. 58 No 4, pp. 488-496.

Landau, U., Prashker, J. N. and Hirsh, M. (1981) “The effect of temporal constraints in household travel behaviour”, Environment and Planning A, Vol. 13, pp. 435-444.

Le Bourdais, C. and Beaudry, M. (1988) “The changing residential structure of Montreal 1971-81”, The Canadian Geographer, Vol. 32 No 2, pp. 98-113.

Legendre, P. (1993) “Spatial autocorrelation: trouble or new paradigm?”, Ecology, Vol. 74 No 6, pp. 1659-1673.

Miron, J. R. (2000) Housing in Postwar Canada, McGill-Queen’s University Press, Montreal. Moran, P. A. P. (1950) “Notes on continuous stochastic phenomena”, Biometrika, Vol. 37, pp. 17-23. Muth, R. F. and Goodman, A. C. (1988) The Economics of Housing Markets. Fundamentals of Pure and

Applied Economics, No. 31, Harwood Academic Publishers, 148 p. Niedercorn, J. H., and Ammari, N. S. (1987) “New evidence on the specification and performance of

neoclassical gravity models in the study of urban transportation”, The Annals of Regional Sciences, Vol. 21 No 1, pp. 56-64.

Nijkamp, P., Van Wissen, L. and Rima, A. M. (1993) “A household life cycle model for residential relocation behaviour”, Socio-Economic Planning Science, Vol. 27 No 1, pp. 35-53.

Odland, J. (1988) Spatial Autocorrelation. Scientific Geography, Vol. 9, Sage, Newbury Park. Ord, J. K., and Getis, A. (1995) “Spatial autocorrelation statistics: Distribution issues and an application”,

Geographical Analysis, Vol. 27 No 4, pp. 286-306. Pace, R. K., Barry, R. and Sirmans, C. F. (1998) “Spatial statistics and real estate”, The Journal of Real Estate

Finance and Economics, Vol. 17 No 1, pp. 5-14. Pavlov, A. D. (2000) “Space-varying regression coefficients: A semi-parametric approach applied to real

estate markets”, Real Estate Economics, Vol. 28 No 2, pp. 249-283. Rodriguez, M., Sirmans, C. F. and Marks, A. P. (1995) “Using geographic information systems to improve

real estate analysis”, The Journal of Real Estate, Vol. 10 No 2, pp. 163-173. Rose, D. and Villeneuve, P. Y. (1983) “Work, labour markets and households in transition”, in L. Bourne and

D. Ley (eds.), The Social Geography of Canadian Cities, McGill-Queens Press, Montreal, pp. 153-174. Rose, D. and Villeneuve, P. Y. (1998) “Engendering class in the metropolitan city: occupational pairing and

income disparities among two-earner couples”, Urban Geography, Vol. 19 No 2, pp. 123-159. Shefer, D. (1986) “Utility changes in housing and neighbourhood services for households moving into and out

of distressed neighbourhoods”, Journal of Urban Economics, Vol. 19, pp. 107-124. Sirpal, R. (1994) “Empirical modelling of the relative impacts of various sizes of shopping centres on the

values of surrounding residential properties”, Journal of Real Estate Research, Vol. 9 No 4, pp. 487-505. Strange, W. (1992) “Overlapping neighbourhoods and housing externalities”, Journal of Urban Economics,

Vol. 32, pp. 17-39. Thériault, M., Leroux, D. and Vandersmissen, M. H. (1998) “Modelling travel route and time within GIS: Its

use for planning”, in A. Bargiela and E. Kerckhoffs (eds.) Simulation Technology: Science and Art. Proceedings of the 10th European Simulation Symposium, The Nottingham-Trent University and Society for Computer Simulation International, pp. 402-407.

Thériault, M., Des Rosiers, F. and Vandersmissen, M. H. (1999) “GIS-based simulation of accessibility to enhance hedonic modelling and property value appraisal”, Joint IAAO-URISA 1999 Conference, New Orleans, April 11-14, 10 pages.

Timmermans, H., and Golledge, R. G. (1990) “Applications of behavioural research on spatial problems: preferences and choice”, Progress in Human Geography, Vol. 14, pp. 311-354.

Page 26: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

24

Wyly, E. K. (1999) “Continuity and change in the restless urban landscape”, Economic Geography, Vol. 75 No 4, pp. 309-338.

Yinger, J. H., Bloom, H. S. Borsch-Supan, A. and Ladd, H. F. (1987) “Property Taxes and House Values: The Theory and Estimation of Intra-jurisdictional Property Tax Capitalization”, Studies in Urban Economics, Academic Press, Princeton Univ.

Travel time computed using transportation GIS to evaluate accessibility at 17,871 local street corners in Quebec Urban Community Total variance explained

Extraction sums of squared loadings Principal component analysis

Rotation sums of squared loadings Varimax with Kaiser normalisation

Components Total % of variance

Cumulative %

Total % of variance

Cumulative %

ACF1 9.683 64.556 64.556 6.313 42.086 42.086 ACF2 1.668 11.122 75.678 5.039 33.592 75.678

Rotated component matrix Components

Travel times simulated with TransCAD using the road network with impedance and turn penalties (minutes)

ACF1 ACF2

Travel time to nearest highway entrance by car 0.647 0.352 Travel time to nearest regional shopping centre by car 0.758 0.555 Travel time to nearest local shopping centre by car 0.557 0.727 Travel time to nearest neighbourhood shopping centre by car 0.546 0.671 Travel time to nearest high school by car 0.351 0.797 Travel time to nearest college or university by car 0.911 Travel time to Laval University by car 0.916 Travel time to downtown Quebec City by car 0.610 0.534 Travel time to downtown Sainte-Foy by car 0.893 Travel time to “La Capitale” shopping centre by car 0.320 0.701 Walking time to nearest neighbourhood shopping centre 0.407 0.776 Walking time to nearest primary school 0.724 Walking time to nearest high school 0.847 Walking time to nearest college or university 0.801 Walking time to Laval University 0.875 0.361 Accessibility to

regional services Accessibility to neighbourhood

services Interpretation of positive values

Far away from regional-level

services

Far away from local-level services

Interpretation of negative values

Close to regional-level

services

Close to local-level services

Adapted from Des Rosiers, F., Thériault M. and Villeneuve P. (2000) Sorting out access and neighbourhood factors in hedonic price modelling. Journal of Property Investment and Finance, vol. 18, no 3, p. 291-315.

Table I. Principal component analysis of distance to services

Page 27: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

25

Principal component analysis on 1991 census attributes of 416 neighbourhoods in Quebec Metropolitan Area Total variance explained

Extraction sums of squared loadings Principal component analysis

Rotation sums of squared loadings Varimax with Kaiser normalisation

Components Total % of variance

Cumulative % Total % of variance

Cumulative %

CSF1 11.798 39.326 39.326 10.931 36.438 36.438 CSF2 4.724 15.748 55.074 4.789 15.964 52.402 CSF3 4.076 13.588 68.661 4.774 15.914 68.317 CSF4 1.791 5.971 74.632 1.895 6.315 74.632

Rotated component matrix Components

1991 Census Attributes CSF1 CSF2 CSF3 CSF4 % of persons 0-14 years -0.811 0.505 % of persons 15-24 years 0.808 % of persons 25-44 years 0.933 % of persons 45-64 years -0.883 % of persons 65+ years 0.648 -0.548 -0.370 % of women 0.545 -0.339 Persons per household -0.959 % of non-family households 0.945 % of single person households 0.933 Children per family -.0574 % of lone-parent families 0.759 -0.304 % of families with children -0.820 % of families with children 0-6 years 0.850 % of families with children 6-14 years -0.585 0.557 % of detached dwellings -0.932 % of dwellings in large buildings 0.473 Persons per room 0.425 -0.668 % of dwellings built before 1946 0.525 -0.494 % of dwellings built 1946-1960 0.309 -0.504 % of dwellings built 1961-1970 -0.575 0.636 % of tenants 0.910 % of households with housing costs >30% of income 0.635 -0.306 % of adults with highschool degree 0.903 % of adults with university degree 0.944 % of men with college degree 0.935 % of women with college degree 0.919 Average household income ($) -0.699 0.629 % households mowing during last 5 years 0.605 0.608 Population density (persons/hectare) 0.760 Dwelling density (density/hectare) 0.804 Centrality Family cycle

A Socio-economic

status Family cycle

B Interpretation of positive values

Small and non-family households tenants in the city

centre

Young families with children living in new

suburbs

Well educated persons with high

income in the upper town

Young adults living with their parents in low

density suburbs Interpretation of negative values

Family households homeowners in

suburbs

Empty-nesters and retirees living in

older suburbs

Low educated poor persons living mainly

in the lower town

Retirees living in the old city core

Table II. Principal component analysis of socio-economic census attributes

Page 28: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

26

3633 bungalows used in the model Control

sample N=407

Variable Definition Type Minimum Maximum Average Std. Dev AverageSalePrice Sale price of the property ($) N 50500 336000 87779.23 25911.57 88382.69LnSalePrice Natural logarithm of sale price ($) N 10.82973 12.72487 11.34922 0.245897 11.35537LnAppAge Natural logarithm of apparent age (years) N -.69315 3.91202 2.48170 .89609 2.53634LnLotSize Natural logarithm of the lot size (sq. m.) N 5.27300 9.41638 6.43298 .32269 6.45284LivArea Living area of the property (sq. m.) N 16.36000 264.26000 96.28459 17.61973 96.56872WaterSewer Property is (1) / is not (0) linked to the municipal

aqueduct and sewer networks B 0 1 .99009 .09906 .98526

InfFound Property has(1) / does not have (0) inferior quality foundations

B 0 1 .03799 .19119 .02703

BaseFinh Basement is (1) / is not (0) finished B 0 1 .56813 .49541 .56757CathCeil Property has (1) / does not have (0) cathedral

ceiling B 0 1 .17066 .37626 .18182

InfCeilQual Property has (1) / does not have (0) inferior quality ceiling

B 0 1 .00853 .09199 .01229

Skylight Property has (1) / does not have (0) skylights B 0 1 .01239 .11062 .01229Quality House quality index (number of attributes

– below / + above average) N -2 2 -.03413 .21999 -.01966

Washrooms Total number of washrooms (bath = 1; toilet = 0.5)

N .5 5.5 1.25406 .32957 1.26536

Fireplace Number of fireplaces N 0 4 .18827 .41821 .18182HardwStair Presence (1) / absence (0) of an indoor staircase

made of hard wood B 0 1 .07707 .26674 .08845

Oven Presence (1) / absence (0) of a built-in oven and/or cooking top

B 0 1 .10570 .30749 .12285

KitchCab Presence (1) / absence (0) of kitchen cabinets made of hard wood

B 0 1 .11781 .32243 .12531

Dishwasher Presence (1) / absence (0) of a permanent dishwasher

B 0 1 .55216 .49734 .62162

CentVacuum Presence (1) / absence (0) of a central vacuum system

B 0 1 .05863 .23496 .07125

Veranda Presence (1) / absence (0) of a veranda B 0 1 .47702 .49954 .48403Balcony Presence (1) / absence (0) of a balcony B 0 1 .02945 .16909 .02703ExcavPool Presence (1) / absence (0) of an excavated pool B 0 1 .03881 .19317 .04668AttGarage Presence (1) / absence (0) of an attached garage B 0 1 .02753 .16363 .01966DetGarage Presence (1) / absence (0) of a detached garage B 0 1 .10515 .30679 .12776Shed80cf Presence (1) / absence (0) of a shed with more

than 80 cubic feet B 0 1 .45720 .49823 .46437

LTaxRate Local tax rate ($/$100 of assessed value) N 1.25310 3.08350 2.38474 .42820 2.37913TaxationValue Property value for taxation purpose ($) N 38000 307000 81946.42 23015.95 84044.23RelTaxDif Relative tax differential; adjusted tax using

estimated price – effective tax (% of estprice) N -2.19988 1.27402 .14690 .25444 .11640

NbMonthJ90 Number of month elapsed since January 1990 N 1.000 24.000 12.196 6.735 12.13022Motorway150 Property is (1) / is not (0) located at less than 150

metres from a motorway B 0 1 .01927 .13748 .00983

DistMotorExit Euclidean distance from the nearest motorway (km)

N .034 5.661 1.50665 .92099 1.52957

ACF1 Accessibility comp. #1 – Regional-level services N -1.92565 2.50911 .42520 .91465 .42634ACF2 Accessibility comp. #2 – Local-level services N -1.44784 3.83155 .01101 .78073 .00252CSF1 Census comp. #1 – Centrality N -1.61231 2.55393 -.64900 .52924 -.65465CSF2 Census comp. #2 – Family cycle A N -2.29043 2.66786 -.01373 1.19485 -.06834CSF3 Census comp. #3 – Socio-economic status N -1.99671 2.77619 .09166 .86869 .08234CSF4 Census comp. #4 – Family cycle B N -2.99326 3.29053 .25839 .77453 .25282

Table III. Operational definition of variables and descriptive statistics

Page 29: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

27

4040 houses; 60340 valid pairs of neighbouring properties within a distance of 2000 metres.

Variable Distance Range of Significant

Autocorrelation (metres)

Moran’s I 0-2000 m

Probability (Ho I = 0)

Correlogram Shape

SalePrice 0-1125 0.39332 < 0.0001 Increase 0-625; fast decrease 625-1125 LnSalePrice 0-1125 0.44847 < 0.0001 Stable 0-1000; fast decrease 1000-1125 LnAppAge 0-1250 1.07452 < 0.0001 Fast decrease 0-375; decrease 375-1250 LnLotSize 0-1500 0.45483 < 0.0001 Fast decrease 0-250; stable 250-1500 LivArea 0-1125 0.42225 < 0.0001 Decrease 0-1125 WaterSewer 0-750 0.04132 0.0320 Stable 0-750 InfFound 0-750 0.10212 < 0.0001 Decrease 0-750 BaseFinh 0-875 0.19180 < 0.0001 Decrease 0-250; stable 250-875 CathCeil 0-500 0.46591 < 0.0001 Very fast decrease 0-500 InfCeilQual none 0.00928 0.3352 Non significant Skylight 0-500 0.03823 0.0428 Decrease 0-500 Quality 0-500 0.31956 < 0.0001 Very fast decrease 0-500 Washrooms 0-1125 0.13714 < 0.0001 Stable 0-1125 Fireplace 0-750 0.14418 < 0.0001 Decrease 0-750 HardwStair 0-750 0.08716 < 0.0001 Low 0-750 Oven 0-250 0.06379 0.0021 Decrease 0-250 KitchCab 0-500 0.15552 < 0.0001 Fast decrease 0-500 Dishwasher 0-500 0.07691 0.0003 Slow decrease 0-500 CentVacuum none 0.00215 0.4573 Non significant Veranda 0-750 0.10982 < 0.0001 Decrease 0-750 Balcony none 0.00630 0.3846 Non significant ExcavPool 0-750 0.06855 0.0010 Low 0-750 AttGarage 0-1000 0.11599 < 0.0001 Increase 0-750; fast decrease 750-1000 DetGarage 0-750 0.04978 0.0127 Stable 0-500; decrease 500-750 Shed80cf 0-1000 0.15184 < 0.0001 Decrease 0-1000 LTaxRate 0->2000 0.85783 < 0.0001 Stable 0-750; slow decrease 750-2000 TaxationValue 0-1150 0.45611 < 0.0001 Stable 0-1000; fast decrease 1000-1150 RelTaxDif 0-1125 0.33982 < 0.0001 Decrease 0-1125 NbMonthJ90 none 0.03363 0.0651 Non significant Motorway150 0-1000 0.51851 < 0.0001 Fast decrease 0-1000 DistMotorExit 0->2000 0.86620 < 0.0001 Stable 0-2000 ACF1 0->2000 0.89417 < 0.0001 Stable 0-2000 ACF2 0->2000 0.83594 < 0.0001 Stable 0-1850 CSF1 0->2000 0.62426 < 0.0001 Stable 0-1500; increase 1500+ CSF2 0->2000 1.18043 < 0.0001 Regular decrease 0-2000 CSF3 0->2000 0.80025 < 0.0001 Stable 0-1500; decrease 1500+ CSF4 0-1250 0.82272 < 0.0001 Fast decrease 0-1250

Table IV. Spatial autocorrelation of property and geographical attributes

Page 30: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

28

a) Standard hedonic model of property attributes and geographical components

N of Cases : 3633 Indep. Var : 33 Max. VIF : 4.566 Adj. R-sq: 0.768 SEE : 0.1182 Constant : 10.60501 F ratio : 366.1 Df : 33 / 3599 Prob. : 0.000

Model Minimum Maximum Mean Std. Dev. N of cases Predicted 10.707112 12.749734 11.348533 0.215678 3633 Residual -0.460806 0.452916 0.0 0.117705 3633

Control sample Minimum Maximum Mean Std. Dev. N of cases Predicted 10.773187 12.218870 11.352782 0.222641 407 Residual -0.610692 0.621067 0.002583 0.124783 407

Autocorrelation Moran’s I Probability N of cases Pairs Range (m) Dependent 0.44847 < 0.0001 4040 60340 0-1125 Predicted 0.60066 < 0.0001 4040 60340 0-1375 Residuals 0.15971 < 0.0001 4040 60340 0-875 Heteroskedasticity SS Res. Low SS Res. High F D. of freedom Prob. Goldfeld-Quandt 24.0253 25.6554 1.0678 1616, 1616 0.0936

b) Weighted average of the 15 nearest neighbours and property difference on independent variables

N of Cases : 3633 Indep. Var : 44 Max. VIF : 4.659 Adj. R-sq: 0.773 SEE : 0.1170 Constant : 10.34441 F ratio : 282.4 Df : 44 / 3588 Prob. : 0.000

Model Minimum Maximum Mean Std. Dev. N of cases Predicted 10.684851 12.719604 11.348533 0.216437 3633 Residual -0.470449 0.467345 0.0 0.116304 3633

Control sample Minimum Maximum Mean Std. Dev. N of cases Predicted 10.768578 12.205505 11.352311 0.222562 407 Residual -0.606884 0.614113 0.003054 0.124341 407

Autocorrelation Moran’s I Probability N of cases Pairs Range (m) Dependent 0.44847 < 0.0001 4040 60340 0-1125 Predicted 0.61545 < 0.0001 4040 60340 0-1500 Residuals 0.15623 < 0.0001 4040 60340 0-875 Heteroskedasticity SS Res. Low SS Res. High F D. of freedom Prob. Goldfeld-Quandt 23.2169 25.2566 1.0878 1616, 1616 0.0453

c) Interactions between property attributes and geographical factors

N of Cases : 3633 Indep. Var : 48 Max. VIF : 4.314 Adj. R-sq: 0.787 SEE : 0.1133 Constant : 10.64469 F ratio : 280.8 Df : 48 / 3584 Prob. : 0.000

Model Minimum Maximum Mean Std. Dev. N of cases Predicted 10.727795 12.902420 11.348533 0.218383 3633 Residual -0.441963 0.424379 0.0 0.112607 3633

Control sample Minimum Maximum Mean Std. Dev. N of cases Predicted 10.789083 12.352350 11.354625 0.225295 407 Residual -0.611310 0.638933 0.000740 0.123674 407

Autocorrelation Moran’s I Probability N of cases Pairs Range (m) Dependent 0.44847 < 0.0001 4040 60340 0-1125 Predicted 0.56392 < 0.0001 4040 60340 0-1250 Residuals 0.12354 < 0.0001 4040 60340 0-750 Heteroskedasticity SS Res. Low SS Res. High F D. of freedom Prob. Goldfeld-Quandt 21.3444 24.7560 1.1598 1616, 1616 0.0014

Table V. Summary of hedonic model (Natural logarithm of sale price)

Page 31: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

29

d) First stage integration of neighbourhood and geographical interaction relationships

N of Cases : 3633 Indep. Var : 55 Max. VIF : 4.011 Adj. R-sq: 0.789 SEE : 0.1129 Constant : 10.64469 F ratio : 247.4 Df : 55 / 3577 Prob. : 0.000

Model Minimum Maximum Mean Std. Dev. N of cases Predicted 10.763083 12.858742 11.348533 0.218650 3633 Residual -0.453463 0.422567 0.0 0.112087 3633

Control sample Minimum Maximum Mean Std. Dev. N of cases Predicted 10.796546 12.346214 11.354337 0.224621 407 Residual -0.615016 0.622688 0.001028 0.123791 407

Autocorrelation Moran’s I Probability N of cases Pairs Range (m) Dependent 0.44847 < 0.0001 4040 60340 0-1125 Predicted 0.57769 < 0.0001 4040 60340 0-1500 Residuals 0.12557 < 0.0001 4040 60340 0-750 Heteroskedasticity SS Res. Low SS Res. High F D. of freedom Prob. Goldfeld-Quandt 21.1189 24.4930 1.1598 1616, 1616 0.0014

e) Second stage integration of neighbourhood and geographical interaction relationships including relative tax differential

N of Cases : 3633 Indep. Var : 56 Max. VIF : 4.698 Adj. R-sq: 0.822 SEE : 0.1036 Constant : 10.68007 F ratio : 324.5 Df : 52 / 3580 Prob. : 0.000

Model Minimum Maximum Mean Std. Dev. N of cases Predicted 10.774527 12.743113 11.348533 0.2231699 3633 Residual -0.5016142 0.3498444 0.0 0.1027944 3633

Control sample Minimum Maximum Mean Std. Dev. N of cases Predicted 10.794561 12.421995 11.361402 0.2364209 407 Residual -0.6270931 0.3649263 -0.00604 0.1056451 407

Autocorrelation Moran’s I Probability N of cases Pairs Range (m) Dependent 0.44847 < 0.0001 4040 60340 0-1125 Predicted 0.54906 < 0.0001 4040 60340 0-1125 Residuals 0.08338 0.0001 4040 60340 0-375 Heteroskedasticity SS Res. Low SS Res. High F D. of freedom Prob. Goldfeld-Quandt 18.2043 18.9623 1.0416 1616, 1616 0.2062

Table V (Continued). Summary of hedonic model (Natural logarithm of sale price)

Page 32: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

30

Coefficients Significance Collinearity B Std Error Beta t Sig. VIF

(Constant) 10.60501 .05428 195.363 .000 LnAppAge -.11075 .00349 -.404 -31.719 .000 2.543 LnLotSize .06631 .00720 .087 9.209 .000 1.402 LivArea .00494 .00015 .354 33.042 .000 1.803 WaterSewer .10587 .02087 .043 5.073 .000 1.110 InfFound -.06440 .01079 -.050 -5.969 .000 1.105 BaseFinh .02209 .00457 .045 4.835 .000 1.331 CathCeil .01888 .00610 .029 3.094 .002 1.368 InfCeilQual -.10531 .02182 -.039 -4.826 .000 1.047 Skylight .07768 .01805 .035 4.304 .000 1.035 Quality .06296 .00948 .056 6.644 .000 1.129 Washrooms .06312 .00698 .085 9.037 .000 1.376 Fireplace .03825 .00510 .065 7.498 .000 1.183 HardwStair .03710 .00772 .040 4.807 .000 1.101 Oven .02539 .00665 .032 3.819 .000 1.085 KitchCab .02396 .00619 .031 3.871 .000 1.034 Dishwasher .02482 .00416 .050 5.961 .000 1.114 CentVacuum .03817 .00858 .037 4.451 .000 1.055 Veranda .01124 .00407 .023 2.763 .006 1.073 Balcony .03250 .01184 .022 2.744 .006 1.042 ExcavPool .08543 .01057 .067 8.083 .000 1.083 AttGarage .04333 .01271 .029 3.410 .001 1.124 DetGarage .04585 .00684 .057 6.707 .000 1.143 Shed80cf .02077 .00435 .042 4.773 .000 1.221 LTaxRate -.05500 .00672 -.096 -8.185 .000 2.151 NbMonthJ90 .00129 .00029 .035 4.404 .000 1.012 Motorway150 -.05084 .01466 -.028 -3.468 .001 1.055 DistMotorExit .01703 .00305 .064 5.584 .000 2.048 ACF1 Regional-level services -.06657 .00458 -.248 -14.524 .000 4.566 ACF2 Local-level services -.02736 .00348 -.087 -7.855 .000 1.920 CSF1 Centrality .03077 .00469 .066 6.563 .000 1.600 CSF2 Family cycle A -.01914 .00278 -.093 -6.890 .000 2.861 CSF3 Socio-economic status .08140 .00392 .288 20.757 .000 3.015 CSF4 Family cycle B -.01600 .00321 -.050 -4.991 .000 1.602 Location-related attributes are greyed.

Table VI. Model A : Standard hedonic model (Natural logarithm of sale price)

Page 33: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

31

Coefficients Significance Collinearity B Std Error Beta t Sig. VIF

(Constant) 10.59499 .05382 196.873 .000 Nhbd LnAppAge -.11291 .00469 -.353 -24.060 .000 3.707 Pdif LnAppAge -.12083 .00436 -.246 -27.713 .000 1.359 LnAppAge * CSF2 .01848 .00257 .080 7.182 .000 2.111 LnLotSize .08031 .00787 .105 10.205 .000 1.836 LnLotSize * ACF2 -.01612 .00733 -.020 -2.199 .028 1.476 LnLotSize * CSF2 -.01897 .00699 -.026 -2.715 .007 1.565 LnLotSize * CSF3 .02591 .00742 .031 3.490 .000 1.313 Nhbd LivArea .00516 .00025 .279 20.761 .000 3.108 Pdif LivArea .00437 .00015 .265 28.561 .000 1.483 WaterSewer * CSF4 -.12834 .02695 -.038 -4.763 .000 1.081 Nhbd InfFound -.11313 .02040 -.049 -5.545 .000 1.337 Pdif InfFound -.06305 .01043 -.053 -6.043 .000 1.328 BaseFinh .02180 .00442 .044 4.935 .000 1.363 CathCeil .01907 .00609 .029 3.133 .002 1.494 CathCeil * CSF2 .01431 .00439 .029 3.258 .001 1.392 InfCeilQual -.08952 .02141 -.034 -4.182 .000 1.104 InfCeilQual * CSF2 .04358 .01924 .018 2.265 .024 1.070 Nhbd Skylight .16807 .03573 .041 4.704 .000 1.331 Pdif Skylight .06998 .01732 .035 4.040 .000 1.279 Nhbd Quality .09040 .01614 .050 5.599 .000 1.348 Pdif Quality .07668 .00963 .065 7.961 .000 1.146 Nhbd Washrooms .05701 .01335 .045 4.270 .000 1.873 Pdif Washrooms .05779 .00694 .081 8.324 .000 1.618 Washrooms * ACF1 -.01594 .00620 -.022 -2.572 .010 1.264 Nhbd Fireplace .03971 .00983 .040 4.038 .000 1.717 Pdif Fireplace .03180 .00494 .056 6.440 .000 1.290 HardwStair .03608 .00741 .039 4.871 .000 1.112 Oven .02297 .00638 .029 3.602 .000 1.095 Nhbd KitchCab .03908 .01175 .029 3.325 .001 1.283 Pdif KitchCab .02583 .00601 .036 4.296 .000 1.199 Dishwasher .02757 .00400 .056 6.894 .000 1.126 CentVacuum .03990 .00824 .038 4.842 .000 1.067 Veranda .01090 .00391 .022 2.788 .005 1.085 Veranda * ACF2 -.01192 .00488 -.019 -2.441 .015 1.025 Balcony .03783 .01137 .026 3.329 .001 1.052 Nhbd ExcavPool .06983 .02134 .030 3.272 .001 1.482 Pdif ExcavPool .08768 .01016 .075 8.626 .000 1.301 AttGarage .03861 .01231 .026 3.135 .002 1.156 DetGarage .04574 .00659 .057 6.944 .000 1.163 Shed80cf .01789 .00421 .036 4.245 .000 1.255 Shed80cf * CSF3 -.01351 .00467 -.023 -2.889 .004 1.114 Shed80cf * CSF4 .01544 .00512 .024 3.018 .003 1.093 LTaxRate -.04818 .00654 -.084 -7.370 .000 2.231 LTaxRate * CSF2 .03075 .00436 .067 7.051 .000 1.541 LTaxRate * CSF3 -.05924 .00694 -.103 -8.531 .000 2.509 LTaxRate * CSF4 -.02322 .00577 -.038 -4.027 .000 1.546 NbMonthJ90 .00139 .00028 .038 4.927 .000 1.021 Motorway150 -.05425 .01415 -.030 -3.833 .000 1.078 DistMotorExit * CSF2 .00711 .00244 .027 2.918 .004 1.422 ACF1 Regional-level services -.04799 .00387 -.179 -12.406 .000 3.564 ACF2 Local-level services -.03364 .00354 -.107 -9.513 .000 2.170 CSF1 Centrality .02196 .00473 .047 4.646 .000 1.781 CSF2 Family cycle A -.00850 .00314 -.041 -2.706 .007 4.011 CSF3 Socio-economic status .07212 .00406 .255 17.752 .000 3.546 CSF4 Family cycle B -.01156 .00329 -.036 -3.517 .000 1.846 Location-related attributes are greyed.

Table VII. Model D : First stage integration of neighbourhood and geographical interactions

Page 34: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

32

Coefficients Significance Collinearity B Std Error Beta t Sig. VIF

(Constant) 10.68007 .04410 242.202 .000 LnAppAge -.12424 .00399 -.453 -31.107 .000 4.340 Nhbd LnAppAge .01234 .00484 .039 2.547 .011 4.698 LnAppAge * CSF2 .01855 .00235 .080 7.882 .000 2.100 LnLotSize .05931 .00698 .078 8.503 .000 1.717 LnLotSize * CSF2 -.03047 .00599 -.042 -5.085 .000 1.369 LnLotSize * CSF3 .02342 .00686 .028 3.416 .001 1.333 LivArea .00430 .00014 .308 30.692 .000 2.065 Nhbd LivArea .00070 .00020 .038 3.463 .001 2.428 WaterSewer * CSF4 -.11031 .02458 -.032 -4.489 .000 1.070 InfFound -.05870 .00960 -.046 -6.117 .000 1.140 Nhbd InfFound -.05662 .01671 -.024 -3.389 .001 1.067 BaseFinh .01080 .00407 .022 2.653 .008 1.377 CathCeil .01826 .00558 .028 3.272 .001 1.493 CathCeil * CSF2 .01709 .00402 .035 4.251 .000 1.389 InfCeilQual -.10458 .01964 -.039 -5.326 .000 1.105 InfCeilQual * CSF2 .07995 .01775 .033 4.504 .000 1.083 Skylight .07677 .01588 .035 4.834 .000 1.046 Nhbd Skylight .14678 .02932 .036 5.005 .000 1.067 Quality .07136 .00838 .064 8.521 .000 1.150 Washrooms .05440 .00635 .073 8.567 .000 1.484 Washrooms * ACF1 -.02594 .00569 -.036 -4.561 .000 1.266 FirePlace .02942 .00449 .050 6.548 .000 1.196 HardwStair .04459 .00679 .048 6.567 .000 1.111 Oven .02414 .00584 .030 4.135 .000 1.092 KitchCab .02565 .00551 .034 4.655 .000 1.069 Nhbd KitchCab .02122 .01000 .016 2.122 .034 1.106 Dishwasher .03104 .00366 .063 8.474 .000 1.125 CentVacuum .05108 .00757 .049 6.751 .000 1.071 Veranda .00763 .00359 .016 2.127 .034 1.089 Veranda * ACF2 -.01660 .00451 -.026 -3.681 .000 1.040 Balcony .03603 .01043 .025 3.455 .001 1.053 ExcavPool .10037 .00933 .079 10.754 .000 1.101 AttGarage .04600 .01127 .031 4.081 .000 1.152 DetGarage .04759 .00603 .059 7.886 .000 1.161 Shed80cf .02050 .00386 .042 5.307 .000 1.254 Shed80cf * CSF3 -.01686 .00428 -.029 -3.934 .000 1.113 Shed80cf * CSF4 .02090 .00469 .033 4.455 .000 1.094 LocalTaxRate * CSF2 .02339 .00416 .051 5.620 .000 1.670 LocalTaxRate * CSF3 -.05571 .00643 -.097 -8.669 .000 2.557 LocalTaxRate * CSF4 -.01779 .00526 -.029 -3.380 .001 1.533 RelTaxDif -.20769 .00748 -.215 -27.784 .000 1.226 RelTaxDif * ACF2 -.03092 .01047 -.023 -2.953 .003 1.197 RelTaxDif * CSF2 .02660 .00701 .031 3.795 .000 1.384 RelTaxDif * CSF3 -.03928 .00891 -.034 -4.407 .000 1.227 Motorway150 -.07162 .01299 -.040 -5.513 .000 1.081 DistMotorExit * CSF2 .01176 .00223 .044 5.264 .000 1.423 ACF1 Regional-level services -.05966 .00353 -.222 -16.914 .000 3.527 ACF2 Local-level services -.03799 .00311 -.121 -12.198 .000 2.003 CSF1 Centrality .01753 .00432 .038 4.061 .000 1.770 CSF2 Family cycle A -.00855 .00283 -.042 -3.017 .003 3.884 CSF3 Socio-economic status .07540 .00342 .267 22.056 .000 2.988 CSF4 Family cycle B -.01385 .00300 -.044 -4.621 .000 1.827 Location-related attributes are greyed.

Table VIII. Model E : Second stage integration of relative tax differential

Page 35: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

33

0 2.5

kilometres5

Municipalities

Hydrography

Road Network

Boundaries

Lake / River

RiverStream

MotorwaysMain RoadsPairs of Nearest Neighbours

Bungalows

Pair of Bungalows

Transactions in 1990-91

Figure 1. Location of bungalows and network of 15 nearest neighbours

Starting point

Stepwise MultipleRegressionProcedure

Yes Test forMulticollinearity

Test for SpatialAutocorrelation

No

Yes

Test forHeteroskedasticity

Unbiased andstable Hedonic

Model

No

No

Add Geographicand NeighborhoodAttributes and/or

Interactions

Yes

Property Attributes

Factor Analysis ofInterrelatedIndependent

Variables

Test for TemporalAutocorrelation Yes

Add TemporalCovatiates and/or

Interactions

No

Improve MarketSegmentation or

Use BinaryInteractions with

Property Specifics

Figure 2. Hedonic modelling procedure

Page 36: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

34

Figure 3. Correlograms of Models Residuals

0 2.5

kilometres5

Spatial Coefficients1000 Square metres Lot

1.601.561.521.481.44

Municipalities

Hydrography

Road Network

Boundaries

Lake / River

RiverStream

MotorwaysMain Roads

Figure 4. Location rent effect of a 1000 sq.-m. lot on house value (Bungalows 1990-91)

Correlogram of Residuals from Models A to E

-0.15000

-0.10000

-0.05000

0.00000

0.05000

0.10000

0.15000

0.20000

1-12

5

125-

249

250-

374

375-

499

500-

624

625-

749

750-

874

875-

999

1000

-112

4

1125

-124

9

1250

-137

4

1375

-149

9

1500

-162

4

1625

-174

9

Distance Lag (metres)

Mor

an's

IModel A ResidualsModel B ResidualsModel C ResidualsModel D ResidualsModel E Residuals

Page 37: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

35

0 2.5

kilometres5

Spatial Coefficients10 years-old (Apparent Age)

0.790.780.770.760.75

Municipalities

Hydrography

Road Network

Boundaries

Lake / River

RiverStream

MotorwaysMain Roads

Figure 5. Spatial trend of depreciation for a 10 years-old house (Bungalows 1990-91)

0 2.5

kilometres5

Spatial Coefficients40 years-old (Apparent Age)

0.6850.6680.6520.6350.619

Municipalities

Hydrography

Road Network

Boundaries

Lake / River

RiverStream

MotorwaysMain Roads

Figure 6. Spatial trend of depreciation for a 40 years-old house (Bungalows 1990-91)

Page 38: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

36

0 2.5

kilometres5

Spatial CoefficientNet Effect of the Second Washroom

1.121.091.061.031

Municipalities

Hydrography

Road Network

Boundaries

Lake / River

RiverStream

MotorwaysMain Roads

Figure 7. Marginal effect of a second washroom on house value (Bungalows 1990-91)

0 2.5

kilometres5

Spatial CoefficientsPresence of a Shed

1.0561.0341.0130.9910.969

Municipalities

Hydrography

Road Network

Boundaries

Lake / River

RiverStream

MotorwaysMain Roads

Figure 8. Marginal effect of the presence of a shed on house value (Bungalows 1990-91)

Page 39: DOCUMENT DE TRAVAIL 2001-015 L S - FSA ULaval · 2001. 11. 13. · it is combining GIS technology and spatial statistics to define principal components of accessibility and socio-

37

[ ] [ ] [ ] [ ] [ ]

[ ] [ ] [ ] [ ]

[ ]

[ ] [ ]

[ ] [ ] [ ]iiiii

ii

iiii

iiiii

ii

CSFCSFCSFCSFACF

ACFi

Motorwayii

iDetGarageAttGarageExcavPoolBalcony

i

CentVacuumDishwasheri

OvenHardwStairFireplace

iQuality

iiiBaseFinh

iiiiii

eeeee

exitDistMotorEScelTaxDifScLTaxRateSc

cfShedSceeeeVerandaSc

eeKitchCabSceee

WashroomsSceSkylightSclInfCeilQuaScCathCeilSce

InfFoundScWaterSewerScLivAreaScLotSizeScAppAgeScEstModelE

401385.0307540.0200855.0101753.0203799.0

105966.015007162.0

04759.004600.010037.003603.0

05108.003104.002414.004459.002942.0

07136.001071.0

Re

80

lnln59.43480$

−−−

−−

=

Where [1] [ ] ( ) ( ) ( )( )( )01373.0248170.2ln01855.0ln01234.0ln12424.0ln +−++−= iiii CSFAppAgeAppAgeNhbdAppAge

i eAppAgeSc

[2] [ ] ( )( )( ) ( )( )( )09166.0343297.6ln02342.001373.0243297.6ln03047.0)ln05931.0(ln −−++−−+= iiiii CSFLotsizeCSFLotSizeLotSizei eLotSizeSc

[3] [ ] ( ) ( )LivAreaNhbdLivAreai

iieLivAreaSc 00070.000430.0 +=

[4] [ ] ( )( ))25839.0499009.011031.0( −−−= ii CSFWaterSeweri eWaterSewerSc

[5] [ ] ( ) ( )InfFoundNhbdInfFoundi

iieInfFoundSc 05662.005870.0 −+−=

[6] [ ] ( )( )( )01373.0217066.001709.0)01826.0( +−−+= iii CSFCathCeilCathCeili eCathCeilSc

[7] [ ] ( )( )( )01373.0200853.007995.0)10458.0( +−+−= iii CSFlInfCeilQualInfCeilQuai elInfCeilQuaSc

[8] [ ] ( ) ( )SkylightNhbdSkylighti

iieSkylightSc 14678.007677.0 +=

[9] [ ] ( ) ( )( )( )42520.0125406.102594.005440.0 −−−+= iii ACFWashroomsWashroomsi eWashroomsSc

[10] [ ] ( ) ( )KitchCabNhbdKitchCabi

iieKitchCabSc 02122.002565.0 +=

[11] [ ] ( )( )( )01101.0247702.001660.0)00763.0( −−−+= iii ACFVerandaVerandai eVerandaSc

[12] [ ] ( )( )( ) ( )( )( ))25839.044572.08002090.009166.034572.08001686.0)8002050.0(80 −−+−−−+= iiiii CSFcfShedCSFcfShedcfShedi ecfShedSc

[13] [ ]( )( )( ) ( )( )( )

( )( )( )25839.043847.201779.009166.033847.205571.001373.023847.202339.0

−−−+−−−++−

= iiiiii

CSFLTaxRateCSFLTaxRateCSFLTaxRate

i eLTaxRateSc

[14] [ ]( )( )( ) ( )( )( )

( )( )( )01101.021469.0Re03092.009166.031469.0Re03928.001373.021469.0Re02660.0)Re20769.0(

Re −−−+−−−++−−

= iiiiiii

ACFlTaxDifCSFlTaxDifCSFlTaxDiflTaxDif

i elTaxDifSc

[15] [ ] ( )( )( )01373.025066.101176.0 +−= ii CSFayExitDistMotorwi exitDistMotorESc

[16] ki

d

dAttribute

AttributeNhbd

k ik

k ik

k

i ≠=

=

= ;1

15

12

15

12 and AttributeNhbdAttributeAttributePdif iii −=

[17] { } iiiiii EstModelDlueTaxationVaLTaxRateEstModelDLTaxRatelTaxDif )01.0()01.0(100Re −=

Appendix 1. Mathematical specification of Model E