Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

Embed Size (px)

Citation preview

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    1/22

    Christopher Dougherty

    EC220 - Introduction to econometrics(chapter 1)

    Slideshow: interpretation of a regression equation

    Original citation:

    Dougherty, C. (2012) EC220 - Introduction to econometrics (chapter 1). [Teaching Resource]

    2012 The Author

    This version available at: http://learningresources.lse.ac.uk/127/

    Available in LSE Learning Resources Online: May 2012

    This work is licensed under a Creative Commons Attribution-ShareAlike 3.0 License. This license allows

    the user to remix, tweak, and build upon the work even for commercial purposes, as long as the user

    credits the author and licenses their new creations under the identical terms.

    http://creativecommons.org/licenses/by-sa/3.0/

    http://learningresources.lse.ac.uk/

    http://learningresources.lse.ac.uk/http://creativecommons.org/licenses/by-sa/3.0/http://creativecommons.org/licenses/by-sa/3.0/http://creativecommons.org/licenses/by-sa/3.0/http://creativecommons.org/licenses/by-sa/3.0/http://learningresources.lse.ac.uk/
  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    2/22

    1

    INTERPRETATION OF A REGRESSION EQUATION

    The scatter diagram shows hourly earnings in 2002 plotted against years of schooling,defined as highest grade completed, for a sample of 540 respondents from the National

    Longitudinal Survey of Youth.

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    3/22

    2

    INTERPRETATION OF A REGRESSION EQUATION

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    Highest grade completed means just that for elementary and high school. Grades 13, 14,and 15 mean completion of one, two and three years of college.

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    4/22

    3

    INTERPRETATION OF A REGRESSION EQUATION

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    Grade 16 means completion of four-year college. Higher grades indicate years ofpostgraduate education.

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    5/22

    . reg EARNINGS S

    Source | SS df MS Number of obs = 540

    -------------+------------------------------ F( 1, 538) = 112.15

    Model | 19321.5589 1 19321.5589 Prob > F = 0.0000

    Residual | 92688.6722 538 172.283777 R-squared = 0.1725

    -------------+------------------------------ Adj R-squared = 0.1710

    Total | 112010.231 539 207.811189 Root MSE = 13.126

    ------------------------------------------------------------------------------EARNINGS | Coef. Std. Err. t P>|t| [95% Conf. Interval]

    -------------+----------------------------------------------------------------

    S | 2.455321 .2318512 10.59 0.000 1.999876 2.910765

    _cons | -13.93347 3.219851 -4.33 0.000 -20.25849 -7.608444

    ------------------------------------------------------------------------------

    INTERPRETATION OF A REGRESSION EQUATION

    This is the output from a regression of earnings on years of schooling, using Stata.

    4

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    6/22

    . reg EARNINGS S

    Source | SS df MS Number of obs = 540

    -------------+------------------------------ F( 1, 538) = 112.15

    Model | 19321.5589 1 19321.5589 Prob > F = 0.0000

    Residual | 92688.6722 538 172.283777 R-squared = 0.1725

    -------------+------------------------------ Adj R-squared = 0.1710

    Total | 112010.231 539 207.811189 Root MSE = 13.126

    ------------------------------------------------------------------------------EARNINGS | Coef. Std. Err. t P>|t| [95% Conf. Interval]

    -------------+----------------------------------------------------------------

    S | 2.455321 .2318512 10.59 0.000 1.999876 2.910765

    _cons | -13.93347 3.219851 -4.33 0.000 -20.25849 -7.608444

    ------------------------------------------------------------------------------

    INTERPRETATION OF A REGRESSION EQUATION

    5

    For the time being, we will be concerned only with the estimates of the parameters. Thevariables in the regression are listed in the first column and the second column gives the

    estimates of their coefficients.

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    7/22

    . reg EARNINGS S

    Source | SS df MS Number of obs = 540

    -------------+------------------------------ F( 1, 538) = 112.15

    Model | 19321.5589 1 19321.5589 Prob > F = 0.0000

    Residual | 92688.6722 538 172.283777 R-squared = 0.1725

    -------------+------------------------------ Adj R-squared = 0.1710

    Total | 112010.231 539 207.811189 Root MSE = 13.126

    ------------------------------------------------------------------------------EARNINGS | Coef. Std. Err. t P>|t| [95% Conf. Interval]

    -------------+----------------------------------------------------------------

    S | 2.455321 .2318512 10.59 0.000 1.999876 2.910765

    _cons | -13.93347 3.219851 -4.33 0.000 -20.25849 -7.608444

    ------------------------------------------------------------------------------

    INTERPRETATION OF A REGRESSION EQUATION

    6

    In this case there is only one variable,S, and its coefficient is 2.46. _cons, in Stata, refers tothe constant. The estimate of the intercept is -13.93.

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    8/22

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    7

    Here is the scatter diagram again, with the regression line shown.

    INTERPRETATION OF A REGRESSION EQUATION

    SEARNINGS 46.293.13 ^

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    9/22

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    8

    INTERPRETATION OF A REGRESSION EQUATION

    SEARNINGS 46.293.13 ^

    What do the coefficients actually mean?

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    10/22

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    9

    INTERPRETATION OF A REGRESSION EQUATION

    SEARNINGS 46.293.13 ^

    To answer this question, you must refer to the units in which the variables are measured.

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    11/22

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    10

    INTERPRETATION OF A REGRESSION EQUATION

    SEARNINGS 46.293.13 ^

    Sis measured in years (strictly speaking, grades completed),EARNINGSin dollars perhour. So the slope coefficient implies that hourly earnings increase by $2.46 for each extra

    year of schooling.

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    12/22

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    11

    INTERPRETATION OF A REGRESSION EQUATION

    SEARNINGS 46.293.13 ^

    We will look at a geometrical representation of this interpretation. To do this, we willenlarge the marked section of the scatter diagram.

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    13/22

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    14/22

    You should ask yourself whether this is a plausible figure. If it is implausible, this could bea sign that your model is misspecified in some way.

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    13

    INTERPRETATION OF A REGRESSION EQUATION

    SEARNINGS 46.293.13 ^

    INTERPRETATION OF A REGRESSION EQUATION

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    15/22

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    14

    INTERPRETATION OF A REGRESSION EQUATION

    SEARNINGS 46.293.13 ^

    For low levels of education it might be plausible. But for high levels it would seem to be anunderestimate.

    INTERPRETATION OF A REGRESSION EQUATION

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    16/22

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    15

    INTERPRETATION OF A REGRESSION EQUATION

    SEARNINGS 46.293.13 ^

    What about the constant term? (Try to answer this question yourself before continuing withthis sequence.)

    INTERPRETATION OF A REGRESSION EQUATION

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    17/22

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    16

    INTERPRETATION OF A REGRESSION EQUATION

    SEARNINGS 46.293.13 ^

    Literally, the constant indicates that an individual with no years of education would have topay $13.93 per hour to be allowed to work.

    INTERPRETATION OF A REGRESSION EQUATION

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    18/22

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    17

    INTERPRETATION OF A REGRESSION EQUATION

    SEARNINGS 46.293.13 ^

    This does not make any sense at all. In former times craftsmen might require an initialpayment when taking on an apprentice, and might pay the apprentice little or nothing for

    quite a while, but an interpretation of negative payment is impossible to sustain.

    INTERPRETATION OF A REGRESSION EQUATION

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    19/22

    18

    INTERPRETATION OF A REGRESSION EQUATION

    A safe solution to the problem is to limit the interpretation to the range of the sample data,and to refuse to extrapolate on the ground that we have no evidence outside the data range.

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    SEARNINGS 46.293.13 ^

    INTERPRETATION OF A REGRESSION EQUATION

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    20/22

    19

    INTERPRETATION OF A REGRESSION EQUATION

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    SEARNINGS 46.293.13 ^

    With this explanation, the only function of the constant term is to enable you to draw theregression line at the correct height on the scatter diagram. It has no meaning of its own.

    INTERPRETATION OF A REGRESSION EQUATION

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    21/22

    20

    Another solution is to explore the possibility that the true relationship is nonlinear and thatwe are approximating it with a linear regression. We will soon extend the regression

    technique to fit nonlinear models.

    INTERPRETATION OF A REGRESSION EQUATION

    -20

    0

    20

    40

    60

    80

    100

    120

    0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20

    Years of schooling

    Hourly

    earnings

    ($)

    SEARNINGS 46.293.13 ^

  • 8/10/2019 Chapter 1 Interpretation of a Regression Equation %28EC220%29 (1)

    22/22

    Copyright Christopher Dougherty 2011.

    These slideshows may be downloaded by anyone, anywhere for personal use.Subject to respect for copyright and, where appropriate, attribution, they may be

    used as a resource for teaching an econometrics course. There is no need to

    refer to the author.

    The content of this slideshow comes from Section 1.4 of C. Dougherty,

    In troduct ion to Econ ometr ics, fourth edition 2011, Oxford University Press.

    Additional (free) resources for both students and instructors may be

    downloaded from the OUP Online Resource Centre

    http://www.oup.com/uk/orc/bin/9780199567089/.

    Individuals studying econometrics on their own and who feel that they might

    benefit from participation in a formal course should consider the London School

    of Economics summer school course

    EC212 Introduction to Econometrics

    http://www2.lse.ac.uk/study/summerSchools/summerSchool/Home.aspxor the University of London International Programmes distance learning course

    20 Elements of Econometrics

    www.londoninternational.ac.uk/lse.

    11 07 25

    http://www.oup.com/uk/orc/bin/9780199567089/http://www2.lse.ac.uk/study/summerSchools/summerSchool/Home.aspxhttp://g/www.londoninternational.ac.uk/lsehttp://g/www.londoninternational.ac.uk/lsehttp://www2.lse.ac.uk/study/summerSchools/summerSchool/Home.aspxhttp://www.oup.com/uk/orc/bin/9780199567089/