Regression for class teaching

Regression

Understanding

Example

Diagnostics

Regression With SPSS

Transformation

Regression

What is

Regression

A Statistical Technique that is used to relate two or

more variables.

Use the independent variable(s) to predict the value of

dependent variable. Objective

Example

For a given value of advertisement expenditure, how

much sales will be generated.

With a given diet plan, how much weight an individual

will be able to reduce.

With a unit increase in green house gases, how much

will be the rise in the temperature?

Regression Understanding

A layman

Question

Suppose we want to find out how much the age of the

car helps you to determine the price of the car

The older the car ______ will be the priceA layman Answer

Regression in

Simple Words

As the age of the car increases by one year the price of

the car is estimated to decrease by a certain amount.

Y(Estimated) = b0 + b1 X Regression in

Statistical Terms

Regression Understanding

Data Set: Age &

Price of the Cars

A Negative RelationshipWhat Relation Do

you see?

Age 1 2 1 2 3 4 3 4 3

Price 90 85 93 84 80 74 81 76 79

A Convenient

Way to Look

(What is this tool

Called?)

1 2 3 4

tistic

Y (E) = b0 + b1 X

Y (E) = 97 – 5 X

Y = 97 – 5 X +E

What it is!

Dependent Variable whose behavior is to be determined

Independent Variable whose effect to be determined

Intercept: Value of Y(E) when X = 0

Estimated Change in Y in response to unit Change in X

E Difference between the actual and estimated

Assessing the Goodness of Fit: Graphical Way

Goodness of

Fit Means

How well the model fits the actual data. Less residual

means a good fit, more residual means bad Fit

Bad Fit Good Fit Perfect Fit

Assessing the Goodness of Fit: Statistical Way

Expected Y

Estimated YActual Y

SSR =Σ (Estimated – Expected)2

SST =Σ (Real – Expected)2

SSE =Σ (Actual – Estimated)2

Assessing the Goodness of Fit: Statistical Way R2

SST =Σ (Real – Expected)2

SSR =Σ (Estimated – Expected)2

SSE =Σ (Actual – Expected)2

A good Model is the one in

which SSE is the lowest

SSE = 0

SST = SSR + SSE R2 = SSR/SST R2 = 1 - SSE/SST

Residual Analysis

The purpose of Modeling is to predict

(interpolate), the interpolation can be

correct when the assumptions about the

behavior of the data hold true.

Assumptions:

Response

Variable

is independentIs Normally

Distributed

Has constant

Variance

Has straight line

Relation with IV

Residual Analysis

In Terms of Response

VariableIn Terms of Residual

Independence

Normality

Constant

Variance

Linearity

Response Variable Random Errors

is independent

Is Normally

Distributed

Has constant Variance

Has straight line

Relation with IV

are independent

are Normally

Distributed

Have constant

Variance

Have straight line

relation with IV

Inferring About the Population

Assumptions

Expected Value

of Residual

Variance of

Residual

Distribution of

Residual

Dependency of

Residuals

E(ei ) = 0

σe1= σe2= …. = σei

Normal

Independent

What it means

No apparent pattern in residual plot

Residual Plot has consistent Spread

Histogram is symmetric or normal

(Histogram & Probability Plot of Residual)

Relationship

b/w IndV & DVLinear Linear Scatter Plot

How to Check it

The Three Conditions Shown Together

As the distribution is symmetric, the

mean distribution of error term will

be zero

The distribution of error term is

shown to be normally distributed

Variance of error term for different

values of x appear to be same

Residual Analysis

Types of Residuals

Normal or Raw

Residual: RESID

Standardized

Residual: ZRESID

Studentized

Residual: SRESID

Y – Y(Estimated)

{Y – Y(Estimated)}/Standard Error of Residual

{Y – Y(Estimated)}/ Varying Standard Error of Residual

Influential Observation

Outliers Observations with large error

Leverage

Points

Distinct from other values on the basis of

independent values

Influential

Observation

Value the inclusion of which can affect the

coefficient of regression line

Any Value can be Influential Observation

Outliers With Residuals

Standardized Residuals Un standardized Residuals

Can not tell how big

residual will be considered

Using the Properties of

ND helps us in making a

rule for deciding large or

Rule of 3.28

Rule of 2.58

Rule of 1.96

SR > 3.28

1% or More % SR > 2.58

Model is Unacceptable When

5% or More % SR > 1.96

Identifying Influential Cases

I Will Look at

the World

Without You

Regression is done with a particular

data set removed and that particular

value is predicted

How it Looks

This adjusted Predicted value is similar

to the Predicted Value then the value is

not an influential observation

Adjusted

Predicted Value

The predicted value of a case without

including that case for Predicting it

DFFit Original Predicted – Adjusted Predicted

Deleted

Residual

Studentized

Deleted Residual

Original Observed– Adjusted Predicted

Deleted Residual / Standard Deviation

Influential Cases

Coefficient with (xa, ya) included

Coefficient with (xa, ya) not included

Large Change in

Coefficient

Not Large Change

in Coefficient

Influential

Observation

Not an Influential

Observation

Influential Cases

(Adjusted Predicted Value)

Predicted Value

DFFit =Difference= PV - APV

Influential

Observation

Small Difference

Adjusted

Predicted Value

Large Difference

Not an Influential

Observation

Influential Cases

(Adjusted Predicted Value)

Original Value

Deleted Residual (DR)= OV - APV

SDR Can be compared for different

Regression Models

Adjusted Predicted

Studentized Deleted Residual=DR/SE

Cook’s

Distance

What is it?

Leverage

Is the measure of overall

influence of the case on

the model

ceObservation is

influential if

CD > 1

Influence of observed on

predicted

Average Leverage(AL) =

(K+1)/2

AL > 2(k+1)/2

AL > 3(k+1)/2

Distance of Cases from

mean of Predictor

variables

Use Barnett & Lewis

DfBeta/Standard

DfBeta

Standardized

DfBeta

Covariance

Ratio = CVR

What is it?Observation is

influential if

Delete case if

CVR < 1-3(k+1)/n

Don’t Delete case if

CVR > 1+3(k+1)/n

K = Number of Predictors

Difference Between

Parameter with &

without Case

It measures whether

the case affects the

variance of Regression

Parameter

Scale Sensitive

therefore does not

provide Good CV

Heteroscedasticity

What is it?Changing Variance at different level

of predictor

+ + ++

The spread increases with y

Residual

Measure

Multicolinearity

What is it?Strong correlation between the

predictor variables

Untrustworthy

Restricted R2

Difficulty in

Picking the

Right Variable

Inflated

Standard

Significant

Varying bs

from Sample

to Sample

The inclusion of new varaible which is

strongly correlated with the first one, R2

will not increase

The inclusion of new varaible which is

strongly correlated with the first one, R2

will not increase

Multicolinearity:

Measure

What is it? VIF = 1/(1-R2)

Interpretation

The lower the

value the better:

VIF < 10

VIF: Variance

Inflating FactorDurbin Watson

Range of Value is

between 0 & 4

0 = Negative correlation

4= Positive Correlation

2 = No Correlation

Desired value is 2 or

Measures of Multicolinearity:

Variance

Inflation Factor

Tolerance

Eigen Value

Variance

Proportion

The Lower the

Better

Higher the Better

The Lower the

Better

Higher the Better

MeasureDesired

Behavior

VIF > 10

T < 0.1

The Lower the

Better

Each Dimension be

related with

separate Variable

Critical Value

Checking Assumptions Through Plots

P-P Plot: Standardized Residual

Normality

Scatter Plot: Standardized Residual /

Standardized Predicted ValueHeteroscedasticity

OutliersScatter Plot: Residual / Stadnardized

Predicted Value

Q-Q Plot: Standardized Residual

Transformation of a Variable

ReasonNonliear is translated into linear

Methods of explanation for linear relation are known

Justified

Theoretically

Diagnostic Plots

Transform x Y Both

Transformation of a Variable

Function

Reciprocal

Y =α+ β/x

ExponentialY =αebx

PowerY =αxb

Y =α+ β log

Transform

Y’ =ln(Y) Y’ =lnα+ β x

Linear Form

Y’=log(Y),

X’=log(X) Y’ =logα+ β x’

X’ =log(X) Y’ =α+ β x’

X’ =1/x Y’ =α+ β x’

Regression through SPSS

Coefficients

Model Fit

Assumption

b0 & b1

SST =SSR + SSE

F=MSR/MSE

e is independent

e is Normally Distributed

e has constant Variance

e has straight line Relation with IV

Multicolinearity

Data Set

Variables

Study Time

Interest

Standardized Predicted

Standardized Residual

Deleted Residual

Adjusted Predicted

Studentized Residual

Studentized Deleted

Residual

on e is independent

Multicolinearity

ty Normal Probability Plot of the Standardized Residual

Histogram of the Standardized Residual

SK and Shapiro Test

Getting the Residual & Standardized Residual

SK and Shapiro Test

on e is independent

Multicolinearity

Z Predicted

Z Residual

-3 -2 -1 0 321

on e is independent

Multicolinearity

Regression for class teaching

Education

Logistic Regression - legacydirs.umiacs.umd.edulegacydirs.umiacs.umd.edu/~jbg/teaching/CSCI_3022/10c.pdf · Logistic Regression: Regression for outputting Probabilities Intuitions

Micro Teaching Class 1

Logistic Regression - and Maximum Likelihoodmpetrik/teaching/intro_ml_17_files/class4.pdfLinear Regression for 2-class Classification Y = (1 if default 0 otherwise Linear regression

Deqiang ZOU- Market Segmentation with Latent Class Regression · Market Segmentation with Latent Class Regression Deqiang Zou ... Wedel and Kamakura (2000), p. 7. Pricing Segments

Introduction to Linear Regression - MESCAL Projectmescal.imag.fr/membres/arnaud.legrand/teaching/2013/EP_05_linear...1 Simple Linear Regression General Introduction ... What is a regression?

Class 4 Multiple Regression

Anti Arrhythmic Class Teaching

Logistic regression teaching

MULTIPLE REGRESSION AND CLASSICAL ASSUMPTION TESTING · Multiple Regression Teaching Materials Agus Tri Basuki, M.Sc. MULTIPLE REGRESSION AND CLASSICAL ASSUMPTION TESTING In statistics,

Unit and Regression Testing using JUnit - York Universityjackie/teaching/lectures/2017/F/EECS2030/... · A Simple Counter (1) Consider a utility class (where attributes and methods

MULTIPLE REGRESSION ANALYSIS: ESTIMATION - …tastan/teaching/03 Multiple Regression... · Multiple Regression Analysis In the simple regression analysis with only one explanatory

Class 25 T-test 2-sample ≡ Regression with Dummy Understanding Multiple Regression. ANOVA ≡ Regression with p-1 Dummies EMBS 13.7 Pfeifer Note: section

Multi-Class Sparse Bayesian Regression for Neuroimaging ...Multi-Class Sparse Bayesian Regression for Neuroimaging data analysis Vincent Michel1,2,5, Evelyn Eger3,5, Christine Keribin2,4,

Classification: Logistic Regressionusers.umiacs.umd.edu/~jbg/teaching/CMSC_470/02b_lr.pdf · Logistic Regression Example Contrasting Naïve Bayes and Logistic Regression — Naïve

Linear and Weighted Regression Support Vector Regressionlasa.epfl.ch/teaching/lectures/ML_Msc/Slides/Regressi… · · 2016-12-02Linear and Weighted Regression Support Vector Regression

Teaching online vs. teaching in class

Chapter 3 Local Regression - Biostatistics - Departments - …ririzarr/Teaching/754/section-03… · · 2001-03-28Chapter 3 Local Regression Local regression is used to model a

SIMPLE REGRESSION MODEL - Yıldız Teknik Üniversitesiyildiz.edu.tr/~tastan/teaching/Handout 02 Simple Regression Model.pdf · 5 Simple Regression Model: Examples Agricultural production

CSC 411: Lecture 04: Logistic Regressionfidler/teaching/2015/slides/CSC411/04_prob_classif.pdfCSC 411: Lecture 04: Logistic Regression Class based on Raquel Urtasun & Rich Zemel’s

Correlation and Regression Analysis in Antitrust Class ... · CORRELATION AND REGRESSION ANALYSIS IN ANTITRUST CLASS CERTIFICATION ... which he described as methods of showing ‘an