74
STATISTICAL ANALYSIS - LECTURE 3 Dr. Mahmoud Mounir [email protected]

STATISTICAL ANALYSIS - LECTURE 3

  • Upload
    others

  • View
    4

  • Download
    0

Embed Size (px)

Citation preview

Page 1: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS -LECTURE 3Dr. Mahmoud Mounir

[email protected]

Page 2: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 2

Page 3: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 3

VARIANCE AND STANDARD DEVIATION

Page 4: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 4

VARIANCE AND STANDARD DEVIATIONโ‘VARIANCE (UNGROUPED DATA)

Page 5: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 5

VARIANCE AND STANDARD DEVIATIONโ‘VARIANCE (UNGROUPED DATA)

โžขMean is the pointof balance, so wehave positive andnegative deviationsfrom the mean.

โžขThe sum ofdeviation sum tozero. Thatโ€™s why wedonโ€™t use theoriginal deviations,but the squareddeviations.

Page 6: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 6

VARIANCE AND STANDARD DEVIATIONโ‘VARIANCE (UNGROUPED DATA)

Page 7: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 7

VARIANCE AND STANDARD DEVIATIONโ‘VARIANCE (UNGROUPED DATA)

๐’” =ฯƒ๐’™๐Ÿ โˆ’

ฯƒ๐’™ ๐Ÿ

๐’๐’ โˆ’ ๐Ÿ

(X) (X - าง๐‘ฅ) (X - าง๐‘ฅ)2

50 -112.5 12656.25

100 -62.5 3906.25

200 37.5 1406.25

300 137.5 18906.25

าง๐‘ฅ = 162.5ฯƒ (X - าง๐‘ฅ)2

= 36875

Page 8: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 8

VARIANCE AND STANDARD DEVIATIONโ‘VARIANCE (UNGROUPED DATA)

๐’” =ฯƒ๐’™๐Ÿ โˆ’

ฯƒ๐’™ ๐Ÿ

๐’๐’ โˆ’ ๐Ÿ

(X) (X2)

50 2500

100 10000

200 40000

300 90000

ฯƒ X = 650ฯƒ X2 =

142500

Page 9: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 9

x = class midpoint

๐‘† =ฯƒ ๐‘ฅ โˆ’ ว‰๐‘ฅ 2๐‘“

๐‘› โˆ’ 1

โ‘VARIANCE (GROUPED DATA)

VARIANCE AND STANDARD DEVIATION

Page 10: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 10

x = class midpoint

โ‘VARIANCE (GROUPED DATA)

VARIANCE AND STANDARD DEVIATION

๐’” =ฯƒ๐’™๐Ÿ ๐’‡ โˆ’

ฯƒ๐’™๐’‡ ๐Ÿ

๐’๐’ โˆ’ ๐Ÿ (X) (X2) f X2 * f

50 2500 5 12500

100 10000 3 30000

200 40000 6 240000

300 90000 2 180000

ฯƒ X = 650ฯƒ X2 =

142500n = 16

ฯƒ X2 * f =

462500

Page 11: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 11

Age Frrquency (f) Midpoint (x) X-Mean (X-Mean)2 (X-Mean)2 f

30-34 4 32 -9 81 324

35-39 5 37 -4 16 80

40-44 2 42 1 1 2

45-49 9 47 6 36 324

Total 20 730

โˆ‘f = n = 20Mean = 820/20 = 41

โˆ‘(X-Mean)2 f = 730

โ‘VARIANCE (GROUPED DATA)

๐‘บ =๐Ÿ•๐Ÿ‘๐ŸŽ

๐Ÿ๐ŸŽ โˆ’ ๐Ÿ

= ๐Ÿ‘๐Ÿ– . ๐Ÿ’๐Ÿ โ‰ˆ ๐Ÿ”. ๐Ÿ๐ŸŽ

VARIANCE AND STANDARD DEVIATION

Page 12: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 12

โ‘Sometimes researchers want to know if a specific

observation is common or exceptional.

โ‘To answer that question, they express a score in terms of

how many standard deviations below or above the

population mean a raw score is.

โ‘This number is what we call a z-score.

โ‘If we recode original scores into z-scores, we say that we

standardize a variable.

Z-SCORE

Page 13: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 13

Z-SCORE

Page 14: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 14

Z-SCORE

Page 15: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 15

Z-SCORE

Page 16: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 16

Z-SCOREโ‘EMPIRICAL RULE

NORMAL DISTRIBUION (BELL SHAPED)

Page 17: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 17

Z-SCOREโ‘EMPIRICAL RULE โ€œAPPROXIMATIONโ€

NORMAL DISTRIBUION (BELL SHAPED)โ–ช Approximately 68%of the data lie within one standard deviation of the mean, that is, in the interval with endpoints าง๐‘ฅ ยฑs for samples and with endpoints ฮผยฑฯƒ for populations.โ–ช Approximately 95%

of the data lie within two standard deviations of the mean, that is, in the interval with endpoints าง๐‘ฅ ยฑ2s for samples and with endpoints ฮผยฑ2ฯƒ for populations.โ–ช Approximately 99.7%

of the data lies within three standard deviations of the mean, that is, in the interval with endpoints าง๐‘ฅ ยฑ3s for samples and with endpoints ฮผยฑ3ฯƒ for populations.

Page 18: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 18

Z-SCOREโ‘EMPIRICAL RULE

NORMAL DISTRIBUION (BELL SHAPED)

Page 19: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 19

Z-SCOREโ‘CHEBYSHEV'S RULE

ANY DISTRIBUION

Page 20: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 20

Z-SCOREโ‘CHEBYSHEV'S RULE

ANY DISTRIBUION

Page 21: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 21

Z-SCOREโ‘CHEBYSHEV'S RULE

ANY DISTRIBUION

Page 22: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 22

Z-SCOREโ‘CHEBYSHEV'S RULE โ€œFACTโ€

ANY DISTRIBUIONโ–ช At Least 75%

of the data lie within two standard deviations of the mean, that is, in the interval with endpoints าง๐‘ฅ ยฑ2s for samples and with endpoints ฮผยฑ2ฯƒ for populations.โ–ช At Least 89%

of the data lies within three standard deviations of the mean, that is, in the interval with endpoints าง๐‘ฅ ยฑ3s for samples and with endpoints ฮผยฑ3ฯƒ for populations.

Page 23: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 23

Z-SCORE

Page 24: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 24

EXERCISE (1)โ–ช What does the distribution

of the variable look like?

โ–ช What is the center of the

distribution?

โ–ช Study the variability of the

distribution.

โ–ช Construct a box plot.

โ–ช What is the z-score of school

#3?

Page 25: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 25

EXERCISE (2)

68.7 72.3 71.3 72.5 70.6 68.2 70.1 68.4 68.6 70.673.7 70.5 71.0 70.9 69.3 69.4 69.7 69.1 71.5 68.670.9 70.0 70.4 68.9 69.4 69.4 69.2 70.7 70.5 69.969.8 69.8 68.6 69.5 71.6 66.2 72.4 70.7 67.7 69.168.8 69.3 68.9 74.8 68.0 71.2 68.3 70.2 71.9 70.471.9 72.2 70.0 68.7 67.9 71.1 69.0 70.8 67.3 71.870.3 68.8 67.2 73.0 70.4 67.8 70.0 69.5 70.1 72.072.2 67.6 67.0 70.3 71.2 65.6 68.1 70.8 71.4 70.270.1 67.5 71.3 71.5 71.0 69.1 69.5 71.1 66.8 71.869.6 72.7 72.8 69.6 65.9 68.0 69.7 68.7 69.8 69.7

โ€ข The following table shows the heights in inches of 100 randomly selected

adult men measured in inches.

Mean าง๐‘ฅ = 69.92 inchesStandard Deviation S = 1.70 inches

MIN = 65.6 inchesMAX = 74.8 inches

Page 26: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 26

EXERCISE (2)โ€ข A relative frequency histogram for the data

Page 27: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 27

EXERCISE (2)โ‘ The number of observations that are within ONE standard deviation of the mean

าง๐‘ฅ - S าง๐‘ฅ + S69.92 โˆ’ 1.70 = 68.22 inches and 69.92 + 1.70 = 71.62 inches

69

โ‘ The number of observations that are within TWO standard deviation of the mean

าง๐‘ฅ - 2S าง๐‘ฅ + 2S69.92 โˆ’ 2(1.70) = 66.52 inches and 69.92 + 2(1.70) = 73.32 inches

95โ‘ The number of observations that are within THREE standard deviation of the mean

าง๐‘ฅ - 3S าง๐‘ฅ + 3S69.92 โˆ’ 3(1.70) = 64.822 inches and 69.92 + 3(1.70) = 75.02 inches

ALL

Page 28: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 28

EXERCISE (3)

โ‘ Heights of 18-year-old males have a bell-shaped distribution with

mean 69.6 inches and standard deviation 1.4 inches.

1. About what proportion of all such men are between 68.2 and

71 inches tall?

2. What interval centered on the mean should contain about

95% of all such men?

Page 29: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 29

EXERCISE (3) SOLUTIONโ‘ The observations that are within ONE standard deviation of the mean

าง๐‘ฅ - S าง๐‘ฅ + S69.6 โˆ’ 1.40 = 68.2 inches and 69.6 + 1.40 = 71.71 inches

68 %

โ‘ The observations that are within TWO standard deviation of the mean

าง๐‘ฅ - 2S าง๐‘ฅ + 2S69.6 โˆ’ 2(1.40) = 66.80 inches and 69.6 + 2(1.40) = 72.40 inches

95 %

โ‘ The observations that are within THREE standard deviation of the mean

าง๐‘ฅ - 3S าง๐‘ฅ + 3S69.6 - 3(1.40) = 65.40 inches and 69.6 + 3(1.40) = 73.80 inches

ALL

Page 30: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 30

EXERCISE (4)

โ‘ Scores on IQ tests have a bell-shaped distribution with mean ฮผ=100

and standard deviation ฯƒ=10. Discuss what the Empirical Rule

implies concerning individuals with IQ scores of 110, 120, and 130.

โ–ช Approximately 68% of the IQ scores in the population lie between 90

and 110,

โ–ช Approximately 95% of the IQ scores in the population lie between 80

and 120, and

โ–ช Approximately 99.7% of the IQ scores in the population lie between

70 and 130.

Page 31: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 31

EXERCISE (4)

Page 32: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 32

EXERCISE (5)

โ‘A sample of size n=50 has mean เดฅ๐’™ =28 and standard deviation

s=3. Without knowing anything else about the sample,

โ–ช what can be said about the number of observations that lie in

the interval (22,34)?

โ–ช What can be said about the number of observations that lie

outside that interval?

Page 33: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 33

EXERCISE (5) SOLUTIONBy Chebyshevโ€™s Theorem:โ‘ The observations that are within TWO standard deviation of the

meanาง๐‘ฅ - 2S าง๐‘ฅ + 2S

28 โˆ’ 2(3) = 22 and 28 + 2(3) = 34 75 %

โ‘ The observations that are within THREE standard deviation of the mean

าง๐‘ฅ - 3S าง๐‘ฅ + 3S28 โˆ’ 3(3) = 19 and 28 + 3(3) = 37

89 %

Page 34: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 34

EXERCISE (5) SOLUTIONโ–ช The interval (22,34) is the one that is formed by adding and subtracting

two standard deviations from the mean.

By Chebyshevโ€™s Theorem,

โ–ช At least 75 % of the data are within this interval.

โ–ช Since 75 % of 50 is 37.5, this means that at least 37.5

observations are in this interval or at least 38 observations.

โ–ช If at least 75 % of the observations are in the interval, then at

most 25 % of them are outside it.

โ–ช Since 1/4 of 50 is 12.5, at most 12.5 observations are outside the

interval or 38 observations.

Page 35: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 35

EXERCISE (5) SOLUTION

Page 36: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 36

EXERCISE (6)

Page 37: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 37

EXERCISE (6)

Page 38: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 38

(2 Marks) Here are some summary statistics for the numbers of acres of

soybeans ุงู„ุตูˆูŠุงููˆู„ and peanuts ุงู„ุณูˆุฏุงู†ูŠุงู„ููˆู„ harvested per county in

Alabama in 2009, for counties that planted those crops.

In one southern county, there were 9 thousand acres of soybeansharvested and 3 thousand acres of peanuts harvested. Relative to itscrop, which plant had a better harvest?

EXERCISE (7)

Page 39: STATISTICAL ANALYSIS - LECTURE 3

CORRELATION AND REGRESSION

STATISTICAL ANALYSIS - LECTURE 3 39

Page 40: STATISTICAL ANALYSIS - LECTURE 3

CORRELATION: CROSSTABS AND SCATTER PLOTS

STATISTICAL ANALYSIS - LECTURE 3 40

Page 41: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 41

CORRELATION: CROSSTABS AND SCATTER PLOTS

Page 42: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 42

CORRELATION: CROSSTABS AND SCATTER PLOTS

Page 43: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 43

CROSSTABS (CONTINGENCY TABLES)

Page 44: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 44

CROSSTABS (CONTINGENCY TABLES)

Page 45: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 45

CROSSTABS (CONTINGENCY TABLES)

Page 46: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 46

CROSSTABS (CONTINGENCY TABLES)

Page 47: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 47

CROSSTABS (CONTINGENCY TABLES)

Page 48: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 48

CROSSTABS (CONTINGENCY TABLES)

Page 49: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 49

CROSSTABS (CONTINGENCY TABLES)

Page 50: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 50

CROSSTABS (CONTINGENCY TABLES)

Page 51: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 51

SCATTER PLOTS

Page 52: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 52

SCATTER PLOTS

Page 53: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 53

SCATTER PLOTSTYPE OF RELATIONSHIP โ€œDIRECTIONโ€

POSITIVE, NEGATIVE, OR NO RELATIONSHIP

Page 54: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 54

SCATTER PLOTSSTRENGTH OF RELATIONSHIP

STRONG OR WEAK RELAIONSHIP

Page 55: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 55

SCATTER PLOTS

Page 56: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 56

CORRELATION

Page 57: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 57

CORRELATION

Page 58: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 58

CORRELATION

Page 59: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 59

CORRELATION

Page 60: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 60

CORRELATION

Page 61: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 61

CORRELATION

Page 62: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 62

CORRELATION

0.93

Page 63: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 63

CORRELATION

Page 64: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 64

CORRELATION

The coefficient of determination ๐‘Ÿ2

0 โ‰ค r2 โ‰ค +1

Example :

If ๐‘Ÿ2 = 0.86

This means that 86% of the variation in y can

be described by x.

Page 65: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 65

REGRESSION ANALYSIS

โ–ช Deals with finding the best relationship

between Y and X, quantifying the strength of

that relationship, and using methods that

allow for prediction of the response values

given values of the X.

LINEAR REGRESSION

Page 66: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 66

SIMPLE REGRESSION

LINEAR REGRESSION

Page 67: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 67

LINEAR REGRESSION

Page 68: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 68

LINEAR REGRESSION

Page 69: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 69

LINEAR REGRESSION

Page 70: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 70

LINEAR REGRESSION

เท๐’š = ๐’ƒ๐Ÿ๐’™ + ๐’ƒ๐ŸŽ

LINEAR REGRESSION EQUATION

๐’ƒ๐Ÿ = ๐’“๐‘บ๐’š

๐‘บ๐’™๐’ƒ๐Ÿ =

ฯƒ ๐’™ โˆ’ เดฅ๐’™ (๐’š โˆ’ เดฅ๐’š)

ฯƒ ๐’™ โˆ’ เดฅ๐’™ ๐Ÿ๐’ƒ๐Ÿ =

๐’ฯƒ๐’™๐’š โˆ’ ฯƒ๐’™ฯƒ๐’š

๐’ฯƒ๐’™๐Ÿ โˆ’ (ฯƒ๐’™)๐Ÿ

๐’ƒ๐ŸŽ = เดฅ๐’š โˆ’ ๐’ƒ๐Ÿ เดฅ๐’™

๐’ƒ๐ŸŽ =ฯƒ๐‘ฆ โˆ’ ๐‘1 ฯƒ๐‘ฅ

๐‘›

There are different forms of these

formulasDonโ€™t get

confused please โ˜บ

Page 71: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 71

EXAMPLE (1) ON CORRELATION

0.93

Page 72: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 72

(X) 50 100 200 300

ZX -1.01 -0.56 0.34 1.24

(Y) 50 70 70 95

ZY -.1.15 -0.07 -0.07 1.29

ZX * ZY 1.17 0.04 -0.02 1.60ฯƒ ZX * ZY

= 2.78

EXAMPLE (1) ON CORRELATION

ZX = ๐‘ฟ โˆ’ ๐‘ฟ

๐‘บ๐‘ฟ

ZY = ๐’€ โˆ’ ๐’€

๐‘บ๐’€

r = ฯƒ ZX โˆ— ZY

๐’โˆ’๐Ÿ= 2.78

๐Ÿ‘= 0.93

Strong Positive or Direct Relationship

Page 73: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 73

ii. What would be the values of Y at X = 400 and 500?

เทœ๐‘ฆ = ๐‘๐‘œ + ๐‘1 ๐‘‹

b1 = r ๐‘†๐‘ฆ

๐‘†๐‘ฅ= 0.93

18.4

110.9= 0.154

bo = เดค๐‘ฆ โ€“ b1 าง๐‘ฅ = 71.3- (0.154)(162.5) = 46.275

เทœ๐‘ฆ = ๐‘๐‘œ + ๐‘1 ๐‘‹ = เท๐’š = ๐ŸŽ. ๐Ÿ๐Ÿ“๐Ÿ’ ๐‘ฟ + ๐Ÿ’๐Ÿ”. ๐Ÿ๐Ÿ•๐Ÿ“

At X = 400 ๐’š ฬ‚ = ๐ŸŽ. ๐Ÿ๐Ÿ“๐Ÿ’ (๐Ÿ’๐ŸŽ๐ŸŽ) + ๐Ÿ’๐Ÿ”. ๐Ÿ๐Ÿ•๐Ÿ“ = 107.875

At X = 500 ๐’š ฬ‚ = ๐ŸŽ. ๐Ÿ๐Ÿ“๐Ÿ’ (๐Ÿ“๐ŸŽ๐ŸŽ) + ๐Ÿ’๐Ÿ”. ๐Ÿ๐Ÿ•๐Ÿ“ = 123.275

EXAMPLE (1) ON CORRELATION

Page 74: STATISTICAL ANALYSIS - LECTURE 3

STATISTICAL ANALYSIS - LECTURE 3 74

iii. What is the error in the predicted value of Y at X = 200 and 300?

เท๐’š = ๐ŸŽ. ๐Ÿ๐Ÿ“๐Ÿ’ ๐‘ฟ + ๐Ÿ’๐Ÿ”. ๐Ÿ๐Ÿ•๐Ÿ“

At X = 200 เท๐’š = ๐ŸŽ. ๐Ÿ๐Ÿ“๐Ÿ’ ๐Ÿ๐ŸŽ๐ŸŽ + ๐Ÿ’๐Ÿ”. ๐Ÿ๐Ÿ•๐Ÿ“ = ๐Ÿ•๐Ÿ•. ๐ŸŽ๐Ÿ•๐Ÿ“

Error = |๐’š ฬ‚ - y| = |77.075 โ€“ 70|= 7.075

At X = 300 เท๐’š = ๐ŸŽ. ๐Ÿ๐Ÿ“๐Ÿ’ ๐Ÿ‘๐ŸŽ๐ŸŽ + ๐Ÿ’๐Ÿ”. ๐Ÿ๐Ÿ•๐Ÿ“ = ๐Ÿ—๐Ÿ. ๐Ÿ’๐Ÿ•๐Ÿ“

Error = |๐’š ฬ‚ - y| = |92.475 โ€“ 95|= 2.525

EXAMPLE (1) ON CORRELATION