16
Scatter Diagrams - I know how to interpret a scatter diagram - I know how to identify a correlation - I know the difference between cause and effect

Scatter Diagrams - I know how to interpret a scatter diagram - I know how to identify a correlation - I know the difference between cause and effect

Embed Size (px)

Citation preview

Scatter Diagrams- I know how to interpret a scatter diagram- I know how to identify a correlation- I know the difference between cause and effect

Investigation

• We need to measure the following for everyone:• Foot length• Arm Span• Height• Hand Span• Circumference of Head• Make sure you know each of these details for

yourself, so write it down!

Graphs we can plot

• Foot length v Height• Arm Span v Height• Hand Span v Height• Circumference of Head v Hand Span• Etc, etc, etc…

© Boardworks Ltd 2005 4 of 44

Scatter graphs

As height increases, weight increases.

What does this scatter graph show about the relationship between the height and weight of twenty Grade 10 boys?

This is called a positive correlation.This is called a positive correlation.

40

45

50

55

60

140 150 160 170 180 190Height (cm)

Wei

gh

t (k

g)

© Boardworks Ltd 2005 5 of 44

Scatter graphs

What does this scatter graph show?

50

55

60

65

70

75

80

85

0 20 40 60 80 100 120Number of cigarettes smoked in a week

Lif

e e

xp

ect

ancy

It shows that life expectancy decreases as the number of cigarettes smoked increases.

This is called a negative correlation.This is called a negative correlation.

© Boardworks Ltd 2005 6 of 44

Interpreting scatter graphs

Scatter graphs can show a relationship between two variables.

This relationship is called correlation.

Correlation is a general trend. Some data items will not fit this trend, as there are often exceptions to a rule. They are called outliers.

Scatter graphs can show:

positive correlation: as one variable increases, so does the other variable

negative correlation: as one variable increases, the other variable decreases

zero correlation: no linear relationship between the variables.

Correlation can be weak or strong.

© Boardworks Ltd 2005 7 of 44

Identifying correlation from scatter graphs

strong positive correlation

strong negative correlation

zero correlation

weak positive correlation

weak negative correlation.

0

5

10

15

20

25

0 5 10 15 20 25

A

0

2

4

6

8

10

12

0 2 4 6 8 10 12

B

0

5

10

15

20

25

0 5 10 15 20 25

C

0

5

10

15

20

25

0 5 10 15 20 25

D

20

0

5

10

15

20

25

0 5 10 15 20 25

E

0

5

10

15

20

25

0 5 10 15 20 25

F

Decide whether each of the following graphs shows,

Correlation Coefficient

• The correlation coefficient gives us a numerical description of how strong the relationship between two variables is.

• The range of the value of r (the correlation coefficient) is from -1 to +1.

• From 0 to 1, there is a positive relationship. The closer to 1, the stronger the relationship.

• For -1 to 0, there is a negative relationship. The closer to -1, the stronger the relationship.

© Boardworks Ltd 2005 9 of 44

Relationships between two variables

© Boardworks Ltd 2005 10 of 44

0

5

10

15

20

25

0 5 10 15 20 25

Strong positive correlation

0

5

10

15

20

25

0 5 10 15 20 25

Strong negative correlation

0

5

10

15

20

25

0 5 10 15 20 25

Weak negative correlation

The line of best fit

Lines of best fit are drawn on graphs by eye so that there are a roughly equal number of points above and below the line.

Look at these examples,

0

5

10

15

20

25

0 5 10 15 20 25

Weak positive correlation

Notice that the stronger the correlation, the closer the points are to the line.

If the gradient is positive, the correlation is positive and if the gradient is negative, then the correlation is also negative.

© Boardworks Ltd 2005 11 of 44

The line of best fit

The line does not have to pass through the origin.

When drawing the line of best fit remember the following points,

For an accurate line of best fit, find the mean for each variable. This forms a coordinate, which can be plotted. The line of best fit should pass through this point.

The line of best fit can be used to predict one variable from another.

It should not be used for predictions outside the range of data used.

The equation of the line of best fit can be found using the slope and the y-intercept.

© Boardworks Ltd 2005 12 of 44

The line of best fit

50

55

60

65

70

75

80

85

0 20 40 60 80 100Number of cigarettes smoked in a week

Lif

e ex

pec

tan

cyThis graph shows the relationship between life expectancy and the number of cigarettes smoked in a week.

© Boardworks Ltd 2005 13 of 44

Solving problems with lines of best fit

50

55

60

65

70

75

80

85

0 20 40 60 80 100 120Number of cigarettes smoked in a week

Life

exp

ecta

ncy

Work out an estimate for the equation of the line of best fit using the slope and y-intercept.

Use the equation to estimate the life expectancy for someone who smokes 10 cigarettes a day.

Why would an estimate of the number of cigarettes smoked for a life expectancy of 40 years not be reliable?

Can you explain why there are so many outliers for this data?

© Boardworks Ltd 2005 14 of 44

Cause and effect

A study finds a positive correlation between the number of cars in a town and the number of babies born.

“Buying a new car can help you get pregnant!”

The local newspaper reports,

Does the study support this conclusion?

What might this be in the example above?

Correlation does not necessarily imply that there is a causal relationship between the two variables. There may be some other cause.

© Boardworks Ltd 2005 15 of 44

Cause and effect

A study finds a negative correlation between the number of sleds sold and the temperature.

Explain.

Does the study support this conclusion?

“If you want it to snow, go out and buy a sled!”

The local newspaper reports,

© Boardworks Ltd 2005 16 of 44

Homework

• Page 39 #s 1 - 5