55
1 Review • Sections 2.1-2.4 • Descriptive Statistics – Qualitative (Graphical) – Quantitative (Graphical) – Summation Notation – Qualitative (Numerical) • Central Measures (mean, median, mode and modal class) • Shape of the Data

1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

Embed Size (px)

Citation preview

Page 1: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

1

Review• Sections 2.1-2.4

• Descriptive Statistics– Qualitative (Graphical)– Quantitative (Graphical)– Summation Notation– Qualitative (Numerical)

• Central Measures (mean, median, mode and modal class)

• Shape of the Data

Page 2: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

2

Review• Sections 2.1-2.4• Descriptive Statistics

– Qualitative (Graphical)– Quantitative (Graphical)– Summation Notation– Qualitative (Numerical)

• Central Measures (mean, median, mode and modal class)• Shape of the Data• Measures of Variability

Page 3: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

3

Outlier

A data measurement which is unusually large or small compared to the rest of the data.

Usually from:– Measurement or recording error– Measurement from a different population– A rare, chance event.

Page 4: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

4

Advantages/Disadvantages Mean

• Disadvantages– is sensitive to outliers

• Advantages– always exists– very common– nice mathematical properties

Page 5: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

5

Advantages/Disadvantages Median

• Disadvantages– does not take all data into account

• Advantages– always exists– easily calculated– not affected by outliers– nice mathematical properties

Page 6: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

6

Advantages/Disadvantages Mode

• Disadvantages– does not always exist, there could be just one

of each data point– sometimes more than one

• Advantages– appropriate for qualitative data

Page 7: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

7

Review

A data set is skewed if one tail of the distribution has more extreme observations than the other.

http://www.shodor.org/interactivate/activities/SkewDistribution/

Page 8: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

8

Review

Skewed to the right: The mean is bigger than the median.

xM

Page 9: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

9

Review

Skewed to the left: The mean is less than the median.

x M

Page 10: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

10

Review

When the mean and median are equal, the data is symmetric

Mx

Page 11: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

11

Numerical Measures of Variability

These measure the variability or spread of the data.

Page 12: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

12

Numerical Measures of Variability

These measure the variability or spread of the data.

Relative Frequency

0 1 3 4 52

0.3

0.4

0.5

0.2

0.1

Mx

Page 13: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

13

Numerical Measures of Variability

These measure the variability or spread of the data.

Relative Frequency

0 1 3 4 52

0.3

0.4

0.5

0.2

0.1

Mx

Page 14: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

14

Numerical Measures of Variability

These measure the variability or spread of the data.

Relative Frequency

0 1 3 4 52

0.3

0.4

0.5

0.2

0.1

6 7

Mx

Page 15: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

15

Numerical Measures of Variability

These measure the variability, spread or relative standing of the data.

– Range– Standard Deviation– Percentile Ranking– Z-score

Page 16: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

16

Range

The range of quantitative data is denoted R and is given by:

R = Maximum – Minimum

Page 17: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

17

Range

The range of quantitative data is denoted R and is given by:

R = Maximum – Minimum

In the previous examples the first two graphs have a range of 5 and the third has a range of 7.

Page 18: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

18

Range

R = Maximum – Minimum

Disadvantages: – Since the range uses only two values in the

sample it is very sensitive to outliers.– Give you no idea about how much data is in the

center of the data.

Page 19: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

19

What else?

We want a measure which shows how far away most of the data points are from the mean.

Page 20: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

20

What else?

We want a measure which shows how far away most of the data points are from the mean.

One option is to keep track of the average distance each point is from the mean.

Page 21: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

21

Mean Deviation

The Mean Deviation is a measure of dispersion which calculates the distance between each data point and the mean, and then finds the average of these distances.

n

xx

n

xx ii

sumDeviation Mean

Page 22: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

22

Mean Deviation

Advantages: The mean deviation takes into account all values in the sample.

Disadvantages: The absolute value signs are very cumbersome in mathematical equations.

Page 23: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

23

Standard Deviation

The sample variance, denoted by s², is:

1

)( s

22

n

xxi

Page 24: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

24

Standard Deviation

The sample variance, denoted by s², is:

The sample standard deviation is

The sample standard deviation is much more commonly used as a measure of variance.

.2ss

1

)( s

22

n

xxi

Page 25: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

25

Example

Let the following be data from a sample:

2, 4, 3, 2, 5, 2, 1, 4, 5, 2.

Find:

a) The range

b) The standard deviation of this sample.

Page 26: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

26

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2.

a) The range

b) The standard deviation of this sample.

2 4 3 2 5 2 1 4 5 2

x

R

ix

)( xxi 2)( xxi

Page 27: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

27

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. a) The range

b) The standard deviation of this sample.

2 4 3 2 5 2 1 4 5 2

310

30

10

2541252342

x

415R

ix

)( xxi 2)( xxi

Page 28: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

28

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. a) The range

b) The standard deviation of this sample.

2 4 3 2 5 2 1 4 5 2

-1 1 0

310

30

10

2541252342

x

415R

ix

)( xxi 2)( xxi

Page 29: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

29

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. a) The range

b) The standard deviation of this sample.

2 4 3 2 5 2 1 4 5 2

-1 1 0 -1 2 -1 -2 1 2 -1

1 1 0 1 4 1 4 1 4 1

310

30

10

2541252342

x

415R

ix

)( xxi 2)( xxi

Page 30: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

30

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. 2 4 3 2 5 2 1 4 5 2

-1 1 0 -1 2 -1 -2 1 2 -1

1 1 0 1 4 1 4 1 4 1

ix

)( xxi 2)( xxi

1

)( s

22

n

xxi

Page 31: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

31

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. 2 4 3 2 5 2 1 4 5 2

-1 1 0 -1 2 -1 -2 1 2 -1

1 1 0 1 4 1 4 1 4 1

ix

)( xxi 2)( xxi

110

1414141011

1

)( s

22

n

xxi

Page 32: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

32

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2. 2 4 3 2 5 2 1 4 5 2

-1 1 0 -1 2 -1 -2 1 2 -1

1 1 0 1 4 1 4 1 4 1

ix

)( xxi 2)( xxi

2110

1414141011

1

)( s

22

n

xxi

Page 33: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

33

Sample: 2, 4, 3, 2, 5, 2, 1, 4, 5, 2.

2110

1414141011

1

)( s

22

n

xxi

41.12 ss 2

Standard Deviation:

Page 34: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

34

More Standard DeviationThere is a “short cut” formula for finding the variance and the standard deviation

Page 35: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

35

More Standard DeviationThere is a “short cut” formula for finding the variance and the standard deviation

1 s

2

2

2

n

n

xx ii

Page 36: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

36

More Standard Deviation

Use this to find the standard deviation of the previous example:

1 s

2

2

2

n

n

xx ii

Page 37: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

37

More Standard Deviation

Use this to find the standard deviation of the previous example:

1 s

2

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2ix2ix

Page 38: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

38

More Standard Deviation

Use this to find the standard deviation of the previous example:

1 s

2

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2

4 16 9 4 25 4 1 16 25 4

ix2ix

Page 39: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

39

More Standard Deviation

Use this to find the standard deviation of the previous example:

1 s

2

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2

4 16 9 4 25 4 1 16 25 4

ix2ix

Page 40: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

40

More Standard Deviation

Use this to find the standard deviation of the previous example:

1 s

2

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2

4 16 9 4 25 4 1 16 25 4

ix2ix

30

108

Page 41: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

41

More Standard Deviation

1 s

2

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2

4 16 9 4 25 4 1 16 25 4

ix2ix

30

108

Page 42: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

42

More Standard Deviation

2

1101030

108

1 s

22

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2

4 16 9 4 25 4 1 16 25 4

ix2ix

30

108

Page 43: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

43

More Standard Deviation

2

1101030

108

1 s

22

2

2

n

n

xx ii

2 4 3 2 5 2 1 4 5 2

4 16 9 4 25 4 1 16 25 4

ix2ix

30

108

41.12 ss 2

Page 44: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

44

More Standard DeviationLike the mean, we are also interested in the population variance (i.e. your sample is the whole population) and the population standard deviation.

The population variance and standard deviation are denoted σ and σ2 respectively.

Page 45: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

45

More Standard DeviationThe population variance and standard deviation are denoted σ and σ2 respectively.

****The formula for population variance is slightly different than sample variance

nn

xx

n

xxi

ii

2

22

2 )(

2

Page 46: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

46

Example - Calculator

Find the mean, median, mode, range and standard deviation for the following sample of data:

2.3, 2.5, 2.6, 2.7, 3.0, 3.4,

3.4, 3.5, 3.5, 3.5, 3.7, 3.8

Use your calculator

Page 47: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

47

Using your Calculator

• Change calculator to statistics mode. (SD if you have it)

• Enter in the data and then press the key, or data key.

• Keep entering data by pressing the key, or data key until complete.

• To obtain the summary data, find the key for the sample mean and the s key or n-1 key to display the sample standard deviation.

x

Page 48: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

48

2.3, 2.5, 2.6, 2.7, 3.0, 3.4,3.4, 3.5, 3.5, 3.5, 3.7, 3.8

• Change calculator to statistics mode. (SD if you have it)

• Enter in the data and then press the key, or data key.

• Keep entering data by pressing the key, or data key until complete.

• To obtain the summary data, find the key for the sample mean and the s key or n-1 key to display the sample standard deviation.

x

Page 49: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

49

Example - CalculatorFind the mean, median, mode, range and standard deviation for the following sample of data:

2.3, 2.5, 2.6, 2.7, 3.0, 3.4,

3.4, 3.5, 3.5, 3.5, 3.7, 3.8

Answer:

Mode = 3.5

M = 3.4

Range = 1.5

51.0 s

16.3 x

Page 50: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

50

Example – Using Standard Deviation

Here are eight test scores from a previous Stats 201 class:

35, 59, 70, 73, 75, 81, 84, 86.

The mean and standard deviation are 70.4 and 16.7, respectively.

Page 51: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

51

Example – Using Standard Deviation

Here are eight test scores from a previous Stats 201 class:

35, 59, 70, 73, 75, 81, 84, 86.

The mean and standard deviation are 70.4 and 16.7, respectively.

We wish to know if any of are data points are outliers. That is whether they don’t fit with the general trend of the rest of the data.

Page 52: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

52

Example – Using Standard Deviation

35, 59, 70, 73, 75, 81, 84, 86.

The mean and standard deviation are 70.4 and 16.7, respectively.

We wish to know if any of are data points are outliers. That is whether they don’t fit with the general trend of the rest of the data.

To find this we calculate the number of standard deviations each point is from the mean.

Page 53: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

53

Example – Using Standard Deviation

To find this we calculate the number of standard deviations each point is from the mean.

To simplify things for now, work out which data points are within

a) one standard deviation from the mean i.e.

b) two standard deviations from the mean i.e.

c) three standard deviations from the mean i.e.

) ,( sxsx

)2 ,2( sxsx

)3 ,3( sxsx

Page 54: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

54

Example – Using Standard Deviation

Here are eight test scores from a previous Stats 201 class:

35, 59, 70, 73, 75, 81, 84, 86.

The mean and standard deviation are 70.4 and 16.7, respectively. Work out which data points are within

a) one standard deviation from the mean i.e.

b) two standard deviations from the mean i.e.

c) three standard deviations from the mean i.e.

)1.87 ,7.53()7.160.47 ,7.164.70(

)8.301 ,0.37())7.16(20.47 ),7.16(24.70(

)5.021 ,3.21())7.16(30.47 ),7.16(34.70(

Page 55: 1 Review Sections 2.1-2.4 Descriptive Statistics –Qualitative (Graphical) –Quantitative (Graphical) –Summation Notation –Qualitative (Numerical) Central

55

Example – Using Standard Deviation

Here are eight test scores from a previous Stats 201 class:

35, 59, 70, 73, 75, 81, 84, 86.

The mean and standard deviation are 70.4 and 16.7, respectively. Work out which data points are within

a) one standard deviation from the mean i.e.

59, 70, 73, 75, 81, 84, 86

b) two standard deviations from the mean i.e.

59, 70, 73, 75, 81, 84, 86

c) three standard deviations from the mean i.e.

35, 59, 70, 73, 75, 81, 84, 86