21
Statistics

Statistics. The collection, evaluation, and interpretation of data

Embed Size (px)

Citation preview

Page 1: Statistics. The collection, evaluation, and interpretation of data

Statistics

Page 2: Statistics. The collection, evaluation, and interpretation of data

StatisticsThe collection, evaluation, and interpretation of data

Page 3: Statistics. The collection, evaluation, and interpretation of data

Statistics

Statistics

Descriptive Statistics

Describe collected data

Inferential Statistics

Generalize and evaluate a population based on sample data

Page 4: Statistics. The collection, evaluation, and interpretation of data

Data

Values that possess names or labelsColor of M&Ms, breed of dog, etc.

Categorical or Qualitative Data

Values that represent a measurable quantityPopulation, number of M&Ms, number of defective parts, etc.

Numerical or Quantitative Data

Page 5: Statistics. The collection, evaluation, and interpretation of data

Data CollectionSampling

Random

Systematic

Stratified

Cluster

Convenience

Page 6: Statistics. The collection, evaluation, and interpretation of data

Graphic Data RepresentationHistogram

Frequency Polygons

Bar Chart

Pie Chart

Frequency distribution graph

Frequency distribution graph

Categorical data graph

Categorical data graph %

Page 7: Statistics. The collection, evaluation, and interpretation of data

Measures of Central Tendency

xx

n

Most frequently used measure of central tendency

Strongly influenced by outliers – very large or very small values

Mean Arithmetic average

Sum of all data values divided by the number of data values within the array

x

Page 8: Statistics. The collection, evaluation, and interpretation of data

Measures of Central Tendency

xx

n

48, 63, 62, 49, 58, 2, 63, 5, 60, 59, 55Determine the mean value of

(48 63 62 49 58 2 63 5 60 59 55)

11x

524

11x

47.64x

Page 9: Statistics. The collection, evaluation, and interpretation of data

Measures of Central TendencyMedian

Data value that divides a data array into two equal groups

Data values must be ordered from lowest to highest

Useful in situations with skewed data and outliers (e.g., wealth management)

Page 10: Statistics. The collection, evaluation, and interpretation of data

Measures of Central TendencyDetermine the median value of

Organize the data array from lowest to highest value.

59, 60, 62, 63, 63

48, 63, 62, 49, 58, 2, 63, 5, 60, 59, 55

Select the data value that splits the data set evenly.

2, 5, 48, 49, 55, 58,

Median = 58

What if the data array had an even number of values?

60, 62, 63, 635, 48, 49, 55, 58, 59,

Page 11: Statistics. The collection, evaluation, and interpretation of data

Measures of Central Tendency

Usually the highest point of curve

ModeMost frequently occurring response within a data array

May not be typical

May not exist at all

Modal, bimodal, and multimodal

Page 12: Statistics. The collection, evaluation, and interpretation of data

Measures of Central TendencyDetermine the mode of

48, 63, 62, 49, 58, 2, 63, 5, 60, 59, 55Mode = 63

Determine the mode of

48, 63, 62, 59, 58, 2, 63, 5, 60, 59, 55Mode = 63 & 59 Bimodal

Determine the mode of

48, 63, 62, 59, 48, 2, 63, 5, 60, 59, 55Mode = 63, 59, & 48 Multimodal

Page 13: Statistics. The collection, evaluation, and interpretation of data

Data Variation

Range

Standard Deviation

Measure of data scatter

Difference between the lowest and highest data value

Square root of the variance

Page 14: Statistics. The collection, evaluation, and interpretation of data

Range

63 2R

Calculate by subtracting the lowest value from the highest value.

R h l

2, 5, 48, 49, 55, 58, 59, 60, 62, 63, 63

Calculate the range for the data array.

R h l

61R

Page 15: Statistics. The collection, evaluation, and interpretation of data

Standard Deviation 2

( 1)

x x

N

σ for a sample, not population

1.Calculate the mean

2. Subtract the mean from each value and then square it.

3.Sum all squared differences.

4. Divide the summation by the number of values in the array minus 1.

5. Calculate the square root of the product.

x

Page 16: Statistics. The collection, evaluation, and interpretation of data

Standard Deviation 2

( 1)

x x

N

2, 5, 48, 49, 55, 58, 59, 60, 62, 63, 63

Calculate the standard deviation for the data array.

x

x

n

524

111. 47.64

2. (2 - 47.64)2 = 2083.01

(5 - 47.64)2 = 1818.17

(48 - 47.64)2 = 0.13

(49 - 47.64)2 = 1.85

(55 - 47.64)2 = 54.17

(58 - 47.64)2 = 107.33

(59 - 47.64)2 = 129.05

(60 - 47.64)2 = 152.77

(62 - 47.64)2 = 206.21

(63 - 47.64)2 = 235.93

(63 - 47.64)2 = 235.93

2x x

Page 17: Statistics. The collection, evaluation, and interpretation of data

Standard Deviation 2

( 1)

x xs

N

2, 5, 48, 49, 55, 58, 59, 60, 62, 63, 63

Calculate the standard deviation for the data array.

4.

2083.01 + 1818.17 + 0.13 + 1.85 + 54.17 + 107.33 + 129.05 + 152.77 + 206.21 + 235.93 + 235.93

2x x

= 5,024.55

5. 2

( 1)

x x

N

5,024.55

10 502.46

6. 2

( 1)

x xs

N

502.46S = 22.42

Page 18: Statistics. The collection, evaluation, and interpretation of data

Graphing Frequency DistributionNumerical assignment of each outcome of a chance experiment

A coin is tossed 3 times. Assign the variable X to represent the frequency of heads occurring in each toss.

Toss Outcome X Value

HHH

HHT

HTH

THH

HTT

THT

TTH

TTT

3

2

2

2

1

1

1

0

X =1 when?

HTT,THT,TTH

Page 19: Statistics. The collection, evaluation, and interpretation of data

Graphing Frequency DistributionThe calculated likelihood that an outcome variable will occur within an experiment

Toss Outcome X value

HHH

HHT

HTH

THH

HTT

THT

TTH

TTT

3

2

2

2

1

1

1

0

x P(x)

0

1

2

3

xx

a

FP

F

0

1P

8

1

3P

8

2

3P

8

3

1P

8

Page 20: Statistics. The collection, evaluation, and interpretation of data

Graphing Frequency Distribution

x P(x)

0

1

2

3

0

1P

8

1

3P

8

2

3P

8

3

1P

8 x

Histogram

Page 21: Statistics. The collection, evaluation, and interpretation of data

HistogramAvailable airplane passenger seats one week before departure

What information does the histogram provide the airline carriers?

What information does the histogram provide prospective customers?

open seats

perc

ent

of t

he t

ime