38
© The McGraw-Hill Companies, Inc., 2000 2-1 2-1 Chapter 2 Chapter 2 Describing, Describing, Exploring and Exploring and Comparing Data Comparing Data

© The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

Embed Size (px)

Citation preview

Page 1: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-12-1

Chapter 2Chapter 2

Describing, Exploring and Describing, Exploring and Comparing DataComparing Data

Page 2: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-22-2 OutlineOutline

2-1 Introduction 2-2 Frequency Distributions 2-3 Visualizing Data

Page 3: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-32-3 ObjectivesObjectives

Organize data using frequency distributions.

Represent data in frequency distributions graphically using histograms, frequency polygons and ogives.

Page 4: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-52-5 2-2 Frequency Distributions2-2 Frequency Distributions

When data are collected in original form, they are called raw dataraw data.

When the raw data is organized into a frequency distributionfrequency distribution, the frequencyfrequency will be the number of values in a specific class of the distribution.

Page 5: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-62-6 2-2 Frequency Distributions2-2 Frequency Distributions

A frequency distributionfrequency distribution is the organizing of raw data in table form, using classes and frequencies.

The following slide shows an example of a frequency distribution.

Page 6: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-72-72-2 Three Types of Frequency 2-2 Three Types of Frequency

DistributionsDistributions

Categorical frequency distributionsCategorical frequency distributions -- can be used for data that can be placed in specific categories, such as nominal- or ordinal-level data.

Examples -Examples - political affiliation, religious affiliation, blood type etc.

Page 7: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-82-82-2 Blood Type Frequency 2-2 Blood Type Frequency

Distribution - Distribution - Example

Class Frequency Percent

A 5 20

B 7 28

O 9 36

AB 4 16

Page 8: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-92-92-2 Ungrouped Frequency 2-2 Ungrouped Frequency

DistributionsDistributions

Ungrouped frequency distributions -Ungrouped frequency distributions - can be used for data that can be enumerated and when the range of values in the data set is not large.

Examples -Examples - number of miles your instructors have to travel from home to campus, number of girls in a 4-child family etc.

Page 9: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-102-102-2 Number of Miles Traveled2-2 Number of Miles Traveled -

Example

Class Frequency

5 24

10 16

15 10

Page 10: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-112-112-2 Grouped Frequency 2-2 Grouped Frequency

DistributionsDistributions

Grouped frequency distributions -Grouped frequency distributions - can be used when the range of values in the data set is very large. The data must be grouped into classes that are more than one unit in width.

Examples -Examples - the life of boat batteries in hours.

Page 11: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-122-122-2 Lifetimes of Boat Batteries2-2 Lifetimes of Boat Batteries -

Example

Classlimits

ClassBoundaries

Cumulative

24 - 30 23.5 - 37.5 4 4

38 - 51 37.5 - 51.5 14 18

52 - 65 51.5 - 65.5 7 25

frequencyFrequency

Page 12: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-132-132-2 Terms Associated with a 2-2 Terms Associated with a

Grouped Frequency DistributionGrouped Frequency Distribution

Class limits represent the smallest and largest data values that can be included in a class.

In the lifetimes of boat batteries example, the values 24 and 30 of the first class are the class class limitslimits.

The lower classlower class limit is 24 and the upper classupper class limit is 30.

Page 13: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-142-14

The class boundariesclass boundaries are used to separate the classes so that there are no gaps in the frequency distribution.

2-2 Terms Associated with a 2-2 Terms Associated with a Grouped Frequency DistributionGrouped Frequency Distribution

Page 14: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-152-15

The class widthclass width for a class in a frequency distribution is found by subtracting the lower (or upper) class limit of one class minus the lower (or upper) class limit of the previous class.

2-2 Terms Associated with a 2-2 Terms Associated with a Grouped Frequency DistributionGrouped Frequency Distribution

Page 15: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-162-162-2 Guidelines for Constructing 2-2 Guidelines for Constructing

a Frequency Distributiona Frequency Distribution

There should be between 5 and 20 classes.

The class width should be an odd number.

The classes must be mutually exclusive.

Page 16: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-172-172-2 Guidelines for Constructing 2-2 Guidelines for Constructing

a Frequency Distributiona Frequency Distribution

The classes must be continuous. The classes must be exhaustive. The class must be equal in width.

Page 17: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-182-182-2 Procedure for Constructing 2-2 Procedure for Constructing

a Grouped Frequency Distributiona Grouped Frequency Distribution

Find the highest and lowest value. Find the range. Select the number of classes

desired. Find the width by dividing the range

by the number of classes and rounding up.

Page 18: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-192-19

Select a starting point (usually the lowest value); add the width to get the lower limits.

Find the upper class limits. Find the boundaries. Tally the data, find the frequencies and

find the cumulative frequency.

2-2 Procedure for Constructing 2-2 Procedure for Constructing a Grouped Frequency Distributiona Grouped Frequency Distribution

Page 19: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-202-20

In a survey of 20 patients who smoked, the following data were obtained. Each value represents the number of cigarettes the patient smoked per day. Construct a frequency distribution using six classes. (The data is given on the next slide.)

2-2 Grouped Frequency Distribution - 2-2 Grouped Frequency Distribution - Example

Page 20: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-212-21

10 8 6 14

22 13 17 19

11 9 18 14

13 12 15 15

5 11 16 11

2-2 Grouped Frequency Distribution - 2-2 Grouped Frequency Distribution - Example

Page 21: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-222-22

Step 1:Step 1: Find the highest and lowest values: H = 22 and L = 5.

Step 2:Step 2: Find the range: R = H – L = 22 – 5 = 17.

Step 3:Step 3: Select the number of classes desired. In this case it is equal to 6.

2-2 Grouped Frequency Distribution - 2-2 Grouped Frequency Distribution - Example

Page 22: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-232-23

Step 4:Step 4: Find the class width by dividing the range by the number of classes. Width = 17/6 = 2.83. This value is rounded up to 3.

2-2 Grouped Frequency Distribution - 2-2 Grouped Frequency Distribution - Example

Page 23: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-242-24

Step 5:Step 5: Select a starting point for the lowest class limit. For convenience, this value is chosen to be 5, the smallest data value. The lower class limits will be 5, 8, 11, 14, 17 and 20.

2-2 Grouped Frequency Distribution - 2-2 Grouped Frequency Distribution - Example

Page 24: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-252-25

Step 6:Step 6: The upper class limits will be 7, 10, 13, 16, 19 and 22. For example, the upper limit for the first class is computed as 8 - 1, etc.

2-2 Grouped Frequency Distribution - 2-2 Grouped Frequency Distribution - Example

Page 25: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-262-26

Step 7:Step 7: Find the class boundaries by subtracting 0.5 from each lower class limit and adding 0.5 to the upper class limit.

2-2 Grouped Frequency Distribution - 2-2 Grouped Frequency Distribution - Example

Page 26: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-272-27

Step 8:Step 8: Tally the data, write the numerical values for the tallies in the frequency column and find the cumulative frequencies.

The grouped frequency distribution is shown on the next slide.

2-2 Grouped Frequency Distribution - 2-2 Grouped Frequency Distribution - Example

Page 27: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-282-28

Class Limits Class Boundaries Frequency Cumulative Frequency

05 to 07 4.5 - 7.5 2 2

08 to 10 7.5 - 10.5 3 5

11 to 13 10.5 - 13.5 6 11

14 to 16 13.5 - 16.5 5 16

17 to 19 16.5 - 19.5 3 1920 to 22 19.5 - 22.5 1 20

Note: The dash “-” represents “to”.

Page 28: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-292-29 2-3 Visualizing Data2-3 Visualizing Data

The three most commonly used The three most commonly used graphs in research are:graphs in research are:

The histogram. The frequency polygon. The cumulative frequency graph,

or ogive (pronounced o-jive).

Page 29: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-302-30

The histogramhistogram is a graph that displays the data by using vertical bars of various heights to represent the frequencies.

2-3 Visualizing Data2-3 Visualizing Data

Page 30: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-312-31 Example of a HistogramExample of a Histogram

2017141185

6

5

4

3

2

1

0

Number of Cigarettes Smoked per Day

Freq

uenc

y

Page 31: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-322-32

A frequency polygonfrequency polygon is a graph that displays the data by using lines that connect points plotted for frequencies at the midpoint of classes. The frequencies represent the heights of the midpoints.

2-3 2-3 Visualizing DataVisualizing Data

Page 32: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-332-33 Example of a Frequency PolygonExample of a Frequency Polygon

Frequency Polygon

262320171411852

6

5

4

3

2

1

0

Number of Cigarettes Smoked per Day

Fre

quen

cy

Page 33: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-342-34

A cumulative frequency graphcumulative frequency graph or ogiveogive is a graph that represents the cumulative frequencies for the classes in a frequency distribution.

2-3 Visualizing Data2-3 Visualizing Data

Page 34: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-352-35 Example of an OgiveExample of an Ogive

262320171411852

20

10

0

Number of Cigarettes Smoked per Day

Cum

ulat

ive

Fre

quen

cyOgive

Page 35: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-392-39 2-3 Other Types of Graphs2-3 Other Types of Graphs

Time series graph -Time series graph - A time series graph represents data that occur over a specific period of time.

Page 36: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-402-402-3 Other Types of Graphs - 2-3 Other Types of Graphs -

Time Series GraphTime Series Graph

19941993199219911990

89

87

85

83

81

79

77

75

Year

Rid

ersh

ip (

in m

illio

ns)PORT AUTHORITY TRANSIT RIDERSHIP

Page 37: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-412-41 2-3 Other Types of Graphs2-3 Other Types of Graphs

Pie graph -Pie graph - A pie graph is a circle that is divided into sections or wedges according to the percentage of frequencies in each category of the distribution.

Page 38: © The McGraw-Hill Companies, Inc., 2000 2-1 Chapter 2 Describing, Exploring and Comparing Data

© The McGraw-Hill Companies, Inc., 2000

2-422-422-3 Other Types of Graphs - 2-3 Other Types of Graphs - Pie Pie GraphGraph

Robbery (29, 12.1%)

Rape (34, 14.2%)

Assaults (164, 68.3%)

Homicide (13, 5.4%)

Pie Chart of the Number of Crimes Investigated byLaw Enforcement Officers In U.S. National Parks During 1995