22
Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Embed Size (px)

Citation preview

Page 1: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Chapter 6

Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Page 2: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Basic Logic And Terminology Problem: The

populations we wish to study are almost always so large that we are unable to gather information from every case.

Page 3: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Basic Logic And Terminology Solution: We

choose a sample -- a carefully chosen subset of the population – and use information gathered from the cases in the sample to generalize to the population.

Page 4: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Basic Logic And Terminology

Statistics are mathematical characteristics of samples.

Parameters are mathematical characteristics of populations.

Statistics are used to estimate parameters.

PARAMETER

STATISTIC

Page 5: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Samples

Must be representative of the population. Representative: The sample has the same

characteristics as the population. How can we ensure samples are

representative? Samples drawn according to the rule of EPSEM

(every case in the population has the same chance of being selected for the sample) are likely to be representative.

Page 6: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Six Key Terms Population and Sample Parameter and Statistic Representative and EPSEM

Page 7: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Sampling Techniques

Simple Random Sampling (SRS) Systematic Random Sampling Stratified Random Sampling Cluster Sampling

Page 8: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Simple Random Sampling (SRS)

To begin, we need: A list of the population. A method for selecting cases from the

population so each case has the same probability of being selected. The principle of EPSEM. A sample selected this way is very likely to

be representative of the population.

Page 9: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Example of SRS You want to know what % of students at a

large university work during the semester. Draw a sample of 500 from a list of all

students (N =20,000). Assume the list is available from the Registrar.

How can you draw names so every student has the same chance of being selected?

Page 10: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Example of SRS Each student has a unique, 6 digit ID

number that ranges from 000000 to 999999.

Use a Table of Random Numbers or a computer program to select 500 ID numbers with 6 digits each.

Each time a randomly selected 6 digit number matches the ID of a student, that student is selected for the sample.

Page 11: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Example of SRS Part of the list of selected students might

look like this:

ID # Student

015782 Andrea Mitchell

992541 Joseph Campbell

325798 Mary Margaret Hayes

772398 Keisha Magnum

648900 Kenneth Albright

Page 12: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Example of SRS

Continue until 500 names are selected. Disregard duplicate numbers. Ignore cases in which no student ID

matches the randomly selected number. After questioning each of these 500

students, you find that 368 (74%) work during the semester.

Page 13: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Systematic Random Sampling

Same situation: you want a sample of 500 students from a population of 20,000

500/20,000 = 1/400 Use a table of random numbers to

pick the first student in your sample Take every 400th students thereafter

Page 14: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Applying Logic and Terminology

In this research situation, identify each the following: Population Sample Statistic Parameter

Page 15: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Applying Logic and Terminology

Population = All 20,000 students. Sample = The 500 students

selected and interviewed. Statistic =74% (% of sample that

held a job during the semester). Parameter = % of all students in the

population who held a job.

Page 16: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

The Sampling Distribution

The single most important concept in inferential statistics.

Definition: The distribution of a statistic for all possible samples of a given size (N).

The sampling distribution is a theoretical concept.

Page 17: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

The Sampling Distribution Every application

of inferential statistics involves 3 different distributions.

Information from the sample is linked to the population via the sampling distribution.

Population

Sampling Distribution

Sample

Page 18: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

The Sampling Distribution: Properties

1. Normal in shape.2. Has a mean equal to the population

mean.3. Has a standard deviation (standard

error) equal to the population standard deviation divided by the square root of N.

Page 19: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

First Theorem If we begin with a trait that is normally

distributed across a population (IQ, height) and take an infinite number of equally sized random samples from that population, the sampling distribution of sample means will be normal.

Its mean will be the mean of the population and its standard deviation will be the population standard deviation divided by the square root of N.

Page 20: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Central Limit Theorem For any trait or variable, even those that

are not normally distributed in the population, as sample size grows larger, the sampling distribution of sample means will become normal in shape.

The importance of the Central Limit Theorem is that it removes the constraint of normality in the population.

Page 21: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

The Sampling Distribution

The S.D. is normal so we can use Appendix A to find areas.

We do not know the value of the population mean (μ) but the mean of the S.D. is the same value as μ.

We do not know the value of the pop. Stnd. Dev. (σ) but the Stnd. Dev. of the S.D. is equal to σ divided by the square root of N.

Page 22: Chapter 6 Introduction to Inferential Statistics: Sampling and the Sampling Distribution

Three Distributions

ShapeCentral

TendencyDispersion

Sample Varies_X s

Sampling Distribution

Normal μx=μ σx= σ/√N

Population Variesμ

σ