27
SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONS Let’s do a little review :

Embed Size (px)

Citation preview

Page 1: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSLet’s do a little review :

Page 2: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSLet’s do a little review :

A statistic is a numerical descriptive measure of a sample.

Page 3: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSLet’s do a little review :

A statistic is a numerical descriptive measure of a sample.

A parameter is a numerical descriptive measure of a population.

Page 4: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSLet’s do a little review :

A statistic is a numerical descriptive measure of a sample.

A parameter is a numerical descriptive measure of a population.

From a statistical point of view, a population can be thought of as a complete set of measurements ( counts ), either existing or conceptual. But it is sometimes impossible to get the entire population as our data set.

Page 5: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSLet’s do a little review :

A statistic is a numerical descriptive measure of a sample.

A parameter is a numerical descriptive measure of a population.

From a statistical point of view, a population can be thought of as a complete set of measurements ( counts ), either existing or conceptual. But it is sometimes impossible to get the entire population as our data set.

Imagine if we wanted to find the average height of 15 year old males in the United States? We might take years to get all the data which makes it impractical.

Page 6: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSLet’s do a little review :

A statistic is a numerical descriptive measure of a sample.

A parameter is a numerical descriptive measure of a population.

From a statistical point of view, a population can be thought of as a complete set of measurements ( counts ), either existing or conceptual. But it is sometimes impossible to get the entire population as our data set.

Imagine if we wanted to find the average height of 15 year old males in the United States? We might take years to get all the data which makes it impractical.

So we SAMPLE the population by maybe only measuring the height of fifty random 15 – year old males in each state.

Page 7: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSLet’s do a little review :

A statistic is a numerical descriptive measure of a sample.

A parameter is a numerical descriptive measure of a population.

From a statistical point of view, a population can be thought of as a complete set of measurements ( counts ), either existing or conceptual. But it is sometimes impossible to get the entire population as our data set.

Imagine if we wanted to find the average height of 15 year old males in the United States? We might take years to get all the data which makes it impractical.

So we SAMPLE the population by maybe only measuring the height of fifty random 15 – year old males in each state.

We can then make inferences on the entire population based on the sample data or our population parameter.

Page 8: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSWe have different types of inferences :

Page 9: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSWe have different types of inferences :

1. Estimation – we can estimate the value of a population parameter

Page 10: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSWe have different types of inferences :

1. Estimation – we can estimate the value of a population parameter

2. Testing – we can formulate a decision about the value of a population parameter

Page 11: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSWe have different types of inferences :

1. Estimation – we can estimate the value of a population parameter

2. Testing – we can formulate a decision about the value of a population parameter

3. Regression – we can make predictions or forecasts about the value of a statistical variable

Page 12: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSWe have different types of inferences :

1. Estimation – we can estimate the value of a population parameter

2. Testing – we can formulate a decision about the value of a population parameter

3. Regression – we can make predictions or forecasts about the value of a statistical variable

We have worked with frequency tables, relative frequencies, and histograms to help understand sampling distributions.

Page 13: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSWe have different types of inferences :

1. Estimation – we can estimate the value of a population parameter

2. Testing – we can formulate a decision about the value of a population parameter

3. Regression – we can make predictions or forecasts about the value of a statistical variable

We have worked with frequency tables, relative frequencies, and histograms to help understand sampling distributions.

We have also understood what a normal distribution looks like and how we can calculate probabilities of data items occurring or not occurring.

Page 14: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSA SAMPLING DISTRIBUTION is a probability distribution of a sample statistic based on all possible simple random samples of the same size from the same population.

Page 15: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSA SAMPLING DISTRIBUTION is a probability distribution of a sample statistic based on all possible simple random samples of the same size from the same population.

Here is an example :

Becky has a project for her biometrics class where she has to track the measurements of trout taken from a lake where only children under 12 years old can fish. The kids can keep 5 trout per day and the trout must be 6 inches or longer to keep. Hundreds of children fished the lake last summer so Becky has a lot of data. To make things easier, she decides to choose 100 children at random and list their lengths of each trout kept.

Page 16: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSA SAMPLING DISTRIBUTION is a probability distribution of a sample statistic based on all possible simple random samples of the same size from the same population.

Here is an example :

Becky has a project for her biometrics class where she has to track the measurements of trout taken from a lake where only children under 12 years old can fish. The kids can keep 5 trout per day and the trout must be 6 inches or longer to keep. Hundreds of children fished the lake last summer so Becky has a lot of data. To make things easier, she decides to choose 100 children at random and list their lengths of each trout kept.

Becky can find the mean and standard deviation of the sample she took but what does that really tell her ?

Page 17: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSA SAMPLING DISTRIBUTION is a probability distribution of a sample statistic based on all possible simple random samples of the same size from the same population.

Here is an example :

Becky has a project for her biometrics class where she has to track the measurements of trout taken from a lake where only children under 12 years old can fish. The kids can keep 5 trout per day and the trout must be 6 inches or longer to keep. Hundreds of children fished the lake last summer so Becky has a lot of data. To make things easier, she decides to choose 100 children at random and list their lengths of each trout kept.

Becky can find the mean and standard deviation of the sample she took but what does that really tell her ?

She decides to organize the data into a frequency table. Since relative frequencies can be thought of as probabilities, we effectively have a probability distribution. She organizes the data according to the mean ( for each child.

Page 18: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSA SAMPLING DISTRIBUTION is a probability distribution of a sample statistic based on all possible simple random samples of the same size from the same population.

Here is an example :

Becky has a project for her biometrics class where she has to track the measurements of trout taken from a lake where only children under 12 years old can fish. The kids can keep 5 trout per day and the trout must be 6 inches or longer to keep. Hundreds of children fished the lake last summer so Becky has a lot of data. To make things easier, she decides to choose 100 children at random and list their lengths of each trout kept.

Becky can find the mean and standard deviation of the sample she took but what does that really tell her ?

She decides to organize the data into a frequency table. Since relative frequencies can be thought of as probabilities, we effectively have a probability distribution. She organizes the data according to the mean ( for each child.

The results appear on the next page…

Page 19: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONS Class Limits

Class Lower Upper

1 8.39 8.76 1 0.01

2 8.77 9.14 5 0.05

3 9.15 9.52 10 0.10

4 9.53 9.90 19 0.19

5 9.91 10.28 27 0.27

6 10.29 10.66 18 0.18

7 10.67 11.04 12 0.12

8 11.05 11.42 5 0.05

9 11.43 11.80 3 0.03

The frequency represents how many children had a mean within the class limits. The relative frequency represents the theoretical probability of a child having a mean within the class limit.

Page 20: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONS Class Limits

Class Lower Upper

1 8.39 8.76 1 0.01

2 8.77 9.14 5 0.05

3 9.15 9.52 10 0.10

4 9.53 9.90 19 0.19

5 9.91 10.28 27 0.27

6 10.29 10.66 18 0.18

7 10.67 11.04 12 0.12

8 11.05 11.42 5 0.05

9 11.43 11.80 3 0.03

The frequency represents how many children had a mean within the class limits. The relative frequency represents the theoretical probability of a child having a mean within the class limit.

You can see the data is pretty close to a normal curve.In fact, if we made a histogram of the data, it is somewhat bell shaped.

Page 21: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONS Class Limits

Class Lower Upper

1 8.39 8.76 1 0.01

2 8.77 9.14 5 0.05

3 9.15 9.52 10 0.10

4 9.53 9.90 19 0.19

5 9.91 10.28 27 0.27

6 10.29 10.66 18 0.18

7 10.67 11.04 12 0.12

8 11.05 11.42 5 0.05

9 11.43 11.80 3 0.03

The frequency represents how many children had a mean within the class limits. The relative frequency represents the theoretical probability of a child having a mean within the class limit.

You can see the data is pretty close to a normal curve.In fact, if we made a histogram of the data, it is somewhat bell shaped.

So we have sampling distribution because :a) We have a probability distribution

Page 22: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONS Class Limits

Class Lower Upper

1 8.39 8.76 1 0.01

2 8.77 9.14 5 0.05

3 9.15 9.52 10 0.10

4 9.53 9.90 19 0.19

5 9.91 10.28 27 0.27

6 10.29 10.66 18 0.18

7 10.67 11.04 12 0.12

8 11.05 11.42 5 0.05

9 11.43 11.80 3 0.03

The frequency represents how many children had a mean within the class limits. The relative frequency represents the theoretical probability of a child having a mean within the class limit.

You can see the data is pretty close to a normal curve.In fact, if we made a histogram of the data, it is somewhat bell shaped.

So we have sampling distribution because :a) We have a probability distributionb) We have a sampling statistic of

Page 23: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONS Class Limits

Class Lower Upper

1 8.39 8.76 1 0.01

2 8.77 9.14 5 0.05

3 9.15 9.52 10 0.10

4 9.53 9.90 19 0.19

5 9.91 10.28 27 0.27

6 10.29 10.66 18 0.18

7 10.67 11.04 12 0.12

8 11.05 11.42 5 0.05

9 11.43 11.80 3 0.03

The frequency represents how many children had a mean within the class limits. The relative frequency represents the theoretical probability of a child having a mean within the class limit.

You can see the data is pretty close to a normal curve.In fact, if we made a histogram of the data, it is somewhat bell shaped.

So we have sampling distribution because :a) We have a probability distributionb) We have a sampling statistic of c) We have our sample statistic based on simple

random samples from the population

Page 24: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSSo a SAMPLING DISTRIBUTION is :

1. A probability distribution

Page 25: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSSo a SAMPLING DISTRIBUTION is :

1. A probability distribution2. That has a sample statistic - mean , variance , standard deviation ,

( is called “p hat” and has to do with proportion, we’ll get to later )

Page 26: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSSo a SAMPLING DISTRIBUTION is :

1. A probability distribution2. That has a sample statistic - mean , variance , standard deviation ,

( is called “p hat” and has to do with proportion, we’ll get to later )

3. Based on all possible samples of the same size

Page 27: SAMPLING DISTRIBUTIONS Let’s do a little review :

SAMPLING DISTRIBUTIONSSo a SAMPLING DISTRIBUTION is :

1. A probability distribution2. That has a sample statistic - mean , variance , standard deviation ,

( is called “p hat” and has to do with proportion, we’ll get to later )

3. Based on all possible samples of the same size

We will use sampling distributions to make statistical inferences