21
Elementary Statistics Professor K. Leppel

Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Embed Size (px)

Citation preview

Page 1: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Elementary Statistics

Professor K. Leppel

Page 2: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Introduction and Data Collection

Page 3: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Definitions

Population: All observations of interest in a given context

Sample: A subset of a population

Page 4: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Example 1Suppose you are the president of Widener University.

Population: All Widener students.

Sample: All Widener students taking classes in the School of Business Administration.

Page 5: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Example 2Suppose you are the head of the Economics Dept.

Population: All Widener students taking Economics classes.

Sample: All Widener students taking Professor Leppel’s classes.

Page 6: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

More Definitions

Population parameter or parameter:numerical characteristics of a population

Sample statistics:numerical characteristics of a sample

Page 7: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Deductive vs. Inductive Reasoning

Deductive:

population sample

general specific

Probability

Example: Suppose you have a bowl with 2 red marbles & 3 green ones. If you pick one, what is the probability that the marble is green?

Page 8: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Deductive vs. Inductive Reasoning

Inductive:

sample population

specific general

Statistics

Example: If you take a poll & note the voting preferences of this sample, we will be able to draw some conclusions about the votes of the population.

Page 9: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Sampling with & without Replacement

Sampling without replacement: once an element of a population has been selected as part of a sample, it cannot be selected again.

Sampling with replacement: an element of a population that has been selected as part of a sample can be selected again.

Page 10: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Random sampling vs. non-random sampling

Random sampling or probability sampling:sampling in which the probability of inclusion of each element in the population is known.

Non-random sampling or judgment sampling:sampling in which judgment is exercised in deciding which elements of population to include in the sample.

Page 11: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Simple Random Sample

A sample of n elements is a simple random sample if sampling is performed such that every combination of n elements has an equal chance of being the sample selected.

Page 12: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Two Types of Studies

1. Observational or comparative studies

The analyst examines historical relationships among variables of interest.

Problem: Deriving cause & effect relationships from historical data is difficult because important environmental factors are generally not controlled & not stable.

Page 13: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

2. Direct experimentation or controlled studies

The investigator directly manipulates factors that affect a variable of interest.

Page 14: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Control Group

To understand the effect of a “treatment,” we need to compare a group that received a treatment with a group that received no treatment. The “no-treatment” group is the control group.

Page 15: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Two types of errors

1. Systematic errors or bias:

These errors cause measurement to be incorrect in some systematic way.

They are caused by inaccuracies or deficiencies in the measuring instrument.

Systematic errors persist even when the sample size is increased.

Page 16: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

These errors arise from a large number of uncontrolled factors - chance.

Random errors decrease on average as the sample size is increased.

2. Random error or sampling error:

Page 17: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Some of the variables with which you will work are qualitative and some are quantitative.

Qualitative variables are categorical and can be subdivided into nominal and ordinal measures.

Quantitative variables are numerical and can be subdivided into interval and ratio measures.

Page 18: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Qualitative (categorical) variables that are nominal have no order to them.

Example 1: U.S. citizenship (yes, no)

Example 2: On what continent were you born? (N. America, S. America, Africa, Antarctica, Asia, Australia, Europe)

Sex (male, female) is sometimes considered as a nominal variable. However, if you take into consideration intersex individuals, who can have any of a variety of anatomical conditions that don’t fit the typical definitions of female or male, you no longer have a simple nominal measure.

Page 19: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Qualitative (categorical) variables that are ordinal have an implied ranking of a characteristic.

Example 1: student class (freshman, sophomore, junior, senior)

Example 2: customer service satisfaction(very dissatisfied, somewhat dissatisfied, neither satisfied nor dissatisfied, somewhat satisfied, very satisfied)

Page 20: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Example 1: IntelligenceA person with an IQ of 150 is much more intelligent than a person with an IQ of 100, while a person with an IQ of 140 is somewhat more intelligent than a person with an IQ of 125.However, what would an IQ of 0 mean? And a person with an IQ of 200 is not twice as smart as one with an IQ of 100.

Example 2: SAT scores

They are measured on an ordered scale in which the difference between measurements is meaningful.However, there is no true zero point where there is none of a specific characteristic. Also, if the measure is twice as large, that does not imply that there is twice as much of the characteristic.

Switching to quantitative (numerical) variables, interval variables are a bit tricky.

Page 21: Elementary Statistics Professor K. Leppel. Introduction and Data Collection

Example 1: IncomeA person with zero income has no earnings or other source of money. And someone who has income of $100,000 has twice as much money coming in as someone who has income of $50,000.

Example 2: Age

Quantitative variables (numerical) that are ratio variables have true zero points and ratios work in the expected way.