28
Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Embed Size (px)

Citation preview

Page 1: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Collecting Samples

Chapter 2.3 – In Search of Good DataMathematics of Data Management (Nelson)MDM 4U

Page 2: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Why Sampling?

sampling is done because a census is too expensive or time consuming

the challenge is being confident that the sample represents the population accurately

convenience sampling occurs when you simply take data from the most convenient place (for example collecting data by walking around the hallways at school)

convenience sampling is not representative

Page 3: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Random Sampling

representative samples involve random sampling random events are events that are considered to

occur by chance random numbers are described as numbers that

occur without pattern random numbers can be generated using a

calculator, computer or random number table random choice is used as a method of selecting

members of a population without introducing bias

Page 4: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

1) Simple Random Sampling

this sample requires that all selections be equally likely and that all combinations of selections be equally likely

the sample is likely to be representative of the population

but if it isn’t, this is due to chance example: put entire population’s names in a

hat and draw them

Page 5: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

2) Systematic Random Sampling you decide to sample a fixed percent of the

population using some random starting point and you select every nth individual

n in this case is determined by calculating the sampling interval (population size ÷ sample size)

example: you decide to sample 10% of 800 people. n = 800 ÷ 80 = 10, so generate a random number between 1 and 10, start at this number and sample each 10th person

Page 6: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

3) Stratified Random Sampling the population is divided into groups called

strata (which could be MSIPs or grades) a simple random sample is taken of each of

these with the size of the sample determined by the size of the strata

example: sample CPHS students by MSIP, with samples randomly drawn from each MSIP (the number drawn is relative to the size of the MSIP)

Page 7: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

4) Cluster Random Sampling

the population is ordered in terms of groups (like MSIPs or schools)

groups are randomly chosen for sampling and then all members of the chosen groups are surveyed

example: student attitudes could be measured by randomly choosing schools from across Ontario, and then surveying all students in those

Page 8: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

5) Multistage Random Sampling groups are randomly chosen from a

population, subgroups from these groups are randomly chosen and then individuals in these subgroups are then randomly chosen to be surveyed

example: to understand student attitudes a school might randomly choose one period, randomly choose MSIPs during that period then randomly choose students from within those MSIPs

Page 9: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

6) Destructive Sampling

sometimes the act of sampling will restrict the ability of a surveyor to return the element to the population

example: cars used in crash tests cannot be used again for the same purpose

example: taking a standardized test (individuals may acquire learning during sampling that would introduce bias if they were tested again)

Page 10: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Example: do students at CPHS want a longer lunch? (sample 200 of 800 students) Simple Random Sampling

Create a numbered, alphabetic list of students, have a computer generate 200 names and interview those students

Systematic Random Sampling sampling interval n = 800 ÷ 200 = 4 generate a random number between 1 and 4 start with that number on the list and interview

each 4th person after that

Page 11: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Example: do students at CPHS want a longer lunch? Stratified Random Sampling

group students by grade and have a computer generate a random group of names from each grade to interview

the number of students interviewed from each grade is probably not equal, rather it is proportional to the size of the group

if there were 180 grade 10’s, 180 ÷ 800 = 0.225 800 × 0.225 = 45 so we would need to interview 45

grade 10s

Page 12: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Example: do students at CPHS want a shorter lunch? Cluster Random Sampling

randomly choose enough MSIPs to sample 200 students

say there are 25 per MSIP, we would need 8 MSIPs, since 8 x 25 = 200

interview every student in each of these rooms

Page 13: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Example: do UCDSB high school students want a shorter lunch? Multi Stage Random Sampling

Randomly select 4 high schools in the UCDSB Randomly choose a period from 1-5 randomly choose 2 MSIP classes of 25 interview every student in those MSIPs 200 students total

Page 14: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Sample Size

the size of the sample will have an effect on the reliability of the results

the larger the better factors:

variability in the population (the more variation, the larger the sample required to capture that variation)

degree of precision required for the survey the sampling method chosen

Page 15: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Techniques for Experimental Studies Experimental studies are different from

studies where a population is sampled as it exists

in experimental studies some treatment is applied to some part of the population

however, the effect of the treatment can only be known in comparison to some part of the population that has not received the treatment

Page 16: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Vocabulary treatment group

the part of the experimental group that receives the treatment

control group the part of the experimental group that does not

receive the treatment

Page 17: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Vocabulary

placebo a treatment that has no value given to the control

group to reduce bias in the experiment no one knows whether they are receiving the

treatment or not (why?) double-blind test

in this case, neither the subjects or the researchers doing the testing know who has received the treatment (why?)

Page 18: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Class Activity

How would we take a sample of the students in this class using the following methods:

a) 40% Simple Random Sampling b) 20% Systematic Random Sampling? c) 40% Stratified Random Sampling? d) 50% Cluster Random Sampling?

Page 19: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

MSIP / Homework

p. 99 #1, 5, 6, 10, 11 For 6b, see Ex. 1 on p. 95

Page 20: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Creating Survey Questions

Chapter 2.4 – In Search of Good DataMathematics of Data Management (Nelson)MDM 4U

Page 21: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Surveys

A series of carefully designed questions Commonly used in data collection Types: interview, questionnaire, mail-in,

telephone, WWW, focus group Bad questions lead to bad data (why?) Good questions may create good data (why?)

Page 22: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Question Styles

Open Questions respondents answer in their own words (written) gives a wide variety of answers may be difficult to interpret offer the possibility of gaining data you did not know

existed sometimes used in preliminary collection of

information, to gain a sense of what is going on can clarify the categories of data you will end up

studying

Page 23: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Question Styles

Closed Questions questions that require the respondent to select from

pre-defined responses responses can be easily analyzed the options present may bias the result options may not represent the population and the

researcher may miss what is going on sometimes used after an initial open ended survey

as the researcher has already identified data categories

Page 24: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Types of Survey Questions

Information ex: Circle your Age: 16 17 18+

Checklist ex: Courses currently being taken (check all

that apply):□ Data Management

□ Advanced Functions

□ Calculus and Vectors

□ Other _________________

Page 25: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Types of Survey Questions

Ranking Questions ex: rank the following in order of importance (1 =

most important, 3 = least important) __ Work __ Homework __ Sports

Rating Questions ex: How would you rate your teacher?

(choose 1)

□ Great □ Fabulous □ Incredible □ Outstanding

Page 26: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

Questions should…

Be simple, relevant, specific, readable Be written without jargon/slang,

abbreviations, acronyms, etc. Not lead the respondents Allow for all possible responses on closed Qs Be sensitive to the respondents

Page 27: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

MSIP / Homework

Complete p. 105 #1, 2, 4, 5, 8, 9, 12

Page 28: Collecting Samples Chapter 2.3 – In Search of Good Data Mathematics of Data Management (Nelson) MDM 4U

References

Wikipedia (2004). Online Encyclopedia. Retrieved September 1, 2004 from http://en.wikipedia.org/wiki/Main_Page