Upload
sajid-ali-maari
View
215
Download
0
Embed Size (px)
Citation preview
7/30/2019 10 Sampling
1/6
11/14/20
Sampling TechniquesAdvanced Research Methods
Muhammad Zaki Rashidi
Overview
Sampling techniques
Determination of sample size
Probability sampling
Simple random sampling Stratified random sampling
Systematic sampling
Cluster sampling
Non-probability sampling Quota, judgment, convenience sampling
Snowball sampling
Errors in sampling
Table of sample size
2
Basics of sampling I
A sample is a
part of a whole
to show what
the rest is like.
Sampling helps
to determine the
corresponding
value of the
population and
plays a vital role
in marketingresearch.
Samples offer many benefits:
Save costs:Less expensive to study thesample than the population.
Save time:Less time needed to studythe sample than the population .
Accuracy:Since sampling is done withcare and studies are conducted byskilled and qualified interviewers, theresults are expected to be accurate.
Destructive nature of elements:Forsome elements, sampling is the way totest, since tests destroy the elementitself.
Basics of sampling II
Limitations ofSampling
Demands more rigidcontrol in undertakingsample operation.
Minority and smallnessin number of sub-groupsoften render study to besuspected.
Accuracy level may beaffected when data issubjected to weighing.
Sample results are goodapproximations at best.
Sampling Process
Defining the
population
Developing
a sampling
Frame
Determining
Sample
Size
Specifying
Sample
Method
SELECTING THE SAMPLE
Sampling: Step 1
Defining the Universe
Universe or population is the
whole mass under study.
How to define a universe:
What constitutes the units of
analysis (HDB apartments)?
What are the sampling units
(HDB apartments occupied in
the last three months)?
What is the specific
designation of the units to be
covered (HDB in town area)?
What time period does the
data refer to (December 31,
1995)
Sampling: Step 2Establishing the Sampling
Frame
A sample frame is the list ofall elements in thepopulation (such astelephone directories,electoral registers, club
membership etc.) fromwhich the samples are
drawn.
A sample frame which doesnot fully represent anintended population willresult inframe error andaffect the degree of reliability
of sample result.
Step - 3
Determination of Sample Size
Sample size may be determined by using: Subjective methods (less sophisticated methods)
The rule of thumb approach: eg. 5% of population
Conventional approach: eg. Average of sample sizes of
similar other studies;
Cost basis approach: The number that can be studied
with the available funds;
Statistical formulae (more sophisticated methods)
Confidence interval approach.
7/30/2019 10 Sampling
2/6
11/14/20
Choice of Sample Size - Large
PopulationsSample Sizes
% Margin of Error 95% Confidence 99% Confidence
1 9,604 16,590
2 2,401 4,148
3 1,068 1,844
4 601 1,037
5 385 664
6 267 461
7 196 339
8 151 260
9 119 250
10 97 166
Source :Parker & Rea, Designing and Conducting Research
Table 1
Choice of Sample Size - Small Populations
Sample Sizes
95% Level of Confidence 99% Level of Confidence
N 3% 5% 10% 3% 5% 10%
500 250 218 81 250 250 124
1000 500 278 88 500 399 143
1500 624 306 91 750 460 150
2,000 696 323 92 959 498 154
3,000 788 341 94 1,142 544 158
5,000 880 357 95 1,347 586 161
10,000 965 370 96 1,556 622 164
20,000 1,014 377 96 1,687 642 165
50,000 1,045 382 96 1,777 655 166
100,000 1,058 383 96 1,809 659 166
Source : Parker & Rea, Designing and Conducting Research
Table 2
Conventional approach of Sample size determination using
Sample sizes used in different marketing research studies
TYPE OF STUDY MINIMUM
SIZE
TYPICAL
RANGE
Identifying a problem (e.g.marketsegmentation) 500 1000-2500
Problem-solving (e.g., promotion) 200 300-500
Product tests 200 300-500
Advertising (TV, Radio, or print Mediaper commercial or ad tested) 150 200-300Test marketing 200 300-500
Test market audits 10stores/outlets 10-20stores/outletsFocus groups 2 groups 4-12 groups
Sample size determination using statistical formulae:
The confidence interval approach
To determine sample sizes using statistical formulae,
researchers use the confidence interval approach based on the
following factors:
Desired level of data precision or accuracy;
Amount of variability in the population (homogeneity);
Level of confidence required in the estimates of population values.
Availability of resources such as money, manpower and time
may prompt the researcher to modify the computed sample
size.
Students are encouraged to consult any standard marketingresearch textbook to have an understanding of these formulae.
Step 4:
Specifying the sampling method
Probability Sampling Every element in the target population or universe
[sampling frame] has equal probability of being chosen in
the sample for the survey being conducted.
Scientific, operationally convenient and simple in theory.
Results may be generalized.
Non-Probability Sampling
Every element in the universe [sampling frame] does not
have equal probability of being chosen in the sample.
Operationally convenient and simple in theory.
Results may not be generalized.
Probability sampling
Appropriate for
homogeneous population
Simple random sampling
Requires the use of a
random number table.
Systematic sampling
Requires the sample frame
only,
No random number table is
necessary
Appropriate for
heterogeneous
population
Stratified sampling
Use of random number
table may be necessary
Cluster sampling
Use of random number
table may be necessary
Four types of probability sampling
7/30/2019 10 Sampling
3/6
11/14/20
Non-probability sampling
Four types of non-probability samplingtechniques
Very simple types, based on subjective criteria Convenient sampling
Judgmental sampling
More systematic and formal
Quota sampling
Special type
Snowball Sampling
Simple Random Sampling
Also called
random sampling
Simplest method
of probability
sampling
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
1 37 75 10 49 98 66 03 86 34 80 98 44 22 22 45 83 53 86 23 51
2 50 91 56 41 52 82 98 11 57 96 27 10 27 16 35 34 47 01 36 083 99 14 23 50 21 01 03 25 79 07 80 54 55 41 12 15 15 03 68 56
4 70 72 01 00 33 25 19 16 23 58 03 78 47 43 77 88 15 02 55 67
5 18 46 06 49 47 32 58 08 75 29 63 66 89 09 22 35 97 74 30 80
6 65 76 34 11 33 60 95 03 53 72 06 78 28 14 51 78 76 45 26 45
7 83 76 95 25 70 60 13 32 52 11 87 38 49 01 82 84 99 02 64 00
8 58 90 07 84 20 98 57 93 36 65 10 71 83 93 42 46 34 61 44 01
9 54 74 67 11 15 78 21 96 43 14 11 22 74 17 02 54 51 78 76 76
10 56 81 92 73 40 07 20 05 26 63 57 86 48 51 59 15 46 09 75 64
11 34 99 06 21 22 38 22 32 85 26 37 00 62 27 74 46 02 61 59 81
12 02 26 92 27 95 87 59 38 18 30 95 38 36 78 23 20 19 65 48 50
13 43 04 25 36 00 45 73 80 02 61 31 10 06 72 39 02 00 47 06 98
14 92 56 51 22 11 06 86 88 77 86 59 57 66 13 82 33 97 21 31 61
15 67 42 43 26 20 60 84 18 68 48 85 00 00 48 35 48 57 63 38 84
Need to use
Random
Number Table
How to Use Random Number Tables
________________________________________________
1. Assign a unique number to each population element in the
sampling frame. Start with serial number 1, or 01, or 001,
etc. upwards depending on the number of digits required.
2. Choose a random starting position.
3. Select serial numbers systematically across rows or down
columns.
4. Discard numbers that are not assigned to any population
element and ignore numbers that have already been
selected.
5. Repeat the selection process until the required number of
sample elements is selected.
How to Use a Table of Random Numbers to Select a Sample
Your marketing research lecturer wants to randomly select 20 students from
your class of 100 students. Here is how he can do it using a random number table.
Step 1: Assign all the 100 members of the population a unique number.You may
identify each element by assigning a two-digit number. Assign 01 to the first name
on the list, and 00 to the last name. If this is done, then the task of selecting the
sample will be easier as you would be able to use a 2-digit random number table.
NAME NUMBER NAME NUMBER
Adam, Tan 01 Tan Teck Wah 42
Carrol, Chan 08 Tay Thiam Soon 61
. .. Jerry Lewis 18 Teo Tai Meng 87
. .
Lim Chin Nam 26 . Yeo Teck Lan 99
Singh, Arun 30 Zailani bt Samat 00
Step 2: Select any starting point in the Random Number Table and find the first number that
corresponds to a number on the list of your population. In the example below, # 08 has been
chosen as the starting point and the first student chosen is Carol Chan.
10 09 73 25 33 76
37 54 20 48 05 64
08 42 26 89 53 19
90 01 90 25 29 09
12 80 79 99 70 80
66 06 57 47 17 34
31 06 01 08 05 45
Step 3: Move to the next number, 42 and select the person corresponding to that number into
the sample. #87 Tan Teck Wah
Step 4: Continue to the next number that qualifies and select that person into the sample.
# 26 -- Jerry Lewis, followed by #89, #53 and #19
Step 5: After you have selected the student # 19, go to the next line and choose #90. Continue
in the same manner until the full sample is selected. If you encounter a number selected
earlier (e.g., 90, 06 in this example) simply skip over it and choose the next number.
Starting point:move right to the endof the row, then downto the next row row;move left to the end,then down to the nextrow, and so on.
How to use random number table to select a random sampleSystematic sampling
Very similar to simple random sampling with one exception.
In systematic sampling only one random number is needed throughout the
entire sampling process.
To use systematic sampling, a researcher needs:
[i] a sampling frame of the population; and is needed.
[ii] a skip interval calculated as follows:
Skip interval = population list size
Sample size
Names are selected using the skip interval.
If a researcher were to select a sample of 1000 people using the local telephone
directory containing 215,000 listings as the sampling frame, skip interval is
[215,000/1000], or 215. The researcher can select every 215th name of the entire
directory [samplingframe], and select his sample.
7/30/2019 10 Sampling
4/6
11/14/20
Example: How to Take a Systematic Sample
Step 1: Select a listing of the population, say the City Telephone Directory, from which to
sample. Remember that the list will have an acceptable level of sample frame error.
Step 2: Compute the skip interval by dividing the number of entries in the directory by the
desired sample size.
Example: 250,000 names in the phone book, desired a sample size of 2500,
So skip interval = every 100
th
name
Step 3: Using random number(s), determine a starting position for sampling the list.
Example: Select: Random number for page number. (page 01)
Select: Random number of column on that page. (col. 03)
Select: Random number for name position in that column (#38, say, A..Mahadeva)Step 4: Apply the skip interval to determine which names on the list will be in the sample.
Example: A. Mahadeva (Skip 100 names), new name chosen is A Rahman b Ahmad.
Step 5: Consider the list as circular; that is, the first name on the list is now the initial name
you selected, and the last name is now the name just prior to the initially selected one.
Example: When you come to the end of the phone book names (Zs), just continue on
through the beginning (As).
Stratified sampling I
A three-stage process:
Step 1- Divide the population intohomogeneous, mutually exclusiveand collectively exhaustive
subgroups or strata using somestratification variable;
Step 2- Select an independentsimple random sample from eachstratum.
Step 3- Form the final sample byconsolidating all sample elementschosen in step 2.
May yield smaller standard errors ofestimators than does the simplerandom sampling. Thus precision canbe gained with smaller sample sizes.
Stratified samples can be:
Proportionate: involvingthe selection of sampleelements from each
stratum, such that the ratioof sample elements fromeach stratum to the samplesize equals that of thepopulation elements withineach stratum to the totalnumber of populationelements.
Disproportionate: thesample is disproportionatewhen the above mentionedratio is unequal.
To select a proportionate stratified sample of 20 members of the Island Video Club which has
100 members belonging to three language based groups of viewers i.e., English (E), Mandarin
(M) and Others (X).
Step 1: Identify each member from the membership list by his or her respective language groups
00 (E ) 20 (M) 40 (E ) 60 ( X ) 80 (M)
01 (E ) 21 ( X ) 41 ( X ) 61 (M) 81 (E )
02 ( X ) 22 (E ) 42 ( X ) 62 (M) 82 (E )
03 (E ) 23 ( X ) 43 (E ) 63 (E ) 83 (M)
04 (E ) 24 (E ) 44 (M) 64 (E ) 84 ( X )
05 (E ) 25 (M) 45 (E ) 65 ( X ) 85 (E )
06 (M) 26 (E ) 46 ( X ) 66 (M) 86 (E )
07 (M) 27 (M) 47 (M) 67 (E ) 87 (M)
08 (E ) 28 ( X ) 48 (E ) 68 (M) 88 ( X )
09 (E ) 29 (E ) 49 (E ) 69 (E ) 89 (E )
10 (M) 30 (E ) 50 (E ) 70 (E ) 90 ( X )
11 (E ) 31 (E ) 51 (M) 71 (E ) 91 (E )
12 ( X ) 32 (E ) 52 ( X ) 72 (M) 92 (M)
13 (M) 33 (M) 53 (M) 73 (E ) 93 (E )
14 (E ) 34 (E ) 54 (E ) 74 ( X ) 94 (E )
15 (M) 35 (M) 55 (E ) 75 (E ) 95 ( X )16 (E ) 36 (E ) 56 (M) 76 (E ) 96 (E )
17 ( X ) 37 (E ) 57 (E ) 77 (M) 97 (E )
18 ( X ) 38 ( X ) 58 (M) 78 (M) 98 (M)
19 (M) 39 ( X ) 59 (M) 79 (E ) 99 (E )
Selection of a proportionate Stratified Sample
Step 2: Sub-divide the club members into three homogeneous sub-groups or strata by the
language groups: English, Mandarin and others.
EnglishLanguage Mandarin Language Other Language
Stratum Stratum Stratum .
00 22 40 64 82 06 35 66 02 42
01 24 43 67 85 07 44 68 12 46
03 26 45 69 86 10 47 72 17 52
04 29 48 70 89 13 51 77 18 60
05 30 49 71 91 15 53 78 21 65
08 31 50 73 93 19 56 80 23 74
09 32 54 75 94 20 58 83 28 84
11 34 55 76 96 25 59 87 38 88
14 36 57 79 97 27 61 92 39 90
16 37 63 81 99 33 62 98 41 95
1.Calculate the overall sampling fraction, f, in the following manner:
f = n = 20 = 1 =N 100 5
where n = sample size and N = population size
0.2
Selection of a proportionate stratified sample II
Determine the number of sample elements (n1) to be selected from the English
language stratum. In this example, n1 = 50 x f = 50 x 0.2 =10. By using a simple
random sampling method [using a random number table] members whose numbersare 01, 03, 16, 30, 43, 48, 50, 54, 55, 75, are selected.
Next, determine the number of sample elements (n2) from the Mandarin language
stratum. In this example, n2 = 30 x f = 30 X 0.2 = 6. By using a simple random
sampling method as before, members having numbers 10,15, 27, 51, 59, 87 are
selected from the Mandarin language stratum.
In the same manner, the number of sample elements (n3) from the Other language
stratum is calculated. In this example, n3 = 20 x f = 20 X 0.2 = 4. For this stratum,
members whose numbers are 17, 18, 28, 38 are selected
These three different sets of numbers are now aggregated to obtain the ultimate
stratified sample as shown below.
S = (01, 03, 10, 15, 16, 17, 18, 27, 28, 30, 38, 43, 48, 50, 51, 54, 55, 59, 75, 87)
Selection of a proportionate stratified sample III
Cluster sampling
Is a type of sampling in which clusters or groups ofelements are sampled at the same time.
Such a procedure is economic, and it retains thecharacteristics of probability sampling.
A two-step-process:
Step 1- Defined population is divided into number ofmutually exclusive and collectively exhaustive subgroups orclusters;
Step 2- Select an independent simple random sample ofclusters.
One special type of cluster sampling is called area sampling, wherepieces of geographical areas are selected.
7/30/2019 10 Sampling
5/6
11/14/20
Example : One-stage and two-stage Cluster sampling
Consider the same Island Video Club example involving 100 club members:
Step 1: Sub-divide the club members into 5 clusters, each cluster containing 20 members.
Cluster
No. English Mandarin Others
1 00, 22, 40, 64, 82 06, 35, 66 02, 42
0 1, 24 , 4 3, 67 , 8 5 0 7, 44 , 6 8 1 2, 46
2 03, 26, 45, 69, 86 10, 47, 72 17, 52
0 4, 29 , 4 8, 70 , 8 9 1 3, 51 , 7 7 1 8, 60
3 05, 30, 49, 71, 91 15, 53, 78 21, 65
0 8, 31 , 5 0, 73 , 9 3 1 9, 56 , 8 0 2 3, 74
4 09, 32, 54, 75, 94 20, 58, 83 28, 84
1 1, 34 , 5 5, 76 , 9 6 2 5, 59 , 8 7 3 8, 88
5 14, 36, 57, 79, 97 27, 61, 92 39, 90
1 6, 37 , 6 3, 81 , 9 9 3 3, 62 , 9 8 4 1, 95
Step 2: Select one of the 5 clusters. If cluster 4 is selected, then all its elements (i.e. Club
Members with numbers 09, 11, 32, 34, 54, 55, 75, 76, 94, 96, 20, 25, 58, 59, 83, 87, 28, 38, 84,
88) are selected.
Step 3: If a two-stage cluster sampling is desired, the researcher may randomly select 4 members
from each of the five clusters. In this case, the sample will be different from that shown in step 2
above.
Stratified Sampling vs Cluster Sampling
Stratified Sampling Cluster Sampling
1.The target population is sub-divided
into a few subgroups or strata, each
containing a large number of elements.
1.The target population is sub-
divided into a large number of
sub-population or clusters, eachcontaining a few elements.
2.Within each stratum, the elements are
homogeneous. However, high degree of
heterogeneity exists between strata.
2.Within each cluster, the elements
are heterogeneous. Between
clusters, there is a high degree of
homogeneity.
3.A sample element is selected each time. 3.A cluster is selected each time.
4.Less sampling error. 4.More prone to sampling error.
5.Object ive i s to increase precision. 5 .Objective is to increase sampling
efficiency by decreasing cost.
AREA SAMPLING
A common form of cluster sampling where clusters consist of geographic areas, such as
districts, housing blocks or townships. Area sampling could be one-stage, two-stage, or
multi-stage.
How to Take an Area Sample Using Subdivisions
Your company wants to conduct a survey on the expected patronage of its new outlet in a new
housing estate. The company wants to use area sampling to select the sample households to be
interviewed. The sample may be drawn in the manner outlined below.
___________________________________________________________________________________
Step 1: Determine the geographic area to be surveyed, and identify its subdivisions. Each
subdivision cluster should be highly similar to all others. For example, choose ten housing
blocks within 2 kilometers of the proposed site [say, Model Town ] for your new retail outlet;
assign each a number.
Step 2: Decide on the use of one-step or two-step cluster sampling. Assume that you decide to
use a two-stage cluster sampling.
Step 3: Using random numbers, select the housing blocks to be sampled. Here, you select 4
blocks randomly, say numbers #102, #104, #106, and #108.Step 4: Using some probability method of sample selection, select the households in each of the
chosen housing block to be included in the sample. Identify a random starting point (say,apartment no. 103), instruct field workers to drop off the survey at every fifth house
(systematic sampling).
Non-probability samples
Convenience sampling
Drawn at the convenience of the researcher. Common in exploratoryresearch. Does not lead to any conclusion.
Judgmental sampling
Sampling based on some judgment, gut-feelings or experience of theresearcher. Common in commercial marketing research projects. If
inference drawing is not necessary, these samples are quite useful. Quota sampling
An extension of judgmental sampling. It is something like a two-stagejudgmental sampling. Quite difficult to draw.
Snowball sampling
Used in studies involving respondents who are rare to find. To start with,
the researcher compiles a short list of sample units from various sources.Each of these respondents are contacted to provide names of otherprobable respondents.
Quota Sampling
To select a quota sample comprising 3000 persons in country X using three control
characteristics: sex, age and level of education.
Here, the three control characteristics are considered independently of one another.
In order to calculate the desired number of sample elements possessing the various
attributes of the specified control characteristics, the distribution pattern of the
general population in country X in terms of each control characteristics is examined.
Control
Characteristics Population Distribution Sample Elements .
Gender: .... Male ...................... 50.7% Male 3000 x 50.7% = 1521
................. Female .................. 49.3% Female 3000 x 49.3% = 1479
A ge : .. .. .. .. . 2 0-2 9 ye ar s .. .. .. ... .. 1 3. 4% 2 0-2 9 y ea rs 3 00 0 x 1 3. 4% = 4 0 2
. .. .. .. .. .. .. .. .. 3 0-3 9 ye ar s .. .. .. ... .. 5 3. 3% 3 0-3 9 y ea rs 3 00 0 x 52 .3 % = 15 69
. .. .. .. .. .. .. .. .. 4 0 y ea rs & o ve r .. .. 3 3. 3% 4 0 y ea rs & o ve r 3 00 0 x 3 4. 3% = 1 02 9
R el ig io n: .. C hri st ia ni ty .. .. .. ... .. 7 6. 4% C hr is ti an ity 3 00 0 x 7 6. 4% = 2 29 2
................. Is la m..................... 14.8% Islam 3000 x 14.8% = 4 44
................. Hinduism .............. 6.6% Hinduism 3000 x 6 .6% = 1 98
. .. .. .. .. .. .. .. .. O th ers . .. .. ... .. .. .. .. .. . 2 .2 % O th ers 3 00 0 x 2 .2 % = 6 6
__________________________________________________________________________________
Sampling vs non-sampling errors
Sampling Error [SE] Non-sampling Error [NSE]
Very small sampleSize
Larger sample size
Still larger sample
Complete census
7/30/2019 10 Sampling
6/6
11/14/20
Choosing probability vs. non-probability sampling
Probability Evaluation Criteria Non-probabilitysampling sampling
Conclusive Nature of research Exploratory
Larger sampling Relative magnitude Larger non-
samplingerrors sampling vs. error
non-sampling error
High Population variability Low
[Heterogeneous][Homogeneous]
Favorable Statistical Considerations Unfavorable
High Sophistication Needed Low
Relatively Longer Time Relativelyshorter
High Budget Needed Low
THANK YOU
32