
SAMPLING DEFINITIONS
®
Population:
The group of people, items or units
under investigation
®
Census:
Obtained by collecting information
about each member of a population
®
Sample
Obtained by collecting information
only about some members of a "population"
®
Sampling
Frame:
The list of people from which the sample is
taken. It should be comprehensive, complete and up-to-date.
TYPES OF SAMPLING
®
A probability sample is one in which
each member of the population has an known chance of being selected.
®
In a non-probability sample, some people
have a greater, but unknown, chance than others of selection.
TYPES OF SAMPLING
®
Probability Sampling
®
Non-Probability Sampling
Non-Probability - Advantages
®
Cheaper
®
Used when sampling frame
is not available
®
Useful when population
is so widely dispersed that cluster sampling would not be efficient
®
Often used in
exploratory studies, e.g. for hypothesis generation
®
For research interested
in obtaining an idea of the range of responses on ideas that people have
Purposive Sampling
®
A purposive sample is
one which is selected by the researcher subjectively. The researcher attempts to
obtain sample that appears to him/her to be representative of the population and
will usually try to ensure that a range from one extreme to the other is
included.
®
Often used in political
polling - districts chosen because their pattern has in the past provided good
idea of outcomes for whole electorate.
Snowball Sampling
®
With this approach, you
initially contact a few potential respondents and then ask them whether they
know of anybody with the same characteristics that you are looking for in your
research. For example, if you wanted to interview a sample of vegetarians /
cyclists / people with a particular disability / people who support a particular
political party etc., your initial contacts may well have knowledge
(through e.g. support group) of others.
Convenience Sample
®
A convenience sample is
used when you simply ask people you meet whether they will answer your
questions. The sample comprises subjects who are simply available in a
convenient way to the researcher. There is no randomness and the likelihood of
bias is high. You can't draw any meaningful conclusions from the results you
obtain. However, this method can legitimately be used provided its limitations
are clearly understood and stated.
Self-selection Sample
®
Self-selection is when respondents themselves
decide that they would like to take part in your survey.
Quota Sampling
®
Often used in market
research. Interviewers are given quota of particular types of people to
interview and the final sample should be representative of population.
®
Disadvantage -
Interviewers choose who they like (within above criteria) and may therefore
select those who are easiest to interview, so bias can result. Also, impossible
to estimate accuracy (because not random sample)
Stages of Quota Sampling
®
Decide on
characteristic(s) of which sample is to be representative, e.g. age, sex, race
®
Find out distribution of
this variable in population and set quota accordingly. (If 20% of population is
between 20 and 30, and sample is to be 1,000 then 200 respondents must be
between 20 and 30).
®
Complex quotas can be
developed so that several characteristics are used simultaneously.
Definitions
®
Element: the unit about
which information is collected and provides the basis of analysis (analogous to
units of analysis)
®
Population: the
theoretically specified aggregation of elements in a study
®
Study population: the
aggregation of elements from which the sample is actually selected
Probability Sampling
®
Basic principle: sample
will representative of the population from which it is selected
®
Key to probability
sampling is random selection
®
Random selection: each
element has an equal chance of selection independent of any other event in the
selection process
®
If all members of sample
have equal chance of being selected – EPSEM – equal probability of selection
method
Probability Theory
®
Allows researchers to
calculate the population parameters
®
Parameter:
the summary description of a given variable in a population
®
Generalize:
we estimate the population parameters from the sample observations. Probability
theory allows us to estimate how accurate those estimates are to the true
population parameter. How “off” we are is called the “margin of error.”
An important note - there will always be some sampling error.
Sampling Error
1. Random Error
Difference between the sample results and the true results because of chance
variation. This error cannot be avoided, only reduced by increasing the sample
size. It is possible to estimate the range of random error at a particular level
of confidence.
2. Systematic Error
Systematic error occurs when sample results consistently vary in one
direction from the true values for that population. Systematic error is made up
of sample design error and measurement error.
Sample Design Error.
®
Frame Error.
The sampling frame is the list of population
elements or members from which sample is selected.
®
specification error
Results from an incorrect definition of the
universe or population from which the sample is to be selected
®
Selection error
Selection error involves a systematic bias
in the manner in which respondents are selected for participation in the survey.
Random Error
®
Based on sampling distributions and the Central
Limit Theorem
®
Can be calculated with 3 pieces of information
®
The population parameter
®
The sample size
®
The standard error
Simple Random Sample
®
Each person has same chance as any other of being
selected
®
Standard against which other methods are sometimes
evaluated
®
Suitable where population is relatively small and
where sampling frame is complete and up-to-date
Procedure for SRS:
®
Obtain a complete sampling frame
®
Give each case a unique number, starting at one
®
Decide on the required sample size
®
Select that many numbers from a table of random
numbers or using computer
Table of random numbers
®
Decide on a pattern of
movement through table and stick to it, e.g. numbers from every second column
and every row. If a number comes up twice or a number is selected which is
larger than population number, discard it.
92941 04999 77422 25992
27372 94157
43252 48135 34237 46293 46178 50110
78907 37586 50940 88094 28209 67334
82843 43383 32561 62108 46076 91276
Possible questionnaire-based surveys.
Which of them do you feel would have
simple random sampling as an appropriate sampling method?
®
A survey of shoppers in
a clothes shop on a Saturday morning
®
A study of employees in
an organization
®
A survey of teenage
prostitutes
®
Survey of a theatre
audience following a play
®
School-children's
attitudes to branded clothing
®
Feedback from hotel
clients on perceptions of their last visit to that hotel
Systematic sampling
®
Similar to SRS; instead
of selecting random numbers from tables, you move through sample frame picking
every nth name.
®
Procedure
1. Calculate SAMPLING FRACTION
by dividing population size by required sample size. E.g. for a population of
500 and a sample of 100, the sampling fraction is 1/5 i.e. you will select one
person out of every five in the population.
2. Random number needs to be used only to
decide on starting point. With the sampling fraction of 1/5, the starting point
must be within the first 5 people in your list
®
Disadvantage:
Effect of periodicity (bias caused by particular characteristics arising
in the sampling frame at regular units). An example of this would occur if you
used a sampling frame of adult residents in an area composed of predominantly
couples or young families. If this list was arranged: Husband / Wife / Husband /
Wife etc. and if every tenth person was to be interviewed, there would be an
increased chance of males being selected.
Stratified Sampling
®
All people in sampling frame are divided into
"strata" (groups or categories). Within each stratum, a simple random sample or
systematic sample is selected.
Example of stratified sampling
®
If we want to ensure
that a sample of 5 students from a population of 50 contains both male and
female students in same proportions as in the full population, we first divide
that population into male and female.
®
There are 20 male
and 30 female students in the population. To work out the number of males and
females in the sample…
No. of males in sample = (5 / 50) x 20 = 2.0 No. of females in sample = (5 / 50)
x 30 = 3.0
We choose 2 males and 3 females for the sample, which are then
selected using
simple random
or
systematic sample
methods.
Exercise
®
A company
employs 200 part-time staff and 800 full-time staff, and you want to do 20
interviews. If you were to take a random sample, you might find that all
of the names you selected were part-time staff. For this reason you have decided
to do a stratified sample. Work out how many part-time and how many full-time
employees you should interview so as to accurately reflect the proportions of
the two groups in the whole workforce.
Multi-stage cluster sampling
®
As the name implies,
this involves drawing several different samples. It does so in such a way that
cost of final interviewing is minimized.
®
Basic procedure:
First draw sample of areas. Initially large areas selected then progressively
smaller areas within larger area are sampled. Eventually end up with sample of
households (or other units such as schools) and use method of selecting
individuals from these selected households.
Example of Multi-Stage Cluster Sampling
®
You have been
asked to undertake a study across the whole of American universities to
determine student drug use.
What are the possible sampling frames for this study?
®
You have been
asked to interview 2,000 people in total. Consider what would happen if you
undertook a simple random survey. Of these 2,000 you might have to interview
students in hundreds of colleges all over the country. It would be more
efficient to undertake a Multi-stage Cluster Sample.
