View
213
Download
0
Category
Preview:
Citation preview
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 1/20
GSSR
Research Methodology and Methods of Social Inquiry
www.socialinquiry.wordpress.com
October 25, 2010
Assessing Measurement
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 2/20
The measure needs to be:
Valid
Reliable
Exhaustive
Mutually Exclusive
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 3/20
VALIDITY
Claims of having appropriately measured the DV and IVsare valid;
Validity of measurement
Assuming that there is a relationship in this study , if weclaim causality, is the relationship causal ?
Internal Validity of the causal argument
Generalizations from the results of our study to other units ofobservations (e.g. persons) in other places and at other
times
External Validity of our conclusions
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 4/20
Validity of Measurement
An empirical measure is valid to the extent to which itadequately captures the real meaning of the concept
under consideration
(how well it measures the concept it is intended to
measure)
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 5/20
Face validity
Content validity
Criterion-related Validity
Construct Validity
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 6/20
Face Validity
- look at the operationalization; assess whether "on itsface" it seems like a good translation of the construct
To improve the quality of face validity assessment, make it
more systematic.
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 7/20
Content Validity
- concerns the extent to which a measure represents allfacets of a concept;
- Identify clearly the components of the total content‘domain’; then show that the items adequately represent
the components;
Ex: knowledge tests
- assumes a good detailed description of the content
domain, which may not always be so;
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 8/20
Criterion-related Validity I
Applies to measures (‘tests’) that shouldindicate a person’s present/future standing
on a specific behavior (trait)
The behavior (trait) - criterion;
Validation is a matter of how well scores on
the measure correlate with the criterion ofinterest.
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 9/20
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 10/20
Construct Validity I
- based upon accumulation of research evidence (lit. rev)
- ‘construct’ – ‘concept’
Assumption: the meaning of any concept is implied bystatements of its theoretical relation to other concepts
Hence:
- Examine theory;
- Hypotheses about variables that should be related tomeasure(s) of the concept;
- Hypotheses about variables that should NOT be relatedto measure(s) of the concept;
- Gather ‘evidence’
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 11/20
Construct Validity II
Convergent Validity
- examine the degree to which the measure is similar to(converges on) other measures that it theoreticallyshould be similar to.
Discriminant Validity
- examine the degree to which the measure is not similar to(diverges from) other measures that it theoreticallyshould be not be similar to.
To estimate the degree to which any two measures arerelated to each other one typically uses the correlationcoefficient.
http://www.socialresearchmethods.net/kb/constval.php - construct
validity
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 12/20
RELIABILITY(consistency of measurement)
-deals with the quality of measurementA measure is considered reliable if it would give the same
result over and over again
(Assumption: what we are measuring is not changing!)
True Score Theoryevery measurement has two additive components: true
ability (or the true level) of the respondent on thatmeasure; PLUS random error.
- foundation of reliability theory:A measure that has no random error (i.e., is all true score) is perfectlyreliable; a measure that has no true score (i.e., is all random error)has zero reliability.
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 13/20
Assessing Reliability
Inter-coder reliability :
- check the degree to which differentinterviewers/observers/raters/coders give consistent
estimates of the same phenomenon.
A. Nominal measure & raters are checking off whichcategory each observation falls in: calculate % ofagreement between raters.
Ex: One measure, with 3 categories; N = 100 observations,
rated by two raters.On 86 of the 100 observations the raters checked thesame category (i.e., 86% inter-rater agreement)
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 14/20
B. If the measure = continuous: calculate the correlation
between the ratings of the two raters/observers.
Ex: rating the overall level of activity in a classroom on a 1-
to-7 scale. Ask raters to give their rating at regular time intervals (e.g.,
every 60 seconds). The correlation between theseratings would give you an estimate of the consistencybetween the raters.
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 15/20
Test-retest reliability
- administer the same test to the same sample on two
different occasions.
- calculate the correlation btw repeated applications of themeasure through time
Problems: - people remember answers;
- Real change in attitudes may occur
- First application of measure may have produced changein the subject
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 16/20
Internal consistency reliability Examines the consistency of responses across all items
(simultaneously) in a composite measure (uses a singlemeasurement instrument administered to a group of
people on one occasion to estimate reliability).
How consistent are the results for different items for thesame construct within the measure.
various stats. procedures;
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 17/20
Split-half reliability
- calculate the correlation btw. responses to subsets ofitems from the same measure (apply scale to sample;then divide scale in 2, randomly; reapply each half; makecorrelation of results; the higher the better)
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 18/20
- when factors systematically influence the process ofmeasurement, or the concept we measure - systematicmeasurement error
- when temporary, chance factors affect measurement – random error
- its presence, extent and direction are unpredictable fromone question to the next, or from one respondent to thenext
See:
www.socialresearchmethods.net
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 19/20
Relation Validity – Reliability – Measurement Error
Systematic error affects distance from center;
Random error affects tightness of pattern AND distancefrom center
Target metaphor- A tight pattern, irrespective of its location, reflects
RELIABLE measure, because it’s consistent.
- How closely the shots cluster around the center indicates
VALIDITY
7/29/2019 Lect 4 Oct 25 Measurement Notes1
http://slidepdf.com/reader/full/lect-4-oct-25-measurement-notes1 20/20
Appropriate Measurement
Valid
Reliabile
Exhaustive
It should exhaust the possibilities of what it is intended to
measure.There must be sufficient categories so that virtually all units
of observations being classified will fit into one of the
categories.
Mutually exclusive
each observation fits one and only one of the scale values(categories).
Recommended