|
| |
Criterion-Related Validity:
- How well do our tests predict behaviour or events? This is an important question for
making decisions on the basis of our tests. The stronger our test scores correlate with
independent behaviours, attitudes, or events the better our decisions will be, and the
greater our criterion-related validity will be.
- For example, suppose a test developer wants to develop a test to be used in a clinical
setting to diagnose depression. What criterion would you used to determine if indeed the
test does diagnose depression? Probably, a diagnosis made by a psychologist or
psychiatrist independent of the test developer's test. If the test correlates well with
the independent diagnosis of professionals then the test is said to have criterion-related
validity.
- A criterion is simple the measure of performance that is correlated with the test
scores. Or, the criterion is the measure that is used to determine how accurate your
decision is.
- The criterion for the GRE test is most often the student's grade point average.
Methods for Demonstrating Criterion-Related Validity:
- Basically, there are 2 methods used for demonstrating criterion-related validity.
- The Predictive Method:
- The Concurrent Method:
The Predictive Method:
- When you are trying to show a relationship between test scores and some future
behaviour, the predictive method should be used to determine validity.
- The general procedure for predictive validity is as follows:
- 1) A large group of people take the test;
- 2) The scores for those people are held for a predetermined period of time;
- 3) Once the time period elapses, a measure of some behaviour (i.e., the criterion) is
taken.
- 4) The test scores are then correlated with the criterion scores.
- 5) If the scores correlate, the test has predictive validity.
- 6) The resulting correlation coefficient is called the validity coefficient.
- Can you think of some situations that are meant for predictive validity?
- What would the ideal predictive validation study be like? Is it realistic?
- When determining predictive validity it is important that for everyone who took the
test, there is also measured on the criterion. Why?
- Because a restriction in the range of the distribution of test scores will lower your
correlation.
- This 'restriction of range' occurs fairly often in the real world. Why?
- Consider an industrial setting, where 1000 candidates apply for 100 jobs. Generally of
all those who take the test, usually those who score low are weeded out, and only those
with high scores are selected.
- Thus, we need to make a correction in our correlation when we know that a restriction in
range has occurred. See pg. 176 in your text.
rc = {r (SDu / SDres)} / SQRT {1 - r2
+ r2 (SD2u / SD2res)}
- Where:
- rc = correct correlation for restriction of range;
- r = sample observed correlation;
- SDu = standard deviation of the sample before range restriction;
- SDres = standard deviation of sample after range restriction.
The Concurrent Method:
- Concurrent validity is the practical alternative to the ideal predictive method.
- With concurrent validity you obtain at roughly the same time both test scores and
criterion scores in some predetermined population. Once this is accomplished, you simply
correlate test scores with the criterion scores.
- What is the basic difference between concurrent and predictive validation strategies?
- With predictive validation you use a more or less random sample of the population, and
with concurrent validation you are using a preselected sample. The preselected sample may
be different than the population at large and thus is not as powerful as the predictive
method--it theoretically gives an underestimate of the true population validity. However,
most studies show that concurrent validities and predictive validities are very similar.
- The concurrent is more practical and thus is more commonly used than the predictive
model.
- The concurrent method does not predict! Instead it provides information about the
present state of affairs and the status quo. Why is this distinction important? See pg.
177 of your text.
- Can you think of some methods used for concurrent validity?
- Since your selected sample may also be restricted in range, the adjustment formula for
restriction of range can also apply here.
Interpreting Validity Coefficients:
- In theory, validity coefficients have values that, like correlation, range from -1 to
+1.
- However, in practice most of the validity scores you'll see will be relatively small.
Most usually occur in the .3 to .5 range, and few exceed .6 or .7. Thus, there's lots of
room for improvement in most of our psychological measurements.
- Suppose a test used to select graduate students has a criterion-related validity
coefficient of rv = .5. How might you interpret this finding?
- One way of interpreting the finding is to consider the squared correlation coefficient
(rv2). The squared coefficient gives you an indication of how much
of the variation in the criterion can be accounted for by the predictor (your test). Thus,
in our example, 25% of the variance in graduate student performance can be accounted for
by our test. Or, 75% of graduate student performance cannot be accounted for by our test.
- Are tests with such low criterion-related validity coefficients really helping us to
make important decisions?
Tests and Decisions:
|