Abstract
Established results on latent variable models are applied to the study of the validity of a psychological test. When the test predicts a criterion by measuring a unidimensional latent construct, not only must the total score predict the criterion, but the joint distribution of criterion scores and item responses must exhibit a certain pattern. The presence of this population pattern may be tested with sample data using the stratified Wilcoxon rank sum test. Often, criterion information is available only for selected examinees, for instance, those who are admitted or hired. Three cases are discussed: (i) selection at random, (ii) selection based on the current test, and (iii) selection based on other measures of the latent construct. Discriminant validity is also discussed.
Similar content being viewed by others
References
Bartholomew, D. (1980). Factor analysis for categorical data (with Discussion).Journal of the Royal Statistical Society, Series B,42, 293–321.
Birnbaum, A. (1968). Some latent trait models and their use in inferring an examinee's ability (Part 5). In F. Lord & M. Novick (Eds.),Statistical theories of mental test scores, Reading, MA: Addison-Wesley.
Bock, D., & Lieberman, M. (1970). Fitting a response model forn dichotomously scored times.Psychometrika, 35, 179–97.
Campbell, D., & Fiske, D. (1959). Convergent and discriminant validation by the multitrait-multimethod matrix.Psychological Bulletin, 56, 81–105.
Cronbach, L. (1971). Test Validation. In R. L. Throndike, (Ed.),Educational Measurement. Washington, DC: National Council on Research in Education.
Cronbach, L., & Meehl, P. (1955). Construct validity in psychological tests.Psychological Bulletin, 52, 281–302.
Holland, P. (1981). When are item response models consistent with observed data?Psychometrika, 46, 79–92.
Holland, P., & Rosenbaum, P. (1986). Conditional association and unidimensionality in monotone latent variable models.Annals of Statistics, 14, 1523–1543.
Lehmann, E. (1951). Consistency and unbiasedness of certain nonparametric tests.Annals of Mathematical Statistics, 22, 165–179.
Lehmann, E. (1966). Some concepts of dependence.Annals of Mathematical Statistics, 37, 1137–1153.
Lord, F. (1977). A study of item bias, using item characteristic curve theory. In Y. H. Poortinga (Ed.),Basic problems in cross-cultural psychology (pp. 19–29). Amsterdam: Swets and Zeitlinger.
Lord, F. (1980).Applications of item response theory to practical testing problems. Hillsdale, NJ: Erlbaum.
Lord, F., & Novick, M. (1968).Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
Mantel, N., & Haenszel, W. (1959). Statistical aspects of retrospective studies of disease.Journal of the National Cancer Institute, 22, 719–748.
Messick, S. (1980). Test validity and the ethics of assessment.American Psychologist, 35, 1012–1027.
Miller, R. (1981).Simultaneous statistical inference. New York: Springer-Verlag.
Popper, K. (1959).The logic of scientific discovery. New York: Harper and Row.
Rasch, G. (1960).Probabilistic models for some intelligence and attainment tests. Copenhagen: Neilson and Lydiche.
Rosenbaum, P. (1984). Testing the conditional independence and monotonicity assumptions of item response theory.Psychometrika, 49, 425–435.
Rosenbaum, P. (1987). Comparing item characteristic curves.Psychometrika, 52, 217–233.
Standards for educational and psychological tests (1985). Washington, DC: A joint publication of the American Educational Research Association, the American Psychological Association, and the National Council on Measurement in Education.
Uniform guidelines on employee selection procedures. (1978).United States Federal Register, 43 (106, August 25, 1978), pp. 38296–38369.
Author information
Authors and Affiliations
Additional information
This work was supported in part by Grant SES-87-01890 from the Measurement Methods and Data Improvement Program of the U.S. National Science Foundation.
Rights and permissions
About this article
Cite this article
Rosenbaum, P.R. Criterion-related construct validity. Psychometrika 54, 625–633 (1989). https://doi.org/10.1007/BF02296400
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF02296400