Test validity

In psychological and educational testing, “Validity refers to the degree to which evidence and theory support the interpretations of test scores entailed by proposed uses of tests” ^[1]. Although classical models divided the concept into various "validities," such as content validity, criterion validity, and construct validity^[2], the modern view is that validity is a single unitary construct^[3].

Introduction

Validity is the most important issue in psychological and educational testing^[4] because it concerns the meaning placed on test results^[3]. Though many textbooks present validity as a static construct^[5], various models of validity have evolved since the first published recommendations for constructing psychological and education tests^[6]. These models can be categorized into two primary groups: classical models, which include several types of validity, and modern models, which present validity as a single construct. The modern models reorganize classical "validities" into either "aspects" of validity^[3] or types of validity-supporting evidence^[1]

Historical background

Although psychologists and educators were aware of several facets of validity before World War II, their methods for establishing validity were commonly restricted to correlations of test scores with some known criterion ^[7]. Under the direction of Lee Cronbach, the 1954 Technical Recommendations for Psychological Tests and Diagnostic Techniques^[6] attempted to clarify and broaden the scope of validity by dividing it into four parts: (a) concurrent validity, (b) predictive validity, (c) content validity, and (d) construct validity. Cronbach and Meehl’s subsequent publication^[8] grouped predictive and concurrent validity into a "criterion-orientation", which eventually became criterion validity.

Over the next four decades, many theorists, including Cronbach himself^[9], voiced their dissatisfaction with this three-in-one model of validity^[10]^[11]^[12]. Their arguments culminated in Samuel Messick’s 1995 article that described validity as a single construct comprised of six "aspects"^[3]. In his view, various inferences made from test scores may require different types of evidence, but not different validities.

The 1999 Standards for Educational and Psychological Testing^[1] largely codified Messick’s model. They describe five types of validity-supporting evidence that incorporate each of Messick’s aspects, and make no mention of the classical models’ content, criterion, and construct validities.

References

^ ^a ^b ^c American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (1999). Standards for educational and psychological testing. Washington, DC: American Educational Research Association.
^ Guion, R. M. (1980). On trinitarian doctrines of validity. Professional Psychology, 11, 385-398.
^ ^a ^b ^c ^d Messick, S. (1995). Validity of psychological assessment: Validation of inferences from persons’ responses and performances as scientific inquiry into score meaning. American Psychologist, 50, 741-749.
^ Popham, W. J. (2008). All About Assessment / A Misunderstood Grail. Educational Leadership, 66(1), 82-83.
^ See the otherwise excellent text: Nitko, J.J., Brookhart, S. M. (2004). Educational assessment of students. Upper Saddle River, NJ: Merrill-Prentice Hall.
^ ^a ^b American Psychological Association, American Educational Research Association, & National Council on Measurement in Education. (1954). Technical recommendations for psychological tests and diagnostic techniques. Washington, DC: The Association.
^ Angoff, W. H. (1988). Validity: An evolving concept. In H. Wainer & H. Braun (Eds.), Test Validity (pp. 19-32). Hillsdale, NJ: Lawrence Erlbaum.
^ Cronbach, L. J., & Meehl, P. E. (1955). Construct validity in psychological tests. Psychological Bulletin, 52, 281-302.
^ Cronbach, L. J. (1969). Validation of educational measures. Proceedings of the 1969 Invitational Conference on Testing Problems. Princeton, NJ: Educational Testing Service, 35-52.
^ Loevinger, J. (1957). Objective tests as instruments of psychological theory. Psychological Reports, 3, 634-694.
^ Tenopyr, M. L. (1977). Content-construct confusion. Personnel Psychology, 30, 47-54.
^ Guion, R. M. (1977). Content validity–The source of my discontent. Applied Psychological Measurement, 1, 1-10.

[1999standards-1] American Educational Research Association, American Psychological Association, & National Council on Measurement in Education. (1999). Standards for educational and psychological testing. Washington, DC: American Educational Research Association.

[guion1980-2] Guion, R. M. (1980). On trinitarian doctrines of validity. Professional Psychology, 11, 385-398.

[messick1995-3] Messick, S. (1995). Validity of psychological assessment: Validation of inferences from persons’ responses and performances as scientific inquiry into score meaning. American Psychologist, 50, 741-749.

[popham2008-4] Popham, W. J. (2008). All About Assessment / A Misunderstood Grail. Educational Leadership, 66(1), 82-83.

[5] See the otherwise excellent text: Nitko, J.J., Brookhart, S. M. (2004). Educational assessment of students. Upper Saddle River, NJ: Merrill-Prentice Hall.

[1954recommendations-6] American Psychological Association, American Educational Research Association, & National Council on Measurement in Education. (1954). Technical recommendations for psychological tests and diagnostic techniques. Washington, DC: The Association.

[angoff1988-7] Angoff, W. H. (1988). Validity: An evolving concept. In H. Wainer & H. Braun (Eds.), Test Validity (pp. 19-32). Hillsdale, NJ: Lawrence Erlbaum.

[cronbachmeehl1955-8] Cronbach, L. J., & Meehl, P. E. (1955). Construct validity in psychological tests. Psychological Bulletin, 52, 281-302.

[9] Cronbach, L. J. (1969). Validation of educational measures. Proceedings of the 1969 Invitational Conference on Testing Problems. Princeton, NJ: Educational Testing Service, 35-52.

[10] Loevinger, J. (1957). Objective tests as instruments of psychological theory. Psychological Reports, 3, 634-694.

[11] Tenopyr, M. L. (1977). Content-construct confusion. Personnel Psychology, 30, 47-54.

[12] Guion, R. M. (1977). Content validity–The source of my discontent. Applied Psychological Measurement, 1, 1-10.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

v t e Psychology
History Philosophy Portal Psychologist
Basic psychology	Abnormal Affective neuroscience Affective science Behavioral genetics Behavioral neuroscience Behaviorism Cognitive/Cognitivism Cognitive neuroscience Social Comparative Cross-cultural Cultural Developmental Differential Ecological Evolutionary Experimental Gestalt Intelligence Mathematical Moral Neuropsychology Perception Personality Psycholinguistics Psychophysiology Quantitative Social Theoretical
Applied psychology	Anomalistic Applied behavior analysis Art Assessment Aviation Biography of famous people Clinical Coaching Color Community Consumer Counseling Critical Educational Ergonomics Fashion Feminist Food Forensic Health History Humanistic Industrial and organizational Legal Media Medical Military Music Occupational health Pastoral Peace and war Political Positive Photography Psychometrics Psychotherapy Religion School Sex differences Sport and exercise Suicidology Systems Trading Traffic
Methodologies	Animal testing Archival research Behavior epigenetics Case study Content analysis Experiments Human subject research Interviews Neuroimaging Observation Psychophysics Qualitative research Quantitative research Self-report inventory Statistical surveys
Concepts	Behavior Behavioral engineering Behavioral genetics Behavioral neuroscience Cognition Competence Consciousness Consumer behavior Emotions Feelings Human factors and ergonomics Intelligence Maslow's hierarchy of needs Mental state Mind Psychology of religion Psychometrics Sex differences Terror management theory
Psychologists	Wilhelm Wundt William James Ivan Pavlov Sigmund Freud Edward Thorndike Carl Jung John B. Watson Clark L. Hull Kurt Lewin Jean Piaget Gordon Allport J. P. Guilford Carl Rogers Erik Erikson B. F. Skinner Donald O. Hebb Ernest Hilgard Harry Harlow Raymond Cattell Abraham Maslow Neal E. Miller Jerome Bruner Donald T. Campbell Hans Eysenck Herbert A. Simon David McClelland Leon Festinger George A. Miller Richard Lazarus Stanley Schachter Robert Zajonc Albert Bandura Roger Brown Endel Tulving Lawrence Kohlberg Noam Chomsky Ulric Neisser Jerome Kagan Walter Mischel Elliot Aronson Daniel Kahneman Paul Ekman Michael Posner Amos Tversky Bruce McEwen Larry Squire Richard E. Nisbett Martin Seligman Ed Diener Shelley E. Taylor John Anderson Ronald C. Kessler Joseph E. LeDoux Richard Davidson Susan Fiske Roy Baumeister
Lists	Counseling topics Disciplines Organizations Outline Psychologists Psychotherapies Research methods Schools of thought Timeline Topics
Category Wiktionary definition Wiktionary category Wikisource Wikimedia Commons Wikiquote Wikinews Wikibooks