Abstract
During the past 30 years or so, a new theoretical basis for educational and psychological testing and measurement has emerged. It has been variously referred to as latent trait theory, item characteristic curve theory, and, more recently, item response theory (IRT). Although this new test theory holds considerable promise as a successor to classical test theory, it has been underutilized by test practitioners. One important reason for this underutilization is that many test developers have not had sufficient time to devote to the study of the technical and mathematical intricacies involved in this new test theory and its mathematical models. This chapter is intended as an overview of IRT for individuals with some background in the basic methods of classical test theory. Readers are referred to Hambleton (1989) and Hambleton and Swaminathan (1985) for other overviews of IRT.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Angoff, W. H. (1982). Summary and derivation of equating methods used at ETS. In P. W. Holland & D. R. Rubin (eds.), Testing equating. New York: Academic Press, pp. 55–69.
Assessment Systems Corporation. (1989). User’s manual for the MicroCAT Testing System, (3rd ed.) St. Paul, MN: Author.
Birnbaum, A. (1968). Test scores, sufficient statistics, and the information structures of tests. In F. M. Lord & M. R. Novick, Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
Bock, R. D. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika 37:29–51.
Fisher, R. A. (1922). On the mathematical foundations of theoretical statistics. Philosophical Transactions of the Royal Society of London (A) 222:309–368.
Guttman, L. (1944). A basis for scaling qualitative data. American Sociological Review 9:139–150.
Hambleton, R. K. (1989). Principles and selected applications of item response theory. In R. Linn (ed.), Educational measurement, 3rd ed. New York: Macmillan, pp. 147–200.
Hambleton, R.K., & Swaminathan, H. (1985). Item response theory. Principles and applications. Norwell, MA: Kluwer Academic Publishers.
Hambleton, R. K., & Traub, R.E. (1973). Analysis of empirical data using two logistic latent trait models. British Journal of Mathematical and Statistical Psychology 26:195–211.
Harris, D. (1989). Comparison of 1-, 2-, and 3-parameter IRT models. Educational Measurement: Issues and Practice 8:35–44.
Harwell, M.R., Baker, F. B., & Zwarts, M. (1988). Item parameter estimation via marginal maximum likelihood and an EM algorithm: A didactic. Journal of Educational Statistics 13:243–271.
Lazarsfeid, P. F. (1950). The logical and mathematical foundation of latent structure analysis. In S. A. Stouffer, L. Guttman, E. A. Suchman, P. F. Lazarsfeid, S.A. Star, & J.A. Clausen (eds.), Measurement and prediction: Studies in social psychology in World War II, Vol. IV. New York: Wiley, 1966. (Originally published by Princeton University Press, 1950.)
Lord, F. M. (1952). A theory of test scores (Psychometric Monograph, No. 7). Psychometric Society.
Lord, F. M. (1980). Applications of item response theory to practical testing problems. Hillsdale, NJ: Lawrence Erlbaum Associates.
Lord, F. M., & Novick, M. R. (1968). Statistical theories of mental test scores. Reading, MA: Addison-Wesley.
McBride, J. R. (1977). A brief overview of adaptive testing. In D. J. Weiss (ed.), Applications of computerized adaptive testing (Research Report 77–1). University of Minnesota, Department of Psychology, Psychometric Methods Program.
McDonald, R. P. (1985). Factor analysis and related methods. Hillsdale, NJ: Lawrence Erlbaum Associates.
Mislevy, R. J., & Bock, R. D. (1982). BILOG: Maximum likelihood item analysis and test scoring with logistic models for binary items. Chicago: International Educational Services.
Mislevy, R.J., & Stocking, M. L. (1989). A consumer’s guide to LOGIST and BILOG. Applied Psychological Measurement 13:57–75.
Mosier, C. I. (1940). Psychophysics and mental test theory: Fundamental postulates and elementary theorems. Psychological Review 47:355–366.
Rasch, G. (1960/1980). Probabilistic models for some intelligence and attainment tests. Chicago: University of Chicago Press. (Originally published by The Danish Institute for Educational Research, Copenhagen, 1960.)
Reckase, M. D. (1983, April). The definition of difficulty and discrimination for multidimensional item response theory models. Paper presented at the meeting of the American Educational Research Association, Montreal, Canada
Reckase, M. D. (1985). The difficulty of test items that measure more than one dimension. Applied Psychological Measurement 9:401–422.
Samejima, F. (1969). Estimation of latent ability using a response pattern of graded scores (Psychometric Monograph No. 17). Psychometric Society.
Samejima, F. (1972). A general model for free-response data (Psychometric Monograph No. 18). Psychometric Society.
Samejima, F. (1973). Homogeneous case of the continuous response model. Psychometrika 38:203–219.
Swaminathan, H. (1983). Parameter estimation in item response models. In R.K. Hambleton (ed.), Applications of item response theory. Vancouver, BC: Educational Research Institute of British Columbia, pp. 24–44.
Vale, C D. (1986). Linking item parameters onto a common scale. Applied Psychological Measurement 10:333–344.
van der Linden, W.J. (1986). The changing conception of measurement in education and psychology. Applied Psychological Measurement 10:325–332.
van der Linden, W.J., & Boekkooi-Timminga, E. (1989). A maximin model for test design with practical constraints. Psychometrika 54:237–247.
Wainer, H. (Ed.). (1990). Computerized adaptive testing: A primer. Hillsdale, NJ: Lawrence Erlbaum Associates.
Weiss, D.J. (1985). Adaptive testing by computer. Journal of Consulting and Clinical Psychology 53:774–789.
Weiss, D.J., & Kingsbury, G. G. (1984). Application of computerized adaptive testing to educational problems. Journal of Educational Measurement 21:361–375.
Wingersky, M.S., Barton, M. A., & Lord, F.M. (1982). LOGIST User’s Guide. Princeton, NJ: Educational Testing Service.
Wright, B.D., & Mead, R.J. (1976). BICAL: Calibrating rating scores with the Rasch model (Research Memorandum No. 23). Chicago: Chicago Statistical Laboratory, Department of Education, University of Chicago.
Editor information
Rights and permissions
Copyright information
© 1991 Springer Science+Business Media New York
About this chapter
Cite this chapter
Weiss, D.J., Yoes, M.E. (1991). Item Response Theory. In: Hambleton, R.K., Zaal, J.N. (eds) Advances in Educational and Psychological Testing: Theory and Applications. Evaluation in Education and Human Services Series, vol 28. Springer, Dordrecht. https://doi.org/10.1007/978-94-009-2195-5_3
Download citation
DOI: https://doi.org/10.1007/978-94-009-2195-5_3
Publisher Name: Springer, Dordrecht
Print ISBN: 978-94-010-7484-1
Online ISBN: 978-94-009-2195-5
eBook Packages: Springer Book Archive