Research Article
BibTex RIS Cite

The Relation of Item Difficulty Between Classical Test Theory and Item Response Theory: Computerized Adaptive Test Perspective

Year 2023, Volume: 14 Issue: 2, 118 - 127, 30.06.2023
https://doi.org/10.21031/epod.1209284

Abstract

This study aims to transform the calculated item difficulty statistics according to Classical Test Theory (CTT) into the item difficulty parameter of Item Response Theory (IRT) by utilizing the normal distribution curve and to analyze the effectiveness of this transformation based on Rasch model. In this regard, 36 different data sets created with catR package were studied. For each data set, item difficulty parameters and transformed item difficulty parameters were calculated and the correlation coefficients between these parameters were analyzed. Then, Computerized Adaptive Test (CAT) simulations were performed using these parameters. According to the simulation results, the correlation coefficients between the estimated theta values with both methods were high. Furthermore, in CAT simulations in which both parameters were used, especially in the samples which were over 250, it was found to have similar bias, RMSE values, and the average number of administered items.

Supporting Institution

Pamukkale University

Project Number

2021BSP008

References

  • Bock, R. D. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika, 37(1), 29-51. https://doi.org/10.1007/BF02291411
  • Crocker, L., & Algina, J. (1986). Introduction to Classical and Modern Test Theory. Cengage Learning.
  • Çelen, Ü., & Aybek, E. C. (2013). Öğrenci başarısının öğretmen yapımı bir test ile klasik test kuramı ve madde tepki kuramı yöntemleriyle elde edilen puanlara göre karşılaştırılması. Journal of Measurement and Evaluation in Education and Psychology, 4(2), 64-75. https://dergipark.org.tr/tr/pub/epod/issue/5800/77213
  • De Ayala, R.J. (2009). The theory and practice of item response theory. The Guilford Press.
  • Hambleton, R., & Swaminathan, R. (1985). Fundementals of Item Response Theory. Sage Pub.
  • Hambleton, R., Swaminathan, R., & Rogers, H.J. (1991). Fundementals of Item Response Theory. Sage Pub.
  • Kohli, N., Koran, J., & Henn, L. (2015). Relationships among classical test theory and item response theory frameworks via factor analytic models. Educational and psychological measurement, 75(3), 389–405. https://doi.org/10.1177/0013164414559071
  • Lord, F.M. (1980). Applications of Item Response Theory to Practical Testing Problems. Routledge.
  • Magis, D. & Barrada, J.R. (2017). Computerized adaptive testing with R: Recent updates of the package catR. Journal of Statistical Software, Code Snippets, 76(1), 1-19. https://doi.org/10.18637/jss.v076.c01
  • Magis, D. & Raiche, G. (2012). Random generation of response patterns under computerized adaptive testing with the R package catR. Journal of Statistical Software, 48(8), 1-31. https://doi.org/10.18637/jss.v048.i08
  • Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47(2), 149-174. https://doi.org/10.1007/BF02296272
  • Muraki, E. (1992). A generalized partial credit model: Application of an EM algorithm. ETS Research Report Series, 1992(1), i-30. https://doi.org/10.1002/j.2333-8504.1992.tb01436.x
  • Pitman, J. (1993). Probability (6th Edition). Springer.
  • R Core Team (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria. URL https://www.R-project.org/.
  • Rasch, G. (1961). On general laws and the meaning of measurement in psychology. Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability (pp. 321-333). University of California Press.
  • Raykov, T., & Marcoulides, G. A. (2016). On the relationship between classical test theory and item response theory: From one to the other and back. Educational and psychological measurement, 76(2), 325–338. https://doi.org/10.1177/0013164415576958
  • Reckase, D. (2009). Multidimensional Item Response Theory. Springer.
  • Samejima, F. (1996). Polychotomous responses and the test score. The University of Tennessee.
  • van der Linden, W. J. & Glas, G.A.W. (2022). Computerized Adaptive Testing: Theory and Practice. Kluwer Academic Publishers.
Year 2023, Volume: 14 Issue: 2, 118 - 127, 30.06.2023
https://doi.org/10.21031/epod.1209284

Abstract

Project Number

2021BSP008

References

  • Bock, R. D. (1972). Estimating item parameters and latent ability when responses are scored in two or more nominal categories. Psychometrika, 37(1), 29-51. https://doi.org/10.1007/BF02291411
  • Crocker, L., & Algina, J. (1986). Introduction to Classical and Modern Test Theory. Cengage Learning.
  • Çelen, Ü., & Aybek, E. C. (2013). Öğrenci başarısının öğretmen yapımı bir test ile klasik test kuramı ve madde tepki kuramı yöntemleriyle elde edilen puanlara göre karşılaştırılması. Journal of Measurement and Evaluation in Education and Psychology, 4(2), 64-75. https://dergipark.org.tr/tr/pub/epod/issue/5800/77213
  • De Ayala, R.J. (2009). The theory and practice of item response theory. The Guilford Press.
  • Hambleton, R., & Swaminathan, R. (1985). Fundementals of Item Response Theory. Sage Pub.
  • Hambleton, R., Swaminathan, R., & Rogers, H.J. (1991). Fundementals of Item Response Theory. Sage Pub.
  • Kohli, N., Koran, J., & Henn, L. (2015). Relationships among classical test theory and item response theory frameworks via factor analytic models. Educational and psychological measurement, 75(3), 389–405. https://doi.org/10.1177/0013164414559071
  • Lord, F.M. (1980). Applications of Item Response Theory to Practical Testing Problems. Routledge.
  • Magis, D. & Barrada, J.R. (2017). Computerized adaptive testing with R: Recent updates of the package catR. Journal of Statistical Software, Code Snippets, 76(1), 1-19. https://doi.org/10.18637/jss.v076.c01
  • Magis, D. & Raiche, G. (2012). Random generation of response patterns under computerized adaptive testing with the R package catR. Journal of Statistical Software, 48(8), 1-31. https://doi.org/10.18637/jss.v048.i08
  • Masters, G. N. (1982). A Rasch model for partial credit scoring. Psychometrika, 47(2), 149-174. https://doi.org/10.1007/BF02296272
  • Muraki, E. (1992). A generalized partial credit model: Application of an EM algorithm. ETS Research Report Series, 1992(1), i-30. https://doi.org/10.1002/j.2333-8504.1992.tb01436.x
  • Pitman, J. (1993). Probability (6th Edition). Springer.
  • R Core Team (2020). R: A language and environment for statistical computing. R Foundation for Statistical Computing. Vienna, Austria. URL https://www.R-project.org/.
  • Rasch, G. (1961). On general laws and the meaning of measurement in psychology. Proceedings of the Fourth Berkeley Symposium on Mathematical Statistics and Probability (pp. 321-333). University of California Press.
  • Raykov, T., & Marcoulides, G. A. (2016). On the relationship between classical test theory and item response theory: From one to the other and back. Educational and psychological measurement, 76(2), 325–338. https://doi.org/10.1177/0013164415576958
  • Reckase, D. (2009). Multidimensional Item Response Theory. Springer.
  • Samejima, F. (1996). Polychotomous responses and the test score. The University of Tennessee.
  • van der Linden, W. J. & Glas, G.A.W. (2022). Computerized Adaptive Testing: Theory and Practice. Kluwer Academic Publishers.
There are 19 citations in total.

Details

Primary Language English
Subjects Test Theories
Journal Section Articles
Authors

Eren Can Aybek 0000-0003-3040-2337

Project Number 2021BSP008
Publication Date June 30, 2023
Acceptance Date June 28, 2023
Published in Issue Year 2023 Volume: 14 Issue: 2

Cite

APA Aybek, E. C. (2023). The Relation of Item Difficulty Between Classical Test Theory and Item Response Theory: Computerized Adaptive Test Perspective. Journal of Measurement and Evaluation in Education and Psychology, 14(2), 118-127. https://doi.org/10.21031/epod.1209284