Examining the Discrimination of Binary Scored Test Items with ROC Analysis

Sait Çüm

doi:10.21449/ijate.894851

Araştırma Makalesi

Examining the Discrimination of Binary Scored Test Items with ROC Analysis

Yıl 2021, Cilt: 8 Sayı: 4, 948 - 958, 04.12.2021

Sait Çüm

https://doi.org/10.21449/ijate.894851

Cited By: 2

Öz

In this study, it was claimed that ROC analysis, which is used to determine to what extent medical diagnosis tests can be differentiated between patients and non-patients, can also be used to examine the discrimination of binary scored items in cognitive tests. In order to obtain various evidence for this claim, the 2x2 contingency table used in the ROC analysis was adapted in accordance with the logic of item discrimination. It was suggested in the article that the areas under the ROC curves (AUC) obtained by using the sensitivity and specificity values calculated with the adapted contingency table can be considered as a measure of item discrimination. The results of the statistical analyses made on the simulation data showed that the AUC values were positively and highly correlated with the D, 𝑟bis and a parameter values of the items, and the AUC values from different sized samples were consistent. Additionally, ROC analysis was more stable against range narrowing than other methods. In this respect, it was concluded that very large groups were not needed to examine item discrimination with the proposed method.

Anahtar Kelimeler

Item analysis, Discrimination, Roc analysis, Test development, Psychometrics

Kaynakça

Alonzo, A.T., & Pepe, S. M. (2002). Distribution-free ROC analysis using binary regression tecniques. Biostatistics, 3(3), 421-432. https://doi.org/10.1093/biostatistics/3.3.421
Baker, F.B. (2001). The basics of item response theory. ERIC Clearinghouse on Assessment and Evaluation.
Bamber, D. (1975). The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. Journal of Mathematical Psychology, 12(4), 387–415. https://doi.org/10.1016/0022-2496(75)90001-2
Çüm, S., Gelbal, S., & Tsai, C-P. (2016). Examination of the consistency of the sato test theory item parameters obtained from different samples. Journal of Measurement and Evaluation in Education and Psychology, 7(1), 170-181. https://doi.org/10.21031/epod.69276
DeMars, C. (2016). Madde tepki kuramı [Item response theory]. Nobel.
Ebel, R.L., & Frisbie, D.A. (1991). Essentials of educational measurement. Prentice-Hall Inc.
Fulcher, G., & Davidson, F. (2007). Language testing and assessment: An advanced resource book. Routledge.
Gönen, M. (2007). Analyzing Receiver Operating Characteristic Curves with SAS®. SAS Institute Inc.
Hambleton, R. K., & Swaminathan, H. (1985). Item response theory: Principles and applications. Springer.
Hosmer, D.W., & Lemeshow, S. (2000). Applied logistic regression. John-Wiley & Sons, INC.
Krzanowski, W.J., & Hand, D.J. (2009). ROC curves for continuous data. Chapman and Hall/CRC Press.
Osterlind, S. J. (1990). Toward a uniform definition of a test item. Educational Research Quarterly, 14(4), 2-5.
Pepe, M.S. (2003). The statistical evaluation of medical tests for classification and prediction. University Press, Oxford.
Ruopp, D. M., Perkins, J. N., Whitcomb, W. B., & Schisterman, F. E. (2008). Youden index and optimal cut-point estimated from observations affected by a lower limit of detection. Biometrical Journal, 3, 419-430. https://doi.org/10.1002/bimj.200710415
Van Erkel, A.R., & Pattynama, P.M. (1998). Reciever operating characteristic analysis: Basic principles and applications in radiology. European Journal of Radiology 27, 88-94. https://doi.org/10.1016/S0720-048X(97)00157-5
Zou, H. K., Liu, A., Bandos, I.A., Onho-Machado, L., & Rockette, E. H. (2012). Statistical evaluation of diagnostic performance topics in roc analysis. CRC Press.

Examining the Discrimination of Binary Scored Test Items with ROC Analysis

Yıl 2021, Cilt: 8 Sayı: 4, 948 - 958, 04.12.2021

Sait Çüm

https://doi.org/10.21449/ijate.894851

Cited By: 2

Öz

In this study, it was claimed that ROC analysis, which is used to determine to what extent medical diagnosis tests can be differentiated between patients and non-patients, can also be used to examine the discrimination of binary scored items in cognitive tests. In order to obtain various evidence for this claim, the 2x2 contingency table used in the ROC analysis was adapted in accordance with the logic of item discrimination. It was suggested in the article that the areas under the ROC curves (AUC) obtained by using the sensitivity and specificity values calculated with the adapted contingency table can be considered as a measure of item discrimination. The results of the statistical analyses made on the simulation data showed that the AUC values were positively and highly correlated with the D, 𝑟bis and a parameter values of the items, and the AUC values from different sized samples were consistent. Additionally, ROC analysis was more stable against range narrowing than other methods. In this respect, it was concluded that very large groups were not needed to examine item discrimination with the proposed method.

Anahtar Kelimeler

Item analysis, Discrimination, Roc analysis, Test development, Psychometrics

Kaynakça

Alonzo, A.T., & Pepe, S. M. (2002). Distribution-free ROC analysis using binary regression tecniques. Biostatistics, 3(3), 421-432. https://doi.org/10.1093/biostatistics/3.3.421
Baker, F.B. (2001). The basics of item response theory. ERIC Clearinghouse on Assessment and Evaluation.
Bamber, D. (1975). The area above the ordinal dominance graph and the area below the receiver operating characteristic graph. Journal of Mathematical Psychology, 12(4), 387–415. https://doi.org/10.1016/0022-2496(75)90001-2
Çüm, S., Gelbal, S., & Tsai, C-P. (2016). Examination of the consistency of the sato test theory item parameters obtained from different samples. Journal of Measurement and Evaluation in Education and Psychology, 7(1), 170-181. https://doi.org/10.21031/epod.69276
DeMars, C. (2016). Madde tepki kuramı [Item response theory]. Nobel.
Ebel, R.L., & Frisbie, D.A. (1991). Essentials of educational measurement. Prentice-Hall Inc.
Fulcher, G., & Davidson, F. (2007). Language testing and assessment: An advanced resource book. Routledge.
Gönen, M. (2007). Analyzing Receiver Operating Characteristic Curves with SAS®. SAS Institute Inc.
Hambleton, R. K., & Swaminathan, H. (1985). Item response theory: Principles and applications. Springer.
Hosmer, D.W., & Lemeshow, S. (2000). Applied logistic regression. John-Wiley & Sons, INC.
Krzanowski, W.J., & Hand, D.J. (2009). ROC curves for continuous data. Chapman and Hall/CRC Press.
Osterlind, S. J. (1990). Toward a uniform definition of a test item. Educational Research Quarterly, 14(4), 2-5.
Pepe, M.S. (2003). The statistical evaluation of medical tests for classification and prediction. University Press, Oxford.
Ruopp, D. M., Perkins, J. N., Whitcomb, W. B., & Schisterman, F. E. (2008). Youden index and optimal cut-point estimated from observations affected by a lower limit of detection. Biometrical Journal, 3, 419-430. https://doi.org/10.1002/bimj.200710415
Van Erkel, A.R., & Pattynama, P.M. (1998). Reciever operating characteristic analysis: Basic principles and applications in radiology. European Journal of Radiology 27, 88-94. https://doi.org/10.1016/S0720-048X(97)00157-5
Zou, H. K., Liu, A., Bandos, I.A., Onho-Machado, L., & Rockette, E. H. (2012). Statistical evaluation of diagnostic performance topics in roc analysis. CRC Press.

Toplam 16 adet kaynakça vardır.

Ayrıntılar

Birincil Dil	İngilizce
Konular	Eğitim Üzerine Çalışmalar
Bölüm	Makaleler
Yazarlar	Sait Çüm 0000-0002-0428-5088
Yayımlanma Tarihi	4 Aralık 2021
Gönderilme Tarihi	11 Mart 2021
Yayımlandığı Sayı	Yıl 2021 Cilt: 8 Sayı: 4

Kaynak Göster

APA	Çüm, S. (2021). Examining the Discrimination of Binary Scored Test Items with ROC Analysis. International Journal of Assessment Tools in Education, 8(4), 948-958. https://doi.org/10.21449/ijate.894851

International Journal of Assessment Tools in Education

Examining the Discrimination of Binary Scored Test Items with ROC Analysis

Öz

Anahtar Kelimeler

Kaynakça

Examining the Discrimination of Binary Scored Test Items with ROC Analysis

Öz

Anahtar Kelimeler

Kaynakça

Ayrıntılar

Kaynak Göster

Cited By

A novel approach for calculating the item discrimination for Likert type of scales

International Journal of Assessment Tools in Education

https://doi.org/10.21449/ijate.1173356

A Special Case of Brennan's Index for Tests That Aim to Select a Limited Number of Students: A Monte Carlo Simulation Study

Educational Measurement: Issues and Practice

https://doi.org/10.1111/emip.12528