Machine learning algorithm for grading open-ended physics questions in Turkish

被引:20
作者
Cinar, Ayse [1 ]
Ince, Elif [2 ]
Gezer, Murat [3 ]
Yilmaz, Ozgur [4 ]
机构
[1] Marmara Univ, Fac Business Adm, Dept Business Adm, English Subdept Quantitat Methods, Istanbul, Turkey
[2] Istanbul Univ Cerrahpasa, Hasan Ali Yucel Educ Fac, Dept Sci Educ, Istanbul, Turkey
[3] Istanbul Univ, Dept Informat, Istanbul, Turkey
[4] Istanbul Univ Cerrahpasa, Hasan Ali Yucel Educ Fac, Dept Comp Educ & Instruct Technol, Istanbul, Turkey
关键词
Machine learning; Automatic short answer grading; Short-answer scoring; AUTOMATIC ASSESSMENT; ANSWERS;
D O I
10.1007/s10639-020-10128-0
中图分类号
G40 [教育学];
学科分类号
040101 ; 120403 ;
摘要
Worldwide, open-ended questions that require short answers have been used in many exams in fields of science, such as the International Student Assessment Program (PISA), the International Science and Maths Trends Research (TIMSS). However, multiple-choice questions are used for many exams at the national level in Turkey, especially high school and university entrance exams. This study aims to develop an objective and useful automatic scoring model for open-ended questions using machine learning algorithms. Within the scope of this aim, an automated scoring model construction study was conducted on four Physics questions at a University level course with the participation of 246 undergraduate students. The short-answer scoring was handled through an approach that addresses students' answers in Turkish. Model performing machine learning classification techniques such as SVM (Support Vector Machines), Gini, KNN (k-Nearest Neighbors), and Bagging and Boosting were applied after data preprocessing. The score indicated the accuracy, precision and F1-Score of each predictive model of which the AdaBoost.M1 technique had the best performance. In this paper, we report on a short answer grading system in Turkish, based on a machine learning approach using a constructed dataset about a Physics course in Turkish. This study is also the first study in the field of open-ended exam scoring in Turkish.
引用
收藏
页码:3821 / 3844
页数:24
相关论文
共 34 条
  • [1] Alfonseca E, 2004, LECT NOTES ARTIF INT, V3230, P25
  • [2] Alvarado J.G., 2018, EDM, P55
  • [3] [Anonymous], 1997, HDB MODERN ITEM RESP
  • [4] [Anonymous], 2016, P 9 INT C ED DATA MI
  • [5] [Anonymous], ADABAG APPLIES MULTI
  • [6] [Anonymous], 2009, European Chapter of the Association for Computational Linguistics
  • [7] Bailey S., 2008, P 3 ACL WORKSHOP INN, P107
  • [8] Basu S., 2013, T ASS COMPUTATIONAL, V1, P391, DOI [DOI 10.1162/TACL_A_00236, DOI 10.1162/TACLA00236]
  • [9] Bukai O., 2006, P 20 IITSE C, P1
  • [10] The Eras and Trends of Automatic Short Answer Grading
    Burrows, Steven
    Gurevych, Iryna
    Stein, Benno
    [J]. INTERNATIONAL JOURNAL OF ARTIFICIAL INTELLIGENCE IN EDUCATION, 2015, 25 (01) : 60 - 117