ACKNOWLEDGEMENTS
The research was supported by the European
Regional Development Fund’s project ‘IT-based
support system prototype for providing feedback and
improve student performance in literacy and
numeracy acquisition’, Project No. 1.1.1.1/19/A/076.
REFERENCES
Biggs, J., and Collis, K. (1982). Evaluating the Quality of
Learning: the SOLO taxonomy. New York: Academic
Press.
Black, P., and Wiliam, D. (2009). Developing the theory of
formative assessment. Educational Assessment,
Evaluation and Accountability, 21(1), 5–31.
Bond, T. G. and Fox, C. M. (2013). Applying the Rasch
model: Fundamental measurement in the human
sciences. New Jersey: Lawrence Erlbaum Associates
Publishers, 2013.
Boone, W. J. (2016). Rasch analysis for instrument
development: Why, when, and how?. CBE—Life
Sciences Education, 15(4), 1.-7.
Bridgeman, B., Trapani, C., and Attali, Y. (2012).
Comparison of human and machine scoring of essays:
Differences by gender, ethnicity, and country. Applied
Measurement in Education, 25(1), 27-40.
Cheng, Y. C. (2020). Education reform phenomenon: A
typology of multiple dilemmas. In Fan, G., and T.
Popkewitz (Eds.), Handbook of education policy
studies (pp. 85-109). Singapore: Springer
Csapó, B., and Molnár, G. (2019). Online diagnostic
assessment in support of personalized teaching and
learning: The eDia system. Frontiers in psychology, 10,
1522.
Csapó, B., Ainley, J., Bennett, R. E., Latour, T., and Law,
N. (2012). Technological issues for computer-based
assessment. In P. Griffin, B. McGaw, & E. Care (Eds.),
Assessment and teaching of 21st Century skills (pp.
143–230). New York, NY: Springer.
Cuthbert, A. S. (2021). Disciplinary knowledge and its role
in the school curriculum. In A. S. Cuthbert and A.
Standish, What should schools teach? (pp. 15-37).
London: UCL Press.
Dihoff, R. E., Brosvic, G. M., and Epstein, M. L. (2003).
The role of feedback during academic testing: The
delay retention effect revisited. The Psychological
Record, 53(4), 533-548.
Dunlap, K., Piro, J. (2016). Diving into data: Developing
the capacity for data literacy in teacher education.
Cogent Education, 3(1), 1132526.
Gibbs, G., and Simpson, C. (2005). Conditions under which
assessment supports students’ learning. Learning and
Teaching in Higher Education, (1), 3-31.
Hattie, J., and Timperley, H. (2007). The power of
feedback. Review of educational research, 77(1), 81-
112.
Holmes-Smith, P. (2005). Assessment for Learning: Using
statewide literacy and numeracy tests as diagnostic
tools. Paper presented at the Using data to support
learning research conference, Mlbourne.
Hume, A. and Coll, R. K. (2009). Assessment of learning,
for learning, and as learning: New Zealand case studies.
Assessment in Education: Principles, Policy &
Practice, 16(3), 269-290.
Ketterlin-Geller, L. R., and Yovanoff, P. (2009). Diagnostic
assessments in mathematics to support instructional
decision making. Practical Assessment, Research, and
Evaluation, 14(1), 1-11.
Kulik, J. A., and Fletcher, J. D. (2016). Effectiveness of
intelligent tutoring systems: a meta-analytic review.
Review of educational research, 86(1), 42-78.
Leber, J., Renkl, A., Nückles, M., and Waschle, K. (2018).
When the type of assessment counteracts teaching for
understanding.
Learning: Research and Practice, 4(2),
161-179.
Lee, H. S., Pallant, A., Pryputniewicz, S., Lord, T.,
Mulholland, M., & Liu, O. L. (2019). Automated text
scoring and real‐time adjustable feedback: Supporting
revision of scientific arguments involving uncertainty.
Science Education, 103(3), 590-622.
Leung, C. F. (2000). Assessment for learning: Using SOLO
taxonomy to measure design performance of design &
technology students. International Journal of
Technology and Design Education, 10(2), 149-161.
Linacre, J., M. (2015). Winsteps Rasch Measurement
(computer software), Beaverton, Oregon: Winsteps.com
Liu, O. L., Brew, C., Blackmore, J., Gerard, L., Madhok, J.,
and Linn, M. C. (2014). Automated scoring of
constructed‐response science items: Prospects and
obstacles. Educational Measurement: Issues and
Practice, 33(2), 19-28.
Ljungdahl, L., and Prescott, A. (2009). Teachers' use of
diagnostic testing to enhance students' literacy and
numeracy learning. International Journal of Learning,
16, 461-476.
Lockton, M., Weddle, H., Datnow, A. (2020). When data
don’t drive: teacher agency in data use efforts in low-
performing schools. School Effectiveness and School
Improvement, 31(2), 243-265.
Lonsdale, M., and McCurry, D. (2004). Literacy in the new
millennium. Adelaide: NCVER.
Maier, U., Wolf, N., and Randler, C. (2016). Effects of a
computer-assisted formative assessment intervention
based on multiple-tier diagnostic items and different
feedback types. Computers & Education, 95, 85-98.
Masters, G., N. (1982). A Rasch model for partial credit
scoring. Psychometrika, 47(2), 149-174.
Mills, K. A. (2010). A review of the “digital turn” in the
new literacy studies. Review of educational research,
80(2), 246-271.
Molnar, G. nad Csapo, B. (2019). Making the
psychological dimension of learning visible: using
technology-based assessment to monitor students’
cognitive development. Frontiers in Psychology, 10,
1368