Leveraging LLM With RAG For Feedback In Medical Data Science Courses
DOI:
https://doi.org/10.61007/QdC.2025.2.378Keywords:
Data Science, Large Language Models, Techology Enhanced Learning, Retrieval Augmented GenerationAbstract
Providing feedback during formative assessment has proved to increase learning outcomes. Recently, the authors explored using large language models (LLMs) to produce scalable, cost-effective, and time-efficient feedback. The research focuses on short written answers from students concerning the interpretation of normality and hypothesis testing. Preliminary findings show promising performance: the LLaMA-3.3-7B model achieved an average accuracy of 0.93 in understanding if right or wrong, and suitable explanations in over 75% of cases. This study examines previously unsatisfactory LLM-generated explanations using Retrieval-Augmented Generation (RAG). A blind evaluator scored 64 responses (three RAG variants and one non-RAG). RAG-based methods improved explanation quality, making up to 25% of previously inadequate responses satisfactory. Besides the small sample size, these results underscore the flexibility of LLMs in multilingual, domain-specific contexts and highlight RAG's potential to enhance performance without retraining. Further research is needed to improve the alignment between the LLM's focus and the pedagogical intent.
References
Bernardi, A., Innamorati, C., Padovani, C., Romanelli, R., Saggino, A., Tommasi, M., & Vittorini, P. (2019). On the design and development of an assessment system with adaptive capabilities. Advances in Intelligent Systems and Computing (Vol. 804). Springer, Cham.
Black, P., & Wiliam, D. (1998). Assessment and classroom learning. Assessment in Education; Principles, Policy & Practice, 5(1), 7-74
Brooks, C., Quintana, R. M., Choi, H., Quintana, C., NeCamp, T., Gardner, J. (2021). Towards Culturally Relevant Personalization at Scale: Experiments with Data Science Learners, International Journal of Artificial Intelligence in Education 31 516–537.
Choi Y, McClenen C (2020) Development of Adaptive Formative Assessment System Using Computerized Adaptive Testing and Dynamic Bayesian Networks. Applied Sciences 2020, Vol 10, Page 8196 10(22):8196
Cofini, V., Jobe, T., Letteri, I., Vittorini, P. (2025), Preliminary evaluation of an LLM-based system for grading and providing feedback on short-text answers in data science exercises, in: Methodologies and Intelligent Systems for Technology Enhanced Learning, 15th International Conference, Springer, Lille.
Gupta S, Ojeh N, Sa B, et al (2020) Use of an Adaptive e-Learning Platform as a Formative Assessment Tool in the Cardiovascular System Course Component of an MBBS Programme. Advances in medical education and practice 11:989–996.
Mertens, U., Finn, B., & Lindner, M. A. (2022). Effects of Computer-Based Feedback on Lower and Higher-Order Learning Outcomes: A Network Meta-Analysis. Journal of Educational Psychology, 114(8), 1743–1772.
Oord, A. v. d., Li, Y. & Vinyals, O. (2018). Representation Learning with Contrastive Predictive Coding (arxiv:1807.03748)
Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of EMNLP-IJCNLP 2019, 3982–3992.
Vittorini, P., Menini, S., & Tonelli, S. (2020). An AI-Based System for Formative and Summative Assessment in Data Science Courses. International Journal of Artificial Intelligence in Education, 1–27.
Vittorini, P. (2023). The Design of an Adaptive Tool Supporting Formative Assessment in Data Science Courses. In C. S. González-González, B. Fernández-Manjón, F. Li, F. J. García-Peñalvo, F. Sciarrone, M. Spaniol, A. García-Holgado, M. Area-Moreira, M. Hemmje, & T. Hao (Eds.), ICWL 2022 - International Conference on Web-based Learning (pp. 86–97). Springer, Cham.
Vygotsky, L. S. (1978). Mind in society: The development of higher psychological processes (Vol. 86). Harvard University Press.
Published
How to Cite
Issue
Section
License
Copyright (c) 2025 Community Notebook. People, Education and Welfare in the Society 5.0

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.