Leveraging LLM With RAG For Feedback In Medical Data Science Courses

Ivan Letteri; Pierpaolo Vittorini; Francesca Tusoni; Leila Fabiani

doi:10.61007/QdC.2025.2.378

Authors

Ivan Letteri Research Fellow, Department of Life, Health and Environmental Sciences, University of L’Aquila
Pierpaolo Vittorini Assistant Professor, Department of Life, Health and Environmental Sciences, University of L’Aquila
Francesca Tusoni PhD Student, Department of Life, Health and Environmental Sciences, University of L’Aquila
Leila Fabiani Full Professor, Department of Life, Health and Environmental Sciences, University of L’Aquila

DOI:

https://doi.org/10.61007/QdC.2025.2.378

Keywords:

Data Science, Large Language Models, Techology Enhanced Learning, Retrieval Augmented Generation

Abstract

Providing feedback during formative assessment has proved to increase learning outcomes. Recently, the authors explored using large language models (LLMs) to produce scalable, cost-effective, and time-efficient feedback. The research focuses on short written answers from students concerning the interpretation of normality and hypothesis testing. Preliminary findings show promising performance: the LLaMA-3.3-7B model achieved an average accuracy of 0.93 in understanding if right or wrong, and suitable explanations in over 75% of cases. This study examines previously unsatisfactory LLM-generated explanations using Retrieval-Augmented Generation (RAG). A blind evaluator scored 64 responses (three RAG variants and one non-RAG). RAG-based methods improved explanation quality, making up to 25% of previously inadequate responses satisfactory. Besides the small sample size, these results underscore the flexibility of LLMs in multilingual, domain-specific contexts and highlight RAG's potential to enhance performance without retraining. Further research is needed to improve the alignment between the LLM's focus and the pedagogical intent.

References

Bernardi, A., Innamorati, C., Padovani, C., Romanelli, R., Saggino, A., Tommasi, M., & Vittorini, P. (2019). On the design and development of an assessment system with adaptive capabilities. Advances in Intelligent Systems and Computing (Vol. 804). Springer, Cham.

Black, P., & Wiliam, D. (1998). Assessment and classroom learning. Assessment in Education; Principles, Policy & Practice, 5(1), 7-74

Brooks, C., Quintana, R. M., Choi, H., Quintana, C., NeCamp, T., Gardner, J. (2021). Towards Culturally Relevant Personalization at Scale: Experiments with Data Science Learners, International Journal of Artificial Intelligence in Education 31 516–537.

Choi Y, McClenen C (2020) Development of Adaptive Formative Assessment System Using Computerized Adaptive Testing and Dynamic Bayesian Networks. Applied Sciences 2020, Vol 10, Page 8196 10(22):8196

Cofini, V., Jobe, T., Letteri, I., Vittorini, P. (2025), Preliminary evaluation of an LLM-based system for grading and providing feedback on short-text answers in data science exercises, in: Methodologies and Intelligent Systems for Technology Enhanced Learning, 15th International Conference, Springer, Lille.

Gupta S, Ojeh N, Sa B, et al (2020) Use of an Adaptive e-Learning Platform as a Formative Assessment Tool in the Cardiovascular System Course Component of an MBBS Programme. Advances in medical education and practice 11:989–996.

Mertens, U., Finn, B., & Lindner, M. A. (2022). Effects of Computer-Based Feedback on Lower and Higher-Order Learning Outcomes: A Network Meta-Analysis. Journal of Educational Psychology, 114(8), 1743–1772.

Oord, A. v. d., Li, Y. & Vinyals, O. (2018). Representation Learning with Contrastive Predictive Coding (arxiv:1807.03748)

Reimers, N., & Gurevych, I. (2019). Sentence-BERT: Sentence Embeddings using Siamese BERT-Networks. Proceedings of EMNLP-IJCNLP 2019, 3982–3992.

Vittorini, P., Menini, S., & Tonelli, S. (2020). An AI-Based System for Formative and Summative Assessment in Data Science Courses. International Journal of Artificial Intelligence in Education, 1–27.

Vittorini, P. (2023). The Design of an Adaptive Tool Supporting Formative Assessment in Data Science Courses. In C. S. González-González, B. Fernández-Manjón, F. Li, F. J. García-Peñalvo, F. Sciarrone, M. Spaniol, A. García-Holgado, M. Area-Moreira, M. Hemmje, & T. Hao (Eds.), ICWL 2022 - International Conference on Web-based Learning (pp. 86–97). Springer, Cham.

Vygotsky, L. S. (1978). Mind in society: The development of higher psychological processes (Vol. 86). Harvard University Press.

Leveraging LLM With RAG For Feedback In Medical Data Science Courses

Authors

DOI:

Keywords:

Abstract

References

Published

How to Cite

Issue

Section

License

Make a Submission

Language

Guidelines

Information

Open Access Logo

The Journal is covered by the following indexing services

Scientific Journal