A recent systematic review published in npj Science of Learning examines the effects of intelligent tutoring systems (ITSs) on students’ learning and performance in K-12 education. As artificial intelligence in education (AIEd) has expanded rapidly, ITSs have emerged as a key application with the potential to personalize learning and improve educational outcomes. However, despite their growing adoption, their actual educational value remains uncertain. While some studies suggest that ITSs can enhance learning outcomes and even outperform traditional instruction, others report limited or inconsistent effects. In addition, existing research often conflates different educational contexts or focuses on broader AI applications, leaving a lack of systematic understanding of ITS effectiveness specifically in K-12 settings. This study therefore aims to assess the effects of ITSs on K-12 students’ learning and performance and to examine the experimental designs used to evaluate these systems.
The authors conducted a systematic review of 28 empirical studies involving a total of 4,597 students. Most studies adopted quasi-experimental designs, typically comparing an ITS-based intervention group with control conditions such as traditional teacher-led instruction, non-intelligent tutoring systems, modified ITSs, or no control group. The studies covered a range of countries, subjects, and school levels, with a strong concentration in middle and high school STEM education. Intervention durations varied considerably, from a single class session to several weeks or months. The review categorized studies based on educational context, experimental design, and intervention characteristics to enable a structured comparison of findings.
The review finds that ITSs generally have a positive effect on students’ learning and performance in K-12 education, particularly when compared to traditional teacher-led instruction, where most studies report medium to large effects. However, when compared with non-intelligent tutoring systems, the results are more mixed, with several studies finding no significant differences. Substantial heterogeneity is observed across studies due to differences in design, duration, and context. Importantly, the effectiveness of ITSs depends on key features such as personalization, adaptivity, and real-time feedback, as well as on implementation conditions. ITSs that are integrated with teacher support, encourage self-regulated learning, and are used over longer periods tend to produce better outcomes. In contrast, short interventions may be influenced by novelty effects, and learner characteristics such as prior knowledge and educational level also shape outcomes.
Taken together, the findings suggest that ITSs can enhance learning and performance in K-12 education, but their effectiveness is contingent upon pedagogical design and implementation conditions rather than technology alone. ITSs are most effective when aligned with sound instructional principles and used in combination with teacher guidance. The study also highlights limitations in the existing literature, including short intervention durations, limited sample diversity, and a lack of attention to ethical considerations. It calls for future research with more robust experimental designs, longer interventions, and greater attention to ethical issues, particularly as AI technologies continue to evolve and play an increasing role in education.
Source (Open Access): Létourneau, A., Deslandes Martineau, M., Charland, P., Karran, J. A., Boasen, J., & Léger, P. M. (2025). A systematic review of AI-driven intelligent tutoring systems (ITS) in K-12 education. npj Science of Learning, 10(1), 29.