Predicting at-risk students in the early stage of a blended learning course via machine learning using limited data
Academic failure is a persistent challenge in education. Despite the limited available data, in this study, we focus on identifying at-risk students in a blended learning (BL) course. Several motivational variables are analyzed to determine their effect on student performance. We use a machine-learn...
Gespeichert in:
Veröffentlicht in: | Computers and education. Artificial intelligence 2024-12, Vol.7, p.100261, Article 100261 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Academic failure is a persistent challenge in education. Despite the limited available data, in this study, we focus on identifying at-risk students in a blended learning (BL) course. Several motivational variables are analyzed to determine their effect on student performance. We use a machine-learning classifier to compare two approaches: 1) The benchmark study, which uses data from the same academic year for both training and testing; and 2) a prospective study, which focuses on extrapolating a model trained on the previous fall semester to predict outcomes for the upcoming fall semester. We categorize the motivational variables, including time management and event-occurrence frequency. The window-expansion strategy is adopted to enhance performance through periodic evaluation to facilitate timely intervention. Consequently, the prospective approach is consistent with the benchmark results, thus demonstrating its generalizability across academic years and student populations. This approach is promising for identifying at-risk students in the BL course early. The Shapley additive explanations (SHAP) method emphasizes the importance of time-management variables, particularly study in time, for identifying at-risk students. This study enhances our understanding of early detection methods for at-risk students. By analyzing time-related behaviors, we establish a robust foundation for data-driven decisions to improve future BL implementations as well as support motivation and self-regulated learning practices.
•Two approaches are compared in identifying at-risk students in Blended learning (BL) course: a benchmark study using same-year data for training and testing, and a prospective study extrapolating a model trained on the previous fall semester to predict upcoming fall out-comes, as our primary focus.•Motivational variables, including time management and event occurrence frequency, are categorized, and a window-expansion strategy facilitates timely prediction.•The prospective approach aligns closely with benchmark results, showing promise for early-stage identification of at-risk students in BL courses.•The SHAP method underscores the importance of time-management variables, particularly study in time, in identifying at-risk students.•This study enhances our understanding of early detection methods, aiding data-driven decisions to improve future BL implementations and support motivation and self-regulated learning practices in education. |
---|---|
ISSN: | 2666-920X 2666-920X |
DOI: | 10.1016/j.caeai.2024.100261 |