Early dropout prediction using data mining: a case study with high school students

Early prediction of school dropout is a serious problem in education, but it is not an easy issue to resolve. On the one hand, there are many factors that can influence student retention. On the other hand, the traditional classification approach used to solve this problem normally has to be impleme...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems 2016-02, Vol.33 (1), p.107-124
Hauptverfasser: Márquez-Vera, Carlos, Cano, Alberto, Romero, Cristobal, Noaman, Amin Yousef Mohammad, Mousa Fardoun, Habib, Ventura, Sebastian
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Early prediction of school dropout is a serious problem in education, but it is not an easy issue to resolve. On the one hand, there are many factors that can influence student retention. On the other hand, the traditional classification approach used to solve this problem normally has to be implemented at the end of the course to gather maximum information in order to achieve the highest accuracy. In this paper, we propose a methodology and a specific classification algorithm to discover comprehensible prediction models of student dropout as soon as possible. We used data gathered from 419 high schools students in Mexico. We carried out several experiments to predict dropout at different steps of the course, to select the best indicators of dropout and to compare our proposed algorithm versus some classical and imbalanced well‐known classification algorithms. Results show that our algorithm was capable of predicting student dropout within the first 4–6 weeks of the course and trustworthy enough to be used in an early warning system.
ISSN:0266-4720
1468-0394
DOI:10.1111/exsy.12135