Automatic detection of students’ affective states in classroom environment using hybrid convolutional neural networks
Predicting the students’ emotional and behavioral engagements using computer vision techniques is a challenging task. Though there are several state-of-the-art techniques for analyzing a student’s affective states in an e-learning environment (single person’s engagement detection in a single image f...
Gespeichert in:
Veröffentlicht in: | Education and information technologies 2020-03, Vol.25 (2), p.1387-1415 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Predicting the students’ emotional and behavioral engagements using computer vision techniques is a challenging task. Though there are several state-of-the-art techniques for analyzing a student’s affective states in an e-learning environment (single person’s engagement detection in a single image frame), a very few works are available for analyzing the students’ affective states in a classroom environment (multiple people in a single image frame). Hence, in this paper, we propose a novel hybrid convolutional neural network (CNN) architecture for analyzing the students’ affective states in a classroom environment. This proposed architecture consists of two models, the first model (CNN-1) is designed to analyze the affective states of a single student in a single image frame and the second model (CNN-2) uses multiple students in a single image frame. Thus, our proposed hybrid architecture predicts the overall affective state of the entire class. The proposed architecture uses the students’ facial expressions, hand gestures and body postures for analyzing their affective states. Further, due to unavailability of standard datasets for the students’ affective state analysis, we created, annotated and tested on our dataset of over 8000 single face in a single image frame and 12000 multiple faces in a single image frame with three different affective states, namely: engaged, boredom and neutral. The experimental results demonstrate an accuracy of 86% and 70% for posed and spontaneous affective states of classroom data, respectively. |
---|---|
ISSN: | 1360-2357 1573-7608 |
DOI: | 10.1007/s10639-019-10004-6 |