Deep operational audio-visual emotion recognition
Emotions play a large role in interpersonal communication, marketing, healthcare and the service industry. For this reason, much research has been carried out on emotion classification until today. Audio-visual emotion recognition is a field within artificial intelligence and machine learning that f...
Gespeichert in:
Veröffentlicht in: | Neurocomputing (Amsterdam) 2024-07, Vol.588, p.127713, Article 127713 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Emotions play a large role in interpersonal communication, marketing, healthcare and the service industry. For this reason, much research has been carried out on emotion classification until today. Audio-visual emotion recognition is a field within artificial intelligence and machine learning that focuses on recognizing and understanding human emotions from both visual and audio cues. It combines computer vision and audio processing techniques to analyze and interpret emotional states expressed by individuals. This paper presents a deep learning model developed over an operational neural network using multiple inputs and aimed at audio-visual emotion recognition. The proposed network utilizes both visual and audio information in an end to end approach. The primary objective of this work is to demonstrate that multi-input models can produce more efficient outcomes compared to single-input models in emotion classification. Another objective is to demonstrate the superior performance of weight calculation methods employed in operational neural networks compared to the conventional weight calculation methods used in convolutional neural networks. Therefore, we want to demonstrate that substituting convolutional neural network approaches with operational neural network methods can yield superior outcomes in emotion categorization models. In the proposed architecture, regular convolutional layers are replaced with operational layers. The experimental results demonstrate that the operational convolutional architecture performs better compared to the classical convolutional neural network architecture. |
---|---|
ISSN: | 0925-2312 1872-8286 |
DOI: | 10.1016/j.neucom.2024.127713 |