BANSpEmo: A Bangla Language Emotional Speech Recognition Dataset

For languages with low resources like the Bangla language, BANSpEmo is the third audio dataset for emotional speech recognition (SER). BANSpEmo consists of 792 utterance recordings of six basic emotional reactions of two sets of sentences. Each set has six sentences. Speakers are explained the emoti...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Md Gulzar Hussain
Format: Dataset
Sprache:eng
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:For languages with low resources like the Bangla language, BANSpEmo is the third audio dataset for emotional speech recognition (SER). BANSpEmo consists of 792 utterance recordings of six basic emotional reactions of two sets of sentences. Each set has six sentences. Speakers are explained the emotional states and utterances are recorded in a more realistic way than just reading the sentences. These emotional states are Disgust (বিতৃষ্ণা), Happy (খুশি), Sad (দুঃখজনক), Surprised (বিস্মিত), Anger (রাগ), Fear (ভয়). The produced corpus includes voice recordings from 22 unprofessional speakers, 11 of whom are male and 11 of whom are female. The audio recording was for two sets of sentences.
DOI:10.17632/rdwn4bs5ky.2