Nonspeech7k dataset

The dataset consists of 7,014 files delivered as 32kHz, mono audio files in .wav format and divided into train and test sets. The train set consists of 6,289, and the test set consists of 725 files. The files were strongly manually annotated with a single ground-truth label. The length of each file...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Rashid, Muhammad Mamunur, Guiqing Li, Chengrui Du
Format:	Dataset
Sprache:	eng
Schlagworte:	audio dataset, audio classification, human non-speech sound, voice dataset
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The dataset consists of 7,014 files delivered as 32kHz, mono audio files in .wav format and divided into train and test sets. The train set consists of 6,289, and the test set consists of 725 files. The files were strongly manually annotated with a single ground-truth label. The length of each file is from 500 milliseconds to 4 seconds. The dataset is only allowed for non-commercial and academic research purposes under the creative commons (CC BY-NC-SA 4.0) license. If you use the dataset, please cite our paper and acknowledge the source(freesound.org, Youtube, and Aigei). More details about the Nonspeech7k dataset are available in our article. Article title: "Nonspeech7k dataset: Classification and analysis of human nonspeech sound"
DOI:	10.5281/zenodo.6967441