Folk music structural segment classification using GRU-based hierarchical attention network

This paper introduces a structural classification scheme for folk music using a multimodal fusion of musical features and textual information. A data corpus of Thiruvathirakali music (a popular folk dance in India) is compiled for this purpose. The musical features are learned using a gated recurren...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Sadhana (Bangalore) 2023-11, Vol.48 (4), Article 254
Hauptverfasser: Rajan, Rajeev, Nithin, S K
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This paper introduces a structural classification scheme for folk music using a multimodal fusion of musical features and textual information. A data corpus of Thiruvathirakali music (a popular folk dance in India) is compiled for this purpose. The musical features are learned using a gated recurrent unit (GRU) with an attention mechanism from musical texture features. A term frequency-inverse document frequency (TF-IDF) model is employed to derive text-based features from transcription data. Early integration of the GRU-derived features and TF-IDF features is employed for multimodal fusion. The performance is evaluated using Thiruvathirakali music corpus. The performance of the proposed system is also compared with that of a support vector machine-based classifier. The proposed system reports an overall precision, recall and F1 measure of 0.81 each for the GRU-attention-based model for multi-modal fusion. The scope of the proposed work for structural segmentation is also demonstrated with the help of an example.
ISSN:0973-7677
0973-7677
DOI:10.1007/s12046-023-02321-x