A bi-stage approach to North Indian raga distinction

Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Multimedia tools and applications 2024-05, Vol.83 (15), p.45163-45183
Hauptverfasser: Basu, Debjyoti, Mukherjee, Himadri, Marciano, Matteo, Sen, Shibaprasad, Singh, Sajai Vir, Obaidullah, Sk Md, Roy, Kaushik
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic framework. Thus, the delineation of ragas assumes an enormous significance as a preliminary step preceding a more deeper and intricate analysis. Each of these ragas is meant to be practiced during a particular time of the day to amplify the emotional content and physical involvement. In this current work, a machine learning-based approach has been proposed to classify the dawn and dusk time (Sandhi Prakash) ragas. Here, mel-frequency cepstral coefficients (MFCC) based feature extraction technique has been applied which was further processed to generate second-level statistical features. This brought down the original feature dimension by means of effective representation of the raw features. Several classification techniques were employed and a new bi-stage raga distinction technique has been proposed. The first stage classifies ragas as dawn/ dusk while the second stage performs deeper classification for these groups separately to identify the exact raga. Experiments were performed with over 57 K clips from 11 ragas belonging to the 2 time periods and a performance improvement of 0.7 % was obtained for the dusk ragas using the bi-stage approach over the single shot classification technique. The highest possible accuracy of 96.47 % was obtained for distinguishing the dusk ragas with only 2-second clips in the experiments.
ISSN:1573-7721
1380-7501
1573-7721
DOI:10.1007/s11042-023-17322-5