A bi-stage approach to North Indian raga distinction

Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Multimedia tools and applications 2024-05, Vol.83 (15), p.45163-45183
Hauptverfasser: Basu, Debjyoti, Mukherjee, Himadri, Marciano, Matteo, Sen, Shibaprasad, Singh, Sajai Vir, Obaidullah, Sk Md, Roy, Kaushik
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 45183
container_issue 15
container_start_page 45163
container_title Multimedia tools and applications
container_volume 83
creator Basu, Debjyoti
Mukherjee, Himadri
Marciano, Matteo
Sen, Shibaprasad
Singh, Sajai Vir
Obaidullah, Sk Md
Roy, Kaushik
description Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic framework. Thus, the delineation of ragas assumes an enormous significance as a preliminary step preceding a more deeper and intricate analysis. Each of these ragas is meant to be practiced during a particular time of the day to amplify the emotional content and physical involvement. In this current work, a machine learning-based approach has been proposed to classify the dawn and dusk time (Sandhi Prakash) ragas. Here, mel-frequency cepstral coefficients (MFCC) based feature extraction technique has been applied which was further processed to generate second-level statistical features. This brought down the original feature dimension by means of effective representation of the raw features. Several classification techniques were employed and a new bi-stage raga distinction technique has been proposed. The first stage classifies ragas as dawn/ dusk while the second stage performs deeper classification for these groups separately to identify the exact raga. Experiments were performed with over 57 K clips from 11 ragas belonging to the 2 time periods and a performance improvement of 0.7 % was obtained for the dusk ragas using the bi-stage approach over the single shot classification technique. The highest possible accuracy of 96.47 % was obtained for distinguishing the dusk ragas with only 2-second clips in the experiments.
doi_str_mv 10.1007/s11042-023-17322-5
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3048261555</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3048261555</sourcerecordid><originalsourceid>FETCH-LOGICAL-c270t-b2b36e498794e6c28df9e0689b0fd49b614d43566214eaa1c5aadf938bc18e953</originalsourceid><addsrcrecordid>eNp9kD1PwzAQhi0EEqXwB5gsMRt8_kzGqgJaqYIFZst2nDQVJMF2B_49CUGCielueN73Tg9C10BvgVJ9lwCoYIQyTkBzxog8QQuQmhOtGZz-2c_RRUoHSkFJJhZIrLBrScq2CdgOQ-yt3-Pc46c-5j3edlVrOxxtY3HVptx2Prd9d4nOavuWwtXPXKLXh_uX9Ybsnh-369WOeKZpJo45roIoC12KoDwrqroMVBWlo3UlSqdAVIJLpRiIYC14ae2I8MJ5KEIp-RLdzL3jXx_HkLI59MfYjScNp6JgCqScKDZTPvYpxVCbIbbvNn4aoGayY2Y7ZrRjvu2YKcTnUBrhrgnxt_qf1BcE_WXC</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3048261555</pqid></control><display><type>article</type><title>A bi-stage approach to North Indian raga distinction</title><source>SpringerLink Journals - AutoHoldings</source><creator>Basu, Debjyoti ; Mukherjee, Himadri ; Marciano, Matteo ; Sen, Shibaprasad ; Singh, Sajai Vir ; Obaidullah, Sk Md ; Roy, Kaushik</creator><creatorcontrib>Basu, Debjyoti ; Mukherjee, Himadri ; Marciano, Matteo ; Sen, Shibaprasad ; Singh, Sajai Vir ; Obaidullah, Sk Md ; Roy, Kaushik</creatorcontrib><description>Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic framework. Thus, the delineation of ragas assumes an enormous significance as a preliminary step preceding a more deeper and intricate analysis. Each of these ragas is meant to be practiced during a particular time of the day to amplify the emotional content and physical involvement. In this current work, a machine learning-based approach has been proposed to classify the dawn and dusk time (Sandhi Prakash) ragas. Here, mel-frequency cepstral coefficients (MFCC) based feature extraction technique has been applied which was further processed to generate second-level statistical features. This brought down the original feature dimension by means of effective representation of the raw features. Several classification techniques were employed and a new bi-stage raga distinction technique has been proposed. The first stage classifies ragas as dawn/ dusk while the second stage performs deeper classification for these groups separately to identify the exact raga. Experiments were performed with over 57 K clips from 11 ragas belonging to the 2 time periods and a performance improvement of 0.7 % was obtained for the dusk ragas using the bi-stage approach over the single shot classification technique. The highest possible accuracy of 96.47 % was obtained for distinguishing the dusk ragas with only 2-second clips in the experiments.</description><identifier>ISSN: 1573-7721</identifier><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-023-17322-5</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Classical music ; Classification ; Clips ; Computer Communication Networks ; Computer Science ; Data Structures and Information Theory ; Emotions ; Feature extraction ; Identification ; Information retrieval ; Machine learning ; Multimedia ; Multimedia Information Systems ; Musical performances ; Special Purpose and Application-Based Systems</subject><ispartof>Multimedia tools and applications, 2024-05, Vol.83 (15), p.45163-45183</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c270t-b2b36e498794e6c28df9e0689b0fd49b614d43566214eaa1c5aadf938bc18e953</cites><orcidid>0000-0002-3360-7576</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11042-023-17322-5$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11042-023-17322-5$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27923,27924,41487,42556,51318</link.rule.ids></links><search><creatorcontrib>Basu, Debjyoti</creatorcontrib><creatorcontrib>Mukherjee, Himadri</creatorcontrib><creatorcontrib>Marciano, Matteo</creatorcontrib><creatorcontrib>Sen, Shibaprasad</creatorcontrib><creatorcontrib>Singh, Sajai Vir</creatorcontrib><creatorcontrib>Obaidullah, Sk Md</creatorcontrib><creatorcontrib>Roy, Kaushik</creatorcontrib><title>A bi-stage approach to North Indian raga distinction</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic framework. Thus, the delineation of ragas assumes an enormous significance as a preliminary step preceding a more deeper and intricate analysis. Each of these ragas is meant to be practiced during a particular time of the day to amplify the emotional content and physical involvement. In this current work, a machine learning-based approach has been proposed to classify the dawn and dusk time (Sandhi Prakash) ragas. Here, mel-frequency cepstral coefficients (MFCC) based feature extraction technique has been applied which was further processed to generate second-level statistical features. This brought down the original feature dimension by means of effective representation of the raw features. Several classification techniques were employed and a new bi-stage raga distinction technique has been proposed. The first stage classifies ragas as dawn/ dusk while the second stage performs deeper classification for these groups separately to identify the exact raga. Experiments were performed with over 57 K clips from 11 ragas belonging to the 2 time periods and a performance improvement of 0.7 % was obtained for the dusk ragas using the bi-stage approach over the single shot classification technique. The highest possible accuracy of 96.47 % was obtained for distinguishing the dusk ragas with only 2-second clips in the experiments.</description><subject>Classical music</subject><subject>Classification</subject><subject>Clips</subject><subject>Computer Communication Networks</subject><subject>Computer Science</subject><subject>Data Structures and Information Theory</subject><subject>Emotions</subject><subject>Feature extraction</subject><subject>Identification</subject><subject>Information retrieval</subject><subject>Machine learning</subject><subject>Multimedia</subject><subject>Multimedia Information Systems</subject><subject>Musical performances</subject><subject>Special Purpose and Application-Based Systems</subject><issn>1573-7721</issn><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kD1PwzAQhi0EEqXwB5gsMRt8_kzGqgJaqYIFZst2nDQVJMF2B_49CUGCielueN73Tg9C10BvgVJ9lwCoYIQyTkBzxog8QQuQmhOtGZz-2c_RRUoHSkFJJhZIrLBrScq2CdgOQ-yt3-Pc46c-5j3edlVrOxxtY3HVptx2Prd9d4nOavuWwtXPXKLXh_uX9Ybsnh-369WOeKZpJo45roIoC12KoDwrqroMVBWlo3UlSqdAVIJLpRiIYC14ae2I8MJ5KEIp-RLdzL3jXx_HkLI59MfYjScNp6JgCqScKDZTPvYpxVCbIbbvNn4aoGayY2Y7ZrRjvu2YKcTnUBrhrgnxt_qf1BcE_WXC</recordid><startdate>20240501</startdate><enddate>20240501</enddate><creator>Basu, Debjyoti</creator><creator>Mukherjee, Himadri</creator><creator>Marciano, Matteo</creator><creator>Sen, Shibaprasad</creator><creator>Singh, Sajai Vir</creator><creator>Obaidullah, Sk Md</creator><creator>Roy, Kaushik</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-3360-7576</orcidid></search><sort><creationdate>20240501</creationdate><title>A bi-stage approach to North Indian raga distinction</title><author>Basu, Debjyoti ; Mukherjee, Himadri ; Marciano, Matteo ; Sen, Shibaprasad ; Singh, Sajai Vir ; Obaidullah, Sk Md ; Roy, Kaushik</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c270t-b2b36e498794e6c28df9e0689b0fd49b614d43566214eaa1c5aadf938bc18e953</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Classical music</topic><topic>Classification</topic><topic>Clips</topic><topic>Computer Communication Networks</topic><topic>Computer Science</topic><topic>Data Structures and Information Theory</topic><topic>Emotions</topic><topic>Feature extraction</topic><topic>Identification</topic><topic>Information retrieval</topic><topic>Machine learning</topic><topic>Multimedia</topic><topic>Multimedia Information Systems</topic><topic>Musical performances</topic><topic>Special Purpose and Application-Based Systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Basu, Debjyoti</creatorcontrib><creatorcontrib>Mukherjee, Himadri</creatorcontrib><creatorcontrib>Marciano, Matteo</creatorcontrib><creatorcontrib>Sen, Shibaprasad</creatorcontrib><creatorcontrib>Singh, Sajai Vir</creatorcontrib><creatorcontrib>Obaidullah, Sk Md</creatorcontrib><creatorcontrib>Roy, Kaushik</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Basu, Debjyoti</au><au>Mukherjee, Himadri</au><au>Marciano, Matteo</au><au>Sen, Shibaprasad</au><au>Singh, Sajai Vir</au><au>Obaidullah, Sk Md</au><au>Roy, Kaushik</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A bi-stage approach to North Indian raga distinction</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2024-05-01</date><risdate>2024</risdate><volume>83</volume><issue>15</issue><spage>45163</spage><epage>45183</epage><pages>45163-45183</pages><issn>1573-7721</issn><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic framework. Thus, the delineation of ragas assumes an enormous significance as a preliminary step preceding a more deeper and intricate analysis. Each of these ragas is meant to be practiced during a particular time of the day to amplify the emotional content and physical involvement. In this current work, a machine learning-based approach has been proposed to classify the dawn and dusk time (Sandhi Prakash) ragas. Here, mel-frequency cepstral coefficients (MFCC) based feature extraction technique has been applied which was further processed to generate second-level statistical features. This brought down the original feature dimension by means of effective representation of the raw features. Several classification techniques were employed and a new bi-stage raga distinction technique has been proposed. The first stage classifies ragas as dawn/ dusk while the second stage performs deeper classification for these groups separately to identify the exact raga. Experiments were performed with over 57 K clips from 11 ragas belonging to the 2 time periods and a performance improvement of 0.7 % was obtained for the dusk ragas using the bi-stage approach over the single shot classification technique. The highest possible accuracy of 96.47 % was obtained for distinguishing the dusk ragas with only 2-second clips in the experiments.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11042-023-17322-5</doi><tpages>21</tpages><orcidid>https://orcid.org/0000-0002-3360-7576</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1573-7721
ispartof Multimedia tools and applications, 2024-05, Vol.83 (15), p.45163-45183
issn 1573-7721
1380-7501
1573-7721
language eng
recordid cdi_proquest_journals_3048261555
source SpringerLink Journals - AutoHoldings
subjects Classical music
Classification
Clips
Computer Communication Networks
Computer Science
Data Structures and Information Theory
Emotions
Feature extraction
Identification
Information retrieval
Machine learning
Multimedia
Multimedia Information Systems
Musical performances
Special Purpose and Application-Based Systems
title A bi-stage approach to North Indian raga distinction
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T14%3A16%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20bi-stage%20approach%20to%20North%20Indian%20raga%20distinction&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Basu,%20Debjyoti&rft.date=2024-05-01&rft.volume=83&rft.issue=15&rft.spage=45163&rft.epage=45183&rft.pages=45163-45183&rft.issn=1573-7721&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-023-17322-5&rft_dat=%3Cproquest_cross%3E3048261555%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3048261555&rft_id=info:pmid/&rfr_iscdi=true