A bi-stage approach to North Indian raga distinction
Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic...
Gespeichert in:
Veröffentlicht in: | Multimedia tools and applications 2024-05, Vol.83 (15), p.45163-45183 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 45183 |
---|---|
container_issue | 15 |
container_start_page | 45163 |
container_title | Multimedia tools and applications |
container_volume | 83 |
creator | Basu, Debjyoti Mukherjee, Himadri Marciano, Matteo Sen, Shibaprasad Singh, Sajai Vir Obaidullah, Sk Md Roy, Kaushik |
description | Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic framework. Thus, the delineation of ragas assumes an enormous significance as a preliminary step preceding a more deeper and intricate analysis. Each of these ragas is meant to be practiced during a particular time of the day to amplify the emotional content and physical involvement. In this current work, a machine learning-based approach has been proposed to classify the dawn and dusk time (Sandhi Prakash) ragas. Here, mel-frequency cepstral coefficients (MFCC) based feature extraction technique has been applied which was further processed to generate second-level statistical features. This brought down the original feature dimension by means of effective representation of the raw features. Several classification techniques were employed and a new bi-stage raga distinction technique has been proposed. The first stage classifies ragas as dawn/ dusk while the second stage performs deeper classification for these groups separately to identify the exact raga. Experiments were performed with over 57
K
clips from 11 ragas belonging to the 2 time periods and a performance improvement of
0.7
%
was obtained for the dusk ragas using the bi-stage approach over the single shot classification technique. The highest possible accuracy of
96.47
%
was obtained for distinguishing the dusk ragas with only 2-second clips in the experiments. |
doi_str_mv | 10.1007/s11042-023-17322-5 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3048261555</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3048261555</sourcerecordid><originalsourceid>FETCH-LOGICAL-c270t-b2b36e498794e6c28df9e0689b0fd49b614d43566214eaa1c5aadf938bc18e953</originalsourceid><addsrcrecordid>eNp9kD1PwzAQhi0EEqXwB5gsMRt8_kzGqgJaqYIFZst2nDQVJMF2B_49CUGCielueN73Tg9C10BvgVJ9lwCoYIQyTkBzxog8QQuQmhOtGZz-2c_RRUoHSkFJJhZIrLBrScq2CdgOQ-yt3-Pc46c-5j3edlVrOxxtY3HVptx2Prd9d4nOavuWwtXPXKLXh_uX9Ybsnh-369WOeKZpJo45roIoC12KoDwrqroMVBWlo3UlSqdAVIJLpRiIYC14ae2I8MJ5KEIp-RLdzL3jXx_HkLI59MfYjScNp6JgCqScKDZTPvYpxVCbIbbvNn4aoGayY2Y7ZrRjvu2YKcTnUBrhrgnxt_qf1BcE_WXC</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3048261555</pqid></control><display><type>article</type><title>A bi-stage approach to North Indian raga distinction</title><source>SpringerLink Journals - AutoHoldings</source><creator>Basu, Debjyoti ; Mukherjee, Himadri ; Marciano, Matteo ; Sen, Shibaprasad ; Singh, Sajai Vir ; Obaidullah, Sk Md ; Roy, Kaushik</creator><creatorcontrib>Basu, Debjyoti ; Mukherjee, Himadri ; Marciano, Matteo ; Sen, Shibaprasad ; Singh, Sajai Vir ; Obaidullah, Sk Md ; Roy, Kaushik</creatorcontrib><description>Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic framework. Thus, the delineation of ragas assumes an enormous significance as a preliminary step preceding a more deeper and intricate analysis. Each of these ragas is meant to be practiced during a particular time of the day to amplify the emotional content and physical involvement. In this current work, a machine learning-based approach has been proposed to classify the dawn and dusk time (Sandhi Prakash) ragas. Here, mel-frequency cepstral coefficients (MFCC) based feature extraction technique has been applied which was further processed to generate second-level statistical features. This brought down the original feature dimension by means of effective representation of the raw features. Several classification techniques were employed and a new bi-stage raga distinction technique has been proposed. The first stage classifies ragas as dawn/ dusk while the second stage performs deeper classification for these groups separately to identify the exact raga. Experiments were performed with over 57
K
clips from 11 ragas belonging to the 2 time periods and a performance improvement of
0.7
%
was obtained for the dusk ragas using the bi-stage approach over the single shot classification technique. The highest possible accuracy of
96.47
%
was obtained for distinguishing the dusk ragas with only 2-second clips in the experiments.</description><identifier>ISSN: 1573-7721</identifier><identifier>ISSN: 1380-7501</identifier><identifier>EISSN: 1573-7721</identifier><identifier>DOI: 10.1007/s11042-023-17322-5</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Classical music ; Classification ; Clips ; Computer Communication Networks ; Computer Science ; Data Structures and Information Theory ; Emotions ; Feature extraction ; Identification ; Information retrieval ; Machine learning ; Multimedia ; Multimedia Information Systems ; Musical performances ; Special Purpose and Application-Based Systems</subject><ispartof>Multimedia tools and applications, 2024-05, Vol.83 (15), p.45163-45183</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c270t-b2b36e498794e6c28df9e0689b0fd49b614d43566214eaa1c5aadf938bc18e953</cites><orcidid>0000-0002-3360-7576</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11042-023-17322-5$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11042-023-17322-5$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27923,27924,41487,42556,51318</link.rule.ids></links><search><creatorcontrib>Basu, Debjyoti</creatorcontrib><creatorcontrib>Mukherjee, Himadri</creatorcontrib><creatorcontrib>Marciano, Matteo</creatorcontrib><creatorcontrib>Sen, Shibaprasad</creatorcontrib><creatorcontrib>Singh, Sajai Vir</creatorcontrib><creatorcontrib>Obaidullah, Sk Md</creatorcontrib><creatorcontrib>Roy, Kaushik</creatorcontrib><title>A bi-stage approach to North Indian raga distinction</title><title>Multimedia tools and applications</title><addtitle>Multimed Tools Appl</addtitle><description>Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic framework. Thus, the delineation of ragas assumes an enormous significance as a preliminary step preceding a more deeper and intricate analysis. Each of these ragas is meant to be practiced during a particular time of the day to amplify the emotional content and physical involvement. In this current work, a machine learning-based approach has been proposed to classify the dawn and dusk time (Sandhi Prakash) ragas. Here, mel-frequency cepstral coefficients (MFCC) based feature extraction technique has been applied which was further processed to generate second-level statistical features. This brought down the original feature dimension by means of effective representation of the raw features. Several classification techniques were employed and a new bi-stage raga distinction technique has been proposed. The first stage classifies ragas as dawn/ dusk while the second stage performs deeper classification for these groups separately to identify the exact raga. Experiments were performed with over 57
K
clips from 11 ragas belonging to the 2 time periods and a performance improvement of
0.7
%
was obtained for the dusk ragas using the bi-stage approach over the single shot classification technique. The highest possible accuracy of
96.47
%
was obtained for distinguishing the dusk ragas with only 2-second clips in the experiments.</description><subject>Classical music</subject><subject>Classification</subject><subject>Clips</subject><subject>Computer Communication Networks</subject><subject>Computer Science</subject><subject>Data Structures and Information Theory</subject><subject>Emotions</subject><subject>Feature extraction</subject><subject>Identification</subject><subject>Information retrieval</subject><subject>Machine learning</subject><subject>Multimedia</subject><subject>Multimedia Information Systems</subject><subject>Musical performances</subject><subject>Special Purpose and Application-Based Systems</subject><issn>1573-7721</issn><issn>1380-7501</issn><issn>1573-7721</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kD1PwzAQhi0EEqXwB5gsMRt8_kzGqgJaqYIFZst2nDQVJMF2B_49CUGCielueN73Tg9C10BvgVJ9lwCoYIQyTkBzxog8QQuQmhOtGZz-2c_RRUoHSkFJJhZIrLBrScq2CdgOQ-yt3-Pc46c-5j3edlVrOxxtY3HVptx2Prd9d4nOavuWwtXPXKLXh_uX9Ybsnh-369WOeKZpJo45roIoC12KoDwrqroMVBWlo3UlSqdAVIJLpRiIYC14ae2I8MJ5KEIp-RLdzL3jXx_HkLI59MfYjScNp6JgCqScKDZTPvYpxVCbIbbvNn4aoGayY2Y7ZrRjvu2YKcTnUBrhrgnxt_qf1BcE_WXC</recordid><startdate>20240501</startdate><enddate>20240501</enddate><creator>Basu, Debjyoti</creator><creator>Mukherjee, Himadri</creator><creator>Marciano, Matteo</creator><creator>Sen, Shibaprasad</creator><creator>Singh, Sajai Vir</creator><creator>Obaidullah, Sk Md</creator><creator>Roy, Kaushik</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-3360-7576</orcidid></search><sort><creationdate>20240501</creationdate><title>A bi-stage approach to North Indian raga distinction</title><author>Basu, Debjyoti ; Mukherjee, Himadri ; Marciano, Matteo ; Sen, Shibaprasad ; Singh, Sajai Vir ; Obaidullah, Sk Md ; Roy, Kaushik</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c270t-b2b36e498794e6c28df9e0689b0fd49b614d43566214eaa1c5aadf938bc18e953</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Classical music</topic><topic>Classification</topic><topic>Clips</topic><topic>Computer Communication Networks</topic><topic>Computer Science</topic><topic>Data Structures and Information Theory</topic><topic>Emotions</topic><topic>Feature extraction</topic><topic>Identification</topic><topic>Information retrieval</topic><topic>Machine learning</topic><topic>Multimedia</topic><topic>Multimedia Information Systems</topic><topic>Musical performances</topic><topic>Special Purpose and Application-Based Systems</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Basu, Debjyoti</creatorcontrib><creatorcontrib>Mukherjee, Himadri</creatorcontrib><creatorcontrib>Marciano, Matteo</creatorcontrib><creatorcontrib>Sen, Shibaprasad</creatorcontrib><creatorcontrib>Singh, Sajai Vir</creatorcontrib><creatorcontrib>Obaidullah, Sk Md</creatorcontrib><creatorcontrib>Roy, Kaushik</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Multimedia tools and applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Basu, Debjyoti</au><au>Mukherjee, Himadri</au><au>Marciano, Matteo</au><au>Sen, Shibaprasad</au><au>Singh, Sajai Vir</au><au>Obaidullah, Sk Md</au><au>Roy, Kaushik</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A bi-stage approach to North Indian raga distinction</atitle><jtitle>Multimedia tools and applications</jtitle><stitle>Multimed Tools Appl</stitle><date>2024-05-01</date><risdate>2024</risdate><volume>83</volume><issue>15</issue><spage>45163</spage><epage>45183</epage><pages>45163-45183</pages><issn>1573-7721</issn><issn>1380-7501</issn><eissn>1573-7721</eissn><abstract>Music embraces an intricate assembly of auditory elements, carefully organized in diverse configurations to articulate a variety of human emotions, moods, thoughts, feelings, temporal contexts, and situations. One of the primary aspects of any music composition is the raga which governs its melodic framework. Thus, the delineation of ragas assumes an enormous significance as a preliminary step preceding a more deeper and intricate analysis. Each of these ragas is meant to be practiced during a particular time of the day to amplify the emotional content and physical involvement. In this current work, a machine learning-based approach has been proposed to classify the dawn and dusk time (Sandhi Prakash) ragas. Here, mel-frequency cepstral coefficients (MFCC) based feature extraction technique has been applied which was further processed to generate second-level statistical features. This brought down the original feature dimension by means of effective representation of the raw features. Several classification techniques were employed and a new bi-stage raga distinction technique has been proposed. The first stage classifies ragas as dawn/ dusk while the second stage performs deeper classification for these groups separately to identify the exact raga. Experiments were performed with over 57
K
clips from 11 ragas belonging to the 2 time periods and a performance improvement of
0.7
%
was obtained for the dusk ragas using the bi-stage approach over the single shot classification technique. The highest possible accuracy of
96.47
%
was obtained for distinguishing the dusk ragas with only 2-second clips in the experiments.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11042-023-17322-5</doi><tpages>21</tpages><orcidid>https://orcid.org/0000-0002-3360-7576</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1573-7721 |
ispartof | Multimedia tools and applications, 2024-05, Vol.83 (15), p.45163-45183 |
issn | 1573-7721 1380-7501 1573-7721 |
language | eng |
recordid | cdi_proquest_journals_3048261555 |
source | SpringerLink Journals - AutoHoldings |
subjects | Classical music Classification Clips Computer Communication Networks Computer Science Data Structures and Information Theory Emotions Feature extraction Identification Information retrieval Machine learning Multimedia Multimedia Information Systems Musical performances Special Purpose and Application-Based Systems |
title | A bi-stage approach to North Indian raga distinction |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T14%3A16%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20bi-stage%20approach%20to%20North%20Indian%20raga%20distinction&rft.jtitle=Multimedia%20tools%20and%20applications&rft.au=Basu,%20Debjyoti&rft.date=2024-05-01&rft.volume=83&rft.issue=15&rft.spage=45163&rft.epage=45183&rft.pages=45163-45183&rft.issn=1573-7721&rft.eissn=1573-7721&rft_id=info:doi/10.1007/s11042-023-17322-5&rft_dat=%3Cproquest_cross%3E3048261555%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3048261555&rft_id=info:pmid/&rfr_iscdi=true |