A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique

In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition mo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Circuits, systems, and signal processing systems, and signal processing, 2020-07, Vol.39 (7), p.3652-3681
1. Verfasser: Mavaddati, Samira
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 3681
container_issue 7
container_start_page 3652
container_title Circuits, systems, and signal processing
container_volume 39
creator Mavaddati, Samira
description In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition model using spectrogram of the singing voice signals. The vocal and non-vocal parts of a singing voice signal are investigated as sparse and low-rank components, respectively. An alternating optimization algorithm is applied to decompose the singing voice frames using the sparse representation technique over the vocal and non-vocal dictionaries. Also, a novel voice activity detector is presented based upon the energy of the sparse coefficients to learn atoms related to the non-vocal data in the training step. In the test phase, the learned non-vocal atoms of the music instrumental part are updated according to the non-vocal components captured from the test signal using domain adaptation technique. The proposed dictionary learning process includes two coherence measures: atom–data coherence and mutual coherence to provide a learning procedure with low reconstruction error along with a proper separation in the test step. The simulation results using different measures show that the proposed method leads to significantly better results in comparison with the earlier methods in this context and the traditional procedures.
doi_str_mv 10.1007/s00034-019-01338-0
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2403561019</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2403561019</sourcerecordid><originalsourceid>FETCH-LOGICAL-c356t-215c2089ad85071fd7fce90cf602a22e664eb3e798b750f9245a3bfe4f0a7e8c3</originalsourceid><addsrcrecordid>eNp9kF9LwzAUxYMoOKdfwKeAz9WbpG3Sxzn_wtSHTfEtpOnN1tE1tekEv71xE3wTEsINv3PuvYeQcwaXDEBeBQAQaQKsiFcIlcABGbFMsCRTUh2SEXAZPxV7PyYnIawhkmnBR2Qxoc_-Exs6r9tlPPTN1xbpHDvTm6H2LX3CYeUrem0CVjTWhs7Q9K0pG6Q3aP2m86HekQu0q7b-2OIpOXKmCXj2-47J693tYvqQzF7uH6eTWWJFlg8JZ5nloApTqQwkc5V0FguwLgduOMc8T7EUKAtVygxcwdPMiNJh6sBIVFaMycXet-t9bBsGvfbbOFoTNE8h9mBxzUjxPWV7H0KPTnd9vTH9l2agf9LT-_R0hPUuPQ1RJPaiEOF2if2f9T-qbz4KcbA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2403561019</pqid></control><display><type>article</type><title>A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique</title><source>SpringerNature Journals</source><creator>Mavaddati, Samira</creator><creatorcontrib>Mavaddati, Samira</creatorcontrib><description>In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition model using spectrogram of the singing voice signals. The vocal and non-vocal parts of a singing voice signal are investigated as sparse and low-rank components, respectively. An alternating optimization algorithm is applied to decompose the singing voice frames using the sparse representation technique over the vocal and non-vocal dictionaries. Also, a novel voice activity detector is presented based upon the energy of the sparse coefficients to learn atoms related to the non-vocal data in the training step. In the test phase, the learned non-vocal atoms of the music instrumental part are updated according to the non-vocal components captured from the test signal using domain adaptation technique. The proposed dictionary learning process includes two coherence measures: atom–data coherence and mutual coherence to provide a learning procedure with low reconstruction error along with a proper separation in the test step. The simulation results using different measures show that the proposed method leads to significantly better results in comparison with the earlier methods in this context and the traditional procedures.</description><identifier>ISSN: 0278-081X</identifier><identifier>EISSN: 1531-5878</identifier><identifier>DOI: 10.1007/s00034-019-01338-0</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Circuits and Systems ; Coherence ; Computer simulation ; Data retrieval ; Decomposition ; Dictionaries ; Electrical Engineering ; Electronics and Microelectronics ; Engineering ; Instrumentation ; Learning ; Optimization ; Separation ; Signal processing ; Signal,Image and Speech Processing ; Singing ; Voice activity detectors ; Voice recognition</subject><ispartof>Circuits, systems, and signal processing, 2020-07, Vol.39 (7), p.3652-3681</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020</rights><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c356t-215c2089ad85071fd7fce90cf602a22e664eb3e798b750f9245a3bfe4f0a7e8c3</citedby><cites>FETCH-LOGICAL-c356t-215c2089ad85071fd7fce90cf602a22e664eb3e798b750f9245a3bfe4f0a7e8c3</cites><orcidid>0000-0002-8138-1014</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s00034-019-01338-0$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s00034-019-01338-0$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Mavaddati, Samira</creatorcontrib><title>A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique</title><title>Circuits, systems, and signal processing</title><addtitle>Circuits Syst Signal Process</addtitle><description>In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition model using spectrogram of the singing voice signals. The vocal and non-vocal parts of a singing voice signal are investigated as sparse and low-rank components, respectively. An alternating optimization algorithm is applied to decompose the singing voice frames using the sparse representation technique over the vocal and non-vocal dictionaries. Also, a novel voice activity detector is presented based upon the energy of the sparse coefficients to learn atoms related to the non-vocal data in the training step. In the test phase, the learned non-vocal atoms of the music instrumental part are updated according to the non-vocal components captured from the test signal using domain adaptation technique. The proposed dictionary learning process includes two coherence measures: atom–data coherence and mutual coherence to provide a learning procedure with low reconstruction error along with a proper separation in the test step. The simulation results using different measures show that the proposed method leads to significantly better results in comparison with the earlier methods in this context and the traditional procedures.</description><subject>Algorithms</subject><subject>Circuits and Systems</subject><subject>Coherence</subject><subject>Computer simulation</subject><subject>Data retrieval</subject><subject>Decomposition</subject><subject>Dictionaries</subject><subject>Electrical Engineering</subject><subject>Electronics and Microelectronics</subject><subject>Engineering</subject><subject>Instrumentation</subject><subject>Learning</subject><subject>Optimization</subject><subject>Separation</subject><subject>Signal processing</subject><subject>Signal,Image and Speech Processing</subject><subject>Singing</subject><subject>Voice activity detectors</subject><subject>Voice recognition</subject><issn>0278-081X</issn><issn>1531-5878</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kF9LwzAUxYMoOKdfwKeAz9WbpG3Sxzn_wtSHTfEtpOnN1tE1tekEv71xE3wTEsINv3PuvYeQcwaXDEBeBQAQaQKsiFcIlcABGbFMsCRTUh2SEXAZPxV7PyYnIawhkmnBR2Qxoc_-Exs6r9tlPPTN1xbpHDvTm6H2LX3CYeUrem0CVjTWhs7Q9K0pG6Q3aP2m86HekQu0q7b-2OIpOXKmCXj2-47J693tYvqQzF7uH6eTWWJFlg8JZ5nloApTqQwkc5V0FguwLgduOMc8T7EUKAtVygxcwdPMiNJh6sBIVFaMycXet-t9bBsGvfbbOFoTNE8h9mBxzUjxPWV7H0KPTnd9vTH9l2agf9LT-_R0hPUuPQ1RJPaiEOF2if2f9T-qbz4KcbA</recordid><startdate>20200701</startdate><enddate>20200701</enddate><creator>Mavaddati, Samira</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7SP</scope><scope>7XB</scope><scope>88I</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M2P</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>Q9U</scope><scope>S0W</scope><orcidid>https://orcid.org/0000-0002-8138-1014</orcidid></search><sort><creationdate>20200701</creationdate><title>A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique</title><author>Mavaddati, Samira</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c356t-215c2089ad85071fd7fce90cf602a22e664eb3e798b750f9245a3bfe4f0a7e8c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Circuits and Systems</topic><topic>Coherence</topic><topic>Computer simulation</topic><topic>Data retrieval</topic><topic>Decomposition</topic><topic>Dictionaries</topic><topic>Electrical Engineering</topic><topic>Electronics and Microelectronics</topic><topic>Engineering</topic><topic>Instrumentation</topic><topic>Learning</topic><topic>Optimization</topic><topic>Separation</topic><topic>Signal processing</topic><topic>Signal,Image and Speech Processing</topic><topic>Singing</topic><topic>Voice activity detectors</topic><topic>Voice recognition</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mavaddati, Samira</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Science Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Science Database</collection><collection>Engineering Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>ProQuest Central Basic</collection><collection>DELNET Engineering &amp; Technology Collection</collection><jtitle>Circuits, systems, and signal processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mavaddati, Samira</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique</atitle><jtitle>Circuits, systems, and signal processing</jtitle><stitle>Circuits Syst Signal Process</stitle><date>2020-07-01</date><risdate>2020</risdate><volume>39</volume><issue>7</issue><spage>3652</spage><epage>3681</epage><pages>3652-3681</pages><issn>0278-081X</issn><eissn>1531-5878</eissn><abstract>In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition model using spectrogram of the singing voice signals. The vocal and non-vocal parts of a singing voice signal are investigated as sparse and low-rank components, respectively. An alternating optimization algorithm is applied to decompose the singing voice frames using the sparse representation technique over the vocal and non-vocal dictionaries. Also, a novel voice activity detector is presented based upon the energy of the sparse coefficients to learn atoms related to the non-vocal data in the training step. In the test phase, the learned non-vocal atoms of the music instrumental part are updated according to the non-vocal components captured from the test signal using domain adaptation technique. The proposed dictionary learning process includes two coherence measures: atom–data coherence and mutual coherence to provide a learning procedure with low reconstruction error along with a proper separation in the test step. The simulation results using different measures show that the proposed method leads to significantly better results in comparison with the earlier methods in this context and the traditional procedures.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s00034-019-01338-0</doi><tpages>30</tpages><orcidid>https://orcid.org/0000-0002-8138-1014</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0278-081X
ispartof Circuits, systems, and signal processing, 2020-07, Vol.39 (7), p.3652-3681
issn 0278-081X
1531-5878
language eng
recordid cdi_proquest_journals_2403561019
source SpringerNature Journals
subjects Algorithms
Circuits and Systems
Coherence
Computer simulation
Data retrieval
Decomposition
Dictionaries
Electrical Engineering
Electronics and Microelectronics
Engineering
Instrumentation
Learning
Optimization
Separation
Signal processing
Signal,Image and Speech Processing
Singing
Voice activity detectors
Voice recognition
title A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T15%3A59%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Novel%20Singing%20Voice%20Separation%20Method%20Based%20on%20a%20Learnable%20Decomposition%20Technique&rft.jtitle=Circuits,%20systems,%20and%20signal%20processing&rft.au=Mavaddati,%20Samira&rft.date=2020-07-01&rft.volume=39&rft.issue=7&rft.spage=3652&rft.epage=3681&rft.pages=3652-3681&rft.issn=0278-081X&rft.eissn=1531-5878&rft_id=info:doi/10.1007/s00034-019-01338-0&rft_dat=%3Cproquest_cross%3E2403561019%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2403561019&rft_id=info:pmid/&rfr_iscdi=true