A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique
In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition mo...
Gespeichert in:
Veröffentlicht in: | Circuits, systems, and signal processing systems, and signal processing, 2020-07, Vol.39 (7), p.3652-3681 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 3681 |
---|---|
container_issue | 7 |
container_start_page | 3652 |
container_title | Circuits, systems, and signal processing |
container_volume | 39 |
creator | Mavaddati, Samira |
description | In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition model using spectrogram of the singing voice signals. The vocal and non-vocal parts of a singing voice signal are investigated as sparse and low-rank components, respectively. An alternating optimization algorithm is applied to decompose the singing voice frames using the sparse representation technique over the vocal and non-vocal dictionaries. Also, a novel voice activity detector is presented based upon the energy of the sparse coefficients to learn atoms related to the non-vocal data in the training step. In the test phase, the learned non-vocal atoms of the music instrumental part are updated according to the non-vocal components captured from the test signal using domain adaptation technique. The proposed dictionary learning process includes two coherence measures: atom–data coherence and mutual coherence to provide a learning procedure with low reconstruction error along with a proper separation in the test step. The simulation results using different measures show that the proposed method leads to significantly better results in comparison with the earlier methods in this context and the traditional procedures. |
doi_str_mv | 10.1007/s00034-019-01338-0 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2403561019</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2403561019</sourcerecordid><originalsourceid>FETCH-LOGICAL-c356t-215c2089ad85071fd7fce90cf602a22e664eb3e798b750f9245a3bfe4f0a7e8c3</originalsourceid><addsrcrecordid>eNp9kF9LwzAUxYMoOKdfwKeAz9WbpG3Sxzn_wtSHTfEtpOnN1tE1tekEv71xE3wTEsINv3PuvYeQcwaXDEBeBQAQaQKsiFcIlcABGbFMsCRTUh2SEXAZPxV7PyYnIawhkmnBR2Qxoc_-Exs6r9tlPPTN1xbpHDvTm6H2LX3CYeUrem0CVjTWhs7Q9K0pG6Q3aP2m86HekQu0q7b-2OIpOXKmCXj2-47J693tYvqQzF7uH6eTWWJFlg8JZ5nloApTqQwkc5V0FguwLgduOMc8T7EUKAtVygxcwdPMiNJh6sBIVFaMycXet-t9bBsGvfbbOFoTNE8h9mBxzUjxPWV7H0KPTnd9vTH9l2agf9LT-_R0hPUuPQ1RJPaiEOF2if2f9T-qbz4KcbA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2403561019</pqid></control><display><type>article</type><title>A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique</title><source>SpringerNature Journals</source><creator>Mavaddati, Samira</creator><creatorcontrib>Mavaddati, Samira</creatorcontrib><description>In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition model using spectrogram of the singing voice signals. The vocal and non-vocal parts of a singing voice signal are investigated as sparse and low-rank components, respectively. An alternating optimization algorithm is applied to decompose the singing voice frames using the sparse representation technique over the vocal and non-vocal dictionaries. Also, a novel voice activity detector is presented based upon the energy of the sparse coefficients to learn atoms related to the non-vocal data in the training step. In the test phase, the learned non-vocal atoms of the music instrumental part are updated according to the non-vocal components captured from the test signal using domain adaptation technique. The proposed dictionary learning process includes two coherence measures: atom–data coherence and mutual coherence to provide a learning procedure with low reconstruction error along with a proper separation in the test step. The simulation results using different measures show that the proposed method leads to significantly better results in comparison with the earlier methods in this context and the traditional procedures.</description><identifier>ISSN: 0278-081X</identifier><identifier>EISSN: 1531-5878</identifier><identifier>DOI: 10.1007/s00034-019-01338-0</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Circuits and Systems ; Coherence ; Computer simulation ; Data retrieval ; Decomposition ; Dictionaries ; Electrical Engineering ; Electronics and Microelectronics ; Engineering ; Instrumentation ; Learning ; Optimization ; Separation ; Signal processing ; Signal,Image and Speech Processing ; Singing ; Voice activity detectors ; Voice recognition</subject><ispartof>Circuits, systems, and signal processing, 2020-07, Vol.39 (7), p.3652-3681</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020</rights><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c356t-215c2089ad85071fd7fce90cf602a22e664eb3e798b750f9245a3bfe4f0a7e8c3</citedby><cites>FETCH-LOGICAL-c356t-215c2089ad85071fd7fce90cf602a22e664eb3e798b750f9245a3bfe4f0a7e8c3</cites><orcidid>0000-0002-8138-1014</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s00034-019-01338-0$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s00034-019-01338-0$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Mavaddati, Samira</creatorcontrib><title>A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique</title><title>Circuits, systems, and signal processing</title><addtitle>Circuits Syst Signal Process</addtitle><description>In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition model using spectrogram of the singing voice signals. The vocal and non-vocal parts of a singing voice signal are investigated as sparse and low-rank components, respectively. An alternating optimization algorithm is applied to decompose the singing voice frames using the sparse representation technique over the vocal and non-vocal dictionaries. Also, a novel voice activity detector is presented based upon the energy of the sparse coefficients to learn atoms related to the non-vocal data in the training step. In the test phase, the learned non-vocal atoms of the music instrumental part are updated according to the non-vocal components captured from the test signal using domain adaptation technique. The proposed dictionary learning process includes two coherence measures: atom–data coherence and mutual coherence to provide a learning procedure with low reconstruction error along with a proper separation in the test step. The simulation results using different measures show that the proposed method leads to significantly better results in comparison with the earlier methods in this context and the traditional procedures.</description><subject>Algorithms</subject><subject>Circuits and Systems</subject><subject>Coherence</subject><subject>Computer simulation</subject><subject>Data retrieval</subject><subject>Decomposition</subject><subject>Dictionaries</subject><subject>Electrical Engineering</subject><subject>Electronics and Microelectronics</subject><subject>Engineering</subject><subject>Instrumentation</subject><subject>Learning</subject><subject>Optimization</subject><subject>Separation</subject><subject>Signal processing</subject><subject>Signal,Image and Speech Processing</subject><subject>Singing</subject><subject>Voice activity detectors</subject><subject>Voice recognition</subject><issn>0278-081X</issn><issn>1531-5878</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kF9LwzAUxYMoOKdfwKeAz9WbpG3Sxzn_wtSHTfEtpOnN1tE1tekEv71xE3wTEsINv3PuvYeQcwaXDEBeBQAQaQKsiFcIlcABGbFMsCRTUh2SEXAZPxV7PyYnIawhkmnBR2Qxoc_-Exs6r9tlPPTN1xbpHDvTm6H2LX3CYeUrem0CVjTWhs7Q9K0pG6Q3aP2m86HekQu0q7b-2OIpOXKmCXj2-47J693tYvqQzF7uH6eTWWJFlg8JZ5nloApTqQwkc5V0FguwLgduOMc8T7EUKAtVygxcwdPMiNJh6sBIVFaMycXet-t9bBsGvfbbOFoTNE8h9mBxzUjxPWV7H0KPTnd9vTH9l2agf9LT-_R0hPUuPQ1RJPaiEOF2if2f9T-qbz4KcbA</recordid><startdate>20200701</startdate><enddate>20200701</enddate><creator>Mavaddati, Samira</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7SP</scope><scope>7XB</scope><scope>88I</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M2P</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>Q9U</scope><scope>S0W</scope><orcidid>https://orcid.org/0000-0002-8138-1014</orcidid></search><sort><creationdate>20200701</creationdate><title>A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique</title><author>Mavaddati, Samira</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c356t-215c2089ad85071fd7fce90cf602a22e664eb3e798b750f9245a3bfe4f0a7e8c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Circuits and Systems</topic><topic>Coherence</topic><topic>Computer simulation</topic><topic>Data retrieval</topic><topic>Decomposition</topic><topic>Dictionaries</topic><topic>Electrical Engineering</topic><topic>Electronics and Microelectronics</topic><topic>Engineering</topic><topic>Instrumentation</topic><topic>Learning</topic><topic>Optimization</topic><topic>Separation</topic><topic>Signal processing</topic><topic>Signal,Image and Speech Processing</topic><topic>Singing</topic><topic>Voice activity detectors</topic><topic>Voice recognition</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mavaddati, Samira</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Science Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Science Database</collection><collection>Engineering Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>ProQuest Central Basic</collection><collection>DELNET Engineering & Technology Collection</collection><jtitle>Circuits, systems, and signal processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mavaddati, Samira</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique</atitle><jtitle>Circuits, systems, and signal processing</jtitle><stitle>Circuits Syst Signal Process</stitle><date>2020-07-01</date><risdate>2020</risdate><volume>39</volume><issue>7</issue><spage>3652</spage><epage>3681</epage><pages>3652-3681</pages><issn>0278-081X</issn><eissn>1531-5878</eissn><abstract>In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition model using spectrogram of the singing voice signals. The vocal and non-vocal parts of a singing voice signal are investigated as sparse and low-rank components, respectively. An alternating optimization algorithm is applied to decompose the singing voice frames using the sparse representation technique over the vocal and non-vocal dictionaries. Also, a novel voice activity detector is presented based upon the energy of the sparse coefficients to learn atoms related to the non-vocal data in the training step. In the test phase, the learned non-vocal atoms of the music instrumental part are updated according to the non-vocal components captured from the test signal using domain adaptation technique. The proposed dictionary learning process includes two coherence measures: atom–data coherence and mutual coherence to provide a learning procedure with low reconstruction error along with a proper separation in the test step. The simulation results using different measures show that the proposed method leads to significantly better results in comparison with the earlier methods in this context and the traditional procedures.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s00034-019-01338-0</doi><tpages>30</tpages><orcidid>https://orcid.org/0000-0002-8138-1014</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0278-081X |
ispartof | Circuits, systems, and signal processing, 2020-07, Vol.39 (7), p.3652-3681 |
issn | 0278-081X 1531-5878 |
language | eng |
recordid | cdi_proquest_journals_2403561019 |
source | SpringerNature Journals |
subjects | Algorithms Circuits and Systems Coherence Computer simulation Data retrieval Decomposition Dictionaries Electrical Engineering Electronics and Microelectronics Engineering Instrumentation Learning Optimization Separation Signal processing Signal,Image and Speech Processing Singing Voice activity detectors Voice recognition |
title | A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T15%3A59%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Novel%20Singing%20Voice%20Separation%20Method%20Based%20on%20a%20Learnable%20Decomposition%20Technique&rft.jtitle=Circuits,%20systems,%20and%20signal%20processing&rft.au=Mavaddati,%20Samira&rft.date=2020-07-01&rft.volume=39&rft.issue=7&rft.spage=3652&rft.epage=3681&rft.pages=3652-3681&rft.issn=0278-081X&rft.eissn=1531-5878&rft_id=info:doi/10.1007/s00034-019-01338-0&rft_dat=%3Cproquest_cross%3E2403561019%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2403561019&rft_id=info:pmid/&rfr_iscdi=true |