A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique

In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition mo...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Circuits, systems, and signal processing systems, and signal processing, 2020-07, Vol.39 (7), p.3652-3681
1. Verfasser:	Mavaddati, Samira
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Circuits and Systems Coherence Computer simulation Data retrieval Decomposition Dictionaries Electrical Engineering Electronics and Microelectronics Engineering Instrumentation Learning Optimization Separation Signal processing Signal,Image and Speech Processing Singing Voice activity detectors Voice recognition
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	3681
container_issue	7
container_start_page	3652
container_title	Circuits, systems, and signal processing
container_volume	39
creator	Mavaddati, Samira
description	In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition model using spectrogram of the singing voice signals. The vocal and non-vocal parts of a singing voice signal are investigated as sparse and low-rank components, respectively. An alternating optimization algorithm is applied to decompose the singing voice frames using the sparse representation technique over the vocal and non-vocal dictionaries. Also, a novel voice activity detector is presented based upon the energy of the sparse coefficients to learn atoms related to the non-vocal data in the training step. In the test phase, the learned non-vocal atoms of the music instrumental part are updated according to the non-vocal components captured from the test signal using domain adaptation technique. The proposed dictionary learning process includes two coherence measures: atom–data coherence and mutual coherence to provide a learning procedure with low reconstruction error along with a proper separation in the test step. The simulation results using different measures show that the proposed method leads to significantly better results in comparison with the earlier methods in this context and the traditional procedures.
doi_str_mv	10.1007/s00034-019-01338-0
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2403561019</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2403561019</sourcerecordid><originalsourceid>FETCH-LOGICAL-c356t-215c2089ad85071fd7fce90cf602a22e664eb3e798b750f9245a3bfe4f0a7e8c3</originalsourceid><addsrcrecordid>eNp9kF9LwzAUxYMoOKdfwKeAz9WbpG3Sxzn_wtSHTfEtpOnN1tE1tekEv71xE3wTEsINv3PuvYeQcwaXDEBeBQAQaQKsiFcIlcABGbFMsCRTUh2SEXAZPxV7PyYnIawhkmnBR2Qxoc_-Exs6r9tlPPTN1xbpHDvTm6H2LX3CYeUrem0CVjTWhs7Q9K0pG6Q3aP2m86HekQu0q7b-2OIpOXKmCXj2-47J693tYvqQzF7uH6eTWWJFlg8JZ5nloApTqQwkc5V0FguwLgduOMc8T7EUKAtVygxcwdPMiNJh6sBIVFaMycXet-t9bBsGvfbbOFoTNE8h9mBxzUjxPWV7H0KPTnd9vTH9l2agf9LT-_R0hPUuPQ1RJPaiEOF2if2f9T-qbz4KcbA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2403561019</pqid></control><display><type>article</type><title>A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique</title><source>SpringerNature Journals</source><creator>Mavaddati, Samira</creator><creatorcontrib>Mavaddati, Samira</creatorcontrib><description>In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition model using spectrogram of the singing voice signals. The vocal and non-vocal parts of a singing voice signal are investigated as sparse and low-rank components, respectively. An alternating optimization algorithm is applied to decompose the singing voice frames using the sparse representation technique over the vocal and non-vocal dictionaries. Also, a novel voice activity detector is presented based upon the energy of the sparse coefficients to learn atoms related to the non-vocal data in the training step. In the test phase, the learned non-vocal atoms of the music instrumental part are updated according to the non-vocal components captured from the test signal using domain adaptation technique. The proposed dictionary learning process includes two coherence measures: atom–data coherence and mutual coherence to provide a learning procedure with low reconstruction error along with a proper separation in the test step. The simulation results using different measures show that the proposed method leads to significantly better results in comparison with the earlier methods in this context and the traditional procedures.</description><identifier>ISSN: 0278-081X</identifier><identifier>EISSN: 1531-5878</identifier><identifier>DOI: 10.1007/s00034-019-01338-0</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Algorithms ; Circuits and Systems ; Coherence ; Computer simulation ; Data retrieval ; Decomposition ; Dictionaries ; Electrical Engineering ; Electronics and Microelectronics ; Engineering ; Instrumentation ; Learning ; Optimization ; Separation ; Signal processing ; Signal,Image and Speech Processing ; Singing ; Voice activity detectors ; Voice recognition</subject><ispartof>Circuits, systems, and signal processing, 2020-07, Vol.39 (7), p.3652-3681</ispartof><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020</rights><rights>Springer Science+Business Media, LLC, part of Springer Nature 2020.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c356t-215c2089ad85071fd7fce90cf602a22e664eb3e798b750f9245a3bfe4f0a7e8c3</citedby><cites>FETCH-LOGICAL-c356t-215c2089ad85071fd7fce90cf602a22e664eb3e798b750f9245a3bfe4f0a7e8c3</cites><orcidid>0000-0002-8138-1014</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s00034-019-01338-0$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s00034-019-01338-0$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,780,784,27924,27925,41488,42557,51319</link.rule.ids></links><search><creatorcontrib>Mavaddati, Samira</creatorcontrib><title>A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique</title><title>Circuits, systems, and signal processing</title><addtitle>Circuits Syst Signal Process</addtitle><description>In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition model using spectrogram of the singing voice signals. The vocal and non-vocal parts of a singing voice signal are investigated as sparse and low-rank components, respectively. An alternating optimization algorithm is applied to decompose the singing voice frames using the sparse representation technique over the vocal and non-vocal dictionaries. Also, a novel voice activity detector is presented based upon the energy of the sparse coefficients to learn atoms related to the non-vocal data in the training step. In the test phase, the learned non-vocal atoms of the music instrumental part are updated according to the non-vocal components captured from the test signal using domain adaptation technique. The proposed dictionary learning process includes two coherence measures: atom–data coherence and mutual coherence to provide a learning procedure with low reconstruction error along with a proper separation in the test step. The simulation results using different measures show that the proposed method leads to significantly better results in comparison with the earlier methods in this context and the traditional procedures.</description><subject>Algorithms</subject><subject>Circuits and Systems</subject><subject>Coherence</subject><subject>Computer simulation</subject><subject>Data retrieval</subject><subject>Decomposition</subject><subject>Dictionaries</subject><subject>Electrical Engineering</subject><subject>Electronics and Microelectronics</subject><subject>Engineering</subject><subject>Instrumentation</subject><subject>Learning</subject><subject>Optimization</subject><subject>Separation</subject><subject>Signal processing</subject><subject>Signal,Image and Speech Processing</subject><subject>Singing</subject><subject>Voice activity detectors</subject><subject>Voice recognition</subject><issn>0278-081X</issn><issn>1531-5878</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kF9LwzAUxYMoOKdfwKeAz9WbpG3Sxzn_wtSHTfEtpOnN1tE1tekEv71xE3wTEsINv3PuvYeQcwaXDEBeBQAQaQKsiFcIlcABGbFMsCRTUh2SEXAZPxV7PyYnIawhkmnBR2Qxoc_-Exs6r9tlPPTN1xbpHDvTm6H2LX3CYeUrem0CVjTWhs7Q9K0pG6Q3aP2m86HekQu0q7b-2OIpOXKmCXj2-47J693tYvqQzF7uH6eTWWJFlg8JZ5nloApTqQwkc5V0FguwLgduOMc8T7EUKAtVygxcwdPMiNJh6sBIVFaMycXet-t9bBsGvfbbOFoTNE8h9mBxzUjxPWV7H0KPTnd9vTH9l2agf9LT-_R0hPUuPQ1RJPaiEOF2if2f9T-qbz4KcbA</recordid><startdate>20200701</startdate><enddate>20200701</enddate><creator>Mavaddati, Samira</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7SP</scope><scope>7XB</scope><scope>88I</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L6V</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M2P</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>Q9U</scope><scope>S0W</scope><orcidid>https://orcid.org/0000-0002-8138-1014</orcidid></search><sort><creationdate>20200701</creationdate><title>A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique</title><author>Mavaddati, Samira</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c356t-215c2089ad85071fd7fce90cf602a22e664eb3e798b750f9245a3bfe4f0a7e8c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Algorithms</topic><topic>Circuits and Systems</topic><topic>Coherence</topic><topic>Computer simulation</topic><topic>Data retrieval</topic><topic>Decomposition</topic><topic>Dictionaries</topic><topic>Electrical Engineering</topic><topic>Electronics and Microelectronics</topic><topic>Engineering</topic><topic>Instrumentation</topic><topic>Learning</topic><topic>Optimization</topic><topic>Separation</topic><topic>Signal processing</topic><topic>Signal,Image and Speech Processing</topic><topic>Singing</topic><topic>Voice activity detectors</topic><topic>Voice recognition</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mavaddati, Samira</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Science Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Engineering Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Science Database</collection><collection>Engineering Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>ProQuest Central Basic</collection><collection>DELNET Engineering & Technology Collection</collection><jtitle>Circuits, systems, and signal processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mavaddati, Samira</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique</atitle><jtitle>Circuits, systems, and signal processing</jtitle><stitle>Circuits Syst Signal Process</stitle><date>2020-07-01</date><risdate>2020</risdate><volume>39</volume><issue>7</issue><spage>3652</spage><epage>3681</epage><pages>3652-3681</pages><issn>0278-081X</issn><eissn>1531-5878</eissn><abstract>In this paper, a new monaural singing voice separation algorithm is presented. This field of signal processing provides important information in many areas dealing with voice recognition, data retrieval, and singer identification. The proposed approach includes a sparse and low-rank decomposition model using spectrogram of the singing voice signals. The vocal and non-vocal parts of a singing voice signal are investigated as sparse and low-rank components, respectively. An alternating optimization algorithm is applied to decompose the singing voice frames using the sparse representation technique over the vocal and non-vocal dictionaries. Also, a novel voice activity detector is presented based upon the energy of the sparse coefficients to learn atoms related to the non-vocal data in the training step. In the test phase, the learned non-vocal atoms of the music instrumental part are updated according to the non-vocal components captured from the test signal using domain adaptation technique. The proposed dictionary learning process includes two coherence measures: atom–data coherence and mutual coherence to provide a learning procedure with low reconstruction error along with a proper separation in the test step. The simulation results using different measures show that the proposed method leads to significantly better results in comparison with the earlier methods in this context and the traditional procedures.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s00034-019-01338-0</doi><tpages>30</tpages><orcidid>https://orcid.org/0000-0002-8138-1014</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 0278-081X
ispartof	Circuits, systems, and signal processing, 2020-07, Vol.39 (7), p.3652-3681
issn	0278-081X 1531-5878
language	eng
recordid	cdi_proquest_journals_2403561019
source	SpringerNature Journals
subjects	Algorithms Circuits and Systems Coherence Computer simulation Data retrieval Decomposition Dictionaries Electrical Engineering Electronics and Microelectronics Engineering Instrumentation Learning Optimization Separation Signal processing Signal,Image and Speech Processing Singing Voice activity detectors Voice recognition
title	A Novel Singing Voice Separation Method Based on a Learnable Decomposition Technique
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-19T15%3A59%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Novel%20Singing%20Voice%20Separation%20Method%20Based%20on%20a%20Learnable%20Decomposition%20Technique&rft.jtitle=Circuits,%20systems,%20and%20signal%20processing&rft.au=Mavaddati,%20Samira&rft.date=2020-07-01&rft.volume=39&rft.issue=7&rft.spage=3652&rft.epage=3681&rft.pages=3652-3681&rft.issn=0278-081X&rft.eissn=1531-5878&rft_id=info:doi/10.1007/s00034-019-01338-0&rft_dat=%3Cproquest_cross%3E2403561019%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2403561019&rft_id=info:pmid/&rfr_iscdi=true