Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition
Direction-of-arrival (DOA) estimation is a fundamental technique in array signal processing due to its wide applications in beamforming, speech enhancement and many other assistive speech processing technologies. In this paper, we devise a novel DOA technique based on randomized singular value decom...
Gespeichert in:
Veröffentlicht in: | IEEE access 2021, Vol.9, p.157800-157811 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 157811 |
---|---|
container_issue | |
container_start_page | 157800 |
container_title | IEEE access |
container_volume | 9 |
creator | Tokgoz, Serkan Panahi, Issa M. S. |
description | Direction-of-arrival (DOA) estimation is a fundamental technique in array signal processing due to its wide applications in beamforming, speech enhancement and many other assistive speech processing technologies. In this paper, we devise a novel DOA technique based on randomized singular value decomposition (RSVD) to improve the performance of non-uniform non-linear microphone arrays (NUNLA). The accurate and efficient singular value decomposition of large data matrices is computationally challenging, and randomization provides an effective tool for performing matrix approximation, therefore, the developed DOA estimation utilizes a modified dictionary-based RSVD method for localizing single speech sources under low signal-to-noise ratios (SNR). Unlike previous methods developed for uniform linear microphone arrays, the proposed approach with L-shaped three microphone setup has no 'left-right' ambiguity. We present the performance of our proposed method in comparison to other techniques. The demonstrated experiments shows at-least 20% performance improvement using simulated data and 25% performance improvement using real data when compared with similar DoA estimation techniques for NUNLA. The proposed method exploits frame-based online time delay of arrival (TDOA) measurements which facilitates the proposed algorithm to run on real-time devices. We also show an efficient real-time implementation of the proposed method on a Pixel 3 Android smartphone using its built-in three microphones for hearing aid applications. |
doi_str_mv | 10.1109/ACCESS.2021.3130180 |
format | Article |
fullrecord | <record><control><sourceid>proquest_webof</sourceid><recordid>TN_cdi_webofscience_primary_000725781500001CitationCount</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9624952</ieee_id><doaj_id>oai_doaj_org_article_55bae1f996e24073ac53f1f8037ab17c</doaj_id><sourcerecordid>2605707851</sourcerecordid><originalsourceid>FETCH-LOGICAL-c519t-a73cd9fe5bdbb1a0d136ef50cb9c7c621801cc8826d72e9663ee86c59fcd35d53</originalsourceid><addsrcrecordid>eNqNkl1v0zAUhiMEYtPYL5iEInGDhFr8UX_dIE1hg0lFSMvGrXGck9ZVGhc7AbFfP2cpZeMKX_jzOa98znmz7AyjOcZIvT8viouynBNE8JxiirBEz7JjgrmaUUb580f7o-w0xg1KQ6YrJl5mR3ShCMcIH2ffr301xD6_WQeA2Rdng9-tfQd5uQOw67z0Q7CQL701rbszvfNdfhtdt8qvTVf7rbuDOi_TeWhNyL-ZdoD8I1i_3fnoRvpV9qIxbYTT_XqS3V5e3BSfZ8uvn66K8-XMMqz6mRHU1qoBVtVVhQ2qMeXQMGQrZYXlJOWHrZWS8FoQUJxTAMktU42tKasZPcmuJt3am43eBbc14bf2xumHCx9W2oTe2RY0Y5UB3CjFgSyQoMYy2uBGIipMhYVNWh8mrd1QbaG20PXBtE9En750bq1X_qeWXGIpcBJ4uxcI_scAsddbFy20renAD1Gn4qfOpXlE3_yDblLJu1SqRCEmkJBspOhEpf7EGKA5fAYjPRpCT4bQoyH03hAp6vXjPA4xf9qfADkBv6DyTbQOOgsHLDlGECYkZqN5cOH6h_4Xfuj6FPru_0MTfTbRDuAvpThZKEboPTDb3AU</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2605707851</pqid></control><display><type>article</type><title>Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition</title><source>IEEE Open Access Journals</source><source>Directory of Open Access Journals</source><source>Web of Science - Science Citation Index Expanded - 2021<img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" /></source><source>EZB Electronic Journals Library</source><creator>Tokgoz, Serkan ; Panahi, Issa M. S.</creator><creatorcontrib>Tokgoz, Serkan ; Panahi, Issa M. S.</creatorcontrib><description>Direction-of-arrival (DOA) estimation is a fundamental technique in array signal processing due to its wide applications in beamforming, speech enhancement and many other assistive speech processing technologies. In this paper, we devise a novel DOA technique based on randomized singular value decomposition (RSVD) to improve the performance of non-uniform non-linear microphone arrays (NUNLA). The accurate and efficient singular value decomposition of large data matrices is computationally challenging, and randomization provides an effective tool for performing matrix approximation, therefore, the developed DOA estimation utilizes a modified dictionary-based RSVD method for localizing single speech sources under low signal-to-noise ratios (SNR). Unlike previous methods developed for uniform linear microphone arrays, the proposed approach with L-shaped three microphone setup has no 'left-right' ambiguity. We present the performance of our proposed method in comparison to other techniques. The demonstrated experiments shows at-least 20% performance improvement using simulated data and 25% performance improvement using real data when compared with similar DoA estimation techniques for NUNLA. The proposed method exploits frame-based online time delay of arrival (TDOA) measurements which facilitates the proposed algorithm to run on real-time devices. We also show an efficient real-time implementation of the proposed method on a Pixel 3 Android smartphone using its built-in three microphones for hearing aid applications.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2021.3130180</identifier><identifier>PMID: 34926101</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>PISCATAWAY: IEEE</publisher><subject>Algorithms ; Arrays ; Auditory system ; Beamforming ; Computer Science ; Computer Science, Information Systems ; Decomposition ; Direction of arrival ; Direction-of-arrival estimation ; Engineering ; Engineering, Electrical & Electronic ; Estimation ; Hearing aid device ; Hearing aids ; Location awareness ; low SNR ; Microphone arrays ; Microphones ; non-uniform microphone arrays ; Performance enhancement ; Randomization ; randomized algorithm ; Real time ; real-time implementation ; Science & Technology ; Signal processing ; Signal processing algorithms ; Signal to noise ratio ; Singular value decomposition ; smartphone ; Smartphones ; Speech processing ; speech source localization ; Technology ; Telecommunications ; Time lag</subject><ispartof>IEEE access, 2021, Vol.9, p.157800-157811</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>true</woscitedreferencessubscribed><woscitedreferencescount>2</woscitedreferencescount><woscitedreferencesoriginalsourcerecordid>wos000725781500001</woscitedreferencesoriginalsourcerecordid><citedby>FETCH-LOGICAL-c519t-a73cd9fe5bdbb1a0d136ef50cb9c7c621801cc8826d72e9663ee86c59fcd35d53</citedby><cites>FETCH-LOGICAL-c519t-a73cd9fe5bdbb1a0d136ef50cb9c7c621801cc8826d72e9663ee86c59fcd35d53</cites><orcidid>0000-0001-7674-096X ; 0000-0002-1852-3104</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9624952$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>230,315,781,785,865,886,2103,2115,4025,27638,27928,27929,27930,39263,54938</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/34926101$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Tokgoz, Serkan</creatorcontrib><creatorcontrib>Panahi, Issa M. S.</creatorcontrib><title>Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition</title><title>IEEE access</title><addtitle>Access</addtitle><addtitle>IEEE ACCESS</addtitle><addtitle>IEEE Access</addtitle><description>Direction-of-arrival (DOA) estimation is a fundamental technique in array signal processing due to its wide applications in beamforming, speech enhancement and many other assistive speech processing technologies. In this paper, we devise a novel DOA technique based on randomized singular value decomposition (RSVD) to improve the performance of non-uniform non-linear microphone arrays (NUNLA). The accurate and efficient singular value decomposition of large data matrices is computationally challenging, and randomization provides an effective tool for performing matrix approximation, therefore, the developed DOA estimation utilizes a modified dictionary-based RSVD method for localizing single speech sources under low signal-to-noise ratios (SNR). Unlike previous methods developed for uniform linear microphone arrays, the proposed approach with L-shaped three microphone setup has no 'left-right' ambiguity. We present the performance of our proposed method in comparison to other techniques. The demonstrated experiments shows at-least 20% performance improvement using simulated data and 25% performance improvement using real data when compared with similar DoA estimation techniques for NUNLA. The proposed method exploits frame-based online time delay of arrival (TDOA) measurements which facilitates the proposed algorithm to run on real-time devices. We also show an efficient real-time implementation of the proposed method on a Pixel 3 Android smartphone using its built-in three microphones for hearing aid applications.</description><subject>Algorithms</subject><subject>Arrays</subject><subject>Auditory system</subject><subject>Beamforming</subject><subject>Computer Science</subject><subject>Computer Science, Information Systems</subject><subject>Decomposition</subject><subject>Direction of arrival</subject><subject>Direction-of-arrival estimation</subject><subject>Engineering</subject><subject>Engineering, Electrical & Electronic</subject><subject>Estimation</subject><subject>Hearing aid device</subject><subject>Hearing aids</subject><subject>Location awareness</subject><subject>low SNR</subject><subject>Microphone arrays</subject><subject>Microphones</subject><subject>non-uniform microphone arrays</subject><subject>Performance enhancement</subject><subject>Randomization</subject><subject>randomized algorithm</subject><subject>Real time</subject><subject>real-time implementation</subject><subject>Science & Technology</subject><subject>Signal processing</subject><subject>Signal processing algorithms</subject><subject>Signal to noise ratio</subject><subject>Singular value decomposition</subject><subject>smartphone</subject><subject>Smartphones</subject><subject>Speech processing</subject><subject>speech source localization</subject><subject>Technology</subject><subject>Telecommunications</subject><subject>Time lag</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>HGBXW</sourceid><sourceid>DOA</sourceid><recordid>eNqNkl1v0zAUhiMEYtPYL5iEInGDhFr8UX_dIE1hg0lFSMvGrXGck9ZVGhc7AbFfP2cpZeMKX_jzOa98znmz7AyjOcZIvT8viouynBNE8JxiirBEz7JjgrmaUUb580f7o-w0xg1KQ6YrJl5mR3ShCMcIH2ffr301xD6_WQeA2Rdng9-tfQd5uQOw67z0Q7CQL701rbszvfNdfhtdt8qvTVf7rbuDOi_TeWhNyL-ZdoD8I1i_3fnoRvpV9qIxbYTT_XqS3V5e3BSfZ8uvn66K8-XMMqz6mRHU1qoBVtVVhQ2qMeXQMGQrZYXlJOWHrZWS8FoQUJxTAMktU42tKasZPcmuJt3am43eBbc14bf2xumHCx9W2oTe2RY0Y5UB3CjFgSyQoMYy2uBGIipMhYVNWh8mrd1QbaG20PXBtE9En750bq1X_qeWXGIpcBJ4uxcI_scAsddbFy20renAD1Gn4qfOpXlE3_yDblLJu1SqRCEmkJBspOhEpf7EGKA5fAYjPRpCT4bQoyH03hAp6vXjPA4xf9qfADkBv6DyTbQOOgsHLDlGECYkZqN5cOH6h_4Xfuj6FPru_0MTfTbRDuAvpThZKEboPTDb3AU</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Tokgoz, Serkan</creator><creator>Panahi, Issa M. S.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>BLEPL</scope><scope>DTL</scope><scope>HGBXW</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0001-7674-096X</orcidid><orcidid>https://orcid.org/0000-0002-1852-3104</orcidid></search><sort><creationdate>2021</creationdate><title>Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition</title><author>Tokgoz, Serkan ; Panahi, Issa M. S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c519t-a73cd9fe5bdbb1a0d136ef50cb9c7c621801cc8826d72e9663ee86c59fcd35d53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Arrays</topic><topic>Auditory system</topic><topic>Beamforming</topic><topic>Computer Science</topic><topic>Computer Science, Information Systems</topic><topic>Decomposition</topic><topic>Direction of arrival</topic><topic>Direction-of-arrival estimation</topic><topic>Engineering</topic><topic>Engineering, Electrical & Electronic</topic><topic>Estimation</topic><topic>Hearing aid device</topic><topic>Hearing aids</topic><topic>Location awareness</topic><topic>low SNR</topic><topic>Microphone arrays</topic><topic>Microphones</topic><topic>non-uniform microphone arrays</topic><topic>Performance enhancement</topic><topic>Randomization</topic><topic>randomized algorithm</topic><topic>Real time</topic><topic>real-time implementation</topic><topic>Science & Technology</topic><topic>Signal processing</topic><topic>Signal processing algorithms</topic><topic>Signal to noise ratio</topic><topic>Singular value decomposition</topic><topic>smartphone</topic><topic>Smartphones</topic><topic>Speech processing</topic><topic>speech source localization</topic><topic>Technology</topic><topic>Telecommunications</topic><topic>Time lag</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tokgoz, Serkan</creatorcontrib><creatorcontrib>Panahi, Issa M. S.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) Online</collection><collection>IEEE Electronic Library Online</collection><collection>Web of Science Core Collection</collection><collection>Science Citation Index Expanded</collection><collection>Web of Science - Science Citation Index Expanded - 2021</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tokgoz, Serkan</au><au>Panahi, Issa M. S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><stitle>IEEE ACCESS</stitle><addtitle>IEEE Access</addtitle><date>2021</date><risdate>2021</risdate><volume>9</volume><spage>157800</spage><epage>157811</epage><pages>157800-157811</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Direction-of-arrival (DOA) estimation is a fundamental technique in array signal processing due to its wide applications in beamforming, speech enhancement and many other assistive speech processing technologies. In this paper, we devise a novel DOA technique based on randomized singular value decomposition (RSVD) to improve the performance of non-uniform non-linear microphone arrays (NUNLA). The accurate and efficient singular value decomposition of large data matrices is computationally challenging, and randomization provides an effective tool for performing matrix approximation, therefore, the developed DOA estimation utilizes a modified dictionary-based RSVD method for localizing single speech sources under low signal-to-noise ratios (SNR). Unlike previous methods developed for uniform linear microphone arrays, the proposed approach with L-shaped three microphone setup has no 'left-right' ambiguity. We present the performance of our proposed method in comparison to other techniques. The demonstrated experiments shows at-least 20% performance improvement using simulated data and 25% performance improvement using real data when compared with similar DoA estimation techniques for NUNLA. The proposed method exploits frame-based online time delay of arrival (TDOA) measurements which facilitates the proposed algorithm to run on real-time devices. We also show an efficient real-time implementation of the proposed method on a Pixel 3 Android smartphone using its built-in three microphones for hearing aid applications.</abstract><cop>PISCATAWAY</cop><pub>IEEE</pub><pmid>34926101</pmid><doi>10.1109/ACCESS.2021.3130180</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0001-7674-096X</orcidid><orcidid>https://orcid.org/0000-0002-1852-3104</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2169-3536 |
ispartof | IEEE access, 2021, Vol.9, p.157800-157811 |
issn | 2169-3536 2169-3536 |
language | eng |
recordid | cdi_webofscience_primary_000725781500001CitationCount |
source | IEEE Open Access Journals; Directory of Open Access Journals; Web of Science - Science Citation Index Expanded - 2021<img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" />; EZB Electronic Journals Library |
subjects | Algorithms Arrays Auditory system Beamforming Computer Science Computer Science, Information Systems Decomposition Direction of arrival Direction-of-arrival estimation Engineering Engineering, Electrical & Electronic Estimation Hearing aid device Hearing aids Location awareness low SNR Microphone arrays Microphones non-uniform microphone arrays Performance enhancement Randomization randomized algorithm Real time real-time implementation Science & Technology Signal processing Signal processing algorithms Signal to noise ratio Singular value decomposition smartphone Smartphones Speech processing speech source localization Technology Telecommunications Time lag |
title | Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-13T18%3A58%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_webof&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Robust%20Three-Microphone%20Speech%20Source%20Localization%20Using%20Randomized%20Singular%20Value%20Decomposition&rft.jtitle=IEEE%20access&rft.au=Tokgoz,%20Serkan&rft.date=2021&rft.volume=9&rft.spage=157800&rft.epage=157811&rft.pages=157800-157811&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2021.3130180&rft_dat=%3Cproquest_webof%3E2605707851%3C/proquest_webof%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2605707851&rft_id=info:pmid/34926101&rft_ieee_id=9624952&rft_doaj_id=oai_doaj_org_article_55bae1f996e24073ac53f1f8037ab17c&rfr_iscdi=true |