Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition

Direction-of-arrival (DOA) estimation is a fundamental technique in array signal processing due to its wide applications in beamforming, speech enhancement and many other assistive speech processing technologies. In this paper, we devise a novel DOA technique based on randomized singular value decom...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE access 2021, Vol.9, p.157800-157811
Hauptverfasser: Tokgoz, Serkan, Panahi, Issa M. S.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 157811
container_issue
container_start_page 157800
container_title IEEE access
container_volume 9
creator Tokgoz, Serkan
Panahi, Issa M. S.
description Direction-of-arrival (DOA) estimation is a fundamental technique in array signal processing due to its wide applications in beamforming, speech enhancement and many other assistive speech processing technologies. In this paper, we devise a novel DOA technique based on randomized singular value decomposition (RSVD) to improve the performance of non-uniform non-linear microphone arrays (NUNLA). The accurate and efficient singular value decomposition of large data matrices is computationally challenging, and randomization provides an effective tool for performing matrix approximation, therefore, the developed DOA estimation utilizes a modified dictionary-based RSVD method for localizing single speech sources under low signal-to-noise ratios (SNR). Unlike previous methods developed for uniform linear microphone arrays, the proposed approach with L-shaped three microphone setup has no 'left-right' ambiguity. We present the performance of our proposed method in comparison to other techniques. The demonstrated experiments shows at-least 20% performance improvement using simulated data and 25% performance improvement using real data when compared with similar DoA estimation techniques for NUNLA. The proposed method exploits frame-based online time delay of arrival (TDOA) measurements which facilitates the proposed algorithm to run on real-time devices. We also show an efficient real-time implementation of the proposed method on a Pixel 3 Android smartphone using its built-in three microphones for hearing aid applications.
doi_str_mv 10.1109/ACCESS.2021.3130180
format Article
fullrecord <record><control><sourceid>proquest_webof</sourceid><recordid>TN_cdi_webofscience_primary_000725781500001CitationCount</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>9624952</ieee_id><doaj_id>oai_doaj_org_article_55bae1f996e24073ac53f1f8037ab17c</doaj_id><sourcerecordid>2605707851</sourcerecordid><originalsourceid>FETCH-LOGICAL-c519t-a73cd9fe5bdbb1a0d136ef50cb9c7c621801cc8826d72e9663ee86c59fcd35d53</originalsourceid><addsrcrecordid>eNqNkl1v0zAUhiMEYtPYL5iEInGDhFr8UX_dIE1hg0lFSMvGrXGck9ZVGhc7AbFfP2cpZeMKX_jzOa98znmz7AyjOcZIvT8viouynBNE8JxiirBEz7JjgrmaUUb580f7o-w0xg1KQ6YrJl5mR3ShCMcIH2ffr301xD6_WQeA2Rdng9-tfQd5uQOw67z0Q7CQL701rbszvfNdfhtdt8qvTVf7rbuDOi_TeWhNyL-ZdoD8I1i_3fnoRvpV9qIxbYTT_XqS3V5e3BSfZ8uvn66K8-XMMqz6mRHU1qoBVtVVhQ2qMeXQMGQrZYXlJOWHrZWS8FoQUJxTAMktU42tKasZPcmuJt3am43eBbc14bf2xumHCx9W2oTe2RY0Y5UB3CjFgSyQoMYy2uBGIipMhYVNWh8mrd1QbaG20PXBtE9En750bq1X_qeWXGIpcBJ4uxcI_scAsddbFy20renAD1Gn4qfOpXlE3_yDblLJu1SqRCEmkJBspOhEpf7EGKA5fAYjPRpCT4bQoyH03hAp6vXjPA4xf9qfADkBv6DyTbQOOgsHLDlGECYkZqN5cOH6h_4Xfuj6FPru_0MTfTbRDuAvpThZKEboPTDb3AU</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2605707851</pqid></control><display><type>article</type><title>Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition</title><source>IEEE Open Access Journals</source><source>Directory of Open Access Journals</source><source>Web of Science - Science Citation Index Expanded - 2021&lt;img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" /&gt;</source><source>EZB Electronic Journals Library</source><creator>Tokgoz, Serkan ; Panahi, Issa M. S.</creator><creatorcontrib>Tokgoz, Serkan ; Panahi, Issa M. S.</creatorcontrib><description>Direction-of-arrival (DOA) estimation is a fundamental technique in array signal processing due to its wide applications in beamforming, speech enhancement and many other assistive speech processing technologies. In this paper, we devise a novel DOA technique based on randomized singular value decomposition (RSVD) to improve the performance of non-uniform non-linear microphone arrays (NUNLA). The accurate and efficient singular value decomposition of large data matrices is computationally challenging, and randomization provides an effective tool for performing matrix approximation, therefore, the developed DOA estimation utilizes a modified dictionary-based RSVD method for localizing single speech sources under low signal-to-noise ratios (SNR). Unlike previous methods developed for uniform linear microphone arrays, the proposed approach with L-shaped three microphone setup has no 'left-right' ambiguity. We present the performance of our proposed method in comparison to other techniques. The demonstrated experiments shows at-least 20% performance improvement using simulated data and 25% performance improvement using real data when compared with similar DoA estimation techniques for NUNLA. The proposed method exploits frame-based online time delay of arrival (TDOA) measurements which facilitates the proposed algorithm to run on real-time devices. We also show an efficient real-time implementation of the proposed method on a Pixel 3 Android smartphone using its built-in three microphones for hearing aid applications.</description><identifier>ISSN: 2169-3536</identifier><identifier>EISSN: 2169-3536</identifier><identifier>DOI: 10.1109/ACCESS.2021.3130180</identifier><identifier>PMID: 34926101</identifier><identifier>CODEN: IAECCG</identifier><language>eng</language><publisher>PISCATAWAY: IEEE</publisher><subject>Algorithms ; Arrays ; Auditory system ; Beamforming ; Computer Science ; Computer Science, Information Systems ; Decomposition ; Direction of arrival ; Direction-of-arrival estimation ; Engineering ; Engineering, Electrical &amp; Electronic ; Estimation ; Hearing aid device ; Hearing aids ; Location awareness ; low SNR ; Microphone arrays ; Microphones ; non-uniform microphone arrays ; Performance enhancement ; Randomization ; randomized algorithm ; Real time ; real-time implementation ; Science &amp; Technology ; Signal processing ; Signal processing algorithms ; Signal to noise ratio ; Singular value decomposition ; smartphone ; Smartphones ; Speech processing ; speech source localization ; Technology ; Telecommunications ; Time lag</subject><ispartof>IEEE access, 2021, Vol.9, p.157800-157811</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>true</woscitedreferencessubscribed><woscitedreferencescount>2</woscitedreferencescount><woscitedreferencesoriginalsourcerecordid>wos000725781500001</woscitedreferencesoriginalsourcerecordid><citedby>FETCH-LOGICAL-c519t-a73cd9fe5bdbb1a0d136ef50cb9c7c621801cc8826d72e9663ee86c59fcd35d53</citedby><cites>FETCH-LOGICAL-c519t-a73cd9fe5bdbb1a0d136ef50cb9c7c621801cc8826d72e9663ee86c59fcd35d53</cites><orcidid>0000-0001-7674-096X ; 0000-0002-1852-3104</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/9624952$$EHTML$$P50$$Gieee$$Hfree_for_read</linktohtml><link.rule.ids>230,315,781,785,865,886,2103,2115,4025,27638,27928,27929,27930,39263,54938</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/34926101$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Tokgoz, Serkan</creatorcontrib><creatorcontrib>Panahi, Issa M. S.</creatorcontrib><title>Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition</title><title>IEEE access</title><addtitle>Access</addtitle><addtitle>IEEE ACCESS</addtitle><addtitle>IEEE Access</addtitle><description>Direction-of-arrival (DOA) estimation is a fundamental technique in array signal processing due to its wide applications in beamforming, speech enhancement and many other assistive speech processing technologies. In this paper, we devise a novel DOA technique based on randomized singular value decomposition (RSVD) to improve the performance of non-uniform non-linear microphone arrays (NUNLA). The accurate and efficient singular value decomposition of large data matrices is computationally challenging, and randomization provides an effective tool for performing matrix approximation, therefore, the developed DOA estimation utilizes a modified dictionary-based RSVD method for localizing single speech sources under low signal-to-noise ratios (SNR). Unlike previous methods developed for uniform linear microphone arrays, the proposed approach with L-shaped three microphone setup has no 'left-right' ambiguity. We present the performance of our proposed method in comparison to other techniques. The demonstrated experiments shows at-least 20% performance improvement using simulated data and 25% performance improvement using real data when compared with similar DoA estimation techniques for NUNLA. The proposed method exploits frame-based online time delay of arrival (TDOA) measurements which facilitates the proposed algorithm to run on real-time devices. We also show an efficient real-time implementation of the proposed method on a Pixel 3 Android smartphone using its built-in three microphones for hearing aid applications.</description><subject>Algorithms</subject><subject>Arrays</subject><subject>Auditory system</subject><subject>Beamforming</subject><subject>Computer Science</subject><subject>Computer Science, Information Systems</subject><subject>Decomposition</subject><subject>Direction of arrival</subject><subject>Direction-of-arrival estimation</subject><subject>Engineering</subject><subject>Engineering, Electrical &amp; Electronic</subject><subject>Estimation</subject><subject>Hearing aid device</subject><subject>Hearing aids</subject><subject>Location awareness</subject><subject>low SNR</subject><subject>Microphone arrays</subject><subject>Microphones</subject><subject>non-uniform microphone arrays</subject><subject>Performance enhancement</subject><subject>Randomization</subject><subject>randomized algorithm</subject><subject>Real time</subject><subject>real-time implementation</subject><subject>Science &amp; Technology</subject><subject>Signal processing</subject><subject>Signal processing algorithms</subject><subject>Signal to noise ratio</subject><subject>Singular value decomposition</subject><subject>smartphone</subject><subject>Smartphones</subject><subject>Speech processing</subject><subject>speech source localization</subject><subject>Technology</subject><subject>Telecommunications</subject><subject>Time lag</subject><issn>2169-3536</issn><issn>2169-3536</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ESBDL</sourceid><sourceid>RIE</sourceid><sourceid>HGBXW</sourceid><sourceid>DOA</sourceid><recordid>eNqNkl1v0zAUhiMEYtPYL5iEInGDhFr8UX_dIE1hg0lFSMvGrXGck9ZVGhc7AbFfP2cpZeMKX_jzOa98znmz7AyjOcZIvT8viouynBNE8JxiirBEz7JjgrmaUUb580f7o-w0xg1KQ6YrJl5mR3ShCMcIH2ffr301xD6_WQeA2Rdng9-tfQd5uQOw67z0Q7CQL701rbszvfNdfhtdt8qvTVf7rbuDOi_TeWhNyL-ZdoD8I1i_3fnoRvpV9qIxbYTT_XqS3V5e3BSfZ8uvn66K8-XMMqz6mRHU1qoBVtVVhQ2qMeXQMGQrZYXlJOWHrZWS8FoQUJxTAMktU42tKasZPcmuJt3am43eBbc14bf2xumHCx9W2oTe2RY0Y5UB3CjFgSyQoMYy2uBGIipMhYVNWh8mrd1QbaG20PXBtE9En750bq1X_qeWXGIpcBJ4uxcI_scAsddbFy20renAD1Gn4qfOpXlE3_yDblLJu1SqRCEmkJBspOhEpf7EGKA5fAYjPRpCT4bQoyH03hAp6vXjPA4xf9qfADkBv6DyTbQOOgsHLDlGECYkZqN5cOH6h_4Xfuj6FPru_0MTfTbRDuAvpThZKEboPTDb3AU</recordid><startdate>2021</startdate><enddate>2021</enddate><creator>Tokgoz, Serkan</creator><creator>Panahi, Issa M. S.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>ESBDL</scope><scope>RIA</scope><scope>RIE</scope><scope>BLEPL</scope><scope>DTL</scope><scope>HGBXW</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7SR</scope><scope>8BQ</scope><scope>8FD</scope><scope>JG9</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0001-7674-096X</orcidid><orcidid>https://orcid.org/0000-0002-1852-3104</orcidid></search><sort><creationdate>2021</creationdate><title>Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition</title><author>Tokgoz, Serkan ; Panahi, Issa M. S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c519t-a73cd9fe5bdbb1a0d136ef50cb9c7c621801cc8826d72e9663ee86c59fcd35d53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Arrays</topic><topic>Auditory system</topic><topic>Beamforming</topic><topic>Computer Science</topic><topic>Computer Science, Information Systems</topic><topic>Decomposition</topic><topic>Direction of arrival</topic><topic>Direction-of-arrival estimation</topic><topic>Engineering</topic><topic>Engineering, Electrical &amp; Electronic</topic><topic>Estimation</topic><topic>Hearing aid device</topic><topic>Hearing aids</topic><topic>Location awareness</topic><topic>low SNR</topic><topic>Microphone arrays</topic><topic>Microphones</topic><topic>non-uniform microphone arrays</topic><topic>Performance enhancement</topic><topic>Randomization</topic><topic>randomized algorithm</topic><topic>Real time</topic><topic>real-time implementation</topic><topic>Science &amp; Technology</topic><topic>Signal processing</topic><topic>Signal processing algorithms</topic><topic>Signal to noise ratio</topic><topic>Singular value decomposition</topic><topic>smartphone</topic><topic>Smartphones</topic><topic>Speech processing</topic><topic>speech source localization</topic><topic>Technology</topic><topic>Telecommunications</topic><topic>Time lag</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tokgoz, Serkan</creatorcontrib><creatorcontrib>Panahi, Issa M. S.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE Open Access Journals</collection><collection>IEEE All-Society Periodicals Package (ASPP) Online</collection><collection>IEEE Electronic Library Online</collection><collection>Web of Science Core Collection</collection><collection>Science Citation Index Expanded</collection><collection>Web of Science - Science Citation Index Expanded - 2021</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Engineered Materials Abstracts</collection><collection>METADEX</collection><collection>Technology Research Database</collection><collection>Materials Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>Directory of Open Access Journals</collection><jtitle>IEEE access</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tokgoz, Serkan</au><au>Panahi, Issa M. S.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition</atitle><jtitle>IEEE access</jtitle><stitle>Access</stitle><stitle>IEEE ACCESS</stitle><addtitle>IEEE Access</addtitle><date>2021</date><risdate>2021</risdate><volume>9</volume><spage>157800</spage><epage>157811</epage><pages>157800-157811</pages><issn>2169-3536</issn><eissn>2169-3536</eissn><coden>IAECCG</coden><abstract>Direction-of-arrival (DOA) estimation is a fundamental technique in array signal processing due to its wide applications in beamforming, speech enhancement and many other assistive speech processing technologies. In this paper, we devise a novel DOA technique based on randomized singular value decomposition (RSVD) to improve the performance of non-uniform non-linear microphone arrays (NUNLA). The accurate and efficient singular value decomposition of large data matrices is computationally challenging, and randomization provides an effective tool for performing matrix approximation, therefore, the developed DOA estimation utilizes a modified dictionary-based RSVD method for localizing single speech sources under low signal-to-noise ratios (SNR). Unlike previous methods developed for uniform linear microphone arrays, the proposed approach with L-shaped three microphone setup has no 'left-right' ambiguity. We present the performance of our proposed method in comparison to other techniques. The demonstrated experiments shows at-least 20% performance improvement using simulated data and 25% performance improvement using real data when compared with similar DoA estimation techniques for NUNLA. The proposed method exploits frame-based online time delay of arrival (TDOA) measurements which facilitates the proposed algorithm to run on real-time devices. We also show an efficient real-time implementation of the proposed method on a Pixel 3 Android smartphone using its built-in three microphones for hearing aid applications.</abstract><cop>PISCATAWAY</cop><pub>IEEE</pub><pmid>34926101</pmid><doi>10.1109/ACCESS.2021.3130180</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0001-7674-096X</orcidid><orcidid>https://orcid.org/0000-0002-1852-3104</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2169-3536
ispartof IEEE access, 2021, Vol.9, p.157800-157811
issn 2169-3536
2169-3536
language eng
recordid cdi_webofscience_primary_000725781500001CitationCount
source IEEE Open Access Journals; Directory of Open Access Journals; Web of Science - Science Citation Index Expanded - 2021<img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" />; EZB Electronic Journals Library
subjects Algorithms
Arrays
Auditory system
Beamforming
Computer Science
Computer Science, Information Systems
Decomposition
Direction of arrival
Direction-of-arrival estimation
Engineering
Engineering, Electrical & Electronic
Estimation
Hearing aid device
Hearing aids
Location awareness
low SNR
Microphone arrays
Microphones
non-uniform microphone arrays
Performance enhancement
Randomization
randomized algorithm
Real time
real-time implementation
Science & Technology
Signal processing
Signal processing algorithms
Signal to noise ratio
Singular value decomposition
smartphone
Smartphones
Speech processing
speech source localization
Technology
Telecommunications
Time lag
title Robust Three-Microphone Speech Source Localization Using Randomized Singular Value Decomposition
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-13T18%3A58%3A22IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_webof&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Robust%20Three-Microphone%20Speech%20Source%20Localization%20Using%20Randomized%20Singular%20Value%20Decomposition&rft.jtitle=IEEE%20access&rft.au=Tokgoz,%20Serkan&rft.date=2021&rft.volume=9&rft.spage=157800&rft.epage=157811&rft.pages=157800-157811&rft.issn=2169-3536&rft.eissn=2169-3536&rft.coden=IAECCG&rft_id=info:doi/10.1109/ACCESS.2021.3130180&rft_dat=%3Cproquest_webof%3E2605707851%3C/proquest_webof%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2605707851&rft_id=info:pmid/34926101&rft_ieee_id=9624952&rft_doaj_id=oai_doaj_org_article_55bae1f996e24073ac53f1f8037ab17c&rfr_iscdi=true