Two-level Noise Robust and Block Featured PNN Model for Speaker Recognition in Real Environment

Speaker recognition is gaining popularity in a device and application-specific verification and validation to avoid complex textual passwords and keep remembering them. Various devices and applications have adapted speaker-based verification to ensure online and offline access. However, speaker reco...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Wireless personal communications 2022, Vol.125 (4), p.3741-3771
1. Verfasser:	Juneja, Kapil
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Communications Engineering Computer Communication Networks Deep learning Engineering Feature extraction Machine learning Networks Neural networks Noise Robustness Signal,Image and Speech Processing Speech recognition Statistical analysis Subtraction Verification
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	3771
container_issue	4
container_start_page	3741
container_title	Wireless personal communications
container_volume	125
creator	Juneja, Kapil
description	Speaker recognition is gaining popularity in a device and application-specific verification and validation to avoid complex textual passwords and keep remembering them. Various devices and applications have adapted speaker-based verification to ensure online and offline access. However, speaker recognition is also affected by multiple devices and environment-specific disturbances. In this paper, the Two-level noise-robust PNN model (2LNR-PNN) is presented for the significant recognition of the speaker. The noise is handled during the pre-processing level and the featureset generation stage. The high-level noise and situational turbulence were addressed in this work using spectral subtraction and the GMM method. This rectified noise is processed under frequency and window-based computation to extract the MFCC, LPC, and statistical features. This composite featureset is processed under Probabilistic Neural Network (PNN) for identifying the speaker. The proposed model has experimented on THUYG-20 SRE Corpus and self-collected real-time dataset. The separate experiments are conducted in different noise conditions with car, fan, white, cafeteria and babble noises. The experiments are validated against various feature processors, machine learning and deep learning models. The analytical observations are collected using accuracy, EER and FRR measures. The proposed model claims an average accuracy of over 80% and a maximum FRR of 0.2 in varied noises with 1db, 5db and 9db SNR conditions. The proposed model outperformed the experimented machine learning and deep learning models with a significant performance gain.
doi_str_mv	10.1007/s11277-022-09734-7
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2700751583</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2700751583</sourcerecordid><originalsourceid>FETCH-LOGICAL-c249t-931e9dab7bd3f23dea3da68302ac150929b2ebc41e87da8a5a83ba78599c7a8b3</originalsourceid><addsrcrecordid>eNp9kF1LwzAUhoMoOKd_wKuA19F8tEtzqWNTYU6ZE7wLaXs6unXJTNqJ_97MCt55dTjwPu85PAhdMnrNKJU3gTEuJaGcE6qkSIg8QgOWSk4ykbwfowFVXJERZ_wUnYWwpjRiig-QXn460sAeGjx3dQC8cHkXWmxsie8aV2zwFEzbeSjxy3yOn1wZk5Xz-HUHZgMeL6BwK1u3tbO4tnE1DZ7Yfe2d3YJtz9FJZZoAF79ziN6mk-X4gcye7x_HtzNS8ES1RAkGqjS5zEtRcVGCEaUZZYJyU7D08HzOIS8SBpksTWZSk4ncyCxVqpAmy8UQXfW9O-8-OgitXrvO23hScxkNpSzNREzxPlV4F4KHSu98vTX-SzOqDyJ1L1JHkfpHpJYREj0UYtiuwP9V_0N9A2midjk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2700751583</pqid></control><display><type>article</type><title>Two-level Noise Robust and Block Featured PNN Model for Speaker Recognition in Real Environment</title><source>Springer Nature - Complete Springer Journals</source><creator>Juneja, Kapil</creator><creatorcontrib>Juneja, Kapil</creatorcontrib><description>Speaker recognition is gaining popularity in a device and application-specific verification and validation to avoid complex textual passwords and keep remembering them. Various devices and applications have adapted speaker-based verification to ensure online and offline access. However, speaker recognition is also affected by multiple devices and environment-specific disturbances. In this paper, the Two-level noise-robust PNN model (2LNR-PNN) is presented for the significant recognition of the speaker. The noise is handled during the pre-processing level and the featureset generation stage. The high-level noise and situational turbulence were addressed in this work using spectral subtraction and the GMM method. This rectified noise is processed under frequency and window-based computation to extract the MFCC, LPC, and statistical features. This composite featureset is processed under Probabilistic Neural Network (PNN) for identifying the speaker. The proposed model has experimented on THUYG-20 SRE Corpus and self-collected real-time dataset. The separate experiments are conducted in different noise conditions with car, fan, white, cafeteria and babble noises. The experiments are validated against various feature processors, machine learning and deep learning models. The analytical observations are collected using accuracy, EER and FRR measures. The proposed model claims an average accuracy of over 80% and a maximum FRR of 0.2 in varied noises with 1db, 5db and 9db SNR conditions. The proposed model outperformed the experimented machine learning and deep learning models with a significant performance gain.</description><identifier>ISSN: 0929-6212</identifier><identifier>EISSN: 1572-834X</identifier><identifier>DOI: 10.1007/s11277-022-09734-7</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Accuracy ; Communications Engineering ; Computer Communication Networks ; Deep learning ; Engineering ; Feature extraction ; Machine learning ; Networks ; Neural networks ; Noise ; Robustness ; Signal,Image and Speech Processing ; Speech recognition ; Statistical analysis ; Subtraction ; Verification</subject><ispartof>Wireless personal communications, 2022, Vol.125 (4), p.3741-3771</ispartof><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022</rights><rights>The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2022.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c249t-931e9dab7bd3f23dea3da68302ac150929b2ebc41e87da8a5a83ba78599c7a8b3</citedby><cites>FETCH-LOGICAL-c249t-931e9dab7bd3f23dea3da68302ac150929b2ebc41e87da8a5a83ba78599c7a8b3</cites><orcidid>0000-0002-6351-3351</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11277-022-09734-7$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11277-022-09734-7$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27903,27904,41467,42536,51297</link.rule.ids></links><search><creatorcontrib>Juneja, Kapil</creatorcontrib><title>Two-level Noise Robust and Block Featured PNN Model for Speaker Recognition in Real Environment</title><title>Wireless personal communications</title><addtitle>Wireless Pers Commun</addtitle><description>Speaker recognition is gaining popularity in a device and application-specific verification and validation to avoid complex textual passwords and keep remembering them. Various devices and applications have adapted speaker-based verification to ensure online and offline access. However, speaker recognition is also affected by multiple devices and environment-specific disturbances. In this paper, the Two-level noise-robust PNN model (2LNR-PNN) is presented for the significant recognition of the speaker. The noise is handled during the pre-processing level and the featureset generation stage. The high-level noise and situational turbulence were addressed in this work using spectral subtraction and the GMM method. This rectified noise is processed under frequency and window-based computation to extract the MFCC, LPC, and statistical features. This composite featureset is processed under Probabilistic Neural Network (PNN) for identifying the speaker. The proposed model has experimented on THUYG-20 SRE Corpus and self-collected real-time dataset. The separate experiments are conducted in different noise conditions with car, fan, white, cafeteria and babble noises. The experiments are validated against various feature processors, machine learning and deep learning models. The analytical observations are collected using accuracy, EER and FRR measures. The proposed model claims an average accuracy of over 80% and a maximum FRR of 0.2 in varied noises with 1db, 5db and 9db SNR conditions. The proposed model outperformed the experimented machine learning and deep learning models with a significant performance gain.</description><subject>Accuracy</subject><subject>Communications Engineering</subject><subject>Computer Communication Networks</subject><subject>Deep learning</subject><subject>Engineering</subject><subject>Feature extraction</subject><subject>Machine learning</subject><subject>Networks</subject><subject>Neural networks</subject><subject>Noise</subject><subject>Robustness</subject><subject>Signal,Image and Speech Processing</subject><subject>Speech recognition</subject><subject>Statistical analysis</subject><subject>Subtraction</subject><subject>Verification</subject><issn>0929-6212</issn><issn>1572-834X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp9kF1LwzAUhoMoOKd_wKuA19F8tEtzqWNTYU6ZE7wLaXs6unXJTNqJ_97MCt55dTjwPu85PAhdMnrNKJU3gTEuJaGcE6qkSIg8QgOWSk4ykbwfowFVXJERZ_wUnYWwpjRiig-QXn460sAeGjx3dQC8cHkXWmxsie8aV2zwFEzbeSjxy3yOn1wZk5Xz-HUHZgMeL6BwK1u3tbO4tnE1DZ7Yfe2d3YJtz9FJZZoAF79ziN6mk-X4gcye7x_HtzNS8ES1RAkGqjS5zEtRcVGCEaUZZYJyU7D08HzOIS8SBpksTWZSk4ncyCxVqpAmy8UQXfW9O-8-OgitXrvO23hScxkNpSzNREzxPlV4F4KHSu98vTX-SzOqDyJ1L1JHkfpHpJYREj0UYtiuwP9V_0N9A2midjk</recordid><startdate>2022</startdate><enddate>2022</enddate><creator>Juneja, Kapil</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope><orcidid>https://orcid.org/0000-0002-6351-3351</orcidid></search><sort><creationdate>2022</creationdate><title>Two-level Noise Robust and Block Featured PNN Model for Speaker Recognition in Real Environment</title><author>Juneja, Kapil</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c249t-931e9dab7bd3f23dea3da68302ac150929b2ebc41e87da8a5a83ba78599c7a8b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Accuracy</topic><topic>Communications Engineering</topic><topic>Computer Communication Networks</topic><topic>Deep learning</topic><topic>Engineering</topic><topic>Feature extraction</topic><topic>Machine learning</topic><topic>Networks</topic><topic>Neural networks</topic><topic>Noise</topic><topic>Robustness</topic><topic>Signal,Image and Speech Processing</topic><topic>Speech recognition</topic><topic>Statistical analysis</topic><topic>Subtraction</topic><topic>Verification</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Juneja, Kapil</creatorcontrib><collection>CrossRef</collection><jtitle>Wireless personal communications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Juneja, Kapil</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Two-level Noise Robust and Block Featured PNN Model for Speaker Recognition in Real Environment</atitle><jtitle>Wireless personal communications</jtitle><stitle>Wireless Pers Commun</stitle><date>2022</date><risdate>2022</risdate><volume>125</volume><issue>4</issue><spage>3741</spage><epage>3771</epage><pages>3741-3771</pages><issn>0929-6212</issn><eissn>1572-834X</eissn><abstract>Speaker recognition is gaining popularity in a device and application-specific verification and validation to avoid complex textual passwords and keep remembering them. Various devices and applications have adapted speaker-based verification to ensure online and offline access. However, speaker recognition is also affected by multiple devices and environment-specific disturbances. In this paper, the Two-level noise-robust PNN model (2LNR-PNN) is presented for the significant recognition of the speaker. The noise is handled during the pre-processing level and the featureset generation stage. The high-level noise and situational turbulence were addressed in this work using spectral subtraction and the GMM method. This rectified noise is processed under frequency and window-based computation to extract the MFCC, LPC, and statistical features. This composite featureset is processed under Probabilistic Neural Network (PNN) for identifying the speaker. The proposed model has experimented on THUYG-20 SRE Corpus and self-collected real-time dataset. The separate experiments are conducted in different noise conditions with car, fan, white, cafeteria and babble noises. The experiments are validated against various feature processors, machine learning and deep learning models. The analytical observations are collected using accuracy, EER and FRR measures. The proposed model claims an average accuracy of over 80% and a maximum FRR of 0.2 in varied noises with 1db, 5db and 9db SNR conditions. The proposed model outperformed the experimented machine learning and deep learning models with a significant performance gain.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11277-022-09734-7</doi><tpages>31</tpages><orcidid>https://orcid.org/0000-0002-6351-3351</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 0929-6212
ispartof	Wireless personal communications, 2022, Vol.125 (4), p.3741-3771
issn	0929-6212 1572-834X
language	eng
recordid	cdi_proquest_journals_2700751583
source	Springer Nature - Complete Springer Journals
subjects	Accuracy Communications Engineering Computer Communication Networks Deep learning Engineering Feature extraction Machine learning Networks Neural networks Noise Robustness Signal,Image and Speech Processing Speech recognition Statistical analysis Subtraction Verification
title	Two-level Noise Robust and Block Featured PNN Model for Speaker Recognition in Real Environment
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-24T06%3A08%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Two-level%20Noise%20Robust%20and%20Block%20Featured%20PNN%20Model%20for%20Speaker%20Recognition%20in%20Real%20Environment&rft.jtitle=Wireless%20personal%20communications&rft.au=Juneja,%20Kapil&rft.date=2022&rft.volume=125&rft.issue=4&rft.spage=3741&rft.epage=3771&rft.pages=3741-3771&rft.issn=0929-6212&rft.eissn=1572-834X&rft_id=info:doi/10.1007/s11277-022-09734-7&rft_dat=%3Cproquest_cross%3E2700751583%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2700751583&rft_id=info:pmid/&rfr_iscdi=true