Speaker Identification using Frequency Dsitribution in the Transform Domain
In this paper, we propose Speaker Identification using the frequency distribution of various transforms like DFT (Discrete Fourier Transform), DCT (Discrete Cosine Transform), DST (Discrete Sine Transform), Hartley, Walsh, Haar and Kekre transforms. The speech signal spoken by a particular speaker i...
Gespeichert in:
Veröffentlicht in: | International journal of advanced computer science & applications 2012-01, Vol.3 (2) |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 2 |
container_start_page | |
container_title | International journal of advanced computer science & applications |
container_volume | 3 |
creator | B, H Kulkarni, Vaishali |
description | In this paper, we propose Speaker Identification using the frequency distribution of various transforms like DFT (Discrete Fourier Transform), DCT (Discrete Cosine Transform), DST (Discrete Sine Transform), Hartley, Walsh, Haar and Kekre transforms. The speech signal spoken by a particular speaker is converted into frequency domain by applying the different transform techniques. The distribution in the transform domain is utilized to extract the feature vectors in the training and the matching phases. The results obtained by using all the seven transform techniques have been analyzed and compared. It can be seen that DFT, DCT, DST and Hartley transform give comparatively similar results (Above 96%). The results obtained by using Haar and Kekre transform are very poor. The best results are obtained by using DFT (97.19% for a feature vector of size 40). |
doi_str_mv | 10.14569/IJACSA.2012.030213 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2656757481</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2656757481</sourcerecordid><originalsourceid>FETCH-LOGICAL-c202t-487663eb2562d1db8cbd8437b13e0edfde931cfce9433e982bfa4018e3bf5b7c3</originalsourceid><addsrcrecordid>eNotkM1OwzAQhC0EElXpE3CJxDnFP7HjHKuWQqAShxaJm2U7a3ChTrGTQ9-e0DCXHWlGu6sPoVuC56TgorqvnxfL7WJOMaFzzDAl7AJNKOEi57zEl2cvc4LL92s0S2mPB7GKCskm6GV7BP0FMasbCJ133urOtyHrkw8f2TrCTw_BnrJV8l30pj-HPmTdJ2S7qENybTxkq_agfbhBV05_J5j9zyl6Wz_slk_55vWxXi42uaWYdnkhSyEYGMoFbUhjpDWNLFhpCAMMjWugYsQ6C1XBGFSSGqcLTCQw47gpLZuiu3HvMbbDe6lT-7aPYTipqOCi5GUhydBiY8vGNqUITh2jP-h4UgSrMzg1glN_4NQIjv0CG3JhtA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2656757481</pqid></control><display><type>article</type><title>Speaker Identification using Frequency Dsitribution in the Transform Domain</title><source>EZB-FREE-00999 freely available EZB journals</source><creator>B, H ; Kulkarni, Vaishali</creator><creatorcontrib>B, H ; Kulkarni, Vaishali</creatorcontrib><description>In this paper, we propose Speaker Identification using the frequency distribution of various transforms like DFT (Discrete Fourier Transform), DCT (Discrete Cosine Transform), DST (Discrete Sine Transform), Hartley, Walsh, Haar and Kekre transforms. The speech signal spoken by a particular speaker is converted into frequency domain by applying the different transform techniques. The distribution in the transform domain is utilized to extract the feature vectors in the training and the matching phases. The results obtained by using all the seven transform techniques have been analyzed and compared. It can be seen that DFT, DCT, DST and Hartley transform give comparatively similar results (Above 96%). The results obtained by using Haar and Kekre transform are very poor. The best results are obtained by using DFT (97.19% for a feature vector of size 40).</description><identifier>ISSN: 2158-107X</identifier><identifier>EISSN: 2156-5570</identifier><identifier>DOI: 10.14569/IJACSA.2012.030213</identifier><language>eng</language><publisher>West Yorkshire: Science and Information (SAI) Organization Limited</publisher><subject>Discrete cosine transform ; Feature extraction ; Fourier transforms ; Frequency distribution ; Phase matching ; Trigonometric functions</subject><ispartof>International journal of advanced computer science & applications, 2012-01, Vol.3 (2)</ispartof><rights>2012. This work is licensed under https://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27923,27924</link.rule.ids></links><search><creatorcontrib>B, H</creatorcontrib><creatorcontrib>Kulkarni, Vaishali</creatorcontrib><title>Speaker Identification using Frequency Dsitribution in the Transform Domain</title><title>International journal of advanced computer science & applications</title><description>In this paper, we propose Speaker Identification using the frequency distribution of various transforms like DFT (Discrete Fourier Transform), DCT (Discrete Cosine Transform), DST (Discrete Sine Transform), Hartley, Walsh, Haar and Kekre transforms. The speech signal spoken by a particular speaker is converted into frequency domain by applying the different transform techniques. The distribution in the transform domain is utilized to extract the feature vectors in the training and the matching phases. The results obtained by using all the seven transform techniques have been analyzed and compared. It can be seen that DFT, DCT, DST and Hartley transform give comparatively similar results (Above 96%). The results obtained by using Haar and Kekre transform are very poor. The best results are obtained by using DFT (97.19% for a feature vector of size 40).</description><subject>Discrete cosine transform</subject><subject>Feature extraction</subject><subject>Fourier transforms</subject><subject>Frequency distribution</subject><subject>Phase matching</subject><subject>Trigonometric functions</subject><issn>2158-107X</issn><issn>2156-5570</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><sourceid>8G5</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>GUQSH</sourceid><sourceid>M2O</sourceid><recordid>eNotkM1OwzAQhC0EElXpE3CJxDnFP7HjHKuWQqAShxaJm2U7a3ChTrGTQ9-e0DCXHWlGu6sPoVuC56TgorqvnxfL7WJOMaFzzDAl7AJNKOEi57zEl2cvc4LL92s0S2mPB7GKCskm6GV7BP0FMasbCJ133urOtyHrkw8f2TrCTw_BnrJV8l30pj-HPmTdJ2S7qENybTxkq_agfbhBV05_J5j9zyl6Wz_slk_55vWxXi42uaWYdnkhSyEYGMoFbUhjpDWNLFhpCAMMjWugYsQ6C1XBGFSSGqcLTCQw47gpLZuiu3HvMbbDe6lT-7aPYTipqOCi5GUhydBiY8vGNqUITh2jP-h4UgSrMzg1glN_4NQIjv0CG3JhtA</recordid><startdate>20120101</startdate><enddate>20120101</enddate><creator>B, H</creator><creator>Kulkarni, Vaishali</creator><general>Science and Information (SAI) Organization Limited</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7XB</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>8G5</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>GUQSH</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M2O</scope><scope>MBDVC</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>20120101</creationdate><title>Speaker Identification using Frequency Dsitribution in the Transform Domain</title><author>B, H ; Kulkarni, Vaishali</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c202t-487663eb2562d1db8cbd8437b13e0edfde931cfce9433e982bfa4018e3bf5b7c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Discrete cosine transform</topic><topic>Feature extraction</topic><topic>Fourier transforms</topic><topic>Frequency distribution</topic><topic>Phase matching</topic><topic>Trigonometric functions</topic><toplevel>online_resources</toplevel><creatorcontrib>B, H</creatorcontrib><creatorcontrib>Kulkarni, Vaishali</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>Research Library (Alumni Edition)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>Research Library Prep</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Research Library</collection><collection>Research Library (Corporate)</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>International journal of advanced computer science & applications</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>B, H</au><au>Kulkarni, Vaishali</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Speaker Identification using Frequency Dsitribution in the Transform Domain</atitle><jtitle>International journal of advanced computer science & applications</jtitle><date>2012-01-01</date><risdate>2012</risdate><volume>3</volume><issue>2</issue><issn>2158-107X</issn><eissn>2156-5570</eissn><abstract>In this paper, we propose Speaker Identification using the frequency distribution of various transforms like DFT (Discrete Fourier Transform), DCT (Discrete Cosine Transform), DST (Discrete Sine Transform), Hartley, Walsh, Haar and Kekre transforms. The speech signal spoken by a particular speaker is converted into frequency domain by applying the different transform techniques. The distribution in the transform domain is utilized to extract the feature vectors in the training and the matching phases. The results obtained by using all the seven transform techniques have been analyzed and compared. It can be seen that DFT, DCT, DST and Hartley transform give comparatively similar results (Above 96%). The results obtained by using Haar and Kekre transform are very poor. The best results are obtained by using DFT (97.19% for a feature vector of size 40).</abstract><cop>West Yorkshire</cop><pub>Science and Information (SAI) Organization Limited</pub><doi>10.14569/IJACSA.2012.030213</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2158-107X |
ispartof | International journal of advanced computer science & applications, 2012-01, Vol.3 (2) |
issn | 2158-107X 2156-5570 |
language | eng |
recordid | cdi_proquest_journals_2656757481 |
source | EZB-FREE-00999 freely available EZB journals |
subjects | Discrete cosine transform Feature extraction Fourier transforms Frequency distribution Phase matching Trigonometric functions |
title | Speaker Identification using Frequency Dsitribution in the Transform Domain |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-12T18%3A29%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Speaker%20Identification%20using%20Frequency%20Dsitribution%20in%20the%20Transform%20Domain&rft.jtitle=International%20journal%20of%20advanced%20computer%20science%20&%20applications&rft.au=B,%20H&rft.date=2012-01-01&rft.volume=3&rft.issue=2&rft.issn=2158-107X&rft.eissn=2156-5570&rft_id=info:doi/10.14569/IJACSA.2012.030213&rft_dat=%3Cproquest_cross%3E2656757481%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2656757481&rft_id=info:pmid/&rfr_iscdi=true |