Domain Adaptation For Formant Estimation Using Deep Learning

In this paper we present a domain adaptation technique for formant estimation using a deep network. We first train a deep learning network on a small read speech dataset. We then freeze the parameters of the trained network and use several different datasets to train an adaptation layer that makes t...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Dissen, Yehoshua, Keshet, Joseph, Goldberger, Jacob, Clopper, Cynthia
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computation and Language Computer Science - Sound
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Dissen, Yehoshua Keshet, Joseph Goldberger, Jacob Clopper, Cynthia
description	In this paper we present a domain adaptation technique for formant estimation using a deep network. We first train a deep learning network on a small read speech dataset. We then freeze the parameters of the trained network and use several different datasets to train an adaptation layer that makes the obtained network universal in the sense that it works well for a variety of speakers and speech domains with very different characteristics. We evaluated our adapted network on three datasets, each of which has different speaker characteristics and speech styles. The performance of our method compares favorably with alternative methods for formant estimation.
doi_str_mv	10.48550/arxiv.1611.01783
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1611_01783</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1611_01783</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-6c75159fa445e78b227f56ec7af820a556036d84cc53028970057ed7f4a98e0a3</originalsourceid><addsrcrecordid>eNotj71OxDAQhN1QoIMHoMIvkLCOvV5HojndDyBFuuaooyWxkSXiRE6E4O25H4rRaKYYfSPEg4LSOER44vwTv0tllSpBkdO34nk7DhyTXPc8LbzEMcn9mM8aOC1yNy9xuNbvc0yfcuv9JBvPOZ3SnbgJ_DX7-39fieN-d9y8Fs3h5W2zbgq2pAvbESqsAxuDntxHVVFA6zvi4CpgRAva9s50HWqoXE0ASL6nYLh2HlivxON19oLfTvmElH_b8432ckP_AVINQR4</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Domain Adaptation For Formant Estimation Using Deep Learning</title><source>arXiv.org</source><creator>Dissen, Yehoshua ; Keshet, Joseph ; Goldberger, Jacob ; Clopper, Cynthia</creator><creatorcontrib>Dissen, Yehoshua ; Keshet, Joseph ; Goldberger, Jacob ; Clopper, Cynthia</creatorcontrib><description>In this paper we present a domain adaptation technique for formant estimation using a deep network. We first train a deep learning network on a small read speech dataset. We then freeze the parameters of the trained network and use several different datasets to train an adaptation layer that makes the obtained network universal in the sense that it works well for a variety of speakers and speech domains with very different characteristics. We evaluated our adapted network on three datasets, each of which has different speaker characteristics and speech styles. The performance of our method compares favorably with alternative methods for formant estimation.</description><identifier>DOI: 10.48550/arxiv.1611.01783</identifier><language>eng</language><subject>Computer Science - Computation and Language ; Computer Science - Sound</subject><creationdate>2016-11</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1611.01783$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1611.01783$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Dissen, Yehoshua</creatorcontrib><creatorcontrib>Keshet, Joseph</creatorcontrib><creatorcontrib>Goldberger, Jacob</creatorcontrib><creatorcontrib>Clopper, Cynthia</creatorcontrib><title>Domain Adaptation For Formant Estimation Using Deep Learning</title><description>In this paper we present a domain adaptation technique for formant estimation using a deep network. We first train a deep learning network on a small read speech dataset. We then freeze the parameters of the trained network and use several different datasets to train an adaptation layer that makes the obtained network universal in the sense that it works well for a variety of speakers and speech domains with very different characteristics. We evaluated our adapted network on three datasets, each of which has different speaker characteristics and speech styles. The performance of our method compares favorably with alternative methods for formant estimation.</description><subject>Computer Science - Computation and Language</subject><subject>Computer Science - Sound</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj71OxDAQhN1QoIMHoMIvkLCOvV5HojndDyBFuuaooyWxkSXiRE6E4O25H4rRaKYYfSPEg4LSOER44vwTv0tllSpBkdO34nk7DhyTXPc8LbzEMcn9mM8aOC1yNy9xuNbvc0yfcuv9JBvPOZ3SnbgJ_DX7-39fieN-d9y8Fs3h5W2zbgq2pAvbESqsAxuDntxHVVFA6zvi4CpgRAva9s50HWqoXE0ASL6nYLh2HlivxON19oLfTvmElH_b8432ckP_AVINQR4</recordid><startdate>20161106</startdate><enddate>20161106</enddate><creator>Dissen, Yehoshua</creator><creator>Keshet, Joseph</creator><creator>Goldberger, Jacob</creator><creator>Clopper, Cynthia</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20161106</creationdate><title>Domain Adaptation For Formant Estimation Using Deep Learning</title><author>Dissen, Yehoshua ; Keshet, Joseph ; Goldberger, Jacob ; Clopper, Cynthia</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-6c75159fa445e78b227f56ec7af820a556036d84cc53028970057ed7f4a98e0a3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Computer Science - Computation and Language</topic><topic>Computer Science - Sound</topic><toplevel>online_resources</toplevel><creatorcontrib>Dissen, Yehoshua</creatorcontrib><creatorcontrib>Keshet, Joseph</creatorcontrib><creatorcontrib>Goldberger, Jacob</creatorcontrib><creatorcontrib>Clopper, Cynthia</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Dissen, Yehoshua</au><au>Keshet, Joseph</au><au>Goldberger, Jacob</au><au>Clopper, Cynthia</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Domain Adaptation For Formant Estimation Using Deep Learning</atitle><date>2016-11-06</date><risdate>2016</risdate><abstract>In this paper we present a domain adaptation technique for formant estimation using a deep network. We first train a deep learning network on a small read speech dataset. We then freeze the parameters of the trained network and use several different datasets to train an adaptation layer that makes the obtained network universal in the sense that it works well for a variety of speakers and speech domains with very different characteristics. We evaluated our adapted network on three datasets, each of which has different speaker characteristics and speech styles. The performance of our method compares favorably with alternative methods for formant estimation.</abstract><doi>10.48550/arxiv.1611.01783</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.1611.01783
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_1611_01783
source	arXiv.org
subjects	Computer Science - Computation and Language Computer Science - Sound
title	Domain Adaptation For Formant Estimation Using Deep Learning
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T09%3A53%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Domain%20Adaptation%20For%20Formant%20Estimation%20Using%20Deep%20Learning&rft.au=Dissen,%20Yehoshua&rft.date=2016-11-06&rft_id=info:doi/10.48550/arxiv.1611.01783&rft_dat=%3Carxiv_GOX%3E1611_01783%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true