Comparison of Methods for Topic Classification of Spoken Inquiries

In this work, we address the topic classification of spoken inquiries in Japanese that are received by a speech-oriented guidance system operating in a real environment. The classification of spoken inquiries is often hindered by automatic speech recognition (ASR) errors, the sparseness of features...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Information and Media Technologies 2013, Vol.8(2), pp.438-448
Hauptverfasser:	Torres, Rafael, Kawanami, Hiromichi, Matsui, Tomoko, Saruwatari, Hiroshi, Shikano, Kiyohiro
Format:	Artikel
Sprache:	eng
Schlagworte:	maximum entropy PrefixSpan boosting stacked generalization support vector machine topic classification
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	448
container_issue	2
container_start_page	438
container_title	Information and Media Technologies
container_volume	8
creator	Torres, Rafael Kawanami, Hiromichi Matsui, Tomoko Saruwatari, Hiroshi Shikano, Kiyohiro
description	In this work, we address the topic classification of spoken inquiries in Japanese that are received by a speech-oriented guidance system operating in a real environment. The classification of spoken inquiries is often hindered by automatic speech recognition (ASR) errors, the sparseness of features and the shortness of spontaneous speech utterances. Here, we compare the performances of a support vector machine (SVM) with a radial basis function (RBF) kernel, PrefixSpan boosting (pboost) and the maximum entropy (ME) method, which are supervised learning methods. We also combine their predictions using a stacked generalization (SG) scheme. We also perform an evaluation using words or characters as features for the classifiers. Using characters as features is possible in Japanese owing to the presence of kanji, ideograms originating from Chinese characters that represent not only sounds but also meanings. We performed analyses on the performance of the above methods and their combination in dealing with the indicated problems. Experimental results show an F-measure of 86.87% for the classification of ASR results from children's inquiries with an average performance improvement of 2.81% compared with the performance of individual classifiers, and an F-measure of 93.96% with an average improvement of 1.89% for adults' inquiries when using the SG scheme and character features.
doi_str_mv	10.11185/imt.8.438
format	Article
fullrecord	<record><control><sourceid>proquest_jstag</sourceid><recordid>TN_cdi_proquest_journals_1477988268</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3184165341</sourcerecordid><originalsourceid>FETCH-LOGICAL-j2768-321e468a8dfefbc5711384e92388ab60cd70a05697e5b066ca9c644539a572153</originalsourceid><addsrcrecordid>eNo9kMFKxDAQhoMguKx78QkKnrsmTZNM8aRF3YUVD67nME1TN3XbdJP24NtbWXHgZ-DnYwY-Qm4YXTPGQNy5blzDOudwQRYMgKUUCnlFVjG29HcUZUotyGPpuwGDi75PfJO82vHg65g0PiR7PziTlEeM0TXO4OjOzPvgv2yfbPvT5IKz8ZpcNniMdvW3l-Tj-WlfbtLd28u2fNilbaYkpDxjNpeAUDe2qYxQjHHIbZFxAKwkNbWiSIUslBUVldJgYWSeC16gUBkTfEluz3eH4E-TjaNu_RT6-aVmuVIFQCZhpu7PVBtH_LR6CK7D8K0xjM4crZ69aNDZnNnNf2sOGLTt-Q-XQl8g</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1477988268</pqid></control><display><type>article</type><title>Comparison of Methods for Topic Classification of Spoken Inquiries</title><source>J-STAGE Free</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><creator>Torres, Rafael ; Kawanami, Hiromichi ; Matsui, Tomoko ; Saruwatari, Hiroshi ; Shikano, Kiyohiro</creator><creatorcontrib>Torres, Rafael ; Kawanami, Hiromichi ; Matsui, Tomoko ; Saruwatari, Hiroshi ; Shikano, Kiyohiro</creatorcontrib><description>In this work, we address the topic classification of spoken inquiries in Japanese that are received by a speech-oriented guidance system operating in a real environment. The classification of spoken inquiries is often hindered by automatic speech recognition (ASR) errors, the sparseness of features and the shortness of spontaneous speech utterances. Here, we compare the performances of a support vector machine (SVM) with a radial basis function (RBF) kernel, PrefixSpan boosting (pboost) and the maximum entropy (ME) method, which are supervised learning methods. We also combine their predictions using a stacked generalization (SG) scheme. We also perform an evaluation using words or characters as features for the classifiers. Using characters as features is possible in Japanese owing to the presence of kanji, ideograms originating from Chinese characters that represent not only sounds but also meanings. We performed analyses on the performance of the above methods and their combination in dealing with the indicated problems. Experimental results show an F-measure of 86.87% for the classification of ASR results from children's inquiries with an average performance improvement of 2.81% compared with the performance of individual classifiers, and an F-measure of 93.96% with an average improvement of 1.89% for adults' inquiries when using the SG scheme and character features.</description><identifier>EISSN: 1881-0896</identifier><identifier>DOI: 10.11185/imt.8.438</identifier><language>eng</language><publisher>Tokyo: Information and Media Technologies Editorial Board</publisher><subject>maximum entropy ; PrefixSpan boosting ; stacked generalization ; support vector machine ; topic classification</subject><ispartof>Information and Media Technologies, 2013, Vol.8(2), pp.438-448</ispartof><rights>2013 Information Processing Society of Japan</rights><rights>Copyright Japan Science and Technology Agency 2013</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,1877,4010,27900,27901,27902</link.rule.ids></links><search><creatorcontrib>Torres, Rafael</creatorcontrib><creatorcontrib>Kawanami, Hiromichi</creatorcontrib><creatorcontrib>Matsui, Tomoko</creatorcontrib><creatorcontrib>Saruwatari, Hiroshi</creatorcontrib><creatorcontrib>Shikano, Kiyohiro</creatorcontrib><title>Comparison of Methods for Topic Classification of Spoken Inquiries</title><title>Information and Media Technologies</title><addtitle>IMT</addtitle><description>In this work, we address the topic classification of spoken inquiries in Japanese that are received by a speech-oriented guidance system operating in a real environment. The classification of spoken inquiries is often hindered by automatic speech recognition (ASR) errors, the sparseness of features and the shortness of spontaneous speech utterances. Here, we compare the performances of a support vector machine (SVM) with a radial basis function (RBF) kernel, PrefixSpan boosting (pboost) and the maximum entropy (ME) method, which are supervised learning methods. We also combine their predictions using a stacked generalization (SG) scheme. We also perform an evaluation using words or characters as features for the classifiers. Using characters as features is possible in Japanese owing to the presence of kanji, ideograms originating from Chinese characters that represent not only sounds but also meanings. We performed analyses on the performance of the above methods and their combination in dealing with the indicated problems. Experimental results show an F-measure of 86.87% for the classification of ASR results from children's inquiries with an average performance improvement of 2.81% compared with the performance of individual classifiers, and an F-measure of 93.96% with an average improvement of 1.89% for adults' inquiries when using the SG scheme and character features.</description><subject>maximum entropy</subject><subject>PrefixSpan boosting</subject><subject>stacked generalization</subject><subject>support vector machine</subject><subject>topic classification</subject><issn>1881-0896</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2013</creationdate><recordtype>article</recordtype><recordid>eNo9kMFKxDAQhoMguKx78QkKnrsmTZNM8aRF3YUVD67nME1TN3XbdJP24NtbWXHgZ-DnYwY-Qm4YXTPGQNy5blzDOudwQRYMgKUUCnlFVjG29HcUZUotyGPpuwGDi75PfJO82vHg65g0PiR7PziTlEeM0TXO4OjOzPvgv2yfbPvT5IKz8ZpcNniMdvW3l-Tj-WlfbtLd28u2fNilbaYkpDxjNpeAUDe2qYxQjHHIbZFxAKwkNbWiSIUslBUVldJgYWSeC16gUBkTfEluz3eH4E-TjaNu_RT6-aVmuVIFQCZhpu7PVBtH_LR6CK7D8K0xjM4crZ69aNDZnNnNf2sOGLTt-Q-XQl8g</recordid><startdate>2013</startdate><enddate>2013</enddate><creator>Torres, Rafael</creator><creator>Kawanami, Hiromichi</creator><creator>Matsui, Tomoko</creator><creator>Saruwatari, Hiroshi</creator><creator>Shikano, Kiyohiro</creator><general>Information and Media Technologies Editorial Board</general><general>Japan Science and Technology Agency</general><scope>7SC</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>2013</creationdate><title>Comparison of Methods for Topic Classification of Spoken Inquiries</title><author>Torres, Rafael ; Kawanami, Hiromichi ; Matsui, Tomoko ; Saruwatari, Hiroshi ; Shikano, Kiyohiro</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-j2768-321e468a8dfefbc5711384e92388ab60cd70a05697e5b066ca9c644539a572153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2013</creationdate><topic>maximum entropy</topic><topic>PrefixSpan boosting</topic><topic>stacked generalization</topic><topic>support vector machine</topic><topic>topic classification</topic><toplevel>online_resources</toplevel><creatorcontrib>Torres, Rafael</creatorcontrib><creatorcontrib>Kawanami, Hiromichi</creatorcontrib><creatorcontrib>Matsui, Tomoko</creatorcontrib><creatorcontrib>Saruwatari, Hiroshi</creatorcontrib><creatorcontrib>Shikano, Kiyohiro</creatorcontrib><collection>Computer and Information Systems Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Information and Media Technologies</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Torres, Rafael</au><au>Kawanami, Hiromichi</au><au>Matsui, Tomoko</au><au>Saruwatari, Hiroshi</au><au>Shikano, Kiyohiro</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Comparison of Methods for Topic Classification of Spoken Inquiries</atitle><jtitle>Information and Media Technologies</jtitle><addtitle>IMT</addtitle><date>2013</date><risdate>2013</risdate><volume>8</volume><issue>2</issue><spage>438</spage><epage>448</epage><pages>438-448</pages><eissn>1881-0896</eissn><abstract>In this work, we address the topic classification of spoken inquiries in Japanese that are received by a speech-oriented guidance system operating in a real environment. The classification of spoken inquiries is often hindered by automatic speech recognition (ASR) errors, the sparseness of features and the shortness of spontaneous speech utterances. Here, we compare the performances of a support vector machine (SVM) with a radial basis function (RBF) kernel, PrefixSpan boosting (pboost) and the maximum entropy (ME) method, which are supervised learning methods. We also combine their predictions using a stacked generalization (SG) scheme. We also perform an evaluation using words or characters as features for the classifiers. Using characters as features is possible in Japanese owing to the presence of kanji, ideograms originating from Chinese characters that represent not only sounds but also meanings. We performed analyses on the performance of the above methods and their combination in dealing with the indicated problems. Experimental results show an F-measure of 86.87% for the classification of ASR results from children's inquiries with an average performance improvement of 2.81% compared with the performance of individual classifiers, and an F-measure of 93.96% with an average improvement of 1.89% for adults' inquiries when using the SG scheme and character features.</abstract><cop>Tokyo</cop><pub>Information and Media Technologies Editorial Board</pub><doi>10.11185/imt.8.438</doi><tpages>11</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 1881-0896
ispartof	Information and Media Technologies, 2013, Vol.8(2), pp.438-448
issn	1881-0896
language	eng
recordid	cdi_proquest_journals_1477988268
source	J-STAGE Free; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals
subjects	maximum entropy PrefixSpan boosting stacked generalization support vector machine topic classification
title	Comparison of Methods for Topic Classification of Spoken Inquiries
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-02T03%3A45%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_jstag&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Comparison%20of%20Methods%20for%20Topic%20Classification%20of%20Spoken%20Inquiries&rft.jtitle=Information%20and%20Media%20Technologies&rft.au=Torres,%20Rafael&rft.date=2013&rft.volume=8&rft.issue=2&rft.spage=438&rft.epage=448&rft.pages=438-448&rft.eissn=1881-0896&rft_id=info:doi/10.11185/imt.8.438&rft_dat=%3Cproquest_jstag%3E3184165341%3C/proquest_jstag%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1477988268&rft_id=info:pmid/&rfr_iscdi=true