A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge

Recently there have been efforts to introduce new benchmark tasks for spoken language understanding (SLU), like semantic parsing. In this paper, we describe our proposed spoken semantic parsing system for the quality track (Track 1) in Spoken Language Understanding Grand Challenge which is part of I...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Arora, Siddhant, Futami, Hayato, Wu, Shih-Lun, Huynh, Jessica, Peng, Yifan, Kashiwagi, Yosuke, Tsunoo, Emiru, Yan, Brian, Watanabe, Shinji
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computation and Language Computer Science - Sound
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Arora, Siddhant Futami, Hayato Wu, Shih-Lun Huynh, Jessica Peng, Yifan Kashiwagi, Yosuke Tsunoo, Emiru Yan, Brian Watanabe, Shinji
description	Recently there have been efforts to introduce new benchmark tasks for spoken language understanding (SLU), like semantic parsing. In this paper, we describe our proposed spoken semantic parsing system for the quality track (Track 1) in Spoken Language Understanding Grand Challenge which is part of ICASSP Signal Processing Grand Challenge 2023. We experiment with both end-to-end and pipeline systems for this task. Strong automatic speech recognition (ASR) models like Whisper and pretrained Language models (LM) like BART are utilized inside our SLU framework to boost performance. We also investigate the output level combination of various models to get an exact match accuracy of 80.8, which won the 1st place at the challenge.
doi_str_mv	10.48550/arxiv.2305.01620
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2305_01620</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2305_01620</sourcerecordid><originalsourceid>FETCH-LOGICAL-a670-8d8238b2f41493cf01b895911090b16400ea4b0b96bcc041270f0999b434915c3</originalsourceid><addsrcrecordid>eNotz0FPgzAYxvFePJjpB_Dk-wXAt6UwelwI6pIlw4Bn0kJhjVBI6VS-vbp5evK_PMmPkAeKIU_jGJ-k-zafIYswDpEmDG-J30Hpz-0KkwV_0rC3XvdOevPbUweFmfVgrAZpW8hZDuXhHZZ18XpcoJsclPP0oS2UepTWmwYK6RZje_DTl3QtlNWxgLezHIxfITvJYdC213fkppPDou__d0Oq57zKXoPD8WWf7Q6BTLYYpG3KolSxjlMuoqZDqlIRC0pRoKIJR9SSK1QiUU2DnLItdiiEUDzigsZNtCGP19sLu56dGaVb6z9-feFHP7keU8U</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge</title><source>arXiv.org</source><creator>Arora, Siddhant ; Futami, Hayato ; Wu, Shih-Lun ; Huynh, Jessica ; Peng, Yifan ; Kashiwagi, Yosuke ; Tsunoo, Emiru ; Yan, Brian ; Watanabe, Shinji</creator><creatorcontrib>Arora, Siddhant ; Futami, Hayato ; Wu, Shih-Lun ; Huynh, Jessica ; Peng, Yifan ; Kashiwagi, Yosuke ; Tsunoo, Emiru ; Yan, Brian ; Watanabe, Shinji</creatorcontrib><description>Recently there have been efforts to introduce new benchmark tasks for spoken language understanding (SLU), like semantic parsing. In this paper, we describe our proposed spoken semantic parsing system for the quality track (Track 1) in Spoken Language Understanding Grand Challenge which is part of ICASSP Signal Processing Grand Challenge 2023. We experiment with both end-to-end and pipeline systems for this task. Strong automatic speech recognition (ASR) models like Whisper and pretrained Language models (LM) like BART are utilized inside our SLU framework to boost performance. We also investigate the output level combination of various models to get an exact match accuracy of 80.8, which won the 1st place at the challenge.</description><identifier>DOI: 10.48550/arxiv.2305.01620</identifier><language>eng</language><subject>Computer Science - Computation and Language ; Computer Science - Sound</subject><creationdate>2023-05</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,776,881</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2305.01620$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2305.01620$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Arora, Siddhant</creatorcontrib><creatorcontrib>Futami, Hayato</creatorcontrib><creatorcontrib>Wu, Shih-Lun</creatorcontrib><creatorcontrib>Huynh, Jessica</creatorcontrib><creatorcontrib>Peng, Yifan</creatorcontrib><creatorcontrib>Kashiwagi, Yosuke</creatorcontrib><creatorcontrib>Tsunoo, Emiru</creatorcontrib><creatorcontrib>Yan, Brian</creatorcontrib><creatorcontrib>Watanabe, Shinji</creatorcontrib><title>A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge</title><description>Recently there have been efforts to introduce new benchmark tasks for spoken language understanding (SLU), like semantic parsing. In this paper, we describe our proposed spoken semantic parsing system for the quality track (Track 1) in Spoken Language Understanding Grand Challenge which is part of ICASSP Signal Processing Grand Challenge 2023. We experiment with both end-to-end and pipeline systems for this task. Strong automatic speech recognition (ASR) models like Whisper and pretrained Language models (LM) like BART are utilized inside our SLU framework to boost performance. We also investigate the output level combination of various models to get an exact match accuracy of 80.8, which won the 1st place at the challenge.</description><subject>Computer Science - Computation and Language</subject><subject>Computer Science - Sound</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz0FPgzAYxvFePJjpB_Dk-wXAt6UwelwI6pIlw4Bn0kJhjVBI6VS-vbp5evK_PMmPkAeKIU_jGJ-k-zafIYswDpEmDG-J30Hpz-0KkwV_0rC3XvdOevPbUweFmfVgrAZpW8hZDuXhHZZ18XpcoJsclPP0oS2UepTWmwYK6RZje_DTl3QtlNWxgLezHIxfITvJYdC213fkppPDou__d0Oq57zKXoPD8WWf7Q6BTLYYpG3KolSxjlMuoqZDqlIRC0pRoKIJR9SSK1QiUU2DnLItdiiEUDzigsZNtCGP19sLu56dGaVb6z9-feFHP7keU8U</recordid><startdate>20230502</startdate><enddate>20230502</enddate><creator>Arora, Siddhant</creator><creator>Futami, Hayato</creator><creator>Wu, Shih-Lun</creator><creator>Huynh, Jessica</creator><creator>Peng, Yifan</creator><creator>Kashiwagi, Yosuke</creator><creator>Tsunoo, Emiru</creator><creator>Yan, Brian</creator><creator>Watanabe, Shinji</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20230502</creationdate><title>A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge</title><author>Arora, Siddhant ; Futami, Hayato ; Wu, Shih-Lun ; Huynh, Jessica ; Peng, Yifan ; Kashiwagi, Yosuke ; Tsunoo, Emiru ; Yan, Brian ; Watanabe, Shinji</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a670-8d8238b2f41493cf01b895911090b16400ea4b0b96bcc041270f0999b434915c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Computer Science - Computation and Language</topic><topic>Computer Science - Sound</topic><toplevel>online_resources</toplevel><creatorcontrib>Arora, Siddhant</creatorcontrib><creatorcontrib>Futami, Hayato</creatorcontrib><creatorcontrib>Wu, Shih-Lun</creatorcontrib><creatorcontrib>Huynh, Jessica</creatorcontrib><creatorcontrib>Peng, Yifan</creatorcontrib><creatorcontrib>Kashiwagi, Yosuke</creatorcontrib><creatorcontrib>Tsunoo, Emiru</creatorcontrib><creatorcontrib>Yan, Brian</creatorcontrib><creatorcontrib>Watanabe, Shinji</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Arora, Siddhant</au><au>Futami, Hayato</au><au>Wu, Shih-Lun</au><au>Huynh, Jessica</au><au>Peng, Yifan</au><au>Kashiwagi, Yosuke</au><au>Tsunoo, Emiru</au><au>Yan, Brian</au><au>Watanabe, Shinji</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge</atitle><date>2023-05-02</date><risdate>2023</risdate><abstract>Recently there have been efforts to introduce new benchmark tasks for spoken language understanding (SLU), like semantic parsing. In this paper, we describe our proposed spoken semantic parsing system for the quality track (Track 1) in Spoken Language Understanding Grand Challenge which is part of ICASSP Signal Processing Grand Challenge 2023. We experiment with both end-to-end and pipeline systems for this task. Strong automatic speech recognition (ASR) models like Whisper and pretrained Language models (LM) like BART are utilized inside our SLU framework to boost performance. We also investigate the output level combination of various models to get an exact match accuracy of 80.8, which won the 1st place at the challenge.</abstract><doi>10.48550/arxiv.2305.01620</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2305.01620
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2305_01620
source	arXiv.org
subjects	Computer Science - Computation and Language Computer Science - Sound
title	A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T06%3A17%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20Study%20on%20the%20Integration%20of%20Pipeline%20and%20E2E%20SLU%20systems%20for%20Spoken%20Semantic%20Parsing%20toward%20STOP%20Quality%20Challenge&rft.au=Arora,%20Siddhant&rft.date=2023-05-02&rft_id=info:doi/10.48550/arxiv.2305.01620&rft_dat=%3Carxiv_GOX%3E2305_01620%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true