Using automated syllable counting to detect missing information in speech transcripts from clinical settings

•There are significant problems with ‘off the shelf’ automatic speech transcription systems in real world settings such that they can misrepresent what and how much was actually spoken.•An automated syllable counter can more accurately represent the true number of words spoken in real world settings...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Psychiatry research 2022-09, Vol.315, p.114712-114712, Article 114712
Hauptverfasser:	Diaz-Asper, Marama, Holmlund, Terje B., Chandler, Chelsea, Diaz-Asper, Catherine, Foltz, Peter W., Cohen, Alex S., Elvevåg, Brita
Format:	Artikel
Sprache:	eng
Schlagworte:	Automatic speech recognition Syllables Word error rate
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	114712
container_issue
container_start_page	114712
container_title	Psychiatry research
container_volume	315
creator	Diaz-Asper, Marama Holmlund, Terje B. Chandler, Chelsea Diaz-Asper, Catherine Foltz, Peter W. Cohen, Alex S. Elvevåg, Brita
description	•There are significant problems with ‘off the shelf’ automatic speech transcription systems in real world settings such that they can misrepresent what and how much was actually spoken.•An automated syllable counter can more accurately represent the true number of words spoken in real world settings, and the proportion of missing syllables in transcripts strongly correlates with the automatic speech recognition word error rate.•With low quality recordings and challenging speaker characteristics the missing syllables can be used to estimate the quality of transcripts derived with automatic speech recognition systems.•This solution can be easily implemented as an automated fail-safe method. Speech rate and quantity reflect clinical state; thus automated transcription holds potential clinical applications. We describe two datasets where recording quality and speaker characteristics affected transcription accuracy. Transcripts of low-quality recordings omitted significant portions of speech. An automated syllable counter estimated actual speech output and quantified the amount of missing information. The efficacy of this method differed by audio quality: the correlation between missing syllables and word error rate was only significant when quality was low. Automatically counting syllables could be useful to measure and flag transcription omissions in clinical contexts where speaker characteristics and recording quality are problematic.
doi_str_mv	10.1016/j.psychres.2022.114712
format	Article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9378537</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0165178122003079</els_id><sourcerecordid>2691049506</sourcerecordid><originalsourceid>FETCH-LOGICAL-c472t-1bde29e9b4dfd987f1ca025ef02d267da501b623108ac2aba67dbf2bb51e9cc93</originalsourceid><addsrcrecordid>eNqFkU9v1DAQxS0EokvhK4CPXLLYziaOLwhUAa1UiQs9W_4z6XqV2MHjVNpvXy_bInHi5JHnzc_P8wh5z9mWM95_OmwXPLp9BtwKJsSW853k4gXZ8EGKppbtS7Kpwq7hcuAX5A3igTEmuFKvyUXbDa3q22FDpjsM8Z6ataTZFPAUj9Nk7ATUpTWWU68k6qGAK3QO-Ecd4phylYcUa01xAXB7WrKJ6HJYCtIxp5m6KcTgzEQRyomEb8mr0UwI757OS3L3_duvq-vm9uePm6uvt43bSVEabj0IBcru_OjVIEfuDBMdjEx40UtvOsZtL1rOBuOEsabe2VFY23FQzqn2knw-c5fVzuAdxOpt0ksOs8lHnUzQ_3Zi2Ov79KBVK4eulRXw4Qyo38FqXceUjeaMtVIL2StWFR-fnsjp9wpYdF2Og7q7CGlFLXrF2U51rK_S_hmWEDOMf41wpk9h6oN-DlOfwtTnMOvgl_Mg1F09BMgaXYDowIdc49A-hf8hHgGJ2q3O</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2691049506</pqid></control><display><type>article</type><title>Using automated syllable counting to detect missing information in speech transcripts from clinical settings</title><source>NORA - Norwegian Open Research Archives</source><source>Elsevier ScienceDirect Journals Complete</source><creator>Diaz-Asper, Marama ; Holmlund, Terje B. ; Chandler, Chelsea ; Diaz-Asper, Catherine ; Foltz, Peter W. ; Cohen, Alex S. ; Elvevåg, Brita</creator><creatorcontrib>Diaz-Asper, Marama ; Holmlund, Terje B. ; Chandler, Chelsea ; Diaz-Asper, Catherine ; Foltz, Peter W. ; Cohen, Alex S. ; Elvevåg, Brita</creatorcontrib><description>•There are significant problems with ‘off the shelf’ automatic speech transcription systems in real world settings such that they can misrepresent what and how much was actually spoken.•An automated syllable counter can more accurately represent the true number of words spoken in real world settings, and the proportion of missing syllables in transcripts strongly correlates with the automatic speech recognition word error rate.•With low quality recordings and challenging speaker characteristics the missing syllables can be used to estimate the quality of transcripts derived with automatic speech recognition systems.•This solution can be easily implemented as an automated fail-safe method. Speech rate and quantity reflect clinical state; thus automated transcription holds potential clinical applications. We describe two datasets where recording quality and speaker characteristics affected transcription accuracy. Transcripts of low-quality recordings omitted significant portions of speech. An automated syllable counter estimated actual speech output and quantified the amount of missing information. The efficacy of this method differed by audio quality: the correlation between missing syllables and word error rate was only significant when quality was low. Automatically counting syllables could be useful to measure and flag transcription omissions in clinical contexts where speaker characteristics and recording quality are problematic.</description><identifier>ISSN: 0165-1781</identifier><identifier>ISSN: 1872-7123</identifier><identifier>EISSN: 1872-7123</identifier><identifier>DOI: 10.1016/j.psychres.2022.114712</identifier><identifier>PMID: 35839638</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Automatic speech recognition ; Syllables ; Word error rate</subject><ispartof>Psychiatry research, 2022-09, Vol.315, p.114712-114712, Article 114712</ispartof><rights>2022 The Authors</rights><rights>info:eu-repo/semantics/openAccess</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c472t-1bde29e9b4dfd987f1ca025ef02d267da501b623108ac2aba67dbf2bb51e9cc93</citedby><cites>FETCH-LOGICAL-c472t-1bde29e9b4dfd987f1ca025ef02d267da501b623108ac2aba67dbf2bb51e9cc93</cites><orcidid>0000-0003-0441-3729</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0165178122003079$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>230,314,776,780,881,3537,26544,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Diaz-Asper, Marama</creatorcontrib><creatorcontrib>Holmlund, Terje B.</creatorcontrib><creatorcontrib>Chandler, Chelsea</creatorcontrib><creatorcontrib>Diaz-Asper, Catherine</creatorcontrib><creatorcontrib>Foltz, Peter W.</creatorcontrib><creatorcontrib>Cohen, Alex S.</creatorcontrib><creatorcontrib>Elvevåg, Brita</creatorcontrib><title>Using automated syllable counting to detect missing information in speech transcripts from clinical settings</title><title>Psychiatry research</title><description>•There are significant problems with ‘off the shelf’ automatic speech transcription systems in real world settings such that they can misrepresent what and how much was actually spoken.•An automated syllable counter can more accurately represent the true number of words spoken in real world settings, and the proportion of missing syllables in transcripts strongly correlates with the automatic speech recognition word error rate.•With low quality recordings and challenging speaker characteristics the missing syllables can be used to estimate the quality of transcripts derived with automatic speech recognition systems.•This solution can be easily implemented as an automated fail-safe method. Speech rate and quantity reflect clinical state; thus automated transcription holds potential clinical applications. We describe two datasets where recording quality and speaker characteristics affected transcription accuracy. Transcripts of low-quality recordings omitted significant portions of speech. An automated syllable counter estimated actual speech output and quantified the amount of missing information. The efficacy of this method differed by audio quality: the correlation between missing syllables and word error rate was only significant when quality was low. Automatically counting syllables could be useful to measure and flag transcription omissions in clinical contexts where speaker characteristics and recording quality are problematic.</description><subject>Automatic speech recognition</subject><subject>Syllables</subject><subject>Word error rate</subject><issn>0165-1781</issn><issn>1872-7123</issn><issn>1872-7123</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>3HK</sourceid><recordid>eNqFkU9v1DAQxS0EokvhK4CPXLLYziaOLwhUAa1UiQs9W_4z6XqV2MHjVNpvXy_bInHi5JHnzc_P8wh5z9mWM95_OmwXPLp9BtwKJsSW853k4gXZ8EGKppbtS7Kpwq7hcuAX5A3igTEmuFKvyUXbDa3q22FDpjsM8Z6ataTZFPAUj9Nk7ATUpTWWU68k6qGAK3QO-Ecd4phylYcUa01xAXB7WrKJ6HJYCtIxp5m6KcTgzEQRyomEb8mr0UwI757OS3L3_duvq-vm9uePm6uvt43bSVEabj0IBcru_OjVIEfuDBMdjEx40UtvOsZtL1rOBuOEsabe2VFY23FQzqn2knw-c5fVzuAdxOpt0ksOs8lHnUzQ_3Zi2Ov79KBVK4eulRXw4Qyo38FqXceUjeaMtVIL2StWFR-fnsjp9wpYdF2Og7q7CGlFLXrF2U51rK_S_hmWEDOMf41wpk9h6oN-DlOfwtTnMOvgl_Mg1F09BMgaXYDowIdc49A-hf8hHgGJ2q3O</recordid><startdate>20220901</startdate><enddate>20220901</enddate><creator>Diaz-Asper, Marama</creator><creator>Holmlund, Terje B.</creator><creator>Chandler, Chelsea</creator><creator>Diaz-Asper, Catherine</creator><creator>Foltz, Peter W.</creator><creator>Cohen, Alex S.</creator><creator>Elvevåg, Brita</creator><general>Elsevier B.V</general><scope>6I.</scope><scope>AAFTH</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>3HK</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0003-0441-3729</orcidid></search><sort><creationdate>20220901</creationdate><title>Using automated syllable counting to detect missing information in speech transcripts from clinical settings</title><author>Diaz-Asper, Marama ; Holmlund, Terje B. ; Chandler, Chelsea ; Diaz-Asper, Catherine ; Foltz, Peter W. ; Cohen, Alex S. ; Elvevåg, Brita</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c472t-1bde29e9b4dfd987f1ca025ef02d267da501b623108ac2aba67dbf2bb51e9cc93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Automatic speech recognition</topic><topic>Syllables</topic><topic>Word error rate</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Diaz-Asper, Marama</creatorcontrib><creatorcontrib>Holmlund, Terje B.</creatorcontrib><creatorcontrib>Chandler, Chelsea</creatorcontrib><creatorcontrib>Diaz-Asper, Catherine</creatorcontrib><creatorcontrib>Foltz, Peter W.</creatorcontrib><creatorcontrib>Cohen, Alex S.</creatorcontrib><creatorcontrib>Elvevåg, Brita</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>NORA - Norwegian Open Research Archives</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Psychiatry research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Diaz-Asper, Marama</au><au>Holmlund, Terje B.</au><au>Chandler, Chelsea</au><au>Diaz-Asper, Catherine</au><au>Foltz, Peter W.</au><au>Cohen, Alex S.</au><au>Elvevåg, Brita</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Using automated syllable counting to detect missing information in speech transcripts from clinical settings</atitle><jtitle>Psychiatry research</jtitle><date>2022-09-01</date><risdate>2022</risdate><volume>315</volume><spage>114712</spage><epage>114712</epage><pages>114712-114712</pages><artnum>114712</artnum><issn>0165-1781</issn><issn>1872-7123</issn><eissn>1872-7123</eissn><abstract>•There are significant problems with ‘off the shelf’ automatic speech transcription systems in real world settings such that they can misrepresent what and how much was actually spoken.•An automated syllable counter can more accurately represent the true number of words spoken in real world settings, and the proportion of missing syllables in transcripts strongly correlates with the automatic speech recognition word error rate.•With low quality recordings and challenging speaker characteristics the missing syllables can be used to estimate the quality of transcripts derived with automatic speech recognition systems.•This solution can be easily implemented as an automated fail-safe method. Speech rate and quantity reflect clinical state; thus automated transcription holds potential clinical applications. We describe two datasets where recording quality and speaker characteristics affected transcription accuracy. Transcripts of low-quality recordings omitted significant portions of speech. An automated syllable counter estimated actual speech output and quantified the amount of missing information. The efficacy of this method differed by audio quality: the correlation between missing syllables and word error rate was only significant when quality was low. Automatically counting syllables could be useful to measure and flag transcription omissions in clinical contexts where speaker characteristics and recording quality are problematic.</abstract><pub>Elsevier B.V</pub><pmid>35839638</pmid><doi>10.1016/j.psychres.2022.114712</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0003-0441-3729</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 0165-1781
ispartof	Psychiatry research, 2022-09, Vol.315, p.114712-114712, Article 114712
issn	0165-1781 1872-7123 1872-7123
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_9378537
source	NORA - Norwegian Open Research Archives; Elsevier ScienceDirect Journals Complete
subjects	Automatic speech recognition Syllables Word error rate
title	Using automated syllable counting to detect missing information in speech transcripts from clinical settings
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T18%3A39%3A25IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Using%20automated%20syllable%20counting%20to%20detect%20missing%20information%20in%20speech%20transcripts%20from%20clinical%20settings&rft.jtitle=Psychiatry%20research&rft.au=Diaz-Asper,%20Marama&rft.date=2022-09-01&rft.volume=315&rft.spage=114712&rft.epage=114712&rft.pages=114712-114712&rft.artnum=114712&rft.issn=0165-1781&rft.eissn=1872-7123&rft_id=info:doi/10.1016/j.psychres.2022.114712&rft_dat=%3Cproquest_pubme%3E2691049506%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2691049506&rft_id=info:pmid/35839638&rft_els_id=S0165178122003079&rfr_iscdi=true