Rhythm measures and dimensions of durational variation in speech

Patterns of durational variation were examined by applying 15 previously published rhythm measures to a large corpus of speech from five languages. In order to achieve consistent segmentation across all languages, an automatic speech recognition system was developed to divide the waveforms into cons...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of the Acoustical Society of America 2011-05, Vol.129 (5), p.3258-3270
Hauptverfasser: Loukina, Anastassia, Kochanski, Greg, Rosner, Burton, Keane, Elinor, Shih, Chilin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 3270
container_issue 5
container_start_page 3258
container_title The Journal of the Acoustical Society of America
container_volume 129
creator Loukina, Anastassia
Kochanski, Greg
Rosner, Burton
Keane, Elinor
Shih, Chilin
description Patterns of durational variation were examined by applying 15 previously published rhythm measures to a large corpus of speech from five languages. In order to achieve consistent segmentation across all languages, an automatic speech recognition system was developed to divide the waveforms into consonantal and vocalic regions. The resulting duration measurements rest strictly on acoustic criteria. Machine classification showed that rhythm measures could separate languages at rates above chance. Within-language variability in rhythm measures, however, was large and comparable to that between languages. Therefore, different languages could not be identified reliably from single paragraphs. In experiments separating pairs of languages, a rhythm measure that was relatively successful at separating one pair often performed very poorly on another pair: there was no broadly successful rhythm measure. Separation of all five languages at once required a combination of three rhythm measures. Many triplets were about equally effective, but the confusion patterns between languages varied with the choice of rhythm measures.
doi_str_mv 10.1121/1.3559709
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_867321363</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>867321363</sourcerecordid><originalsourceid>FETCH-LOGICAL-c467t-33542267af5ad2ae3f97a93829c5813d4aa5cc2a5ff415c167ef9d503783ac053</originalsourceid><addsrcrecordid>eNp90ctKxDAUBuAgio6XhS8g3Yi6qOYkzW0hKIM3GBBE1-WYJkyllzFphXl763TUla6SAx8n5P8JOQR6DsDgAs65EEZRs0EmIBhNtWDZJplQSiHNjJQ7ZDfGt2EUmpttssNASJ0xNSFXT_NlN6-T2mHsg4sJNkVSlLVrYtk2MWl9UvQBu2HAKvnAUK7uSdkkceGcne-TLY9VdAfrc4-83N48T-_T2ePdw_R6ltpMqi7lXGSMSYVeYMHQcW8UGq6ZsUIDLzJEYS1D4X0GwoJUzptCUK40R0sF3yMn495FaN97F7u8LqN1VYWNa_uYa6k4Ay75IE__lUCBSmMMNQM9G6kNbYzB-XwRyhrDckD5V7Q55OtoB3u0Xtu_1q74kd9ZDuB4DTBarHzAxpbx12WgQemvn1yOLtqyW6X596tjO_l3O_nQDv8ExeuUEQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1010699909</pqid></control><display><type>article</type><title>Rhythm measures and dimensions of durational variation in speech</title><source>MEDLINE</source><source>AIP Journals Complete</source><source>Alma/SFX Local Collection</source><source>AIP Acoustical Society of America</source><creator>Loukina, Anastassia ; Kochanski, Greg ; Rosner, Burton ; Keane, Elinor ; Shih, Chilin</creator><creatorcontrib>Loukina, Anastassia ; Kochanski, Greg ; Rosner, Burton ; Keane, Elinor ; Shih, Chilin</creatorcontrib><description>Patterns of durational variation were examined by applying 15 previously published rhythm measures to a large corpus of speech from five languages. In order to achieve consistent segmentation across all languages, an automatic speech recognition system was developed to divide the waveforms into consonantal and vocalic regions. The resulting duration measurements rest strictly on acoustic criteria. Machine classification showed that rhythm measures could separate languages at rates above chance. Within-language variability in rhythm measures, however, was large and comparable to that between languages. Therefore, different languages could not be identified reliably from single paragraphs. In experiments separating pairs of languages, a rhythm measure that was relatively successful at separating one pair often performed very poorly on another pair: there was no broadly successful rhythm measure. Separation of all five languages at once required a combination of three rhythm measures. Many triplets were about equally effective, but the confusion patterns between languages varied with the choice of rhythm measures.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/1.3559709</identifier><identifier>PMID: 21568427</identifier><identifier>CODEN: JASMAN</identifier><language>eng</language><publisher>Melville, NY: Acoustical Society of America</publisher><subject>Adult ; Algorithms ; Automation ; Biological and medical sciences ; China - ethnology ; England ; Female ; France - ethnology ; Fundamental and applied biological sciences. Psychology ; Greece - ethnology ; Humans ; Language ; Male ; Pattern Recognition, Physiological - physiology ; Production and perception of spoken language ; Psychology. Psychoanalysis. Psychiatry ; Psychology. Psychophysiology ; Russia - ethnology ; Speech Acoustics ; Speech Recognition Software ; Time Factors ; Young Adult</subject><ispartof>The Journal of the Acoustical Society of America, 2011-05, Vol.129 (5), p.3258-3270</ispartof><rights>2011 Acoustical Society of America</rights><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c467t-33542267af5ad2ae3f97a93829c5813d4aa5cc2a5ff415c167ef9d503783ac053</citedby><cites>FETCH-LOGICAL-c467t-33542267af5ad2ae3f97a93829c5813d4aa5cc2a5ff415c167ef9d503783ac053</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://pubs.aip.org/jasa/article-lookup/doi/10.1121/1.3559709$$EHTML$$P50$$Gscitation$$H</linktohtml><link.rule.ids>207,208,314,776,780,790,1559,4498,27901,27902,76126</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=24181785$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/21568427$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Loukina, Anastassia</creatorcontrib><creatorcontrib>Kochanski, Greg</creatorcontrib><creatorcontrib>Rosner, Burton</creatorcontrib><creatorcontrib>Keane, Elinor</creatorcontrib><creatorcontrib>Shih, Chilin</creatorcontrib><title>Rhythm measures and dimensions of durational variation in speech</title><title>The Journal of the Acoustical Society of America</title><addtitle>J Acoust Soc Am</addtitle><description>Patterns of durational variation were examined by applying 15 previously published rhythm measures to a large corpus of speech from five languages. In order to achieve consistent segmentation across all languages, an automatic speech recognition system was developed to divide the waveforms into consonantal and vocalic regions. The resulting duration measurements rest strictly on acoustic criteria. Machine classification showed that rhythm measures could separate languages at rates above chance. Within-language variability in rhythm measures, however, was large and comparable to that between languages. Therefore, different languages could not be identified reliably from single paragraphs. In experiments separating pairs of languages, a rhythm measure that was relatively successful at separating one pair often performed very poorly on another pair: there was no broadly successful rhythm measure. Separation of all five languages at once required a combination of three rhythm measures. Many triplets were about equally effective, but the confusion patterns between languages varied with the choice of rhythm measures.</description><subject>Adult</subject><subject>Algorithms</subject><subject>Automation</subject><subject>Biological and medical sciences</subject><subject>China - ethnology</subject><subject>England</subject><subject>Female</subject><subject>France - ethnology</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>Greece - ethnology</subject><subject>Humans</subject><subject>Language</subject><subject>Male</subject><subject>Pattern Recognition, Physiological - physiology</subject><subject>Production and perception of spoken language</subject><subject>Psychology. Psychoanalysis. Psychiatry</subject><subject>Psychology. Psychophysiology</subject><subject>Russia - ethnology</subject><subject>Speech Acoustics</subject><subject>Speech Recognition Software</subject><subject>Time Factors</subject><subject>Young Adult</subject><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp90ctKxDAUBuAgio6XhS8g3Yi6qOYkzW0hKIM3GBBE1-WYJkyllzFphXl763TUla6SAx8n5P8JOQR6DsDgAs65EEZRs0EmIBhNtWDZJplQSiHNjJQ7ZDfGt2EUmpttssNASJ0xNSFXT_NlN6-T2mHsg4sJNkVSlLVrYtk2MWl9UvQBu2HAKvnAUK7uSdkkceGcne-TLY9VdAfrc4-83N48T-_T2ePdw_R6ltpMqi7lXGSMSYVeYMHQcW8UGq6ZsUIDLzJEYS1D4X0GwoJUzptCUK40R0sF3yMn495FaN97F7u8LqN1VYWNa_uYa6k4Ay75IE__lUCBSmMMNQM9G6kNbYzB-XwRyhrDckD5V7Q55OtoB3u0Xtu_1q74kd9ZDuB4DTBarHzAxpbx12WgQemvn1yOLtqyW6X596tjO_l3O_nQDv8ExeuUEQ</recordid><startdate>20110501</startdate><enddate>20110501</enddate><creator>Loukina, Anastassia</creator><creator>Kochanski, Greg</creator><creator>Rosner, Burton</creator><creator>Keane, Elinor</creator><creator>Shih, Chilin</creator><general>Acoustical Society of America</general><general>American Institute of Physics</general><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7T9</scope><scope>7X8</scope></search><sort><creationdate>20110501</creationdate><title>Rhythm measures and dimensions of durational variation in speech</title><author>Loukina, Anastassia ; Kochanski, Greg ; Rosner, Burton ; Keane, Elinor ; Shih, Chilin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c467t-33542267af5ad2ae3f97a93829c5813d4aa5cc2a5ff415c167ef9d503783ac053</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Adult</topic><topic>Algorithms</topic><topic>Automation</topic><topic>Biological and medical sciences</topic><topic>China - ethnology</topic><topic>England</topic><topic>Female</topic><topic>France - ethnology</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>Greece - ethnology</topic><topic>Humans</topic><topic>Language</topic><topic>Male</topic><topic>Pattern Recognition, Physiological - physiology</topic><topic>Production and perception of spoken language</topic><topic>Psychology. Psychoanalysis. Psychiatry</topic><topic>Psychology. Psychophysiology</topic><topic>Russia - ethnology</topic><topic>Speech Acoustics</topic><topic>Speech Recognition Software</topic><topic>Time Factors</topic><topic>Young Adult</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Loukina, Anastassia</creatorcontrib><creatorcontrib>Kochanski, Greg</creatorcontrib><creatorcontrib>Rosner, Burton</creatorcontrib><creatorcontrib>Keane, Elinor</creatorcontrib><creatorcontrib>Shih, Chilin</creatorcontrib><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><collection>MEDLINE - Academic</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Loukina, Anastassia</au><au>Kochanski, Greg</au><au>Rosner, Burton</au><au>Keane, Elinor</au><au>Shih, Chilin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Rhythm measures and dimensions of durational variation in speech</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><addtitle>J Acoust Soc Am</addtitle><date>2011-05-01</date><risdate>2011</risdate><volume>129</volume><issue>5</issue><spage>3258</spage><epage>3270</epage><pages>3258-3270</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><coden>JASMAN</coden><abstract>Patterns of durational variation were examined by applying 15 previously published rhythm measures to a large corpus of speech from five languages. In order to achieve consistent segmentation across all languages, an automatic speech recognition system was developed to divide the waveforms into consonantal and vocalic regions. The resulting duration measurements rest strictly on acoustic criteria. Machine classification showed that rhythm measures could separate languages at rates above chance. Within-language variability in rhythm measures, however, was large and comparable to that between languages. Therefore, different languages could not be identified reliably from single paragraphs. In experiments separating pairs of languages, a rhythm measure that was relatively successful at separating one pair often performed very poorly on another pair: there was no broadly successful rhythm measure. Separation of all five languages at once required a combination of three rhythm measures. Many triplets were about equally effective, but the confusion patterns between languages varied with the choice of rhythm measures.</abstract><cop>Melville, NY</cop><pub>Acoustical Society of America</pub><pmid>21568427</pmid><doi>10.1121/1.3559709</doi><tpages>13</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0001-4966
ispartof The Journal of the Acoustical Society of America, 2011-05, Vol.129 (5), p.3258-3270
issn 0001-4966
1520-8524
language eng
recordid cdi_proquest_miscellaneous_867321363
source MEDLINE; AIP Journals Complete; Alma/SFX Local Collection; AIP Acoustical Society of America
subjects Adult
Algorithms
Automation
Biological and medical sciences
China - ethnology
England
Female
France - ethnology
Fundamental and applied biological sciences. Psychology
Greece - ethnology
Humans
Language
Male
Pattern Recognition, Physiological - physiology
Production and perception of spoken language
Psychology. Psychoanalysis. Psychiatry
Psychology. Psychophysiology
Russia - ethnology
Speech Acoustics
Speech Recognition Software
Time Factors
Young Adult
title Rhythm measures and dimensions of durational variation in speech
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T11%3A19%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Rhythm%20measures%20and%20dimensions%20of%20durational%20variation%20in%20speech&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Loukina,%20Anastassia&rft.date=2011-05-01&rft.volume=129&rft.issue=5&rft.spage=3258&rft.epage=3270&rft.pages=3258-3270&rft.issn=0001-4966&rft.eissn=1520-8524&rft.coden=JASMAN&rft_id=info:doi/10.1121/1.3559709&rft_dat=%3Cproquest_cross%3E867321363%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1010699909&rft_id=info:pmid/21568427&rfr_iscdi=true