Rhythm measures and dimensions of durational variation in speech
Patterns of durational variation were examined by applying 15 previously published rhythm measures to a large corpus of speech from five languages. In order to achieve consistent segmentation across all languages, an automatic speech recognition system was developed to divide the waveforms into cons...
Gespeichert in:
Veröffentlicht in: | The Journal of the Acoustical Society of America 2011-05, Vol.129 (5), p.3258-3270 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 3270 |
---|---|
container_issue | 5 |
container_start_page | 3258 |
container_title | The Journal of the Acoustical Society of America |
container_volume | 129 |
creator | Loukina, Anastassia Kochanski, Greg Rosner, Burton Keane, Elinor Shih, Chilin |
description | Patterns of durational variation were examined by applying 15 previously published rhythm measures to a large corpus of speech from five languages. In order to achieve consistent segmentation across all languages, an automatic speech recognition system was developed to divide the waveforms into consonantal and vocalic regions. The resulting duration measurements rest strictly on acoustic criteria. Machine classification showed that rhythm measures could separate languages at rates above chance. Within-language variability in rhythm measures, however, was large and comparable to that between languages. Therefore, different languages could not be identified reliably from single paragraphs. In experiments separating pairs of languages, a rhythm measure that was relatively successful at separating one pair often performed very poorly on another pair: there was no broadly successful rhythm measure. Separation of all five languages at once required a combination of three rhythm measures. Many triplets were about equally effective, but the confusion patterns between languages varied with the choice of rhythm measures. |
doi_str_mv | 10.1121/1.3559709 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_867321363</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>867321363</sourcerecordid><originalsourceid>FETCH-LOGICAL-c467t-33542267af5ad2ae3f97a93829c5813d4aa5cc2a5ff415c167ef9d503783ac053</originalsourceid><addsrcrecordid>eNp90ctKxDAUBuAgio6XhS8g3Yi6qOYkzW0hKIM3GBBE1-WYJkyllzFphXl763TUla6SAx8n5P8JOQR6DsDgAs65EEZRs0EmIBhNtWDZJplQSiHNjJQ7ZDfGt2EUmpttssNASJ0xNSFXT_NlN6-T2mHsg4sJNkVSlLVrYtk2MWl9UvQBu2HAKvnAUK7uSdkkceGcne-TLY9VdAfrc4-83N48T-_T2ePdw_R6ltpMqi7lXGSMSYVeYMHQcW8UGq6ZsUIDLzJEYS1D4X0GwoJUzptCUK40R0sF3yMn495FaN97F7u8LqN1VYWNa_uYa6k4Ay75IE__lUCBSmMMNQM9G6kNbYzB-XwRyhrDckD5V7Q55OtoB3u0Xtu_1q74kd9ZDuB4DTBarHzAxpbx12WgQemvn1yOLtqyW6X596tjO_l3O_nQDv8ExeuUEQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1010699909</pqid></control><display><type>article</type><title>Rhythm measures and dimensions of durational variation in speech</title><source>MEDLINE</source><source>AIP Journals Complete</source><source>Alma/SFX Local Collection</source><source>AIP Acoustical Society of America</source><creator>Loukina, Anastassia ; Kochanski, Greg ; Rosner, Burton ; Keane, Elinor ; Shih, Chilin</creator><creatorcontrib>Loukina, Anastassia ; Kochanski, Greg ; Rosner, Burton ; Keane, Elinor ; Shih, Chilin</creatorcontrib><description>Patterns of durational variation were examined by applying 15 previously published rhythm measures to a large corpus of speech from five languages. In order to achieve consistent segmentation across all languages, an automatic speech recognition system was developed to divide the waveforms into consonantal and vocalic regions. The resulting duration measurements rest strictly on acoustic criteria. Machine classification showed that rhythm measures could separate languages at rates above chance. Within-language variability in rhythm measures, however, was large and comparable to that between languages. Therefore, different languages could not be identified reliably from single paragraphs. In experiments separating pairs of languages, a rhythm measure that was relatively successful at separating one pair often performed very poorly on another pair: there was no broadly successful rhythm measure. Separation of all five languages at once required a combination of three rhythm measures. Many triplets were about equally effective, but the confusion patterns between languages varied with the choice of rhythm measures.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/1.3559709</identifier><identifier>PMID: 21568427</identifier><identifier>CODEN: JASMAN</identifier><language>eng</language><publisher>Melville, NY: Acoustical Society of America</publisher><subject>Adult ; Algorithms ; Automation ; Biological and medical sciences ; China - ethnology ; England ; Female ; France - ethnology ; Fundamental and applied biological sciences. Psychology ; Greece - ethnology ; Humans ; Language ; Male ; Pattern Recognition, Physiological - physiology ; Production and perception of spoken language ; Psychology. Psychoanalysis. Psychiatry ; Psychology. Psychophysiology ; Russia - ethnology ; Speech Acoustics ; Speech Recognition Software ; Time Factors ; Young Adult</subject><ispartof>The Journal of the Acoustical Society of America, 2011-05, Vol.129 (5), p.3258-3270</ispartof><rights>2011 Acoustical Society of America</rights><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c467t-33542267af5ad2ae3f97a93829c5813d4aa5cc2a5ff415c167ef9d503783ac053</citedby><cites>FETCH-LOGICAL-c467t-33542267af5ad2ae3f97a93829c5813d4aa5cc2a5ff415c167ef9d503783ac053</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://pubs.aip.org/jasa/article-lookup/doi/10.1121/1.3559709$$EHTML$$P50$$Gscitation$$H</linktohtml><link.rule.ids>207,208,314,776,780,790,1559,4498,27901,27902,76126</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=24181785$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/21568427$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Loukina, Anastassia</creatorcontrib><creatorcontrib>Kochanski, Greg</creatorcontrib><creatorcontrib>Rosner, Burton</creatorcontrib><creatorcontrib>Keane, Elinor</creatorcontrib><creatorcontrib>Shih, Chilin</creatorcontrib><title>Rhythm measures and dimensions of durational variation in speech</title><title>The Journal of the Acoustical Society of America</title><addtitle>J Acoust Soc Am</addtitle><description>Patterns of durational variation were examined by applying 15 previously published rhythm measures to a large corpus of speech from five languages. In order to achieve consistent segmentation across all languages, an automatic speech recognition system was developed to divide the waveforms into consonantal and vocalic regions. The resulting duration measurements rest strictly on acoustic criteria. Machine classification showed that rhythm measures could separate languages at rates above chance. Within-language variability in rhythm measures, however, was large and comparable to that between languages. Therefore, different languages could not be identified reliably from single paragraphs. In experiments separating pairs of languages, a rhythm measure that was relatively successful at separating one pair often performed very poorly on another pair: there was no broadly successful rhythm measure. Separation of all five languages at once required a combination of three rhythm measures. Many triplets were about equally effective, but the confusion patterns between languages varied with the choice of rhythm measures.</description><subject>Adult</subject><subject>Algorithms</subject><subject>Automation</subject><subject>Biological and medical sciences</subject><subject>China - ethnology</subject><subject>England</subject><subject>Female</subject><subject>France - ethnology</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>Greece - ethnology</subject><subject>Humans</subject><subject>Language</subject><subject>Male</subject><subject>Pattern Recognition, Physiological - physiology</subject><subject>Production and perception of spoken language</subject><subject>Psychology. Psychoanalysis. Psychiatry</subject><subject>Psychology. Psychophysiology</subject><subject>Russia - ethnology</subject><subject>Speech Acoustics</subject><subject>Speech Recognition Software</subject><subject>Time Factors</subject><subject>Young Adult</subject><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp90ctKxDAUBuAgio6XhS8g3Yi6qOYkzW0hKIM3GBBE1-WYJkyllzFphXl763TUla6SAx8n5P8JOQR6DsDgAs65EEZRs0EmIBhNtWDZJplQSiHNjJQ7ZDfGt2EUmpttssNASJ0xNSFXT_NlN6-T2mHsg4sJNkVSlLVrYtk2MWl9UvQBu2HAKvnAUK7uSdkkceGcne-TLY9VdAfrc4-83N48T-_T2ePdw_R6ltpMqi7lXGSMSYVeYMHQcW8UGq6ZsUIDLzJEYS1D4X0GwoJUzptCUK40R0sF3yMn495FaN97F7u8LqN1VYWNa_uYa6k4Ay75IE__lUCBSmMMNQM9G6kNbYzB-XwRyhrDckD5V7Q55OtoB3u0Xtu_1q74kd9ZDuB4DTBarHzAxpbx12WgQemvn1yOLtqyW6X596tjO_l3O_nQDv8ExeuUEQ</recordid><startdate>20110501</startdate><enddate>20110501</enddate><creator>Loukina, Anastassia</creator><creator>Kochanski, Greg</creator><creator>Rosner, Burton</creator><creator>Keane, Elinor</creator><creator>Shih, Chilin</creator><general>Acoustical Society of America</general><general>American Institute of Physics</general><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7T9</scope><scope>7X8</scope></search><sort><creationdate>20110501</creationdate><title>Rhythm measures and dimensions of durational variation in speech</title><author>Loukina, Anastassia ; Kochanski, Greg ; Rosner, Burton ; Keane, Elinor ; Shih, Chilin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c467t-33542267af5ad2ae3f97a93829c5813d4aa5cc2a5ff415c167ef9d503783ac053</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Adult</topic><topic>Algorithms</topic><topic>Automation</topic><topic>Biological and medical sciences</topic><topic>China - ethnology</topic><topic>England</topic><topic>Female</topic><topic>France - ethnology</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>Greece - ethnology</topic><topic>Humans</topic><topic>Language</topic><topic>Male</topic><topic>Pattern Recognition, Physiological - physiology</topic><topic>Production and perception of spoken language</topic><topic>Psychology. Psychoanalysis. Psychiatry</topic><topic>Psychology. Psychophysiology</topic><topic>Russia - ethnology</topic><topic>Speech Acoustics</topic><topic>Speech Recognition Software</topic><topic>Time Factors</topic><topic>Young Adult</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Loukina, Anastassia</creatorcontrib><creatorcontrib>Kochanski, Greg</creatorcontrib><creatorcontrib>Rosner, Burton</creatorcontrib><creatorcontrib>Keane, Elinor</creatorcontrib><creatorcontrib>Shih, Chilin</creatorcontrib><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><collection>MEDLINE - Academic</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Loukina, Anastassia</au><au>Kochanski, Greg</au><au>Rosner, Burton</au><au>Keane, Elinor</au><au>Shih, Chilin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Rhythm measures and dimensions of durational variation in speech</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><addtitle>J Acoust Soc Am</addtitle><date>2011-05-01</date><risdate>2011</risdate><volume>129</volume><issue>5</issue><spage>3258</spage><epage>3270</epage><pages>3258-3270</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><coden>JASMAN</coden><abstract>Patterns of durational variation were examined by applying 15 previously published rhythm measures to a large corpus of speech from five languages. In order to achieve consistent segmentation across all languages, an automatic speech recognition system was developed to divide the waveforms into consonantal and vocalic regions. The resulting duration measurements rest strictly on acoustic criteria. Machine classification showed that rhythm measures could separate languages at rates above chance. Within-language variability in rhythm measures, however, was large and comparable to that between languages. Therefore, different languages could not be identified reliably from single paragraphs. In experiments separating pairs of languages, a rhythm measure that was relatively successful at separating one pair often performed very poorly on another pair: there was no broadly successful rhythm measure. Separation of all five languages at once required a combination of three rhythm measures. Many triplets were about equally effective, but the confusion patterns between languages varied with the choice of rhythm measures.</abstract><cop>Melville, NY</cop><pub>Acoustical Society of America</pub><pmid>21568427</pmid><doi>10.1121/1.3559709</doi><tpages>13</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0001-4966 |
ispartof | The Journal of the Acoustical Society of America, 2011-05, Vol.129 (5), p.3258-3270 |
issn | 0001-4966 1520-8524 |
language | eng |
recordid | cdi_proquest_miscellaneous_867321363 |
source | MEDLINE; AIP Journals Complete; Alma/SFX Local Collection; AIP Acoustical Society of America |
subjects | Adult Algorithms Automation Biological and medical sciences China - ethnology England Female France - ethnology Fundamental and applied biological sciences. Psychology Greece - ethnology Humans Language Male Pattern Recognition, Physiological - physiology Production and perception of spoken language Psychology. Psychoanalysis. Psychiatry Psychology. Psychophysiology Russia - ethnology Speech Acoustics Speech Recognition Software Time Factors Young Adult |
title | Rhythm measures and dimensions of durational variation in speech |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T11%3A19%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Rhythm%20measures%20and%20dimensions%20of%20durational%20variation%20in%20speech&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Loukina,%20Anastassia&rft.date=2011-05-01&rft.volume=129&rft.issue=5&rft.spage=3258&rft.epage=3270&rft.pages=3258-3270&rft.issn=0001-4966&rft.eissn=1520-8524&rft.coden=JASMAN&rft_id=info:doi/10.1121/1.3559709&rft_dat=%3Cproquest_cross%3E867321363%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1010699909&rft_id=info:pmid/21568427&rfr_iscdi=true |