Paradigmatic variation of vowels in expressive speech: Acoustic description and dimensional analysis

Acoustic variation in expressive speech at the syllable level is studied. As emotions or attitudes can be conveyed by short spoken words, analysis of paradigmatic variations in vowels is an important issue to characterize the expressive content of such speech segments. The corpus contains 160 senten...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Journal of the Acoustical Society of America 2018-01, Vol.143 (1), p.109-122
Hauptverfasser: Rilliard, Albert, d'Alessandro, Christophe, Evrard, Marc
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 122
container_issue 1
container_start_page 109
container_title The Journal of the Acoustical Society of America
container_volume 143
creator Rilliard, Albert
d'Alessandro, Christophe
Evrard, Marc
description Acoustic variation in expressive speech at the syllable level is studied. As emotions or attitudes can be conveyed by short spoken words, analysis of paradigmatic variations in vowels is an important issue to characterize the expressive content of such speech segments. The corpus contains 160 sentences produced under seven expressive conditions (Neutral, Anger, Fear, Surprise, Sensuality, Joy, Sadness) acted by a French female speaker (a total of 1120 sentences, 13 140 vowels). Eleven base acoustic parameters are selected for voice source and vocal tract related feature analysis. An acoustic description of the expressions is drawn, using the dimensions of melodic range, intensity, noise, spectral tilt, vocalic space, and dynamic features. The first three functions of a discriminant analysis explain 95% of the variance in the data. These statistical dimensions are consistently associated with acoustic dimensions. Covariation of intensity and F0 explains over 80% of the variance, followed by noise features (8%), covariation of spectral tilt, and F0 (7%). On the basis of isolated vowels alone, expressions are classified with a mean accuracy of 78%.
doi_str_mv 10.1121/1.5018433
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmed_primary_29390730</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1993998512</sourcerecordid><originalsourceid>FETCH-LOGICAL-c424t-ea347bd8a27897273e3898e57b8c7303e4f570174b0a5404882032765eadfe793</originalsourceid><addsrcrecordid>eNp9kUtPWzEQha2qqKS0C_5AdZel0gU_Y7u7CEFBigSLdm059lxwdV_1JBf49zgkhVW7mjnWN0fjM4QcM3rKGGdn7FRRZqQQ78iMKU5ro7h8T2aUUlZLO58fko-Iv4tURtgP5JBbYakWdEbirc8-prvOr1OoJp9TaYa-GppqGh6gxSr1FTyOGRDTBBWOAOH-e7UIwwa3IxEw5DS-DPk-VjF10GNRvi3at0-Y8BM5aHyL8Hlfj8ivy4uf51f18ubH9fliWQfJ5boGL6ReReO5NlZzLUAYa0DplQllWQGyUZoyLVfUK0mlMZwKrucKfGxAW3FETna-9751Y06dz09u8MldLZZu-0aZZVJaPbHCft2xYx7-bADXrksYoG19D-VrjtmSkTWK8TfbkAfEDM2rN6NuewDH3P4Ahf2yt92sOoiv5N_EC_BtB2BI65es_-v2T3ga8hvoxtiIZ-4tm10</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1993998512</pqid></control><display><type>article</type><title>Paradigmatic variation of vowels in expressive speech: Acoustic description and dimensional analysis</title><source>AIP Journals Complete</source><source>Alma/SFX Local Collection</source><source>AIP Acoustical Society of America</source><creator>Rilliard, Albert ; d'Alessandro, Christophe ; Evrard, Marc</creator><creatorcontrib>Rilliard, Albert ; d'Alessandro, Christophe ; Evrard, Marc</creatorcontrib><description>Acoustic variation in expressive speech at the syllable level is studied. As emotions or attitudes can be conveyed by short spoken words, analysis of paradigmatic variations in vowels is an important issue to characterize the expressive content of such speech segments. The corpus contains 160 sentences produced under seven expressive conditions (Neutral, Anger, Fear, Surprise, Sensuality, Joy, Sadness) acted by a French female speaker (a total of 1120 sentences, 13 140 vowels). Eleven base acoustic parameters are selected for voice source and vocal tract related feature analysis. An acoustic description of the expressions is drawn, using the dimensions of melodic range, intensity, noise, spectral tilt, vocalic space, and dynamic features. The first three functions of a discriminant analysis explain 95% of the variance in the data. These statistical dimensions are consistently associated with acoustic dimensions. Covariation of intensity and F0 explains over 80% of the variance, followed by noise features (8%), covariation of spectral tilt, and F0 (7%). On the basis of isolated vowels alone, expressions are classified with a mean accuracy of 78%.</description><identifier>ISSN: 0001-4966</identifier><identifier>EISSN: 1520-8524</identifier><identifier>DOI: 10.1121/1.5018433</identifier><identifier>PMID: 29390730</identifier><identifier>CODEN: JASMAN</identifier><language>eng</language><publisher>United States: Acoustical Society of America</publisher><subject>Computer Science ; Human-Computer Interaction ; Humanities and Social Sciences ; Linguistics ; Musicology and performing arts ; Signal and Image Processing ; Sound</subject><ispartof>The Journal of the Acoustical Society of America, 2018-01, Vol.143 (1), p.109-122</ispartof><rights>Acoustical Society of America</rights><rights>Distributed under a Creative Commons Attribution 4.0 International License</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c424t-ea347bd8a27897273e3898e57b8c7303e4f570174b0a5404882032765eadfe793</citedby><cites>FETCH-LOGICAL-c424t-ea347bd8a27897273e3898e57b8c7303e4f570174b0a5404882032765eadfe793</cites><orcidid>0000-0001-6490-2386 ; 0000-0002-2629-8752</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://pubs.aip.org/jasa/article-lookup/doi/10.1121/1.5018433$$EHTML$$P50$$Gscitation$$H</linktohtml><link.rule.ids>207,208,230,314,780,784,794,885,1564,4021,4509,27921,27922,27923,76154</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29390730$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink><backlink>$$Uhttps://hal.science/hal-01914497$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Rilliard, Albert</creatorcontrib><creatorcontrib>d'Alessandro, Christophe</creatorcontrib><creatorcontrib>Evrard, Marc</creatorcontrib><title>Paradigmatic variation of vowels in expressive speech: Acoustic description and dimensional analysis</title><title>The Journal of the Acoustical Society of America</title><addtitle>J Acoust Soc Am</addtitle><description>Acoustic variation in expressive speech at the syllable level is studied. As emotions or attitudes can be conveyed by short spoken words, analysis of paradigmatic variations in vowels is an important issue to characterize the expressive content of such speech segments. The corpus contains 160 sentences produced under seven expressive conditions (Neutral, Anger, Fear, Surprise, Sensuality, Joy, Sadness) acted by a French female speaker (a total of 1120 sentences, 13 140 vowels). Eleven base acoustic parameters are selected for voice source and vocal tract related feature analysis. An acoustic description of the expressions is drawn, using the dimensions of melodic range, intensity, noise, spectral tilt, vocalic space, and dynamic features. The first three functions of a discriminant analysis explain 95% of the variance in the data. These statistical dimensions are consistently associated with acoustic dimensions. Covariation of intensity and F0 explains over 80% of the variance, followed by noise features (8%), covariation of spectral tilt, and F0 (7%). On the basis of isolated vowels alone, expressions are classified with a mean accuracy of 78%.</description><subject>Computer Science</subject><subject>Human-Computer Interaction</subject><subject>Humanities and Social Sciences</subject><subject>Linguistics</subject><subject>Musicology and performing arts</subject><subject>Signal and Image Processing</subject><subject>Sound</subject><issn>0001-4966</issn><issn>1520-8524</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNp9kUtPWzEQha2qqKS0C_5AdZel0gU_Y7u7CEFBigSLdm059lxwdV_1JBf49zgkhVW7mjnWN0fjM4QcM3rKGGdn7FRRZqQQ78iMKU5ro7h8T2aUUlZLO58fko-Iv4tURtgP5JBbYakWdEbirc8-prvOr1OoJp9TaYa-GppqGh6gxSr1FTyOGRDTBBWOAOH-e7UIwwa3IxEw5DS-DPk-VjF10GNRvi3at0-Y8BM5aHyL8Hlfj8ivy4uf51f18ubH9fliWQfJ5boGL6ReReO5NlZzLUAYa0DplQllWQGyUZoyLVfUK0mlMZwKrucKfGxAW3FETna-9751Y06dz09u8MldLZZu-0aZZVJaPbHCft2xYx7-bADXrksYoG19D-VrjtmSkTWK8TfbkAfEDM2rN6NuewDH3P4Ahf2yt92sOoiv5N_EC_BtB2BI65es_-v2T3ga8hvoxtiIZ-4tm10</recordid><startdate>201801</startdate><enddate>201801</enddate><creator>Rilliard, Albert</creator><creator>d'Alessandro, Christophe</creator><creator>Evrard, Marc</creator><general>Acoustical Society of America</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>1XC</scope><scope>BXJBU</scope><scope>IHQJB</scope><scope>VOOES</scope><orcidid>https://orcid.org/0000-0001-6490-2386</orcidid><orcidid>https://orcid.org/0000-0002-2629-8752</orcidid></search><sort><creationdate>201801</creationdate><title>Paradigmatic variation of vowels in expressive speech: Acoustic description and dimensional analysis</title><author>Rilliard, Albert ; d'Alessandro, Christophe ; Evrard, Marc</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c424t-ea347bd8a27897273e3898e57b8c7303e4f570174b0a5404882032765eadfe793</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Computer Science</topic><topic>Human-Computer Interaction</topic><topic>Humanities and Social Sciences</topic><topic>Linguistics</topic><topic>Musicology and performing arts</topic><topic>Signal and Image Processing</topic><topic>Sound</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Rilliard, Albert</creatorcontrib><creatorcontrib>d'Alessandro, Christophe</creatorcontrib><creatorcontrib>Evrard, Marc</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Hyper Article en Ligne (HAL)</collection><collection>HAL-SHS: Archive ouverte en Sciences de l'Homme et de la Société</collection><collection>HAL-SHS: Archive ouverte en Sciences de l'Homme et de la Société (Open Access)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><jtitle>The Journal of the Acoustical Society of America</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rilliard, Albert</au><au>d'Alessandro, Christophe</au><au>Evrard, Marc</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Paradigmatic variation of vowels in expressive speech: Acoustic description and dimensional analysis</atitle><jtitle>The Journal of the Acoustical Society of America</jtitle><addtitle>J Acoust Soc Am</addtitle><date>2018-01</date><risdate>2018</risdate><volume>143</volume><issue>1</issue><spage>109</spage><epage>122</epage><pages>109-122</pages><issn>0001-4966</issn><eissn>1520-8524</eissn><coden>JASMAN</coden><abstract>Acoustic variation in expressive speech at the syllable level is studied. As emotions or attitudes can be conveyed by short spoken words, analysis of paradigmatic variations in vowels is an important issue to characterize the expressive content of such speech segments. The corpus contains 160 sentences produced under seven expressive conditions (Neutral, Anger, Fear, Surprise, Sensuality, Joy, Sadness) acted by a French female speaker (a total of 1120 sentences, 13 140 vowels). Eleven base acoustic parameters are selected for voice source and vocal tract related feature analysis. An acoustic description of the expressions is drawn, using the dimensions of melodic range, intensity, noise, spectral tilt, vocalic space, and dynamic features. The first three functions of a discriminant analysis explain 95% of the variance in the data. These statistical dimensions are consistently associated with acoustic dimensions. Covariation of intensity and F0 explains over 80% of the variance, followed by noise features (8%), covariation of spectral tilt, and F0 (7%). On the basis of isolated vowels alone, expressions are classified with a mean accuracy of 78%.</abstract><cop>United States</cop><pub>Acoustical Society of America</pub><pmid>29390730</pmid><doi>10.1121/1.5018433</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0001-6490-2386</orcidid><orcidid>https://orcid.org/0000-0002-2629-8752</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0001-4966
ispartof The Journal of the Acoustical Society of America, 2018-01, Vol.143 (1), p.109-122
issn 0001-4966
1520-8524
language eng
recordid cdi_pubmed_primary_29390730
source AIP Journals Complete; Alma/SFX Local Collection; AIP Acoustical Society of America
subjects Computer Science
Human-Computer Interaction
Humanities and Social Sciences
Linguistics
Musicology and performing arts
Signal and Image Processing
Sound
title Paradigmatic variation of vowels in expressive speech: Acoustic description and dimensional analysis
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T14%3A59%3A30IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Paradigmatic%20variation%20of%20vowels%20in%20expressive%20speech:%20Acoustic%20description%20and%20dimensional%20analysis&rft.jtitle=The%20Journal%20of%20the%20Acoustical%20Society%20of%20America&rft.au=Rilliard,%20Albert&rft.date=2018-01&rft.volume=143&rft.issue=1&rft.spage=109&rft.epage=122&rft.pages=109-122&rft.issn=0001-4966&rft.eissn=1520-8524&rft.coden=JASMAN&rft_id=info:doi/10.1121/1.5018433&rft_dat=%3Cproquest_pubme%3E1993998512%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1993998512&rft_id=info:pmid/29390730&rfr_iscdi=true