Automatic analysis of Mandarin accented English using phonological features

► This study presents a new model for accent based on phonological features (PFs). ► Pronunciation variations are viewed as different paths in the PFs space-time. ► Markov models learn the pronunciations variations between native and non-native speakers. ► Native speakers of Mandarin and American En...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Speech communication 2012, Vol.54 (1), p.40-54
Hauptverfasser: Sangwan, Abhijeet, Hansen, John H.L.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 54
container_issue 1
container_start_page 40
container_title Speech communication
container_volume 54
creator Sangwan, Abhijeet
Hansen, John H.L.
description ► This study presents a new model for accent based on phonological features (PFs). ► Pronunciation variations are viewed as different paths in the PFs space-time. ► Markov models learn the pronunciations variations between native and non-native speakers. ► Native speakers of Mandarin and American English are used for evaluation. ► Proposed system shows high correlation with human judgment of accent (0.89 with p < 0.01). The problem of accent analysis and modeling has been considered from a variety of domains, including linguistic structure, statistical analysis of speech production features, and HMM/GMM (hidden Markov model/Gaussian mixture model) model classification. These studies however fail to connect speech production from a temporal perspective through a final classification strategy. Here, a novel accent analysis system and methodology which exploits the power of phonological features (PFs) is presented. The proposed system exploits the knowledge of articulation embedded in phonology by building Markov models (MMs) of PFs extracted from accented speech. The Markov models capture information in the PF space along two dimensions of articulation: PF state-transitions and state-durations. Furthermore, by utilizing MMs of native and non-native accents, a new statistical measure of “accentedness” is developed which rates the articulation of a word by a speaker on a scale of native-like (+1) to non-native like (−1). The proposed methodology is then used to perform an automatic cross-sectional study of accented English spoken by native speakers of Mandarin Chinese (N-MC). The experimental results demonstrate the capability of the proposed system to perform quantitative as well as qualitative analysis of foreign accents. The work developed in this study can be easily expanded into language learning systems, and has potential impact in the areas of speaker recognition and ASR (automatic speech recognition).
doi_str_mv 10.1016/j.specom.2011.06.003
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_926319783</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0167639311000872</els_id><sourcerecordid>1024642277</sourcerecordid><originalsourceid>FETCH-LOGICAL-c434t-5d57dedb4d66b2c3d74ed69e334f9b871f281a614ead2717d27032d4c3e0db803</originalsourceid><addsrcrecordid>eNqN0U1LJDEQBuAgLji6_gMPfRG9dJuvTrovgoirosteds8hk1SPGXqSMdUt-O83w4hH8VJ1eaoK6iXkjNGGUaau1g1uwaVNwyljDVUNpeKALFinea1Zxw_JojBdK9GLI3KMuKaUyq7jC_J0M09pY6fgKhvt-I4BqzRUv230NodYWecgTuCru7gaA75UM4a4qrYvKaYxrYKzYzWAneYM-JP8GOyIcPrRT8i_X3d_bx_q5z_3j7c3z7WTQk5161vtwS-lV2rJnfBaglc9CCGHftlpNvCOWcUkWM8106VQwb10AqhfdlSckIv93m1OrzPgZDYBHYyjjZBmND1XgvW6E0VefikZ5VJJzrX-BmVU9UK1Oyr31OWEmGEw2xw2Nr8XtHPKrM0-ELMLxFBlSiBl7PzjgsXytiHb6AJ-znKp21ZKVdz13kH54VuAbNAFiA58yOAm41P4-tB_VdeieA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1010693657</pqid></control><display><type>article</type><title>Automatic analysis of Mandarin accented English using phonological features</title><source>Access via ScienceDirect (Elsevier)</source><creator>Sangwan, Abhijeet ; Hansen, John H.L.</creator><creatorcontrib>Sangwan, Abhijeet ; Hansen, John H.L.</creatorcontrib><description>► This study presents a new model for accent based on phonological features (PFs). ► Pronunciation variations are viewed as different paths in the PFs space-time. ► Markov models learn the pronunciations variations between native and non-native speakers. ► Native speakers of Mandarin and American English are used for evaluation. ► Proposed system shows high correlation with human judgment of accent (0.89 with p &lt; 0.01). The problem of accent analysis and modeling has been considered from a variety of domains, including linguistic structure, statistical analysis of speech production features, and HMM/GMM (hidden Markov model/Gaussian mixture model) model classification. These studies however fail to connect speech production from a temporal perspective through a final classification strategy. Here, a novel accent analysis system and methodology which exploits the power of phonological features (PFs) is presented. The proposed system exploits the knowledge of articulation embedded in phonology by building Markov models (MMs) of PFs extracted from accented speech. The Markov models capture information in the PF space along two dimensions of articulation: PF state-transitions and state-durations. Furthermore, by utilizing MMs of native and non-native accents, a new statistical measure of “accentedness” is developed which rates the articulation of a word by a speaker on a scale of native-like (+1) to non-native like (−1). The proposed methodology is then used to perform an automatic cross-sectional study of accented English spoken by native speakers of Mandarin Chinese (N-MC). The experimental results demonstrate the capability of the proposed system to perform quantitative as well as qualitative analysis of foreign accents. The work developed in this study can be easily expanded into language learning systems, and has potential impact in the areas of speaker recognition and ASR (automatic speech recognition).</description><identifier>ISSN: 0167-6393</identifier><identifier>EISSN: 1872-7182</identifier><identifier>DOI: 10.1016/j.specom.2011.06.003</identifier><identifier>CODEN: SCOMDH</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Accent analysis ; Applied sciences ; Classification ; Cross sections ; Exact sciences and technology ; Information, signal and communications theory ; Mandarins ; Markov models ; Mathematical models ; Methodology ; Miscellaneous ; Non-native speaker traits ; Phonological features ; Signal processing ; Speech ; Speech processing ; Strategy ; Telecommunications and information theory</subject><ispartof>Speech communication, 2012, Vol.54 (1), p.40-54</ispartof><rights>2011 Elsevier B.V.</rights><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c434t-5d57dedb4d66b2c3d74ed69e334f9b871f281a614ead2717d27032d4c3e0db803</citedby><cites>FETCH-LOGICAL-c434t-5d57dedb4d66b2c3d74ed69e334f9b871f281a614ead2717d27032d4c3e0db803</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.specom.2011.06.003$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>315,781,785,3551,4025,27928,27929,27930,46000</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=24755446$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Sangwan, Abhijeet</creatorcontrib><creatorcontrib>Hansen, John H.L.</creatorcontrib><title>Automatic analysis of Mandarin accented English using phonological features</title><title>Speech communication</title><description>► This study presents a new model for accent based on phonological features (PFs). ► Pronunciation variations are viewed as different paths in the PFs space-time. ► Markov models learn the pronunciations variations between native and non-native speakers. ► Native speakers of Mandarin and American English are used for evaluation. ► Proposed system shows high correlation with human judgment of accent (0.89 with p &lt; 0.01). The problem of accent analysis and modeling has been considered from a variety of domains, including linguistic structure, statistical analysis of speech production features, and HMM/GMM (hidden Markov model/Gaussian mixture model) model classification. These studies however fail to connect speech production from a temporal perspective through a final classification strategy. Here, a novel accent analysis system and methodology which exploits the power of phonological features (PFs) is presented. The proposed system exploits the knowledge of articulation embedded in phonology by building Markov models (MMs) of PFs extracted from accented speech. The Markov models capture information in the PF space along two dimensions of articulation: PF state-transitions and state-durations. Furthermore, by utilizing MMs of native and non-native accents, a new statistical measure of “accentedness” is developed which rates the articulation of a word by a speaker on a scale of native-like (+1) to non-native like (−1). The proposed methodology is then used to perform an automatic cross-sectional study of accented English spoken by native speakers of Mandarin Chinese (N-MC). The experimental results demonstrate the capability of the proposed system to perform quantitative as well as qualitative analysis of foreign accents. The work developed in this study can be easily expanded into language learning systems, and has potential impact in the areas of speaker recognition and ASR (automatic speech recognition).</description><subject>Accent analysis</subject><subject>Applied sciences</subject><subject>Classification</subject><subject>Cross sections</subject><subject>Exact sciences and technology</subject><subject>Information, signal and communications theory</subject><subject>Mandarins</subject><subject>Markov models</subject><subject>Mathematical models</subject><subject>Methodology</subject><subject>Miscellaneous</subject><subject>Non-native speaker traits</subject><subject>Phonological features</subject><subject>Signal processing</subject><subject>Speech</subject><subject>Speech processing</subject><subject>Strategy</subject><subject>Telecommunications and information theory</subject><issn>0167-6393</issn><issn>1872-7182</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><recordid>eNqN0U1LJDEQBuAgLji6_gMPfRG9dJuvTrovgoirosteds8hk1SPGXqSMdUt-O83w4hH8VJ1eaoK6iXkjNGGUaau1g1uwaVNwyljDVUNpeKALFinea1Zxw_JojBdK9GLI3KMuKaUyq7jC_J0M09pY6fgKhvt-I4BqzRUv230NodYWecgTuCru7gaA75UM4a4qrYvKaYxrYKzYzWAneYM-JP8GOyIcPrRT8i_X3d_bx_q5z_3j7c3z7WTQk5161vtwS-lV2rJnfBaglc9CCGHftlpNvCOWcUkWM8106VQwb10AqhfdlSckIv93m1OrzPgZDYBHYyjjZBmND1XgvW6E0VefikZ5VJJzrX-BmVU9UK1Oyr31OWEmGEw2xw2Nr8XtHPKrM0-ELMLxFBlSiBl7PzjgsXytiHb6AJ-znKp21ZKVdz13kH54VuAbNAFiA58yOAm41P4-tB_VdeieA</recordid><startdate>2012</startdate><enddate>2012</enddate><creator>Sangwan, Abhijeet</creator><creator>Hansen, John H.L.</creator><general>Elsevier B.V</general><general>Elsevier</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7T9</scope><scope>8BM</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>2012</creationdate><title>Automatic analysis of Mandarin accented English using phonological features</title><author>Sangwan, Abhijeet ; Hansen, John H.L.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c434t-5d57dedb4d66b2c3d74ed69e334f9b871f281a614ead2717d27032d4c3e0db803</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Accent analysis</topic><topic>Applied sciences</topic><topic>Classification</topic><topic>Cross sections</topic><topic>Exact sciences and technology</topic><topic>Information, signal and communications theory</topic><topic>Mandarins</topic><topic>Markov models</topic><topic>Mathematical models</topic><topic>Methodology</topic><topic>Miscellaneous</topic><topic>Non-native speaker traits</topic><topic>Phonological features</topic><topic>Signal processing</topic><topic>Speech</topic><topic>Speech processing</topic><topic>Strategy</topic><topic>Telecommunications and information theory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sangwan, Abhijeet</creatorcontrib><creatorcontrib>Hansen, John H.L.</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><collection>ComDisDome</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Speech communication</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sangwan, Abhijeet</au><au>Hansen, John H.L.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automatic analysis of Mandarin accented English using phonological features</atitle><jtitle>Speech communication</jtitle><date>2012</date><risdate>2012</risdate><volume>54</volume><issue>1</issue><spage>40</spage><epage>54</epage><pages>40-54</pages><issn>0167-6393</issn><eissn>1872-7182</eissn><coden>SCOMDH</coden><abstract>► This study presents a new model for accent based on phonological features (PFs). ► Pronunciation variations are viewed as different paths in the PFs space-time. ► Markov models learn the pronunciations variations between native and non-native speakers. ► Native speakers of Mandarin and American English are used for evaluation. ► Proposed system shows high correlation with human judgment of accent (0.89 with p &lt; 0.01). The problem of accent analysis and modeling has been considered from a variety of domains, including linguistic structure, statistical analysis of speech production features, and HMM/GMM (hidden Markov model/Gaussian mixture model) model classification. These studies however fail to connect speech production from a temporal perspective through a final classification strategy. Here, a novel accent analysis system and methodology which exploits the power of phonological features (PFs) is presented. The proposed system exploits the knowledge of articulation embedded in phonology by building Markov models (MMs) of PFs extracted from accented speech. The Markov models capture information in the PF space along two dimensions of articulation: PF state-transitions and state-durations. Furthermore, by utilizing MMs of native and non-native accents, a new statistical measure of “accentedness” is developed which rates the articulation of a word by a speaker on a scale of native-like (+1) to non-native like (−1). The proposed methodology is then used to perform an automatic cross-sectional study of accented English spoken by native speakers of Mandarin Chinese (N-MC). The experimental results demonstrate the capability of the proposed system to perform quantitative as well as qualitative analysis of foreign accents. The work developed in this study can be easily expanded into language learning systems, and has potential impact in the areas of speaker recognition and ASR (automatic speech recognition).</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.specom.2011.06.003</doi><tpages>15</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0167-6393
ispartof Speech communication, 2012, Vol.54 (1), p.40-54
issn 0167-6393
1872-7182
language eng
recordid cdi_proquest_miscellaneous_926319783
source Access via ScienceDirect (Elsevier)
subjects Accent analysis
Applied sciences
Classification
Cross sections
Exact sciences and technology
Information, signal and communications theory
Mandarins
Markov models
Mathematical models
Methodology
Miscellaneous
Non-native speaker traits
Phonological features
Signal processing
Speech
Speech processing
Strategy
Telecommunications and information theory
title Automatic analysis of Mandarin accented English using phonological features
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-14T18%3A23%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automatic%20analysis%20of%20Mandarin%20accented%20English%20using%20phonological%20features&rft.jtitle=Speech%20communication&rft.au=Sangwan,%20Abhijeet&rft.date=2012&rft.volume=54&rft.issue=1&rft.spage=40&rft.epage=54&rft.pages=40-54&rft.issn=0167-6393&rft.eissn=1872-7182&rft.coden=SCOMDH&rft_id=info:doi/10.1016/j.specom.2011.06.003&rft_dat=%3Cproquest_cross%3E1024642277%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1010693657&rft_id=info:pmid/&rft_els_id=S0167639311000872&rfr_iscdi=true