Automatic analysis of Mandarin accented English using phonological features
► This study presents a new model for accent based on phonological features (PFs). ► Pronunciation variations are viewed as different paths in the PFs space-time. ► Markov models learn the pronunciations variations between native and non-native speakers. ► Native speakers of Mandarin and American En...
Gespeichert in:
Veröffentlicht in: | Speech communication 2012, Vol.54 (1), p.40-54 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 54 |
---|---|
container_issue | 1 |
container_start_page | 40 |
container_title | Speech communication |
container_volume | 54 |
creator | Sangwan, Abhijeet Hansen, John H.L. |
description | ► This study presents a new model for accent based on phonological features (PFs). ► Pronunciation variations are viewed as different paths in the PFs space-time. ► Markov models learn the pronunciations variations between native and non-native speakers. ► Native speakers of Mandarin and American English are used for evaluation. ► Proposed system shows high correlation with human judgment of accent (0.89 with
p
<
0.01).
The problem of accent analysis and modeling has been considered from a variety of domains, including linguistic structure, statistical analysis of speech production features, and HMM/GMM (hidden Markov model/Gaussian mixture model) model classification. These studies however fail to connect speech production from a temporal perspective through a final classification strategy. Here, a novel accent analysis system and methodology which exploits the power of phonological features (PFs) is presented. The proposed system exploits the knowledge of articulation embedded in phonology by building Markov models (MMs) of PFs extracted from accented speech. The Markov models capture information in the PF space along two dimensions of articulation: PF state-transitions and state-durations. Furthermore, by utilizing MMs of native and non-native accents, a new statistical measure of “accentedness” is developed which rates the articulation of a word by a speaker on a scale of native-like (+1) to non-native like (−1). The proposed methodology is then used to perform an automatic cross-sectional study of accented English spoken by native speakers of Mandarin Chinese (N-MC). The experimental results demonstrate the capability of the proposed system to perform quantitative as well as qualitative analysis of foreign accents. The work developed in this study can be easily expanded into language learning systems, and has potential impact in the areas of speaker recognition and ASR (automatic speech recognition). |
doi_str_mv | 10.1016/j.specom.2011.06.003 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_926319783</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0167639311000872</els_id><sourcerecordid>1024642277</sourcerecordid><originalsourceid>FETCH-LOGICAL-c434t-5d57dedb4d66b2c3d74ed69e334f9b871f281a614ead2717d27032d4c3e0db803</originalsourceid><addsrcrecordid>eNqN0U1LJDEQBuAgLji6_gMPfRG9dJuvTrovgoirosteds8hk1SPGXqSMdUt-O83w4hH8VJ1eaoK6iXkjNGGUaau1g1uwaVNwyljDVUNpeKALFinea1Zxw_JojBdK9GLI3KMuKaUyq7jC_J0M09pY6fgKhvt-I4BqzRUv230NodYWecgTuCru7gaA75UM4a4qrYvKaYxrYKzYzWAneYM-JP8GOyIcPrRT8i_X3d_bx_q5z_3j7c3z7WTQk5161vtwS-lV2rJnfBaglc9CCGHftlpNvCOWcUkWM8106VQwb10AqhfdlSckIv93m1OrzPgZDYBHYyjjZBmND1XgvW6E0VefikZ5VJJzrX-BmVU9UK1Oyr31OWEmGEw2xw2Nr8XtHPKrM0-ELMLxFBlSiBl7PzjgsXytiHb6AJ-znKp21ZKVdz13kH54VuAbNAFiA58yOAm41P4-tB_VdeieA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1010693657</pqid></control><display><type>article</type><title>Automatic analysis of Mandarin accented English using phonological features</title><source>Access via ScienceDirect (Elsevier)</source><creator>Sangwan, Abhijeet ; Hansen, John H.L.</creator><creatorcontrib>Sangwan, Abhijeet ; Hansen, John H.L.</creatorcontrib><description>► This study presents a new model for accent based on phonological features (PFs). ► Pronunciation variations are viewed as different paths in the PFs space-time. ► Markov models learn the pronunciations variations between native and non-native speakers. ► Native speakers of Mandarin and American English are used for evaluation. ► Proposed system shows high correlation with human judgment of accent (0.89 with
p
<
0.01).
The problem of accent analysis and modeling has been considered from a variety of domains, including linguistic structure, statistical analysis of speech production features, and HMM/GMM (hidden Markov model/Gaussian mixture model) model classification. These studies however fail to connect speech production from a temporal perspective through a final classification strategy. Here, a novel accent analysis system and methodology which exploits the power of phonological features (PFs) is presented. The proposed system exploits the knowledge of articulation embedded in phonology by building Markov models (MMs) of PFs extracted from accented speech. The Markov models capture information in the PF space along two dimensions of articulation: PF state-transitions and state-durations. Furthermore, by utilizing MMs of native and non-native accents, a new statistical measure of “accentedness” is developed which rates the articulation of a word by a speaker on a scale of native-like (+1) to non-native like (−1). The proposed methodology is then used to perform an automatic cross-sectional study of accented English spoken by native speakers of Mandarin Chinese (N-MC). The experimental results demonstrate the capability of the proposed system to perform quantitative as well as qualitative analysis of foreign accents. The work developed in this study can be easily expanded into language learning systems, and has potential impact in the areas of speaker recognition and ASR (automatic speech recognition).</description><identifier>ISSN: 0167-6393</identifier><identifier>EISSN: 1872-7182</identifier><identifier>DOI: 10.1016/j.specom.2011.06.003</identifier><identifier>CODEN: SCOMDH</identifier><language>eng</language><publisher>Amsterdam: Elsevier B.V</publisher><subject>Accent analysis ; Applied sciences ; Classification ; Cross sections ; Exact sciences and technology ; Information, signal and communications theory ; Mandarins ; Markov models ; Mathematical models ; Methodology ; Miscellaneous ; Non-native speaker traits ; Phonological features ; Signal processing ; Speech ; Speech processing ; Strategy ; Telecommunications and information theory</subject><ispartof>Speech communication, 2012, Vol.54 (1), p.40-54</ispartof><rights>2011 Elsevier B.V.</rights><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c434t-5d57dedb4d66b2c3d74ed69e334f9b871f281a614ead2717d27032d4c3e0db803</citedby><cites>FETCH-LOGICAL-c434t-5d57dedb4d66b2c3d74ed69e334f9b871f281a614ead2717d27032d4c3e0db803</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.specom.2011.06.003$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>315,781,785,3551,4025,27928,27929,27930,46000</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=24755446$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Sangwan, Abhijeet</creatorcontrib><creatorcontrib>Hansen, John H.L.</creatorcontrib><title>Automatic analysis of Mandarin accented English using phonological features</title><title>Speech communication</title><description>► This study presents a new model for accent based on phonological features (PFs). ► Pronunciation variations are viewed as different paths in the PFs space-time. ► Markov models learn the pronunciations variations between native and non-native speakers. ► Native speakers of Mandarin and American English are used for evaluation. ► Proposed system shows high correlation with human judgment of accent (0.89 with
p
<
0.01).
The problem of accent analysis and modeling has been considered from a variety of domains, including linguistic structure, statistical analysis of speech production features, and HMM/GMM (hidden Markov model/Gaussian mixture model) model classification. These studies however fail to connect speech production from a temporal perspective through a final classification strategy. Here, a novel accent analysis system and methodology which exploits the power of phonological features (PFs) is presented. The proposed system exploits the knowledge of articulation embedded in phonology by building Markov models (MMs) of PFs extracted from accented speech. The Markov models capture information in the PF space along two dimensions of articulation: PF state-transitions and state-durations. Furthermore, by utilizing MMs of native and non-native accents, a new statistical measure of “accentedness” is developed which rates the articulation of a word by a speaker on a scale of native-like (+1) to non-native like (−1). The proposed methodology is then used to perform an automatic cross-sectional study of accented English spoken by native speakers of Mandarin Chinese (N-MC). The experimental results demonstrate the capability of the proposed system to perform quantitative as well as qualitative analysis of foreign accents. The work developed in this study can be easily expanded into language learning systems, and has potential impact in the areas of speaker recognition and ASR (automatic speech recognition).</description><subject>Accent analysis</subject><subject>Applied sciences</subject><subject>Classification</subject><subject>Cross sections</subject><subject>Exact sciences and technology</subject><subject>Information, signal and communications theory</subject><subject>Mandarins</subject><subject>Markov models</subject><subject>Mathematical models</subject><subject>Methodology</subject><subject>Miscellaneous</subject><subject>Non-native speaker traits</subject><subject>Phonological features</subject><subject>Signal processing</subject><subject>Speech</subject><subject>Speech processing</subject><subject>Strategy</subject><subject>Telecommunications and information theory</subject><issn>0167-6393</issn><issn>1872-7182</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><recordid>eNqN0U1LJDEQBuAgLji6_gMPfRG9dJuvTrovgoirosteds8hk1SPGXqSMdUt-O83w4hH8VJ1eaoK6iXkjNGGUaau1g1uwaVNwyljDVUNpeKALFinea1Zxw_JojBdK9GLI3KMuKaUyq7jC_J0M09pY6fgKhvt-I4BqzRUv230NodYWecgTuCru7gaA75UM4a4qrYvKaYxrYKzYzWAneYM-JP8GOyIcPrRT8i_X3d_bx_q5z_3j7c3z7WTQk5161vtwS-lV2rJnfBaglc9CCGHftlpNvCOWcUkWM8106VQwb10AqhfdlSckIv93m1OrzPgZDYBHYyjjZBmND1XgvW6E0VefikZ5VJJzrX-BmVU9UK1Oyr31OWEmGEw2xw2Nr8XtHPKrM0-ELMLxFBlSiBl7PzjgsXytiHb6AJ-znKp21ZKVdz13kH54VuAbNAFiA58yOAm41P4-tB_VdeieA</recordid><startdate>2012</startdate><enddate>2012</enddate><creator>Sangwan, Abhijeet</creator><creator>Hansen, John H.L.</creator><general>Elsevier B.V</general><general>Elsevier</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7T9</scope><scope>8BM</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>2012</creationdate><title>Automatic analysis of Mandarin accented English using phonological features</title><author>Sangwan, Abhijeet ; Hansen, John H.L.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c434t-5d57dedb4d66b2c3d74ed69e334f9b871f281a614ead2717d27032d4c3e0db803</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Accent analysis</topic><topic>Applied sciences</topic><topic>Classification</topic><topic>Cross sections</topic><topic>Exact sciences and technology</topic><topic>Information, signal and communications theory</topic><topic>Mandarins</topic><topic>Markov models</topic><topic>Mathematical models</topic><topic>Methodology</topic><topic>Miscellaneous</topic><topic>Non-native speaker traits</topic><topic>Phonological features</topic><topic>Signal processing</topic><topic>Speech</topic><topic>Speech processing</topic><topic>Strategy</topic><topic>Telecommunications and information theory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Sangwan, Abhijeet</creatorcontrib><creatorcontrib>Hansen, John H.L.</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Linguistics and Language Behavior Abstracts (LLBA)</collection><collection>ComDisDome</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Speech communication</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sangwan, Abhijeet</au><au>Hansen, John H.L.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Automatic analysis of Mandarin accented English using phonological features</atitle><jtitle>Speech communication</jtitle><date>2012</date><risdate>2012</risdate><volume>54</volume><issue>1</issue><spage>40</spage><epage>54</epage><pages>40-54</pages><issn>0167-6393</issn><eissn>1872-7182</eissn><coden>SCOMDH</coden><abstract>► This study presents a new model for accent based on phonological features (PFs). ► Pronunciation variations are viewed as different paths in the PFs space-time. ► Markov models learn the pronunciations variations between native and non-native speakers. ► Native speakers of Mandarin and American English are used for evaluation. ► Proposed system shows high correlation with human judgment of accent (0.89 with
p
<
0.01).
The problem of accent analysis and modeling has been considered from a variety of domains, including linguistic structure, statistical analysis of speech production features, and HMM/GMM (hidden Markov model/Gaussian mixture model) model classification. These studies however fail to connect speech production from a temporal perspective through a final classification strategy. Here, a novel accent analysis system and methodology which exploits the power of phonological features (PFs) is presented. The proposed system exploits the knowledge of articulation embedded in phonology by building Markov models (MMs) of PFs extracted from accented speech. The Markov models capture information in the PF space along two dimensions of articulation: PF state-transitions and state-durations. Furthermore, by utilizing MMs of native and non-native accents, a new statistical measure of “accentedness” is developed which rates the articulation of a word by a speaker on a scale of native-like (+1) to non-native like (−1). The proposed methodology is then used to perform an automatic cross-sectional study of accented English spoken by native speakers of Mandarin Chinese (N-MC). The experimental results demonstrate the capability of the proposed system to perform quantitative as well as qualitative analysis of foreign accents. The work developed in this study can be easily expanded into language learning systems, and has potential impact in the areas of speaker recognition and ASR (automatic speech recognition).</abstract><cop>Amsterdam</cop><pub>Elsevier B.V</pub><doi>10.1016/j.specom.2011.06.003</doi><tpages>15</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0167-6393 |
ispartof | Speech communication, 2012, Vol.54 (1), p.40-54 |
issn | 0167-6393 1872-7182 |
language | eng |
recordid | cdi_proquest_miscellaneous_926319783 |
source | Access via ScienceDirect (Elsevier) |
subjects | Accent analysis Applied sciences Classification Cross sections Exact sciences and technology Information, signal and communications theory Mandarins Markov models Mathematical models Methodology Miscellaneous Non-native speaker traits Phonological features Signal processing Speech Speech processing Strategy Telecommunications and information theory |
title | Automatic analysis of Mandarin accented English using phonological features |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-14T18%3A23%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Automatic%20analysis%20of%20Mandarin%20accented%20English%20using%20phonological%20features&rft.jtitle=Speech%20communication&rft.au=Sangwan,%20Abhijeet&rft.date=2012&rft.volume=54&rft.issue=1&rft.spage=40&rft.epage=54&rft.pages=40-54&rft.issn=0167-6393&rft.eissn=1872-7182&rft.coden=SCOMDH&rft_id=info:doi/10.1016/j.specom.2011.06.003&rft_dat=%3Cproquest_cross%3E1024642277%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1010693657&rft_id=info:pmid/&rft_els_id=S0167639311000872&rfr_iscdi=true |