Hierarchical Bayesian modeling of vowel formant data: Speaker-intrinsic and speaker-extrinsic approaches compared
Vowel formant data is traditionally normalized across speakers by transforming a set of ‘raw’ measurements into ‘standardized’ ones in one of two ways. With a speaker-extrinsic method, data from each individual is normalized with respect to external baseline measures calculated across the population...
Gespeichert in:
Veröffentlicht in: | The Journal of the Acoustical Society of America 2012-09, Vol.132 (3_Supplement), p.2002-2002 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Vowel formant data is traditionally normalized across speakers by transforming a set of ‘raw’ measurements into ‘standardized’ ones in one of two ways. With a speaker-extrinsic method, data from each individual is normalized with respect to external baseline measures calculated across the population of all speakers in a corpus, whereas a speaker-intrinsic method normalizes entirely with respect to speaker-dependent variables. The present study reports on implementations of both these methods in terms of hierarchical statistical models whereby probability distributions for various model parameters can be obtained using Bayesian analysis (rather than merely ‘converting’ the measurements). In this new framework, a speaker-extrinsic approach can estimate (1) the size and shape of each speaker’s vowel space, (2) the locations of vowel categories across a speech community within a normalized space, and (3) individual speakers’ deviations from the community norms. However, this process relies on a number of assumptions that are not needed with a speaker-intrinsic approach, which instead makes many low-level discrete ‘decisions’ on a speaker-by-speaker basis. By testing multiple models on the same dataset (a large corpus of vowel data collected from 132 speakers of American English), the present study explores the comparative merits of speaker-extrinsic and speaker-intrinsic Bayesian models. |
---|---|
ISSN: | 0001-4966 1520-8524 |
DOI: | 10.1121/1.4755408 |