Hierarchical Bayesian modeling of vowel formant data: Speaker-intrinsic and speaker-extrinsic approaches compared

Vowel formant data is traditionally normalized across speakers by transforming a set of ‘raw’ measurements into ‘standardized’ ones in one of two ways. With a speaker-extrinsic method, data from each individual is normalized with respect to external baseline measures calculated across the population...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	The Journal of the Acoustical Society of America 2012-09, Vol.132 (3_Supplement), p.2002-2002
Hauptverfasser:	Albin, Aaron L., Rankinen, Wil A.
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Vowel formant data is traditionally normalized across speakers by transforming a set of ‘raw’ measurements into ‘standardized’ ones in one of two ways. With a speaker-extrinsic method, data from each individual is normalized with respect to external baseline measures calculated across the population of all speakers in a corpus, whereas a speaker-intrinsic method normalizes entirely with respect to speaker-dependent variables. The present study reports on implementations of both these methods in terms of hierarchical statistical models whereby probability distributions for various model parameters can be obtained using Bayesian analysis (rather than merely ‘converting’ the measurements). In this new framework, a speaker-extrinsic approach can estimate (1) the size and shape of each speaker’s vowel space, (2) the locations of vowel categories across a speech community within a normalized space, and (3) individual speakers’ deviations from the community norms. However, this process relies on a number of assumptions that are not needed with a speaker-intrinsic approach, which instead makes many low-level discrete ‘decisions’ on a speaker-by-speaker basis. By testing multiple models on the same dataset (a large corpus of vowel data collected from 132 speakers of American English), the present study explores the comparative merits of speaker-extrinsic and speaker-intrinsic Bayesian models.
ISSN:	0001-4966 1520-8524
DOI:	10.1121/1.4755408