Robust speaker identification via fusion of subglottal resonances and cepstral features
This paper investigates the use of subglottal resonances (SGRs) for noise-robust speaker identification (SID). It is motivated by the speaker specificity and stationarity of subglottal acoustics, and the development of noise-robust SGR estimation algorithms which are reliable at low SNRs for large d...
Gespeichert in:
Veröffentlicht in: | The Journal of the Acoustical Society of America 2017-05, Vol.141 (5), p.3468-3468 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This paper investigates the use of subglottal resonances (SGRs) for noise-robust speaker identification (SID). It is motivated by the speaker specificity and stationarity of subglottal acoustics, and the development of noise-robust SGR estimation algorithms which are reliable at low SNRs for large datasets. A two-stage framework is proposed which combines the SGRs with different cepstral features. The cepstral features are used in the first stage to reduce the number of target speakers for a test utterance, and then SGRs are used as complementary second-stage features to conduct identification. Experiments with the TIMIT and NIST 2008 databases show that SGRs, when used in conjunction with PNCCs and LPCCs, can improve the performance significantly (2-6% absolute accuracy improvement) across all noise conditions in mismatched situations. |
---|---|
ISSN: | 0001-4966 1520-8524 |
DOI: | 10.1121/1.4987208 |