Person authentication by fusing face and speech information

A person authentication technique using two modalities is presented. Results from individual experts, namely face and speech recognisers, are merged by a supervisor. The visual part involves matching of a coarse grid containing Gabor phase information from face images. The acoustic part is performed...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Duc, Benoît, Maître, Gilbert, Fischer, Stefan, Bigün, Josef
Format:	Buchkapitel
Sprache:	eng
Schlagworte:	Face Image Fusion Method Receiver Operating Characteristic Curve Speaker Verification World Model
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A person authentication technique using two modalities is presented. Results from individual experts, namely face and speech recognisers, are merged by a supervisor. The visual part involves matching of a coarse grid containing Gabor phase information from face images. The acoustic part is performed by a text-dependent speaker verification system based on Hidden Markov Models, which assumes as text a spelled sequence of digits. The merging of individual decisions is accomplished by one of two different methods: a simple averaging and a more sophisticated Bayesian method. Experimental results show that even the simple method provides improvements compared to single modalities. The improvements are significant with the Bayesian method. These results show that the use of two modalities increases authentication performance at least under certain circumstances.
ISSN:	0302-9743 1611-3349
DOI:	10.1007/BFb0016010