A Biologically Inspired Sound Localisation System Using a Silicon Cochlea Pair

We present a biologically inspired sound localisation system for reverberant environments using the Cascade of Asymmetric Resonators with Fast-Acting Compression (CAR-FAC) cochlear model. The system exploits a CAR-FAC pair to pre-process binaural signals that travel through the inherent delay line o...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Applied sciences 2021-02, Vol.11 (4), p.1519
Hauptverfasser:	Xu, Ying, Afshar, Saeed, Wang, Runchun, Cohen, Gregory, Singh Thakur, Chetan, Hamilton, Tara Julia, van Schaik, André
Format:	Artikel
Sprache:	eng
Schlagworte:	Cochlea Compression Correlation analysis Delay lines electronic cochlea Field programmable gate arrays Hair ITD Learning algorithms Localization Neural networks neuromorphic engineering Neurons Nonlinearity onset detection process innovation Signal processing Sound sound localisation Sound localization Sound sources Timing
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	We present a biologically inspired sound localisation system for reverberant environments using the Cascade of Asymmetric Resonators with Fast-Acting Compression (CAR-FAC) cochlear model. The system exploits a CAR-FAC pair to pre-process binaural signals that travel through the inherent delay line of the cascade structures, as each filter acts as a delay unit. Following the filtering, each cochlear channel is cross-correlated with all the channels of the other cochlea using a quantised instantaneous correlation function to form a 2-D instantaneous correlation matrix (correlogram). The correlogram contains both interaural time difference and spectral information. The generated correlograms are analysed using a regression neural network for localisation. We investigate the effect of the CAR-FAC nonlinearity on the system performance by comparing it with a CAR only version. To verify that the CAR/CAR-FAC and the quantised instantaneous correlation provide a suitable basis with which to perform sound localisation tasks, a linear regression, an extreme learning machine, and a convolutional neural network are trained to learn the azimuthal angle of the sound source from the correlogram. The system is evaluated using speech data recorded in a reverberant environment. We compare the performance of the linear CAR and nonlinear CAR-FAC models with current sound localisation systems as well as with human performance.
ISSN:	2076-3417 2076-3417
DOI:	10.3390/app11041519