Synthesis of soundfields through irregular loudspeaker arrays based on convolutional neural networks

Most soundfield synthesis approaches deal with extensive and regular loudspeaker arrays, which are often not suitable for home audio systems, due to physical space constraints. In this article, we propose a technique for soundfield synthesis through more easily deployable irregular loudspeaker array...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	EURASIP journal on audio, speech, and music processing speech, and music processing, 2024-03, Vol.2024 (1), p.17-20, Article 17
Hauptverfasser:	Comanducci, Luca, Antonacci, Fabio, Sarti, Augusto
Format:	Artikel
Sprache:	eng
Schlagworte:	Acoustics Arrays Artificial neural networks Complex-valued convolutional neural networks Decomposition Deep learning Empirical Research Engineering Engineering Acoustics Loudspeakers Machine learning Mathematics in Music Neural networks Plane waves Pressure-matching method Signal,Image and Speech Processing Soundfield synthesis Spatial audio Synthesis
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Most soundfield synthesis approaches deal with extensive and regular loudspeaker arrays, which are often not suitable for home audio systems, due to physical space constraints. In this article, we propose a technique for soundfield synthesis through more easily deployable irregular loudspeaker arrays, i.e., where the spacing between loudspeakers is not constant, based on deep learning. The input are the driving signals obtained through a plane wave decomposition-based technique. While the considered driving signals are able to correctly reproduce the soundfield with a regular array, they show degraded performances when using irregular setups. Through a complex-valued convolutional neural network (CNN), we modify the driving signals in order to compensate the errors in the reproduction of the desired soundfield. Since no ground truth driving signals are available for the compensated ones, we train the model by calculating the loss between the desired soundfield at a number of control points and the one obtained through the driving signals estimated by the network. The proposed model must be retrained for each irregular loudspeaker array configuration. Numerical results show better reproduction accuracy with respect to the plane wave decomposition-based technique, pressure-matching approach, and linear optimizers for driving signal compensation.
ISSN:	1687-4722 1687-4714 1687-4722
DOI:	10.1186/s13636-024-00337-7