Spike Timing Dependent Gradient for Direct Training of Fast and Efficient Binarized Spiking Neural Networks

Spiking neural networks (SNNs) are well-suited for neuromorphic hardware due to their biological plausibility and energy efficiency. These networks utilize sparse, asynchronous spikes for communication and can be binarized. However, the training of such networks presents several challenges due to th...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE journal on emerging and selected topics in circuits and systems 2023-12, Vol.13 (4), p.1083-1093
Hauptverfasser:	Cai, Zhengyu, Kalatehbali, Hamid Rahimian, Walters, Ben, Rahimi Azghadi, Mostafa, Amirsoleimani, Amirali, Genov, Roman
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Algorithms Back propagation networks binarized neural network Biological neural networks Circuits and systems Encoding event-driven computing Membrane potentials Motorcycles Neural networks neuromorphic computing Neurons Spike-timing-dependent learning Spiking spiking neural networks Timing Training
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Spiking neural networks (SNNs) are well-suited for neuromorphic hardware due to their biological plausibility and energy efficiency. These networks utilize sparse, asynchronous spikes for communication and can be binarized. However, the training of such networks presents several challenges due to their non-differentiable activation function and binarized inter-layer data movement. The well-established backpropagation through time (BPTT) algorithm used to train SNNs encounters notable difficulties because of its substantial memory consumption and extensive computational demands. These limitations restrict its practical utility in real-world scenarios. Therefore, effective techniques are required to train such networks efficiently while preserving accuracy. In this paper, we propose Binarized Spike Timing Dependent Gradient (BSTDG), a novel method that utilizes presynaptic and postsynaptic timings to bypass the non-differentiable gradient and the need of BPTT. Additionally, we employ binarized weights with a threshold training strategy to enhance energy savings and performance. Moreover, we exploit latency/temporal-based coding and the Integrate-and-Fire (IF) model to achieve significant computational advantages. We evaluate the proposed method on Caltech101 Face/Motorcycle, MNIST, Fashion-MNIST, and Spiking Heidelberg Digits. The results demonstrate that the accuracy attained surpasses that of existing BSNNs and single-spike networks under the same structure. Furthermore, the proposed model achieves up to 30 \times \times \times speedup in inference and effectively reduces the number of spikes emitted in the hidden layer by 50% compared to previous works.
ISSN:	2156-3357 2156-3365
DOI:	10.1109/JETCAS.2023.3328926