Enhanced Temporal Processing in Spiking Neural Networks for Static Object Detection Using 3D Convolutions
Spiking Neural Networks (SNNs) are a class of network models capable of processing spatiotemporal information, with event-driven characteristics and energy efficiency advantages. Recently, directly trained SNNs have shown potential to match or surpass the performance of traditional Artificial Neural...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Spiking Neural Networks (SNNs) are a class of network models capable of
processing spatiotemporal information, with event-driven characteristics and
energy efficiency advantages. Recently, directly trained SNNs have shown
potential to match or surpass the performance of traditional Artificial Neural
Networks (ANNs) in classification tasks. However, in object detection tasks,
directly trained SNNs still exhibit a significant performance gap compared to
ANNs when tested on frame-based static object datasets (such as COCO2017).
Therefore, bridging this performance gap and enabling directly trained SNNs to
achieve performance comparable to ANNs on these static datasets has become one
of the key challenges in the development of SNNs.To address this challenge,
this paper focuses on enhancing the SNN's unique ability to process
spatiotemporal information. Spiking neurons, as the core components of SNNs,
facilitate the exchange of information between different temporal channels
during the process of converting input floating-point data into binary spike
signals. However, existing neuron models still have certain limitations in the
communication of temporal information. Some studies have even suggested that
disabling the backpropagation in the time dimension during SNN training can
still yield good training results. To improve the SNN handling of temporal
information, this paper proposes replacing traditional 2D convolutions with 3D
convolutions, thus directly incorporating temporal information into the
convolutional process. Additionally, temporal information recurrence mechanism
is introduced within the neurons to further enhance the neurons' efficiency in
utilizing temporal information.Experimental results show that the proposed
method enables directly trained SNNs to achieve performance levels comparable
to ANNs on the COCO2017 and VOC datasets. |
---|---|
DOI: | 10.48550/arxiv.2412.17654 |