Optical convolution operations with optical neural networks for incoherent color image recognition
•Developed ONNs that accelerate optical convolution operations using RGB pixel arrays integrated into conventional display technology.•Achieved 96% classification accuracy on an unseen 1,000-image test dataset after on-system iterative training with a normal convolution network.•Demonstrated efficie...
Gespeichert in:
Veröffentlicht in: | Optics and lasers in engineering 2025-02, Vol.185, p.108740, Article 108740 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •Developed ONNs that accelerate optical convolution operations using RGB pixel arrays integrated into conventional display technology.•Achieved 96% classification accuracy on an unseen 1,000-image test dataset after on-system iterative training with a normal convolution network.•Demonstrated efficient operation with classification accuracy exceeding 94%, using less than 34 aJ of optical energy per MAC operation.•Implemented depth-wise convolution with a rank-3 kernel, successfully classifying complex patterns containing three MNIST handwritten digits encoded in RGB.
This study introduces optical neural networks (ONNs) designed to accelerate optical convolution operations using a red, green, and blue (RGB) pixel array integrated into conventional display technology. In a proof-of-concept demonstration, we initially employed a rank-4 kernel for a normal convolution network, which was integrated with a fully connected layer, to accurately classify color images across five fruit categories. Following seven epochs of on-system iterative training for a 3,000 training dataset, the ONN achieved 96% classification accuracy and maintained robust performance on an unseen 1,000 test dataset. Our analysis also showed its potential for efficient operation, with a classification accuracy exceeding 94% using an average less than 34 aJ of optical energy per MAC operation. Additionally, we demonstrated depth-wise convolution with a rank-3 kernel, recurring the system to spectrally resolve the signals into independent R, G, and B channels. This architecture enabled the successful classification of complex patterns containing three MNIST handwritten digits encoded in RGB. Our strategy contributes significantly to optical computing and neuromorphic vision, facilitating efficient recognition of real-world, multi-color, and incoherent light images. |
---|---|
ISSN: | 0143-8166 |
DOI: | 10.1016/j.optlaseng.2024.108740 |