Silicon photonic architecture for training deep neural networks with direct feedback alignment

There has been growing interest in using photonic processors for performing neural network inference operations; however, these networks are currently trained using standard digital electronics. Here, we propose on-chip training of neural networks enabled by a CMOS-compatible silicon photonic archit...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Optica 2022-12, Vol.9 (12), p.1323
Hauptverfasser:	Filipovich, Matthew J., Guo, Zhimu, Al-Qadasi, Mohammed, Marquez, Bicky A., Morison, Hugh D., Sorger, Volker J., Prucnal, Paul R., Shekhar, Sudip, Shastri, Bhavin J.
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	There has been growing interest in using photonic processors for performing neural network inference operations; however, these networks are currently trained using standard digital electronics. Here, we propose on-chip training of neural networks enabled by a CMOS-compatible silicon photonic architecture to harness the potential for massively parallel, efficient, and fast data operations. Our scheme employs the direct feedback alignment training algorithm, which trains neural networks using error feedback rather than error backpropagation, and can operate at speeds of trillions of multiply–accumulate (MAC) operations per second while consuming less than one picojoule per MAC operation. The photonic architecture exploits parallelized matrix–vector multiplications using arrays of microring resonators for processing multi-channel analog signals along single waveguide buses to calculate the gradient vector for each neural network layer in situ . We also experimentally demonstrate training deep neural networks with the MNIST dataset using on-chip MAC operation results. Our approach for efficient, ultra-fast neural network training showcases photonics as a promising platform for executing artificial intelligence applications.
ISSN:	2334-2536 2334-2536
DOI:	10.1364/OPTICA.475493