Non-ideal program-time conservation in charge trap flash for deep learning
Training deep neural networks (DNNs) is computationally intensive, but arrays of non-volatile memories such as charge trap flash (CTF) can accelerate DNN operations using in-memory computing. Specifically, the resistive processing unit (RPU) architecture uses a voltage-threshold program with stochas...
Gespeichert in:
Veröffentlicht in: | Semiconductor science and technology 2023-10, Vol.38 (10), p.105008 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Training deep neural networks (DNNs) is computationally intensive, but arrays of non-volatile memories such as charge trap flash (CTF) can accelerate DNN operations using in-memory computing. Specifically, the resistive processing unit (RPU) architecture uses a voltage-threshold program with stochastic encoded pulse trains and analog memory features to accelerate vector-vector outer product and weight update for gradient descent algorithms. Although CTF, offering high precision, has been regarded as an excellent choice for implementing RPU, the accumulation of charge due to the applied stochastic pulse trains is ultimately of critical significance in determining the final weight update. In this paper, we report on the non-ideal program-time conservation in CTF through pulsing input measurements. We experimentally measure the effect of pulse width and pulse gap, keeping the total ON-time of the input pulse train constant, and report three non-idealities: (1) the cumulative
V
T
shift reduces when total ON-time is fragmented into a larger number of shorter pulses, (2) the cumulative
V
T
shift drops abruptly for pulse widths |
---|---|
ISSN: | 0268-1242 1361-6641 |
DOI: | 10.1088/1361-6641/aceea6 |