Fast Implementation of 4-bit Convolutional Neural Networks for Mobile Devices

Quantized low-precision neural networks are very popular because they require less computational resources for inference and can provide high performance, which is vital for real-time and embedded recognition systems. However, their advantages are apparent for FPGA and ASIC devices, while general-pu...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2020-10
Hauptverfasser:	Trusov, Anton, Limonova, Elena, Slugin, Dmitry, Nikolaev, Dmitry, Arlazarov, Vladimir V
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Artificial neural networks Electronic devices Embedded systems Floating point arithmetic Inference Measurement Microprocessors Multiplication Neural networks Optical character recognition Wireless networks
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!