METHOD AND APPARATUS FOR QUANTIZING NEURAL NETWORK MODEL, AND METHOD AND APPARATUS FOR PROCESSING DATA

A method and apparatus for quantizing a neural network model, and a method and apparatus for processing data, which belong to the field of artificial intelligence. An original neural network model comprises a first operator, a second operator and a first operation module, the first operation module...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: SUN, Fangxuan, LIAN, Shuo, CHANG, Jing, ZHOU, Jun, WANG, Chenxi
Format: Patent
Sprache:chi ; eng ; fre
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A method and apparatus for quantizing a neural network model, and a method and apparatus for processing data, which belong to the field of artificial intelligence. An original neural network model comprises a first operator, a second operator and a first operation module, the first operation module being used for performing operation on an output of the first operator and an output of the second operator. The method for quantizing a neural network model comprises: determining a data quantization parameter according to the range of first training input data of a first operator and the range of second training input data of a second operator; and determining a quantized neural network model, and the quantized neural network model respectively quantizing quantized first input data of the first operator and quantized second input data of the second operator by using the data quantization parameter. A quantized processing result of a first operator and a quantized processing result of a second operator can be dire