Neural Network Inference Acceleration Method, Target Detection Method, Device, and Storage Medium

A neural network inference acceleration method includes: acquiring a neural network model to be accelerated and an accelerated data set; automatically performing accelerating process on the neural network model to be accelerated by using the accelerated data set to obtain the accelerated neural netw...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: ZU, Chunshan
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A neural network inference acceleration method includes: acquiring a neural network model to be accelerated and an accelerated data set; automatically performing accelerating process on the neural network model to be accelerated by using the accelerated data set to obtain the accelerated neural network model, wherein the accelerating process includes at least one of the following: model compression, graph optimization and deployment optimization, wherein the model compression includes at least one of the following: model quantification, model pruning and model distillation, wherein the graph optimization is the optimization for the directed graph of the neural network model to be accelerated, and the deployment optimization is the optimization for the deployment platform of the neural network model to be accelerated; and performing inference evaluation on the accelerated neural network model.