GPU-based deep neural network model training method and apparatus, and computer device

The invention relates to a GPU-based deep neural network model training method and device, computer equipment and a storage medium. The method comprises the steps: when a deep neural network model istrained for the first time, compressing output data of all hidden layers to a GPU main memory for sto...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: LI KEQIN, TANG ZHUO, TAN GUANGHUA, LI KENLI, LIU CHUBO, YANG WANGDONG, CHEN ZAILONG, ZHU NINGBO, XIAO GUOQING, ZHOU XU
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention relates to a GPU-based deep neural network model training method and device, computer equipment and a storage medium. The method comprises the steps: when a deep neural network model istrained for the first time, compressing output data of all hidden layers to a GPU main memory for storage, and obtaining the compressed output data and the main memory allowance of the GPU; when the main memory margin does not reach the preset margin threshold, determining a preliminary hidden layer according to the sparse degree value of the output data and the time proportion of the compressed output data occupying the GPU main memory; when the deep neural network model is iteratively trained, according to the preliminary hidden layer, compressing output data of the preliminary hidden layerto a GPU main memory for storage to obtain a preliminary margin of the GPU main memory until the preliminary margin reaches a preset margin threshold; and when the preliminary margin reaches a presetmargin threshold, determini