GPU-based deep neural network model training method and apparatus, and computer device
The invention relates to a GPU-based deep neural network model training method and device, computer equipment and a storage medium. The method comprises the steps: when a deep neural network model istrained for the first time, compressing output data of all hidden layers to a GPU main memory for sto...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention relates to a GPU-based deep neural network model training method and device, computer equipment and a storage medium. The method comprises the steps: when a deep neural network model istrained for the first time, compressing output data of all hidden layers to a GPU main memory for storage, and obtaining the compressed output data and the main memory allowance of the GPU; when the main memory margin does not reach the preset margin threshold, determining a preliminary hidden layer according to the sparse degree value of the output data and the time proportion of the compressed output data occupying the GPU main memory; when the deep neural network model is iteratively trained, according to the preliminary hidden layer, compressing output data of the preliminary hidden layerto a GPU main memory for storage to obtain a preliminary margin of the GPU main memory until the preliminary margin reaches a preset margin threshold; and when the preliminary margin reaches a presetmargin threshold, determini |
---|