Model training method and device and readable storage medium
The embodiment of the invention relates to a model training method and device and a readable storage medium, and the method comprises the steps: obtaining a sample data set corresponding to a target task, a pre-trained teacher model and an ith initial student model; i-th channel pruning is carried o...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The embodiment of the invention relates to a model training method and device and a readable storage medium, and the method comprises the steps: obtaining a sample data set corresponding to a target task, a pre-trained teacher model and an ith initial student model; i-th channel pruning is carried out on the i-th initial student model, a student model after i-th channel pruning is obtained, and the initial value of i is 1; knowledge distillation is carried out according to the sample data set, the teacher model and the student model after ith channel pruning, an (i + 1) th initial student model is obtained, and the compression ratio between the (i + 1) th initial student model and the ith initial student model is equal to a preset ith compression ratio; and updating i = i + 1, returning to execute the ith channel pruning on the ith initial student model until the updated i is greater than a preset threshold N, and obtaining a target student model. Step-by-step compression is realized through successive prunin |
---|