Model training method and device and readable storage medium

The embodiment of the invention relates to a model training method and device and a readable storage medium, and the method comprises the steps: obtaining a sample data set corresponding to a target task, a pre-trained teacher model and an ith initial student model; i-th channel pruning is carried o...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: ZENG HAIEN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The embodiment of the invention relates to a model training method and device and a readable storage medium, and the method comprises the steps: obtaining a sample data set corresponding to a target task, a pre-trained teacher model and an ith initial student model; i-th channel pruning is carried out on the i-th initial student model, a student model after i-th channel pruning is obtained, and the initial value of i is 1; knowledge distillation is carried out according to the sample data set, the teacher model and the student model after ith channel pruning, an (i + 1) th initial student model is obtained, and the compression ratio between the (i + 1) th initial student model and the ith initial student model is equal to a preset ith compression ratio; and updating i = i + 1, returning to execute the ith channel pruning on the ith initial student model until the updated i is greater than a preset threshold N, and obtaining a target student model. Step-by-step compression is realized through successive prunin