Code processing model training method, code processing model processing method, code processing model training system, code processing model processing equipment and medium

The invention discloses a code processing model training method and system, a code processing model processing method and system, equipment and a medium. A to-be-trained code processing model is obtained; training the code processing model through grammar and semantic training tasks to obtain a firs...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: WANG ZIHAN, SONG SHUANGYONG, YAO YITONG, LIU SHIXUAN, LIU XINZHANG, WANG CHAO, WANG YAN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention discloses a code processing model training method and system, a code processing model processing method and system, equipment and a medium. A to-be-trained code processing model is obtained; training the code processing model through grammar and semantic training tasks to obtain a first processing model; training the first processing model by generating and retrieving a training task to obtain a second processing model; and performing instruction fine tuning on the second processing model to obtain a trained code processing model. A code processing model established in the method can flexibly switch an encoder mode, a decoder mode and an encoder-decoder mode so as to adapt to various task scenes; moreover, in the training process, grammar and semantic training tasks, generation and retrieval training tasks and instruction fine tuning are used for training the model, so that the code data processing performance of the model can be improved, the application effect of the model can be improved, and