Information processing method and device, model training method and device, equipment and medium

The invention provides an information processing method, a deep learning model training method and device, electronic equipment, a storage medium and a program product, and relates to the technical field of artificial intelligence, in particular to the technical field of large models, large language...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: GUO ZHI, ZHU KUNHONG, LIU LIN, LIN KUNHAI, LIANG ZHIHAO, YE CHAO, CUI ZIXIN, HE DENGWU, LI SHUANGLONG
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention provides an information processing method, a deep learning model training method and device, electronic equipment, a storage medium and a program product, and relates to the technical field of artificial intelligence, in particular to the technical field of large models, large language models, Transformers, dialogue models, generative models and the like. According to the specific implementation scheme, input information is processed through a result generation model, intermediate features and an initial output result used for responding to the input information are obtained, a target function plug-in is integrated in the result generation model, and the target function plug-in is used for evaluating the initial output result; processing the intermediate feature by using the target function plug-in to obtain an evaluation result for evaluating the initial output result; and adjusting the initial output result according to the evaluation result to obtain a target output result. 本公开提供了信息处理方法、深度学习模