Information processing method and device, model training method and device, equipment and medium
The invention provides an information processing method, a deep learning model training method and device, electronic equipment, a storage medium and a program product, and relates to the technical field of artificial intelligence, in particular to the technical field of large models, large language...
Gespeichert in:
Hauptverfasser: | , , , , , , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention provides an information processing method, a deep learning model training method and device, electronic equipment, a storage medium and a program product, and relates to the technical field of artificial intelligence, in particular to the technical field of large models, large language models, Transformers, dialogue models, generative models and the like. According to the specific implementation scheme, input information is processed through a result generation model, intermediate features and an initial output result used for responding to the input information are obtained, a target function plug-in is integrated in the result generation model, and the target function plug-in is used for evaluating the initial output result; processing the intermediate feature by using the target function plug-in to obtain an evaluation result for evaluating the initial output result; and adjusting the initial output result according to the evaluation result to obtain a target output result.
本公开提供了信息处理方法、深度学习模 |
---|