Large language model operation chain device and method based on tree structure

The invention relates to a large language model operation chain device and method based on a tree structure, and the device comprises a hierarchical structure processing module which is used for decomposing a large-scale language model into a plurality of sub-models, and then carrying out the organi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: GUO HONGSEN
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention relates to a large language model operation chain device and method based on a tree structure, and the device comprises a hierarchical structure processing module which is used for decomposing a large-scale language model into a plurality of sub-models, and then carrying out the organization according to the tree structure; the context modeling module is used for extracting context information from the input data and determining a sub-model path needing to be calculated according to the extracted context information; the computing resource allocation module is used for receiving the sub-model paths determined by the context modeling module, computing resource requirements according to the sub-model paths, and dynamically allocating computing resources for the computing process; the model parameter storage module is used for storing parameters of a large language model and sub-model parameters of the large language model; compared with the prior art, the method has the advantages that efficient o