Large language model operation chain device and method based on tree structure
The invention relates to a large language model operation chain device and method based on a tree structure, and the device comprises a hierarchical structure processing module which is used for decomposing a large-scale language model into a plurality of sub-models, and then carrying out the organi...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention relates to a large language model operation chain device and method based on a tree structure, and the device comprises a hierarchical structure processing module which is used for decomposing a large-scale language model into a plurality of sub-models, and then carrying out the organization according to the tree structure; the context modeling module is used for extracting context information from the input data and determining a sub-model path needing to be calculated according to the extracted context information; the computing resource allocation module is used for receiving the sub-model paths determined by the context modeling module, computing resource requirements according to the sub-model paths, and dynamically allocating computing resources for the computing process; the model parameter storage module is used for storing parameters of a large language model and sub-model parameters of the large language model; compared with the prior art, the method has the advantages that efficient o |
---|