Model reasoning acceleration method and device
The invention provides a model reasoning acceleration method and device, and the method comprises the steps: obtaining request information, analyzing the request information, determining a target model service with a dependency relationship corresponding to the request information, and enabling an i...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention provides a model reasoning acceleration method and device, and the method comprises the steps: obtaining request information, analyzing the request information, determining a target model service with a dependency relationship corresponding to the request information, and enabling an integrated model service system to comprise at least one model service; determining a model parameter value, and processing the model parameter value through at least one target model service with a dependency relationship and a message queue arranged between the target model services in sequence to obtain a final output model calculation result; wherein the message queue between every two target model services with the dependency relationship is used for receiving task data generated by the previous target model service according to the processing capability value of the previous target model service, and sending the task data to the latter target model service according to the processing capability value of the la |
---|