Model reasoning acceleration method and device

The invention provides a model reasoning acceleration method and device, and the method comprises the steps: obtaining request information, analyzing the request information, determining a target model service with a dependency relationship corresponding to the request information, and enabling an i...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: ZHANG DE, HUANG LELE, LIU SHUAICHAO
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention provides a model reasoning acceleration method and device, and the method comprises the steps: obtaining request information, analyzing the request information, determining a target model service with a dependency relationship corresponding to the request information, and enabling an integrated model service system to comprise at least one model service; determining a model parameter value, and processing the model parameter value through at least one target model service with a dependency relationship and a message queue arranged between the target model services in sequence to obtain a final output model calculation result; wherein the message queue between every two target model services with the dependency relationship is used for receiving task data generated by the previous target model service according to the processing capability value of the previous target model service, and sending the task data to the latter target model service according to the processing capability value of the la