INFERENCING ENDPOINT DISCOVERY IN COMPUTING SYSTEMS

Techniques for machine learning inferencing endpoint discovery in a distributed computing system are discloses herein. In one example, a method includes searching a database containing machine learning endpoint records having data representing values of execution latency or prediction accuracy corre...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: FERRE, Juan Diego, HUANG, Hao, YANG, Zhenghua, PINNINTI, Ashish, AMLESHWARAM, Amit Anand, QIU, Long
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Techniques for machine learning inferencing endpoint discovery in a distributed computing system are discloses herein. In one example, a method includes searching a database containing machine learning endpoint records having data representing values of execution latency or prediction accuracy corresponding inferencing endpoints deployed in the distributed computing system. The method also includes generating a list of inferencing endpoints matching the individual target values and determining whether a count of the inferencing endpoints in the generated list exceeds a preset threshold. In response to determining that the identified count does not exceed the preset threshold, the method includes instantiating one or more additional inferencing endpoints in the distributed computing system based on the individual target values in the received query.