Model deployment method and device

The invention provides a model deployment method and equipment. The method comprises the following steps: acquiring historical access information of an algorithm model library and performance information of each algorithm model in the algorithm model library; according to the historical access infor...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: LIN YINGJIE, JIAN RENXIAN, XIN CHENGJIE
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention provides a model deployment method and equipment. The method comprises the following steps: acquiring historical access information of an algorithm model library and performance information of each algorithm model in the algorithm model library; according to the historical access information and the performance information of each algorithm model, determining the types of target models needing to be deployed and the number of copies of each target model; determining the memory occupation amount of each target model; and deploying the target models on a server cluster according to the memory occupation amount of each target model, the number of copies of each target model and the total memory resource of the server cluster. According to the invention, the utilization rate of server system resources during model deployment is improved, and the model deployment efficiency is improved. 本申请提供一种模型部署方法和设备,该方法包括:获取算法模型库的历史访问信息和所述算法模型库中每个算法模型的性能信息;根据所述历史访问信息和所述每个算法模型的性能信息,确定需要部署的目标模型种类和每种所述目标模型的副本数量;确定每个