Model deployment method and device
The invention provides a model deployment method and equipment. The method comprises the following steps: acquiring historical access information of an algorithm model library and performance information of each algorithm model in the algorithm model library; according to the historical access infor...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The invention provides a model deployment method and equipment. The method comprises the following steps: acquiring historical access information of an algorithm model library and performance information of each algorithm model in the algorithm model library; according to the historical access information and the performance information of each algorithm model, determining the types of target models needing to be deployed and the number of copies of each target model; determining the memory occupation amount of each target model; and deploying the target models on a server cluster according to the memory occupation amount of each target model, the number of copies of each target model and the total memory resource of the server cluster. According to the invention, the utilization rate of server system resources during model deployment is improved, and the model deployment efficiency is improved.
本申请提供一种模型部署方法和设备,该方法包括:获取算法模型库的历史访问信息和所述算法模型库中每个算法模型的性能信息;根据所述历史访问信息和所述每个算法模型的性能信息,确定需要部署的目标模型种类和每种所述目标模型的副本数量;确定每个 |
---|