Allocation of machine learning tasks into a shared cache

The present invention relates to allocation of machine learning tasks into a shared cache. The subject technology receives code corresponding to a neural network (NN) model, the code including particular operations that are performed by the NN model. The subject technology determines, among the part...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: XIAOZHONG YAO, CECILE M. FORET, SUNDARARAMAN HARIHARASUBRAMANIAN, FABIAN P. WANNER
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The present invention relates to allocation of machine learning tasks into a shared cache. The subject technology receives code corresponding to a neural network (NN) model, the code including particular operations that are performed by the NN model. The subject technology determines, among the particular operations, a set of operations that are to be allocated to a cache of the electronic devicethat is to execute the NN model. The subject technology generates a set of cache indicators corresponding to the determined set of operations. The subject technology compiles the code and the generated set of cache indicators to provide a compiled binary for the NN model to execute on a target device. 本公开涉及机器学习任务到共享高速缓存中的分配。本主题技术接收对应于神经网络(NN)模型的代码,所述代码包括由NN模型执行的特定操作。在所述特定操作中,本主题技术确定将分配给要执行所述NN模型的电子设备的高速缓存的一组操作。本主题技术生成对应于所述确定的一组操作的一组高速缓存指示器。本主题技术编译所述代码和所述生成的一组高速缓存指示器,以提供用于所述NN模型的编译的二进制文件以在目标设备上执行。