Allocation of machine learning tasks into a shared cache

The present invention relates to allocation of machine learning tasks into a shared cache. The subject technology receives code corresponding to a neural network (NN) model, the code including particular operations that are performed by the NN model. The subject technology determines, among the part...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	XIAOZHONG YAO, CECILE M. FORET, SUNDARARAMAN HARIHARASUBRAMANIAN, FABIAN P. WANNER
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING ELECTRIC DIGITAL DATA PROCESSING PHYSICS
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The present invention relates to allocation of machine learning tasks into a shared cache. The subject technology receives code corresponding to a neural network (NN) model, the code including particular operations that are performed by the NN model. The subject technology determines, among the particular operations, a set of operations that are to be allocated to a cache of the electronic devicethat is to execute the NN model. The subject technology generates a set of cache indicators corresponding to the determined set of operations. The subject technology compiles the code and the generated set of cache indicators to provide a compiled binary for the NN model to execute on a target device. 本公开涉及机器学习任务到共享高速缓存中的分配。本主题技术接收对应于神经网络(NN)模型的代码，所述代码包括由NN模型执行的特定操作。在所述特定操作中，本主题技术确定将分配给要执行所述NN模型的电子设备的高速缓存的一组操作。本主题技术生成对应于所述确定的一组操作的一组高速缓存指示器。本主题技术编译所述代码和所述生成的一组高速缓存指示器，以提供用于所述NN模型的编译的二进制文件以在目标设备上执行。