Towards optimized tensor code generation for deep learning on sunway many-core processor

The flourish of deep learning frameworks and hardware platforms has been demanding an efficient compiler that can shield the diversity in both software and hardware in order to provide application portability. Among the existing deep learning compilers, TVM is well known for its efficiency in code g...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Frontiers of Computer Science 2024-04, Vol.18 (2), p.182101, Article 182101
Hauptverfasser: LI, Mingzhen, LIU, Changxi, LIAO, Jianjin, ZHENG, Xuegui, YANG, Hailong, SUN, Rujun, XU, Jun, GAN, Lin, YANG, Guangwen, LUAN, Zhongzhi, QIAN, Depei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!