Proactive Embedding on Cold Data for Deep Learning Recommendation Model Training

Deep learning recommendation model (DLRM) is an important class of deep learning networks that are commonly used in many applications. DRLM presents unique challenges, especially for scale-out training since it not only has compute and memory-intensive components but the communication between the mu...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE computer architecture letters 2024-07, Vol.23 (2), p.203-206
Hauptverfasser:	Cho, Haeyoon, Son, Hyojun, Choi, Jungmin, Koh, Byungil, Ha, Minho, Kim, John
Format:	Artikel
Sprache:	eng
Schlagworte:	Backpropagation Data models Deep learning Graphics processing units Parallel processing Pipelines recommendation system Training
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Deep learning recommendation model (DLRM) is an important class of deep learning networks that are commonly used in many applications. DRLM presents unique challenges, especially for scale-out training since it not only has compute and memory-intensive components but the communication between the multiple GPUs is also on the critical path. In this work, we propose how cold data in DLRM embedding tables can be exploited to propose proactive embedding. In particular, proactive embedding allows embedding table accesses to be done in advance to reduce the impact of the memory access latency by overlapping the embedding access with communication. Our analysis of proactive embedding demonstrates that it can improve overall training performance by 46%.
ISSN:	1556-6056 1556-6064
DOI:	10.1109/LCA.2024.3445948