Dual tensor parallel method for mixed word frequency embedding

The invention discloses a mixed word frequency embedded double tensor parallel method, which specifically comprises the following steps of: S1, scanning a data set used for training once through a task distributor, counting the occurrence frequency of a word id of each query, and then, a greedy algo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: BIAN ZHENGDA, MAI SIQI, LI YONGBIN, LEE, SEUNG-GYE, HUANG HAICHEN, LOU YUXUAN, HAN JIATONG, LIU YULIANG, LU GUANGYANG, WU JUNMING, CHEN WEIWEN, LIU HONGXIN, FANG JIARUI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The invention discloses a mixed word frequency embedded double tensor parallel method, which specifically comprises the following steps of: S1, scanning a data set used for training once through a task distributor, counting the occurrence frequency of a word id of each query, and then, a greedy algorithm (minimax: enabling the maximum difference of word frequencies between the embedded tables after cutting to be as small as possible) is utilized to uniformly cut rows of the embedded tables to parallel equipment according to the total number of the word frequencies, so that the number of the word frequencies on each piece of equipment is basically consistent, and the invention relates to the technical field of deep learning. According to the mixed word frequency embedded double tensor parallel method, through word frequency distribution information of an embedded table, uniform transverse cutting according to the page view is achieved during tensor parallel, and uniform spreading of the training amount is guar