TOTAL MOMENTUM FEEDBACK LOOP FOR BETTER ASGD GENERALIZATION
A system and a method for distributed training of a machine learning model are disclosed. They are characterized by using feedback loop for total momentum control, and comprising detection of ASGD executions with compromised generalization, a feedback loop for adjusting the total momentum of ASGD ex...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Patent |
Sprache: | eng ; fre ; ger |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A system and a method for distributed training of a machine learning model are disclosed. They are characterized by using feedback loop for total momentum control, and comprising detection of ASGD executions with compromised generalization, a feedback loop for adjusting the total momentum of ASGD execution, i.e. the sum of the explicit, parametric momentum, and the implicit momentum, related to gradient staleness, toward zero, and tuning ASGD hyperparameters, in accordance with total momentum minimization. |
---|