Barzilai–Borwein-based adaptive learning rate for deep learning

•A Barzilai–Borwein-based method for adaptive learning rate of training DNNs.•The method is highly insensitive to initial learning rate which greatly reduces computational effort.•The method has its advantage over others on learning speed and generalization performance.•Convergence guarantee of the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pattern recognition letters 2019-12, Vol.128, p.197-203
Hauptverfasser: Liang, Jinxiu, Xu, Yong, Bao, Chenglong, Quan, Yuhui, Ji, Hui
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•A Barzilai–Borwein-based method for adaptive learning rate of training DNNs.•The method is highly insensitive to initial learning rate which greatly reduces computational effort.•The method has its advantage over others on learning speed and generalization performance.•Convergence guarantee of the method for training DNNs. Learning rate is arguably the most important hyper-parameter to tune when training a neural network. As manually setting right learning rate remains a cumbersome process, adaptive learning rate algorithms aim at automating such a process. Motivated by the success of the Barzilai–Borwein (BB) step-size method in many gradient descent methods for solving convex problems, this paper aims at investigating the potential of the BB method for training neural networks. With strong motivation from related convergence analysis, the BB method is generalized to adaptive learning rate of mini-batch gradient descent. The experiments showed that, in contrast to many existing methods, the proposed BB method is highly insensitive to initial learning rate, especially in terms of generalization performance. Also, the BB method showed its advantages on both learning speed and generalization performance over other available methods.
ISSN:0167-8655
1872-7344
DOI:10.1016/j.patrec.2019.08.029