Methods, systems, apparatuses, and devices for Sifrian-based neural network training

A method for training a neural network model having layers and parameters, comprises providing an input corresponding to each of samples comprised in a batch of a training dataset to an input layer, obtaining outputs from the neural network model, calculating a loss function for each of the samples...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
1. Verfasser: Mehouachi, Fares
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A method for training a neural network model having layers and parameters, comprises providing an input corresponding to each of samples comprised in a batch of a training dataset to an input layer, obtaining outputs from the neural network model, calculating a loss function for each of the samples based on the outputs and corresponding desired values, and determining values for minimizing a mismatch between the outputs and the corresponding desired values across the samples for the parameters based on the loss function. The determining of the values comprises running a forward model through the layers, determining a Sifrian functional for the layers based on the forward model, backpropgation, and gradient definition, determining equations from the Sifrian functional, performing transformation and optionally reduction to get a pivotal Sifr equation and solving the equation to estimate a second-order update for the purpose of the neural network model training.