Methods, systems, apparatuses, and devices for Sifrian-based neural network training
A method for training a neural network model having layers and parameters, comprises providing an input corresponding to each of samples comprised in a batch of a training dataset to an input layer, obtaining outputs from the neural network model, calculating a loss function for each of the samples...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Patent |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | A method for training a neural network model having layers and parameters, comprises providing an input corresponding to each of samples comprised in a batch of a training dataset to an input layer, obtaining outputs from the neural network model, calculating a loss function for each of the samples based on the outputs and corresponding desired values, and determining values for minimizing a mismatch between the outputs and the corresponding desired values across the samples for the parameters based on the loss function. The determining of the values comprises running a forward model through the layers, determining a Sifrian functional for the layers based on the forward model, backpropgation, and gradient definition, determining equations from the Sifrian functional, performing transformation and optionally reduction to get a pivotal Sifr equation and solving the equation to estimate a second-order update for the purpose of the neural network model training. |
---|