REUSING WEIGHTS AND BIASES IN AN ARTIFICIAL INTELLIGENCE ACCELERATOR FOR A NEURAL NETWORK FOR DIFFERENT MINIBATCH SIZES OF INFERENCES

Provided are a computer program product, system, and method for reusing weights and biases in an artificial intelligence accelerator for a neural network for different minibatch sizes of inferences. A minibatch size is selected of inference jobs batched to process in the accelerator. A representatio...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Venkataramani, Swagath, Schaal, Marcel, Srinivasan, Vijayalakshmi, Nagarajan, Amrit, Sen, Sanchari, Ramji, Shyam
Format: Patent
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!