Learning Accurate Integer Transformer Machine-Translation Models

We describe a method for training accurate Transformer machine-translation models to run inference using 8-bit integer (INT8) hardware matrix multipliers, as opposed to the more costly single-precision floating-point (FP32) hardware. Unlike previous work, which converted only 85 Transformer matrix m...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	SN computer science 2021-07, Vol.2 (4), p.291, Article 291
1. Verfasser:	Wu, Ephrem
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Computer Imaging Computer Science Computer Systems Organization and Communication Networks Data Structures and Information Theory Floating point arithmetic Hardware Information Systems and Communication Service Integers Machine translation Mathematical analysis Numbers Original Research Pattern Recognition and Graphics Software Engineering/Programming and Operating Systems Tensors Training Transformers Vision
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!