Transformers as a Physical Model in AI

The structure of a transformer (an artificial intelligence model based on attention models) is considered. Analogies with physical models are discussed (‘‘transformer as an evolution operator for a system of attention models as a Hamiltonian’’), with the approaches by Yu.I. Manin (‘‘renormalization...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Lobachevskii journal of mathematics 2024-02, Vol.45 (2), p.710-717
1. Verfasser: Kozyrev, S. V.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The structure of a transformer (an artificial intelligence model based on attention models) is considered. Analogies with physical models are discussed (‘‘transformer as an evolution operator for a system of attention models as a Hamiltonian’’), with the approaches by Yu.I. Manin (‘‘renormalization and computations’’) and M. Marcolli (‘‘generative linguistics as algebraic and physical model’’), and with the concept of a genome as a ‘‘gas of interacting genes’’ (E.V. Koonin).
ISSN:1995-0802
1818-9962
DOI:10.1134/S1995080224600353