Transformers as a Physical Model in AI
The structure of a transformer (an artificial intelligence model based on attention models) is considered. Analogies with physical models are discussed (‘‘transformer as an evolution operator for a system of attention models as a Hamiltonian’’), with the approaches by Yu.I. Manin (‘‘renormalization...
Gespeichert in:
Veröffentlicht in: | Lobachevskii journal of mathematics 2024-02, Vol.45 (2), p.710-717 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The structure of a transformer (an artificial intelligence model based on attention models) is considered. Analogies with physical models are discussed (‘‘transformer as an evolution operator for a system of attention models as a Hamiltonian’’), with the approaches by Yu.I. Manin (‘‘renormalization and computations’’) and M. Marcolli (‘‘generative linguistics as algebraic and physical model’’), and with the concept of a genome as a ‘‘gas of interacting genes’’ (E.V. Koonin). |
---|---|
ISSN: | 1995-0802 1818-9962 |
DOI: | 10.1134/S1995080224600353 |