A Mathematical Investigation of Hallucination and Creativity in GPT Models

In this paper, we present a comprehensive mathematical analysis of the hallucination phenomenon in generative pretrained transformer (GPT) models. We rigorously define and measure hallucination and creativity using concepts from probability theory and information theory. By introducing a parametric...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Mathematics (Basel) 2023-05, Vol.11 (10), p.2320
1. Verfasser:	Lee, Minhyeok
Format:	Artikel
Sprache:	eng
Schlagworte:	ChatGPT Computational linguistics Creative ability Creativity Data mining Datasets generative pretrained transformers GPT hallucination Hallucinations Information theory Investigations Language processing large language model Large language models LLM Mathematical analysis Mathematics Natural language Natural language interfaces Numerical analysis Optimization Probability Probability distribution Probability theory R&D Research & development
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	In this paper, we present a comprehensive mathematical analysis of the hallucination phenomenon in generative pretrained transformer (GPT) models. We rigorously define and measure hallucination and creativity using concepts from probability theory and information theory. By introducing a parametric family of GPT models, we characterize the trade-off between hallucination and creativity and identify an optimal balance that maximizes model performance across various tasks. Our work offers a novel mathematical framework for understanding the origins and implications of hallucination in GPT models and paves the way for future research and development in the field of large language models (LLMs).
ISSN:	2227-7390 2227-7390
DOI:	10.3390/math11102320