SAUP: Situation Awareness Uncertainty Propagation on LLM Agent
Large language models (LLMs) integrated into multistep agent systems enable complex decision-making processes across various applications. However, their outputs often lack reliability, making uncertainty estimation crucial. Existing uncertainty estimation methods primarily focus on final-step outpu...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Large language models (LLMs) integrated into multistep agent systems enable
complex decision-making processes across various applications. However, their
outputs often lack reliability, making uncertainty estimation crucial. Existing
uncertainty estimation methods primarily focus on final-step outputs, which
fail to account for cumulative uncertainty over the multistep decision-making
process and the dynamic interactions between agents and their environments. To
address these limitations, we propose SAUP (Situation Awareness Uncertainty
Propagation), a novel framework that propagates uncertainty through each step
of an LLM-based agent's reasoning process. SAUP incorporates situational
awareness by assigning situational weights to each step's uncertainty during
the propagation. Our method, compatible with various one-step uncertainty
estimation techniques, provides a comprehensive and accurate uncertainty
measure. Extensive experiments on benchmark datasets demonstrate that SAUP
significantly outperforms existing state-of-the-art methods, achieving up to
20% improvement in AUROC. |
---|---|
DOI: | 10.48550/arxiv.2412.01033 |