FastRM: An efficient and automatic explainability framework for multimodal generative models

While Large Vision Language Models (LVLMs) have become masterly capable in reasoning over human prompts and visual inputs, they are still prone to producing responses that contain misinformation. Identifying incorrect responses that are not grounded in evidence has become a crucial task in building...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2024-12
Hauptverfasser:	Gabriela Ben-Melech Stan, Aflalo, Estelle, Luo, Man, Rosenman, Shachar, Le, Tiep, Sayak, Paul, Shao-Yen Tseng, Lal, Vasudev
Format:	Artikel
Sprache:	eng
Schlagworte:	Explainable artificial intelligence Visual flight Visual tasks
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!