ADAGENT: Anomaly Detection Agent With Multimodal Large Models in Adverse Environments

Multimodal Language Models (MMLMs), such as LLaVA and GPT-4V, have shown zero-shot generalization capabilities for understanding images and text across various domains. However, their effectiveness in open-world visual tasks, particularly anomaly detection under challenging conditions, such as low l...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE access 2024, Vol.12, p.172061-172074
Hauptverfasser:	Zhang, Miao, Shen, Yiqing, Yin, Jun, Lu, Shuai, Wang, Xueqian
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy AI agent Anomalies Anomaly detection Artificial intelligence Benchmark testing Benchmarks Cognition Context modeling Error analysis Error detection Feature extraction Image quality Information retrieval Lighting Multimodal language model Multisensory integration Performance evaluation Prompt engineering Semantics Training Visual tasks Visualization
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!