A fine-tuned multimodal large model for power defect image-text question-answering

In power defect detection, the complexity of scenes and the diversity of defects pose challenges for manual defect identification. Considering these issues, this paper proposes utilizing a multimodal large model to assist power professionals in identifying power scenes and defects through image-text...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Signal, image and video processing image and video processing, 2024-12, Vol.18 (12), p.9191-9203
Hauptverfasser: Wang, Qiqi, Zhang, Jie, Du, Jianming, Zhang, Ke, Li, Rui, Zhao, Feng, Zou, Le, Xie, Chengjun
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!