MM-PhyQA: Multimodal Physics Question-Answering With Multi-Image CoT Prompting

While Large Language Models (LLMs) can achieve human-level performance in various tasks, they continue to face challenges when it comes to effectively tackling multi-step physics reasoning tasks. To identify the shortcomings of existing models and facilitate further research in this area, we curated...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2024-04
Hauptverfasser:	Anand, Avinash, Kapuriya, Janak, Singh, Apoorv, Saraf, Jay, Lal, Naman, Verma, Astha, Gupta, Rushali, Shah, Rajiv
Format:	Artikel
Sprache:	eng
Schlagworte:	Datasets Human performance Large language models Performance evaluation Physics Questions
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!