Benchmarking Medical LLMs on Anesthesiology: A Comprehensive Dataset in Chinese

With the recent success of large language models (LLMs), interest in developing them for medical domains has increased. However, due to the lack of benchmark datasets, evaluating the capabilities of medical LLMs remains challenging, particularly in highly specialized fields such as anesthesiology. T...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on emerging topics in computational intelligence 2025-01, p.1-15
Hauptverfasser:	Zhou, Bohao, Zhan, Yibing, Wang, Zhonghai, Li, Yanhong, Zhang, Chong, Yu, Baosheng, Ding, Liang, Jin, Hua, Liu, Weifeng, Wang, Xiongbin, Tao, Dapeng
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Anesthesia Anesthesiology benchmark Benchmark testing Data collection dataset evaluation Hospitals large language model medicine Physiology Question answering (information retrieval) Safety Tag clouds
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Schreiben Sie den ersten Kommentar!