Benchmarking Medical LLMs on Anesthesiology: A Comprehensive Dataset in Chinese

With the recent success of large language models (LLMs), interest in developing them for medical domains has increased. However, due to the lack of benchmark datasets, evaluating the capabilities of medical LLMs remains challenging, particularly in highly specialized fields such as anesthesiology. T...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on emerging topics in computational intelligence 2025-01, p.1-15
Hauptverfasser: Zhou, Bohao, Zhan, Yibing, Wang, Zhonghai, Li, Yanhong, Zhang, Chong, Yu, Baosheng, Ding, Liang, Jin, Hua, Liu, Weifeng, Wang, Xiongbin, Tao, Dapeng
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!