Comprehensive evaluation of molecule property prediction with ChatGPT

The versatility of ChatGPT in performing a diverse range of tasks has elicited considerable interest on its potential applications within professional fields. Taking drug discovery as a testbed, this paper provides a comprehensive evaluation of ChatGPT's ability on molecule property prediction....

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Methods (San Diego, Calif.) Calif.), 2024-02, Vol.222, p.133-141
Hauptverfasser:	Cai, Xibao, Lai, Houtim, Wang, Xing, Wang, Longyue, Liu, Wei, Wang, Yijun, Wang, Zixu, Cao, Dongsheng, Zeng, Xiangxiang
Format:	Artikel
Sprache:	eng
Schlagworte:	ChatGPT Drug Discovery Evaluation Few-shot Molecule property prediction Prompt Research Design
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	The versatility of ChatGPT in performing a diverse range of tasks has elicited considerable interest on its potential applications within professional fields. Taking drug discovery as a testbed, this paper provides a comprehensive evaluation of ChatGPT's ability on molecule property prediction. The study focuses on three aspects: 1) Effects of different prompt settings, where we investigate the impact of varying prompts on the prediction outcomes of ChatGPT; 2) Comprehensive evaluation on molecule property prediction, where we conduct a comprehensive evaluation on 53 ADMET-related endpoints; 3) Analysis of ChatGPT's potential and limitations, where we make comparisons with models tailored for molecule property prediction, thus gaining a more accurate understanding of ChatGPT's capabilities and limitations in this area. Through comprehensive evaluation, we find that 1) With appropriate prompt settings, ChatGPT can attain satisfactory prediction outcomes that are competitive with specialized models designed for those tasks. 2) Prompt settings significantly affect ChatGPT's performance. Among all prompt settings, the strategy of selecting examples in few-shot has the greatest impact on results. Scaffold sampling greatly outperforms random sampling. 3) The capacity of ChatGPT to accomplish high-precision predictions is significantly influenced by the quality of examples provided, which may constrain its practical applicability in real-world scenarios. This work highlights ChatGPT's potential and limitations on molecule property prediction, which we hope can inspire future design and evaluation of Large Language Models within scientific domains. •ChatGPT can achieve satisfactory results on molecule property prediction with proper prompt settings.•Zero-shot prompts cause high ChatGPT rejection; few-shot prompts effectively resolve this.•The strategy of selecting examples in few-shot has the greatest impact on ChatGPT's performance among all prompt settings.•ChatGPT is essentially insensitive to the balance of the datasets.
ISSN:	1046-2023 1095-9130 1095-9130
DOI:	10.1016/j.ymeth.2024.01.004