Boosting the performance of molecular property prediction via graph–text alignment and multi-granularity representation enhancement

Deep learning is playing an increasingly important role in accurate prediction of molecular properties. Prior to being processed by a deep learning model, a molecule is typically represented in the form of a text or a graph. While some methods attempt to integrate these two forms of molecular repres...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Journal of molecular graphics & modelling 2024-11, Vol.132, p.108843, Article 108843
Hauptverfasser:	Zhao, Zhuoran, Zhou, Qing, Wu, Chengkai, Su, Renbin, Xiong, Weihong
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Align Deep Learning Models, Molecular Molecular property prediction Multi-granularity Multimodal
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Deep learning is playing an increasingly important role in accurate prediction of molecular properties. Prior to being processed by a deep learning model, a molecule is typically represented in the form of a text or a graph. While some methods attempt to integrate these two forms of molecular representations, the misalignment of graph and text embeddings presents a significant challenge to fuse two modalities. To solve this problem, we propose a method that aligns and fuses graph and text features in the embedding space by using contrastive loss and cross attentions. Additionally, we enhance the molecular representation by incorporating multi-granularity information of molecules on the levels of atoms, functional groups, and molecules. Extensive experiments show that our model outperforms state-of-the-art models in downstream tasks of molecular property prediction, achieving superior performance with less pretraining data. The source codes and data are available at https://github.com/zzr624663649/multimodal_molecular_property. [Display omitted]
ISSN:	1093-3263 1873-4243 1873-4243
DOI:	10.1016/j.jmgm.2024.108843