Cross-Modal Learning for Chemistry Property Prediction: Large Language Models Meet Graph Machine Learning

In the field of chemistry, the objective is to create novel molecules with desired properties, facilitating accurate property predictions for applications such as material design and drug screening. However, existing graph deep learning methods face limitations that curb their expressive power. To a...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2024-08
Hauptverfasser:	Sakhinana Sagar Srinivas, Runkana, Venkataramana
Format:	Artikel
Sprache:	eng
Schlagworte:	Deep learning Effectiveness Graph neural networks Harnesses Large language models Machine learning Molecular properties Predictions Structured data
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Sakhinana Sagar Srinivas Runkana, Venkataramana
description	In the field of chemistry, the objective is to create novel molecules with desired properties, facilitating accurate property predictions for applications such as material design and drug screening. However, existing graph deep learning methods face limitations that curb their expressive power. To address this, we explore the integration of vast molecular domain knowledge from Large Language Models (LLMs) with the complementary strengths of Graph Neural Networks (GNNs) to enhance performance in property prediction tasks. We introduce a Multi-Modal Fusion (MMF) framework that synergistically harnesses the analytical prowess of GNNs and the linguistic generative and predictive abilities of LLMs, thereby improving accuracy and robustness in predicting molecular properties. Our framework combines the effectiveness of GNNs in modeling graph-structured data with the zero-shot and few-shot learning capabilities of LLMs, enabling improved predictions while reducing the risk of overfitting. Furthermore, our approach effectively addresses distributional shifts, a common challenge in real-world applications, and showcases the efficacy of learning cross-modal representations, surpassing state-of-the-art baselines on benchmark datasets for property prediction tasks.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3097957948</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3097957948</sourcerecordid><originalsourceid>FETCH-proquest_journals_30979579483</originalsourceid><addsrcrecordid>eNqNjM8LgjAYhkcQJOX_MOgsrE1Tu0o_DgkdusvQT53YZt_mof--BdG5y_s-8L48CxJwIXZRFnO-IqG1A2OM71OeJCIgqkBjbVSaRo70ChK10h1tDdKih4eyDl_0hmYCdB-ARtVOGX2gV4kd-NTdLD14AYyWlgCOnlFOPS1l3SsNP-mGLFs5Wgi_vSbb0_FeXKIJzXMG66rBzKj9VAmWp3mS5nEm_nu9AVqVR8A</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3097957948</pqid></control><display><type>article</type><title>Cross-Modal Learning for Chemistry Property Prediction: Large Language Models Meet Graph Machine Learning</title><source>Free E- Journals</source><creator>Sakhinana Sagar Srinivas ; Runkana, Venkataramana</creator><creatorcontrib>Sakhinana Sagar Srinivas ; Runkana, Venkataramana</creatorcontrib><description>In the field of chemistry, the objective is to create novel molecules with desired properties, facilitating accurate property predictions for applications such as material design and drug screening. However, existing graph deep learning methods face limitations that curb their expressive power. To address this, we explore the integration of vast molecular domain knowledge from Large Language Models (LLMs) with the complementary strengths of Graph Neural Networks (GNNs) to enhance performance in property prediction tasks. We introduce a Multi-Modal Fusion (MMF) framework that synergistically harnesses the analytical prowess of GNNs and the linguistic generative and predictive abilities of LLMs, thereby improving accuracy and robustness in predicting molecular properties. Our framework combines the effectiveness of GNNs in modeling graph-structured data with the zero-shot and few-shot learning capabilities of LLMs, enabling improved predictions while reducing the risk of overfitting. Furthermore, our approach effectively addresses distributional shifts, a common challenge in real-world applications, and showcases the efficacy of learning cross-modal representations, surpassing state-of-the-art baselines on benchmark datasets for property prediction tasks.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Deep learning ; Effectiveness ; Graph neural networks ; Harnesses ; Large language models ; Machine learning ; Molecular properties ; Predictions ; Structured data</subject><ispartof>arXiv.org, 2024-08</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by-nc-nd/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Sakhinana Sagar Srinivas</creatorcontrib><creatorcontrib>Runkana, Venkataramana</creatorcontrib><title>Cross-Modal Learning for Chemistry Property Prediction: Large Language Models Meet Graph Machine Learning</title><title>arXiv.org</title><description>In the field of chemistry, the objective is to create novel molecules with desired properties, facilitating accurate property predictions for applications such as material design and drug screening. However, existing graph deep learning methods face limitations that curb their expressive power. To address this, we explore the integration of vast molecular domain knowledge from Large Language Models (LLMs) with the complementary strengths of Graph Neural Networks (GNNs) to enhance performance in property prediction tasks. We introduce a Multi-Modal Fusion (MMF) framework that synergistically harnesses the analytical prowess of GNNs and the linguistic generative and predictive abilities of LLMs, thereby improving accuracy and robustness in predicting molecular properties. Our framework combines the effectiveness of GNNs in modeling graph-structured data with the zero-shot and few-shot learning capabilities of LLMs, enabling improved predictions while reducing the risk of overfitting. Furthermore, our approach effectively addresses distributional shifts, a common challenge in real-world applications, and showcases the efficacy of learning cross-modal representations, surpassing state-of-the-art baselines on benchmark datasets for property prediction tasks.</description><subject>Deep learning</subject><subject>Effectiveness</subject><subject>Graph neural networks</subject><subject>Harnesses</subject><subject>Large language models</subject><subject>Machine learning</subject><subject>Molecular properties</subject><subject>Predictions</subject><subject>Structured data</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNjM8LgjAYhkcQJOX_MOgsrE1Tu0o_DgkdusvQT53YZt_mof--BdG5y_s-8L48CxJwIXZRFnO-IqG1A2OM71OeJCIgqkBjbVSaRo70ChK10h1tDdKih4eyDl_0hmYCdB-ARtVOGX2gV4kd-NTdLD14AYyWlgCOnlFOPS1l3SsNP-mGLFs5Wgi_vSbb0_FeXKIJzXMG66rBzKj9VAmWp3mS5nEm_nu9AVqVR8A</recordid><startdate>20240827</startdate><enddate>20240827</enddate><creator>Sakhinana Sagar Srinivas</creator><creator>Runkana, Venkataramana</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240827</creationdate><title>Cross-Modal Learning for Chemistry Property Prediction: Large Language Models Meet Graph Machine Learning</title><author>Sakhinana Sagar Srinivas ; Runkana, Venkataramana</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_30979579483</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Deep learning</topic><topic>Effectiveness</topic><topic>Graph neural networks</topic><topic>Harnesses</topic><topic>Large language models</topic><topic>Machine learning</topic><topic>Molecular properties</topic><topic>Predictions</topic><topic>Structured data</topic><toplevel>online_resources</toplevel><creatorcontrib>Sakhinana Sagar Srinivas</creatorcontrib><creatorcontrib>Runkana, Venkataramana</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Sakhinana Sagar Srinivas</au><au>Runkana, Venkataramana</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Cross-Modal Learning for Chemistry Property Prediction: Large Language Models Meet Graph Machine Learning</atitle><jtitle>arXiv.org</jtitle><date>2024-08-27</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>In the field of chemistry, the objective is to create novel molecules with desired properties, facilitating accurate property predictions for applications such as material design and drug screening. However, existing graph deep learning methods face limitations that curb their expressive power. To address this, we explore the integration of vast molecular domain knowledge from Large Language Models (LLMs) with the complementary strengths of Graph Neural Networks (GNNs) to enhance performance in property prediction tasks. We introduce a Multi-Modal Fusion (MMF) framework that synergistically harnesses the analytical prowess of GNNs and the linguistic generative and predictive abilities of LLMs, thereby improving accuracy and robustness in predicting molecular properties. Our framework combines the effectiveness of GNNs in modeling graph-structured data with the zero-shot and few-shot learning capabilities of LLMs, enabling improved predictions while reducing the risk of overfitting. Furthermore, our approach effectively addresses distributional shifts, a common challenge in real-world applications, and showcases the efficacy of learning cross-modal representations, surpassing state-of-the-art baselines on benchmark datasets for property prediction tasks.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2024-08
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_3097957948
source	Free E- Journals
subjects	Deep learning Effectiveness Graph neural networks Harnesses Large language models Machine learning Molecular properties Predictions Structured data
title	Cross-Modal Learning for Chemistry Property Prediction: Large Language Models Meet Graph Machine Learning
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-25T02%3A50%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Cross-Modal%20Learning%20for%20Chemistry%20Property%20Prediction:%20Large%20Language%20Models%20Meet%20Graph%20Machine%20Learning&rft.jtitle=arXiv.org&rft.au=Sakhinana%20Sagar%20Srinivas&rft.date=2024-08-27&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3097957948%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3097957948&rft_id=info:pmid/&rfr_iscdi=true