Artificial Intelligence-Powered Construction of a Microbial Optimal Growth Temperature Database and Its Impact on Enzyme Optimal Temperature Prediction

Accurate prediction of enzyme optimal temperature (Topt) is crucial for identifying enzymes suitable for catalytic functions under extreme bioprocessing conditions. The optimal growth temperature (OGT) of microorganisms serves as a key indicator for estimating enzyme Topt, reflecting an evolutionary...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The journal of physical chemistry. B 2024-03, Vol.128 (10), p.2281-2292
Hauptverfasser: Wang, Xiaotao, Zong, Yuwei, Zhou, Xuanjie, Xu, Li, He, Wei, Quan, Shu
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2292
container_issue 10
container_start_page 2281
container_title The journal of physical chemistry. B
container_volume 128
creator Wang, Xiaotao
Zong, Yuwei
Zhou, Xuanjie
Xu, Li
He, Wei
Quan, Shu
description Accurate prediction of enzyme optimal temperature (Topt) is crucial for identifying enzymes suitable for catalytic functions under extreme bioprocessing conditions. The optimal growth temperature (OGT) of microorganisms serves as a key indicator for estimating enzyme Topt, reflecting an evolutionary temperature balance between enzyme-catalyzed reactions and the organism’s growth environments. Existing OGT databases, collected from culture collection centers, often fall short as culture temperature does not precisely represent the OGT. Models trained on such databases yield inadequate accuracy in enzyme Topt prediction, underscoring the need for a high-quality OGT database. Herein, we developed AI-based models to extract the OGT information from the scientific literature, constructing a comprehensive OGT database with 1155 unique organisms and 2142 OGT values. The top-performing model, BioLinkBERT, demonstrated exceptional information extraction ability with an EM score of 91.00 and an F1 score of 91.91 for OGT. Notably, applying this OGT database in enzyme Topt prediction achieved an R 2 value of 0.698, outperforming the R 2 value of 0.686 obtained using culture temperature. This emphasizes the superiority of the OGT database in predicting the enzyme Topt and underscores its pivotal role in identifying enzymes with optimal catalytic temperatures.
doi_str_mv 10.1021/acs.jpcb.3c06526
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2937701977</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2937701977</sourcerecordid><originalsourceid>FETCH-LOGICAL-a289t-caed333590790e4958899d52d4d5cd2442c0c11d885787cebe0c5df1884a13e93</originalsourceid><addsrcrecordid>eNp1kc1O3DAUhS3UqlDafVfIyy7I1D9xbC_RQOlIIFjQdeTYN9RoEgfb0QhehNeth5miblhY14vznatzD0LfKFlQwugPY9PiYbLdglvSCNYcoCMqGKnKkx_2_4aS5hB9TumBECaYaj6hQ65qLqnkR-jlLGbfe-vNGq_GDOu1v4fRQnUbNhDB4WUYU46zzT6MOPTY4GtvY-i2wM2U_VDmZQyb_AffwTBBNHmOgM9NNp1JgM3o8ConvBomYzMuJhfj89MAb_D_1G3Z6F9XfUEfe7NO8HU_j9Hvnxd3y1_V1c3lanl2VRmmdK6sAcc5F5pITaDWQimtnWCudsI6VtfMEkupU0pIJS10QKxwPVWqNpSD5sfo-853iuFxhpTbwSdbzmBGCHNqmeZSEqqlLFKyk5b4KUXo2ymWBPGppaTd1tGWOtptHe2-joKc7N3nbgD3Bvy7fxGc7gSvaJjjWMK-7_cXfEyZGQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2937701977</pqid></control><display><type>article</type><title>Artificial Intelligence-Powered Construction of a Microbial Optimal Growth Temperature Database and Its Impact on Enzyme Optimal Temperature Prediction</title><source>American Chemical Society Journals</source><creator>Wang, Xiaotao ; Zong, Yuwei ; Zhou, Xuanjie ; Xu, Li ; He, Wei ; Quan, Shu</creator><creatorcontrib>Wang, Xiaotao ; Zong, Yuwei ; Zhou, Xuanjie ; Xu, Li ; He, Wei ; Quan, Shu</creatorcontrib><description>Accurate prediction of enzyme optimal temperature (Topt) is crucial for identifying enzymes suitable for catalytic functions under extreme bioprocessing conditions. The optimal growth temperature (OGT) of microorganisms serves as a key indicator for estimating enzyme Topt, reflecting an evolutionary temperature balance between enzyme-catalyzed reactions and the organism’s growth environments. Existing OGT databases, collected from culture collection centers, often fall short as culture temperature does not precisely represent the OGT. Models trained on such databases yield inadequate accuracy in enzyme Topt prediction, underscoring the need for a high-quality OGT database. Herein, we developed AI-based models to extract the OGT information from the scientific literature, constructing a comprehensive OGT database with 1155 unique organisms and 2142 OGT values. The top-performing model, BioLinkBERT, demonstrated exceptional information extraction ability with an EM score of 91.00 and an F1 score of 91.91 for OGT. Notably, applying this OGT database in enzyme Topt prediction achieved an R 2 value of 0.698, outperforming the R 2 value of 0.686 obtained using culture temperature. This emphasizes the superiority of the OGT database in predicting the enzyme Topt and underscores its pivotal role in identifying enzymes with optimal catalytic temperatures.</description><identifier>ISSN: 1520-6106</identifier><identifier>EISSN: 1520-5207</identifier><identifier>DOI: 10.1021/acs.jpcb.3c06526</identifier><identifier>PMID: 38437173</identifier><language>eng</language><publisher>United States: American Chemical Society</publisher><subject>B: Biophysical and Biochemical Systems and Processes</subject><ispartof>The journal of physical chemistry. B, 2024-03, Vol.128 (10), p.2281-2292</ispartof><rights>2024 American Chemical Society</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-a289t-caed333590790e4958899d52d4d5cd2442c0c11d885787cebe0c5df1884a13e93</cites><orcidid>0000-0001-9234-094X ; 0000-0002-6672-4947</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://pubs.acs.org/doi/pdf/10.1021/acs.jpcb.3c06526$$EPDF$$P50$$Gacs$$H</linktopdf><linktohtml>$$Uhttps://pubs.acs.org/doi/10.1021/acs.jpcb.3c06526$$EHTML$$P50$$Gacs$$H</linktohtml><link.rule.ids>314,776,780,2752,27053,27901,27902,56713,56763</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38437173$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Wang, Xiaotao</creatorcontrib><creatorcontrib>Zong, Yuwei</creatorcontrib><creatorcontrib>Zhou, Xuanjie</creatorcontrib><creatorcontrib>Xu, Li</creatorcontrib><creatorcontrib>He, Wei</creatorcontrib><creatorcontrib>Quan, Shu</creatorcontrib><title>Artificial Intelligence-Powered Construction of a Microbial Optimal Growth Temperature Database and Its Impact on Enzyme Optimal Temperature Prediction</title><title>The journal of physical chemistry. B</title><addtitle>J. Phys. Chem. B</addtitle><description>Accurate prediction of enzyme optimal temperature (Topt) is crucial for identifying enzymes suitable for catalytic functions under extreme bioprocessing conditions. The optimal growth temperature (OGT) of microorganisms serves as a key indicator for estimating enzyme Topt, reflecting an evolutionary temperature balance between enzyme-catalyzed reactions and the organism’s growth environments. Existing OGT databases, collected from culture collection centers, often fall short as culture temperature does not precisely represent the OGT. Models trained on such databases yield inadequate accuracy in enzyme Topt prediction, underscoring the need for a high-quality OGT database. Herein, we developed AI-based models to extract the OGT information from the scientific literature, constructing a comprehensive OGT database with 1155 unique organisms and 2142 OGT values. The top-performing model, BioLinkBERT, demonstrated exceptional information extraction ability with an EM score of 91.00 and an F1 score of 91.91 for OGT. Notably, applying this OGT database in enzyme Topt prediction achieved an R 2 value of 0.698, outperforming the R 2 value of 0.686 obtained using culture temperature. This emphasizes the superiority of the OGT database in predicting the enzyme Topt and underscores its pivotal role in identifying enzymes with optimal catalytic temperatures.</description><subject>B: Biophysical and Biochemical Systems and Processes</subject><issn>1520-6106</issn><issn>1520-5207</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp1kc1O3DAUhS3UqlDafVfIyy7I1D9xbC_RQOlIIFjQdeTYN9RoEgfb0QhehNeth5miblhY14vznatzD0LfKFlQwugPY9PiYbLdglvSCNYcoCMqGKnKkx_2_4aS5hB9TumBECaYaj6hQ65qLqnkR-jlLGbfe-vNGq_GDOu1v4fRQnUbNhDB4WUYU46zzT6MOPTY4GtvY-i2wM2U_VDmZQyb_AffwTBBNHmOgM9NNp1JgM3o8ConvBomYzMuJhfj89MAb_D_1G3Z6F9XfUEfe7NO8HU_j9Hvnxd3y1_V1c3lanl2VRmmdK6sAcc5F5pITaDWQimtnWCudsI6VtfMEkupU0pIJS10QKxwPVWqNpSD5sfo-853iuFxhpTbwSdbzmBGCHNqmeZSEqqlLFKyk5b4KUXo2ymWBPGppaTd1tGWOtptHe2-joKc7N3nbgD3Bvy7fxGc7gSvaJjjWMK-7_cXfEyZGQ</recordid><startdate>20240314</startdate><enddate>20240314</enddate><creator>Wang, Xiaotao</creator><creator>Zong, Yuwei</creator><creator>Zhou, Xuanjie</creator><creator>Xu, Li</creator><creator>He, Wei</creator><creator>Quan, Shu</creator><general>American Chemical Society</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-9234-094X</orcidid><orcidid>https://orcid.org/0000-0002-6672-4947</orcidid></search><sort><creationdate>20240314</creationdate><title>Artificial Intelligence-Powered Construction of a Microbial Optimal Growth Temperature Database and Its Impact on Enzyme Optimal Temperature Prediction</title><author>Wang, Xiaotao ; Zong, Yuwei ; Zhou, Xuanjie ; Xu, Li ; He, Wei ; Quan, Shu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a289t-caed333590790e4958899d52d4d5cd2442c0c11d885787cebe0c5df1884a13e93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>B: Biophysical and Biochemical Systems and Processes</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wang, Xiaotao</creatorcontrib><creatorcontrib>Zong, Yuwei</creatorcontrib><creatorcontrib>Zhou, Xuanjie</creatorcontrib><creatorcontrib>Xu, Li</creatorcontrib><creatorcontrib>He, Wei</creatorcontrib><creatorcontrib>Quan, Shu</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>The journal of physical chemistry. B</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wang, Xiaotao</au><au>Zong, Yuwei</au><au>Zhou, Xuanjie</au><au>Xu, Li</au><au>He, Wei</au><au>Quan, Shu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Artificial Intelligence-Powered Construction of a Microbial Optimal Growth Temperature Database and Its Impact on Enzyme Optimal Temperature Prediction</atitle><jtitle>The journal of physical chemistry. B</jtitle><addtitle>J. Phys. Chem. B</addtitle><date>2024-03-14</date><risdate>2024</risdate><volume>128</volume><issue>10</issue><spage>2281</spage><epage>2292</epage><pages>2281-2292</pages><issn>1520-6106</issn><eissn>1520-5207</eissn><abstract>Accurate prediction of enzyme optimal temperature (Topt) is crucial for identifying enzymes suitable for catalytic functions under extreme bioprocessing conditions. The optimal growth temperature (OGT) of microorganisms serves as a key indicator for estimating enzyme Topt, reflecting an evolutionary temperature balance between enzyme-catalyzed reactions and the organism’s growth environments. Existing OGT databases, collected from culture collection centers, often fall short as culture temperature does not precisely represent the OGT. Models trained on such databases yield inadequate accuracy in enzyme Topt prediction, underscoring the need for a high-quality OGT database. Herein, we developed AI-based models to extract the OGT information from the scientific literature, constructing a comprehensive OGT database with 1155 unique organisms and 2142 OGT values. The top-performing model, BioLinkBERT, demonstrated exceptional information extraction ability with an EM score of 91.00 and an F1 score of 91.91 for OGT. Notably, applying this OGT database in enzyme Topt prediction achieved an R 2 value of 0.698, outperforming the R 2 value of 0.686 obtained using culture temperature. This emphasizes the superiority of the OGT database in predicting the enzyme Topt and underscores its pivotal role in identifying enzymes with optimal catalytic temperatures.</abstract><cop>United States</cop><pub>American Chemical Society</pub><pmid>38437173</pmid><doi>10.1021/acs.jpcb.3c06526</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0001-9234-094X</orcidid><orcidid>https://orcid.org/0000-0002-6672-4947</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1520-6106
ispartof The journal of physical chemistry. B, 2024-03, Vol.128 (10), p.2281-2292
issn 1520-6106
1520-5207
language eng
recordid cdi_proquest_miscellaneous_2937701977
source American Chemical Society Journals
subjects B: Biophysical and Biochemical Systems and Processes
title Artificial Intelligence-Powered Construction of a Microbial Optimal Growth Temperature Database and Its Impact on Enzyme Optimal Temperature Prediction
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T05%3A43%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Artificial%20Intelligence-Powered%20Construction%20of%20a%20Microbial%20Optimal%20Growth%20Temperature%20Database%20and%20Its%20Impact%20on%20Enzyme%20Optimal%20Temperature%20Prediction&rft.jtitle=The%20journal%20of%20physical%20chemistry.%20B&rft.au=Wang,%20Xiaotao&rft.date=2024-03-14&rft.volume=128&rft.issue=10&rft.spage=2281&rft.epage=2292&rft.pages=2281-2292&rft.issn=1520-6106&rft.eissn=1520-5207&rft_id=info:doi/10.1021/acs.jpcb.3c06526&rft_dat=%3Cproquest_cross%3E2937701977%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2937701977&rft_id=info:pmid/38437173&rfr_iscdi=true