Artificial Intelligence-Powered Construction of a Microbial Optimal Growth Temperature Database and Its Impact on Enzyme Optimal Temperature Prediction
Accurate prediction of enzyme optimal temperature (Topt) is crucial for identifying enzymes suitable for catalytic functions under extreme bioprocessing conditions. The optimal growth temperature (OGT) of microorganisms serves as a key indicator for estimating enzyme Topt, reflecting an evolutionary...
Gespeichert in:
Veröffentlicht in: | The journal of physical chemistry. B 2024-03, Vol.128 (10), p.2281-2292 |
---|---|
Hauptverfasser: | , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 2292 |
---|---|
container_issue | 10 |
container_start_page | 2281 |
container_title | The journal of physical chemistry. B |
container_volume | 128 |
creator | Wang, Xiaotao Zong, Yuwei Zhou, Xuanjie Xu, Li He, Wei Quan, Shu |
description | Accurate prediction of enzyme optimal temperature (Topt) is crucial for identifying enzymes suitable for catalytic functions under extreme bioprocessing conditions. The optimal growth temperature (OGT) of microorganisms serves as a key indicator for estimating enzyme Topt, reflecting an evolutionary temperature balance between enzyme-catalyzed reactions and the organism’s growth environments. Existing OGT databases, collected from culture collection centers, often fall short as culture temperature does not precisely represent the OGT. Models trained on such databases yield inadequate accuracy in enzyme Topt prediction, underscoring the need for a high-quality OGT database. Herein, we developed AI-based models to extract the OGT information from the scientific literature, constructing a comprehensive OGT database with 1155 unique organisms and 2142 OGT values. The top-performing model, BioLinkBERT, demonstrated exceptional information extraction ability with an EM score of 91.00 and an F1 score of 91.91 for OGT. Notably, applying this OGT database in enzyme Topt prediction achieved an R 2 value of 0.698, outperforming the R 2 value of 0.686 obtained using culture temperature. This emphasizes the superiority of the OGT database in predicting the enzyme Topt and underscores its pivotal role in identifying enzymes with optimal catalytic temperatures. |
doi_str_mv | 10.1021/acs.jpcb.3c06526 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2937701977</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2937701977</sourcerecordid><originalsourceid>FETCH-LOGICAL-a289t-caed333590790e4958899d52d4d5cd2442c0c11d885787cebe0c5df1884a13e93</originalsourceid><addsrcrecordid>eNp1kc1O3DAUhS3UqlDafVfIyy7I1D9xbC_RQOlIIFjQdeTYN9RoEgfb0QhehNeth5miblhY14vznatzD0LfKFlQwugPY9PiYbLdglvSCNYcoCMqGKnKkx_2_4aS5hB9TumBECaYaj6hQ65qLqnkR-jlLGbfe-vNGq_GDOu1v4fRQnUbNhDB4WUYU46zzT6MOPTY4GtvY-i2wM2U_VDmZQyb_AffwTBBNHmOgM9NNp1JgM3o8ConvBomYzMuJhfj89MAb_D_1G3Z6F9XfUEfe7NO8HU_j9Hvnxd3y1_V1c3lanl2VRmmdK6sAcc5F5pITaDWQimtnWCudsI6VtfMEkupU0pIJS10QKxwPVWqNpSD5sfo-853iuFxhpTbwSdbzmBGCHNqmeZSEqqlLFKyk5b4KUXo2ymWBPGppaTd1tGWOtptHe2-joKc7N3nbgD3Bvy7fxGc7gSvaJjjWMK-7_cXfEyZGQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2937701977</pqid></control><display><type>article</type><title>Artificial Intelligence-Powered Construction of a Microbial Optimal Growth Temperature Database and Its Impact on Enzyme Optimal Temperature Prediction</title><source>American Chemical Society Journals</source><creator>Wang, Xiaotao ; Zong, Yuwei ; Zhou, Xuanjie ; Xu, Li ; He, Wei ; Quan, Shu</creator><creatorcontrib>Wang, Xiaotao ; Zong, Yuwei ; Zhou, Xuanjie ; Xu, Li ; He, Wei ; Quan, Shu</creatorcontrib><description>Accurate prediction of enzyme optimal temperature (Topt) is crucial for identifying enzymes suitable for catalytic functions under extreme bioprocessing conditions. The optimal growth temperature (OGT) of microorganisms serves as a key indicator for estimating enzyme Topt, reflecting an evolutionary temperature balance between enzyme-catalyzed reactions and the organism’s growth environments. Existing OGT databases, collected from culture collection centers, often fall short as culture temperature does not precisely represent the OGT. Models trained on such databases yield inadequate accuracy in enzyme Topt prediction, underscoring the need for a high-quality OGT database. Herein, we developed AI-based models to extract the OGT information from the scientific literature, constructing a comprehensive OGT database with 1155 unique organisms and 2142 OGT values. The top-performing model, BioLinkBERT, demonstrated exceptional information extraction ability with an EM score of 91.00 and an F1 score of 91.91 for OGT. Notably, applying this OGT database in enzyme Topt prediction achieved an R 2 value of 0.698, outperforming the R 2 value of 0.686 obtained using culture temperature. This emphasizes the superiority of the OGT database in predicting the enzyme Topt and underscores its pivotal role in identifying enzymes with optimal catalytic temperatures.</description><identifier>ISSN: 1520-6106</identifier><identifier>EISSN: 1520-5207</identifier><identifier>DOI: 10.1021/acs.jpcb.3c06526</identifier><identifier>PMID: 38437173</identifier><language>eng</language><publisher>United States: American Chemical Society</publisher><subject>B: Biophysical and Biochemical Systems and Processes</subject><ispartof>The journal of physical chemistry. B, 2024-03, Vol.128 (10), p.2281-2292</ispartof><rights>2024 American Chemical Society</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-a289t-caed333590790e4958899d52d4d5cd2442c0c11d885787cebe0c5df1884a13e93</cites><orcidid>0000-0001-9234-094X ; 0000-0002-6672-4947</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://pubs.acs.org/doi/pdf/10.1021/acs.jpcb.3c06526$$EPDF$$P50$$Gacs$$H</linktopdf><linktohtml>$$Uhttps://pubs.acs.org/doi/10.1021/acs.jpcb.3c06526$$EHTML$$P50$$Gacs$$H</linktohtml><link.rule.ids>314,776,780,2752,27053,27901,27902,56713,56763</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/38437173$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Wang, Xiaotao</creatorcontrib><creatorcontrib>Zong, Yuwei</creatorcontrib><creatorcontrib>Zhou, Xuanjie</creatorcontrib><creatorcontrib>Xu, Li</creatorcontrib><creatorcontrib>He, Wei</creatorcontrib><creatorcontrib>Quan, Shu</creatorcontrib><title>Artificial Intelligence-Powered Construction of a Microbial Optimal Growth Temperature Database and Its Impact on Enzyme Optimal Temperature Prediction</title><title>The journal of physical chemistry. B</title><addtitle>J. Phys. Chem. B</addtitle><description>Accurate prediction of enzyme optimal temperature (Topt) is crucial for identifying enzymes suitable for catalytic functions under extreme bioprocessing conditions. The optimal growth temperature (OGT) of microorganisms serves as a key indicator for estimating enzyme Topt, reflecting an evolutionary temperature balance between enzyme-catalyzed reactions and the organism’s growth environments. Existing OGT databases, collected from culture collection centers, often fall short as culture temperature does not precisely represent the OGT. Models trained on such databases yield inadequate accuracy in enzyme Topt prediction, underscoring the need for a high-quality OGT database. Herein, we developed AI-based models to extract the OGT information from the scientific literature, constructing a comprehensive OGT database with 1155 unique organisms and 2142 OGT values. The top-performing model, BioLinkBERT, demonstrated exceptional information extraction ability with an EM score of 91.00 and an F1 score of 91.91 for OGT. Notably, applying this OGT database in enzyme Topt prediction achieved an R 2 value of 0.698, outperforming the R 2 value of 0.686 obtained using culture temperature. This emphasizes the superiority of the OGT database in predicting the enzyme Topt and underscores its pivotal role in identifying enzymes with optimal catalytic temperatures.</description><subject>B: Biophysical and Biochemical Systems and Processes</subject><issn>1520-6106</issn><issn>1520-5207</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp1kc1O3DAUhS3UqlDafVfIyy7I1D9xbC_RQOlIIFjQdeTYN9RoEgfb0QhehNeth5miblhY14vznatzD0LfKFlQwugPY9PiYbLdglvSCNYcoCMqGKnKkx_2_4aS5hB9TumBECaYaj6hQ65qLqnkR-jlLGbfe-vNGq_GDOu1v4fRQnUbNhDB4WUYU46zzT6MOPTY4GtvY-i2wM2U_VDmZQyb_AffwTBBNHmOgM9NNp1JgM3o8ConvBomYzMuJhfj89MAb_D_1G3Z6F9XfUEfe7NO8HU_j9Hvnxd3y1_V1c3lanl2VRmmdK6sAcc5F5pITaDWQimtnWCudsI6VtfMEkupU0pIJS10QKxwPVWqNpSD5sfo-853iuFxhpTbwSdbzmBGCHNqmeZSEqqlLFKyk5b4KUXo2ymWBPGppaTd1tGWOtptHe2-joKc7N3nbgD3Bvy7fxGc7gSvaJjjWMK-7_cXfEyZGQ</recordid><startdate>20240314</startdate><enddate>20240314</enddate><creator>Wang, Xiaotao</creator><creator>Zong, Yuwei</creator><creator>Zhou, Xuanjie</creator><creator>Xu, Li</creator><creator>He, Wei</creator><creator>Quan, Shu</creator><general>American Chemical Society</general><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0001-9234-094X</orcidid><orcidid>https://orcid.org/0000-0002-6672-4947</orcidid></search><sort><creationdate>20240314</creationdate><title>Artificial Intelligence-Powered Construction of a Microbial Optimal Growth Temperature Database and Its Impact on Enzyme Optimal Temperature Prediction</title><author>Wang, Xiaotao ; Zong, Yuwei ; Zhou, Xuanjie ; Xu, Li ; He, Wei ; Quan, Shu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a289t-caed333590790e4958899d52d4d5cd2442c0c11d885787cebe0c5df1884a13e93</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>B: Biophysical and Biochemical Systems and Processes</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wang, Xiaotao</creatorcontrib><creatorcontrib>Zong, Yuwei</creatorcontrib><creatorcontrib>Zhou, Xuanjie</creatorcontrib><creatorcontrib>Xu, Li</creatorcontrib><creatorcontrib>He, Wei</creatorcontrib><creatorcontrib>Quan, Shu</creatorcontrib><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>The journal of physical chemistry. B</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wang, Xiaotao</au><au>Zong, Yuwei</au><au>Zhou, Xuanjie</au><au>Xu, Li</au><au>He, Wei</au><au>Quan, Shu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Artificial Intelligence-Powered Construction of a Microbial Optimal Growth Temperature Database and Its Impact on Enzyme Optimal Temperature Prediction</atitle><jtitle>The journal of physical chemistry. B</jtitle><addtitle>J. Phys. Chem. B</addtitle><date>2024-03-14</date><risdate>2024</risdate><volume>128</volume><issue>10</issue><spage>2281</spage><epage>2292</epage><pages>2281-2292</pages><issn>1520-6106</issn><eissn>1520-5207</eissn><abstract>Accurate prediction of enzyme optimal temperature (Topt) is crucial for identifying enzymes suitable for catalytic functions under extreme bioprocessing conditions. The optimal growth temperature (OGT) of microorganisms serves as a key indicator for estimating enzyme Topt, reflecting an evolutionary temperature balance between enzyme-catalyzed reactions and the organism’s growth environments. Existing OGT databases, collected from culture collection centers, often fall short as culture temperature does not precisely represent the OGT. Models trained on such databases yield inadequate accuracy in enzyme Topt prediction, underscoring the need for a high-quality OGT database. Herein, we developed AI-based models to extract the OGT information from the scientific literature, constructing a comprehensive OGT database with 1155 unique organisms and 2142 OGT values. The top-performing model, BioLinkBERT, demonstrated exceptional information extraction ability with an EM score of 91.00 and an F1 score of 91.91 for OGT. Notably, applying this OGT database in enzyme Topt prediction achieved an R 2 value of 0.698, outperforming the R 2 value of 0.686 obtained using culture temperature. This emphasizes the superiority of the OGT database in predicting the enzyme Topt and underscores its pivotal role in identifying enzymes with optimal catalytic temperatures.</abstract><cop>United States</cop><pub>American Chemical Society</pub><pmid>38437173</pmid><doi>10.1021/acs.jpcb.3c06526</doi><tpages>12</tpages><orcidid>https://orcid.org/0000-0001-9234-094X</orcidid><orcidid>https://orcid.org/0000-0002-6672-4947</orcidid></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1520-6106 |
ispartof | The journal of physical chemistry. B, 2024-03, Vol.128 (10), p.2281-2292 |
issn | 1520-6106 1520-5207 |
language | eng |
recordid | cdi_proquest_miscellaneous_2937701977 |
source | American Chemical Society Journals |
subjects | B: Biophysical and Biochemical Systems and Processes |
title | Artificial Intelligence-Powered Construction of a Microbial Optimal Growth Temperature Database and Its Impact on Enzyme Optimal Temperature Prediction |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T05%3A43%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Artificial%20Intelligence-Powered%20Construction%20of%20a%20Microbial%20Optimal%20Growth%20Temperature%20Database%20and%20Its%20Impact%20on%20Enzyme%20Optimal%20Temperature%20Prediction&rft.jtitle=The%20journal%20of%20physical%20chemistry.%20B&rft.au=Wang,%20Xiaotao&rft.date=2024-03-14&rft.volume=128&rft.issue=10&rft.spage=2281&rft.epage=2292&rft.pages=2281-2292&rft.issn=1520-6106&rft.eissn=1520-5207&rft_id=info:doi/10.1021/acs.jpcb.3c06526&rft_dat=%3Cproquest_cross%3E2937701977%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2937701977&rft_id=info:pmid/38437173&rfr_iscdi=true |