Prediction of Intrinsic Disorder Using Rosetta ResidueDisorder and AlphaFold2

The combination of deep learning and sequence data has transformed protein structure prediction and modeling, evidenced in the success of AlphaFold (AF). For this reason, many methods have been developed to take advantage of this success in areas where inaccurate structural modeling may limit comput...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The journal of physical chemistry. B 2022-10, Vol.126 (42), p.8439-8446
Hauptverfasser: He, Jiadi, Turzo, SM Bargeen Alam, Seffernick, Justin T., Kim, Stephanie S., Lindert, Steffen
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 8446
container_issue 42
container_start_page 8439
container_title The journal of physical chemistry. B
container_volume 126
creator He, Jiadi
Turzo, SM Bargeen Alam
Seffernick, Justin T.
Kim, Stephanie S.
Lindert, Steffen
description The combination of deep learning and sequence data has transformed protein structure prediction and modeling, evidenced in the success of AlphaFold (AF). For this reason, many methods have been developed to take advantage of this success in areas where inaccurate structural modeling may limit computational predictiveness. For example, many methods have been developed to predict protein intrinsic disorder from sequence, including our Rosetta ResidueDisorder (RRD) approach. Intrinsically disordered regions in proteins are parts of the sequence that do not form ordered, folded structures under typical physiological conditions. In the original implementation of RRD, Rosetta ab initio models were generated, and disordered regions were predicted based on residue scores (disordered residues typically exist in regions of unfavorable scores). In this work, we show that by (i) replacing the ab initio modeling with AF (using the same scoring and disorder assignment approach) and (ii) updating the score function, the predictiveness improved significantly. Residues were better ranked by the order/disorder, evidenced by an improvement in receiver operating characteristic area-under-the-curve from 0.69 to 0.78 on a large (229 protein) and balanced data set (relatively even ordered versus disordered residues). Finally, the binary prediction accuracy also improved from 62% to 74% on the same data set. Our results show that the combined AF-RRD approach was as good as or better than all existing methods by these metrics (AF-RRD had the highest prediction accuracy).
doi_str_mv 10.1021/acs.jpcb.2c05508
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2725441718</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2725441718</sourcerecordid><originalsourceid>FETCH-LOGICAL-a266t-a7c86806b4ca7ed7fd0cbfce0df4899636e179b4ff09bc6a2d4aa46c5e2d4bb73</originalsourceid><addsrcrecordid>eNp1kD1PwzAQhi0EoqWwM6GMDKTYbmInY1UoVCoCVXS2_Amu0jjYycC_xyWhG8PpTno_pHsAuEZwiiBG91yG6a6RYoolzHNYnIAxyjFM49DT4SYIkhG4CGEHIc5xQc7BaEZwHkU8Bi9vXisrW-vqxJlkVbfe1sHK5MEG55X2yTbY-iPZuKDblicbHazq9FHltUrmVfPJl65S-BKcGV4FfTXsCdguH98Xz-n69Wm1mK9TjglpU05lQQpIRCY51YoaBaUwUkNlsqIsyYxoREuRGQNLIQnHKuM8IzLX8RKCzibgtu9tvPvqdGjZ3gapq4rX2nWBYYrzLEMUFdEKe6v0LgSvDWu83XP_zRBkB4gsQmQHiGyAGCM3Q3sn9lodA3_UouGuN_xGXefr-Oz_fT-bTn6k</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2725441718</pqid></control><display><type>article</type><title>Prediction of Intrinsic Disorder Using Rosetta ResidueDisorder and AlphaFold2</title><source>ACS Publications</source><source>MEDLINE</source><creator>He, Jiadi ; Turzo, SM Bargeen Alam ; Seffernick, Justin T. ; Kim, Stephanie S. ; Lindert, Steffen</creator><creatorcontrib>He, Jiadi ; Turzo, SM Bargeen Alam ; Seffernick, Justin T. ; Kim, Stephanie S. ; Lindert, Steffen</creatorcontrib><description>The combination of deep learning and sequence data has transformed protein structure prediction and modeling, evidenced in the success of AlphaFold (AF). For this reason, many methods have been developed to take advantage of this success in areas where inaccurate structural modeling may limit computational predictiveness. For example, many methods have been developed to predict protein intrinsic disorder from sequence, including our Rosetta ResidueDisorder (RRD) approach. Intrinsically disordered regions in proteins are parts of the sequence that do not form ordered, folded structures under typical physiological conditions. In the original implementation of RRD, Rosetta ab initio models were generated, and disordered regions were predicted based on residue scores (disordered residues typically exist in regions of unfavorable scores). In this work, we show that by (i) replacing the ab initio modeling with AF (using the same scoring and disorder assignment approach) and (ii) updating the score function, the predictiveness improved significantly. Residues were better ranked by the order/disorder, evidenced by an improvement in receiver operating characteristic area-under-the-curve from 0.69 to 0.78 on a large (229 protein) and balanced data set (relatively even ordered versus disordered residues). Finally, the binary prediction accuracy also improved from 62% to 74% on the same data set. Our results show that the combined AF-RRD approach was as good as or better than all existing methods by these metrics (AF-RRD had the highest prediction accuracy).</description><identifier>ISSN: 1520-6106</identifier><identifier>EISSN: 1520-5207</identifier><identifier>DOI: 10.1021/acs.jpcb.2c05508</identifier><identifier>PMID: 36251522</identifier><language>eng</language><publisher>United States: American Chemical Society</publisher><subject>B: Biophysical and Biochemical Systems and Processes ; Computational Biology ; Protein Conformation ; Proteins - chemistry</subject><ispartof>The journal of physical chemistry. B, 2022-10, Vol.126 (42), p.8439-8446</ispartof><rights>2022 American Chemical Society</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-a266t-a7c86806b4ca7ed7fd0cbfce0df4899636e179b4ff09bc6a2d4aa46c5e2d4bb73</citedby><cites>FETCH-LOGICAL-a266t-a7c86806b4ca7ed7fd0cbfce0df4899636e179b4ff09bc6a2d4aa46c5e2d4bb73</cites><orcidid>0000-0002-3976-3473</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://pubs.acs.org/doi/pdf/10.1021/acs.jpcb.2c05508$$EPDF$$P50$$Gacs$$H</linktopdf><linktohtml>$$Uhttps://pubs.acs.org/doi/10.1021/acs.jpcb.2c05508$$EHTML$$P50$$Gacs$$H</linktohtml><link.rule.ids>314,776,780,2752,27053,27901,27902,56713,56763</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/36251522$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>He, Jiadi</creatorcontrib><creatorcontrib>Turzo, SM Bargeen Alam</creatorcontrib><creatorcontrib>Seffernick, Justin T.</creatorcontrib><creatorcontrib>Kim, Stephanie S.</creatorcontrib><creatorcontrib>Lindert, Steffen</creatorcontrib><title>Prediction of Intrinsic Disorder Using Rosetta ResidueDisorder and AlphaFold2</title><title>The journal of physical chemistry. B</title><addtitle>J. Phys. Chem. B</addtitle><description>The combination of deep learning and sequence data has transformed protein structure prediction and modeling, evidenced in the success of AlphaFold (AF). For this reason, many methods have been developed to take advantage of this success in areas where inaccurate structural modeling may limit computational predictiveness. For example, many methods have been developed to predict protein intrinsic disorder from sequence, including our Rosetta ResidueDisorder (RRD) approach. Intrinsically disordered regions in proteins are parts of the sequence that do not form ordered, folded structures under typical physiological conditions. In the original implementation of RRD, Rosetta ab initio models were generated, and disordered regions were predicted based on residue scores (disordered residues typically exist in regions of unfavorable scores). In this work, we show that by (i) replacing the ab initio modeling with AF (using the same scoring and disorder assignment approach) and (ii) updating the score function, the predictiveness improved significantly. Residues were better ranked by the order/disorder, evidenced by an improvement in receiver operating characteristic area-under-the-curve from 0.69 to 0.78 on a large (229 protein) and balanced data set (relatively even ordered versus disordered residues). Finally, the binary prediction accuracy also improved from 62% to 74% on the same data set. Our results show that the combined AF-RRD approach was as good as or better than all existing methods by these metrics (AF-RRD had the highest prediction accuracy).</description><subject>B: Biophysical and Biochemical Systems and Processes</subject><subject>Computational Biology</subject><subject>Protein Conformation</subject><subject>Proteins - chemistry</subject><issn>1520-6106</issn><issn>1520-5207</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp1kD1PwzAQhi0EoqWwM6GMDKTYbmInY1UoVCoCVXS2_Amu0jjYycC_xyWhG8PpTno_pHsAuEZwiiBG91yG6a6RYoolzHNYnIAxyjFM49DT4SYIkhG4CGEHIc5xQc7BaEZwHkU8Bi9vXisrW-vqxJlkVbfe1sHK5MEG55X2yTbY-iPZuKDblicbHazq9FHltUrmVfPJl65S-BKcGV4FfTXsCdguH98Xz-n69Wm1mK9TjglpU05lQQpIRCY51YoaBaUwUkNlsqIsyYxoREuRGQNLIQnHKuM8IzLX8RKCzibgtu9tvPvqdGjZ3gapq4rX2nWBYYrzLEMUFdEKe6v0LgSvDWu83XP_zRBkB4gsQmQHiGyAGCM3Q3sn9lodA3_UouGuN_xGXefr-Oz_fT-bTn6k</recordid><startdate>20221027</startdate><enddate>20221027</enddate><creator>He, Jiadi</creator><creator>Turzo, SM Bargeen Alam</creator><creator>Seffernick, Justin T.</creator><creator>Kim, Stephanie S.</creator><creator>Lindert, Steffen</creator><general>American Chemical Society</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-3976-3473</orcidid></search><sort><creationdate>20221027</creationdate><title>Prediction of Intrinsic Disorder Using Rosetta ResidueDisorder and AlphaFold2</title><author>He, Jiadi ; Turzo, SM Bargeen Alam ; Seffernick, Justin T. ; Kim, Stephanie S. ; Lindert, Steffen</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a266t-a7c86806b4ca7ed7fd0cbfce0df4899636e179b4ff09bc6a2d4aa46c5e2d4bb73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>B: Biophysical and Biochemical Systems and Processes</topic><topic>Computational Biology</topic><topic>Protein Conformation</topic><topic>Proteins - chemistry</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>He, Jiadi</creatorcontrib><creatorcontrib>Turzo, SM Bargeen Alam</creatorcontrib><creatorcontrib>Seffernick, Justin T.</creatorcontrib><creatorcontrib>Kim, Stephanie S.</creatorcontrib><creatorcontrib>Lindert, Steffen</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>The journal of physical chemistry. B</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>He, Jiadi</au><au>Turzo, SM Bargeen Alam</au><au>Seffernick, Justin T.</au><au>Kim, Stephanie S.</au><au>Lindert, Steffen</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Prediction of Intrinsic Disorder Using Rosetta ResidueDisorder and AlphaFold2</atitle><jtitle>The journal of physical chemistry. B</jtitle><addtitle>J. Phys. Chem. B</addtitle><date>2022-10-27</date><risdate>2022</risdate><volume>126</volume><issue>42</issue><spage>8439</spage><epage>8446</epage><pages>8439-8446</pages><issn>1520-6106</issn><eissn>1520-5207</eissn><abstract>The combination of deep learning and sequence data has transformed protein structure prediction and modeling, evidenced in the success of AlphaFold (AF). For this reason, many methods have been developed to take advantage of this success in areas where inaccurate structural modeling may limit computational predictiveness. For example, many methods have been developed to predict protein intrinsic disorder from sequence, including our Rosetta ResidueDisorder (RRD) approach. Intrinsically disordered regions in proteins are parts of the sequence that do not form ordered, folded structures under typical physiological conditions. In the original implementation of RRD, Rosetta ab initio models were generated, and disordered regions were predicted based on residue scores (disordered residues typically exist in regions of unfavorable scores). In this work, we show that by (i) replacing the ab initio modeling with AF (using the same scoring and disorder assignment approach) and (ii) updating the score function, the predictiveness improved significantly. Residues were better ranked by the order/disorder, evidenced by an improvement in receiver operating characteristic area-under-the-curve from 0.69 to 0.78 on a large (229 protein) and balanced data set (relatively even ordered versus disordered residues). Finally, the binary prediction accuracy also improved from 62% to 74% on the same data set. Our results show that the combined AF-RRD approach was as good as or better than all existing methods by these metrics (AF-RRD had the highest prediction accuracy).</abstract><cop>United States</cop><pub>American Chemical Society</pub><pmid>36251522</pmid><doi>10.1021/acs.jpcb.2c05508</doi><tpages>8</tpages><orcidid>https://orcid.org/0000-0002-3976-3473</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 1520-6106
ispartof The journal of physical chemistry. B, 2022-10, Vol.126 (42), p.8439-8446
issn 1520-6106
1520-5207
language eng
recordid cdi_proquest_miscellaneous_2725441718
source ACS Publications; MEDLINE
subjects B: Biophysical and Biochemical Systems and Processes
Computational Biology
Protein Conformation
Proteins - chemistry
title Prediction of Intrinsic Disorder Using Rosetta ResidueDisorder and AlphaFold2
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T16%3A36%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Prediction%20of%20Intrinsic%20Disorder%20Using%20Rosetta%20ResidueDisorder%20and%20AlphaFold2&rft.jtitle=The%20journal%20of%20physical%20chemistry.%20B&rft.au=He,%20Jiadi&rft.date=2022-10-27&rft.volume=126&rft.issue=42&rft.spage=8439&rft.epage=8446&rft.pages=8439-8446&rft.issn=1520-6106&rft.eissn=1520-5207&rft_id=info:doi/10.1021/acs.jpcb.2c05508&rft_dat=%3Cproquest_cross%3E2725441718%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2725441718&rft_id=info:pmid/36251522&rfr_iscdi=true