Flavors of protein disorder

Intrinsically disordered proteins are characterized by long regions lacking 3‐D structure in their native states, yet they have been so far associated with 28 distinguishable functions. Previous studies showed that protein predictors trained on disorder from one type of protein often achieve poor ac...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Proteins, structure, function, and bioinformatics structure, function, and bioinformatics, 2003-09, Vol.52 (4), p.573-584
Hauptverfasser: Vucetic, Slobodan, Brown, Celeste J., Dunker, A. Keith, Obradovic, Zoran
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 584
container_issue 4
container_start_page 573
container_title Proteins, structure, function, and bioinformatics
container_volume 52
creator Vucetic, Slobodan
Brown, Celeste J.
Dunker, A. Keith
Obradovic, Zoran
description Intrinsically disordered proteins are characterized by long regions lacking 3‐D structure in their native states, yet they have been so far associated with 28 distinguishable functions. Previous studies showed that protein predictors trained on disorder from one type of protein often achieve poor accuracy on disorder of proteins of a different type, thus indicating significant differences in sequence properties among disordered proteins. Important biological problems are identifying different types, or flavors, of disorder and examining their relationships with protein function. Innovative use of computational methods is needed in addressing these problems due to relative scarcity of experimental data and background knowledge related to protein disorder. We developed an algorithm that partitions protein disorder into flavors based on competition among increasing numbers of predictors, with prediction accuracy determining both the number of distinct predictors and the partitioning of the individual proteins. Using 145 variously characterized proteins with long (>30 amino acids) disordered regions, 3 flavors, called V, C, and S, were identified by this approach, with the V subset containing 52 segments and 7743 residues, C containing 39 segments and 3402 residues, and S containing 54 segments and 5752 residues. The V, C, and S flavors were distinguishable by amino acid compositions, sequence locations, and biological function. For the sequences in SwissProt and 28 genomes, their protein functions exhibit correlations with the commonness and usage of different disorder flavors, suggesting different flavor‐function sets across these protein groups. Overall, the results herein support the flavor‐function approach as a useful complement to structural genomics as a means for automatically assigning possible functions to sequences. Proteins 2003;52:573–584. © 2003 Wiley‐Liss, Inc.
doi_str_mv 10.1002/prot.10437
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_73537471</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>73537471</sourcerecordid><originalsourceid>FETCH-LOGICAL-c4297-8877828d9519c7493d3e407add0442eb713af754aa8a194ddfce80b5acb82c683</originalsourceid><addsrcrecordid>eNp9kEFPAjEQRhujEUQvXk0MJw8mq9Ntu22PSgRNiCDB6K3ptt1kdWGxBZV_b3FRb55mDm_ezDcIHWO4wADp5cLXy9hRwndQG4PkCWBCd1EbhOAJYYK10EEILwCQSZLtoxZOZeQZb6OTfqXfax-6ddHdeFw579oy1N46f4j2Cl0Fd7StHfTYv5n2bpPhaHDXuxomhqZxWVzCRSqsZFgaTiWxxFHg2lqgNHU5x0QXnFGthcaSWlsYJyBn2uQiNZkgHXTWeOMBbysXlmpWBuOqSs9dvQqKE0Y4jZoOOm9A4-sQvCvUwpcz7dcKg9q8Qm0iqO9XRPh0a13lM2f_0G32COAG-Cgrt_5HpcaT0fRHmjQzZVi6z98Z7V9Vxgln6ul-oLLr50n_4Z6pMfkCqeV26Q</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>73537471</pqid></control><display><type>article</type><title>Flavors of protein disorder</title><source>MEDLINE</source><source>Wiley Online Library Journals Frontfile Complete</source><creator>Vucetic, Slobodan ; Brown, Celeste J. ; Dunker, A. Keith ; Obradovic, Zoran</creator><creatorcontrib>Vucetic, Slobodan ; Brown, Celeste J. ; Dunker, A. Keith ; Obradovic, Zoran</creatorcontrib><description>Intrinsically disordered proteins are characterized by long regions lacking 3‐D structure in their native states, yet they have been so far associated with 28 distinguishable functions. Previous studies showed that protein predictors trained on disorder from one type of protein often achieve poor accuracy on disorder of proteins of a different type, thus indicating significant differences in sequence properties among disordered proteins. Important biological problems are identifying different types, or flavors, of disorder and examining their relationships with protein function. Innovative use of computational methods is needed in addressing these problems due to relative scarcity of experimental data and background knowledge related to protein disorder. We developed an algorithm that partitions protein disorder into flavors based on competition among increasing numbers of predictors, with prediction accuracy determining both the number of distinct predictors and the partitioning of the individual proteins. Using 145 variously characterized proteins with long (&gt;30 amino acids) disordered regions, 3 flavors, called V, C, and S, were identified by this approach, with the V subset containing 52 segments and 7743 residues, C containing 39 segments and 3402 residues, and S containing 54 segments and 5752 residues. The V, C, and S flavors were distinguishable by amino acid compositions, sequence locations, and biological function. For the sequences in SwissProt and 28 genomes, their protein functions exhibit correlations with the commonness and usage of different disorder flavors, suggesting different flavor‐function sets across these protein groups. Overall, the results herein support the flavor‐function approach as a useful complement to structural genomics as a means for automatically assigning possible functions to sequences. Proteins 2003;52:573–584. © 2003 Wiley‐Liss, Inc.</description><identifier>ISSN: 0887-3585</identifier><identifier>EISSN: 1097-0134</identifier><identifier>DOI: 10.1002/prot.10437</identifier><identifier>PMID: 12910457</identifier><language>eng</language><publisher>Hoboken: Wiley Subscription Services, Inc., A Wiley Company</publisher><subject>Algorithms ; Archaeal Proteins - chemistry ; Bacterial Proteins - chemistry ; Computational Biology - methods ; Databases, Protein ; Drosophila Proteins - chemistry ; Fungal Proteins - chemistry ; genomics ; Helminth Proteins - chemistry ; Plant Proteins - chemistry ; Protein Conformation ; Protein Folding ; Protein Structure, Secondary ; Proteins - chemistry ; Reproducibility of Results ; secondary structure ; sequence composition ; signaling and regulatory proteins ; structure prediction ; unfolded proteins</subject><ispartof>Proteins, structure, function, and bioinformatics, 2003-09, Vol.52 (4), p.573-584</ispartof><rights>Copyright © 2003 Wiley‐Liss, Inc.</rights><rights>Copyright 2003 Wiley-Liss, Inc.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c4297-8877828d9519c7493d3e407add0442eb713af754aa8a194ddfce80b5acb82c683</citedby><cites>FETCH-LOGICAL-c4297-8877828d9519c7493d3e407add0442eb713af754aa8a194ddfce80b5acb82c683</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Fprot.10437$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Fprot.10437$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,777,781,1412,27905,27906,45555,45556</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/12910457$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Vucetic, Slobodan</creatorcontrib><creatorcontrib>Brown, Celeste J.</creatorcontrib><creatorcontrib>Dunker, A. Keith</creatorcontrib><creatorcontrib>Obradovic, Zoran</creatorcontrib><title>Flavors of protein disorder</title><title>Proteins, structure, function, and bioinformatics</title><addtitle>Proteins</addtitle><description>Intrinsically disordered proteins are characterized by long regions lacking 3‐D structure in their native states, yet they have been so far associated with 28 distinguishable functions. Previous studies showed that protein predictors trained on disorder from one type of protein often achieve poor accuracy on disorder of proteins of a different type, thus indicating significant differences in sequence properties among disordered proteins. Important biological problems are identifying different types, or flavors, of disorder and examining their relationships with protein function. Innovative use of computational methods is needed in addressing these problems due to relative scarcity of experimental data and background knowledge related to protein disorder. We developed an algorithm that partitions protein disorder into flavors based on competition among increasing numbers of predictors, with prediction accuracy determining both the number of distinct predictors and the partitioning of the individual proteins. Using 145 variously characterized proteins with long (&gt;30 amino acids) disordered regions, 3 flavors, called V, C, and S, were identified by this approach, with the V subset containing 52 segments and 7743 residues, C containing 39 segments and 3402 residues, and S containing 54 segments and 5752 residues. The V, C, and S flavors were distinguishable by amino acid compositions, sequence locations, and biological function. For the sequences in SwissProt and 28 genomes, their protein functions exhibit correlations with the commonness and usage of different disorder flavors, suggesting different flavor‐function sets across these protein groups. Overall, the results herein support the flavor‐function approach as a useful complement to structural genomics as a means for automatically assigning possible functions to sequences. Proteins 2003;52:573–584. © 2003 Wiley‐Liss, Inc.</description><subject>Algorithms</subject><subject>Archaeal Proteins - chemistry</subject><subject>Bacterial Proteins - chemistry</subject><subject>Computational Biology - methods</subject><subject>Databases, Protein</subject><subject>Drosophila Proteins - chemistry</subject><subject>Fungal Proteins - chemistry</subject><subject>genomics</subject><subject>Helminth Proteins - chemistry</subject><subject>Plant Proteins - chemistry</subject><subject>Protein Conformation</subject><subject>Protein Folding</subject><subject>Protein Structure, Secondary</subject><subject>Proteins - chemistry</subject><subject>Reproducibility of Results</subject><subject>secondary structure</subject><subject>sequence composition</subject><subject>signaling and regulatory proteins</subject><subject>structure prediction</subject><subject>unfolded proteins</subject><issn>0887-3585</issn><issn>1097-0134</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2003</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kEFPAjEQRhujEUQvXk0MJw8mq9Ntu22PSgRNiCDB6K3ptt1kdWGxBZV_b3FRb55mDm_ezDcIHWO4wADp5cLXy9hRwndQG4PkCWBCd1EbhOAJYYK10EEILwCQSZLtoxZOZeQZb6OTfqXfax-6ddHdeFw579oy1N46f4j2Cl0Fd7StHfTYv5n2bpPhaHDXuxomhqZxWVzCRSqsZFgaTiWxxFHg2lqgNHU5x0QXnFGthcaSWlsYJyBn2uQiNZkgHXTWeOMBbysXlmpWBuOqSs9dvQqKE0Y4jZoOOm9A4-sQvCvUwpcz7dcKg9q8Qm0iqO9XRPh0a13lM2f_0G32COAG-Cgrt_5HpcaT0fRHmjQzZVi6z98Z7V9Vxgln6ul-oLLr50n_4Z6pMfkCqeV26Q</recordid><startdate>200309</startdate><enddate>200309</enddate><creator>Vucetic, Slobodan</creator><creator>Brown, Celeste J.</creator><creator>Dunker, A. Keith</creator><creator>Obradovic, Zoran</creator><general>Wiley Subscription Services, Inc., A Wiley Company</general><scope>BSCLL</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>200309</creationdate><title>Flavors of protein disorder</title><author>Vucetic, Slobodan ; Brown, Celeste J. ; Dunker, A. Keith ; Obradovic, Zoran</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c4297-8877828d9519c7493d3e407add0442eb713af754aa8a194ddfce80b5acb82c683</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2003</creationdate><topic>Algorithms</topic><topic>Archaeal Proteins - chemistry</topic><topic>Bacterial Proteins - chemistry</topic><topic>Computational Biology - methods</topic><topic>Databases, Protein</topic><topic>Drosophila Proteins - chemistry</topic><topic>Fungal Proteins - chemistry</topic><topic>genomics</topic><topic>Helminth Proteins - chemistry</topic><topic>Plant Proteins - chemistry</topic><topic>Protein Conformation</topic><topic>Protein Folding</topic><topic>Protein Structure, Secondary</topic><topic>Proteins - chemistry</topic><topic>Reproducibility of Results</topic><topic>secondary structure</topic><topic>sequence composition</topic><topic>signaling and regulatory proteins</topic><topic>structure prediction</topic><topic>unfolded proteins</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Vucetic, Slobodan</creatorcontrib><creatorcontrib>Brown, Celeste J.</creatorcontrib><creatorcontrib>Dunker, A. Keith</creatorcontrib><creatorcontrib>Obradovic, Zoran</creatorcontrib><collection>Istex</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Proteins, structure, function, and bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Vucetic, Slobodan</au><au>Brown, Celeste J.</au><au>Dunker, A. Keith</au><au>Obradovic, Zoran</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Flavors of protein disorder</atitle><jtitle>Proteins, structure, function, and bioinformatics</jtitle><addtitle>Proteins</addtitle><date>2003-09</date><risdate>2003</risdate><volume>52</volume><issue>4</issue><spage>573</spage><epage>584</epage><pages>573-584</pages><issn>0887-3585</issn><eissn>1097-0134</eissn><abstract>Intrinsically disordered proteins are characterized by long regions lacking 3‐D structure in their native states, yet they have been so far associated with 28 distinguishable functions. Previous studies showed that protein predictors trained on disorder from one type of protein often achieve poor accuracy on disorder of proteins of a different type, thus indicating significant differences in sequence properties among disordered proteins. Important biological problems are identifying different types, or flavors, of disorder and examining their relationships with protein function. Innovative use of computational methods is needed in addressing these problems due to relative scarcity of experimental data and background knowledge related to protein disorder. We developed an algorithm that partitions protein disorder into flavors based on competition among increasing numbers of predictors, with prediction accuracy determining both the number of distinct predictors and the partitioning of the individual proteins. Using 145 variously characterized proteins with long (&gt;30 amino acids) disordered regions, 3 flavors, called V, C, and S, were identified by this approach, with the V subset containing 52 segments and 7743 residues, C containing 39 segments and 3402 residues, and S containing 54 segments and 5752 residues. The V, C, and S flavors were distinguishable by amino acid compositions, sequence locations, and biological function. For the sequences in SwissProt and 28 genomes, their protein functions exhibit correlations with the commonness and usage of different disorder flavors, suggesting different flavor‐function sets across these protein groups. Overall, the results herein support the flavor‐function approach as a useful complement to structural genomics as a means for automatically assigning possible functions to sequences. Proteins 2003;52:573–584. © 2003 Wiley‐Liss, Inc.</abstract><cop>Hoboken</cop><pub>Wiley Subscription Services, Inc., A Wiley Company</pub><pmid>12910457</pmid><doi>10.1002/prot.10437</doi><tpages>12</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0887-3585
ispartof Proteins, structure, function, and bioinformatics, 2003-09, Vol.52 (4), p.573-584
issn 0887-3585
1097-0134
language eng
recordid cdi_proquest_miscellaneous_73537471
source MEDLINE; Wiley Online Library Journals Frontfile Complete
subjects Algorithms
Archaeal Proteins - chemistry
Bacterial Proteins - chemistry
Computational Biology - methods
Databases, Protein
Drosophila Proteins - chemistry
Fungal Proteins - chemistry
genomics
Helminth Proteins - chemistry
Plant Proteins - chemistry
Protein Conformation
Protein Folding
Protein Structure, Secondary
Proteins - chemistry
Reproducibility of Results
secondary structure
sequence composition
signaling and regulatory proteins
structure prediction
unfolded proteins
title Flavors of protein disorder
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T09%3A31%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Flavors%20of%20protein%20disorder&rft.jtitle=Proteins,%20structure,%20function,%20and%20bioinformatics&rft.au=Vucetic,%20Slobodan&rft.date=2003-09&rft.volume=52&rft.issue=4&rft.spage=573&rft.epage=584&rft.pages=573-584&rft.issn=0887-3585&rft.eissn=1097-0134&rft_id=info:doi/10.1002/prot.10437&rft_dat=%3Cproquest_cross%3E73537471%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=73537471&rft_id=info:pmid/12910457&rfr_iscdi=true