FreeContact: fast and free software for protein contact prediction from residue co-evolution
20 years of improved technology and growing sequences now renders residue-residue contact constraints in large protein families through correlated mutations accurate enough to drive de novo predictions of protein three-dimensional structure. The method EVfold broke new ground using mean-field Direct...
Gespeichert in:
Veröffentlicht in: | BMC bioinformatics 2014-03, Vol.15 (1), p.85-85, Article 85 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 85 |
---|---|
container_issue | 1 |
container_start_page | 85 |
container_title | BMC bioinformatics |
container_volume | 15 |
creator | Kaján, László Hopf, Thomas A Kalaš, Matúš Marks, Debora S Rost, Burkhard |
description | 20 years of improved technology and growing sequences now renders residue-residue contact constraints in large protein families through correlated mutations accurate enough to drive de novo predictions of protein three-dimensional structure. The method EVfold broke new ground using mean-field Direct Coupling Analysis (EVfold-mfDCA); the method PSICOV applied a related concept by estimating a sparse inverse covariance matrix. Both methods (EVfold-mfDCA and PSICOV) are publicly available, but both require too much CPU time for interactive applications. On top, EVfold-mfDCA depends on proprietary software.
Here, we present FreeContact, a fast, open source implementation of EVfold-mfDCA and PSICOV. On a test set of 140 proteins, FreeContact was almost eight times faster than PSICOV without decreasing prediction performance. The EVfold-mfDCA implementation of FreeContact was over 220 times faster than PSICOV with negligible performance decrease. EVfold-mfDCA was unavailable for testing due to its dependency on proprietary software. FreeContact is implemented as the free C++ library "libfreecontact", complete with command line tool "freecontact", as well as Perl and Python modules. All components are available as Debian packages. FreeContact supports the BioXSD format for interoperability.
FreeContact provides the opportunity to compute reliable contact predictions in any environment (desktop or cloud). |
doi_str_mv | 10.1186/1471-2105-15-85 |
format | Article |
fullrecord | <record><control><sourceid>gale_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3987048</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A539461331</galeid><sourcerecordid>A539461331</sourcerecordid><originalsourceid>FETCH-LOGICAL-b614t-42be4064607f43894e149d8c780d16a1ac98a885f880b9ff2aa6affbeb60d7223</originalsourceid><addsrcrecordid>eNqNks9rFTEQxxdRbK2evcmCFz1sm9_JehDqw2qhIPjjJoRsdvJM2d28Jtmq_71ZXn32SQXJIcnMZ74M35mqeorRMcZKnGAmcUMw4g3mjeL3qsNd5P6t90H1KKVLhLBUiD-sDggTopWcHlZfzyLAKkzZ2Pyqdibl2kx97Uq0TsHl7yZC7UKsNzFk8FNtt2z5Q-9t9mEqcBjrCMn3M5R8A9dhmJfM4-qBM0OCJzf3UfXl7O3n1fvm4sO789XpRdMJzHLDSAcMCSaQdIyqlgFmba9sabbHwmBjW2WU4k4p1LXOEWOEca6DTqBeEkKPqtdb3c3cjdBbmHI0g95EP5r4Uwfj9X5m8t_0Olxr2iqJmCoCb7YCnQ__ENjP2DDqxV29uKsx14oXkRc3XcRwNUPKevTJwjCYCcKcCkUYQy3l6D9QTAiRvMUFff4XehnmOBU7F0phSbGUf6i1GUD7yYXSpl1E9SmnLROY0kXr-A6qnB5GXwYLzpf4XsHLvYJl-PAjr82ckj7_9HGfPdmyNoaUIridfRjpZVnvMOzZ7bHt-N_bSX8BA1njEQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1518173177</pqid></control><display><type>article</type><title>FreeContact: fast and free software for protein contact prediction from residue co-evolution</title><source>MEDLINE</source><source>SpringerLink Journals</source><source>PubMed Central Open Access</source><source>Springer Nature OA Free Journals</source><source>TestCollectionTL3OpenAccess</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Kaján, László ; Hopf, Thomas A ; Kalaš, Matúš ; Marks, Debora S ; Rost, Burkhard</creator><creatorcontrib>Kaján, László ; Hopf, Thomas A ; Kalaš, Matúš ; Marks, Debora S ; Rost, Burkhard</creatorcontrib><description>20 years of improved technology and growing sequences now renders residue-residue contact constraints in large protein families through correlated mutations accurate enough to drive de novo predictions of protein three-dimensional structure. The method EVfold broke new ground using mean-field Direct Coupling Analysis (EVfold-mfDCA); the method PSICOV applied a related concept by estimating a sparse inverse covariance matrix. Both methods (EVfold-mfDCA and PSICOV) are publicly available, but both require too much CPU time for interactive applications. On top, EVfold-mfDCA depends on proprietary software.
Here, we present FreeContact, a fast, open source implementation of EVfold-mfDCA and PSICOV. On a test set of 140 proteins, FreeContact was almost eight times faster than PSICOV without decreasing prediction performance. The EVfold-mfDCA implementation of FreeContact was over 220 times faster than PSICOV with negligible performance decrease. EVfold-mfDCA was unavailable for testing due to its dependency on proprietary software. FreeContact is implemented as the free C++ library "libfreecontact", complete with command line tool "freecontact", as well as Perl and Python modules. All components are available as Debian packages. FreeContact supports the BioXSD format for interoperability.
FreeContact provides the opportunity to compute reliable contact predictions in any environment (desktop or cloud).</description><identifier>ISSN: 1471-2105</identifier><identifier>EISSN: 1471-2105</identifier><identifier>DOI: 10.1186/1471-2105-15-85</identifier><identifier>PMID: 24669753</identifier><language>eng</language><publisher>England: BioMed Central Ltd</publisher><subject>Algorithms ; Bioinformatics ; Biology ; C plus plus ; Colleges & universities ; Computational Biology - methods ; Internet service providers ; Methods ; Programming languages ; Protein Conformation ; Protein structure prediction ; Proteins ; Proteins - chemistry ; Proteins - genetics ; Public software ; Sequence Analysis, Protein - methods ; Software ; Technology application</subject><ispartof>BMC bioinformatics, 2014-03, Vol.15 (1), p.85-85, Article 85</ispartof><rights>COPYRIGHT 2014 BioMed Central Ltd.</rights><rights>2014 Kaján et al.; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.</rights><rights>Copyright © 2014 Kaján et al.; licensee BioMed Central Ltd. 2014 Kaján et al.; licensee BioMed Central Ltd.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-b614t-42be4064607f43894e149d8c780d16a1ac98a885f880b9ff2aa6affbeb60d7223</citedby><cites>FETCH-LOGICAL-b614t-42be4064607f43894e149d8c780d16a1ac98a885f880b9ff2aa6affbeb60d7223</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3987048/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3987048/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,724,777,781,861,882,27905,27906,53772,53774</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/24669753$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Kaján, László</creatorcontrib><creatorcontrib>Hopf, Thomas A</creatorcontrib><creatorcontrib>Kalaš, Matúš</creatorcontrib><creatorcontrib>Marks, Debora S</creatorcontrib><creatorcontrib>Rost, Burkhard</creatorcontrib><title>FreeContact: fast and free software for protein contact prediction from residue co-evolution</title><title>BMC bioinformatics</title><addtitle>BMC Bioinformatics</addtitle><description>20 years of improved technology and growing sequences now renders residue-residue contact constraints in large protein families through correlated mutations accurate enough to drive de novo predictions of protein three-dimensional structure. The method EVfold broke new ground using mean-field Direct Coupling Analysis (EVfold-mfDCA); the method PSICOV applied a related concept by estimating a sparse inverse covariance matrix. Both methods (EVfold-mfDCA and PSICOV) are publicly available, but both require too much CPU time for interactive applications. On top, EVfold-mfDCA depends on proprietary software.
Here, we present FreeContact, a fast, open source implementation of EVfold-mfDCA and PSICOV. On a test set of 140 proteins, FreeContact was almost eight times faster than PSICOV without decreasing prediction performance. The EVfold-mfDCA implementation of FreeContact was over 220 times faster than PSICOV with negligible performance decrease. EVfold-mfDCA was unavailable for testing due to its dependency on proprietary software. FreeContact is implemented as the free C++ library "libfreecontact", complete with command line tool "freecontact", as well as Perl and Python modules. All components are available as Debian packages. FreeContact supports the BioXSD format for interoperability.
FreeContact provides the opportunity to compute reliable contact predictions in any environment (desktop or cloud).</description><subject>Algorithms</subject><subject>Bioinformatics</subject><subject>Biology</subject><subject>C plus plus</subject><subject>Colleges & universities</subject><subject>Computational Biology - methods</subject><subject>Internet service providers</subject><subject>Methods</subject><subject>Programming languages</subject><subject>Protein Conformation</subject><subject>Protein structure prediction</subject><subject>Proteins</subject><subject>Proteins - chemistry</subject><subject>Proteins - genetics</subject><subject>Public software</subject><subject>Sequence Analysis, Protein - methods</subject><subject>Software</subject><subject>Technology application</subject><issn>1471-2105</issn><issn>1471-2105</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2014</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNqNks9rFTEQxxdRbK2evcmCFz1sm9_JehDqw2qhIPjjJoRsdvJM2d28Jtmq_71ZXn32SQXJIcnMZ74M35mqeorRMcZKnGAmcUMw4g3mjeL3qsNd5P6t90H1KKVLhLBUiD-sDggTopWcHlZfzyLAKkzZ2Pyqdibl2kx97Uq0TsHl7yZC7UKsNzFk8FNtt2z5Q-9t9mEqcBjrCMn3M5R8A9dhmJfM4-qBM0OCJzf3UfXl7O3n1fvm4sO789XpRdMJzHLDSAcMCSaQdIyqlgFmba9sabbHwmBjW2WU4k4p1LXOEWOEca6DTqBeEkKPqtdb3c3cjdBbmHI0g95EP5r4Uwfj9X5m8t_0Olxr2iqJmCoCb7YCnQ__ENjP2DDqxV29uKsx14oXkRc3XcRwNUPKevTJwjCYCcKcCkUYQy3l6D9QTAiRvMUFff4XehnmOBU7F0phSbGUf6i1GUD7yYXSpl1E9SmnLROY0kXr-A6qnB5GXwYLzpf4XsHLvYJl-PAjr82ckj7_9HGfPdmyNoaUIridfRjpZVnvMOzZ7bHt-N_bSX8BA1njEQ</recordid><startdate>20140326</startdate><enddate>20140326</enddate><creator>Kaján, László</creator><creator>Hopf, Thomas A</creator><creator>Kalaš, Matúš</creator><creator>Marks, Debora S</creator><creator>Rost, Burkhard</creator><general>BioMed Central Ltd</general><general>BioMed Central</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISR</scope><scope>3V.</scope><scope>7QO</scope><scope>7SC</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>L7M</scope><scope>LK8</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20140326</creationdate><title>FreeContact: fast and free software for protein contact prediction from residue co-evolution</title><author>Kaján, László ; Hopf, Thomas A ; Kalaš, Matúš ; Marks, Debora S ; Rost, Burkhard</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-b614t-42be4064607f43894e149d8c780d16a1ac98a885f880b9ff2aa6affbeb60d7223</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2014</creationdate><topic>Algorithms</topic><topic>Bioinformatics</topic><topic>Biology</topic><topic>C plus plus</topic><topic>Colleges & universities</topic><topic>Computational Biology - methods</topic><topic>Internet service providers</topic><topic>Methods</topic><topic>Programming languages</topic><topic>Protein Conformation</topic><topic>Protein structure prediction</topic><topic>Proteins</topic><topic>Proteins - chemistry</topic><topic>Proteins - genetics</topic><topic>Public software</topic><topic>Sequence Analysis, Protein - methods</topic><topic>Software</topic><topic>Technology application</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kaján, László</creatorcontrib><creatorcontrib>Hopf, Thomas A</creatorcontrib><creatorcontrib>Kalaš, Matúš</creatorcontrib><creatorcontrib>Marks, Debora S</creatorcontrib><creatorcontrib>Rost, Burkhard</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Computer and Information Systems Abstracts</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>ProQuest Biological Science Collection</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>BMC bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kaján, László</au><au>Hopf, Thomas A</au><au>Kalaš, Matúš</au><au>Marks, Debora S</au><au>Rost, Burkhard</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>FreeContact: fast and free software for protein contact prediction from residue co-evolution</atitle><jtitle>BMC bioinformatics</jtitle><addtitle>BMC Bioinformatics</addtitle><date>2014-03-26</date><risdate>2014</risdate><volume>15</volume><issue>1</issue><spage>85</spage><epage>85</epage><pages>85-85</pages><artnum>85</artnum><issn>1471-2105</issn><eissn>1471-2105</eissn><abstract>20 years of improved technology and growing sequences now renders residue-residue contact constraints in large protein families through correlated mutations accurate enough to drive de novo predictions of protein three-dimensional structure. The method EVfold broke new ground using mean-field Direct Coupling Analysis (EVfold-mfDCA); the method PSICOV applied a related concept by estimating a sparse inverse covariance matrix. Both methods (EVfold-mfDCA and PSICOV) are publicly available, but both require too much CPU time for interactive applications. On top, EVfold-mfDCA depends on proprietary software.
Here, we present FreeContact, a fast, open source implementation of EVfold-mfDCA and PSICOV. On a test set of 140 proteins, FreeContact was almost eight times faster than PSICOV without decreasing prediction performance. The EVfold-mfDCA implementation of FreeContact was over 220 times faster than PSICOV with negligible performance decrease. EVfold-mfDCA was unavailable for testing due to its dependency on proprietary software. FreeContact is implemented as the free C++ library "libfreecontact", complete with command line tool "freecontact", as well as Perl and Python modules. All components are available as Debian packages. FreeContact supports the BioXSD format for interoperability.
FreeContact provides the opportunity to compute reliable contact predictions in any environment (desktop or cloud).</abstract><cop>England</cop><pub>BioMed Central Ltd</pub><pmid>24669753</pmid><doi>10.1186/1471-2105-15-85</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1471-2105 |
ispartof | BMC bioinformatics, 2014-03, Vol.15 (1), p.85-85, Article 85 |
issn | 1471-2105 1471-2105 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3987048 |
source | MEDLINE; SpringerLink Journals; PubMed Central Open Access; Springer Nature OA Free Journals; TestCollectionTL3OpenAccess; EZB-FREE-00999 freely available EZB journals; PubMed Central |
subjects | Algorithms Bioinformatics Biology C plus plus Colleges & universities Computational Biology - methods Internet service providers Methods Programming languages Protein Conformation Protein structure prediction Proteins Proteins - chemistry Proteins - genetics Public software Sequence Analysis, Protein - methods Software Technology application |
title | FreeContact: fast and free software for protein contact prediction from residue co-evolution |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-21T08%3A55%3A12IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=FreeContact:%20fast%20and%20free%20software%20for%20protein%20contact%20prediction%20from%20residue%20co-evolution&rft.jtitle=BMC%20bioinformatics&rft.au=Kaj%C3%A1n,%20L%C3%A1szl%C3%B3&rft.date=2014-03-26&rft.volume=15&rft.issue=1&rft.spage=85&rft.epage=85&rft.pages=85-85&rft.artnum=85&rft.issn=1471-2105&rft.eissn=1471-2105&rft_id=info:doi/10.1186/1471-2105-15-85&rft_dat=%3Cgale_pubme%3EA539461331%3C/gale_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1518173177&rft_id=info:pmid/24669753&rft_galeid=A539461331&rfr_iscdi=true |