Hamming distance geometry of a protein conformational space. Application to the clustering of a 4 ns molecular dynamics trajectory of the HIV-1 integrase catalytic core
Protein structures can be encoded into binary sequences, these are used to define a Hamming distance in conformational space: the distance between two different molecular conformations is the number of different bits in their sequences. Each bit in the sequence arises from a partition of conformatio...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2001-10 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Laboulais, Cyril Ouali, Mohammed Marc Le Bret Gabarro-Arpa, Jacques |
description | Protein structures can be encoded into binary sequences, these are used to define a Hamming distance in conformational space: the distance between two different molecular conformations is the number of different bits in their sequences. Each bit in the sequence arises from a partition of conformational space in two halves. Thus, the information encoded in the binary sequences is also used to characterize the regions of conformational space visited by the system. We apply this distance and their associated geometric structures, to the clustering and analysis of conformations sampled during a 4 ns molecular dynamics simulation of the HIV-1 integrase catalytic core. The cluster analysis of the simulation shows a division of the trajectory into two segments of 2.6 and 1.4 ns length, which are qualitatively different: the data points to the fact that equilibration is only reached at the end of the first segment. Some length of the paper is devoted to compare the Hamming distance to the r.m.s. deviation measure. The analysis of the cases studied so far, shows that under the same conditions the two measures behave quite differently, and that the Hamming distance appears to be more robust than the r.m.s. deviation. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2090730950</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2090730950</sourcerecordid><originalsourceid>FETCH-proquest_journals_20907309503</originalsourceid><addsrcrecordid>eNqNjsFOwzAQRCMkJCroP6zEOci1G0qPCIHCHXGtVu4mdWR7g3dzyB_xmaSFD-A00mjmzVxVK-vcpn7aWntTrUUGY4x93Nmmcavqu8WUQu7hGEQxe4KeOJGWGbgDhLGwUsjgOXdcEmrgjBFkRE8P8DyOMfiLCcqgJwIfJ1EqZ-QFsIUskDiSnyIWOM4ZU_ACWnAgr_w7dG6275_1BkJW6gvKQkLFOGvwy3ihu-q6wyi0_tPb6v7t9eOlrZeHXxOJHgaeyvJNDtbszc6ZfWPc_1I__flfQg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2090730950</pqid></control><display><type>article</type><title>Hamming distance geometry of a protein conformational space. Application to the clustering of a 4 ns molecular dynamics trajectory of the HIV-1 integrase catalytic core</title><source>Free E- Journals</source><creator>Laboulais, Cyril ; Ouali, Mohammed ; Marc Le Bret ; Gabarro-Arpa, Jacques</creator><creatorcontrib>Laboulais, Cyril ; Ouali, Mohammed ; Marc Le Bret ; Gabarro-Arpa, Jacques</creatorcontrib><description>Protein structures can be encoded into binary sequences, these are used to define a Hamming distance in conformational space: the distance between two different molecular conformations is the number of different bits in their sequences. Each bit in the sequence arises from a partition of conformational space in two halves. Thus, the information encoded in the binary sequences is also used to characterize the regions of conformational space visited by the system. We apply this distance and their associated geometric structures, to the clustering and analysis of conformations sampled during a 4 ns molecular dynamics simulation of the HIV-1 integrase catalytic core. The cluster analysis of the simulation shows a division of the trajectory into two segments of 2.6 and 1.4 ns length, which are qualitatively different: the data points to the fact that equilibration is only reached at the end of the first segment. Some length of the paper is devoted to compare the Hamming distance to the r.m.s. deviation measure. The analysis of the cases studied so far, shows that under the same conditions the two measures behave quite differently, and that the Hamming distance appears to be more robust than the r.m.s. deviation.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Catalysis ; Cluster analysis ; Clustering ; Coding ; Data points ; Deviation ; Molecular dynamics ; Proteins ; Sequences ; Trajectories</subject><ispartof>arXiv.org, 2001-10</ispartof><rights>2001. This work is published under https://arxiv.org/licenses/assumed-1991-2003/license.html (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Laboulais, Cyril</creatorcontrib><creatorcontrib>Ouali, Mohammed</creatorcontrib><creatorcontrib>Marc Le Bret</creatorcontrib><creatorcontrib>Gabarro-Arpa, Jacques</creatorcontrib><title>Hamming distance geometry of a protein conformational space. Application to the clustering of a 4 ns molecular dynamics trajectory of the HIV-1 integrase catalytic core</title><title>arXiv.org</title><description>Protein structures can be encoded into binary sequences, these are used to define a Hamming distance in conformational space: the distance between two different molecular conformations is the number of different bits in their sequences. Each bit in the sequence arises from a partition of conformational space in two halves. Thus, the information encoded in the binary sequences is also used to characterize the regions of conformational space visited by the system. We apply this distance and their associated geometric structures, to the clustering and analysis of conformations sampled during a 4 ns molecular dynamics simulation of the HIV-1 integrase catalytic core. The cluster analysis of the simulation shows a division of the trajectory into two segments of 2.6 and 1.4 ns length, which are qualitatively different: the data points to the fact that equilibration is only reached at the end of the first segment. Some length of the paper is devoted to compare the Hamming distance to the r.m.s. deviation measure. The analysis of the cases studied so far, shows that under the same conditions the two measures behave quite differently, and that the Hamming distance appears to be more robust than the r.m.s. deviation.</description><subject>Catalysis</subject><subject>Cluster analysis</subject><subject>Clustering</subject><subject>Coding</subject><subject>Data points</subject><subject>Deviation</subject><subject>Molecular dynamics</subject><subject>Proteins</subject><subject>Sequences</subject><subject>Trajectories</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2001</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNjsFOwzAQRCMkJCroP6zEOci1G0qPCIHCHXGtVu4mdWR7g3dzyB_xmaSFD-A00mjmzVxVK-vcpn7aWntTrUUGY4x93Nmmcavqu8WUQu7hGEQxe4KeOJGWGbgDhLGwUsjgOXdcEmrgjBFkRE8P8DyOMfiLCcqgJwIfJ1EqZ-QFsIUskDiSnyIWOM4ZU_ACWnAgr_w7dG6275_1BkJW6gvKQkLFOGvwy3ihu-q6wyi0_tPb6v7t9eOlrZeHXxOJHgaeyvJNDtbszc6ZfWPc_1I__flfQg</recordid><startdate>20011023</startdate><enddate>20011023</enddate><creator>Laboulais, Cyril</creator><creator>Ouali, Mohammed</creator><creator>Marc Le Bret</creator><creator>Gabarro-Arpa, Jacques</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20011023</creationdate><title>Hamming distance geometry of a protein conformational space. Application to the clustering of a 4 ns molecular dynamics trajectory of the HIV-1 integrase catalytic core</title><author>Laboulais, Cyril ; Ouali, Mohammed ; Marc Le Bret ; Gabarro-Arpa, Jacques</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_20907309503</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2001</creationdate><topic>Catalysis</topic><topic>Cluster analysis</topic><topic>Clustering</topic><topic>Coding</topic><topic>Data points</topic><topic>Deviation</topic><topic>Molecular dynamics</topic><topic>Proteins</topic><topic>Sequences</topic><topic>Trajectories</topic><toplevel>online_resources</toplevel><creatorcontrib>Laboulais, Cyril</creatorcontrib><creatorcontrib>Ouali, Mohammed</creatorcontrib><creatorcontrib>Marc Le Bret</creatorcontrib><creatorcontrib>Gabarro-Arpa, Jacques</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Laboulais, Cyril</au><au>Ouali, Mohammed</au><au>Marc Le Bret</au><au>Gabarro-Arpa, Jacques</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Hamming distance geometry of a protein conformational space. Application to the clustering of a 4 ns molecular dynamics trajectory of the HIV-1 integrase catalytic core</atitle><jtitle>arXiv.org</jtitle><date>2001-10-23</date><risdate>2001</risdate><eissn>2331-8422</eissn><abstract>Protein structures can be encoded into binary sequences, these are used to define a Hamming distance in conformational space: the distance between two different molecular conformations is the number of different bits in their sequences. Each bit in the sequence arises from a partition of conformational space in two halves. Thus, the information encoded in the binary sequences is also used to characterize the regions of conformational space visited by the system. We apply this distance and their associated geometric structures, to the clustering and analysis of conformations sampled during a 4 ns molecular dynamics simulation of the HIV-1 integrase catalytic core. The cluster analysis of the simulation shows a division of the trajectory into two segments of 2.6 and 1.4 ns length, which are qualitatively different: the data points to the fact that equilibration is only reached at the end of the first segment. Some length of the paper is devoted to compare the Hamming distance to the r.m.s. deviation measure. The analysis of the cases studied so far, shows that under the same conditions the two measures behave quite differently, and that the Hamming distance appears to be more robust than the r.m.s. deviation.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2001-10 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2090730950 |
source | Free E- Journals |
subjects | Catalysis Cluster analysis Clustering Coding Data points Deviation Molecular dynamics Proteins Sequences Trajectories |
title | Hamming distance geometry of a protein conformational space. Application to the clustering of a 4 ns molecular dynamics trajectory of the HIV-1 integrase catalytic core |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T20%3A44%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Hamming%20distance%20geometry%20of%20a%20protein%20conformational%20space.%20Application%20to%20the%20clustering%20of%20a%204%20ns%20molecular%20dynamics%20trajectory%20of%20the%20HIV-1%20integrase%20catalytic%20core&rft.jtitle=arXiv.org&rft.au=Laboulais,%20Cyril&rft.date=2001-10-23&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2090730950%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2090730950&rft_id=info:pmid/&rfr_iscdi=true |