Hamming distance geometry of a protein conformational space. Application to the clustering of a 4 ns molecular dynamics trajectory of the HIV-1 integrase catalytic core

Protein structures can be encoded into binary sequences, these are used to define a Hamming distance in conformational space: the distance between two different molecular conformations is the number of different bits in their sequences. Each bit in the sequence arises from a partition of conformatio...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2001-10
Hauptverfasser: Laboulais, Cyril, Ouali, Mohammed, Marc Le Bret, Gabarro-Arpa, Jacques
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Laboulais, Cyril
Ouali, Mohammed
Marc Le Bret
Gabarro-Arpa, Jacques
description Protein structures can be encoded into binary sequences, these are used to define a Hamming distance in conformational space: the distance between two different molecular conformations is the number of different bits in their sequences. Each bit in the sequence arises from a partition of conformational space in two halves. Thus, the information encoded in the binary sequences is also used to characterize the regions of conformational space visited by the system. We apply this distance and their associated geometric structures, to the clustering and analysis of conformations sampled during a 4 ns molecular dynamics simulation of the HIV-1 integrase catalytic core. The cluster analysis of the simulation shows a division of the trajectory into two segments of 2.6 and 1.4 ns length, which are qualitatively different: the data points to the fact that equilibration is only reached at the end of the first segment. Some length of the paper is devoted to compare the Hamming distance to the r.m.s. deviation measure. The analysis of the cases studied so far, shows that under the same conditions the two measures behave quite differently, and that the Hamming distance appears to be more robust than the r.m.s. deviation.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2090730950</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2090730950</sourcerecordid><originalsourceid>FETCH-proquest_journals_20907309503</originalsourceid><addsrcrecordid>eNqNjsFOwzAQRCMkJCroP6zEOci1G0qPCIHCHXGtVu4mdWR7g3dzyB_xmaSFD-A00mjmzVxVK-vcpn7aWntTrUUGY4x93Nmmcavqu8WUQu7hGEQxe4KeOJGWGbgDhLGwUsjgOXdcEmrgjBFkRE8P8DyOMfiLCcqgJwIfJ1EqZ-QFsIUskDiSnyIWOM4ZU_ACWnAgr_w7dG6275_1BkJW6gvKQkLFOGvwy3ihu-q6wyi0_tPb6v7t9eOlrZeHXxOJHgaeyvJNDtbszc6ZfWPc_1I__flfQg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2090730950</pqid></control><display><type>article</type><title>Hamming distance geometry of a protein conformational space. Application to the clustering of a 4 ns molecular dynamics trajectory of the HIV-1 integrase catalytic core</title><source>Free E- Journals</source><creator>Laboulais, Cyril ; Ouali, Mohammed ; Marc Le Bret ; Gabarro-Arpa, Jacques</creator><creatorcontrib>Laboulais, Cyril ; Ouali, Mohammed ; Marc Le Bret ; Gabarro-Arpa, Jacques</creatorcontrib><description>Protein structures can be encoded into binary sequences, these are used to define a Hamming distance in conformational space: the distance between two different molecular conformations is the number of different bits in their sequences. Each bit in the sequence arises from a partition of conformational space in two halves. Thus, the information encoded in the binary sequences is also used to characterize the regions of conformational space visited by the system. We apply this distance and their associated geometric structures, to the clustering and analysis of conformations sampled during a 4 ns molecular dynamics simulation of the HIV-1 integrase catalytic core. The cluster analysis of the simulation shows a division of the trajectory into two segments of 2.6 and 1.4 ns length, which are qualitatively different: the data points to the fact that equilibration is only reached at the end of the first segment. Some length of the paper is devoted to compare the Hamming distance to the r.m.s. deviation measure. The analysis of the cases studied so far, shows that under the same conditions the two measures behave quite differently, and that the Hamming distance appears to be more robust than the r.m.s. deviation.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Catalysis ; Cluster analysis ; Clustering ; Coding ; Data points ; Deviation ; Molecular dynamics ; Proteins ; Sequences ; Trajectories</subject><ispartof>arXiv.org, 2001-10</ispartof><rights>2001. This work is published under https://arxiv.org/licenses/assumed-1991-2003/license.html (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Laboulais, Cyril</creatorcontrib><creatorcontrib>Ouali, Mohammed</creatorcontrib><creatorcontrib>Marc Le Bret</creatorcontrib><creatorcontrib>Gabarro-Arpa, Jacques</creatorcontrib><title>Hamming distance geometry of a protein conformational space. Application to the clustering of a 4 ns molecular dynamics trajectory of the HIV-1 integrase catalytic core</title><title>arXiv.org</title><description>Protein structures can be encoded into binary sequences, these are used to define a Hamming distance in conformational space: the distance between two different molecular conformations is the number of different bits in their sequences. Each bit in the sequence arises from a partition of conformational space in two halves. Thus, the information encoded in the binary sequences is also used to characterize the regions of conformational space visited by the system. We apply this distance and their associated geometric structures, to the clustering and analysis of conformations sampled during a 4 ns molecular dynamics simulation of the HIV-1 integrase catalytic core. The cluster analysis of the simulation shows a division of the trajectory into two segments of 2.6 and 1.4 ns length, which are qualitatively different: the data points to the fact that equilibration is only reached at the end of the first segment. Some length of the paper is devoted to compare the Hamming distance to the r.m.s. deviation measure. The analysis of the cases studied so far, shows that under the same conditions the two measures behave quite differently, and that the Hamming distance appears to be more robust than the r.m.s. deviation.</description><subject>Catalysis</subject><subject>Cluster analysis</subject><subject>Clustering</subject><subject>Coding</subject><subject>Data points</subject><subject>Deviation</subject><subject>Molecular dynamics</subject><subject>Proteins</subject><subject>Sequences</subject><subject>Trajectories</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2001</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNqNjsFOwzAQRCMkJCroP6zEOci1G0qPCIHCHXGtVu4mdWR7g3dzyB_xmaSFD-A00mjmzVxVK-vcpn7aWntTrUUGY4x93Nmmcavqu8WUQu7hGEQxe4KeOJGWGbgDhLGwUsjgOXdcEmrgjBFkRE8P8DyOMfiLCcqgJwIfJ1EqZ-QFsIUskDiSnyIWOM4ZU_ACWnAgr_w7dG6275_1BkJW6gvKQkLFOGvwy3ihu-q6wyi0_tPb6v7t9eOlrZeHXxOJHgaeyvJNDtbszc6ZfWPc_1I__flfQg</recordid><startdate>20011023</startdate><enddate>20011023</enddate><creator>Laboulais, Cyril</creator><creator>Ouali, Mohammed</creator><creator>Marc Le Bret</creator><creator>Gabarro-Arpa, Jacques</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20011023</creationdate><title>Hamming distance geometry of a protein conformational space. Application to the clustering of a 4 ns molecular dynamics trajectory of the HIV-1 integrase catalytic core</title><author>Laboulais, Cyril ; Ouali, Mohammed ; Marc Le Bret ; Gabarro-Arpa, Jacques</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_20907309503</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2001</creationdate><topic>Catalysis</topic><topic>Cluster analysis</topic><topic>Clustering</topic><topic>Coding</topic><topic>Data points</topic><topic>Deviation</topic><topic>Molecular dynamics</topic><topic>Proteins</topic><topic>Sequences</topic><topic>Trajectories</topic><toplevel>online_resources</toplevel><creatorcontrib>Laboulais, Cyril</creatorcontrib><creatorcontrib>Ouali, Mohammed</creatorcontrib><creatorcontrib>Marc Le Bret</creatorcontrib><creatorcontrib>Gabarro-Arpa, Jacques</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Laboulais, Cyril</au><au>Ouali, Mohammed</au><au>Marc Le Bret</au><au>Gabarro-Arpa, Jacques</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Hamming distance geometry of a protein conformational space. Application to the clustering of a 4 ns molecular dynamics trajectory of the HIV-1 integrase catalytic core</atitle><jtitle>arXiv.org</jtitle><date>2001-10-23</date><risdate>2001</risdate><eissn>2331-8422</eissn><abstract>Protein structures can be encoded into binary sequences, these are used to define a Hamming distance in conformational space: the distance between two different molecular conformations is the number of different bits in their sequences. Each bit in the sequence arises from a partition of conformational space in two halves. Thus, the information encoded in the binary sequences is also used to characterize the regions of conformational space visited by the system. We apply this distance and their associated geometric structures, to the clustering and analysis of conformations sampled during a 4 ns molecular dynamics simulation of the HIV-1 integrase catalytic core. The cluster analysis of the simulation shows a division of the trajectory into two segments of 2.6 and 1.4 ns length, which are qualitatively different: the data points to the fact that equilibration is only reached at the end of the first segment. Some length of the paper is devoted to compare the Hamming distance to the r.m.s. deviation measure. The analysis of the cases studied so far, shows that under the same conditions the two measures behave quite differently, and that the Hamming distance appears to be more robust than the r.m.s. deviation.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2001-10
issn 2331-8422
language eng
recordid cdi_proquest_journals_2090730950
source Free E- Journals
subjects Catalysis
Cluster analysis
Clustering
Coding
Data points
Deviation
Molecular dynamics
Proteins
Sequences
Trajectories
title Hamming distance geometry of a protein conformational space. Application to the clustering of a 4 ns molecular dynamics trajectory of the HIV-1 integrase catalytic core
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T20%3A44%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Hamming%20distance%20geometry%20of%20a%20protein%20conformational%20space.%20Application%20to%20the%20clustering%20of%20a%204%20ns%20molecular%20dynamics%20trajectory%20of%20the%20HIV-1%20integrase%20catalytic%20core&rft.jtitle=arXiv.org&rft.au=Laboulais,%20Cyril&rft.date=2001-10-23&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2090730950%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2090730950&rft_id=info:pmid/&rfr_iscdi=true