Using Huffman coding method to visualize and analyze DNA sequences

On the basis of the Huffman coding method, we propose a new graphical representation of DNA sequence. The representation can avoid degeneracy and loss of information in the transfer of data from a DNA sequence to its graphical representation. Then a multicomponent vector from the representation is i...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of computational chemistry 2011-11, Vol.32 (15), p.3233-3240
Hauptverfasser: Qi, Zhao-Hui, Li, Ling, Qi, Xiao-Qin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 3240
container_issue 15
container_start_page 3233
container_title Journal of computational chemistry
container_volume 32
creator Qi, Zhao-Hui
Li, Ling
Qi, Xiao-Qin
description On the basis of the Huffman coding method, we propose a new graphical representation of DNA sequence. The representation can avoid degeneracy and loss of information in the transfer of data from a DNA sequence to its graphical representation. Then a multicomponent vector from the representation is introduced to characterize quantitatively DNA sequences. The components of the vector are derived from the graphical representation of DNA primary sequence. The examination of similarities and dissimilarities among the complete coding sequences of β‐globin gene of 11 species and six ND6 proteins shows the utility of the scheme. © 2011 Wiley Periodicals, Inc. J Comput Chem, 2011
doi_str_mv 10.1002/jcc.21906
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_894815005</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>894815005</sourcerecordid><originalsourceid>FETCH-LOGICAL-c4246-68f4642f2ce5bd68b96c70c61a754042c45bddb1a9db0b256d1d41cd4bde6cf3</originalsourceid><addsrcrecordid>eNp10E1PwyAABmBiNG5OD_4B03gxHroBBdoeteqmmfMyo9mFUKDa2Y9ZVnX-eqnddjDxQPjIwwt5AThGsI8gxIO5lH2MQsh2QBfBkLlh4D_vgi5EIXYDRlEHHBgzhxB6lJF90LGYepT6XXD5aNLixRnVSZKLwpGlara5Xr6WylmWzkdqapGl39oRhbJDZCu7vppcOEa_17qQ2hyCvURkRh-t5x6Y3lxPo5E7fhjeRhdjVxJMmMuChDCCEyw1jRUL4pBJH0qGhE8JJFgSe6xiJEIVwxhTppAiSCoSK81k4vXAWRu7qEr7slnyPDVSZ5kodFkbHoQkQBRCauXpHzkv68p-vUEMhwwGDTpvkaxKYyqd8EWV5qJacQR50yq3rfLfVq09WQfWca7VVm5qtGDQgs8006v_k_hdFG0i3fZGapb6a3tDVG-c-Z5P-dNkyNlsNBpG9zM-8X4AUSiPRg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>896296085</pqid></control><display><type>article</type><title>Using Huffman coding method to visualize and analyze DNA sequences</title><source>MEDLINE</source><source>Wiley Online Library Journals Frontfile Complete</source><creator>Qi, Zhao-Hui ; Li, Ling ; Qi, Xiao-Qin</creator><creatorcontrib>Qi, Zhao-Hui ; Li, Ling ; Qi, Xiao-Qin</creatorcontrib><description>On the basis of the Huffman coding method, we propose a new graphical representation of DNA sequence. The representation can avoid degeneracy and loss of information in the transfer of data from a DNA sequence to its graphical representation. Then a multicomponent vector from the representation is introduced to characterize quantitatively DNA sequences. The components of the vector are derived from the graphical representation of DNA primary sequence. The examination of similarities and dissimilarities among the complete coding sequences of β‐globin gene of 11 species and six ND6 proteins shows the utility of the scheme. © 2011 Wiley Periodicals, Inc. J Comput Chem, 2011</description><identifier>ISSN: 0192-8651</identifier><identifier>EISSN: 1096-987X</identifier><identifier>DOI: 10.1002/jcc.21906</identifier><identifier>PMID: 21953557</identifier><identifier>CODEN: JCCHDD</identifier><language>eng</language><publisher>Hoboken: Wiley Subscription Services, Inc., A Wiley Company</publisher><subject>Base Sequence ; Computer Graphics ; Deoxyribonucleic acid ; DNA ; DNA - analysis ; DNA - genetics ; DNA sequence ; Genes ; Graph representations ; graphical representation ; Huffman coding method ; Medical coding ; Methods ; Proteins - genetics ; sequence analysis ; Sequence Analysis, DNA - methods</subject><ispartof>Journal of computational chemistry, 2011-11, Vol.32 (15), p.3233-3240</ispartof><rights>Copyright © 2011 Wiley Periodicals, Inc.</rights><rights>Copyright John Wiley and Sons, Limited Nov 30, 2011</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c4246-68f4642f2ce5bd68b96c70c61a754042c45bddb1a9db0b256d1d41cd4bde6cf3</citedby><cites>FETCH-LOGICAL-c4246-68f4642f2ce5bd68b96c70c61a754042c45bddb1a9db0b256d1d41cd4bde6cf3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Fjcc.21906$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Fjcc.21906$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,778,782,1414,27911,27912,45561,45562</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/21953557$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Qi, Zhao-Hui</creatorcontrib><creatorcontrib>Li, Ling</creatorcontrib><creatorcontrib>Qi, Xiao-Qin</creatorcontrib><title>Using Huffman coding method to visualize and analyze DNA sequences</title><title>Journal of computational chemistry</title><addtitle>J. Comput. Chem</addtitle><description>On the basis of the Huffman coding method, we propose a new graphical representation of DNA sequence. The representation can avoid degeneracy and loss of information in the transfer of data from a DNA sequence to its graphical representation. Then a multicomponent vector from the representation is introduced to characterize quantitatively DNA sequences. The components of the vector are derived from the graphical representation of DNA primary sequence. The examination of similarities and dissimilarities among the complete coding sequences of β‐globin gene of 11 species and six ND6 proteins shows the utility of the scheme. © 2011 Wiley Periodicals, Inc. J Comput Chem, 2011</description><subject>Base Sequence</subject><subject>Computer Graphics</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>DNA - analysis</subject><subject>DNA - genetics</subject><subject>DNA sequence</subject><subject>Genes</subject><subject>Graph representations</subject><subject>graphical representation</subject><subject>Huffman coding method</subject><subject>Medical coding</subject><subject>Methods</subject><subject>Proteins - genetics</subject><subject>sequence analysis</subject><subject>Sequence Analysis, DNA - methods</subject><issn>0192-8651</issn><issn>1096-987X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp10E1PwyAABmBiNG5OD_4B03gxHroBBdoeteqmmfMyo9mFUKDa2Y9ZVnX-eqnddjDxQPjIwwt5AThGsI8gxIO5lH2MQsh2QBfBkLlh4D_vgi5EIXYDRlEHHBgzhxB6lJF90LGYepT6XXD5aNLixRnVSZKLwpGlara5Xr6WylmWzkdqapGl39oRhbJDZCu7vppcOEa_17qQ2hyCvURkRh-t5x6Y3lxPo5E7fhjeRhdjVxJMmMuChDCCEyw1jRUL4pBJH0qGhE8JJFgSe6xiJEIVwxhTppAiSCoSK81k4vXAWRu7qEr7slnyPDVSZ5kodFkbHoQkQBRCauXpHzkv68p-vUEMhwwGDTpvkaxKYyqd8EWV5qJacQR50yq3rfLfVq09WQfWca7VVm5qtGDQgs8006v_k_hdFG0i3fZGapb6a3tDVG-c-Z5P-dNkyNlsNBpG9zM-8X4AUSiPRg</recordid><startdate>20111130</startdate><enddate>20111130</enddate><creator>Qi, Zhao-Hui</creator><creator>Li, Ling</creator><creator>Qi, Xiao-Qin</creator><general>Wiley Subscription Services, Inc., A Wiley Company</general><general>Wiley Subscription Services, Inc</general><scope>BSCLL</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>JQ2</scope><scope>7X8</scope></search><sort><creationdate>20111130</creationdate><title>Using Huffman coding method to visualize and analyze DNA sequences</title><author>Qi, Zhao-Hui ; Li, Ling ; Qi, Xiao-Qin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c4246-68f4642f2ce5bd68b96c70c61a754042c45bddb1a9db0b256d1d41cd4bde6cf3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>Base Sequence</topic><topic>Computer Graphics</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>DNA - analysis</topic><topic>DNA - genetics</topic><topic>DNA sequence</topic><topic>Genes</topic><topic>Graph representations</topic><topic>graphical representation</topic><topic>Huffman coding method</topic><topic>Medical coding</topic><topic>Methods</topic><topic>Proteins - genetics</topic><topic>sequence analysis</topic><topic>Sequence Analysis, DNA - methods</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Qi, Zhao-Hui</creatorcontrib><creatorcontrib>Li, Ling</creatorcontrib><creatorcontrib>Qi, Xiao-Qin</creatorcontrib><collection>Istex</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Computer Science Collection</collection><collection>MEDLINE - Academic</collection><jtitle>Journal of computational chemistry</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Qi, Zhao-Hui</au><au>Li, Ling</au><au>Qi, Xiao-Qin</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Using Huffman coding method to visualize and analyze DNA sequences</atitle><jtitle>Journal of computational chemistry</jtitle><addtitle>J. Comput. Chem</addtitle><date>2011-11-30</date><risdate>2011</risdate><volume>32</volume><issue>15</issue><spage>3233</spage><epage>3240</epage><pages>3233-3240</pages><issn>0192-8651</issn><eissn>1096-987X</eissn><coden>JCCHDD</coden><abstract>On the basis of the Huffman coding method, we propose a new graphical representation of DNA sequence. The representation can avoid degeneracy and loss of information in the transfer of data from a DNA sequence to its graphical representation. Then a multicomponent vector from the representation is introduced to characterize quantitatively DNA sequences. The components of the vector are derived from the graphical representation of DNA primary sequence. The examination of similarities and dissimilarities among the complete coding sequences of β‐globin gene of 11 species and six ND6 proteins shows the utility of the scheme. © 2011 Wiley Periodicals, Inc. J Comput Chem, 2011</abstract><cop>Hoboken</cop><pub>Wiley Subscription Services, Inc., A Wiley Company</pub><pmid>21953557</pmid><doi>10.1002/jcc.21906</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0192-8651
ispartof Journal of computational chemistry, 2011-11, Vol.32 (15), p.3233-3240
issn 0192-8651
1096-987X
language eng
recordid cdi_proquest_miscellaneous_894815005
source MEDLINE; Wiley Online Library Journals Frontfile Complete
subjects Base Sequence
Computer Graphics
Deoxyribonucleic acid
DNA
DNA - analysis
DNA - genetics
DNA sequence
Genes
Graph representations
graphical representation
Huffman coding method
Medical coding
Methods
Proteins - genetics
sequence analysis
Sequence Analysis, DNA - methods
title Using Huffman coding method to visualize and analyze DNA sequences
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T10%3A29%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Using%20Huffman%20coding%20method%20to%20visualize%20and%20analyze%20DNA%20sequences&rft.jtitle=Journal%20of%20computational%20chemistry&rft.au=Qi,%20Zhao-Hui&rft.date=2011-11-30&rft.volume=32&rft.issue=15&rft.spage=3233&rft.epage=3240&rft.pages=3233-3240&rft.issn=0192-8651&rft.eissn=1096-987X&rft.coden=JCCHDD&rft_id=info:doi/10.1002/jcc.21906&rft_dat=%3Cproquest_cross%3E894815005%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=896296085&rft_id=info:pmid/21953557&rfr_iscdi=true