Repetitive element signature-based visualization, distance computation, and classification of 1766 microbial genomes

The genomes of living organisms are populated with pleomorphic repetitive elements (REs) of varying densities. Our hypothesis that genomic RE landscapes are species/strain/individual-specific was implemented into the Genome Signature Imaging system to visualize and compute the RE-based signatures of...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Genomics (San Diego, Calif.) Calif.), 2015-07, Vol.106 (1), p.30-42
Hauptverfasser: Lee, Kang-Hoon, Shin, Kyung-Seop, Lim, Debora, Kim, Woo-Chan, Chung, Byung Chang, Han, Gyu-Bum, Roh, Jeongkyu, Cho, Dong-Ho, Cho, Kiho
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 42
container_issue 1
container_start_page 30
container_title Genomics (San Diego, Calif.)
container_volume 106
creator Lee, Kang-Hoon
Shin, Kyung-Seop
Lim, Debora
Kim, Woo-Chan
Chung, Byung Chang
Han, Gyu-Bum
Roh, Jeongkyu
Cho, Dong-Ho
Cho, Kiho
description The genomes of living organisms are populated with pleomorphic repetitive elements (REs) of varying densities. Our hypothesis that genomic RE landscapes are species/strain/individual-specific was implemented into the Genome Signature Imaging system to visualize and compute the RE-based signatures of any genome. Following the occurrence profiling of 5-nucleotide REs/words, the information from top-50 frequency words was transformed into a genome-specific signature and visualized as Genome Signature Images (GSIs), using a CMYK scheme. An algorithm for computing distances among GSIs was formulated using the GSIs' variables (word identity, frequency, and frequency order). The utility of the GSI-distance computation system was demonstrated with control genomes. GSI-based computation of genome-relatedness among 1766 microbes (117 archaea and 1649 bacteria) identified their clustering patterns; although the majority paralleled the established classification, some did not. The Genome Signature Imaging system, with its visualization and distance computation functions, enables genome-scale evolutionary studies involving numerous genomes with varying sizes. •Genomes of microbes are visualized by representative repetitive element (RE) sets.•The RE profile-based GSI system is developed for genome-scale evolutionary studies.•The GSI system enables an efficient computing and cataloging of big genome data.
doi_str_mv 10.1016/j.ygeno.2015.04.004
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1735913374</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0888754315000749</els_id><sourcerecordid>1686414986</sourcerecordid><originalsourceid>FETCH-LOGICAL-c387t-246d8461bbe71693adff1fe2f8a0a6912e351464ec8a237ab17a71dde23d3d5b3</originalsourceid><addsrcrecordid>eNqFkU2LFDEQhoMo7rj6CwTJ0YPdpjrpJH3wIItfsCCInkN1Ur1k6I-xkx5Yf72ZndGjngqKp1KV92HsJYgaBOi3-_r-jualbgS0tVC1EOoR24GwXWW10o_ZTlhrK9MqecWepbQXQnTSNk_ZVdN2YIWUO5a_0YFyzPFInEaaaM48xbsZ87ZS1WOiwI8xbTjGX5jjMr_hIaaMsyful-mw5UsX58D9iCnFIfqHHl8GDkZrPkW_Ln3EkZ_unSg9Z08GHBO9uNRr9uPjh-83n6vbr5--3Ly_rby0JleN0sEqDX1PBnQnMQwDDNQMFgXqDhqSLSityFtspMEeDBoIgRoZZGh7ec1en989rMvPjVJ2U0yexhFnWrbkwMgShJRG_R_VJVNQndUFlWe0_CqllQZ3WOOE670D4U5m3N49mHEnM04oV8yUqVeXBVs_Ufg780dFAd6dASqJHCOtLvlIJecQV_LZhSX-c8FvxnSiBQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1686414986</pqid></control><display><type>article</type><title>Repetitive element signature-based visualization, distance computation, and classification of 1766 microbial genomes</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><creator>Lee, Kang-Hoon ; Shin, Kyung-Seop ; Lim, Debora ; Kim, Woo-Chan ; Chung, Byung Chang ; Han, Gyu-Bum ; Roh, Jeongkyu ; Cho, Dong-Ho ; Cho, Kiho</creator><creatorcontrib>Lee, Kang-Hoon ; Shin, Kyung-Seop ; Lim, Debora ; Kim, Woo-Chan ; Chung, Byung Chang ; Han, Gyu-Bum ; Roh, Jeongkyu ; Cho, Dong-Ho ; Cho, Kiho</creatorcontrib><description>The genomes of living organisms are populated with pleomorphic repetitive elements (REs) of varying densities. Our hypothesis that genomic RE landscapes are species/strain/individual-specific was implemented into the Genome Signature Imaging system to visualize and compute the RE-based signatures of any genome. Following the occurrence profiling of 5-nucleotide REs/words, the information from top-50 frequency words was transformed into a genome-specific signature and visualized as Genome Signature Images (GSIs), using a CMYK scheme. An algorithm for computing distances among GSIs was formulated using the GSIs' variables (word identity, frequency, and frequency order). The utility of the GSI-distance computation system was demonstrated with control genomes. GSI-based computation of genome-relatedness among 1766 microbes (117 archaea and 1649 bacteria) identified their clustering patterns; although the majority paralleled the established classification, some did not. The Genome Signature Imaging system, with its visualization and distance computation functions, enables genome-scale evolutionary studies involving numerous genomes with varying sizes. •Genomes of microbes are visualized by representative repetitive element (RE) sets.•The RE profile-based GSI system is developed for genome-scale evolutionary studies.•The GSI system enables an efficient computing and cataloging of big genome data.</description><identifier>ISSN: 0888-7543</identifier><identifier>EISSN: 1089-8646</identifier><identifier>DOI: 10.1016/j.ygeno.2015.04.004</identifier><identifier>PMID: 25918033</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>Algorithms ; Archaea ; Cluster Analysis ; DNA - chemistry ; Evolution, Molecular ; Genome distance ; Genome signature ; Genome visualization ; Genome, Archaeal ; Genome, Bacterial ; Genome-scale classification ; Genomics - methods ; Microbial genomes ; Mutation, Missense ; Repetitive element ; Repetitive Sequences, Nucleic Acid</subject><ispartof>Genomics (San Diego, Calif.), 2015-07, Vol.106 (1), p.30-42</ispartof><rights>2015 Elsevier Inc.</rights><rights>Copyright © 2015 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c387t-246d8461bbe71693adff1fe2f8a0a6912e351464ec8a237ab17a71dde23d3d5b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0888754315000749$$EHTML$$P50$$Gelsevier$$Hfree_for_read</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/25918033$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Lee, Kang-Hoon</creatorcontrib><creatorcontrib>Shin, Kyung-Seop</creatorcontrib><creatorcontrib>Lim, Debora</creatorcontrib><creatorcontrib>Kim, Woo-Chan</creatorcontrib><creatorcontrib>Chung, Byung Chang</creatorcontrib><creatorcontrib>Han, Gyu-Bum</creatorcontrib><creatorcontrib>Roh, Jeongkyu</creatorcontrib><creatorcontrib>Cho, Dong-Ho</creatorcontrib><creatorcontrib>Cho, Kiho</creatorcontrib><title>Repetitive element signature-based visualization, distance computation, and classification of 1766 microbial genomes</title><title>Genomics (San Diego, Calif.)</title><addtitle>Genomics</addtitle><description>The genomes of living organisms are populated with pleomorphic repetitive elements (REs) of varying densities. Our hypothesis that genomic RE landscapes are species/strain/individual-specific was implemented into the Genome Signature Imaging system to visualize and compute the RE-based signatures of any genome. Following the occurrence profiling of 5-nucleotide REs/words, the information from top-50 frequency words was transformed into a genome-specific signature and visualized as Genome Signature Images (GSIs), using a CMYK scheme. An algorithm for computing distances among GSIs was formulated using the GSIs' variables (word identity, frequency, and frequency order). The utility of the GSI-distance computation system was demonstrated with control genomes. GSI-based computation of genome-relatedness among 1766 microbes (117 archaea and 1649 bacteria) identified their clustering patterns; although the majority paralleled the established classification, some did not. The Genome Signature Imaging system, with its visualization and distance computation functions, enables genome-scale evolutionary studies involving numerous genomes with varying sizes. •Genomes of microbes are visualized by representative repetitive element (RE) sets.•The RE profile-based GSI system is developed for genome-scale evolutionary studies.•The GSI system enables an efficient computing and cataloging of big genome data.</description><subject>Algorithms</subject><subject>Archaea</subject><subject>Cluster Analysis</subject><subject>DNA - chemistry</subject><subject>Evolution, Molecular</subject><subject>Genome distance</subject><subject>Genome signature</subject><subject>Genome visualization</subject><subject>Genome, Archaeal</subject><subject>Genome, Bacterial</subject><subject>Genome-scale classification</subject><subject>Genomics - methods</subject><subject>Microbial genomes</subject><subject>Mutation, Missense</subject><subject>Repetitive element</subject><subject>Repetitive Sequences, Nucleic Acid</subject><issn>0888-7543</issn><issn>1089-8646</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkU2LFDEQhoMo7rj6CwTJ0YPdpjrpJH3wIItfsCCInkN1Ur1k6I-xkx5Yf72ZndGjngqKp1KV92HsJYgaBOi3-_r-jualbgS0tVC1EOoR24GwXWW10o_ZTlhrK9MqecWepbQXQnTSNk_ZVdN2YIWUO5a_0YFyzPFInEaaaM48xbsZ87ZS1WOiwI8xbTjGX5jjMr_hIaaMsyful-mw5UsX58D9iCnFIfqHHl8GDkZrPkW_Ln3EkZ_unSg9Z08GHBO9uNRr9uPjh-83n6vbr5--3Ly_rby0JleN0sEqDX1PBnQnMQwDDNQMFgXqDhqSLSityFtspMEeDBoIgRoZZGh7ec1en989rMvPjVJ2U0yexhFnWrbkwMgShJRG_R_VJVNQndUFlWe0_CqllQZ3WOOE670D4U5m3N49mHEnM04oV8yUqVeXBVs_Ufg780dFAd6dASqJHCOtLvlIJecQV_LZhSX-c8FvxnSiBQ</recordid><startdate>201507</startdate><enddate>201507</enddate><creator>Lee, Kang-Hoon</creator><creator>Shin, Kyung-Seop</creator><creator>Lim, Debora</creator><creator>Kim, Woo-Chan</creator><creator>Chung, Byung Chang</creator><creator>Han, Gyu-Bum</creator><creator>Roh, Jeongkyu</creator><creator>Cho, Dong-Ho</creator><creator>Cho, Kiho</creator><general>Elsevier Inc</general><scope>6I.</scope><scope>AAFTH</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>RC3</scope></search><sort><creationdate>201507</creationdate><title>Repetitive element signature-based visualization, distance computation, and classification of 1766 microbial genomes</title><author>Lee, Kang-Hoon ; Shin, Kyung-Seop ; Lim, Debora ; Kim, Woo-Chan ; Chung, Byung Chang ; Han, Gyu-Bum ; Roh, Jeongkyu ; Cho, Dong-Ho ; Cho, Kiho</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c387t-246d8461bbe71693adff1fe2f8a0a6912e351464ec8a237ab17a71dde23d3d5b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Algorithms</topic><topic>Archaea</topic><topic>Cluster Analysis</topic><topic>DNA - chemistry</topic><topic>Evolution, Molecular</topic><topic>Genome distance</topic><topic>Genome signature</topic><topic>Genome visualization</topic><topic>Genome, Archaeal</topic><topic>Genome, Bacterial</topic><topic>Genome-scale classification</topic><topic>Genomics - methods</topic><topic>Microbial genomes</topic><topic>Mutation, Missense</topic><topic>Repetitive element</topic><topic>Repetitive Sequences, Nucleic Acid</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lee, Kang-Hoon</creatorcontrib><creatorcontrib>Shin, Kyung-Seop</creatorcontrib><creatorcontrib>Lim, Debora</creatorcontrib><creatorcontrib>Kim, Woo-Chan</creatorcontrib><creatorcontrib>Chung, Byung Chang</creatorcontrib><creatorcontrib>Han, Gyu-Bum</creatorcontrib><creatorcontrib>Roh, Jeongkyu</creatorcontrib><creatorcontrib>Cho, Dong-Ho</creatorcontrib><creatorcontrib>Cho, Kiho</creatorcontrib><collection>ScienceDirect Open Access Titles</collection><collection>Elsevier:ScienceDirect:Open Access</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><jtitle>Genomics (San Diego, Calif.)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lee, Kang-Hoon</au><au>Shin, Kyung-Seop</au><au>Lim, Debora</au><au>Kim, Woo-Chan</au><au>Chung, Byung Chang</au><au>Han, Gyu-Bum</au><au>Roh, Jeongkyu</au><au>Cho, Dong-Ho</au><au>Cho, Kiho</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Repetitive element signature-based visualization, distance computation, and classification of 1766 microbial genomes</atitle><jtitle>Genomics (San Diego, Calif.)</jtitle><addtitle>Genomics</addtitle><date>2015-07</date><risdate>2015</risdate><volume>106</volume><issue>1</issue><spage>30</spage><epage>42</epage><pages>30-42</pages><issn>0888-7543</issn><eissn>1089-8646</eissn><abstract>The genomes of living organisms are populated with pleomorphic repetitive elements (REs) of varying densities. Our hypothesis that genomic RE landscapes are species/strain/individual-specific was implemented into the Genome Signature Imaging system to visualize and compute the RE-based signatures of any genome. Following the occurrence profiling of 5-nucleotide REs/words, the information from top-50 frequency words was transformed into a genome-specific signature and visualized as Genome Signature Images (GSIs), using a CMYK scheme. An algorithm for computing distances among GSIs was formulated using the GSIs' variables (word identity, frequency, and frequency order). The utility of the GSI-distance computation system was demonstrated with control genomes. GSI-based computation of genome-relatedness among 1766 microbes (117 archaea and 1649 bacteria) identified their clustering patterns; although the majority paralleled the established classification, some did not. The Genome Signature Imaging system, with its visualization and distance computation functions, enables genome-scale evolutionary studies involving numerous genomes with varying sizes. •Genomes of microbes are visualized by representative repetitive element (RE) sets.•The RE profile-based GSI system is developed for genome-scale evolutionary studies.•The GSI system enables an efficient computing and cataloging of big genome data.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>25918033</pmid><doi>10.1016/j.ygeno.2015.04.004</doi><tpages>13</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0888-7543
ispartof Genomics (San Diego, Calif.), 2015-07, Vol.106 (1), p.30-42
issn 0888-7543
1089-8646
language eng
recordid cdi_proquest_miscellaneous_1735913374
source MEDLINE; Elsevier ScienceDirect Journals; EZB-FREE-00999 freely available EZB journals
subjects Algorithms
Archaea
Cluster Analysis
DNA - chemistry
Evolution, Molecular
Genome distance
Genome signature
Genome visualization
Genome, Archaeal
Genome, Bacterial
Genome-scale classification
Genomics - methods
Microbial genomes
Mutation, Missense
Repetitive element
Repetitive Sequences, Nucleic Acid
title Repetitive element signature-based visualization, distance computation, and classification of 1766 microbial genomes
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T11%3A52%3A36IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Repetitive%20element%20signature-based%20visualization,%20distance%20computation,%20and%20classification%20of%201766%20microbial%20genomes&rft.jtitle=Genomics%20(San%20Diego,%20Calif.)&rft.au=Lee,%20Kang-Hoon&rft.date=2015-07&rft.volume=106&rft.issue=1&rft.spage=30&rft.epage=42&rft.pages=30-42&rft.issn=0888-7543&rft.eissn=1089-8646&rft_id=info:doi/10.1016/j.ygeno.2015.04.004&rft_dat=%3Cproquest_cross%3E1686414986%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1686414986&rft_id=info:pmid/25918033&rft_els_id=S0888754315000749&rfr_iscdi=true