Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations

Background: “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:BMC bioinformatics 2018-11, Vol.19 (15), p.85-143
Hauptverfasser: Ambrosino, Luca, Ruggieri, Valentino, Bostan, Hamed, Miralto, Marco, Vitulo, Nicola, Zouine, Mohammed, Barone, Amalia, Bouzayen, Mondher, Frusciante, Luigi, Pezzotti, Mario, Valle, Giorgio, Chiusano, Maria Luisa
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 143
container_issue 15
container_start_page 85
container_title BMC bioinformatics
container_volume 19
creator Ambrosino, Luca
Ruggieri, Valentino
Bostan, Hamed
Miralto, Marco
Vitulo, Nicola
Zouine, Mohammed
Barone, Amalia
Bouzayen, Mondher
Frusciante, Luigi
Pezzotti, Mario
Valle, Giorgio
Chiusano, Maria Luisa
description Background: “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog genes. However, the spreading of sequenced genome, often endowed with still preliminary annotations, requires suitable bioinformatics to be appropriately exploited in this framework. Results: We presented here a multilevel comparative approach to investigate on genome evolutionary relationships and peculiarities of two fleshy fruit species of relevant agronomic interest, Solanum lycopersicum (tomato) and Vitis vinifera (grapevine). We defined 17,823 orthology relationships between tomato and grapevine reference gene annotations. The resulting orthologs are associated with the detected paralogs in each species, permitting the definition of gene networks, useful to investigate the different relationships. The reconciliation of the compared collections in terms of an updating of the functional descriptions was also exploited. All the results were made accessible in ComParaLogs, a dedicated bioinformatics platform available at biosrv.cab.unina.it/comparalogs/gene/search Conclusions: The aim of the work was to suggest a reliable approach to detect all similarities of gene loci between two species based on the integration of results from different levels of information, such as the gene, the transcript and the protein sequences, overcoming possible limits due to exclusive protein versus protein comparisons. This to define reliable ortholog and paralog genes, as well as species specific gene loci in the two species, overcoming limits due to the possible draft nature of preliminary gene annotations. Moreover, reconciled functional descriptions, as well as common or peculiar enzymatic classes and protein domains from tomato and grapevine, together with the definition of species-specific gene sets after the pairwise comparisons, contributed a comprehensive set of information useful to comparatively exploit the two species gene annotations and investigate on differences between species with climacteric and non-climacteric fruits. In addition, the definition of networks of ortholog genes and of associated paralogs, and the organization of web-based interfaces for the exploration of the results, defined a friendly computational bench-work in support of comparative analyses b
doi_str_mv 10.1186/s12859-018-2420-y
format Article
fullrecord <record><control><sourceid>hal</sourceid><recordid>TN_cdi_hal_primary_oai_HAL_hal_02623377v1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>oai_HAL_hal_02623377v1</sourcerecordid><originalsourceid>FETCH-hal_primary_oai_HAL_hal_02623377v13</originalsourceid><addsrcrecordid>eNqVTLtOwzAUtRCIlscHsHllMPiRxOmIEKgDbOyRCTftRY4d-TqW-vekgoG103kfxu6UfFCqbR5J6bbeCKlaoSstxeGMrVVlldBK1uf_-IpdEX1LqWwr60u2MrLaWNPYNZvfZ5_RQwHP-zhOLrmMBfgnRgxDTOMie-I5cgwFKOPOZeBQop8zxuDSgSfw7shpjxNxF744TdDjgD1mBFqGfAcBliTE_Nu8YReD8wS3f3jN7l9fPp63Yu98NyUcl98uOuy2T2_d0ZO60cZYW5Q5pfsD-nJbdw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations</title><source>DOAJ Directory of Open Access Journals</source><source>SpringerLink Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central Open Access</source><source>Springer Nature OA Free Journals</source><source>PubMed Central</source><creator>Ambrosino, Luca ; Ruggieri, Valentino ; Bostan, Hamed ; Miralto, Marco ; Vitulo, Nicola ; Zouine, Mohammed ; Barone, Amalia ; Bouzayen, Mondher ; Frusciante, Luigi ; Pezzotti, Mario ; Valle, Giorgio ; Chiusano, Maria Luisa</creator><creatorcontrib>Ambrosino, Luca ; Ruggieri, Valentino ; Bostan, Hamed ; Miralto, Marco ; Vitulo, Nicola ; Zouine, Mohammed ; Barone, Amalia ; Bouzayen, Mondher ; Frusciante, Luigi ; Pezzotti, Mario ; Valle, Giorgio ; Chiusano, Maria Luisa</creatorcontrib><description>Background: “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog genes. However, the spreading of sequenced genome, often endowed with still preliminary annotations, requires suitable bioinformatics to be appropriately exploited in this framework. Results: We presented here a multilevel comparative approach to investigate on genome evolutionary relationships and peculiarities of two fleshy fruit species of relevant agronomic interest, Solanum lycopersicum (tomato) and Vitis vinifera (grapevine). We defined 17,823 orthology relationships between tomato and grapevine reference gene annotations. The resulting orthologs are associated with the detected paralogs in each species, permitting the definition of gene networks, useful to investigate the different relationships. The reconciliation of the compared collections in terms of an updating of the functional descriptions was also exploited. All the results were made accessible in ComParaLogs, a dedicated bioinformatics platform available at biosrv.cab.unina.it/comparalogs/gene/search Conclusions: The aim of the work was to suggest a reliable approach to detect all similarities of gene loci between two species based on the integration of results from different levels of information, such as the gene, the transcript and the protein sequences, overcoming possible limits due to exclusive protein versus protein comparisons. This to define reliable ortholog and paralog genes, as well as species specific gene loci in the two species, overcoming limits due to the possible draft nature of preliminary gene annotations. Moreover, reconciled functional descriptions, as well as common or peculiar enzymatic classes and protein domains from tomato and grapevine, together with the definition of species-specific gene sets after the pairwise comparisons, contributed a comprehensive set of information useful to comparatively exploit the two species gene annotations and investigate on differences between species with climacteric and non-climacteric fruits. In addition, the definition of networks of ortholog genes and of associated paralogs, and the organization of web-based interfaces for the exploration of the results, defined a friendly computational bench-work in support of comparative analyses between two species.</description><identifier>ISSN: 1471-2105</identifier><identifier>EISSN: 1471-2105</identifier><identifier>DOI: 10.1186/s12859-018-2420-y</identifier><identifier>PMID: 30497367</identifier><language>eng</language><publisher>BioMed Central</publisher><subject>Life Sciences ; Vegetal Biology</subject><ispartof>BMC bioinformatics, 2018-11, Vol.19 (15), p.85-143</ispartof><rights>Attribution</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0001-7630-1449 ; 0000-0001-7630-1449</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,777,781,861,882,27905,27906</link.rule.ids><backlink>$$Uhttps://hal.inrae.fr/hal-02623377$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Ambrosino, Luca</creatorcontrib><creatorcontrib>Ruggieri, Valentino</creatorcontrib><creatorcontrib>Bostan, Hamed</creatorcontrib><creatorcontrib>Miralto, Marco</creatorcontrib><creatorcontrib>Vitulo, Nicola</creatorcontrib><creatorcontrib>Zouine, Mohammed</creatorcontrib><creatorcontrib>Barone, Amalia</creatorcontrib><creatorcontrib>Bouzayen, Mondher</creatorcontrib><creatorcontrib>Frusciante, Luigi</creatorcontrib><creatorcontrib>Pezzotti, Mario</creatorcontrib><creatorcontrib>Valle, Giorgio</creatorcontrib><creatorcontrib>Chiusano, Maria Luisa</creatorcontrib><title>Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations</title><title>BMC bioinformatics</title><description>Background: “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog genes. However, the spreading of sequenced genome, often endowed with still preliminary annotations, requires suitable bioinformatics to be appropriately exploited in this framework. Results: We presented here a multilevel comparative approach to investigate on genome evolutionary relationships and peculiarities of two fleshy fruit species of relevant agronomic interest, Solanum lycopersicum (tomato) and Vitis vinifera (grapevine). We defined 17,823 orthology relationships between tomato and grapevine reference gene annotations. The resulting orthologs are associated with the detected paralogs in each species, permitting the definition of gene networks, useful to investigate the different relationships. The reconciliation of the compared collections in terms of an updating of the functional descriptions was also exploited. All the results were made accessible in ComParaLogs, a dedicated bioinformatics platform available at biosrv.cab.unina.it/comparalogs/gene/search Conclusions: The aim of the work was to suggest a reliable approach to detect all similarities of gene loci between two species based on the integration of results from different levels of information, such as the gene, the transcript and the protein sequences, overcoming possible limits due to exclusive protein versus protein comparisons. This to define reliable ortholog and paralog genes, as well as species specific gene loci in the two species, overcoming limits due to the possible draft nature of preliminary gene annotations. Moreover, reconciled functional descriptions, as well as common or peculiar enzymatic classes and protein domains from tomato and grapevine, together with the definition of species-specific gene sets after the pairwise comparisons, contributed a comprehensive set of information useful to comparatively exploit the two species gene annotations and investigate on differences between species with climacteric and non-climacteric fruits. In addition, the definition of networks of ortholog genes and of associated paralogs, and the organization of web-based interfaces for the exploration of the results, defined a friendly computational bench-work in support of comparative analyses between two species.</description><subject>Life Sciences</subject><subject>Vegetal Biology</subject><issn>1471-2105</issn><issn>1471-2105</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNqVTLtOwzAUtRCIlscHsHllMPiRxOmIEKgDbOyRCTftRY4d-TqW-vekgoG103kfxu6UfFCqbR5J6bbeCKlaoSstxeGMrVVlldBK1uf_-IpdEX1LqWwr60u2MrLaWNPYNZvfZ5_RQwHP-zhOLrmMBfgnRgxDTOMie-I5cgwFKOPOZeBQop8zxuDSgSfw7shpjxNxF744TdDjgD1mBFqGfAcBliTE_Nu8YReD8wS3f3jN7l9fPp63Yu98NyUcl98uOuy2T2_d0ZO60cZYW5Q5pfsD-nJbdw</recordid><startdate>201811</startdate><enddate>201811</enddate><creator>Ambrosino, Luca</creator><creator>Ruggieri, Valentino</creator><creator>Bostan, Hamed</creator><creator>Miralto, Marco</creator><creator>Vitulo, Nicola</creator><creator>Zouine, Mohammed</creator><creator>Barone, Amalia</creator><creator>Bouzayen, Mondher</creator><creator>Frusciante, Luigi</creator><creator>Pezzotti, Mario</creator><creator>Valle, Giorgio</creator><creator>Chiusano, Maria Luisa</creator><general>BioMed Central</general><scope>1XC</scope><scope>VOOES</scope><orcidid>https://orcid.org/0000-0001-7630-1449</orcidid><orcidid>https://orcid.org/0000-0001-7630-1449</orcidid></search><sort><creationdate>201811</creationdate><title>Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations</title><author>Ambrosino, Luca ; Ruggieri, Valentino ; Bostan, Hamed ; Miralto, Marco ; Vitulo, Nicola ; Zouine, Mohammed ; Barone, Amalia ; Bouzayen, Mondher ; Frusciante, Luigi ; Pezzotti, Mario ; Valle, Giorgio ; Chiusano, Maria Luisa</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-hal_primary_oai_HAL_hal_02623377v13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Life Sciences</topic><topic>Vegetal Biology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ambrosino, Luca</creatorcontrib><creatorcontrib>Ruggieri, Valentino</creatorcontrib><creatorcontrib>Bostan, Hamed</creatorcontrib><creatorcontrib>Miralto, Marco</creatorcontrib><creatorcontrib>Vitulo, Nicola</creatorcontrib><creatorcontrib>Zouine, Mohammed</creatorcontrib><creatorcontrib>Barone, Amalia</creatorcontrib><creatorcontrib>Bouzayen, Mondher</creatorcontrib><creatorcontrib>Frusciante, Luigi</creatorcontrib><creatorcontrib>Pezzotti, Mario</creatorcontrib><creatorcontrib>Valle, Giorgio</creatorcontrib><creatorcontrib>Chiusano, Maria Luisa</creatorcontrib><collection>Hyper Article en Ligne (HAL)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><jtitle>BMC bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ambrosino, Luca</au><au>Ruggieri, Valentino</au><au>Bostan, Hamed</au><au>Miralto, Marco</au><au>Vitulo, Nicola</au><au>Zouine, Mohammed</au><au>Barone, Amalia</au><au>Bouzayen, Mondher</au><au>Frusciante, Luigi</au><au>Pezzotti, Mario</au><au>Valle, Giorgio</au><au>Chiusano, Maria Luisa</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations</atitle><jtitle>BMC bioinformatics</jtitle><date>2018-11</date><risdate>2018</risdate><volume>19</volume><issue>15</issue><spage>85</spage><epage>143</epage><pages>85-143</pages><issn>1471-2105</issn><eissn>1471-2105</eissn><abstract>Background: “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog genes. However, the spreading of sequenced genome, often endowed with still preliminary annotations, requires suitable bioinformatics to be appropriately exploited in this framework. Results: We presented here a multilevel comparative approach to investigate on genome evolutionary relationships and peculiarities of two fleshy fruit species of relevant agronomic interest, Solanum lycopersicum (tomato) and Vitis vinifera (grapevine). We defined 17,823 orthology relationships between tomato and grapevine reference gene annotations. The resulting orthologs are associated with the detected paralogs in each species, permitting the definition of gene networks, useful to investigate the different relationships. The reconciliation of the compared collections in terms of an updating of the functional descriptions was also exploited. All the results were made accessible in ComParaLogs, a dedicated bioinformatics platform available at biosrv.cab.unina.it/comparalogs/gene/search Conclusions: The aim of the work was to suggest a reliable approach to detect all similarities of gene loci between two species based on the integration of results from different levels of information, such as the gene, the transcript and the protein sequences, overcoming possible limits due to exclusive protein versus protein comparisons. This to define reliable ortholog and paralog genes, as well as species specific gene loci in the two species, overcoming limits due to the possible draft nature of preliminary gene annotations. Moreover, reconciled functional descriptions, as well as common or peculiar enzymatic classes and protein domains from tomato and grapevine, together with the definition of species-specific gene sets after the pairwise comparisons, contributed a comprehensive set of information useful to comparatively exploit the two species gene annotations and investigate on differences between species with climacteric and non-climacteric fruits. In addition, the definition of networks of ortholog genes and of associated paralogs, and the organization of web-based interfaces for the exploration of the results, defined a friendly computational bench-work in support of comparative analyses between two species.</abstract><pub>BioMed Central</pub><pmid>30497367</pmid><doi>10.1186/s12859-018-2420-y</doi><orcidid>https://orcid.org/0000-0001-7630-1449</orcidid><orcidid>https://orcid.org/0000-0001-7630-1449</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1471-2105
ispartof BMC bioinformatics, 2018-11, Vol.19 (15), p.85-143
issn 1471-2105
1471-2105
language eng
recordid cdi_hal_primary_oai_HAL_hal_02623377v1
source DOAJ Directory of Open Access Journals; SpringerLink Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central Open Access; Springer Nature OA Free Journals; PubMed Central
subjects Life Sciences
Vegetal Biology
title Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T12%3A20%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-hal&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multilevel%20comparative%20bioinformatics%20to%20investigate%20evolutionary%20relationships%20and%20specificities%20in%20gene%20annotations&rft.jtitle=BMC%20bioinformatics&rft.au=Ambrosino,%20Luca&rft.date=2018-11&rft.volume=19&rft.issue=15&rft.spage=85&rft.epage=143&rft.pages=85-143&rft.issn=1471-2105&rft.eissn=1471-2105&rft_id=info:doi/10.1186/s12859-018-2420-y&rft_dat=%3Chal%3Eoai_HAL_hal_02623377v1%3C/hal%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/30497367&rfr_iscdi=true