Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations
Background: “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog...
Gespeichert in:
Veröffentlicht in: | BMC bioinformatics 2018-11, Vol.19 (15), p.85-143 |
---|---|
Hauptverfasser: | , , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 143 |
---|---|
container_issue | 15 |
container_start_page | 85 |
container_title | BMC bioinformatics |
container_volume | 19 |
creator | Ambrosino, Luca Ruggieri, Valentino Bostan, Hamed Miralto, Marco Vitulo, Nicola Zouine, Mohammed Barone, Amalia Bouzayen, Mondher Frusciante, Luigi Pezzotti, Mario Valle, Giorgio Chiusano, Maria Luisa |
description | Background: “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog genes. However, the spreading of sequenced genome, often endowed with still preliminary annotations, requires suitable bioinformatics to be appropriately exploited in this framework. Results: We presented here a multilevel comparative approach to investigate on genome evolutionary relationships and peculiarities of two fleshy fruit species of relevant agronomic interest, Solanum lycopersicum (tomato) and Vitis vinifera (grapevine). We defined 17,823 orthology relationships between tomato and grapevine reference gene annotations. The resulting orthologs are associated with the detected paralogs in each species, permitting the definition of gene networks, useful to investigate the different relationships. The reconciliation of the compared collections in terms of an updating of the functional descriptions was also exploited. All the results were made accessible in ComParaLogs, a dedicated bioinformatics platform available at biosrv.cab.unina.it/comparalogs/gene/search Conclusions: The aim of the work was to suggest a reliable approach to detect all similarities of gene loci between two species based on the integration of results from different levels of information, such as the gene, the transcript and the protein sequences, overcoming possible limits due to exclusive protein versus protein comparisons. This to define reliable ortholog and paralog genes, as well as species specific gene loci in the two species, overcoming limits due to the possible draft nature of preliminary gene annotations. Moreover, reconciled functional descriptions, as well as common or peculiar enzymatic classes and protein domains from tomato and grapevine, together with the definition of species-specific gene sets after the pairwise comparisons, contributed a comprehensive set of information useful to comparatively exploit the two species gene annotations and investigate on differences between species with climacteric and non-climacteric fruits. In addition, the definition of networks of ortholog genes and of associated paralogs, and the organization of web-based interfaces for the exploration of the results, defined a friendly computational bench-work in support of comparative analyses b |
doi_str_mv | 10.1186/s12859-018-2420-y |
format | Article |
fullrecord | <record><control><sourceid>hal</sourceid><recordid>TN_cdi_hal_primary_oai_HAL_hal_02623377v1</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>oai_HAL_hal_02623377v1</sourcerecordid><originalsourceid>FETCH-hal_primary_oai_HAL_hal_02623377v13</originalsourceid><addsrcrecordid>eNqVTLtOwzAUtRCIlscHsHllMPiRxOmIEKgDbOyRCTftRY4d-TqW-vekgoG103kfxu6UfFCqbR5J6bbeCKlaoSstxeGMrVVlldBK1uf_-IpdEX1LqWwr60u2MrLaWNPYNZvfZ5_RQwHP-zhOLrmMBfgnRgxDTOMie-I5cgwFKOPOZeBQop8zxuDSgSfw7shpjxNxF744TdDjgD1mBFqGfAcBliTE_Nu8YReD8wS3f3jN7l9fPp63Yu98NyUcl98uOuy2T2_d0ZO60cZYW5Q5pfsD-nJbdw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations</title><source>DOAJ Directory of Open Access Journals</source><source>SpringerLink Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central Open Access</source><source>Springer Nature OA Free Journals</source><source>PubMed Central</source><creator>Ambrosino, Luca ; Ruggieri, Valentino ; Bostan, Hamed ; Miralto, Marco ; Vitulo, Nicola ; Zouine, Mohammed ; Barone, Amalia ; Bouzayen, Mondher ; Frusciante, Luigi ; Pezzotti, Mario ; Valle, Giorgio ; Chiusano, Maria Luisa</creator><creatorcontrib>Ambrosino, Luca ; Ruggieri, Valentino ; Bostan, Hamed ; Miralto, Marco ; Vitulo, Nicola ; Zouine, Mohammed ; Barone, Amalia ; Bouzayen, Mondher ; Frusciante, Luigi ; Pezzotti, Mario ; Valle, Giorgio ; Chiusano, Maria Luisa</creatorcontrib><description>Background: “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog genes. However, the spreading of sequenced genome, often endowed with still preliminary annotations, requires suitable bioinformatics to be appropriately exploited in this framework. Results: We presented here a multilevel comparative approach to investigate on genome evolutionary relationships and peculiarities of two fleshy fruit species of relevant agronomic interest, Solanum lycopersicum (tomato) and Vitis vinifera (grapevine). We defined 17,823 orthology relationships between tomato and grapevine reference gene annotations. The resulting orthologs are associated with the detected paralogs in each species, permitting the definition of gene networks, useful to investigate the different relationships. The reconciliation of the compared collections in terms of an updating of the functional descriptions was also exploited. All the results were made accessible in ComParaLogs, a dedicated bioinformatics platform available at biosrv.cab.unina.it/comparalogs/gene/search Conclusions: The aim of the work was to suggest a reliable approach to detect all similarities of gene loci between two species based on the integration of results from different levels of information, such as the gene, the transcript and the protein sequences, overcoming possible limits due to exclusive protein versus protein comparisons. This to define reliable ortholog and paralog genes, as well as species specific gene loci in the two species, overcoming limits due to the possible draft nature of preliminary gene annotations. Moreover, reconciled functional descriptions, as well as common or peculiar enzymatic classes and protein domains from tomato and grapevine, together with the definition of species-specific gene sets after the pairwise comparisons, contributed a comprehensive set of information useful to comparatively exploit the two species gene annotations and investigate on differences between species with climacteric and non-climacteric fruits. In addition, the definition of networks of ortholog genes and of associated paralogs, and the organization of web-based interfaces for the exploration of the results, defined a friendly computational bench-work in support of comparative analyses between two species.</description><identifier>ISSN: 1471-2105</identifier><identifier>EISSN: 1471-2105</identifier><identifier>DOI: 10.1186/s12859-018-2420-y</identifier><identifier>PMID: 30497367</identifier><language>eng</language><publisher>BioMed Central</publisher><subject>Life Sciences ; Vegetal Biology</subject><ispartof>BMC bioinformatics, 2018-11, Vol.19 (15), p.85-143</ispartof><rights>Attribution</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0001-7630-1449 ; 0000-0001-7630-1449</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,777,781,861,882,27905,27906</link.rule.ids><backlink>$$Uhttps://hal.inrae.fr/hal-02623377$$DView record in HAL$$Hfree_for_read</backlink></links><search><creatorcontrib>Ambrosino, Luca</creatorcontrib><creatorcontrib>Ruggieri, Valentino</creatorcontrib><creatorcontrib>Bostan, Hamed</creatorcontrib><creatorcontrib>Miralto, Marco</creatorcontrib><creatorcontrib>Vitulo, Nicola</creatorcontrib><creatorcontrib>Zouine, Mohammed</creatorcontrib><creatorcontrib>Barone, Amalia</creatorcontrib><creatorcontrib>Bouzayen, Mondher</creatorcontrib><creatorcontrib>Frusciante, Luigi</creatorcontrib><creatorcontrib>Pezzotti, Mario</creatorcontrib><creatorcontrib>Valle, Giorgio</creatorcontrib><creatorcontrib>Chiusano, Maria Luisa</creatorcontrib><title>Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations</title><title>BMC bioinformatics</title><description>Background: “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog genes. However, the spreading of sequenced genome, often endowed with still preliminary annotations, requires suitable bioinformatics to be appropriately exploited in this framework. Results: We presented here a multilevel comparative approach to investigate on genome evolutionary relationships and peculiarities of two fleshy fruit species of relevant agronomic interest, Solanum lycopersicum (tomato) and Vitis vinifera (grapevine). We defined 17,823 orthology relationships between tomato and grapevine reference gene annotations. The resulting orthologs are associated with the detected paralogs in each species, permitting the definition of gene networks, useful to investigate the different relationships. The reconciliation of the compared collections in terms of an updating of the functional descriptions was also exploited. All the results were made accessible in ComParaLogs, a dedicated bioinformatics platform available at biosrv.cab.unina.it/comparalogs/gene/search Conclusions: The aim of the work was to suggest a reliable approach to detect all similarities of gene loci between two species based on the integration of results from different levels of information, such as the gene, the transcript and the protein sequences, overcoming possible limits due to exclusive protein versus protein comparisons. This to define reliable ortholog and paralog genes, as well as species specific gene loci in the two species, overcoming limits due to the possible draft nature of preliminary gene annotations. Moreover, reconciled functional descriptions, as well as common or peculiar enzymatic classes and protein domains from tomato and grapevine, together with the definition of species-specific gene sets after the pairwise comparisons, contributed a comprehensive set of information useful to comparatively exploit the two species gene annotations and investigate on differences between species with climacteric and non-climacteric fruits. In addition, the definition of networks of ortholog genes and of associated paralogs, and the organization of web-based interfaces for the exploration of the results, defined a friendly computational bench-work in support of comparative analyses between two species.</description><subject>Life Sciences</subject><subject>Vegetal Biology</subject><issn>1471-2105</issn><issn>1471-2105</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNqVTLtOwzAUtRCIlscHsHllMPiRxOmIEKgDbOyRCTftRY4d-TqW-vekgoG103kfxu6UfFCqbR5J6bbeCKlaoSstxeGMrVVlldBK1uf_-IpdEX1LqWwr60u2MrLaWNPYNZvfZ5_RQwHP-zhOLrmMBfgnRgxDTOMie-I5cgwFKOPOZeBQop8zxuDSgSfw7shpjxNxF744TdDjgD1mBFqGfAcBliTE_Nu8YReD8wS3f3jN7l9fPp63Yu98NyUcl98uOuy2T2_d0ZO60cZYW5Q5pfsD-nJbdw</recordid><startdate>201811</startdate><enddate>201811</enddate><creator>Ambrosino, Luca</creator><creator>Ruggieri, Valentino</creator><creator>Bostan, Hamed</creator><creator>Miralto, Marco</creator><creator>Vitulo, Nicola</creator><creator>Zouine, Mohammed</creator><creator>Barone, Amalia</creator><creator>Bouzayen, Mondher</creator><creator>Frusciante, Luigi</creator><creator>Pezzotti, Mario</creator><creator>Valle, Giorgio</creator><creator>Chiusano, Maria Luisa</creator><general>BioMed Central</general><scope>1XC</scope><scope>VOOES</scope><orcidid>https://orcid.org/0000-0001-7630-1449</orcidid><orcidid>https://orcid.org/0000-0001-7630-1449</orcidid></search><sort><creationdate>201811</creationdate><title>Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations</title><author>Ambrosino, Luca ; Ruggieri, Valentino ; Bostan, Hamed ; Miralto, Marco ; Vitulo, Nicola ; Zouine, Mohammed ; Barone, Amalia ; Bouzayen, Mondher ; Frusciante, Luigi ; Pezzotti, Mario ; Valle, Giorgio ; Chiusano, Maria Luisa</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-hal_primary_oai_HAL_hal_02623377v13</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Life Sciences</topic><topic>Vegetal Biology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Ambrosino, Luca</creatorcontrib><creatorcontrib>Ruggieri, Valentino</creatorcontrib><creatorcontrib>Bostan, Hamed</creatorcontrib><creatorcontrib>Miralto, Marco</creatorcontrib><creatorcontrib>Vitulo, Nicola</creatorcontrib><creatorcontrib>Zouine, Mohammed</creatorcontrib><creatorcontrib>Barone, Amalia</creatorcontrib><creatorcontrib>Bouzayen, Mondher</creatorcontrib><creatorcontrib>Frusciante, Luigi</creatorcontrib><creatorcontrib>Pezzotti, Mario</creatorcontrib><creatorcontrib>Valle, Giorgio</creatorcontrib><creatorcontrib>Chiusano, Maria Luisa</creatorcontrib><collection>Hyper Article en Ligne (HAL)</collection><collection>Hyper Article en Ligne (HAL) (Open Access)</collection><jtitle>BMC bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Ambrosino, Luca</au><au>Ruggieri, Valentino</au><au>Bostan, Hamed</au><au>Miralto, Marco</au><au>Vitulo, Nicola</au><au>Zouine, Mohammed</au><au>Barone, Amalia</au><au>Bouzayen, Mondher</au><au>Frusciante, Luigi</au><au>Pezzotti, Mario</au><au>Valle, Giorgio</au><au>Chiusano, Maria Luisa</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations</atitle><jtitle>BMC bioinformatics</jtitle><date>2018-11</date><risdate>2018</risdate><volume>19</volume><issue>15</issue><spage>85</spage><epage>143</epage><pages>85-143</pages><issn>1471-2105</issn><eissn>1471-2105</eissn><abstract>Background: “Omics” approaches may provide useful information for a deeper understanding of speciation events, diversification and function innovation. This can be achieved by investigating the molecular similarities at sequence level between species, allowing the definition of ortholog and paralog genes. However, the spreading of sequenced genome, often endowed with still preliminary annotations, requires suitable bioinformatics to be appropriately exploited in this framework. Results: We presented here a multilevel comparative approach to investigate on genome evolutionary relationships and peculiarities of two fleshy fruit species of relevant agronomic interest, Solanum lycopersicum (tomato) and Vitis vinifera (grapevine). We defined 17,823 orthology relationships between tomato and grapevine reference gene annotations. The resulting orthologs are associated with the detected paralogs in each species, permitting the definition of gene networks, useful to investigate the different relationships. The reconciliation of the compared collections in terms of an updating of the functional descriptions was also exploited. All the results were made accessible in ComParaLogs, a dedicated bioinformatics platform available at biosrv.cab.unina.it/comparalogs/gene/search Conclusions: The aim of the work was to suggest a reliable approach to detect all similarities of gene loci between two species based on the integration of results from different levels of information, such as the gene, the transcript and the protein sequences, overcoming possible limits due to exclusive protein versus protein comparisons. This to define reliable ortholog and paralog genes, as well as species specific gene loci in the two species, overcoming limits due to the possible draft nature of preliminary gene annotations. Moreover, reconciled functional descriptions, as well as common or peculiar enzymatic classes and protein domains from tomato and grapevine, together with the definition of species-specific gene sets after the pairwise comparisons, contributed a comprehensive set of information useful to comparatively exploit the two species gene annotations and investigate on differences between species with climacteric and non-climacteric fruits. In addition, the definition of networks of ortholog genes and of associated paralogs, and the organization of web-based interfaces for the exploration of the results, defined a friendly computational bench-work in support of comparative analyses between two species.</abstract><pub>BioMed Central</pub><pmid>30497367</pmid><doi>10.1186/s12859-018-2420-y</doi><orcidid>https://orcid.org/0000-0001-7630-1449</orcidid><orcidid>https://orcid.org/0000-0001-7630-1449</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1471-2105 |
ispartof | BMC bioinformatics, 2018-11, Vol.19 (15), p.85-143 |
issn | 1471-2105 1471-2105 |
language | eng |
recordid | cdi_hal_primary_oai_HAL_hal_02623377v1 |
source | DOAJ Directory of Open Access Journals; SpringerLink Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central Open Access; Springer Nature OA Free Journals; PubMed Central |
subjects | Life Sciences Vegetal Biology |
title | Multilevel comparative bioinformatics to investigate evolutionary relationships and specificities in gene annotations |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T12%3A20%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-hal&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Multilevel%20comparative%20bioinformatics%20to%20investigate%20evolutionary%20relationships%20and%20specificities%20in%20gene%20annotations&rft.jtitle=BMC%20bioinformatics&rft.au=Ambrosino,%20Luca&rft.date=2018-11&rft.volume=19&rft.issue=15&rft.spage=85&rft.epage=143&rft.pages=85-143&rft.issn=1471-2105&rft.eissn=1471-2105&rft_id=info:doi/10.1186/s12859-018-2420-y&rft_dat=%3Chal%3Eoai_HAL_hal_02623377v1%3C/hal%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/30497367&rfr_iscdi=true |