Gene Tree Discordance Causes Apparent Substitution Rate Variation

Substitution rates are known to be variable among genes, chromosomes, species, and lineages due to multifarious biological processes. Here, we consider another source of substitution rate variation due to a technical bias associated with gene tree discordance. Discordance has been found to be rampan...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Systematic biology 2016-07, Vol.65 (4), p.711-721
Hauptverfasser: Mendes, Fábio K., Hahn, Matthew W.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 721
container_issue 4
container_start_page 711
container_title Systematic biology
container_volume 65
creator Mendes, Fábio K.
Hahn, Matthew W.
description Substitution rates are known to be variable among genes, chromosomes, species, and lineages due to multifarious biological processes. Here, we consider another source of substitution rate variation due to a technical bias associated with gene tree discordance. Discordance has been found to be rampant in genome-wide data sets, often due to incomplete lineage sorting (ILS). This apparent substitution rate variation is caused when substitutions that occur on discordant gene trees are analyzed in the context of a single, fixed species tree. Such substitutions have to be resolved by proposing multiple substitutions on the species tree, and we therefore refer to this phenomenon as Substitutions Produced by ILS (SPILS). We use simulations to demonstrate that SPILS has a larger effect with increasing levels of ILS, and on trees with larger numbers of taxa. Specific branches of the species trees are consistently, but erroneously, inferred to be longer or shorter, and we show that these branches can be predicted based on discordant tree topologies. Moreover, we observe that fixing a species tree topology when performing tests of positive selection increases the false positive rate, particularly for genes whose discordant topologies are most affected by SPILS. Finally, we use data from multiple Drosophila species to show that SPILS can be detected in nature. Although the effects of SPILS are modest per gene, it has the potential to affect substitution rate variation whenever high levels of ILS are present, particularly in rapid radiations. The problems outlined here have implications for character mapping of any type of trait, and for any biological process that causes discordance. We discuss possible solutions to these problems, and areas in which they are likely to have caused faulty inferences of convergence and accelerated evolution.
doi_str_mv 10.1093/sysbio/syw018
format Article
fullrecord <record><control><sourceid>jstor_proqu</sourceid><recordid>TN_cdi_proquest_miscellaneous_1797871177</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><jstor_id>44028788</jstor_id><oup_id>10.1093/sysbio/syw018</oup_id><sourcerecordid>44028788</sourcerecordid><originalsourceid>FETCH-LOGICAL-c415t-9aa8b61fa0d743809b9f279996e958ce746881be03c38a9d6dfcdeffb4209e463</originalsourceid><addsrcrecordid>eNqFkEtLw0AUhQdRbK0uXSoBN26iM3nMY1mqVqEgaBV3YTK5gZQ2E-eB9N87IbWCG1fnXvg4fByEzgm-IVikt3Zry0aH-MKEH6AxwYzGPKUfh_1N0zgnORuhE2tXGBNCc3KMRgkVCRMUj9F0Di1ESwMQ3TVWaVPJVkE0k96CjaZdJw20Lnr1pXWN867RbfQiHUTv0jSyf0_RUS3XFs52OUFvD_fL2WO8eJ4_zaaLWGUkd7GQkpeU1BJXLEs5FqWog4MQFETOFbCMck5KwKlKuRQVrWpVQV2XWYIFZDSdoOuhtzP604N1xSYIw3otW9DeFoQJxhkhjAX06g-60t60wa6nBGc04TxQ8UApo601UBedaTbSbAuCi37bYti2GLYN_OWu1ZcbqPb0z5i_htp3_3ZdDOjKOm32cJbhhLPg9g0VFo5F</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1799876288</pqid></control><display><type>article</type><title>Gene Tree Discordance Causes Apparent Substitution Rate Variation</title><source>MEDLINE</source><source>Jstor Complete Legacy</source><source>Oxford University Press Journals All Titles (1996-Current)</source><source>Alma/SFX Local Collection</source><creator>Mendes, Fábio K. ; Hahn, Matthew W.</creator><creatorcontrib>Mendes, Fábio K. ; Hahn, Matthew W.</creatorcontrib><description>Substitution rates are known to be variable among genes, chromosomes, species, and lineages due to multifarious biological processes. Here, we consider another source of substitution rate variation due to a technical bias associated with gene tree discordance. Discordance has been found to be rampant in genome-wide data sets, often due to incomplete lineage sorting (ILS). This apparent substitution rate variation is caused when substitutions that occur on discordant gene trees are analyzed in the context of a single, fixed species tree. Such substitutions have to be resolved by proposing multiple substitutions on the species tree, and we therefore refer to this phenomenon as Substitutions Produced by ILS (SPILS). We use simulations to demonstrate that SPILS has a larger effect with increasing levels of ILS, and on trees with larger numbers of taxa. Specific branches of the species trees are consistently, but erroneously, inferred to be longer or shorter, and we show that these branches can be predicted based on discordant tree topologies. Moreover, we observe that fixing a species tree topology when performing tests of positive selection increases the false positive rate, particularly for genes whose discordant topologies are most affected by SPILS. Finally, we use data from multiple Drosophila species to show that SPILS can be detected in nature. Although the effects of SPILS are modest per gene, it has the potential to affect substitution rate variation whenever high levels of ILS are present, particularly in rapid radiations. The problems outlined here have implications for character mapping of any type of trait, and for any biological process that causes discordance. We discuss possible solutions to these problems, and areas in which they are likely to have caused faulty inferences of convergence and accelerated evolution.</description><identifier>ISSN: 1063-5157</identifier><identifier>EISSN: 1076-836X</identifier><identifier>DOI: 10.1093/sysbio/syw018</identifier><identifier>PMID: 26927960</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Amino Acid Substitution ; Animals ; Drosophila ; Drosophila - genetics ; Evolution ; Evolution, Molecular ; Evolutionary biology ; Genes ; Genetic variation ; Genetics ; Genome ; Genomes ; Genomics ; Insects ; Models, Genetic ; Phylogeny ; Population size ; Positive selection ; Simulation ; Speciation ; Systematic biology ; Taxa ; Taxonomy ; Topology</subject><ispartof>Systematic biology, 2016-07, Vol.65 (4), p.711-721</ispartof><rights>Copyright © 2016 Society of Systematic Biologists</rights><rights>The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com 2016</rights><rights>The Author(s) 2016. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.</rights><rights>Copyright Oxford University Press, UK Jul 2016</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c415t-9aa8b61fa0d743809b9f279996e958ce746881be03c38a9d6dfcdeffb4209e463</citedby><cites>FETCH-LOGICAL-c415t-9aa8b61fa0d743809b9f279996e958ce746881be03c38a9d6dfcdeffb4209e463</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.jstor.org/stable/pdf/44028788$$EPDF$$P50$$Gjstor$$H</linktopdf><linktohtml>$$Uhttps://www.jstor.org/stable/44028788$$EHTML$$P50$$Gjstor$$H</linktohtml><link.rule.ids>314,777,781,800,1579,27905,27906,57998,58231</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/26927960$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Mendes, Fábio K.</creatorcontrib><creatorcontrib>Hahn, Matthew W.</creatorcontrib><title>Gene Tree Discordance Causes Apparent Substitution Rate Variation</title><title>Systematic biology</title><addtitle>Syst Biol</addtitle><description>Substitution rates are known to be variable among genes, chromosomes, species, and lineages due to multifarious biological processes. Here, we consider another source of substitution rate variation due to a technical bias associated with gene tree discordance. Discordance has been found to be rampant in genome-wide data sets, often due to incomplete lineage sorting (ILS). This apparent substitution rate variation is caused when substitutions that occur on discordant gene trees are analyzed in the context of a single, fixed species tree. Such substitutions have to be resolved by proposing multiple substitutions on the species tree, and we therefore refer to this phenomenon as Substitutions Produced by ILS (SPILS). We use simulations to demonstrate that SPILS has a larger effect with increasing levels of ILS, and on trees with larger numbers of taxa. Specific branches of the species trees are consistently, but erroneously, inferred to be longer or shorter, and we show that these branches can be predicted based on discordant tree topologies. Moreover, we observe that fixing a species tree topology when performing tests of positive selection increases the false positive rate, particularly for genes whose discordant topologies are most affected by SPILS. Finally, we use data from multiple Drosophila species to show that SPILS can be detected in nature. Although the effects of SPILS are modest per gene, it has the potential to affect substitution rate variation whenever high levels of ILS are present, particularly in rapid radiations. The problems outlined here have implications for character mapping of any type of trait, and for any biological process that causes discordance. We discuss possible solutions to these problems, and areas in which they are likely to have caused faulty inferences of convergence and accelerated evolution.</description><subject>Amino Acid Substitution</subject><subject>Animals</subject><subject>Drosophila</subject><subject>Drosophila - genetics</subject><subject>Evolution</subject><subject>Evolution, Molecular</subject><subject>Evolutionary biology</subject><subject>Genes</subject><subject>Genetic variation</subject><subject>Genetics</subject><subject>Genome</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Insects</subject><subject>Models, Genetic</subject><subject>Phylogeny</subject><subject>Population size</subject><subject>Positive selection</subject><subject>Simulation</subject><subject>Speciation</subject><subject>Systematic biology</subject><subject>Taxa</subject><subject>Taxonomy</subject><subject>Topology</subject><issn>1063-5157</issn><issn>1076-836X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkEtLw0AUhQdRbK0uXSoBN26iM3nMY1mqVqEgaBV3YTK5gZQ2E-eB9N87IbWCG1fnXvg4fByEzgm-IVikt3Zry0aH-MKEH6AxwYzGPKUfh_1N0zgnORuhE2tXGBNCc3KMRgkVCRMUj9F0Di1ESwMQ3TVWaVPJVkE0k96CjaZdJw20Lnr1pXWN867RbfQiHUTv0jSyf0_RUS3XFs52OUFvD_fL2WO8eJ4_zaaLWGUkd7GQkpeU1BJXLEs5FqWog4MQFETOFbCMck5KwKlKuRQVrWpVQV2XWYIFZDSdoOuhtzP604N1xSYIw3otW9DeFoQJxhkhjAX06g-60t60wa6nBGc04TxQ8UApo601UBedaTbSbAuCi37bYti2GLYN_OWu1ZcbqPb0z5i_htp3_3ZdDOjKOm32cJbhhLPg9g0VFo5F</recordid><startdate>20160701</startdate><enddate>20160701</enddate><creator>Mendes, Fábio K.</creator><creator>Hahn, Matthew W.</creator><general>Oxford University Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>K9.</scope><scope>7X8</scope></search><sort><creationdate>20160701</creationdate><title>Gene Tree Discordance Causes Apparent Substitution Rate Variation</title><author>Mendes, Fábio K. ; Hahn, Matthew W.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c415t-9aa8b61fa0d743809b9f279996e958ce746881be03c38a9d6dfcdeffb4209e463</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Amino Acid Substitution</topic><topic>Animals</topic><topic>Drosophila</topic><topic>Drosophila - genetics</topic><topic>Evolution</topic><topic>Evolution, Molecular</topic><topic>Evolutionary biology</topic><topic>Genes</topic><topic>Genetic variation</topic><topic>Genetics</topic><topic>Genome</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Insects</topic><topic>Models, Genetic</topic><topic>Phylogeny</topic><topic>Population size</topic><topic>Positive selection</topic><topic>Simulation</topic><topic>Speciation</topic><topic>Systematic biology</topic><topic>Taxa</topic><topic>Taxonomy</topic><topic>Topology</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mendes, Fábio K.</creatorcontrib><creatorcontrib>Hahn, Matthew W.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>MEDLINE - Academic</collection><jtitle>Systematic biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mendes, Fábio K.</au><au>Hahn, Matthew W.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Gene Tree Discordance Causes Apparent Substitution Rate Variation</atitle><jtitle>Systematic biology</jtitle><addtitle>Syst Biol</addtitle><date>2016-07-01</date><risdate>2016</risdate><volume>65</volume><issue>4</issue><spage>711</spage><epage>721</epage><pages>711-721</pages><issn>1063-5157</issn><eissn>1076-836X</eissn><abstract>Substitution rates are known to be variable among genes, chromosomes, species, and lineages due to multifarious biological processes. Here, we consider another source of substitution rate variation due to a technical bias associated with gene tree discordance. Discordance has been found to be rampant in genome-wide data sets, often due to incomplete lineage sorting (ILS). This apparent substitution rate variation is caused when substitutions that occur on discordant gene trees are analyzed in the context of a single, fixed species tree. Such substitutions have to be resolved by proposing multiple substitutions on the species tree, and we therefore refer to this phenomenon as Substitutions Produced by ILS (SPILS). We use simulations to demonstrate that SPILS has a larger effect with increasing levels of ILS, and on trees with larger numbers of taxa. Specific branches of the species trees are consistently, but erroneously, inferred to be longer or shorter, and we show that these branches can be predicted based on discordant tree topologies. Moreover, we observe that fixing a species tree topology when performing tests of positive selection increases the false positive rate, particularly for genes whose discordant topologies are most affected by SPILS. Finally, we use data from multiple Drosophila species to show that SPILS can be detected in nature. Although the effects of SPILS are modest per gene, it has the potential to affect substitution rate variation whenever high levels of ILS are present, particularly in rapid radiations. The problems outlined here have implications for character mapping of any type of trait, and for any biological process that causes discordance. We discuss possible solutions to these problems, and areas in which they are likely to have caused faulty inferences of convergence and accelerated evolution.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>26927960</pmid><doi>10.1093/sysbio/syw018</doi><tpages>11</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1063-5157
ispartof Systematic biology, 2016-07, Vol.65 (4), p.711-721
issn 1063-5157
1076-836X
language eng
recordid cdi_proquest_miscellaneous_1797871177
source MEDLINE; Jstor Complete Legacy; Oxford University Press Journals All Titles (1996-Current); Alma/SFX Local Collection
subjects Amino Acid Substitution
Animals
Drosophila
Drosophila - genetics
Evolution
Evolution, Molecular
Evolutionary biology
Genes
Genetic variation
Genetics
Genome
Genomes
Genomics
Insects
Models, Genetic
Phylogeny
Population size
Positive selection
Simulation
Speciation
Systematic biology
Taxa
Taxonomy
Topology
title Gene Tree Discordance Causes Apparent Substitution Rate Variation
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T06%3A39%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-jstor_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Gene%20Tree%20Discordance%20Causes%20Apparent%20Substitution%20Rate%20Variation&rft.jtitle=Systematic%20biology&rft.au=Mendes,%20F%C3%A1bio%20K.&rft.date=2016-07-01&rft.volume=65&rft.issue=4&rft.spage=711&rft.epage=721&rft.pages=711-721&rft.issn=1063-5157&rft.eissn=1076-836X&rft_id=info:doi/10.1093/sysbio/syw018&rft_dat=%3Cjstor_proqu%3E44028788%3C/jstor_proqu%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1799876288&rft_id=info:pmid/26927960&rft_jstor_id=44028788&rft_oup_id=10.1093/sysbio/syw018&rfr_iscdi=true