Five-leaf Generalizations of the D-statistic Reveal the Directionality of Admixture
Abstract Over the past 15 years, the D-statistic, a four-taxon test for organismal admixture (hybridization, or introgression) which incorporates single nucleotide polymorphism data with allelic patterns ABBA and BABA, has seen considerable use. This statistic seeks to discern significant deviation...
Gespeichert in:
Veröffentlicht in: | Molecular biology and evolution 2024-11, Vol.41 (11) |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 11 |
container_start_page | |
container_title | Molecular biology and evolution |
container_volume | 41 |
creator | Leppälä, Kalle da Silva Coelho, Flavio Augusto Richter, Michaela Albert, Victor A Lindqvist, Charlotte |
description | Abstract
Over the past 15 years, the D-statistic, a four-taxon test for organismal admixture (hybridization, or introgression) which incorporates single nucleotide polymorphism data with allelic patterns ABBA and BABA, has seen considerable use. This statistic seeks to discern significant deviation from either a given species tree assumption, or from the balanced incomplete lineage sorting that could otherwise defy this species tree. However, while the D-statistic can successfully discriminate admixture from incomplete lineage sorting, it is not a simple matter to determine the directionality of admixture using only four-leaf tree models. As such, methods have been developed that use five leaves to evaluate admixture. Among these, the DFOIL method (“FOIL”, a mnemonic for “First-Outer-Inner-Last”), which tests allelic patterns on the “symmetric” tree S=(((1,2),(3,4)),5), succeeds in finding admixture direction for many five-taxon examples. However, DFOIL does not make full use of all symmetry, nor can DFOIL function properly when ancient samples are included because of the reliance on singleton patterns (such as BAAAA and ABAAA). Here, we take inspiration from DFOIL to develop a new and completely general family of five-leaf admixture tests, dubbed Δ-statistics, that can either incorporate or exclude the singleton allelic patterns depending on individual taxon and age sampling choices. We describe two new shapes that are also fully testable, namely the “asymmetric” tree A=(((1,2),3),4),5) and the “quasisymmetric” tree Q=(((1,2),3),(4,5)), which can considerably supplement the “symmetric” S=(((1,2),(3,4)),5) model used by DFOIL. We demonstrate the consistency of Δ-statistics under various simulated scenarios, and provide empirical examples using data from black, brown and polar bears, the latter also including two ancient polar bear samples from previous studies. Recently, DFOIL and one of these ancient samples was used to argue for a dominant polar bear → brown bear introgression direction. However, we find, using both this ancient polar bear and our own, that by far the strongest signal using both DFOIL and Δ-statistics on tree S is actually bidirectional gene flow of indistinguishable direction. Further experiments on trees A and Q instead highlight what were likely two phases of admixture: one with stronger brown bear → polar bear introgression in ancient times, and a more recent phase with predominant polar bear → brown bear directionality. |
doi_str_mv | 10.1093/molbev/msae198 |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_proquest_miscellaneous_3107162553</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/molbev/msae198</oup_id><sourcerecordid>3107162553</sourcerecordid><originalsourceid>FETCH-LOGICAL-c310t-d8c043c8fce9b96f94be90a710202e4d4df4dfc5479d578fd728d70163b866e53</originalsourceid><addsrcrecordid>eNqFkc9LwzAUx4Mobk6vHqVHPXRLmrZpTjKmm4Ig-OMc0vTVRdpmNulw_vVmdA49CYEX3vu8b77ki9A5wWOCOZ3UpsphPamtBMKzAzQkCWUhYYQfoiFm_h5jmg3QibXvGJM4TtNjNKCc4ogkfIie53oNYQWyDBbQQCsr_SWdNo0NTBm4JQQ3oXW-Y51WwROsQVZ9W7egtqDfcJstPC1q_em6Fk7RUSkrC2e7OkKv89uX2V348Li4n00fQkUJdmGRKRxTlZUKeM7Tksc5cCwZwRGOIC7iovRHJTHjRcKysmBRVjBMUppnaQoJHaHrXnfV5TUUChrn_YtVq2vZboSRWvydNHop3sxaEMJwFlHiFS53Cq356MA6UWuroKpkA6azwvtkJI2ShHp03KOqNda2UO7fIVhsoxB9FGIXhV-4-O1uj__8vQeuesB0q__EvgH52ZdE</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3107162553</pqid></control><display><type>article</type><title>Five-leaf Generalizations of the D-statistic Reveal the Directionality of Admixture</title><source>Oxford Journals Open Access Collection</source><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>PubMed Central</source><source>Alma/SFX Local Collection</source><source>Free Full-Text Journals in Chemistry</source><creator>Leppälä, Kalle ; da Silva Coelho, Flavio Augusto ; Richter, Michaela ; Albert, Victor A ; Lindqvist, Charlotte</creator><contributor>Yoder, Anne</contributor><creatorcontrib>Leppälä, Kalle ; da Silva Coelho, Flavio Augusto ; Richter, Michaela ; Albert, Victor A ; Lindqvist, Charlotte ; Yoder, Anne</creatorcontrib><description>Abstract
Over the past 15 years, the D-statistic, a four-taxon test for organismal admixture (hybridization, or introgression) which incorporates single nucleotide polymorphism data with allelic patterns ABBA and BABA, has seen considerable use. This statistic seeks to discern significant deviation from either a given species tree assumption, or from the balanced incomplete lineage sorting that could otherwise defy this species tree. However, while the D-statistic can successfully discriminate admixture from incomplete lineage sorting, it is not a simple matter to determine the directionality of admixture using only four-leaf tree models. As such, methods have been developed that use five leaves to evaluate admixture. Among these, the DFOIL method (“FOIL”, a mnemonic for “First-Outer-Inner-Last”), which tests allelic patterns on the “symmetric” tree S=(((1,2),(3,4)),5), succeeds in finding admixture direction for many five-taxon examples. However, DFOIL does not make full use of all symmetry, nor can DFOIL function properly when ancient samples are included because of the reliance on singleton patterns (such as BAAAA and ABAAA). Here, we take inspiration from DFOIL to develop a new and completely general family of five-leaf admixture tests, dubbed Δ-statistics, that can either incorporate or exclude the singleton allelic patterns depending on individual taxon and age sampling choices. We describe two new shapes that are also fully testable, namely the “asymmetric” tree A=(((1,2),3),4),5) and the “quasisymmetric” tree Q=(((1,2),3),(4,5)), which can considerably supplement the “symmetric” S=(((1,2),(3,4)),5) model used by DFOIL. We demonstrate the consistency of Δ-statistics under various simulated scenarios, and provide empirical examples using data from black, brown and polar bears, the latter also including two ancient polar bear samples from previous studies. Recently, DFOIL and one of these ancient samples was used to argue for a dominant polar bear → brown bear introgression direction. However, we find, using both this ancient polar bear and our own, that by far the strongest signal using both DFOIL and Δ-statistics on tree S is actually bidirectional gene flow of indistinguishable direction. Further experiments on trees A and Q instead highlight what were likely two phases of admixture: one with stronger brown bear → polar bear introgression in ancient times, and a more recent phase with predominant polar bear → brown bear directionality.</description><identifier>ISSN: 0737-4038</identifier><identifier>ISSN: 1537-1719</identifier><identifier>EISSN: 1537-1719</identifier><identifier>DOI: 10.1093/molbev/msae198</identifier><identifier>PMID: 39302159</identifier><language>eng</language><publisher>UK: Oxford University Press</publisher><subject>Alleles ; Animals ; Genetic Introgression ; Hybridization, Genetic ; Methods ; Models, Genetic ; Phylogeny ; Polymorphism, Single Nucleotide</subject><ispartof>Molecular biology and evolution, 2024-11, Vol.41 (11)</ispartof><rights>The Author(s) 2024. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. 2024</rights><rights>The Author(s) 2024. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c310t-d8c043c8fce9b96f94be90a710202e4d4df4dfc5479d578fd728d70163b866e53</cites><orcidid>0000-0002-2304-7595 ; 0000-0002-0262-826X ; 0000-0001-7537-4902 ; 0000-0002-4190-727X ; 0000-0003-0236-7044</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC11708231/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC11708231/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,1598,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/39302159$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Yoder, Anne</contributor><creatorcontrib>Leppälä, Kalle</creatorcontrib><creatorcontrib>da Silva Coelho, Flavio Augusto</creatorcontrib><creatorcontrib>Richter, Michaela</creatorcontrib><creatorcontrib>Albert, Victor A</creatorcontrib><creatorcontrib>Lindqvist, Charlotte</creatorcontrib><title>Five-leaf Generalizations of the D-statistic Reveal the Directionality of Admixture</title><title>Molecular biology and evolution</title><addtitle>Mol Biol Evol</addtitle><description>Abstract
Over the past 15 years, the D-statistic, a four-taxon test for organismal admixture (hybridization, or introgression) which incorporates single nucleotide polymorphism data with allelic patterns ABBA and BABA, has seen considerable use. This statistic seeks to discern significant deviation from either a given species tree assumption, or from the balanced incomplete lineage sorting that could otherwise defy this species tree. However, while the D-statistic can successfully discriminate admixture from incomplete lineage sorting, it is not a simple matter to determine the directionality of admixture using only four-leaf tree models. As such, methods have been developed that use five leaves to evaluate admixture. Among these, the DFOIL method (“FOIL”, a mnemonic for “First-Outer-Inner-Last”), which tests allelic patterns on the “symmetric” tree S=(((1,2),(3,4)),5), succeeds in finding admixture direction for many five-taxon examples. However, DFOIL does not make full use of all symmetry, nor can DFOIL function properly when ancient samples are included because of the reliance on singleton patterns (such as BAAAA and ABAAA). Here, we take inspiration from DFOIL to develop a new and completely general family of five-leaf admixture tests, dubbed Δ-statistics, that can either incorporate or exclude the singleton allelic patterns depending on individual taxon and age sampling choices. We describe two new shapes that are also fully testable, namely the “asymmetric” tree A=(((1,2),3),4),5) and the “quasisymmetric” tree Q=(((1,2),3),(4,5)), which can considerably supplement the “symmetric” S=(((1,2),(3,4)),5) model used by DFOIL. We demonstrate the consistency of Δ-statistics under various simulated scenarios, and provide empirical examples using data from black, brown and polar bears, the latter also including two ancient polar bear samples from previous studies. Recently, DFOIL and one of these ancient samples was used to argue for a dominant polar bear → brown bear introgression direction. However, we find, using both this ancient polar bear and our own, that by far the strongest signal using both DFOIL and Δ-statistics on tree S is actually bidirectional gene flow of indistinguishable direction. Further experiments on trees A and Q instead highlight what were likely two phases of admixture: one with stronger brown bear → polar bear introgression in ancient times, and a more recent phase with predominant polar bear → brown bear directionality.</description><subject>Alleles</subject><subject>Animals</subject><subject>Genetic Introgression</subject><subject>Hybridization, Genetic</subject><subject>Methods</subject><subject>Models, Genetic</subject><subject>Phylogeny</subject><subject>Polymorphism, Single Nucleotide</subject><issn>0737-4038</issn><issn>1537-1719</issn><issn>1537-1719</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>TOX</sourceid><sourceid>EIF</sourceid><recordid>eNqFkc9LwzAUx4Mobk6vHqVHPXRLmrZpTjKmm4Ig-OMc0vTVRdpmNulw_vVmdA49CYEX3vu8b77ki9A5wWOCOZ3UpsphPamtBMKzAzQkCWUhYYQfoiFm_h5jmg3QibXvGJM4TtNjNKCc4ogkfIie53oNYQWyDBbQQCsr_SWdNo0NTBm4JQQ3oXW-Y51WwROsQVZ9W7egtqDfcJstPC1q_em6Fk7RUSkrC2e7OkKv89uX2V348Li4n00fQkUJdmGRKRxTlZUKeM7Tksc5cCwZwRGOIC7iovRHJTHjRcKysmBRVjBMUppnaQoJHaHrXnfV5TUUChrn_YtVq2vZboSRWvydNHop3sxaEMJwFlHiFS53Cq356MA6UWuroKpkA6azwvtkJI2ShHp03KOqNda2UO7fIVhsoxB9FGIXhV-4-O1uj__8vQeuesB0q__EvgH52ZdE</recordid><startdate>20241101</startdate><enddate>20241101</enddate><creator>Leppälä, Kalle</creator><creator>da Silva Coelho, Flavio Augusto</creator><creator>Richter, Michaela</creator><creator>Albert, Victor A</creator><creator>Lindqvist, Charlotte</creator><general>Oxford University Press</general><scope>TOX</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-2304-7595</orcidid><orcidid>https://orcid.org/0000-0002-0262-826X</orcidid><orcidid>https://orcid.org/0000-0001-7537-4902</orcidid><orcidid>https://orcid.org/0000-0002-4190-727X</orcidid><orcidid>https://orcid.org/0000-0003-0236-7044</orcidid></search><sort><creationdate>20241101</creationdate><title>Five-leaf Generalizations of the D-statistic Reveal the Directionality of Admixture</title><author>Leppälä, Kalle ; da Silva Coelho, Flavio Augusto ; Richter, Michaela ; Albert, Victor A ; Lindqvist, Charlotte</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c310t-d8c043c8fce9b96f94be90a710202e4d4df4dfc5479d578fd728d70163b866e53</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Alleles</topic><topic>Animals</topic><topic>Genetic Introgression</topic><topic>Hybridization, Genetic</topic><topic>Methods</topic><topic>Models, Genetic</topic><topic>Phylogeny</topic><topic>Polymorphism, Single Nucleotide</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Leppälä, Kalle</creatorcontrib><creatorcontrib>da Silva Coelho, Flavio Augusto</creatorcontrib><creatorcontrib>Richter, Michaela</creatorcontrib><creatorcontrib>Albert, Victor A</creatorcontrib><creatorcontrib>Lindqvist, Charlotte</creatorcontrib><collection>Oxford Journals Open Access Collection</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Molecular biology and evolution</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Leppälä, Kalle</au><au>da Silva Coelho, Flavio Augusto</au><au>Richter, Michaela</au><au>Albert, Victor A</au><au>Lindqvist, Charlotte</au><au>Yoder, Anne</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Five-leaf Generalizations of the D-statistic Reveal the Directionality of Admixture</atitle><jtitle>Molecular biology and evolution</jtitle><addtitle>Mol Biol Evol</addtitle><date>2024-11-01</date><risdate>2024</risdate><volume>41</volume><issue>11</issue><issn>0737-4038</issn><issn>1537-1719</issn><eissn>1537-1719</eissn><abstract>Abstract
Over the past 15 years, the D-statistic, a four-taxon test for organismal admixture (hybridization, or introgression) which incorporates single nucleotide polymorphism data with allelic patterns ABBA and BABA, has seen considerable use. This statistic seeks to discern significant deviation from either a given species tree assumption, or from the balanced incomplete lineage sorting that could otherwise defy this species tree. However, while the D-statistic can successfully discriminate admixture from incomplete lineage sorting, it is not a simple matter to determine the directionality of admixture using only four-leaf tree models. As such, methods have been developed that use five leaves to evaluate admixture. Among these, the DFOIL method (“FOIL”, a mnemonic for “First-Outer-Inner-Last”), which tests allelic patterns on the “symmetric” tree S=(((1,2),(3,4)),5), succeeds in finding admixture direction for many five-taxon examples. However, DFOIL does not make full use of all symmetry, nor can DFOIL function properly when ancient samples are included because of the reliance on singleton patterns (such as BAAAA and ABAAA). Here, we take inspiration from DFOIL to develop a new and completely general family of five-leaf admixture tests, dubbed Δ-statistics, that can either incorporate or exclude the singleton allelic patterns depending on individual taxon and age sampling choices. We describe two new shapes that are also fully testable, namely the “asymmetric” tree A=(((1,2),3),4),5) and the “quasisymmetric” tree Q=(((1,2),3),(4,5)), which can considerably supplement the “symmetric” S=(((1,2),(3,4)),5) model used by DFOIL. We demonstrate the consistency of Δ-statistics under various simulated scenarios, and provide empirical examples using data from black, brown and polar bears, the latter also including two ancient polar bear samples from previous studies. Recently, DFOIL and one of these ancient samples was used to argue for a dominant polar bear → brown bear introgression direction. However, we find, using both this ancient polar bear and our own, that by far the strongest signal using both DFOIL and Δ-statistics on tree S is actually bidirectional gene flow of indistinguishable direction. Further experiments on trees A and Q instead highlight what were likely two phases of admixture: one with stronger brown bear → polar bear introgression in ancient times, and a more recent phase with predominant polar bear → brown bear directionality.</abstract><cop>UK</cop><pub>Oxford University Press</pub><pmid>39302159</pmid><doi>10.1093/molbev/msae198</doi><orcidid>https://orcid.org/0000-0002-2304-7595</orcidid><orcidid>https://orcid.org/0000-0002-0262-826X</orcidid><orcidid>https://orcid.org/0000-0001-7537-4902</orcidid><orcidid>https://orcid.org/0000-0002-4190-727X</orcidid><orcidid>https://orcid.org/0000-0003-0236-7044</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0737-4038 |
ispartof | Molecular biology and evolution, 2024-11, Vol.41 (11) |
issn | 0737-4038 1537-1719 1537-1719 |
language | eng |
recordid | cdi_proquest_miscellaneous_3107162553 |
source | Oxford Journals Open Access Collection; MEDLINE; DOAJ Directory of Open Access Journals; PubMed Central; Alma/SFX Local Collection; Free Full-Text Journals in Chemistry |
subjects | Alleles Animals Genetic Introgression Hybridization, Genetic Methods Models, Genetic Phylogeny Polymorphism, Single Nucleotide |
title | Five-leaf Generalizations of the D-statistic Reveal the Directionality of Admixture |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T13%3A16%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Five-leaf%20Generalizations%20of%20the%20D-statistic%20Reveal%20the%20Directionality%20of%20Admixture&rft.jtitle=Molecular%20biology%20and%20evolution&rft.au=Lepp%C3%A4l%C3%A4,%20Kalle&rft.date=2024-11-01&rft.volume=41&rft.issue=11&rft.issn=0737-4038&rft.eissn=1537-1719&rft_id=info:doi/10.1093/molbev/msae198&rft_dat=%3Cproquest_pubme%3E3107162553%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3107162553&rft_id=info:pmid/39302159&rft_oup_id=10.1093/molbev/msae198&rfr_iscdi=true |