The geometry of admixture in population genetics: the blessing of dimensionality

We present a geometry-based interpretation of the f-statistics framework, commonly used in population genetics to estimate phylogenetic relationships from genomic data. The focus is on the determination of the mixing coefficients in population admixture events subject to post-admixture drift. The in...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Genetics (Austin) 2024-10, Vol.228 (2)
Hauptverfasser: Oteo, José-Angel, Oteo-García, Gonzalo
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 2
container_start_page
container_title Genetics (Austin)
container_volume 228
creator Oteo, José-Angel
Oteo-García, Gonzalo
description We present a geometry-based interpretation of the f-statistics framework, commonly used in population genetics to estimate phylogenetic relationships from genomic data. The focus is on the determination of the mixing coefficients in population admixture events subject to post-admixture drift. The interpretation takes advantage of the high dimension of the dataset and analyzes the problem as a dimensional reduction issue. We show that it is possible to think of the f-statistics technique as an implicit transformation of the genomic data from a phase space into a subspace where the mapped data structure is more similar to the ancestral admixture configuration. The 2-way mixing coefficient is, as a matter of fact, carried out implicitly in this subspace. In addition, we propose the admixture test to be evaluated in the subspace because the comparison with the conventional one provides an important assessment of the admixture model. The overarching geometric framework provides slightly more general formulas than the f-formalism by using a different rationale as a starting point. Explicitly addressed are 2- and 3-way admixtures. The mixture proportions are provided by suitable linear fits, in 2 or 3 dimensions, that can be easily visualized. The difficulties encountered with introgression and gene flow are also addressed. The developments and findings are illustrated with numerical simulations and real-world cases.
doi_str_mv 10.1093/genetics/iyae134
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_3092871841</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3092871841</sourcerecordid><originalsourceid>FETCH-LOGICAL-c224t-96087e93a23acfb4169153795eb43d58b124216bd697807d9ac4f872ee4e521c3</originalsourceid><addsrcrecordid>eNpNkD1PwzAQhi0EoqWwM6GMLKH-SmKzoYovqRIMZbYc51KMkjjYjkT-PanaIqa74X3e0z0IXRN8R7Bkyy10EK0JSztqIIyfoDmRnKU0Z-T03z5DFyF8YYxzmYlzNGOSMCE4naP3zSckW3AtRD8mrk501dqfOHhIbJf0rh8aHa3rkuOp-yRORNlACLbb7ojKttCFKaMbG8dLdFbrJsDVYS7Qx9PjZvWSrt-eX1cP69RQymMqcywKkExTpk1dcpJLkrFCZlByVmWiJJRTkpdVLguBi0pqw2tRUAAOGSWGLdDtvrf37nuAEFVrg4Gm0R24ISiGJRUFEZxMUbyPGu9C8FCr3ttW-1ERrHYe1fE5dfA4ITeH9qFsofoDjuLYL1J8cgY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3092871841</pqid></control><display><type>article</type><title>The geometry of admixture in population genetics: the blessing of dimensionality</title><source>MEDLINE</source><source>Oxford University Press Journals All Titles (1996-Current)</source><creator>Oteo, José-Angel ; Oteo-García, Gonzalo</creator><contributor>Novembre, J</contributor><creatorcontrib>Oteo, José-Angel ; Oteo-García, Gonzalo ; Novembre, J</creatorcontrib><description>We present a geometry-based interpretation of the f-statistics framework, commonly used in population genetics to estimate phylogenetic relationships from genomic data. The focus is on the determination of the mixing coefficients in population admixture events subject to post-admixture drift. The interpretation takes advantage of the high dimension of the dataset and analyzes the problem as a dimensional reduction issue. We show that it is possible to think of the f-statistics technique as an implicit transformation of the genomic data from a phase space into a subspace where the mapped data structure is more similar to the ancestral admixture configuration. The 2-way mixing coefficient is, as a matter of fact, carried out implicitly in this subspace. In addition, we propose the admixture test to be evaluated in the subspace because the comparison with the conventional one provides an important assessment of the admixture model. The overarching geometric framework provides slightly more general formulas than the f-formalism by using a different rationale as a starting point. Explicitly addressed are 2- and 3-way admixtures. The mixture proportions are provided by suitable linear fits, in 2 or 3 dimensions, that can be easily visualized. The difficulties encountered with introgression and gene flow are also addressed. The developments and findings are illustrated with numerical simulations and real-world cases.</description><identifier>ISSN: 1943-2631</identifier><identifier>EISSN: 1943-2631</identifier><identifier>DOI: 10.1093/genetics/iyae134</identifier><identifier>PMID: 39138842</identifier><language>eng</language><publisher>United States</publisher><subject>Gene Flow ; Genetics, Population - methods ; Humans ; Models, Genetic ; Phylogeny</subject><ispartof>Genetics (Austin), 2024-10, Vol.228 (2)</ispartof><rights>The Author(s) 2024. Published by Oxford University Press on behalf of The Genetics Society of America.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c224t-96087e93a23acfb4169153795eb43d58b124216bd697807d9ac4f872ee4e521c3</cites><orcidid>0000-0002-0957-4014</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/39138842$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Novembre, J</contributor><creatorcontrib>Oteo, José-Angel</creatorcontrib><creatorcontrib>Oteo-García, Gonzalo</creatorcontrib><title>The geometry of admixture in population genetics: the blessing of dimensionality</title><title>Genetics (Austin)</title><addtitle>Genetics</addtitle><description>We present a geometry-based interpretation of the f-statistics framework, commonly used in population genetics to estimate phylogenetic relationships from genomic data. The focus is on the determination of the mixing coefficients in population admixture events subject to post-admixture drift. The interpretation takes advantage of the high dimension of the dataset and analyzes the problem as a dimensional reduction issue. We show that it is possible to think of the f-statistics technique as an implicit transformation of the genomic data from a phase space into a subspace where the mapped data structure is more similar to the ancestral admixture configuration. The 2-way mixing coefficient is, as a matter of fact, carried out implicitly in this subspace. In addition, we propose the admixture test to be evaluated in the subspace because the comparison with the conventional one provides an important assessment of the admixture model. The overarching geometric framework provides slightly more general formulas than the f-formalism by using a different rationale as a starting point. Explicitly addressed are 2- and 3-way admixtures. The mixture proportions are provided by suitable linear fits, in 2 or 3 dimensions, that can be easily visualized. The difficulties encountered with introgression and gene flow are also addressed. The developments and findings are illustrated with numerical simulations and real-world cases.</description><subject>Gene Flow</subject><subject>Genetics, Population - methods</subject><subject>Humans</subject><subject>Models, Genetic</subject><subject>Phylogeny</subject><issn>1943-2631</issn><issn>1943-2631</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpNkD1PwzAQhi0EoqWwM6GMLKH-SmKzoYovqRIMZbYc51KMkjjYjkT-PanaIqa74X3e0z0IXRN8R7Bkyy10EK0JSztqIIyfoDmRnKU0Z-T03z5DFyF8YYxzmYlzNGOSMCE4naP3zSckW3AtRD8mrk501dqfOHhIbJf0rh8aHa3rkuOp-yRORNlACLbb7ojKttCFKaMbG8dLdFbrJsDVYS7Qx9PjZvWSrt-eX1cP69RQymMqcywKkExTpk1dcpJLkrFCZlByVmWiJJRTkpdVLguBi0pqw2tRUAAOGSWGLdDtvrf37nuAEFVrg4Gm0R24ISiGJRUFEZxMUbyPGu9C8FCr3ttW-1ERrHYe1fE5dfA4ITeH9qFsofoDjuLYL1J8cgY</recordid><startdate>20241007</startdate><enddate>20241007</enddate><creator>Oteo, José-Angel</creator><creator>Oteo-García, Gonzalo</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-0957-4014</orcidid></search><sort><creationdate>20241007</creationdate><title>The geometry of admixture in population genetics: the blessing of dimensionality</title><author>Oteo, José-Angel ; Oteo-García, Gonzalo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c224t-96087e93a23acfb4169153795eb43d58b124216bd697807d9ac4f872ee4e521c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Gene Flow</topic><topic>Genetics, Population - methods</topic><topic>Humans</topic><topic>Models, Genetic</topic><topic>Phylogeny</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Oteo, José-Angel</creatorcontrib><creatorcontrib>Oteo-García, Gonzalo</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Genetics (Austin)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Oteo, José-Angel</au><au>Oteo-García, Gonzalo</au><au>Novembre, J</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The geometry of admixture in population genetics: the blessing of dimensionality</atitle><jtitle>Genetics (Austin)</jtitle><addtitle>Genetics</addtitle><date>2024-10-07</date><risdate>2024</risdate><volume>228</volume><issue>2</issue><issn>1943-2631</issn><eissn>1943-2631</eissn><abstract>We present a geometry-based interpretation of the f-statistics framework, commonly used in population genetics to estimate phylogenetic relationships from genomic data. The focus is on the determination of the mixing coefficients in population admixture events subject to post-admixture drift. The interpretation takes advantage of the high dimension of the dataset and analyzes the problem as a dimensional reduction issue. We show that it is possible to think of the f-statistics technique as an implicit transformation of the genomic data from a phase space into a subspace where the mapped data structure is more similar to the ancestral admixture configuration. The 2-way mixing coefficient is, as a matter of fact, carried out implicitly in this subspace. In addition, we propose the admixture test to be evaluated in the subspace because the comparison with the conventional one provides an important assessment of the admixture model. The overarching geometric framework provides slightly more general formulas than the f-formalism by using a different rationale as a starting point. Explicitly addressed are 2- and 3-way admixtures. The mixture proportions are provided by suitable linear fits, in 2 or 3 dimensions, that can be easily visualized. The difficulties encountered with introgression and gene flow are also addressed. The developments and findings are illustrated with numerical simulations and real-world cases.</abstract><cop>United States</cop><pmid>39138842</pmid><doi>10.1093/genetics/iyae134</doi><orcidid>https://orcid.org/0000-0002-0957-4014</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1943-2631
ispartof Genetics (Austin), 2024-10, Vol.228 (2)
issn 1943-2631
1943-2631
language eng
recordid cdi_proquest_miscellaneous_3092871841
source MEDLINE; Oxford University Press Journals All Titles (1996-Current)
subjects Gene Flow
Genetics, Population - methods
Humans
Models, Genetic
Phylogeny
title The geometry of admixture in population genetics: the blessing of dimensionality
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-23T21%3A10%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20geometry%20of%20admixture%20in%20population%20genetics:%20the%20blessing%20of%20dimensionality&rft.jtitle=Genetics%20(Austin)&rft.au=Oteo,%20Jos%C3%A9-Angel&rft.date=2024-10-07&rft.volume=228&rft.issue=2&rft.issn=1943-2631&rft.eissn=1943-2631&rft_id=info:doi/10.1093/genetics/iyae134&rft_dat=%3Cproquest_cross%3E3092871841%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3092871841&rft_id=info:pmid/39138842&rfr_iscdi=true