The geometry of admixture in population genetics: the blessing of dimensionality
We present a geometry-based interpretation of the f-statistics framework, commonly used in population genetics to estimate phylogenetic relationships from genomic data. The focus is on the determination of the mixing coefficients in population admixture events subject to post-admixture drift. The in...
Gespeichert in:
Veröffentlicht in: | Genetics (Austin) 2024-10, Vol.228 (2) |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | 2 |
container_start_page | |
container_title | Genetics (Austin) |
container_volume | 228 |
creator | Oteo, José-Angel Oteo-García, Gonzalo |
description | We present a geometry-based interpretation of the f-statistics framework, commonly used in population genetics to estimate phylogenetic relationships from genomic data. The focus is on the determination of the mixing coefficients in population admixture events subject to post-admixture drift. The interpretation takes advantage of the high dimension of the dataset and analyzes the problem as a dimensional reduction issue. We show that it is possible to think of the f-statistics technique as an implicit transformation of the genomic data from a phase space into a subspace where the mapped data structure is more similar to the ancestral admixture configuration. The 2-way mixing coefficient is, as a matter of fact, carried out implicitly in this subspace. In addition, we propose the admixture test to be evaluated in the subspace because the comparison with the conventional one provides an important assessment of the admixture model. The overarching geometric framework provides slightly more general formulas than the f-formalism by using a different rationale as a starting point. Explicitly addressed are 2- and 3-way admixtures. The mixture proportions are provided by suitable linear fits, in 2 or 3 dimensions, that can be easily visualized. The difficulties encountered with introgression and gene flow are also addressed. The developments and findings are illustrated with numerical simulations and real-world cases. |
doi_str_mv | 10.1093/genetics/iyae134 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_3092871841</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3092871841</sourcerecordid><originalsourceid>FETCH-LOGICAL-c224t-96087e93a23acfb4169153795eb43d58b124216bd697807d9ac4f872ee4e521c3</originalsourceid><addsrcrecordid>eNpNkD1PwzAQhi0EoqWwM6GMLKH-SmKzoYovqRIMZbYc51KMkjjYjkT-PanaIqa74X3e0z0IXRN8R7Bkyy10EK0JSztqIIyfoDmRnKU0Z-T03z5DFyF8YYxzmYlzNGOSMCE4naP3zSckW3AtRD8mrk501dqfOHhIbJf0rh8aHa3rkuOp-yRORNlACLbb7ojKttCFKaMbG8dLdFbrJsDVYS7Qx9PjZvWSrt-eX1cP69RQymMqcywKkExTpk1dcpJLkrFCZlByVmWiJJRTkpdVLguBi0pqw2tRUAAOGSWGLdDtvrf37nuAEFVrg4Gm0R24ISiGJRUFEZxMUbyPGu9C8FCr3ttW-1ERrHYe1fE5dfA4ITeH9qFsofoDjuLYL1J8cgY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3092871841</pqid></control><display><type>article</type><title>The geometry of admixture in population genetics: the blessing of dimensionality</title><source>MEDLINE</source><source>Oxford University Press Journals All Titles (1996-Current)</source><creator>Oteo, José-Angel ; Oteo-García, Gonzalo</creator><contributor>Novembre, J</contributor><creatorcontrib>Oteo, José-Angel ; Oteo-García, Gonzalo ; Novembre, J</creatorcontrib><description>We present a geometry-based interpretation of the f-statistics framework, commonly used in population genetics to estimate phylogenetic relationships from genomic data. The focus is on the determination of the mixing coefficients in population admixture events subject to post-admixture drift. The interpretation takes advantage of the high dimension of the dataset and analyzes the problem as a dimensional reduction issue. We show that it is possible to think of the f-statistics technique as an implicit transformation of the genomic data from a phase space into a subspace where the mapped data structure is more similar to the ancestral admixture configuration. The 2-way mixing coefficient is, as a matter of fact, carried out implicitly in this subspace. In addition, we propose the admixture test to be evaluated in the subspace because the comparison with the conventional one provides an important assessment of the admixture model. The overarching geometric framework provides slightly more general formulas than the f-formalism by using a different rationale as a starting point. Explicitly addressed are 2- and 3-way admixtures. The mixture proportions are provided by suitable linear fits, in 2 or 3 dimensions, that can be easily visualized. The difficulties encountered with introgression and gene flow are also addressed. The developments and findings are illustrated with numerical simulations and real-world cases.</description><identifier>ISSN: 1943-2631</identifier><identifier>EISSN: 1943-2631</identifier><identifier>DOI: 10.1093/genetics/iyae134</identifier><identifier>PMID: 39138842</identifier><language>eng</language><publisher>United States</publisher><subject>Gene Flow ; Genetics, Population - methods ; Humans ; Models, Genetic ; Phylogeny</subject><ispartof>Genetics (Austin), 2024-10, Vol.228 (2)</ispartof><rights>The Author(s) 2024. Published by Oxford University Press on behalf of The Genetics Society of America.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c224t-96087e93a23acfb4169153795eb43d58b124216bd697807d9ac4f872ee4e521c3</cites><orcidid>0000-0002-0957-4014</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,780,784,27924,27925</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/39138842$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Novembre, J</contributor><creatorcontrib>Oteo, José-Angel</creatorcontrib><creatorcontrib>Oteo-García, Gonzalo</creatorcontrib><title>The geometry of admixture in population genetics: the blessing of dimensionality</title><title>Genetics (Austin)</title><addtitle>Genetics</addtitle><description>We present a geometry-based interpretation of the f-statistics framework, commonly used in population genetics to estimate phylogenetic relationships from genomic data. The focus is on the determination of the mixing coefficients in population admixture events subject to post-admixture drift. The interpretation takes advantage of the high dimension of the dataset and analyzes the problem as a dimensional reduction issue. We show that it is possible to think of the f-statistics technique as an implicit transformation of the genomic data from a phase space into a subspace where the mapped data structure is more similar to the ancestral admixture configuration. The 2-way mixing coefficient is, as a matter of fact, carried out implicitly in this subspace. In addition, we propose the admixture test to be evaluated in the subspace because the comparison with the conventional one provides an important assessment of the admixture model. The overarching geometric framework provides slightly more general formulas than the f-formalism by using a different rationale as a starting point. Explicitly addressed are 2- and 3-way admixtures. The mixture proportions are provided by suitable linear fits, in 2 or 3 dimensions, that can be easily visualized. The difficulties encountered with introgression and gene flow are also addressed. The developments and findings are illustrated with numerical simulations and real-world cases.</description><subject>Gene Flow</subject><subject>Genetics, Population - methods</subject><subject>Humans</subject><subject>Models, Genetic</subject><subject>Phylogeny</subject><issn>1943-2631</issn><issn>1943-2631</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpNkD1PwzAQhi0EoqWwM6GMLKH-SmKzoYovqRIMZbYc51KMkjjYjkT-PanaIqa74X3e0z0IXRN8R7Bkyy10EK0JSztqIIyfoDmRnKU0Z-T03z5DFyF8YYxzmYlzNGOSMCE4naP3zSckW3AtRD8mrk501dqfOHhIbJf0rh8aHa3rkuOp-yRORNlACLbb7ojKttCFKaMbG8dLdFbrJsDVYS7Qx9PjZvWSrt-eX1cP69RQymMqcywKkExTpk1dcpJLkrFCZlByVmWiJJRTkpdVLguBi0pqw2tRUAAOGSWGLdDtvrf37nuAEFVrg4Gm0R24ISiGJRUFEZxMUbyPGu9C8FCr3ttW-1ERrHYe1fE5dfA4ITeH9qFsofoDjuLYL1J8cgY</recordid><startdate>20241007</startdate><enddate>20241007</enddate><creator>Oteo, José-Angel</creator><creator>Oteo-García, Gonzalo</creator><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0002-0957-4014</orcidid></search><sort><creationdate>20241007</creationdate><title>The geometry of admixture in population genetics: the blessing of dimensionality</title><author>Oteo, José-Angel ; Oteo-García, Gonzalo</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c224t-96087e93a23acfb4169153795eb43d58b124216bd697807d9ac4f872ee4e521c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Gene Flow</topic><topic>Genetics, Population - methods</topic><topic>Humans</topic><topic>Models, Genetic</topic><topic>Phylogeny</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Oteo, José-Angel</creatorcontrib><creatorcontrib>Oteo-García, Gonzalo</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Genetics (Austin)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Oteo, José-Angel</au><au>Oteo-García, Gonzalo</au><au>Novembre, J</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>The geometry of admixture in population genetics: the blessing of dimensionality</atitle><jtitle>Genetics (Austin)</jtitle><addtitle>Genetics</addtitle><date>2024-10-07</date><risdate>2024</risdate><volume>228</volume><issue>2</issue><issn>1943-2631</issn><eissn>1943-2631</eissn><abstract>We present a geometry-based interpretation of the f-statistics framework, commonly used in population genetics to estimate phylogenetic relationships from genomic data. The focus is on the determination of the mixing coefficients in population admixture events subject to post-admixture drift. The interpretation takes advantage of the high dimension of the dataset and analyzes the problem as a dimensional reduction issue. We show that it is possible to think of the f-statistics technique as an implicit transformation of the genomic data from a phase space into a subspace where the mapped data structure is more similar to the ancestral admixture configuration. The 2-way mixing coefficient is, as a matter of fact, carried out implicitly in this subspace. In addition, we propose the admixture test to be evaluated in the subspace because the comparison with the conventional one provides an important assessment of the admixture model. The overarching geometric framework provides slightly more general formulas than the f-formalism by using a different rationale as a starting point. Explicitly addressed are 2- and 3-way admixtures. The mixture proportions are provided by suitable linear fits, in 2 or 3 dimensions, that can be easily visualized. The difficulties encountered with introgression and gene flow are also addressed. The developments and findings are illustrated with numerical simulations and real-world cases.</abstract><cop>United States</cop><pmid>39138842</pmid><doi>10.1093/genetics/iyae134</doi><orcidid>https://orcid.org/0000-0002-0957-4014</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1943-2631 |
ispartof | Genetics (Austin), 2024-10, Vol.228 (2) |
issn | 1943-2631 1943-2631 |
language | eng |
recordid | cdi_proquest_miscellaneous_3092871841 |
source | MEDLINE; Oxford University Press Journals All Titles (1996-Current) |
subjects | Gene Flow Genetics, Population - methods Humans Models, Genetic Phylogeny |
title | The geometry of admixture in population genetics: the blessing of dimensionality |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-23T21%3A10%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=The%20geometry%20of%20admixture%20in%20population%20genetics:%20the%20blessing%20of%20dimensionality&rft.jtitle=Genetics%20(Austin)&rft.au=Oteo,%20Jos%C3%A9-Angel&rft.date=2024-10-07&rft.volume=228&rft.issue=2&rft.issn=1943-2631&rft.eissn=1943-2631&rft_id=info:doi/10.1093/genetics/iyae134&rft_dat=%3Cproquest_cross%3E3092871841%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3092871841&rft_id=info:pmid/39138842&rfr_iscdi=true |