Inference in two dimensions: Allele frequencies versus lengths of shared sequence blocks

We outline two approaches to inference of neighbourhood size, N, and dispersal rate, σ2, based on either allele frequencies or on the lengths of sequence blocks that are shared between genomes. Over intermediate timescales (10–100 generations, say), populations that live in two dimensions approach a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Theoretical population biology 2013-08, Vol.87, p.105-119
Hauptverfasser: Barton, N.H., Etheridge, A.M., Kelleher, J., Véber, A.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 119
container_issue
container_start_page 105
container_title Theoretical population biology
container_volume 87
creator Barton, N.H.
Etheridge, A.M.
Kelleher, J.
Véber, A.
description We outline two approaches to inference of neighbourhood size, N, and dispersal rate, σ2, based on either allele frequencies or on the lengths of sequence blocks that are shared between genomes. Over intermediate timescales (10–100 generations, say), populations that live in two dimensions approach a quasi-equilibrium that is independent of both their local structure and their deeper history. Over such scales, the standardised covariance of allele frequencies (i.e. pairwise FST) falls with the logarithm of distance, and depends only on neighbourhood size, N, and a ‘local scale’, κ; the rate of gene flow, σ2, cannot be inferred. We show how spatial correlations can be accounted for, assuming a Gaussian distribution of allele frequencies, giving maximum likelihood estimates of N and κ. Alternatively, inferences can be based on the distribution of the lengths of sequence that are identical between blocks of genomes: long blocks (>0.1 cM, say) tell us about intermediate timescales, over which we assume a quasi-equilibrium. For large neighbourhood size, the distribution of long blocks is given directly by the classical Wright–Malécot formula; this relationship can be used to infer both N and σ2. With small neighbourhood size, there is an appreciable chance that recombinant lineages will coalesce back before escaping into the distant past. For this case, we show that if genomes are sampled from some distance apart, then the distribution of lengths of blocks that are identical in state is geometric, with a mean that depends on N and σ2.
doi_str_mv 10.1016/j.tpb.2013.03.001
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1413166540</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0040580913000233</els_id><sourcerecordid>1413166540</sourcerecordid><originalsourceid>FETCH-LOGICAL-c353t-38aee12412d6cf7323ab8442b4260e6b0f915d7e7da7eedd02cb6b2737a7ec383</originalsourceid><addsrcrecordid>eNp9kE1LAzEQhoMoWqs_wIvk6GXrJNnNbvVUih-FghcFb2E3mbWp292a2Vb896ZUPQoDw8AzLzMPYxcCRgKEvl6O-nU1kiDUCGKBOGADAWOdgJLZIRsApJBkBYxP2CnREgAKodQxO5EqA52rdMBeZ22NAVuL3Le8_-y48ytsyXct3fBJ02CDvA74sYmMR-JbDLQh3mD71i-IdzWnRRnQcdozyKums-90xo7qsiE8_-lD9nJ_9zx9TOZPD7PpZJ5Ylak-UUWJKGQqpNO2zpVUZVWkqaxSqQF1BfVYZC7H3JU5onMgbaUrmas8zlYVasiu9rnr0MUDqDcrTxabpmyx25ARqVBC6yyFiIo9akNHFLA26-BXZfgyAsxOqFmaKNTshBqIFfuQXf7Eb6oVur-NX4MRuN0DGJ_cegyGoqfowfmAtjeu8__EfwMBvobd</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1413166540</pqid></control><display><type>article</type><title>Inference in two dimensions: Allele frequencies versus lengths of shared sequence blocks</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals Complete</source><creator>Barton, N.H. ; Etheridge, A.M. ; Kelleher, J. ; Véber, A.</creator><creatorcontrib>Barton, N.H. ; Etheridge, A.M. ; Kelleher, J. ; Véber, A.</creatorcontrib><description>We outline two approaches to inference of neighbourhood size, N, and dispersal rate, σ2, based on either allele frequencies or on the lengths of sequence blocks that are shared between genomes. Over intermediate timescales (10–100 generations, say), populations that live in two dimensions approach a quasi-equilibrium that is independent of both their local structure and their deeper history. Over such scales, the standardised covariance of allele frequencies (i.e. pairwise FST) falls with the logarithm of distance, and depends only on neighbourhood size, N, and a ‘local scale’, κ; the rate of gene flow, σ2, cannot be inferred. We show how spatial correlations can be accounted for, assuming a Gaussian distribution of allele frequencies, giving maximum likelihood estimates of N and κ. Alternatively, inferences can be based on the distribution of the lengths of sequence that are identical between blocks of genomes: long blocks (&gt;0.1 cM, say) tell us about intermediate timescales, over which we assume a quasi-equilibrium. For large neighbourhood size, the distribution of long blocks is given directly by the classical Wright–Malécot formula; this relationship can be used to infer both N and σ2. With small neighbourhood size, there is an appreciable chance that recombinant lineages will coalesce back before escaping into the distant past. For this case, we show that if genomes are sampled from some distance apart, then the distribution of lengths of blocks that are identical in state is geometric, with a mean that depends on N and σ2.</description><identifier>ISSN: 0040-5809</identifier><identifier>EISSN: 1096-0325</identifier><identifier>DOI: 10.1016/j.tpb.2013.03.001</identifier><identifier>PMID: 23506734</identifier><language>eng</language><publisher>United States: Elsevier Inc</publisher><subject>[formula omitted]-statistics ; Gene flow ; Gene Frequency ; Identity in state ; Inference ; Likelihood Functions ; Models, Theoretical ; Recombination ; Recombination, Genetic ; Spatial structure</subject><ispartof>Theoretical population biology, 2013-08, Vol.87, p.105-119</ispartof><rights>2013 Elsevier Inc.</rights><rights>Copyright © 2013 Elsevier Inc. All rights reserved.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c353t-38aee12412d6cf7323ab8442b4260e6b0f915d7e7da7eedd02cb6b2737a7ec383</citedby><cites>FETCH-LOGICAL-c353t-38aee12412d6cf7323ab8442b4260e6b0f915d7e7da7eedd02cb6b2737a7ec383</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.tpb.2013.03.001$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,777,781,3537,27905,27906,45976</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/23506734$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Barton, N.H.</creatorcontrib><creatorcontrib>Etheridge, A.M.</creatorcontrib><creatorcontrib>Kelleher, J.</creatorcontrib><creatorcontrib>Véber, A.</creatorcontrib><title>Inference in two dimensions: Allele frequencies versus lengths of shared sequence blocks</title><title>Theoretical population biology</title><addtitle>Theor Popul Biol</addtitle><description>We outline two approaches to inference of neighbourhood size, N, and dispersal rate, σ2, based on either allele frequencies or on the lengths of sequence blocks that are shared between genomes. Over intermediate timescales (10–100 generations, say), populations that live in two dimensions approach a quasi-equilibrium that is independent of both their local structure and their deeper history. Over such scales, the standardised covariance of allele frequencies (i.e. pairwise FST) falls with the logarithm of distance, and depends only on neighbourhood size, N, and a ‘local scale’, κ; the rate of gene flow, σ2, cannot be inferred. We show how spatial correlations can be accounted for, assuming a Gaussian distribution of allele frequencies, giving maximum likelihood estimates of N and κ. Alternatively, inferences can be based on the distribution of the lengths of sequence that are identical between blocks of genomes: long blocks (&gt;0.1 cM, say) tell us about intermediate timescales, over which we assume a quasi-equilibrium. For large neighbourhood size, the distribution of long blocks is given directly by the classical Wright–Malécot formula; this relationship can be used to infer both N and σ2. With small neighbourhood size, there is an appreciable chance that recombinant lineages will coalesce back before escaping into the distant past. For this case, we show that if genomes are sampled from some distance apart, then the distribution of lengths of blocks that are identical in state is geometric, with a mean that depends on N and σ2.</description><subject>[formula omitted]-statistics</subject><subject>Gene flow</subject><subject>Gene Frequency</subject><subject>Identity in state</subject><subject>Inference</subject><subject>Likelihood Functions</subject><subject>Models, Theoretical</subject><subject>Recombination</subject><subject>Recombination, Genetic</subject><subject>Spatial structure</subject><issn>0040-5809</issn><issn>1096-0325</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2013</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kE1LAzEQhoMoWqs_wIvk6GXrJNnNbvVUih-FghcFb2E3mbWp292a2Vb896ZUPQoDw8AzLzMPYxcCRgKEvl6O-nU1kiDUCGKBOGADAWOdgJLZIRsApJBkBYxP2CnREgAKodQxO5EqA52rdMBeZ22NAVuL3Le8_-y48ytsyXct3fBJ02CDvA74sYmMR-JbDLQh3mD71i-IdzWnRRnQcdozyKums-90xo7qsiE8_-lD9nJ_9zx9TOZPD7PpZJ5Ylak-UUWJKGQqpNO2zpVUZVWkqaxSqQF1BfVYZC7H3JU5onMgbaUrmas8zlYVasiu9rnr0MUDqDcrTxabpmyx25ARqVBC6yyFiIo9akNHFLA26-BXZfgyAsxOqFmaKNTshBqIFfuQXf7Eb6oVur-NX4MRuN0DGJ_cegyGoqfowfmAtjeu8__EfwMBvobd</recordid><startdate>201308</startdate><enddate>201308</enddate><creator>Barton, N.H.</creator><creator>Etheridge, A.M.</creator><creator>Kelleher, J.</creator><creator>Véber, A.</creator><general>Elsevier Inc</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope></search><sort><creationdate>201308</creationdate><title>Inference in two dimensions: Allele frequencies versus lengths of shared sequence blocks</title><author>Barton, N.H. ; Etheridge, A.M. ; Kelleher, J. ; Véber, A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c353t-38aee12412d6cf7323ab8442b4260e6b0f915d7e7da7eedd02cb6b2737a7ec383</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2013</creationdate><topic>[formula omitted]-statistics</topic><topic>Gene flow</topic><topic>Gene Frequency</topic><topic>Identity in state</topic><topic>Inference</topic><topic>Likelihood Functions</topic><topic>Models, Theoretical</topic><topic>Recombination</topic><topic>Recombination, Genetic</topic><topic>Spatial structure</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Barton, N.H.</creatorcontrib><creatorcontrib>Etheridge, A.M.</creatorcontrib><creatorcontrib>Kelleher, J.</creatorcontrib><creatorcontrib>Véber, A.</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Theoretical population biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Barton, N.H.</au><au>Etheridge, A.M.</au><au>Kelleher, J.</au><au>Véber, A.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Inference in two dimensions: Allele frequencies versus lengths of shared sequence blocks</atitle><jtitle>Theoretical population biology</jtitle><addtitle>Theor Popul Biol</addtitle><date>2013-08</date><risdate>2013</risdate><volume>87</volume><spage>105</spage><epage>119</epage><pages>105-119</pages><issn>0040-5809</issn><eissn>1096-0325</eissn><abstract>We outline two approaches to inference of neighbourhood size, N, and dispersal rate, σ2, based on either allele frequencies or on the lengths of sequence blocks that are shared between genomes. Over intermediate timescales (10–100 generations, say), populations that live in two dimensions approach a quasi-equilibrium that is independent of both their local structure and their deeper history. Over such scales, the standardised covariance of allele frequencies (i.e. pairwise FST) falls with the logarithm of distance, and depends only on neighbourhood size, N, and a ‘local scale’, κ; the rate of gene flow, σ2, cannot be inferred. We show how spatial correlations can be accounted for, assuming a Gaussian distribution of allele frequencies, giving maximum likelihood estimates of N and κ. Alternatively, inferences can be based on the distribution of the lengths of sequence that are identical between blocks of genomes: long blocks (&gt;0.1 cM, say) tell us about intermediate timescales, over which we assume a quasi-equilibrium. For large neighbourhood size, the distribution of long blocks is given directly by the classical Wright–Malécot formula; this relationship can be used to infer both N and σ2. With small neighbourhood size, there is an appreciable chance that recombinant lineages will coalesce back before escaping into the distant past. For this case, we show that if genomes are sampled from some distance apart, then the distribution of lengths of blocks that are identical in state is geometric, with a mean that depends on N and σ2.</abstract><cop>United States</cop><pub>Elsevier Inc</pub><pmid>23506734</pmid><doi>10.1016/j.tpb.2013.03.001</doi><tpages>15</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0040-5809
ispartof Theoretical population biology, 2013-08, Vol.87, p.105-119
issn 0040-5809
1096-0325
language eng
recordid cdi_proquest_miscellaneous_1413166540
source MEDLINE; Elsevier ScienceDirect Journals Complete
subjects [formula omitted]-statistics
Gene flow
Gene Frequency
Identity in state
Inference
Likelihood Functions
Models, Theoretical
Recombination
Recombination, Genetic
Spatial structure
title Inference in two dimensions: Allele frequencies versus lengths of shared sequence blocks
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T14%3A03%3A39IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Inference%20in%20two%20dimensions:%20Allele%20frequencies%20versus%20lengths%20of%20shared%20sequence%20blocks&rft.jtitle=Theoretical%20population%20biology&rft.au=Barton,%20N.H.&rft.date=2013-08&rft.volume=87&rft.spage=105&rft.epage=119&rft.pages=105-119&rft.issn=0040-5809&rft.eissn=1096-0325&rft_id=info:doi/10.1016/j.tpb.2013.03.001&rft_dat=%3Cproquest_cross%3E1413166540%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1413166540&rft_id=info:pmid/23506734&rft_els_id=S0040580913000233&rfr_iscdi=true