Detecting Heterogeneity of Substitution Along DNA and Protein Sequences

Regions of differing constraint, mutation rate or combination along a sequence of DNA or amino acids lead to a nonuniform distribution of polymorphism within species or fixed differences between species. The power of five tests to reject the null hypothesis of a uniform distribution is studied for f...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Genetics (Austin) 1996-05, Vol.143 (1), p.589-602
Hauptverfasser:	Goss, PJE, Lewontin, R. C
Format:	Artikel
Sprache:	eng
Schlagworte:	Amino acids Biological Evolution Deoxyribonucleic acid DNA DNA - chemistry DNA - genetics Genetics Heterozygote Investigations Models, Genetic Monte Carlo Method Mutation Phylogeny Polymorphism, Genetic Probability Proteins Proteins - chemistry Proteins - genetics Recombination, Genetic Software
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	602
container_issue	1
container_start_page	589
container_title	Genetics (Austin)
container_volume	143
creator	Goss, PJE Lewontin, R. C
description	Regions of differing constraint, mutation rate or combination along a sequence of DNA or amino acids lead to a nonuniform distribution of polymorphism within species or fixed differences between species. The power of five tests to reject the null hypothesis of a uniform distribution is studied for four classes of alternate hypothesis. The tests explored are the variance of interval lengths; a modified variance test, which includes covariance between neighboring intervals; the length of the longest interval; the length of the shortest third-order interval; and a composite test. Although there is no uniformly most powerful test over the range of alternate hypotheses tested, the variance and modified variance tests usually have the highest power. Therefore, we recommend that one of these two tests be used to test departure from uniformity in all circumstances. Tables of critical values for the variance and modified variance tests are given. The critical values depend both on the number of events and the number of positions in the sequence. A computer program is available on request that calculates both the critical values for a specified number of events and number of positions as well as the significance level of a given data set.
doi_str_mv	10.1093/genetics/143.1.589
format	Article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_1207291</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>15607110</sourcerecordid><originalsourceid>FETCH-LOGICAL-c486t-5387af4c0a757e72fa22b3c292d28169468906808e45bb7853fad00ddd659f8c3</originalsourceid><addsrcrecordid>eNqFkdFqFDEUhoModVt9AUEYvPButuckM0nmRlha2wpFhep1yGQyuymzSU0yLn17s3Rt1RuvEsh3Pv6cn5A3CEuEjp2urbfZmXSKDVvispXdM7LArmE15QyfkwUA8poLhi_JcUq3AMC7Vh6RIykolSAW5PLcZmuy8-vqqtxi2Dtdvq_CWN3Mfcouz9kFX62mUJjzz6tK-6H6GkO2zlc39sdsvbHpFXkx6inZ14fzhHy_-Pjt7Kq-_nL56Wx1XZtG8ly3TAo9Nga0aIUVdNSU9szQjg5UIu8aLjvgEqRt2r4XsmWjHgCGYeBtN0rDTsiHB-_d3G_tYKzPUU_qLrqtjvcqaKf-fvFuo9bhp0IKgnZYBO8PghhK9pTV1iVjp0l7G-akhEQJZUn_BbHlIBChgO_-AW_DHH3ZgqLYIEPa7G30ATIxpBTt-BgZQe3LVL_LVKVMhaqUWYbe_vnZx5FDe08RN2692bloVdrqaSo0qt1u9yT6BSsCqgY</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>214131248</pqid></control><display><type>article</type><title>Detecting Heterogeneity of Substitution Along DNA and Protein Sequences</title><source>Oxford University Press Journals All Titles (1996-Current)</source><source>MEDLINE</source><source>EZB-FREE-00999 freely available EZB journals</source><source>Alma/SFX Local Collection</source><creator>Goss, PJE ; Lewontin, R. C</creator><creatorcontrib>Goss, PJE ; Lewontin, R. C</creatorcontrib><description>Regions of differing constraint, mutation rate or combination along a sequence of DNA or amino acids lead to a nonuniform distribution of polymorphism within species or fixed differences between species. The power of five tests to reject the null hypothesis of a uniform distribution is studied for four classes of alternate hypothesis. The tests explored are the variance of interval lengths; a modified variance test, which includes covariance between neighboring intervals; the length of the longest interval; the length of the shortest third-order interval; and a composite test. Although there is no uniformly most powerful test over the range of alternate hypotheses tested, the variance and modified variance tests usually have the highest power. Therefore, we recommend that one of these two tests be used to test departure from uniformity in all circumstances. Tables of critical values for the variance and modified variance tests are given. The critical values depend both on the number of events and the number of positions in the sequence. A computer program is available on request that calculates both the critical values for a specified number of events and number of positions as well as the significance level of a given data set.</description><identifier>ISSN: 0016-6731</identifier><identifier>ISSN: 1943-2631</identifier><identifier>EISSN: 1943-2631</identifier><identifier>DOI: 10.1093/genetics/143.1.589</identifier><identifier>PMID: 8722807</identifier><identifier>CODEN: GENTAE</identifier><language>eng</language><publisher>United States: Genetics Soc America</publisher><subject>Amino acids ; Biological Evolution ; Deoxyribonucleic acid ; DNA ; DNA - chemistry ; DNA - genetics ; Genetics ; Heterozygote ; Investigations ; Models, Genetic ; Monte Carlo Method ; Mutation ; Phylogeny ; Polymorphism, Genetic ; Probability ; Proteins ; Proteins - chemistry ; Proteins - genetics ; Recombination, Genetic ; Software</subject><ispartof>Genetics (Austin), 1996-05, Vol.143 (1), p.589-602</ispartof><rights>Copyright Genetics Society of America May 1996</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c486t-5387af4c0a757e72fa22b3c292d28169468906808e45bb7853fad00ddd659f8c3</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,776,780,881,27903,27904</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/8722807$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Goss, PJE</creatorcontrib><creatorcontrib>Lewontin, R. C</creatorcontrib><title>Detecting Heterogeneity of Substitution Along DNA and Protein Sequences</title><title>Genetics (Austin)</title><addtitle>Genetics</addtitle><description>Regions of differing constraint, mutation rate or combination along a sequence of DNA or amino acids lead to a nonuniform distribution of polymorphism within species or fixed differences between species. The power of five tests to reject the null hypothesis of a uniform distribution is studied for four classes of alternate hypothesis. The tests explored are the variance of interval lengths; a modified variance test, which includes covariance between neighboring intervals; the length of the longest interval; the length of the shortest third-order interval; and a composite test. Although there is no uniformly most powerful test over the range of alternate hypotheses tested, the variance and modified variance tests usually have the highest power. Therefore, we recommend that one of these two tests be used to test departure from uniformity in all circumstances. Tables of critical values for the variance and modified variance tests are given. The critical values depend both on the number of events and the number of positions in the sequence. A computer program is available on request that calculates both the critical values for a specified number of events and number of positions as well as the significance level of a given data set.</description><subject>Amino acids</subject><subject>Biological Evolution</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>DNA - chemistry</subject><subject>DNA - genetics</subject><subject>Genetics</subject><subject>Heterozygote</subject><subject>Investigations</subject><subject>Models, Genetic</subject><subject>Monte Carlo Method</subject><subject>Mutation</subject><subject>Phylogeny</subject><subject>Polymorphism, Genetic</subject><subject>Probability</subject><subject>Proteins</subject><subject>Proteins - chemistry</subject><subject>Proteins - genetics</subject><subject>Recombination, Genetic</subject><subject>Software</subject><issn>0016-6731</issn><issn>1943-2631</issn><issn>1943-2631</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>1996</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkdFqFDEUhoModVt9AUEYvPButuckM0nmRlha2wpFhep1yGQyuymzSU0yLn17s3Rt1RuvEsh3Pv6cn5A3CEuEjp2urbfZmXSKDVvispXdM7LArmE15QyfkwUA8poLhi_JcUq3AMC7Vh6RIykolSAW5PLcZmuy8-vqqtxi2Dtdvq_CWN3Mfcouz9kFX62mUJjzz6tK-6H6GkO2zlc39sdsvbHpFXkx6inZ14fzhHy_-Pjt7Kq-_nL56Wx1XZtG8ly3TAo9Nga0aIUVdNSU9szQjg5UIu8aLjvgEqRt2r4XsmWjHgCGYeBtN0rDTsiHB-_d3G_tYKzPUU_qLrqtjvcqaKf-fvFuo9bhp0IKgnZYBO8PghhK9pTV1iVjp0l7G-akhEQJZUn_BbHlIBChgO_-AW_DHH3ZgqLYIEPa7G30ATIxpBTt-BgZQe3LVL_LVKVMhaqUWYbe_vnZx5FDe08RN2692bloVdrqaSo0qt1u9yT6BSsCqgY</recordid><startdate>19960501</startdate><enddate>19960501</enddate><creator>Goss, PJE</creator><creator>Lewontin, R. C</creator><general>Genetics Soc America</general><general>Genetics Society of America</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>4T-</scope><scope>4U-</scope><scope>7QP</scope><scope>7SS</scope><scope>7TK</scope><scope>7TM</scope><scope>8FD</scope><scope>FR3</scope><scope>K9.</scope><scope>M7N</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>19960501</creationdate><title>Detecting Heterogeneity of Substitution Along DNA and Protein Sequences</title><author>Goss, PJE ; Lewontin, R. C</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c486t-5387af4c0a757e72fa22b3c292d28169468906808e45bb7853fad00ddd659f8c3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>1996</creationdate><topic>Amino acids</topic><topic>Biological Evolution</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>DNA - chemistry</topic><topic>DNA - genetics</topic><topic>Genetics</topic><topic>Heterozygote</topic><topic>Investigations</topic><topic>Models, Genetic</topic><topic>Monte Carlo Method</topic><topic>Mutation</topic><topic>Phylogeny</topic><topic>Polymorphism, Genetic</topic><topic>Probability</topic><topic>Proteins</topic><topic>Proteins - chemistry</topic><topic>Proteins - genetics</topic><topic>Recombination, Genetic</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Goss, PJE</creatorcontrib><creatorcontrib>Lewontin, R. C</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Docstoc</collection><collection>University Readers</collection><collection>Calcium & Calcified Tissue Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Genetics (Austin)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Goss, PJE</au><au>Lewontin, R. C</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Detecting Heterogeneity of Substitution Along DNA and Protein Sequences</atitle><jtitle>Genetics (Austin)</jtitle><addtitle>Genetics</addtitle><date>1996-05-01</date><risdate>1996</risdate><volume>143</volume><issue>1</issue><spage>589</spage><epage>602</epage><pages>589-602</pages><issn>0016-6731</issn><issn>1943-2631</issn><eissn>1943-2631</eissn><coden>GENTAE</coden><abstract>Regions of differing constraint, mutation rate or combination along a sequence of DNA or amino acids lead to a nonuniform distribution of polymorphism within species or fixed differences between species. The power of five tests to reject the null hypothesis of a uniform distribution is studied for four classes of alternate hypothesis. The tests explored are the variance of interval lengths; a modified variance test, which includes covariance between neighboring intervals; the length of the longest interval; the length of the shortest third-order interval; and a composite test. Although there is no uniformly most powerful test over the range of alternate hypotheses tested, the variance and modified variance tests usually have the highest power. Therefore, we recommend that one of these two tests be used to test departure from uniformity in all circumstances. Tables of critical values for the variance and modified variance tests are given. The critical values depend both on the number of events and the number of positions in the sequence. A computer program is available on request that calculates both the critical values for a specified number of events and number of positions as well as the significance level of a given data set.</abstract><cop>United States</cop><pub>Genetics Soc America</pub><pmid>8722807</pmid><doi>10.1093/genetics/143.1.589</doi><tpages>14</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 0016-6731
ispartof	Genetics (Austin), 1996-05, Vol.143 (1), p.589-602
issn	0016-6731 1943-2631 1943-2631
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_1207291
source	Oxford University Press Journals All Titles (1996-Current); MEDLINE; EZB-FREE-00999 freely available EZB journals; Alma/SFX Local Collection
subjects	Amino acids Biological Evolution Deoxyribonucleic acid DNA DNA - chemistry DNA - genetics Genetics Heterozygote Investigations Models, Genetic Monte Carlo Method Mutation Phylogeny Polymorphism, Genetic Probability Proteins Proteins - chemistry Proteins - genetics Recombination, Genetic Software
title	Detecting Heterogeneity of Substitution Along DNA and Protein Sequences
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-27T10%3A50%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Detecting%20Heterogeneity%20of%20Substitution%20Along%20DNA%20and%20Protein%20Sequences&rft.jtitle=Genetics%20(Austin)&rft.au=Goss,%20PJE&rft.date=1996-05-01&rft.volume=143&rft.issue=1&rft.spage=589&rft.epage=602&rft.pages=589-602&rft.issn=0016-6731&rft.eissn=1943-2631&rft.coden=GENTAE&rft_id=info:doi/10.1093/genetics/143.1.589&rft_dat=%3Cproquest_pubme%3E15607110%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=214131248&rft_id=info:pmid/8722807&rfr_iscdi=true