An empirical comparison of case-control and trio based study designs in high throughput association mapping
Motivated by high throughput genotyping technology, our aim in this study was to experimentally compare the power and accuracy of case-control and family trio based approaches for haplotype based, large scale, association gene mapping. We compared trio based and case-control study designs in differe...
Gespeichert in:
Veröffentlicht in: | Journal of medical genetics 2006-07, Vol.43 (7), p.617-624 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 624 |
---|---|
container_issue | 7 |
container_start_page | 617 |
container_title | Journal of medical genetics |
container_volume | 43 |
creator | Hintsanen, P Sevon, P Onkamo, P Eronen, L Toivonen, H |
description | Motivated by high throughput genotyping technology, our aim in this study was to experimentally compare the power and accuracy of case-control and family trio based approaches for haplotype based, large scale, association gene mapping. We compared trio based and case-control study designs in different disease models, and partitioned the performance differences into separate components: those from the sample ascertainment, the effective sample size, and the haplotyping approaches. For systematic and controlled tests, we simulated a rapidly expanding and relatively young isolated population. The experiments were also replicated with real asthma data. We used computationally efficient methods that scale up to large amounts of both markers and individuals. Mapping is based on a haplotype association test for haplotypes of 1–10 markers. For population based haplotype reconstruction, we use HaploRec, and compare it to both a simple trio based inference and true haplotypes. Firstly and surprisingly, statistically inferred population based haplotypes can be equally powerful as true haplotypes. Secondly, as expected, the effective sample size has a clear effect on both gene detection power and mapping accuracy. Thirdly, the sample ascertainment method does not have much effect on mapping accuracy. Finally, an interesting side result is that the simple haplotype association test clearly outperformed exhaustive allelic transmission disequilibrium tests. The results suggest that the case-control design is a powerful alternative to the more laborious family based ascertainment approach, especially for large datasets, and wherever population stratification can be controlled. |
doi_str_mv | 10.1136/jmg.2005.036020 |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_2564560</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>68600853</sourcerecordid><originalsourceid>FETCH-LOGICAL-b553t-2894987545b96bf7b10680623a2ee856671ae6cb7a7f3f28932c26268329b9c63</originalsourceid><addsrcrecordid>eNqF0s9v1iAcBvDGaNy76dmbITHuYNJ3_Chf2suS5dU5zaIHp1dCKe3LuxYqtMb99_Kmbzb1shMJfHgCPGTZK4LXhDA42w3dmmLM15gBpvhJtiIFlDnQoniarTCmNKe8YkfZcYw7jAkTBJ5nRwQoLzEWq-z2wiEzjDZYrXqk_TCqYKN3yLdIq2hy7d0UfI-Ua9AUrEd1mm1QnObmDjUm2s5FZB3a2m6Lpm3wc7cd5wmpGL22arIpa1DjaF33InvWqj6al4fxJPt--eFmc5Vff_34aXNxndecsymnZVVUpeAFryuoW1ETDCUGyhQ1puQAgigDuhZKtKxNmlFNgULJaFVXGthJdr7kjnM9mEabdAPVyzHYQYU76ZWV_644u5Wd_yUph4IDTgGnh4Dgf84mTnKwUZu-V874OUooAeOSs0chEVRQUuwT3_wHd34OLr1CMiUhUGAgSZ0tSgcfYzDt_ZkJlvu-Zepb7vuWS99px-u_r_rgDwUn8PYAVEwNt0E5beODExWu0mdJLl-cjZP5fb-uwq0EwQSXX35s5Ocr_v7bDQZ5mfy7xdfD7tFT_gEQ6s-h</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1781164061</pqid></control><display><type>article</type><title>An empirical comparison of case-control and trio based study designs in high throughput association mapping</title><source>MEDLINE</source><source>BMJ Journals - NESLi2</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><creator>Hintsanen, P ; Sevon, P ; Onkamo, P ; Eronen, L ; Toivonen, H</creator><creatorcontrib>Hintsanen, P ; Sevon, P ; Onkamo, P ; Eronen, L ; Toivonen, H</creatorcontrib><description>Motivated by high throughput genotyping technology, our aim in this study was to experimentally compare the power and accuracy of case-control and family trio based approaches for haplotype based, large scale, association gene mapping. We compared trio based and case-control study designs in different disease models, and partitioned the performance differences into separate components: those from the sample ascertainment, the effective sample size, and the haplotyping approaches. For systematic and controlled tests, we simulated a rapidly expanding and relatively young isolated population. The experiments were also replicated with real asthma data. We used computationally efficient methods that scale up to large amounts of both markers and individuals. Mapping is based on a haplotype association test for haplotypes of 1–10 markers. For population based haplotype reconstruction, we use HaploRec, and compare it to both a simple trio based inference and true haplotypes. Firstly and surprisingly, statistically inferred population based haplotypes can be equally powerful as true haplotypes. Secondly, as expected, the effective sample size has a clear effect on both gene detection power and mapping accuracy. Thirdly, the sample ascertainment method does not have much effect on mapping accuracy. Finally, an interesting side result is that the simple haplotype association test clearly outperformed exhaustive allelic transmission disequilibrium tests. The results suggest that the case-control design is a powerful alternative to the more laborious family based ascertainment approach, especially for large datasets, and wherever population stratification can be controlled.</description><identifier>ISSN: 0022-2593</identifier><identifier>ISSN: 1468-6244</identifier><identifier>EISSN: 1468-6244</identifier><identifier>DOI: 10.1136/jmg.2005.036020</identifier><identifier>PMID: 16258007</identifier><identifier>CODEN: JMDGAE</identifier><language>eng</language><publisher>London: BMJ Publishing Group Ltd</publisher><subject>Algorithms ; Asthma ; Biological and medical sciences ; Cardiology. Vascular system ; case-control data ; Case-Control Studies ; Chromosome Mapping ; Databases, Factual ; Datasets ; EATDT ; exhaustive allelic transmission disequilibrium tests ; General aspects. Genetic counseling ; Genes ; Genetic Diseases, Inborn - genetics ; Genomics ; Genotype ; haplotype association ; Haplotypes ; high throughput methods ; Humans ; Letter to JMG ; linkage disequilibrium mapping ; Medical genetics ; Medical sciences ; Methods ; population-based haplotyping ; Research Design ; Sample size ; Simulation ; single nucleotide polymorphism ; SNP ; Statistical analysis</subject><ispartof>Journal of medical genetics, 2006-07, Vol.43 (7), p.617-624</ispartof><rights>Copyright 2006 Journal of Medical Genetics</rights><rights>2006 INIST-CNRS</rights><rights>Copyright: 2006 Copyright 2006 Journal of Medical Genetics</rights><rights>Copyright ©2006 BMJ Publishing Group Ltd.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-b553t-2894987545b96bf7b10680623a2ee856671ae6cb7a7f3f28932c26268329b9c63</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://jmg.bmj.com/content/43/7/617.full.pdf$$EPDF$$P50$$Gbmj$$H</linktopdf><linktohtml>$$Uhttps://jmg.bmj.com/content/43/7/617.full$$EHTML$$P50$$Gbmj$$H</linktohtml><link.rule.ids>114,115,230,314,723,776,780,881,3183,23550,27901,27902,53766,53768,77342,77373</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17909244$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/16258007$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hintsanen, P</creatorcontrib><creatorcontrib>Sevon, P</creatorcontrib><creatorcontrib>Onkamo, P</creatorcontrib><creatorcontrib>Eronen, L</creatorcontrib><creatorcontrib>Toivonen, H</creatorcontrib><title>An empirical comparison of case-control and trio based study designs in high throughput association mapping</title><title>Journal of medical genetics</title><addtitle>J Med Genet</addtitle><description>Motivated by high throughput genotyping technology, our aim in this study was to experimentally compare the power and accuracy of case-control and family trio based approaches for haplotype based, large scale, association gene mapping. We compared trio based and case-control study designs in different disease models, and partitioned the performance differences into separate components: those from the sample ascertainment, the effective sample size, and the haplotyping approaches. For systematic and controlled tests, we simulated a rapidly expanding and relatively young isolated population. The experiments were also replicated with real asthma data. We used computationally efficient methods that scale up to large amounts of both markers and individuals. Mapping is based on a haplotype association test for haplotypes of 1–10 markers. For population based haplotype reconstruction, we use HaploRec, and compare it to both a simple trio based inference and true haplotypes. Firstly and surprisingly, statistically inferred population based haplotypes can be equally powerful as true haplotypes. Secondly, as expected, the effective sample size has a clear effect on both gene detection power and mapping accuracy. Thirdly, the sample ascertainment method does not have much effect on mapping accuracy. Finally, an interesting side result is that the simple haplotype association test clearly outperformed exhaustive allelic transmission disequilibrium tests. The results suggest that the case-control design is a powerful alternative to the more laborious family based ascertainment approach, especially for large datasets, and wherever population stratification can be controlled.</description><subject>Algorithms</subject><subject>Asthma</subject><subject>Biological and medical sciences</subject><subject>Cardiology. Vascular system</subject><subject>case-control data</subject><subject>Case-Control Studies</subject><subject>Chromosome Mapping</subject><subject>Databases, Factual</subject><subject>Datasets</subject><subject>EATDT</subject><subject>exhaustive allelic transmission disequilibrium tests</subject><subject>General aspects. Genetic counseling</subject><subject>Genes</subject><subject>Genetic Diseases, Inborn - genetics</subject><subject>Genomics</subject><subject>Genotype</subject><subject>haplotype association</subject><subject>Haplotypes</subject><subject>high throughput methods</subject><subject>Humans</subject><subject>Letter to JMG</subject><subject>linkage disequilibrium mapping</subject><subject>Medical genetics</subject><subject>Medical sciences</subject><subject>Methods</subject><subject>population-based haplotyping</subject><subject>Research Design</subject><subject>Sample size</subject><subject>Simulation</subject><subject>single nucleotide polymorphism</subject><subject>SNP</subject><subject>Statistical analysis</subject><issn>0022-2593</issn><issn>1468-6244</issn><issn>1468-6244</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2006</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><recordid>eNqF0s9v1iAcBvDGaNy76dmbITHuYNJ3_Chf2suS5dU5zaIHp1dCKe3LuxYqtMb99_Kmbzb1shMJfHgCPGTZK4LXhDA42w3dmmLM15gBpvhJtiIFlDnQoniarTCmNKe8YkfZcYw7jAkTBJ5nRwQoLzEWq-z2wiEzjDZYrXqk_TCqYKN3yLdIq2hy7d0UfI-Ua9AUrEd1mm1QnObmDjUm2s5FZB3a2m6Lpm3wc7cd5wmpGL22arIpa1DjaF33InvWqj6al4fxJPt--eFmc5Vff_34aXNxndecsymnZVVUpeAFryuoW1ETDCUGyhQ1puQAgigDuhZKtKxNmlFNgULJaFVXGthJdr7kjnM9mEabdAPVyzHYQYU76ZWV_644u5Wd_yUph4IDTgGnh4Dgf84mTnKwUZu-V874OUooAeOSs0chEVRQUuwT3_wHd34OLr1CMiUhUGAgSZ0tSgcfYzDt_ZkJlvu-Zepb7vuWS99px-u_r_rgDwUn8PYAVEwNt0E5beODExWu0mdJLl-cjZP5fb-uwq0EwQSXX35s5Ocr_v7bDQZ5mfy7xdfD7tFT_gEQ6s-h</recordid><startdate>20060701</startdate><enddate>20060701</enddate><creator>Hintsanen, P</creator><creator>Sevon, P</creator><creator>Onkamo, P</creator><creator>Eronen, L</creator><creator>Toivonen, H</creator><general>BMJ Publishing Group Ltd</general><general>BMJ</general><general>BMJ Publishing Group LTD</general><general>BMJ Group</general><scope>BSCLL</scope><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8AF</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>BTHHO</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20060701</creationdate><title>An empirical comparison of case-control and trio based study designs in high throughput association mapping</title><author>Hintsanen, P ; Sevon, P ; Onkamo, P ; Eronen, L ; Toivonen, H</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-b553t-2894987545b96bf7b10680623a2ee856671ae6cb7a7f3f28932c26268329b9c63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2006</creationdate><topic>Algorithms</topic><topic>Asthma</topic><topic>Biological and medical sciences</topic><topic>Cardiology. Vascular system</topic><topic>case-control data</topic><topic>Case-Control Studies</topic><topic>Chromosome Mapping</topic><topic>Databases, Factual</topic><topic>Datasets</topic><topic>EATDT</topic><topic>exhaustive allelic transmission disequilibrium tests</topic><topic>General aspects. Genetic counseling</topic><topic>Genes</topic><topic>Genetic Diseases, Inborn - genetics</topic><topic>Genomics</topic><topic>Genotype</topic><topic>haplotype association</topic><topic>Haplotypes</topic><topic>high throughput methods</topic><topic>Humans</topic><topic>Letter to JMG</topic><topic>linkage disequilibrium mapping</topic><topic>Medical genetics</topic><topic>Medical sciences</topic><topic>Methods</topic><topic>population-based haplotyping</topic><topic>Research Design</topic><topic>Sample size</topic><topic>Simulation</topic><topic>single nucleotide polymorphism</topic><topic>SNP</topic><topic>Statistical analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hintsanen, P</creatorcontrib><creatorcontrib>Sevon, P</creatorcontrib><creatorcontrib>Onkamo, P</creatorcontrib><creatorcontrib>Eronen, L</creatorcontrib><creatorcontrib>Toivonen, H</creatorcontrib><collection>Istex</collection><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>STEM Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>BMJ Journals</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of medical genetics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hintsanen, P</au><au>Sevon, P</au><au>Onkamo, P</au><au>Eronen, L</au><au>Toivonen, H</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An empirical comparison of case-control and trio based study designs in high throughput association mapping</atitle><jtitle>Journal of medical genetics</jtitle><addtitle>J Med Genet</addtitle><date>2006-07-01</date><risdate>2006</risdate><volume>43</volume><issue>7</issue><spage>617</spage><epage>624</epage><pages>617-624</pages><issn>0022-2593</issn><issn>1468-6244</issn><eissn>1468-6244</eissn><coden>JMDGAE</coden><abstract>Motivated by high throughput genotyping technology, our aim in this study was to experimentally compare the power and accuracy of case-control and family trio based approaches for haplotype based, large scale, association gene mapping. We compared trio based and case-control study designs in different disease models, and partitioned the performance differences into separate components: those from the sample ascertainment, the effective sample size, and the haplotyping approaches. For systematic and controlled tests, we simulated a rapidly expanding and relatively young isolated population. The experiments were also replicated with real asthma data. We used computationally efficient methods that scale up to large amounts of both markers and individuals. Mapping is based on a haplotype association test for haplotypes of 1–10 markers. For population based haplotype reconstruction, we use HaploRec, and compare it to both a simple trio based inference and true haplotypes. Firstly and surprisingly, statistically inferred population based haplotypes can be equally powerful as true haplotypes. Secondly, as expected, the effective sample size has a clear effect on both gene detection power and mapping accuracy. Thirdly, the sample ascertainment method does not have much effect on mapping accuracy. Finally, an interesting side result is that the simple haplotype association test clearly outperformed exhaustive allelic transmission disequilibrium tests. The results suggest that the case-control design is a powerful alternative to the more laborious family based ascertainment approach, especially for large datasets, and wherever population stratification can be controlled.</abstract><cop>London</cop><pub>BMJ Publishing Group Ltd</pub><pmid>16258007</pmid><doi>10.1136/jmg.2005.036020</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0022-2593 |
ispartof | Journal of medical genetics, 2006-07, Vol.43 (7), p.617-624 |
issn | 0022-2593 1468-6244 1468-6244 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_2564560 |
source | MEDLINE; BMJ Journals - NESLi2; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central |
subjects | Algorithms Asthma Biological and medical sciences Cardiology. Vascular system case-control data Case-Control Studies Chromosome Mapping Databases, Factual Datasets EATDT exhaustive allelic transmission disequilibrium tests General aspects. Genetic counseling Genes Genetic Diseases, Inborn - genetics Genomics Genotype haplotype association Haplotypes high throughput methods Humans Letter to JMG linkage disequilibrium mapping Medical genetics Medical sciences Methods population-based haplotyping Research Design Sample size Simulation single nucleotide polymorphism SNP Statistical analysis |
title | An empirical comparison of case-control and trio based study designs in high throughput association mapping |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T20%3A08%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20empirical%20comparison%20of%20case-control%20and%20trio%20based%20study%20designs%20in%20high%20throughput%20association%20mapping&rft.jtitle=Journal%20of%20medical%20genetics&rft.au=Hintsanen,%20P&rft.date=2006-07-01&rft.volume=43&rft.issue=7&rft.spage=617&rft.epage=624&rft.pages=617-624&rft.issn=0022-2593&rft.eissn=1468-6244&rft.coden=JMDGAE&rft_id=info:doi/10.1136/jmg.2005.036020&rft_dat=%3Cproquest_pubme%3E68600853%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1781164061&rft_id=info:pmid/16258007&rfr_iscdi=true |