An empirical comparison of case-control and trio based study designs in high throughput association mapping

Motivated by high throughput genotyping technology, our aim in this study was to experimentally compare the power and accuracy of case-control and family trio based approaches for haplotype based, large scale, association gene mapping. We compared trio based and case-control study designs in differe...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of medical genetics 2006-07, Vol.43 (7), p.617-624
Hauptverfasser: Hintsanen, P, Sevon, P, Onkamo, P, Eronen, L, Toivonen, H
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 624
container_issue 7
container_start_page 617
container_title Journal of medical genetics
container_volume 43
creator Hintsanen, P
Sevon, P
Onkamo, P
Eronen, L
Toivonen, H
description Motivated by high throughput genotyping technology, our aim in this study was to experimentally compare the power and accuracy of case-control and family trio based approaches for haplotype based, large scale, association gene mapping. We compared trio based and case-control study designs in different disease models, and partitioned the performance differences into separate components: those from the sample ascertainment, the effective sample size, and the haplotyping approaches. For systematic and controlled tests, we simulated a rapidly expanding and relatively young isolated population. The experiments were also replicated with real asthma data. We used computationally efficient methods that scale up to large amounts of both markers and individuals. Mapping is based on a haplotype association test for haplotypes of 1–10 markers. For population based haplotype reconstruction, we use HaploRec, and compare it to both a simple trio based inference and true haplotypes. Firstly and surprisingly, statistically inferred population based haplotypes can be equally powerful as true haplotypes. Secondly, as expected, the effective sample size has a clear effect on both gene detection power and mapping accuracy. Thirdly, the sample ascertainment method does not have much effect on mapping accuracy. Finally, an interesting side result is that the simple haplotype association test clearly outperformed exhaustive allelic transmission disequilibrium tests. The results suggest that the case-control design is a powerful alternative to the more laborious family based ascertainment approach, especially for large datasets, and wherever population stratification can be controlled.
doi_str_mv 10.1136/jmg.2005.036020
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_2564560</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>68600853</sourcerecordid><originalsourceid>FETCH-LOGICAL-b553t-2894987545b96bf7b10680623a2ee856671ae6cb7a7f3f28932c26268329b9c63</originalsourceid><addsrcrecordid>eNqF0s9v1iAcBvDGaNy76dmbITHuYNJ3_Chf2suS5dU5zaIHp1dCKe3LuxYqtMb99_Kmbzb1shMJfHgCPGTZK4LXhDA42w3dmmLM15gBpvhJtiIFlDnQoniarTCmNKe8YkfZcYw7jAkTBJ5nRwQoLzEWq-z2wiEzjDZYrXqk_TCqYKN3yLdIq2hy7d0UfI-Ua9AUrEd1mm1QnObmDjUm2s5FZB3a2m6Lpm3wc7cd5wmpGL22arIpa1DjaF33InvWqj6al4fxJPt--eFmc5Vff_34aXNxndecsymnZVVUpeAFryuoW1ETDCUGyhQ1puQAgigDuhZKtKxNmlFNgULJaFVXGthJdr7kjnM9mEabdAPVyzHYQYU76ZWV_644u5Wd_yUph4IDTgGnh4Dgf84mTnKwUZu-V874OUooAeOSs0chEVRQUuwT3_wHd34OLr1CMiUhUGAgSZ0tSgcfYzDt_ZkJlvu-Zepb7vuWS99px-u_r_rgDwUn8PYAVEwNt0E5beODExWu0mdJLl-cjZP5fb-uwq0EwQSXX35s5Ocr_v7bDQZ5mfy7xdfD7tFT_gEQ6s-h</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1781164061</pqid></control><display><type>article</type><title>An empirical comparison of case-control and trio based study designs in high throughput association mapping</title><source>MEDLINE</source><source>BMJ Journals - NESLi2</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><creator>Hintsanen, P ; Sevon, P ; Onkamo, P ; Eronen, L ; Toivonen, H</creator><creatorcontrib>Hintsanen, P ; Sevon, P ; Onkamo, P ; Eronen, L ; Toivonen, H</creatorcontrib><description>Motivated by high throughput genotyping technology, our aim in this study was to experimentally compare the power and accuracy of case-control and family trio based approaches for haplotype based, large scale, association gene mapping. We compared trio based and case-control study designs in different disease models, and partitioned the performance differences into separate components: those from the sample ascertainment, the effective sample size, and the haplotyping approaches. For systematic and controlled tests, we simulated a rapidly expanding and relatively young isolated population. The experiments were also replicated with real asthma data. We used computationally efficient methods that scale up to large amounts of both markers and individuals. Mapping is based on a haplotype association test for haplotypes of 1–10 markers. For population based haplotype reconstruction, we use HaploRec, and compare it to both a simple trio based inference and true haplotypes. Firstly and surprisingly, statistically inferred population based haplotypes can be equally powerful as true haplotypes. Secondly, as expected, the effective sample size has a clear effect on both gene detection power and mapping accuracy. Thirdly, the sample ascertainment method does not have much effect on mapping accuracy. Finally, an interesting side result is that the simple haplotype association test clearly outperformed exhaustive allelic transmission disequilibrium tests. The results suggest that the case-control design is a powerful alternative to the more laborious family based ascertainment approach, especially for large datasets, and wherever population stratification can be controlled.</description><identifier>ISSN: 0022-2593</identifier><identifier>ISSN: 1468-6244</identifier><identifier>EISSN: 1468-6244</identifier><identifier>DOI: 10.1136/jmg.2005.036020</identifier><identifier>PMID: 16258007</identifier><identifier>CODEN: JMDGAE</identifier><language>eng</language><publisher>London: BMJ Publishing Group Ltd</publisher><subject>Algorithms ; Asthma ; Biological and medical sciences ; Cardiology. Vascular system ; case-control data ; Case-Control Studies ; Chromosome Mapping ; Databases, Factual ; Datasets ; EATDT ; exhaustive allelic transmission disequilibrium tests ; General aspects. Genetic counseling ; Genes ; Genetic Diseases, Inborn - genetics ; Genomics ; Genotype ; haplotype association ; Haplotypes ; high throughput methods ; Humans ; Letter to JMG ; linkage disequilibrium mapping ; Medical genetics ; Medical sciences ; Methods ; population-based haplotyping ; Research Design ; Sample size ; Simulation ; single nucleotide polymorphism ; SNP ; Statistical analysis</subject><ispartof>Journal of medical genetics, 2006-07, Vol.43 (7), p.617-624</ispartof><rights>Copyright 2006 Journal of Medical Genetics</rights><rights>2006 INIST-CNRS</rights><rights>Copyright: 2006 Copyright 2006 Journal of Medical Genetics</rights><rights>Copyright ©2006 BMJ Publishing Group Ltd.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-b553t-2894987545b96bf7b10680623a2ee856671ae6cb7a7f3f28932c26268329b9c63</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://jmg.bmj.com/content/43/7/617.full.pdf$$EPDF$$P50$$Gbmj$$H</linktopdf><linktohtml>$$Uhttps://jmg.bmj.com/content/43/7/617.full$$EHTML$$P50$$Gbmj$$H</linktohtml><link.rule.ids>114,115,230,314,723,776,780,881,3183,23550,27901,27902,53766,53768,77342,77373</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=17909244$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/16258007$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hintsanen, P</creatorcontrib><creatorcontrib>Sevon, P</creatorcontrib><creatorcontrib>Onkamo, P</creatorcontrib><creatorcontrib>Eronen, L</creatorcontrib><creatorcontrib>Toivonen, H</creatorcontrib><title>An empirical comparison of case-control and trio based study designs in high throughput association mapping</title><title>Journal of medical genetics</title><addtitle>J Med Genet</addtitle><description>Motivated by high throughput genotyping technology, our aim in this study was to experimentally compare the power and accuracy of case-control and family trio based approaches for haplotype based, large scale, association gene mapping. We compared trio based and case-control study designs in different disease models, and partitioned the performance differences into separate components: those from the sample ascertainment, the effective sample size, and the haplotyping approaches. For systematic and controlled tests, we simulated a rapidly expanding and relatively young isolated population. The experiments were also replicated with real asthma data. We used computationally efficient methods that scale up to large amounts of both markers and individuals. Mapping is based on a haplotype association test for haplotypes of 1–10 markers. For population based haplotype reconstruction, we use HaploRec, and compare it to both a simple trio based inference and true haplotypes. Firstly and surprisingly, statistically inferred population based haplotypes can be equally powerful as true haplotypes. Secondly, as expected, the effective sample size has a clear effect on both gene detection power and mapping accuracy. Thirdly, the sample ascertainment method does not have much effect on mapping accuracy. Finally, an interesting side result is that the simple haplotype association test clearly outperformed exhaustive allelic transmission disequilibrium tests. The results suggest that the case-control design is a powerful alternative to the more laborious family based ascertainment approach, especially for large datasets, and wherever population stratification can be controlled.</description><subject>Algorithms</subject><subject>Asthma</subject><subject>Biological and medical sciences</subject><subject>Cardiology. Vascular system</subject><subject>case-control data</subject><subject>Case-Control Studies</subject><subject>Chromosome Mapping</subject><subject>Databases, Factual</subject><subject>Datasets</subject><subject>EATDT</subject><subject>exhaustive allelic transmission disequilibrium tests</subject><subject>General aspects. Genetic counseling</subject><subject>Genes</subject><subject>Genetic Diseases, Inborn - genetics</subject><subject>Genomics</subject><subject>Genotype</subject><subject>haplotype association</subject><subject>Haplotypes</subject><subject>high throughput methods</subject><subject>Humans</subject><subject>Letter to JMG</subject><subject>linkage disequilibrium mapping</subject><subject>Medical genetics</subject><subject>Medical sciences</subject><subject>Methods</subject><subject>population-based haplotyping</subject><subject>Research Design</subject><subject>Sample size</subject><subject>Simulation</subject><subject>single nucleotide polymorphism</subject><subject>SNP</subject><subject>Statistical analysis</subject><issn>0022-2593</issn><issn>1468-6244</issn><issn>1468-6244</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2006</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><recordid>eNqF0s9v1iAcBvDGaNy76dmbITHuYNJ3_Chf2suS5dU5zaIHp1dCKe3LuxYqtMb99_Kmbzb1shMJfHgCPGTZK4LXhDA42w3dmmLM15gBpvhJtiIFlDnQoniarTCmNKe8YkfZcYw7jAkTBJ5nRwQoLzEWq-z2wiEzjDZYrXqk_TCqYKN3yLdIq2hy7d0UfI-Ua9AUrEd1mm1QnObmDjUm2s5FZB3a2m6Lpm3wc7cd5wmpGL22arIpa1DjaF33InvWqj6al4fxJPt--eFmc5Vff_34aXNxndecsymnZVVUpeAFryuoW1ETDCUGyhQ1puQAgigDuhZKtKxNmlFNgULJaFVXGthJdr7kjnM9mEabdAPVyzHYQYU76ZWV_644u5Wd_yUph4IDTgGnh4Dgf84mTnKwUZu-V874OUooAeOSs0chEVRQUuwT3_wHd34OLr1CMiUhUGAgSZ0tSgcfYzDt_ZkJlvu-Zepb7vuWS99px-u_r_rgDwUn8PYAVEwNt0E5beODExWu0mdJLl-cjZP5fb-uwq0EwQSXX35s5Ocr_v7bDQZ5mfy7xdfD7tFT_gEQ6s-h</recordid><startdate>20060701</startdate><enddate>20060701</enddate><creator>Hintsanen, P</creator><creator>Sevon, P</creator><creator>Onkamo, P</creator><creator>Eronen, L</creator><creator>Toivonen, H</creator><general>BMJ Publishing Group Ltd</general><general>BMJ</general><general>BMJ Publishing Group LTD</general><general>BMJ Group</general><scope>BSCLL</scope><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8AF</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>BTHHO</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20060701</creationdate><title>An empirical comparison of case-control and trio based study designs in high throughput association mapping</title><author>Hintsanen, P ; Sevon, P ; Onkamo, P ; Eronen, L ; Toivonen, H</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-b553t-2894987545b96bf7b10680623a2ee856671ae6cb7a7f3f28932c26268329b9c63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2006</creationdate><topic>Algorithms</topic><topic>Asthma</topic><topic>Biological and medical sciences</topic><topic>Cardiology. Vascular system</topic><topic>case-control data</topic><topic>Case-Control Studies</topic><topic>Chromosome Mapping</topic><topic>Databases, Factual</topic><topic>Datasets</topic><topic>EATDT</topic><topic>exhaustive allelic transmission disequilibrium tests</topic><topic>General aspects. Genetic counseling</topic><topic>Genes</topic><topic>Genetic Diseases, Inborn - genetics</topic><topic>Genomics</topic><topic>Genotype</topic><topic>haplotype association</topic><topic>Haplotypes</topic><topic>high throughput methods</topic><topic>Humans</topic><topic>Letter to JMG</topic><topic>linkage disequilibrium mapping</topic><topic>Medical genetics</topic><topic>Medical sciences</topic><topic>Methods</topic><topic>population-based haplotyping</topic><topic>Research Design</topic><topic>Sample size</topic><topic>Simulation</topic><topic>single nucleotide polymorphism</topic><topic>SNP</topic><topic>Statistical analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hintsanen, P</creatorcontrib><creatorcontrib>Sevon, P</creatorcontrib><creatorcontrib>Onkamo, P</creatorcontrib><creatorcontrib>Eronen, L</creatorcontrib><creatorcontrib>Toivonen, H</creatorcontrib><collection>Istex</collection><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>STEM Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>BMJ Journals</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Journal of medical genetics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hintsanen, P</au><au>Sevon, P</au><au>Onkamo, P</au><au>Eronen, L</au><au>Toivonen, H</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>An empirical comparison of case-control and trio based study designs in high throughput association mapping</atitle><jtitle>Journal of medical genetics</jtitle><addtitle>J Med Genet</addtitle><date>2006-07-01</date><risdate>2006</risdate><volume>43</volume><issue>7</issue><spage>617</spage><epage>624</epage><pages>617-624</pages><issn>0022-2593</issn><issn>1468-6244</issn><eissn>1468-6244</eissn><coden>JMDGAE</coden><abstract>Motivated by high throughput genotyping technology, our aim in this study was to experimentally compare the power and accuracy of case-control and family trio based approaches for haplotype based, large scale, association gene mapping. We compared trio based and case-control study designs in different disease models, and partitioned the performance differences into separate components: those from the sample ascertainment, the effective sample size, and the haplotyping approaches. For systematic and controlled tests, we simulated a rapidly expanding and relatively young isolated population. The experiments were also replicated with real asthma data. We used computationally efficient methods that scale up to large amounts of both markers and individuals. Mapping is based on a haplotype association test for haplotypes of 1–10 markers. For population based haplotype reconstruction, we use HaploRec, and compare it to both a simple trio based inference and true haplotypes. Firstly and surprisingly, statistically inferred population based haplotypes can be equally powerful as true haplotypes. Secondly, as expected, the effective sample size has a clear effect on both gene detection power and mapping accuracy. Thirdly, the sample ascertainment method does not have much effect on mapping accuracy. Finally, an interesting side result is that the simple haplotype association test clearly outperformed exhaustive allelic transmission disequilibrium tests. The results suggest that the case-control design is a powerful alternative to the more laborious family based ascertainment approach, especially for large datasets, and wherever population stratification can be controlled.</abstract><cop>London</cop><pub>BMJ Publishing Group Ltd</pub><pmid>16258007</pmid><doi>10.1136/jmg.2005.036020</doi><tpages>8</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0022-2593
ispartof Journal of medical genetics, 2006-07, Vol.43 (7), p.617-624
issn 0022-2593
1468-6244
1468-6244
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_2564560
source MEDLINE; BMJ Journals - NESLi2; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central
subjects Algorithms
Asthma
Biological and medical sciences
Cardiology. Vascular system
case-control data
Case-Control Studies
Chromosome Mapping
Databases, Factual
Datasets
EATDT
exhaustive allelic transmission disequilibrium tests
General aspects. Genetic counseling
Genes
Genetic Diseases, Inborn - genetics
Genomics
Genotype
haplotype association
Haplotypes
high throughput methods
Humans
Letter to JMG
linkage disequilibrium mapping
Medical genetics
Medical sciences
Methods
population-based haplotyping
Research Design
Sample size
Simulation
single nucleotide polymorphism
SNP
Statistical analysis
title An empirical comparison of case-control and trio based study designs in high throughput association mapping
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-10T20%3A08%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=An%20empirical%20comparison%20of%20case-control%20and%20trio%20based%20study%20designs%20in%20high%20throughput%20association%20mapping&rft.jtitle=Journal%20of%20medical%20genetics&rft.au=Hintsanen,%20P&rft.date=2006-07-01&rft.volume=43&rft.issue=7&rft.spage=617&rft.epage=624&rft.pages=617-624&rft.issn=0022-2593&rft.eissn=1468-6244&rft.coden=JMDGAE&rft_id=info:doi/10.1136/jmg.2005.036020&rft_dat=%3Cproquest_pubme%3E68600853%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1781164061&rft_id=info:pmid/16258007&rfr_iscdi=true