Parallel multiplicity and error discovery rate

In microarray gene expression profiling experiments, differentially expressed genes (DEGs) are detected from among tens of thousands of genes on an array using statistical tests. It is important to control the number of false positives or errors that are present in the resultant DEG list. To date, m...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:BMC bioinformatics 2010-09, Vol.11, p.465
Hauptverfasser: Xu, Wayne Wenzhong, Carter, Clay J
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 465
container_title BMC bioinformatics
container_volume 11
creator Xu, Wayne Wenzhong
Carter, Clay J
description In microarray gene expression profiling experiments, differentially expressed genes (DEGs) are detected from among tens of thousands of genes on an array using statistical tests. It is important to control the number of false positives or errors that are present in the resultant DEG list. To date, more than 20 different multiple test methods have been reported that compute overall Type I error rates in microarray experiments. However, these methods share the following dilemma: they have low power in cases where only a small number of DEGs exist among a large number of total genes on the array. This study contrasts parallel multiplicity of objectively related tests against the traditional simultaneousness of subjectively related tests and proposes a new assessment called the Error Discovery Rate (EDR) for evaluating multiple test comparisons in microarray experiments. Parallel multiple tests use only the negative genes that parallel the positive genes to control the error rate; while simultaneous multiple tests use the total unchanged gene number for error estimates. Here, we demonstrate that the EDR method exhibits improved performance over other methods in specificity and sensitivity in testing expression data sets with sequence digital expression confirmation, in examining simulation data, as well as for three experimental data sets that vary in the proportion of DEGs. The EDR method overcomes a common problem of previous multiple test procedures, namely that the Type I error rate detection power is low when the total gene number used is large but the DEG number is small. Microarrays are extensively used to address many research questions. However, there is potential to improve the sensitivity and specificity of microarray data analysis by developing improved multiple test comparisons. This study proposes a new view of multiplicity in microarray experiments and the EDR provides an alternative multiple test method for Type I error control in microarray experiments.
doi_str_mv 10.1186/1471-2105-11-465
format Article
fullrecord <record><control><sourceid>gale</sourceid><recordid>TN_cdi_gale_incontextgauss_ISR_A238878147</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A238878147</galeid><sourcerecordid>A238878147</sourcerecordid><originalsourceid>FETCH-gale_incontextgauss_ISR_A2388781473</originalsourceid><addsrcrecordid>eNqVjTsLwjAURoMoWB-7Y1aHaG6fWUUU3UTdy6W9lkhsJUnF_nsdRFydvsPhwMfYDOQCQKVLiDMQIchEAIg4TXos-Kr-Dw_ZyLmrlJApmQRscUCLxpDht9Z4fTe60L7jWJecrG0sL7UrmgfZjlv0NGGDCxpH08-O2Xy7Oa93okJDua6Lpvb09BW2zuX70zFfhZFSmXrfR_-0L2XLPPI</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Parallel multiplicity and error discovery rate</title><source>DOAJ Directory of Open Access Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><source>SpringerLink Journals - AutoHoldings</source><source>PubMed Central Open Access</source><source>Springer Nature OA Free Journals</source><creator>Xu, Wayne Wenzhong ; Carter, Clay J</creator><creatorcontrib>Xu, Wayne Wenzhong ; Carter, Clay J</creatorcontrib><description>In microarray gene expression profiling experiments, differentially expressed genes (DEGs) are detected from among tens of thousands of genes on an array using statistical tests. It is important to control the number of false positives or errors that are present in the resultant DEG list. To date, more than 20 different multiple test methods have been reported that compute overall Type I error rates in microarray experiments. However, these methods share the following dilemma: they have low power in cases where only a small number of DEGs exist among a large number of total genes on the array. This study contrasts parallel multiplicity of objectively related tests against the traditional simultaneousness of subjectively related tests and proposes a new assessment called the Error Discovery Rate (EDR) for evaluating multiple test comparisons in microarray experiments. Parallel multiple tests use only the negative genes that parallel the positive genes to control the error rate; while simultaneous multiple tests use the total unchanged gene number for error estimates. Here, we demonstrate that the EDR method exhibits improved performance over other methods in specificity and sensitivity in testing expression data sets with sequence digital expression confirmation, in examining simulation data, as well as for three experimental data sets that vary in the proportion of DEGs. The EDR method overcomes a common problem of previous multiple test procedures, namely that the Type I error rate detection power is low when the total gene number used is large but the DEG number is small. Microarrays are extensively used to address many research questions. However, there is potential to improve the sensitivity and specificity of microarray data analysis by developing improved multiple test comparisons. This study proposes a new view of multiplicity in microarray experiments and the EDR provides an alternative multiple test method for Type I error control in microarray experiments.</description><identifier>ISSN: 1471-2105</identifier><identifier>EISSN: 1471-2105</identifier><identifier>DOI: 10.1186/1471-2105-11-465</identifier><language>eng</language><publisher>BioMed Central Ltd</publisher><subject>Computational biology ; DNA microarrays ; Gene expression</subject><ispartof>BMC bioinformatics, 2010-09, Vol.11, p.465</ispartof><rights>COPYRIGHT 2010 BioMed Central Ltd.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,860,27901,27902</link.rule.ids></links><search><creatorcontrib>Xu, Wayne Wenzhong</creatorcontrib><creatorcontrib>Carter, Clay J</creatorcontrib><title>Parallel multiplicity and error discovery rate</title><title>BMC bioinformatics</title><description>In microarray gene expression profiling experiments, differentially expressed genes (DEGs) are detected from among tens of thousands of genes on an array using statistical tests. It is important to control the number of false positives or errors that are present in the resultant DEG list. To date, more than 20 different multiple test methods have been reported that compute overall Type I error rates in microarray experiments. However, these methods share the following dilemma: they have low power in cases where only a small number of DEGs exist among a large number of total genes on the array. This study contrasts parallel multiplicity of objectively related tests against the traditional simultaneousness of subjectively related tests and proposes a new assessment called the Error Discovery Rate (EDR) for evaluating multiple test comparisons in microarray experiments. Parallel multiple tests use only the negative genes that parallel the positive genes to control the error rate; while simultaneous multiple tests use the total unchanged gene number for error estimates. Here, we demonstrate that the EDR method exhibits improved performance over other methods in specificity and sensitivity in testing expression data sets with sequence digital expression confirmation, in examining simulation data, as well as for three experimental data sets that vary in the proportion of DEGs. The EDR method overcomes a common problem of previous multiple test procedures, namely that the Type I error rate detection power is low when the total gene number used is large but the DEG number is small. Microarrays are extensively used to address many research questions. However, there is potential to improve the sensitivity and specificity of microarray data analysis by developing improved multiple test comparisons. This study proposes a new view of multiplicity in microarray experiments and the EDR provides an alternative multiple test method for Type I error control in microarray experiments.</description><subject>Computational biology</subject><subject>DNA microarrays</subject><subject>Gene expression</subject><issn>1471-2105</issn><issn>1471-2105</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2010</creationdate><recordtype>article</recordtype><recordid>eNqVjTsLwjAURoMoWB-7Y1aHaG6fWUUU3UTdy6W9lkhsJUnF_nsdRFydvsPhwMfYDOQCQKVLiDMQIchEAIg4TXos-Kr-Dw_ZyLmrlJApmQRscUCLxpDht9Z4fTe60L7jWJecrG0sL7UrmgfZjlv0NGGDCxpH08-O2Xy7Oa93okJDua6Lpvb09BW2zuX70zFfhZFSmXrfR_-0L2XLPPI</recordid><startdate>20100916</startdate><enddate>20100916</enddate><creator>Xu, Wayne Wenzhong</creator><creator>Carter, Clay J</creator><general>BioMed Central Ltd</general><scope>ISR</scope></search><sort><creationdate>20100916</creationdate><title>Parallel multiplicity and error discovery rate</title><author>Xu, Wayne Wenzhong ; Carter, Clay J</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-gale_incontextgauss_ISR_A2388781473</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Computational biology</topic><topic>DNA microarrays</topic><topic>Gene expression</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xu, Wayne Wenzhong</creatorcontrib><creatorcontrib>Carter, Clay J</creatorcontrib><collection>Gale In Context: Science</collection><jtitle>BMC bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Xu, Wayne Wenzhong</au><au>Carter, Clay J</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Parallel multiplicity and error discovery rate</atitle><jtitle>BMC bioinformatics</jtitle><date>2010-09-16</date><risdate>2010</risdate><volume>11</volume><spage>465</spage><pages>465-</pages><issn>1471-2105</issn><eissn>1471-2105</eissn><abstract>In microarray gene expression profiling experiments, differentially expressed genes (DEGs) are detected from among tens of thousands of genes on an array using statistical tests. It is important to control the number of false positives or errors that are present in the resultant DEG list. To date, more than 20 different multiple test methods have been reported that compute overall Type I error rates in microarray experiments. However, these methods share the following dilemma: they have low power in cases where only a small number of DEGs exist among a large number of total genes on the array. This study contrasts parallel multiplicity of objectively related tests against the traditional simultaneousness of subjectively related tests and proposes a new assessment called the Error Discovery Rate (EDR) for evaluating multiple test comparisons in microarray experiments. Parallel multiple tests use only the negative genes that parallel the positive genes to control the error rate; while simultaneous multiple tests use the total unchanged gene number for error estimates. Here, we demonstrate that the EDR method exhibits improved performance over other methods in specificity and sensitivity in testing expression data sets with sequence digital expression confirmation, in examining simulation data, as well as for three experimental data sets that vary in the proportion of DEGs. The EDR method overcomes a common problem of previous multiple test procedures, namely that the Type I error rate detection power is low when the total gene number used is large but the DEG number is small. Microarrays are extensively used to address many research questions. However, there is potential to improve the sensitivity and specificity of microarray data analysis by developing improved multiple test comparisons. This study proposes a new view of multiplicity in microarray experiments and the EDR provides an alternative multiple test method for Type I error control in microarray experiments.</abstract><pub>BioMed Central Ltd</pub><doi>10.1186/1471-2105-11-465</doi><tpages>465</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1471-2105
ispartof BMC bioinformatics, 2010-09, Vol.11, p.465
issn 1471-2105
1471-2105
language eng
recordid cdi_gale_incontextgauss_ISR_A238878147
source DOAJ Directory of Open Access Journals; EZB-FREE-00999 freely available EZB journals; PubMed Central; SpringerLink Journals - AutoHoldings; PubMed Central Open Access; Springer Nature OA Free Journals
subjects Computational biology
DNA microarrays
Gene expression
title Parallel multiplicity and error discovery rate
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-04T13%3A18%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Parallel%20multiplicity%20and%20error%20discovery%20rate&rft.jtitle=BMC%20bioinformatics&rft.au=Xu,%20Wayne%20Wenzhong&rft.date=2010-09-16&rft.volume=11&rft.spage=465&rft.pages=465-&rft.issn=1471-2105&rft.eissn=1471-2105&rft_id=info:doi/10.1186/1471-2105-11-465&rft_dat=%3Cgale%3EA238878147%3C/gale%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_galeid=A238878147&rfr_iscdi=true