Testing hypotheses about the microbiome using the linear decomposition model (LDM)

Abstract Motivation Methods for analyzing microbiome data generally fall into one of two groups: tests of the global hypothesis of any microbiome effect, which do not provide any information on the contribution of individual operational taxonomic units (OTUs); and tests for individual OTUs, which do...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Bioinformatics 2020-08, Vol.36 (14), p.4106-4115
Hauptverfasser: Hu, Yi-Juan, Satten, Glen A
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 4115
container_issue 14
container_start_page 4106
container_title Bioinformatics
container_volume 36
creator Hu, Yi-Juan
Satten, Glen A
description Abstract Motivation Methods for analyzing microbiome data generally fall into one of two groups: tests of the global hypothesis of any microbiome effect, which do not provide any information on the contribution of individual operational taxonomic units (OTUs); and tests for individual OTUs, which do not typically provide a global test of microbiome effect. Without a unified approach, the findings of a global test may be hard to resolve with the findings at the individual OTU level. Further, many tests of individual OTU effects do not preserve the false discovery rate (FDR). Results We introduce the linear decomposition model (LDM), that provides a single analysis path that includes global tests of any effect of the microbiome, tests of the effects of individual OTUs while accounting for multiple testing by controlling the FDR, and a connection to distance-based ordination. The LDM accommodates both continuous and discrete variables (e.g. clinical outcomes, environmental factors) as well as interaction terms to be tested either singly or in combination, allows for adjustment of confounding covariates, and uses permutation-based P-values that can control for sample correlation. The LDM can also be applied to transformed data, and an ‘omnibus’ test can easily combine results from analyses conducted on different transformation scales. We also provide a new implementation of PERMANOVA based on our approach. For global testing, our simulations indicate the LDM provided correct type I error and can have comparable power to existing distance-based methods. For testing individual OTUs, our simulations indicate the LDM controlled the FDR well. In contrast, DESeq2 often had inflated FDR; MetagenomeSeq generally had the lowest sensitivity. The flexibility of the LDM for a variety of microbiome studies is illustrated by the analysis of data from two microbiome studies. We also show that our implementation of PERMANOVA can outperform existing implementations. Availability and implementation The R package LDM is available on GitHub at https://github.com/yijuanhu/LDM in formats appropriate for Macintosh or Windows. Contact yijuan.hu@emory.edu Supplementary information Supplementary data are available at Bioinformatics online.
doi_str_mv 10.1093/bioinformatics/btaa260
format Article
fullrecord <record><control><sourceid>proquest_TOX</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8453243</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/bioinformatics/btaa260</oup_id><sourcerecordid>2393660437</sourcerecordid><originalsourceid>FETCH-LOGICAL-c509t-a6d6b04e133fe486df9822063a9ae493382e6ba769e8d86bab016a6f591263b03</originalsourceid><addsrcrecordid>eNqNkU1LxDAQhoMofv-FpUc9VJNMmm0ugvgNK4Ks55C2091I09SmFfz3Ztl10ZunTCbPvBPel5AJoxeMKrgsrLdt7XtnBluGy2Iwhku6Qw6ZkDTlNFO7sQY5TUVO4YAchfBOacaEEPvkADiwDBQcktc5hsG2i2T51flhiQFDYgo_Dkm8JM6WvY-rHCZjWFGrZmNbNH1SYeld54MdrG8T5ytskrPZ7fP5CdmrTRPwdHMek7f7u_nNYzp7eXi6uZ6lZUbVkBpZyYIKZAA1ilxWtco5pxKMMigUQM5RFmYqFeZVHquCMmlknSnGJRQUjsnVWrcbC4dVie3Qm0Z3vXWm_9LeWP33pbVLvfCfOhcZcAFR4Gwj0PuPMfqgnQ0lNo1p0Y9B8-iQlFTANKJyjUY_Quix3q5hVK8C0X8D0ZtA4uDk9ye3Yz8JRICtAT92_xX9BuBzoBA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2393660437</pqid></control><display><type>article</type><title>Testing hypotheses about the microbiome using the linear decomposition model (LDM)</title><source>Oxford Journals Open Access Collection</source><creator>Hu, Yi-Juan ; Satten, Glen A</creator><contributor>Valencia, Alfonso</contributor><creatorcontrib>Hu, Yi-Juan ; Satten, Glen A ; Valencia, Alfonso</creatorcontrib><description>Abstract Motivation Methods for analyzing microbiome data generally fall into one of two groups: tests of the global hypothesis of any microbiome effect, which do not provide any information on the contribution of individual operational taxonomic units (OTUs); and tests for individual OTUs, which do not typically provide a global test of microbiome effect. Without a unified approach, the findings of a global test may be hard to resolve with the findings at the individual OTU level. Further, many tests of individual OTU effects do not preserve the false discovery rate (FDR). Results We introduce the linear decomposition model (LDM), that provides a single analysis path that includes global tests of any effect of the microbiome, tests of the effects of individual OTUs while accounting for multiple testing by controlling the FDR, and a connection to distance-based ordination. The LDM accommodates both continuous and discrete variables (e.g. clinical outcomes, environmental factors) as well as interaction terms to be tested either singly or in combination, allows for adjustment of confounding covariates, and uses permutation-based P-values that can control for sample correlation. The LDM can also be applied to transformed data, and an ‘omnibus’ test can easily combine results from analyses conducted on different transformation scales. We also provide a new implementation of PERMANOVA based on our approach. For global testing, our simulations indicate the LDM provided correct type I error and can have comparable power to existing distance-based methods. For testing individual OTUs, our simulations indicate the LDM controlled the FDR well. In contrast, DESeq2 often had inflated FDR; MetagenomeSeq generally had the lowest sensitivity. The flexibility of the LDM for a variety of microbiome studies is illustrated by the analysis of data from two microbiome studies. We also show that our implementation of PERMANOVA can outperform existing implementations. Availability and implementation The R package LDM is available on GitHub at https://github.com/yijuanhu/LDM in formats appropriate for Macintosh or Windows. Contact yijuan.hu@emory.edu Supplementary information Supplementary data are available at Bioinformatics online.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1460-2059</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/btaa260</identifier><identifier>PMID: 32315393</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Linear Models ; Microbiota ; Original Papers</subject><ispartof>Bioinformatics, 2020-08, Vol.36 (14), p.4106-4115</ispartof><rights>The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2020</rights><rights>The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c509t-a6d6b04e133fe486df9822063a9ae493382e6ba769e8d86bab016a6f591263b03</citedby><cites>FETCH-LOGICAL-c509t-a6d6b04e133fe486df9822063a9ae493382e6ba769e8d86bab016a6f591263b03</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8453243/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8453243/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,723,776,780,881,1598,27901,27902,53766,53768</link.rule.ids><linktorsrc>$$Uhttps://dx.doi.org/10.1093/bioinformatics/btaa260$$EView_record_in_Oxford_University_Press$$FView_record_in_$$GOxford_University_Press</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/32315393$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Valencia, Alfonso</contributor><creatorcontrib>Hu, Yi-Juan</creatorcontrib><creatorcontrib>Satten, Glen A</creatorcontrib><title>Testing hypotheses about the microbiome using the linear decomposition model (LDM)</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>Abstract Motivation Methods for analyzing microbiome data generally fall into one of two groups: tests of the global hypothesis of any microbiome effect, which do not provide any information on the contribution of individual operational taxonomic units (OTUs); and tests for individual OTUs, which do not typically provide a global test of microbiome effect. Without a unified approach, the findings of a global test may be hard to resolve with the findings at the individual OTU level. Further, many tests of individual OTU effects do not preserve the false discovery rate (FDR). Results We introduce the linear decomposition model (LDM), that provides a single analysis path that includes global tests of any effect of the microbiome, tests of the effects of individual OTUs while accounting for multiple testing by controlling the FDR, and a connection to distance-based ordination. The LDM accommodates both continuous and discrete variables (e.g. clinical outcomes, environmental factors) as well as interaction terms to be tested either singly or in combination, allows for adjustment of confounding covariates, and uses permutation-based P-values that can control for sample correlation. The LDM can also be applied to transformed data, and an ‘omnibus’ test can easily combine results from analyses conducted on different transformation scales. We also provide a new implementation of PERMANOVA based on our approach. For global testing, our simulations indicate the LDM provided correct type I error and can have comparable power to existing distance-based methods. For testing individual OTUs, our simulations indicate the LDM controlled the FDR well. In contrast, DESeq2 often had inflated FDR; MetagenomeSeq generally had the lowest sensitivity. The flexibility of the LDM for a variety of microbiome studies is illustrated by the analysis of data from two microbiome studies. We also show that our implementation of PERMANOVA can outperform existing implementations. Availability and implementation The R package LDM is available on GitHub at https://github.com/yijuanhu/LDM in formats appropriate for Macintosh or Windows. Contact yijuan.hu@emory.edu Supplementary information Supplementary data are available at Bioinformatics online.</description><subject>Linear Models</subject><subject>Microbiota</subject><subject>Original Papers</subject><issn>1367-4803</issn><issn>1460-2059</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqNkU1LxDAQhoMofv-FpUc9VJNMmm0ugvgNK4Ks55C2091I09SmFfz3Ztl10ZunTCbPvBPel5AJoxeMKrgsrLdt7XtnBluGy2Iwhku6Qw6ZkDTlNFO7sQY5TUVO4YAchfBOacaEEPvkADiwDBQcktc5hsG2i2T51flhiQFDYgo_Dkm8JM6WvY-rHCZjWFGrZmNbNH1SYeld54MdrG8T5ytskrPZ7fP5CdmrTRPwdHMek7f7u_nNYzp7eXi6uZ6lZUbVkBpZyYIKZAA1ilxWtco5pxKMMigUQM5RFmYqFeZVHquCMmlknSnGJRQUjsnVWrcbC4dVie3Qm0Z3vXWm_9LeWP33pbVLvfCfOhcZcAFR4Gwj0PuPMfqgnQ0lNo1p0Y9B8-iQlFTANKJyjUY_Quix3q5hVK8C0X8D0ZtA4uDk9ye3Yz8JRICtAT92_xX9BuBzoBA</recordid><startdate>20200815</startdate><enddate>20200815</enddate><creator>Hu, Yi-Juan</creator><creator>Satten, Glen A</creator><general>Oxford University Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20200815</creationdate><title>Testing hypotheses about the microbiome using the linear decomposition model (LDM)</title><author>Hu, Yi-Juan ; Satten, Glen A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c509t-a6d6b04e133fe486df9822063a9ae493382e6ba769e8d86bab016a6f591263b03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Linear Models</topic><topic>Microbiota</topic><topic>Original Papers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hu, Yi-Juan</creatorcontrib><creatorcontrib>Satten, Glen A</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hu, Yi-Juan</au><au>Satten, Glen A</au><au>Valencia, Alfonso</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Testing hypotheses about the microbiome using the linear decomposition model (LDM)</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2020-08-15</date><risdate>2020</risdate><volume>36</volume><issue>14</issue><spage>4106</spage><epage>4115</epage><pages>4106-4115</pages><issn>1367-4803</issn><eissn>1460-2059</eissn><eissn>1367-4811</eissn><abstract>Abstract Motivation Methods for analyzing microbiome data generally fall into one of two groups: tests of the global hypothesis of any microbiome effect, which do not provide any information on the contribution of individual operational taxonomic units (OTUs); and tests for individual OTUs, which do not typically provide a global test of microbiome effect. Without a unified approach, the findings of a global test may be hard to resolve with the findings at the individual OTU level. Further, many tests of individual OTU effects do not preserve the false discovery rate (FDR). Results We introduce the linear decomposition model (LDM), that provides a single analysis path that includes global tests of any effect of the microbiome, tests of the effects of individual OTUs while accounting for multiple testing by controlling the FDR, and a connection to distance-based ordination. The LDM accommodates both continuous and discrete variables (e.g. clinical outcomes, environmental factors) as well as interaction terms to be tested either singly or in combination, allows for adjustment of confounding covariates, and uses permutation-based P-values that can control for sample correlation. The LDM can also be applied to transformed data, and an ‘omnibus’ test can easily combine results from analyses conducted on different transformation scales. We also provide a new implementation of PERMANOVA based on our approach. For global testing, our simulations indicate the LDM provided correct type I error and can have comparable power to existing distance-based methods. For testing individual OTUs, our simulations indicate the LDM controlled the FDR well. In contrast, DESeq2 often had inflated FDR; MetagenomeSeq generally had the lowest sensitivity. The flexibility of the LDM for a variety of microbiome studies is illustrated by the analysis of data from two microbiome studies. We also show that our implementation of PERMANOVA can outperform existing implementations. Availability and implementation The R package LDM is available on GitHub at https://github.com/yijuanhu/LDM in formats appropriate for Macintosh or Windows. Contact yijuan.hu@emory.edu Supplementary information Supplementary data are available at Bioinformatics online.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>32315393</pmid><doi>10.1093/bioinformatics/btaa260</doi><tpages>10</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1367-4803
ispartof Bioinformatics, 2020-08, Vol.36 (14), p.4106-4115
issn 1367-4803
1460-2059
1367-4811
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8453243
source Oxford Journals Open Access Collection
subjects Linear Models
Microbiota
Original Papers
title Testing hypotheses about the microbiome using the linear decomposition model (LDM)
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T03%3A09%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_TOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Testing%20hypotheses%20about%20the%20microbiome%20using%20the%20linear%20decomposition%20model%20(LDM)&rft.jtitle=Bioinformatics&rft.au=Hu,%20Yi-Juan&rft.date=2020-08-15&rft.volume=36&rft.issue=14&rft.spage=4106&rft.epage=4115&rft.pages=4106-4115&rft.issn=1367-4803&rft.eissn=1460-2059&rft_id=info:doi/10.1093/bioinformatics/btaa260&rft_dat=%3Cproquest_TOX%3E2393660437%3C/proquest_TOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2393660437&rft_id=info:pmid/32315393&rft_oup_id=10.1093/bioinformatics/btaa260&rfr_iscdi=true