Testing hypotheses about the microbiome using the linear decomposition model (LDM)
Abstract Motivation Methods for analyzing microbiome data generally fall into one of two groups: tests of the global hypothesis of any microbiome effect, which do not provide any information on the contribution of individual operational taxonomic units (OTUs); and tests for individual OTUs, which do...
Gespeichert in:
Veröffentlicht in: | Bioinformatics 2020-08, Vol.36 (14), p.4106-4115 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 4115 |
---|---|
container_issue | 14 |
container_start_page | 4106 |
container_title | Bioinformatics |
container_volume | 36 |
creator | Hu, Yi-Juan Satten, Glen A |
description | Abstract
Motivation
Methods for analyzing microbiome data generally fall into one of two groups: tests of the global hypothesis of any microbiome effect, which do not provide any information on the contribution of individual operational taxonomic units (OTUs); and tests for individual OTUs, which do not typically provide a global test of microbiome effect. Without a unified approach, the findings of a global test may be hard to resolve with the findings at the individual OTU level. Further, many tests of individual OTU effects do not preserve the false discovery rate (FDR).
Results
We introduce the linear decomposition model (LDM), that provides a single analysis path that includes global tests of any effect of the microbiome, tests of the effects of individual OTUs while accounting for multiple testing by controlling the FDR, and a connection to distance-based ordination. The LDM accommodates both continuous and discrete variables (e.g. clinical outcomes, environmental factors) as well as interaction terms to be tested either singly or in combination, allows for adjustment of confounding covariates, and uses permutation-based P-values that can control for sample correlation. The LDM can also be applied to transformed data, and an ‘omnibus’ test can easily combine results from analyses conducted on different transformation scales. We also provide a new implementation of PERMANOVA based on our approach. For global testing, our simulations indicate the LDM provided correct type I error and can have comparable power to existing distance-based methods. For testing individual OTUs, our simulations indicate the LDM controlled the FDR well. In contrast, DESeq2 often had inflated FDR; MetagenomeSeq generally had the lowest sensitivity. The flexibility of the LDM for a variety of microbiome studies is illustrated by the analysis of data from two microbiome studies. We also show that our implementation of PERMANOVA can outperform existing implementations.
Availability and implementation
The R package LDM is available on GitHub at https://github.com/yijuanhu/LDM in formats appropriate for Macintosh or Windows.
Contact
yijuan.hu@emory.edu
Supplementary information
Supplementary data are available at Bioinformatics online. |
doi_str_mv | 10.1093/bioinformatics/btaa260 |
format | Article |
fullrecord | <record><control><sourceid>proquest_TOX</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8453243</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1093/bioinformatics/btaa260</oup_id><sourcerecordid>2393660437</sourcerecordid><originalsourceid>FETCH-LOGICAL-c509t-a6d6b04e133fe486df9822063a9ae493382e6ba769e8d86bab016a6f591263b03</originalsourceid><addsrcrecordid>eNqNkU1LxDAQhoMofv-FpUc9VJNMmm0ugvgNK4Ks55C2091I09SmFfz3Ztl10ZunTCbPvBPel5AJoxeMKrgsrLdt7XtnBluGy2Iwhku6Qw6ZkDTlNFO7sQY5TUVO4YAchfBOacaEEPvkADiwDBQcktc5hsG2i2T51flhiQFDYgo_Dkm8JM6WvY-rHCZjWFGrZmNbNH1SYeld54MdrG8T5ytskrPZ7fP5CdmrTRPwdHMek7f7u_nNYzp7eXi6uZ6lZUbVkBpZyYIKZAA1ilxWtco5pxKMMigUQM5RFmYqFeZVHquCMmlknSnGJRQUjsnVWrcbC4dVie3Qm0Z3vXWm_9LeWP33pbVLvfCfOhcZcAFR4Gwj0PuPMfqgnQ0lNo1p0Y9B8-iQlFTANKJyjUY_Quix3q5hVK8C0X8D0ZtA4uDk9ye3Yz8JRICtAT92_xX9BuBzoBA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2393660437</pqid></control><display><type>article</type><title>Testing hypotheses about the microbiome using the linear decomposition model (LDM)</title><source>Oxford Journals Open Access Collection</source><creator>Hu, Yi-Juan ; Satten, Glen A</creator><contributor>Valencia, Alfonso</contributor><creatorcontrib>Hu, Yi-Juan ; Satten, Glen A ; Valencia, Alfonso</creatorcontrib><description>Abstract
Motivation
Methods for analyzing microbiome data generally fall into one of two groups: tests of the global hypothesis of any microbiome effect, which do not provide any information on the contribution of individual operational taxonomic units (OTUs); and tests for individual OTUs, which do not typically provide a global test of microbiome effect. Without a unified approach, the findings of a global test may be hard to resolve with the findings at the individual OTU level. Further, many tests of individual OTU effects do not preserve the false discovery rate (FDR).
Results
We introduce the linear decomposition model (LDM), that provides a single analysis path that includes global tests of any effect of the microbiome, tests of the effects of individual OTUs while accounting for multiple testing by controlling the FDR, and a connection to distance-based ordination. The LDM accommodates both continuous and discrete variables (e.g. clinical outcomes, environmental factors) as well as interaction terms to be tested either singly or in combination, allows for adjustment of confounding covariates, and uses permutation-based P-values that can control for sample correlation. The LDM can also be applied to transformed data, and an ‘omnibus’ test can easily combine results from analyses conducted on different transformation scales. We also provide a new implementation of PERMANOVA based on our approach. For global testing, our simulations indicate the LDM provided correct type I error and can have comparable power to existing distance-based methods. For testing individual OTUs, our simulations indicate the LDM controlled the FDR well. In contrast, DESeq2 often had inflated FDR; MetagenomeSeq generally had the lowest sensitivity. The flexibility of the LDM for a variety of microbiome studies is illustrated by the analysis of data from two microbiome studies. We also show that our implementation of PERMANOVA can outperform existing implementations.
Availability and implementation
The R package LDM is available on GitHub at https://github.com/yijuanhu/LDM in formats appropriate for Macintosh or Windows.
Contact
yijuan.hu@emory.edu
Supplementary information
Supplementary data are available at Bioinformatics online.</description><identifier>ISSN: 1367-4803</identifier><identifier>EISSN: 1460-2059</identifier><identifier>EISSN: 1367-4811</identifier><identifier>DOI: 10.1093/bioinformatics/btaa260</identifier><identifier>PMID: 32315393</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Linear Models ; Microbiota ; Original Papers</subject><ispartof>Bioinformatics, 2020-08, Vol.36 (14), p.4106-4115</ispartof><rights>The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com 2020</rights><rights>The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c509t-a6d6b04e133fe486df9822063a9ae493382e6ba769e8d86bab016a6f591263b03</citedby><cites>FETCH-LOGICAL-c509t-a6d6b04e133fe486df9822063a9ae493382e6ba769e8d86bab016a6f591263b03</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8453243/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8453243/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,723,776,780,881,1598,27901,27902,53766,53768</link.rule.ids><linktorsrc>$$Uhttps://dx.doi.org/10.1093/bioinformatics/btaa260$$EView_record_in_Oxford_University_Press$$FView_record_in_$$GOxford_University_Press</linktorsrc><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/32315393$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Valencia, Alfonso</contributor><creatorcontrib>Hu, Yi-Juan</creatorcontrib><creatorcontrib>Satten, Glen A</creatorcontrib><title>Testing hypotheses about the microbiome using the linear decomposition model (LDM)</title><title>Bioinformatics</title><addtitle>Bioinformatics</addtitle><description>Abstract
Motivation
Methods for analyzing microbiome data generally fall into one of two groups: tests of the global hypothesis of any microbiome effect, which do not provide any information on the contribution of individual operational taxonomic units (OTUs); and tests for individual OTUs, which do not typically provide a global test of microbiome effect. Without a unified approach, the findings of a global test may be hard to resolve with the findings at the individual OTU level. Further, many tests of individual OTU effects do not preserve the false discovery rate (FDR).
Results
We introduce the linear decomposition model (LDM), that provides a single analysis path that includes global tests of any effect of the microbiome, tests of the effects of individual OTUs while accounting for multiple testing by controlling the FDR, and a connection to distance-based ordination. The LDM accommodates both continuous and discrete variables (e.g. clinical outcomes, environmental factors) as well as interaction terms to be tested either singly or in combination, allows for adjustment of confounding covariates, and uses permutation-based P-values that can control for sample correlation. The LDM can also be applied to transformed data, and an ‘omnibus’ test can easily combine results from analyses conducted on different transformation scales. We also provide a new implementation of PERMANOVA based on our approach. For global testing, our simulations indicate the LDM provided correct type I error and can have comparable power to existing distance-based methods. For testing individual OTUs, our simulations indicate the LDM controlled the FDR well. In contrast, DESeq2 often had inflated FDR; MetagenomeSeq generally had the lowest sensitivity. The flexibility of the LDM for a variety of microbiome studies is illustrated by the analysis of data from two microbiome studies. We also show that our implementation of PERMANOVA can outperform existing implementations.
Availability and implementation
The R package LDM is available on GitHub at https://github.com/yijuanhu/LDM in formats appropriate for Macintosh or Windows.
Contact
yijuan.hu@emory.edu
Supplementary information
Supplementary data are available at Bioinformatics online.</description><subject>Linear Models</subject><subject>Microbiota</subject><subject>Original Papers</subject><issn>1367-4803</issn><issn>1460-2059</issn><issn>1367-4811</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqNkU1LxDAQhoMofv-FpUc9VJNMmm0ugvgNK4Ks55C2091I09SmFfz3Ztl10ZunTCbPvBPel5AJoxeMKrgsrLdt7XtnBluGy2Iwhku6Qw6ZkDTlNFO7sQY5TUVO4YAchfBOacaEEPvkADiwDBQcktc5hsG2i2T51flhiQFDYgo_Dkm8JM6WvY-rHCZjWFGrZmNbNH1SYeld54MdrG8T5ytskrPZ7fP5CdmrTRPwdHMek7f7u_nNYzp7eXi6uZ6lZUbVkBpZyYIKZAA1ilxWtco5pxKMMigUQM5RFmYqFeZVHquCMmlknSnGJRQUjsnVWrcbC4dVie3Qm0Z3vXWm_9LeWP33pbVLvfCfOhcZcAFR4Gwj0PuPMfqgnQ0lNo1p0Y9B8-iQlFTANKJyjUY_Quix3q5hVK8C0X8D0ZtA4uDk9ye3Yz8JRICtAT92_xX9BuBzoBA</recordid><startdate>20200815</startdate><enddate>20200815</enddate><creator>Hu, Yi-Juan</creator><creator>Satten, Glen A</creator><general>Oxford University Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20200815</creationdate><title>Testing hypotheses about the microbiome using the linear decomposition model (LDM)</title><author>Hu, Yi-Juan ; Satten, Glen A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c509t-a6d6b04e133fe486df9822063a9ae493382e6ba769e8d86bab016a6f591263b03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Linear Models</topic><topic>Microbiota</topic><topic>Original Papers</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hu, Yi-Juan</creatorcontrib><creatorcontrib>Satten, Glen A</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Bioinformatics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Hu, Yi-Juan</au><au>Satten, Glen A</au><au>Valencia, Alfonso</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Testing hypotheses about the microbiome using the linear decomposition model (LDM)</atitle><jtitle>Bioinformatics</jtitle><addtitle>Bioinformatics</addtitle><date>2020-08-15</date><risdate>2020</risdate><volume>36</volume><issue>14</issue><spage>4106</spage><epage>4115</epage><pages>4106-4115</pages><issn>1367-4803</issn><eissn>1460-2059</eissn><eissn>1367-4811</eissn><abstract>Abstract
Motivation
Methods for analyzing microbiome data generally fall into one of two groups: tests of the global hypothesis of any microbiome effect, which do not provide any information on the contribution of individual operational taxonomic units (OTUs); and tests for individual OTUs, which do not typically provide a global test of microbiome effect. Without a unified approach, the findings of a global test may be hard to resolve with the findings at the individual OTU level. Further, many tests of individual OTU effects do not preserve the false discovery rate (FDR).
Results
We introduce the linear decomposition model (LDM), that provides a single analysis path that includes global tests of any effect of the microbiome, tests of the effects of individual OTUs while accounting for multiple testing by controlling the FDR, and a connection to distance-based ordination. The LDM accommodates both continuous and discrete variables (e.g. clinical outcomes, environmental factors) as well as interaction terms to be tested either singly or in combination, allows for adjustment of confounding covariates, and uses permutation-based P-values that can control for sample correlation. The LDM can also be applied to transformed data, and an ‘omnibus’ test can easily combine results from analyses conducted on different transformation scales. We also provide a new implementation of PERMANOVA based on our approach. For global testing, our simulations indicate the LDM provided correct type I error and can have comparable power to existing distance-based methods. For testing individual OTUs, our simulations indicate the LDM controlled the FDR well. In contrast, DESeq2 often had inflated FDR; MetagenomeSeq generally had the lowest sensitivity. The flexibility of the LDM for a variety of microbiome studies is illustrated by the analysis of data from two microbiome studies. We also show that our implementation of PERMANOVA can outperform existing implementations.
Availability and implementation
The R package LDM is available on GitHub at https://github.com/yijuanhu/LDM in formats appropriate for Macintosh or Windows.
Contact
yijuan.hu@emory.edu
Supplementary information
Supplementary data are available at Bioinformatics online.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>32315393</pmid><doi>10.1093/bioinformatics/btaa260</doi><tpages>10</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 1367-4803 |
ispartof | Bioinformatics, 2020-08, Vol.36 (14), p.4106-4115 |
issn | 1367-4803 1460-2059 1367-4811 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8453243 |
source | Oxford Journals Open Access Collection |
subjects | Linear Models Microbiota Original Papers |
title | Testing hypotheses about the microbiome using the linear decomposition model (LDM) |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T03%3A09%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_TOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Testing%20hypotheses%20about%20the%20microbiome%20using%20the%20linear%20decomposition%20model%20(LDM)&rft.jtitle=Bioinformatics&rft.au=Hu,%20Yi-Juan&rft.date=2020-08-15&rft.volume=36&rft.issue=14&rft.spage=4106&rft.epage=4115&rft.pages=4106-4115&rft.issn=1367-4803&rft.eissn=1460-2059&rft_id=info:doi/10.1093/bioinformatics/btaa260&rft_dat=%3Cproquest_TOX%3E2393660437%3C/proquest_TOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2393660437&rft_id=info:pmid/32315393&rft_oup_id=10.1093/bioinformatics/btaa260&rfr_iscdi=true |