Metacoder: An R package for visualization and manipulation of community taxonomic diversity data

Community-level data, the type generated by an increasing number of metabarcoding studies, is often graphed as stacked bar charts or pie graphs that use color to represent taxa. These graph types do not convey the hierarchical structure of taxonomic classifications and are limited by the use of colo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:PLoS computational biology 2017-02, Vol.13 (2), p.e1005404
Hauptverfasser: Foster, Zachary S L, Sharpton, Thomas J, Grünwald, Niklaus J
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue 2
container_start_page e1005404
container_title PLoS computational biology
container_volume 13
creator Foster, Zachary S L
Sharpton, Thomas J
Grünwald, Niklaus J
description Community-level data, the type generated by an increasing number of metabarcoding studies, is often graphed as stacked bar charts or pie graphs that use color to represent taxa. These graph types do not convey the hierarchical structure of taxonomic classifications and are limited by the use of color for categories. As an alternative, we developed metacoder, an R package for easily parsing, manipulating, and graphing publication-ready plots of hierarchical data. Metacoder includes a dynamic and flexible function that can parse most text-based formats that contain taxonomic classifications, taxon names, taxon identifiers, or sequence identifiers. Metacoder can then subset, sample, and order this parsed data using a set of intuitive functions that take into account the hierarchical nature of the data. Finally, an extremely flexible plotting function enables quantitative representation of up to 4 arbitrary statistics simultaneously in a tree format by mapping statistics to the color and size of tree nodes and edges. Metacoder also allows exploration of barcode primer bias by integrating functions to run digital PCR. Although it has been designed for data from metabarcoding research, metacoder can easily be applied to any data that has a hierarchical component such as gene ontology or geographic location data. Our package complements currently available tools for community analysis and is provided open source with an extensive online user manual.
doi_str_mv 10.1371/journal.pcbi.1005404
format Article
fullrecord <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_1878077120</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A493714522</galeid><doaj_id>oai_doaj_org_article_3df04e1e5b44428f8897820f25de1de8</doaj_id><sourcerecordid>A493714522</sourcerecordid><originalsourceid>FETCH-LOGICAL-c732t-734660a2f19d6323143fa6094c84fc4bebde3f2662b8d587bc1f57216b3f5ad03</originalsourceid><addsrcrecordid>eNqVkktv1DAUhSMEoqXwDxBEYgOLGfyMnS6QRhWPkQpIBdbG8WPwkNiD7Yxafj0OM606iAXIC1s33z0-uT5V9RiCOcQMvlyHMXrZzzeqc3MIACWA3KmOIaV4xjDld2-dj6oHKa0BKMe2uV8dIY4QAm1zXH19b7JUQZt4Wi98fVFvpPouV6a2IdZbl0bZu58yu-Br6XU9SO82Y78rBFurMAyjd_mqzvIy-DA4VWu3NTFNNS2zfFjds7JP5tF-P6m-vHn9-ezd7Pzj2-XZ4nymGEa5uCRNAySysNUNRhgSbGUDWqI4sYp0ptMGW9Q0qOOactYpaClDsOmwpVIDfFI93elu-pDEfjhJQM44YAyiiVjuCB3kWmyiG2S8EkE68bsQ4krImJ3qjcDaAmKgoR0hBHHLecs4AhZRbaA2vGi92t82doPRyvgcZX8gevjFu29iFbaC4vJMTVMEnu8FYvgxmpTF4JIyfS-9CePkm0NGOWXtP6AMtMUhmVSf_YH-fRDzHbWS5V-dt6FYVGVpU94veGNdqS9IW2JGKEKl4cVBQ2GyucwrOaYklp8u_oP9cMiSHatiSCkaezNACMQU8mv7Ygq52Ie8tD25PfybputU418uxfhT</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1878077120</pqid></control><display><type>article</type><title>Metacoder: An R package for visualization and manipulation of community taxonomic diversity data</title><source>DOAJ, Directory of Open Access Journals</source><source>PubMed Central (Open Access)</source><source>Public Library of Science (PLoS) Journals Open Access</source><source>MEDLINE</source><source>EZB Electronic Journals Library</source><creator>Foster, Zachary S L ; Sharpton, Thomas J ; Grünwald, Niklaus J</creator><creatorcontrib>Foster, Zachary S L ; Sharpton, Thomas J ; Grünwald, Niklaus J</creatorcontrib><description>Community-level data, the type generated by an increasing number of metabarcoding studies, is often graphed as stacked bar charts or pie graphs that use color to represent taxa. These graph types do not convey the hierarchical structure of taxonomic classifications and are limited by the use of color for categories. As an alternative, we developed metacoder, an R package for easily parsing, manipulating, and graphing publication-ready plots of hierarchical data. Metacoder includes a dynamic and flexible function that can parse most text-based formats that contain taxonomic classifications, taxon names, taxon identifiers, or sequence identifiers. Metacoder can then subset, sample, and order this parsed data using a set of intuitive functions that take into account the hierarchical nature of the data. Finally, an extremely flexible plotting function enables quantitative representation of up to 4 arbitrary statistics simultaneously in a tree format by mapping statistics to the color and size of tree nodes and edges. Metacoder also allows exploration of barcode primer bias by integrating functions to run digital PCR. Although it has been designed for data from metabarcoding research, metacoder can easily be applied to any data that has a hierarchical component such as gene ontology or geographic location data. Our package complements currently available tools for community analysis and is provided open source with an extensive online user manual.</description><identifier>ISSN: 1553-7358</identifier><identifier>ISSN: 1553-734X</identifier><identifier>EISSN: 1553-7358</identifier><identifier>DOI: 10.1371/journal.pcbi.1005404</identifier><identifier>PMID: 28222096</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Algorithms ; Bias ; Biology and Life Sciences ; Colleges &amp; universities ; Computer and Information Sciences ; Computer Graphics ; DNA - genetics ; DNA Barcoding, Taxonomic - methods ; Documentation ; Earth Sciences ; Funding ; Gene expression ; Genetic engineering ; Genetic research ; Genetic Variation - genetics ; High-Throughput Nucleotide Sequencing ; Methods ; Ontology ; Programming Languages ; Research and Analysis Methods ; Scientific software ; Software ; Taxonomy ; Technology application ; Trees ; User-Computer Interface</subject><ispartof>PLoS computational biology, 2017-02, Vol.13 (2), p.e1005404</ispartof><rights>COPYRIGHT 2017 Public Library of Science</rights><rights>2017 Public Library of Science. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited: Foster ZSL, Sharpton TJ, Grünwald NJ (2017) Metacoder: An R package for visualization and manipulation of community taxonomic diversity data. PLoS Comput Biol 13(2): e1005404. doi:10.1371/journal.pcbi.1005404</rights><rights>2017 Public Library of Science. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited: Foster ZSL, Sharpton TJ, Grünwald NJ (2017) Metacoder: An R package for visualization and manipulation of community taxonomic diversity data. PLoS Comput Biol 13(2): e1005404. doi:10.1371/journal.pcbi.1005404</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c732t-734660a2f19d6323143fa6094c84fc4bebde3f2662b8d587bc1f57216b3f5ad03</citedby><cites>FETCH-LOGICAL-c732t-734660a2f19d6323143fa6094c84fc4bebde3f2662b8d587bc1f57216b3f5ad03</cites><orcidid>0000-0003-1656-7602</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5340466/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC5340466/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,2096,2915,23845,27901,27902,53766,53768,79342,79343</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/28222096$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Foster, Zachary S L</creatorcontrib><creatorcontrib>Sharpton, Thomas J</creatorcontrib><creatorcontrib>Grünwald, Niklaus J</creatorcontrib><title>Metacoder: An R package for visualization and manipulation of community taxonomic diversity data</title><title>PLoS computational biology</title><addtitle>PLoS Comput Biol</addtitle><description>Community-level data, the type generated by an increasing number of metabarcoding studies, is often graphed as stacked bar charts or pie graphs that use color to represent taxa. These graph types do not convey the hierarchical structure of taxonomic classifications and are limited by the use of color for categories. As an alternative, we developed metacoder, an R package for easily parsing, manipulating, and graphing publication-ready plots of hierarchical data. Metacoder includes a dynamic and flexible function that can parse most text-based formats that contain taxonomic classifications, taxon names, taxon identifiers, or sequence identifiers. Metacoder can then subset, sample, and order this parsed data using a set of intuitive functions that take into account the hierarchical nature of the data. Finally, an extremely flexible plotting function enables quantitative representation of up to 4 arbitrary statistics simultaneously in a tree format by mapping statistics to the color and size of tree nodes and edges. Metacoder also allows exploration of barcode primer bias by integrating functions to run digital PCR. Although it has been designed for data from metabarcoding research, metacoder can easily be applied to any data that has a hierarchical component such as gene ontology or geographic location data. Our package complements currently available tools for community analysis and is provided open source with an extensive online user manual.</description><subject>Algorithms</subject><subject>Bias</subject><subject>Biology and Life Sciences</subject><subject>Colleges &amp; universities</subject><subject>Computer and Information Sciences</subject><subject>Computer Graphics</subject><subject>DNA - genetics</subject><subject>DNA Barcoding, Taxonomic - methods</subject><subject>Documentation</subject><subject>Earth Sciences</subject><subject>Funding</subject><subject>Gene expression</subject><subject>Genetic engineering</subject><subject>Genetic research</subject><subject>Genetic Variation - genetics</subject><subject>High-Throughput Nucleotide Sequencing</subject><subject>Methods</subject><subject>Ontology</subject><subject>Programming Languages</subject><subject>Research and Analysis Methods</subject><subject>Scientific software</subject><subject>Software</subject><subject>Taxonomy</subject><subject>Technology application</subject><subject>Trees</subject><subject>User-Computer Interface</subject><issn>1553-7358</issn><issn>1553-734X</issn><issn>1553-7358</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2017</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><sourceid>DOA</sourceid><recordid>eNqVkktv1DAUhSMEoqXwDxBEYgOLGfyMnS6QRhWPkQpIBdbG8WPwkNiD7Yxafj0OM606iAXIC1s33z0-uT5V9RiCOcQMvlyHMXrZzzeqc3MIACWA3KmOIaV4xjDld2-dj6oHKa0BKMe2uV8dIY4QAm1zXH19b7JUQZt4Wi98fVFvpPouV6a2IdZbl0bZu58yu-Br6XU9SO82Y78rBFurMAyjd_mqzvIy-DA4VWu3NTFNNS2zfFjds7JP5tF-P6m-vHn9-ezd7Pzj2-XZ4nymGEa5uCRNAySysNUNRhgSbGUDWqI4sYp0ptMGW9Q0qOOactYpaClDsOmwpVIDfFI93elu-pDEfjhJQM44YAyiiVjuCB3kWmyiG2S8EkE68bsQ4krImJ3qjcDaAmKgoR0hBHHLecs4AhZRbaA2vGi92t82doPRyvgcZX8gevjFu29iFbaC4vJMTVMEnu8FYvgxmpTF4JIyfS-9CePkm0NGOWXtP6AMtMUhmVSf_YH-fRDzHbWS5V-dt6FYVGVpU94veGNdqS9IW2JGKEKl4cVBQ2GyucwrOaYklp8u_oP9cMiSHatiSCkaezNACMQU8mv7Ygq52Ie8tD25PfybputU418uxfhT</recordid><startdate>20170201</startdate><enddate>20170201</enddate><creator>Foster, Zachary S L</creator><creator>Sharpton, Thomas J</creator><creator>Grünwald, Niklaus J</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISN</scope><scope>ISR</scope><scope>3V.</scope><scope>7QO</scope><scope>7QP</scope><scope>7TK</scope><scope>7TM</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>LK8</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-1656-7602</orcidid></search><sort><creationdate>20170201</creationdate><title>Metacoder: An R package for visualization and manipulation of community taxonomic diversity data</title><author>Foster, Zachary S L ; Sharpton, Thomas J ; Grünwald, Niklaus J</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c732t-734660a2f19d6323143fa6094c84fc4bebde3f2662b8d587bc1f57216b3f5ad03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2017</creationdate><topic>Algorithms</topic><topic>Bias</topic><topic>Biology and Life Sciences</topic><topic>Colleges &amp; universities</topic><topic>Computer and Information Sciences</topic><topic>Computer Graphics</topic><topic>DNA - genetics</topic><topic>DNA Barcoding, Taxonomic - methods</topic><topic>Documentation</topic><topic>Earth Sciences</topic><topic>Funding</topic><topic>Gene expression</topic><topic>Genetic engineering</topic><topic>Genetic research</topic><topic>Genetic Variation - genetics</topic><topic>High-Throughput Nucleotide Sequencing</topic><topic>Methods</topic><topic>Ontology</topic><topic>Programming Languages</topic><topic>Research and Analysis Methods</topic><topic>Scientific software</topic><topic>Software</topic><topic>Taxonomy</topic><topic>Technology application</topic><topic>Trees</topic><topic>User-Computer Interface</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Foster, Zachary S L</creatorcontrib><creatorcontrib>Sharpton, Thomas J</creatorcontrib><creatorcontrib>Grünwald, Niklaus J</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Canada</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>ProQuest Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>AUTh Library subscriptions: ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>Biological Sciences</collection><collection>Computing Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>ProQuest advanced technologies &amp; aerospace journals</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ, Directory of Open Access Journals</collection><jtitle>PLoS computational biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Foster, Zachary S L</au><au>Sharpton, Thomas J</au><au>Grünwald, Niklaus J</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Metacoder: An R package for visualization and manipulation of community taxonomic diversity data</atitle><jtitle>PLoS computational biology</jtitle><addtitle>PLoS Comput Biol</addtitle><date>2017-02-01</date><risdate>2017</risdate><volume>13</volume><issue>2</issue><spage>e1005404</spage><pages>e1005404-</pages><issn>1553-7358</issn><issn>1553-734X</issn><eissn>1553-7358</eissn><abstract>Community-level data, the type generated by an increasing number of metabarcoding studies, is often graphed as stacked bar charts or pie graphs that use color to represent taxa. These graph types do not convey the hierarchical structure of taxonomic classifications and are limited by the use of color for categories. As an alternative, we developed metacoder, an R package for easily parsing, manipulating, and graphing publication-ready plots of hierarchical data. Metacoder includes a dynamic and flexible function that can parse most text-based formats that contain taxonomic classifications, taxon names, taxon identifiers, or sequence identifiers. Metacoder can then subset, sample, and order this parsed data using a set of intuitive functions that take into account the hierarchical nature of the data. Finally, an extremely flexible plotting function enables quantitative representation of up to 4 arbitrary statistics simultaneously in a tree format by mapping statistics to the color and size of tree nodes and edges. Metacoder also allows exploration of barcode primer bias by integrating functions to run digital PCR. Although it has been designed for data from metabarcoding research, metacoder can easily be applied to any data that has a hierarchical component such as gene ontology or geographic location data. Our package complements currently available tools for community analysis and is provided open source with an extensive online user manual.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>28222096</pmid><doi>10.1371/journal.pcbi.1005404</doi><orcidid>https://orcid.org/0000-0003-1656-7602</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1553-7358
ispartof PLoS computational biology, 2017-02, Vol.13 (2), p.e1005404
issn 1553-7358
1553-734X
1553-7358
language eng
recordid cdi_plos_journals_1878077120
source DOAJ, Directory of Open Access Journals; PubMed Central (Open Access); Public Library of Science (PLoS) Journals Open Access; MEDLINE; EZB Electronic Journals Library
subjects Algorithms
Bias
Biology and Life Sciences
Colleges & universities
Computer and Information Sciences
Computer Graphics
DNA - genetics
DNA Barcoding, Taxonomic - methods
Documentation
Earth Sciences
Funding
Gene expression
Genetic engineering
Genetic research
Genetic Variation - genetics
High-Throughput Nucleotide Sequencing
Methods
Ontology
Programming Languages
Research and Analysis Methods
Scientific software
Software
Taxonomy
Technology application
Trees
User-Computer Interface
title Metacoder: An R package for visualization and manipulation of community taxonomic diversity data
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-07T15%3A17%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Metacoder:%20An%20R%20package%20for%20visualization%20and%20manipulation%20of%20community%20taxonomic%20diversity%20data&rft.jtitle=PLoS%20computational%20biology&rft.au=Foster,%20Zachary%20S%20L&rft.date=2017-02-01&rft.volume=13&rft.issue=2&rft.spage=e1005404&rft.pages=e1005404-&rft.issn=1553-7358&rft.eissn=1553-7358&rft_id=info:doi/10.1371/journal.pcbi.1005404&rft_dat=%3Cgale_plos_%3EA493714522%3C/gale_plos_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1878077120&rft_id=info:pmid/28222096&rft_galeid=A493714522&rft_doaj_id=oai_doaj_org_article_3df04e1e5b44428f8897820f25de1de8&rfr_iscdi=true