Improving metabarcoding taxonomic assignment: A case study of fishes in a large marine ecosystem

DNA metabarcoding is an important tool for molecular ecology. However, its effectiveness hinges on the quality of reference sequence databases and classification parameters employed. Here we evaluate the performance of MiFish 12S taxonomic assignments using a case study of California Current Large M...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Molecular ecology resources 2021-10, Vol.21 (7), p.2546-2564
Hauptverfasser: Gold, Zachary, Curd, Emily E., Goodwin, Kelly D., Choi, Emma S., Frable, Benjamin W., Thompson, Andrew R., Walker, Harold J., Burton, Ronald S., Kacev, Dovi, Martz, Lucas D., Barber, Paul H.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2564
container_issue 7
container_start_page 2546
container_title Molecular ecology resources
container_volume 21
creator Gold, Zachary
Curd, Emily E.
Goodwin, Kelly D.
Choi, Emma S.
Frable, Benjamin W.
Thompson, Andrew R.
Walker, Harold J.
Burton, Ronald S.
Kacev, Dovi
Martz, Lucas D.
Barber, Paul H.
description DNA metabarcoding is an important tool for molecular ecology. However, its effectiveness hinges on the quality of reference sequence databases and classification parameters employed. Here we evaluate the performance of MiFish 12S taxonomic assignments using a case study of California Current Large Marine Ecosystem fishes to determine best practices for metabarcoding. Specifically, we use a taxonomy cross‐validation by identity framework to compare classification performance between a global database comprised of all available sequences and a curated database that only includes sequences of fishes from the California Current Large Marine Ecosystem. We demonstrate that the regional database provides higher assignment accuracy than the comprehensive global database. We also document a tradeoff between accuracy and misclassification across a range of taxonomic cutoff scores, highlighting the importance of parameter selection for taxonomic classification. Furthermore, we compared assignment accuracy with and without the inclusion of additionally generated reference sequences. To this end, we sequenced tissue from 597 species using the MiFish 12S primers, adding 252 species to GenBank's existing 550 California Current Large Marine Ecosystem fish sequences. We then compared species and reads identified from seawater environmental DNA samples using global databases with and without our generated references, and the regional database. The addition of new references allowed for the identification of 16 additional native taxa representing 17.0% of total reads from eDNA samples, including species with vast ecological and economic value. Together these results demonstrate the importance of comprehensive and curated reference databases for effective metabarcoding and the need for locus‐specific validation efforts.
doi_str_mv 10.1111/1755-0998.13450
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2549692280</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2570263589</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3890-d8a804841c54672f41524b7b2e3403d99616bc3ef36fa3d910c1fa8144f00de43</originalsourceid><addsrcrecordid>eNqFkD1PwzAQhiMEEqUws1piYUlrJ05is1VVgUoFFpDYjOOci6skLnEC5N_jEMTAwi33oec93b1BcE7wjPiYkyxJQsw5m5GYJvggmPxODn9r9nwcnDi3wzjFPKOT4GVd7Rv7buotqqCVuWyULYaulZ-2tpVRSDpntnUFdXuFFkhJB8i1XdEjq5E27hUcMjWSqJTNFlAlG1MDAmVd71qoToMjLUsHZz95Gjxdrx6Xt-Hm4Wa9XGxCFTOOw4JJhimjRCU0zSJNSRLRPMsjiCmOC85TkuYqBh2nWvqeYEW0ZIRSjXEBNJ4Gl-Ne_85bB64VlXEKylLWYDsnooTylEcRwx69-IPubNfU_jpPZThK44RxT81HSjXWuQa02DfGf9cLgsXguBg8FYO_4ttxr0hGxYcpof8PF3er-1H3BSiegfY</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2570263589</pqid></control><display><type>article</type><title>Improving metabarcoding taxonomic assignment: A case study of fishes in a large marine ecosystem</title><source>Wiley Online Library All Journals</source><creator>Gold, Zachary ; Curd, Emily E. ; Goodwin, Kelly D. ; Choi, Emma S. ; Frable, Benjamin W. ; Thompson, Andrew R. ; Walker, Harold J. ; Burton, Ronald S. ; Kacev, Dovi ; Martz, Lucas D. ; Barber, Paul H.</creator><creatorcontrib>Gold, Zachary ; Curd, Emily E. ; Goodwin, Kelly D. ; Choi, Emma S. ; Frable, Benjamin W. ; Thompson, Andrew R. ; Walker, Harold J. ; Burton, Ronald S. ; Kacev, Dovi ; Martz, Lucas D. ; Barber, Paul H.</creatorcontrib><description>DNA metabarcoding is an important tool for molecular ecology. However, its effectiveness hinges on the quality of reference sequence databases and classification parameters employed. Here we evaluate the performance of MiFish 12S taxonomic assignments using a case study of California Current Large Marine Ecosystem fishes to determine best practices for metabarcoding. Specifically, we use a taxonomy cross‐validation by identity framework to compare classification performance between a global database comprised of all available sequences and a curated database that only includes sequences of fishes from the California Current Large Marine Ecosystem. We demonstrate that the regional database provides higher assignment accuracy than the comprehensive global database. We also document a tradeoff between accuracy and misclassification across a range of taxonomic cutoff scores, highlighting the importance of parameter selection for taxonomic classification. Furthermore, we compared assignment accuracy with and without the inclusion of additionally generated reference sequences. To this end, we sequenced tissue from 597 species using the MiFish 12S primers, adding 252 species to GenBank's existing 550 California Current Large Marine Ecosystem fish sequences. We then compared species and reads identified from seawater environmental DNA samples using global databases with and without our generated references, and the regional database. The addition of new references allowed for the identification of 16 additional native taxa representing 17.0% of total reads from eDNA samples, including species with vast ecological and economic value. Together these results demonstrate the importance of comprehensive and curated reference databases for effective metabarcoding and the need for locus‐specific validation efforts.</description><identifier>ISSN: 1755-098X</identifier><identifier>EISSN: 1755-0998</identifier><identifier>DOI: 10.1111/1755-0998.13450</identifier><language>eng</language><publisher>Oxford: Wiley Subscription Services, Inc</publisher><subject>Accuracy ; Best practice ; California Current Large Marine Ecosystem ; Case studies ; Chemical analysis ; Classification ; Deoxyribonucleic acid ; DNA ; Ecological effects ; Ecosystems ; eDNA ; Environmental DNA ; Marine ecosystems ; Marine environment ; Marine fish ; metabarcoding ; MiFish primers ; Parameters ; Performance evaluation ; reference database ; Seawater ; Species ; Taxonomy ; Water analysis</subject><ispartof>Molecular ecology resources, 2021-10, Vol.21 (7), p.2546-2564</ispartof><rights>2021 John Wiley &amp; Sons Ltd</rights><rights>Copyright © 2021 John Wiley &amp; Sons Ltd</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3890-d8a804841c54672f41524b7b2e3403d99616bc3ef36fa3d910c1fa8144f00de43</citedby><cites>FETCH-LOGICAL-c3890-d8a804841c54672f41524b7b2e3403d99616bc3ef36fa3d910c1fa8144f00de43</cites><orcidid>0000-0003-4525-0671 ; 0000-0001-7769-5558 ; 0000-0001-7671-7487 ; 0000-0002-6995-5329 ; 0000-0003-0490-7630 ; 0000-0001-9583-8073 ; 0000-0003-0336-6852</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1111%2F1755-0998.13450$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1111%2F1755-0998.13450$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,780,784,1416,27922,27923,45572,45573</link.rule.ids></links><search><creatorcontrib>Gold, Zachary</creatorcontrib><creatorcontrib>Curd, Emily E.</creatorcontrib><creatorcontrib>Goodwin, Kelly D.</creatorcontrib><creatorcontrib>Choi, Emma S.</creatorcontrib><creatorcontrib>Frable, Benjamin W.</creatorcontrib><creatorcontrib>Thompson, Andrew R.</creatorcontrib><creatorcontrib>Walker, Harold J.</creatorcontrib><creatorcontrib>Burton, Ronald S.</creatorcontrib><creatorcontrib>Kacev, Dovi</creatorcontrib><creatorcontrib>Martz, Lucas D.</creatorcontrib><creatorcontrib>Barber, Paul H.</creatorcontrib><title>Improving metabarcoding taxonomic assignment: A case study of fishes in a large marine ecosystem</title><title>Molecular ecology resources</title><description>DNA metabarcoding is an important tool for molecular ecology. However, its effectiveness hinges on the quality of reference sequence databases and classification parameters employed. Here we evaluate the performance of MiFish 12S taxonomic assignments using a case study of California Current Large Marine Ecosystem fishes to determine best practices for metabarcoding. Specifically, we use a taxonomy cross‐validation by identity framework to compare classification performance between a global database comprised of all available sequences and a curated database that only includes sequences of fishes from the California Current Large Marine Ecosystem. We demonstrate that the regional database provides higher assignment accuracy than the comprehensive global database. We also document a tradeoff between accuracy and misclassification across a range of taxonomic cutoff scores, highlighting the importance of parameter selection for taxonomic classification. Furthermore, we compared assignment accuracy with and without the inclusion of additionally generated reference sequences. To this end, we sequenced tissue from 597 species using the MiFish 12S primers, adding 252 species to GenBank's existing 550 California Current Large Marine Ecosystem fish sequences. We then compared species and reads identified from seawater environmental DNA samples using global databases with and without our generated references, and the regional database. The addition of new references allowed for the identification of 16 additional native taxa representing 17.0% of total reads from eDNA samples, including species with vast ecological and economic value. Together these results demonstrate the importance of comprehensive and curated reference databases for effective metabarcoding and the need for locus‐specific validation efforts.</description><subject>Accuracy</subject><subject>Best practice</subject><subject>California Current Large Marine Ecosystem</subject><subject>Case studies</subject><subject>Chemical analysis</subject><subject>Classification</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>Ecological effects</subject><subject>Ecosystems</subject><subject>eDNA</subject><subject>Environmental DNA</subject><subject>Marine ecosystems</subject><subject>Marine environment</subject><subject>Marine fish</subject><subject>metabarcoding</subject><subject>MiFish primers</subject><subject>Parameters</subject><subject>Performance evaluation</subject><subject>reference database</subject><subject>Seawater</subject><subject>Species</subject><subject>Taxonomy</subject><subject>Water analysis</subject><issn>1755-098X</issn><issn>1755-0998</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><recordid>eNqFkD1PwzAQhiMEEqUws1piYUlrJ05is1VVgUoFFpDYjOOci6skLnEC5N_jEMTAwi33oec93b1BcE7wjPiYkyxJQsw5m5GYJvggmPxODn9r9nwcnDi3wzjFPKOT4GVd7Rv7buotqqCVuWyULYaulZ-2tpVRSDpntnUFdXuFFkhJB8i1XdEjq5E27hUcMjWSqJTNFlAlG1MDAmVd71qoToMjLUsHZz95Gjxdrx6Xt-Hm4Wa9XGxCFTOOw4JJhimjRCU0zSJNSRLRPMsjiCmOC85TkuYqBh2nWvqeYEW0ZIRSjXEBNJ4Gl-Ne_85bB64VlXEKylLWYDsnooTylEcRwx69-IPubNfU_jpPZThK44RxT81HSjXWuQa02DfGf9cLgsXguBg8FYO_4ttxr0hGxYcpof8PF3er-1H3BSiegfY</recordid><startdate>202110</startdate><enddate>202110</enddate><creator>Gold, Zachary</creator><creator>Curd, Emily E.</creator><creator>Goodwin, Kelly D.</creator><creator>Choi, Emma S.</creator><creator>Frable, Benjamin W.</creator><creator>Thompson, Andrew R.</creator><creator>Walker, Harold J.</creator><creator>Burton, Ronald S.</creator><creator>Kacev, Dovi</creator><creator>Martz, Lucas D.</creator><creator>Barber, Paul H.</creator><general>Wiley Subscription Services, Inc</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SN</scope><scope>7SS</scope><scope>8FD</scope><scope>C1K</scope><scope>FR3</scope><scope>M7N</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0003-4525-0671</orcidid><orcidid>https://orcid.org/0000-0001-7769-5558</orcidid><orcidid>https://orcid.org/0000-0001-7671-7487</orcidid><orcidid>https://orcid.org/0000-0002-6995-5329</orcidid><orcidid>https://orcid.org/0000-0003-0490-7630</orcidid><orcidid>https://orcid.org/0000-0001-9583-8073</orcidid><orcidid>https://orcid.org/0000-0003-0336-6852</orcidid></search><sort><creationdate>202110</creationdate><title>Improving metabarcoding taxonomic assignment: A case study of fishes in a large marine ecosystem</title><author>Gold, Zachary ; Curd, Emily E. ; Goodwin, Kelly D. ; Choi, Emma S. ; Frable, Benjamin W. ; Thompson, Andrew R. ; Walker, Harold J. ; Burton, Ronald S. ; Kacev, Dovi ; Martz, Lucas D. ; Barber, Paul H.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3890-d8a804841c54672f41524b7b2e3403d99616bc3ef36fa3d910c1fa8144f00de43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Accuracy</topic><topic>Best practice</topic><topic>California Current Large Marine Ecosystem</topic><topic>Case studies</topic><topic>Chemical analysis</topic><topic>Classification</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>Ecological effects</topic><topic>Ecosystems</topic><topic>eDNA</topic><topic>Environmental DNA</topic><topic>Marine ecosystems</topic><topic>Marine environment</topic><topic>Marine fish</topic><topic>metabarcoding</topic><topic>MiFish primers</topic><topic>Parameters</topic><topic>Performance evaluation</topic><topic>reference database</topic><topic>Seawater</topic><topic>Species</topic><topic>Taxonomy</topic><topic>Water analysis</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Gold, Zachary</creatorcontrib><creatorcontrib>Curd, Emily E.</creatorcontrib><creatorcontrib>Goodwin, Kelly D.</creatorcontrib><creatorcontrib>Choi, Emma S.</creatorcontrib><creatorcontrib>Frable, Benjamin W.</creatorcontrib><creatorcontrib>Thompson, Andrew R.</creatorcontrib><creatorcontrib>Walker, Harold J.</creatorcontrib><creatorcontrib>Burton, Ronald S.</creatorcontrib><creatorcontrib>Kacev, Dovi</creatorcontrib><creatorcontrib>Martz, Lucas D.</creatorcontrib><creatorcontrib>Barber, Paul H.</creatorcontrib><collection>CrossRef</collection><collection>Ecology Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Technology Research Database</collection><collection>Environmental Sciences and Pollution Management</collection><collection>Engineering Research Database</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Molecular ecology resources</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gold, Zachary</au><au>Curd, Emily E.</au><au>Goodwin, Kelly D.</au><au>Choi, Emma S.</au><au>Frable, Benjamin W.</au><au>Thompson, Andrew R.</au><au>Walker, Harold J.</au><au>Burton, Ronald S.</au><au>Kacev, Dovi</au><au>Martz, Lucas D.</au><au>Barber, Paul H.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Improving metabarcoding taxonomic assignment: A case study of fishes in a large marine ecosystem</atitle><jtitle>Molecular ecology resources</jtitle><date>2021-10</date><risdate>2021</risdate><volume>21</volume><issue>7</issue><spage>2546</spage><epage>2564</epage><pages>2546-2564</pages><issn>1755-098X</issn><eissn>1755-0998</eissn><abstract>DNA metabarcoding is an important tool for molecular ecology. However, its effectiveness hinges on the quality of reference sequence databases and classification parameters employed. Here we evaluate the performance of MiFish 12S taxonomic assignments using a case study of California Current Large Marine Ecosystem fishes to determine best practices for metabarcoding. Specifically, we use a taxonomy cross‐validation by identity framework to compare classification performance between a global database comprised of all available sequences and a curated database that only includes sequences of fishes from the California Current Large Marine Ecosystem. We demonstrate that the regional database provides higher assignment accuracy than the comprehensive global database. We also document a tradeoff between accuracy and misclassification across a range of taxonomic cutoff scores, highlighting the importance of parameter selection for taxonomic classification. Furthermore, we compared assignment accuracy with and without the inclusion of additionally generated reference sequences. To this end, we sequenced tissue from 597 species using the MiFish 12S primers, adding 252 species to GenBank's existing 550 California Current Large Marine Ecosystem fish sequences. We then compared species and reads identified from seawater environmental DNA samples using global databases with and without our generated references, and the regional database. The addition of new references allowed for the identification of 16 additional native taxa representing 17.0% of total reads from eDNA samples, including species with vast ecological and economic value. Together these results demonstrate the importance of comprehensive and curated reference databases for effective metabarcoding and the need for locus‐specific validation efforts.</abstract><cop>Oxford</cop><pub>Wiley Subscription Services, Inc</pub><doi>10.1111/1755-0998.13450</doi><tpages>0</tpages><orcidid>https://orcid.org/0000-0003-4525-0671</orcidid><orcidid>https://orcid.org/0000-0001-7769-5558</orcidid><orcidid>https://orcid.org/0000-0001-7671-7487</orcidid><orcidid>https://orcid.org/0000-0002-6995-5329</orcidid><orcidid>https://orcid.org/0000-0003-0490-7630</orcidid><orcidid>https://orcid.org/0000-0001-9583-8073</orcidid><orcidid>https://orcid.org/0000-0003-0336-6852</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1755-098X
ispartof Molecular ecology resources, 2021-10, Vol.21 (7), p.2546-2564
issn 1755-098X
1755-0998
language eng
recordid cdi_proquest_miscellaneous_2549692280
source Wiley Online Library All Journals
subjects Accuracy
Best practice
California Current Large Marine Ecosystem
Case studies
Chemical analysis
Classification
Deoxyribonucleic acid
DNA
Ecological effects
Ecosystems
eDNA
Environmental DNA
Marine ecosystems
Marine environment
Marine fish
metabarcoding
MiFish primers
Parameters
Performance evaluation
reference database
Seawater
Species
Taxonomy
Water analysis
title Improving metabarcoding taxonomic assignment: A case study of fishes in a large marine ecosystem
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T20%3A31%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Improving%20metabarcoding%20taxonomic%20assignment:%20A%20case%20study%20of%20fishes%20in%20a%20large%20marine%20ecosystem&rft.jtitle=Molecular%20ecology%20resources&rft.au=Gold,%20Zachary&rft.date=2021-10&rft.volume=21&rft.issue=7&rft.spage=2546&rft.epage=2564&rft.pages=2546-2564&rft.issn=1755-098X&rft.eissn=1755-0998&rft_id=info:doi/10.1111/1755-0998.13450&rft_dat=%3Cproquest_cross%3E2570263589%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2570263589&rft_id=info:pmid/&rfr_iscdi=true