V-REVCOMP: automated high-throughput detection of reverse complementary 16S rRNA gene sequences in large environmental and taxonomic datasets

Abstract Reverse complementary DNA sequences – sequences that are inadvertently given backwards with all purines and pyrimidines transposed – can affect sequence analysis detrimentally unless taken into account. We present an open-source, high-throughput software tool –v-revcomp (http://www.cmde.sci...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:FEMS microbiology letters 2011-06, Vol.319 (2), p.140-145
Hauptverfasser: Hartmann, Martin, Howes, Charles G., Veldre, Vilmar, Schneider, Salome, Vaishampayan, Parag A., Yannarell, Anthony C., Quince, Christopher, Johansson, Per, Björkroth, K. Johanna, Abarenkov, Kessy, Hallam, Steven J., Mohn, William W., Nilsson, R. Henrik
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 145
container_issue 2
container_start_page 140
container_title FEMS microbiology letters
container_volume 319
creator Hartmann, Martin
Howes, Charles G.
Veldre, Vilmar
Schneider, Salome
Vaishampayan, Parag A.
Yannarell, Anthony C.
Quince, Christopher
Johansson, Per
Björkroth, K. Johanna
Abarenkov, Kessy
Hallam, Steven J.
Mohn, William W.
Nilsson, R. Henrik
description Abstract Reverse complementary DNA sequences – sequences that are inadvertently given backwards with all purines and pyrimidines transposed – can affect sequence analysis detrimentally unless taken into account. We present an open-source, high-throughput software tool –v-revcomp (http://www.cmde.science.ubc.ca/mohn/software.html) – to detect and reorient reverse complementary entries of the small-subunit rRNA (16S) gene from sequencing datasets, particularly from environmental sources. The software supports sequence lengths ranging from full length down to the short reads that are characteristic of next-generation sequencing technologies. We evaluated the reliability of v-revcomp by screening all 406781 16S sequences deposited in release 102 of the curated SILVA database and demonstrated that the tool has a detection accuracy of virtually 100%. We subsequently used v-revcomp to analyse 1171646 16S sequences deposited in the International Nucleotide Sequence Databases and found that about 1% of these user-submitted sequences were reverse complementary. In addition, a nontrivial proportion of the entries were otherwise anomalous, including reverse complementary chimeras, sequences associated with wrong taxa, nonribosomal genes, sequences of poor quality or otherwise erroneous sequences without a reasonable match to any other entry in the database. Thus, v-revcomp is highly efficient in detecting and reorienting reverse complementary 16S sequences of almost any length and can be used to detect various sequence anomalies.
doi_str_mv 10.1111/j.1574-6968.2011.02274.x
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_proquest_miscellaneous_902381550</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><oup_id>10.1111/j.1574-6968.2011.02274.x</oup_id><sourcerecordid>902381550</sourcerecordid><originalsourceid>FETCH-LOGICAL-c5034-a98c5fab553048bf9ef4aea43b5212e9cab059d8c6922b358cd8db63804750a73</originalsourceid><addsrcrecordid>eNqFkl1rFDEUhgdRbK3-BQmIeDVrPmcSwYuytCpsrVTtbchkzuzOMpOsSaZuf4T_2Wx3raCIuUngPG--zlMUiOAZyeP1ekZEzctKVXJGMSEzTGnNZ9sHxfF94WFxjFktS4JVfVQ8iXGNMeYUV4-LI0q4YIzy4-LHdXl1dj2_vPj0Bpkp-dEkaNGqX67KtAp-Wq42U0ItJLCp9w75DgW4gRABWT9uBhjBJRNuEak-o3D18RQtwQGK8G0CZyGi3qHBhCUgcDd98O6OH5BxLUpm650fe4tak0yEFJ8WjzozRHh2mE-Kr-dnX-bvy8Xluw_z00VpBWa8NEpa0ZlGCIa5bDoFHTdgOGsEJRSUNQ0WqpW2UpQ2TEjbyrapmMS8FtjU7KR4td93E3y-aEx67KOFYTAO_BS1wpRJIvJh_yNlLbjCuGaZfPEHufZTcPkZmjIspMKKk0w9P1BTM0KrN6Ef8_fpXx3JwMsDYKI1QxeMs338zeUO1pKozL3dc9_7AW7v6wTrnSF6rXci6J0IemeIvjNEb_X5xWK3ynm2z_tp8490-Vea_QTJ772g</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2305890941</pqid></control><display><type>article</type><title>V-REVCOMP: automated high-throughput detection of reverse complementary 16S rRNA gene sequences in large environmental and taxonomic datasets</title><source>MEDLINE</source><source>Oxford Journals Online</source><source>Wiley Blackwell Journals</source><creator>Hartmann, Martin ; Howes, Charles G. ; Veldre, Vilmar ; Schneider, Salome ; Vaishampayan, Parag A. ; Yannarell, Anthony C. ; Quince, Christopher ; Johansson, Per ; Björkroth, K. Johanna ; Abarenkov, Kessy ; Hallam, Steven J. ; Mohn, William W. ; Nilsson, R. Henrik</creator><creatorcontrib>Hartmann, Martin ; Howes, Charles G. ; Veldre, Vilmar ; Schneider, Salome ; Vaishampayan, Parag A. ; Yannarell, Anthony C. ; Quince, Christopher ; Johansson, Per ; Björkroth, K. Johanna ; Abarenkov, Kessy ; Hallam, Steven J. ; Mohn, William W. ; Nilsson, R. Henrik</creatorcontrib><description>Abstract Reverse complementary DNA sequences – sequences that are inadvertently given backwards with all purines and pyrimidines transposed – can affect sequence analysis detrimentally unless taken into account. We present an open-source, high-throughput software tool –v-revcomp (http://www.cmde.science.ubc.ca/mohn/software.html) – to detect and reorient reverse complementary entries of the small-subunit rRNA (16S) gene from sequencing datasets, particularly from environmental sources. The software supports sequence lengths ranging from full length down to the short reads that are characteristic of next-generation sequencing technologies. We evaluated the reliability of v-revcomp by screening all 406781 16S sequences deposited in release 102 of the curated SILVA database and demonstrated that the tool has a detection accuracy of virtually 100%. We subsequently used v-revcomp to analyse 1171646 16S sequences deposited in the International Nucleotide Sequence Databases and found that about 1% of these user-submitted sequences were reverse complementary. In addition, a nontrivial proportion of the entries were otherwise anomalous, including reverse complementary chimeras, sequences associated with wrong taxa, nonribosomal genes, sequences of poor quality or otherwise erroneous sequences without a reasonable match to any other entry in the database. Thus, v-revcomp is highly efficient in detecting and reorienting reverse complementary 16S sequences of almost any length and can be used to detect various sequence anomalies.</description><identifier>ISSN: 0378-1097</identifier><identifier>EISSN: 1574-6968</identifier><identifier>DOI: 10.1111/j.1574-6968.2011.02274.x</identifier><identifier>PMID: 21453324</identifier><identifier>CODEN: FMLED7</identifier><language>eng</language><publisher>Oxford, UK: Blackwell Publishing Ltd</publisher><subject>16S sequence ; Anomalies ; Bacteria - classification ; Bacteria - genetics ; Bacteria - isolation &amp; purification ; Biological and medical sciences ; Chimeras ; Complementary DNA ; Computer programs ; Databases, Nucleic Acid ; Datasets ; Deoxyribonucleic acid ; DNA ; Environmental Microbiology ; Fundamental and applied biological sciences. Psychology ; Gene sequencing ; Hidden Markov Models ; hmmer ; Microbiology ; Next-generation sequencing ; Nucleotide sequence ; Nucleotides ; Phylogeny ; Purines ; Pyrimidines ; Reliability analysis ; reverse complementary ; RNA, Ribosomal, 16S - genetics ; rRNA 16S ; Sequence Alignment ; Software ; Software development tools ; Source code ; SSU rRNA gene</subject><ispartof>FEMS microbiology letters, 2011-06, Vol.319 (2), p.140-145</ispartof><rights>2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved 2011</rights><rights>2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved</rights><rights>2015 INIST-CNRS</rights><rights>2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c5034-a98c5fab553048bf9ef4aea43b5212e9cab059d8c6922b358cd8db63804750a73</citedby></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1111%2Fj.1574-6968.2011.02274.x$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1111%2Fj.1574-6968.2011.02274.x$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,777,781,1412,27905,27906,45555,45556</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=24207819$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/21453324$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hartmann, Martin</creatorcontrib><creatorcontrib>Howes, Charles G.</creatorcontrib><creatorcontrib>Veldre, Vilmar</creatorcontrib><creatorcontrib>Schneider, Salome</creatorcontrib><creatorcontrib>Vaishampayan, Parag A.</creatorcontrib><creatorcontrib>Yannarell, Anthony C.</creatorcontrib><creatorcontrib>Quince, Christopher</creatorcontrib><creatorcontrib>Johansson, Per</creatorcontrib><creatorcontrib>Björkroth, K. Johanna</creatorcontrib><creatorcontrib>Abarenkov, Kessy</creatorcontrib><creatorcontrib>Hallam, Steven J.</creatorcontrib><creatorcontrib>Mohn, William W.</creatorcontrib><creatorcontrib>Nilsson, R. Henrik</creatorcontrib><title>V-REVCOMP: automated high-throughput detection of reverse complementary 16S rRNA gene sequences in large environmental and taxonomic datasets</title><title>FEMS microbiology letters</title><addtitle>FEMS Microbiol Lett</addtitle><description>Abstract Reverse complementary DNA sequences – sequences that are inadvertently given backwards with all purines and pyrimidines transposed – can affect sequence analysis detrimentally unless taken into account. We present an open-source, high-throughput software tool –v-revcomp (http://www.cmde.science.ubc.ca/mohn/software.html) – to detect and reorient reverse complementary entries of the small-subunit rRNA (16S) gene from sequencing datasets, particularly from environmental sources. The software supports sequence lengths ranging from full length down to the short reads that are characteristic of next-generation sequencing technologies. We evaluated the reliability of v-revcomp by screening all 406781 16S sequences deposited in release 102 of the curated SILVA database and demonstrated that the tool has a detection accuracy of virtually 100%. We subsequently used v-revcomp to analyse 1171646 16S sequences deposited in the International Nucleotide Sequence Databases and found that about 1% of these user-submitted sequences were reverse complementary. In addition, a nontrivial proportion of the entries were otherwise anomalous, including reverse complementary chimeras, sequences associated with wrong taxa, nonribosomal genes, sequences of poor quality or otherwise erroneous sequences without a reasonable match to any other entry in the database. Thus, v-revcomp is highly efficient in detecting and reorienting reverse complementary 16S sequences of almost any length and can be used to detect various sequence anomalies.</description><subject>16S sequence</subject><subject>Anomalies</subject><subject>Bacteria - classification</subject><subject>Bacteria - genetics</subject><subject>Bacteria - isolation &amp; purification</subject><subject>Biological and medical sciences</subject><subject>Chimeras</subject><subject>Complementary DNA</subject><subject>Computer programs</subject><subject>Databases, Nucleic Acid</subject><subject>Datasets</subject><subject>Deoxyribonucleic acid</subject><subject>DNA</subject><subject>Environmental Microbiology</subject><subject>Fundamental and applied biological sciences. Psychology</subject><subject>Gene sequencing</subject><subject>Hidden Markov Models</subject><subject>hmmer</subject><subject>Microbiology</subject><subject>Next-generation sequencing</subject><subject>Nucleotide sequence</subject><subject>Nucleotides</subject><subject>Phylogeny</subject><subject>Purines</subject><subject>Pyrimidines</subject><subject>Reliability analysis</subject><subject>reverse complementary</subject><subject>RNA, Ribosomal, 16S - genetics</subject><subject>rRNA 16S</subject><subject>Sequence Alignment</subject><subject>Software</subject><subject>Software development tools</subject><subject>Source code</subject><subject>SSU rRNA gene</subject><issn>0378-1097</issn><issn>1574-6968</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2011</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNqFkl1rFDEUhgdRbK3-BQmIeDVrPmcSwYuytCpsrVTtbchkzuzOMpOsSaZuf4T_2Wx3raCIuUngPG--zlMUiOAZyeP1ekZEzctKVXJGMSEzTGnNZ9sHxfF94WFxjFktS4JVfVQ8iXGNMeYUV4-LI0q4YIzy4-LHdXl1dj2_vPj0Bpkp-dEkaNGqX67KtAp-Wq42U0ItJLCp9w75DgW4gRABWT9uBhjBJRNuEak-o3D18RQtwQGK8G0CZyGi3qHBhCUgcDd98O6OH5BxLUpm650fe4tak0yEFJ8WjzozRHh2mE-Kr-dnX-bvy8Xluw_z00VpBWa8NEpa0ZlGCIa5bDoFHTdgOGsEJRSUNQ0WqpW2UpQ2TEjbyrapmMS8FtjU7KR4td93E3y-aEx67KOFYTAO_BS1wpRJIvJh_yNlLbjCuGaZfPEHufZTcPkZmjIspMKKk0w9P1BTM0KrN6Ef8_fpXx3JwMsDYKI1QxeMs338zeUO1pKozL3dc9_7AW7v6wTrnSF6rXci6J0IemeIvjNEb_X5xWK3ynm2z_tp8490-Vea_QTJ772g</recordid><startdate>201106</startdate><enddate>201106</enddate><creator>Hartmann, Martin</creator><creator>Howes, Charles G.</creator><creator>Veldre, Vilmar</creator><creator>Schneider, Salome</creator><creator>Vaishampayan, Parag A.</creator><creator>Yannarell, Anthony C.</creator><creator>Quince, Christopher</creator><creator>Johansson, Per</creator><creator>Björkroth, K. Johanna</creator><creator>Abarenkov, Kessy</creator><creator>Hallam, Steven J.</creator><creator>Mohn, William W.</creator><creator>Nilsson, R. Henrik</creator><general>Blackwell Publishing Ltd</general><general>Wiley-Blackwell</general><general>Oxford University Press</general><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>3V.</scope><scope>7QL</scope><scope>7T7</scope><scope>7TK</scope><scope>7TM</scope><scope>7U9</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8C1</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>H94</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M7N</scope><scope>M7P</scope><scope>P64</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>RC3</scope><scope>7X8</scope><scope>7ST</scope><scope>SOI</scope></search><sort><creationdate>201106</creationdate><title>V-REVCOMP: automated high-throughput detection of reverse complementary 16S rRNA gene sequences in large environmental and taxonomic datasets</title><author>Hartmann, Martin ; Howes, Charles G. ; Veldre, Vilmar ; Schneider, Salome ; Vaishampayan, Parag A. ; Yannarell, Anthony C. ; Quince, Christopher ; Johansson, Per ; Björkroth, K. Johanna ; Abarenkov, Kessy ; Hallam, Steven J. ; Mohn, William W. ; Nilsson, R. Henrik</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c5034-a98c5fab553048bf9ef4aea43b5212e9cab059d8c6922b358cd8db63804750a73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2011</creationdate><topic>16S sequence</topic><topic>Anomalies</topic><topic>Bacteria - classification</topic><topic>Bacteria - genetics</topic><topic>Bacteria - isolation &amp; purification</topic><topic>Biological and medical sciences</topic><topic>Chimeras</topic><topic>Complementary DNA</topic><topic>Computer programs</topic><topic>Databases, Nucleic Acid</topic><topic>Datasets</topic><topic>Deoxyribonucleic acid</topic><topic>DNA</topic><topic>Environmental Microbiology</topic><topic>Fundamental and applied biological sciences. Psychology</topic><topic>Gene sequencing</topic><topic>Hidden Markov Models</topic><topic>hmmer</topic><topic>Microbiology</topic><topic>Next-generation sequencing</topic><topic>Nucleotide sequence</topic><topic>Nucleotides</topic><topic>Phylogeny</topic><topic>Purines</topic><topic>Pyrimidines</topic><topic>Reliability analysis</topic><topic>reverse complementary</topic><topic>RNA, Ribosomal, 16S - genetics</topic><topic>rRNA 16S</topic><topic>Sequence Alignment</topic><topic>Software</topic><topic>Software development tools</topic><topic>Source code</topic><topic>SSU rRNA gene</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hartmann, Martin</creatorcontrib><creatorcontrib>Howes, Charles G.</creatorcontrib><creatorcontrib>Veldre, Vilmar</creatorcontrib><creatorcontrib>Schneider, Salome</creatorcontrib><creatorcontrib>Vaishampayan, Parag A.</creatorcontrib><creatorcontrib>Yannarell, Anthony C.</creatorcontrib><creatorcontrib>Quince, Christopher</creatorcontrib><creatorcontrib>Johansson, Per</creatorcontrib><creatorcontrib>Björkroth, K. Johanna</creatorcontrib><creatorcontrib>Abarenkov, Kessy</creatorcontrib><creatorcontrib>Hallam, Steven J.</creatorcontrib><creatorcontrib>Mohn, William W.</creatorcontrib><creatorcontrib>Nilsson, R. Henrik</creatorcontrib><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>ProQuest Central (Corporate)</collection><collection>Bacteriology Abstracts (Microbiology B)</collection><collection>Industrial and Applied Microbiology Abstracts (Microbiology A)</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Virology and AIDS Abstracts</collection><collection>ProQuest Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>ProQuest Public Health Database</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>ProQuest Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>AIDS and Cancer Research Abstracts</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>PML(ProQuest Medical Library)</collection><collection>Algology Mycology and Protozoology Abstracts (Microbiology C)</collection><collection>ProQuest Biological Science Journals</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>Environment Abstracts</collection><collection>Environment Abstracts</collection><jtitle>FEMS microbiology letters</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hartmann, Martin</au><au>Howes, Charles G.</au><au>Veldre, Vilmar</au><au>Schneider, Salome</au><au>Vaishampayan, Parag A.</au><au>Yannarell, Anthony C.</au><au>Quince, Christopher</au><au>Johansson, Per</au><au>Björkroth, K. Johanna</au><au>Abarenkov, Kessy</au><au>Hallam, Steven J.</au><au>Mohn, William W.</au><au>Nilsson, R. Henrik</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>V-REVCOMP: automated high-throughput detection of reverse complementary 16S rRNA gene sequences in large environmental and taxonomic datasets</atitle><jtitle>FEMS microbiology letters</jtitle><addtitle>FEMS Microbiol Lett</addtitle><date>2011-06</date><risdate>2011</risdate><volume>319</volume><issue>2</issue><spage>140</spage><epage>145</epage><pages>140-145</pages><issn>0378-1097</issn><eissn>1574-6968</eissn><coden>FMLED7</coden><abstract>Abstract Reverse complementary DNA sequences – sequences that are inadvertently given backwards with all purines and pyrimidines transposed – can affect sequence analysis detrimentally unless taken into account. We present an open-source, high-throughput software tool –v-revcomp (http://www.cmde.science.ubc.ca/mohn/software.html) – to detect and reorient reverse complementary entries of the small-subunit rRNA (16S) gene from sequencing datasets, particularly from environmental sources. The software supports sequence lengths ranging from full length down to the short reads that are characteristic of next-generation sequencing technologies. We evaluated the reliability of v-revcomp by screening all 406781 16S sequences deposited in release 102 of the curated SILVA database and demonstrated that the tool has a detection accuracy of virtually 100%. We subsequently used v-revcomp to analyse 1171646 16S sequences deposited in the International Nucleotide Sequence Databases and found that about 1% of these user-submitted sequences were reverse complementary. In addition, a nontrivial proportion of the entries were otherwise anomalous, including reverse complementary chimeras, sequences associated with wrong taxa, nonribosomal genes, sequences of poor quality or otherwise erroneous sequences without a reasonable match to any other entry in the database. Thus, v-revcomp is highly efficient in detecting and reorienting reverse complementary 16S sequences of almost any length and can be used to detect various sequence anomalies.</abstract><cop>Oxford, UK</cop><pub>Blackwell Publishing Ltd</pub><pmid>21453324</pmid><doi>10.1111/j.1574-6968.2011.02274.x</doi><tpages>6</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0378-1097
ispartof FEMS microbiology letters, 2011-06, Vol.319 (2), p.140-145
issn 0378-1097
1574-6968
language eng
recordid cdi_proquest_miscellaneous_902381550
source MEDLINE; Oxford Journals Online; Wiley Blackwell Journals
subjects 16S sequence
Anomalies
Bacteria - classification
Bacteria - genetics
Bacteria - isolation & purification
Biological and medical sciences
Chimeras
Complementary DNA
Computer programs
Databases, Nucleic Acid
Datasets
Deoxyribonucleic acid
DNA
Environmental Microbiology
Fundamental and applied biological sciences. Psychology
Gene sequencing
Hidden Markov Models
hmmer
Microbiology
Next-generation sequencing
Nucleotide sequence
Nucleotides
Phylogeny
Purines
Pyrimidines
Reliability analysis
reverse complementary
RNA, Ribosomal, 16S - genetics
rRNA 16S
Sequence Alignment
Software
Software development tools
Source code
SSU rRNA gene
title V-REVCOMP: automated high-throughput detection of reverse complementary 16S rRNA gene sequences in large environmental and taxonomic datasets
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T17%3A47%3A13IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=V-REVCOMP:%20automated%20high-throughput%20detection%20of%20reverse%20complementary%2016S%20rRNA%20gene%20sequences%20in%20large%20environmental%20and%20taxonomic%20datasets&rft.jtitle=FEMS%20microbiology%20letters&rft.au=Hartmann,%20Martin&rft.date=2011-06&rft.volume=319&rft.issue=2&rft.spage=140&rft.epage=145&rft.pages=140-145&rft.issn=0378-1097&rft.eissn=1574-6968&rft.coden=FMLED7&rft_id=info:doi/10.1111/j.1574-6968.2011.02274.x&rft_dat=%3Cproquest_pubme%3E902381550%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2305890941&rft_id=info:pmid/21453324&rft_oup_id=10.1111/j.1574-6968.2011.02274.x&rfr_iscdi=true