Data for: Detection and diversity of Phytophthora species from declining Quercus suber stands using both DNA metabarcoding and soil baiting techniques

This dataset on Zenodo accompanies the manuscript Salvatore et al. (2024), Detection and diversity of Phytophthora species from declining Quercus suber stands using both DNA metabarcoding and soil baiting techniques. There are two files: metadata.tsv - plain text table as tab-separated variables raw...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Seddaiu, Salvatore, Riddell, Carolyn, Piras, Giovanni, Ruiu, Pino Angelo, Sarais, Luca, Mello, Antonietta, Brandano, Andrea, Cock, Peter J. A., Green, Sarah, Scanu, Bruno
Format: Dataset
Sprache:eng
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Seddaiu, Salvatore
Riddell, Carolyn
Piras, Giovanni
Ruiu, Pino Angelo
Sarais, Luca
Mello, Antonietta
Brandano, Andrea
Cock, Peter J. A.
Green, Sarah
Scanu, Bruno
description This dataset on Zenodo accompanies the manuscript Salvatore et al. (2024), Detection and diversity of Phytophthora species from declining Quercus suber stands using both DNA metabarcoding and soil baiting techniques. There are two files: metadata.tsv - plain text table as tab-separated variables raw_data.tar.gz - compressed archive of 56 paired raw FASTQ files This represents a subset of one complete Illumina Nano MiSeq plate run at the James Hutton Institute also containing a small number of unrelated samples using the same protocol. To repeat the analysis described in the paper, first install THAPBI PICT. See https://github.com/peterjc/thapbi-pict/ for instructions. At the time of the paper, v1.0.16 was the current release. Next, decompress the raw data into a folder of paired gzipped FASTQ files. There is no need to decompress those: $ tar -zxvf raw_data.tar.gz $ ls -1 raw_data/ If you wish, verify the checksums to confirm the data integrity: $ cd raw_data/   $ md5sum -c MD5SUM.txt $ cd .. Setup output directories:   $ mkdir -p intermediate/ summary/ Run the THAPBI PICT pipeline:   $ thapbi_pict pipeline -m 1s3g -f 0 -a 15 -i raw_data/ \ -s intermediate/ -o summary/sardinia_20240912_v1.0.16 \ -t metadata.tsv -u -x 8 -c 4,5,3,2,7,6 The options here are as follows: -m - use the 1s3g classifier (see methods) -f - set to zero to disable the fractional abundance threshold -a - set a lower absolute abundance threshold -i - location of the input raw data -s - optional location to store intermediate files -o - output stem for reports -t - filename for tab-separated-variable metadata -u - show unsequenced samples defined in the metadata -x - which metadata column contains Illumina FASTQ filename stems -c - which metadata columns to include in the report. This leaves the -d option with the default provided ITS1 database. We are NOT taking advantage of the negative controls to automatically set a blanket minimum abundance as Control-Plate-3-Mix-3-Dry-P3-c_S56_L001 sadly has over 3000 Phytophthora reads: That takes under a minute to run, and classifies most of the samples. Opening the output file summary/sardinia_20240912_v1.0.16.ITS1.samples.1s3g.xlsx in Excel or similar should show you a table resembling Table 3 in the paper, without the baiting results, but with one row per sequencing sample, and additional columns with per-sample per-species read counts etc. The similarly named reads file as one row per unique amplicon sequence variant (ASV), and columns for ea
doi_str_mv 10.5281/zenodo.13753737
format Dataset
fullrecord <record><control><sourceid>datacite_PQ8</sourceid><recordid>TN_cdi_datacite_primary_10_5281_zenodo_13753737</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_5281_zenodo_13753737</sourcerecordid><originalsourceid>FETCH-datacite_primary_10_5281_zenodo_137537373</originalsourceid><addsrcrecordid>eNqVj89KxDAQxnvxIOrZ67yAu1vLUvG2bBVPouA9pMl0M9BmamYi1AfxeW1QH8DTwDd8f35VdV3vNvvbu3r7iZE9b-qm3Tdt055XX51VCwOne-hQ0SlxBBs9ePrAJKQL8AAvYVGegwZOFmRGRygwJJ7AoxspUjzBa8bksoDkHhOIriECWcqrZw3QPR9gQrW9TY59kUuNMI3QW9IirP0h0ntGuazOBjsKXv3ei2r7-PB2fLrx61xHimZONNm0mHpnCpn5ITN_ZM3_Hd_qpWHU</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>dataset</recordtype></control><display><type>dataset</type><title>Data for: Detection and diversity of Phytophthora species from declining Quercus suber stands using both DNA metabarcoding and soil baiting techniques</title><source>DataCite</source><creator>Seddaiu, Salvatore ; Riddell, Carolyn ; Piras, Giovanni ; Ruiu, Pino Angelo ; Sarais, Luca ; Mello, Antonietta ; Brandano, Andrea ; Cock, Peter J. A. ; Green, Sarah ; Scanu, Bruno</creator><creatorcontrib>Seddaiu, Salvatore ; Riddell, Carolyn ; Piras, Giovanni ; Ruiu, Pino Angelo ; Sarais, Luca ; Mello, Antonietta ; Brandano, Andrea ; Cock, Peter J. A. ; Green, Sarah ; Scanu, Bruno</creatorcontrib><description>This dataset on Zenodo accompanies the manuscript Salvatore et al. (2024), Detection and diversity of Phytophthora species from declining Quercus suber stands using both DNA metabarcoding and soil baiting techniques. There are two files: metadata.tsv - plain text table as tab-separated variables raw_data.tar.gz - compressed archive of 56 paired raw FASTQ files This represents a subset of one complete Illumina Nano MiSeq plate run at the James Hutton Institute also containing a small number of unrelated samples using the same protocol. To repeat the analysis described in the paper, first install THAPBI PICT. See https://github.com/peterjc/thapbi-pict/ for instructions. At the time of the paper, v1.0.16 was the current release. Next, decompress the raw data into a folder of paired gzipped FASTQ files. There is no need to decompress those: $ tar -zxvf raw_data.tar.gz $ ls -1 raw_data/ If you wish, verify the checksums to confirm the data integrity: $ cd raw_data/   $ md5sum -c MD5SUM.txt $ cd .. Setup output directories:   $ mkdir -p intermediate/ summary/ Run the THAPBI PICT pipeline:   $ thapbi_pict pipeline -m 1s3g -f 0 -a 15 -i raw_data/ \ -s intermediate/ -o summary/sardinia_20240912_v1.0.16 \ -t metadata.tsv -u -x 8 -c 4,5,3,2,7,6 The options here are as follows: -m - use the 1s3g classifier (see methods) -f - set to zero to disable the fractional abundance threshold -a - set a lower absolute abundance threshold -i - location of the input raw data -s - optional location to store intermediate files -o - output stem for reports -t - filename for tab-separated-variable metadata -u - show unsequenced samples defined in the metadata -x - which metadata column contains Illumina FASTQ filename stems -c - which metadata columns to include in the report. This leaves the -d option with the default provided ITS1 database. We are NOT taking advantage of the negative controls to automatically set a blanket minimum abundance as Control-Plate-3-Mix-3-Dry-P3-c_S56_L001 sadly has over 3000 Phytophthora reads: That takes under a minute to run, and classifies most of the samples. Opening the output file summary/sardinia_20240912_v1.0.16.ITS1.samples.1s3g.xlsx in Excel or similar should show you a table resembling Table 3 in the paper, without the baiting results, but with one row per sequencing sample, and additional columns with per-sample per-species read counts etc. The similarly named reads file as one row per unique amplicon sequence variant (ASV), and columns for each sequencing sample. All the Phytophthora classifications were to species level except a single ASV from E5BTB-DILUTE-Wet-P6_S17_L001 (S3  Bultei) with 15 reads, a perfect match to partial sequences MH588088.1 and MH593844.1, isolates Y1 and Y2, which is in the THAPBI PICT database but only at genus level: &gt;bad82a53fff502146e7ea00cbf2d9d3e PhytophthoraTTTCCGTAGGTGAACCTGCGGAAGGATCATTACCACACCTAAAACTTTCCACGTGAACCGTTTCAAACCAAATAGTTGGGGGTCTTGTCTGGTGGCGGCTGCTGGCTTTATTGTTGGCGGCTGCTGCTGGGTGAGCCCTATCATGGCGAGCGTTTGGGCTTCGGCCTGAGCTAGTAGCATTTCTTTTAAACCCATTCCTTAATACTGATTATACT There are 9 samples left simply as "Unknown" (all six from S2 Buddusò, one each from S3 Bultei, S4 Nuoro, and S6 Tempio). A further two samples contained low levels of an unknown sequence in addition to knowns. The unknown ASV from S1BTC-DILUTE-Wet-P6_S9_L001 (S3 Bultei) looks to be a novel Phytophthora if real, similar to P. quercina and P. capsici: &gt;3cd110db0a0b8fa8b971ea6b61c53970 Phytophthora?TTTCCGTAGGTGAACCTGCGGAAGGATCATTACCACACCTAAAAAACTTTCCACGTGAACCGTTTCAACCAATATTTTGGGGGTCTCGTCTGGCGTGCGGCTGTTGCTGTAAAAGGCGGCGGCTGTTGCTGGGTGAGCCCTATCATGGCAAACGTTTGGGCTTCGGTCTGAACAAGTAGCTCTTTTTTAAACCATTACTTATTACTGATTATACT The unknown ASV from S4OC-DILUTE-Wet-P6_S2_L001 (S4 Nuoro) looks to be a Pythium (a perfect match to partial sequences MN269744 and KY822489 from uncultured clones): &gt;eac8c1931b4f8c57803e6ad9ffb4eb56 Pythium?TTTCCGTAGGTGAACCTGCGGAAGGATCATTACCACACCAAAAAACTATCCACGTGAACCGTTAAGCAAAAGTCTAGTTGGCTTGTGTTGTTCGGGAGTGTGTTGGGAAGAGCTTGGAGATGTCTTCGGATATTTCGATGCCTAGTACTGGACATCCTGGCGAGCGAGTCGGCTAGCAACGAAGGTCGGGAGTTCGCTTGCGGACTGATGTGCGCTTGTCGCATGTCGGTCGAAAGGCTTGAGCAAACGGCTGATCTATTACTTTTAAACCATACCATAACTACTGATGATACT The unknown ASV from S9OC-Wet-P5_S12_L001 (S4 Nuoro) at only 38 reads seems to be an artefact of some kind, possibly chimeric: &gt;74933d826f6077cd5f3dd036f894429d Artefact?TAGCCGTAGGGGAACCTGCGGCTGGATCACCTCCTTTCTGGATTCGGAAGGCAGGGATCAGTGATCAGTTATCAGAACCGATTGCTGCTCCTCTTCCGAGCATCCACAACGCCAGCTCTGGCAGGAATTCTGATATCTGATATCTGGTTTCTGCGATCTGGCAACGGCGCCGCCGTCTGCGCATCCCTTCTGCCGCGTTATTCCAGAGGGCAGTGATCAGTCATCAGAACCGATGGCGGATCCGCCCCCGGCTTCACGGCGAGCGCCCGAGCCTG The remaining unknown ASVs were likely Plasmopara.  </description><identifier>DOI: 10.5281/zenodo.13753737</identifier><language>eng</language><publisher>Zenodo</publisher><creationdate>2024</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><orcidid>0000-0003-4569-5996 ; 0000-0002-8672-3108 ; 0000-0002-6311-377X ; 0000-0003-0089-9135 ; 0000-0002-0690-580X ; 0000-0002-5691-6267 ; 0009-0001-5385-0709 ; 0000-0001-9513-9993 ; 0000-0003-2364-4311 ; 0000-0003-4546-6368</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,1888</link.rule.ids><linktorsrc>$$Uhttps://commons.datacite.org/doi.org/10.5281/zenodo.13753737$$EView_record_in_DataCite.org$$FView_record_in_$$GDataCite.org$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>Seddaiu, Salvatore</creatorcontrib><creatorcontrib>Riddell, Carolyn</creatorcontrib><creatorcontrib>Piras, Giovanni</creatorcontrib><creatorcontrib>Ruiu, Pino Angelo</creatorcontrib><creatorcontrib>Sarais, Luca</creatorcontrib><creatorcontrib>Mello, Antonietta</creatorcontrib><creatorcontrib>Brandano, Andrea</creatorcontrib><creatorcontrib>Cock, Peter J. A.</creatorcontrib><creatorcontrib>Green, Sarah</creatorcontrib><creatorcontrib>Scanu, Bruno</creatorcontrib><title>Data for: Detection and diversity of Phytophthora species from declining Quercus suber stands using both DNA metabarcoding and soil baiting techniques</title><description>This dataset on Zenodo accompanies the manuscript Salvatore et al. (2024), Detection and diversity of Phytophthora species from declining Quercus suber stands using both DNA metabarcoding and soil baiting techniques. There are two files: metadata.tsv - plain text table as tab-separated variables raw_data.tar.gz - compressed archive of 56 paired raw FASTQ files This represents a subset of one complete Illumina Nano MiSeq plate run at the James Hutton Institute also containing a small number of unrelated samples using the same protocol. To repeat the analysis described in the paper, first install THAPBI PICT. See https://github.com/peterjc/thapbi-pict/ for instructions. At the time of the paper, v1.0.16 was the current release. Next, decompress the raw data into a folder of paired gzipped FASTQ files. There is no need to decompress those: $ tar -zxvf raw_data.tar.gz $ ls -1 raw_data/ If you wish, verify the checksums to confirm the data integrity: $ cd raw_data/   $ md5sum -c MD5SUM.txt $ cd .. Setup output directories:   $ mkdir -p intermediate/ summary/ Run the THAPBI PICT pipeline:   $ thapbi_pict pipeline -m 1s3g -f 0 -a 15 -i raw_data/ \ -s intermediate/ -o summary/sardinia_20240912_v1.0.16 \ -t metadata.tsv -u -x 8 -c 4,5,3,2,7,6 The options here are as follows: -m - use the 1s3g classifier (see methods) -f - set to zero to disable the fractional abundance threshold -a - set a lower absolute abundance threshold -i - location of the input raw data -s - optional location to store intermediate files -o - output stem for reports -t - filename for tab-separated-variable metadata -u - show unsequenced samples defined in the metadata -x - which metadata column contains Illumina FASTQ filename stems -c - which metadata columns to include in the report. This leaves the -d option with the default provided ITS1 database. We are NOT taking advantage of the negative controls to automatically set a blanket minimum abundance as Control-Plate-3-Mix-3-Dry-P3-c_S56_L001 sadly has over 3000 Phytophthora reads: That takes under a minute to run, and classifies most of the samples. Opening the output file summary/sardinia_20240912_v1.0.16.ITS1.samples.1s3g.xlsx in Excel or similar should show you a table resembling Table 3 in the paper, without the baiting results, but with one row per sequencing sample, and additional columns with per-sample per-species read counts etc. The similarly named reads file as one row per unique amplicon sequence variant (ASV), and columns for each sequencing sample. All the Phytophthora classifications were to species level except a single ASV from E5BTB-DILUTE-Wet-P6_S17_L001 (S3  Bultei) with 15 reads, a perfect match to partial sequences MH588088.1 and MH593844.1, isolates Y1 and Y2, which is in the THAPBI PICT database but only at genus level: &gt;bad82a53fff502146e7ea00cbf2d9d3e PhytophthoraTTTCCGTAGGTGAACCTGCGGAAGGATCATTACCACACCTAAAACTTTCCACGTGAACCGTTTCAAACCAAATAGTTGGGGGTCTTGTCTGGTGGCGGCTGCTGGCTTTATTGTTGGCGGCTGCTGCTGGGTGAGCCCTATCATGGCGAGCGTTTGGGCTTCGGCCTGAGCTAGTAGCATTTCTTTTAAACCCATTCCTTAATACTGATTATACT There are 9 samples left simply as "Unknown" (all six from S2 Buddusò, one each from S3 Bultei, S4 Nuoro, and S6 Tempio). A further two samples contained low levels of an unknown sequence in addition to knowns. The unknown ASV from S1BTC-DILUTE-Wet-P6_S9_L001 (S3 Bultei) looks to be a novel Phytophthora if real, similar to P. quercina and P. capsici: &gt;3cd110db0a0b8fa8b971ea6b61c53970 Phytophthora?TTTCCGTAGGTGAACCTGCGGAAGGATCATTACCACACCTAAAAAACTTTCCACGTGAACCGTTTCAACCAATATTTTGGGGGTCTCGTCTGGCGTGCGGCTGTTGCTGTAAAAGGCGGCGGCTGTTGCTGGGTGAGCCCTATCATGGCAAACGTTTGGGCTTCGGTCTGAACAAGTAGCTCTTTTTTAAACCATTACTTATTACTGATTATACT The unknown ASV from S4OC-DILUTE-Wet-P6_S2_L001 (S4 Nuoro) looks to be a Pythium (a perfect match to partial sequences MN269744 and KY822489 from uncultured clones): &gt;eac8c1931b4f8c57803e6ad9ffb4eb56 Pythium?TTTCCGTAGGTGAACCTGCGGAAGGATCATTACCACACCAAAAAACTATCCACGTGAACCGTTAAGCAAAAGTCTAGTTGGCTTGTGTTGTTCGGGAGTGTGTTGGGAAGAGCTTGGAGATGTCTTCGGATATTTCGATGCCTAGTACTGGACATCCTGGCGAGCGAGTCGGCTAGCAACGAAGGTCGGGAGTTCGCTTGCGGACTGATGTGCGCTTGTCGCATGTCGGTCGAAAGGCTTGAGCAAACGGCTGATCTATTACTTTTAAACCATACCATAACTACTGATGATACT The unknown ASV from S9OC-Wet-P5_S12_L001 (S4 Nuoro) at only 38 reads seems to be an artefact of some kind, possibly chimeric: &gt;74933d826f6077cd5f3dd036f894429d Artefact?TAGCCGTAGGGGAACCTGCGGCTGGATCACCTCCTTTCTGGATTCGGAAGGCAGGGATCAGTGATCAGTTATCAGAACCGATTGCTGCTCCTCTTCCGAGCATCCACAACGCCAGCTCTGGCAGGAATTCTGATATCTGATATCTGGTTTCTGCGATCTGGCAACGGCGCCGCCGTCTGCGCATCCCTTCTGCCGCGTTATTCCAGAGGGCAGTGATCAGTCATCAGAACCGATGGCGGATCCGCCCCCGGCTTCACGGCGAGCGCCCGAGCCTG The remaining unknown ASVs were likely Plasmopara.  </description><fulltext>true</fulltext><rsrctype>dataset</rsrctype><creationdate>2024</creationdate><recordtype>dataset</recordtype><sourceid>PQ8</sourceid><recordid>eNqVj89KxDAQxnvxIOrZ67yAu1vLUvG2bBVPouA9pMl0M9BmamYi1AfxeW1QH8DTwDd8f35VdV3vNvvbu3r7iZE9b-qm3Tdt055XX51VCwOne-hQ0SlxBBs9ePrAJKQL8AAvYVGegwZOFmRGRygwJJ7AoxspUjzBa8bksoDkHhOIriECWcqrZw3QPR9gQrW9TY59kUuNMI3QW9IirP0h0ntGuazOBjsKXv3ei2r7-PB2fLrx61xHimZONNm0mHpnCpn5ITN_ZM3_Hd_qpWHU</recordid><startdate>20240912</startdate><enddate>20240912</enddate><creator>Seddaiu, Salvatore</creator><creator>Riddell, Carolyn</creator><creator>Piras, Giovanni</creator><creator>Ruiu, Pino Angelo</creator><creator>Sarais, Luca</creator><creator>Mello, Antonietta</creator><creator>Brandano, Andrea</creator><creator>Cock, Peter J. A.</creator><creator>Green, Sarah</creator><creator>Scanu, Bruno</creator><general>Zenodo</general><scope>DYCCY</scope><scope>PQ8</scope><orcidid>https://orcid.org/0000-0003-4569-5996</orcidid><orcidid>https://orcid.org/0000-0002-8672-3108</orcidid><orcidid>https://orcid.org/0000-0002-6311-377X</orcidid><orcidid>https://orcid.org/0000-0003-0089-9135</orcidid><orcidid>https://orcid.org/0000-0002-0690-580X</orcidid><orcidid>https://orcid.org/0000-0002-5691-6267</orcidid><orcidid>https://orcid.org/0009-0001-5385-0709</orcidid><orcidid>https://orcid.org/0000-0001-9513-9993</orcidid><orcidid>https://orcid.org/0000-0003-2364-4311</orcidid><orcidid>https://orcid.org/0000-0003-4546-6368</orcidid></search><sort><creationdate>20240912</creationdate><title>Data for: Detection and diversity of Phytophthora species from declining Quercus suber stands using both DNA metabarcoding and soil baiting techniques</title><author>Seddaiu, Salvatore ; Riddell, Carolyn ; Piras, Giovanni ; Ruiu, Pino Angelo ; Sarais, Luca ; Mello, Antonietta ; Brandano, Andrea ; Cock, Peter J. A. ; Green, Sarah ; Scanu, Bruno</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-datacite_primary_10_5281_zenodo_137537373</frbrgroupid><rsrctype>datasets</rsrctype><prefilter>datasets</prefilter><language>eng</language><creationdate>2024</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Seddaiu, Salvatore</creatorcontrib><creatorcontrib>Riddell, Carolyn</creatorcontrib><creatorcontrib>Piras, Giovanni</creatorcontrib><creatorcontrib>Ruiu, Pino Angelo</creatorcontrib><creatorcontrib>Sarais, Luca</creatorcontrib><creatorcontrib>Mello, Antonietta</creatorcontrib><creatorcontrib>Brandano, Andrea</creatorcontrib><creatorcontrib>Cock, Peter J. A.</creatorcontrib><creatorcontrib>Green, Sarah</creatorcontrib><creatorcontrib>Scanu, Bruno</creatorcontrib><collection>DataCite (Open Access)</collection><collection>DataCite</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Seddaiu, Salvatore</au><au>Riddell, Carolyn</au><au>Piras, Giovanni</au><au>Ruiu, Pino Angelo</au><au>Sarais, Luca</au><au>Mello, Antonietta</au><au>Brandano, Andrea</au><au>Cock, Peter J. A.</au><au>Green, Sarah</au><au>Scanu, Bruno</au><format>book</format><genre>unknown</genre><ristype>DATA</ristype><title>Data for: Detection and diversity of Phytophthora species from declining Quercus suber stands using both DNA metabarcoding and soil baiting techniques</title><date>2024-09-12</date><risdate>2024</risdate><abstract>This dataset on Zenodo accompanies the manuscript Salvatore et al. (2024), Detection and diversity of Phytophthora species from declining Quercus suber stands using both DNA metabarcoding and soil baiting techniques. There are two files: metadata.tsv - plain text table as tab-separated variables raw_data.tar.gz - compressed archive of 56 paired raw FASTQ files This represents a subset of one complete Illumina Nano MiSeq plate run at the James Hutton Institute also containing a small number of unrelated samples using the same protocol. To repeat the analysis described in the paper, first install THAPBI PICT. See https://github.com/peterjc/thapbi-pict/ for instructions. At the time of the paper, v1.0.16 was the current release. Next, decompress the raw data into a folder of paired gzipped FASTQ files. There is no need to decompress those: $ tar -zxvf raw_data.tar.gz $ ls -1 raw_data/ If you wish, verify the checksums to confirm the data integrity: $ cd raw_data/   $ md5sum -c MD5SUM.txt $ cd .. Setup output directories:   $ mkdir -p intermediate/ summary/ Run the THAPBI PICT pipeline:   $ thapbi_pict pipeline -m 1s3g -f 0 -a 15 -i raw_data/ \ -s intermediate/ -o summary/sardinia_20240912_v1.0.16 \ -t metadata.tsv -u -x 8 -c 4,5,3,2,7,6 The options here are as follows: -m - use the 1s3g classifier (see methods) -f - set to zero to disable the fractional abundance threshold -a - set a lower absolute abundance threshold -i - location of the input raw data -s - optional location to store intermediate files -o - output stem for reports -t - filename for tab-separated-variable metadata -u - show unsequenced samples defined in the metadata -x - which metadata column contains Illumina FASTQ filename stems -c - which metadata columns to include in the report. This leaves the -d option with the default provided ITS1 database. We are NOT taking advantage of the negative controls to automatically set a blanket minimum abundance as Control-Plate-3-Mix-3-Dry-P3-c_S56_L001 sadly has over 3000 Phytophthora reads: That takes under a minute to run, and classifies most of the samples. Opening the output file summary/sardinia_20240912_v1.0.16.ITS1.samples.1s3g.xlsx in Excel or similar should show you a table resembling Table 3 in the paper, without the baiting results, but with one row per sequencing sample, and additional columns with per-sample per-species read counts etc. The similarly named reads file as one row per unique amplicon sequence variant (ASV), and columns for each sequencing sample. All the Phytophthora classifications were to species level except a single ASV from E5BTB-DILUTE-Wet-P6_S17_L001 (S3  Bultei) with 15 reads, a perfect match to partial sequences MH588088.1 and MH593844.1, isolates Y1 and Y2, which is in the THAPBI PICT database but only at genus level: &gt;bad82a53fff502146e7ea00cbf2d9d3e PhytophthoraTTTCCGTAGGTGAACCTGCGGAAGGATCATTACCACACCTAAAACTTTCCACGTGAACCGTTTCAAACCAAATAGTTGGGGGTCTTGTCTGGTGGCGGCTGCTGGCTTTATTGTTGGCGGCTGCTGCTGGGTGAGCCCTATCATGGCGAGCGTTTGGGCTTCGGCCTGAGCTAGTAGCATTTCTTTTAAACCCATTCCTTAATACTGATTATACT There are 9 samples left simply as "Unknown" (all six from S2 Buddusò, one each from S3 Bultei, S4 Nuoro, and S6 Tempio). A further two samples contained low levels of an unknown sequence in addition to knowns. The unknown ASV from S1BTC-DILUTE-Wet-P6_S9_L001 (S3 Bultei) looks to be a novel Phytophthora if real, similar to P. quercina and P. capsici: &gt;3cd110db0a0b8fa8b971ea6b61c53970 Phytophthora?TTTCCGTAGGTGAACCTGCGGAAGGATCATTACCACACCTAAAAAACTTTCCACGTGAACCGTTTCAACCAATATTTTGGGGGTCTCGTCTGGCGTGCGGCTGTTGCTGTAAAAGGCGGCGGCTGTTGCTGGGTGAGCCCTATCATGGCAAACGTTTGGGCTTCGGTCTGAACAAGTAGCTCTTTTTTAAACCATTACTTATTACTGATTATACT The unknown ASV from S4OC-DILUTE-Wet-P6_S2_L001 (S4 Nuoro) looks to be a Pythium (a perfect match to partial sequences MN269744 and KY822489 from uncultured clones): &gt;eac8c1931b4f8c57803e6ad9ffb4eb56 Pythium?TTTCCGTAGGTGAACCTGCGGAAGGATCATTACCACACCAAAAAACTATCCACGTGAACCGTTAAGCAAAAGTCTAGTTGGCTTGTGTTGTTCGGGAGTGTGTTGGGAAGAGCTTGGAGATGTCTTCGGATATTTCGATGCCTAGTACTGGACATCCTGGCGAGCGAGTCGGCTAGCAACGAAGGTCGGGAGTTCGCTTGCGGACTGATGTGCGCTTGTCGCATGTCGGTCGAAAGGCTTGAGCAAACGGCTGATCTATTACTTTTAAACCATACCATAACTACTGATGATACT The unknown ASV from S9OC-Wet-P5_S12_L001 (S4 Nuoro) at only 38 reads seems to be an artefact of some kind, possibly chimeric: &gt;74933d826f6077cd5f3dd036f894429d Artefact?TAGCCGTAGGGGAACCTGCGGCTGGATCACCTCCTTTCTGGATTCGGAAGGCAGGGATCAGTGATCAGTTATCAGAACCGATTGCTGCTCCTCTTCCGAGCATCCACAACGCCAGCTCTGGCAGGAATTCTGATATCTGATATCTGGTTTCTGCGATCTGGCAACGGCGCCGCCGTCTGCGCATCCCTTCTGCCGCGTTATTCCAGAGGGCAGTGATCAGTCATCAGAACCGATGGCGGATCCGCCCCCGGCTTCACGGCGAGCGCCCGAGCCTG The remaining unknown ASVs were likely Plasmopara.  </abstract><pub>Zenodo</pub><doi>10.5281/zenodo.13753737</doi><orcidid>https://orcid.org/0000-0003-4569-5996</orcidid><orcidid>https://orcid.org/0000-0002-8672-3108</orcidid><orcidid>https://orcid.org/0000-0002-6311-377X</orcidid><orcidid>https://orcid.org/0000-0003-0089-9135</orcidid><orcidid>https://orcid.org/0000-0002-0690-580X</orcidid><orcidid>https://orcid.org/0000-0002-5691-6267</orcidid><orcidid>https://orcid.org/0009-0001-5385-0709</orcidid><orcidid>https://orcid.org/0000-0001-9513-9993</orcidid><orcidid>https://orcid.org/0000-0003-2364-4311</orcidid><orcidid>https://orcid.org/0000-0003-4546-6368</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.5281/zenodo.13753737
ispartof
issn
language eng
recordid cdi_datacite_primary_10_5281_zenodo_13753737
source DataCite
title Data for: Detection and diversity of Phytophthora species from declining Quercus suber stands using both DNA metabarcoding and soil baiting techniques
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-19T02%3A37%3A42IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-datacite_PQ8&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=unknown&rft.au=Seddaiu,%20Salvatore&rft.date=2024-09-12&rft_id=info:doi/10.5281/zenodo.13753737&rft_dat=%3Cdatacite_PQ8%3E10_5281_zenodo_13753737%3C/datacite_PQ8%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true