High‐quality genome assemblies for nine non‐model North American insect species representing six orders (Insecta: Coleoptera, Diptera, Hemiptera, Hymenoptera, Lepidoptera, Neuroptera)

Field‐collected specimens were used to obtain nine high‐quality genome assemblies from a total of 10 insect species native to prairies and savannas of central Illinois (USA): Mellilla xanthometata (Lepidoptera: Geometridae), Stenolophus ochropezus (Coleoptera: Carabidae), Forcipata loca (Hemiptera:...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Molecular ecology resources 2024-11, Vol.24 (8), p.e14010-n/a
Hauptverfasser:	Walden, Kimberly K. O., Cao, Yanghui, Fields, Christopher J., Hernandez, Alvaro G., Rendon, Gloria A., Robinson, Gene E., Skinner, Rachel K., Stein, Jeffrey A., Dietrich, Christopher H.
Format:	Artikel
Sprache:	eng
Schlagworte:	Animals annotation Annotations Assemblies Assembly Bioinformatics Cicadellidae Coleoptera Completeness Conserved sequence Cost analysis Deoxyribonucleic acid Diptera DNA DNA sequencing Earth BioGenome project Gene sequencing genome assembly Genome, Insect - genetics Genomes Hemiptera Hymenoptera Illinois Indigenous species Insecta Insecta - classification Insecta - genetics Insects Lepidoptera Neuroptera non‐model species Nucleotide sequence Prairies Scaffolding Scaffolds Sequence Analysis, DNA - methods sequencing
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Field‐collected specimens were used to obtain nine high‐quality genome assemblies from a total of 10 insect species native to prairies and savannas of central Illinois (USA): Mellilla xanthometata (Lepidoptera: Geometridae), Stenolophus ochropezus (Coleoptera: Carabidae), Forcipata loca (Hemiptera: Cicadellidae), Coelinius sp. (Hymenoptera: Braconidae), Thaumatomyia glabra (Diptera: Chloropidae), Brachynemurus abdominalus (Neuroptera: Myrmeleontidae), Catonia carolina (Hemiptera: Achilidae), Oncometopia orbona (Hemiptera: Cicadellidae), Flexamia atlantica (Hemiptera: Cicadellidae) and Stictocephala bisonia (Hemiptera: Membracidae). Sequencing library preparation from single specimens was successful despite extremely small DNA yields (13,000, with the longest scaffold per assembly ranging from ~23 to 439 Mb. Genome completeness was high, with BUSCO scores ranging from 85.5% completeness for the largest genome (Stictocephala bisonia) to 98.8% completeness for the smallest genome (Coelinius sp.). The unique content was estimated using RepeatMasker and GenomeScope2, which ranged from 50.7% to 75.8% and roughly decreased with increasing genome size. Structural annotation predicted a range of 19,281–72,469 protein models for sequenced species. Sequencing costs per genome at the time ranged from US$3–5k, averaged ~1600 CPU‐hours on a high‐performance cluster and required approximately 14 h of bioinformatics analyses with samples using PacBio HiFi data. Most assemblies would benefit from further manual curation to correct possible scaffold misjoins and translocations suggested by off‐diagonal or depleted signals in Omni‐C contact maps.
ISSN:	1755-098X 1755-0998 1755-0998
DOI:	10.1111/1755-0998.14010