The confounding effects of high genetic diversity on the determination and interpretation of differential gene expression analysis in the parasitic nematode Haemonchus contortus
[Display omitted] •Sequence polymorphism can counfound RNAseq analysis in genetically diverse organisms due to read mapping biases.•Optimising read mapping and excluding highly polymorphic genes reduces differential gene expression analysis biases.•Genetically divergent strains of Haemonchus contort...
Gespeichert in:
Veröffentlicht in: | International journal for parasitology 2019-10, Vol.49 (11), p.847-858 |
---|---|
Hauptverfasser: | , , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | [Display omitted]
•Sequence polymorphism can counfound RNAseq analysis in genetically diverse organisms due to read mapping biases.•Optimising read mapping and excluding highly polymorphic genes reduces differential gene expression analysis biases.•Genetically divergent strains of Haemonchus contortus have very high levels of inter-strain transcriptional diversity.•Analyses of inter-strain differential expression must consider sequence polymorphism and overall transcriptional diversity.
Differential expression analysis between parasitic nematode strains is commonly used to implicate candidate genes in anthelmintic resistance or other biological functions. We have tested the hypothesis that the high genetic diversity of an organism such as Haemonchus contortus could complicate such analyses. First, we investigated the extent to which sequence polymorphism affects the reliability of differential expression analysis between the genetically divergent H. contortus strains MHco3(ISE), MHco4(WRS) and MHco10(CAVR). Using triplicates of 20 adult female worms from each population isolated under parallel experimental conditions, we found that high rates of sequence polymorphism in RNAseq reads were associated with lower efficiency read mapping to gene models under default TopHat2 parameters, leading to biased estimates of inter-strain differential expression. We then showed it is possible to largely compensate for this bias by optimising the read mapping single nucleotide polymorphism (SNP) allowance and filtering out genes with particularly high single nucleotide polymorphism rates. Once the sequence polymorphism biases were removed, we then assessed the genuine transcriptional diversity between the strains, finding ≥824 differentially expressed genes across all three pairwise strain comparisons. This high level of inter-strain transcriptional diversity not only suggests substantive inter-strain phenotypic variation but also highlights the difficulty in reliably associating differential expression of specific genes with phenotypic differences. To provide a practical example, we analysed two gene families of potential relevance to ivermectin drug resistance; the ABC transporters and the ligand-gated ion channels (LGICs). Over half of genes identified as differentially expressed using default TopHat2 parameters were shown to be an artifact of sequence polymorphism differences. This work illustrates the need to account for sequence polymorphism in differential expressi |
---|---|
ISSN: | 0020-7519 1879-0135 |
DOI: | 10.1016/j.ijpara.2019.05.012 |