In silico identification of conserved intercoding sequences in Leishmania genomes: Unraveling putative cis-regulatory elements
In silico identification of conserved intercoding sequences (CICS) in three Leishmania genomes reveals potential cis-acting elements on the control of gene expression. [Display omitted] ► In silico identification of conserved intercoding sequences in Leishmania genomes. ► Data will be made available...
Gespeichert in:
Veröffentlicht in: | Molecular and biochemical parasitology 2012-06, Vol.183 (2), p.140-150 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | In silico identification of conserved intercoding sequences (CICS) in three Leishmania genomes reveals potential cis-acting elements on the control of gene expression. [Display omitted]
► In silico identification of conserved intercoding sequences in Leishmania genomes. ► Data will be made available to the community. ► Some CICS are potential regulatory cis-elements. ► Common CICS are placed within transcripts sharing similar pattern of expression.
In silico analyses of Leishmania spp. genome data are a powerful resource to improve the understanding of these pathogens’ biology. Trypanosomatids such as Leishmania spp. have their protein-coding genes grouped in long polycistronic units of functionally unrelated genes. The control of gene expression happens by a variety of posttranscriptional mechanisms. The high degree of synteny among Leishmania species is accompanied by highly conserved coding sequences (CDS) and poorly conserved intercoding untranslated sequences. To identify the elements involved in the control of gene expression, we conducted an in silico investigation to find conserved intercoding sequences (CICS) in the genomes of L. major, L. infantum, and L. braziliensis.
We used a combination of computational tools, such as Linux-Shell, PERL and R languages, BLAST, MSPcrunch, SSAKE, and Pred-A-Term algorithms to construct a pipeline which was able to: (i) search for conservation in target-regions, (ii) eliminate CICS redundancy and mask repeat elements, (iii) predict the mRNA's extremities, (iv) analyze the distribution of orthologous genes within the generated LeishCICS-clusters, (v) assign GO terms to the LeishCICS-clusters, and (vi) provide statistical support for the gene-enrichment annotation. We associated the LeishCICS-cluster data, generated at the end of the pipeline, with the expression profile of L. donovani genes during promastigote–amastigote differentiation, as previously evaluated by others (GEO accession: GSE21936). A Pearson's correlation coefficient greater than 0.5 was observed for 730 LeishCICS-clusters containing from 2 to 17 genes. The designed computational pipeline is a useful tool and its application identified potential regulatory cis elements and putative regulons in Leishmania. |
---|---|
ISSN: | 0166-6851 1872-9428 |
DOI: | 10.1016/j.molbiopara.2012.02.009 |