Identification and bioinformatics analysis of pseudogenes from whole genome sequence of Phaeodactylum tricornutum

Pseudogenes share sequence similarities with functional genes, but in general they have lost their protein-coding ability. The identification of pseudogenes is a very important step in genome annotation. Phaeodactylum tricornutum is a marine diatom that is rich in polyunsaturated fatty acids (PUFAs)...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Chinese science bulletin 2013-03, Vol.58 (9), p.1010-1017
Hauptverfasser: Ji, ChangMian, Huang, AiYou, Liu, WenLing, Pan, GuangHua, Wang, GuangCe
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Pseudogenes share sequence similarities with functional genes, but in general they have lost their protein-coding ability. The identification of pseudogenes is a very important step in genome annotation. Phaeodactylum tricornutum is a marine diatom that is rich in polyunsaturated fatty acids (PUFAs). The genome of P. tricornutum has been completely sequenced. To identify pseudogenes in P. tricornutum, we developed a pipeline to discover and characterize pseudogenes. We identified a total of 1654 'true' processed pseudogenes, 714 duplicated pseudogenes and 4729 pseudogene fragments. The results of the bioinformatics analysis indicated that the genome sequence of P. tricornutum contained many pseudogenes and pseudogene fragments.
ISSN:1001-6538
1861-9541
DOI:10.1007/s11434-012-5174-3