International Journal of Molecular Sciences, Vol.16, No.5, 10376-10388, 2015
Identification and Validation of Evolutionarily Conserved Unusually Short Pre-mRNA Introns in the Human Genome
According to the length distribution of human introns, there is a large population of short introns with a threshold of 65 nucleotides (nt) and a peak at 85 nt. Using human genome and transcriptome databases, we investigated the introns shorter than 66 nt, termed ultra-short introns, the identities of which are scarcely known. Here, we provide for the first time a list of bona fide human ultra-short introns, which have never been characterized elsewhere. By conducting BLAST searches of the databases, we screened 22 introns (37-65 nt) with conserved lengths and sequences among closely related species. We then provide experimental and bioinformatic evidence for the splicing of 15 introns, of which 12 introns were remarkably G-rich and 9 introns contained completely inefficient splice sites and/or branch sites. These unorthodox characteristics of ultra-short introns suggest that there are unknown splicing mechanisms that differ from the well-established mechanism.
Keywords:G-rich sequence;H-InvDB;human genome;human transcriptome;intron length;pre-mRNA splicing;splice sites;ultra-short intron