Table 1 |
|||||||
|
U12 type of introns identified in T. spiralis. |
|||||||
|
EST |
Genomic sequence |
Intron length |
B |
R |
Relationship to C. elegans |
C. elegans orthologue |
Gene function |
|
ATAC introns |
|||||||
|
|
|||||||
|
EX501652.1 |
AGA|ATATCCTTTC...TTGGGCATTGTTATATTTCCTTAACGGGTATGGTTTAC|GTT |
2095 |
+ |
+ |
U12->U2 |
EEED8.7 |
SR (splicing factor) |
|
TyrIleGl nPheGlnGl nLeuLysAspAla |
|||||||
|
Ts TACATAGAAT...ACGTTCGAAGA GCTGAAAGATGCT |
|||||||
|
PheValAr gPheTyrGl nArgArgAspAla |
|||||||
|
Ce TTCGTTAG ATTTTATGAGT...AGACGTCGTGCTGCT |
|||||||
|
EX500683.1 |
ATT|ATATCCTTTC...AATTTCATTTCCTTAACGTTAGATTTTTGTTGTTTTAC|TGA |
94 |
+ |
+ |
U12 lost |
E04D5.1a |
NA |
|
ES570647.1 |
ATC|ATATCCTTTC...GTATTGTTTGTATTTTCCTTAACTTCATATGTTTTTAC|GTA |
182 |
+ |
+ |
U12 lost |
Y37D8A.10 |
Signal peptidase complex, subunit SPC25 |
|
|
|||||||
|
GTAG introns |
|||||||
|
|
|||||||
|
EX499999.1 |
Ts GAG|GTATCCTTTG...TTTTGTTTTTCTCTCTTTTTACAATTATTATACAG|GCC |
90 |
- |
+ |
U12->U2 |
F10F2.1 |
PH, BEACH and WD40 domain-containing protein |
|
Ce GAG|GTTTGAAACA...TTTTAATATTGAACTAAAATTTTTGAATTTTCCAG|GCG |
64 |
||||||
|
ES570692.1 |
Ts TAG|GTATTGTTTT...TGCTACAAGGAATTTTTTTATTGCTTTGATTTTAG|AGT |
617 |
- |
- |
U12->U2 |
F40F8.10 |
Small ribosomal subunit S9 protein |
|
Ce CCG|GTTTAGTTTT...AAGATTAGTATCGACTTCAAATTCTTCTCTTTCAG|TGT |
291 |
||||||
|
ES561213.1 |
Ts TCG|GTATTATTTT...CATATTAATCGTTTCATTTCTTAATGTATTTTTAG|TGG |
54 |
- |
- |
U12->U2 |
ZC395.10 |
NA |
|
Ce CCA|GTACGTTTCG...ACATAGAATGAGTCGTAATTCGTAAATTTTCAGAG|GAA |
150 |
||||||
|
EX500486.1 |
Ts TCG|GTATTCTTTC...TAATATGTTTTTCTTTTTTTTCAACTTATTTTAAG|ATT |
87 |
- |
+ |
U12->U2 |
ZC328.3a |
NA |
|
Ce CAT|GTGAGTTTCA...TCCTGAATTTATTCAAGTTTCAACCACATTTCCAG|CAT |
758 |
||||||
|
ES569928.1 |
ATG|GTATTCTTTT...ATTTCCATTACAAAATTACAACCGCGTTGTTCTTTCAG|TGC |
107 |
+ |
+ |
Not known |
Y82E9BR.15 |
Transcription elongation factor B |
|
ES565768.1 |
CAG|GTATTCTTTT...CAAATTTTGGAAAAATTCTTTTTTTTTAATCCGAACAG|GTA |
94 |
- |
+ |
Not known |
C34D4.4a |
NA |
|
ES562099.1 |
AAT|GTATCCTTAA...TGTATGAGGTTTGGTATTTCTGATTTTAATCATTTTAG|TGT |
50 |
- |
+ |
Not known |
R07E5.14 |
RRM RNA binding domain) containing |
|
ES563059.1 |
GCG|GTATCTTTTC...TATTTATAACTGAATCGTTTTTATTAATAATTTTTTAG|AGT |
54 |
- |
+ |
Not known |
M04F3.4 |
NA |
|
BQ738460.1 |
ACG|GTATCGTTCA...TCAATTTTTTTAAAAGTAATTTTCTTCATATATTTTAG|AAC |
72 |
- |
- |
U12 lost/Not known |
Y56A3A.36 |
NA |
|
ES566079.1 |
TGG|GTATCGTTCG...ATTAACTAACACTTTGAAGTTGACAAGTGAATGTTTAG|GAT |
140 |
- |
- |
Not known |
M02B7.4 |
beta-1,4-N-acetylglucosaminyl transferase |
|
EX500543.1 |
TCG|GTATTCTTTG...TTATTATTAATTTCTGTTTTTTTTGGTTTTCTAAACAG|AGA |
86 |
- |
+ |
None |
||
|
ES561535.1 |
GGG|GTATTATTTT...TTTTCTGTGATTTAATTGCATTTTAATGTTCTATCTAG|TGA |
71 |
- |
- |
None |
||
|
BQ738918.1 |
GAA|GTATCTTTTA...TGAATTTTGCTAAATTGTACTTAACAGGTTGTTTTTAG|AAA |
153 |
- |
+ |
None |
||
|
|
|||||||
|
Table shows all introns identified by the Sheth et al method [11]. Regions with best match to branch site PWM is underlined. For one of the ATAC introns (EST EX501652.1) is shown the shift in 5' splice site observed between T. spiralis and C. elegans. Rule of 5' splice site (R) is that 5' splice site sequence is RTATCCTT where one of the Cs in positions +5 and +6 may be converted into a T. Burge et al method (B) is described in [5]. NA = not available. For sequences of U2 and U12 introns listed in table, see additional file 2. |
|||||||
|
Bartschat and Samuelsson BMC Genomics 2010 11:106 doi:10.1186/1471-2164-11-106 |
|||||||