|
Ascidian and human HEAT repeats mapped on the protein sequence of the corresponding species. |
|||||
| Species |
HEAT name |
REP E-value |
Htt region |
Location |
Sequence |
|
|
|||||
| C. intestinalis |
|||||
| A1 |
0.0005 |
N-term |
58–96 |
PGLLAVSVETLLQSCADDNADVRLNANECLNRLIKGLYE |
|
| A2 |
5.96E-06 |
N-term |
139–177 |
RPYILNLLPCLCRISQREEDGVQETLGLSLVKIFKILGP |
|
| A3 |
1.35E-06 |
N-term |
181–219 |
ESEIQGLLASFLKNLSHKSATMRRTACVCLHSVILNCRK |
|
| B4 |
6.19E-06 |
N-term |
682–720 |
QSLSHQALSIALKCLCDDDLRLRKTAAATIVTMPTSFPT |
|
| c |
2.30E-06 |
Central |
867–905 |
SQQQFGILPFVMSLLHSAWLPLDVTAHSDALVLAGNLVA |
|
| E1 |
1.26E-06 |
Central |
1341–1378 |
QGSASHVIPAMQPIIHDI.YVVRASSKNEPPEVTTQREV |
|
| g1 |
9.05E-06 |
C-term |
2771–2809 |
ARVMSKVLPSMLDDFFPAQDIMNKIIAEFISTLQPFPAS |
|
| g2 |
1.46E-06 |
C-term |
2864–2904 |
NRWISSMVPLIISRVHDPTLDVDWTCFCKAAVDFYTCQLSE |
|
| C. savignyi |
|||||
| A1 |
2.92E-07 |
N-term |
58–96 |
PGLLAVSVETLLQSCADENADVRLNSNECLNRVIKGLYD |
|
| A2 |
0.0001 |
N-term |
139–177 |
RPYILNLLPCLCRISQREEDAVQEVLSSSLAKIFIVLGA |
|
| A3 |
2.52E-06 |
N-term |
181–219 |
ESEIQGLLASFLKNLSHKSPTVRRTACICLHSILTNSRK |
|
| B4 |
1.53E-06 |
N-term |
692–730 |
KSIAQKALSIALECLCDEDTRLRKTSSAAIVSMATSYPT |
|
| c |
1.46E-06 |
Central |
876–914 |
AQQQFGILPIVMSLLRSAWLPLDVTAHSDALVLAGNLIA |
|
| E1 |
- |
Central |
1352–1389 |
QGSASHVIPAMQPITHDI.FVVRGSLKNEPPEVTTQREV |
|
| g1 |
1.27E-06 |
C-term |
2770–2808 |
ARVMSKILPSMLDDFFPAQEIMNKIIAEFISTLQPFPGS |
|
| g2 |
- |
C-term |
2864–2903 |
RWISSMVPLIISRSHDPSLDRNWTCFCKSAVDFYTCQLSE |
|
| Homo sapiens |
|||||
| A1 |
4.75E-07 |
N-term |
124–162 |
QKLLGIAMELFLLCSDDAESDVRMVADECLNKVIKALMD |
|
| A2 |
0.0001 |
N-term |
205–243 |
RPYLVNLLPCLTRTSKRPEESVQETLAAAVPKIMASFGN |
|
| A3 |
5.48E-07 |
N-term |
247–285 |
DNEIKVLLKAFIANLKSSSPTIRRTAAGSAVSICQHSRR |
|
| a4 |
* |
N-term |
291–329 |
SWLLNVLLGLLVPVEDEHSTLLILGVLLTLRYLVPLLQQ |
|
| a5 |
7.77E-06 |
N-term |
318–362 |
LTLRYLVPLLQQQVKDTSLKGSFGVTRKEMEVSPSAEQLVQVYEL |
|
| b1 |
* |
N-term |
745–783 |
EYPEEQYVSDILNYIDHGDPQVRGATAILCGTLICSILS |
|
| b2 |
1.04E-06 |
N-term |
803–841 |
TFSLADCIPLLRKTLKDESSVTSKLACTAVRNCVMSLCS |
|
| b3 |
* |
N-term |
842–880 |
SSYSELGLQLIIDVLTLRNSSYWLVRTELLETLAEIDFR |
|
| B4 |
6.69E-08 |
N-term |
904–942 |
KLQERVLNNVVIHLLGDEDPRVRHVAAASLIRLVPKLFY |
|
| b5 |
9.05E-06 |
N-term |
984–1025 |
RIYRGYNLLPSITDVTMENNLSRVIAAVSHELITSTTRALTF |
|
| d |
5.62E-06 |
Central |
1425–1463 |
RLFEPLVIKALKQYTTTTCVQLQKQVLDLLAQLVQLRVN |
|
| E1 |
* |
Central |
1534–1575 |
RKAVTHAIPALQPIVHDLFVLRGTNKADAGKELETQKEVVVS |
|
| e2 |
* |
Central |
1610–1648 |
RQIADIILPMLAKQQMHIDSHEALGVLNTLFEILAPSSL |
|
| e3 |
* |
Central |
1670–1710 |
TVQLWISGILAILRVLISQSTEDIVLSRIQELSFSPYLISC |
|
| f |
3.51E-06 |
C-term |
2798–2836 |
DDTAKQLIPVISDYLLSNLKGIAHCVNIHSQQHVLVMCA |
|
|
HEAT repeats are named according to their relative position along the chordate aligned sequences, using the same letter for repeats closer than 45 amino acids. Orthologous HEAT repeats conserved in ascidians and human share the same name, and are reported in upper case. The Expectation values (E-value) was calculated by the REP program [62]. Htt regions defined as in Methods. Absolute position of the HEAT repeats in the corresponding protein sequence is reported in the "Location" column. Dash: REP E-value not statistically significant. Asterisk: HEAT repeats originally described in Andrade and Bork [18] but not identified by the REP program as statistically significant [62]. | |||||
Gissi et al. BMC Genomics 2006 7:288 doi:10.1186/1471-2164-7-288 |
|||||