Table 3

Ascidian and human HEAT repeats mapped on the protein sequence of the corresponding species.

Species
HEAT name
REP E-value
Htt region
Location
Sequence

C. intestinalis






A1
0.0005
N-term
58–96
PGLLAVSVETLLQSCADDNADVRLNANECLNRLIKGLYE

A2
5.96E-06
N-term
139–177
RPYILNLLPCLCRISQREEDGVQETLGLSLVKIFKILGP

A3
1.35E-06
N-term
181–219
ESEIQGLLASFLKNLSHKSATMRRTACVCLHSVILNCRK

B4
6.19E-06
N-term
682–720
QSLSHQALSIALKCLCDDDLRLRKTAAATIVTMPTSFPT

c
2.30E-06
Central
867–905
SQQQFGILPFVMSLLHSAWLPLDVTAHSDALVLAGNLVA

E1
1.26E-06
Central
1341–1378
QGSASHVIPAMQPIIHDI.YVVRASSKNEPPEVTTQREV

g1
9.05E-06
C-term
2771–2809
ARVMSKVLPSMLDDFFPAQDIMNKIIAEFISTLQPFPAS

g2
1.46E-06
C-term
2864–2904
NRWISSMVPLIISRVHDPTLDVDWTCFCKAAVDFYTCQLSE
C. savignyi






A1
2.92E-07
N-term
58–96
PGLLAVSVETLLQSCADENADVRLNSNECLNRVIKGLYD

A2
0.0001
N-term
139–177
RPYILNLLPCLCRISQREEDAVQEVLSSSLAKIFIVLGA

A3
2.52E-06
N-term
181–219
ESEIQGLLASFLKNLSHKSPTVRRTACICLHSILTNSRK

B4
1.53E-06
N-term
692–730
KSIAQKALSIALECLCDEDTRLRKTSSAAIVSMATSYPT

c
1.46E-06
Central
876–914
AQQQFGILPIVMSLLRSAWLPLDVTAHSDALVLAGNLIA

E1
-
Central
1352–1389
QGSASHVIPAMQPITHDI.FVVRGSLKNEPPEVTTQREV

g1
1.27E-06
C-term
2770–2808
ARVMSKILPSMLDDFFPAQEIMNKIIAEFISTLQPFPGS

g2
-
C-term
2864–2903
RWISSMVPLIISRSHDPSLDRNWTCFCKSAVDFYTCQLSE
Homo sapiens






A1
4.75E-07
N-term
124–162
QKLLGIAMELFLLCSDDAESDVRMVADECLNKVIKALMD

A2
0.0001
N-term
205–243
RPYLVNLLPCLTRTSKRPEESVQETLAAAVPKIMASFGN

A3
5.48E-07
N-term
247–285
DNEIKVLLKAFIANLKSSSPTIRRTAAGSAVSICQHSRR

a4
*
N-term
291–329
SWLLNVLLGLLVPVEDEHSTLLILGVLLTLRYLVPLLQQ

a5
7.77E-06
N-term
318–362
LTLRYLVPLLQQQVKDTSLKGSFGVTRKEMEVSPSAEQLVQVYEL

b1
*
N-term
745–783
EYPEEQYVSDILNYIDHGDPQVRGATAILCGTLICSILS

b2
1.04E-06
N-term
803–841
TFSLADCIPLLRKTLKDESSVTSKLACTAVRNCVMSLCS

b3
*
N-term
842–880
SSYSELGLQLIIDVLTLRNSSYWLVRTELLETLAEIDFR

B4
6.69E-08
N-term
904–942
KLQERVLNNVVIHLLGDEDPRVRHVAAASLIRLVPKLFY

b5
9.05E-06
N-term
984–1025
RIYRGYNLLPSITDVTMENNLSRVIAAVSHELITSTTRALTF

d
5.62E-06
Central
1425–1463
RLFEPLVIKALKQYTTTTCVQLQKQVLDLLAQLVQLRVN

E1
*
Central
1534–1575
RKAVTHAIPALQPIVHDLFVLRGTNKADAGKELETQKEVVVS

e2
*
Central
1610–1648
RQIADIILPMLAKQQMHIDSHEALGVLNTLFEILAPSSL

e3
*
Central
1670–1710
TVQLWISGILAILRVLISQSTEDIVLSRIQELSFSPYLISC

f
3.51E-06
C-term
2798–2836
DDTAKQLIPVISDYLLSNLKGIAHCVNIHSQQHVLVMCA

HEAT repeats are named according to their relative position along the chordate aligned sequences, using the same letter for repeats closer than 45 amino acids. Orthologous HEAT repeats conserved in ascidians and human share the same name, and are reported in upper case. The Expectation values (E-value) was calculated by the REP program [62]. Htt regions defined as in Methods. Absolute position of the HEAT repeats in the corresponding protein sequence is reported in the "Location" column. Dash: REP E-value not statistically significant. Asterisk: HEAT repeats originally described in Andrade and Bork [18] but not identified by the REP program as statistically significant [62].

Gissi et al. BMC Genomics 2006 7:288   doi:10.1186/1471-2164-7-288