Table 5

Amino acid repeat frequency in protozoan parasitic proteomes.

Species

Total Predicted Coding Sequences

SAARs (10+)

DPRs (4+)

SRR (3+ Repeats)

Total amino acid repeat containing proteins *

Total % repeat containing proteins


L.braziliensis

7046

34

40

123

190

2.70%

L.infantum

8183

60

60

158

259

3.17%

L.major

8302

80

85

174

315

3.79%

T.brucei

8758

86

97

177

346

3.95%

T.congolense

17203

105

60

504

643

3.73%

T.cruzi

25401

594

245

514

1264

4.98%

D.discoideum

13498

3741

1003

2060

4627

34.28%

E.histolytica

9766

10

7

257

272

2.79%

P.berghei

12235

58

104

346

496

4.05%

P.chabaudi

15007

37

45

249

328

2.19%

P.falciparum

5479

853

256

1490

1835

33.49%

P.vivax

5352

111

113

1050

1157

21.62%

P.yoelii

8761

103

155

1024

1182

13.49%


* Some proteins contain several individual repeats. These are taken into account here.

Depledge et al. BMC Bioinformatics 2007 8:122   doi:10.1186/1471-2105-8-122

Open Data