Skip to main content

Table 3 A selection of the most repetitive proteins from pathogens

From: Surface antigens and potential virulence factors from parasites detected by comparative genomics of perfect amino acid repeats

Name, accession

Sp

L

Repeat

pP

Hypothetical protein, Tb927.1.1740

Tb

7154

132 × LAEESQQHTARSEADIDE

2806

Gene 11-1 protein*, Q8I6U6

Pf

10589

967 × EEV

2457

Conserved protein, LmjF29.0110

Lm

3418

146 × AEEQARR

1080

Proteophosphoglycan-like, LmjF35.0550

Lm

2425

105 × SSSSSAPSA

1052

Putative antigen*, Tb04.29M18.750

Tb

4455

66 × NEQYETLQRTNAA

958

Gb4*, Tb09.160.1200

Tb

8214

35 × VVIIDCRLGSLLIDYKVI

701

Hypothetical protein, Chro.50162

Ch

1589

84 × KKDAP

407

Hypothetical protein, Q8I455

Pf

2349

67 × LKEEER

389

Interspersed repeat antigen*, Q8I486

Pf

1720

67 × QEPVT

313

Putative antigen 332*, Q8IHN3

Pf

5507

144 × EEI

274

Cell wall surface anchor family, Q97P71

Spn

4776

1074 × SAS

3418

Cell surface SD repeat protein, Q88XB6

Lpl

3360

796 × DS

1619

Hypothetical protein, Q8E473

Sag

1310

106 × TSAS

447

Putative peptidoglycan-bound, Q8Y697

Lmo

903

78 × ADADA

403

Avirulence protein, Q5GYF3

Xor

1790

20 × ETVQRLLPVLCQDHGLTP

401

Serine/threonine-rich antigen, Q99QY4

Sau

2271

163 × STS

391

PE-PGRS family, PG54_MYCTU

Mt

1901

136 × GAG

326

Structural toxin RtxA, Q5X7A6

Lpn

7679

29 × RFEDDGPVV

247

Ice nucleation protein, Q8PD38

Xca

1333

52 × GYGST

242

PPE family protein, Q6MX44

Mtu

3300

95 × NTG

184

  1. Eukaryotic proteins (top) whose expression is confirmed by the presence of expressed sequence tags (EST) in GenBank are marked with an asterisk. L, length; pP, negative logarithm of the P-value; Sp, species (Ch, C. hominis; Lm, L. major; Pf, P. falciparum; Tb, T. brucei; Lmo, Listeria monocytogenes; Lpl, Lactobacillus plantarum; Lpn, Legionella pneumophila; Mtu, M. tuberculosis; Sau, S. aureus; Spn, S. pneumoniae; Sag, Streptococcus agalactiae; Xca, Xanthomonas campestris; Xor, X. oryzae).