Skip to main content

Table 3 A selection of the most repetitive proteins from pathogens

From: Surface antigens and potential virulence factors from parasites detected by comparative genomics of perfect amino acid repeats

Name, accession Sp L Repeat pP
Hypothetical protein, Tb927.1.1740 Tb 7154 132 × LAEESQQHTARSEADIDE 2806
Gene 11-1 protein*, Q8I6U6 Pf 10589 967 × EEV 2457
Conserved protein, LmjF29.0110 Lm 3418 146 × AEEQARR 1080
Proteophosphoglycan-like, LmjF35.0550 Lm 2425 105 × SSSSSAPSA 1052
Putative antigen*, Tb04.29M18.750 Tb 4455 66 × NEQYETLQRTNAA 958
Gb4*, Tb09.160.1200 Tb 8214 35 × VVIIDCRLGSLLIDYKVI 701
Hypothetical protein, Chro.50162 Ch 1589 84 × KKDAP 407
Hypothetical protein, Q8I455 Pf 2349 67 × LKEEER 389
Interspersed repeat antigen*, Q8I486 Pf 1720 67 × QEPVT 313
Putative antigen 332*, Q8IHN3 Pf 5507 144 × EEI 274
Cell wall surface anchor family, Q97P71 Spn 4776 1074 × SAS 3418
Cell surface SD repeat protein, Q88XB6 Lpl 3360 796 × DS 1619
Hypothetical protein, Q8E473 Sag 1310 106 × TSAS 447
Putative peptidoglycan-bound, Q8Y697 Lmo 903 78 × ADADA 403
Avirulence protein, Q5GYF3 Xor 1790 20 × ETVQRLLPVLCQDHGLTP 401
Serine/threonine-rich antigen, Q99QY4 Sau 2271 163 × STS 391
PE-PGRS family, PG54_MYCTU Mt 1901 136 × GAG 326
Structural toxin RtxA, Q5X7A6 Lpn 7679 29 × RFEDDGPVV 247
Ice nucleation protein, Q8PD38 Xca 1333 52 × GYGST 242
PPE family protein, Q6MX44 Mtu 3300 95 × NTG 184
  1. Eukaryotic proteins (top) whose expression is confirmed by the presence of expressed sequence tags (EST) in GenBank are marked with an asterisk. L, length; pP, negative logarithm of the P-value; Sp, species (Ch, C. hominis; Lm, L. major; Pf, P. falciparum; Tb, T. brucei; Lmo, Listeria monocytogenes; Lpl, Lactobacillus plantarum; Lpn, Legionella pneumophila; Mtu, M. tuberculosis; Sau, S. aureus; Spn, S. pneumoniae; Sag, Streptococcus agalactiae; Xca, Xanthomonas campestris; Xor, X. oryzae).