- Open Access
Identification and bioinformatic analysis of the membrane proteins of synechocystis sp. PCC 6803
Proteome Sciencevolume 7, Article number: 11 (2009)
The membranes of Synechocystis sp. PCC 6803 play a central role in photosynthesis, respiration and other important metabolic pathways. Comprehensive identification of the membrane proteins is of importance for a better understanding of the diverse functions of its unique membrane structures. Up to date, approximately 900 known or predicted membrane proteins, consisting 24.5% of Synechocystis sp. PCC 6803 proteome, have been indentified by large-scale proteomic studies.
To resolve more membrane proteins on 2-D gels for mass spectrometry identification, we separated integral proteins from membrane associated proteins and collected them as the integral and peripheral fractions, respectively. In total, 95 proteins in the peripheral fraction and 29 proteins in the integral fraction were identified, including the 5 unique proteins that were not identified by any previous studies. Bioinformatic analysis revealed that the identified proteins can be functionally classified into 14 distinct groups according to the cellular functions annotated by Cyanobase, including the two largest groups hypothetical and unknown, and photosynthesis and respiration. Homology analysis indicates that the identified membrane proteins are more conserved than the rest of the proteome.
The proteins identified in this study combined with other published proteomic data provide the most comprehensive Synechocystis proteome catalog, which will serve as a useful reference for further detailed studies to address protein functions through both traditional gene-by-gene and systems biology approaches.
The membrane system of Synechocystis sp. PCC 6803 (thereafter referred as Synechocystis) is one of the best systems for performing functional membrane proteomic analysis because of its unique membrane organization. Synechocystis, a widely used model strain of gram-negative unicellular cyanobacterium for the studies of photosynthesis and other metabolic processes, has the outer and plasma membranes as well as an intracellular membrane system, called the thylakoid [1, 2]. The outer and plasma membranes of Synechocystis contain important proteins involved in a variety of functions, such as nutrient uptake, secretion, and multidrug efflux pumps and energy transduction while the thylakoid membrane enriches photosynthetic and respiratory proteins [3–6]. A large-scale functional proteomic analysis can help to identify novel proteins involved in photosynthesis, respiration and other cellular processes to extend the current understanding of the fundamental signal transduction and metabolic pathways. In addition, the 3.57 Mb-genome of Synechocystis was completely sequenced  and a total of 3,673 genes including 3,168 genomic genes and 505 plasmid genes were annotated in Cyanobase http://bacteria.kazusa.or.jp/cyanobase/, making it feasible for the large scale proteomic analysis. Furthermore, Synechocystis can be easily transformed and has a homologous recombination system, enabling the further functional study of proteins identified by proteome using reverse genetics approaches. Although multiple studies have been performed to catalog Synechocystis membrane proteome [3–6, 8–10], it is still a challenge to identify all the membrane proteins due to their low abundance and low solubility.
Two-dimensional gel electrophoresis in conjunction with mass spectrometry has been widely used for global analysis of proteins. However, membrane proteins, especially integral membrane proteins, remain as difficult for 2-DE analysis due to their insolubility, high hydrophobicity, low abundance and/or aggregation during IEF [4, 11–13]. Use of different combinations of strong nonionic detergents and chaotropes can increase the solubility of membrane proteins [14, 15], whereas separation of membrane proteins into different compartments can help to enrich low-abundance membrane proteins [3–5, 8]. In addition, protein prefractionation that reduces sample complexity and enriches low abundant proteins, combined with the use of stronger denaturing and reducing reagents compatible for 2-DE running conditions, could be an effective method to resolve membrane proteins on 2-D gels.
Previously, we identified 51 proteins from the membranes of Synechocystis by TCA/acetone precipitation and 2-DE, most of which are membrane associated proteins . Here, we describe a different approach to separate and enrich integral proteins from membrane associated proteins using high concentration of urea. This approach has been reported to improve identification of integral proteins from the purified Synechocystis plasma membrane  and thylakoid membrane . Proteins from each fraction were resolved by 2-DE with multiple pH ranges in the first dimension and were identified by matrix-assisted laser-desorption and ionization time-of-flight (MALDI-TOF) mass spectrometry. The results along with other published proteome data serve as a reference for further studies to address detailed functions of membrane proteins in a specific physiological context. Bioinformatic analysis suggested that the identified proteins are more evolutionally conserved than the rest of the proteins in the Synechocystis proteome. Functional classification of these proteins revealed that the identified membrane proteins are implicated in a wide spectrum of cellular processes, including the highly represented process, i.e., photosynthesis and respiration.
Growth of Synechocystis sp. PCC 6803 and preparation of membranes
The wild-type strain of Synechocystis was cultured in BG-11 medium  with 5.0 mM glucose under ~40 μmol·m-2·s-1 light intensity at 30°C. For membrane preparation, cells at a late exponential phase were harvested and resuspended in a buffer containing 0.4 M sucrose, 50 mM MOPS, pH 7.0, 10 mM NaCl, 5 mM EDTA, and 0.5 mM PMSF. Cells were broken using a bead beater and the membranes were isolated by differential centrifugation [17, 18]. The chlorophyll concentration of each membrane preparation was measured in 80% acetone using a UV-160 U spectrophotometer (Shimadzu Scientific Instruments, Columbia, MD, USA) [19, 20].
Isolation of integral and membrane associated proteins
The membranes were further purified through washing with 20 mM MOPS, pH 7.0, 50 mM EDTA for 5 times to remove cytoplasmic proteins. The purified membranes were first extracted with 8.0 M urea to release the membrane associated proteins, and then centrifuged at 75,600 × g to pellet the insoluble fraction that was enriched with the integral membrane proteins. The supernatant and the pellet were collected, respectively. The pellet was further extracted with 8.0 M urea, and the supernatant of this extraction was combined with the supernatant from the first extraction and labeled as the peripheral fraction. Similarly, the insoluble pellet from the second extraction was labeled as the integral fraction. The peripheral fraction was diluted four times with deionized water and centrifuged at 75,600 × g to collect the carry-over insoluble fraction; the latter was combined with the previously collected integral fraction. The proteins in the peripheral fraction were precipitated by 10% TCA for 30 minutes on ice. The precipitated proteins were spun down and subsequently extracted with 100% ice-cold acetone to remove lipids and pigments. The integral fraction was also washed with ice-cold acetone multiple times until the wash acetone was colorless. Both fractions were dried under a vacuum, solubilized with the multiple surfactant solution (5.0 M urea, 2.0 M thiourea, 2.0 mM TBP, 2% CHAPS, 2% sulfobetaine 3–10, 0.5% carrier ampholytes, 40 mM Tris, 0.001% orange G dye), and sonicated for 15 minutes in a waterbath at 4°C. During this step, nearly all proteins in the peripheral fraction were dissolved, whereas the integral proteins were only partially solubilized. The insoluble parts in both fractions were removed by centrifugation at 75,600 × g. Protein concentration of both fractions was measured with Bio-Rad Dc Protein assay kit (Bio-Rad, Richmond, CA, USA).
2-DE of membrane proteins
The immobilized pH gradient strips with different pH ranges (18 cm, pH 3–10, non-linear, pH 4–7, pH 4–5, pH 5–6, Pharmacia Biotech, Uppsala, Sweden) were rehydrated by 320 μl sample solutions containing approximately 500 μg of proteins from the corresponding fractions. Active rehydration was accomplished by applying low voltage  for 10 hours after 2-hour rehydration without voltage at 20°C. The first-dimensional IEF for pH 3–10 and pH 4–7 was performed with an IPGphor instrument (Pharmacia Biotech, Uppsala, Sweden) using the following voltage settings: 100 V for 0.5 h, 300 V for 0.5 h, 1,000 V for 0.5 h, 2,500 V for 0.5 h, 5,000 V for 0.5 h, and then 8,000 V until a total of 80,000 Vh was reached. For narrow pH range IPG (pH 4–5 and pH 5–6), the settings were the same except that a total of 12,000 Vh was reached. Upon electrophoresis, the proteins on the strips were denatured and cysteinyl residues were reduced by equilibrating the IPG strips with a buffer containing 6.0 M urea, 2% SDS, 0.375 M Tris/HCl, pH 8.8, 20% glycerol, 5.0 mM TBP, and 2.5% acrylamide monomer for 20 minutes. The second-dimensional electrophoresis was performed using 12–18% gradient SDS-PAGE gels. Upon electrophoresis, the protein spots on the SDS-PAGE gel were stained with colloidal Coomassie Brilliant Blue (CBB) and the gels were scanned using GS-800 Calibrated Imaging Densitometer (Bio-Rad, Richmond, CA, USA) to obtain images for analysis by Melanie II software [22, 23].
Protein spots that were visualized with CBB were excised manually and incubated at 37°C with 2.5 mM Tris HCl (pH 8.5) in 50% acetonitrile to remove the dye bound to the proteins. The gel pieces were dried under a vacuum followed by incubation with 10 μl of 10 μg/ml trypsin in 2.5 mM Tris-HCl, pH 8.5 at 37°C for 18 h. The resulting tryptic fragments were eluted by diffusion into 50% acetonitrile and 0.5% trifluoroacetic acid (TFA). Diffusion of peptide fragments was facilitated by ultrasonication in a waterbath at 4°C. One microliter of the tryptic peptides of each sample was mixed with 1.0 μl of α-cyano-4-hydroxy-cinnamic acid matrix prior to be transferred to a 100-well plate for MALDI-TOF. A Voyager-DE PRO Biospectrometry Workstation was used to acquire mass spectra in a reflection-delayed extraction mode over a mass range of 600–4,000 Da. The final mass spectra were the accumulation of the spectra obtained from 3–6 positions with 64 shots (total 192–384 shots). If high resolved mass spectra could not be obtained for a spot, the sample would be concentrated by drying again using a vacuum followed by resuspension in 2.5 μl of elution buffer and 2.5 μl of 2.5 mM Tris HCl (pH 8.5). The increased peptide concentrations allowed mass spectral detection of any protein spots that could be visualized with CBB. Some proteins had fewer tryptic sites and did not produce enough tryptic peptides within the mass range 600–4,000 Da for the identification. For such proteins, spectra within the mass range 3,000–6,000 Da were also acquired to ensure good resolution for the spectra between 3,000–5,000 Da. Subsequently, the peptide mass fingerprints (PMF) generated from the two mass ranges were combined as one PMF file to search the database. The peptide ions generated by autolysis of trypsin (with m/z 832.33+, 842.51+, 1045.56+, 2211.10+) were used as the internal standard peaks for mass calibration. The mass spectra were analyzed and the PMF for each sample was generated with the Data Explorer software.
Peptide masses were used to search NCBInr databases with the MS-Fit program http://prospector.ucsf.edu using the following parameters: mass tolerance of 25 ppm, a minimum of two peptides match with one missed cleavage. The identity of proteins with high scores in the MS-Fit analysis was further validated by three other criteria: mass, pI and amino acid sequence coverage. The deduced mass of the putatively identified proteins should match the apparent mass estimated from the corresponding spots on the 2-D gel. In the mass comparisons, we considered possible post-translational modifications (PTMs) which could increase or decrease the mass. The second criterion is the deduced pI of proteins, which should be close to the pI estimated from the 2-D gel. Again, the possibility of PTMs that may change pI was considered. The last criterion is the amino acid sequence coverage which delineates the ratio of the number of identified amino acid residues to the total number of amino acid residues for each individual protein. Generally the higher the sequence coverage, the more confident the identification is. However, it should be noted that the sequence coverage of a protein strongly depends on the chemophysical property of its amino acid sequence and the abundance. Sequence coverage information is particularly important for the identification of low molecular weight proteins, e.g. photosystem I PsaE subunit [82% coverage, see additional file 1], because their MOWSE scores are usually lower compared with those of high molecular weight proteins due to the smaller number of distinct tryptic peptides. In addition to these criteria, correspondence to each of the identified proteins in the Cyano2Dbase was also used as a positive control. If a protein spot is identified as the same protein represented by a spot in Cyano2Dbase and their coordinates (apparent protein mass and pI) are also matched, then the identification for this particular protein is considered to be further confirmed.
The motif and domain of hypothetical proteins were predicted by InterPro program http://www.expasy.ch. Hydropathy analysis for deduced protein sequences was performed by predicting transmembrane (TM) helices using TopPred program http://www.expasy.ch. The cut-off for TopPred scores was set to 0.6 to include only high confident TM prediction .
Conservative analysis of Synechocystis proteins
The amino acid sequences of Synechocystis proteins were retrieved from Cyanobase automatically and saved into a file in fasta format using an in-house software. The entire proteome sequences of Arabidopsis thaliana (thereafter Arabidopsis) were retrieved from The Arabidopsis Information Resource (TAIR, http://www.arabidopsis.org). Synechocystis proteins and their Arabidopsis homologs were analyzed using the software Blastpro , which automatically searches the Arabidopsis proteome for the homolog of each Synechocystis protein.
Prefractionation of membrane proteins
The low abundance of some membrane proteins is one of the major hindrances to their identification and functional characterization. To analyze the membrane proteome of Synechocystis, we first enriched low abundant proteins by separating membranes into the peripheral and integral fractions. The peripheral fraction mainly contains the membrane associated proteins that were released from the lipid-bilayer by 8.0 M urea extraction, whereas the integral fraction mainly contains the integral membrane proteins that are refractory to the urea extraction. The fractionated proteins were separated by SDS-PAGE and stained with CBB. The result revealed that the protein migration patterns of the two fractions are not only different from each other, but also different from those of the total membrane proteins (Figure 1). The difference of the protein patterns was exhibited by the different dominant protein bands on the SDS-PAGE gel that are specific to either fraction, but not both (Figure 1), indicating that the fractionation is an efficient way to separate membrane associated proteins from integral proteins, which is in agree with the literature [4, 5]. The 68 kDa major bands representing high abundant integral proteins PsaA and PsaB were present only in the lane for the total membrane proteins but not in the other two lanes, indicating that these two 11-TM containing highly hydrophobic proteins were not solubilized by the extraction buffer. In fact, no report has shown that these proteins can be solubilized in a buffer compatible for 2-DE. Both the peripheral and integral fractions contain many high molecular weight proteins that were not detected in our previous study using TCA/acetone precipitation of total membranes (Figure 1) . Although the multiple surfactant solution used here is stronger than the rehydration buffer used in our previous study , detection of more high molecular weight proteins is more likely due to enrichment of low abundant proteins by serial extraction rather than the difference of solubilizing solution. This is supported by the observation that the serial extraction resolved more high molecular weight proteins than TCA/acetone precipitation did even we used the same solution for protein solubilization (unpublished data). Therefore, the serial extraction is an effective way for enriching low abundant membrane proteins.
2-D separation and identification of the fractionated proteins
To further enrich low abundant membrane proteins for the identification, we separated proteins on 2-D gels with multiple pH ranges. For the peripheral fraction, we used four 2-D gels covering pH ranges 3–10, 4–7, 4–5 and 5–6 (Figure 2). For the integral fraction, we used only two 2-D gels with pH range 3–10 and 4–7 (Figure 3) because the sample complexity in this fraction was relatively low compared with the peripheral fraction (Figure 1). Each gel had three replicates and only the protein spots presented in all replicates were chosen for PMF identification. In total, more than 600 protein spots were observed on 2-D gels for the peripheral fraction, and more than 200 protein spots were observed on 2-D gels for the integral fraction.
Proteins were identified as previously described . A total of 112 proteins were identified from 312 spots, including 95 proteins represented by 249 spots in the peripheral fraction, and 29 proteins represented by 63 spots in the integral fraction [see additional file 1]. It is notable that 12 proteins were present in both the fractions. These proteins may contain differential PTMs that allow different subpopulations of these proteins to differentially associate with the membranes, because PTM appears to be common for membrane proteins as evidenced by many proteins with multiple isoforms [Figure 2 and 3; see additional file 1]. Nevertheless, the majority of the identified proteins in the integral fraction were not found in the peripheral fraction or vice versa, indicating that the fractionation method is effective. Forty of these proteins contain multi-isoforms that appeared as multiple-spots with equal mass but different pI values on 2-D gels, an indicative of PTMs causing the protein pI shift. The topology analysis using the software TopPred [6, 24] revealed that 20 out of 29 (69.0%) proteins in the integral fraction contain at least one TM [see additional file 2], including the 4-TM containing protein Slr0891. The 9 predicted non-TM containing proteins may either tightly interact with the membrane embedded proteins, or may associate with the membranes through post-translationally attached lipid components. For example, the protein Sll1450 was predicted to be a lipoprotein by the algorithm LipoP, which correctly predicts 96.8% of lipoproteins in Gram-negative bacteria . In contrast, 16 out of 95 (16.8%) proteins in the peripheral fraction were predicted to contain one TM (data not shown). The presence of TM containing proteins in the peripheral fraction is possibly due to the cross contamination caused by incompletely spinning down of insoluble fraction during fractionation. However, the higher ratio of TM-containing proteins in the integral fraction strongly suggests that integral membrane proteins were specifically enriched by the serial extraction approach. The majority of the proteins (77 out of 112) identified here were not identified in our previous study, where 51 proteins were identified from the total membranes without prefractionation . Among the 128 proteins identified by these two studies, 41 proteins were also described in Cyano2Dbase [27, 28]. It should be mentioned that only 5 proteins were exclusively identified by this study, and the rest of the proteins have been previously identified by others using either 2-DE based proteomic approach or shotgun proteomic approach (see the discussion).
Analysis of the subcellular localization of the identified membrane proteins
The Synechocystis is a Gram-negative cyanobacterium containing a plasma membrane, an outer membrane, and a photosynthetic thylakoid membrane. Seventy-two, 29, and 76 proteins have been identified from each purified membrane by several consecutive studies [3–5, 8], respectively. Because we did not separate the membranes into such three fractions, an identified protein in additional file 1 may come from any of the three distinct membrane fractions mentioned above. It should be mentioned that the proteins identified from our previously work might come from any of the three distinct membrane fractions as well . Comparisons of the proteins in additional file 1 and those identified by the above studies revealed that 34 proteins were matched as the thylakoid membrane proteins , 33 proteins were matched as the plasma membrane proteins [4, 8], and 15 proteins were matched as the outer membrane proteins . Several interesting observations were also obtained from the comparisons. First, 11 proteins were matched as both the thylakoid and plasma membrane proteins, e.g., the subunits of the photosystem I/II complexes and the subunits of ATP synthase [see additional file 1]. This is not surprising because accumulating evidence suggested that the plasma membrane is involved in the early steps of the biogenesis of the photosystem [8, 29]. Furthermore, membrane vesicles transport from the plasma membrane to the thylakoid membrane or vice versa could also facilitate the exchange of the protein components between the two membrane systems [8, 30, 31]. Second, 6 proteins were matched as both the plasma membrane and outer membrane proteins, including two porins (Slr1841, Slr1908) and four hypothetical proteins (Slr1506, Sll1835, Slr0431 and Slr1270). Finally, 2 proteins were present in all of the three membrane systems, including photosystem II subunit PsbQ (Sll1638) [32–34] and the iron transport system substrate-binding protein Slr1295. Both are lipoproteins predicted by LipoP . The last two observations are intriguing and cannot simply be explained by the cross contamination of distinct membrane fractions because the data used for the comparison were generated from the completely separated membrane fractions [3–5, 8]. A more reasonable explanation is that there are interactions or transport processes between these membrane systems causing the exchange of the protein components, yet the type and function of the interactions and transport processes remain to be further characterized [3, 5, 8].
In addition to the overlapping proteins whose cellular locations were determined by aforementioned studies, 54 proteins identified by the current study were not assigned a subcellular location because they were not identified by the above studies. Some of these proteins are known to be the thylakoid membrane associated proteins including multiple photosystem I and II related proteins, e.g., the photosystem II subunit PsbU (Sll1194), photosystem II oxygen-evolving complex 23 K protein PsbP homolog (Sll1418), and photosystem I assembly related protein (Slr0823) [see additional file 1]. The data suggest that the serial extraction of membrane proteins combined with narrow pH range 2-DE separation helped to resolve and identify more Synechocystis membrane proteins.
Functions of the identified membrane proteins
To better understand the functional diversity and importance of the membrane proteins, we analyzed the cellular function of each identified protein by searching the gene annotations in Cyanobase . The identified proteins can be categorized into 14 different functional groups, which is 87.5% (14 out of 16) of all functional categories of Synechocystis proteins described in Cyanobase . Except the hypothetical and unknown proteins, the largest group of the identified proteins is photosynthesis and respiration (Figure 4), which consists of 26.56% (34 out of 128) of the total identified proteins. In this functional group, some components of the major functional protein complexes in photosynthetic and respiratory electron transport chains were identified. For example, in the photosynthetic process, proteins from photosystem I (Ssl0563, Ssr2831, and Slr0737), photosystem II (Slr2034, Sll0427, Sll1418, Sll1194, Sll1398, and Sll1638), cytochrome b 6 f complex (Sll136, Ssr2998), and ATP synthase subunits (Sll1326, Sll1325, Sll1324, Sll1323, and Slr1330) were identified [32–34, 36, 37], whereas in the respiratory process, NADH dehydrogenase subunits (Slr0261, Slr1280, Slr1281, Slr1623, and Ssl1690) were also identified [21, 38–40]. Moreover, multiple phycobilisome subunits were identified, including the proteins Sll1580, Sll1577, Slr2067, and Slr1986. These proteins are known to be the thylakoid membrane associated proteins involved in photosynthesis .
Conservative analysis of the identified membrane proteins
Cyanobacteria are considered to be the ancestor of chloroplast in higher plants . In this context, many proteins are expected to be conserved between Synechocystis and higher plants, e.g., Arabidopsis, for performing conserved functions. However, Synechocystis is an independent organism requiring diversified functions to support life cycles, whereas the chloroplast of Arabidopsis is a subcellular organelle specialized for a certain type of functions, e.g., photosynthesis. Therefore, some proteins that are essential in Synechocystis may not be necessary for the proper functioning of chloroplast in plants. This functional redundancy may facilitate the release of the selection pressure that favors the conservation of these proteins, which will gain diversified functions and amino acid sequences through evolution. To investigate which proteins are more phylogenically conserved between Synechocystis and Arabidopsis, we performed the conservative analysis for all Synechocystis proteins by automatically searching the Arabidopsis proteome using Blastpro, an automatic Blast software for the analysis of the homology between two lists of proteins [25, 43]. In total, 393 (12.0%), 206 (6.3%), and 81 (2.5%) Synechocystis proteins have homologs in Arabidopsis with minimal sequence similarity 40%, 50%, and 60% respectively (Figure 5). To investigate the conservation of the membrane proteins, we performed a similar analysis for the membrane proteins using Blastpro . Of the identified 128 proteins, 30 (23.4%), 17 (13.3%), and 9 (7.0%) proteins have homologs in Arabidopsis with minimal sequence similarity 40%, 50%, and 60%, respectively (Figure 5). Interestingly, the ratio of the highly conserved proteins, that is, proteins with minimal sequence similarity 40%, 50% or 60% in the membranes, is much higher than that in the total proteins (Figure 5). This result suggests that the identified membrane proteins are more evolutionally conserved than the rest of the proteins in the Synechocystis proteome, probably due to the necessity of the functional conservation of the corresponding proteins.
Products of the identified hypothetical genes
The genome of Synechocystis contains approximately 50% hypothetical or unknown genes that encode proteins with unknown functions. From our current and previous studies , we identified a total of 35 (27.34%) hypothetical and unknown proteins based on the information in Cyanobase and the literature (Figure 4). The majority of hypothetical proteins identified are low abundant because they were shown as weakly-stained spots on 2-D gels. The low abundance might prevent the previous discovery and functional analysis of these proteins. Several hypothetical proteins were present as major spots in 2-D gels with multiple isoforms, for example, the hypothetical protein encoded by the ORF slr1506, which was shown as the dominant spots at 68 kDa with 4 isoforms in the 2-D gel with pH 3–10 [see additional file 1 and Figure 3]. The unknown functions of these abundant hypothetical proteins may be due to their insolubility in normal aqueous solutions, which hampers their functional analysis.
To gain preliminary functional information of these hypothetical proteins, we performed the functional domain and motif analysis using the computational software InterProScan http://www.expasy.ch. The results revealed that 12 hypothetical proteins contain one or more domains and motifs [see additional file 3]. For examples, the protein encoded by ORF slr1506 has two domains, including the esterase/lipase/thioesterase domain and the ATP/GTP-binding site motif A. The esterase/lipase/thioesterase domain indicates that the protein has the hydrolase activity whereas the ATP/GTP-binding site motif A suggests that the activity is driven by the energy from ATP or GTP. The protein encoded by ORF slr0038 belongs to the mitochondrial energy transfer protein (carrier proteins) family that are found in the inner mitochondrial membrane [44–48]. In Synechocystis, the thylakoid membrane contains the machinery for energy metabolism such as the photosynthesis and respiration electron transport chains . Therefore, it is highly possible that the hypothetical protein encoded by slr0038 is a substrate carrier protein functionally related with the thylakoids.
Functional characterization of these proteins remains a challenge. Although computational prediction of functional domains of these proteins could partially alleviate this problem by gaining functional information from the domains themselves and the proteins that they may interact with, reverse genetic approach to generate gene knockout mutants and relevant biochemical and physiological characterizations are necessary to understand the detailed functions of these hypothetical proteins.
Membrane proteins contain integral proteins as well as membrane associated proteins linked to the lipid bilayer through direct protein-protein interactions or PTMs (e.g., lipid-anchored proteins). The membrane associated proteins can be released using organic solvents , high pH solutions [11, 50], or chaotropes [51, 52]. Here we used a high concentration of urea to separate the Synechocystis membranes into the integral and peripheral fractions; the latter mainly contains membrane associated proteins. Detection and identification of many proteins that are weakly stained on 2-D gels suggest that the fractionation of the membranes effectively enriched low abundant membrane proteins [Figure 1, 2, and 3; see additional file 1] probably because these proteins were unable to be detected in our previous study . Furthermore, the ratio of TM-containing proteins in the integral fraction is much higher than in the peripheral fraction, indicating that the fractionation approach specifically enriched integral membrane proteins. As stated previously, high concentration of urea has been successfully used to enrich integral membrane proteins in other studies [4, 5]. Collectively, the data suggest that the high concentration of urea is an effective way to separate membrane associated proteins and integral proteins for proteomic studies.
Synechocystis has its unique membrane and soluble compartments. The membrane compartments can be separated into the outer, plasma and thylakoid membranes whereas the soluble compartments can be separated into the periplasm, cytoplasm and lumen . All the membrane and lumenal spaces have their unique protein compositions, which are related to their specific functions in each subcellular location. It is of importance to precisely identify the subcellular locations of proteins, where their functions can be predicted. A number of studies have been recently reported to analyze the Synechocystis proteome. These studies were focused on proteins either in specific subcellular locations including the outer membrane [3, 53], plasma membrane [4, 8, 10, 53, 54], thylakoid membrane [5, 53], periplasm [53, 55, 56], or cytoplasm , or in the more general locations  such as membrane fraction [6, 58, 59], soluble fraction [60–62], or both [27, 28, 63–67]. To summarize these works and catalog the proteins with known or predicted subcellular locations, we generated a table [see additional file 4] that contains all the Synechocystis proteins identified based on the references above and the proteins identified in this study. In total, about 47.3% of Synechocystis proteome (1,738 proteins) have been identified [see additional file 4]. Specific subcellular locations of 173 identified proteins (10.0%) are determined (outer membrane: 10 proteins; plasma membrane: 75 proteins; periplasm: 52 proteins; thylakoid membrane: 36). For the rest of 1,565 proteins, they either have more than one locations in a cell or their exact location(s) has (have) not been determined by experiments. The proteins with undetermined cellular localizations were assigned as predicted membrane (PM)- or predicted soluble (PS)-proteins based on the TM prediction by TopPred [see additional file 4] . Among the 1,738 proteins, 1,291 proteins (74.3%) were uniquely identified by LC-MS/MS based shotgun proteomic approach [63–67]. The remaining 447 proteins were identified either uniquely by 2-DE based proteomic approach, or by both. Shotgun proteomic approach provides unprecedented power in protein identification and the majority of the proteins in additional file 1 were also identified by this approach. However, five new proteins identified in the current study have never been identified in any previous proteomic studies, including phosphate transport ATP-binding protein PstB (Sll0684), transcriptional repressor SmtB (Sll0792), and three hypothetical proteins (Sll1630, Sll1862, and Slr1053) [see additional file 1]. Identification of these new proteins is more likely due to the protein prefractionation that reduces sample complexity and enriches low abundant proteins. Therefore, we expect that sample prefractionation combined with LC-MS/MS based shotgun proteomics would provide higher coverage in proteome identification in the future.
In spite of the collective efforts in cataloging the Synechocystis proteome, 52.7% of the proteome (1,935 proteins) has not been identified so far [see additional file 5]. Multiple possibilities could account for the failure of identification for these proteins (i) some proteins are transiently expressed responding to a specific internal or external signal; (ii) proteins with high hydrophobicity or low abundance are generally difficult to be identified (iii) some proteins may not even express. The identified Synechocystis proteome contains 38.7% hypothetical and unknown proteins (673/1,738) [see additional file 6]. In contrast, the unidentified Synechocystis proteome contains much higher percent of hypothetical and unknown proteins (1,284/1,935 = 66.4%) [see additional file 7], suggesting that some of hypothetical and unknown proteins might not express. However, experimental evidences are still needed to verify their expression.
Collectively, the information on these identified proteins can be used as a reference for any studies targeting at mechanisms of important biological processes of Synechocystis, such as signal transduction cascades stimulated by a specific stimulus. In fact, some proteins were identified from Synechocystis under such physiological conditions including heat shock , salt stress [10, 64], acid stress , or heterotrophic condition . In addition, this information can be used for bioinformatic and computational analyses of Synechocystis proteome. For example, N-terminal features have been predicted for the outer and plasma membranes, the periplasm, and the thylakoid lumenal proteins using a combined proteomic and multivariate sequence analysis .
As mentioned earlier, the purified membranes in the current study contain the thylakoid membrane as well as the plasma membrane and outer membrane since we did not specifically isolate each type of the membrane. However, the abundance of the thylakoid membrane is orders of magnitude higher than that of the plasma membrane and outer membrane . Therefore, the majority of the membrane proteins are predicted to be associated with the thylakoid membrane instead of the plasma or outer membranes. In fact, more than 50% of the proteins with the assigned subcellular locations [3–5, 8] in additional file 1 are specific to the thylakoid membrane. This ratio allows us to deduce that the majority of the proteins in additional file 1 without the assigned subcellular locations may be also specific to the thylakoid membrane.
Among the proteins in additional file 1, 34 identified proteins are involved in the processes of photosynthesis and respiration. This group consists of 26.56% of the total identified membrane proteins, whereas only 3.89% of the whole Synechocystis proteome belong to this group. The data suggest that these proteins are highly abundant in the Synechocystis membranes, indicating that one of the major functions of the Synechocystis membranes is photosynthesis and respiration. Besides these proteins, the proteins involved in the process of translation are also highly represented, including the elongation factor EF-Tu and ribosomal proteins [see additional file 1 and Figure 4]. This is not surprising because ribosomes target to the thylakoid membrane when membrane-targeting proteins are synthesized in both cyanobacteria and chloroplasts of higher plants [69, 70].
One of the interesting findings from our study is that the identified membrane proteins are more conserved than the other proteins in Synechocystis, suggesting that the membrane proteome is more conserved than the rest of the proteome. The protein sequence conservation through evolution ensured the conservation of one of the major functions of the membranes, photosynthesis, between Synechocystis and the chloroplasts of higher plants that shares the nearly identical mechanism for the photosynthetic electron transport on the thylakoid membrane .
By using the serial extraction approach to separate membranes into the peripheral and integral fractions, we identified 112 membrane proteins from Synechocystis. The identified proteins are involved in 14 distinct groups of functions, which is an indicative of the diversified functions of the Synechocystis membrane system. Conserved analysis revealed that membrane proteins are evolutionally more conserved than soluble proteins, suggesting that membranes may perform more conserved functions through evolution.
Although the 2-DE based proteomic approach used here is less robust compared with the LC-MS/MS based shotgun proteomic approach, five of the proteins identified in this study were not identified previously by any shotgun proteomic studies, indicating that sample prefractionation is important for the identification of low abundant proteins. Therefore, we suggest that sample prefractionation combined with LC-MS/MS shotgun proteomic approach would achieve higher proteome coverage in protein identification.
The proteins identified in this study, together with previously identified proteins by others [3–5, 8, 10, 27, 28, 53–67], will provide the most comprehensive database to date for the identified Synechocystis proteins. This database will serve as a useful reference for future studies to understand the mechanisms of basic biological processes of this photosynthetic organism.
Coomassie Brilliant Blue
matrix-assistant laser-desporption ionization time-of-flight
open reading frame
polyacrylamide gel electrophoresis
peptide mass fingerprinting
sodium dodecyl sulphate
Gantt E: Supramolecular Membrane Organization. In Molecular Biology of Cyanobacteria. Edited by: Bryant DA. Dordrecht, The Netherlands: Kluwer; 1994:119–138.
Stanier RY, Cohen-Bazire G: Phototrophic prokaryotes: the cyanobacteria. Annu Rev Microbiol 1977, 31: 225–274.
Huang F, Hedman E, Funk C, Kieselbach T, Schroder WP, Norling B: Isolation of outer membrane of Synechocystis sp. PCC 6803 and its proteomic characterization. Mol Cell Proteomics 2004, 3: 586–595.
Pisareva T, Shumskaya M, Maddalo G, Ilag L, Norling B: Proteomics of Synechocystis sp. PCC 6803. Identification of novel integral plasma membrane proteins. Febs J 2007, 274: 791–804.
Srivastava R, Pisareva T, Norling B: Proteomic studies of the thylakoid membrane of Synechocystis sp. PCC 6803. Proteomics 2005, 5: 4905–4916.
Wang Y, Sun J, Chitnis PR: Proteomic study of the peripheral proteins from thylakoid membranes of the cyanobacterium Synechocystis sp. PCC 6803. Electrophoresis 2000, 21: 1746–1754.
Kaneko T, Sato S, Kotani H, Tanaka A, Asamizu E, Nakamura Y, Miyajima N, Hirosawa M, Sugiura M, Sasamoto S, et al.: Sequence analysis of the genome of the unicellular cyanobacterium Synechocystis sp. strain PCC6803. II. Sequence determination of the entire genome and assignment of potential protein-coding regions. DNA Res 1996, 3: 109–136.
Huang F, Parmryd I, Nilsson F, Persson AL, Pakrasi HB, Andersson B, Norling B: Proteomics of Synechocystis sp. Strain PCC 6803: Identification of Plasma Membrane Proteins. Mol Cell Proteomics 2002, 1: 956–966.
Norling B, Zak E, Andersson B, Pakrasi H: 2D-isolation of pure plasma and thylakoid membranes from the cyanobacterium Synechocystis sp. PCC 6803. FEBS Lett 1998, 436: 189–192.
Huang F, Fulda S, Hagemann M, Norling B: Proteomic screening of salt-stress-induced changes in plasma membranes of Synechocystis sp. strain PCC 6803. Proteomics 2006, 6: 910–920.
Santoni V, Rabilloud T, Doumas P, Rouquie D, Mansion M, Kieffer S, Garin J, Rossignol M: Towards the recovery of hydrophobic proteins on two-dimensional electrophoresis gels. Electrophoresis 1999, 20: 705–711.
Wilkins MR, Gasteiger E, Sanchez JC, Bairoch A, Hochstrasser DF: Two-dimensional gel electrophoresis for proteome projects: the effects of protein hydrophobicity and copy number. Electrophoresis 1998, 19: 1501–1505.
Wilkins MR, Sanchez JC, Williams KL, Hochstrasser DF: Current challenges and future applications for protein maps and post-translational vector maps in proteome projects. Electrophoresis 1996, 17: 830–838.
Chevallet M, Santoni V, Poinas A, Rouquie D, Fuchs A, Kieffer S, Rossignol M, Lunardi J, Garin J, Rabilloud T: New zwitterionic detergents improve the analysis of membrane proteins by two-dimensional electrophoresis. Electrophoresis 1998, 19: 1901–1909.
Rabilloud T, Adessi C, Giraudel A, Lunardi J: Improvement of the solubilization of proteins in two-dimensional electrophoresis with immobilized pH gradients. Electrophoresis 1997, 18: 307–316.
Mary Mennes A: Simple conditions for grwoth of unicellular blue-green algae on plates. Journal of phycology 1968, 4: 1–4.
Sun J, Ke A, Jin P, Chitnis VP, Chitnis PR: Isolation and functional study of photosystem I subunits in the cyanobacterium Synechocystis sp. PCC 6803. Methods Enzymol 1998, 297: 124–139.
Sun J, Xu W, Hervas M, Navarro JA, Rosa MA, Chitnis PR: Oxidizing side of the cyanobacterial photosystem I. Evidence for interaction between the electron donor proteins and a lumenal surface helix of the PsaB subunit. J Biol Chem 1999, 274: 19048–19054.
Arnon D: Copper enzyme in isolated chloroplasts. Polyphenol oxidase in Beta vulgaris. Plant Physiol 1949, 24: 1–14.
Chitnis VP, Xu Q, Yu L, Golbeck JH, Nakamoto H, Xie DL, Chitnis PR: Targeted inactivation of the gene psaL encoding a subunit of photosystem I of the cyanobacterium Synechocystis sp. PCC 6803. J Biol Chem 1993, 268: 11678–11684.
Battchikova N, Zhang P, Rudd S, Ogawa T, Aro EM: Identification of NdhL and Ssl1690 (NdhO) in NDH-1L and NDH-1M complexes of Synechocystis sp. PCC 6803. J Biol Chem 2005, 280: 2587–2595.
Appel RD, Palagi PM, Walther D, Vargas JR, Sanchez JC, Ravier F, Pasquali C, Hochstrasser DF: Melanie II – a third-generation software package for analysis of two-dimensional electrophoresis images: I. Features and user interface. Electrophoresis 1997, 18: 2724–2734.
Appel RD, Vargas JR, Palagi PM, Walther D, Hochstrasser DF: Melanie II – a third-generation software package for analysis of two-dimensional electrophoresis images: II. Algorithms. Electrophoresis 1997, 18: 2735–2748.
von Heijne G: Membrane protein structure prediction. Hydrophobicity analysis and the positive-inside rule. J Mol Biol 1992, 225: 487–494.
Wang Y, Hanley R, Klemke RL: Computational methods for comparison of large genomic and proteomic datasets reveal protein markers of metastatic cancer. J Proteome Res 2006, 5: 907–915.
Juncker AS, Willenbrock H, Von Heijne G, Brunak S, Nielsen H, Krogh A: Prediction of lipoprotein signal peptides in Gram-negative bacteria. Protein Sci 2003, 12: 1652–1662.
Sazuka T, Ohara O: Towards a proteome project of cyanobacterium Synechocystis sp. strain PCC6803: linking 130 protein spots with their respective genes. Electrophoresis 1997, 18: 1252–1258.
Sazuka T, Yamaguchi M, Ohara O: Cyano2Dbase updated: linkage of 234 protein spots to corresponding genes through N-terminal microsequencing. Electrophoresis 1999, 20: 2160–2171.
Zak E, Norling B, Maitra R, Huang F, Andersson B, Pakrasi HB: The initial steps of biogenesis of cyanobacterial photosystems occur in plasma membranes. Proc Natl Acad Sci USA 2001, 98: 13443–13448.
Kroll D, Meierhoff K, Bechtold N, Kinoshita M, Westphal S, Vothknecht UC, Soll J, Westhoff P: VIPP1, a nuclear gene of Arabidopsis thaliana essential for thylakoid membrane formation. Proc Natl Acad Sci USA 2001, 98: 4238–4242.
Westphal S, Heins L, Soll J, Vothknecht UC: Vipp1 deletion mutant of Synechocystis: a connection between bacterial phage shock and thylakoid biogenesis? Proc Natl Acad Sci USA 2001, 98: 4243–4248.
Kashino Y, Lauber WM, Carroll JA, Wang Q, Whitmarsh J, Satoh K, Pakrasi HB: Proteomic analysis of a highly active photosystem II preparation from the cyanobacterium Synechocystis sp. PCC 6803 reveals the presence of novel polypeptides. Biochemistry 2002, 41: 8004–8012.
Summerfield TC, Shand JA, Bentley FK, Eaton-Rye JJ: PsbQ (Sll1638) in Synechocystis sp. PCC 6803 is required for photosystem II activity in specific mutants and in nutrient-limiting conditions. Biochemistry 2005, 44: 805–815.
Thornton LE, Ohkawa H, Roose JL, Kashino Y, Keren N, Pakrasi HB: Homologs of plant PsbP and PsbQ proteins are necessary for regulation of photosystem ii activity in the cyanobacterium Synechocystis 6803. Plant Cell 2004, 16: 2164–2175.
Nakamura Y, Kaneko T, Hirosawa M, Miyajima N, Tabata S: CyanoBase, a www database containing the complete nucleotide sequence of the genome of Synechocystis sp. strain PCC6803. Nucleic Acids Res 1998, 26: 63–67.
Chitnis PR: PHOTOSYSTEM I: Function and Physiology. Annu Rev Plant Physiol Plant Mol Biol 2001, 52: 593–626.
Xu W, Tang H, Wang Y, Chitnis PR: Proteins of the cyanobacterial photosystem I. Biochim Biophys Acta 2001, 1507: 32–40.
Cooley JW, Howitt CA, Vermaas WF: Succinate:quinol oxidoreductases in the cyanobacterium synechocystis sp. strain PCC 6803: presence and function in metabolism and electron transport. J Bacteriol 2000, 182: 714–722.
Cooley JW, Vermaas WF: Succinate dehydrogenase and other respiratory pathways in thylakoid membranes of Synechocystis sp. strain PCC 6803: capacity comparisons and physiological function. J Bacteriol 2001, 183: 4251–4258.
Prommeenate P, Lennon AM, Markert C, Hippler M, Nixon PJ: Subunit composition of NDH-1 complexes of Synechocystis sp. PCC 6803: identification of two new ndh gene products with nuclear-encoded homologues in the chloroplast Ndh complex. J Biol Chem 2004, 279: 28165–28173.
Sarcina M, Tobin MJ, Mullineaux CW: Diffusion of phycobilisomes on the thylakoid membranes of the cyanobacterium Synechococcus 7942. Effects of phycobilisome size, temperature, and membrane lipid composition. J Biol Chem 2001, 276: 46830–46834.
Goksoyr J: Evolution of eucaryotic cells. Nature 1967, 214: 1161.
Wang Y, Ding SJ, Wang W, Jacobs JM, Qian WJ, Moore RJ, Yang F, Camp DG II, Smith RD, Klemke RL: Profiling signaling polarity in chemotactic cells. Proc Natl Acad Sci U S A 2007.
Jank B, Habermann B, Schweyen RJ, Link TA: PMP47, a peroxisomal homologue of mitochondrial solute carrier proteins. Trends Biochem Sci 1993, 18: 427–428.
Klingenberg M: Mechanism and evolution of the uncoupling protein of brown adipose tissue. Trends Biochem Sci 1990, 15: 108–112.
Kuan J, Saier MH Jr: Expansion of the mitochondrial carrier family. Res Microbiol 1993, 144: 671–672.
Nelson DR, Lawson JE, Klingenberg M, Douglas MG: Site-directed mutagenesis of the yeast mitochondrial ADP/ATP translocator. Six arginines and one lysine are essential. J Mol Biol 1993, 230: 1159–1170.
Palmieri F: Mitochondrial carrier proteins. FEBS Lett 1994, 346: 48–54.
Seigneurin-Berny D, Rolland N, Garin J, Joyard J: Technical Advance: Differential extraction of hydrophobic proteins from chloroplast envelope membranes: a subcellular-specific proteomic approach to identify rare intrinsic membrane proteins. Plant J 1999, 19: 217–228.
Fujiki Y, Hubbard AL, Fowler S, Lazarow PB: Isolation of intracellular membranes by means of sodium carbonate treatment: application to endoplasmic reticulum. J Cell Biol 1982, 93: 97–102.
Herskovits TT, Jaillet H, Gadegbeku B: On the structural stability and solvent denaturation of proteins. II. Denaturation by the ureas. J Biol Chem 1970, 245: 4544–4550.
Rabilloud T: Solubilization of proteins for electrophoretic analyses. Electrophoresis 1996, 17: 813–829.
Rajalahti T, Huang F, Klement MR, Pisareva T, Edman M, Sjostrom M, Wieslander A, Norling B: Proteins in different Synechocystis compartments have distinguishing N-terminal features: a combined proteomics and multivariate sequence analysis. J Proteome Res 2007, 6: 2420–2434.
Srivastava R, Battchikova N, Norling B, Aro EM: Plasma membrane of Synechocystis PCC 6803: a heterogeneous distribution of membrane proteins. Arch Microbiol 2006, 185: 238–243.
Fulda S, Huang F, Nilsson F, Hagemann M, Norling B: Proteomics of Synechocystis sp. strain PCC 6803. Identification of periplasmic proteins in cells grown at low and high salt concentrations. Eur J Biochem 2000, 267: 5900–5907.
Kurian D, Phadwal K, Maenpaa P: Proteomic characterization of acid stress response in Synechocystis sp. PCC 6803. Proteomics 2006, 6: 3614–3624.
Perez-Perez ME, Florencio FJ, Lindahl M: Selecting thioredoxins for disulphide proteomics: target proteomes of three thioredoxins from the cyanobacterium Synechocystis sp. PCC 6803. Proteomics 2006,6(Suppl 1):S186–195.
Herranen M, Battchikova N, Zhang P, Graf A, Sirpio S, Paakkarinen V, Aro EM: Towards functional proteomics of membrane protein complexes in Synechocystis sp. PCC 6803. Plant Physiol 2004, 134: 470–481.
Mata-Cabana A, Florencio FJ, Lindahl M: Membrane proteins from the cyanobacterium Synechocystis sp. PCC 6803 interacting with thioredoxin. Proteomics 2007, 7: 3953–3963.
Kurian D, Jansen T, Maenpaa P: Proteomic analysis of heterotrophy in Synechocystis sp. PCC 6803. Proteomics 2006, 6: 1483–1494.
Simon WJ, Hall JJ, Suzuki I, Murata N, Slabas AR: Proteomic study of the soluble proteins from the unicellular cyanobacterium Synechocystis sp. PCC6803 using automated matrix-assisted laser desorption/ionization-time of flight peptide mass fingerprinting. Proteomics 2002, 2: 1735–1742.
Slabas AR, Suzuki I, Murata N, Simon WJ, Hall JJ: Proteomic analysis of the heat shock response in Synechocystis PCC6803 and a thermally tolerant knockout strain lacking the histidine kinase 34 gene. Proteomics 2006, 6: 845–864.
Chong PK, Gan CS, Pham TK, Wright PC: Isobaric tags for relative and absolute quantitation (iTRAQ) reproducibility: Implication of multiple injections. J Proteome Res 2006, 5: 1232–1240.
Fulda S, Mikkat S, Huang F, Huckauf J, Marin K, Norling B, Hagemann M: Proteome analysis of salt stress response in the cyanobacterium Synechocystis sp. strain PCC 6803. Proteomics 2006, 6: 2733–2745.
Gan CS, Chong PK, Pham TK, Wright PC: Technical, experimental, and biological variations in isobaric tags for relative and absolute quantitation (iTRAQ). J Proteome Res 2007, 6: 821–827.
Gan CS, Reardon KF, Wright PC: Comparison of protein and peptide prefractionation methods for the shotgun proteomic analysis of Synechocystis sp. PCC 6803. Proteomics 2005, 5: 2468–2478.
Ishino Y, Okada H, Ikeuchi M, Taniguchi H: Mass spectrometry-based prokaryote gene annotation. Proteomics 2007, 7: 4053–4065.
Bryant DA: The cyanobacterial photosynthetic apparatus: comparison to those of higher plants and photosynthetic bacteria. In Photosynthetic Picoplankton Can Bull Fish Aquat Sci Edited by: Platt T, Li WKW. 1986, 214: 423–500.
Houben E, de Gier JW, van Wijk KJ: Insertion of leader peptidase into the thylakoid membrane during synthesis in a chloroplast translation system. Plant Cell 1999, 11: 1553–1564.
Tyystjarvi T, Herranen M, Aro EM: Regulation of translation elongation in cyanobacteria: membrane targeting of the ribosome nascent-chain complexes controls the synthesis of D1 protein. Mol Microbiol 2001, 40: 476–484.
Douglas SE: Chloroplast origin and evolution. In Molecular Biology of Cyanobacteria. Edited by: Bryant DA. Dordrecht, The Netherlands: Kluwer; 1994:91–118.
This work was supported in part by grants to PRC from Iowa Sate University. The authors thank the start-up fund awarded to WX from the University of Louisiana at Lafayette.
The authors declare that they have no competing interests.
YW, WX, and PRC conceived the study, YW and WX carried out the proteomics experiments and bioinformatic analysis. PRC and WX provided all the reagents and instruments. YW and WX drafted the manuscript, and all the authors read and approved the final manuscript.