- Open Access
An interactomics overview of the human and bovine milk proteome over lactation
Proteome Sciencevolume 15, Article number: 1 (2017)
Milk is the most important food for growth and development of the neonate, because of its nutrient composition and presence of many bioactive proteins. Differences between human and bovine milk in low abundant proteins have not been extensively studied. To better understand the differences between human and bovine milk, the qualitative and quantitative differences in the milk proteome as well as their changes over lactation were compared using both label-free and labelled proteomics techniques. These datasets were analysed and compared, to better understand the role of milk proteins in development of the newborn.
Human and bovine milk samples were prepared by using filter-aided sample preparation (FASP) combined with dimethyl labelling and analysed by nano LC LTQ-Orbitrap XL mass spectrometry.
The human and bovine milk proteome show similarities with regard to the distribution over biological functions, especially the dominant presence of enzymes, transport and immune-related proteins. At a quantitative level, the human and bovine milk proteome differed not only between species but also over lactation within species. Dominant enzymes that differed between species were those assisting in nutrient digestion, with bile salt-activated lipase being abundant in human milk and pancreatic ribonuclease being abundant in bovine milk. As lactation advances, immune-related proteins decreased slower in human milk compared to bovine milk. Notwithstanding these quantitative differences, analysis of human and bovine co-expression networks and protein-protein interaction networks indicated that a subset of milk proteins displayed highly similar interactions in each of the different networks, which may be related to the general importance of milk in nutrition and healthy development of the newborn.
Our findings promote a better understanding of the differences and similarities in dynamics of human and bovine milk proteins, thereby also providing guidance for further improvement of infant formula.
Milk is one of the richest foods, as it provides complete nutrition and bioactive components for healthy development of the newborn. These nutritional and bioactive components are essential for the neonate, for example for cognitive development, pathogen prevention, intestinal microflora modulation, and development of the immune system [1, 2]. Of these bioactive components, proteins have attracted great attention because of their importance in the protection of the neonate. With the development of proteomics techniques, more and more proteins, including both high and low abundant proteins, were characterized in the last few decades [3–5].
However, milk proteins are variable in presence and concentration due to many factors. One of the most obvious factors causing differences in protein concentration is species differences . Caseins accounts for 80% (w/w) of the bovine milk proteins, and for 50% of human milk proteins . In addition, β-lactoglobulin exists in bovine milk but cannot be found in human milk [6, 7]. Human and bovine milk diverge not only in their high abundant protein composition, but also in their low abundant protein composition. A total of 268 and 269 proteins were previously identified in human and bovine milk, respectively, in our previous study . Of these proteins, 44 from human milk and 51 from bovine milk were related to the host defense system. Specifically, the concentration of proteins involved in the mucosal immune system, immunoglobulin A, CD14, lactoferrin, and lysozyme, were present in much higher concentration in human milk than bovine milk .
Furthermore, milk proteins also differ in concentration over lactation. Immunoglobulins have been reported to change rapidly in concentration from colostrum to mature milk in both human [9, 10] and bovine milk [11–13]. Moreover, the low abundant proteins, such as complement proteins, lipid synthesis and transport proteins, and enzymes were also reported to change as lactation advances [14, 15]. However, the differences in changes of proteins over lactation has not been reported between human milk and bovine milk directly, although we reported the changes in the species separately [13, 16–18].
As human milk is used as reference and bovine milk is used as protein source for producing infant formula , the differences in the health outcomes between breastfed and formula-fed infants could be related to the differences in the nutrient intake . Breastfed infants were reported to have fewer infections (gastrointestinal infections, acute otitis media), reduced risk for celiac disease, obesity, and diabetes compared to formula-fed infants . Therefore, the aim of this study is to better understand the role of different proteins, especially those involved in immune activity, in both human milk and bovine milk through elaborating the existing data in qualitative and quantitative proteome  and their changes over lactation [13, 16–18]. Separate interactomics studies of human and bovine milk proteins have previously been performed, using published data collected from many different sources [20, 21]. In this study, the analysis is a comparative data analysis on both species simultaneously, where data has been collected on a single instrument [8, 13, 16–18], throughout lactation, allowing a better comparison between species.
In the current study, the human and bovine milk data in Data set 1  was reanalysed by Maxquant to give a more precise comparison in the quantitative differences between human and bovine milk proteins. The changes of both human and bovine milk proteome over lactation in Data set 2 [13, 16–18] were reanalysed using a co-expression (expression meaning the relative abundance) network approach and integrated with protein-protein interaction network data. The additional analysis enhances the comparison between human and bovine milk proteome from both qualitative and quantitative differences in milk proteome and their differences in changes over lactation. This should contribute to better understanding of the differences and similarities in biological functions networks of proteins, especially with regard to immune activity, in both the human and bovine milk proteome.
A total of 379 proteins were quantified through reanalyzing the human and bovine milk of data set 1 prepared by filter-aided sample preparation (FASP) and LC-MS/MS. The specific number of identified proteins in milk fat globule membrane (MFGM) and milk serum proteins for both human and bovine species are shown in Fig. 1. Of these quantified proteins, 93 proteins present in both species. Figure 2 shows that both human milk and bovine milk have similar distribution over biological functions in quantified MFGM and milk serum proteins. Transport proteins, enzymes, and immune-related proteins were the three dominant biological function groups in both human and bovine milk (Fig. 2). The biological enrichment of these three protein groups were shown in Additional file 1: Table S1. However, the number of proteins in these three dominant groups was different between human and bovine milk. Bovine milk contained a higher number of transport proteins than human milk (Fig. 2), which was dominated by lipid and protein transporters. Although the number of enzymes were similar, they were quite different in the type between human and bovine milk. The enzymes assisting nutrient digestion were bile salt-activated lipase (CEL), and lipoprotein lipase (LPL) alpha-trypsin chain 1 (PRSS1) in human milk (Table 1) [16, 18], whereas pancreatic ribonuclease 1 (RNASE1), LPL, and ribonuclease 4 (RNASE4) were dominant in bovine milk [13, 17].
Tables 1 and 2 show the quantitative differences of common MFGM and milk serum proteins between human and bovine milk. Lipid synthesis and transport proteins, including fatty acid-binding protein, heart (FABP3), perilipin-2 (PLIN2), butyrophilin subfamily 1 member A1 (BTN1A1), lactadherin (MFGE8), and platelet glycoprotein 4 (CD36), were present at approximately 10–100 times higher abundance in bovine MFGM (p < 0.05). Serum albumin (ALB), monocyte differentiation antigen CD14 (CD14), alpha-lactalbumin (LALBA), lactoferrin (LTF), toll-like receptor 2 (TLR2), alpha-1-antitrypsin (SERPINA1), alpha-1-antichymotrypsin (SERPINA3), clusterin (CLU), and polymeric immunoglobulin receptor (PIGR) showed higher concentrations in human milk, especially for ALB, LTF, SERPINA3, and CD14, which were around 20–100 times higher in human milk serum (p < 0.05).
Since milk serum protein content is far higher than MFGM protein content , the quantitative changes over lactation were only determined for milk serum. A total of 299 proteins were quantified in bovine milk serum [13, 17] and 247 in human milk serum [16, 18] by FASP and dimethyl labelling combined with LC-MS/MS. There were 71 common proteins quantified in human and bovine milk serum, with 34 of them quantified in every time point over lactation. In addition to the high number of transport proteins in bovine milk serum, the concentration of the transport proteins (calculated based on the summed intensity based absolute quantification (iBAQ values)) was higher in bovine milk serum than human milk serum, whereas enzymes were higher in human milk serum (Figs. 2 and 3).
Although the biological function distribution were similar in the identified proteins between human and bovine (Fig. 2), the quantitative changes of these protein groups differed over lactation (Fig. 3). Immune-related protein group decreased during the course of lactation, whereas transport protein and enzymes increased (Fig. 3). Moreover, the changing rate of the protein with the same functionality differed between species (Fig. 3); for instance, immune-related proteins, LTF, complement C3 (C3), PIGR, and osteopontin (SPP1) decreased much faster in bovine milk serum compared to human milk serum (Fig. 4). The changes in immune-related proteins over lactation are important for two reasons. Firstly, immune-related proteins had relatively higher concentration in human milk than bovine milk. Secondly, these proteins play important roles in the protection of the neonate, which may therefore be proteins of interest for application in infant formula. Hierarchical clustering (Fig. 4) shows that these immune-related proteins are correlated to each other. In addition to the correlation of proteins related to complement and coagulation cascades, such as C3, complement factor I (CFI), complement factor B (CFB), SERPINA1, antithrombin-III (SERPINC1), and alpha-2-HS-glycoprotein (AHSG) discussed before , CLU, alpha-1-acid glycoprotein 1 (ORM1), actin, cytoplasmic 1 (ACTB), LTF, SPP1, and PIGR also showed close interactions in both human and bovine milk serum (Fig. 4).
In order to compare the common human and bovine milk serum proteome at the network level, we converted our expression data to co-expression networks, and obtained available protein-protein interaction data for both species. Analysis of protein-protein interaction data indicated that the milk serum proteins quantified in our study are highly connected. For example, 310 interactions were observed for 66 human milk serum proteins, which is roughly 50 times higher than the number of interactions expected for randomly chosen proteins. The observed high interaction density was statistically significant according to the statistical test provided by STRING (p < 10−6).
Comparing the co-expression networks to each other, for 34 proteins quantified in every time point in both human and bovine milk serum, 18 were aligned to the equivalent protein in the other species. For these proteins, if they have expression similarity with another protein in human milk, it is likely that they also have expression similarity with that protein in bovine milk, and vice versa. For the other 16 proteins, network alignment indicated that this was not the case. In other words, these proteins have expression similarities with different proteins in human milk than in bovine milk, and are indicative of changes in the expression network between the two species (Fig. S1). The similarity between the human and bovine expression networks was also quantified using the correlation between the expression correlation coefficients. This resulted in a Pearson correlation coefficient of R = 0.23 (p < 10−7) between the expression Pearson correlation coefficients in human and bovine milk serum proteome. Comparing the human co-expression network with the protein interaction network, for 34 proteins, 17 were aligned to themselves. For these proteins, if they have expression similarity with another protein, it is likely that they also have protein interaction with that protein. Out of these, 13 proteins were among the above-mentioned 18 proteins which were aligned to the equivalent protein in the human-bovine co-expression network alignment. This indicates a common core of 13 proteins with relatively highly conserved interaction in each of the networks (Fig. 5). These include the immune-related C3, CLU, ACTB, SERPINA1, SPP1, PIGR, and LTF.
The large agreement between co-expression networks and protein interaction networks observed based on the network alignment (Additional file 2: Figure S1 and Additional file 3: Table S2) was confirmed by analysing the relation between interaction status in the protein-protein interaction network, and expression correlation (both in human and bovine milk, Additional file 4: Table S3). The average expression correlation coefficient of non-interacting proteins is −0.06 +/−0.37, whereas for interacting proteins it is 0.18+/−0.37 (human) and 0.14+/−0.51 (bovine) respectively (Fig. 6). According to a Kolmogorov-Smirnov test, the differences between the distribution of correlation coefficients for interacting and for non-interacting proteins is significant: p ~ 10−5 (human interacting vs non-interacting) and p ~ 10−3 (bovine interacting vs non-interacting), respectively. Similarly, a Mann–Whitney U Test indicated that the means are significantly different (p ~ 10−5 for human interacting vs non-interacting and p ~ 0.005 for bovine interacting vs non-interacting).
Previous studies described some comparisons of the milk proteome between species [20–22]; however, they only used single samples, either mature milk collected at certain lactation stages or a pooled samples from different lactation stage. Also some reviews [23, 24] on milk proteome were based on single species, with no comparisons between different species. This is because the data they used are from different studies. Differences in lactation stage, differences in sample preparation methods, and differences in instruments make it difficult to compare the proteome between species at the same time points over lactation. This study was the first one to compare the changes of milk protein profile between human and bovine species at the same time points from colostrum to 6 months lactation by using the same sample preparation method and the same instrument. Our comparative analysis between the human and bovine lactation proteome was performed by reanalysing data from several of our previous studies [8, 13, 16–18]. The time-based comparison between human and bovine milk proteins, may help us to know better the differences in the needs between infants and calves. This may also provide guidance on the improvement of infant formula composition on different stages. Although the data interpretation of the lactation stage studies is limited by the small sample size (n = 4) for both species, the separate results for bovine and human milk are similar to previously published studies on the biological functions of bovine and human milk protein, with many proteins in both species contributing to nutrient transport and immune protection [23, 24]. The annotation in this study gives a first insight in the comparison in the milk proteomes between human and bovine and their changes over lactation. The network analysis indicates that both the biological functions and the concentration of proteins have similarities between human and bovine milk. The reanalysed results in the current study should contribute to better understanding of the differences and similarities in the biological functions and micronutrients between human and bovine milk proteome.
A total of 390 proteins were quantified using Maxquant in both human and bovine milk (Fig. 1), which is higher compared to our previous study . However, the number of identified proteins were lower than that reported in previous studies [10, 20, 21, 23, 24]. First, this comparison is based on one study not on a large number of reviewed studies [23, 24]. Second, the lower number of identified proteins can be related to both the identification criteria (reducing identification confidence) and the extensive protein fractionation (increasing the proteome coverage but decreasing the precision of protein quantification), as discussed in our previous paper . Moreover, Maxquant was time cost-efficient in protein quantification. This indicates the advantages of Maxquant in quantifying milk proteins. The higher number of quantified proteins in data set 1 than data set 2 can be related to the differences in the preparation methods. Label free was used for dataset 1 and dimethyl labelling was used for dataset 2. The shift from label free to dimethyl labelling in two studies is because dimethyl labelling is much more sensitive and precise to pick up small differences between two samples . The lower number of quantified proteins in our studies compared with previous studies (e.g. 573 proteins from bovine milk , 1606  and 976  proteins from human milk) can be related to the extensive protein fractionation in these previous studies and less strict identification criteria as discussed in our previous paper .
The higher number of quantified MFGM proteins than milk serum proteins in both human and bovine (data set 1) is consistent with the numbers of identified proteins reported previously . It is not surprising, as MFGM represent the epithelial cell, the place where the milk fat is synthesised and secreted [26, 27]. The low amount of transport proteins in human milk can be mainly related to the absence of the major transport protein β-lactoglobulin (LGB) in human milk , which is the most abundant protein in bovine milk serum. In addition, the lower concentration of lipid synthesis and secretion proteins in human milk (Table 1 and 2) also contributes to the relatively low amount of transport proteins in human milk.
The relative high amount of enzymes (Fig. 3) and the high biological enrichment (Additional file 1: Table S1) in human milk can probably be attributed to the immature gastrointestinal tract of infants at birth. Although the development of the gastrointestinal tract starts from the fetal stage, the maturation of the gastrointestinal digestive function is not complete at birth . It experiences a dramatic switch in the nutrients from amniotic fluid before birth to colostrum after birth and the energy supply switches from glucose-dominated to lipid-dominated . This transition requires the digestion of lipids and proteins prior to their absorption in the gastrointestinal tract . The high abundant enzymes related to lipid and protein degradation in human milk, such as bile salt-activated lipase, lipoprotein lipase, trypsin, and cathepsin D , suggests that human milk itself contributes to the digestive capacity, thereby being able to more effectively deal with immature luminal digestion . The differences in the dominant digestive enzymes between human milk (bile salt-activated lipase) and bovine milk (ribonuclease pancreatic), which have been discussed in our previous papers  may thus reflect the differences in the needs for support of the digestion system between infants and calves.
Previous studies have reported that calves develop their own immune system in a few weeks , whereas infants produce their own immunoglobulins only after 2 or 3 months . The relatively higher amount and slower decrease of immune-related proteins in human milk (Fig. 3) may be related to the slower maturation of immune system in infants than calves, as hypothesized before . This hypothesis is consistent with the in-depth comparison between human and bovine milk proteome (Tables 1 and 2, Figs. 3 and 4).
However, the common proteins present in human and bovine milk (Fig. 1) suggest the similarity in the milk proteome between human and bovine. Several common immune-related proteins in the network analysis of both biological functions and co-expression levels (Fig. 5) indicate the comparable immunological functions of milk proteins in protecting the neonate. In addition to the importance of dominant immune-related proteins, such as LTF and immunoglobulins discussed previously [14, 15], the low abundant immune-related proteins, including C3, CFB, SERPINA1, ACTB, and SPP1 (Fig. 5), play important roles in the immune system, especially innate immune system [10, 15]. The high abundance of innate immune-related proteins in early lactation (Fig. 4) may be due to its rapid reaction against broad groups of pathogens in the gastrointestinal tract of the neonate [8, 34], especially just after birth. SERPINA1 plays a dual role in regulating the complement and coagulation pathway , but also protecting the immune-related proteins against degradation during digestion. ACTB not only plays a role in the cell cytoskeleton but is also involved in innate immune response, according to research using a mice model . SPP1 could protect the intestinal tract of infants against pathogens or bacteria, due to its cytokine-like properties and it being a key factor in the initiation of T helper 1 immune responses . PIGR is the receptor of immunoglobulins A and M, facilitating their secretion in the mammary gland. The high correlation between SERPINA1, LTF, C3, ACTB, SPP1, and PIGR (Fig. 5) in both human and bovine milk reflects the interactions between innate and adaptive immune system and the complex nature of biological interrelationships between milk proteins in protecting the neonate.
The other common proteins in Fig. 5, LTF, TF, ALB, vitamin D-binding (GC), play roles in transport and delivery of nutrients through binding minerals, vitamins, fatty acid, steroids, glucocorticoid/progestin, and heme derivatives, and thus facilitate their uptake in the intestinal tract . The correlation of these proteins in both human and bovine milk (Fig. 5) could be related to need for providing this range of micronutrients that are necessary for the growth of the neonate.
The distribution of expression correlation coefficients (Fig. 6) over lactation in both human and bovine milk proteome for protein pairs not interacting in the protein interaction network is shifted towards negative values compared to the distribution for protein pairs that are interacting. This suggests an interplay between protein-protein interactions and expression similarity. Such similarity between these different types of networks was also observed based on network alignment. In all mammals, milk provision is a complex process with changes in milk composition and interactions between parent and young beyond the straightforward nutritional function . The similarity in the milk proteome may be related to their main functions in providing nutrients and protection to the neonate. The differences in the milk proteome between species may be due to their unique lactation strategies to accommodate reproductive success and adapt to the specific environment. This suggests an interplay between protein-protein interactions and expression similarity.
The comparison of the milk proteome between human and bovine over lactation provides more information on the similarity and differences of milk protein profile over lactation. This study can be used as a start point for further biological function investigation of proteins discussed in the paper. Proteins differing between human and bovine are interesting from an infant nutrition point-of-view. Further evaluation of the biological significance of these proteins, and on the feasibility of the application of such proteins in infant formula can be conducted. With respect to the proteins with high similarity based on the network alignment, they may still differ in digestibility or have different nutritional values due to the differences in amino acid sequence and post-translation modifications between species. Further studying this will contribute to a better understanding of protein functionality in human and bovine milk, and may provide guidance on the improvement of infant formula.
The qualitative and quantitative differences between human and bovine milk proteome as well as the differences in the concentration changes over lactation help us to better understand the role of milk proteins in the development of the digestive and immune system of the neonates in general, including differences between infants and calves. The similarities in both protein-protein interaction network and expression correlation between human and bovine milk proteome indicates the importance of milk proteins in providing nutrients and protection to the neonate. This in-depth comparison between human and bovine milk contributes to a better understanding on the biological functions, especially immunological functions, of milk proteins between human and bovine.
Data set 1-Qualitative and quantitative differences between human and bovine milk proteome study
This data is based on the study of Hettinga, et al. . Human milk was collected from 10 healthy mothers between 3 and 10 months in lactation. Samples of 10 mL were collected and frozen for later analysis. After thawing, the 10 samples were pooled. One bovine tank milk sample was collected from the university farm “De Ossekampen” in Wageningen, The Netherlands, which was milk from 30 clinically healthy cows which were between 3 weeks and 10 months in lactation.
Data set 2-The comparison in the changes of human and bovine milk proteome over lactation
This data set is based on our previous studies [13, 16–18]. Human milk samples were collected from women who gave birth at the obstetric department in VU medical center (VUmc) in Amsterdam. All women who delivered singleton term infants (gestational age 37–42 weeks) were eligible for this study. Women with haemolysis elevated liver enzymes, low platelet syndrome, history of breast surgery, and (gestational) diabetes mellitus were excluded. The samples collected at week 1, 2, 3, 4, 8, 16, 24 were used for this study. Approximately 5–10 mL was collected in a polypropylene bottle after 1 min of pumping for every sample. and stored at −18 °C immediately afterwards.
Bovine milk was collected from four healthy cows in a farm in Zaffelaere, Belgium. The cows were milked using an automatic milking system. Samples were collected from day 0 to the end of lactation. Samples collected at day 0, 0.5, 1, 2, 3, 5, 9, 14, month 1, 2, 3, 6, 9 and the latest time point of the lactation (10 months for cow 1, 11 months for cow 2 and 12 months for cow 3, the latest time point was missed for cow 4) were used for this study. The samples were frozen immediately at −20 °C after collection and transferred frozen to the laboratory for further analysis.
Milk serum separation
The separation of milk serum was performed according to a previous study . The samples were centrifuged at 1,500 × g for 10 min at 10 °C (Beckman coulter Avanti J-26 XP centrifuge, rotor JA-25.15). The milk fat was removed and the obtained supernatant was transferred to the ultracentrifuge tubes followed by ultracentrifugation at 100,000 × g for 90 min at 4 °C (Beckman L-60, rotor 70 Ti). After ultracentrifugation, samples were separated into three phases. The top layer was remaining milk fat, the middle layer was milk serum (with some free soluble caseins), and the bottom layer (pellet) was casein. Milk serum was used for filter aided sample preparation as described below after the measurement of protein content by the BCA protein assay (Fisher Scientific).
Filter aided sample preparation
Filter aided sample preparation (FASP) was performed as previously described . Milk serum samples (20 μL), including samples of each time point and pooled samples of each included woman, were diluted in 100 mM Tris/HCl pH 8.0 + 4% SDS + 0.1 M Dithiotreitol (SDT-lysis buffer) to get a 1 μg/μL protein solution. Samples were then incubated for 10 min at 95 °C, and centrifuged at 18407 g for 10 min, after cooling down to room temperature. Twenty μL of each sample were directly added to the middle of 180 μL 0.05 M iodoacetamide/100 mM Tris/HCl pH 8.0 + 8 M urea (UT) in a low binding Eppendorf tube and incubated for 10 min while mildly shaking at room temperature. The sample was transferred to a Pall 3 K omega filter (10–20 kDa cutoff, OD003C34; Pall, Washington, NY, USA) and centrifuged at 15871 g for 30 min. Three repeated centrifugations at 15871 g for 30 min were carried out after adding three times 100 μL UT. After that, 110 μL 0.05 M NH4HCO3 in water (ABC) were added to the filter unit and the samples were centrifuged again at 15871 g for 30 min. Then, the filter was transferred to a new low-binding Eppendorf tube. One hundred μL ABC containing 0.5 μg trypsin were added followed by overnight incubation at room temperature. Finally, the sample was centrifuged at 15871 g for 30 min, and 3.5 μL 10% trifluoroacetic acid (TFA) were added to the filtrate to adjust the pH value of the sample to around 2. These samples were ready for dimethyl labeling.
The dimethyl labeling was carried out by on-column dimethyl labeling according to . The trypsin digested samples of pooled milk serum from each individual mothers and cows collected at the different time points were labeled with light reagent (the mix of CH2O and cyanoborohydride), whereas trypsin digested milk serum samples of the individual mothers and cows at each time point were labeled with heavy reagent (the mix of CD2O and cyanoborohydride). Stage tips containing 2 mg Lichroprep C18 (25 um particles) column material (C18+ Stage tip) were made in-house. The C18+ Stage tip column was washed 2 times with 200 μL methanol. The column was conditioned with 100 μL of 1 mL/L formic acid in water (HCOOH) after which samples were loaded on the C18+ Stage tip column. The column was washed with 100 μL 1 mL/L HCOOH, and then slowly flushed with 100 μL labeling reagent (0.2% CH2O or CD2O and 30 mM cyanoborohydride in 50 mM phosphate buffer pH 7.5) in about 10 min. The column was washed again with 200 μL 1 mL/L HCOOH. Finally, the labeled peptides were eluted with 50 μL of 70% acetonitrile/30% 1 mL/L HCOOH from the C18+ Stage tip columns. The samples were then dried in a vacuum concentrator (Eppendorf Vacufuge®) at 45 °C for 20 to 30 min until the volume of each sample decreased to 15 μL or less. The pairs of light dimethyl label and heavy dimethyl label were then mixed up and the volume was adjusted to exactly 100 μL by adding 1 mL/L HCOOH. These samples were ready for analysis by LC-MS/MS.
Eighteen μL of the trypsin digested and dimethyl labeled milk fractions were injected on a 0.10 × 30 mm Magic C18AQ 200A 5 μm beads (Michrom Bioresources Inc., USA) pre-concentration column (prepared in house) at a maximum pressure of 270 bar. Peptides were eluted from the pre-concentration column onto a 0.10 × 200 mm Prontosil 300-3-C18H Magic C18AQ 200A 3 μm analytical column with an acetonitrile gradient at a flow of 0.5 μL/min, using gradient elution from 8 to 33% acetonitrile in water with 0.5 v/v% acetic acid in 50 min. The column was washed using an increase in the percentage acetonitrile to 80% (with 20% water and 0.5 v/v% acetic acid in the acetonitrile and the water) in 3 min. A P777 Upchurch microcross was positioned between the pre-concentration and analytical column. An electrospray potential of 3.5 kV was applied directly to the eluent via a stainless steel needle fitted into the waste line of the microcross. Full scan positive mode FTMS spectra were measured between m/z 380 and 1400 on a LTQ-Orbitrap XL (Thermo electron, San Jose, CA, USA). CID fragmented MS/MS scans of the four most abundant doubly- and triply-charged peaks in the FTMS scan were recorded in data-dependent mode in the linear trap (MS/MS threshold = 5.000).
The acquired datasets were analyzed by using MaxQuant (Version 22.214.171.124, http://www.maxquant.org/) and the built-in Andromeda search engine with a UniProt human and bovine database (http://www.uniprot.org/; accessed March 2012). The search parameters were as follows: variable modifications of protein N-terminal acetylation and methionine oxidation, and fixed modification of cysteine carbamidomethylation. The minimum peptide length was set to 7 amino acids and a maximum of 2 missed cleavages was allowed for the search. Trypsin/P was selected as the semi-specific proteolytic enzyme. The global false discovery rate (FDR) cut off used for both peptides and proteins was 0.01 . Label-free quantitation was performed in MaxQuant. To further improve the quantification accuracy, only the razor/unique peptides were used for quantitative calculations. The other parameters used were the default settings in MaxQuant software for processing MS/MS data.
All known contaminants (i.e. keratins, trypsin), and proteins detected in less than half of the samples, were removed from each sample set of proteins identified. The origin and function of the identified proteins was taken from UniProtKB (http://www.uniprot.org/; accessed March 2012) for recommended protein name, gene name, and protein function. It was verified that the human and bovine proteins with the same protein name were orthologous using a reciprocal best BLAST hit approach. DAVID Bioinformatics Resource 6.7 (https://david.ncifcrf.gov/) was used for protein biological function classification and protein group enrichment. Protein concentrations were calculated as the average of all peptide peak intensities from five replicates divided by the number of theoretically observable tryptic peptides (intensity based absolute quantification, or iBAQ, [42, 43]). Perseus software v.126.96.36.199 (Martinsreid, Germany) was used to test for hierarchical clustering and significant differences between species. Hierarchical clustering in Perseus software was used for clustering proteins identified in both human and bovine milk based on their relative abundance. This procedure is performing hierarchical clustering of rows (proteins) and columns (samples) and produces a visual heat map representation of the clustered matrix. The ratios between the concentration found in human milk (milk fat globulin membrane-MFGM and serum) and bovine milk (MFGM and serum) were calculated as the difference (on 10log scale) of the iBAQ value of the human MFGM versus the bovine MFGM and human serum versus bovine serum. ANOVA was applied to compare MFGM and serum in both species, and the p-values obtained were adjusted with false discovery rate (FDR)-based correction in order to account for the effect of multiple comparisons.
Protein-protein interactions for proteins in both human and bovine milk proteome were obtained from STRING . In order to interpret the interaction density (number of observed interactions divided by total possible number of interactions) of milk proteins, this density was compared with the interaction density of all human/bovine STRING proteins. A statistical test for the significance of the observed high density in the milk proteome was performed using the approach provided by STRING .
For co-expression network analysis, a cutoff of 0.3 on the absolute value of the Pearson correlation was applied, in order to get a number of interactions in the co-expression networks that would be comparable to that in the STRING interaction networks. Pinalog  was used to align different networks to each other, taking into account both sequence similarity between proteins and topological similarity (i.e. similarity of interaction partners for each protein). For visualization, VANLO  and Cytoscape  were applied. Comparison of distributions with Kolmogorov-Smirnov test was performed using the R-function ks.test.
Actin, cytoplasmic 1
Butyrophilin subfamily 1 member A1
Monocyte differentiation antigen CD14
Platelet glycoprotein 4
Bile salt-activated lipase
Complement factor B
Complement factor I
Fatty acid-binding protein, heart
Filter-aided sample preparation
False discovery rate
- iBAQ Value:
intensity based absolute quantification
Milk fat globule membrane
Alpha-1-acid glycoprotein 1
Polymeric immunoglobulin receptor
Alpha-trypsin chain 1
Pancreatic ribonuclease 1
Toll-like receptor 2
Casado B, Affolter M, Kussmann M. OMICS-rooted studies of milk proteins, oligosaccharides and lipids. J Proteomics. 2009;73(2):196–208.
German JB, Dillard CJ, Ward RE. Bioactive components in milk. Curr Opin Clin Nutr Metab Care. 2002;5(6):653–8.
Reinhardt TA, Lippolis JD. Bovine milk fat globule membrane proteome. J Dairy Res. 2006;73(4):406–16.
Séverin S, Wenshui X. Milk biologically active components as nutraceuticals: Review. Crit Rev Food Sci Nutr. 2005;45(7–8):645–56.
Smolenski G, Haines S, Kwan FYS, Bond J, Farr V, Davis SR, Stelwagen K, Wheeler TT. Characterisation of host defence proteins in milk using a proteomic approach. J Proteome Res. 2007;6(1):207–15.
D’Auria E, Agostoni C, Giovannini M, Riva E, Zetterstrom R, Fortin R, Greppi GF, Bonizzi L, Roncada P. Proteomic evaluation of milk from different mammalian species as a substitute for breast milk. Acta Paediatr Int J Paediatr. 2005;94(12):1708–13.
Mercier JC, Vilotte JL. Structure and function of milk protein genes. J Dairy Sci. 1993;76(10):3079–98.
Hettinga K, van Valenberg H, de Vries S, Boeren S, van Hooijdonk T, van Arendonk J, Vervoort J. The host defense proteome of human and bovine milk. PLoS. One 2011;6(4):e19433.
Politis I, Chronopoulou R. Milk peptides and immune response in the neonate. Adv Exp Med Biol. 2008;606:253–69.
Zhang Q, Cundiff J, Maria S, McMahon R, Woo J, Davidson B, Morrow A. Quantitative analysis of the human milk whey proteome reveals developing milk and mammary-gland functions across the first year of lactation. Proteomes. 2013;1(2):128–58.
Stelwagen K, Carpenter E, Haigh B, Hodgkinson A, Wheeler TT. Immune components of bovine colostrum and milk. J Anim Sci. 2009;87(13 Suppl):3–9.
Senda A, Fukuda K, Ishii T, Urashima T. Changes in the bovine whey proteome during the early lactation period. Anim Sci J. 2011;82(5):698–706.
Zhang L, Boeren S, Hageman JA, van Hooijdonk T, Vervoort J, Hettinga K. Bovine milk proteome in the first 9 days: protein interactions in maturation of the immune and digestive system of the newborn. PLoS One. 2015;10(2):e0116710.
Liao Y, Alvarado R, Phinney B, Lonnerdal B. Proteomic characterization of human milk whey proteins during a twelve-month lactation period. J Proteome Res. 2011;10(4):1746–54.
Gao X, McMahon RJ, Woo JG, Davidson BS, Morrow AL, Zhang Q. Temporal changes in milk proteomes reveal developing milk functions. J Proteome Res. 2012;11(7):3897–907.
Zhang L, de Waard M, Verheijen H, Boeren S, Hageman JA, van Hooijdonk T, Vervoort J, van Goudoever JB, Hettinga K. Changes over lactation in breast milk serum proteins involved in the maturation of immune and digestive system of the infant. J Proteomics. 2016;147:40–7. http://dx.doi.org/10.1016/j.jprot.2016.02.005.
Zhang L, Boeren S, Hageman JA, van Hooijdonk T, Vervoort J, Hettinga K. Perspective on calf and mammary gland development through changes in the bovine milk proteome over a complete lactation. J Dairy Sci. 2015;98(8):5362–73.
Zhang L, de Waard M, Verheijen H, et al. Changes over lactation in breast milk serum proteins involved in the maturation of immune and digestive system of the infant. Data Brief. 2016;7:362–5. doi:10.1016/j.dib.2016.02.046.
Hernell O. Human milk vs. cow’s milk and the evolution of infant formulas. In: Nestle Nutrition Workshop Series: Pediatric Program. 67th ed. 2011. p. 17–28.
Reinhardt TA, Lippolis JD, Nonnecke BJ, Sacco RE. Bovine milk exosome proteome. J Proteomics. 2012;75(5):1486–92.
Reinhardt TA, Lippolis JD. Developmental changes in the milk fat globule membrane proteome during the transition from colostrum to milk. J Dairy Sci. 2008;91(6):2307–18.
Beck KL, Weber D, Phinney BS, Smilowitz JT, Hinde K, Lonnerdal B, Korf I, Lemay DG. Comparative proteomics of human and macaque milk reveals species-specific nutrition during postnatal development. J Proteome Res. 2015;14:2143–57.
D’Alessandro A, Scaloni A, Zolla L. Human milk proteins: an interactomics and updated functional overview. J Proteome Res. 2010;9(7):3339–73.
D’Alessandro A, Zolla L, Scaloni A. The bovine milk proteome: cherishing, nourishing and fostering molecular complexity. An interactomics and functional overview. Mol BioSyst. 2011;7:579–97.
Lu J, Boeren S, de Vries SC, van Valenberg HJ, Vervoort J, Hettinga K. Filter-aided sample preparation with dimethyl labeling to identify and quantify milk fat globule membrane proteins. J Proteomics. 2011;75(1):34–43.
McManaman JL, Neville MC. Mammary physiology and milk secretion. Adv Drug Deliv Rev. 2003;55(5):629–41.
Lu J, van Hooijdonk T, Boeren S, Vervoort J, Hettinga K. Identification of lipid synthesis and secretion proteins in bovine milk. J Dairy Res. 2014;81(1):65–72.
Hinz K, O’Connor PM, Huppertz T, Ross RP, Kelly AL. Comparison of the principal proteins in bovine, caprine, buffalo, equine and camel milk. J Dairy Res. 2012;79(2):185–91.
Lindquist S, Hernell O. Lipid digestion and absorption in early life: an update. Curr Opin Clin Nutr Metab Care. 2010;13(3):314–20.
Abrahamse E, Minekus M, van Aken GA, van de Heijning B, Knol J, Bartke N, Oozeer R, van der Beek EM, Ludwig T. Development of the digestive system-experimental challenges and approaches of infant lipid digestion. Food Digestion. 2012;3(1–3):63–77.
Khaldi N, Vijayakumar V, Dallas DC, Guerrero A, Wickramasinghe S, Smilowitz JT, Medrano JF, Lebrilla CB, Shields DC, German JB. Predicting the important enzymes in human breast milk digestion. J Agric Food Chem. 2014;62(29):7225–32.
Dallas DC, Smink CJ, Robinson RC, Tian T, Guerrero A, Parker EA, Smilowitz JT, Hettinga KA, Underwood MA, Lebrilla CB, et al. Endogenous human milk peptide release is greater after preterm birth than term birth. J Nutr. 2015;145(3):425–33.
Chase CC, Hurley DJ, Reber AJ. Neonatal immune development in the calf and its impact on vaccine response. Vet Clin North Am Food Anim Pract. 2008;24(1):87–104.
Jensen GS, Patel D, Benson KF. A novel extract from bovine colostrum whey supports innate immune functions. II. Rapid changes in cellular immune function in humans. Prev Med. 2012;54:124–9.
Law RH, Zhang Q, McGowan S, Buckle AM, Silverman GA, Wong W, Rosado CJ, Langendorf CG, Pike RN, Bird PI, Whisstock JC. An overview of the serpin superfamily. Genome Biol. 2006;7(5):216.
Man SM, Ekpenyong A, Tourlomousis P, Achouri S, Cammarota E, Hughes K, Rizzo A, Ng G, Wright JA, Cicuta P, et al. Actin polymerization as a key innate immune effector mechanism to control Salmonella infection. Proc Natl Acad Sci U S A. 2014;111(49):17588–93.
Schack L, Lange A, Kelsen J, Agnholt J, Christensen B, Petersen TE, Sørensen ES. Considerable variation in the concentration of osteopontin in human milk, bovine milk, and infant formulas. J Dairy Sci. 2009;92(11):5378–85.
Lonnerdal B. Nutritional and physiologic significance of human milk proteins. Am J Clin Nutr. 2003;77(6):1537s–43s.
Lefèvre CM, Sharp JA, Nicholas KR. Evolution of lactation: Ancient origin and extreme adaptations of the lactation system. In: Annual Review of Genomics and Human Genetics. 11th ed. 2010. p. 219–38.
Wisniewski JR, Zougman A, Nagaraj N, Mann M. Universal sample preparation method for proteome analysis. Nat Methods. 2009;6:359–62.
Michalski A, Cox J, Mann M. More than 100,000 detectable peptide species elute in single shotgun proteomics runs but the majority is inaccessible to data-dependent LC-MS/MS. J Proteome Res. 2011;10:1785–93.
Malmström J, Beck M, Schmidt A, Lange V, Deutsch EW, Aebersold R. Proteome-wide cellular protein concentrations of the human pathogen Leptospira interrogans. Nature. 2009;460(7256):762–5.
Schwanhausser B, Busse D, Li N, Dittmar G, Schuchhardt J, Wolf J, Chen W, Selbach M. Global quantification of mammalian gene expression control. Nature. 2011;473(7347):337–42.
Szklarczyk D, Franceschini A, Wyder S, Forslund K, Heller D, Huerta-Cepas J, Simonovic M, Roth A, Santos A, Tsafou KP, et al. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic Acids Res. 2015;43(Database issue):D447–452.
Franceschini A, Szklarczyk D, Frankild S, Kuhn M, Simonovic M, Roth A, Lin J, Minguez P, Bork P, von Mering C, Jensen LJ. STRING v9.1: protein-protein interaction networks, with increased coverage and integration. Nucleic Acids Res. 2013;41(Database issue):D808–15.
Phan HT, Sternberg MJ. PINALOG: a novel approach to align protein interaction networks--implications for complex detection and function prediction. Bioinformatics. 2012;28(9):1239–45.
Brasch S, Linsen L, Fuellen G. VANLO--interactive visual exploration of aligned biological networks. BMC bioinformatics. 2009;10:327.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504.
We thank Marita de Waard, Hester Verheijen and Hans van Goudoever for collecting the human milk samples. We thank Jeroen Heck for collecting the bovine milk samples. We thank Jos A. Hageman for the statistical support in the primary data analysis. We thank Sjef Boeren for performing the LC-MS/MS on all samples.
Availability of data and materials
LZ designed the experiment, performed sample preparation and preliminary data analysis, and drafted the manuscript. AvD performed the interactomics data analysis, participated in discussion on result interpretation, and wrote the data analysis section and part of the results and discussion of the manuscript. KH participated in the experiment design and data interpretation discussion, and revised the manuscript. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
Bovine milk: No specific permissions were required for this sample collection, as samples were taken from the milk collected during regular milking.
Human milk data set 1: Milk samples were donated anonymously for this study and pooled before use, so IRB approval was not required.
Human milk data set 2: The study was approved by the institutional medical ethical review board of VU medical center and written informed consent was obtained from all participants.
The biological functional enrichment of immunity, transport and enzyme protein groups in both human and bovine milk. (DOCX 13 kb)
Network alignment between bovine (red) and human (green) co-expression networks. Equivalent nodes are connected by thin straight lines and are at comparable positions in the two networks. (TIF 18148 kb)
Alignment between bovine and human co-expression networks. (DOCX 14 kb)
Alignment between protein interaction network and human co-expression network. (DOCX 15 kb)