Low-Abundance Protein Enrichment for Medical Applications: The Involvement of Combinatorial Peptide Library Technique

Boschetti, Egisto; Righetti, Pier Giorgio

doi:10.3390/ijms241210329

Open AccessReview

Low-Abundance Protein Enrichment for Medical Applications: The Involvement of Combinatorial Peptide Library Technique

by

Egisto Boschetti

^1,* and

Pier Giorgio Righetti

²

¹

JAM Conseil, 92200 Neuilly-sur-Seine, France

²

Department of Chemistry, Politecnico di Milano, 20133 Milan, Italy

^*

Author to whom correspondence should be addressed.

Int. J. Mol. Sci. 2023, 24(12), 10329; https://doi.org/10.3390/ijms241210329

Submission received: 5 May 2023 / Revised: 9 June 2023 / Accepted: 16 June 2023 / Published: 19 June 2023

(This article belongs to the Special Issue Recent Advances of Proteomics in Human Health and Disease)

Download

Browse Figures

Review Reports Versions Notes

Abstract

The discovery of low- and very low-abundance proteins in medical applications is considered a key success factor in various important domains. To reach this category of proteins, it is essential to adopt procedures consisting of the selective enrichment of species that are present at extremely low concentrations. In the past few years pathways towards this objective have been proposed. In this review, a general landscape of the enrichment technology situation is made first with the presentation and the use of combinatorial peptide libraries. Then, a description of this peculiar technology for the identification of early-stage biomarkers for well-known pathologies with concrete examples is given. In another field of medical applications, the determination of host cell protein traces potentially present in recombinant therapeutic proteins, such as antibodies, is discussed along with their potentially deleterious effects on the health of patients on the one hand, and on the stability of these biodrugs on the other hand. Various additional applications of medical interest are disclosed for biological fluids investigations where the target proteins are present at very low concentrations (e.g., protein allergens).

Keywords:

proteomics; enrichment; medical applications; low-abundance proteins

1. Introduction

In most biological fluids or extracts, proteins are present in a large number and their individual concentration difference can span over several orders of magnitude. Typical examples are given by blood serum where the dynamic concentration range is estimated at 12–15 orders of magnitude with albumin which is a largely dominant protein [1]. However, this is not the only biological sample presenting such a characteristic. The lysate of red blood cells is another example where the local concentration of the main protein, hemoglobin, is about 300 mg/mL (representing approximately 95% of the total proteome) and the least concentrated proteins are well below the ng value [2]. Plant fluids and extracts are not in a very different situation since the most abundant protein by far in leaves is RuBisCO (ribulose-1,5-bisphosphate carboxylase/oxygenase) [3] that represents more than 40% of the total protein mass and in seeds extracts are storage proteins such as β-conglycinin and glycinin accounting for about 80% of the proteins [4]. This situation creates difficulties when considering the detection and identification of the most dilutes polypeptides.

Proteins that are of high abundance mask the signal of many others of very low abundance escaping thus detection by current available instrumentation and analytical methods. This is why the exploration of low-abundance proteins (LAP) cannot be performed directly, but rather using dedicated tricks. Chemical depletion, immunosubtraction, affinity depletion, molecular filtration and precipitation are the most common. Thus, the perfection of methods allowing the detection of low-abundance proteins is essential for targeted applications. They allow for establishing a better knowledge of proteomes and the understanding of the protein interactions. The detection of protein traces also helps the design of purification procedures of isolated biopharmaceuticals such as, for instance, purified vaccines or recombinant monoclonal antibodies. Spontaneous or induced subtle modifications of protein expression are also another aspect where reaching low- or very low-abundance proteins is critical.

This review is mostly dedicated to one of the major enrichment methods based on the use of combinatorial peptide ligand libraries.

2. Current Methods for Low-Abundance Protein Enrichment

The large dynamic concentration range of most protein extracts is a hard technical obstacle when attempting to visualize and identify proteins of low abundance. The number of proteins present is extremely large and can reach several thousand or more. In addition, their individual concentration is very different; among a few concentrated proteins are a large number of others with a concentration close to ng/mL or lower. Useless to say that with such a low concentration it is almost impossible to detect all proteins at once for at least two reasons: (i) the capabilities of most current analytical methods are restricted to about 4–5 orders of magnitude and (ii) the signal of high abundance proteins obscures the signal of the most dilutes forms. It is within this context that the interest of scientists is evidenced by the appearance of a growing number of published reports as illustrated in Figure 1.

To circumvent this difficulty, the most obvious option is to eliminate the high-abundance protein (HAP) group. Historically, this approach started in 2003 with Pieper et al. [5] with the proposal of using a solid phase carrying few immobilized antibodies. This approach was intended to subtract a few of the most abundant proteins from human serum. Later on, immunosubtraction has been extended to a larger number of plasma proteins [6] up to about 60 [7] with a relatively modest advantage compared to the initial proposal. Close to the advantage of removing high-abundance proteins, problems have been identified such as massive co-depletion phenomena of unexpected other proteins. In some cases, the co-depletion was so massive at a point that the number of removed proteins was larger than what was found in the depleted sample [8]. In immunosubtraction, another major drawback observed is a dilution effect that, without a problematic concentration step, renders even more complicated the detection of most low-abundance proteins. Moreover, this enrichment process involving antibodies is so specific that can only be applicable to restricted biological samples. Alternative methods have been described for the enrichment of protein groups such as glycoproteins [9] and phosphoproteins [10]. Glycoproteins are the result of post-translational modifications with the addition of various glycan structures conferring singular properties of recognition and other functions. Representing a large diversity and number, they constitute a homogeneous group comprising high- and low-abundance species. They have aroused particular interest insofar as certain unexpected antigens are expressed in carcinogenesis processes and may constitute a possible path toward the discovery of biomarkers of interest [11]. Thus, dedicated glycan-based enrichment technologies are applied, such as affinity adsorption on immobilized lectins [12] and boronate-based affinity ligands [13].

Concerning phosphoproteins, about 30% of human proteins are phosphorylated; they participate in many biological processes such as signaling and many of them are of low abundance. Their detection, identification and analytical determinations are challenging; thus, they constitute a group of products where it makes sense to find methods of enrichment. Various approaches have been described such as hydroxyapatite adsorption [14], metal chelating chromatography [15] and immunoaffinity chromatography [16] with specific antibodies. The adsorption on these solid phases is not always bispecific and consequently contributes to the co-adoption of other proteins. Nevertheless, the associated concentration effect of these methods contributes to detecting low-abundance phosphoproteins by mass spectrometry.

In essence, to enrich for LAP four main approaches are described: fractionation methods such as liquid chromatography, precipitation, subtraction of high-abundance proteins, targeted capture of low-abundance proteins and reduction of dynamic concentration range by CPLLs (combinatorial peptide ligand library). The main characteristics of each method are summarized in Table 1.

These enrichment method proposals have been described for animal proteomes [17], plant proteomes [18,19] and even recombinant therapeutic proteins when attempting to detect contaminating proteins from the host cells. In this domain, dedicated methods are described. They are the depletion of the largely dominant recombinant protein [20], the immunoprecipitation of protein impurity traces [21], cutoff filtration [22], size exclusion [23], various chromatographic procedures and CPLL. These approaches are described in a few published application reviews [24,25,26,27].

3. Combinatorial Peptide Ligand Library Technology

The enrichment process is in essence a separation method able to isolate a group of proteins from a large ballast of other polypeptides. The group of proteins in question must, thus, comprise a common property or similarities in structure exploited for the fractionation process. The removal of a large proportion of proteins from the sample greatly facilitates further analytical operations. The major feature of the CPLL-based procedures is not the separation of different species of proteins but rather the reduction of the concentration of the most represented proteins while increasing the concentration of proteins of low abundance [25].

Since the development of the CPLL technology, thousands of scientific publications report data on proteomics investigations when low-abundance proteins are the objective of the study. The diversity of captured protein, the diversity of biological fluids and the diversity of species (extracts from animals, plants, and bacteria) are witnesses of the large applicability of the principle as described below. This technology demonstrated a relatively easy way to identify proteins that are normally undetectable without any enrichment effect.

The solid phase combinatorial peptide library is a mixed bed of affinity sorbents for the capture of proteins. Each bead supports millions of copies of a unique hexapeptide structure made using combinatorial synthesis. Considering that for the synthesis of hexapeptides 16 natural amino acids are used, the library comprises a population of linear hexapeptides amounting to 16–17 million different structures. Thus, in principle, an appropriate volume of beads contains enough ligand partners to interact with just about each protein present in a complex proteome. The density of these ligands is about 40–60 µmol per mL of bead volume.

When CPLL is put in contact with a protein extract or a biological fluid, proteins are adsorbed by affinity interaction on their corresponding hexapeptide beads. This reaction occurs up to the bead saturation. Saturation is very rapidly reached with high-abundance proteins while for the very dilutes ones (the low-abundance proteins) it is only reached when the sample volume is large enough to meet the bead binding capacity. In that way, the amount of high-abundance proteins captured by the beads is limited as a consequence of bead saturation; the excess of these abundant proteins remains as a consequence in the supernatant. On the contrary proteins of low abundance are progressively concentrated as long as the offered volume of sample to the solid phase is increased.

The interaction between the captured protein and the hexapeptide ligands is affected by the composition of the medium. Most generally, the process of protein loading is conducted under physiological conditions. On the contrary, the protein collection is operated by using chemical agents capable to annihilate the molecular interactions. All proteins captured are thus collected for further analysis. Several possibilities have been described over the years [25] for conducting a single desorption step or a sequential elution mode. An alternative to the elution procedures is the digestion of captured proteins directly on beads and collected peptides analyzed by mass spectrometry [28,29]. The on-bead protein trypsination prior to mass spectrometry was recently further optimized and improved by extending the digestion time and also by making a pre-digestion with Lys-C endopeptidase [30] producing a large number of peptides from each protein. It is here important to say that even if some peptides are still strongly attached to the beads or lost after digestion, the mass spectrometry analysis results are not degraded because the minimum number of peptides needed for reliable protein identification is limited to two. The increased number of detected proteins, the lower risk of protein losses due to too hard elution conditions and the simplified protocol with better reproducibility, should play in favor of the on-bead protein digestion option, especially when looking for protein markers discovery and for the detection of protein impurities present in biopharmaceuticals.

The enrichment process based on solid phases combinatorial peptide ligand library has been repeatedly compared to the popular immunosubtraction where applicable, especially in proteomics investigations involving human serum or plasma. Frequently, the advantage of the number of gene products discovered has been attributed to CPLL [31,32,33]. Recently, a paper comparing several approaches has been published where experimental results indicated remarkably better performance than immunosubtraction based on 14 major antibodies [34]. Figure 2 reports the two-dimensional electrophoresis results of one comparative experiment on human serum.

Spectacular demonstrations of CPLL performance are given by three pioneering papers. The first was centred on human plasma proteins [35] where the number of gene products found after low-abundance proteins enrichment with CPLL reached 3869, a number much larger than what was known earlier in this domain at these early days of proteomics investigation and with a remarkable reproducibility.

A second report involved the findings in red blood cell proteins [36]. In this particular case, the number of proteins found was 1578 while what was known before was limited to a few hundred. This was the first ever deep exploration of such a proteome starting from a cytoplasmic fraction of a highly purified human red blood cell preparation where unexpected minor proteins were identified.

A third significant example is reported with the exploration of hemolymph proteins from Limulus polyphemus, a living fossil arthropod from the east coast of North America [37]. The direct proteome analysis by two-dimensional electrophoresis evidenced 200 protein spots while after treatment with CPLL 890 protein spots were present throughout a pH range of 3–10 and a molecular mass between 8 and 240 kDa. The following mass spectrometry analysis gave an extremely large number of proteins (around 7500), most of them unmatched because they could not be attributed to gene products due to the absence of proper annotations. A few years after, another study [38] identified 1100 unique proteins with a good confidence value upon treatment of the hemolymph with CPLL.

4. Identification of Early-Stage Biomarkers of Human Diseases

It is a general consensus that the discovery of early biomarkers allows for better management of diseases based on the concept that a therapeutic treatment may be more effective at the initial stage of a given pathology [39]. This is particularly true for pathologies having a low survival rate. Within this context, the major obstacle to overcome is to devise a discovery method for relevant biomarkers when their expression is still extremely limited. Their detection is very challenging because of the massive presence of all other current abundant proteins that mask the signal of very dilute disease-related proteins. It is really like “looking for a needle in a haystack”, as indicated earlier [40].

A single biomarker (up- or down-regulated) is rarely pertinent for the formal designation of a pathology; hence, it is generally admitted that the use of combinations of several of them would improve the predictivity of a given disease. The more numerous the biomarkers considered as a whole, the better the diagnosis value; however, even this approach is not a full guarantee of a correct diagnosis without adding other biological variables such as sex, age and predispositions.

The question is how to approach the search for early biomarkers. One way would be to target a given organ where the disease is presumably located with the presumption that the misregulated expression of pathology-related proteins occurs and where their concentration is supposed to be significant [41]. This principle has been used a number of times with interesting results. Nevertheless, high-abundance proteins are still present, perturbating the detection of potential biomarkers.

Among other interesting approaches is targeting circulating exosomes in biological fluids. They are about 100 nm membrane vesicles released by mammalian cells, including malignant ones, capable to transfer proteins in charge of cellular communication functions. They are potential reservoirs of protein markers of interest, especially when they come from malignant cells. Although of high interest, this option is quite laborious because it is contingent upon the isolation of these vesicles among numerous other similar bodies [42]. The interest of such an approach has been shown with urinary exosomes where the presence of a few low-abundance antigens was directly correlated with prostate cancer [43]. Nonetheless, other authors are less optimistic, indicating that in spite of useful clinical information, results are still preliminary and not fully conclusive [44].

At early disease stages of protein expression, the signs of proteome difference between a disease against a control are still extremely hard to recognize. Here, the enrichment methods are essential to visualize the weak signs of modified protein expression. Since the beginning of its development, CPLL technology was recognized as a confident and reliable approach for biomarker discovery. To evidence the presence of a potential biomarker, a direct comparison between a pathological situation must be made against a control (see scheme in Figure 3). Variations are possible as extensively described [25] according to the objectives of the application with many discovered biomarkers. It is out of the scope of this paper to make an exhaustive list of the described applications; thus, only a few representative examples of major diseases, such as cancers, are summarized first.

Identifying early biological signs to improve the survival rates of certain cancers is critical. This is the case for pancreatic tumors, a highly invasive malignant disease with a preponderant lethal outcome [45]. The most common biomarker used is the carbohydrate antigen CA 19–9, but it has insufficient ability to detect pancreatic cancer because of its poor specificity and sensitivity. Relevant papers have reported the early discovery of pertinent markers using combinatorial peptide ligand libraries using a CPLL-treated supernatant of pancreatic cancer cultured cells [46]. Among differentially regulated proteins found, six of them were associated with survival. The most relevant were glucagon-like peptide-1, apolipoproteins CII and CIII and zinc-alpha-2-glycoprotein.

Another aggressive cancer is HCC (hepatocellular carcinoma) [47] for which only limited therapeutic options are currently available. The biochemical diagnosis is classically based on the determination of the level of AFP (α-fetoprotein), a non-reliable marker in terms of sensitivity and specificity, present in many other cancers. A promising advancement is the detection of early-stage markers using enrichment methodologies. As suggested by Mustafa et al. [48], a biological sample (e.g., blood serum) treated with CPLL contributed significantly to this undertaking. The enriched serum using CPLL is resolved by 2D-DIGE (two-dimensional difference gel electrophoresis) and differentially expressed proteins identified by mass spectrometry, followed by a targeted quantitation using SRM (selected reaction monitoring) method. From a quite large study [49] using CPLL followed by analysis 2D (two-dimensional) electrophoresis, 4000 spots have been investigated. Among the 24 misregulated proteins found, the authors centred their attention on ApoA1, which is down-regulated by 2.45 fold. The reason for this choice was its known ability to suppress the expression of adhesion molecules and to inhibit tumor necrosis factor-α, two important factors of cancer development. No good correlation was found with AFP, as described above. This study suggests monitoring ApoA1 to follow the effectiveness of therapeutic treatments. Beyond cancers, other pathological situations are at the centre of early biomarker discovery of diagnosis/prognosis interest. Significant recent examples starting from various body fluids are illustrated hereafter.

Blood serum is the most investigated biological fluid due to its continuous circulation throughout the body from where it is supposed to collect all possible proteins from various organs, including low-abundance protein markers expressed within localized organs. Unexpected applications are progressively published. An example is given by the identification of prognostic markers of severe COVID-19 [50] from CPLL-treated serum samples from infected patients (see below for more details).

Another serum-based biomarker discovery worthy of interest is exemplified by RA (rheumatoid arthritis). Upon CPLL serum treatments at least two routes have been followed to the identification of specific signatures of the pathology. According to the classical way of identification of biomarkers, more than two dozen proteins were differentially expressed. Some were down-regulated, such as paraoxonase/arylesterase 1, proteoglycan 4 and plasminogen, while others were up-regulated, such as, for example, apolipoprotein E, thrombospondin-1 and ficolin-2. The latter represented a serious potential marker for diagnostic applications. Dedicated quantifications by ELISA assay (enzyme-linked immunosorbent assay) indicated the extent of the quantitative expression modification [51].

The second route followed for the discovery of specific signatures of RA is based on a strong presence of antibodies against citrullinated proteins that appear long before the development of the pathology. On these bases, a study has been established to determine the presence of citrullinated autoantigens among the pool of overexpressed proteins [52]. Serum samples from patients were subjected to treatment with CPLL and protein abnormalities were determined by mass spectrometry. A set of 135 misexpressed proteins was found, 11 of which contained citrulline. Exposed to anti-citrulline antibodies in patients with RA, autoantibodies against citrullinated lipopolysaccharide-binding protein were found to be particularly increased. This study concluded that the presence of anti-lipopolysaccharide binding protein antibodies could become markers not only for the diagnosis of rheumatoid arthritis, but also for defining its severity.

Still, within the discovery of markers of importance, atherosclerosis, a devastating cause of cerebrovascular and cardiovascular events, has been considered a subject of investigation. To get a global view of protein expression including low-abundance proteins several projects were centred on the use of CPLL as a means to enrich rare proteins. Enriched carotid tissue protein extracts showed important differences in low-abundance proteins between non-complicated and complicated plaques [53]. Major alterations were essentially found for aldehyde dehydrogenase, heat-shock protein 27, protein kinase C delta-binding and moesin. More recently, protein signatures of atherosclerotic plaques have been searched in the blood [54]. It appeared that the complexity of plaque rupture as the initial cause of stroke could probably be due to several proteins. A set of 76 proteins have been identified while analyzing unstable atherosclerotic plaques. Part of such a panel could be considered as prognostic signs of plaque instability.

On another subject, to predict the outcome of osteotomy and to prepare for the best treatment, dedicated protein multi-markers have been identified from CPLL-treated synovial fluid. Findings were defined as very useful to help classify patients suited to treatment with osteotomy [55].

Other body fluids such as saliva [56], urine [57] and cerebrospinal fluid [58] have been explored repeatedly with and without enrichment effects in order to detect low-abundance protein markers with benefits related to outcome and to determine the best therapeutic treatment before the advent of critical situations.

5. Detection of Protein Impurity Traces from Recombinant Biopharmaceuticals

Several bioproducts for therapeutic applications are presently produced by recombinant procaryotic or eucaryotic cell cultures. The most representative are monoclonal antibodies. All these recombinant products need to be purified in order to eliminate the components from either the cell culture medium or from co-expressed proteins, including proteins from the cell lysis during their life cycle. While purification processes are very efficient, some impurities are always present in trace amounts in the purified protein. The nature of these impurities is very diverse and can adversely affect the stability of recombinant proteins and induce immunogenic reactions in patients. These protein impurities called HCP are tolerated all together within the limit of 100 ppm [59], but this figure does not give a good picture of the reality because it covers an unidentified number of HCP. Protein impurities may not always be the same and their relative concentration may be different even if the total amount is similar. Commonly, quantification is performed by global immunochemical assays using polyclonal antibodies against all host cell proteins [60]. While formal identification is a priority objective, it comes up against their very low concentration and the massive presence of the main recombinant therapeutic protein. Very often the dynamic concentration range spans over several orders of magnitude, which suggests using enrichment techniques developed in proteomic studies. Various approaches have been proposed; the most current are the removal of the recombinant protein [20,61,62], the immunoprecipitation of HCP [63], molecular size discriminating methods (cutoff filtration [22], size exclusion [23]) and various chromatographic procedures. In addition to these general enrichment technologies, CPLLs appear to be a very effective way to enhance the presence of low-abundance proteins while largely diminishing the recombinant dominant recombinant protein (see previous sections). A pioneering work on this principle was published in 2006–2007 and demonstrated the usefulness of highlighting the presence of host impurities in highly purified recombinant proteins [64,65] (see examples in Figure 4). It took several years before it was formally applied as a solution to concrete problems such as biodrug stability and patient safety. In a demonstration example [66], a recombinant protein expressed in CHO (Chinese hamster ovary) cells was spiked with several foreign proteins in a range of concentration between 10 and 1000 ppm. The sample was submitted to LC-MS/MS analysis before and after treatment with CPLL. While no spiked proteins were detected in the control, the CPLL-treated sample showed all the spiked proteins and, in addition, revealed the presence of 30 other host cell proteins with an enrichment factor estimated between 80 and 700 fold.

In another study [63], the enrichment with a similar technology reached 1000 fold from a sample of purified monoclonal antibodies. When applied to a commercial recombinant monoclonal antibody, 527 proteins from engineered CHO cells appeared with an enrichment rate of 100–400 fold.

In an attempt to improve the effectiveness of the enrichment technology, the capture process of HCP from monoclonal antibodies has been optimized by limited trypsin digestion [67]. Here, the number of identified proteins was in the best case 850 with a reproducibility of more than 80%; the enrichment was up to 7694 fold with the capability to detect 0.05 ppm.

More recently [68], the enrichment of foreign proteins from the host cells by CPLL was followed by an improved method of quantitation. It is actually important to have accurate quantitative data about certain HCP that are particularly detrimental to biodrug stability and have the certainty that they are present at a concentration below a critical level. This is the case for several degrading enzymes, such as esterases, thioesterases, lipases and carboxypeptidases, that could be reliably quantified to sub-ppm level with good accuracy and precision.

At this stage, the detection, identification and correct quantitation of each critical protein impurity can become a current practice with the support of enrichment techniques.

6. Discovery of Low-Concentration Allergens

Allergy is an immune response of an organism against a foreign molecule called allergen. The latter are usually proteins of different origins that can activate a cascade of events producing undesirable effects that can go as far as the death of the organism. Not all heterologous proteins are recognized by the body as allergens, but when they are, there is a specific interaction with the body’s immunoglobulin E with consequent reactions. Without going into the details of the biochemical mechanisms, it is important to mention here that there are no well-identified allergenic peptide structures, although the replacement of an epitope responsible for an allergic reaction can be neutralized just by the replacement of an amino acid [69]. While IgE can recognize certain epitopes of foreign proteins without inducing allergic reactions [70], there are situations where the same IgE recognizes proteins of different origins that share the same sequential epitope of an allergenic antigen. This is the reason why there are common families of allergens from different species [71].

There are allergens of plant origin (e.g., pollen, seeds, fruits), animal allergens (mites, fish) and certain components of biological fluids such as milk and eggs. The number of protein allergens is constantly increasing in relation to the improvement of technologies capable of detecting those of low abundance. It is in this context that enrichment methods become of interest [72,73].

While the quantitation of allergens is operated by immunochemical methods (e.g., ELISA) [74], the discovery of new protein allergens is mostly performed by immunoblot techniques [75]. CPLL and ELISA methodologies are not competitive but, rather, complementary techniques since the former allows novel biomarker discoveries and the latter is then designed for their quantification. In short (Figure 5), the protein extract that may contain an unknown allergen is fractionated by electrophoretic techniques and subjected to the serum of an identified allergic patient. IgE from the patient interacts with the allergens and forms a complex. A second reaction follows using labelled IgG against the human IgE and the super-complex obtained revealed by histochemical reactions. Once the right positioning of the allergen is found on the electrophoretic plate, it is extracted and submitted to LC-MS/MS identification. This field of investigation lends itself well to the use of enrichment methods, in particular using CPLL.

Allergens from animal sources are probably less representative than those from plants. The most common animal allergens are from milk, eggs, fish and sea products, and small organisms such as arthropods and mollusks. The CPLL treatment of animal biological extracts contributed to the identification of several allergenic proteins from native [76] and sterilization-treated milk [77]. Eggs are also well known to comprise allergens that can be evidenced by CPLL [78]. Sometimes they are dominant proteins directly detected; in other instances, low-abundance proteins are responsible for allergic reactions and need amplification processes [76]. Among allergens discovered with the assistance of CPLL, the analysis of egg white revealed the allergenic nature of clusterin and an ovoinhibitor [78]. Beyond lactalbumin, caseins, lactoglobulin and lactoferrin, traces of polymorphic immunoglobulins have been evidenced using a serum from a selected patient [76,79]. Insect venoms are also intensively investigated for their content of allergic polypeptides that can also be of low abundance [80].

Plant allergens occupy a large place among current studies. They come from several distinct organs and can cause significant damage to human health. However, proteins in plants are very dilute compared to animals and allergens represent only a little fraction of them rendering their detection challenging. For easy investigations, enrichment methods applied to plant extracts are mandatory [81,82]. Among the most adopted enrichment techniques, CPLL plays a central role because of its general applicability. It has been described for the detection of allergens in various plant organs. For instance, allergens have been found in many fruits, such as bananas [83] and mango [84], where some of them are in common. Allergens from cypress pollens have recently been investigated with the adoption of CPLL enrichment techniques [85,86,87]. The analysis of enriched pollen extracts evidenced allergic low-abundance proteins such as chaperon protein HSP104, sigb-regulated protein, glyoxalase 1 and malate dehydrogenase. Interestingly, allergens from animals and plants can also be found in food by the composition ingredients. In this particular context, the presence of traces of allergen is really critical since they are frequently present as traces well below the sensitivity of current analytical methods necessitating the use of enrichment techniques [88]. A variety of food ingredients from plants and animals carrying proteinaceous allergens are currently used (seeds, nuts, fruits, wheat, flour, milk and eggs). Some of them require attention for very serious consequences on the health of consumers. This is the case with peanuts, which commonly carry allergens. Unfortunately, there are polypeptides not yet listed as allergens and their detection and identification in food is complicated by huge amounts of food matrix, masking their presence and delivering negative or false results if enrichment procedures are not used [89].

At present, great efforts are still ongoing to enlarge the knowledge in allergomics via proteomics studies with the help of enrichment techniques and high-throughput mass spectrometry. Although novel allergens will implement the long existing list from plants and animal biological extracts, the detection of new or known allergenic polypeptides from food remains a challenge. The detection of allergens resulting from post-translational modifications (glycation, citrullination, carbonylation and many others) is another field of investigation. On this matter, enrichment procedures accompanied by highly specific group capture methods could represent a path in which to follow.

7. Other Medical Involvements

Environmental contingencies can induce physiological negative consequences whose knowledge is still limited. In this area, the analysis of gene expression can be an important step forward in prevention. This is precisely the case of space exploration by human beings or living beings in general for which it is in practice impossible to predict the influence of microgravity, exposure to various radiations and confinement. These influences operate at low and progressive levels of gene expression with adaptation or deregulation. The analysis of low-abundance proteins is one way that can provide information on the level of changes. This field of studies started before 2010 with simulations and was gradually developed. In 2014 a study was engaged on the influence of sodium chloride balance on human inflammatory processes during the Mars105 isolation period program [90]. To this end, two thousand low-abundance proteins from the urine samples of six volunteers were investigated. It has been concluded that a reduction of sodium chloride consumption probably limits the activation of an inflammatory process.

In 2019 another study was performed on a small cohort of astronauts to understand the effects of a prolonged stay in space [91]. Significant changes in proteins involved in the hemostasis system and post-translational changes, particularly in phosphorylation, have been demonstrated. The same scientific team studied the effects of microgravity on an endothelial cell culture [92] and found changes in certain elements of the cytoskeletal structure by a complex regulatory system involving Rho proteins. All these results were obtained after submitting cell extracts with CPLL to enrich low-abundance proteins.

This type of study was extended to human cells grown in microgravity [93] and protein extracts treated with CPLL before analysis. The results confirmed the previous conclusions with an increase in filamin-A, alpha-actinin and myosin light polypeptide 6. It was also found that other mechanisms were involved at the cytoskeleton level and adhesion phenomena. Long stays in space also induce general oxidative stress, altering several biological functions with modified protein expression with an increased risk of thrombosis, as reported [94] and evidenced by enrichment processes.

In another medical domain and closer to the recent reality, articles reported the use of CPLL along with immunosubtraction to evidence protein expression differences with the identification of prognostic biomarkers of severe COVID-19 [50]. More than two dozen differentially expressed proteins were found implicated in cardiovascular disorders and inflammation mechanisms. Acid-labile subunit of insulin-like growth factor binding protein and chitinase-3-like protein 1 were found as powerful prognostic markers contributing to providing adapted therapeutic treatments. In this hot domain, various experimental data have been produced thanks to the CPLL enrichment effect, such as the identification of biomarker candidates of acute respiratory distress syndrome contingent upon COVID-19 infection, which is useful for specific therapeutic targets [95]. Prior to analysis here, the blood serum was either submitted to an immunosubtraction with 14 multiple affinity removal systems or to enrichment with combinatorial peptide ligand libraries. From this study, more than a dozen of mis-expressed proteins seemed involved in cytokine signaling such as tumor necrosis factor, interleukin-1β and IL-6 and implicated in systemic inflammatory processes.

8. Conclusions

A multi-year survey of scientific activity around the use of low-abundance protein enrichment indicates a clear evolution of the use of CPLL technology. While adopted initially for the elucidation of proteomes, the most recent years show a clear trend toward the detection and identification of rare proteins in various other domains. In the last 8–10 years the number of publications devoted to low-abundance proteins has been constantly increasing [96] and, essentially, applied in three domains of applications. The first was and still is the detection of biomarkers of diagnostic interest that appear at the beginning of a disease not only for major pathologies such as cancer but also relative to numerous other diseases. The second application domain is to find foreign proteins from recombinant biopharmaceuticals. The third is the discovery of unknown allergens, another medical domain where the adverse effects of foreign environmental materials are constantly growing.

These trends are not dissociated from the improvement of specific applications as repeatedly reported in the last few years [97,98,99]. Beyond the current CPLL products which have been commercially available for several years, technological developments are probably not going to be slowed down. In fact, it is anticipated that in order to make the detection and identification operations more effective and sensitive, two or more protein enrichment and complementary operations would be assembled. For example, phosphoprotein enrichment with dedicated solid phases could be followed by treatments with CPLL. In this case, even the most dilute phosphoproteins would be enriched, and thus, in a position to be easily detected. These operations are not a vision of the mind since already the first combinations of enrichment methods have been described [100,101]. Blends of simple or more complicated libraries are envisioned with better specificities for target proteins on the attention to post-translational modifications will be developed in order to better understand the influence of the environment on cell communication and adaptation. The question does not seem prevented by the difficulty of discovering unknown signatures of physiological changes, but rather to elucidate the biological significance of the protein markers that are currently discovered.

Within the domain of HCP, the standardization of recombinant cell cultures will simplify the task of finding traces of proteins. It is then foreseen that only foreign dangerous proteins, for the stability of biopharma products and the protection of a patient’s health, will specifically be defined and quantified on each production lot by dedicated assays.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Anderson, N.L.; Anderson, N.G. The human plasma proteome: History, character, and diagnostic prospects. Mol. Cell. Proteom. 2002, 1, 845–867. [Google Scholar] [CrossRef] [PubMed]
D’Alessandro, A.; Dzieciatkowska, M.; Nemkov, T.; Hansen, K.C. Red blood cell proteomics update: Is there more to discover? Blood Transfus. 2017, 15, 182–187. [Google Scholar] [PubMed]
Bracher, A.; Whitney, S.M.; Hartl, F.U.; Hayer-Hartl, M. Biogenesis and metabolic maintenance of RuBisCO. Annu. Rev. Plant Biol. 2017, 68, 29–60. [Google Scholar] [CrossRef] [PubMed]
Xi, J.; Wang, X.; Li, S.; Zhou, X.; Yue, L.; Fan, J.; Hao, D. Polyethylene glycol fractionation improved detection of low-abundant proteins by two-dimensional electrophoresis analysis of plant proteome. Phytochemistry 2006, 67, 2341–2348. [Google Scholar] [CrossRef]
Pieper, R.; Su, Q.; Gatlin, C.L.; Huang, S.-T.; Anderson, N.L.; Steiner, S. Multi-component immunoaffinity subtraction chromatography: An innovative step towards a comprehensive survey of the human plasma proteome. Proteomics 2003, 3, 422–432. [Google Scholar] [CrossRef]
Levin, Y.; Schwarz, E.; Wang, L.; Leweke, F.M.; Bahn, S. Label-free LC-MS/MS quantitative proteomics for large-scale biomarker discovery in complex samples. J. Sep. Sci. 2007, 30, 2198–2203. [Google Scholar] [CrossRef]
Gao, M.; Deng, C.; Yu, W.; Zhang, Y.; Yang, P.; Zhang, X. Large scale depletion of the high-abundance proteins and analysis of middle- and low-abundance proteins in human liver proteome by multidimensional liquid chromatography. Proteomics 2008, 8, 939–947. [Google Scholar] [CrossRef]
Shen, Y.; Kim, J.; Strittmatter, E.F.; Jacobs, J.M.; Camp, D.G., II; Fang, R.; Tolié, N.; Moore, R.J.; Smith, R.D. Characterization of the human blood plasma proteome. Proteomics 2005, 5, 4034–4045. [Google Scholar] [CrossRef]
Huang, B.Y.; Yang, C.K.; Liu, C.P.; Liu, C.Y. Stationary phases for the enrichment of glycoproteins and glycopeptides. Electrophoresis 2014, 35, 2091–2107. [Google Scholar]
Low, T.Y.; Mohtar, M.A.; Lee, P.Y.; Omar, N.; Zhou, N.; Ye, M. Widening the bottleneck of phosphoproteomics: Evolving strategies for phosphopeptide enrichment. Mass Spectrom. Rev. 2021, 40, 309–333. [Google Scholar] [CrossRef]
Llop, E.; Peracaula, R. Lectin affinity chromatography for the discovery of novel cancer glycobiomarkers: A case study with PSA glycoforms and prostate cancer. Methods Mol. Biol. 2022, 2370, 301–313. [Google Scholar]
Nauom, S.; da Silva Neto, B.R.; Ribeiro, M.S.; Pedersoli, W.R.; Ulhoa, C.J.; Silva, R.N.; Monteiro, V.N. Biochemical and molecular study of Trichoderma harzianum enriched secretome protein profiles using lectin affinity chromatography. Appl. Biochem. Biotechnol. 2019, 187, 1–13. [Google Scholar]
Chen, J.; Li, X.; Feng, M.; Luo, K.; Yang, J.; Zhang, B. Novel boronate material affords efficient enrichment of glycopeptides by synergized hydrophilic and affinity interactions. Anal. Bioanal. Chem. 2017, 409, 519–528. [Google Scholar]
Pinto, G.; Caira, S.; Cuollo, M.; Lilla, S.; Fierro, O.; Addeo, F. Hydroxyapatite as a concentrating probe for phosphoproteomic analyses. J. Chromatogr. B 2010, 878, 2669–2678. [Google Scholar]
Lin, H.; Deng, C. Development of immobilized Sn4+ affinity chromatography material for highly selective enrichment of phosphopeptides. Proteomics 2016, 16, 2733–2741. [Google Scholar]
Wang, M.C.; Lee, Y.H.; Liao, P.C. Optimization of titanium dioxide and immunoaffinity-based enrichment procedures for tyrosine phosphopeptide using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. Anal. Bioanal. Chem. 2015, 407, 1343–1356. [Google Scholar] [PubMed]
Di Girolamo, F.; D’Amato, A.; Lante, I.; Signore, F.; Muraca, M.; Putignani, L. Farm animal serum proteomics and impact on human health. Int. J. Mol. Sci. 2014, 15, 15396–15411. [Google Scholar] [CrossRef] [PubMed]
Subba, P.; Narayana Kotimoole, C.; Prasad, T.S.K. Plant Proteome Databases and Bioinformatic Tools: An Expert Review and Comparative Insights. OMICS 2019, 23, 190–206. [Google Scholar] [PubMed]
Righetti, P.G.; Fasoli, E.; D’Amato, A.; Boschetti, E. Making progress in plant proteomics for improved food safety. In Comprehensive Analytical Chemistry; Elsevier: Waltham, MA, USA, 2014; Volume 64, pp. 131–155. [Google Scholar]
Johnson, R.O.; Greer, T.; Cejkov, M.; Zheng, X.; Li, N. Combination of FAIMS, Protein A depletion, and native digest conditions enables deep proteomic profiling of host cell proteins in monoclonal antibodies. Anal. Chem. 2020, 92, 10478–10484. [Google Scholar]
Zhang, S.; Xiao, H.; Li, N. Degradation of polysorbate 20 by sialate O-acetylesterase in monoclonal antibody formulations. J. Pharm. Sci. 2021, 110, 3866–3873. [Google Scholar]
Chen, I.H.; Xiao, H.; Daly, T.; Li, N. Improved host cell protein analysis in monoclonal antibody products through molecular weight cutoff enrichment. Anal. Chem. 2020, 92, 3751–3757. [Google Scholar] [CrossRef] [PubMed]
Zhao, B.; Abdubek, P.; Zhang, S.; Xiao, H.; Li, N. Analysis of host cell proteins in monoclonal antibody therapeutics through size exclusion chromatography. Pharm. Res. 2022, 39, 3029–3037. [Google Scholar] [CrossRef] [PubMed]
Boschetti, E.; Righetti, P.G. Host cell proteins enrichment for an in-depth analytical assessment of biopharmaceuticals quality. 2023; submitted for publication. [Google Scholar]
Boschetti, E.; Righetti, P.G. Low-Abundance Protein Discovery: State of the Art and Protocols; Elsevier: Waltham, MA, USA, 2013. [Google Scholar]
Boschetti, E.; Hernandez-Castellano, L.E.; Righetti, P.G. Progress in farm animal proteomics: The contribution of combinatorial peptide ligand libraries. J. Proteom. 2019, 197, 1–13. [Google Scholar]
Righetti, P.G.; Boschetti, E. Low-abundance plant protein enrichment with peptide libraries to enlarge proteome coverage and related applications. Plant Sci. 2020, 290, 110302. [Google Scholar]
Meng, R.; Gormley, M.; Bhat, V.B.; Rosenberg, A.; Quong, A.A. Low abundance protein enrichment for discovery of candidate plasma protein biomarkers for early detection of breast cancer. J. Proteom. 2011, 75, 366–374. [Google Scholar] [CrossRef]
Fonslow, B.R.; Carvalho, P.C.; Academia, K.; Freeby, S.; Xu, T.; Nakorchevsky, A.; Paulus, A.; Yates, J.R. Improvements in proteomic metrics of low abundance proteins through proteome equalization using ProteoMiner prior to MudPIT. J. Proteome Res. 2011, 10, 3690–3700. [Google Scholar] [CrossRef]
Anderson, J.R.; Phelan, M.M.; Rubio-Martinez, L.M.; Fitzgerald, M.M.; Jones, S.W.; Clegg, P.D.; Peffers, M.J. Optimization of synovial fluid collection and processing for nmr metabolomics and LC-MS/MS proteomics. J. Proteome Res. 2020, 19, 2585–2597. [Google Scholar] [CrossRef]
Beseme, O.; Fertin, M.; Drobecq, H.; Amouyel, P.; Pinet, F. Combinatorial peptide ligand library plasma treatment: Advantages for accessing low-abundance proteins. Electrophoresis 2010, 31, 2697–2704. [Google Scholar] [CrossRef]
Fahiminiya, S.; Labas, V.; Roche, S.; Dacheux, J.-L.; Gérard, N. Proteomic analysis of mare follicular fluid during late follicle development. Proteome Sci. (Short Commun.) 2011, 9, 54–56. [Google Scholar]
Pisanu, S.; Biosa, G.; Carcangiu, L.; Uzzau, S.; Pagnozzi, D. Comparative evaluation of seven commercial products for human serum enrichment/depletion by shotgun proteomics. Talanta 2018, 185, 213–220. [Google Scholar]
Palstrøm, N.B.; Rasmussen, L.M.; Beck, H.C. Affinity capture enrichment versus affinity depletion: A comparison of strategies for increasing coverage of low-abundant human plasma proteins. Int. J. Mol. Sci. 2020, 21, 5903. [Google Scholar]
Sennels, L.; Salek, M.; Lomas, L.; Boschetti, E.; Righetti, P.G.; Rappsilber, J. Proteomic analysis of human blood serum using peptide library beads. J. Proteome Res. 2007, 6, 4055–4062. [Google Scholar] [CrossRef]
Roux-Dalvai, F.; Gonzalez de Peredo, A.; Simó, C.; Guerrier, L.; Bouyssie, D.; Zanella, A.; Citterio, A.; Burlet-Schiltz, O.; Boschetti, E.; Righetti, P.G.; et al. Extensive analysis of the cytoplasmic proteome of human erythrocytes using the peptide ligand library technology and advanced spectrometry. Mol. Cell. Proteom. 2008, 7, 2254–2269. [Google Scholar] [CrossRef]
D’Amato, A.; Cereda, A.; Bachi, A.; Pierce, J.C.; Righetti, P.G. In-depth exploration of the hemolymph of Limulus polyphemus via combinatorial peptide ligand libraries. J. Proteome Res. 2010, 9, 3260–3269. [Google Scholar] [CrossRef]
Qu, Z.; Leung, T.C.N.; Nong, W.; Yip, H.Y.; Lee, I.H.T.; Cheung, S.G.; Ming, N.S.; So, W.L.; Bendena, W.G.; Tobe, S.S.; et al. Hemolymph proteomics and gut microbiota of horseshoe crabs Tachypleus tridentatus and Carcinoscorpius rotundicauda. Front. Mar. Sci. 2020, 7, 579706. [Google Scholar]
Rodríguez, M.; Ajona, D.; Seijo, L.M.; Sanz, J.; Valencia, K.; Corral, J.; Mesa-Guzmán, M.; Pío, R.; Calvo, A.; Lozano, M.D.; et al. Molecular biomarkers in early stage lung cancer. Transl. Lung Cancer Res. 2021, 10, 1165–1185. [Google Scholar]
Veenstra, T.D. Global and targeted quantitative proteomics for biomarker discovery. J. Chromatogr. B. 2007, 847, 3–11. [Google Scholar]
Liu, N.Q.; Braakman, R.B.; Stingl, C.; Luider, T.M.; Martens, J.W.; Foekens JAUmar, A. Proteomics pipeline for biomarker discovery of laser capture microdissected breast cancer tissue, J. Mammary Gland Biol. Neoplasia 2012, 17, 155–164. [Google Scholar]
Yu, W.; Hurley, J.; Roberts, D.; Chakrabortty, S.K.; Enderle, D.; Noerholm, M.; Breakefield, X.O.; Skog, J.K. Exosome-based liquid biopsies in cancer: Opportunities and challenges. Ann. Oncol. 2021, 32, 466–477. [Google Scholar]
Sequeiros, T.; Rigau, M.; Chiva, C.; Montes, M.; Garcia-Grau, I.; Garcia, M.; Diaz, S.; Celma, A.; Bijnsdorp, I.; Campos, A.; et al. Targeted proteomics in urinary extracellular vesicles identifies biomarkers for diagnosis and prognosis of prostate cancer. Oncotarget 2017, 8, 4960–4976. [Google Scholar] [PubMed]
Salciccia, S.; Capriotti, A.L.; Laganà, A.; Fais, S.; Logozzi, M.; De Berardinis, E.; Busetto, G.M.; Di Pierro, G.B.; Ricciuti, G.P.; Del Giudice, F.; et al. Biomarkers in Prostate Cancer Diagnosis: From current knowledge to the role of metabolomics and exosomes. Int. J. Mol. Sci. 2021, 22, 4367. [Google Scholar] [PubMed]
Zhao, Z.; Liu, W. Pancreatic cancer: A review of risk factors, diagnosis, and treatment. Technol. Cancer Res. Treat. 2020, 19, 1–13. [Google Scholar] [CrossRef] [PubMed]
Liu, P.; Weng, Y.; Sui, Z.; Wu, Y.; Meng, X.; Wu, M.; Jin, H.; Tan, X.; Zhang, L.; Zhang, Y. Quantitative secretomic analysis of pancreatic cancer cells in serum-containing conditioned medium. Sci. Rep. 2016, 6, 37606. [Google Scholar] [CrossRef] [PubMed]
Chidambaranathan-Reghupaty, S.; Fisher, P.B.; Sarkar, D. Hepatocellular carcinoma (HCC): Epidemiology, etiology and molecular classification. Adv. Cancer Res. 2021, 149, 1–61. [Google Scholar]
Mustafa, G.M.; Larry, D.; Petersen, J.R.; Elferink, C.J. Targeted proteomics for biomarker discovery and validation of hepatocellular carcinoma in hepatitis C infected patients. World J. Hepatol. 2015, 7, 1312–1324. [Google Scholar]
Mustafa, M.G.; Petersen, J.R.; Ju, H.; Cicalese, L.; Snyder, N.; Haidacher, S.J.; Denner, L.; Elferink, C. Biomarker discovery for early detection of hepatocellular carcinoma in hepatitis C-infected patients. Mol. Cell. Proteom. 2013, 12, 3640–3652. [Google Scholar]
Kimura, Y.; Nakai, Y.; Shin, J.; Hara, M.; Takeda, Y.; Kubo, S.; Jeremiah, S.S.; Ino, Y.; Akiyama, T.; Moriyama, K.; et al. Identification of serum prognostic biomarkers of severe COVID-19 using a quantitative proteomic approach. Sci. Rep. 2021, 11, 20638. [Google Scholar] [CrossRef]
Cheng, Y.; Chen, Y.; Sun, X.; Li, Y.; Huang, C.; Deng, H.; Li, Z. Identification of potential serum biomarkers for rheumatoid arthritis by high-resolution quantitative proteomic analysis. Inflammation 2014, 37, 1459–1467. [Google Scholar]
Zhao, X.; Chen, Y.; Wen, W.; Cheng, Y.; Li, R.; Liu, X.; Li, Y.; Jia, R.; Deng, H.; Li, Z.; et al. Identification of lipopolysaccharide-binding protein as a novel citrullinated autoantigen in rheumatoid arthritis. Rheumatol. Autoimmun. 2022, 2, 5–14. [Google Scholar]
Malaud, E.; Piquer, D.; Merle, D.; Molina, F.; Guerrier, L.; Boschetti, E.; Saussine, M.; Marty-Ané, C.; Albat, B.; Fareh, J. Carotid atherosclerotic plaques: Proteomics study after a low-abundance protein enrichment step. Electrophoresis 2012, 33, 470–482. [Google Scholar]
Eslava-Alcon, S.; Extremera-García, M.J.; González-Rovira, A.; Rosal-Vela, A.; Rojas-Torres, M.; Beltran-Camacho, L.; Sanchez-Gomar, I. Molecular signatures of atherosclerotic plaques: An up-dated panel of protein related markers. J Proteom. 2020, 221, 103757. [Google Scholar]
Ulme, C.H.; Peffers, M.J.; Harrington, G.M.B.; Wilson, E.; Perry, J.; Roberts, S.; Gallacher, P.; Jermin, P.; Wright, K.T. Identification of candidate synovial fluid biomarkers for the prediction of patient outcome after microfracture or osteotomy. Am. J. Sports Med. 2021, 49, 1512–1523. [Google Scholar]
Sivadasan, P.; Gupta, M.K.; Sathe, G.J.; Balakrishnan, L.; Palit, P.; Gowda, H.; Suresh, A.; Kuriakose, M.A.; Sirdeshmukh, R. Data from human salivary proteome—A resource of potential biomarkers for oral cancer. Data Brief. 2015, 4, 374–378. [Google Scholar]
Celsi, F.; Monasta, L.; Arrigoni, G.; Battisti, I.; Licastro, D.; Aloisio, M.; Di Lorenzo, G.; Romano, F.; Ricci, G.; Ura, B. Gel-based proteomic identification of suprabasin as a potential new candidate biomarker in endometrial cancer. Int. J. Mol. Sci. 2022, 23, 2076. [Google Scholar]
Jankovska, E.; Svitek, M.; Holada, K.; Petrak, J. Affinity depletion versus relative protein enrichment: A side-by-side comparison of two major strategies for increasing human cerebrospinal fluid proteome coverage. Clin. Proteom. 2019, 16, 9. [Google Scholar]
Champion, K.; Madden, H.; Dougherty, J.; Shacter, E. Defining your product profile and maintaining control over it, part 2. Bioprocess Int. 2005, 9, 52–57. [Google Scholar]
Zhu-Shimoni, J.; Yu, C.; Nishihara, J.; Wong, R.M.; Gunawan, F.; Lin, M.; Krawitz, D.; Liu, P.; Sandoval, W.; Vanderlaan, M. Host cell protein testing by ELISAs and the use of orthogonal methods. Biotechnol. Bioeng. 2014, 111, 2367–2379. [Google Scholar]
Soderquist, R.G.; Trumbo, M.; Hart, R.A.; Zhang, Q.; Flynn, G.C. Development of advanced host cell protein enrichment and detection strategies to enable process relevant spike challenge studies. Biotechnol. Prog. 2015, 31, 983–989. [Google Scholar] [CrossRef]
Madsen, J.A.; Farutin, V.; Carbeau, T.; Wudyka, S.; Yin, Y.; Smith, S.; Anderson, J.; Capila, I. Toward the complete characterization of host cell proteins in biotherapeutics via affinity depletions, LC-MS/MS, and multivariate analysis. MAbs 2015, 7, 1128–1137. [Google Scholar] [CrossRef]
Chen, I.-H.; Xiao, H.; Li, N. Improved host cell protein analysis in monoclonal antibody products through ProteoMiner. Anal. Biochem. 2020, 610, 113972. [Google Scholar] [CrossRef]
Fortis, F.; Guerrier, G.; Areces, L.; Antonioli, P.; Hayes, T.; Carrick, K.; Hammond, D.; Boschetti, E.; Righetti, P.G. A new approach for the detection and identification of protein impurities using combinatorial solid phase ligand libraries. J. Proteome Res. 2006, 5, 2577–2585. [Google Scholar] [CrossRef] [PubMed]
Antonioli PFortis, F.; Guerrier, L.; Rinalducci, S.; Zolla, L.; Righetti, P.G.; Boschetti, E. Capturing and amplifying impurities from purified recombinant monoclonal antibodies via peptide library beads: A proteomic study. Proteomics 2007, 7, 1624–1633. [Google Scholar] [CrossRef] [PubMed]
Mörtstedt, H.; Makower, A.; Edlund, P.O.; Sjöberg, K.; Tjernberg, A. Improved identification of host cell proteins in a protein biopharmaceutical by LC-MS/MS using the ProteoMiner enrichment kit. J. Pharm. Biomed. Anal. 2020, 185, 113256. [Google Scholar] [CrossRef] [PubMed]
Zhang, S.; Xiao, H.; Li, N. Ultrasensitive method for profiling host cell proteins by coupling limited digestion to ProteoMiner technology. Anal. Biochem. 2022, 657, 114901. [Google Scholar] [CrossRef]
Zhang, S.; Zhao, B.; Adaniya, S.; Xiao, H.; Li, N. Ultrasensitive quantification method for understanding biologically relevant concentrations of host cell proteins in therapeutics. Anal. Chem. 2023, 95, 6002–6008. [Google Scholar] [CrossRef]
Bredehorst, R.; David, K. What establishes a protein as an allergen. J. Chromatogr. B. 2001, 756, 33–40. [Google Scholar] [CrossRef]
Bousquet, J.; Anto, J.M.; Bachert, C.; Bousquet, P.J.; Colombo, P.; Crameri, R.; Daëron, M.; Fokkens, W.; Leynaert, B.; Lahoz, C.; et al. Factors responsible for differences between asymptomatic subjects and patients presenting an IgE sensitization to allergens. Allergy 2006, 61, 671–680. [Google Scholar] [CrossRef]
Yagami, T. Allergies to cross-reactive plant proteins. Latex-fruit syndrome is comparable with pollen-food allergy syndrome. Int. Arch. Allergy Immunol. 2002, 128, 271–279. [Google Scholar] [CrossRef]
Ramachandran, B.; Yang, C.T.; Downs, M.L. Parallel reaction monitoring mass spectrometry method for detection of both casein and whey milk allergens from a baked food matrix. J. Proteome Res. 2020, 19, 2964–2976. [Google Scholar] [CrossRef] [PubMed]
Torres-Arroyo, A.; Martínez-Aguilar, J.; Castillo-Villanueva, A.; Zárate-Mondragón, F.; Cervantes-Bustamante, R.; Patiño-López, G.; Medina-Contreras, O.; Espinosa-Padilla, S.E.; Valencia-Rojas, S.; Romero-Guzmán, L.; et al. Immunoproteomics of cow’s milk allergy in Mexican pediatric patients. J. Proteom. 2023, 273, 104809. [Google Scholar] [CrossRef] [PubMed]
Alisi, C.; Afferni, C.; Iacovacci, P.; Barletta, B.; Tinghino, R.; Butteroni, C.; Puggioni, E.M.; Wilson, I.B.; Federico, R.; Schininà, M.E.; et al. Rapid isolation, characterization, and glycan analysis of Cup a 1, the major allergen of Arizona cypress (Cupressus arizonica) pollen. Allergy 2001, 56, 978–984. [Google Scholar] [CrossRef] [PubMed]
Stott, D.I. Immunoblotting, dot-blotting, and ELISPOT assays: Methods and applications. J. Immunoass. 2000, 21, 273–296. [Google Scholar] [CrossRef] [PubMed]
D’Amato, A.; Bachi, A.; Fasoli, E.; Boschetti, E.; Peltre GSénéchal, H.; Righetti, P.G. In-depth exploration of cow’s whey proteome via combinatorial peptide ligand libraries. J. Proteome Res. 2009, 8, 3925–3936. [Google Scholar] [CrossRef] [PubMed]
Siciliano, R.A.; Mazzeo, M.F.; Arena, S.; Renzone, G.; Scaloni, A. Mass spectrometry for the analysis of protein lactosylation in milk products. Food Res. Int. 2013, 54, 988–1000. [Google Scholar] [CrossRef]
Martos, G.; López-Fandiño, R.; Molina, E. Immunoreactivity of hen egg allergens: Influence on in-vitro gastrointestinal digestion of the presence of other egg white proteins and of egg yolk. Food Chem. 2013, 136, 775–781. [Google Scholar] [CrossRef]
Coscia, A.; Orrù, S.; Di Nicola, P.; Giuliani, F.; Varalda, A.; Peila, C.; Fabris, C.; Conti, A.; Bertino, E. Detection of cow’s milk proteins and minor components in human milk using proteomics techniques. J. Matern. Fetal. Neonatal. Med. 2012, 25 (Suppl. S4), 54–56. [Google Scholar] [CrossRef]
de Graaf, D.C.; Brochetto Braga, M.R.; Magalhães de Abreu, R.M.; Blank, S. Standard methods for Apis mellifera venom research. J. Apic. Res. 2020, 60, 1–31. [Google Scholar] [CrossRef]
López-Pedrouso, M.; Lorenzo, J.M.; Gagaoua, M.; Franco, D. Current Trends in Proteomic Adv. Food Allerg. Anal. Biol. 2020, 9, 247. [Google Scholar]
Zimmermann, J.; Hubel, P.; Pfannstiel, J.; Afzal, M.; Longin, C.F.H.; Hitzmann, H.; Götz, H.; Bischoff, S.C. Comprehensive proteome analysis of bread deciphering the allergenic potential of bread wheat, spelt and rye. J. Proteom. 2021, 247, 104318. [Google Scholar] [CrossRef]
Nikolić, J.; Nešić, A.; Kull, S.; Schocker, F.; Jappe, U.; Gavrović-Jankulović, M. Employment of proteomic and immunological based methods for the identification of catalase as novel allergen from banana. J. Proteom. 2018, 175, 87–94. [Google Scholar] [CrossRef]
Gomez-Cardona, E.E.; Heathcote, K.; Teran, L.M.; Righetti, P.G.; Boschetti, E.; D’Amato, A. Novel low-abundance allergens from mango via combinatorial peptide libraries: A proteomics study. Food Chem. 2018, 269, 652–660. [Google Scholar] [CrossRef]
Charpin, D.; Pichot, C.; Belmonte, J.; Sutra, J.P.; Zidkova, J.; Chanez, P.; Shahali, Y.; Sénéchal, H.; Poncet, P. Cypress pollinosis: From tree to clinic. Clinic Rev. Allerg. Immunol. 2019, 56, 174–195. [Google Scholar] [CrossRef] [PubMed]
Shahali, Y.; Sénéchal, H.; Poncet, P. The use of combinatorial hexapeptide ligand library (CPLL) in allergomics. Funct. Proteom. 2019, 1871, 393–403. [Google Scholar]
Poncet, P.; Sénéchal, H.; Charpin, D. Update on pollen-food allergy syndrome. Expert Rev. Clin. Immunol. 2020, 16, 561–578. [Google Scholar] [CrossRef] [PubMed]
Ortea, I.; O’Connor, G.; Maquet, A. Review on proteomics for food authentication. J. Proteom. 2016, 147, 212–225. [Google Scholar] [CrossRef]
Pedreschi, R.; Nørgaard, J.; Maquet, A. Current challenges in detecting food allergens by shotgun and targeted proteomic approaches: A case study on traces of peanut allergens in baked cookies. Nutrients 2012, 4, 132–150. [Google Scholar] [CrossRef]
Binder, H.; Wirth, H.; Arakelyan, A.; Lembcke, K.; Tiys, E.S.; Ivanisenko, V.A.; Kolchanov, N.A.; Kononikhin, A.; Popov, I.; Nikolaev, E.N.; et al. Time-course human urine proteomics in space-flight simulation experiments. BMC Genom. 2014, 15, S2. [Google Scholar] [CrossRef]
Brzhozovskiy, A.G.; Kononikhin, A.S.; Pastushkova, L.C.; Kashirina, D.N.; Indeykina, M.I.; Popov, I.A.; Custaud, M.A.; Larina, I.M.; Nikolaev, E.N. The effects of spaceflight factors on the human plasma proteome, including both real space missions and ground-based experiments. Int. J. Mol. Sci. 2019, 20, 3194–3210. [Google Scholar] [CrossRef]
Kashirina, D.N.; Kononikhin, A.S.; Larina, I.M.; Buravkova, L.B. Secretome of cultured human endothelial cells in simulated microgravity. Exp. Bull. Biol. Med. 2019, 167, 35–38. [Google Scholar] [CrossRef]
Kashirina, D.N.; Kononikhin, A.S.; Ratushnyy, A.Y.; Nikolaev, E.N.; Larina, I.M.; Buravkova, L.B. Proteomic profile of cultured human endothelial cells after exposition to simulated microgravity. Acta Astronaut. 2021, 179, 11–19. [Google Scholar] [CrossRef]
Larina, I.M.; Brzhzovsky, A.G.; Nosovsky, A.M.; Kononikhin, A.S.; Orlov, O.I. Post-translational oxidation modifications of blood plasma proteins of cosmonauts after a long-term flight: Part I. Hum. Physiol. 2020, 46, 531–539. [Google Scholar] [CrossRef]
Xia, Y.; Gao, L.; Guo, L.; Li, H.; Shao, M.; Yang, Q.; Liu, N.; Fang, M.; Xu, X.; Li, J.; et al. Identification of RPSA as a potential biomarker in bronchoalveolar lavage fluid for acute respiratory distress syndrome. Res. Sq. 2021. [Google Scholar] [CrossRef]
Boschetti, E.; Zilberstein, G.; Righetti, P.G. Combinatorial peptides: A library that continuously probes low-abundance proteins. Electrophoresis 2022, 43, 355–369. [Google Scholar] [CrossRef] [PubMed]
Boschetti, E.; D’Amato, A.; Candiano, G.; Righetti, P.G. Protein biomarkers for early detection of diseases: The decisive contribution of combinatorial peptide ligand libraries. J. Proteom. 2018, 188, 1–14. [Google Scholar] [CrossRef]
Bangy-Letheule, A.; Souab, F.; Bourgoin, S.; Michelland, S.; Cunin, V.; Seve, M.; Aillerie, V.; Dhot, J.; Montnach, J.; Persello, A.; et al. A non-targeted quantitative mass spectrometry approach for the identification of new blood biomarkers of septic shock in the secretory of a rat model of endotoxemic shock. Arch. Cardiovasc. Dis. Suppl. 2020, 12, 229–238. [Google Scholar] [CrossRef]
Zhou, Y.; Qin, S.; Sun, M.; Tang, L.; Yan, X.; Kim, T.-K.; Caballero, J.; Glusman, G.; Brunkow, M.E.; Soloski, M.J.; et al. Measurement of organ-specific and acute-phase blood protein levels in early Lyme disease. J. Proteome Res. 2020, 19, 346–359. [Google Scholar] [CrossRef]
Zhang, Y.; Lin, Z.; Tan, Y.; Bu, F.; Hao, P.; Zhang, K.; Yang, H.; Liu, S.; Ren, Y. Exploration of missing proteins by a combination approach to enrich the low-abundance hydrophobic proteins from four cancer cell lines. J. Proteome Res. 2020, 19, 401–408. [Google Scholar] [CrossRef]
Gjoka, X.; Schofield, M.; Cvetkovic, A.; Gantier, R. Combined Protein A and size exclusion high performance liquid chromatography for the single-step measurement of mAb, aggregates and host cell proteins. J. Chromatogr B 2014, 972, 48–52. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Evolution of the number of enrichment-based peer-reviewed publications over the last 40 years with more than 2600 in total. The number of scientific papers increases rapidly during the last decade. Compilation of data from PubMed (NIH, National Library of Medicine).

Figure 2. Two-dimensional electrophoresis analysis of human blood serum before (native (A)) and after enrichment (immunosubtraction on the left (B) and combinatorial peptide library on the right (C)). pH gradient 3 to 10; molecular mass range 250 kDa and 10 kDa; Coomassie staining). Adapted from Boschetti and Righetti [25].

Figure 3. Example of bloc diagram used for the discovery of biomarkers of low abundance (early stage) with the involvement of the enrichment procedure. The sample is first enriched followed by a complete trypsination to produce peptides. The latter are then fractionated and protein identified by mass spectrometry. Comparisons are performed against a control sample. The central part of the figure (dotted frame) schematically represents the CPLL treatment of each sample. HAP: high-abundance proteins; LAP: low-abundance proteins.

Figure 4. SDS-polyacrylamide gel electrophoresis of purified recombinant proteins before and after enrichment to evidence host cell protein impurity traces (for details see Refs. [64,65] from where it is adapted). (A) Purified recombinant proteins (1: purified monoclonal antibodies by mixed-mode chromatography; 2: purified monoclonal antibody by anion exchange chromatography; 3: purified Staphylococcus aureus protein A by IgG affinity chromatography). (B) Same protein samples, in the same order after treatment with combinatorial peptide ligand library to enrich for low-abundance proteins.

Figure 5. Experimental setup for the identification of allergens from a plant extract (for example, cypress pollen). After protein extraction and enrichment (e.g., by CPLL) the sample is submitted to 2D electrophoresis (two plates). One gel plate is classically stained to visualize each protein spot (plate on the left). The other gel plate (plate on the right) is submitted to an immunoblot with blood serum from an allergic patient (comprising IgE antibodies). Allergens are then visualized by immunochemical reactions. The two plates are then compared. Considered positive spots from the first plate are excised, extracted and submitted to the current procedure of protein identification by LC-MS/MS. The arrows indicate the allergen spots that are extracted for identification.

Table 1. Comparison of main enrichment methods used in proteomics investigations.

Method	Principle	Advantages	Drawbacks
Fractionation	Chromatography	High binding capacity Cheap Various conditions	Fraction overlapping Non specific Large dilution
Precipitation	Differential solubility	Easy handling Cheap Large and small samples Large applications	Non specific Protein entrapping Rough method Fraction overlapping
Immunosubtraction	Antibodies against HAP	High specificity Easy handling Small samples	Restricted samples Large co-subtraction Large dilution Expensive Low binding capacity
Capture of LAP groups	Various affinity ligands	Group specific Large choice Concentration of LAP	Non-specific binding Restricted to protein groups
Reduction of dynamic range with CPLL	Multiple affinity-like overloading.	Concentration of LAP Reduction of HAP No sample restriction Possible fractionated harvesting	Large samples Expensive Single use

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Boschetti, E.; Righetti, P.G. Low-Abundance Protein Enrichment for Medical Applications: The Involvement of Combinatorial Peptide Library Technique. Int. J. Mol. Sci. 2023, 24, 10329. https://doi.org/10.3390/ijms241210329

AMA Style

Boschetti E, Righetti PG. Low-Abundance Protein Enrichment for Medical Applications: The Involvement of Combinatorial Peptide Library Technique. International Journal of Molecular Sciences. 2023; 24(12):10329. https://doi.org/10.3390/ijms241210329

Chicago/Turabian Style

Boschetti, Egisto, and Pier Giorgio Righetti. 2023. "Low-Abundance Protein Enrichment for Medical Applications: The Involvement of Combinatorial Peptide Library Technique" International Journal of Molecular Sciences 24, no. 12: 10329. https://doi.org/10.3390/ijms241210329

APA Style

Boschetti, E., & Righetti, P. G. (2023). Low-Abundance Protein Enrichment for Medical Applications: The Involvement of Combinatorial Peptide Library Technique. International Journal of Molecular Sciences, 24(12), 10329. https://doi.org/10.3390/ijms241210329

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Low-Abundance Protein Enrichment for Medical Applications: The Involvement of Combinatorial Peptide Library Technique

Abstract

1. Introduction

2. Current Methods for Low-Abundance Protein Enrichment

3. Combinatorial Peptide Ligand Library Technology

4. Identification of Early-Stage Biomarkers of Human Diseases

5. Detection of Protein Impurity Traces from Recombinant Biopharmaceuticals

6. Discovery of Low-Concentration Allergens

7. Other Medical Involvements

8. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI