1. Introduction
Scientists usually understand proteins to be more extensively in the native (N) state with normal molecular conformations than in an unfolded (U) or intermediate (M) state with abnormal protein folding molecular conformations, because protein drugs exist in N state and this is more useful for human life. However, the appearance of bovine spongiform encephalopathy (BSE) in England required scientists to investigate proteins in an abnormal (non-native) state and thus it was found that BSE disease is also dependent on the molecular conformation of proteins. The molecular conformations of protein in buffers appear flexible, but still in N state, while proteins in U and/or M states, are actually in a very stable molecular conformation.
How to prevent the formation of U and M states of a protein? Also, if both exist already, how to convert them to their corresponding N state, or how to increase the efficiency of protein folding? This has became an important field in both molecular biology and liquid chromatography (LC), and also the renaturation for producing recombinant protein drugs in biotechnology using 
E coli. According to Anfinsen’s theory [
1], the three-, or four-dimensional structure of protein molecule only depends on its primary structure, or amino acid sequence. In other words, each of the N, U, and M states has the same amino acid sequence, but various molecular conformations. An intensive understanding and characterization of the nature of the U and M states would be very helpful to find a better way to accomplish this conversion from U and/or M state into the corresponding N state.
Many methods, such as optical or calorimetric methods or NMR, can be employed to investigate the character and the changes in molecular conformation over a period of time [
2–
4]. All of them need a pure sample. Many kinds of LC can be employed to achieve this by a chromatographic separation process. Hydrophobic interaction chromatography (HIC) was previously reported as being not only a very efficient tool for investigating protein folding, but also for protein renaturation with simultaneous purification in biotechnology and this method is called protein folding liquid chromatography(PFLC) [
5].
In some cases unfolded proteins cannot, or cannot completely be refolded by HIC, so the obtained one or more M state species should form at the interface between the stationary and mobile phases and have various hydrophobic strengths, and thus they can be separated from each other during the chromatographic process. This may provide a theoretical basis and methodology for on-line isolation, capture and investigation of the M state that originally existed in a complex mixture solution, without a priori purification of the N and/or M state of the target proteins.
The two linear parameters log
I and 
Z(S) (for their physical meanings, see later) of the stoichiometric displacement theory for retention (SDT-R) were widely employed for characterizing the changes in molecular conformation of proteins and molecular structure of small solutes and biopolymers [
6,
7]. The M and N states of a protein must have a different magnitude of log
I and 
Z (S) [
8]. This difference should be employed to distinguish them from each other.
In this study, an active protein, α-chymotrypsin (α-Chy), was selected as a standard protein (N state) and the stable intermediate of α-Chy as the typical M state of a protein. HIC was selected as a typical HPLC method for the on-line preparation and purification of the prepared stable M state of α-Chy and subsequently identifying its N and M states by optical, bioactivity, and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS). With the measurment and comparison of the three linear parameters of the SDT-R for α-Chy in the N and M states, a new on-line chromatographic characterization of the α-Chy in M state is established.
 2. Principle
A protein in N state has a correct three-, or four-dimensional molecular structure that allow flexible changes in molecular conformation in buffers, and the hydrophobic amino acid residues exist inside the protein molecules. Proteins in the M and U states have a very compact molecular, or random coil molecular structure, do not present flexible changes in molecular conformation and the hydrophobic amino acid residues are exposed on the surface of protein molecules. When the three species of a same protein contact with the region of the interface of a liquid-solid, their contact surface areas, the type and magnitude of the interaction forces of the target protein with the STHIC (stationary phase of hydrophobic interaction chromatography) should be quite different.
The hydrophobicity of the surface of the LC stationary phase plays here an important role. For reverse phase liquid chromatography (RPLC), the strong hydrophobicity makes a protein partially, or completely lose its N state and convert to its U state. If the hydrophobicity is moderate, such as in HIC, the protein can not only retain its N state, but some protein which had been originally denatured can now be refolded by HIC [
9]. So long as a protein in U state cannot be refolded, or only partially refolded by HIC, one, or several stable M states shall exist. When the molecular structure of the protein changes to a large extent, such as protein in the U state converts to an N state, the characterization parameters to express the changes in surface area, interaction forces, affinity of the protein to STHIC vary tremendously also. On the contrary, proteins in the U and/or M states cannot be refolded by HIC, so their molecular structure and corresponding characterization parameters will not change significantly. So long as these characterized parameters can be found and exactly measured, these different changes sourced from the molecular conformation of a protein can be employed to distinguish them from each other.
Two linear parameters of the stoichiometric displacement theory for retention (SDT-R), 
Z (
S) and log
I have been widely employed to express the molecular conformation and also the affinity of proteins towards stationary phases by many scientists. The topic has been reviewed [
6–
8]. For HIC, the expression of the SDT-R is gven by 
Equation (1) [
10]:
where, 
k’ is the capacity factor of protein, 
Z denotes the moles of water released from the contact surface region of one mole of solvated protein adsorbed onto the STHIC, log
I is a constant relating to the affinity of one mole of the protein for the stationary phase. The term 
aH2O represents the water activity in the mobile phase. When the HIC stationary phase and mobile phase, as well as other chromatographic conditions, such as column temperature, are given, both of 
Z and log
I are constants and thus 
Equation (1) is a linear equation and easily measured by experiment by plotting log
k’ versus log
aH2O. 
S, presented as an identical parameter to 
Z by Snyder’s group, can be obtained by:
Here 
Kw denotes the partition coefficient of protein in stationary phase and pure water [
6–
8], 
Φ is the volume fraction of displacer in mobile phase. Because 
Equation (2) is an approximately linear equation, thus 
S is an empirical constant. The relationship between Z and S was connected by 
Equation (3) as [
8]:
By using 
Equation (3) and the physical meaning of 
Z, both have been widely employed to characterize the changes in the molecular conformation of proteins. With 
Equation (3), both 
S and 
Z can be converted to each other. According to a previous report the 
Z value measured by isocratic elution is more accurate than the 
S obtained from gradient elution [
11] and thus 
Z together with log
I were employed to investigate the molecular conformation of proteins in this study.
An excellent linear relationship between log
I and 
Z of small solutes (non-polar and polar) and proteins in N state exist [
12], but it is only valid for solute retention dominated by non-specific interactions, such as RPLC and HIC. A linear relationship between log
Kw and 
S was also reported, but is only valid in methanol-water as the mobile phase [
6]. This relationship can be expressed as 
Equation (4).
The physical meaning of 
j here denotes the affinity of one mole of displacer (water in HIC) to the stationary phase of LC, 
ϕ is the column phase ratio defined by thermodynamics. Because the stoichiometric displacement process between solute and solvent is a reversed process, in other words, one mole of solute displacing Z moles of solvent is equivalent to one mole of solvent displacing 1/
Z solute. The stoichiometric parameter 
j shown in 
Equation (4) just describes the latter process. Thus, the physical meaning of 
j as solvent as displacer actually corresponds to that log
I as solute as displacer. It is actually 
Equation (4) that has been experimentally found to have a very good linear relation for protein separation by HIC, either under various proteins at a same composition of mobile phase, or a given protein with various salts, pH, even denaturants [
10]. With the new concept of molecular interaction about the origin of long-range attraction between hydrophobes in water presented by Despa 
et al. [
13], this linear relationship between proteins and the stationary phase of HIC (STHIC) can be theoretically explained as a non-selective interaction that dominates protein separation in this instance. Thus, the goodness of the linearity can be employed to justify the kind of interaction forces between solute and stationary phase of LC.
 4. Results and Discussion
Supposing the activity coefficient of the employed mobile phase under all conditions is unity and thus 
Equation (1) becomes its concentration form 
Equation(1a) as:
where [
H2O] is water mole concentration, mol/L.
 4.1. Preparation, identification and stability of M state of α-Chy
One of advantages of PFLC is that the refolding and separation of proteins can be carried out simultaneously. Based on this advantage, the equilibrium among N, M, U states, even some broken peptide chains, can easily be investigated by measuring the changes in the retention and/or peak height, or by isolating one of them to investigate its character alone, providing a lot of information for understanding the folding mechanism of a target protein. The selected commercial α-Chy is an ideal model protein for accomplishing this purpose. Solid α-Chy in the N state can be bought from commercial sources but it is a kind of self-digesting protein. If one desires to study a feature in a particular state, it has to be prepared from a commercial α-Chy 
in situ. The commercial α-Chy was firstly denatured with various urea concentrations, 
Curea for 24 hours and then was injected into a HIC column for refolding under the same chromatographic conditiond as that for the purification of α-Chy in the N state [
16]. 
Figure 1 only shows the chromatogram of the α-Chy unfolding when 
Curea was 1.0, 3.0, 6.0 and 8.0 mol/L.
From this figure, depending on Curea, different numbers of peaks were obtained. Compared to other Curea, the largest number of the components were obtained when Curea was 3.0 mol/L. It implies that it is possible for several M states to exist. With the combination of MALDI-TOF MS and bioactivity measurement, only two components have the same molecular mass as the α-Chy, one of them marked with N has high bioactivity and other one, with no bioactivity (actually very low bioactivity) was marked as an M state. When the Curea are separately 6.0 and 8.0 mol/L, although the retention times of their last two peaks are the same as the M state at Curea 1.0 and 3.0 mol/L, with MALDI-TOF MS identification, they are actually a mixture of many kinds of peptides. This fact indicates a high concentration urea is needed to completely digest the α-Chy. Other components shown in this figure were confirmed to be peptides having various molecular masses. They come either from the commercial product, or from the digestion product during the denaturing and/or renaturating processes. Each of the collected fractions of the α-Chy in the M and N states was lyophilized and stored.
Because the half-life of many intermediates during protein folding is only seconds in duration, sometimes even less, this makes it be very difficult to capture and detect them [
17]. The prepared α-Chy in the M state must be stable enough for the subsequent investigations, otherwise, it either is refolded by PFHIC, or self-digested already. The stability of the obtained α-Chy in the M state needs to be initially confirmed by experiment. Re-dissolving each of the lyophilized pellets with either 3.0 mol/L urea (denaturing condition), or water and incubating for 0, 1, 3, 8, 24, and 28 h. The resulting samples were injected into the same HIC column; only one peak having the same retention with the original M state was obtained (Figure not shown here), and also without bioactivity according to the biological assay.
 4.2. Distinguishing M from U state
This study is required to investigate the character of the α-Chy in the M state, not U state. We firstly need define the protein in M and U states in PFLC. The two states are defined as follows: U state is some species of protein in a non-N state under unfolded conditions, or before PFLC, while the M state is that under refold conditions, or after PFLC. It still needs to be confirmed whether the collection contains to M state or U state. Because the obtained stable α-Chy in the U state exists only under an unfolding condition, i.e., the presence of a suitable urea concentration, otherwise, the U state either refolds to its N state, or converts to a stable M state, or an associated state. In this study, enough urea has to be the present in the sample solution for its identification. In other words, any kind of LC method by which urea from the α-Chy solution is removed, can never be employed to distinguish M from U, or N states.
Figure 2 shows the comparison of FE spectrum (2A) and UV absorption spectrometry (2B) of the α-Chy in different environments. From this figure, the two kinds of absorption spectroscopy of U state (2) are quite different from that of M state (3, 4) in either water, or 3.0 mol/L urea solution, also different from N state(1), indicating that the obtained M state(after HIC) shown in 
Figure 1 is really different from either its U state, or N state.
  4.3. Z (S) and log I of the α-Chy in N and M states
With isocratic elution, both 
Z and log 
I of proteins in N state can be exactly measured with 
Equation (1).
For those in an M state, they need testing whether they follow 
Equation (1), or not. 
Figure 3 shows this plot of the α-Chy in different states and all parameters for α-Chy in the N and M states at a 
Curea of 0.0 and 3.0 mol/L are separately shown in 
Table I. It can be seen that although the α-Chy exists in different molecular conformations, the linear correlation coefficient 
R is greater than 0.997, indicating that α-Chy in both of its N and M obeys 
Equation (1) well. The most relative mean deviation for log 
I and 
Z at two continuously parallel measurements is less than ±3%, providing an experimental basis for characterizing the natures of the N and M states accurately by using both of the log 
I and 
Z values of protein.
 4.4. Dependence of logI and Z on Curea
Figure 4 shows the effect of 
Curea on log
I and 
Z, as the 
Curea covers the range from 0 to 5.0 mol/L. It can be seen that the changes in log
I and 
Z are quite different. For α-Chy as an N state, with the increases in the 
Curea, the two values decrease in a tremendous and discontinuous manner, while in that for the M state, they also decrease in the same direction, but only in a small and continuous manner. This phenomenon fits the expectation in the theoretical part that due to the presence of urea the native α-Chy loses significantly its three-, or four-dimensional molecular structure in a discontinuous manner [
18]. The part of hydrophobic amino acid residues which are originally buried within the molecules and its surface only has residue with weak hydrophobicity, requiring a large surface area to contact and adsorb by the STHIC. With the increasing the 
Curea, more hydrophobic amino residues are exposed to the mobile phase, resulting in contact on the surface of STHIC at a minority of strong hydrophobic amino acid residues. This leads to a decrease in the contact surface area between the α-Chy molecules and the stationary phase in a discontinuous manner. In contrast, because the molecular conformation of α-Chy in the M state had changed a lot already, the changes in log
I and 
Z are only about a fifth of that in the N state, respectively. This fact indicates that these changes in M state are not a result of molecular conformation changes, but of urea as a secondary displacer (water is the first displacer) to participate in the stoichiometric displacement process [
19]. The fact further shows that the α-Chy in the M state is very stable as urea changes, further indicating its M state is very difficult to convert to the N state as urea decreases. It is also seen from 
Figure 3 that when 3.0 mol/L ≤ 
Curea ≤ 4.0mol/L, the molecular conformation of α-Chy very tremendously changes, and when ≥ 4.0 mol/L, the two values for N state is almost the same as that its M state. Compared to 
Figure 2 where the optical spectrometry only qualitatively provides the differences between N and M states, the 
Z and log
I here can provide quantitative information and characterization.
  4.5. Linear relationship between logI and Z under various Curea
From 
Figure 4, the curve profile of the plot of 
Z of α-Chy in N state 
vs Curea has almost the same style as that of of log
I of α-Chy 
vs Curea. The same instance is also seen for α-Chy in M state. It should be tested whether a quantitative relationship really to exist between log 
I and 
Z for a protein in M state as 
Curea changes. It is seen that for its N state shown in 
Figure 5, a good linear relation with R = 0.9982 exists in 
Figure 5A and the quantitative expression is indicated by 
Equation (6). However, for M state, as shown in 
Figure 5B, there is not this linear relation.
The obtained 
j value 1.69 is very close to its theoretical value 1.73 (25 °C) [
10,
12]. This fact indicates that although the changes in molecular conformation of protein in N state are large as 
Curea changes from 0 to 5.0 mol/L, this change is flexible and reversible, resulting in no variation of the non-selective force character between α-Chy molecule and STHIC. In other words, hydrophobic amino acid residues still contact to the STHIC with a moderate hydrophobicity. By contrast, in the same chromatographic condition, the retention of α-Chy in M state is a different situation. Suppose the curve shown in 
Figure 4B is divided into two parts: (1) the four points continuous on the bottom of this curve which correspond to when 
Curea is 6.0, 5.0, 4.0, 3.0 mol/L; (2) the top four points of this curve when 
Curea is 3.0, 2.0, 1.0, and 0.0 mol/L. It is shown from 
Figure 4B, the part (1) shows a straight line with R 0.9984 marked with a dash line and it can be expressed by 
Equation (7):
Compared to the 
j value of 1.69 for the N state indicated in 
Equation (6), the obtained 
j value 1.79 for the M state here is also close to 1.73. The former is 0.04 less than 1.73, indicating the affinity of protein to STHIC (actually the ratio of log
I/
Z) becomes weak due to the presence of the second displacer, urea, while the latter is 0.06 greater than 1.73, showing the affinity of the protein to STHIC becomes greater due to the stronger non-selective interaction between them. However, the fact that both close to its theoretical value indicates that the interaction forces between STHIC and α-Chy are totally dominated by the same interaction force. For the non-linear part of the curve shown in 
Figure 5B, as 
Curea from 0.0~3.0 mol/L (top three points in 
Figure 5B), α-Chy in the M state interacts with the STHIC to be selective.
This phenomenon suggests that α-Chy in the M state at low urea concentrations tends to refold to its N state and to re-change its molecular structure, making some strong amino acid residues on the surface of α-Chy molecules enter its inside. However, the refolding of α-Chy only proceeds half-way, both hydrophilic and hydrophobic amino acid residues contact to the STHIC and establishes a new equilibrium between non-selective and selective interaction forces at the liquid-solid interface, resulting in not making protein refolding successfully.
 5. Conclusions
(1) With a hydrophobic interaction chromatography (HIC), a new approach for distinguishing the native (N) state of a protein from its unfold (M) state is presented.
(2) With HIC, a stable M state of the urea-denatured α-chymotrypsin (α-Chy) is prepared and characterized by three parameters of contact surface area (Z, S), affinity (logI), and the character of interaction force (j), of stoichiometric displacement theory for retention (SDT-R).
(3) By comparing the magnitude and type of the three parameters of SDT-R the N and M states some very useful information in proteomics and protein separation in nature may be provided and the existing states of proteins, N, M, U states, may also be distinguished with each other.