You are currently viewing a new version of our website. To view the old version click .
Biomolecules
  • Article
  • Open Access

21 November 2025

Production, Purification, and Characterization of a Novel Cysteine-Rich Anticoagulant from the Medicinal Leech and the Functional Role of Its C-Terminal Motif

,
,
,
,
,
,
,
,
and
1
Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Malaya Pirogovskaya, 1a, 119435 Moscow, Russia
2
Center for Genetic Reprogramming and Gene Therapy, Lopukhin Federal Research and Clinical Center of Physical-Chemical Medicine of Federal Medical Biological Agency, Malaya Pirogovskaya, 1a, 119435 Moscow, Russia
3
Moscow Center for Advanced Studies, 20 Kulakova Str., 123592 Moscow, Russia
*
Author to whom correspondence should be addressed.
This article belongs to the Section Biomacromolecules: Proteins, Nucleic Acids and Carbohydrates

Abstract

The saliva of the medicinal leech Hirudo medicinalis contains a wide range of biologically active compounds, including multiple anticoagulants. Previously, we identified a novel cysteine-rich anticoagulant protein (CRA) from leech saliva and produced it recombinantly in Escherichia coli, demonstrating its potential as a basis for new anticoagulant drugs. In this study, we developed an optimized procedure for scalable production and purification of recombinant CRA. The purified protein was analyzed for common contaminants originating from E. coli, such as endotoxins, bacterial proteins, and DNA, and its anticoagulant properties were evaluated using standard clotting assays. Across three independent experiments, the yield of purified CRA ranged from 3.7 to 5.5 mg per liter of bacterial culture, with impurity levels per milligram of protein ranging from 7.1–31.2 ng of bacterial proteins, 1.2–15.1 ng of DNA, and 60–1445 EU of endotoxins. The purified CRA displayed electrophoretic and chromatographic homogeneity and retained strong anticoagulant activity. Additionally, a truncated form of CRA lacking the C-terminal region was produced and characterized. This variant lost membrane affinity and showed altered activity profiles, with higher thrombin time activity but reduced prothrombin time and activated partial thromboplastin time activities compared with the full-length protein.

1. Introduction

Cardiovascular diseases (CVDs) are a leading cause of death worldwide, with an estimated ~19.8 million deaths attributable to CVDs in 2022; the majority of these deaths are due to ischemic heart disease and stroke [,]. A substantial proportion of CVDs result from thrombotic processes (for example, occlusive arterial thrombosis leading to myocardial infarction or ischemic stroke), and microvascular thrombosis has also been implicated in severe forms of disease such as COVID-19–associated coagulopathy [,,]. Currently, clinical practice employs several groups of effective anticoagulants, including indirect thrombin inhibitors such as warfarin, direct oral anticoagulants (DOACs) such as apixaban and edoxaban, as well as heparins and their low-molecular-weight derivatives []. Although these drugs are widely used and have proven efficacy, each class has notable limitations. Warfarin requires strict dietary control and frequent monitoring of coagulation parameters [,]; heparins can cause heparin-induced thrombocytopenia and bleeding complications []; and DOACs, while more convenient, may still lead to bleeding risks and are costly or contraindicated in patients with impaired renal or hepatic function [,]. These limitations highlight the need for new anticoagulant agents with improved safety profiles, novel mechanisms of action, or better pharmacokinetic properties. One potential source of such molecules is hematophagous animals, whose saliva typically contains a set of anticoagulants required for blood feeding. The therapeutic use of the medicinal leech (Hirudo medicinalis), a prominent blood-feeding species, dates back several thousand years. The saliva of the medicinal leech contains several anticoagulants, the most well-known of which is hirudin, a polypeptide competitive thrombin inhibitor [,]. In addition, leech saliva contains other polypeptide anticoagulants. Previously, we performed an omics analysis of the salivary gland cells of the medicinal leech and identified several novel potential anticoagulants []. Based on these data, we obtained a set of recombinant proteins and demonstrated that one of them—a cysteine-rich anticoagulant protein (CRA)—exhibited strong hemostasis inhibition in in vitro assays [].
The mature form of CRA, after signal peptide cleavage, consists of 139 amino acid residues, 22 of which are cysteines. The closest known homolog of CRA is antistasin from the Mexican leech (Haementeria officinalis) []. Antistasin (ATS) and CRA share 39 identical amino acid residues. Both proteins display a characteristic pattern of cysteine residue distribution []. In the case of ATS, all 20 cysteine residues are involved in disulfide bond formation []. It is reasonable to assume that in CRA, all or nearly all cysteine residues also participate in disulfide bond formation.
We previously established that CRA exhibits a pronounced ability to inhibit coagulation in in vitro clotting assays [], which are commonly used in clinical practice to evaluate the hemostatic system based on fibrin clot formation time []. To more precisely assess this effect, we modified the standard methodology of three clinical clotting assays for testing anticoagulant solutions rather than patient serum parameters. We employed activated partial thromboplastin time (aPTT), prothrombin time (PT), and thrombin time (TT) assays []. In all three assays, CRA demonstrated a clear dose-dependent inhibition of coagulation. Notably, in the TT assay, its effect was weaker than that of hirudin (Hir), whereas in PT and especially aPTT assays, CRA was significantly more potent than Hir. This suggests a difference in the mechanism of hemostasis inhibition between the two proteins.
Previously, we obtained CRA using a preliminary simplified protocol []. It included bacterial cell lysis, washing of the insoluble fraction, solubilization in buffer containing 8 M urea, metal chelate chromatography under denaturing conditions, and dialysis. This approach allowed isolation of small amounts of protein sufficient for preliminary analysis. The purity of the preparation was not controlled, and electrophoretic analysis revealed numerous impurities. Gel filtration analysis indicated a high content of high-molecular-weight aggregates. Protein losses during dialysis reached 80–90%, and the total yield did not exceed 1–2 mg per liter of bacterial culture. Moreover, purification from such contaminants as bacterial endotoxins, E. coli DNA, and host proteins was insufficient, which is critical for testing CRA in in vivo experiments and for its potential use in medical formulations. In this study, we describe a high-yield method for obtaining the cysteine-rich anticoagulant from the medicinal leech with a high degree of purity.

2. Materials and Methods

2.1. The Construction of the Expression Plasmid

The plasmid carrying the recombinant gene encoding CRA was constructed using the PIPE method []. The plasmid pET-22b(+) (Novagen, Darmstadt, Germany) was amplified by PCR in its entirety, except for the region encoding the pelB signal peptide. The oligonucleotides PETPIPE-1N (5′-CATATGTATATCTCCTTCTTAAAGTTAAAC-3′) and PETPIPE-2N (5′-CTCGAGCACCACCACCACCACCACTGAGATC-3′) were used for this purpose. The DNA fragment encoding the mature form of CRA without the signal peptide was amplified by PCR using oligonucleotides CRApipf (5′-GGAGATATACATATGGACGAAGAAAACTATGAAGATG-3′) and CRApipr (5′-GTGGTGGTGCTCGAGATTCTTTTTCCCTTTGAAGAAATTC-3′). The cDNA from H. medicinalis salivary gland cells, previously obtained from laser-microdissected material [], served as the amplification template. The resulting amplicons were purified by agarose gel electrophoresis and mixed in a 1:1 mass ratio. The resulting mixture, without ligation, was used to transform Escherichia coli TOP10 cells (Invitrogen, Thermo Fisher Scientific, Waltham, MA, USA), which were plated on LB agar plates containing ampicillin (150 mg/L). Colonies carrying plasmids with the target DNA fragment were identified by PCR. Selected colonies were cultured, and plasmid DNA was isolated from the biomass. The structure of the DNA insert in the plasmids was verified by Sanger sequencing on an automated sequencer. As a result, the plasmid pET-hmCRA was obtained. This plasmid carries the recombinant gene under the control of the T7 promoter, encoding the mature form of CRA (without the signal peptide) fused with a C-terminal His-Tag (Figure S1).
The DNA fragment encoding the truncated form of CRA lacking the C-terminal region (CRA-cut) was obtained by PCR using oligonucleotides T7 (5′-TAATACGACTCACTATAGGG-3′) and CRAcutR (5′-GTGCTCGAGATTATTTTCTCCTTTGCAGATGC-3′). The plasmid pET-hmCRA served as the amplification template. The resulting fragment was sequentially treated with the restriction endonucleases XbaI (Thermo Fisher Scientific, Waltham, MA, USA) and XhoI (Thermo Fisher Scientific, Waltham, MA, USA). The plasmid pET-22b(+) was treated in the same manner. After agarose gel purification, the obtained vector and insert fragments were mixed at a molar ratio of 1:8 in ligation buffer supplemented with T4 bacteriophage ligase (Thermo Fisher Scientific, Waltham, MA, USA) and incubated at room temperature for one hour. The resulting mixture was used to transform E. coli TOP10 cells, which were plated on LB agar containing ampicillin (150 mg/L). Similarly to the construction of the pET-hmCRA plasmid, clones carrying plasmids with the target DNA fragment were selected, and the inserted DNA encoding the protein of interest was sequenced. As a result, the plasmid pET-hmCRA-cut was obtained, which encodes CRA lacking 28 C-terminal amino acid residues. In all other respects, it is identical to pET-hmCRA (Figure 1).
Figure 1. Full amino acid sequence of the recombinant CRA. Amino acid residues additionally introduced relative to the native protein are underlined; residues deleted in the truncated CRA-cut protein are shown in bold; cysteine residues are highlighted in green.

2.2. Cultivation of the Producer Strain

E. coli BL21(DE3) gold cells (Novagen, Darmstadt, Germany) were transformed with the plasmid pET-hmCRA or pET-hmCRAcut. Transformants were plated on LB agar plates containing ampicillin (150 mg/L) and incubated for 16 h at 37 °C. A single bacterial colony was transferred into an LB medium containing ampicillin (150 mg/L) and cultivated in a shaker incubator at 37 °C for six hours. The resulting culture was used to inoculate an autoinduction TB medium (24 g/L yeast extract, 12 g/L tryptone, 4 g/L glycerol, 0.17 M KH2PO4, 0.72 M K2HPO4, 150 mg/L ampicillin, 1 g/L glucose, 0.5 g/L lactose) at a ratio of 1:100. Culture flasks (1.5 L) containing 250 mL of inoculated medium were incubated for 18 h in a shaker incubator at 220 rpm and 37 °C. Cells from the overnight culture were harvested by centrifugation for 15 min at 5000× g. The pellet was stored at −20 °C.

2.3. Isolation of CRA

The cell pellet obtained in the previous step was resuspended in US buffer (PBS, 0.1% Triton X-100, 0.1 g/L lysozyme, 5 mM EDTA) at a ratio of 100 mL of US buffer per 1 L of initial culture and disrupted using a Branson 250 Sonifier (Branson Ultrasonics, Danbury, CT, USA) at 22 kHz for 10 min on ice. The lysate was centrifuged for 15 min at 15,000× g. The resulting pellet was resuspended in 2% Triton X-100 using a sonifier for 1 min on ice and centrifuged again for 5 min at 10,000× g. The detergent washing step was repeated twice. The washed pellet was solubilized in buffer AM (8 M urea, 20 mM Tris-Cl, 500 mM NaCl, 10 mM imidazole, pH 7.5) at a volume of 1/10 of the initial bacterial culture using sonication at 22 kHz for 5 min on ice. The insoluble residue was removed by centrifugation for 30 min at 50,000× g. The obtained solution was loaded onto an HR16/20 column containing 10 mL of Ni Sepharose Fast Flow (Cytiva, Marlborough, MA, USA) equilibrated with buffer AM. After loading, the column was washed with 10 mL of AM buffer, then with 50 mL of wash buffer BM (8 M urea, 20 mM Tris-Cl, 500 mM NaCl, 25 mM imidazole, pH 7.5), and eluted with buffer EM (8 M urea, 20 mM Tris-Cl, 500 mM NaCl, 500 mM imidazole, pH 7.5).
Refolding of CRA was performed by rapid 20-fold dilution in 20 mM Tris-Cl, pH 7.5, at 4 °C. The cooled Tris-Cl solution was poured into a beaker and placed on a magnetic stirrer for vigorous mixing. Then, the CRA protein solution in EM buffer obtained in the previous step was quickly added. The beaker was sealed with parafilm and incubated for 16–18 h at 4 °C without stirring. The precipitate formed was removed by centrifugation for 30 min at 50,000× g.
For purification, the refolded protein solution was applied to a Tricorn10/50 column containing 4.5 mL of DEAE Sepharose FF (Cytiva, Marlborough, MA, USA). The column was pre-equilibrated with 20 mM Tris-Cl, pH 7.5, and washed with 10 mL of the same buffer after sample loading. Elution was carried out with a linear NaCl gradient from 20 mM Tris-Cl, pH 7.5, to 20 mM Tris-Cl, 1 M NaCl, pH 7.5, over 50 mL. Chromatography was performed using an NGC chromatograph (Bio-Rad, Hercules, CA, USA) at a flow rate of 2 mL/min. Process control and fraction collection were based on absorbance at 280 nm. The presence of the target protein in the fractions was confirmed by SDS-PAGE with Coomassie G-250 staining. Fractions containing CRA were combined and dialyzed against 100 volumes of water. The resulting protein solution was aliquoted, frozen, and lyophilized for 20 h. Dried samples were stored at −20 °C.

2.4. Isolation of CRA-Cut

The cell pellet was resuspended in buffer AN (20 mM Tris-Cl, 500 mM NaCl, 10 mM imidazole, pH 7.5) at a ratio of 100 mL per 1 L of culture and disrupted using a Branson 250 Sonifier (Branson Ultrasonics, Danbury, CT, USA) at 22 kHz for 10 min on ice. The lysate was centrifuged for 15 min at 15,000× g. The pellet was discarded, and the supernatant was heated to 65 °C and centrifuged for 10 min at 15,000× g. The pellet of denatured proteins was discarded, and the supernatant was centrifuged again for 30 min at 50,000× g. The resulting solution was applied to an HR16/20 column containing 10 mL of Ni Sepharose Fast Flow (Cytiva, Marlborough, MA, USA) equilibrated with buffer AN. After loading, the column was washed with 10 mL of AN buffer, then 50 mL of wash buffer BN (20 mM Tris-Cl, 500 mM NaCl, 25 mM imidazole, pH 7.5), and eluted with buffer EN (20 mM Tris-Cl, 500 mM NaCl, 500 mM imidazole, pH 7.5). Fractions containing the protein of interest (POI) were combined and diluted fivefold with 20 mM Tris-Cl, pH 7.5. CRA-cut was further purified using IEX chromatography as described for full-length CRA. The purified CRA-cut solution was stored at −20 °C.

2.5. Protein Concentration Determination

Lyophilized protein was dissolved in water for 30 min with stirring. CRA concentration was determined by measuring absorbance at 280 nm. The calculated molar and mass extinction coefficients for CRA were 19,140 M−1cm−1 and 1.11 mg−1cm−1, respectively. For CRA-cut, the calculated molar and mass extinction coefficients were 13,210 M−1cm−1 and 0.96 mg−1cm−1, respectively.

2.6. Clotting Assays

The APG4-03-Ph (EMCO LLC, Moscow, Russia) hematology analyzer was used for coagulation assays. All measurements were performed using control plasma (Renam, Moscow, Russia). Activated partial thromboplastin time (aPTT), prothrombin time (PT), and thrombin time (TT) were determined as described previously []. Each sample was measured in four replicates in parallel using four coagulometer measurement cells. CRA concentrations of 1 µM were used for aPTT and PT assays, and 1 µM and 5 µM for TT assays. Concentrations of anticoagulant proteins are indicated for the initial sample. In the reaction mixture, the concentration is six times lower in the case of aPTT (25 µL from 150 µL), four times lower in the case of TT (50 µL from 200 µL), and three times lower in the case of PT (50 µL from 150 µL).

2.7. Determination of Impurities in Protein Samples

The content of E. coli proteins in CRA samples was determined by ELISA using the Enzyme-linked Immunosorbent Assay Kit for Escherichia coli Host Cell Protein Residue (Cloud Clone Corp., Katy, TX, USA). E. coli DNA content was determined by quantitative real-time PCR using the E. coli Host Cell DNA Quantitation Kit (Biolabmix, Moscow, Russia). Endotoxin levels were measured using the chromogenic endpoint LAL assay (BioEndo, Chengdu, China). All measurements were performed according to the manufacturers’ instructions.

2.8. Analytical Gel Filtration

Analytical gel filtration was performed using an NGC chromatograph (Bio-Rad, Hercules, CA, USA) on a Tricorn 10/300 column (Cytiva, Marlborough, MA, USA) packed with Superdex 200 (Cytiva, Marlborough, MA, USA). The column was equilibrated with PBS (20 mM sodium phosphate, 8 g/L NaCl, pH 7.4). Calibration was performed using the LMW Gel Filtration Calibration Kit (Cytiva, Marlborough, MA, USA). The flow rate was 1 mL/min, and protein elution was monitored at 280 nm.

2.9. Protein Binding with Lipid Vesicles

Giant unilamellar vesicles consisting of Chol:PC:PE:PS (1:3:2:2) were used to mimic endothelial cell membranes []. The vesicles were prepared and kindly provided by Dr. P.V. Bashkirov. 150 µL of liposomes (1 mg/mL) were mixed with 50 µL of protein solution (1 mg/mL) and incubated for 15 min at room temperature. The mixture was centrifuged for 10 min at 12,000× g, and the supernatant was transferred to another tube. The pellet was washed with saline and dissolved in 15 µL of 1% SDS. Samples were analyzed by SDS-PAGE and stained with Coomassie G-250. From the supernatant fraction, 5 µL was loaded onto the gel, while the entire pellet fraction was loaded in full.

2.10. Modeling of the CRA Spatial Structure

The AlphaFold 3 [] software was used for protein structure prediction. The AlphaFold protein model was visualized with UCSF ChimeraX 1.7 [].

2.11. Circular Dichroism (CD) Spectroscopy

For CD spectroscopy-based analysis, the proteins were diluted to 0.2 mg/mL in H2O. The CD spectra were registered on a Chirascan VX Spectrophotometer (Applied Photophysics Ltd., Leatherhead, UK) in cuvettes of 0.05 cm optical path in two repeats and averaged after baseline subtraction. The spectra were analyzed using CDNN software version 2.1 (Applied Photophysics Ltd., Leatherhead, UK) with default settings.

3. Results

3.1. Isolation of Recombinant Proteins

During heterologous expression in E. coli, CRA actively accumulated in the cells and remained in the pellet fraction after lysis. We performed three completely independent CRA isolations—from bacterial transformation to lyophilization. Each isolation was carried out from 1.5 L of bacterial culture. The resulting samples were analyzed for anticoagulant activity, as well as for the content of producer strain proteins, DNA, and endotoxins. In addition, the oligomeric status of the proteins was determined using analytical gel filtration.
As a result of three independent isolations, 5.62 mg, 8.2 mg, and 8.3 mg of CRA were obtained. SDS-PAGE analysis demonstrated electrophoretic homogeneity of the samples (Figure 2). Upon dissolution of the lyophilized protein, it was completely solubilized, which was confirmed by the absence of precipitate after centrifugation and by the equality of protein concentrations before lyophilization and after reconstitution.
Figure 2. Purification of CRA by ion-exchange chromatography. (A)—chromatographic profile: the x-axis represents elution volume, left y-axis represents optical density at 280 nm, and right y-axis represents conductivity. Fractions are indicated by shaded areas and labeled above. (B)—SDS-PAGE analysis of fractions with Coomassie G-250 staining. L—washed cell pellet lysate; 1–8—chromatographic fractions, numbering corresponds to the chromatogram above. The predicted molecular weight of the CRA protein is 17.2 kDa.
CRA-cut, in contrast to CRA, was detected in the soluble fraction of the E. coli lysate. After optimizing the purification conditions, we carried out a preparative purification of CRA-cut from 1.5 L of bacterial culture (Figure S2). As a result, 12.7 mg of purified CRA-cut was obtained.

3.2. Determination of Activity

The results of aPTT, PT, and TT assays are shown in Table 1. For each point, four independent measurements are conducted; the table presents the mean value with the standard deviation. The p-values for multiple group comparisons are provided in the Supplementary Materials (Table S1). In all tests, the time of the measured groups significantly differs from the control groups without anticoagulants. It is also worth noting the presence, in a number of cases, of significant differences in activity between the three CRA preparations, although ideally the values should be identical. The most active protein preparation, CRA P1, showed significantly longer clotting times than the other two preparations in both the aPTT and PT assays. Only in the TT assay the difference between CRA P1 and CRA P3 was not statistically significant. In the TT assay preparation CRA P3 also showed no significant difference from CRA P2. These results may indicate the presence of real differences in specific activity between individual protein preparations.
Table 1. Determination of the activity of purified CRA using aPTT, PT, and TT tests.

3.3. Determination of Impurities

Three independent CRA preparations (P1–P3) were obtained under identical expression and purification conditions, and the levels of E. coli host cell proteins, DNA, and lipopolysaccharides (LPS, endotoxins) were determined (Table 2).
Table 2. Content of E. coli proteins, DNA, and lipopolysaccharides (endotoxins) in purified CRA samples.
The total yield of CRA ranged from 5.6 mg to 8.3 mg per preparation, indicating reproducible expression and purification efficiency. The content of E. coli host cell proteins was low, varying between 7.1 ± 0.4 ng/mg and 31.2 ± 1.9 ng/mg, which corresponds to a high level of purification. Residual E. coli DNA was detected in the range of 1.2 ± 0.1–15.1 ± 2.4 ng/mg, with some variation between preparations. The endotoxin (LPS) content showed the largest differences, from 60.0 ± 2.8 EU/mg to 1445 ± 102 EU/mg, despite identical purification conditions. This variability likely reflects biological differences between production batches rather than procedural inconsistencies. Overall, the obtained CRA preparations demonstrated consistent yields and a high degree of purity, although the variation in endotoxin and DNA levels indicates the need for careful monitoring of these parameters during production.

3.4. Analytical Gel Filtration

Gel filtration of all three samples yielded a single peak (Figure 3), with apparent molecular masses of 24.5 kDa, 24.9 kDa, and 25.1 kDa, respectively.
Figure 3. Gel filtration profile of CRA refolded by rapid dilution and purified by DEAE-Sepharose chromatography using a Superdex 200 column. The x-axis represents elution volume; the y-axis shows optical density at 280 nm.

3.5. Binding of Proteins to Lipid Vesicles

The binding of CRA, CRA-cut, and BSA (control) to lipid vesicles was investigated. It was found that CRA effectively binds to vesicles and sediments with them during centrifugation, whereas CRA-cut was not detected in the pellet and remained entirely in the supernatant (Figure 4).
Figure 4. Binding of proteins to lipid vesicles. SDS-PAGE analysis (Laemmli) with Coomassie G-250 staining. M—molecular weight markers; 1—vesicles without protein; 2—BSA; 3—CRA; 4—CRA-cut; supernatant and pellet fractions are labeled on the gel. The predicted molecular weight of the CRA protein is 17.2 kDa, and that of CRA-cut is 13.8 kDa.

3.6. Modeling of the CRA Spatial Structure

The overall architecture of CRA is similar to that of antistasin (Figure 5A). The pLDDT plots and PAE matrix are presented in the Supplementary Materials (Figure S5). It has two clearly defined homologous structural domains, each cross-linked by five disulfide bonds (Figure 5B). However, there are two notable differences. First, the interdomain spacer in CRA is much shorter than in antistasin and consists of only three amino acid residues with proline in the center—Ala-Pro-Arg. In antistasin, the interdomain spacer also contains a proline residue in the middle but consists of eight residues (Arg-Leu-Glu-Pro-Met-Lys-Ala-Thr). This likely results in a more rigid relative fixation of the domains in the CRA molecule. Second, CRA possesses a large helical C-terminal motif (CTM). Furthermore, due to the alternation of residues in the primary structure, the sides of the helix have different properties. Predominantly Lys residues are grouped on one side, giving it a pronounced positive charge. On the opposite side, primarily aromatic hydrophobic groups are clustered (Figure 5C,D). The base of the CTM contains a disulfide bond (Cys113-Cys116) and Pro125. Such a motif apparently ensures a specific orientation of the CTM relative to the CRA molecule.
Figure 5. Visualization of the CRA spatial structure modeling using AlphaFold3 []. (A)—Overall polypeptide chain fold. Two well-defined domains and a separate C-terminal helical motif (CTM) are visible. (B)—Comparison of the structure of the two domains. (C,D)—Structure of the CTM in two projections. Protein model was visualized with UCSF ChimeraX 1.7 [].

3.7. Circular Dichroism (CD) Spectroscopy

CRA and CRA-cut show similar CD spectra, which are provided in the Supplementary Materials (Figure S3). Both spectra have a single negative peak at a wavelength of approximately 200 nm. This indicates a predominance of loops and coils in the protein structure, with a lack of significant amounts of regular secondary structure elements such as alpha-helices and beta-sheets [,]. This result is entirely consistent with the structure of anistasin, which lacks helices and sheets and consists mainly of loops stabilized by numerous disulfide bonds []. Bdellastasin, a single-domain homolog of antistasin, has a CD spectrum very similar to that of CRA, for both the native and recombinant protein [].

4. Discussion

The cysteine-rich anticoagulant (CRA) of the medicinal leech contains a large number of cysteine residues, which is reflected in its name. Antistasin, the closest homolog of CRA for which a spatial structure has been determined [], also contains a high number of cysteine residues, all of which participate in disulfide bond formation. ANS possesses a unique fold that lacks canonical regular elements of secondary structure, but has numerous loops stabilized by disulfide bridges at their bases, which maintain its structure. Such structural features do not inspire optimism when expressing the recombinant protein in E. coli. The intracellular environment of E. coli has a reducing redox potential, which greatly impedes disulfide bond formation [], and cytoplasmic proteins with disulfide bonds are virtually absent in E. coli []. In general, obtaining active recombinant proteins containing multiple disulfide bonds in E. coli is a rather challenging task [,,]. However, in the case of CRA, we were able to obtain an active protein without employing sophisticated methods or complicated approaches. The reason undoubtedly lies in the strong intrinsic ability of CRA to form its tertiary structure during in vitro folding. Previously, we only described a crude isolation of CRA using metal-chelate chromatography without additional purification []. We performed refolding by dialysis, which led to very high losses (80–90%) and a final yield of the target protein of no more than 1–2 mg per liter of bacterial culture. First, we optimized new refolding conditions. We used dialysis and rapid dilution at pH values from 5 to 10 at room temperature and +4 °C. As a result, we found that the best result is achieved with rapid dilution using a cold 20 mM TrisCl pH 7.5 solution. Under these conditions, no visible protein precipitate forms and almost all the protein remains in solution. Second, we introduced an additional purification step using ion-exchange chromatography (IEX). The calculated isoelectric point for CRA is 5.7. Based on this, we performed chromatography on two anion-exchange sorbents—DEAE-sepharose and Q-sepharose—with elution by a linear NaCl gradient. The results were similar, but in the case of DEAE-sepharose, elution occurred at lower NaCl concentrations, so we used this option thereafter. The IEX step allowed us to combine additional protein purification with its concentration after refolding by rapid dilution. Following this, we performed three independent purifications of CRA and measured the activity of the obtained samples as well as the levels of three types of impurities: E. coli proteins, E. coli DNA, and endotoxins. The level of contaminating proteins was low and generally within acceptable pharmacological limits (which is commonly targeted below 100 ng per mg of final product), although this parameter did not strictly indicate by FDA and depend in each case on host cell type, the immunogenicity of specific host cell proteins and the administered dose []. The level of lipopolysaccharides was relatively low but showed significant variation among the samples. This parameter requires specific monitoring during CRA purification and, most likely, additional purification steps, for example, using polymyxin-Sepharose [,]. The level of host DNA was relatively high [], which may also require further purification or adjustment of chromatographic conditions if CRA is to be used internally as an anticoagulant. For research purposes, however, the level of purity can be considered satisfactory. Thus, the proposed method of CRA production provides a high degree of purity and allows additional purification steps to be planned depending on the intended use of the protein.
In all three CRA preparations, the clotting assays demonstrated a high level of anticoagulant activity. However, the variability among samples was quite significant (Table 1, Supplementary Materials (Table S1)). We assume this variability is related to protein refolding conditions. Although we attempted to reproduce the same conditions in all three cases as precisely as possible, a number of uncontrolled parameters remain. One of them may be the degree of oxygen saturation of the dilution buffer used for rapid dilution. Other factors that could affect the results include the duration of purification, storage time of the frozen biomass (ranging from several hours to several days), and other aspects that became apparent only after analyzing the anticoagulant activity of the samples. Therefore, in the case of large-scale production, we believe that the refolding stage requires particularly careful control.
During optimization of the cultivation and purification procedures for CRA, we noticed some peculiarities. When analyzing a fresh culture during growth by SDS-PAGE, CRA was always detected entirely in the soluble fraction of the cell lysate. However, during preparative lysis, the protein shifted to the pellet fraction. This may indicate either protein denaturation or its association with sedimented cell membranes, as we previously hypothesized []. Supporting the latter assumption is the fact that washing the pellet with Triton X-100 gradually released CRA into the soluble fraction. For this reason, we chose two washing steps, at which point target protein losses were still minimal. Moreover, although difficult to formalize, the washed pellet from the lysate did not resemble classical inclusion bodies, which typically form a milky-white suspension. In this case, the pellet was much more similar to a membrane fraction.
We made several attempts to isolate a soluble form of CRA to avoid the denaturation–renaturation process. CRA could not be purified from the soluble lysate fraction at all. CRA could be purified from the Triton X-100 wash fractions, but with low and unstable yield. The average activity of the obtained protein in aPTT assays did not differ from that of the refolded protein. Therefore, we ultimately used the purification scheme described above.
Unlike antistasin, CRA possesses, in addition to two cysteine-rich domains, an additional C-terminal segment consisting of 28 amino acid residues, counting from the first of two cysteines present in this region []. This motif is composed mainly of alternating hydrophobic residues and ten lysine residues. We hypothesized that this motif mediates CRA binding to cell membrane lipids. The results of CRA folding modeling using AlphaFold are consistent with our predictions based on the analysis of its amino acid sequence. The protein’s structure is similar to antistasin (Figure 5A). It is intriguing that its two domains exhibit mirror symmetry (Figure 5B). However, it is important to remember that this is the result of in silico modeling, not supported by experimental data, and should be treated with a certain degree of caution. The CTM forms a structure distinct from the core of the protein molecule. Its orientation is determined by the disulfide bond and the proline residue present at the base of the motif. The asymmetry of the CTM helix sides results in one surface carrying a positive charge due to Lys residues, while hydrophobic aromatic side groups are clustered on the opposite surface. We hypothesize that this particular feature of the CTM enables its binding to membrane phospholipids. The logical next step was to obtain a truncated form of CRA lacking the C-terminal motif (CRA-cut) and to examine how this modification affected its properties. The results were somewhat surprising. Regarding membrane affinity, our hypothesis was confirmed: CRA-cut accumulated and was purified from the water-soluble fraction of the E. coli lysate. The yield of the protein was higher than that of the full-length form, mainly due to product losses during refolding. The full-length CRA bound efficiently to lipid vesicles, whereas CRA-cut did not show such affinity (Figure 4). Unexpectedly, the results of the coagulation assays revealed that CRA-cut performed worse than CRA in aPTT and PT tests, but significantly better in the TT test (Table 1).
We assume that this intriguing effect may be due to the C-terminal motif acting as a modifier of CRA binding specificity toward inhibited proteases of the coagulation cascade, similar to how staphylokinase alters plasmin substrate specificity []. Certain difficulties arise in the interpretation because coagulation assays are integral tests. The aPTT test is used to measure coagulation parameters during activation of the intrinsic pathway, the PT test assesses the extrinsic pathway, and the TT test measures fibrinogen activation by thrombin. A thrombin inhibitor will show activity in all three tests, as observed for hirudin. CRA-cut increases TT while simultaneously decreasing aPTT and PT compared to CRA. This effect can only be explained by the increased affinity of the truncated form for thrombin and a concomitant strong decrease or loss of affinity for other proteases upstream in the coagulation cascade. Unfortunately, the specific components of the hemostasis cascade, apart from thrombin, that are affected by CRA remain unknown. To clarify this issue, we performed chromogenic assays for the inhibition of factors Xa, XIa, XIIa, and IIa (thrombin). The measurement conditions are described in the Supplementary Materials (Figure S4). It was completely unexpected that CRA showed no inhibition in any of the tests. One could conclude from this that CRA is not a direct inhibitor of these factors; however, in the case of thrombin, there is a strong argument against such a conclusion. We have previously shown that in a system containing purified thrombin and fibrinogen, CRA effectively inhibits clot formation []. We repeated this experiment, as well as the clotting test for factor Xa inhibition (Supplementary Materials (Figure S4A)). The obtained results confirmed our previous findings. In clotting tests, CRA demonstrates inhibition of both thrombin and factor Xa. In the thrombin inhibition clotting test, CRA and CRA-cut showed identical results (Supplementary Materials (Figure S4B)). In the factor Xa inhibition test, CRA showed higher activity (Supplementary Materials (Figure S4A)). However, it is important to consider that these tests are not direct. In the factor Xa inhibition test, the effector is also thrombin, which is itself inhibited by CRA. Furthermore, these tests show weak dose-dependence (Supplementary Materials (Figure S4)). Consequently, the informativeness of such measurements is extremely limited. Thus, two hypotheses are possible. The first is that the chromogenic substrates used are able to access the active site of the coagulation factors during their competitive inhibition by CRA. This could be explained by the much lower molecular mass of these substrates compared to the mass of the proteins cleaved by the factors. If the contact between the inhibited enzyme and CRA is not sufficiently tight, the low-molecular-weight substrate could diffuse into the enzyme’s active site. If this is the case, then chromogenic tests using short peptide substrates are inapplicable here. The other possibility is that the inhibition mechanism is more complex than direct binding. Perhaps blood plasma or insufficiently purified fibrinogen contains some additional components necessary for CRA’s function. We currently have no experimental evidence to favor either version.
The ability of the CTM to bind membranes is also of interest. Coagulation factors themselves exhibit affinity for vesicles []. The extrinsic coagulation pathway is initiated upon vessel wall injury, partly due to the damage of epithelial cell membranes []. All these observations theoretically allow for several functional possibilities for CRA. For instance, the CTM may act as a targeting element, increasing the local concentration of CRA at the site of vessel injury or on vesicles associated with coagulation factors. It is also possible that CRA binds to platelets. Experimental confirmation of which of these scenarios is realized in vivo is currently lacking and remains a topic for further investigation.

5. Conclusions

We obtained a novel recombinant cysteine-rich anticoagulant from the leech Hirudo medicinalis and described a method for its expression and purification. The recombinant protein exhibits high anticoagulant activity in standard assays for activated partial thromboplastin time (aPTT), prothrombin time (PT), and thrombin time (TT). We also obtained a truncated form of the protein lacking the protruding C-terminal motif. The truncated form lost its affinity for lipid membranes. At the same time, its activity in aPTT and PT assays decreased, while its TT activity was significantly higher than that of the full-length protein. We assume that the C-terminal motif acts as a modifier of the binding specificity of CRA toward the proteases it inhibits.

Supplementary Materials

The following supporting information can be downloaded at https://www.mdpi.com/article/10.3390/biom15121633/s1, Figure S1: pET-hmCRA plasmid map; Figure S2: Purification of CRA-cut by ion-exchange chromatography; Figure S3: Far-ultraviolet CD-spectrum of recombinant CRA and CRA-cut; Figure S4: Activity assays of recombinant CRA and CRA-cut; Figure S5: Quality assessment of the AlphaFold-predicted CRA model: pLDDT and PAE analysis; Table S1: Statistical significance of pairwise coagulation test (aPTT, PT, TT) comparisons between CRA protein variants and control.

Author Contributions

Conceptualization, V.A.M.; methodology, V.A.M.; investigation, V.A.M., K.A.B., V.V.B., P.A.B., D.D.K., E.N.G., M.Y.S., N.R.R. and N.F.P.; writing—original draft preparation, V.A.M.; writing—review and editing, P.A.B. and V.V.B.; visualization, V.V.B., P.A.B. and V.A.M.; supervision, V.N.L.; project administration, V.N.L.; funding acquisition, V.N.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Ministry of Science and Higher Education of the Russian Federation (the Federal Scientific-technical programme for genetic technologies development for 2019–2030, agreement N◦075-15-2025-518).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Materials; further inquiries can be directed to the corresponding authors.

Acknowledgments

The authors thank Pavel V. Bashkirov for obtaining and providing the lipid vesicles. The authors thank Anna M. Varizhuk for her kind help in measuring circular dichroism spectra. This work was performed using the core facilities of the LOPUKHIN FRCC PCM “Genomics, proteomics, metabolomics” (http://rcpcm.org/?p=2806) (accessed on 12 November 2025).

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
CRACysteine-Rich Anticoagulant Protein 
EUEndotoxin Unit
CTMC-terminal motif
CRA-cutA Truncated Form of CRA Lacking the C-Terminal Motif 
ATSAntistasin
TTThrombin Time
PTProthrombin Time
aPTTActivated Partially Thromboplastin Time
HirHirudin
CholCholesterol
PCPhosphatidylcholine
PEPhosphatidylethanolamine
PSPhosphatidylserine
CDCircular dichroism
CVDCardiovascular disease
IEXIon-exchange chromatography

References

  1. Mensah, G.A.; Fuster, V.; Murray, C.J.L.; Roth, G.A. Global Burden of Cardiovascular Diseases and Risks Collaborators Global Burden of Cardiovascular Diseases and Risks, 1990-2022. J. Am. Coll. Cardiol. 2023, 82, 2350–2473. [Google Scholar] [CrossRef] [PubMed]
  2. Cardiovascular Diseases (CVDs). Available online: https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds) (accessed on 14 November 2025).
  3. Osuchowski, M.F.; Winkler, M.S.; Skirecki, T.; Cajander, S.; Shankar-Hari, M.; Lachmann, G.; Monneret, G.; Venet, F.; Bauer, M.; Brunkhorst, F.M.; et al. The COVID-19 Puzzle: Deciphering Pathophysiology and Phenotypes of a New Disease Entity. Lancet Respir. Med. 2021, 9, 622–642. [Google Scholar] [CrossRef] [PubMed]
  4. Conway, E.M.; Mackman, N.; Warren, R.Q.; Wolberg, A.S.; Mosnier, L.O.; Campbell, R.A.; Gralinski, L.E.; Rondina, M.T.; van de Veerdonk, F.L.; Hoffmeister, K.M.; et al. Understanding COVID-19-associated coagulopathy. Nat. Rev. Immunol. 2022, 22, 639–649. [Google Scholar] [CrossRef]
  5. Wadowski, P.P.; Panzer, B.; Józkowicz, A.; Kopp, C.W.; Gremmel, T.; Panzer, S.; Koppensteiner, R. Microvascular Thrombosis as a Critical Factor in Severe COVID-19. Int. J. Mol. Sci. 2023, 24, 2492. [Google Scholar] [CrossRef]
  6. Kholmukhamedov, A.; Subbotin, D.; Gorin, A.; Ilyassov, R. Anticoagulation Management: Current Landscape and Future Trends. J. Clin. Med. 2025, 14, 1647. [Google Scholar] [CrossRef] [PubMed]
  7. Tan, C.S.S.; Lee, S.W.H. Warfarin and Food, Herbal or Dietary Supplement Interactions: A Systematic Review. Br. J. Clin. Pharmacol. 2021, 87, 352–374. [Google Scholar] [CrossRef]
  8. Holbrook, A.M.; Pereira, J.A.; Labiris, R.; McDonald, H.; Douketis, J.D.; Crowther, M.; Wells, P.S. Systematic Overview of Warfarin and Its Drug and Food Interactions. Arch. Intern. Med. 2005, 165, 1095–1106. [Google Scholar] [CrossRef]
  9. Marchetti, M.; Zermatten, M.G.; Bertaggia Calderara, D.; Aliotta, A.; Alberio, L. Heparin-Induced Thrombocytopenia: A Review of New Concepts in Pathogenesis, Diagnosis, and Management. J. Clin. Med. 2021, 10, 683. [Google Scholar] [CrossRef]
  10. Gunawardena, T. Direct Oral Anticoagulants: A Review for the Non-Specialist. Hematol. Rep. 2021, 13, 9239. [Google Scholar] [CrossRef]
  11. Chen, A.; Stecker, E.A.; Warden, B. Direct Oral Anticoagulant Use: A Practical Guide to Common Clinical Challenges. J. Am. Heart Assoc. 2020, 9, e017559. [Google Scholar] [CrossRef]
  12. Müller, C.; Haase, M.; Lemke, S.; Hildebrandt, J.-P. Hirudins and Hirudin-like Factors in Hirudinidae: Implications for Function and Phylogenetic Relationships. Parasitol. Res. 2017, 116, 313–325. [Google Scholar] [CrossRef]
  13. Jo, B. The Action of Hirudin upon Thrombin. J. Physiol. 1927, 64, 47–53. [Google Scholar] [CrossRef]
  14. Babenko, V.V.; Podgorny, O.V.; Manuvera, V.A.; Kasianov, A.S.; Manolov, A.I.; Grafskaia, E.N.; Shirokov, D.A.; Kurdyumov, A.S.; Vinogradov, D.V.; Nikitina, A.S.; et al. Draft Genome Sequences of Hirudo Medicinalis and Salivary Transcriptome of Three Closely Related Medicinal Leeches. BMC Genomics 2020, 21, 331. [Google Scholar] [CrossRef] [PubMed]
  15. Manuvera, V.A.; Kharlampieva, D.D.; Bobrovsky, P.A.; Grafskaia, E.N.; Brovina, K.A.; Lazarev, V.N. New Anticoagulant Protein from Medicinal Leech. Biochem. Biophys. Res. Commun. 2024, 696, 149473. [Google Scholar] [CrossRef]
  16. Nutt, E.M.; Jain, D.; Lenny, A.B.; Schaffer, L.; Siegl, P.K.; Dunwiddie, C.T. Purification and Characterization of Recombinant Antistasin: A Leech-Derived Inhibitor of Coagulation Factor Xa. Arch. Biochem. Biophys. 1991, 285, 37–44. [Google Scholar] [CrossRef] [PubMed]
  17. Lapatto, R.; Krengel, U.; Schreuder, H.A.; Arkema, A.; de Boer, B.; Kalk, K.H.; Hol, W.G.; Grootenhuis, P.D.; Mulders, J.W.; Dijkema, R.; et al. X-Ray Structure of Antistasin at 1.9 A Resolution and Its Modelled Complex with Blood Coagulation Factor Xa. EMBO J. 1997, 16, 5151–5161. [Google Scholar] [CrossRef]
  18. Zaidi, S.R.H.; Rout, P. Interpretation of Blood Clotting Studies and Values (PT, PTT, aPTT, INR, Anti-Factor Xa, D-Dimer). In StatPearls; StatPearls Publishing: Treasure Island, FL, USA, 2025. [Google Scholar]
  19. Manuvera, V.A.; Brovina, K.A.; Bobrovsky, P.A.; Grafskaia, E.N.; Kharlampieva, D.D.; Lazarev, V.N. Method to assess the effects of bioactive compounds solutions on blood clotting. Extrem. Med. 2024, 26, 120–124. [Google Scholar] [CrossRef]
  20. Klock, H.E.; Lesley, S.A. The Polymerase Incomplete Primer Extension (PIPE) Method Applied to High-Throughput Cloning and Site-Directed Mutagenesis. Methods Mol. Biol. 2009, 498, 91–103. [Google Scholar] [CrossRef]
  21. Cansell, M.; Gouygou, J.P.; Jozefonvicz, J.; Letourneur, D. Lipid Composition of Cultured Endothelial Cells in Relation to Their Growth. Lipids 1997, 32, 39–44. [Google Scholar] [CrossRef]
  22. Jumper, J.; Evans, R.; Pritzel, A.; Green, T.; Figurnov, M.; Ronneberger, O.; Tunyasuvunakool, K.; Bates, R.; Žídek, A.; Potapenko, A.; et al. Highly Accurate Protein Structure Prediction with AlphaFold. Nature 2021, 596, 583–589. [Google Scholar] [CrossRef]
  23. Pettersen, E.F.; Goddard, T.D.; Huang, C.C.; Meng, E.C.; Couch, G.S.; Croll, T.I.; Morris, J.H.; Ferrin, T.E. UCSF ChimeraX: Structure Visualization for Researchers, Educators, and Developers. Protein Sci. 2021, 30, 70–82. [Google Scholar] [CrossRef]
  24. Linhares, L.A.; Ramos, C.H.I. Unlocking Insights into Folding, Structure, and Function of Proteins through Circular Dichroism Spectroscopy—A Short Review. Appl. Biosci. 2023, 2, 639–655. [Google Scholar] [CrossRef]
  25. Martin, S.R.; Schilstra, M.J. Circular Dichroism and Its Application to the Study of Biomolecules. Methods Cell Biol. 2008, 84, 263–293. [Google Scholar] [CrossRef]
  26. Moser, M.; Auerswald, E.; Mentele, R.; Eckerskorn, C.; Fritz, H.; Fink, E. Bdellastasin, a Serine Protease Inhibitor of the Antistasin Family from the Medical Leech (Hirudo Medicinalis)–Primary Structure, Expression in Yeast, and Characterisation of Native and Recombinant Inhibitor. Eur. J. Biochem. 1998, 253, 212–220. [Google Scholar] [CrossRef]
  27. Messens, J.; Collet, J.-F. Pathways of Disulfide Bond Formation in Escherichia coli. Int. J. Biochem. Cell Biol. 2006, 38, 1050–1062. [Google Scholar] [CrossRef] [PubMed]
  28. Stewart, E.J.; Aslund, F.; Beckwith, J. Disulfide Bond Formation in the Escherichia coli Cytoplasm: An in Vivo Role Reversal for the Thioredoxins. EMBO J. 1998, 17, 5543–5550. [Google Scholar] [CrossRef]
  29. Ke, N.; Berkmen, M. Production of Disulfide-Bonded Proteins in Escherichia coli. Curr. Protoc. Mol. Biol. 2014, 108, 16.1B.1–16.1B.21. [Google Scholar] [CrossRef]
  30. Salinas, G.; Pellizza, L.; Margenat, M.; Fló, M.; Fernández, C. Tuned Escherichia coli as a Host for the Expression of Disulfide-Rich Proteins. Biotechnol. J. 2011, 6, 686–699. [Google Scholar] [CrossRef] [PubMed]
  31. Berkmen, M. Production of Disulfide-Bonded Proteins in Escherichia coli. Protein Expr. Purif. 2012, 82, 240–251. [Google Scholar] [CrossRef] [PubMed]
  32. Bracewell, D.G.; Francis, R.; Smales, C.M. The Future of Host Cell Protein (HCP) Identification during Process Development and Manufacturing Linked to a Risk-Based Management for Their Control. Biotechnol. Bioeng. 2015, 112, 1727–1737. [Google Scholar] [CrossRef]
  33. Petsch, D.; Anspach, F.B. Endotoxin Removal from Protein Solutions. J. Biotechnol. 2000, 76, 97–119. [Google Scholar] [CrossRef] [PubMed]
  34. Anspach, F.B.; Hilbeck, O. Removal of Endotoxins by Affinity Sorbents. J. Chromatogr. A 1995, 711, 81–92. [Google Scholar] [CrossRef] [PubMed]
  35. Yang, H. Establishing Acceptable Limits of Residual DNA. PDA J. Pharm. Sci. Technol. 2013, 67, 155–163. [Google Scholar] [CrossRef]
  36. Collen, D. Staphylokinase: A Potent, Uniquely Fibrin-Selective Thrombolytic Agent. Nat. Med. 1998, 4, 279–284. [Google Scholar] [CrossRef] [PubMed]
  37. Zwaal, R.F.; Comfurius, P.; Bevers, E.M. Lipid-Protein Interactions in Blood Coagulation. Biochim. Biophys. Acta 1998, 1376, 433–453. [Google Scholar] [CrossRef]
  38. Mackman, N.; Tilley, R.E.; Key, N.S. Role of the Extrinsic Pathway of Blood Coagulation in Hemostasis and Thrombosis. Arterioscler. Thromb. Vasc. Biol. 2007, 27, 1687–1693. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Article metric data becomes available approximately 24 hours after publication online.