1. Introduction
Inclusion bodies (IBs) formed during recombinant protein production in
Escherichia coli were historically viewed as aggregates of misfolded, inactive protein. However, over the past two decades, their perception has evolved substantially. IBs are now recognised as nanostructured protein assemblies with ordered intermolecular architecture and, in many cases, retained biological function [
1,
2,
3,
4,
5]. These aggregates are often enriched in β-sheet structures and display amyloid-like features, and can be engineered to act as functional catalysts, immobilised biocatalysts, or delivery materials. Their mechanical robustness, high protein density, and ease of recovery make them attractive as catalytic IBs (CatIBs) for applications ranging from enzyme recycling to nanobiomaterial development [
6,
7]. Recent studies have further demonstrated the utility of IBs as immobilised enzyme systems for robust continuous-flow biocatalysis [
8] and for sustainable process design in flow-based bioprocesses [
9]. Their reusability and physical stability position them as promising alternatives to conventional immobilisation matrices, expanding their potential well beyond laboratory systems.
Advances in the past decade have shown that IB formation is not solely a random consequence of overexpression stress, but sequence-encoded aggregation can strongly contribute to reproducible aggregation behaviour [
1,
2,
10]. Aggregation-prone sequence motifs, particularly those promoting intermolecular β-strand interactions, contribute to ordered IB formation and can be predicted computationally using bioinformatic prediction tools such as AGGRESCAN [
10]. The ability to rationally modulate or exploit these intrinsic aggregation propensities has opened opportunities to design functional IBs with engineered properties [
4,
11]. This trend has stimulated renewed interest in exploiting aggregation-prone domains as designable scaffolds for biocatalytic nanomaterials [
11].
Parallel to this development, lytic polysaccharide monooxygenases (LPMOs) have emerged as a major enzyme class in oxidative biocatalysis. These copper-dependent monooxygenases cleave glycosidic bonds in recalcitrant polysaccharides such as cellulose and chitin through oxidative mechanisms [
12,
13,
14,
15,
16,
17]. LPMOs are classified within the auxiliary activity (AA) families of the CAZy database, with AA10 comprising predominantly bacterial enzymes. The AA10 active site contains a conserved N-terminal histidine brace coordinating a single copper ion that catalyses substrate oxidation [
14,
17]. Despite their mechanistic sophistication and industrial relevance, heterologous expression of LPMOs in
E. coli is often challenging. LPMO folding is strongly influenced by coordinated copper loading and periplasmic targeting, and disruption of these processes has been linked to misfolding, instability, or aggregation in heterologous hosts [
17,
18,
19,
20].
Our previous work showed that the N-terminal AA10 domain of
Caldibacillus cellulovorans (Ccel
p40), originally described as part of a multidomain β-1,4-mannanase [
21], consistently forms IBs when expressed in
E. coli, even when fused to diverse proteins, enzymes, and bioactive peptides [
22]. Remarkably, these fusion constructs yield functional inclusion bodies retaining measurable catalytic or biological activity under non-denaturing conditions. The ability of Ccel
p40 to drive the formation of ordered and active IBs suggested that its strong aggregation propensity is intrinsic to the domain; however, whether this behaviour extends to other AA10 LPMOs remains untested.
Building on this hypothesis, the present study investigates whether the aggregation behaviour observed for Ccelp40 is conserved among other AA10 members. To test this, four phylogenetically distinct homologues, Kallotenue papyrolyticum (Kpapp40), Kibdelosporangium aridum (Karip40), Archangium lipolyticum (Alipp40), and Phytohabitans suffuscus (Psufp40), were expressed in E. coli. Comparative analyses of their solubility, morphology, and structural features were performed using SDS-PAGE, dynamic light scattering (DLS), field-emission scanning electron microscopy (FESEM), and Fourier-transform infrared (FTIR) spectroscopy. The results presented here reveal that in this study high-yield, β-sheet-rich IB formation is a shared property among the AA10 LPMO homologues examined, supporting the view that aggregation may represent an intrinsic property of these examined AA10s rather than a host-specific effect, although host physiology likely modulates the magnitude of aggregation.
3. Discussion
The present study demonstrates that aggregation into IBs is a recurring feature among the AA10-family homologs examined here when heterologously expressed in
E. coli. Although only five AA10 catalytic domains have been analysed to date, the consistent observation of predominant insolubility across phylogenetically diverse homologues suggests that strong aggregation propensity may not be unique to Ccel
p40, but may instead reflect a more widespread, sequence-encoded characteristic of AA10 catalytic domains when expressed outside their native secretion and metal-loading context. The four homologs Kpap
p40, Kari
p40, Alip
p40, and Psuf
p40 all accumulated predominantly as IBs, with yields exceeding 85% of the total expressed protein, showing little dependence on induction strength. These findings extend earlier observations on
C. cellulovorans Ccel
p40, previously shown to possess an intrinsic aggregating property and a tendency to form IBs in
E. coli [
22]. They further support the view that insoluble aggregate formation in this system is not simply an artefact of host stress or overexpression but may reflect sequence-encoded features of these AA10 catalytic domains.
Consistent with this view, several bacterial AA10 LPMOs are difficult to obtain as soluble, correctly folded proteins in
E. coli and are often produced as IBs, with solubility frequently improved only through the use of appropriate secretion signals or folding aids [
18,
19,
24]. A recent review by Zhang et al. [
24] summarised heterologous expression of AA10s from diverse bacterial genera, including
Bacillus,
Enterococcus,
Cellulomonas,
Cellvibrio,
Chitinolyticbacter,
Hahella,
Streptomyces,
Thermobifida, and
Kitasatospora, and highlighted common challenges such as incorrect folding, IB formation, misselection of signal peptides, and low extracellular yields in
E. coli. Qin et al. [
25] further showed that the
Streptomyces megaspores AA10 SmLpmo10A initially accumulated as IBs in
E. coli Transetta (DE3) and required co-expression of molecular chaperones in Origami (DE3) and Shuffle T7-B strains to achieve soluble expression, directly illustrating the need for chaperone-assisted folding for some AA10s. In line with this, Gaber et al. [
18] reported that the AA10 TtAA10A from the shipworm symbiont
Teredinibacter turnerae could only be produced efficiently in
E. coli after co-expression of the pGro7 chaperone plasmid, following multiple unsuccessful attempts using different solubility tags and secretion signals. Together, these studies show that AA10 LPMOs from a range of bacterial genera may require, or substantially benefit from, secretion pathways and/or chaperone systems to obtain properly folded, functional enzymes.
Sequence analysis showed that all homologs share hallmark AA10 motifs within the catalytic core, including conserved residues associated with copper coordination and oxygen activation in mature AA10 LPMOs [
12,
13]. Structural studies have established that this region, together with adjacent β-strand loops surrounding the metal site, forms an exposed hydrophobic platform essential for polysaccharide depolymerisation [
12,
15]. These loops, particularly the flexible L2 and neighbouring surface segments, constitute a structurally diverse region that forms a major part of the substrate-binding surface [
26] and, in cellulose-active AA10s, frequently contain hydrophobic residues that contribute to substrate engagement [
13,
14,
27]. Although this hydrophobicity and mobility are functionally important, any contribution to self-association under incomplete folding or metal-loading conditions remains a plausible interpretation rather than a demonstrated mechanism.
Our in silico aggregation hotspot analysis is consistent with this interpretation. Aggregation-prone sequences were consistently located immediately downstream of the region corresponding to the histidine-brace motif in mature AA10 LPMOs, and within β-sheet regions bordering the catalytic pocket. These predicted hotspot regions overlap with the L2 and neighbouring β-strand-associated segments identified as variable and surface-exposed in AA10s crystal structures [
14]. The recurrence of aggregation-prone motifs in corresponding regions across multiple homologues supports the hypothesis that aggregation propensity may represent an intrinsic feature of the AA10 catalytic fold itself, rather than an isolated anomaly of individual sequences. This correspondence supports the view that, in these AA10s, residues involved in catalysis occupy spatial regions that also exhibit features commonly associated with aggregation propensity. This aligns with structural reviews highlighting the close relationship between catalytic function and surface hydrophobicity of AA10s [
12,
13,
27].
A notable contrast arises when comparing the present cytoplasmic expression results with those of Forsberg et al. [
19], who examined the same N-terminal AA10 catalytic domain from the multi-modular
C. cellulovorans ManA enzyme originally described by Sunna et al. [
21]. In their study, soluble and catalytically active enzyme was obtained only when the construct was directed to the periplasm using a signal peptide, where cleavage generated the mature N-terminal histidine essential for copper binding. Periplasmic expression therefore enables correct N-terminal processing [
18,
19,
24,
28] and thereby exposes the N-terminal histidine that forms the histidine braces required for copper coordination [
13,
14,
18,
24,
29], and supports efficient copper loading at this site [
18,
19]. When expressed without a signal peptide, as in the present work, the domain accumulates in the reducing cytoplasm, where incomplete copper coordination likely contributes to misfolding and aggregation, while any involvement of off-pathway oxidation remains a plausible possibility rather than a demonstrated mechanism. This interpretation is consistent with the folding–redox coupling model, which proposed that copper coordination and oxidative maturation are tightly linked and that redox imbalance can lead to self-oxidation and irreversible inactivation [
16,
20,
30,
31,
32]. These observations support the view that cytoplasmic aggregation of AA10 domains reflects factors normally stabilised by secretion-associated processing and copper loading, while evidence from heterologous systems indicates that some AA10s may also depend on chaperone-assisted folding [
18,
24].
The physicochemical properties of the resulting IBs further substantiate their ordered nature. DLS revealed sub-micron aggregates ranging from 550 to 863 nm with moderate polydispersity (PDI 0.45–0.54), values that fall within the nanoscale size range reported for bacterial IBs and functional IB-derived nanoparticles [
33,
34,
35,
36]. Polydispersity differed among the homologues, following the trend Alip
p40 > Psuf
p40 > Kari
p40 ≈ Kpap
p40, indicating homolog-dependent differences in particle heterogeneity. These observations support their classification as ordered nanostructures rather than amorphous precipitates [
4,
6,
37,
38]. FESEM showed densely packed spherical to ellipsoidal particles with homolog-specific variation in compactness, which we interpret as reflecting sequence-dependent differences in physicochemical properties. The combined FESEM and FTIR signatures agree with those reported for ordered amyloid-like IBs [
4,
6,
7,
36,
39,
40,
41]. FTIR spectra displayed prominent Amide I envelope centred around 1650 cm
−1, and deconvolution revealed a β-sheet-associated component in the 1620–1630 cm
−1 region, consistent with our calculated β-sheet contents of 41–52%. This aligns with the β-sheet-rich and partially folded conformations characteristic of functional IBs [
4,
6,
7,
36,
39,
40,
41].
Homolog-specific differences in particle morphology correlate with predicted aggregation propensities. AGGRESCAN analysis identified a greater number and cumulative extent of aggregation hotspots in Alip
p40 and Psuf
p40, whereas Kpap
p40 and Kari
p40 contained fewer predicted aggregation hotspots. This trend is consistent with experimental observations showing greater heterogeneity and less compact IB morphologies for Alip
p40 and Psuf
p40, compared with the more compact IBs formed by Kpap
p40 and Kari
p40. This correspondence between sequence-encoded aggregation potential and experimental morphology supports the view that IB formation reflects, at least in part, intrinsic biophysical properties of these domains. Similar protein-dependent relationships between IB morphology and sequence or folding behaviour have been reported for other recombinant proteins, including catalytically active and GFP-based IBs [
4,
7,
34,
36,
41], reinforcing that the observed aggregation behaviour is primarily sequence- and folding-dependent rather than solely an artefact of overexpression or host stress.
From a mechanistic perspective, AA10 LPMOs possess an immunoglobulin-like β-sandwich core with surface-exposed loops surrounding the copper centre [
14,
28,
42,
43]. The β-sandwich core forms a structurally rigid scaffold, while the substrate-binding loops on the catalytic surface exhibit greater intrinsic mobility [
29,
43] and are central to catalytic function and stability [
42]. Although no studies have directly examined how these architectural features affect folding during heterologous expression, the conserved β-sandwich topology is known to impose characteristic folding constraints in Ig-like domains [
44], providing a plausible structural basis for differences in folding behaviour when the enzyme is expressed outside its native secretion pathway and metal-loading environment. In native hosts, AA10s are secreted and frequently occur as modular enzymes with carbohydrate-binding modules (CBMs), which orient the catalytic domain on crystalline polysaccharides and modulate catalytic performance [
12,
28]. CBMs have not been shown to confer direct structural stabilisation, but modular LPMOs display lower susceptibility to rapid oxidative inactivation compared with isolated catalytic domains [
45,
46,
47], indicating a functional protective role. Cytoplasmic expression of isolated AA10 catalytic domains, as observed for CcelAA10 in Forsberg et al. [
19], is associated with insolubility, supporting the view that secretion-associated processing and auxiliary modules contribute to proper folding and copper incorporation. The copper redox cycle of LPMOs can generate reactive oxygen species even in the absence of substrate, leading to oxidative modification and inactivation [
14,
15,
20]. Such off-pathway reactions perturb the catalytic site and cause oxidative damage, but a direct mechanistic link to aggregation has not been demonstrated. Consistent with this, Forsberg et al. [
19] showed that non-substrate-bound LPMOs undergo oxidative damage and loss of activity. Although the precise relationship between redox chemistry and aggregation remains unresolved, the combined evidence supports a model in which oxidative perturbation, together with the absence of stabilising native interactions, increases the tendency of isolated catalytic domains to misfold and aggregate under reducing cytoplasmic conditions.
Removal of these auxiliary elements and cytoplasmic expression are consistent with the increased aggregation observed for isolated AA10 catalytic domains [
18,
19,
24]. The copper redox cycle of LPMOs can generate reactive oxygen species even in the absence of substrate, leading to oxidative modification and inactivation [
14,
15,
20]. Such off-pathway reactions can perturb the catalytic domain and may create conditions that favour aggregation. Consistent with this, Forsberg et al. [
19] demonstrated that off-pathway redox events cause oxidative damage and loss of activity when LPMOs are not substrate-bound. Although the precise relationship between redox chemistry and aggregation remains unresolved, these observations are compatible with a model in which structural perturbation increases aggregation propensity.
These insights also underscore the critical influence of the signal peptide and expression compartment on solubility and activity. Forsberg et al. [
19] showed that the same N-terminal AA10 catalytic domain is recovered as a monomeric, copper-loaded, catalytically active enzyme only when exported to the periplasm. In contrast, our cytoplasmic expression experiments resulted in insoluble material that lacked proper N-terminal processing and copper coordination. Our findings additionally show that aggregation behaviour correlates with sequence features within the catalytic domain, and the alignment of predicted aggregation hotspots with regions surrounding the copper-binding site suggests a spatial overlap between residues involved in catalysis and those contributing to aggregation.
The IBs produced here share several characteristics with engineered CatIB systems that have been intentionally developed for biocatalyst immobilisation [
3,
7,
35]. Fusion of aggregation-prone peptides has been shown to generate IBs that are mechanically stable, catalytically active, and recyclable [
3,
7]. The AA10-derived IBs exhibit similar properties, including uniform nanoscale morphology, ordered secondary structure, and high yield under standard culture conditions. These features, together with their β-sheet order and morphological uniformity, suggest that AA10 domains could be repurposed as self-assembling scaffolds for constructing catalytic IBs or hybrid protein-based nanocomposites.
While IB formation is often viewed as a limitation in recombinant production, the concept of functional IBs reframes aggregation as a useful self-assembly process [
35]. The present results provide comparative evidence that this behaviour extends across multiple AA10 family homologs, identifying structural and sequence determinants that govern aggregation propensity. This knowledge offers opportunities for rational design of LPMO-derived scaffolds and for tuning solubility through signal-peptide engineering, domain fusion, or cofactor supplementation.
Functional IBs have been extensively described for a wide range of recombinant proteins [
4,
6,
7]. We have also demonstrated that Ccel
p40 fusion constructs can generate functional or catalytically active IBs [
22]. As the present study focuses specifically on structural and aggregation behaviour, the detailed functional characterisation of these newly generated AA10-derived IBs will be presented elsewhere. Overall, our data establish that AA10 catalytic domains consistently form IBs in the
E. coli cytoplasm, indicating that aggregation is an intrinsic, sequence-encoded property shared across these homologues. The observed aggregation correlates with conserved hydrophobic and β-sheet features, homolog-specific particle morphology, and the nanoscale uniformity typical of structured protein assemblies. In combination with prior evidence from catalytic IBs [
3,
4,
7] and soluble LPMOs expressed in the periplasm [
19], these results identify AA10 domains as promising natural templates for engineering self-assembling biocatalysts and protein-based nanomaterials.
4. Materials and Methods
4.1. Chemicals and Reagents
All reagents were of analytical grade and purchased from Sigma-Aldrich (St. Louis, MO, USA) unless otherwise specified.
4.2. Gene Selection and Sequence Analysis
Four AA10 LPMO homologues were selected to evaluate whether the strong aggregation behaviour previously observed for
Caldibacillus cellulovorans Ccel
p40 is widespread within this enzyme family. The sequences were retrieved from the NCBI GenBank database:
Kallotenue papyrolyticum (Kpap
p40),
Kibdelosporangium aridum (Kari
p40),
Archangium lipolyticum (Alip
p40), and
Phytohabitans suffuscus (Psuf
p40). Amino-acid sequences corresponding to the expressed catalytic domains (following removal of native signal peptides and construct-specific N-terminal residues) were aligned with the reference Ccel
p40 using Clustal Omega (EMBL-EBI) (
https://www.ebi.ac.uk/jdispatcher/msa/clustalo, accessed on 15 December 2025) to assess conservation of motifs within the AA10 catalytic core, including residues associated with copper coordination and substrate binding. Aggregation-propensity profiles were generated with AGGRESCAN [
10] using default parameters, and the total hotspot area (THSA), number of aggregation hotspots (nHS), and average aggregation-propensity score (Na4vSS) were recorded.
4.3. Construction of Expression Plasmids
DNA manipulations and restriction digestions were performed following standard molecular biology protocols [
48]. The pETDuet-1-p533 expression vector, harbouring the gene encoding the p40 domain of
C. cellulovorans ManA preceded by a 3×(GGGGS) flexible linker, was first transformed into
E. coli DH5α competent cells. Positive transformants were identified by colony PCR (cPCR) and confirmed by agarose gel electrophoresis. The p533 vector was subsequently digested with EcoRI and BamHI, and the linearised fragment was separated by agarose gel electrophoresis, excised, and purified using the Monarch DNA Gel Extraction Kit (New England Biolabs, NEB, Ipswich, MA, USA). To prevent self-ligation, the linearised vector was dephosphorylated with Antarctic Phosphatase (NEB) and further purified using the Monarch PCR & DNA Cleanup Kit (NEB), ensuring complete removal of the original p40 insert.
Genes encoding the AA10-LPMO p40 homologues (Alipp40, Kpapp40, Psufp40, and Karip40) were synthesised as gBlock Gene Fragments (Integrated DNA Technologies, Singapore). Each insert was digested with EcoRI and BamHI to match the restriction sites of the prepared vector and subsequently purified from agarose gels using the Monarch DNA Gel Extraction Kit. Ligation reactions were performed using the Quick Ligation Kit (NEB), and ligation mixtures were transformed into E. coli DH5α competent cells. Positive clones were confirmed by cPCR and agarose gel electrophoresis. Verified plasmids were purified and transformed into E. coli BL21(DE3) competent cells for heterologous protein expression. Construct identity was further confirmed by DNA sequencing (Macrogen Inc., Seongnam-si, Republic of Korea). The resulting expression plasmids, pETDuet-1-Alipp40, pETDuet-1-Kpapp40, pETDuet-1-Psufp40, and pETDuet-1-Karip40, were used for subsequent protein expression experiments.
4.4. Protein Expression
Recombinant expression of p40-based IBs was performed in E. coli BL21(DE3) using Luria–Bertani (LB) medium supplemented with ampicillin (100 µg mL−1). An overnight seed culture (5 mL) of cells harbouring the appropriate expression plasmid was used to inoculate 100 mL of fresh LB medium at a 2% (v/v) inoculation ratio. Cultures were incubated aerobically at 37 °C and 200 rpm in a shaking incubator until reaching mid-logarithmic growth phase (OD600 = 0.6–0.8). Protein expression was induced with isopropyl β-D-1-thiogalactopyranoside (IPTG) at final concentrations of 0.1 mM, 0.5 mM, and 1 mM as part of a single IPTG titration experiment to compare IB formation across standard induction conditions. For this titration, all induction conditions were performed using the same recombinant E. coli BL21(DE3) clone to minimise inter-clonal variability. A single colony was used to inoculate a pre-culture, which was subsequently split into parallel expression cultures induced with different IPTG concentrations. Following induction, cultures were incubated for an additional 4 h at 37 °C and 250 rpm. Cells were harvested by centrifugation (10,000× g, 15 min, 4 °C) and washed once with 1× phosphate-buffered saline (PBS, pH 7.5). The resulting cell pellets were centrifuged again under the same conditions and stored at −30 °C until further use.
For the isolation of IBs, cells were resuspended in 5 mL lysis buffer (50 mM Tris-HCl, 100 mM NaCl, 1 mM EDTA, 1 mM serine protease inhibitor Pefabloc
® SC, and 2% glycerol, pH 7.5). The suspension was homogenised by gentle vortexing and subjected to ultrasonic disruption using a Branson 450 Digital Sonifier (Marshall Scientific, Hampton, NH, USA) for 8 min with a 5 s on/10 s off duty cycle at 35% amplitude. After sonication, a small aliquot of the total protein (TP) fraction was collected for subsequent qualitative analysis. The cell lysate was centrifuged at 10,000×
g for 15 min at 4 °C to separate soluble and insoluble fractions. The supernatant containing the soluble protein (SP) and the pellet containing the insoluble fraction (P) were collected separately. Both fractions were analysed by SDS–PAGE to assess protein expression and solubility, and densitometric quantification of band intensity was performed using ImageJ software (version 1.54p, NIH, Bethesda, MD, USA). All densitometric analyses were performed with background subtraction. The relative yield (%) of each fraction was calculated according to:
where
An represents the densitometric area of the band corresponding to either the soluble or insoluble protein fraction, and
At represents the densitometric area of the total protein fraction as measured in ImageJ.
4.5. Sodium Dodecyl Sulfate–Polyacrylamide Gel Electrophoresis (SDS–PAGE)
Protein samples were mixed with an equal volume of Laemmli sample buffer (2×) and incubated at 95 °C for 5 min to achieve complete denaturation. Denatured samples were briefly centrifuged to remove condensate and then loaded onto precast Mini-PROTEAN® TGX gels (4–20%) (Bio-Rad Laboratories, Hercules, CA, USA). Electrophoresis was performed at a constant voltage of 100 V for 60 min using running buffer composed of 25 mM Tris, 192 mM glycine, and 1% (w/v) sodium dodecyl sulfate (SDS), adjusted to pH 8.3. After electrophoresis, the gels were stained with Coomassie Brilliant Blue G-250 Bio-Safe Stain (Bio-Rad Laboratories, Hercules, CA, USA) and destained with Milli-Q water until clear band visualisation was achieved. Protein bands were imaged using the G:BOX F3 Gel Documentation System (Syngene, Cambridge, UK). Gel images were acquired under non-saturating conditions, with exposure settings adjusted to ensure that pixel intensities remained within the linear dynamic range of the detector, thereby avoiding saturation or blooming effects that could compromise densitometric quantification. The Precision Plus Protein™ Dual Color Standard (Bio-Rad Laboratories, Hercules, CA, USA) was included in all gels as a molecular weight reference.
4.6. Partial Purification of IBs
Following cell lysis, the resulting insoluble pellet was initially washed with 1× PBS (pH 7.5) and centrifuged at 10,000× g for 15 min at 4 °C to remove loosely associated contaminants. The pellet was then subjected to a detergent wash using 5 mL of 0.1% (v/v) Triton X-100 (Thermo Fisher Scientific, Scoresby, Victoria, Australia), followed by centrifugation under the same conditions. To eliminate residual detergent, the pellet was rinsed twice with sterile Milli-Q water. For further purification, approximately 1 g of the washed insoluble fraction was incubated with 4 mL B-PER™ reagent (Thermo Fisher Scientific, Scoresby, Victoria, Australia) supplemented with 5 units of Benzonase® nuclease (Sigma-Aldrich, St. Louis, MO, USA) at room temperature for 15 min with gentle mixing. After incubation, the solubilised protein was separated from the IB-rich pellet by centrifugation at 10,000× g for 15 min at 4 °C. The pellet, representing the purified insoluble protein fraction, was subsequently washed twice with Milli-Q water and stored at −30 °C for downstream analyses. Protein samples collected at each washing step, including the final partially purified IB fraction, were resuspended in sterile Milli-Q water or 1× PBS and analysed by SDS–PAGE to evaluate protein size, purity, and relative yield following purification.
4.7. Protein Quantification
The molar extinction coefficient and molecular weight of each IB were calculated from the deduced amino acid sequence using the ExPASy ProtParam tool (
https://web.expasy.org/protparam/, accessed on 4 February 2025). A total of 3 mg of each IB preparation was resuspended in 300 µL of solubilisation buffer (50 mM Tris-HCl, pH 8.5, 5 mM dithiothreitol). Suspensions were pipetted and vortexed vigorously until no visible clumps remained. Aliquots of 100 µL were transferred into three tubes to enable triplicate measurement. Samples were further solubilised by addition of solubilisation buffer containing 1–3 M guanidinium hydrochloride (GdmCl) to yield a final IB concentration of 1 mg/mL. After incubation at room temperature for 2 h with gentle agitation, samples were centrifuged (12,000×
g, 20 min, 4 °C) to remove residual insoluble material. The absorbance at 280 nm (A
280) was measured using a NanoDrop One/One
c spectrophotometer (Thermo Fisher Scientific) after blanking against the solubilisation buffer containing the same GdmCl concentration, with baseline correction at 350 nm. Protein concentration was calculated using the Beer–Lambert law:
where C
protein is the molar concentration of protein, A
280 is the absorbance at 280 nm, ε
protein280 is the theoretical molar extinction coefficient (calculated using ProtParam), and l is the optical pathlength (cm).
The protein content (%) of each IB preparation was calculated as:
All measurements were performed in triplicate, and results are reported as mean ± SD values.
4.8. Dynamic Light Scattering (DLS)
The hydrodynamic diameter of IBs was determined by dynamic light scattering (DLS) using a Zetasizer Nano ZSP (Malvern Instruments Ltd., Worcestershire, UK) equipped with a 50 µL quartz cuvette. IB suspensions were diluted 50–100 fold with sterile Milli-Q water or 1× PBS (pH 7.0) prior to measurement. Analyses were performed at 25 °C using disposable cuvettes, after 1 min equilibration, with the refractive index of wate r (dispersant) set to 1.33 and material to 1.45. Measurements were collected in normal resolution mode, and particle size distributions were expressed as the diameter of an equivalent hydrodynamic sphere. Each sample was measured in triplicate, and the reported hydrodynamic diameter represents the mean ± standard deviation. Water and 1× PBS were used as dispersants in all measurements.
To evaluate IB stability, the scattering intensity and hydrodynamic diameter were monitored over time. Data analysis followed the methodology described by Castellanos-Mendoza et al. [
49].
4.9. Field Emission Scanning Electron Microscopy (FESEM)
Purified AA10 LPMO IBs were diluted 1:20 in ultrapure Milli-Q water and gently vortexed to achieve a uniform suspension. A 1 mL aliquot of each sample was filtered through a 0.22 µm polycarbonate isopore membrane (Merck, Bayswater, Victoria, Australia) mounted on a 13 mm Swinnex filter holder (Merck, Bayswater, Victoria, Australia). Residual particles were removed by rinsing with 500 µL ultrapure water, followed by gentle drying with compressed air.
The membranes were air-dried under laminar flow, mounted on SEM stubs using graphite carbon adhesive tape, and gold-sputter coated with an Emitech K550 coater (Quorum Technologies, Lewes, UK) under an argon atmosphere. Imaging was performed at an accelerating voltage of 5–15 kV using a JSM-7100F field-emission scanning electron microscope (FESEM) (JEOL, Tokyo, Japan). Particle dimensions (length and width) were determined from 20–50 individual IBs per micrograph using ImageJ software (NIH, Bethesda, MD, USA).
4.10. Fourier-Transform Infrared (FTIR) Spectroscopy
The secondary structure of IB proteins was analysed by Fourier-transform infrared (FTIR) spectroscopy using a JASCO FT/IR-4700 spectrometer (JASCO, Hachioji, Japan). Aliquots of 5–10 µL of resuspended IBs in Milli-Q water or 1× PBS were deposited onto a zinc selenide (ZnSe) ATR crystal and air-dried for 10–15 min to remove residual moisture. Spectra were collected over 4000–600 cm−1 with a resolution of 4 cm−1, averaging 32 scans per sample to enhance the signal-to-noise ratio. Background spectra were recorded under identical conditions and subtracted from sample spectra. The crystal was cleaned with 80% ethanol before each run, and reference spectra for Milli-Q water were recorded to remove water interference.
IR spectral deconvolution and amide I band analysis (1700–1600 cm
−1) were conducted in OriginPro 2025 (OriginLab Corp., Northampton, MA, USA) using Gaussian fitting after baseline correction of the amide I region by applying a straight baseline between 1600 and 1700 cm
−1. Peak positions were assigned based on established literature ranges and verified using the deconvolution procedure described in the
Supplementary Material. Secondary-structure content was determined by normalising the total integrated area of the amide I band to 1 and subsequently summing the areas corresponding to established wavenumber ranges for each structural motif. Bands between 1620–1640 cm
−1, 1641–1649 cm
−1, and 1650–1660 cm
−1 were assigned to β-sheets, random coils, and α-helices, respectively, while β-turns were attributed to 1661–1691 cm
−1, excluding 1672–1678 cm
−1 sub-region associated with antiparallel β-sheet contributions [
50].
The proportion of each secondary-structure element was calculated as:
where
Ai is the area of each fitted band within the amide I region and
At is the total area of all fitted components. Details of the fitting procedure are provided in the
Supplementary Material.