1. Introduction
Compounded semisolid pharmaceutical preparations (creams, ointments, and gels) require rigorous control of process parameters to ensure quality, uniformity, and stability. In traditional practice, these dosage forms are prepared manually with a mortar and pestle, a technique that, despite long-standing use, exhibits substantial variability and limited reproducibility. The lack of precise control over critical variables (mixing time, shear rate, and applied force) can lead to inconsistent particle dispersion and non-uniform active pharmaceutical ingredient (API) distribution, with implications for therapeutic performance and patient safety. Moreover, manual workflows make it difficult to document a consistent shear history and mixing endpoint across operators and sites, which complicates batch-to-batch comparability and can undermine confidence in assigned beyond-use dates (BUDs) when product-specific stability data are limited [
1,
2,
3].
Demand for standardized, reproducible compounding has increased, driven by quality expectations for extemporaneous preparations and the need to comply with United States Pharmacopeia (USP) and Good Manufacturing Practices (GMP) requirements [
4,
5]. In the U.S., USP <795> establishes minimum standards for nonsterile compounding, including requirements related to suitable materials and equipment, documentation and control of compounding processes, and the practical assignment of BUDs when stability information is absent or limited [
5]. More broadly, compendia/formularies and regulatory frameworks increasingly expect GMP-aligned controls (traceability, defined procedures, and hygiene controls) even where full industrial GMP does not apply [
6,
7]. Additionally, compendial standards for nonsterile compounding emphasize documented procedures, appropriate equipment, and reproducible processes as foundational controls for quality in extemporaneous preparations [
5].
Additionally, automated devices adapted from non-pharmaceutical contexts (e.g., kitchen or industrial mixers) and commonly used by compounding pharmacies generally fail to meet pharmaceutical-grade specifications; in particular, contact materials may not be demonstrated as non-reactive, non-adsorptive, or non-additive as required by USP <795> and GMP, increasing risks of incompatibility or contamination [
6,
7]. To address these limitations, FagronLab™ developed the Unguator™ specifically for use in compounding pharmacies, as an Electronic Mortar and Pestle (EMP) system (software version 4.8.0-FW-5.2), is purpose-built to support standardized, hygienic, and reproducible semisolid preparation. The device combines a closed mixing environment, programmable control of speed and duration, and patented vertical–horizontal blade motion to promote uniform dispersion and homogenization across semisolid matrices [
7]. Pre-validated mixing programs and batch-level parameter recording support standardization and traceability and enable evaluation of selected Quality by Design (QbD)-aligned elements (identification and control of critical process parameters, DOE-informed operating windows, and hygiene controls) by reducing operator dependence and inter-batch variability. Consistent with ICH Q8(R2) pharmaceutical development principles and supported by ICH Q9(R1) quality risk management and ICH Q10 pharmaceutical quality system concepts, the present work applies a QbD-aligned framework to compounding-relevant, measurable quality attributes; however, it does not constitute a full lifecycle QbD program [
8,
9,
10].
Electronic mortar-and-pestle systems have previously been evaluated for dermatological compounding, reporting generally reproducible outcomes for several cream/ointment bases but limitations for some gel systems [
6,
11]. However, to our knowledge, this is the first systematic performance evaluation of an EMP technology designed for compounding pharmacy practice, covering analytical method validation, forced degradation, content uniformity by stratified sampling, process optimization via design of experiments, and cleaning validation. Five representative formulations were selected to probe distinct physicochemical challenges: (i) urea 40% cream (high solid content), (ii) clobetasol propionate 0.05% cream (low-dose API), (iii) diclofenac sodium 2.5% in 3% hyaluronic acid gel (polymeric matrix), (iv) urea 10% with salicylic acid 1% cream (multicomponent system), and (v) hydroquinone 5% cream (oxidation-sensitive API). The objective was to determine whether the Unguator™ can deliver reproducible, pharmaceutically acceptable semisolid preparations under standardized and practically implementable conditions suitable for routine use in compounding pharmacies. In this work, the DOE-driven optimization endpoint was confined to assay-based content uniformity across jar strata. Importantly, this study was designed as a device-focused performance and QbD evaluation of the Unguator™ EMP; no direct comparison with manual mortar-and-pestle or other mixers was performed.
2. Materials and Methods
2.1. Materials
APIs and semisolid vehicles (Versatile™ and Nourivan Antiox™) were obtained from Fagron (São Paulo, Brazil). All reagents were of analytical grade and used as received. Water was purified by a Milli-Q® system from Merck Millipore (Burlington, MA, USA). Reference standard stock and working solutions for each API were prepared in accordance with the relevant European Pharmacopoeia monograph recommendations for assay-related preparations and were used for UHPLC calibration and quantification.
The compounding and analytical procedures were conducted using the FagronLab™ Unguator™ Electronic Mortar and Pestle (EMP) system (Scheßlitz, Germany), equipped with programmable speed and time control, closed-system mixing jars (20–500 mL), and standard titanium-nitride-coated blades, and operated with software version 4.8.0-FW-5.2.
2.2. QTPP, CQAs, and CPP/CMA Identification (QbD-Aligned Framework)
For clarity, this study was planned using a QbD-aligned structure, consistent with ICH Q8(R2), focused on quality attributes that can be directly measured and controlled in compounding practice [
9]. The Quality Target Product Profile (QTPP) for the model semisolid preparations is: (i) topical semisolid in a closed mixing jar and final container; (ii) labeled strength as compounded; (iii) layer-to-layer content uniformity assessed by stratified sampling (acceptance: 90–110% label claim in all layers and strata CV ≤ 5%); (iv) chemical stability over the assigned beyond-use period assessed via stability-indicating assay capability; (v) acceptable physical appearance and usability (no gross agglomeration and no excessive air entrapment); and (vi) minimized cross-contamination risk between batches.
Based on the QTPP, the Critical Quality Attributes (CQAs) explicitly addressed in this work were: (a) assay/content uniformity across jar strata (primary CQA); (b) stability-indicating assay capability and degradation behavior (supporting CQA); and (c) post-cleaning residue/cross-contamination risk (hygiene CQA). Other CQAs relevant to semisolids (e.g., rheology/viscosity, droplet/particle size distribution, pH, microbial quality, and in vitro release) are acknowledged but were not quantified in the present study.
The Critical Process Parameters (CPPs) experimentally evaluated were mixing rotation speed (rpm) and mixing time (min), as defined in the DOE (
Section 2.9). Additional candidate CPPs identified through process mapping include incorporation order, scraping/handling steps, jar fill volume/headspace, and selection of pre-validated device programs (Emulsion/Emulsion+). Candidate Critical Material Attributes (CMAs) include API particle size/wettability and density mismatch, solids loading, and vehicle rheology. A condensed FMEA-style CPP/CMA–CQA mapping and risk ranking, together with a proposed control strategy informed by the present data, is provided in
Section 4.
2.3. Formulation Design
Five representative semisolid formulations were selected to challenge the mixing and homogenization performance of the EMP system (
Table 1). The set intentionally spans diverse rheological and physicochemical profiles: high solid content, low-dose API, polymeric gel, multicomponent system, and an oxidation-sensitive API.
2.4. Compounding Procedure
All formulations were prepared as 100 g batches using the Unguator™ EMP under ambient laboratory conditions (22 ± 2 °C; 45–55% RH). Unless otherwise specified, raw materials were weighed gravimetrically to ±0.01 g using a PioneerTM balance from Ohaus (Parsippany, NJ, USA).
2.4.1. Standard Single-API Protocol (F1, F2, F5)
Single-API creams were prepared in the Unguator™ EMP under ambient conditions. For each batch, 50% of the target vehicle mass was first charged into the mixing jar, after which the active pharmaceutical ingredient was added and covered with the remaining vehicle. The jar was then sealed and mixed according to the assigned experimental condition (
Section 2.4.3). Upon completion, the mixing blade was removed, the preparation was visually inspected for uniformity and absence of agglomerates, and the product was packaged in opaque white containers.
2.4.2. Multicomponent Cream Protocol (F4)
The multicomponent keratolytic cream was compounded using the same sequence as above to promote progressive wetting and dispersion. The lower-dose component was introduced first, followed by the higher-load component, before bulk homogenization under the designated mixing condition. After mixing, blades were removed, the product was inspected for homogeneity, and it was packaged in opaque white containers.
2.4.3. Mixing Programs and Experimental Factors (All Formulations)
Each formulation was prepared across a predefined matrix of EMP conditions comprising mixing speeds of 600, 1500, and 2400 rpm and mixing times of 2, 6, and 10 min. In addition to these manual settings, the built-in Emulsion and Emulsion+ programs were applied to emulate standardized, GMP-like mixing profiles. All preparations were performed in triplicate (n = 3 per condition).
2.4.4. Two-Step Gel Preparation and API Incorporation (F3)
The polymeric gel was produced in two stages. First, the hyaluronic acid gel base was prepared by weighing hyaluronic acid, phenoxyethanol (0.5%
w/
w), and purified water to q.s. 100%, transferring the components to a 2 kg Unguator-compatible FagronLab
TM jar, and homogenizing using the Gel program until a uniform dispersion was obtained. The gel was then allowed to rest for 24 h at room temperature to complete hydration and reach the target viscosity, after which it was visually inspected and stored in opaque white bottles. In the second stage, diclofenac sodium (at the target strength specified in
Table 1) and ethoxydiglycol (10%
w/
w) were accurately weighed. Approximately half of the prehydrated gel base was charged into a 100 mL FagronLab
TM bottle, the diclofenac and ethoxydiglycol were incorporated, and the remaining gel base was added. The mixture was homogenized in the EMP using the assigned rpm–time setting or preset program outlined in
Section 2.4.3. Following mixing, blades were removed, the gel was inspected for uniformity and absence of visible particulates, and the final product was packaged in opaque white bottles.
2.5. Chromatographic Analysis
Quantification of APIs was performed using Ultra-High Performance Liquid Chromatography (UHPLC) in a Vanquish Flex System from ThermoFischer Scientific (Basel, Switzerland) with a Chromeleon™ software v. 7.3 (same manufacturer) integrated. Individual chromatographic conditions (mobile phase composition, flow rate, detection wavelength, and column type) were optimized for each API and are summarized in
Table 2. The methods were designed to be stability-indicating and validated according to ICH Q2(R2) guidelines [
16].
2.6. Methods Validation
The analytical methods were validated for specificity, linearity, precision, accuracy, limit of detection (LOD), and limit of quantification (LOQ) as per ICH Q2(R2) recommendations [
16].
Specificity was assessed using the solutions described above by performing UHPLC analyses of (i) API standard solution, (ii) vehicle blank, and (iii) mobile phase (diluent) blank. Forced-degradation samples were prepared using the complete formulation matrix (API in the corresponding vehicle/excipient system), rather than API-only solutions, to capture matrix-related interferences and degradation behavior relevant to compounded products. Interference was considered acceptable when no co-eluting peaks were observed at the API retention time and the relative difference in peak area between standard and matrix-added chromatograms was <2.00%. In accordance with ICH Q2(R2) [
9] expectations for stability-indicating methods, forced-degradation studies were also performed as part of the specificity assessment under acidic hydrolysis (6 M HCl), basic hydrolysis (10 M NaOH), oxidative conditions (35% H
2O
2), thermal stress (70 °C for 24 h), and photolysis (UV light at 365 nm for 24 h); chromatograms were evaluated for adequate separation of degradation peaks from the parent analyte (target Rs ≥ 1.5). Each forced-degradation condition was analyzed in sextuplicate (
n = 6). Additionally, photolysis was applied as a forced-degradation stress condition (UV 365 nm for 24 h) to support the specificity/stability-indicating assessment; photolysis was applied as a forced-degradation stress condition to support specificity (stability-indicating capability), not as an ICH Q1B-compliant photostability study (no controlled/reportable cumulative light exposure in lux·h and W·h/m
2). Photolysis-stressed samples were compared to an unstressed control prepared in parallel but not submitted to photolysis (fresh/unexposed sample of the same matrix and concentration), with both samples processed using the same extraction/dilution protocol prior to UHPLC–PDA analysis. Evaluation was based on the appearance of additional peaks and/or change in the parent peak area relative to the unstressed control, together with PDA peak-purity assessment and chromatographic separation. Specificity assessments were performed in triplicate, and forced-degradation samples were analyzed under each stress condition. Peak purity of the parent API peak in forced-degradation samples was assessed using PDA spectral analysis in Chromeleon™ software.
Precision was evaluated in terms of repeatability and intermediate precision. Repeatability was determined by consecutive analysis of six replicate injections of the API solution at working concentration by a single analyst on the same day. Intermediate precision was assessed by two different analysts on two separate days, each performing six replicate analyses under the same experimental conditions. Precision was expressed as the coefficient of variation (CV), with an acceptance criterion of ≤5%.
Accuracy was determined by spiking the matrix with known amounts of the API at the same concentration levels used in the linearity assessment (n = 3 for each level). Analyses were performed by the same analyst, and results were expressed as percent recovery relative to the calibration curve obtained from the linearity study.
Linearity was established from three independent calibration curves constructed using the API concentrations described in
Table 1. The curves used a minimum of five calibration levels (
n = 5) spanning the intended quantification range (from approximately 70% to 130% of the target assay concentration for each API). The relationship between analyte concentration and peak area was evaluated using the ordinary least-squares regression (unweighted) method, using peak area versus nominal concentration. The linearity of each calibration curve was assessed by analysis of variance (ANOVA), and correlation coefficients (r
2) were determined to quantify the strength of the linear relationship.
The limits of detection (LOD) and quantification (LOQ) were determined from three standard calibration curves and calculated according to Equations (1) and (2), respectively:
where S is the mean slope of the analytical curves and σ is the standard deviation of the response (noise) estimated from at least ten blank sample analyses.
Robustness (e.g., deliberate small variations in flow rate, column temperature, mobile-phase composition, or detection wavelength) was not formally assessed in this study.
2.7. Homogeneity and Content Uniformity
Homogeneity testing was performed by stratified sampling from the top, middle, and bottom portions of each compounded jar. For each experimental condition, three independent jars were prepared (n = 3), and, for each jar, each layer was sampled and analyzed in triplicate using validated UHPLC methods. Acceptance criteria were applied per jar: the mean API content of each layer had to be 90–110% of label claim, and the coefficient of variation (CV) of the three-layer means (top/middle/bottom) had to be ≤5%. These internal criteria were selected to reflect pharmacopeial expectations for assay accuracy and low variability in compounded preparations, noting that pharmacopeias do not prescribe a specific stratified “content-uniformity” test for semisolid jars. This evaluation assessed vertical distribution and mixing efficiency across the EMP’s operating parameters.
2.8. Design of Experiments (DOE)
A 3
2 factorial design was employed to identify mixing conditions that optimize assay-based content uniformity (mean % label claim and variability), with mixing time (X
1: 2, 6, 10 min) and rotation speed (X
2: 600, 1500, 2400 rpm) as independent variables (
Table 3). These two critical process parameters were selected based on prior knowledge of their direct influence on the homogenization efficiency of semisolid formulations. The DOE was intentionally restricted to these two factors to maintain feasibility across five formulation matrices (9 conditions × triplicate jars per condition) and to isolate the primary, directly programmable energy-input variables of the EMP. Other potentially influential parameters (e.g., jar size/fill level and headspace, blade configuration/orientation, initial product temperature, and staging of component addition) were standardized. DoE model calculations and regression analyses were performed in Microsoft Excel (Microsoft Corporation, Redmond, WA, USA; version 2510), and response surfaces were generated from the fitted quadratic equations.
The design enabled simultaneous investigation of main effects and second-order interactions on the response variables, defined as (i) API content uniformity (%) and (ii) relative standard deviation (RSD, %). The experiments were conducted in randomized order to minimize systematic error, and each combination was performed in triplicate for each formulation type (n = 9 per DOE matrix). The DOE approach facilitated the construction of quadratic polynomial response surface models, allowing prediction and optimization of the most suitable process parameters for each formulation matrix. Accordingly, the predicted ‘optima’ reported from the response-surface models should be interpreted as optima for content uniformity outcomes only.
2.8.1. Phase 1—Performance Analysis
In the first phase, each formulation described in
Section 2.2 was analyzed in triplicate to assess intra-batch homogeneity. Samples were collected from the top, middle, and bottom layers of the compounded preparations to evaluate vertical uniformity. The API content in each section was determined using the validated UHPLC methods described previously. The acceptance criteria were defined as an API content between 90–110% of label claim and a coefficient of variation ≤ 5% among layers. The ≤5% CV threshold across the three-layer means (top/middle/bottom) was defined as an internal, fit-for-purpose homogeneity target to indicate minimal vertical stratification in compounded semisolids, recognizing that pharmacopeias do not prescribe a specific stratified ‘content-uniformity’ test for jarred semisolids. This value was selected as a conservative benchmark consistent with typical expectations for low assay variability in quantitative methods and provides a practical, stringent criterion for standardized mixing performance in heterogeneous semisolid matrices.
2.8.2. Phase 2—Response Surface Evaluation
In the second phase, the factorial design was applied to model and optimize the relationship between the process parameters (mixing time and rotation speed) and the resulting homogeneity of the formulations.
The coded levels (−1, 0, +1) represented the low, central, and high values of each factor, respectively. A total of 90 experimental runs were performed across all formulations, including 10 replicates at the central point to estimate experimental error and assess model reproducibility (
Table 3).
Experimental data were used to fit a second-order polynomial equation of the form:
where Y is the predicted response (content uniformity or RSD), β
0 is the intercept, β
1–β
22 are the regression coefficients, and ε is the residual error term.
The resulting response surfaces and contour plots were generated to visualize the effects and interactions of the independent variables on the measured responses. The optimal mixing conditions were defined based on the criteria of maximum homogeneity (highest API uniformity, lowest RSD) while maintaining practical compounding time and operational feasibility.
2.9. Cleaning and Cross-Contamination Validation
The workbench, equipment, and mixing blades were cleaned following laboratory hygiene standards. The workbench was sanitized with a 2.2% sodium hypochlorite solution, applied using a squeeze bottle and spread with paper towels in unidirectional, uniform movements. The surface was left to air dry before use. The equipment was cleaned using a non-woven cloth moistened with 70% alcohol, ensuring proper disinfection without the use of sharp tools or abrasive agents to avoid damage. Mixing blades were cleaned with a non-woven cloth, rinsed under hot running water, and dried with the same cloth.
To assess environmental and cross-contamination risks after completion of Phase 2, the formulation with the lowest detection limit, hydroquinone, was selected as a tracer compound. Its high analytical sensitivity allowed the detection of even trace residues, ensuring accurate evaluation of potential contamination. Swab samples were collected from three critical surfaces—the workbench, the mixing paddle, and the external housing of the equipment—at three time points: before compounding, after compounding, and after sanitization. Swab sampling was performed using sterile Olen® swabs with plastic shafts and cotton tips (ethylene oxide sterilized, individually packaged). Samples were collected from three critical surfaces: the workbench (40 cm2 area surrounding the equipment), the mixing paddle (6 cm of the blade), and the paddle insertion point (7 cm). Each swab was pre-moistened with the hydroquinone diluent solution and applied using unidirectional, uniform strokes with firm pressure while rotating the swab to maximize contact. Following sampling, each swab was placed into 5 mL of hydroquinone diluent solution for extraction, and the resulting extract solutions were analyzed by UHPLC using the validated method. An acceptance criterion of <10 ppm hydroquinone in the 5 mL swab extract (≤50 µg recovered per swab) was adopted as a pragmatic, conservative proof-of-concept threshold for routine compounding hygiene; toxicology-derived Health-Based Exposure Limit/Permitted Daily Exposure (HBEL/PDE) limits and full ‘worst-case’ bracketing across APIs/vehicles were outside the scope of this study.
3. Results
3.1. Analytical Method Validation
All analytical methods developed for quantifying the active pharmaceutical ingredients (APIs) demonstrated adequate performance according to ICH Q2(R2) criteria (
Table 4). Acceptance criteria were defined a priori in an internal analytical-validation SOP, following the fitness-for-purpose principle and the expectation that laboratories predefine method performance criteria prior to validation. For the intended use (assay of APIs in compounded semisolid matrices), the following criteria were applied: linearity r ≥ 0.99; significant regression by ANOVA (
p < 0.05; F
reg > F
crit); specificity as discrepancy < 2.0%; repeatability and intermediate precision CV ≤ 5.0%; and accuracy (recovery) 100% ± 2%. The predefined acceptance criteria are shown in
Table 4 together with the corresponding validation results [
17,
18,
19].
The chromatographic methods showed high specificity, with no interference from vehicle or excipient peaks at the analyte retention times. The relative difference in peak areas between standard and matrix-added samples was consistently below 2%, confirming the absence of co-eluting components. Specificity (stability-indicating capability) was further supported through forced-degradation studies under acidic, basic, oxidative, photolytic, and thermal conditions, which generated degradation peaks that were adequately separated from the parent analyte (Rs ≥ 1.5;
Table 5). Hydroquinone and clobetasol propionate showed the greatest susceptibility to oxidation and photolysis, respectively, whereas diclofenac sodium and urea were chemically stable under the tested conditions, confirming that each API can be specifically quantified in the presence of degradation products. Because photolysis exposure was not calibrated to ICH Q1B requirements, photolysis results are interpreted only as qualitative forced-stress outcomes supporting method specificity. PDA peak-purity outcomes are summarized as Pass/Flag in
Table 5, with the corresponding numeric peak-purity match-factor outputs provided in
Supplementary Table S1. Flag outcomes observed under extreme stress were interpreted conservatively in conjunction with the forced-degradation chromatograms, which still demonstrated separation of degradant peaks from the parent analyte (Rs ≥ 1.5), supporting stability-indicating specificity for assay.
Linearity was established over the tested concentration ranges, with correlation coefficients (R2) exceeding 0.999 for all APIs. Precision was confirmed by repeatability and intermediate precision assessments, yielding CV values below 5%, while accuracy results ranged from 98.2 to 101.4% recovery across concentration levels.
The calculated LOD and LOQ values confirmed the high sensitivity of the methods, enabling reliable detection of even minimal residual concentrations, particularly for Hydroquinone. Overall, these results validated that all analytical methods were specific, precise, accurate, and sensitive, ensuring robustness for subsequent performance and contamination assessments.
Formal robustness testing by deliberate variation of method parameters was not performed in this study; consistent with ICH Q2(R2)/ICH Q14 [
17,
18,
19], robustness is typically established during analytical procedure development, and system suitability plus intermediate precision provide operational assurance for the intended use reported here.
3.2. Content Uniformity (Homogeneity)
Across formulations, mean recoveries fell within 90–110% and strata CVs were generally ≤5%, indicating effective vertical mixing under the evaluated conditions (
Table 6). Preparations containing urea (F1, F4-urea) and hydroquinone (F5) showed tight dispersion (low SD and CV), consistent with efficient dispersion of either high-solid content (urea) or dissolved APIs (hydroquinone) in the tested vehicles. The diclofenac in hyaluronic acid gel (F3) maintained recoveries near 100% with low variability, suggesting adequate API distribution within the polymeric matrix.
By contrast, clobetasol 0.05% (F2) exhibited higher between-strata variability in some runs, with the bottom layer occasionally trending away from the other strata, consistent with partial sedimentation at very low dose levels. In these cases, settings employing longer mixing times and/or higher RPM (or the device’s Emulsion/Emulsion+ programs where applicable) reduced variability, supporting the DOE findings that increased process energy improves dispersion for low-dose suspensions.
Overall, the results demonstrate that the electronic mortar and pestle achieved uniform API distribution throughout the jar for the majority of formulations and settings, and that parameter optimization (time/RPM or program selection) is particularly relevant for low-dose, poorly wetting, or higher-density APIs to fully meet the ≤5% CV target across strata.
3.3. DOE Optimization of Mixing Parameters
DoE models were fitted using a full quadratic response surface (coded factors: X1 = time; X2 = rpm), including linear terms, the interaction (X1X2), and curvature (X12, X22). For each formulation, model adequacy was assessed by regression significance (ANOVA) and goodness-of-fit (R2). Factor significance was evaluated from the regression/ANOVA outputs (p-values), and effect directionality was interpreted from the fitted coefficients and response-surface contours. Full regression outputs, ANOVA tables, and diagnostic plots are available in the Excel DoE files (Data Availability Statement).
Across formulations, factorial models were adequate by ANOVA (significant regression; non-significant lack-of-fit where tested) and residual diagnostics showed no systematic trends, supporting the use of quadratic terms at 3 levels. Time (X
1) and speed (X
2) exerted positive main effects on uniformity (higher % label claim convergence and lower RSD), with magnitude depending on formulation rheology and dose. Curvature was present in several matrices (notably viscous or polymeric systems), indicating diminishing returns beyond mid-to-high settings (
Table 7).
A modest X
1 × X
2 interaction was observed in low-dose emulsions (e.g., clobetasol 0.05%) where simultaneous increases in time and rpm most effectively reduced between-strata variability (mitigating bottom-layer drift noted in
Section 3.2). In contrast, polymeric gel (diclofenac in HA 3%) benefited from moderate rpm with adequate time, as excessive speed increased air entrainment risk (manifested as slightly higher RSD without improving mean recovery).
Formulation-specific operating windows (empirical) would be: (i) for low-dose, poorly wetting APIs (F2: clobetasol 0.05%): favor higher rpm (≥1500–2400) and longer time (≥6–10 min); device programs analogous to Emulsion/Emulsion+ reduced RSD across strata; (ii) for high-solid content (F1: urea 40%; F4-urea 10%): mid-to-high rpm with mid time (1500–2400 rpm; ~6 min) achieved tight dispersion; further time gave limited benefit (curvature); (iii) for polymeric gel (F3: diclofenac/HA): moderate rpm (~1500–1900) with sufficient time (≥6 min) balanced shear and bubble control; very high rpm offered no additional uniformity gains (curvature increases); and (iv) for dissolved/antioxidant-protected API (F5: hydroquinone 5%): broad robustness across the design space; mid settings were sufficient to maintain low RSD.
Response-surface desirability analysis (maximize uniformity and minimize RSD) identified formulation-specific optima clustered around mid-high rpm with mid-long time (
Figure 1). Confirmation runs at the predicted optima reproduced model expectations within experimental error, supporting transferable set-points that can be encoded as standard device programs (and scaled by jar volume) to reduce operator variability.
In practice, the EMP’s programmable control enables recipe-level standardization: (i) select a default window (e.g., 1500–1900 rpm, 6–10 min) for most creams; (ii) elevate energy (≥1900 rpm and/or 10 min) for low-dose suspensions; (iii) cap rpm around 1500–1900 for polymeric gels to limit entrapped air; and (iv) use pre-validated programs (Emulsion/Emulsion+) when thermal steps or multi-phase incorporation are required. These findings mechanistically explain the homogeneity outcomes in
Section 3.2 and provide DOE-supported set-points that can be incorporated into a QbD-aligned control strategy for routine compounding.
3.4. Translation of DOE Optima to Device Set-Points
To make the design space explicit,
Table 8 consolidates the DOE-defined operational design space (within the studied rpm/time ranges) for each formulation, i.e., combinations predicted to meet the homogeneity criterion (90–110% label claim; CV ≤ 5%), meaning that the mixing conditions predicted by the response-surface models to optimize content uniformity were compared with the practically achievable settings on the Unguator™. Predicted model optima and corresponding practical device set-points are summarized in
Table 8, and confirmation batches at the practical set-points met the predefined homogeneity criteria.
Minor offsets between model optima and practical set-points reflect the Unguator™’s discrete speed increments and programmed time steps, as well as operational considerations (e.g., air incorporation risk in viscous gels at very high rpm; thermal profiles for Emulsion/Emulsion+). For F2 (low-dose clobetasol), the nearest practicable speed (1650 rpm) paired with 10 min sustained mixing reproduced the uniformity gains predicted at 1860 rpm, consistent with the DOE-identified dominance of process energy. For F3 (diclofenac/HA gel), extending the mixing time at a slightly lower rpm (800 rpm/10 min) achieved the same endpoint while mitigating bubble formation, in line with the curvature observed in viscous systems. F4 demonstrated two viable operating windows; the Unguator™ settings chosen map closely to both modeled optima and support workflow flexibility. F5 (hydroquinone) remained robust across the mid-high region, allowing a small downward adjustment in rpm without loss of content uniformity.
These findings indicate that model-informed set-points can be faithfully implemented on the Unguator™ within its native control scheme, and that equivalent quality outcomes can be achieved through rational trade-offs between time and speed when exact numeric optima are not available. This alignment supports routine deployment of device-encoded recipes for standardized, reproducible compounding.
3.5. Cleaning and Cross-Contamination Test
Extracts were analyzed by UHPLC using the validated method; hydroquinone identity was confirmed by retention-time concordance with a reference standard. The acceptance criterion for cleaning validation was <10 ppm (0.001%) hydroquinone. At baseline, no hydroquinone was detected. Immediately after compounding, hydroquinone was detectable but remained below the limit of quantification (LOQ) across sampled surfaces, and in all cases well below 10 ppm. Following the cleaning procedure, no hydroquinone was detected on any surface.
This ≤10 ppm threshold was adopted as a conservative, practical proof-of-concept hygiene criterion for compounding equipment verification and corresponds to ≤50 µg hydroquinone per swab in our 5 mL extraction setup; it was not intended to represent a universal HBEL/PDE-based limit across all APIs and vehicles. Risk-based approaches linking residue limits to dose/toxicology and worst-case bracketing are described in established cleaning-validation guidance documents [
18,
19,
20].
These results demonstrate that, when the prescribed cleaning procedure is applied, the Unguator™ does not retain detectable hydroquinone residues and presents a minimal risk of cross-contamination between batches, supporting compliance with pharmaceutical quality and safety requirements.
4. Discussion
The validation and forced-degradation studies confirmed that the UHPLC assays were stability-indicating and suitable for quantifying APIs in semisolid matrices compounded with the Unguator™. Peak purity and baseline resolution of degradants established method specificity, and the predefined performance thresholds for linearity, precision, and accuracy were consistently met. Consequently, subsequent observations on content uniformity and mixing performance can be ascribed to process and formulation factors rather than analytical bias.
Across chemically and rheologically diverse systems, the Unguator™ delivered uniform products for most formulations under the evaluated conditions. Single-API creams with urea and hydroquinone showed tight layer-to-layer agreement and low variability, indicating efficient dispersion in both high-solid and dissolved-API scenarios. By contrast, clobetasol 0.05% and the salicylic-acid component of the urea–salicylic acid combination exhibited greater between-strata variability in some settings, consistent with partial sedimentation or incomplete deagglomeration at low dose or in multicomponent matrices. These observations are coherent with prior reports showing that mixing method and energy input influence microstructure and performance of semisolid systems, including adhesion, rheology, and drug delivery outcomes; bead-mill homogenization, for example, has been associated with drug crystallization and loss of adhesive properties, ultimately reducing transdermal delivery, and dilution effects can modify rheology and skin penetration [
21]. Similarly, the type of base and incorporation method can alter stability and efficacy [
22]. Topical products also undergo “in-use” metamorphosis (evaporation, crystallization, viscosity drift) affecting release and permeation [
23]. Together, these factors rationalize the formulation-dependent uniformity seen here and underscore the need to tailor mixing conditions to API dose, wetting behavior, density mismatch, and vehicle rheology. In the present study, these functional CQAs (e.g., rheology/spreadability/adhesiveness, microstructure, and release/permeation) were not directly measured; thus, the DOE and set-points reported here are optimized with content uniformity as the primary endpoint.
From a mechanistic standpoint, formulation-dependent sensitivity to EMP settings is consistent with established drivers of uniformity in semisolid dosage forms: the physical state of the API (molecularly dissolved versus dispersed as a suspension), particle wetting/agglomeration behavior, density mismatch between dispersed phase and vehicle, and the vehicle’s microstructure and rheology. Regulatory and scientific frameworks for topical products emphasize that process parameters can change microstructure and, consequently, functional performance measures, such as drug release, even when composition remains constant [
24,
25].
Low-dose, poorly soluble APIs incorporated as suspended solids are generally more sensitive to shear history because adequate mixing energy is required to de-agglomerate particles, improve wetting, and maintain homogeneous dispersion, thereby limiting stratification during and after compounding. This behavior aligns with the literature in which manufacturing variables (including homogenization speed and time) materially influence the viscosity/microstructure and in vitro release behavior of clobetasol propionate creams, illustrating that processing changes can alter semisolid structure and performance [
26].
Similarly, petrolatum-based matrices can exhibit strong shear sensitivity because the semicrystalline hydrocarbon network is mechanically disrupted by mixing, which can alter viscosity, thixotropy, and spreadability, and can also improve dispersion of suspended solids such as salicylic acid. Mixing-induced microstructural changes in white petrolatum and salicylic-acid ointment have been shown to modify rheology and can measurably affect drug permeation, underscoring why rpm/time effects may be formulation-specific for ointment-like systems [
27].
In contrast, systems in which the API is largely dissolved in the vehicle (or rapidly dissolves during processing) are expected to show reduced sensitivity to shear-driven dispersion effects, because uniformity becomes less dependent on particle break-up and more dependent on ensuring complete dissolution and adequate macro-mixing. This provides a plausible basis for the comparatively modest sensitivity observed in the highly water-soluble urea system within the studied mixing window, where time primarily ensures dissolution while speed mainly influences entrained air and handling properties.
For oxidation- and light-sensitive actives, such as hydroquinone, the compounding process is also relevant to chemical stability because oxygen incorporation, metal contact, and light exposure can accelerate degradation in topical preparations. Published studies on compounded hydroquinone formulations highlight stability vulnerabilities and the influence of formulation choices on delivery and stability; accordingly, minimizing avoidable oxygen/light exposure during preparation is a prudent operational consideration. In the present study, forced degradation was used to support assay specificity rather than to claim product photostability, but these literature considerations contextualize why controlled, closed-jar processing and appropriate packaging are relevant when working with oxidation-prone APIs [
28,
29].
Forced-degradation results further contextualize process selection. Clobetasol showed pronounced base lability, whereas the diclofenac–hyaluronic acid gel was highly unstable under acidic conditions, aligning with known degradation pathways. Stress testing is a critical element of pharmaceutical development because it reveals intrinsic stability, degradation routes, and potential degradants, thereby underpinning stability-indicating method validation and risk control [
30,
31]. Within the neutral, ambient, closed-system conditions typical of compounding with the Unguator™, degradation effects are unlikely to confound content-uniformity outcomes; rather, dispersion efficiency is the primary driver of between-strata differences for the low-dose emulsion.
In this study, the DoE evaluated only two process factors (mixing time and rotation speed) because these are the primary, directly programmable EMP parameters governing mixing energy input and are readily standardized across sites. Other potentially relevant variables (jar size/fill level and headspace, blade configuration/orientation, staging of component addition, and initial product temperature) were standardized by protocol to avoid confounding and to maintain feasibility across five formulation matrices; these factors will be prioritized in follow-up studies, starting with fill level/headspace and staged addition.
For clobetasol, the response-surface analysis indicated that rotation speed (rpm) had a stronger influence than mixing time on content uniformity, consistent with the need for higher mixing energy to disrupt agglomerates and reduce stratification in low-dose suspensions. For urea, rpm also predominated, although all conditions remained close to the target assay range. Diclofenac similarly showed a stronger dependence on rpm than time, while the urea–salicylic acid system exhibited formulation-dependent behavior: the urea assay showed only modest sensitivity within the studied range, whereas the salicylic-acid assay was more time-sensitive, consistent with a wetting/dissolution-limited dispersion step under these conditions. These patterns are consistent with mixing theory and published observations that higher rotation speeds shorten the path to blend homogeneity when particle size heterogeneity and wetting limitations are present, albeit with a practical ceiling to avoid over-mixing or air entrapment [
32,
33,
34]. Particle size and solubility further modulate outcomes; smaller particles can improve distribution yet increase cohesive agglomeration at low shear, extending the time to homogeneity [
35], and differences in dissolution kinetics across particle sizes can translate to altered bioavailability and distribution [
36]. In practice, this means that low-dose, poorly wetting, or density-mismatched APIs profit from higher shear and adequate time, whereas polymer-rich gels may require capped rpm with sufficient duration to balance shear against bubble formation.
Translating model optima to device set-points demonstrated good concordance despite discrete speed/time increments in the Unguator™ control scheme. As summarized in
Table 8, F1 mapped exactly (600 rpm/2 min). F2 required a modest reduction from the modeled 1860 rpm to the practical 1650 rpm at 10 min without loss of uniformity. F3 favored a longer time at slightly lower rpm (800 rpm/10 min) relative to the modeled 960 rpm/2.8 min, consistent with curvature observed in viscous gels. F4 supported two windows (mid-rpm/long-time or higher-rpm/shorter-time) both of which were closely approximated by the available device levels (1400 rpm/10 min; 1900 rpm/7:15 min). F5 remained robust across the mid-high region, allowing a small downward rpm adjustment. Confirmation runs at these practicable set-points met the prespecified uniformity criteria, indicating that rational trade-offs between time and speed can reproduce response-surface performance within routine device constraints.
Operationally, both pre-programmed cycles (“Emulsion,” “Emulsion+”) and customized parameter sets achieved reproducible outcomes. The former reduces operator variability and streamlines routine work; the latter permits adaptation to challenging systems where dose, particle size distribution, or excipient interactions demand targeted shear-time combinations. Coupled with the closed, disposable-jar workflow, these features support GMP-aligned hygienic control and minimize cross-contamination risk, as corroborated by the cleaning validation where post-sanitization residues were below detection and well under the 10-ppm acceptance limit [
5,
16].
QbD is typically implemented as a lifecycle program; however, in compounding practice, the most operationalizable elements are explicit definition of target attributes, identification of the process parameters that drive those attributes, and establishment of actionable operating ranges and controls. In this study, we, therefore, focused on: (i) QTPP and CQA definition for semisolid compounding, (ii) experimental evaluation of CPPs (rpm and time) via DOE against the primary CQA (content uniformity), (iii) definition of an operational design space bounded to the studied rpm/time ranges (
Table 9), and (iv) verification of hygiene controls via cleaning/cross-contamination validation. This condensed risk-mapping approach is aligned with the tool-based risk assessment concepts described in ICH Q9(R1) and is presented here in a pragmatic format suitable for routine compounding implementation [
8].
To make the CPP/CMA–CQA linkage and risk assessment explicit,
Table 9 provides a condensed FMEA-style mapping and risk ranking, together with controls informed by the present data. Based on this mapping, a practical control strategy for routine EMP compounding is: (a) standardize input handling (weighing accuracy, incorporation order, and jar fill volume/headspace) and record device set-points batch-to-batch; (b) control CPPs using the formulation-appropriate operational windows in
Table 8 (e.g., high energy/adequate time for low-dose suspensions; capped rpm with sufficient time for polymeric gels to minimize air entrapment); (c) apply pre-validated device programs (Emulsion/Emulsion+) when multi-phase or thermally influenced steps are required; and (d) maintain hygiene controls through the validated cleaning/sanitization procedure and periodic verification against the acceptance limit. This control strategy should be viewed as a step toward broader lifecycle QbD implementation; additional work (e.g., multi-operator/multi-site robustness, broader CMA variation, and expanded CQAs, such as rheology and release) is needed for comprehensive lifecycle validation.
From an implementation perspective, for several formulations (e.g., F1) the device permitted exact implementation of the predicted set-point, whereas for other formulations, minor parameter rounding to the nearest available rpm/time step or programmatic adjustments were required due to equipment granularity and predefined mixing levels. Importantly, confirmation batches compounded at the practical set-points met the predefined homogeneity criteria (90–110% label claim; CV ≤ 5%), indicating that modest deviations from the model optima did not compromise content uniformity.
It is also important to mention that this study has limitations. First, all APIs, semisolid vehicles (Versatile™ and Nourivan Antiox™), and the EMP device/jar/blade system were sourced from a single manufacturer. Second, while the selected formulations are representative in terms of physicochemical challenge (high solids, low-dose suspension, polymeric gel, multicomponent system, oxidation-sensitive API), they do not encompass the full range of semisolid microstructures used in practice, and the DOE levels reflect a practical operating space rather than an exhaustive exploration of shear history or temperature effects. A further limitation is that impurity formation in the compounded products was not comprehensively profiled (e.g., identification/qualification of degradants or impurity mass balance) beyond the forced-degradation work used to demonstrate assay specificity. Although the validated UHPLC methods were designed to be stability-indicating and closed-jar processing was used to limit avoidable oxygen/light exposure, oxidation- or light-sensitive APIs (e.g., hydroquinone) warrant targeted impurity monitoring and formal photostability/stability studies in future work if longer BUDs or product-specific stability claims are intended. Future work should also include broader base classes (including anhydrous ointments and other gel networks), rheometry and air-content quantification, and particle-size/shape analytics. Additionally, the DOE-driven optimization and the reported operating windows were defined using assay-based content uniformity outcomes (mean % label claim and strata variability). Importantly, the present study did not include a head-to-head comparison with conventional manual mortar-and-pestle compounding or with other automated mixing devices used in routine practice. Comparative studies are warranted to benchmark homogeneity, operator-to-operator variability, and process robustness against manual and alternative devices. Finally, a pilot microplastics screen (single determinations with FTIR-based polymer identification) suggested polyethylene particles in petrolatum-based materials; this observation is hypothesis-generating and was not used for quantitative risk assessment
From a transferability perspective, the qualitative trends observed here are expected to hold across many semisolid systems, even if the quantitative rpm/time values do not. Specifically, low-dose APIs and poorly wetting or density-mismatched powders generally require higher specific mixing energy and/or longer effective mixing time to avoid agglomeration and stratification, whereas structured polymeric gels may benefit from capped rpm with longer duration to balance dispersion against air entrapment. Multicomponent systems may exhibit more than one viable operating window depending on which component is rate-limiting for wetting and dispersion. However, the numeric set-points reported in this study are device- and base-dependent because shear field, flow pattern, and energy dissipation vary with mixer geometry, blade design, jar dimensions, fill volume/headspace, and vehicle rheology.
Nevertheless, the combined validation, degradation, homogeneity, and DOE findings provide a coherent, QbD-aligned step toward model-informed, device-encoded recipes that can improve parameter control and reproducibility while respecting formulation-specific constraints, reinforcing the Unguator™ as a versatile platform for semisolid preparation in compounding practice.