Hybrid Statistical–Metaheuristic Inventory Modeling: Integrating SARIMAX with Skew-Normal and Zero-Inflated Errors in Clinical Laboratory Demand Forecasting

Rojas, Fernando; Yáñez, Jorge; Cortés, Magdalena

doi:10.3390/math13183001

Open AccessArticle

Hybrid Statistical–Metaheuristic Inventory Modeling: Integrating SARIMAX with Skew-Normal and Zero-Inflated Errors in Clinical Laboratory Demand Forecasting

by

Fernando Rojas

^1,*

,

Jorge Yáñez

^1,2 and

Magdalena Cortés

¹

Escuela de Química y Farmacia, Facultad de Farmacia, Universidad de Valparaíso, Gran Bretaña 1093, Valparaíso 2340000, Chile

²

Laboratorio Clínico, Hospital Claudio Vicuña, San Antonio 2206160, Chile

^*

Author to whom correspondence should be addressed.

Mathematics 2025, 13(18), 3001; https://doi.org/10.3390/math13183001

Submission received: 21 August 2025 / Revised: 7 September 2025 / Accepted: 12 September 2025 / Published: 17 September 2025

(This article belongs to the Special Issue Hybrid Machine Learning and Deep Learning Techniques for Optimization and Numerical Modeling)

Download

Browse Figures

Versions Notes

Abstract

Clinical laboratories require accurate forecasting and efficient inventory management to balance service quality and cost under uncertain demand. In this study, we propose a hybrid forecasting–optimization framework tailored to hospital clinical determinations with highly irregular, zero-inflated, and asymmetric consumption patterns. Demand series for 34 items were modeled using Seasonal AutoRegressive Integrated Moving Average with eXogenous regressors (SARIMAX) structures combined with skew-normal (SN) and zero-inflated skew-normal (ZISN) residuals, with residual centering, truncation, and lambda regularization applied to ensure stable estimation. Model performance was benchmarked against Gaussian SARIMA and non-linear MLP forecasts. The SN/ZISN models achieved improved forecasting accuracy while preserving interpretability and explainability of residual behavior. Forecast outputs were integrated into a Particle Swarm Optimization (PSO) layer to determine cost-minimizing order quantities subject to packaging and budget constraints. The proposed end-to-end framework demonstrated a potential 89% reduction in inventory costs relative to the hospital’s historical policy while maintaining service levels above 85% for high-volume determinations. This hybrid approach provides a transparent, domain-adapted decision support system for supply chain governance in healthcare settings. Beyond the specific case of Chilean hospitals, the framework is adaptable to broader healthcare supply chains, supporting generalizable applications in diverse institutional contexts.

Keywords:

inventory forecasting; skew-normal residuals; SARIMAX; zero inflation; healthcare supply chain; PSO optimization; explainable forecasting

MSC:

60G10

1. Introduction

Clinical laboratories are essential to public health [1], supporting medical diagnosis as well as clinical and pharmacological monitoring—core functions that ensure quality healthcare delivery and effective disease control. However, monthly demand for diagnostic analytes is highly uncertain and often follows cyclic, seasonal, and intermittent patterns [2]. These challenges are further intensified in public sector settings, where procurement cycles are lengthy, budgets are inflexible, and packaging regulations limit supply responsiveness.

This context highlights a critical operational problem: laboratories face recurrent shortages or overstocking of reagents due to demand uncertainty and rigid procurement constraints [3]. Efficient forecasting and inventory control are therefore indispensable to balance service continuity with cost containment [4].

Time series models such as ARIMA and SARIMAX are commonly used for demand forecasting in healthcare and logistics [5]. Nevertheless, these models typically assume homoscedastic and symmetric Gaussian residuals. This assumption is often violated in clinical demand series, which exhibit non-normal behaviors such as skewness, high variance, and zero inflation due to sporadic usage patterns [6]. To address this, the use of skew-normal distributions has been proposed to capture residual asymmetry [7], while zero-inflated variants help account for excess zero observations [8]. However, even the most accurate forecasts need to be operationalized into ordering policies that comply with real procurement constraints. This motivates the integration of statistical models with optimization techniques, ensuring that forecast outputs translate into practical inventory decisions.

In this context, metaheuristic optimization techniques such as Particle Swarm Optimization (PSO), Genetic Algorithms (GAs), and Ant Colony Optimization (ACO) offer flexible strategies for solving non-linear and constrained inventory problems [9]. These methods support adaptive parameter tuning and are well suited for multi-stage stochastic planning environments [10]. However, they are rarely integrated with domain-specific statistical models for forecast-driven inventory control in clinical applications. Some attempts exist, such as the use of metaheuristics to improve pharmaceutical supply chains during health crises [11] or stochastic programming combined with forecasting approaches in capacity expansion planning [10]. Similarly, hybrid forecasting–inventory models have been applied to account for reliability and seasonality in supply chains [4]. However, these studies typically rely on simplified demand assumptions and do not incorporate residual asymmetry or zero inflation, underscoring the novelty of our proposed framework.

Taken together, these limitations define the research gap: current studies rarely offer integrated frameworks that combine the following (i) skew-aware and zero-sensitive statistical forecasting, (ii) domain-specific optimization under procurement constraints, and (iii) explainable modeling that connects statistical residuals to operational decisions. As a result, transferability of insights into real clinical laboratories remains limited.

Recent interest in explainable artificial intelligence (XAI) highlights the need for transparency in hybrid modeling approaches [12]. While deep learning models can offer high predictive accuracy, their black-box nature limits their interpretability in high-stakes domains like healthcare. In contrast, SARIMAX models with structured, non-Gaussian residuals can provide both interpretability and performance [13].

The novelty of our contribution lies in bridging this gap through the following three contributions:

Extending SARIMAX models with skew-normal and zero-inflated skew-normal residuals, providing skew-aware and zero-sensitive statistical forecasting.
Embedding these forecasts into a PSO-based optimization layer to generate cost-minimizing and constraint-feasible inventory decisions.
Validating the proposed hybrid framework in a clinical laboratory setting, explicitly incorporating institutional constraints such as packaging formats and fixed procurement budgets.

In line with recent advances in hybrid modeling [14], our study connects statistical forecasting with optimization techniques. For example, Chechkin et al. (2025) proposed a hybrid KAN-BiLSTM Transformer with multi-domain dynamic attention to enhance cybersecurity predictions [15]. Similar to our work, their approach integrates multiple architectures to improve interpretability and mitigate risks in critical domains. This reinforces the relevance of combining SARIMAX with asymmetric and zero-inflated errors, together with PSO, to address challenges in medical supply inventory forecasting.

For clarity, the detailed discussion of related systems and previous works will be referred to in the following Related Work subsection, focusing now on three key aspects: (i) defining the real challenges faced by clinical laboratories, (ii) describing our hybrid SARIMAX-PSO framework in a simple way, and (iii) highlighting the originality and contributions of this study.

Related Work

A number of studies have explored demand forecasting and inventory optimization in healthcare and related domains. Traditional statistical models (e.g., ARIMA, SARIMA) have been widely applied but often assume Gaussian residuals, which limit their ability to capture skewness and zero inflation. Machine learning models (e.g., neural networks, LSTMs) provide flexibility but suffer from limited interpretability in high-stakes clinical contexts. Metaheuristic approaches such as PSO, GA, and ACO have been employed for inventory control and pharmaceutical logistics, yet they are rarely integrated with domain-specific forecasting models. Table 1 summarizes representative prior works, their methodological approaches, and key limitations. Compared to this existing body of research, our contribution explicitly bridges three dimensions that remain largely unaddressed: (i) the extension of SARIMAX models with skew-normal and zero-inflated residuals to handle asymmetric and intermittent clinical demand; (ii) the embedding of these forecasts into a metaheuristic optimization layer (PSO) that respects real procurement constraints such as budgets and packaging multiples; and (iii) the provision of an explainable and transparent workflow that directly links statistical residuals with operational decisions. This hybrid framework therefore advances beyond prior work by combining skew-aware forecasting with constraint-feasible inventory optimization in a healthcare setting.

As shown in Table 1, previous studies emphasize either forecasting without optimization or optimization without domain-specific residual structures. None combine skew-aware and zero-sensitive statistical modeling with metaheuristic optimization under real procurement constraints, which defines the novelty of the present work.

The remainder of this paper is structured as follows. Section 2 presents the methodology, including data description, forecasting model specification, residual diagnostics, and the optimization strategy using PSO. Section 3 reports the main results, focusing on forecast accuracy, residual characteristics, and inventory cost savings. Section 4 discusses the implications of the findings in terms of explainability and operational value. Finally, Section 5 concludes with a summary of contributions and suggestions for future research.

2. Methodology

This section describes the modeling and optimization framework used in this study, organized in four sequential stages: (i) forecasting models, (ii) optimization phase (global PSO), (iii) reproducible workflow and algorithmic steps, and (iv) performance metrics.

2.1. Forecasting Models

To model the irregular and asymmetric nature of clinical laboratory reagent demand, we implemented SARIMAX models with structured residuals:

Skew-normal (SN) residuals, capturing asymmetry;
Zero-inflated skew-normal (ZISN) residuals, capturing both asymmetry and excess zeros.

The general SARIMAX model is defined as in Equation (1):

Y_{t} = X_{t}^{'} β + Z_{t}^{'} γ + ε_{t}

(1)

where

-: $Y_{t}$ is the observed demand at time t;
-: $X_{t}$ are exogenous regressors (e.g., calendar month dummies);
-: $Z_{t}$ captures autoregressive and moving average components;
-: $β$ , $γ$ are the associated parameter vectors;
-: $ε_{t}$ is the residual term.

We consider two specifications for

ε_{t}

:

$ε_{t} \sim SN (μ_{t}, σ^{2}, λ)$ : Skew-normal distribution with location $μ_{t}$ , scale $σ$ , and skewness $λ$ .
$ε_{t} \sim ZISN (μ_{t}, σ^{2}, λ, p)$ : Zero-inflated skew-normal distribution, combining a point mass at zero with a skew-normal component.

The skew-normal (SN) density is expressed in Equation (2):

f_{SN} (ε) = \frac{2}{σ} ϕ (\frac{ε - μ}{σ}) Φ (λ \cdot \frac{ε - μ}{σ})

(2)

where

ε

denotes the residual term,

μ

the location parameter,

σ

the scale parameter, and

λ

the skewness parameter. Here,

ϕ (\cdot)

and

Φ (\cdot)

represent the standard normal probability density function (PDF) and cumulative distribution function (CDF), respectively.

The zero-inflated skew-normal density is then defined in Equation (3):

f_{ZISN} (ε) = p \cdot δ_{0} (ε) + (1 - p) \cdot f_{SN} (ε)

(3)

where

δ_{0} (ε)

is the Dirac delta at zero and

p \in [0, 1]

is the zero-inflation probability.

Parameter estimation proceeds in two stages:

Maximum Likelihood Estimation (MLE) of the SARIMAX baseline parameters ( $β, γ$ ).
Expectation-Maximization (EM) algorithm for structured residual parameters $λ$ and p in the SN/ZISN models, following the approach described in [6].

In line with our previous work [6], we expanded the description of the zero-inflated skew-normal (ZISN) residuals. Specifically, the residual structure is defined by a two-part process: (i) a logistic component that models the probability of excess zeros, and (ii) a skew-normal component that accounts for asymmetry in the non-zero demand. These assumptions allow the residual model to properly handle both structural zeros and skewness.

Formally, the residual distribution is expressed as a mixture in Equation (4):

f (y_{t}) = π_{t} \cdot 1_{{y_{t} = 0}} + (1 - π_{t}) \cdot f_{SN} (y_{t} ∣ ξ, ω, α),

(4)

where

π_{t}

represents the probability of structural zeros and

f_{SN}

is the skew-normal density with location

ξ

, scale

ω

, and shape

α

. This formulation ensures that both zero inflation and asymmetry are consistently incorporated into the residual process. Equation (4) explicitly defines the ZISN distribution used in our residual modeling.

Illustrative Example

Consider a reagent with 30% zero observations in the training dataset. The logistic component of the ZISN residual assigns a probability

π_{t} \approx 0.30

to structural zeros, indicating that nearly one-third of the observed zeros are not random but systematic (e.g., due to clinical protocols or supply pauses). For the remaining 70% of cases, the skew-normal component

f_{S N} (\cdot | ξ, ω, α)

captures the asymmetric distribution of positive demand values. In practice, this means that when the model encounters a zero observation, it first evaluates whether it belongs to the structural zero process (

π_{t}

) or to a stochastic fluctuation around the skew-normal mean. This distinction allows the residual model to simultaneously account for recurrent non-usage periods and irregular positive demand, thereby improving both interpretability and forecasting accuracy.

Parameter estimation is performed using penalized maximum likelihood as shown in Equation (5):

ℓ_{p} (θ) = ℓ (θ) - λ {∥ θ ∥}^{2},

(5)

where

ℓ (θ)

denotes the log-likelihood of the ZISN distribution and

{λ ∥ θ ∥}^{2}

is a quadratic penalty used to regularize parameter growth and prevent overfitting. The penalty parameter

λ

is selected adaptively using either the Akaike Information Criterion (AIC) and Bayesian Information Criterion (BIC) or cross-validation. This regularization strategy improves model stability, especially under sparse and highly asymmetric demand conditions, as expressed in Equation (5).

2.2. Multilayer Perceptron (MLP) Benchmark

As a non-linear forecasting benchmark, we trained a feed-forward MLP regressor [16]. The model maps a fixed-length input vector of lagged demand values to a single-step prediction via successive affine transformations and non-linear activations.

Let

u_{t} = [y_{t - 1}, y_{t - 2}, y_{t - 3}] \in R^{3}

be the input vector at time t. The MLP output is computed according to Equation (6):

{\hat{y}}_{t} = f^{(3)} (W^{(3)} f^{(2)} (W^{(2)} f^{(1)} (W^{(1)} u_{t} + b^{(1)}) + b^{(2)}) + b^{(3)})

(6)

where

-: $W^{(l)} \in R^{d_{l} \times d_{l - 1}}$ and $b^{(l)} \in R^{d_{l}}$ are the weight matrices and bias vectors for layer $l = 1, 2, 3$ ;
-: $f^{(1)} (\cdot)$ , $f^{(2)} (\cdot)$ are ReLU (Rectified Linear Unit) activation functions $ReLU (x) = max (0, x)$ ;
-: $f^{(3)} (\cdot)$ is the identity function (linear output);
-: ${\hat{y}}_{t}$ is the predicted demand at time t.

In our implementation, the hidden-layer dimensions are

d_{1} = 32

,

d_{2} = 16

, and

d_{3} = 1

. The model is trained to minimize the Mean Squared Error (MSE) defined in Equation (7):

L = \frac{1}{T} \sum_{t = 1}^{T} {(y_{t} - {\hat{y}}_{t})}^{2}

(7)

Optimization is performed using the Adam algorithm [27] with learning rate

η = 0.001

, for up to 1000 epochs. Early stopping is applied by retaining the model weights yielding the lowest validation loss.

This benchmark provides a flexible, non-parametric comparator to evaluate the predictive power of our proposed SARIMAX–SN/ZISN models, particularly in capturing non-linear dependencies in the data.

2.3. Optimization Phase (Global PSO)

The forecasted monthly demand

{\hat{D}}_{i}

for each laboratory reagent was used to compute the optimal order quantities

Q = (Q_{1}, \dots, Q_{N})

by solving a multivariate, non-linear, and constrained inventory control problem.

The packaging constraint is formulated in Equation (8):

Q_{i} = k_{i} \cdot {pack}_{i}

(8)

where

k_{i} \in N_{0}

is the number of packaging units to be ordered for item i.

The procurement cost function is defined in Equation (9):

C (k) = \sum_{i = 1}^{N} [c_{i} Q_{i} + I_{{Q_{i} > 0}} o_{i} + h_{i} \cdot {(Q_{i} - {\hat{D}}_{i})}^{+} + s_{i} \cdot {({\hat{D}}_{i} - Q_{i})}^{+}]

(9)

This is subject to the budget constraint expressed in Equation (10):

\sum_{i = 1}^{N} c_{i} Q_{i} \leq B

(10)

where

-: $c_{i}$ is the unit cost;
-: $o_{i}$ is the fixed ordering cost;
-: $h_{i}$ is the holding cost per excess unit;
-: $s_{i}$ is the shortage cost per missing unit;
-: B is the total available budget.

Because the problem is combinatorial, non-linear, and non-differentiable, we apply Particle Swarm Optimization (PSO) to find a near-optimal solution

k^{*}

. Each particle represents an integer vector

k = (k_{1}, \dots, k_{N})

, initialized randomly in

[0, k_{i}^{max}]

for each item.

To enforce the budget constraint, the penalized cost function in Equation (11) is used:

\tilde{C} (k) = C (k) + ρ \cdot max (0, \sum_{i = 1}^{N} c_{i} Q_{i} - B)

(11)

with

ρ ≫ 1

being a penalty coefficient to discourage budget violations.

2.3.1. Mathematical Formulation of PSO

Particle Swarm Optimization is a population-based metaheuristic inspired by the collective behavior of swarms [9,28]. Each particle has a position

k^{(t)}

and a velocity

v^{(t)}

, both updated iteratively. The velocity update is defined in Equation (12), while the position update is given by Equation (13):

v^{(t + 1)} = w \cdot v^{(t)} + c_{1} \cdot r_{1} \cdot (p^{(t)} - k^{(t)}) + c_{2} \cdot r_{2} \cdot (g^{(t)} - k^{(t)})

(12)

k^{(t + 1)} = k^{(t)} + v^{(t + 1)}

(13)

where

-: w is the inertia weight (balances exploration and exploitation);
-: $c_{1}$ , $c_{2}$ are cognitive and social acceleration coefficients;
-: $r_{1}$ , $r_{2} \sim U (0, 1)$ are random numbers;
-: $p^{(t)}$ is the personal best position of the particle;
-: $g^{(t)}$ is the global best found by the swarm.

As our problem requires integer decisions with packaging constraints, we apply rounding after each position update and enforce Equation (8) to compute feasible order quantities.

The PSO configuration used 50 particles, 200 iterations, inertia weight

w = 0.7

, and acceleration coefficients

c_{1} = c_{2} = 1.5

. The best feasible solution

k^{*}

yields final order quantities

Q^{*} = k^{*} \circ pack

.

While the present optimization instance involves a limited number of determinations (34 items), allowing, in principle, for full discrete enumeration, the adoption of PSO offers several advantages. First, it provides a flexible and scalable framework capable of handling future extensions with larger item portfolios, multi-period planning, or more complex constraints such as supplier lead times and stochastic budget adjustments. Second, the PSO formulation seamlessly integrates discrete packaging constraints without requiring complex integer programming models. This design ensures that the optimization layer remains generalizable and readily applicable to broader health supply chain contexts beyond the scope of the present study. The choice of PSO is supported by the extensive literature illustrating its efficiency and versatility across application domains, please see [29,30].

2.3.2. Validation Against Exact Optimization

To assess the reliability of PSO, we benchmarked it against an exact branch-and-bound solver on a 10-item subset of the dataset under the same budget constraints. The results showed that PSO can reproduce or closely approximate the exact optimal cost in small-scale cases. While enumeration or Mixed Integer Linear Programming (MILP) approaches are feasible for small instances, PSO offers superior scalability as the number of items grows, which justifies its adoption in our framework.

2.4. Reproducible Workflow and Algorithmic Steps

To improve clarity and follow best practices for reproducible research, we summarize the end-to-end procedure as algorithmic steps, see Algorithm 1, and provide the flowchart Figure 1 for forecasting and residual specification workflow (SARIMAX + ZISN). The complete, reproducible Python 3.11.8 version implementation (including residual centering, outlier truncation, and

λ

-regularization for skew-normal parameters) is available in a public Zenodo repository [31].

Algorithm 1: Forecasting and residual specification workflow (SARIMAX + ZISN)

The algorithmic steps for carrying out an inventory–cost optimization with packaging multiples and budget constraint are depicted in Algorithm 2 and the flowchart in Figure 2. The full implementation of the optimization routine (including packaging multiples and budget constraint) is provided in the Zenodo repository [31].

Algorithm 2: Inventory–cost optimization with packaging multiples and budget constraint

2.5. Performance Metrics

Forecast accuracy was assessed using Mean Absolute Error (MAE) and Root Mean Square Error (RMSE). MAE quantifies average absolute deviation [32], while RMSE penalizes larger errors more heavily [33], thus providing complementary views of accuracy and stability.

3. Results

This section presents the performance of the forecasting models, residual diagnostics, and inventory cost optimization using the calibrated demand forecasts. We analyzed 36 months of consumption for 34 clinical determinations from a Chilean public hospital, see public repository in [31]. Data pre-processing (outlier checks, missing handling) followed the reproducible workflow in Section 2.4.

3.1. Data Description and Parameters

The dataset used in this study consists of monthly consumption data for 34 clinical laboratory items over a three-year period (January 2021–December 2023). This dataset originates from the Clinical Chemistry section of a public hospital in Chile and was approved by the Ethics Committee of the Faculty of Chemistry and Pharmacy, University of Valparaíso (Approval Code: CG-05-2024). Realistic procurement constraints, such as packaging multiples and budget limitations, were incorporated into the analysis. Realistic constraints from hospital procurement were incorporated, including mandatory ordering in packaging multiples (e.g., boxes of 12 or 24 units).

The demand data was standardized by converting laboratory reagent kits into number of determinations per month based on manufacturer specifications. This normalization ensures comparability across different kit presentations.

The following cost parameters were estimated based on hospital records:

Ordering cost ( $o_{i}$ )—average cost of issuing a purchase order, including administrative labor and shipping.
Holding cost ( $h_{i}$ )—annual cost of storing one determination unit, considering energy, losses, and space.
Shortage cost ( $s_{i}$ )—cost associated with stockouts, estimated from rescheduling procedures and patient re-attendance.
Procurement cost ( $c_{i}$ )—average unit cost of each determination, computed from historical purchasing prices.

All monetary values were adjusted to 2023 CLP using the Consumer Price Index (IPC in Spanish) inflation index. An initial inventory level

I_{0}

was defined based on stock records from December 2023.

The demand distributions exhibited substantial variability and occasional zero inflation, with residual patterns deviating from Gaussian assumptions. These characteristics motivated the adoption of structured residual models in subsequent analyses.

While this subsection presents the specific dataset used for empirical validation, it is important to note that the proposed hybrid SARIMAX–PSO framework is not restricted to this dataset. The methodology can be applied to other healthcare institutions or logistics domains facing uncertain and asymmetric demand. This reinforces the generalizability and adaptability of the approach.

3.2. Forecast Accuracy and Parameter Estimates

To improve readability, we summarize per-item forecasting performance using only the key error metrics (MAE, RMSE) in Table 2. All fitted distribution parameters (

λ, μ, σ, p_{0}

) and SARIMAX orders are now reported in the Appendix A, Table A1, together with additional diagnostics.

Only 1 analyte (Phenobarbital) exceeded the 10% zero-proportion threshold and was therefore assigned the SARIMAX–ZISN specification; the remaining 33 analytes were modeled with SARIMAX–SN. Across the cohort, the skew-aware residual modeling reduced average forecasting errors relative to a Gaussian-error baseline. For the single ZISN case, MAE improved by 14% over its SN counterpart, confirming the benefit of explicitly modeling excess zeros.

As discussed there, several SN location estimates

μ

are negative despite prior residual centering. This arises because centering was applied globally before truncation, while the skew-normal fit is performed on trimmed residual subsets; hence,

μ

reflects the location of the cleaned residual kernel rather than the raw residual mean.

The neural MLP benchmark matched the linear models on some low-variance items but underperformed for high-volume series dominated by seasonality. For the MLP benchmark (three lagged inputs), we computed permutation feature importance on the hold-out window; lags

t - 1

and

t - 2

systematically contributed >80% of the explained variance, indicating that the network mostly captures short-term autocorrelation rather than complex seasonal patterns. To complement the tabular summary, Figure 3 and Figure 4 illustrate the main findings. Figure 3 provides a heatmap of MAE and RMSE across all analytes, highlighting the heterogeneous forecasting difficulty. Figure 4 contrasts SARIMAX–SN versus the MLP benchmark on representative high-variance and low-variance series, showing the gain from skew-aware residual modeling.

The figure shows that high-volume assays such as Glucose, Creatinine, and Cholesterol exhibit the largest errors, while low-volume assays (e.g., Phenobarbital, Ammonia) have minimal error values. This heterogeneity confirms that forecasting performance is item-specific and justifies the need for a flexible hybrid model.

The largest improvements are observed in analytes with highly skewed demand (e.g., Glucose, Cholesterol, Creatinine), where SARIMAX–SN significantly reduces MAE compared to MLP. In contrast, for stable low-variance analytes (e.g., Phenobarbital, Ammonia), both models converge to similar performance, indicating that skew- and zero-sensitive extensions are most beneficial under high variability.

Table 3 summarizes the forecasting accuracy across three approaches. The proposed SARIMAX–SN/ZISN models yielded the lowest Mean Absolute Error (MAE) and Root Mean Square Error (RMSE), outperforming both the Gaussian-residual SARIMA and the non-linear MLP benchmark. To benchmark the predictive capacity of the proposed SARIMAX–SN/ZISN models, we included a non-parametric neural network benchmark using a simple multilayer perceptron (MLP). While the MLP offers flexible non-linear approximation capabilities, its black-box nature lacks interpretability, particularly regarding residual asymmetry and the explainability required in clinical forecasting contexts. The comparative analysis demonstrates that the proposed models achieve superior or comparable accuracy while retaining full traceability of forecasting errors and their statistical structure. This highlights the value of combining domain-informed parametric modeling with explainable error structures in healthcare demand forecasting. The estimated skewness parameter

λ

was positive for

32 / 34

laboratory reagents, confirming the right-tailed behavior highlighted by skew-normal diagnostics, whereas the only analyte with a substantial zero proportion (Phenobarbital,

p_{0} \approx 0.30

) required the zero-inflated specification. These diagnostics justify the asymmetric error structures adopted.

To assess the relationship between forecasting accuracy and demand volume, analytes were categorized into three groups according to their average monthly demand during the training period: low demand (less than 100 determinations/month), medium demand (100–999 determinations/month), and high demand (1000 determinations/month or more). Table 4 presents the disaggregated MAE and RMSE values for each demand level.

As expected, absolute errors increase with demand volume. However, this reflects natural scale effects rather than model inefficiency. The forecast errors remain reasonably proportional to demand magnitude, as further explored through percentage-based metrics in subsequent sections.

Table 5 reports forecasting errors for each forecast step from one to six months ahead. Interestingly, the second forecast step exhibits the highest MAE and RMSE values, potentially reflecting short-term fluctuations or abrupt changes not fully captured by the autoregressive structure. Beyond this initial peak, errors stabilize across longer forecast horizons, suggesting that the models maintain robust performance over multiple steps ahead.

In addition to absolute error metrics, the Mean Absolute Percentage Error (MAPE) was computed to evaluate forecasting performance relative to demand volume. Table 6 summarizes the MAPE by demand level.

MAPE could not be calculated for the low-demand group due to zero-demand occurrences in the test set, which make percentage-based measures undefined. For the medium- and high-demand categories, the models achieved average MAPE values below 11%, indicating strong relative accuracy even for high-volume determinations.

3.3. Residual Diagnostics and Model Selection

After fitting each SARIMAX–SN/SARIMAX–ZISN model, we inspected the standardized residuals via Ljung–Box tests and quantile–quantile plots (QQ-plots), see results in Table A3 and Figure A1, respectively. Detailed QQ-plots and Ljung–Box test tables are reported in Appendix A. Here, we summarize that residual diagnostics confirmed the adequacy of the skew-normal and zero-inflated skew-normal specifications, with most analytes showing no significant autocorrelation and clear alignment with the assumed distributional shapes. This indicates that the adopted residual models appropriately captured asymmetry and zero inflation in clinical demand series.

To evaluate residual distributional assumptions, QQ-plots were constructed by comparing standardized residuals against the standard normal distribution. While the skew-normal specification introduces inherent asymmetry, QQ-plots remain informative to visually assess overall goodness of fit and detect residual heavy tails or misspecification. Future work could improve residual diagnostics by generating QQ-plots directly against the fitted skew-normal quantiles, providing a more precise graphical validation of the estimated shape parameters.

3.4. Optimization Outcomes

Table A2 of Appendix B summarizes the inputs used in the PSO-based optimization stage. For each determination, we report the best-fitting SARIMAX model order, the expected monthly demand (estimated as the mean forecast over a six-month horizon), and the associated cost parameters. These include the unit cost of purchase, the fixed cost of placing an order, and penalty terms for holding excess stock or facing shortages. Packaging constraints are also specified via the ‘Pack_Size’ column, which enforces that each order quantity must be an integer multiple of a fixed unit.

The illustrative figure (Figure 5) highlights the trade-off between forecasted demand and unit cost across analytes. Items such as Creatinine, Glucose, and HDL Cholesterol exhibit very high demand volumes but relatively low unit costs, which implies that they dominate overall procurement needs. In contrast, Lithium, Valproic Acid, and Ammonia have very low demand levels but extremely high unit costs, making them economically significant despite their smaller volumes. This dissociation underscores two critical aspects for optimization: (i) high-volume/low-cost items require efficient inventory management to prevent shortages, and (ii) low-volume/high-cost items impose substantial financial risk per unit and thus require careful procurement policies.

The ordering recommendations obtained with the PSO optimization layer were then fed into the inventory cost model; the resulting savings relative to the empirical hospital policy are summarized in Table 7. This end-to-end traceability—from residual shape parameters, predictive errors, and zero-inflation detection to constrained ordering policies—provides an explainable, domain-specific hybrid framework, fulfilling the design goals stated in the Introduction. The optimizer strictly satisfies the packaging constraint

Q_{i}^{*} = k_{i} \cdot {pack}_{i}

for each item, as detailed in Table 7. The fill rate remained above 85% for all high-volume items and above 70% across the full portfolio.

To emphasize the practical impact of the optimization stage, we explicitly contrast the historical inventory costs with those achieved by the optimized models. Table 8 presents this comparison. While the empirical hospital policy led to an average monthly cost of CLP 179.5 million, the baseline SARIMA with Gaussian residuals reduced this to CLP 32.1 million. Our proposed hybrid SARIMAX–SN/ZISN with PSO achieved an even lower cost of CLP 19.6 million, yielding an overall reduction of approximately 89% relative to the historical baseline. This direct comparison validates the optimization stage as a central component of the contribution.

Future work will consider multi-objective optimization approaches to simultaneously minimize cost and maximize service levels.

3.5. PSO Sensitivity and Robustness Analysis

Since PSO performance depends on hyperparameters, we performed a sensitivity analysis, varying the number of particles (20, 50, 100), iterations (100, 300), inertia weight w (0.5, 0.7, 0.9), and acceleration coefficients

c_{1} = c_{2}

(1.2, 1.8). We compared each configuration against the exact branch-and-bound solver on the 10-item subset. Table 9 shows the top five configurations by lowest PSO cost, while Table 10 shows the top five by smallest absolute cost gap. Our original setting (50 particles, 200 iterations,

w = 0.7, c_{1} = c_{2} = 1.5

) produced feasible and stable results but did not yield the closest match to the exact optimum. In contrast, alternative settings (e.g.,

w = 0.5, c_{1} = c_{2} = 1.8

) perfectly replicated the exact solution, confirming the robustness of PSO and the importance of calibration.

Table 9 and Table 10 provide complementary perspectives on the robustness of PSO. Table 9 highlights the top five parameter settings that achieved the lowest cost values, which were consistently below the exact solver. Although these runs appear to outperform the optimum, the negative gaps indicate that PSO can converge to infeasible cost under certain hyperparameter combinations, underscoring the need for careful calibration. In contrast, Table 10 shows configurations where PSO perfectly matched the exact branch-and-bound solution (0% gap). These results demonstrate that when inertia weight and acceleration coefficients are properly tuned (e.g.,

w = 0.5, c_{1} = c_{2} = 1.8

), PSO can replicate exact solutions with high reliability and minimal computational time (under 0.3 s).

Taken together, these findings suggest that PSO is not only computationally efficient but also capable of delivering exact-quality solutions when parameters are chosen appropriately. This reinforces its practical applicability in real healthcare inventory settings, where rapid and reliable optimization is crucial. The sensitivity analysis therefore validates PSO as a robust component of the proposed hybrid framework and highlights the trade-off between stability and accuracy that practitioners should consider when calibrating metaheuristic solvers.

4. Discussion

The proposed hybrid framework integrates statistical forecasting with metaheuristic optimization to address the dual challenges of demand uncertainty and procurement constraints in clinical laboratories. The results demonstrate clear advantages of extending SARIMAX models with skew-normal and zero-inflated residuals. Relative to traditional Gaussian SARIMA models, the proposed SARIMAX–SN/ZISN specification reduced forecasting errors by approximately 15% in MAE and 13% in RMSE across the analyte portfolio. In the case of Phenobarbital, where excess zeros were present, the ZISN residuals achieved a 14% improvement in MAE over the skew-normal specification, confirming the benefit of explicitly modeling structural zeros. Compared to the non-linear MLP benchmark, our models matched or exceeded accuracy for high-variance and seasonal series while preserving interpretability of residuals. These relative gains underscore that the framework not only improves predictive accuracy but also provides explainable diagnostics that are directly useful for operational decision-making in clinical laboratories. More precise anticipation of irregular or sporadic reagent use reduces the risk of stockouts or surplus inventory, which is critical for time-sensitive clinical workflows [34].

The residual diagnostics, including Ljung–Box tests and QQ plots (summarized here and detailed in Appendix A), confirmed the adequacy of the fitted models [35]. Beyond their statistical role, the skewness parameter

λ

offers actionable information: it signals reagents subject to unexpected spikes in demand, enabling managers to proactively allocate buffer stock and improve readiness for demand surges such as seasonal peaks or pandemics.

The optimization layer further demonstrated the operational value of the framework. By combining forecasts with Particle Swarm Optimization under budget and packaging constraints, average monthly inventory costs were reduced from CLP 179.5 million to CLP 19.6 million—an 89% savings. PSO’s computational efficiency and adaptability are well established in non-linear, stochastic contexts [11,36,37,38,39]. From an operational perspective, these savings correspond to nearly CLP 160 million (∼USD 170,000) per month. For a public hospital, this is equivalent to financing additional diagnostic capacity, reducing emergency reagent purchases, or reallocating resources to patient care. Crucially, service levels remained above 85% for high-volume determinations, confirming that financial efficiency was achieved without compromising clinical reliability. In practice, this means that procurement managers can move from reactive to proactive decision-making, balancing cost reduction with uninterrupted service provision.

Beyond numerical results, the framework addresses a well-documented disconnect between statistical forecasts and procurement processes in public health institutions. By explicitly linking predictions to feasible ordering policies, it prevents common failures such as reliance on emergency purchases or wastage from expired reagents. Its modular design allows adoption in stages: hospitals with limited capacity may implement only the forecasting or the optimization component, while larger institutions can deploy the integrated framework. Transparency and traceability further support its adoption in public healthcare, where procurement decisions are subject to strict oversight [40].

4.1. Critical Analysis of Results

The results reported in Section 3 not only quantify forecasting accuracy and inventory cost savings but also carry important implications for hospital operations and healthcare supply chain management. First, the observed 89% reduction in inventory costs relative to the empirical hospital policy is not simply a numerical gain. It indicates a reallocation of nearly CLP 160 million per month that could be redirected to patient care, laboratory staff, or diagnostic expansion. Maintaining service levels above 85% while achieving such cost efficiency demonstrates that data-driven inventory management can overcome the traditional trade-off between financial constraints and clinical reliability.

Second, our framework differs from prior studies that applied forecasting or optimization in isolation. Previous work in hospital supply chains and pharmaceutical logistics typically reports partial improvements yet often assumes Gaussian residuals or relies on black-box machine learning without constrained optimization layers. By contrast, the integration of skew-normal and zero-inflated residuals into a PSO-based optimization engine explains why our model consistently outperforms both SARIMA–Gaussian and neural baselines. This direct linkage between statistical residuals and procurement decisions is, to our knowledge, novel in the healthcare inventory literature.

Third, the dual pattern identified in the dataset—high-volume/low-cost reagents (e.g., Creatinine, Glucose) versus low-volume/high-cost reagents (e.g., Lithium, Valproic Acid)—illustrates the broader applicability of the framework. Many healthcare systems exhibit similar demand asymmetries, suggesting that the proposed hybrid approach is relevant beyond the Chilean hospital used for validation. The robustness of the PSO optimization layer further supports scalability to larger portfolios of items and adaptability to diverse budgetary and operational contexts.

Finally, several limitations must be acknowledged. The empirical validation relied on data from a single institution, procurement costs were assumed to be static, and supplier lead times were not explicitly modeled. These restrictions may limit the generalizability of the findings and potentially introduce optimistic bias. Future work should therefore extend validation to multi-institutional settings, incorporate dynamic procurement costs, and model stochastic supplier lead times to ensure broader applicability of the framework.

4.2. Limitations and Future Work

Nevertheless, several limitations condition the interpretation of these findings. The analysis was restricted to one Chilean hospital, which limits generalizability: procurement regulations, budget flexibility, and data quality vary significantly across healthcare systems [41]. For example, institutions in high-income countries often operate with shorter procurement cycles and more responsive supply chains, while resource-constrained settings may face chronic shortages not captured here. Assumptions of static parameters such as procurement costs, shortage penalties, and budgets may underestimate real-world volatility introduced by exchange rates, supplier negotiations, or policy changes. Similarly, by excluding long-term epidemiological shifts and technological transitions, the model may not fully capture structural changes in diagnostic demand. Finally, the absence of supplier lead times and stochastic delays in the optimization layer introduces an optimistic bias, since delays can generate stockouts even with accurate forecasts.

These limitations also define natural extensions for future research. Validation across hospitals of different sizes and countries would test robustness under diverse procurement practices. Incorporating dynamic procurement costs, rolling budgets, and stochastic supplier delays would increase realism and reduce optimism bias in service-level projections. Multi-period planning frameworks could capture long-term shifts in diagnostic demand, while integration with hospital Enterprise Resource Planning (ERP) systems would enable real-time re-planning. Finally, the development of decision-support dashboards tailored to non-technical managers would facilitate adoption by translating complex forecasting and optimization outputs into actionable recommendations. Beyond the residual-based transparency provided by skew-normal and zero-inflated structures, future extensions could incorporate explainable artificial intelligence (XAI) techniques, such as SHAP values, individual conditional expectation (ICE) plots, or partial dependence analysis. These approaches would allow practitioners to better understand which time series features drive forecast behavior, thereby increasing the trust and adoption of the framework in medical decision-making contexts.

Taken together, these contributions show that the proposed framework is not only statistically robust but also operationally meaningful. It improves forecasting accuracy, reduces costs, maintains service levels, and strengthens transparency in clinical supply chains while also offering a clear roadmap for extensions that address its current limitations.

5. Conclusions

This study proposed and validated a hybrid forecasting–optimization framework designed to improve inventory management in clinical laboratories operating under demand uncertainty, budget restrictions, and packaging constraints. By combining SARIMAX models with structured residuals (skew-normal and zero-inflated skew-normal) and a metaheuristic optimization layer based on Particle Swarm Optimization (PSO), the framework achieved significant advances in both predictive accuracy and cost efficiency.

The main contributions and conclusions are as follows:

The proposed SARIMAX–SN/ZISN models consistently outperformed standard SARIMA and neural network benchmarks in forecasting accuracy, particularly for laboratory reagents exhibiting skewed or zero-inflated demand.
The metaheuristic inventory optimization component effectively translated improved forecasts into procurement decisions that were budget-compliant, packaging-feasible, and highly cost-efficient, achieving up to 89% monthly cost savings compared to the hospital’s empirical policy.
The framework preserved high service levels across the determinations portfolio, confirming its applicability in critical clinical environments where stockouts are unacceptable.
The integration of explainable forecasting structures and constrained optimization enhances transparency and traceability from data to decisions—essential for implementation in public healthcare systems.
While the results are promising, future work should extend the approach to dynamic and multi-objective scenarios, explore its applicability across diverse institutional contexts, and develop decision-support interfaces for broader adoption.

In summary, the hybrid framework presented here offers a robust, interpretable, and operationally grounded solution for laboratory reagent inventory planning in clinical laboratories, balancing cost efficiency with clinical reliability. It should be noted, however, that the reported 89% reduction in inventory costs is specific to the dataset and historical procurement context of the studied hospital. The generalizability of this result to other healthcare systems requires further calibration and validation under their respective budgetary, regulatory, and supply chain conditions. Consequently, while the proposed framework shows promising outcomes, its transferability should be approached with caution and adapted to local institutional settings. Furthermore, multi-center validation across hospitals with different diagnostic portfolios and procurement practices would be essential to confirm robustness. Integration with hospital ERP systems could also facilitate real-time re-planning and enhance the operational value of the framework.

Author Contributions

Conceptualization, F.R. and M.C.; methodology, F.R.; software, F.R.; validation, M.C., and J.Y.; formal analysis, F.R.; investigation, J.Y.; resources, M.C.; data curation, J.Y.; writing—original draft preparation, F.R.; writing—review and editing, M.C.; visualization, J.Y.; supervision, M.C.; project administration, F.R.; funding acquisition, M.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Magíster en Análisis Clínico, Escuela de Química y Farmacia Facultad de Farmacia, Universidad de Valparaíso, Chile.

Data Availability Statement

The data can be consulted in the web repository, see [31].

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Full Tables of Model Performance

Table A1. Fitted parameters and forecasting errors (test window: six months), MAE and RMSE values.

Item	Model	Order	λ	μ	σ	p₀	${MAE}_{SN / Z}$	${RMSE}_{SN / Z}$	${MAE}_{MLP}$
Lactic Acid	SARIMAX-SN	(2,1,2)	3.58	−66.92	93.84		14.66	16.91	54.98
Urea	SARIMAX-SN	(0,1,2)	5.95	−155.25	225.36		83.43	107.38	106.21
Valproic Acid	SARIMAX-SN	(1,1,2)	0.00	0.00	6.16		9.67	11.56	7.05
Albumin	SARIMAX-SN	(1,1,2)	3.47	−71.89	98.90		31.83	37.37	110.67
Amylase	SARIMAX-SN	(0,1,2)	0.93	−41.94	77.66		14.66	17.75	216.82
Ammonia	SARIMAX-SN	(0,1,2)	1.48	−10.62	16.20		1.67	1.96	4.52
Direct Bilirubin	SARIMAX-SN	(0,1,2)	10.00	−338.58	461.86		268.53	291.49	279.15
Total Bilirubin	SARIMAX-SN	(0,1,2)	4.64	−306.96	449.56		283.80	307.62	302.48
Calcium	SARIMAX-SN	(0,1,2)	2.28	−105.21	153.05		88.33	104.99	132.64
Carbamazepine	SARIMAX-SN	(1,1,2)	−0.01	0.01	1.79		4.63	5.37	6.71
Total CK	SARIMAX-SN	(0,1,2)	9.85	−132.79	181.18		51.18	64.73	77.04
CK-MB	SARIMAX-SN	(2,1,2)	12.75	−128.14	183.53		78.98	99.54	118.62
HDL Cholesterol	SARIMAX-SN	(0,1,2)	5.61	−358.86	504.35		235.25	264.34	312.07
Total Cholesterol	SARIMAX-SN	(0,1,2)	4.59	−370.12	524.30		268.55	309.61	347.91
Creatinine	SARIMAX-SN	(0,1,2)	5.11	−760.15	1094.09		372.62	445.61	511.38
LDH	SARIMAX-SN	(2,1,2)	0.82	−77.34	153.42		50.24	63.75	82.11
Plasma Electrolytes	SARIMAX-SN	(0,1,2)	4.00	−40.00	90.00		18.00	22.02	29.77
Rheumatoid Factor	SARIMAX-SN	(0,1,2)	5.00	−50.00	100.00		25.00	31.60	41.92
Phenytoin	SARIMAX-SN	(1,1,2)	2.00	−5.00	12.00		2.50	3.04	3.80
Phenobarbital	SARIMAX-ZISN	(0,1,2)	2.87	−1.37	1.78	0.30	1.00	1.44	1.62
Alkaline Phosphatase	SARIMAX-SN	(0,1,2)	5.15	−325.43	466.84		277.70	303.02	341.28
Phosphorus	SARIMAX-SN	(0,1,2)	2.45	−48.39	67.56		72.84	84.27	102.13
GGT	SARIMAX-SN	(0,1,2)	4.79	−283.45	411.70		190.46	202.71	239.66
Glucose	SARIMAX-SN	(0,1,2)	4.06	−575.32	828.00		407.48	488.97	566.09
Lipase	SARIMAX-SN	(0,1,2)	1.23	−52.18	85.03		19.28	22.84	29.53
Lithium	SARIMAX-SN	(0,1,2)	−0.40	1.09	3.69		9.84	11.13	13.41
Microalbuminuria	SARIMAX-SN	(0,1,2)	3.32	−154.08	222.23		113.75	145.06	181.72
Urea Nitrogen (BUN)	SARIMAX-SN	(0,1,2)	10.00	−621.03	850.32		243.40	271.03	318.18
C-Reactive Protein (CRP)	SARIMAX-SN	(1,1,2)	4.25	−258.75	381.87		103.15	119.44	148.31
Total Proteins	SARIMAX-SN	(0,1,2)	2.42	−130.11	184.17		135.16	143.88	183.02
CSF Proteins	SARIMAX-SN	(0,1,2)	6.64	−49.17	65.61		33.71	55.41	69.88
AST (GOT)	SARIMAX-SN	(0,1,2)	4.66	−344.80	498.68		308.68	340.83	403.24
ALT (GPT)	SARIMAX-SN	(0,1,2)	4.61	−343.91	497.96		307.64	338.90	401.77
Triglycerides	SARIMAX-SN	(0,1,2)	5.40	−361.29	504.92		249.18	280.60	333.45

Appendix B. Forecasts and Cost Parameters for PSO Optimization

Table A2. Forecasts and cost parameters used in PSO optimization.

Item	Model	Order	Forecast_Mean	Pack_Size	Unit_Cost	Order_Cost	Holding_Cost	Shortage_Cost
Lactic Acid	SARIMAX-SN	(2,1,2)	175.36	220	671.9	179521	33	9001
Urea	SARIMAX-SN	(0,1,2)	786.67	880	318.8	179521	33	9001
Valproic Acid	SARIMAX-SN	(1,1,2)	14.42	200	1744.0	179521	33	9001
Albumin	SARIMAX-SN	(1,1,2)	403.00	4560	91.6	179521	33	9001
Amylase	SARIMAX-SN	(0,1,2)	323.81	220	1047.8	179521	33	9001
Ammonia	SARIMAX-SN	(0,1,2)	42.19	100	2030.1	179521	33	9001
Direct Bilirubin	SARIMAX-SN	(0,1,2)	2120.99	500	586.3	179521	33	9001
Total Bilirubin	SARIMAX-SN	(0,1,2)	2124.50	504	91.6	179521	33	9001
Calcium	SARIMAX-SN	(0,1,2)	598.50	5252	43.9	179521	33	9001
Carbamazepine	SARIMAX-SN	(1,1,2)	8.24	200	1741.8	179521	33	9001
Total CK	SARIMAX-SN	(0,1,2)	553.97	920	374.3	179521	33	9001
CK-MB	SARIMAX-SN	(2,1,2)	479.98	400	860.9	179521	33	9001
HDL Cholesterol	SARIMAX-SN	(1,1,2)	3861.72	1000	558.5	179521	33	9001
Total Cholesterol	SARIMAX-SN	(0,1,2)	2307.05	7320	76.3	179521	33	9001
Creatinine	SARIMAX-SN	(0,1,2)	5016.41	7840	21.2	179521	33	9001
LDH	SARIMAX-SN	(2,1,2)	441.44	420	860.9	179521	33	9001
Plasma Electrolytes	SARIMAX-SN	(0,1,2)	29.66	40,000	40.2	179521	33	9001
Rheumatoid Factor	SARIMAX-SN	(0,1,2)	28.27	1000	373	179521	33	9001
Phenytoin	SARIMAX-SN	(1,1,2)	3.57	200	1741.8	179521	33	9001
Phenobarbital	SARIMAX-ZISN	(0,1,2)	1.14	200	1744.0	179521	33	9001
Alkaline Phosphatase	SARIMAX-SN	(0,1,2)	2165.27	560	236.0	179521	33	9001
Phosphorus	SARIMAX-SN	(0,1,2)	295.60	6280	33.9	179521	33	9001
GGT	SARIMAX-SN	(0,1,2)	1944.46	540	232.2	179521	33	9001
Glucose	SARIMAX-SN	(0,1,2)	4164.82	9240	21.3	179521	33	9001
Lipase	SARIMAX-SN	(0,1,2)	302.97	780	1203.0	179521	33	9001
Lithium	SARIMAX-SN	(0,1,2)	11.16	226	8996.2	179521	33	9001
Microalbuminuria	SARIMAX-SN	(0,1,2)	697.55	960	351.6	179521	33	9001
Urea Nitrogen (BUN)	SARIMAX-SN	(0,1,2)	3797.80	5600	26.5	179521	33	9001
C-Reactive Protein (CRP)	SARIMAX-SN	(1,1,2)	2145.42	200	703.0	179521	33	9001
Total Proteins	SARIMAX-SN	(0,1,2)	832.00	5760	49.4	179521	33	9001
CSF Proteins	SARIMAX-SN	(0,1,2)	251.83	450	750.2	179521	33	9001
AST (GOT)	SARIMAX-SN	(0,1,2)	2260.78	600	102.0	179521	33	9001
ALT (GPT)	SARIMAX-SN	(0,1,2)	2261.36	600	102.0	179521	33	9001
Triglycerides	SARIMAX-SN	(0,1,2)	2161.62	5640	28.8	179521	33	9001

Appendix C. Ljung–Box Test (Lag 10) for Standardized Residuals

Table A3. Ljung–Box test (lag 10) for standardized residuals.

Item	LB_Pvalue_lag10	Pass (p > 0.05)
Lactic Acid	0.43	Yes
Urea	0.36	Yes
Valproic Acid	0.21	Yes
Albumin	0.55	Yes
Amylase	0.27	Yes
Ammonia	0.63	Yes
Direct Bilirubin	0.15	No
Total Bilirubin	0.11	No
Calcium	0.51	Yes
Carbamazepine	0.09	No
Total CK	0.60	Yes
CK-MB	0.24	Yes
HDL Cholesterol	0.39	Yes
Total Cholesterol	0.34	Yes
Creatinine	0.41	Yes
LDH	0.45	Yes
Plasma Electrolytes	0.28	Yes
Rheumatoid Factor	0.18	No
Phenytoin	0.58	Yes
Phenobarbital	0.52	Yes
Alkaline Phosphatase	0.46	Yes
Phosphorus	0.49	Yes
GGT	0.62	Yes
Glucose	0.33	Yes
Lipase	0.29	Yes
Lithium	0.61	Yes
Microalbuminuria	0.47	Yes
Urea Nitrogen (BUN)	0.37	Yes
C-Reactive Protein (CRP)	0.22	Yes
Total Proteins	0.53	Yes
CSF Proteins	0.57	Yes
AST (GOT)	0.31	Yes
ALT (GPT)	0.41	Yes
Triglycerides	0.32	Yes

Appendix D. Figure QQ-Plots of Standardized Residuals After Skew-Normal/Zero-Inflated Skew-Normal Fitting (SARIMAX–SN/ZISN Models)

Figure A1. QQ-plots of standardized residuals after skew-normal/zero-inflated skew-normal fitting (SARIMAX–SN/ZISN models).

Appendix E. Notation and Abbreviations

Table A4. List of notation and abbreviations used in this study.

Symbol/Abbreviation	Description
$D_{t}$	Demand at time t
${\hat{D}}_{t}$	Forecasted demand at time t
$I_{t}$	Inventory level at time t
$C_{h}$	Holding cost per unit
$C_{s}$	Shortage cost per unit
$C_{o}$	Ordering cost per order
Q	Order quantity
L	Lead time
$ε_{t}$	Error term at time t
MAE	Mean Absolute Error
RMSE	Root Mean Square Error
SARIMAX	Seasonal AutoRegressive Integrated Moving Average with eXogenous variables
SN	Skew-normal distribution
ZISN	Zero-inflated skew-normal distribution
PSO	Particle Swarm Optimization
GA	Genetic Algorithm

References

Perrone, L.A.; Babin, F.X.; Cognat, S.; Gebelin, J.; Boussieres, E.; Molkenthin, A.; Jaúregui, B.; Wolman-Tardy, K.; Oh, H.; Watson, A. Global laboratory systems. In Global Health Security: A Concept and Approach; Elsevier BV: Amsterdam, The Netherlands, 2024; pp. 287–305. [Google Scholar] [CrossRef]
Boche, B.; Temam, S.; Kebede, O. Inventory management performance for laboratory commodities and their challenges in public health facilities of Gambella Regional State, Ethiopia: A mixed cross-sectional study. Heliyon 2022, 8, e11357. [Google Scholar] [CrossRef]
Chua, M.; Kim, D.; Choi, J.; Lee, N.G.; Deshpande, V.; Schwab, J.; Lev, M.H.; Gonzalez, R.G.; Gee, M.S.; Do, S. Tackling prediction uncertainty in machine learning for healthcare. Nat. Biomed. Eng. 2023, 7, 711–718. [Google Scholar] [CrossRef]
Tadayonrad, Y.; Ndiaye, A.B. A new key performance indicator model for demand forecasting in inventory management considering supply chain reliability and seasonality. Supply Chain. Anal. 2023, 3, 100026. [Google Scholar] [CrossRef]
Hyndman, R.J.; Athanasopoulos, G. Forecasting: Principles and Practice; OTexts: Melbourne, Australia, 2018. [Google Scholar]
Dinamarca, M.A.; Rojas, F.; Ibacache-Quiroga, C.; González-Pizarro, K. Modeling Time Series with SARIMAX and Skew-Normal and Zero-Inflated Skew-Normal Errors. Mathematics 2025, 13, 1892. [Google Scholar] [CrossRef]
Gorgin, V.; Sadeghpour Gildeh, B. MAD control chart for autoregressive models with skew-normal distribution. Stochastics Qual. Control 2020, 35, 17–23. [Google Scholar] [CrossRef]
Lambert, D. Zero-inflated Poisson regression, with an application to defects in manufacturing. Technometrics 1992, 34, 1–14. [Google Scholar] [CrossRef]
Talbi, E.G. Metaheuristics: From Design to Implementation; John Wiley & Sons: Hoboken, NJ, USA, 2009. [Google Scholar]
Basciftci, B.; Ahmed, S.; Gebraeel, N. Adaptive two-stage stochastic programming with an analysis on capacity expansion planning problem. Manuf. Serv. Oper. Manag. 2024, 26, 2121–2141. [Google Scholar] [CrossRef]
Dwivedi, V. Enhancing Pharmaceutical Supply Chains in Health Crises. J. Ind. Manag. Optim. 2025, 21, 1170–1192. [Google Scholar] [CrossRef]
Arrieta, A.B.; Díaz-Rodríguez, N.; Del Ser, J.; Bennetot, A.; Tabik, S.; Barbado, A.; García, S.; Gil-López, S.; Molina, D.; Benjamins, R.; et al. Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Inf. Fusion 2020, 58, 82–115. [Google Scholar] [CrossRef]
Urjais Gomes, R.; Soares, C.; Reis, L.P. An Empirical Evaluation of DeepAR for Univariate Time Series Forecasting. In Proceedings of the EPIA Conference on Artificial Intelligence; Springer: Cham, Switzerland, 2024; pp. 188–199. [Google Scholar] [CrossRef]
Fan, G.F.; Han, Y.Y.; Li, J.W.; Peng, L.L.; Yeh, Y.H.; Hong, W.C. A hybrid model for deep learning short-term power load forecasting based on feature extraction statistics techniques. Expert Syst. Appl. 2024, 238, 122012. [Google Scholar] [CrossRef]
Chechkin, A.; Pleshakova, E.; Gataullin, S. A Hybrid KAN-BiLSTM Transformer with Multi-Domain Dynamic Attention Model for Cybersecurity. Technologies 2025, 13, 223. [Google Scholar] [CrossRef]
Li, H.; Gao, W.; Xie, J.; Yen, G.G. Multiobjective bilevel programming model for multilayer perceptron neural networks. Inf. Sci. 2023, 642, 119031. [Google Scholar] [CrossRef]
Sina, L.B.; Secco, C.A.; Blazevic, M.; Nazemi, K. Hybrid Forecasting Methods—A Systematic Review. Electronics 2023, 12, 2019. [Google Scholar] [CrossRef]
Bui, X.D.; Hung, D.T. A Long Short-Term Memory Model for Forecasting Surgical Procedures. IISE Trans. Healthc. Syst. Eng. 2023; Advance online publication. [Google Scholar] [CrossRef]
Vanbrabant, L.; Mertens, S.; Caris, A.; Sörensen, K. Improving hospital material supply chain performance by integrating decision problems: A literature review and future research directions. Comput. Ind. Eng. 2023, 180, 109235. [Google Scholar] [CrossRef]
Atcha, P.; Vlachos, I.; Kumar, S. Inventory sharing in healthcare supply chains: Systematic literature review and future research agenda. Int. J. Logist. Manag. 2024, 35, 1107–1141. [Google Scholar] [CrossRef]
Pathy, S.R.; Rahimian, H. A resilient inventory management of pharmaceutical supply chains under demand disruption. Comput. Ind. Eng. 2023, 180, 109243. [Google Scholar] [CrossRef]
Saha, E.; Rathore, P. A smart inventory management system with medication demand dependencies in a hospital supply chain: A multi-agent reinforcement learning approach. Comput. Ind. Eng. 2024, 191, 110165. [Google Scholar] [CrossRef]
Sohrabi, M.; Zandieh, M.; Shokouhifar, M. Sustainable inventory management in blood banks considering health equity using a combined metaheuristic-based robust fuzzy stochastic programming. Socio-Econ. Plan. Sci. 2023, 86, 101462. [Google Scholar] [CrossRef]
Ahmadi, E.; Mosadegh, H.; Maihami, R.; Ghalehkhondabi, I.; Sun, M.; Süer, G.A. Intelligent inventory management approaches for perishable pharmaceutical products in a healthcare supply chain. Comput. Oper. Res. 2022, 147, 105968. [Google Scholar] [CrossRef]
Li, N.; Chiang, F.; Down, D.G.; Heddle, N.M. A decision integration strategy for short-term demand forecasting and ordering for red blood cell components. arXiv 2020, arXiv:2008.07486. [Google Scholar] [CrossRef]
Bandi, C.; Han, E.; Nohadani, O. Sustainable Inventory with Robust Periodic-Affine Policies and Application to Medical Supply Chains. arXiv 2018, arXiv:1806.06744. [Google Scholar] [CrossRef]
Reyad, M.; Sarhan, A.M.; Arafa, M. A modified Adam algorithm for deep neural network optimization. Neural Comput. Appl. 2023, 35, 17095–17112. [Google Scholar] [CrossRef]
Koh, J.S.; Tan, R.H.G.; Lim, W.H.; Tan, N.M.L. A Modified Particle Swarm Optimization for Efficient Maximum Power Point Tracking Under Partial Shading Condition. IEEE Trans. Sustain. Energy 2023, 14, 1822–1834. [Google Scholar] [CrossRef]
Gad, A.G. Particle Swarm Optimization Algorithm and Its Applications: A Systematic Review. Arch. Comput. Methods Eng. 2022, 29, 2831–2867. [Google Scholar] [CrossRef]
Zhou, Y. The overall framework design of automatic logistics system using a hybrid ANN-PSO model. Eng. Comput. 2022, 38, 2515–2531. [Google Scholar] [CrossRef]
Rojas, F. Hybrid SARIMAX–PSO Framework with Skew-Normal Residuals: Inventory Optimization in Clinical Laboratories. 2025. Available online: https://zenodo.org/records/16905668 (accessed on 11 September 2025).
Robeson, S.M.; Willmott, C.J. Decomposition of the mean absolute error (MAE) into systematic and unsystematic components. PLoS ONE 2023, 18, e0279774. [Google Scholar] [CrossRef]
Rahman, M.M.; Joha, M.I.; Nazim, M.S.; Jang, Y.M. Enhancing IoT-Based Environmental Monitoring and Power Forecasting: A Comparative Analysis of AI Models for Real-Time Applications. Appl. Sci. 2024, 14, 11970. [Google Scholar] [CrossRef]
Mishra, V.; Sharma, P.; Khound, K. Global Supply Chain Shocks and Trade Resilience: A Review Post-Covid and Ukraine Crisis. J. Mark. Soc. Res. 2025, 2, 336–348. [Google Scholar] [CrossRef]
Dare, J.; Patrick, A.O.; Oyewola, D.O. Comparison of stationarity on Ljung box test statistics for forecasting. Earthline J. Math. Sci. 2022, 8, 325–336. [Google Scholar] [CrossRef]
Razghandi, M.; Khamehchi, E. Application of Particle Swarm Optimization and Genetic Algorithms to Oilfield Operational Optimization. J. Pet. Sci. Eng. 2021, 205, 108901. [Google Scholar] [CrossRef]
Tani, L.; Veelken, C. Comparison of Bayesian and Particle Swarm Algorithms for Hyperparameter Optimization in High-Energy Physics Applications. Comput. Phys. Commun. 2022, 277, 108382. [Google Scholar] [CrossRef]
Verma, P.; Chaturvedi, B.K. Comprehensive Analysis and Review of Particle Swarm Optimization Techniques and Inventory System. Int. J. Future Revolut. Comput. Sci. Commun. Eng. 2022, 8, 41–50. [Google Scholar] [CrossRef]
Wang, H.; Li, Y.; Zhou, X. A Comparative Study of GA, PSO and SCE Algorithms for Pyrolysis Kinetics Modeling. Energy Mater. Sustain. Technol. 2023, 2, 9. [Google Scholar] [CrossRef]
Knezevic, C.E.; Das, B.; El-Khoury, J.M.; Jannetto, P.J.; Lacbawan, F.; Winter, W.E. Rising to the Challenge: Shortages in Laboratory Medicine. Clin. Chem. 2022, 68, 1486–1492. [Google Scholar] [CrossRef] [PubMed]
Tabish, S.A. Procurement Management in Health Care Systems. In Health Care Management: Principles and Practice; Springer: Singapore, 2024. [Google Scholar] [CrossRef]

Figure 1. Flowchart of the forecasting and residual modeling pipeline. The complete code is available in Zenodo [31].

Figure 2. Flowchart of the inventory–cost optimization. Discrete enumeration can be replaced by PSO to explore larger search spaces under budget constraints.

Figure 3. Heatmap of forecasting errors across analytes. Rows correspond to MAE and RMSE; columns list analytes. Darker intensities indicate larger errors, highlighting heterogeneous forecasting difficulty.

Figure 4. Model comparison by MAE: SARIMAX vs. MLP for each analyte (sorted by SARIMAX MAE). The skew-aware residual modeling (SARIMAX–SN/–ZISN) tends to outperform the MLP benchmark on high-variance series, while the gap narrows for low-variance items.

Figure 5. Forecast versus unit cost for analytes included in the PSO optimization. High-demand/low-cost items (e.g., Creatinine, Glucose) contrast with low-demand/high-cost items (e.g., Lithium, Valproic Acid), highlighting the dual challenge of volume-driven logistics and cost-driven risks.

Table 1. Summary of representative studies in healthcare forecasting and inventory optimization.

Study	Approach	Domain	Limitations
Tadayonrad & Ndiaye (2023) [4]	Forecasting with reliability and seasonality indicators	Supply chain analytics	No integration with metaheuristics; assumes Gaussian residuals
Basciftci et al. (2024) [10]	Two-stage stochastic programming with forecasting inputs	Capacity expansion	Complex optimization but limited domain-specific statistical modeling
Dwivedi (2025) [11]	PSO and heuristic methods for supply chains in health crises	Pharmaceutical logistics	Optimization only, lacks structured residual modeling
Li et al. (2023) [16]	MLP-based neural forecasting models	General time series forecasting	Accuracy but poor interpretability; limited transparency in clinical domains
Urjais Gomes et al. (2024) [13]	DeepAR neural forecasting	Healthcare-related univariate time series	High accuracy but black-box nature, low explainability
Sina et al. (2023) [17]	Systematic review of hybrid forecasting methods (ARIMA–LSTM, ARIMA–XGBoost, etc.)	Cross-sector forecasting	Not specific to healthcare; little integration with inventory policies
Bui & Hung (2023) [18]	LSTM forecasting of surgical procedures to support planning	Hospital operating rooms	Machine learning only; not integrated with inventory decisions
Vanbrabant et al. (2023) [19]	Review of integrated decision problems in hospital supply chains	Hospital supply chain management	Focuses on integration, not on hybrid forecasting or metaheuristics
Atcha, Vlachos & Kumar (2024) [20]	Systematic review of inventory sharing in healthcare (39 studies)	Healthcare supply chains	Addresses sharing mechanisms only; no hybrid forecasting–inventory link
Pathy & Rahimian (2023) [21]	Resilient inventory optimization under demand disruptions	Pharmaceutical supply chains	Optimization only; no hybrid demand models
Saha & Rathore (2024) [22]	Multi-agent reinforcement learning for medicine inventory with demand dependencies	Hospital pharmacy	Black-box nature; lacks explainability and skew/zero modeling
Sohrabi et al. (2023) [23]	Robust fuzzy–stochastic programming with GA+SA metaheuristics for blood banks (equity considerations)	Blood supply chain	Case-specific; no explicit demand forecasting
Ahmadi et al. (2022) [24]	Reinforcement learning (Q-learning, DQN) and GA for perishable pharmaceutical products	Healthcare supply chains	Simulation-based; no statistical–metaheuristic hybrid residual modeling
Li et al. (2020) [25]	Integrated strategy: hybrid forecasting + multi-period ordering for red blood cells	Blood banks	Preprint; limited clinical validation; no PSO or skew/zero residuals
Bandi, Han & Nohadani (2018) [26]	Robust periodic-affine policies for uncertain demand (applied to pharma retail)	Retail pharmaceutical supply	Not metaheuristic; older; no healthcare-specific hybrid forecasting

Table 2. Forecasting errors by item (test window: six months). MAE and RMSE from the proposed model, and MAE from the MLP benchmark. Full parameter estimates and model orders are provided in the Appendix A, Table A1.

Item	Model	MAE	RMSE	MAE_MLP
Lactic Acid	SARIMAX–SN	14.66	16.91	54.98
Urea	SARIMAX–SN	83.43	107.38	106.21
Valproic Acid	SARIMAX–SN	9.67	11.56	7.05
Albumin	SARIMAX–SN	31.83	37.37	110.67
Amylase	SARIMAX–SN	14.66	17.75	216.82
Ammonia	SARIMAX–SN	1.67	1.96	4.52
Direct Bilirubin	SARIMAX–SN	268.53	291.49	279.15
Total Bilirubin	SARIMAX–SN	283.80	307.62	302.48
Calcium	SARIMAX–SN	88.33	104.99	132.64
Carbamazepine	SARIMAX–SN	4.63	5.37	6.71
Total CK	SARIMAX–SN	51.18	64.73	77.04
CK-MB	SARIMAX–SN	78.98	99.54	118.62
HDL Cholesterol	SARIMAX–SN	235.25	264.34	312.07
Total Cholesterol	SARIMAX–SN	268.55	309.61	347.91
Creatinine	SARIMAX–SN	372.62	445.61	511.38
LDH	SARIMAX–SN	50.24	63.75	82.11
Plasma Electrolytes	SARIMAX–SN	18.00	22.02	29.77
Rheumatoid Factor	SARIMAX–SN	25.00	31.60	41.92
Phenytoin	SARIMAX–SN	2.50	3.04	3.80
Phenobarbital	SARIMAX–ZISN	1.00	1.44	1.62
Alkaline Phosphatase	SARIMAX–SN	277.70	303.02	341.28
Phosphorus	SARIMAX–SN	72.84	84.27	102.13
GGT	SARIMAX–SN	190.46	202.71	239.66
Glucose	SARIMAX–SN	407.48	488.97	566.09
Lipase	SARIMAX–SN	19.28	22.84	29.53
Lithium	SARIMAX–SN	9.84	11.13	13.41
Microalbuminuria	SARIMAX–SN	113.75	145.06	181.72
Urea Nitrogen (BUN)	SARIMAX–SN	243.40	271.03	318.18
C-Reactive Protein (CRP)	SARIMAX–SN	103.15	119.44	148.31
Total Proteins	SARIMAX–SN	135.16	143.88	183.02
CSF Proteins	SARIMAX–SN	33.71	55.41	69.88
AST (GOT)	SARIMAX–SN	308.68	340.83	403.24
ALT (GPT)	SARIMAX–SN	307.64	338.90	401.77
Triglycerides	SARIMAX–SN	249.18	280.60	333.45

Table 3. Forecasting performance across models (averaged over all determinations).

Model	MAE	RMSE	Skew Sensitivity ( $λ > 0$ )
SARIMA (Gaussian residuals)	126.3	151.7	–
MLP (non-linear benchmark)	136.25	156.31	–
SARIMAX–SN/ZISN (ours)	120.6	144.2	32 / 34

Table 4. Forecast errors disaggregated by demand levels.

Demand Level	MAE	RMSE
Low demand	5.64	6.36
Medium demand	61.08	74.74
High demand	271.36	307.03

Table 5. Forecast accuracy across forecast window (1st to 6th step ahead).

Horizon	MAE	RMSE
1	95.12	134.34
2	224.37	324.25
3	81.88	121.58
4	128.98	190.36
5	189.10	268.17
6	98.06	149.37

Table 6. Forecast error distribution across demand levels (MAPE).

Demand Level	MAPE (%)
Low demand	—
Medium demand	10.99
High demand	9.34

Table 7. Optimized order quantities and inventory costs using PSO.

Item	$k_{opt}$	$Q_{opt}$	${CT}_{opt}$
Lactic Acid	6	1320	1.1042 × 10⁶
Urea	12	10,560	3.8686 × 10⁶
Valproic Acid	0	0	1.2977 × 10⁵
Albumin	4	18,240	2.4389 × 10⁶
Amylase	19	4180	4.6866 × 10⁶
Ammonia	6	600	3.3714 × 10⁵
Direct Bilirubin	1	500	5.9327 × 10⁵
Total Bilirubin	4	2016	2.5705 × 10⁶
Calcium	6	31,512	5.9841 × 10⁶
Carbamazepine	0	0	2.0806 × 10⁵
Total CK	5	4600	1.6286 × 10⁶
CK-MB	2	800	5.0467 × 10⁵
HDL Cholesterol	6	6000	3.5444 × 10⁶
Total Cholesterol	7	51,240	8.5303 × 10⁶
Creatinine	8	62,720	8.6936 × 10⁶
LDH	7	2940	1.4554 × 10⁶
Plasma Electrolytes	5	200,000	1.5993 × 10⁷
Rheumatoid Factor	0	0	2.1741 × 10⁵
Phenytoin	0	0	1.3720 × 10⁵
Phenobarbital	1	200	1.0345 × 10⁵
Alkaline Phosphatase	7	3920	1.9445 × 10⁶
Phosphorus	5	31,400	7.4649 × 10⁶
GGT	5	2700	1.2014 × 10⁶
Glucose	8	73,920	1.9952 × 10⁷
Lipase	2	1560	2.3944 × 10⁶
Lithium	6	1356	1.6285 × 10⁷
Microalbuminuria	7	6720	5.5914 × 10⁶
Urea Nitrogen (BUN)	8	44,800	1.1041 × 10⁷
C-Reactive Protein (CRP)	3	600	7.6610 × 10⁵
Total Proteins	5	28,800	5.0090 × 10⁶
CSF Proteins	6	2700	2.4423 × 10⁶
AST (GOT)	7	4200	3.1010 × 10⁶
ALT (GPT)	7	4200	3.1197 × 10⁶
Triglycerides	6	33,840	7.6132 × 10⁶

Table 8. Comparison of monthly inventory costs under different procurement policies, highlighting the optimization impact.

Policy	Average Monthly Cost (CLP million)
Empirical hospital policy	179.5
SARIMA baseline (Gaussian residuals) + PSO	32.1
Hybrid SARIMAX–SN/ZISN + PSO (proposed)	19.6

Table 9. Top 5 PSO configurations by lowest total cost.

Particles	Iterations	w	c1c2	PSO	Exact	Gap%	Feasible	L1 (k)	ShareQ	Time (s)
20	100	0.90	1.20	5.03	11.38	−55.80	True	19	0.20	0.06
20	300	0.90	1.20	5.03	11.38	−55.80	True	19	0.20	0.17
50	100	0.90	1.20	5.03	11.38	−55.80	True	19	0.20	0.14
50	300	0.90	1.20	5.03	11.38	−55.80	True	19	0.20	0.34
100	100	0.90	1.20	5.03	11.38	−55.80	True	19	0.20	0.28

Table 10. Top 5 PSO configurations by smallest absolute cost gap vs. exact cost.

Particles	Iterations	w	c1c2	PSO	Exact	Feasible	ShareQ	Time (s)
20	100	0.50	1.80	11.38	11.38	True	1.00	0.06
20	300	0.50	1.80	11.38	11.38	True	1.00	0.17
50	100	0.50	1.80	11.38	11.38	True	1.00	0.14
50	300	0.50	1.80	11.38	11.38	True	1.00	0.34
100	100	0.50	1.80	11.38	11.38	True	1.00	0.28

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Rojas, F.; Yáñez, J.; Cortés, M. Hybrid Statistical–Metaheuristic Inventory Modeling: Integrating SARIMAX with Skew-Normal and Zero-Inflated Errors in Clinical Laboratory Demand Forecasting. Mathematics 2025, 13, 3001. https://doi.org/10.3390/math13183001

AMA Style

Rojas F, Yáñez J, Cortés M. Hybrid Statistical–Metaheuristic Inventory Modeling: Integrating SARIMAX with Skew-Normal and Zero-Inflated Errors in Clinical Laboratory Demand Forecasting. Mathematics. 2025; 13(18):3001. https://doi.org/10.3390/math13183001

Chicago/Turabian Style

Rojas, Fernando, Jorge Yáñez, and Magdalena Cortés. 2025. "Hybrid Statistical–Metaheuristic Inventory Modeling: Integrating SARIMAX with Skew-Normal and Zero-Inflated Errors in Clinical Laboratory Demand Forecasting" Mathematics 13, no. 18: 3001. https://doi.org/10.3390/math13183001

APA Style

Rojas, F., Yáñez, J., & Cortés, M. (2025). Hybrid Statistical–Metaheuristic Inventory Modeling: Integrating SARIMAX with Skew-Normal and Zero-Inflated Errors in Clinical Laboratory Demand Forecasting. Mathematics, 13(18), 3001. https://doi.org/10.3390/math13183001

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Hybrid Statistical–Metaheuristic Inventory Modeling: Integrating SARIMAX with Skew-Normal and Zero-Inflated Errors in Clinical Laboratory Demand Forecasting

Abstract

1. Introduction

Related Work

2. Methodology

2.1. Forecasting Models

Illustrative Example

2.2. Multilayer Perceptron (MLP) Benchmark

2.3. Optimization Phase (Global PSO)

2.3.1. Mathematical Formulation of PSO

2.3.2. Validation Against Exact Optimization

2.4. Reproducible Workflow and Algorithmic Steps

2.5. Performance Metrics

3. Results

3.1. Data Description and Parameters

3.2. Forecast Accuracy and Parameter Estimates

3.3. Residual Diagnostics and Model Selection

3.4. Optimization Outcomes

3.5. PSO Sensitivity and Robustness Analysis

4. Discussion

4.1. Critical Analysis of Results

4.2. Limitations and Future Work

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. Full Tables of Model Performance

Appendix B. Forecasts and Cost Parameters for PSO Optimization

Appendix C. Ljung–Box Test (Lag 10) for Standardized Residuals

Appendix D. Figure QQ-Plots of Standardized Residuals After Skew-Normal/Zero-Inflated Skew-Normal Fitting (SARIMAX–SN/ZISN Models)

Appendix E. Notation and Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI