Ensemble Learning-Based Approach for Forecasting Inventory Data in Prefabricated Component Warehousing

Lin, Shuo; Huang, Xianyu; Zhang, Shunchao; Han, Zhonghua

doi:10.3390/pr13051443

Open AccessArticle

Ensemble Learning-Based Approach for Forecasting Inventory Data in Prefabricated Component Warehousing

School of Electrical and Control Engineering, Shenyang Jianzhu University, Shenyang 110016, China

^*

Author to whom correspondence should be addressed.

Processes 2025, 13(5), 1443; https://doi.org/10.3390/pr13051443

Submission received: 12 March 2025 / Revised: 25 April 2025 / Accepted: 1 May 2025 / Published: 8 May 2025

(This article belongs to the Special Issue Design and Analysis of Adaptive Identification and Control)

Download

Browse Figures

Versions Notes

Abstract

Accurately predicting the storage area of prefabricated components facilitates transshipment scheduling and prevents the waste of storage space. Due to the influence of numerous factors, precise prediction remains challenging. Currently, limited research has addressed the prediction of storage areas for prefabricated components, and effective solutions are lacking. To address this issue, a GRU model with an attention mechanism based on ensemble learning was proposed. The model employed the Bo-Bi-ATT-GRU approach to address the time series prediction of storage areas. A Bayesian optimization algorithm was utilized to enhance parameter tuning and training efficiency, while an ensemble learning framework improved model stability. In this study, a port container dataset was used for experimentation, with root mean square error (RMSE) and mean absolute percentage error (MAPE) as evaluation metrics. Compared with the GM model, the R² of the proposed model improved by 3.38%. Experimental results demonstrated that the ensemble learning-based prediction model offered superior performance in forecasting the storage area of prefabricated components.

Keywords:

ensemble learning; inventory fluctuation prediction; LSTM; K-means; GRU

1. Introduction

Prefabricated construction—a modern building technique involving the factory production of components followed by on-site assembly—is now widely adopted in the construction industry. Prefabricated components represent a critical element of such construction [1,2,3,4].

Unlike traditional warehousing, the storage of prefabricated components is more time-consuming, labor-intensive, and prone to error. Due to their large size and heavy weight, these components require careful handling during transportation to avoid damage. If damaged, an entire component must be reconstructed, necessitating stricter environmental conditions during storage. Prefabricated components are typically stacked, with their production progressing in line with project schedules. The improper allocation of storage space may result in newly produced components being stacked atop earlier ones, complicating retrieval and hindering project execution. Therefore, accurately forecasting the storage area required for prefabricated components is essential to mitigate the complexity of transportation and scheduling caused by disorganized stacking. Given the numerous dynamic factors influencing storage area adjustments, real-time area allocation necessitates the application of deep learning models [5].

In recent years, the exponential growth of data and rapid advances in artificial intelligence (AI) and computing technologies have enabled their application across various domains. These developments support a more refined analysis of prefabricated components, enhancing predictive accuracy and aligning better with the demands of prefabricated construction. Deep learning models excel at processing complex data and extracting key features from diverse characteristics through training, although they require more computational resources than traditional intelligent algorithms. Neural network models can predict fluctuations in storage area needs by analyzing various influencing factors, such as production scheduling, weather conditions, and raw material availability. Consequently, deep learning models demand substantial datasets to accurately forecast warehouse storage areas and improve prediction precision.

Common deep learning approaches for prediction include Convolutional Neural Networks (CNNs) [6,7], which perform predictive analysis through feature extraction, as well as the integration of deep learning with intelligent optimization algorithms to improve predictive accuracy. Long Short-Term Memory (LSTM) networks are particularly effective in time series analysis. However, optimizing their hyperparameters is essential to enhance predictive accuracy and training efficiency, thereby reducing the discrepancy between predicted outputs and actual targets [8,9,10]. A single LSTM model generally offers slightly lower accuracy than ensemble learning models. Ensemble learning combines multiple base models to analyze data from different perspectives and make joint predictions, thus improving precision. Most current predictive studies employ machine learning models such as Random Forest (RF), Support Vector Machine (SVM), and LSTM, often in hybrid forms. The selection of base models can be tailored to the specific requirements of the prediction task.

Given the limited research on dynamic prediction and adjustment of prefabricated component storage areas, this study analyzed fluctuations in inventory data. A novel approach was proposed to improve the reliability of time series data processing by leveraging the enhanced learning capabilities of LSTM networks and the robustness of ensemble learning. The model incorporated an attention mechanism into bidirectional LSTM networks and used Bayesian algorithms for hyperparameter optimization, thereby increasing the network’s sensitivity to data features and accelerating training. The ensemble model used this network as the base learner, with linear regression as the meta-learner. To further enhance training speed without compromising predictive accuracy, the bidirectional LSTM networks in the base learner were replaced with bidirectional Gated Recurrent Units (GRUs).

The major contributions of this paper are as follows:

(1): A novel ensemble learning model based on the Stacking framework was proposed. This model predicted fluctuations in the quantity of prefabricated components, thereby enabling dynamic adjustments in the allocated storage area.
(2): The K-means algorithm was applied for data preprocessing, reducing complexity and accelerating model training. To handle time-series data, the ATT-Bo-Bi-LSTM (ABL) model was introduced, which integrated an attention mechanism into Bi-LSTM to enhance the model’s focus on relevant features. Bayesian optimization was used to tune numerous hyperparameters efficiently. Subsequently, Bi-LSTM was replaced with Bi-GRU to further improve training speed, resulting in the ATT-Bo-Bi-GRU (ABG) model.
(3): A predictive ensemble learning model based on ABG was developed. By aggregating multiple models, the ensemble improved prediction accuracy and model stability. Evaluation using port container data—possessing attributes analogous to prefabricated component warehousing data—demonstrated that the ensemble model outperformed both the single ABG model and classical predictive models. Furthermore, compared to the Stack-XGBoost model, it achieved faster training.
(4): A series of benchmark tests based on various sensitivity indicators confirmed the superior robustness and effectiveness of the proposed ensemble model.

The structure of this paper is organized as follows: Section 2 details the components and architecture of the ensemble learning model; Section 3 explains the mathematical models optimized for constructing the proposed model; Section 4 presents experimental comparisons between the proposed model and alternative models; and Section 5 concludes the study by summarizing key findings and contributions.

2. Related Work

In recent years, an increasing number of scholars have focused on issues related to the inventory management and production of prefabricated components. For example, Xin et al. [11] adopted a dual-layer LSTM prediction method combined with Random Forest and Recursive Feature Elimination for feature selection, enhancing data relevance. This approach achieved a mean absolute percentage error (MAPE) of just 1.38%, outperforming traditional LSTM algorithms. Peixiao et al. [12] integrated VMD and CNNs into the LSTM framework to improve forecasting accuracy. VMD helped reduce load fluctuations, while CNNs extracted key data features, resulting in superior short-term load prediction performance compared to conventional LSTM models.

Due to the limited literature on predictive modeling specific to prefabricated component warehousing—which involves challenges similar to those encountered in large-volume storage and transportation—analogous research in port forecasting has been referenced. Xiaocong et al. [13] proposed a prediction model based on a Random Forest and bidirectional LSTM architecture for container data, demonstrating improvements across multiple evaluation metrics compared to backpropagation neural network models. Fengwu et al. [14] developed an LSTM-based model for throughput forecasting that exhibited higher accuracy than the Autoregressive Integrated Moving Average (ARIMA) method. Xia et al. [15] introduced a fusion-attention Bi-LSTM network that enhanced LSTM’s feature learning capacity by focusing on features at different time intervals, outperforming traditional attention-based models. Dang et al. [16] combined CNN with LSTM to improve deep-level data representation and optimize network structure, achieving reductions of 9.43% in MAPE and 23.81% in mean square error (MSE) compared to standard LSTM networks. Yiqin et al. [17] employed Bayesian optimization to simplify the tuning of LSTM hyperparameters for time-series forecasting. Similarly, Chun’an et al. [18] and Liangjun et al. [19] applied Bayesian algorithms to anticipate the next stage of significant growth in predictive quantities using LSTM networks. Yanyan et al. [20] incorporated dropout technology into LSTM models to mitigate neuron co-adaptation, thereby enhancing generalization and predictive performance. Lin et al. [21] proposed a Bayesian-optimized VMD-LSTM model, which demonstrated higher precision in handling time-series problems and improved adaptability.

Research has also explored the application of ensemble learning models in data prediction. Jianji et al. [22] integrated various neural network models, significantly improving both the generalizability and accuracy of the predictions. Liu et al. [23] and Lin et al. [24] employed ensemble learning techniques for time-series forecasting, achieving favorable results. Shafqat and colleagues [25] demonstrated that ensemble models outperformed single-model approaches in terms of prediction accuracy. Wang et al. [26] conducted predictive analysis on multi-feature data using an enhanced boosting ensemble model, resulting in RMSE improvements ranging from 0.75% to 11.54%. Furthermore, Baihai et al. [27] integrated attention mechanisms into both GRU and LSTM models, combining them through linear weighting to enhance prediction performance. Huiqing et al. [28] proposed a composite model of LSTM and GRU, which outperformed either model alone in terms of prediction accuracy. Mubarak et al. [29] introduced a novel ensemble learning model that aggregated multiple diverse base learners, demonstrating strong predictive capability. Choi et al. [30] confirmed the superior efficiency of ensemble learning models in predictive tasks. Wang et al. [31] developed an LSTM-Informer model based on ensemble learning for long-term forecasting, where LSTM captured sequential correlations and the Informer mitigated gradient vanishing issues, thereby improving long-range forecasting accuracy.

These studies collectively indicate that while LSTM is well-suited for time-series prediction tasks, ensemble learning enhances model robustness and predictive precision. In this study, we adopt a hybrid approach that combines LSTM with ensemble learning strategies to achieve superior forecasting outcomes.

3. The Construction of an Ensemble Learning Model

The types of prefabricated components correspond directly to their respective storage areas. Therefore, by predicting the quantity of prefabricated components, necessary adjustments to the allocated storage area can be determined. To this end, we proposed a Stacking-based ensemble learning model for forecasting the number of prefabricated components, selecting an appropriate number of base learners based on data partitioning. Unlike traditional Stacking ensemble models—which utilize various base learners with different architectures to make predictions on the same dataset before combining outputs via a meta-learner—our approach adopts identical base learners but trains each on distinct subsets of the same dataset. This strategy enables the model to capture diverse feature sensitivities, allowing each base learner to specialize in specific feature patterns. As a result, these base learners can predict similar-feature data with improved efficiency compared to conventionally trained learners.

In our framework, Bi-LSTM with an integrated attention mechanism was initially selected as the base learner. Bayesian optimization was applied to fine-tune its hyperparameters, enhancing the model’s focus on salient data features and reducing training time. Given the large number of hyperparameters in the base learners, logistic regression (LR) was employed as the meta-learner in the second layer of the Stacking framework to prevent overfitting.

To further reduce computational time during training, Bi-LSTM was subsequently replaced by Bi-GRU. This modification led to a modest reduction in training duration without a significant compromise in predictive accuracy. Accordingly, the final ensemble learning model adopted Bi-GRU as the base learner. Technical roadmap is presented in Figure 1.

3.1. Long Short-Term Memory (LSTM) Network

As inventory fluctuation prediction is inherently a time series forecasting task, an LSTM network was employed for data analysis. Compared to classical Recurrent Neural Networks (RNNs), LSTM networks incorporate forget gates, input gates, and output gates, which collectively address the limitations associated with long-term information transmission—specifically, the loss of relevant historical data. Among these components, the forget gate plays a particularly critical role. It processes both the current input and the hidden state from the previous time step, outputting a value between 0 and 1 to determine the extent to which past information should be retained or discarded. This mechanism effectively mitigates issues related to vanishing and exploding gradients, enabling more stable and reliable training over long sequences. Figure 2 shows the LSTM structure diagram.

m i s h^{*} = a x \times \tanh (\log {(1 + e^{x})}^{r})

(1)

Replacing the traditional activation function with the improved Mish function yields a smoother activation curve, helping to alleviate issues such as gradient explosion and vanishing gradients. This enables the Mish function to better capture complex data patterns and relationships. Comparison between the improved Mish function and other activation functions can be observed in Figure 3.

3.2. Bidirectional Long Short-Term Memory (Bi-LSTM) Model

To address the limitations of traditional LSTM in inventory forecasting, particularly its inability to incorporate future information, a Bi-LSTM model was employed. By processing input sequences in both forward and backward directions, Bi-LSTM can capture contextual information from both past and future time steps. This bidirectional memory structure has demonstrated high predictive accuracy, especially when applied to highly stochastic and intermittent inventory data, making it well-suited for inventory stock forecasting tasks.

3.3. Bayesian Optimization

Bayesian optimization is a sequential optimization method based on probabilistic models, with its core principle centered on the “exploration-exploitation trade-off”. It constructs a probabilistic surrogate model of the objective function and strategically selects the next set of hyperparameters to evaluate by leveraging both the model and prior evaluation results. This approach allows for efficient convergence toward optimal solutions with minimal experimental runs, substantially reducing computational overhead. Compared to grid search and random search methods, Bayesian optimization minimizes inefficient sampling and typically identifies better-performing hyperparameters in fewer iterations—making it particularly suitable for deep learning models with long training times and large hyperparameter spaces. Given the numerous hyperparameters involved in our model, Bayesian optimization was employed to accelerate the training process and efficiently identify a well-performing network configuration. Bayes optimization flow chart can be observed in Figure 4.

3.4. K-Means Clustering

The fluctuation in inventory levels of prefabricated components is influenced by several factors, including the progress of component production and construction activities, which directly impact the quantity of inventory in storage. By employing the K-means clustering method, data points with similar influencing factors can be grouped, enabling more accurate predictions when training the model on clustered data.

Integrating K-means clustering with LSTM enhances the model’s effectiveness by exploring data from multiple dimensions. K-means categorizes data based on the similarity of influencing factors, while LSTM captures the correlation between these factors and the target variable, leading to improved prediction accuracy.

3.5. ABL Model

The improved LSTM algorithms, specifically the ATT-Bo-Bi-LSTM (ABL) model, were primarily used to extract information from input feature data. The attention mechanism aids in efficiently selecting key information from a vast array of features, focusing the model on the most relevant data for the task at hand. After passing through the Bi-LSTM, optimized through Bayesian methods and linked to the attention mechanism layer, the model filters out less impactful features. The refined feature information is then passed into a fully connected layer, where the results from the attention mechanism are aggregated to generate the final prediction.

3.6. ABG Model

The ABL model was replaced with the ATT-Bo-Bi-GRU (ABG) model to improve training speed. Bi-GRU, a variant of Bi-LSTM, simplifies the internal structure while retaining the effectiveness of Bi-LSTM in addressing time series problems. With fewer hyperparameters, Bi-GRU offers faster training compared to Bi-LSTM. Using ABG as the base learner in the ensemble learning model results in a significant increase in training speed, thus reducing the overall model training time.

3.7. Ensemble Learning Model

The effectiveness of a single model in inventory prediction is often limited, as it tends to focus on only one aspect of the data, frequently overlooking other important influencing factors. This can lead to inaccurate predictions or suboptimal model performance. To address these limitations, this paper proposes a multi-model overlay inventory prediction model based on ensemble learning. By employing the Stacking algorithm, various models’ predictions are combined, and predictions from single models serve as new training set data for secondary training, thereby enhancing the overall predictive capability of the model.

The Stacking prediction model is composed of two layers. The first layer consists of base learners, determined by clustering results. After training, the different prediction results from the base learners undergo weighted analysis and are input into a second-layer meta-learner, which generates the final prediction. This multi-model overlay inventory prediction model using the Stacking algorithm overcomes the bias and limitations associated with single models, improving overall predictive performance and robustness across various scenarios.

The ensemble learning model presented in this paper (LSTM–ensemble learning (LEL)) uses the ATT-Bo-Bi-LSTM (ABL) model as its base learner. The original data are clustered, and the resulting data groups are input separately into the base learners, ensuring their diversity and independence. Since using deep learning algorithms in the first layer may lead to overfitting, a simpler linear regression algorithm is chosen as the meta-learner in the second layer. This approach helps to minimize overfitting while controlling the overall complexity of the model. Striking a balance between avoiding overfitting and managing the model’s complexity is crucial. General flow chart can be observed in Figure 5.

In this study, the ABL model was replaced with the ATT-Bo-Bi-GRU (ABG), followed by retraining. Comparative experiments demonstrated that both ensemble learning models achieved comparable predictive accuracy. However, the ensemble learning model using ABG as its base learner (GRU–ensemble learning (GEL)) exhibited shorter training times than LEL. As a result, GEL is better suited for the real-time dynamic adjustments required in prefabricated component warehousing. Improved overall process flowchart can be observed in Figure 6.

4. Mathematical Model for Predicting Prefabricated Component Data

In the practical scheduling of prefabricated component warehousing, minimizing the time required for adjustments in the storage area is crucial for enhancing operational efficiency. Therefore, a mathematical model was developed to optimize the time needed for scheduling adjustments in prefabricated component warehousing.

The optimization model for precast component storage scheduling focuses on the prefabricated components that need to be adjusted, represented mathematically as follows:

\min (Mo 1 - Mo 2 - Mo 3)

(2)

Symbol and explanation can be observed in Table 1. To optimize and adjust the current precast component storage area, the following steps were applied: the current working time period, denoted as

k

, was determined, and the in-stock and out-stock status of precast components for the next time period,

k + 1

, were assessed.

S_{n e x t} = T_{k + 1} - T_{k} = \sum_{i \in (1, 2 \dots n)}^{i} (M_{i} - N_{i}) \times S_{i}

(3)

Various influencing factors were set as follows:

C_{q}

,

q

\in

(1, 2 …n).

Let us assume there are n types of prefabricated components transported to m different project storage areas.

O

is the set of prefabricated components,

O \in (1, 2 \dots n)

, where

i

is the index of the type of prefabricated component,

i \in O

, and

j

is the storage area number for storing these prefabricated components,

j \in (1, 2 \dots m)

. If prefabricated component

i

is stored in area

j

, then

x_{i j}

is 1; otherwise, it is 0. For the prefabricated component i, the scheduling time for warehousing transfer to area

j

is represented as

r_{i j}

, and the time consumed for warehousing search in the area is represented as

c_{i j}

.

w

represents the influencing factor that leads to the number of prefabricated components in the warehouse.

d_{i}

represents the time required for re-planning the consumption of prefabricated component

i

in the warehouse.

Based on the information provided above, the problem of dynamic adjustment of prefabricated component storage area can be described as follows

\min (Mo 1 - Mo 2 - Mo 3)

; the specific objective function can be formulated as follows:

M i n (λ_{1} \sum_{i = 1}^{n} \sum_{j = 1}^{m} x_{i j} r_{i j} - λ_{2} \sum_{i = 1}^{n} \sum_{j = 1}^{m} x_{i j} c_{i j} - λ_{3} \sum w d_{i})

(4)

In the objective function model, the first term

λ_{1} \sum_{i = 1}^{n} \sum_{j = 1}^{m} x_{i j} r_{i j}

ensures the total time spent on the storage scheduling of each prefabricated component in the warehouse, aiming to maximize the rational utilization of each project area. The second term

λ_{2} \sum_{i = 1}^{n} \sum_{j = 1}^{m} x_{i j} c_{i j}

represents the total time spent on searching for storage areas for prefabricated components, aiming to minimize search time operations. The third term

λ_{3} \sum w d_{i}

accounts for the time spent on reassigning storage areas during the transportation process, ensuring that the scheduling proceeds according to the plan, where

λ_{i}, i \in (1, 2 \dots n)

represents the weights of the objective function terms,

\sum_{i} λ_{i}

. The constraints are as follows:

(1): The constraints ensure that each prefabricated component is stored in its corresponding project area, and each project area has one and only one:

\sum_{j = 1}^{m} x_{i j} \leq 1, i = 1, 2 \dots n, j = 1, 2 \dots m

(5)

(2): Ensure that the production rate is more than the consumption rate when the decrease in the number of prefabricated components is caused by external factors:

\sum_{i = 1} d_{i} N_{i} < \sum_{i = 1} d_{i} M_{i}, i = 1, 2 \dots n

(6)

(3): Ensure that the adjusted storage area for prefabricated components is more than the sum of the areas of all prefabricated components within the region:

\sum_{i = 1}^{n} \sum_{j = 1}^{m} x_{i j} S_{i} \leq \sum_{j = 1}^{m} \sum_{k = 1} T_{k + 1} x_{i j}, i = 1, 2 \dots n, j = 1, 2 \dots m

(7)

(4): Ensure that the total time during the dynamic partitioning of storage areas is more than the optimization time:

\sum_{i = 1}^{n} \sum_{j = 1}^{m} x_{i j} r_{i j} \geq \sum_{i = 1}^{n} \sum_{j = 1}^{m} x_{i j} c_{i j} + \sum w d_{i}

(8)

Use the K-means algorithm to cluster the original data into groups, resulting in

G_{1}, G_{2} \dots G_{n}

. Split the clustered data into a 70% training set

D_{t r a i n}

and a 30% test set

D_{t e s t}

. Input

G_{1}, G_{2} \dots G_{n}

to the ATT-Bo-Bi-LSTM model separately for training, resulting in n base learners. To improve the model’s generalization ability and avoid the situation where the model performs well on known data but poorly on unknown data, 5-fold cross-validation is used during training. The training set is divided into

D_{t r a i n}^{1}

,

D_{t r a i n}^{2}

,

D_{t r a i n}^{3}

,

D_{t r a i n}^{4}

, and

D_{t r a i n}^{5}

. In sequence, with each portion serving as the validation set for five iterations of training. The trained model equation is as follows:

M_{i}^{k} = N_{i} (D_{t r a i n} - D_{t r a i n}^{k})

(9)

where

M_{i}^{k}

represents the model obtained by the i-th algorithm in the k-th cross-validation training, and

N_{i}

represents the i-th algorithm:

{\overset{\land}{Y}}_{i}^{k} = M_{i}^{k} (D_{t r a i n}^{k})

(10)

{\overset{\land}{Y}}_{i} = ({\overset{\land}{Y}}_{k}^{1}, {\overset{\land}{Y}}_{k}^{2}, {\overset{\land}{Y}}_{k}^{3}, {\overset{\land}{Y}}_{k}^{4}, {\overset{\land}{Y}}_{k}^{5})

(11)

where

{\overset{\land}{Y}}_{i}^{k}

represents the predictions made by the i-th base learner on the validation set

D_{t r a i n}^{k}

during the k-th cross-validation iteration, and

{\overset{\land}{Y}}_{i}

represents the predictions made by the i-th base regressor on the n samples from all the validation sets

D_{t r a i n}^{k}

after the 5-fold cross-validation.

{\overset{\land}{Z}}_{k}^{i} = M_{i}^{k} (D_{t e s t})

(12)

{\overset{\land}{Z}}_{i} = \frac{1}{5} \sum_{k = 1}^{5} {\overset{\land}{Z}}_{k}^{i}

(13)

where

{\overset{\land}{Z}}_{k}^{i}

represents the predictions made by the i-th base learner on the test set

D_{t e s t}

using the base learner

M_{i}^{k}

during the k-th cross-validation iteration, and

{\overset{\land}{Z}}_{i}

represents the average of all the predictions made by the i-th base learner during cross-validation.

In the second-layer linear regression model, a new training set

S_{t r a i n}

and a new testing set

S_{t e s t}

are constructed using the results from the first-layer base learners. The predictions are then made based on these new sets:

M_{s t a c k i n g} = L R (S_{t r a i n})

(14)

Pre S e t s = M_{s t a c k i n g} (S_{t e s t})

(15)

Pre S e t_{D} = M_{s t a c k i n g} (D_{t e s t})

(16)

where

D_{t e s t}

is input into the

M_{s t a c k i n g}

model to validate the accuracy of the model.

The initial area for component storage is

S_{o r i g i n}

, and the number of components in the current stage in the area is

N_{o r i g i n}

. The area for the next stage is adjusted to

S_{n e x t}

.

S_{n e x t} = \frac{S_{o r i g i n}}{N_{o r i g i n}} Pre S e t_{D}

(17)

5. Experimental Simulation Comparison

The previous management and information system for stacking prefabricated components was inadequate, resulting in incomplete historical data. In recent years, with the advancement of informatization, the information system has been better managed. However, the dataset for prefabricated components remains relatively small. To validate the effectiveness of our method, we selected the Chinese port container throughput dataset, as it shares similarities with our prefabricated component data. Both involve large-volume items, making the port dataset suitable for our validation purposes.

Therefore, transportation scheduling data from port container logistics, which closely resembles the structure of prefabricated component transportation and scheduling, was utilized for training. Over time, the scheduling data will be enhanced in collaboration with prefabricated component production companies. The proposed model will then be retrained using the prefabricated component dataset. To demonstrate the effectiveness of the proposed dynamic prediction method for prefabricated component storage, a simulation experiment environment was set up for validation. The validation was conducted on a system with an Intel Core i5-8300H processor, Windows 10 OS, and 16 GB of memory. All experiments in this study were carried out using Python 3.8, with analyses performed on PyCharm 2019.

5.1. Data Preprocessing

This paper utilized the Chinese port container throughput data for model training. Seven factors influencing container storage fluctuations—container outbound volume, berth capacity, container inbound volume, current container storage volume, container cargo type, vessel transport time to the site, and container inspection results—were used as inputs. These factors were grouped using the K-means algorithm. As shown in Figure 7, the initial K value is randomly defined, and the variance ratio criterion (Calinski-Harabasz, CH) is used to assess the reasonableness of the K value. The K value is adjusted until the CH index approximates 1. The K value closest to 1 is then determined as the optimal number of clusters.

The CH evaluation index concludes that the optimal number of clusters is 3, with the CH index being closest to 1. Therefore, the current influencing factors are categorized into three groups according to the CH index, and the clustering results are visualized as shown in Figure 8. As a result, the overall data are divided into three clusters.

5.2. Comparison Between ABL and Unimproved LSTM Prediction

A Bo-Bi-LSTM neural network prediction algorithm was developed, incorporating an attention mechanism. The overall model consists of an input layer, a Bi-LSTM layer, an attention mechanism layer, a fully connected layer, and an output layer. Bayesian optimization is used as the learning algorithm for the Bi-LSTM network. Unlike traditional hyperparameter optimization methods, Bayesian optimization can automatically adjust the learning rate for weights. Compared to the classic LSTM network, this approach allows for the quicker identification of suitable hyperparameters, facilitating the convergence of the short-term prediction model. Additionally, it helps avoid the issue of the model getting stuck in local optima during training. The model was trained for a total of 1000 iterations.

Based on the Bayesian optimization algorithm for hyperparameter tuning, and after iterative evaluations of the objective function, Table 2 presents the hyperparameter combination that resulted in the minimum loss of the objective function. The optimized hyperparameter values are as follows (see Table 2): number of neurons = 128; learning rate = 0.016; dropout probability = 0.3; L2 regularization = 0.0001; training time window = 24. To balance convergence speed and computational efficiency, the maximum number of iterations is set at 1000, a value that has been experimentally validated in the previous literature for effective error convergence.

Figure 9 and Figure 10 show the comparison of the prediction curves between the ABL model proposed in this paper and the classic LSTM using the same dataset. From the comparison of these two sets of images, it can be observed that the predicted values from the ABL model align more closely with the actual values, indicating that the ABL model performs better than the LSTM model in predicting time series data. Bayesian optimization process chart is shown as Figure 11.

5.3. Comparison Between the LEL Model and Single ABL Model

This paper uses the Stacking ensemble learning model. During training, each base learner was trained using a 5-fold cross-validation method. This approach divides the training set into five equal parts, iteratively using each part as the validation set across five iterations. The predictions from these five iterations on the validation set are used as the training set for the second layer, while the test set results from the base learners are used as the test set for the second layer. Finally, a logistic regression (LR) algorithm serves as the meta-learner to train and obtain the final prediction results.

Figure 12 and Figure 13 show the comparison of the prediction curves between the LEL ensemble learning model proposed in this paper and the single ABL model using the same dataset. From the figures, it is evident that the ensemble learning model offers better prediction accuracy than the single model.

5.4. Indicators for the Evaluation of Predictive Models

To evaluate the performance of the models, this study employs several evaluation metrics to assess the effectiveness of the short-term prediction model. These metrics include the mean absolute error (MAE), coefficient of determination (R²), root mean square error (RMSE), and mean absolute percentage error (MAPE). The mathematical expressions for these metrics are shown in Equations (15)–(18).

M A E = \frac{\sum_{g = 1}^{n_{0}} | {\overset{\land}{y}}_{g} - y_{g} |}{n_{0}}

(18)

R^{2} = 1 - \frac{\sum_{i} {({\overset{\land}{y}}_{i} - y_{i})}^{2}}{\sum_{i} {({\overset{=}{y}}_{i} - y_{i})}^{2}}

(19)

R M E S = \sqrt{\frac{\sum_{g = 1}^{n_{0}} {({\overset{\land}{y}}_{i} - y_{i})}^{2}}{n_{0}}}

(20)

M A P E = \frac{1}{n_{0}} \sum_{g = 1}^{n_{0}} \frac{| {\overset{\land}{y}}_{g} - y_{g} |}{{\overset{\land}{y}}_{g}}

(21)

5.5. Experimental Comparison

To evaluate the short-term prediction performance and robustness of the model, an experiment was conducted comparing various models, including the classical LSTM network, bidirectional LSTM network, BP neural network, gray prediction model, and ABL. To demonstrate the effectiveness of the ensemble learning model, comparisons were made between the model without ensemble learning (ABL) and the ensemble learning model proposed in this paper (LSTM–ensemble learning (LEL)). The evaluation metrics are presented in Table 3. The graph illustrates the mean absolute percentage error results from the table, providing a more intuitive comparison.

When comparing the predictive accuracy of different models, the R² value of the poorest-performing GM model was used as a benchmark. The improvement percentage of other models was calculated by subtracting the benchmark value from the evaluated value, obtaining the absolute difference, dividing this difference by the benchmark value, and then multiplying by 100% to express the result as a percentage. This percentage reflects the improvement of the evaluated model over the benchmark.

The results showed that the LEL model exhibited an improvement of 3.38%, significantly outperforming the other comparative models. This demonstrates that the LEL model is more effective and better suited for predicting the quantity of prefabricated components.

Ablation experiments were conducted on the ABL model to verify its superior predictive performance. Additionally, to validate the faster training speed of ABG, a comparative analysis of the training times between ABG and ABL was performed.

From Table 4, it can be concluded that both ABL and ABG perform better in forecasting time series data, with the evaluation index for ABL showing improvement compared to its original model. Compared to the ABL model, ABG reduced the training time by 1.3%, indicating that ABG offers faster training speed.

Figure 14 illustrates the forecast results from traditional models, such as BP neural networks, alongside the proposed ensemble learning model applied to the same dataset.

The x-axis represents months, while the y-axis represents the number of containers. The green curve in Figure 14 represents the LEL ensemble learning model proposed in this paper, and the red curve represents the true values from the dataset.

Other curves represent predicted values from various models. As seen in the figure, the green LEL curve exhibits the highest degree of alignment with the true value curve, while the other curves show considerable deviations at different time points. This demonstrates that the LEL model proposed in this study achieves higher accuracy.

5.6. Algorithm Improvement Based on Training Time

During the experimentation, it was observed that LSTM requires a longer training time due to the large number of hyperparameters. To expedite the process and obtain predictions more rapidly, LSTM was replaced with GRU. Unlike LSTM, GRU maintains similar functionality but has a simpler structure and fewer hyperparameters.

The ABG model proposed in this paper is compared with the Stack-XGBoost ensemble learning model introduced in reference [29]. Optimized hyperparameter values for GRU neural network is shown as Table 5.

From Table 6 and Table 7, it can be observed that LEL and GRU–ensemble learning (GEL) exhibit similar performance across various evaluation metrics. However, GEL shows a 2.32% faster training time than LEL and a reduction in memory usage by 6.61%. Compared to the Stack-XGBoost ensemble learning model, although GEL’s performance on some evaluation metrics is slightly inferior, it excels in training time and memory utilization.

As shown in Figure 15, Methods 3 and 4 exhibit more stable errors compared to the ensemble learning model of Method 1. Method 3 shows 14 instances with absolute errors exceeding 5%, constituting 7% of the total samples, while Method 4 exhibits 12 instances with absolute errors exceeding 5%, accounting for 6% of the total samples. Only one instance in Method 4 exceeds 10% in absolute error. Method 1 shows a quantity of errors exceeding 10%, which is 1.42 times greater than Methods 3 and 4. Therefore, it can be concluded that Methods 3 and 4 demonstrate better data analysis and prediction capabilities than Method 1, showcasing higher stability.

Method 2 outperforms the proposed method in various prediction indicators and performs better in predicting data than Methods 3 and 4. However, according to Table 7, Methods 3 and 4 excel in terms of model training time and memory usage. In particular, Method 4 shows advantages in training time and memory utilization. In summary, the proposed dynamic prediction method for prefabricated component storage areas applies to various samples, providing more accurate predictions than typical ensemble learning models, demonstrating smoother prediction errors, and showcasing higher reliability and stability.

In conclusion, by designing comparative experiments, it can be observed that the ensemble model proposed in this paper has stronger general applicability, certain advantages in various indicators, and high reliability and stability. Due to the limited data on prefabricated component storage currently available, this paper uses similar datasets for training. The model’s performance would improve with more comprehensive long-term data. Additionally, better hardware configurations would also enhance the results.

6. Conclusions

Planning prefabricated component storage areas is crucial for component production planning and construction schedule management. This paper focuses on the dynamic adjustment of prefabricated component inventory storage areas. The comparison of the improved ensemble learning model’s prediction curve with other classic prediction models demonstrates a higher degree of overlap between the improved ensemble learning model and the real value curve. The GEL model is proposed to enhance model training speed, showing a 2.32% improvement in training speed compared to the LEL model. Compared to LEL, AdaBoost, and Stack-XGBoost algorithms or models, GEL exhibits reduced error fluctuations and improved prediction accuracy. Notably, GEL also shows more efficient training time and memory usage than Stack-XGBoost.

This study is particularly suited for real-time adjustments in prefabricated component storage areas. It confirms that the improved ensemble learning model is more accurate in forecasting than single models. This approach can be applied to real-time inventory management systems in enterprises, integrated with other management systems to optimize production plans. Ultimately, we aim to build an Internet-plus-cloud platform for the industrial Internet, deeply integrating production and management processes. This platform will enhance coordination between inventory forecasting models and the entire enterprise production control system. In the future, it could be combined with large language models to further improve adaptability to production and operational changes and enhance the ability to handle unexpected situations, thereby improving prediction accuracy.

Author Contributions

S.L. and Z.H.: Contributed to the conception, the evaluation, and the improvement of the approach, and to the revision of the manuscript. X.H. and S.Z.: Contributed to the problem formulation, the algorithm design, the numerical simulations and results analysis, and to the preparation of the first draft of the manuscript. All authors contributed to the article and approved the submitted version. All authors have read and agreed to the published version of the manuscript.

Funding

The work was supported by the Key R&D Plan Project in Liaoning Province (2020JH2/10100039), Key Project of Basic Research Projects in Higher Education Institutions of Liaoning Province (LJKZ0583), and Liaoning Provincial Department of Science and Technology Applied Basic Research Program (2022JH2/101300253).

Data Availability Statement

The data in this article are only used for research and has no permission to shares.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Zhang, J.; Tao, F.; Su, T. Research on assembly building integration system base on BIM technology. Build. Sci. 2018, 34, 97–102+129. [Google Scholar]
Li, Y.; Liu, M.; Wang, F.; Li, R. Evaluation method of safety performance cloud modeling for assembled building projects. China Saf. Sci. J. 2017, 27, 115–120. [Google Scholar]
Zhao, L.; Han, Q. Research on Evaluation of Cost Influencing Factors of Assembled Buildings. Constr. Econ. 2018, 39, 25–29. [Google Scholar]
Wang, G.; Wu, Z. Analysis of incremental cost of assembled concrete building and research on countermeasures. Constr. Econ. 2017, 38, 15–21. [Google Scholar]
Shen, J.; Hua, Y.; Yuan, M. Research on Lean Cost Management of Assembly Building. Constr. Econ. 2019, 40, 45–49. [Google Scholar]
Li, H.; Huang, T.; Ding, X.; Luo, H.; Huang, L. Public transportation travel demand prediction based on multi-scale spatio-temporal graphical convolutional networks—An example of cab and shared bicycle. J. Comput. Appl. 2024, 44, 2065–2072. [Google Scholar]
Wu, H.; Chen, Y.; Zhu, Z.; Li, X.; Yue, Q. Improved one-dimensional convolutional neural network for hierarchical prediction of convergent deformation of tunnel surrounding rock. J. Basic Sci. Eng. 2024, 32, 145–159. [Google Scholar]
Baniasadi, S.; Salehi, R.; Soltani, S.; Martín, D.; Pourmand, P.; Ghafourian, E. Optimizing Long Short-Term Memory Network for Air Pollution Prediction Using a Novel Binary Chimp Optimization Algorithm. Electronics 2023, 12, 3985. [Google Scholar] [CrossRef]
Zhu, J.; Gu, W.; Ren, M.; Zhang, Z.; Zhang, W. Predictive modeling of subway air quality based on SWT-ISSA-LSTM. Foreign Electron. Meas. Technol. 2023, 42, 164–174. [Google Scholar]
Shi, H.; Shang, Y.; Bai, X.; Guo, L.; Ma, H. Research on SWDAE-LSTM rolling bearing early fault prediction method based on Bayesian optimization. J. Vib. Shock 2021, 40, 286–297. [Google Scholar]
Li, X.; Li, H.; Ma, J. Short-term load forecasting model based on single-step prediction LSTM. Comput. Simul. 2022, 39, 98–102+117. [Google Scholar]
Zhang, P.; Yin, X.; Li, S.; Wang, X. Short-term forecasting of electricity load in agricultural greenhouse park based on VMD-CNN-LSTM. Inf. Control. 2024, 53, 238–249. [Google Scholar]
Sun, X.; Fu, Y. Container throughput prediction based on RF-bidirectional LSTM. J. Shanghai Marit. Univ. 2022, 43, 60–65. [Google Scholar]
Wang, F.; Zhang, X.; Yan, J.; Ji, Z. Container throughput prediction of Shanghai harbor based on LSTM. Navig. China 2022, 45, 109–114. [Google Scholar]
Shao, X.; Duan, C.; Luo, W.; Xu, L.; Zhong, Y. A fusion-attention Bi-LSTM-based blocking prediction method for V2X communications. Radio Eng. 2024, 54, 1277–1285. [Google Scholar]
Dang, Z.; Sun, B.; Li, C.; Yuan, S.; Huang, X.; Zuo, Z. CA-LSTM: An Improved LSTM Trajectory Prediction Method Based on Infrared UAV Target Detection. Electronics 2023, 12, 4081. [Google Scholar] [CrossRef]
Tang, Y.; Zou, H.; Jiang, X.; Tang, J.; Zhao, J. A bus load forecasting method based on VMD and Bayesian optimization LSTM. Power Syst. Clean Energy 2023, 39, 46–52+59. [Google Scholar]
Hu, C.; Jiang, W. Pork price prediction model based on VMD-BO-BiLSTM. J. Appl. Sci. 2023, 41, 692–704. [Google Scholar]
Gao, L.; Tang, Y.; Chen, L.; Wang, B. Bayes-LSTM prediction method for lift and sink motion of crude oil tanker sailing at sea. Oil Gas Storage Transp. 2023, 42, 1291–1296. [Google Scholar]
Guo, Y.; Chen, Y.; He, C.; Xu, Y.; He, Z. Seismic response prediction of electrical equipment coupling system in traction station based on LSTM neural network. J. Railw. Sci. Eng. 2024, 21, 1602–1612. [Google Scholar]
Yao, L.; Zhang, Y.; Chen, L.; Han, Z. Time series prediction based on adaptive VMD-attention mechanism LSTM. Control Eng. China 2022, 29, 1337–1344. [Google Scholar]
Ren, J.; Wei, H.; Zou, Z.; Hou, T.; Yuan, Y. Ultra-short-term power load forecasting based on CNN-BiLSTM-Attention. Power Syst. Prot. Control 2022, 50, 108–116. [Google Scholar]
Liu, L.; Chen, J.; Liu, X.; Yang, J. An Improved Method for Photovoltaic Forecasting Model Training Based on Similarity. Electronics 2023, 12, 2119. [Google Scholar] [CrossRef]
Lin, W.; Xie, L.; Xu, H. Deep-Reinforcement-Learning-Based Dynamic Ensemble Model for Stock Prediction. Electronics 2023, 12, 4483. [Google Scholar] [CrossRef]
Shafqat, W.; Malik, S.; Lee, K.T.; Kim, D.H. PSO Based Optimized Ensemble Learning and Feature Selection Approach for Efficient Energy Forecast. Electronics 2021, 10, 2188. [Google Scholar] [CrossRef]
Wang, Y.; Zhang, H.; An, Y.; Ji, Z.; Ganchev, I. RG Hyperparameter Optimization Approach for Improved Indirect Prediction of Blood Glucose Levels by Boosting Ensemble Learning. Electronics 2021, 10, 1797. [Google Scholar] [CrossRef]
Mao, B.; Qin, W.; Xiao, X.; Zheng, Z. SOH estimation of lithium-ion battery based on LSTM&GRU-Attention multi-joint model. Energy Storage Sci. Technol. 2023, 12, 3519–3527. [Google Scholar]
Lian, H.; Li, Q.; Wang, R.; Xia, X.; Zhang, Q. Research on deep learning-based LSTM-GRU composite modeling mine influx prediction method. Saf. Coal Mines 2024, 55, 166–172. [Google Scholar]
Mubarak, H.; Sanjari, M.; Stegen, S.; Abdellatif, A. Improved Active and Reactive Energy Forecasting Using a Stacking Ensemble Approach: Steel Industry Case Study. Energies 2023, 16, 7252. [Google Scholar] [CrossRef]
Choi, S.; Kim, S.; Jung, H. Ensemble Prediction Model for Dust Collection Efficiency of Wet Electrostatic Precipitator. Electronics 2023, 12, 2579. [Google Scholar] [CrossRef]
Wang, K.; Zhang, J.; Li, X.; Zhang, Y. Long-Term Power Load Forecasting Using LSTM-Informer with Ensemble Learning. Electronics 2023, 12, 2175. [Google Scholar] [CrossRef]

Figure 1. Technical roadmap.

Figure 2. LSTM structure diagram.

Figure 3. Comparison between the improved Mish function and other activation functions.

Figure 4. Bayes optimization flow chart.

Figure 5. General flow chart.

Figure 6. Improved overall process flowchart.

Figure 7. K value selection.

Figure 8. Clustering results.

Figure 9. The comparative graph of test sets.

Figure 10. The comparative graph of training sets.

Figure 11. Bayesian optimization process chart.

Figure 12. Comparative graph of ensemble learning algorithms and single algorithms on the test set.

Figure 13. Comparative graph of ensemble learning algorithms and single algorithms on the training set.

Figure 14. Comparison of forecast results between true value and prediction models.

Figure 15. Prediction errors of 4 approaches.

Table 1. Symbol and explanation.

Symbol	Explanation
$Mo 1$	The sum of the maximum times for precast components during the storage scheduling process
$Mo 2$	The sum of the additional time incurred due to disordered component placement during precast component storage scheduling
$Mo 3$	The sum of the time spent on redividing areas during precast component storage scheduling
$M_{i}$	The average daily production quantity for each type of precast component by the manufacturing company
$N_{i}$	The daily average construction consumption quantity for each type of precast component
$T_{k}$	The current project’s storage area for precast components
$S_{i}$	The area occupied by each type of precast component
$T_{k + 1}$	The new area
$S_{n e x t}$	The adjusted area

Table 2. Optimized hyperparameter values of LSTM neural network.

Parameter	Value	Parameter	Value
LSTM layer	2	Error parameter	MSE
LSTM layer Number of neurons	128	L₂ regularization parameter	0.0001
Probability of discarding neurons	0.3	Learning rate	0.016
Ergodic times	1000	Time step	15
Standardization rule	Minimization–maximization (0,1)	Training time window	24

Table 3. Comparison diagram of methods.

Method	RMES	MAE	MAPE	R2	Growth Rate
BP	2.4965	2.1654	0.1751	0.9537	+0.26%
GM	2.2507	1.7198	0.1396	0.9512	0%
LSTM	2.5897	2.1833	0.1851	0.9572	+0.63%
Bi-LSTM	1.5852	1.3759	0.1083	0.9608	+1.01%
ABL	1.1323	0.9924	0.0769	0.9626	+1.20%
LEL	1.0773	0.6245	0.0443	0.9834	+3.38%

Table 4. Sensitivity analysis experiment.

Method	RMES	MAE	MAPE	R²	Training Time t/s
LSTM	2.5897	2.1833	0.1851	0.9572	133.65
Bi-LSTM	1.5852	1.3759	0.1083	0.9608	274.12
Bo-Bi-LSTM	1.4755	1.1294	0.0812	0.9614	243.25
ABL	1.1323	0.9924	0.0769	0.9626	302.46
ABG	1.1331	0.9875	0.0763	0.9640	298.45

Table 5. Optimized hyperparameter values for GRU neural network.

Parameter	Value	Parameter	Value
GRU layer	2	Error parameter	MSE
GRU layer number of neurons	256	L₂ regularization parameter	0.0001
Probability of discarding neurons	0.25	Learning rate	0.005
Ergodic times	1000	Time step	15
Standardization rule	Minimization–maximization (0,1)	Training time window	24

Table 6. Evaluation index of different ensemble learning models.

Method	Ensemble Models	Base Learners	MAPE	R²	RMES	MAE
1	AdaBoost	ABL	0.0984	0.9696	1.1033	0.8575
2	Stack-XGBoost	ETR and RFR AdaBoost	0.0422	0.9845	1.0765	0.6134
3	LEL	ABL	0.0443	0.9834	1.0773	0.6345
4	GEL	ABG	0.0436	0.9836	1.0769	0.6179

Table 7. Training data and memory occupation of different ensemble learning models.

Method	Ensemble Models	Base Learners	Training Time t/s	Memory Usage/MB
1	AdaBoost	ABL	643.44	657
2	Stack-XGBoost	ETR and RFR AdaBoost	610.56	1045
3	LEL	ABL	601.25	564
4	GEL	ABG	587.58	529

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Lin, S.; Huang, X.; Zhang, S.; Han, Z. Ensemble Learning-Based Approach for Forecasting Inventory Data in Prefabricated Component Warehousing. Processes 2025, 13, 1443. https://doi.org/10.3390/pr13051443

AMA Style

Lin S, Huang X, Zhang S, Han Z. Ensemble Learning-Based Approach for Forecasting Inventory Data in Prefabricated Component Warehousing. Processes. 2025; 13(5):1443. https://doi.org/10.3390/pr13051443

Chicago/Turabian Style

Lin, Shuo, Xianyu Huang, Shunchao Zhang, and Zhonghua Han. 2025. "Ensemble Learning-Based Approach for Forecasting Inventory Data in Prefabricated Component Warehousing" Processes 13, no. 5: 1443. https://doi.org/10.3390/pr13051443

APA Style

Lin, S., Huang, X., Zhang, S., & Han, Z. (2025). Ensemble Learning-Based Approach for Forecasting Inventory Data in Prefabricated Component Warehousing. Processes, 13(5), 1443. https://doi.org/10.3390/pr13051443

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Ensemble Learning-Based Approach for Forecasting Inventory Data in Prefabricated Component Warehousing

Abstract

1. Introduction

2. Related Work

3. The Construction of an Ensemble Learning Model

3.1. Long Short-Term Memory (LSTM) Network

3.2. Bidirectional Long Short-Term Memory (Bi-LSTM) Model

3.3. Bayesian Optimization

3.4. K-Means Clustering

3.5. ABL Model

3.6. ABG Model

3.7. Ensemble Learning Model

4. Mathematical Model for Predicting Prefabricated Component Data

5. Experimental Simulation Comparison

5.1. Data Preprocessing

5.2. Comparison Between ABL and Unimproved LSTM Prediction

5.3. Comparison Between the LEL Model and Single ABL Model

5.4. Indicators for the Evaluation of Predictive Models

5.5. Experimental Comparison

5.6. Algorithm Improvement Based on Training Time

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI