1. Introduction
Converter steelmaking offers a series of advantages, including high efficiency and low cost, making it one of the mainstream steelmaking methods in the steel industry [
1]. This method effectively reduces phosphorus content—a detrimental element in molten iron—through both physical heating and exothermic chemical reactions between components. The endpoint phosphorus content of molten steel serves as a critical indicator for assessing the quality of molten steel, which directly influences the steel’s plasticity and toughness [
2]. Consequently, the reasonable regulation of converter steelmaking process parameters, effectively reducing the endpoint phosphorus content of molten steel is one of the significant challenges that the steel industry needs to overcome to meet the growing market demand and achieve the goal of high-quality and low-cost steel production.
Scholars globally have conducted extensive research on process optimization to improve steel quality. Early studies by Deng Yong [
3] and Deng Nanyang [
4] focused on adjusting slag-forming and scrap material ratios to reduce endpoint phosphorus content under fixed furnace temperature conditions. However, the inherent complexity of in-furnace chemical reactions limits the effectiveness of purely dosage-based control strategies. Subsequently, the literature [
5] proposes a synergistic modeling strategy integrating principal component analysis (PCA) and neural networks (NNs) based on the characteristics of converter metallurgical processes and production data. This method employs PCA to perform linear orthogonal transformations on process parameters, thereby eliminating multicollinearity among variables. However, PCA—a linear projection technique rooted in variance maximization—fails to adequately address nonlinear coupling relationships inherent in metallurgical processes, resulting in the forced decoupling of critical nonlinear features. While the subsequently developed neural network prediction model enhances fitting capabilities through nonlinear activation functions, its fully connected architecture lacks embedded metallurgical thermodynamic constraints. Consequently, the weights of the hidden layer remain disconnected from actual physical processes, creating a fundamental disconnect between mechanistic principles and data-driven modeling. To enhance model performance, Han Kaifeng [
6], Yang Rongzheng [
7], and Azzaz [
8], R. introduced optimization algorithms to refine neural networks and extreme learning machines (ELMs), aiming to improve prediction accuracy for endpoint phosphorus content in molten steel. Nevertheless, these approaches retain inherent limitations: neural networks remain prone to overfitting, while ELM suffers from random weight initialization defects. Furthermore, such methods rely on engineers to preset static combinations of process parameters for pre-production simulations, effectively reducing the dynamic smelting process to a discrete parameter search. This simplification inherently limits real-time responsiveness to raw material composition fluctuations and fails to capture time-varying couplings among process parameters. To address steel quality optimization challenges, researchers have adopted intelligent optimization algorithms such as particle swarm optimization [
9,
10], genetic algorithm [
11,
12], and grey wolf algorithm [
13], among others. These algorithms iteratively adjust process parameters by holistically considering multiple influencing factors during smelting. However, conventional methods frequently converge to local optima, overlook critical process constraints, and struggle to balance model accuracy with industrial feasibility—particularly when simultaneously optimizing endpoint phosphorus content and production costs.
In addressing the optimization problem in the dephosphorization process paramters of the converter steelmaking production, this study employs the actual production data from an industrial steel plant as a case exemple. This paper describes the converter steelmaking process, numerically analyzing the objectives to be optimized along with their constraints, establishing the relationship model between process parameters and objective function through machine learning methods, and proposing the improved multi-objective whale optimization algorithm (IMOWOA). The scientific novelty manifested in three key innovations:
A hybrid metallurgical-mechanism modeling framework is constructed to enhance the interpretability of data-driven models by embedding thermodynamic constraints directly into the optimization architecture;
A Sobol sequence initialization and grey wolf collaborative search mechanism is designed to dynamically balance global exploration and local exploitation capabilities, addressing premature convergence in high-dimensional spaces;
A reward function is introduced to enable multi-objective synergy, ensuring economically optimal solutions while rigorously satisfying process constraints.
Experimental validation demonstrates that the IMOWOA achieves a 12.6% improvement in hypervolume (HV) compared to the second-best performing MOBWO, confirming the enhanced quality and stability of the solution set. In full-scale production trials, parameters optimized by IMOWOA reduce endpoint phosphorus content by 10.8% and lower production costs by 5.79% per ton of steel relative to current industrial benchmarks. These results validate its capability to deliver a framework for metallurgical process optimization that harmonizes physical rationality with engineering applicability.
The subsequent sections of this paper are structured as follows:
Section 1 research background and problem definition.
Section 2 delves into the analysis of the objective to be optimized.
Section 3 design principles of the improved multi-objective whale optimization algorithm.
Section 4 encompasses experiments and analysis.
Section 5 concludes and future directions.
2. Materials and Methods
This study utilizes the converter steelmaking production process of a steel mill (Shanxi Province, China) manufacturing Q235B-Z grade steel as a case study, with the process flow presented in
Figure 1. The production process encompasses out-of-furnace batching, in-furnace blowing, and a single-stage deoxidizing alloying process, which is aimed at initially refining the molten steel.
In the context of the industrial production process, dephosphorization predominantly occurs during the furnace-blowing stage. During this stage, the analytical assessment and monitoring of detection data from the batching and blowing stages of the furnace are critical for achieving precise control over the dephosphorization process. The test data encompass various parameters, including the composition of raw materials, temperature, and oxygen supply. Following the completion of the blowing process, a detailed evaluation of the process is conducted by measuring the steel composition data (i.e., endpoint composition). By meticulously adjusting the detection data to optimize the dephosphorization reaction, it is possible to effectively mitigate the occurrence of phosphorus reversion, thereby enhancing dephosphorization efficiency and reducing production costs [
2]. Consequently, to tackle the optimization problem associated with the dephosphorization process parameters during the furnace blowing stage, the optimization objectives are established based on numerical analysis of the smelting process, with the focus being on minimizing the endpoint phosphorus content (f
1) and the steel production cost per ton (f
2). The selected process parameters include the detection data of both the incoming batch and the furnace blowing stage.
Through a comprehensive analysis of the physicochemical mechanism underlying the dephosphorization reaction within the furnace, as documented in the literature [
2], several critical factors that influence the endpoint phosphorus content in steel were identified. These factors encompass the carbon, silicon, and phosphorus content in the molten iron, the quantity and temperature of the molten iron, the quantity of scrap utilized, the amounts of lime and ore input, the initial pouring temperature, and the volume of oxygen blowing, among other experimental parameters. Consequently, the relationship function for the phosphorus content at the endpoint was derived as follows:
In the formula, f1 denotes the endpoint phosphorus content, while wc0, wsi0, and wp0, respectively, represent the iron-carbon, silicon, and phosphorus content. Wiron, Wresteel, Wlime, and Wore, respectively, refer to the weight of iron, scrap, lime, and ore. Tiron represents the temperature of iron. O and T signify the amount of oxygen blowing and the temperature of a pour, and f signifies the function for calculating endpoint phosphorus content.
The primary constituent of lime is calcium oxide. To decrease the production cost per tonne of steel, limestone and raw dolomite are partially substituted for lime in practical production processes. The production cost per ton of steel can be articulated as follows:
In the formula, C represents the total cost of producing all tonnes of steel, Cm denotes the cost of raw materials, Cs signifies the cost of auxiliary materials, and Ct stands for the cost of process control. Wls, Wrd, and Wt represent the weights of limestone, raw dolomite, and molten steel respectively. Qcover refers to the smelting of molten steel with iron and scrap steel to achieve the conversion rate, typically around 90%. The parameters k0 to k8 denote the corresponding parameter unit price.
Based on the above, the relationship function for the production cost of tonnes of steel can be articulated as follows:
In this equation, f2 represents the total production cost of steel in tonnes, and a0 to a9 signify correlation coefficients.
The multi-objective function is constructed with the optimization goals of minimizing the endpoint phosphorus content (f
1) and minimizing the production cost per tonne of steel (f
2). The formulation of this multi-objective function is presented as depicted in the following equation:
According to the overall dephosphorization reaction equation [
13], it is evident that the phosphorus content in molten iron undergoes chemical reactions with the composition of auxiliary materials, resulting in the formation of stable precipitates. To ensure that the phosphorus content in the molten iron is adequately reduced to meet practical production requirements while simultaneously controlling the production cost per tonne of steel, constraints have been introduced in the optimization study of process parameters, specifically aimed at the input materials during the charging phase.
3. Proposed Methods
In the aforementioned multi-objective optimization problems, the multi-objective whale optimization algorithm (MOWOA) [
14] simulates the foraging behavior of whales, employing global search and spiral encirclement techniques to strategy significantly enhance the acquisition of globally optimal solutions within the solution space. However, in addressing the optimization of dephosphorization process parameters, due to the conflicting relationships among multiple objectives, which may necessitate compromising the endpoint phosphorus content to achieve a globally optimal solution. To tackle this issue, an improved multi-objective whale optimization algorithm (IMOWOA) will be designed by refining the MOWOA algorithm through enhancements in the population initialization strategy, incorporation of a reward function, and enhancement of search capabilities.
3.1. Optimisation of the Initialization Population
The distribution of the initialized population significantly influences both the convergence speed and accuracy of the optimization algorithm [
15]. In addressing the optimization problem of the dephosphorization process parameter, the initial set of process parameters is distributed as uniformly as possible within the search space. This uniform distribution enhances the search range of the algorithm in the solution space, thereby improving its exploration capabilities and diversity, ultimately obtaining a more optimal set of process parameters [
16]. The Sobol sequence, recognized for its low-variance, high-efficiency characteristics, serves as a deterministic sequence suitable for generating uniformly distributed points within multidimensional spaces. By employing Sobol sequences to define the set of process parameters, one can ensure that the initial distribution of the population is more uniform in the solution space. This uniformity is instrumental in accelerating the algorithm’s convergence and enhancing the accuracy of the final solution. The locations of the initial process parameters, generated using Sobol sequences, can be expressed as follows:
In the aforementioned formula, xmin and xmax, respectively, represent the lower and upper limits of the process parameter values; Kn denotes the random number generated by the Sobol sequence constrained within the interval [0, 1].
3.2. Addition of Reward Function
In the multi-objective optimization algorithm, there often exists a certain degree of mutual exclusivity among the objective functions, necessitating the optimization of the overall objective at the expense of some individual objectives. In the dephosphorization process parameter optimization problem, the primary optimization objective is to enhance dephosphorization efficiency. Consequently, it is imperative to minimize the necessity of compromising endpoint phosphorus content in the pursuit of obtaining the optimal solution set. To address this challenge, the introduction of the reward function serves to realign the search direction of the algorithm, thus prompting the optimization algorithm to greater emphasis search and optimization of the endpoint phosphorus content for improved dephosphorization results.
The configuration of the reward function directly influences the optimization process and the resultant optimal solution set of the algorithm. When the reward value is set excessively high, the algorithm may disproportionately prioritize the optimization of the rewarded objectives, thereby compromising the potentially diminishing overall performance. Conversely, if the reward value is set too low, the reward function may fail to significantly affect the search direction of the algorithm, resulting in negligible optimization effect and difficulties in achieving the intended improvements in dephosphorization efficiency. Consequently, establishing a reasonable reward function is essential. By balancing the magnitude of the reward value, it can ensure that the algorithm adequately addresses the relationship between the endpoint phosphorus content and the production cost per tonne of steel throughout the optimization process, ultimately achieving enhanced optimization results. The increased reward items are delineated in the following formula:
In the formula, the term reward refers to the reward factor, while beta denotes the number of normals employed to modulate the rate of increase of the reward factor.
Consequently, the objective function f
1 has been updated to reflect these adjustments.
In the context of this formulation, alpha represents the learning rate utilized for iterative updating of the objective function f1.
3.3. Improvement of Search Capabilities
When addressing the multi-objective dephosphorization process parameter optimization problem, the MOWOA exhibits several inherent limitations, such as a propensity to converge prematurely to local optima and a relatively slow convergence speed. In contrast, the multi-objective grey wolf optimization (MOGWO) algorithm [
17] emulates the collaborative foraging structure observed in natural grey wolf packs by employing a leader–follower structure along with a flexible individual update mechanism. This architecture facilitates effective information sharing and extensive exploration of the solution space. The synergistic integration of these algorithmic components markedly improves global exploration capabilities and diversity maintenance ability of the MOWOA algorithm, thereby effectively mitigating the solutions of converging to local optimal and improving the algorithm’s running efficiency.
Following the incorporation of the MOGWO algorithm, the MOWOA utilizes three distinct populations to identify the individual exhibiting the best fitness across the current populations, designating this individual as the target prey. The remaining individuals undergo updates based on the position of this target prey. The selection probability during the encircling prey phase and the bubble net predation phase of the algorithm is set at 50%. Furthermore, when the condition |A
i| < 1 is satisfied, the mathematical model is updated as follows:
In the current, X(t) denotes the position vector of the current individual whale, while Xi*(t) represents the position of the current optimal solution among various groups. The Di signifies the distance between the individual whale and the optimal position, and Di′ refers to the distance between each prey and the optimal position. The variable t indicates the current iteration number, while Ai and Ci are coefficient vectors that influence the algorithm’s search dynamics, Additionally, p is a randomly generated number within the interval [0, 1].
During the stochastic search phase, the algorithm randomly searches for prey throughout the search space, thereby increasing the likelihood of escaping from the local optimal. When the condition |A
i| ≥ 1 is satisfied, the mathematical model is subsequently updated as follows:
where X
rand(t) denotes the position of the randomly selected individual whales within each population. The term D
i″ represents the distance between the randomly updated individual and the optimal solution in the search space.
The optimal individual of the population is updated by taking the arithmetic mean of the optimal individuals from the three distinct populations. This procedure can be mathematically represented as follows:
3.4. Optimization Algorithm Steps
The incremental multi-objective whale optimization algorithm (IMOWOA) first employs the Sobol sequence to initialize the population, thereby enhancing diversity. A reward factor is then integrated to refine the search direction, followed by the incorporation of the grey wolf optimization (GWO) algorithm to perturb the current optimal solution, thereby augmenting global search capabilities.
Step 1: Initialize algorithm parameters, including population size, data dimensions, number of iterations, alpha, beta, etc.
Step 2: The regression function was employed to formulate a relational model delineating the association between the objective variable and the process parameters. Subsequently, set objective function by Equation (5).
Step 3: Set the constraints according to Equation (6).
Step 4: Initialize three populations using Sobol in Equation (7).
Step 5: Select individuals exhibiting high adaptive capabilities among the whale population as reference points. Assess the value of the acceleration coefficient Ai. If the |Ai| < 1, incorporate a random perturbation (p) and update the optimal positions of the different groups following the Equation (11). Conversely, the optimal positions of the various groups will be updated following Equation (12) provided.
Step 6: The optimal position of the overall population is updated under Equation (13), and the validity of this optimal position is assessed against the established constraints utilizing Equation (6).
Step 7: Following a single iteration, the reward factor is incorporated, and the objective function f1 is modified following Equation (9) to determine the current optimal position.
Step 8: the optimal position and its corresponding fitness value are updated based on the prevailing state of the whale population.
Step 9: The convergence criteria are evaluated using three termination conditions: (1) the relative change rate of the best fitness value F remains below 0.1% over five consecutive iterations, (2) the total iteration count reaches 800, (3) the relative fluctuations of both HV and Spacing metrics are maintained below 0.1% across five consecutive iterations. To ensure computational efficiency without compromising solution quality, the algorithm employs low-frequency evaluation and dynamic thresholds derived from sensitivity analysis. The optimization process terminates immediately upon satisfying any criterion; otherwise, it returns to Step 4 for continued iteration.
The flowchart of the IMOWOA is illustrated in
Figure 2.
4. Experiment and Results
The objective of this study is to identify the optimal process parameters for converter steelmaking with reduced phosphorus content at the endpoint while minimizing the production cost per tonne of steel, utilizing an optimization algorithm. The original dataset comprises 14 process parameters recorded throughout the steelmaking process, including molten iron composition (C, Si, P content), operational parameters (oxygen blowing volume, temperature), and flux proportions (lime, dolomite, etc.). During preprocessing, heats failing to meet molten steel quality standards, lacking critical parameters (e.g., endpoint phosphorus content), or exhibiting statistical outliers were removed, resulting in 4000 validated data groups retained for subsequent analysis. A subset of the processed sample data is presented in
Table 1. The preprocessed sample data that meet the specified requirements is utilized to construct a regression model that establishes the mapping relationship between process parameters (e.g., lime addition amount, oxygen blowing volume) and the endpoint phosphorus content of molten steel as well as the production cost per ton of steel. A dual-objective optimization problem is formulated by defining the objective function, and an improved multi-objective whale optimization algorithm is employed to optimize the process parameters. The parameter adjustment is specifically implemented through the following mechanisms.
To ensure physical feasibility, the optimization process integrates domain-specific constraints derived from metallurgical thermodynamics. For instance, lime dissolution exhibits nonlinear saturation when the addition exceeds 80 kg/ton (
Section 2), and temperature thresholds govern reaction kinetics. These constraints are explicitly enforced via Equation (6), ensuring that optimized parameters align with industrial practice rather than arbitrary combinations. Parameter encoding and search space definition involve encoding the process parameters as 12-dimensional decision variables, with upper and lower bounds determined based on historical data ranges. Dynamic weight allocation adaptively adjusts the optimization weights of phosphorus content and cost according to the distribution state of the current solution set. A hybrid search strategy is designed based on social hierarchy cooperation, deeply integrating the hierarchical population structure of the MOGWO with the spiral contraction mechanism of the MOWOA to achieve collaborative optimization of process parameters in high-dimensional space. Constraint handling imposes an exponential penalty on solutions that violate process boundaries, ensuring that parameter adjustments comply with industrial feasibility. Finally, the optimal parameter combination satisfying production conditions is selected from the Pareto solution set.
4.1. Modeling Effects Analysis
The complex nonlinear relationship between process parameters and endpoint phosphorus content originates from fundamental metallurgical mechanisms: the temperature-dependent competitive reaction mechanism where lower temperatures promote dephosphorization through CaO-P
2O
5 formation while simultaneously reducing slag fluidity and reaction kinetics; the threshold effect of lime dissolution demonstrating nonlinear saturation characteristics when addition exceeds 80 kg/ton; and the time-dependent reaction kinetics exhibiting distinct inflection points during different blowing stages. These physicochemical complexities manifest mathematically as strong interaction effects and non-Gaussian residual distributions in linear models. The extremely random forest (ERT) [
18] algorithm demonstrates superior capability in capturing these nonlinear relationships through three principal mechanisms: the ensemble of randomized decision trees intrinsically models multiplicative parameter interactions; the automatic feature selection weights reveal dominant nonlinear parameters; and the algorithm’s inherent robustness to collinear inputs effectively handles the strong correlation between slag-forming materials and furnace temperature. Consequently, the ERT algorithm was employed to construct a regression model for predicting the endpoint phosphorus content. To evaluate the predictive performance of this model, a random selection of 600 sets of sample data was utilized as test samples for experimentation, yielding the prediction results depicted in
Figure 3. The results indicate that the fluctuations between the model-predicted values and the actual values exhibit a similar trend.
The root mean square error (RMSE) of the prediction model for endpoint phosphorus content was measured at 0.09, with an R2 value of 0.79. Notably, the proportion of furnaces exhibiting a prediction error of endpoint phosphorus content in the range of −0.001% to 0.001% constituted 72.30% of the total furnaces, while those with a prediction error ranging from −0.003% to 0.003% accounted for 85.05% of the total furnaces. The results confirm that the model demonstrates a relatively low average prediction error and high precision, indicating its efficacy and reliability in predicting the endpoint phosphorus content.
4.2. Indicator Evaluation
Three evaluation metrics, inverted generational distance (IGD), hypervolume metric (HV), and spacing evaluation method (Spacing), are used to evaluate the convergence, diversity, and uniformity of the algorithm.
The IGD metric [
19] serves as an evaluation criterion for assessing the convergence and distribution performance of optimization algorithms. It does so by computing the minimum sum of distances between each point on the true Pareto front and the set of solutions produced by the algorithm. A smaller IGD value indicates that the algorithm achieves faster convergence to the true Pareto front with fewer iterations while preserving comprehensive coverage of the solution set. The formula for calculation IGD is as follows:
In the expression, P denotes the set of points uniformly distributed across the real Pareto front, with |P| representing the number of individuals in the set. Q refers to the Pareto optimal solution set generated by the algorithm, while d(x, Q) signifies the minimum Euclidean distance from an individual point x in P to the biological minimum of the population Q.
The HV metric [
20] quantifies the volume enclosed by the Pareto optimal solution set and serves as an indicator of both the diversity and extent of coverage provided by the algorithm’s output. The growth rate of the HV metric across iterations serves as a critical indicator of convergence efficiency. A significant early-stage increase in HV values demonstrates the algorithm’s ability to rapidly identify high-quality solutions during initial iterations, while higher final HV values quantitatively reflect the enhanced accuracy of the Pareto optimal set. The method for computing HV is illustrated as follows:
In this context, v(x, P) represents the hypervolume of the region delineated by the solution x in the optimal solution set X and the reference point P.
The Spacing metric [
21] is defined as an assessment of the standard deviation of the minimum distance between each solution and its respective neighbors within the solutions set generated by her algorithm. This metric is employed to evaluate both the distribution and uniformity of solutions in the set. A smaller Spacing metric value indicates that the algorithm effectively avoids premature convergence to local optima during the search process while achieving uniform exploration of the objective space, thereby minimizing the generation of redundant or overlapping solutions. The calculation is expressed as follows:
where P denotes the total number of solutions in the solution set, d
i represents the distance between the ith solution and its nearest neighbor, and
signifies the mean of all di values.
4.3. Pareto Solution Set Analysis
In the context of the dephosphorization optimization problem, it is imperative to concurrently optimize two objectives to attain outcomes. Typically, the optimal solution is represented by a set, known as the Pareto optimal solution set, which characterizes the trade-offs between the competing objectives.
Table 2 delineates the specific parameter configurations within the IMOWOA.All Pareto optimal solutions (
Table 3) were validated against historical production data, confirming that the proposed parameter adjustments (e.g., increased limestone substitution) adhere to known physicochemical limits (e.g., slag basicity requirements) while achieving measurable quality-cost improvements.
The IMOWOA was employed to address the multi-objective optimization model concerning the production process parameters of Q235B-Z steel grades, obtaining Pareto feasible solutions, as depicted in
Figure 4. In this multi-objective optimization framework, the dual objectives are articulated as minimizing the endpoint phosphorus content and minimizing the production cost per tonne of steel. For a comprehensive analysis, the three most optimal solutions from further examination.
The Pareto feasible solution graph presented in
Figure 4 illustrates that the IMOWOA effectively achieves a favorable trade-off between the dual objectives of minimizing endpoint phosphorous content and production cost per tonne of steel. The presence of multiple solutions in the feasible solution set makes the objectives of f
1 and f
2 research a better solution, reflecting the algorithm’s efficacy in navigating the iterative optimization process. In the optimal solution set displayed in
Table 3, the endpoint phosphorus content of the first optimized solutions is recorded as 0.0107%, alongside the consumed production cost of 3217.03 CNY per tonne of steel. Compared with the actual production data with analogous processing parameters, the endpoint phosphorus content and the production cost of a tonne of steel have been reduced by 10.8% and 5.79%, respectively. The third optimized solution yields an endpoint phosphorous content of 0.0109%, and a steel production cost of 3216.52 CNY per tonne of steel. Compared with the actual production data, two objectives are reduced by 9.17% and 5.80%, respectively.
To evaluate the stability of the IMOWOA algorithm, the Pareto optimal solution set delineated in
Table 3 underwent further verification. The first optimal objective function was selected as the initial solution, and the same parameter configurations were applied to conduct multiple experimental runs. The optimal endpoint phosphorus content and production cost per tonne of steel attained in each trial were systematically recorded to assess the robustness and stability of the algorithm.
Table 4 presents the findings from these multiple experimental evaluations, revealing the optimal endpoint phosphorus content consistently falls within the range of 0.0108% to 0.0110%, while the optimal production cost per tonne of steel varies approximately between 3210.48 CNY/t and 3215.97 CNY/t. In comparison to the initial solution, the maximum error of the endpoint phosphorus content is 0.0004%, whereas the corresponding maximum error of the production cost of a tonne of steel reaches 6.55 CNY/t. These experimental outcomes substantiate the relative stability and reliability of the IMOWOA.
4.4. Ablation Experiment Analysis
To evaluate the efficacy of the three proposed enhancement strategies for the IMOWAO, ablation experiments were conducted for each improvement strategy. Following the execution of the algorithm 30 times for each strategy, the mean and standard deviation of the results were calculated and analyzed. The findings of the analysis are presented in
Table 5.
The impact of each strategic approach on the endpoint phosphorus content and the production cost per tonne of steel has been systematically analyzed, as detailed in
Table 5. This analysis comprehensively evaluates how the different strategies influence both key performance metrics, thereby providing insights into their respective effectiveness in optimizing the steelmaking parameter process. The enhanced strategy that incorporates the Sobol sequence and reward function demonstrates notable superiority regarding endpoint phosphorus content, achieving the mean value reduction to 0.0129% with a standard deviation of merely 0.0002%. Additionally, this strategy results in a slight decrease in the production cost per tonne of steel, characterized by a standard deviation of 1.54 CNY/t, indicating its high stability and efficiency. In contrast, the improved strategy that integrates the Sobol sequence with the MOGWO algorithm results in a 0.0005% decrease in the mean endpoint phosphorus content. However, this approach led to an increase in production cost, rising to 3215.81 CNY/t with a corresponding standard deviation of 8.52 CNY/t. This observation suggests a relatively weak stability in cost control when adopting this particular strategy. When combining the reward function with the enhanced MOGWO algorithm, a significant reduction in the endpoint phosphorus content is achieved, reducing to 0.0123%. This improvement is realized without significant sacrifice in the production cost, showcasing a more favorable overall optimization effect. The IMOWOA proves particularly effective in reducing the endpoint phosphorus content, with an average value of only 0.0110%. Simultaneously, it maintains effective cost control over steel production, with an average cost noted at 3214.51 CNY/t, demonstrating a relatively small standard deviation, and indicating good stability. In summary, the IMOWOA successfully establishes an optimal balance between the endpoint phosphorus content and the production cost per tonne of steel, thereby validating its effectiveness and reasonableness in optimizing phosphorus removal during the converter steelmaking process.
4.5. Algorithm Comparison
To further substantiate the effectiveness of the IMOWOA in addressing the dephosphorization optimization problem in converter steelmaking, a comparison was conducted with several algorithms, including the mature multi-objective particle swarm algorithm [
22] (MOPSO), the multi-objective grey wolf optimization (MOGWO) algorithm, known for its strong optimization capabilities with fewer parameters, the novel multi-objective beluga whale optimization (MOBWO) algorithm, distinguished by its distinctive search paradigm [
23], the multi-objective crayfish optimization algorithm [
24] (MOCOA), and the original MOWOA. The Pareto feasible solution graph for each algorithm is illustrated in
Figure 5, while the first optimal solution in the respective solution sets is presented in
Table 6.
As illustrated in
Figure 5, the MOPSO, MOBWO, and MOCOA algorithms exhibit relatively poor Pareto optimal solution sets. During the iterative process, these algorithms are trapped in more local optimal solution cases, leading to an insufficient exploration of the search space. On the contrary, the MOGWO and MOWOA achieve a more favorable trade-off between the two objective functions; however, these algorithms compromise the performance of the objective function f
1 during the optimization search process. According to the data presented in
Table 6, the MOGWO algorithm successfully obtains a lower endpoint phosphorus content of 0.0128%, while the MOWOA achieves a reduced cost of production cost at 3205.89 CNY per tonne of steel during the iterative process.
4.6. Evaluation Indicators Analysis
To evaluate the convergence, diversity, and uniformity of the algorithm, IGD, HV, and Spacing metrics are employed. Recognizing the stochastic nature inherent in optimization algorithms, each algorithm was executed independently 30 times for the dephosphorization process parameter optimization problem. Subsequently, the mean and standard deviation of each algorithm’s solution set indices were calculated, and the results of these evaluation metrics are presented in
Table 7.
Following extensive empirical evaluations, the mean IGD of the IMOWOA is approximately 0.009, which is close to zero, The mean HV attained is around 7.80, while the mean Spacing value stands at about 0.033. In terms of the IDG metric, although the IMOWOA slightly lags behind the 0.007 of the MOGWO algorithm, it is noteworthy that the IGD produced by the IMOWOA is relatively low, with a standard deviation of 0.0015. The statistically consistent convergence patterns across multiple independent runs demonstrate that the algorithm maintains stable convergence rates without succumbing to local optima entrapment caused by stochastic factors. Comparative analysis of the HV metric reveals that IMOWOA achieves broader coverage of the Pareto optimal front compared to benchmark algorithms, indicating superior solution diversity. Although IMOWOA exhibits a slightly less uniform distribution (Spacing metric) relative to MOPSO, its near-zero Spacing values (approximating 0.033) still confirm advantageous solution spread characteristics. Experimental results collectively demonstrate that IMOWOA’s lower IGD values, higher HV values, and near-zero Spacing metrics reflect dual strengths in computational efficiency and optimization accuracy. These findings validate the algorithm’s robust performance across convergence stability, solution quality, and distribution uniformity, thereby confirming the efficacy of its architectural design. Consequently, the IMOWOA shows significant potential and effectiveness when applied to the optimization of dephosphorization process parameters.
The experimental results indicate that the solutions derived from the IMOWOA are more effective in addressing the optimization problem associated with dephosphorization process parameters in converter steelmaking. This algorithm facilitates a more favorable tradeoff between the endpoint phosphorus content and production cost per tonne of steel, leading to improved steel quality with only a marginal increase in the production cost. Therefore, the IMOWOA evidences a remarkable optimization efficacy for this specific category of dephosphorization process parameter optimization problem.
5. Conclusions
This study addresses the multi-objective optimization challenges in dephosphorization processes during converter steelmaking by proposing an integrated solution that combines metallurgical mechanisms with intelligent algorithms. A thermodynamically constrained extreme random forest model is developed, where domain knowledge is embedded through feature selection prioritization, constructing an interpretable hybrid modeling framework. To enhance search capability and balance endpoint phosphorus content with production cost objectives, three key enhancements are implemented: an optimized population initialization strategy using quasi-random sampling, a dynamic reward-penalty function for objective equilibrium, and hybridized GWO social hierarchy mechanisms to reinforce global-local search coordination. Full-scale production trials validate the approach, demonstrating a 10.8% reduction in endpoint phosphorus content and a 5.79% decrease in production costs per ton of steel while maintaining minimum cost thresholds. The algorithm’s efficacy is further corroborated by superior hypervolume and spacing metrics compared to conventional methods, establishing a robust framework for industrial-scale metallurgical optimization.
The present research investigates the application of intelligent optimization algorithms in refining the process parameters of converter steelmaking, thereby providing an effective tool for technicians. However, the proposed approach still has some limitations, for example, the study focuses solely on the aggregate input quantities of auxiliary materials without delving into the optimization of input patterns as process parameters. For future work, we plan to extend the dataset to encompass a broader range of steel grades and variations in auxiliary material input patterns. Furthermore, we intend to explore advanced intelligent optimization algorithms to further enhance optimization outcomes. Additionally, we are keen on investigating the application of neural network models to improve the predictive accuracy of endpoint phosphorus content.