Balanced Grey Wolf Optimizer Algorithm for Backpropagation Neural Networks
Abstract
1. Introduction
- (1)
- A novel stochastic position update formula is proposed, which reduces the dependence of the wolf pack on the leading wolves and enhances the global search capability of the GWO.
- (2)
- A novel nonlinear convergence factor is proposed, enabling the GWO to achieve rapid exploration in the initial iterations and meticulous exploitation in the later iterations. The global and local search performance of the BGWO is validated using six benchmark functions.
- (3)
- Based on the BGWO, a training method for a BPNN is constructed. Three widely used public datasets are used to train and test BPNNs under different training methods. The results show that, under the experimental settings of this paper, the BPNN trained with the BGWO (BGWO-BPNN) demonstrates superior predictive performance.
2. Traditional Grey Wolf Optimizer
3. Balanced Grey Wolf Optimizer
3.1. Selection of Benchmark Functions
3.2. Novel Stochastic Position Update Formula
3.3. Novel Nonlinear Convergence Factor
3.4. Algorithm Performance Validation
4. BPNN Based on BGWO
4.1. Construction of BGWO-BPNN
4.2. Test Results and Discussion
5. Conclusions
- (1)
- This paper proposes a Balanced Grey Wolf Optimizer algorithm. First, we introduce a novel stochastic position update formula to reduce the dependence of the wolf pack on the leading wolves, thereby enhancing the global search capability of the GWO. Then, a novel nonlinear convergence factor is proposed, and the optimal convergence coefficient is explored, enabling GWO to achieve rapid exploration in the initial iterations and meticulous exploitation in the later iterations.
- (2)
- Six commonly used benchmark functions are selected as the objective functions to test the BGWO (which incorporates both the stochastic position update formula and the nonlinear convergence factor), the GWO with only the stochastic position update formula (SPU-GWO), and the traditional GWO. The test results validate that the BGWO demonstrates superior global exploration and local exploitation capabilities.
- (3)
- A BPNN training method based on the BGWO is constructed. Following the vector encoding strategy, the weights and biases in a BPNN are mapped to the position vectors of grey wolves in the BGWO, and the calculation method for the fitness function is provided. Finally, a complete BPNN training process is designed.
- (4)
- Three public datasets—Abalone, Boston House Prices, and Energy Efficiency—are selected to train and test BPNNs under different training methods. Furthermore, we use the Wilcoxon test to examine its significance. Under the experimental settings of this paper, the test results demonstrate an absolute reduction of 1.976, 3.120, and 5.662 percentage points in the MAPE achieved by BGWO-BPNN over the traditional BPNN. Compared to LM-BPNN, BGWO-BPNN also achieves a certain reduction in MAPE. This indicates that, under the conditions established in this paper, BGWO shows certain competitiveness compared to LM and GD.
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Chen, S.; Cowan, C.; Grant, P. Orthogonal least squares learning algorithm for radial. IEEE Trans. Neural Netw. 1991, 2, 302–309. [Google Scholar] [CrossRef]
- Pollack, J. Recursive auto-associative memory: Devising compositional distributed representations. In Proceedings of the 10th Annual Conference Cognitive Science Society Pod, Portland, OR, USA, 11–14 August 2010; Taylor & Francis Group: Oxford, UK, 2019; pp. 33–39. [Google Scholar]
- LeCun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 2002, 86, 2278–2324. [Google Scholar] [CrossRef]
- Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
- Dauphin, Y.N.; Pascanu, R.; Gulcehre, C.; Cho, K.; Ganguli, S.; Bengio, Y. Identifying and attacking the saddle point problem in high-dimensional non-convex optimization. Adv. Neural Inf. Process Syst. 2014, 27, 1–14. [Google Scholar] [CrossRef]
- Bengio, Y.; Simard, P.; Frasconi, P. Learning long-term dependencies with gradient descent is difficult. IEEE Trans. Neural Netw. 1994, 5, 157–166. [Google Scholar] [CrossRef]
- Patrick, P. Minimisation methods for training feedforward Neural Networks. Neural Netw. 1994, 7, 1–11. [Google Scholar] [CrossRef]
- Gao, X.; Yang, F. A new method for training MLP neural networks. Chin. J. Comput. 1996, 9, 687–694. (In Chinese) [Google Scholar]
- Peng, S.; Fang, Z. The Joint optimization of BP Learning Algorithm. J. Circuits Syst. 2000, 5, 26–30, (In Chinese with English Abstract). [Google Scholar]
- Mirjalili, S. How effective is the Grey Wolf optimizer in training multi-layer perceptrons. Appl. Intell. 2015, 43, 150–161. [Google Scholar] [CrossRef]
- Leema, N.; Nehemiah, H.K.; Kannan, A. Neural network classifier optimization using differential evolution with global information and back propagation algorithm for clinical datasets. Appl. Soft Comput. 2016, 49, 834–844. [Google Scholar] [CrossRef]
- Zhang, R.; Tao, J. A nonlinear fuzzy neural network modeling approach using an improved genetic algorithm. IEEE Trans. Ind. Electron. 2017, 65, 5882–5892. [Google Scholar] [CrossRef]
- Xue, J.; Shen, B. A novel swarm intelligence optimization approach: Sparrow search algorithm. Syst. Sci. Control Eng. 2020, 8, 22–34. [Google Scholar] [CrossRef]
- Zhao, W.; Wang, L.; Zhang, Z. A novel atom search optimization for dispersion coefficient estimation in groundwater. Future Gener. Comput. Syst. 2019, 91, 601–610. [Google Scholar] [CrossRef]
- Dorigo, M.; Birattari, M.; Stützle, T. Ant colony optimization-artificial ants as a computational intelligence technique. IEEE Comput. Intell. Mag. 2006, 1, 28–39. [Google Scholar] [CrossRef]
- Tang, Y.; Cui, M.; Hua, C.; Li, L.; Yang, Y. Optimum design of fractional order PIλDμ controller for AVR system using chaotic ant swarm. Expert Syst. Appl. 2012, 39, 6887–6896. [Google Scholar] [CrossRef]
- Zhang, Y.; Huang, D.; Ji, M.; Xie, F. Image segmentation using PSO and PCM with Mahalanobis distance. Expert Syst. Appl. 2011, 38, 9036–9040. [Google Scholar] [CrossRef]
- Zou, D.; Gao, L.; Li, S.; Wu, J. An effective global harmony search algorithm for reliability problems. Expert Syst. Appl. 2011, 38, 4642–4648. [Google Scholar] [CrossRef]
- Mirjalili, S.; Mirjalili, S.M.; Lewis, A. Grey wolf optimizer. Adv. Eng. Softw. 2014, 69, 46–61. [Google Scholar] [CrossRef]
- Agarwal, J.; Parmar, G.; Gupta, R.; Sikander, A. Analysis of grey wolf optimizer based fractional order PID controller in speed control of DC motor. Microsyst. Technol. 2018, 24, 4997–5006. [Google Scholar] [CrossRef]
- Zhang, H.; Cai, Z.; Xiao, L.; Heidari, A.A.; Chen, H.; Zhao, D.; Wang, S.; Zhang, Y. Face image segmentation using boosted grey wolf optimizer. Biomimetics 2023, 8, 484. [Google Scholar] [CrossRef]
- Gan, W.; Zhou, X.; Wu, W.; Xu, C.-A. Research on the Optimization of Uncertain Multi-Stage Production Integrated Decisions Based on an Improved Grey Wolf Optimizer. Biomimetics 2025, 10, 775. [Google Scholar] [CrossRef]
- Cui, J.; Zhang, N.; Cui, X.; Wang, J.; Yu, M.; Liu, D.; Jiang, L. Fault diagnosis method of aircraft anti-skid brake system based on GWO-PNN. In Proceedings of the 2021 33rd Chinese Control and Decision Conference (CCDC), Kunming, China, 22–24 May 2021; pp. 5402–5406. [Google Scholar] [CrossRef]
- Zhou, L.; Yao, G. An Optimized GWO-BPNN Model for Predicting Corrosion Fatigue Performance of Stay Cables in Coastal Environments. J. Mar. Sci. Eng. 2024, 12, 2308. [Google Scholar] [CrossRef]
- Zhang, H.-T.; Yang, T.-T.; Wang, W.-T. A novel hybrid model for species distribution prediction using neural networks and Grey Wolf Optimizer algorithm. Sci. Rep. 2024, 14, 11505. [Google Scholar] [CrossRef]
- Luo, J.; Liu, Z. Novel grey wolf optimization based on modified differential evolution for numerical function optimization. Appl. Intell. 2020, 50, 468–486. [Google Scholar] [CrossRef]
- Nadimi-Shahraki, M.H.; Taghian, S.; Mirjalili, S. An improved grey wolf optimizer for solving engineering problems. Expert Syst. Appl. 2021, 166, 113917. [Google Scholar] [CrossRef]
- Simon, D. Biogeography-based optimization. IEEE Trans. Evol. Comput. 2008, 12, 702–713. [Google Scholar] [CrossRef]
- Wang, G.-G.; Deb, S.; Gandomi, A.H.; Zhang, Z.; Alavi, A.H. Chaotic cuckoo search. Soft Comput. 2016, 20, 3349–3362. [Google Scholar] [CrossRef]
- Padhy, S.; Panda, S.; Mahapatra, S. A modified GWO technique based cascade PI-PD controller for AGC of power systems in presence of Plug in Electric Vehicles. Eng. Sci. Technol. 2017, 20, 427–442. [Google Scholar] [CrossRef]
- Ali, M.H.; Al Mohammed, B.A.D.; Ismail, A.; Zolkipli, M.F. A new intrusion detection system based on fast learning network and particle swarm optimization. IEEE Access 2018, 6, 20255–20261. [Google Scholar] [CrossRef]
- Zhang, J.-R.; Zhang, J.; Lok, T.-M.; Lyu, M.R. A hybrid particle swarm optimization–back-propagation algorithm for feedforward neural network training. Appl. Math. Comput. 2007, 185, 1026–1037. [Google Scholar] [CrossRef]











| Fun | Name | Multimodal | Mathematical Expression | Range | fmin |
|---|---|---|---|---|---|
| F1 | Sphere | No | [−30, 30] | 0 | |
| F2 | Rosenbrock | No | [−30, 30] | 0 | |
| F3 | Schwefel 1.2 | No | [−30, 30] | 0 | |
| F4 | Schwefel 2.22 | Yes | [−30, 30] | 0 | |
| F5 | Griewank | Yes | [−600, 600] | 0 | |
| F6 | Michalewicz | Yes | [0, π] | When d = 30, fmin = −29.63 |
| Fun | Metric | BGWO | SPU-GWO | GWO |
|---|---|---|---|---|
| F1 | MEAN | 8.273 × 10−4 | 0.001 | 61.782 |
| STD | 9.247 × 10−4 | 0.001 | 23.159 | |
| Runtime (s) | 0.046 | 0.040 | 0.043 | |
| F2 | MEAN | 28.932 | 29.187 | 7.002 × 104 |
| STD | 0.0624 | 0.196 | 5.617 × 104 | |
| Runtime (s) | 0.038 | 0.035 | 0.035 | |
| F3 | MEAN | 0.006 | 0.092 | 680.283 |
| STD | 0.005 | 0.119 | 241.96 | |
| Runtime (s) | 0.043 | 0.040 | 0.042 | |
| F4 | MEAN | 0.095 | 0.166 | 178.879 |
| STD | 0.065 | 0.098 | 1.021 × 103 | |
| Runtime (s) | 0.046 | 0.042 | 0.044 | |
| F5 | MEAN | 0.143 | 0.247 | 7.305 |
| STD | 0.351 | 0.367 | 2.131 | |
| Runtime (s) | 0.042 | 0.039 | 0.042 | |
| F6 | MEAN | −9.723 | −9.344 | −9.178 |
| STD | 1.242 | 0.870 | 0.846 | |
| Runtime (s) | 0.038 | 0.037 | 0.036 |
| Dataset Name | Input/Output | Metric |
|---|---|---|
| Abalone | Input Values | Sex, Length, Diameter, Height, Whole weight, Shucked weight, Viscera weight, Shell weight, and Rings |
| Output Values | Age of the Abalone | |
| Boston House Prices | Input Values | CRIM, ZN, INDUS, NOX, RM, AGE, DIS, RAD, TAX, PIRATIO, B, and LSTAT |
| Output Values | Median House Price | |
| Energy Efficiency | Input Values | Relative Compactness, Surface Area, Wall Area, Roof Area, Overall Height, Orientation, Glazing Area, and Glazing Area Distribution |
| Output Values | Heating Load |
| Dataset | Number of Inputs | Number of Neurons in Hidden Layer 1 | Number of Neurons in Hidden Layer 2 | Number of Outputs |
|---|---|---|---|---|
| Abalone | 9 | 6 | 5 | 1 |
| Boston House Prices | 12 | 8 | 7 | 1 |
| Energy Efficiency | 8 | 6 | 5 | 1 |
| Algorithm | Parameter | Value |
|---|---|---|
| BGWO | Population Size | 30 |
| Runtime (s) | 3 | |
| LM | Learning Rate | 10−5 |
| Runtime (s) | 3 | |
| GD | Learning Rate | 10−5 |
| Runtime (s) | 3 |
| Dataset | Neural Network Type | MAPE | Absolute Reduction in Percentage Points Achieved by BGWO-BPNN |
|---|---|---|---|
| Abalone | BGWO-BPNN | 15.397 | 0 |
| LM-BPNN | 15.850 | 0.453 | |
| BPNN | 17.373 | 1.976 | |
| Boston House Prices | BGWO-BPNN | 12.136 | 0 |
| LM-BPNN | 13.922 | 1.786 | |
| BPNN | 15.256 | 3.120 | |
| Energy Efficiency | BGWO-BPNN | 5.689 | 0 |
| LM-BPNN | 6.462 | 0.773 | |
| BPNN | 11.351 | 5.662 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Chen, J.; Zhu, H.; Shu, T.; Cao, C.; Deng, Y.; Cheng, Q. Balanced Grey Wolf Optimizer Algorithm for Backpropagation Neural Networks. Mathematics 2026, 14, 554. https://doi.org/10.3390/math14030554
Chen J, Zhu H, Shu T, Cao C, Deng Y, Cheng Q. Balanced Grey Wolf Optimizer Algorithm for Backpropagation Neural Networks. Mathematics. 2026; 14(3):554. https://doi.org/10.3390/math14030554
Chicago/Turabian StyleChen, Jiashuo, Hao Zhu, Tanjile Shu, Chengkun Cao, Yuanwang Deng, and Qing Cheng. 2026. "Balanced Grey Wolf Optimizer Algorithm for Backpropagation Neural Networks" Mathematics 14, no. 3: 554. https://doi.org/10.3390/math14030554
APA StyleChen, J., Zhu, H., Shu, T., Cao, C., Deng, Y., & Cheng, Q. (2026). Balanced Grey Wolf Optimizer Algorithm for Backpropagation Neural Networks. Mathematics, 14(3), 554. https://doi.org/10.3390/math14030554
