Evolutionary Convolutional Neural Network Optimization with Cross-Tasks Transfer Strategy
Abstract
:1. Introduction
- (1)
- We propose a simple and effective cross-tasks transfer strategy, which can select the valuable knowledge from the original task to transfer for improving the performance of the target task. Especially at the early generations, our method can increase the optimization speed significantly, which is important when the learning time or computing resource is limited;
- (2)
- Within the case study of image classification tasks, it is demonstrated that the proposed CTS-ECNN can obtain better results than the ECNN that starts from scratch and some manually-designed state-of-the-art methods do;
- (3)
- In the framework of the proposed CTS-ECNN, when a new task is encountered, we can extract knowledge from the optimized tasks. With more knowledge achieved from related tasks, the proposed method can be applied to more tasks rapidly without considering the sequence of tasks.
2. Related Works
2.1. Evolutionary CNN Optimization
| Algorithm 1 Evolutionary convolutional neural network. | 
| Input:N: the max number of generations, k: the size of each generation, pm: the mutation probability, pc: the crossover probability. Output: Individuals of the last generation with their fitness values. 
 | 
| Algorithm 2 Performance evaluation of CNN. | 
| Input:p: the individual, Dtrain: the training dataset, Dvalid: the validation dataset, T: the epoch number, B: the training batch size, L: the loss function, η: the learning rate. Output: The classification accuracy. 
 | 
2.2. Transfer Learning
3. Materials and Method
| Algorithm 3 The pseudocode of CTS-ECNN. | 
| Input:N: the max number of generation, k: the size of each generations, DM: the target dataset. Output: The best CNN architecture for DM. 
 | 
3.1. Neural Architecture Evolution
3.2. Encoding and Extraction of Feature
| Algorithm 4 Cross-task transfer strategy (CTS). | 
| Input: {|r = 1, 2, …, k, c = 1, 2, …, M − 1}: individuals with top fitness values from the preceding (M − 1) datasets, qm: the mutation probability, TP: the transfer parameter. Output: The parent PTrans for DM. 
 | 
3.3. Construction of Subpopulation
3.4. Training and Prediction
4. Experimental Results and Discussion
4.1. Datasets
4.2. Results of the First Experimental Scenario
4.3. Results of the Second Experimental Scenario
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
Abbreviations
| CNNs | Convolutional neural networks | 
| ECNN | Evolutionary convolutional neural network | 
| EAs | Evolutionary algorithms | 
| CTS | Cross-tasks transfer strategy | 
| NAS | Neural architecture search | 
| RL | Reinforcement learning | 
| RNN | Recurrent neural network | 
| AP | Affinity propagation | 
| SGD | Stochastic gradient descent | 
References
- Krizhevsky, A.; Sutskever, I.; Hinton, G.E. Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 2012, 25, 1097–1105. [Google Scholar] [CrossRef]
- Jamshidi, M.; Lalbakhsh, A.; Lotfi, S.; Siahkamari, H.; Mohamadzade, B.; Jalilian, J. A neuro-based approach to designing a wilkinson power divider. Int. J. Microw. Comput. Aided Eng. 2020, 30, e22091. [Google Scholar] [CrossRef]
- Roshani, S.; Jamshidi, M.B.; Mohebi, F.; Roshani, S. Design and modeling of a compact power divider with squared resonators using artificial intelligence. Wirel. Pers. Commun. 2021, 117, 2085–2096. [Google Scholar] [CrossRef]
- Jamshidi, M.B.; Lalbakhsh, A.; Talla, J.; Peroutka, Z.; Roshani, S.; Matousek, V.; Roshani, S.; Mirmozafari, M.; Malek, Z.; Spada, L.L.; et al. Deep learning techniques and covid-19 drug discovery: Fundamentals, state-of-the-art and future directions. Emerg. Technol. During Era COVID Pandemic 2021, 348, 9. [Google Scholar]
- Jamshidi, M.B.; Alibeigi, N.; Rabbani, N.; Oryani, B.; Lalbakhsh, A. Artificial neural networks: A powerful tool for cognitive science. In Proceedings of the 2018 IEEE 9th Annual Information Technology, Electronics and Mobile Communication Conference (IEMCON), Vancouver, BC, Canada, 1–3 November 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 674–679. [Google Scholar]
- Bello, I.; Zoph, B.; Vasudevan, V.; Le, Q.V. Neural optimizer search with reinforcement learning. In Proceedings of the International Conference on Machine Learning, Sydney, Australia, 6–11 August 2017; pp. 459–468. [Google Scholar]
- Snoek, J.; Larochelle, H.; Adams, R.P. Practical bayesian optimization of machine learning algorithms. In Proceedings of the Neural Information Processing Systems, Lake Tahoe, NV, USA, 3–6 December 2012. [Google Scholar]
- Pontes, F.; Amorim, G.; Balestrassi, P.; Paiva, A.; Ferreira, J. Design of experiments and focused grid search for neural network parameter optimization. Neurocomputing 2016, 186, 22–34. [Google Scholar] [CrossRef]
- Bergstra, J.; Bengio, Y. Random search for hyper-parameter optimization. J. Mach. Learn. Res. 2012, 13, 281–305. [Google Scholar]
- Bengio, Y. Gradient-based optimization of hyperparameters. Neural Comput. 2000, 12, 1889–1900. [Google Scholar] [CrossRef] [PubMed]
- Bäck, T.; Schwefel, H.-P. An overview of evolutionary algorithms for parameter optimization. Evol. Comput. 1993, 1, 1–23. [Google Scholar] [CrossRef]
- Ramakurthi, V.B.; Manupati, V.; Machado, J.; Varela, L. A hybrid multi-objective evolutionary algorithm-based semantic foundation for sustainable distributed manufacturing systems. Appl. Sci. 2021, 11, 6314. [Google Scholar] [CrossRef]
- Abualigah, L.; Diabat, A.; Sumari, P.; Gandomi, A.H. A novel evolutionary arithmetic optimization algorithm for multilevel thresholding segmentation of covid-19 ct images. Processes 2021, 9, 1155. [Google Scholar] [CrossRef]
- Guerraiche, K.; Dekhici, L.; Chatelet, E.; Zeblah, A. Multi-objective electrical power system design optimization using a modified bat algorithm. Energies 2021, 14, 3956. [Google Scholar] [CrossRef]
- Yılmaz, E.M.; Güntert, P.; Etaner-Uyar, Ş. Evaluation of multi-objective optimization algorithms for nmr chemical shift assignment. Molecules 2021, 26, 3699. [Google Scholar] [CrossRef]
- Ponti, A.; Candelieri, A.; Archetti, F. A new evolutionary approach to optimal sensor placement in water distribution networks. Water 2021, 13, 1625. [Google Scholar] [CrossRef]
- Zhou, L.; Feng, L.; Gupta, A.; Ong, Y.; Liu, K.; Chen, C.; Sha, E.; Yang, B.; Yan, B.W. Solving dynamic vehicle routing problem via evolutionary search with learning capability. In Proceedings of the IEEE Congress on Evolutionary Computation, San Sebastián, Spain, 5–8 June 2017; pp. 890–896. [Google Scholar]
- Feng, L.; Ong, Y.; Jiang, S.; Gupta, A. Autoencoding evolutionary search with learning across heterogeneous problems. IEEE Trans. Evol. Comput. 2017, 21, 760–772. [Google Scholar] [CrossRef]
- Iqbal, M.; Xue, B.; Al-Sahaf, H.; Zhang, M. Cross-domain reuse of extracted knowledge in genetic programming for image classification. IEEE Trans. Evol. Comput. 2017, 21, 569–587. [Google Scholar] [CrossRef]
- Xu, Q.; Wang, N.; Wang, L.; Li, W.; Sun, Q. Multi-task optimization and multi-task evolutionary computation in the past five years: A brief review. Mathematics 2021, 9, 864. [Google Scholar] [CrossRef]
- Dumitru, D.; Dioșan, L.; Andreica, A.; Bálint, Z. A transfer learning approach on the optimization of edge detectors for medical images using particle swarm optimization. Entropy 2021, 23, 414. [Google Scholar] [CrossRef]
- Chu, S.-C.; Zhuang, Z.; Li, J.; Pan, J.-S. A novel binary quasi-affine transformation evolutionary (quatre) algorithm. Appl. Sci. 2021, 11, 2251. [Google Scholar] [CrossRef]
- Xie, L.; Yuille, A. Genetic CNN. In Proceedings of the IEEE International Conference on Computer Vision ICCV, Venice, Italy, 22–29 October 2017; pp. 1388–1397. [Google Scholar]
- Suganuma, M.; Shirakawa, S.; Nagao, T. A genetic programming approach to designing convolutional neural network architectures. In Proceedings of the Genetic and Evolutionary Computation Conference, Berlin, Germany, 15–19 July 2017; pp. 497–504. [Google Scholar]
- Real, E.; Aggarwal, A.; Huang, Y.; Le, Q. Regularized evolution for image classifier architecture search. In Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA, 27 January–1 February 2019; Volume 33, pp. 4780–4789. [Google Scholar]
- Sun, Y.; Xue, B.; Zhang, M.; Yen, G.; Lv, J. Automatically designing cnn architectures using the genetic algorithm for image classification. IEEE Trans. Cybern. 2020, 50, 3840–3854. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Lu, Z.; Whalen, I.; Boddeti, V.; Dhebar, Y.; Deb, K.; Goodman, E.; Banzhaf, W. Nsga-net: Neural architecture search using multi-objective genetic algorithm. In Proceedings of the Genetic and Evolutionary Computation Conference, Prague, Czech Republic, 13–17 July 2019; pp. 419–427. [Google Scholar]
- Gutstein, O.F.S.; Freudenthal, E. Knowledge transfer in deep convolutional neural nets. Int. J. Artif. Intell. Tools 2008, 17, 555–567. [Google Scholar] [CrossRef] [Green Version]
- Terekhov, A.V.; Montone, G.; O’Regan, J. Knowledge transfer in deep block-modular neural networks. arXiv 2015, arXiv:abs/1908.08017. [Google Scholar]
- Yosinski, J.; Clune, J.; Bengio, Y.; Lipson, H. How Transferable Are Features in Deep Neural Networks? In Advances in Neural Information Processing Systems 27; Curran Associates, Inc.: New York, NY, USA, 2014; pp. 3320–3328. [Google Scholar]
- Gupta, A.; Ong, Y.; Feng, L. Multifactorial evolution: Toward evolutionary multitasking. IEEE Trans. Evol. Comput. 2016, 20, 343–357. [Google Scholar] [CrossRef]
- Feng, L.; Zhou, L.; Zhong, J.; Gupta, A.; Ong, Y.; Tan, K.; Qin, A.K. Evolutionary multitasking via explicit autoencoding. IEEE Trans. Cybern. 2019, 49, 3457–3470. [Google Scholar] [CrossRef]
- Gong, M.; Tang, Z.; Li, H.; Zhang, J. Evolutionary multitasking with dynamic resource allocating strategy. IEEE Trans. Evol. Comput. 2019, 23, 858–869. [Google Scholar] [CrossRef]
- Frey, B.J.; Dueck, D. Clustering by passing messages between data points. Science 2007, 315, 972–976. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Delving deep into rectifiers: Surpassing human-level performance on imagenet classification. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), Boston, MA, USA, 7–12 June 2015; pp. 1026–1034. [Google Scholar]
- He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
- Goodfellow, I.; Warde-Farley, D.; Mirza, M.; Courville, A.; Bengio, Y. Maxout networks. In Proceedings of the 30th International Conference on Machine Learning, Atlanta, GA, USA, 16–21 June 2013; pp. 1319–1327. [Google Scholar]
- Lin, M.; Chen, Q.; Yan, S. Network in network. In Proceedings of the International Conference on Learning Representations, Banff, AB, Canada, 14–16 April 2014; pp. 1–10. [Google Scholar]
- Srivastava, R.K.; Greff, K.; Schmidhuber, J. Highway networks. In Proceedings of the International Conference on Learning Representations, San Diego, CA, USA, 7–9 May 2015; pp. 1–6. [Google Scholar]
- Liu, H.; Simonyan, K.; Vinyals, O.; Fernando, C.; Kavukcuoglu, K. Hierarchical representations for efficient architecture search. arXiv 2017, arXiv:1711.00436. [Google Scholar]





| Gen | Max % | Diff % | Avg % | Diff % | Med % | Diff % | 
|---|---|---|---|---|---|---|
| 01 | 93.52 | 2.43 | 91.49 | 1.72 | 91.73 | 2.13 | 
| 05 | 93.52 | 1.48 | 91.76 | 1.26 | 91.67 | 0.73 | 
| 10 | 93.52 | 1.00 | 91.72 | 1.17 | 91.95 | 1.03 | 
| 15 | 93.75 | 1.23 | 91.93 | 0.37 | 91.93 | 0.38 | 
| 20 | 93.81 | 1.12 | 92.12 | 1.06 | 92.45 | 1.52 | 
| 25 | 93.81 | 1.03 | 92.70 | 1.10 | 92.89 | 1.33 | 
| 30 | 94.12 | 1.01 | 92.58 | 0.60 | 92.42 | 0.30 | 
| 35 | 94.15 | 0.88 | 92.20 | 1.61 | 92.13 | 0.72 | 
| 40 | 94.15 | 0.67 | 92.35 | 1.79 | 92.50 | 1.33 | 
| 45 | 94.37 | 0.88 | 93.38 | 1.78 | 93.28 | 1.73 | 
| 50 | 94.37 | 0.88 | 93.10 | 1.22 | 93.13 | 1.38 | 
| Method | Acc % | Gen | |
|---|---|---|---|
| Manually Designed | ResNet (depth = 101) | 93.57 | – | 
| ResNet (depth = 1202) | 92.07 | – | |
| Maxout | 90.70 | – | |
| Network in Network | 91.19 | – | |
| Highway Network | 92.40 | – | |
| Automatically Designed | Hierarchical Evolution | 96.37 | 7000 | 
| CGP-CNN | 94.02 | 300 | |
| Genetic CNN | 92.90 | 50 | |
| CTS-ECNN (G-10) | 91.46 | 10 | |
| CTS-ECNN (G-30) | 92.06 | 30 | |
| CTS-ECNN (G-50) | 92.42 | 50 | 
| Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. | 
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Wang, Z.; Lu, D.; Wang, H.; Liu, T.; Li, P. Evolutionary Convolutional Neural Network Optimization with Cross-Tasks Transfer Strategy. Electronics 2021, 10, 1857. https://doi.org/10.3390/electronics10151857
Wang Z, Lu D, Wang H, Liu T, Li P. Evolutionary Convolutional Neural Network Optimization with Cross-Tasks Transfer Strategy. Electronics. 2021; 10(15):1857. https://doi.org/10.3390/electronics10151857
Chicago/Turabian StyleWang, Zhao, Di Lu, Huabing Wang, Tongfei Liu, and Peng Li. 2021. "Evolutionary Convolutional Neural Network Optimization with Cross-Tasks Transfer Strategy" Electronics 10, no. 15: 1857. https://doi.org/10.3390/electronics10151857
APA StyleWang, Z., Lu, D., Wang, H., Liu, T., & Li, P. (2021). Evolutionary Convolutional Neural Network Optimization with Cross-Tasks Transfer Strategy. Electronics, 10(15), 1857. https://doi.org/10.3390/electronics10151857
 
         
                                                


 
       