A Hybrid Optimization Approach for Multi-Generation Intelligent Breeding Decisions
Abstract
1. Introduction
2. Methods
2.1. Problem Description and Modeling
2.1.1. Defining the Problem of MGIB
2.1.2. Modeling for MGIB
2.1.3. Modeling with Markov Decision Process
- (1)
- State Representation
- •
- I: SNP locus index;
- •
- J: individual index;
- •
- C: chromosome copy (1: maternal, 2: paternal);
- •
- : additive effect of locus i;
- •
- : allele at locus i, copy c of individual j in generation t (0: ancestral, 1: favorable);
- •
- : recombination frequency between loci i and i + 1;
- •
- : remaining budget (number of offspring that can be produced);
- •
- : cost per cross (set to 1 budget unit per offspring);
- •
- : maximum allowable cumulative recombination probability;
- •
- : equals 1 if allele at locus i, copy c of individual j is selected;
- •
- : equals 1 if individual j is selected as a parent;
- •
- : equals 1 if a recombination event occurs between loci i and i + 1 for parent j.
- (2)
- Action Space
- (3)
- Reward Function
2.2. Overall Framework
2.3. ILAS
2.3.1. The Workflow of ILAS
2.3.2. Global Search Optimization Based on EDA
- Selection repair: If the sampled vector contains more than S ones, the excess is randomly removed; if fewer, additional individuals are randomly selected among those with the highest marginal probability.
- Mating repair: If the number of planned crosses exceeds , pairs are randomly dropped until the limit is satisfied; if the matrix violates the per-individual mating limit, the exceeding pairs are reassigned to other feasible individuals.
| Component | Parameter | Value | Description |
|---|---|---|---|
| EDA | Population size (P) | 100 | Number of candidate solutions per generation |
| Elite ratio (η) | 0.2 | Proportion of top solutions used to update probability model | |
| Learning rate (ε) | 0.05 | Smoothing factor for probability vector update | |
| Stopping criterion | 100 iterations | Maximum number of EDA generations | |
| SA | Initial temperature () | 100 | Starting temperature for acceptance probability |
| Cooling factor (α) | 0.95 | Temperature reduction multiplied per iteration | |
| Moves per temperature (L) | 50 | Number of neighborhood attempts at each temperature | |
| Minimum temperature () | 0.01 | Lower bound for temperature termination | |
| Selection & Mating | Selected parents (S) | 20 | Fixed number of individuals chosen per generation |
| Max crosses per individual | 10 | Maximum number of mating pairs an individual can participate in | |
| Total crosses () | 10 | Upper limit on the number of mating pairs per generation |
2.4. DQN for Resource Allocation in MGIB
| Algorithm 1 DON-based learning data generation |
| Input: T;
; A; β; r;
Output: multiple complete episodes of data Initialization: DQN network; Target network; Experience replay pool Set hyperparameters: γ, ε, batch size, buffer capacity
|
| Algorithm 2 Genome resource allocation based on DQN |
|
3. Results
3.1. Simulation Settings
3.1.1. Datasets
- (1)
- Corn2019 Dataset (Table 3)
| SNP | R001 | R002 | R003 | R004 | R005 |
|---|---|---|---|---|---|
| SNP1 | 0/0 | 0/0 | 1/1 | 0/0 | 0/0 |
| SNP2 | 0/0 | 0/1 | 0/0 | 0/0 | 0/1 |
| SNP3 | 0/0 | 0/0 | 0/1 | 1/1 | 1/0 |
| SNP4 | 0/1 | 1/1 | 0/0 | 1/1 | 0/0 |
| SNP5 | 1/1 | 1/0 | 1/1 | 0/0 | 0/0 |
- (2)
- Cubic Dataset
3.1.2. CUBIC Dataset Preprocessing and BLUE Imputation
- Conservative filtering: loci > 15% missing, individuals > 40% missing;
- Liberal filtering: loci > 25% missing, individuals > 60% missing;
- Alternative imputation: mean imputation instead of BLUE.
3.1.3. Simulation Parameter Settings
- In the process from t to t + 1, each selected pair of parents produces multiple offspring.
- In the look-ahead simulation from t = 1 to T−1, all individuals undergo completely random mating (including selfing).
3.1.4. Computational Requirements and Scalability Analysis
3.2. Simulation Results
3.2.1. Performance Comparison of ILAS in MGIB
3.2.2. Function Approximator Selection and Performance Analysis
3.2.3. Statistical Analysis of ILAS-DQN Performance
4. Discussion
4.1. Effectiveness and Advantages of Algorithm Fusion
4.2. Limitations of the Study
4.3. Practical Implications and Future Outlook
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Meuwissen, T.H.; Hayes, B.J.; Goddard, M. Prediction of total genetic value using genome-wide dense marker maps. Genetics 2001, 157, 1819–1829. [Google Scholar] [CrossRef] [PubMed]
- Armañanzas, R.; Inza, I.; Santana, R.; Saeys, Y.; Flores, J.L.; Lozano, J.A.; Peer, Y.V.D.; Blanco, R.; Robles, V.; Bielza, C. A review of estimation of distribution algorithms in bioinformatics. BioData Min. 2008, 1, 6. [Google Scholar] [CrossRef] [PubMed]
- Atanda, S.A.; Bandillo, N. Genomic-inferred cross-selection methods for multi-trait improvement in a recurrent selection breeding program. Plant Methods 2024, 20, 133. [Google Scholar] [CrossRef]
- Crossa, J.; Pérez-Rodríguez, P.; Cuevas, J.; Montesinos-López, O.; Jarquín, D.; De Los Campos, G.; Burgueño, J.; González-Camacho, J.M.; Pérez-Elizalde, S.; Beyene, Y. Specifically, the workflow of this framework in plant breeding: Methods, models, and perspectives. Trends Plant Sci. 2017, 22, 961–975. [Google Scholar] [CrossRef]
- Krishnappa, G.; Savadi, S.; Tyagi, B.S.; Singh, S.K.; Mamrutha, H.M.; Kumar, S.; Mishra, C.N.; Khan, H.; Gangadhara, K.; Uday, G. Integrated genomic selection for rapid improvement of crops. Genomics 2021, 113, 1070–1086. [Google Scholar] [CrossRef]
- Cui, Y.; Li, R.; Li, G.; Zhang, F.; Zhu, T.; Zhang, Q.; Ali, J.; Li, Z.; Xu, S. Hybrid breeding of rice via genomic selection. Plant Biotechnol. J. 2020, 18, 57–67. [Google Scholar] [CrossRef]
- Hammer, G.L.; McLean, G.; van Oosterom, E.; Chapman, S.; Zheng, B.; Wu, A.; Doherty, A.; Jordan, D. Designing crops for adaptation to the drought and high-temperature risks anticipated in future climates. Crop Sci. 2020, 60, 605–621. [Google Scholar] [CrossRef]
- Gorjanc, G.; Gaynor, R.C.; Hickey, J.M. Optimal cross selection for long-term genetic gain in two-part programs with rapid recurrent genomic selection. Theor. Appl. Genet. 2018, 131, 1953–1966. [Google Scholar] [CrossRef] [PubMed]
- Moeinizade, S.; Hu, G.; Wang, L.; Schnable, P.S. Optimizing selection and mating in genomic selection with a look-ahead approach: An operations research framework. G3 Genes Genomes Genet. 2019, 9, 2123–2133. [Google Scholar] [CrossRef]
- Liu, H.J.; Wang, X.; Xiao, Y.; Luo, J.; Qiao, F.; Yang, W.; Zhang, R.; Meng, Y.; Sun, J.; Yan, S.; et al. CUBIC: An atlas of genetic architecture promises directed maize improvement. Genome Biol. 2020, 21, 20. [Google Scholar] [CrossRef]
- Chen, L.; Li, C.; Sargolzaei, M.; Schenkel, F. Impact of genotype imputation on the performance of GBLUP and Bayesian methods for genomic prediction. PLoS ONE 2014, 9, e101544. [Google Scholar] [CrossRef]
- Liu, Y.; Lu, S.; Liu, F.; Shao, C.; Zhou, Q.; Wang, N.; Li, Y.; Yang, Y.; Zhang, Y.; Sun, H. Genomic selection using BayesCπ and GBLUP for resistance against Edwardsiella tarda in Japanese flounder (Paralichthys olivaceus). Mar. Biotechnol. 2018, 20, 559–565. [Google Scholar] [CrossRef]
- Mahendran, N.; Durai Raj Vincent, P.; Srinivasan, K.; Chang, C.-Y. Machine learning based computational gene selection models: A survey, performance evaluation, open issues, and future research directions. Front. Genet. 2020, 11, 603808. [Google Scholar] [CrossRef] [PubMed]
- Gorjanc, G.; Jenko, J.; Hearne, S.J.; Hickey, J.M. Initiating maize pre-breeding programs using genomic selection to harness polygenic variation from landrace populations. BMC Genom. 2016, 17, 30. [Google Scholar] [CrossRef] [PubMed]
- Akdemir, D.; Beavis, W.; Fritsche-Neto, R.; Singh, A.K.; Isidro-Sánchez, J. Multi-objective optimized genomic breeding strategies for sustainable food improvement. Heredity 2019, 122, 672–683. [Google Scholar] [CrossRef]
- Moeinizade, S.; Han, Y.; Pham, H.; Hu, G.; Wang, L. A look-ahead Monte Carlo simulation method for improving parental selection in trait introgression. Sci. Rep. 2021, 11, 3918. [Google Scholar] [CrossRef] [PubMed]
- Moeinizade, S.; Hu, G.; Wang, L. A reinforcement Learning approach to resource allocation in genomic selection. Intell. Syst. Appl. 2022, 14, 200076. [Google Scholar] [CrossRef]
- Mazyavkina, N.; Sviridov, S.; Ivanov, S.; Burnaev, E. Reinforcement learning for combinatorial optimization: A survey. Comput. Oper. Res. 2021, 134, 105400. [Google Scholar] [CrossRef]
- Shakya, A.K.; Pillai, G.; Chakrabarty, S. Reinforcement learning algorithms: A brief survey. Expert Syst. Appl. 2023, 231, 120495. [Google Scholar] [CrossRef]
- Gambella, C.; Ghaddar, B.; Naoum-Sawaya, J. Optimization problems for machine learning: A survey. Eur. J. Oper. Res. 2021, 290, 807–828. [Google Scholar] [CrossRef]
- Younis, O.G.; Corinzia, L.; Athanasiadis, I.N.; Krause, A.; Buhmann, J.M.; Turchetta, M. Breeding programs optimization with reinforcement learning. arXiv 2024, arXiv:2406.03932. [Google Scholar] [CrossRef]
- Hao, J.; Li, B.; Tang, W.; Liu, S.; Chang, Y.; Pan, J.; Tao, Y.; Lv, C. A Reinforcement Learning-Driven UAV-Based Smart Agriculture System for Extreme Weather Prediction. Agronomy 2025, 15, 964. [Google Scholar] [CrossRef]
- Fu, H.; Li, Z.; Zhang, W.; Feng, Y.; Zhu, L.; Long, Y.; Li, J. Path Planning for Agricultural UAVs Based on Deep Reinforcement Learning and Energy Consumption Constraints. Agriculture 2025, 15, 943. [Google Scholar] [CrossRef]
- Meshram, R.A.; Alvi, A.S. Design of an Iterative Method for Crop Disease Analysis Incorporating Graph Attention with Spatial-Temporal Learning and Deep Q-Networks. Int. J. Intell. Eng. Syst. 2024, 17, 1301–1321. [Google Scholar] [CrossRef]
- Chavan, Y.R.; Swamikan, B.; Gupta, M.V.; Bobade, S.; Malhan, A. Enhanced Crop Yield Forecasting Using Deep Reinforcement Learning and Multi-source Remote Sensing Data. Remote Sens. Earth Syst. Sci. 2024, 7, 426–442. [Google Scholar] [CrossRef]
- Arulkumaran, K.; Deisenroth, M.P.; Brundage, M.; Bharath, A.A. Deep reinforcement learning: A brief survey. IEEE Signal Process. Mag. 2017, 34, 26–38. [Google Scholar] [CrossRef]
- Akdemir, D.; Sanchez, J.I.; Jannink, J.-L. Optimization of genomic selection training populations with a genetic algorithm. Genet. Sel. Evol. 2015, 47, 38. [Google Scholar] [CrossRef]
- Alemu, A.; Åstrand, J.; Montesinos-Lopez, O.A.; y Sanchez, J.I.; Fernandez-Gonzalez, J.; Tadesse, W.; Vetukuri, R.R.; Carlsson, A.S.; Ceplitis, A.; Crossa, J. Genomic selection in plant breeding: Key factors shaping two decades of progress. Mol. Plant 2024, 17, 552–578. [Google Scholar] [CrossRef] [PubMed]
- Larrañaga, P.; Bielza, C. Estimation of distribution algorithms in machine learning: A survey. IEEE Trans. Evol. Comput. 2023, 28, 1301–1321. [Google Scholar] [CrossRef]
- Hauschild, M.; Pelikan, M. An introduction and survey of estimation of distribution algorithms. Swarm Evol. Comput. 2011, 1, 111–128. [Google Scholar] [CrossRef]
- Liu, X.; Li, P.; Meng, F.; Zhou, H.; Zhong, H.; Zhou, J.; Mou, L.; Song, S. Simulated annealing for optimization of graphs and sequences. Neurocomputing 2021, 465, 310–324. [Google Scholar] [CrossRef]
- Molina, D.; Poyatos, J.; Ser, J.D.; García, S.; Hussain, A.; Herrera, F. Comprehensive taxonomies of nature-and bio-inspired optimization: Inspiration versus algorithmic behavior, critical analysis recommendations. Cogn. Comput. 2020, 12, 897–939. [Google Scholar] [CrossRef]
- Guilmeau, T.; Chouzenoux, E.; Elvira, V. Simulated annealing: A review and a new scheme. In Proceedings of the 2021 IEEE Statistical Signal Processing Workshop (SSP), Rio de Janeiro, Brazil, 11–14 July 2021; pp. 101–105. [Google Scholar]
- Henderson, D.; Jacobson, S.H.; Johnson, A.W. The theory and practice of simulated annealing. In Handbook of Metaheuristics; Springer: Berlin/Heidelberg, Germany, 2003; pp. 287–319. [Google Scholar]
- Gonzalez, T.F. Handbook of Approximation Algorithms and Metaheuristics; Chapman and Hall/CRC: Boca Raton, FL, USA, 2007. [Google Scholar]
- Mnih, V.; Kavukcuoglu, K.; Silver, D.; Rusu, A.A.; Veness, J.; Bellemare, M.G.; Graves, A.; Riedmiller, M.; Fidjeland, A.K.; Ostrovski, G. Human-level control through deep reinforcement learning. Nature 2015, 518, 529–533. [Google Scholar] [CrossRef] [PubMed]











| Hyperparameter | Value | Description |
|---|---|---|
| Network architecture | MLP: 256-128-64-1 | Hidden layer sizes, ReLU activations |
| Input dimension | Flattened state–action feature vector | |
| Optimizer | Adam | Gradient-based optimization algorithm |
| Learning rate (η) | Step size for weight updates | |
| Discount factor (γ) | 0.99 | Reward discounting for future returns |
| Replay buffer capacity | 1 × 105 | Maximum number of stored transitions |
| Batch size | 64 | Number of transitions sampled per update |
| Target update frequency | Every 1000 steps | Synchronization interval between main and target networks |
| Exploration rate (ε) | Linear decay 1.0→0.01 (5000 steps) | ε-greedy exploration schedule |
| Training rollouts per generation | 300 | Number of complete breeding simulations for data collection |
| Training iterations per generation | 5000 | Gradient updates performed for each generation’s model |
| Configuration | Filtering Thresholds | Imputation Method | % Change from Default | |
|---|---|---|---|---|
| Default | Loci: 20%, Ind: 50% | BLUE | 17.95 ± 0.78 | — |
| Conservative | Loci: 15%, Ind: 40% | BLUE | 17.68 ± 0.82 | −1.5% |
| Liberal | Loci: 25%, Ind: 60% | BLUE | 18.12 ± 0.85 | +0.9% |
| Mean Imputation | Loci: 20%, Ind: 50% | Mean | 17.59 ± 0.91 | −2.0% |
| Method | Min | Mean | Max | Diversity |
|---|---|---|---|---|
| LAS | 2.92 ± 2.50 | 70.70 ± 5.96 | 151.84 ± 4.72 | 1493.66 |
| LAS_SA | 1.54 ± 2.09 | 73.56 ± 7.58 | 155.49 ± 5.14 | 1344.96 |
| ILAS | 2.76 ± 5.12 | 76.16 ± 6.82 | 165.60 ± 6.53 | 1521.28 |
| Dataset | Method | ) | Statistical Significance | |
|---|---|---|---|---|
| Corn2019 | Uniform | 151.85 ± 1.37 [151.92, 2.45] | 137.43 ± 2.11 [137.62, 3.86] | p < 0.001 *** |
| LAS-RF | 151.70 ± 1.47 [151.65, 2.67] | 136.85 ± 1.38 [136.91, 2.51] | p < 0.001 *** | |
| ILAS-RF | 158.93 ± 1.31 [158.88, 2.38] | 157.10 ± 1.23 [157.05, 2.24] | p = 0.012 * | |
| ILAS-DQN | 154.40 ± 1.29 [154.35, 2.35] | 155.85 ± 1.51 [155.90, 2.75] | — | |
| Cubic | Uniform | 15.74 ± 0.10 [15.73, 0.18] | 14.88 ± 0.09 [14.87, 0.16] | p < 0.001 *** |
| LAS-RF | 15.81 ± 0.11 [15.80, 0.20] | 14.46 ± 0.09 [14.45, 0.16] | p < 0.001 *** | |
| ILAS-RF | 17.18 ± 0.10 [17.17, 0.18] | 16.98 ± 0.10 [16.97, 0.18] | p < 0.001 *** | |
| ILAS-DQN | 17.95 ± 0.09 [17.94, 0.16] | 17.74 ± 0.08 [17.73, 0.15] | — |
| Comparison | Dataset | (95% CI) | (95% CI) | Practical Significance |
|---|---|---|---|---|
| ILAS-DQN vs. Uniform | Corn2019 | 2.55 (1.83, 3.27) | 18.42 (17.15, 19.69) | Large |
| Cubic | 2.21 (2.03, 2.39) | 2.86 (2.69, 3.03) | Large | |
| ILAS-DQN vs. LAS-RF | Corn2019 | 2.70 (1.98, 3.42) | 19.00 (17.73, 20.27) | Large |
| Cubic | 2.14 (1.96, 2.32) | 3.28 (3.11, 3.45) | Large | |
| ILAS-DQN vs. ILAS-RF | Corn2019 | 4.53 (5.25, 3.81) | 1.25 (2.52, 0.02) | Large |
| Cubic | 0.77 (0.59, 0.95) | 0.76 (0.59, 0.93) | Small–Medium |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Yang, M.; Li, Z.; Li, J.; Huang, B.; Niu, X.; Lu, X.; Li, X. A Hybrid Optimization Approach for Multi-Generation Intelligent Breeding Decisions. Information 2026, 17, 106. https://doi.org/10.3390/info17010106
Yang M, Li Z, Li J, Huang B, Niu X, Lu X, Li X. A Hybrid Optimization Approach for Multi-Generation Intelligent Breeding Decisions. Information. 2026; 17(1):106. https://doi.org/10.3390/info17010106
Chicago/Turabian StyleYang, Mingxiang, Ziyu Li, Jiahao Li, Bingling Huang, Xiaohui Niu, Xin Lu, and Xiaoxia Li. 2026. "A Hybrid Optimization Approach for Multi-Generation Intelligent Breeding Decisions" Information 17, no. 1: 106. https://doi.org/10.3390/info17010106
APA StyleYang, M., Li, Z., Li, J., Huang, B., Niu, X., Lu, X., & Li, X. (2026). A Hybrid Optimization Approach for Multi-Generation Intelligent Breeding Decisions. Information, 17(1), 106. https://doi.org/10.3390/info17010106

