Next Article in Journal
A Multifunctional Cationic Waterborne Polyurethane System with High Fire-Safety and Antibacterial Performance Enabled by Phosphorous Acid-Protonated Chitosan
Previous Article in Journal
Effect of Composite Viscosity, Cavity Type, and Placement Technique on Microleakage in Stamp Technique Restorations: An In Vitro Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Framework for Integrating Large Language Models into Memetic Algorithms

Independent Researcher, Moscow 107143, Russia
Biomimetics 2026, 11(6), 383; https://doi.org/10.3390/biomimetics11060383
Submission received: 1 May 2026 / Revised: 23 May 2026 / Accepted: 26 May 2026 / Published: 1 June 2026
(This article belongs to the Section Biological Optimisation and Management)

Abstract

Memetic algorithms achieve strong optimization performance by combining population-based global search with local refinement operators, yet their effectiveness critically depends on the design and management of memes. Local search strategies are typically handcrafted, problem-specific, and fixed prior to execution. This paper proposes a fourth-generation memetic framework in which Large Language Models (LLMs) are embedded directly into the optimization loop as adaptive generators of local search operators. At each triggering point, the LLM receives a structured state vector encoding the current search dynamics and generates a candidate meme in the form of an executable Python function. Generated operators are subject to a two-stage validation procedure combining semantic similarity assessment and implementation-level comparison, ensuring that only sufficiently novel and syntactically correct operators are admitted to a dynamically growing meme library. A cooldown-regulated triggering mechanism balances periodic and stagnation-based generation, while a probability-weighted selection policy prioritizes newly generated memes without discarding previously validated ones. The proposed framework is evaluated on the CEC 2017 benchmark suite for continuous black-box optimization and compared against classical memetic algorithms. Experimental results demonstrate that the LLM-driven approach consistently outperforms non-LLM memetic baselines, confirming the viability of generative language models as adaptive heuristic components within population-based optimization.

Graphical Abstract

1. Introduction

In recent decades, evolutionary population-based algorithms have become powerful and widely adopted tools for solving complex optimization problems. One of the key advantages of this class of methods, in addition to their conceptual simplicity and inherent population diversity, is their high probability of locating so-called suboptimal solutions, that is, solutions situated in the vicinity of the global optimum. In many real-world optimization scenarios, such solutions are sufficient from a practical standpoint [1,2,3].
Despite these strengths, population-based algorithms also exhibit well-known limitations. In particular, their convergence rate often decreases significantly near high-quality solutions, as they generally lack mechanisms for exploiting fine-grained local information about the underlying fitness landscape [4]. This limitation becomes especially important in large-scale and computationally demanding real-world problems, where evaluation budgets are restricted and high solution quality must be achieved within strict time constraints, for example in optical design problems [5,6]. Simply increasing computational power is not sufficient to cope with the growing complexity of modern optimization tasks; instead, qualitatively new algorithmic paradigms are required [7].
Additionally, it has been shown that relying on a single optimization method is often insufficient to obtain high-quality solutions across diverse problem classes [8]. The need for hybrid strategies is well-recognized beyond purely mathematical benchmarks: recent work has demonstrated that combining global metaheuristic search with domain-specific local refinement consistently yields superior results across a range of engineering design and applied optimization problems [9,10,11].
Those observations have motivated the development of hybrid optimization techniques that combine complementary search strategies. Among these, memetic algorithms (MAs) represent one of the most promising and conceptually rich approaches [12]. Memetic algorithms are population-based metaheuristics inspired by neo-Darwinian evolution and the concept of a meme introduced by Dawkins [13], where a meme denotes a unit of cultural information transmitted through imitation and learning. In the context of optimization, a meme corresponds to a local improvement procedure that refines candidate solutions during the evolutionary process. By combining global exploration with local exploitation, memetic algorithms can achieve a synergistic effect that often surpasses the performance of their individual components.
Memetic algorithms have been successfully applied to a wide range of optimization problems in science and engineering [14,15,16]. However, their effectiveness strongly depends on the choice and management of memes, including the selection of local search operators, their frequency of application, and the conditions under which they are activated. Previous studies have demonstrated that meme selection has a decisive impact on the overall performance of memetic algorithms [17].
In this paper, we propose a formal framework aimed at addressing these design challenges by integrating large language models (LLMs) into the memetic search process. The underlying idea was initially introduced in [18] and is supported by numerical results. The present work extends that study by developing a systematic and formal framework that rigorously defines the role of LLMs within memetic algorithms.
The present study addresses the following research questions: (i) whether large language models can be systematically embedded into memetic algorithms as generators of local search operators; (ii) whether such integration leads to measurable improvements in optimization performance; (iii) which components of the proposed framework are critical to its effectiveness; (iv) how generated memes evolve under online selection pressure; and (v) whether the framework generalizes across different population-based optimization algorithms.
The remainder of the paper is organized as follows. Section 2 provides the theoretical background on evolutionary, swarm-based, and memetic algorithms, tracing their development across generations. Section 3 reviews recent advances in the integration of large language models into optimization and algorithm design. Section 4 formulates the optimization problem and introduces the necessary notations and concepts. Section 5 presents the proposed fourth-generation memetic framework in detail, including state vector design, meme generation and novelty filtering, library management, and the complete algorithmic procedure. Section 6 reports the results of computational experiments on the CEC 2017 benchmark suite. Section 7 discusses the findings, examines the sensitivity of the framework to key design choices, and identifies open questions and limitations. Section 8 summarizes the main contributions and outlines directions for future research.

2. Memetic Algorithms

Evolutionary population-based algorithms form a broad class of metaheuristics inspired by natural evolution and collective behavior. Representative examples include genetic algorithms, differential evolution, particle swarm optimization (PSO) [19], whale optimization algorithm (WOA) [20] and moth–flame optimization (MFO) [21]. Despite differences in their update mechanisms and metaphors, these methods share a common structure: a population of candidate solutions, iterative fitness evaluation, and stochastic variation operators. Population-based methods are particularly valued for their robustness, flexibility, and ability to explore complex, multimodal search spaces. However, their reliance on indirect information exchange and population-level dynamics often limits their capacity to exploit fine-grained local structure of the objective function. As a consequence, convergence tends to slow significantly in the vicinity of high-quality solutions, especially in high-dimensional or ill-conditioned landscapes.
Memetic algorithms (MAs) extend evolutionary and swarm-based optimization by incorporating local search procedures into the population-based search process [12]. The notion of a meme, originating from Dawkins’ theory of cultural evolution [13], denotes a unit of information transmitted and refined through imitation and learning. In optimization, a meme corresponds to a local improvement operator that refines candidate solutions during the search. By combining global exploration with local exploitation, memetic algorithms often achieve superior convergence behavior and solution quality compared to their non-hybrid counterparts, and their effectiveness has been demonstrated across a wide range of optimization problems. Recent applications span domains including scheduling and manufacturing, structural and mechanical engineering design [22] and energy systems [10].
Nevertheless, their performance critically depends on the design and management of memes including operator selection, application frequency, and activation criteria. A structured taxonomy of memetic algorithms was presented in [23], distinguishing four generations based on increasing levels of algorithmic autonomy. The first generation belongs to the class of hybrid algorithms that couple a population-based global search with a local refinement procedure. These algorithms cannot be unambiguously classified as evolutionary in the Darwinian sense, as they lack mechanisms for transmitting memetic information across generations, selection and mutation of memes, and related principles. Key design questions were how often local search should be applied; to which individuals; for how long; at what computational cost; and which memes are appropriate for a given problem.
The second generation addresses several of these issues through multi-memetic designs that incorporate operations for transmitting and selecting memes. In some variants, memetic information is inherited from parents by offspring; others employ hyper-heuristics or Lamarckian meta-learning [24] to select memes from a competing pool based on past performance.
The third generation introduces self-generation and self-adaptation as local search engines co-evolve with candidate solutions, tracking structural patterns in the search space. Across all three generations, memes function as mechanisms for controlling the evolution of individuals, with their degree of engagement governed by performance indicators and parameters that balance global and local search.
The fourth generation of memetic algorithms is characterized by higher-level cognitive capabilities, including recognition, generalization, and long-term memory, in which memes are not confined to a single problem instance but evolve across runs and adapt to broader contextual information. The concept of a fourth-generation memetic algorithm realized through large language models was introduced by the authors in [18]. In that preliminary study, the primary focus was placed on exploring the feasibility of LLM-driven generation of local search strategies and demonstrating the practical potential of the approach through several proof-of-concept experiments. However, many important methodological and architectural aspects remained open, including the formal organization of meme generation, integration with the optimization process, and systematic evaluation methodology. The present work extends that contribution by providing a complete formal framework, a rigorous algorithmic specification, and an extensive computational validation on standard continuous optimization benchmarks. Beyond empirical evaluation, the paper identifies key design dimensions, open theoretical questions, and concrete directions for future development of this paradigm.

3. Related Works

Recent advances in large language models have opened new directions for mathematical optimization, with a growing body of work exploring how LLMs can be embedded into optimization workflows. Rather than a single unified paradigm, this literature reflects several distinct but complementary integration strategies, each illuminating a different facet of the LLM–optimization relationship.
The most foundational question in this area concerns what role an LLM should play within an optimization process. The survey by the authors of [25] provides a systematic answer, positioning LLMs not merely as coding assistants but as integral components that can act simultaneously as optimizers generating and refining candidate solutions, as surrogate predictors estimating solution quality, and as designers proposing entire algorithms or heuristics. The authors argue that the most successful approaches adopt a hybrid paradigm in which LLMs provide semantic reasoning and creative exploration while traditional optimization methods ensure structured search and convergence guarantees—a perspective directly aligned with the use of LLMs for meme generation proposed in the present work. This framing is reinforced by [26], which specifically addresses the limitations of standalone LLM reasoning, such as hallucinations and partial solutions, by grounding generation within an evolutionary framework that provides diversity and convergence control. Taken together, these works establish that neither LLMs nor classical optimizers alone are sufficient: the productive design space lies in their structured combination.
A second cluster of studies moves beyond general principles to demonstrate concrete integrations. Studies [27,28] are particularly relevant to the present work because they embed LLMs directly into memetic algorithm architectures. The framework in [27] integrates LLM-driven heuristic exploration, domain-specific local refinement, and memory-aware reflection, demonstrating that LLMs can serve as interpretable heuristic designers that dynamically evolve strategies by synthesizing natural language and program representations. The work in [28] takes a similar approach for constrained scheduling, embedding LLM-generated heuristic rules and memetic operators within a classical evolutionary local search loop and demonstrating competitive benchmark performance. Both studies confirm that LLMs can reduce the manual effort traditionally required to construct effective memetic operators, but both generate operators in a partially offline or externally guided manner rather than under continuous optimization pressure.
More general evolutionary integrations are explored in [29], which empirically investigates LLMs participating directly in the evolutionary workflow by proposing candidate solutions and variation operators, and in [30], which introduces a bidirectional coupling where evolutionary experience is fed back into the LLM when the search stagnates to produce seed solutions that escape local optima. The latter is notable for its explicit feedback mechanism as the LLM is not merely queried but informed by the state of the search, enabling adaptive behavior across iterations. This bidirectional structure is closest in spirit to the architecture proposed here, though the present work extends it to the generation and online selection of local search operators rather than solution candidates.
A third strand of research elevates the LLM from an operator within an algorithm to a generator of complete algorithms. In [31], the LLM is embedded in an evolutionary loop that generates, mutates, evaluates, and selects entire metaheuristics based on performance criteria, with evolutionary feedback iteratively improving these designs and reducing dependence on human expertise. The study [32] demonstrates a related capability, using GPT-4 Turbo to automatically generate and evolve variants of the Self-Organizing Migrating Algorithm without expert intervention; one LLM-derived variant substantially outperformed the baseline while deviating meaningfully from traditional designs, illustrating that LLM-generated constructs can diverge productively from human-engineered heuristics.
These results are significant because they establish that LLMs are capable of genuine algorithmic creativity rather than mere recombination of known patterns. At the same time, the offline or population-level nature of algorithm generation in these works means the LLM receives limited real-time feedback from individual optimization runs—a constraint the present framework explicitly removes.
Underlying these empirical contributions are a set of theoretical and survey works that provide conceptual grounding. The work [33] draws explicit parallels between LLMs and evolutionary algorithms, comparing token representation to individual encoding and transformer training to parameter adaptation, and argues that shared generative-iterative structures provide a principled basis for hybrid methods. Surveys [25,34] synthesize the broader landscape, analyzing prompt design, automated strategy evolution, and representation choices as central technical challenges. These contributions contextualize the empirical results and support the view that LLM–EA hybrids constitute a coherent and theoretically motivated research direction rather than an ad hoc engineering choice.
Across this body of literature, a common pattern emerges: LLMs are either used offline to design operators or algorithms before optimization begins, or they are embedded as solution proposers that operate at the population level. What has not been demonstrated is a closed-loop architecture in which the LLM generates local search operators during a run, those operators are evaluated under real fitness pressure, and the resulting performance signal is used to guide subsequent LLM proposals. This is precisely the gap the present work addresses.
Unlike approaches in which the LLM generates algorithms offline or serves as a standalone solution proposer, the framework developed here places the LLM inside a continuous feedback loop with the fitness function: it receives structured representations of the current search state, proposes meme operators, and adapts subsequent proposals based on empirical performance. This architecture positions the LLM simultaneously as a creative generator and an adaptive optimization component, enabling the accumulation of transferable meme knowledge across runs which is a direct practical realization of the fourth-generation memetic algorithm paradigm that has previously remained conceptual.

4. Problem Formulation

Initially, memetic algorithms were proposed as one of the ways to improve the efficiency of genetic algorithms. However, the combination scheme of global and local search used in MA can be used to construct search algorithms also based on other population algorithms. The term “memetic algorithms” was first proposed by P. Moscato [12] to denote a class of stochastic global search methods that combine the strengths of local search methods, tailored for specific practical problems, and population global search methods.
In this study we use memetic algorithms to solve a global constrained nonlinear minimization problem
min X D R n F ( X ) = F X * = F * .
Here F ( X ) is the scalar objective function, X = ( x 1 , x 2 , , x n ) is the n-dimensional vector of variables, F X * = F * is the required minimum value, R n is the n-dimensional arithmetical space, and D is the constrained search domain
D = { X | x i m i n x i x i m a x , i [ 1 : n ] } R n .
In this work F ( X ) is treated as a black-box function, and the feasible set D is defined by simple bound constraints.
As discussed in [23], memetic algorithms can be organized into four generations of increasing algorithmic autonomy, the first three of which were outlined in the preceding section. Across all three generations, memes serve as mechanisms for controlling individual evolution, with their engagement governed by performance indicators and parameters that balance global and local search.
The fourth generation departs fundamentally from this paradigm. Rather than drawing memes from a predefined pool, memes emerge organically through information exchange among groups of candidate solutions. They can be retained in memory as patterns accumulated from solving prior problems and retrieved through associations between stored patterns and new tasks. This constitutes a qualitatively higher order of learning than local search alone, and it is this generation that the present work formally develops and computationally investigates.

5. The Proposed General Framework for LLM-Enhanced Memetic Algorithms

This section presents the formal framework for LLM-enhanced memetic algorithms, in which the on-demand synthesis of memes via external intelligent systems is realized through large language models. Importantly, the LLM does not replace the optimization algorithm but augments it by providing candidate memes conditioned on the current search state. The global search component can be implemented by an arbitrary population-based algorithm, which retains full control over the optimization trajectory, while the LLM serves as a source of adaptive heuristic variations. From this perspective, the framework implements a co-evolutionary mechanism between solutions and memes: the population evolves in the solution space, and the meme library evolves in the space of local search strategies. This distinguishes the fourth generation from earlier approaches, where meme evolution is either absent or restricted to selection among predefined operators. At the same time, the framework preserves a key empirical principle that all generated memes must be validated under real optimization pressure, as there is no guarantee that the LLM produces fundamentally new or effective strategies.

5.1. State Representation and Prompt Construction

Within the proposed framework, the state representation plays a fundamentally more important role than in earlier generations of memetic algorithms. In classical and multi-memetic approaches, the choice of local search operator is either deterministic or based on simple heuristics. In contrast, the state vector here becomes the primary source of information driving meme generation: the ability of the LLM to produce meaningful local search operators is directly constrained by the informativeness of this representation.
In the present framework, the state vector captures the current behavior of the optimization process in an online setting, encoding convergence dynamics, population diversity, and search regime characteristics. This formulation allows the LLM to implicitly condition generated memes on different search phases, for example, distinguishing stagnation from high-exploration regimes. It also introduces a potential limitation: if important characteristics of the problem or search dynamics are absent from the state vector, the generated memes may resemble generic local search heuristics, such as hill climbing or adaptive step-size methods [35], rather than strategies tailored to the current landscape. It’s also important to mention that the state vector doesn’t contain any information on the problem under optimization and focuses only on the optimization process and its main features.
The components of s t were selected to provide the LLM with a compact but sufficiently informative characterization of the search process across three complementary axes: convergence dynamics ( I t , κ t ), population geometry ( D t , S t , E t ), and fitness landscape structure ( V t , L t , M t ). Each component was chosen on the basis of computational tractability in an online setting, requiring no additional function evaluations. Alternative features such as gradient approximations were excluded as the framework targets black-box problems where gradients are unavailable. The grouping of components into these three axes also reflects a deliberate design choice: each axis captures a distinct aspect of search behavior that may call for qualitatively different meme strategies.
Let P t = { X 1 ( t ) , , X N ( t ) } denote the population of N candidate solutions at iteration t and let F t * denote the best fitness value observed up to iteration t . The state vector s t R 8 is defined as:
s t = ( I t ,   κ t ,   D t ,   S t ,   E t ,   V t ,   L t ,   M t )
The improvement rate I t measures the relative change in best fitness between consecutive iterations:
I t = F t 1 * F t * F t 1 * + ε
where ε > 0 is a small constant ensuring numerical stability. Positive values indicate improvement; zero indicates stagnation. The stagnation counter κ t records the number of consecutive iterations without improvement:
κ t = 0 ,   i f   F t * < F t 1 * κ t 1 + 1 ,   o t h e r w i s e
Together I t and κ t provide complementary information about convergence dynamics. I t captures the instantaneous rate of progress, while κ t accumulates evidence of prolonged stagnation that a single-iteration metric would fail to detect.
The population diversity D t quantifies the spatial spread of the population as the mean Euclidean distance from each individual to the population centroid
D t = 1 N i = 1 N X i t X ( t ) ¯ 2
The mean step size S t measures the average displacement of individuals between consecutive iterations:
S t = 1 N i = 1 N X i t X i ( t 1 ) ¯ 2
The exploration-exploitation ratio E t characterizes the balance between spatial spread and movement dynamics:
E t = D t D t + S t + ε
Values of E t close to 1 indicate that the population is spread widely relative to its movement, suggesting an exploration-dominated regime; values close to 0 indicate convergence or exploitation-dominated dynamics. Jointly, D t ,   S t and E t provide a multi-scale description of population geometry: D t captures the global extent of the search cloud, S t reflects the velocity of individual trajectories, and E t summarizes their relative balance.
The fitness variance V t measures the dispersion of objective values across the population:
V t = 1 N i = 1 N ( F X i t F t ¯ ) 2 ,   F t ¯ = 1 N i = 1 N F X i t
The local fitness dispersion L t captures fine-grained fitness heterogeneity by computing the mean absolute fitness difference between each individual and its nearest neighbor:
L t = 1 N i = 1 N | F X i t F X n n ( i ) t |
where n n i = arg min i j X i t X j t 2 denotes the nearest neighbor of individual i in the population. The pair V t and L t provides complementary views of fitness heterogeneity: V t reflects global spread across the entire population, whereas L t is sensitive to local fitness ruggedness in the immediate neighborhood of each individual.
Finally, the multimodality proxy M t estimates the degree of multimodality in the current population as the fraction of individuals that are locally non-minimal, in other words, worse than at least one of their k nearest neighbors:
M t = 1 N i = 1 N 1   j N k i : F X j t < F X i t
where N k i denotes the set of k nearest neighbors of individual i , and k is a small, fixed integer (here k = 3 ). High values of M t suggest that the population spans multiple distinct basins of attraction, which in turn informs the LLM that escape-from-basin strategies may be more appropriate than fine-grained local refinement.
The state vector s t serves as the basis for constructing the meme generation prompt. This step is critical: the prompt content directly conditions the LLM’s output, and its quality determines whether the generated operator reflects the current search context or defaults to generic heuristic patterns. The meme generation mechanism is embedded directly into the optimization loop through a fixed prompt template. The LLM is tasked with generating a local search operator in the form of a deterministic Python function that performs a fixed number of improvement steps on a single candidate solution.
A key distinction of the fourth-generation framework is that meme generation is not a one-time offline process but an iterative, state-dependent procedure: at each triggering point, the LLM produces a candidate operator conditioned on the current s t . This significantly increases the diversity of potential memes, while also raising the question of whether the generated operators represent genuinely novel strategies or variations in known methods. This question is also addressed through empirical validation within the optimization loop. The prompt template used in the present framework takes the following form:
We are running a population-based continuous optimization algorithm.
A local search operator (meme) is required.
Current optimization state:
 Improvement rate   (I_t): {I_t}
 Stagnation counter  (κ_t): {κ_t}
 Population diversity  (D_t): {D_t}
 Mean step size    (S_t): {S_t}
 Exploration-exploitation ratio (E_t): {E_t}
 Fitness variance    (V_t): {V_t}
 Local fitness dispersion (L_t): {L_t}
 Multimodality proxy  (M_t): {M_t}
Goal:
Generate a Python function that performs 30 local improvement steps
on a single solution vector x.
Requirements:
1. Function signature must be:
  def generated_meme(x, objective_function, lower_bound, upper_bound):
    …
2. Input:
  - x: numpy array of shape (d,)
  - objective_function: callable f(x)
  - lower_bound: numpy array (d,)
  - upper_bound: numpy array (d,)
3. The function must:
  - Modify x locally (small perturbation or structured step)
  - Respect bounds
  - Return (x_new, f_new)
4. Do NOT use:
  - external libraries except numpy
  - randomness without numpy
  - recursion
  - printing
  - global variables
Return only valid Python code.
Do not include explanations.

5.2. Novelty Assessment and Meme Library Management

The generated meme is evaluated in terms of structural and semantic novelty prior to integration into the optimization loop. Since there is no guarantee that the LLM does not reproduce previously generated algorithms or their close variations, the framework explicitly incorporates a similarity filtering stage. The concept of structural similarity control was introduced in [36].
Novelty is assessed through a hybrid similarity measure combining two complementary methods. Semantic similarity is computed using GraphCodeBERT [37], which produces code embeddings that capture high-level algorithmic intent. Implementation-level similarity is measured using the Jaccard coefficient [38] over tokenized operator representations. This dual approach addresses a challenge in optimization algorithm assessment; specifically, while GraphCodeBERT provides robust general-purpose code understanding, its embeddings may not always capture mathematically significant variations characteristic of optimization routines, for example, differences in weight update schemes or step-size adaptation logic. Conversely, Jaccard similarity at the token level is particularly effective at detecting trivial modifications that preserve overall structure while altering surface syntax.
Formally, let σ s e m ( m , m ) and σ i m p l ( m , m ) denote the GraphCodeBERT cosine similarity and Jaccard similarity between candidate meme m and library meme m respectively. A candidate meme m is considered similar to an existing library meme m if either similarity exceeds a fixed threshold τ = 0.9 . This threshold was selected empirically [36] as a conservative value that permits structural variation while filtering near-duplicate operators.
s i m m , m = 1 σ s e m m , m > τ σ i m p l m , m > τ
If s i m m , m = 1 for any m L t , where L t denotes the current meme library, the candidate is rejected and the selection falls back to the existing library. Otherwise, the candidate is accepted and added to L t . It should be noted that structural differences do not necessarily imply conceptual novelty. An LLM-generated operator may be distinct in implementation while still being interpretable as a variant of a known method, for example, hill climbing with adaptive step size. The evaluation mechanism partially mitigates this issue by focusing on empirical performance rather than theoretical novelty: each candidate meme is tested within the optimization loop, and only those demonstrating measurable improvement over the current best solution are retained in L t .
At each triggering point (every Δ iterations), the active meme is selected according to the following policy. If a novel meme m n e w has been accepted, it is assigned selection probability 1 | L t | · θ , while each meme m L t retains a small probability θ :
P m = 1 L t · θ ,   i f   m = m n e w θ ,   i f   m L t
where θ is a fixed exploration probability. If no novel meme was generated at the current trigger, either due to similarity rejection or LLM unavailability, selection is performed uniformly over L t . This policy ensures that newly generated operators are prioritized while preserving access to previously validated strategies, maintaining both adaptability and continuity throughout the optimization process.
The proposed framework maintains a dynamically growing library L t that accumulates operators generated and validated during the optimization run. This mechanism can be interpreted as within-run learning: rather than discarding memes after use, the framework retains those that proved effective and makes them available for reuse in subsequent iterations. The library thus gradually captures a problem-specific portfolio of local search strategies that are empirically suited to the current objective landscape.
An important design decision is that the meme library is not intended to persist across optimization runs. Since different problem instances induce different search dynamics, memes effective for one instance are not assumed to transfer to another. Each run begins with an empty library and constructs its own portfolio from scratch, driven by the evolving state representation. This distinguishes the present framework from transfer optimization or algorithm selection approaches, where cross-instance knowledge reuse is an explicit goal.

5.3. General Framework

The integration of LLMs introduces inference latency and computational overhead not presented in earlier generations of memetic algorithms. The proposed framework mitigates this through the reuse of previously generated memes: as optimization progresses and L t grows, the frequency of LLM calls decreases and the algorithm increasingly operates using the accumulated library. This behavior represents a form of knowledge consolidation, where the upfront cost of meme generation is amortized over repeated reuse. Practical challenges remain, including the handling of syntactically invalid or semantically degenerate operators and the latency associated with LLM inference. These issues highlight the need for robust validation mechanisms and efficient integration strategies. From the perspective of memetic algorithm evolution, the proposed approach represents a shift from static or manually designed hybridization toward autonomous construction of hybrid strategies, in which the practitioner defines the overall framework while specific local search behaviors emerge through generation, evaluation, and selection. At the same time, the empirical foundation of memetic algorithms is fully preserved: all candidate strategies must demonstrate their effectiveness through direct evaluation before contributing to the optimization process.
Algorithm 1 summarizes the complete execution flow of the proposed framework. The algorithm proceeds in three interleaved phases at each iteration: global search update, meme generation and filtering, and local refinement. The global search component is treated as a black box, allowing any population-based algorithm to serve as the underlying optimizer. Meme generation is triggered periodically or upon stagnation detection, subject to the library capacity constraint L m a x . Generated memes pass through a two-stage validity check (syntactic correctness followed by novelty filtering) before being admitted to the library L . Local refinement is applied to the top 0.15 · N individuals ranked by fitness, with each selected individual undergoing up to η = 30 improvement steps or until local stagnation is detected. The meme applied to each individual is drawn from the current library according to a selection policy that prioritizes the most recently accepted meme while preserving access to all previously validated operators.
Algorithm 1. Fourth-Generation Memetic Algorithm Framework with LLM-driven Meme Generation
Input: objective function F , bounds D , population size N , max iterations T , trigger interval Δ , stagnation threshold λ , library capacity L m a x , meme steps η , meme stagnation limit ρ , similarity threshold τ , library exploration parameter θ
 
Output: best solution X * , meme library L
 
P 0 InitializePopulation   N , D
F * m i n i F X i 0 , x * arg m i n i F X i 0
L , κ 0 , m n e w
t l a s t δ
 
for  t = 1 , , T  do
    P t GlobalSearchUpdate P t 1
   Update F * , X * , κ t
   Compute state vector s t = I t , κ t , D t , S t , E t , V t , L t , M t
 
   // Meme Generation Trigger
 
   if  t   m o d   Δ = 0  or  κ t > λ  and  L < L m a x  and  t t l a s t δ  then
        m c a n d LLM - GenerateMeme s t
        t l a s t t
       if  m c a n d is syntactically valid and executes without runtime error then
         if    m L s.t. σ s e m ( m c a n d , m ) > τ  or  σ i m p l ( m c a n d , m ) > τ
             L L m c a n d ,
             m n e w m c a n d
         else
             m n e w // rejected due to similarity
         end if
       else
          m n e w // rejected due to invalid or erroneous code
       end if
   end if
 
   // Meme Selection
 
   if  L
      if  m n e w
        Select m * according to:
           P m * = m = 1 ( L 1 ) θ if   m = m n e w θ if   m L { m n e w }
        where θ < 1 L 1 to ensure P m * = m n e w > 0
      else
        Select m * uniformly from L
       end if
 
       // Meme Application to Top 15%
 
        E t top   0.15 N   individuals   of   P t   ranked   by   f
       for each  X E t  do
          κ l o c a l 0 , X c u r x
         for  s = 1 , , η  do
             X n e w , F n e w ) m * ( X c u r , F , D
            if  F n e w < F ( X c u r )
                X c u r X n e w , κ l o c a l 0
            else
                κ l o c a l κ l o c a l + 1
            end if
            if  κ l o c a l ρ break end if
         end for
         if  F ( X c u r ) < F ( X )  then
             Replace X in P t with X c u r
             Update F * , X * if improved
         end if
      end for
   end if
    m n e w // reset for next iteration
end for
return  X * , L
Generated operators that fail syntactic validation or produce a runtime error upon execution are discarded immediately and do not enter the library. In practice, code generation errors are rare with current state-of-the-art large language models; however, error rates vary across models of different capability and architecture, and a systematic characterization of code validity as a function of model choice represents a separate line of investigation beyond the scope of the present work. To prevent redundant LLM calls when both the periodic and stagnation-based triggers coincide, a cooldown parameter δ  is introduced. Following any LLM call at iteration t l a s t , the next call is suppressed until t t l a s t δ , regardless of which trigger condition is satisfied. Setting δ = Δ  recovers the most conservative behavior, in which the stagnation trigger can only accelerate a scheduled call but cannot introduce additional ones. Smaller values of δ < Δ  allow faster response to stagnation while still preventing back-to-back calls. Prior to the first successful meme admission, the library remains empty and the algorithm operates as a standard global search method without local refinement.
The detailed description of the proposed framework is presented below and in Figure 1.

6. Experimental Study

6.1. Experimental Setup

In order to evaluate the performance of the proposed framework we use a set of high-dimensional benchmark functions from CEC 2017, which was designed single objective real-parameter numerical optimization problems [39]. The present study focuses on the 30-dimensional benchmark setting ( n = 30 ) in order to establish and validate the proposed framework. Investigation of scalability properties for higher-dimensional optimization problems will constitute an important next stage of research following the formalization of the proposed paradigm. All algorithms were implemented using Python programming language, while computational experiments were launched on MacBook Pro with 2.3 GHz Quad-Core Intel Core i7 and 32 GB RAM.
Particle Swarm Optimization and the Whale Optimization Algorithm were selected as the underlying global search components for several reasons. Both represent well-established population-based metaheuristics with qualitatively distinct search mechanisms: PSO relies on velocity-based social learning, while WOA employs encircling and spiral-based prey-chasing behaviors. This diversity ensures that conclusions regarding the proposed memetic framework are not tied to a single search paradigm but reflect its behavior across qualitatively different global optimizer dynamics.
Both algorithms have been extensively benchmarked on CEC test suites, providing reliable baseline references against which the contribution of the memetic component can be cleanly isolated. Eight algorithmic variants were evaluated: PSO, WOA, PSO-NM, WOA-NM, MMPSO, MMWOA, AIM-PSO and AIM-WOA. The baseline algorithms (PSO and WOA) use standard metaheuristic search without local refinement. The single-meme variants (PSO + NM and WOA + NM) augment the global search with Nelder–Mead local search [40], representing a first-generation memetic configuration. The multi-memetic variants (MMPSO and MMWOA) randomly select between two local search operators: Nelder–Mead and Random Walk Local Search [41]. The AI-memetic variants (AIM-PSO and AIM-WOA) constitute the proposed fourth-generation framework, employing LLM-generated operators selected according to the state-dependent policy described in Section 5, with the meme library growing dynamically over the course of each optimization run.
All algorithms used a maximum of T m a x = 2000 iterations with population size N = 30 . PSO parameters were set to inertia weight ω = 0.7298 and cognitive and social coefficients c 1 = c 2 = 1.49618 following the canonical parameterization of [19]. For WOA, default parameter settings from the original formulation [20] were retained. In all memetic variants, local search operators were applied every 50 iterations to the top 15 % of individuals ranked by current fitness, with each application performing up to λ m a x = 30 local iterations subject to the early termination criterion described in Section 5. All experiments employed 30 independent runs per function.
For the meme generation the GPT-5.3 model was used. The model temperature was set to 0.7 to balance operator diversity with coherence; full determinism via seed was not enforced as stochastic variation across meme proposals is a deliberate feature of the framework rather than a confound. All other parameters were left at API defaults. Each run issued at maximum 40 LLM calls (on average 37 calls when the stagnation criteria is not met) with individual inference latency averaging 2.4 s, contributing approximately 100 s of LLM overhead per run. This represents roughly 14% of total wall-clock time, indicating that inference cost does not constitute a prohibitive bottleneck.
All results obtained are presented in Table 1 and in Figure 1.

6.2. Statistical Analysis of the Results

The performance of all optimization strategies was evaluated across 29 benchmark functions, with statistical significance assessed using the Friedman test and pairwise Wilcoxon post hoc comparisons. The Friedman test confirmed highly significant differences among the algorithms ( χ 2 = 121.98 ,   p = 2.96 × 10 23 ), justifying a detailed pairwise analysis.
The mean Friedman ranks revealed a clear performance hierarchy, with AIM-PSO achieving the best average rank of 1.86, closely followed by AIM-WOA at 2.34. Both LLM-based algorithms substantially outperformed all other competitors, reflecting the advantage of the proposed framework. Notably, the pairwise Wilcoxon test found no statistically significant difference between these two methods ( p = 0.686 ), suggesting that, at this level of algorithmic sophistication, the choice of the underlying global search engine, PSO or WOA, has a negligible impact on overall performance. The slightly higher win count of AIM-WOA over AIM-PSO is tentatively attributed to WOA’s more diverse exploration pattern, which provides the adaptive meme selection mechanism with a broader variety of candidate solutions to refine; a thorough investigation of this interaction is left for future work.
Multi-memetic PSO ranked third (3.24) performing significantly worse than both LLM-based variants ( p < 0.05 ) but significantly better than all multistart and Nelder–Mead-based strategies. This indicates that while a fixed multi-meme strategy still benefits from diversity in local search, it cannot fully match the flexibility offered by adaptive meme selection. Multi-memetic WOA, however, ranked considerably lower (5.34), suggesting that the WOA global search mechanism interacts less favorably with a fixed multi-meme framework compared to PSO. The Nelder–Mead hybridizations (PSO-NM and WOA-NM) ranked fourth and sixth respectively, performing poorly despite incorporating a classical local search procedure. This can be attributed to the rigidity of the Nelder–Mead simplex method, which, while effective in smooth unimodal landscapes, struggles to provide consistent improvement across the diverse and multimodal benchmark functions used in this study.
The two simple multi-start strategies (PSO and WOA) ranked last overall, with average ranks of 6.41 and 6.51 respectively. Their high mean and maximum objective values, alongside large standard deviations, confirm that repeated independent restarts without any local refinement are insufficient to compete with hybridized approaches. Interestingly, no statistically significant difference was found between multi-start WOA and WOA-NM ( p = 0.565 ), suggesting that a single fixed local search operator offers little practical benefit over pure multi-start in this setting.
In terms of pairwise win counts, AIM-WOA achieved the strongest overall performance with 15 wins (51.7% of benchmark functions), followed by AIM-PSO with 8 wins (27.6%), collectively accounting for the majority of pairwise comparisons. These results confirm that the advantage of the proposed framework is not contingent on the choice of the underlying global search algorithm: both PSO-based and WOA-based instantiations of the fourth-generation memetic framework outperform their respective non-LLM counterparts including pure baselines, single-meme variants, and randomly selecting multi-memetic variants. This demonstrates that LLM-driven adaptive meme generation constitutes a genuine and transferable improvement over simpler memetic strategies.
To assess practical significance, Kendall’s W was computed for the global Friedman test, confirming a large overall effect of algorithm choice across the 29 benchmark functions. Pairwise effect sizes were quantified using rank-biserial correlation r . The largest effects were observed between the AIM algorithms and the multistart baselines ( r = 0.581 0.665 ), as well as against the Nelder–Mead hybrids ( r = 0.567 0.636 ), all qualifying as large effects. In contrast, the comparison between AIM-PSO and AIM-WOA yielded a negligible effect (r = 0.069), confirming that their statistically equivalent ranking reflects a genuinely marginal practical difference. The multistart strategies were mutually negligible (r ≤ 0.030), further confirming their practical equivalence.
Overall algorithm performance comparison is also presented in Figure 2. It shows that the proposed LLM-based memetic approaches produce substantially lower variance and more stable optimization performance compared to the multi-start and classical hybrid methods. In particular, AIM-PSO and AIM_WOA exhibit narrower interquartile ranges and fewer extreme outliers, indicating consistent convergence behavior across all runs and all benchmark functions. In contrast PSO and WOA demonstrate very high variability, with fitness values spanning several orders of magnitude and numerous extreme outliers, reflecting unstable convergence and strong sensitivity to initialization. Overall, the proposed framework significantly improves robustness and reduce performance dispersion.

6.3. Meme Similarity Analysis

Analyzing the structure and diversity of LLM-generated memes provides insight into the behavior of the generation mechanism and the degree of novelty achieved by the framework. For this purpose, memes were collected across three representative benchmark functions spanning different landscape classes: F 7 (multimodal), F 15 (hybrid) and F 25 (composite). For each function, the first and last memes admitted to the library were retained for analysis, yielding six memes in total: m 1 7 , m 3 7 , m 1 15 , m 3 15 , m 1 25 , m 3 25 .
Pairwise similarity was assessed according to the method described in Section 5.2. The results are presented in Figure 3. Several observations emerge from the analysis. For F 25 , the first and last generated memes exhibit near-identical similarity scores, despite having been produced in independent runs. This finding highlights the stability of the LLM conditioned on the s t : when the search dynamics encoded in the state vector are similar across runs, the model consistently produces structurally comparable operators. For F 7 and F 15 , by contrast, the first and last memes are noticeably dissimilar, reflecting the evolution of the search state over the course of the optimization and confirming that the generation mechanism responds meaningfully to changes in s t . Cross-function comparisons reveal occasional structural overlap between memes generated for different benchmark classes, notably between m 3 15 and m 3 25 , and between m 1 7 and m 1 15 . This suggests that certain search regimes, as characterized by similar state vectors, produce comparable operator responses from the LLM regardless of the underlying function class.
Nevertheless, the majority of cross-function meme pairs exhibit low similarity scores on both semantic and implementation metrics, indicating that the generation process is broadly sensitive to landscape-specific search dynamics. To establish an external reference point, all six memes were additionally compared against the Nelder–Mead method. The resulting similarity scores are uniformly low across both metrics, confirming that the LLM-generated operators are structurally distinct from this classical local search baseline.
Additonally, in Figure 4 we present the distribution of the pairwise comparison of generated memes based on 30 randomly selected memes but no more than 2 for every benchmark function. The obtained distirbution supports the central hypothesis of the framework; specifically, large language models, conditioned on a structured representation of the current search state, are capable of generating genuinely novel local search strategies that extend beyond the repertoire of established optimization heuristics.

6.4. Meme Efficiency Analysis

In order to further evaluate the quality of the generated memes a set of experiments was carried out with a use of 30 randomly selected previously generated memes. To make the comparison fair, we use a different set of benchmark functions and not the ones some of those memes were generated for. For this study five 30-dimensional traditional benchmarks were used: Sphere, Rastrigin, Ackley, Griewank, and Levy [42] with 20 independent runs from random initial points. Each meme was launched for the exact number of 30 iterations like in the earlier experiments. The memes were ranked by a composite score weighting mean fitness (0.7) and standard deviation (0.3).
The results (Table 2) reveal substantial performance variation across generated operators, with composite scores ranging from 0.044 to 0.841. The top-ranked operators ( m 1 13 , m 1 1 , m 3 15 ) achieve average values close to the known optima while maintaining low mean errors across multimodal benchmarks. By contrast, the lowest-ranked operators ( m 3 25 , m 3 9   m 3 17 ) consistently fail to reach acceptable solutions on any benchmark with high variance. A notable pattern is the systematic advantage of memes generated in the beginning of the optimization (lower index 1) over ones generated during the later stages (lower index 3). This is expected as those memes were generated for a broader exploration while those generated later were for fine tuning the small neighborhood.
The observed variation in meme performance is expected: operators are generated for specific stages of the optimization process and tailored to particular search contexts, not designed for universal benchmark efficiency. This context-specificity is a fundamental feature of the proposed framework, distinguishing it from approaches where operators are selected based on historical performance.

7. Discussions

7.1. State Vector Design and Its Influence on Meme Quality

The state vector s t constitutes the sole source of contextual information available to the LLM at the point of meme generation, and its design is therefore a critical determinant of the quality and diversity of generated operators. The eight metrics adopted in the present framework capture complementary aspects of search dynamics: convergence rate, stagnation, spatial spread, movement velocity, fitness heterogeneity, and landscape multimodality. They were selected to provide a compact yet informative description of the current optimization regime. Nevertheless, the choice of these particular metrics represents one of many possible state encodings, and the question of optimal state vector design remains open. Several alternative or complementary representations merit investigation. Problem-specific features, such as estimated gradient information, condition number approximations, or basin-of-attraction indicators could provide the LLM with richer landscape context. Dimensionality-aware metrics may also be important: the current formulation does not explicitly encode the problem dimension n , which influences the expected behavior of perturbation-based local search operators. Furthermore, the metrics are computed at the population level and may obscure individual-level variation; sub-population statistics or clustering-based descriptors could capture finer-grained search structure. A systematic ablation study varying the composition of s t would clarify which components most strongly influence the quality of generated memes, and whether a reduced or augmented state representation could improve overall framework performance.

7.2. The Role of the LLM and Prompt Sensitivity

The framework treats the LLM as a black-box code generator conditioned on s t and its behavior is therefore subject to all known properties and limitations of large language models. Several dimensions of LLM influence warrant explicit discussion. First, the choice of model is likely to affect both the diversity and the quality of generated operators. More capable models may produce structurally more varied and mathematically sophisticated local search strategies, while smaller or quantized models may default to simpler patterns. The present framework imposes no constraints on the underlying model, but empirical evaluation across models of varying capability would clarify the sensitivity of framework performance to this choice. Second, prompt design introduces a source of variability that is difficult to control systematically. The fixed prompt template adopted here encodes the state vector as a structured list of named scalar values. Alternative representations, like natural language descriptions of the current search regime or few-shot examples of effective operators observed in similar states, may elicit qualitatively different and potentially more effective responses. Prompt engineering for optimization contexts remains an underexplored area with significant practical implications. Third, LLMs are inherently stochastic: repeated calls with identical prompts may produce different operators. This non-determinism can be advantageous from a diversity standpoint but complicates reproducibility. Strategies for controlling or exploiting LLM stochasticity within the optimization loop for example, through temperature scheduling or ensemble generation followed by internal selection, represent a natural extension of the present framework.

7.3. Meme Library Dynamics and Selection Policy

The meme library L accumulates operators that have passed novelty filtering, but its growth dynamics and the quality of its contents depend on several interacting design choices. The library capacity L m a x controls the total number of LLM calls and determines the point at which the algorithm transitions to a purely library-based mode. A small L m a x limits adaptability; a large L m a x increases inference cost and may dilute the library with redundant operators if the novelty threshold τ is insufficiently strict. The current selection policy treats all library memes as equally probable alternatives to the newly accepted operator, assigning each a fixed probability θ . This formulation ignores the accumulated performance history of individual memes. A natural extension would weight library memes by their empirical success rate, for example, the fraction of applications that yielded fitness improvement, introducing a form of online reinforcement that gradually concentrates selection probability on the most effective operators.

7.4. Generalization, Transferability, and the Scope of the Meme Library

A deliberate design decision in the present framework is that the meme library does not persist across optimization runs. This choice is motivated by the observation that different problem instances induce qualitatively different search dynamics reflected in their respective state vectors and that operators effective for one landscape may be ineffective or counterproductive for another. Each run therefore constructs its own problem-specific portfolio of local search strategies from scratch. At the same time, this decision forecloses the possibility of cross-instance knowledge transfer, which represents a potentially valuable capability aligned with the fourth-generation memetic paradigm. If state vectors from previous runs were used as retrieval keys in a persistent memory structure, previously validated memes could be reused when a new run enters a similar search regime. Whether such transfer would be beneficial in practice depends on the degree to which s t captures transferable landscape features rather than instance-specific dynamics. This question connects the present framework to the broader literature on algorithm selection, meta-learning and transfer optimization, and represents a promising direction for future work.

7.5. Limitations and Open Questions

Several limitations of the present framework should be acknowledged explicitly. The similarity filtering mechanism relies on fixed thresholds τ = 0.9 for both GraphCodeBERT cosine similarity and Jaccard coefficient. These values were selected empirically and may not generalize across problem classes or LLMs: a threshold appropriate for one model’s output distribution may be too permissive or too strict for another. Adaptive or data-driven thresholds represent a natural improvement direction. However, it should be noted that in the experiments conducted, no acute sensitivity to this parameter was observed across the tested range, providing pragmatic support for the chosen value.
The framework applies the selected meme to the top 15% of the population by fitness, focusing local refinement on the most promising region of the search cloud. While this is consistent with standard memetic practice, it may reduce population diversity over time, particularly in multimodal landscapes where lower-fitness individuals occupy distinct basins of attraction. Alternative application strategies such as fitness-proportional selection, diversity-aware targeting, or periodic application to randomly selected individuals could mitigate this effect and warrant empirical investigation.
Finally, the computational overhead of LLM inference introduces a practical constraint absent in classical memetic algorithms. Although the framework mitigates this through library reuse with LLM calls ceasing once | L | = L m a x ∣, the latency of individual inference calls may be prohibitive in time-sensitive or resource-constrained applications. Lightweight surrogate models trained on accumulated prompt-response pairs, or retrieval-augmented generation over the existing library, could reduce this overhead while preserving operator diversity. These considerations highlight that the transition to fourth-generation memetic algorithms involves not only algorithmic novelty but also practical engineering challenges that require dedicated investigation.

8. Conclusions

This paper presented a fourth-generation memetic framework in which large language models are embedded directly into the optimization loop as adaptive generators of local search operators. The framework addresses a fundamental limitation of existing memetic algorithms: the reliance on manually designed, problem-specific meme pools fixed prior to execution. It enables on-demand meme synthesis conditioned on a structured representation of the current search state. The proposed state vector encodes eight complementary metrics capturing convergence dynamics, population geometry, fitness heterogeneity, and landscape multimodality, providing the LLM with sufficient context to generate operators adapted to the evolving search regime. A two-stage novelty filtering mechanism combining semantic and implementation-level similarity assessment ensures that the dynamically growing meme library accumulates genuinely diverse operators, while a cooldown-regulated triggering policy coordinates periodic and stagnation-based generation without redundant LLM calls.
Computational experiments on the CEC 2017 benchmark suite demonstrate that the proposed LLM-based variants AIM-PSO and AIM-WOAs consistently outperform their non-LLM counterparts across all baseline categories, including pure global search, single-meme and randomly selecting multi-memetic variants. AIM-WOA achieved the strongest overall performance with 15 pairwise wins (51.7%), followed by AIM-PSO with 8 wins (27.6%), confirming that the advantage of the framework is robust to the choice of the underlying global optimizer. Meme similarity analysis further validates the generation mechanism: LLM-generated operators are structurally distinct from classical local search baselines such as Nelder–Mead, exhibit meaningful variation across different search regimes, and display stable behavior when conditioned on similar state vectors. This collectively supports the hypothesis that LLMs are capable of generating genuinely novel and contextually appropriate optimization strategies.
The present work constitutes a concrete algorithmic realization of the fourth-generation memetic paradigm, which has previously remained largely conceptual. By formalizing the co-evolutionary relationship between candidate solutions and meme operators, and by grounding meme generation in empirical optimization feedback rather than offline design, the framework advances memetic computation toward a fully adaptive and general-purpose optimization methodology.
Several directions for future work follow naturally from the present framework. The fixed trigger interval Δ could be replaced by an adaptive policy that initiates LLM calls in response to detected regime changes in E t , while meme mutation, in other words prompting the LLM to refine an existing library operator based on its accumulated performance statistics, would introduce a Lamarckian element into library evolution. A hierarchical library organization indexed by search regime at the time of meme generation would enable more targeted operator retrieval. Finally, systematic evaluation across LLM architectures and model scales would allow the influence of model capability on meme quality, operator diversity, and code validity to be characterized, bridging the optimization and artificial intelligence perspectives on the proposed framework.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

All experimental results are available from the corresponding author upon request.

Acknowledgments

The author would like to thank the anonymous referee for their valuable comments and suggestions regarding the content of the paper.

Conflicts of Interest

The author declares no conflicts of interest.

References

  1. Danilchenko, V.I.; Danilchenko, Y.V.; Kureichik, V.M. Bio-inspired approach to microwave circuit design. In Proceedings of the 2020 IEEE East-West Design & Test Symposium (EWDTS), Varna, Bulgaria, 4–7 September 2020; pp. 362–366. [Google Scholar]
  2. Lu, X.; Sun, T.; Tang, K. Evolutionary optimization with hierarchical surrogates. Swarm Evol. Comput. 2019, 47, 21–32. [Google Scholar] [CrossRef]
  3. Sakharov, M.; Koledina, K.; Gubaydullin, I. Optimal Control of Hydrocarbons’ Hydrogenation with Catalysts. Mathematics 2024, 12, 3570. [Google Scholar] [CrossRef]
  4. Škvorc, U.; Eftimov, T.; Korošec, P. Understanding the problem space in single-objective numerical optimization using exploratory landscape analysis. Appl. Soft Comput. 2020, 90, 106138. [Google Scholar] [CrossRef]
  5. Höschel, K.; Vasudevan, L. Genetic algorithms for lens design: A review. J. Opt. 2019, 48, 134–144. [Google Scholar] [CrossRef]
  6. Sakharov, M.; Houllier, T.; Lépine, T. Optimal Design of Conventional and Freeform Optical Systems with Memetic Mind Evolutionary Computation Algorithm. In Proceedings of the Sixth International Scientific Conference “Intelligent Information Technologies for Industry”; Lecture Notes in Networks and Systems; Springer: Cham, Switzerland, 2023; Volume 566. [Google Scholar] [CrossRef]
  7. Karpenko, A.; Agasiev, T.; Sakharov, M. Intellectualization Methods of Population Algorithms of Global Optimization. In Cyber-Physical Systems: Advances in Design & Modelling; Kravets, A., Bolshakov, A., Shcherbakov, M., Eds.; Studies in Systems, Decision and Control; Springer: Cham, Switzerland, 2020; Volume 259, pp. 137–151. [Google Scholar] [CrossRef]
  8. Xia, X.; Gui, L.; He, G.; Xie, C.; Wei, B.; Xing, Y.; Wu, R.; Tang, Y. A hybrid optimizer based on firefly algorithm and particle swarm optimization algorithm. J. Comput. Sci. 2018, 26, 488–500. [Google Scholar] [CrossRef]
  9. Narinder, S.; Kaur, J. Hybridizing sine–cosine algorithm with harmony search strategy for optimization design problems. Soft Comput. 2021, 25, 11053–11075. [Google Scholar] [CrossRef]
  10. Al-Betar, M.A.; Awadallah, M.A.; Zitar, R.A.; Assaleh, K. Economic load dispatch using memetic sine cosine algorithm. J. Ambient. Intell. Humaniz. Comput. 2023, 14, 11685–11713. [Google Scholar] [CrossRef]
  11. Altay ELİ, F.; Altay, O.; Özçevik, Y. A comparative study of metaheuristic optimization algorithms for solving real-world engineering design problems. CMES-Comput. Model. Eng. Sci. 2024, 139, 1039-1094. [Google Scholar] [CrossRef]
  12. Moscato, P.; Cotta, C. A modern introduction to memetic algorithms. In Handbook of Metaheuristics; Gendreau, M., Potvin, J.-Y., Eds.; Springer: Berlin/Heidelberg, Germany, 2010; pp. 141–183. [Google Scholar] [CrossRef]
  13. Dawkins, R. The Selfish Gene; Oxford University Press: Oxford, UK, 1976. [Google Scholar]
  14. Zhang, W.; Lan, Y.; Lin, A.; Xiao, M. An adaptive clustering routing protocol for wireless sensor networks based on a novel memetic algorithm. IEEE Sens. J. 2025, 25, 8929–8941. [Google Scholar] [CrossRef]
  15. Sakharov, M.; Koledina, K.; Gubaydullin, I.; Karpenko, A. Parallel memetic algorithm for optimal control of multi-stage catalytic reactions. Optim. Lett. 2023, 17, 981–1003. [Google Scholar] [CrossRef]
  16. Chang, X.; Jia, X.; Ren, J. A reinforcement learning enhanced memetic algorithm for multi-objective flexible job shop scheduling toward Industry 5.0. Int. J. Prod. Res. 2025, 63, 119–147. [Google Scholar] [CrossRef]
  17. Neri, F.; Cotta, C.; Moscato, P. Handbook of Memetic Algorithms; Springer: Berlin/Heidelberg, Germany, 2011; p. 368. [Google Scholar] [CrossRef]
  18. Sakharov, M. Introducing the Fourth Generation of Memetic Algorithms. In Proceedings of the Eighth International Scientific Conference “Intelligent Information Technologies for Industry” (IITI’24), Volume 2, IITI 2024; Lecture Notes in Networks and Systems; Springer: Cham, Switzerland, 2024; Volume 1210. [Google Scholar] [CrossRef]
  19. Kennedy, J.; Eberhart, R. Particle swarm optimization. In Proceedings of the IEEE International Conference on Neural Networks (ICNN ‘95), Perth, WA, Australia, 27 November–1 December 1995; Volume 4, pp. 1942–1948. [Google Scholar]
  20. Mirjalili, S.; Lewis, A. The whale optimization algorithm. Adv. Eng. Softw. 2016, 95, 51–67. [Google Scholar] [CrossRef]
  21. Mirjalili, S. Moth-flame optimization algorithm: A novel nature-inspired heuristic paradigm. Knowl.-Based Syst. 2015, 89, 228–249. [Google Scholar] [CrossRef]
  22. Carlos, C.; Mathieson, L.; Moscato, P. Memetic algorithms. In Handbook of Heuristics; Springer Nature: Cham, Switzerland, 2025; pp. 809–850. [Google Scholar]
  23. Ong, Y.S.; Lim, M.H.; Meuth, R. A proposition on memes and meta-memes in computing for higher-order learning. Memetic Comput. 2009, 1, 85–100. [Google Scholar]
  24. Klimenko, A.; Elmekeev, M. Lamarckian Evolution Based Algorithm for Multi-Robot Path Planning Problem. In International Conference on Interactive Collaborative Robotics; Springer Nature: Cham, Switzerland, 2025. [Google Scholar]
  25. Liu, F.; Yao, Y.; Guo, P.; Yang, Z.; Lin, X.; Zhao, Z.; Tong, X.; Mao, K.; Lu, Z.; Wang, Z.; et al. A Systematic Survey on Large Language Models for Algorithm Design. ACM Comput. Surv. 2026, 58, 218. [Google Scholar] [CrossRef]
  26. Yepes, A.J.; Barnard, P. Evolutionary thoughts: Integration of large language models and evolutionary algorithms. arXiv 2025, arXiv:2505.05756. [Google Scholar] [CrossRef]
  27. Qi, F.; Wang, T.; Zheng, R.; Li, M. A Memetic and Reflective Evolution Framework for Automatic Heuristic Design Using Large Language Models. Appl. Sci. 2025, 15, 8735. [Google Scholar] [CrossRef]
  28. Li, R.; Wang, L.; Sang, H.; Yao, L.; Pan, L. LLM-Assisted Automatic Memetic Algorithm for Lot-Streaming Hybrid Job Shop Scheduling with Variable Sublots. IEEE Trans. Evol. Comput. 2026, 58, 1–32. [Google Scholar] [CrossRef]
  29. Liu, S.; Chen, C.; Qu, X.; Tang, K.; Ong, Y.S. Large Language Models as Evolutionary Optimizers. In Proceedings of the 2024 IEEE Congress on Evolutionary Computation (CEC), Yokohama, Japan, 30 June–5 July 2024; pp. 1–8. [Google Scholar] [CrossRef]
  30. Tian, H.; Han, X.; Wu, G.; Guo, A.; Zhou, Y.; Zhang, J.; Li, S.; Wei, J.; Zhang, T. An LLM-Empowered Adaptive Evolutionary Algorithm for Multi-Component Deep Learning Systems. Proc. AAAI Conf. Artif. Intell. 2025, 39, 20894–20902. [Google Scholar] [CrossRef]
  31. Van Stein, N.; Bäck, T. Llamea: A Large Language Model Evolutionary Algorithm for Automatically Generating Metaheuristics. IEEE Trans. Evol. Comput. 2024, 29, 331–345. [Google Scholar] [CrossRef]
  32. Pluhacek, M.; Kovac, J.; Viktorin, A.; Janku, P.; Kadavy, T.; Senkerik, R. Using LLM for Automatic Evolvement of Metaheuristics from Swarm Algorithm SOMA. In Proceedings of the Genetic and Evolutionary Computation Conference Companion (GECCO ‘24 Companion); Association for Computing Machinery: New York, NY, USA, 2024; pp. 2018–2022. [Google Scholar] [CrossRef]
  33. Wang, C.; Zhao, J.; Jiao, L.; Li, L.; Liu, F.; Yang, S. When Large Language Models Meet Evolutionary Algorithms: Potential Enhancements and Challenges. Research 2025, 8, 0646. [Google Scholar] [CrossRef] [PubMed]
  34. Yu, H.; Liu, J. Deep insights into automated optimization with large language models and evolutionary algorithms. arXiv 2024, arXiv:2410.20848. [Google Scholar] [CrossRef]
  35. Chinnasamy, S.; Ramachandran, M.; Amudha, M.; Ramu, K. A review on hill climbing optimization methodology. Recent Trends Manag. Commer. 2022, 3, 1–7. [Google Scholar]
  36. Sakharov, M. A New Assessment Framework for LLM-Generated Optimization Methods. In Proceedings of the Ninth International Scientific Conference “Intelligent Information Technologies for Industry”, Volume 2; IITI 2025; Lecture Notes in Networks and Systems; Springer: Cham, Switzerland, 2026; Volume 1763. [Google Scholar] [CrossRef]
  37. Guo, D.; Ren, S.; Lu, S.; Feng, Z.; Tang, D.; Liu, S.; Zhou, L.; Duan, N.; Svyatkovskiy, A.; Fu, S.; et al. Graphcodebert: Pre-training code representations with data flow. arXiv 2020, arXiv:2009.08366. [Google Scholar]
  38. Bag, S.; Kumar, S.K.; Tiwari, M.K. An efficient recommendation generation using relevant Jaccard similarity. Inf. Sci. 2019, 483, 53–64. [Google Scholar] [CrossRef]
  39. Wu, G.; Mallipeddi, R.; Suganthan, P.N. Problem Definitions and Evaluation Criteria for the CEC 2017 Competition on Constrained Real-Parameter Optimization; Technical Report; National University of Defense Technology: Changsha, China; Kyungpook National University: Daegu, Republic of Korea; Nanyang Technological University: Singapore, 2017. [Google Scholar]
  40. Nelder, J.A.; Mead, R. A simplex method for function minimization. Comput. J. 1965, 7, 308–313. [Google Scholar] [CrossRef]
  41. Nassef, A.M.; Abdelkareem, M.A.; Maghrabie, H.M.; Baroutaji, A. The role of random walk-based techniques in enhancing metaheuristic optimization algorithms—A systematic and comprehensive review. IEEE Access 2024, 12, 139573–139608. [Google Scholar] [CrossRef]
  42. Piotrowski, A.P.; Napiorkowski, J.J.; Piotrowska, A.E. Choice of benchmark optimization problems does matter. Swarm Evol. Comput. 2023, 83, 101378. [Google Scholar] [CrossRef]
Figure 1. The block diagram of the proposed framework.
Figure 1. The block diagram of the proposed framework.
Biomimetics 11 00383 g001
Figure 2. Distribution of objective function values for all algorithms across all benchmark functions and all runs.
Figure 2. Distribution of objective function values for all algorithms across all benchmark functions and all runs.
Biomimetics 11 00383 g002
Figure 3. Similarity analysis of the results for the first and the last memes generated for functions F7, F15, F25 and their comparison with the Nelder–Mead method.
Figure 3. Similarity analysis of the results for the first and the last memes generated for functions F7, F15, F25 and their comparison with the Nelder–Mead method.
Biomimetics 11 00383 g003
Figure 4. Distribution of pairwise comparisons for 30 randomly selected generated memes.
Figure 4. Distribution of pairwise comparisons for 30 randomly selected generated memes.
Biomimetics 11 00383 g004
Table 1. Experimental results for the considered optimization algorithms.
Table 1. Experimental results for the considered optimization algorithms.
Benchmark FunctionPSOWOAPSO-NMWOA-NMMMPSOMMWOAAIM-PSOAIM-WOA
F 1 F ¯ = 8.1 × 10 9 F ¯ = 9.4 × 10 6 F ¯ = 1.1 × 10 4 F ¯ = 1.5 × 10 7 F ¯ = 1.1 × 10 4 F ¯ = 1.7 × 10 6 F ¯ = 6.9 × 10 3 F ¯ = 1.8 × 10 3
σ = 5.5 × 10 9 σ = 4.1 × 10 6 σ = 7.9 × 10 3 σ = 2.2 × 10 7 σ = 7.4 × 10 3 σ = 8.7 × 10 5 σ = 7.8 × 10 3 σ = 3.1 × 10 3
F m i n = 1.8 × 10 4 F m i n = 4.8 × 10 6 F m i n = 5.6 × 10 2 F m i n = 2.0 × 10 6 F m i n = 1.7 × 10 2 F m i n = 6.7 × 10 5 F m i n = 1.0 × 10 2 F m i n = 1.0 × 10 2
F 2 F ¯ = 3.1 × 10 3 F ¯ = 5.4 × 10 2 F ¯ = 2.0 × 10 2 F ¯ = 1.2 × 10 3 F ¯ = 2.0 × 10 2 F ¯ = 2.1 × 10 2 F ¯ = 2.0 × 10 2 F ¯ = 2.0 × 10 2
σ = 2.8 × 10 3 σ = 3.8 × 10 2 σ = 2.7 × 10 14 σ = 1.1 × 10 3 σ = 1.6 × 10 14 σ = 2.9 × 10 0 σ = 2.1 × 10 14 σ = 6.1 × 10 8
F m i n = 2.0 × 10 2 F m i n = 3.1 × 10 2 F m i n = 2.0 × 10 2 F m i n = 3.1 × 10 2 F m i n = 2.0 × 10 2 F m i n = 2.0 × 10 2 F m i n = 2.0 × 10 2 F m i n = 2.0 × 10 2
F 3 F ¯ = 8.8 × 10 2 F ¯ = 4.4 × 10 2 F ¯ = 3.4 × 10 2 F ¯ = 4.6 × 10 2 F ¯ = 3.5 × 10 2 F ¯ = 3.8 × 10 2 F ¯ = 3.3 × 10 2 F ¯ = 3.1 × 10 2
σ = 5.1 × 10 2 σ = 8.3 × 10 1 σ = 1.9 × 10 1 σ = 5.6 × 10 1 σ = 2.8 × 10 1 σ = 3.9 × 10 1 σ = 9.3 × 10 0 σ = 1.5 × 10 1
F m i n = 3.3 × 10 2 F m i n = 3.4 × 10 2 F m i n = 3.3 × 10 2 F m i n = 3.3 × 10 2 F m i n = 3.3 × 10 2 F m i n = 3.3 × 10 2 F m i n = 3.0 × 10 2 F m i n = 3.0 × 10 2
F 4 F ¯ = 1.1 × 10 4 F ¯ = 1.6 × 10 3 F ¯ = 5.6 × 10 2 F ¯ = 1.5 × 10 3 F ¯ = 5.5 × 10 2 F ¯ = 8.6 × 10 2 F ¯ = 4.8 × 10 2 F ¯ = 4.3 × 10 2
σ = 5.5 × 10 3 σ = 3.4 × 10 2 σ = 6.4 × 10 1 σ = 3.1 × 10 2 σ = 4.1 × 10 1 σ = 1.4 × 10 2 σ = 2.1 × 10 1 σ = 6.5 × 10 0
F m i n = 2.6 × 10 3 F m i n = 1.1 × 10 3 F m i n = 4.7 × 10 2 F m i n = 1.1 × 10 3 F m i n = 4.9 × 10 2 F m i n = 6.7 × 10 2 F m i n = 4.4 × 10 2 F m i n = 4.2 × 10 2
F 5 F ¯ = 5.0 × 10 2 F ¯ = 5.0 × 10 2 F ¯ = 5.0 × 10 2 F ¯ = 5.0 × 10 2 F ¯ = 5.0 × 10 2 F ¯ = 5.0 × 10 2 F ¯ = 5.0 × 10 2 F ¯ = 5.0 × 10 2
σ = 3.1 × 10 4 σ = 6.0 × 10 3 σ = 6.4 × 10 4 σ = 1.1 × 10 3 σ = 7.4 × 10 4 σ = 3.4 × 10 3 σ = 1.9 × 10 4 σ = 7.5 × 10 4
F m i n = 5.0 × 10 2 F m i n = 5.0 × 10 2 F m i n = 5.0 × 10 2 F m i n = 5.0 × 10 2 F m i n = 5.0 × 10 2 F m i n = 5.0 × 10 2 F m i n = 5.0 × 10 2 F m i n = 5.0 × 10 2
F 6 F ¯ = 3.7 × 10 5 F ¯ = 3.1 × 10 3 F ¯ = 7.5 × 10 2 F ¯ = 2.3 × 10 3 F ¯ = 7.6 × 10 2 F ¯ = 1.1 × 10 3 F ¯ = 7.1 × 10 2 F ¯ = 6.7 × 10 2
σ = 2.1 × 10 5 σ = 7.1 × 10 2 σ = 4.2 × 10 1 σ = 1.6 × 10 3 σ = 4.0 × 10 1 σ = 1.8 × 10 2 σ = 2.4 × 10 1 σ = 1.4 × 10 1
F m i n = 4.8 × 10 4 F m i n = 2.1 × 10 3 F m i n = 6.8 × 10 2 F m i n = 9.5 × 10 2 F m i n = 7.0 × 10 2 F m i n = 9.3 × 10 2 F m i n = 6.7 × 10 2 F m i n = 6.3 × 10 2
F 7 F ¯ = 9.5 × 10 2 F ¯ = 1.3 × 10 3 F ¯ = 9.7 × 10 2 F ¯ = 1.2 × 10 3 F ¯ = 9.7 × 10 2 F ¯ = 1.2 × 10 3 F ¯ = 9.9 × 10 2 F ¯ = 1.2 × 10 3
σ = 7.7 × 10 1 σ = 1.1 × 10 2 σ = 6.5 × 10 1 σ = 1.5 × 10 2 σ = 8.3 × 10 1 σ = 1.2 × 10 2 σ = 5.1 × 10 1 σ = 1.2 × 10 2
F m i n = 8.7 × 10 2 F m i n = 1.1 × 10 3 F m i n = 8.7 × 10 2 F m i n = 9.4 × 10 2 F m i n = 8.5 × 10 2 F m i n = 1.0 × 10 3 F m i n = 8.8 × 10 2 F m i n = 9.5 × 10 2
F 8 F ¯ = 8.1 × 10 2 F ¯ = 8.1 × 10 2 F ¯ = 8.1 × 10 2 F ¯ = 8.2 × 10 2 F ¯ = 8.1 × 10 2 F ¯ = 8.2 × 10 2 F ¯ = 8.1 × 10 2 F ¯ = 8.2 × 10 2
σ = 3.8 × 10 0 σ = 7.1 × 10 0 σ = 5.1 × 10 0 σ = 1.1 × 10 1 σ = 5.9 × 10 0 σ = 6.9 × 10 0 σ = 6.2 × 10 0 σ = 7.4 × 10 0
F m i n = 8.1 × 10 2 F m i n = 8.1 × 10 2 F m i n = 8.1 × 10 2 F m i n = 8.0 × 10 2 F m i n = 8.0 × 10 2 F m i n = 8.1 × 10 2 F m i n = 8.0 × 10 2 F m i n = 8.1 × 10 2
F 9 F ¯ = 4.3 × 10 3 F ¯ = 5.8 × 10 3 F ¯ = 4.5 × 10 3 F ¯ = 5.6 × 10 3 F ¯ = 4.3 × 10 3 F ¯ = 5.6 × 10 3 F ¯ = 3.7 × 10 3 F ¯ = 4.5 × 10 3
σ = 6.2 × 10 2 σ = 6.8 × 10 2 σ = 8.3 × 10 2 σ = 9.2 × 10 2 σ = 4.3 × 10 2 σ = 6.9 × 10 2 σ = 3.8 × 10 2 σ = 7.7 × 10 2
F m i n = 2.9 × 10 3 F m i n = 4.2 × 10 3 F m i n = 3.2 × 10 3 F m i n = 4.2 × 10 3 F m i n = 3.5 × 10 3 F m i n = 4.8 × 10 3 F m i n = 2.9 × 10 3 F m i n = 3.6 × 10 3
F 10 F ¯ = 5.9 × 10 5 F ¯ = 1.3 × 10 5 F ¯ = 1.7 × 10 4 F ¯ = 8.6 × 10 4 F ¯ = 8.5 × 10 3 F ¯ = 9.5 × 10 4 F ¯ = 6.4 × 10 3 F ¯ = 6.7 × 10 3
σ = 1.1 × 10 6 σ = 3.7 × 10 4 σ = 2.7 × 10 4 σ = 2.7 × 10 4 σ = 1.3 × 10 4 σ = 2.2 × 10 4 σ = 6.6 × 10 3 σ = 6.1 × 10 3
F m i n = 9.1 × 10 4 F m i n = 4.6 × 10 4 F m i n = 1.3 × 10 3 F m i n = 5.7 × 10 4 F m i n = 1.2 × 10 3 F m i n = 5.7 × 10 4 F m i n = 1.1 × 10 3 F m i n = 1.1 × 10 3
F 11 F ¯ = 5.4 × 10 8 F ¯ = 3.7 × 10 7 F ¯ = 2.8 × 10 6 F ¯ = 9.7 × 10 6 F ¯ = 2.2 × 10 6 F ¯ = 1.3 × 10 7 F ¯ = 1.1 × 10 4 F ¯ = 1.2 × 10 4
σ = 8.4 × 10 8 σ = 2.6 × 10 7 σ = 2.4 × 10 6 σ = 1.1 × 10 7 σ = 3.2 × 10 6 σ = 1.1 × 10 7 σ = 3.6 × 10 3 σ = 5.4 × 10 3
F m i n = 1.2 × 10 5 F m i n = 5.5 × 10 6 F m i n = 6.7 × 10 4 F m i n = 7.5 × 10 5 F m i n = 2.2 × 10 4 F m i n = 2.7 × 10 6 F m i n = 3.8 × 10 3 F m i n = 4.4 × 10 3
F 12 F ¯ = 2.2 × 10 8 F ¯ = 3.6 × 10 6 F ¯ = 8.6 × 10 4 F ¯ = 7.7 × 10 5 F ¯ = 7.0 × 10 4 F ¯ = 6.8 × 10 5 F ¯ = 1.2 × 10 4 F ¯ = 1.6 × 10 3
σ = 4.5 × 10 8 σ = 1.4 × 10 6 σ = 2.6 × 10 4 σ = 1.3 × 10 6 σ = 2.5 × 10 4 σ = 9.8 × 10 5 σ = 6.0 × 10 3 σ = 6.7 × 10 2
F m i n = 1.1 × 10 5 F m i n = 1.6 × 10 6 F m i n = 4.8 × 10 4 F m i n = 6.0 × 10 4 F m i n = 2.6 × 10 4 F m i n = 7.7 × 10 4 F m i n = 1.5 × 10 3 F m i n = 1.2 × 10 3
F 13 F ¯ = 1.1 × 10 5 F ¯ = 1.4 × 10 6 F ¯ = 1.9 × 10 5 F ¯ = 2.2 × 10 5 F ¯ = 1.4 × 10 5 F ¯ = 6.6 × 10 5 F ¯ = 6.7 × 10 4 F ¯ = 2.9 × 10 5
σ = 7.2 × 10 4 σ = 1.3 × 10 6 σ = 1.1 × 10 5 σ = 3.4 × 10 6 σ = 9.7 × 10 4 σ = 6.5 × 10 5 σ = 6.1 × 10 4 σ = 2.9 × 10 5
F m i n = 3.6 × 10 4 F m i n = 2.7 × 10 5 F m i n = 4.8 × 10 4 F m i n = 2.4 × 10 5 F m i n = 1.4 × 10 4 F m i n = 1.8 × 10 5 F m i n = 2.3 × 10 3 F m i n = 2.5 × 10 3
F 14 F ¯ = 2.9 × 10 7 F ¯ = 8.1 × 10 5 F ¯ = 2.3 × 10 7 F ¯ = 1.2 × 10 5 F ¯ = 1.1 × 10 5 F ¯ = 1.7 × 10 5 F ¯ = 1.3 × 10 4 F ¯ = 1.6 × 10 4
σ = 7.3 × 10 7 σ = 5.1 × 10 5 σ = 6.8 × 10 7 σ = 4.9 × 10 4 σ = 5.7 × 10 4 σ = 1.4 × 10 5 σ = 8.3 × 10 3 σ = 9.7 × 10 3
F m i n = 9.5 × 10 4 F m i n = 1.9 × 10 5 F m i n = 3.3 × 10 4 F m i n = 5.5 × 10 4 F m i n = 2.4 × 10 4 F m i n = 6.9 × 10 4 F m i n = 2.8 × 10 3 F m i n = 2.3 × 10 3
F 15 F ¯ = 9.1 × 10 6 F ¯ = 4.8 × 10 5 F ¯ = 3.5 × 10 3 F ¯ = 6.9 × 10 5 F ¯ = 2.9 × 10 3 F ¯ = 2.4 × 10 4 F ¯ = 2.1 × 10 3 F ¯ = 1.9 × 10 3
σ = 2.8 × 10 7 σ = 6.1 × 10 5 σ = 1.2 × 10 3 σ = 6.3 × 10 5 σ = 1.7 × 10 3 σ = 1.2 × 10 4 σ = 4.2 × 10 2 σ = 3.9 × 10 2
F m i n = 6.1 × 10 3 F m i n = 2.3 × 10 4 F m i n = 1.9 × 10 3 F m i n = 4.4 × 10 3 F m i n = 1.5 × 10 3 F m i n = 4.6 × 10 3 F m i n = 1.5 × 10 3 F m i n = 1.5 × 10 3
F 16 F ¯ = 2.5 × 10 5 F ¯ = 5.9 × 10 4 F ¯ = 2.5 × 10 3 F ¯ = 2.9 × 10 4 F ¯ = 2.3 × 10 3 F ¯ = 1.6 × 10 4 F ¯ = 2.1 × 10 3 F ¯ = 1.8 × 10 3
σ = 6.9 × 10 5 σ = 3.5 × 10 4 σ = 4.6 × 10 2 σ = 1.2 × 10 4 σ = 5.3 × 10 2 σ = 8.7 × 10 3 σ = 4.2 × 10 3 σ = 3.7 × 10 3
F m i n = 1.2 × 10 4 F m i n = 9.3 × 10 3 F m i n = 1.6 × 10 3 F m i n = 9.2 × 10 3 F m i n = 1.6 × 10 3 F m i n = 4.3 × 10 3 F m i n = 1.6 × 10 3 F m i n = 1.6 × 10 3
F 17 F ¯ = 9.8 × 10 4 F ¯ = 7.1 × 10 5 F ¯ = 8.3 × 10 4 F ¯ = 3.8 × 10 5 F ¯ = 8.9 × 10 4 F ¯ = 3.1 × 10 5 F ¯ = 3.7 × 10 4 F ¯ = 1.6 × 10 5
σ = 7.6 × 10 4 σ = 6.4 × 10 5 σ = 6.2 × 10 4 σ = 2.5 × 10 5 σ = 6.5 × 10 4 σ = 2.1 × 10 5 σ = 2.0 × 10 4 σ = 1.1 × 10 5
F m i n = 1.2 × 10 4 F m i n = 1.1 × 10 5 F m i n = 1.7 × 10 4 F m i n = 1.0 × 10 5 F m i n = 1.7 × 10 4 F m i n = 5.2 × 10 4 F m i n = 9.9 × 10 3 F m i n = 2.9 × 10 4
F 18 F ¯ = 2.9 × 10 9 F ¯ = 1.3 × 10 5 F ¯ = 1.1 × 10 7 F ¯ = 7.4 × 10 4 F ¯ = 8.4 × 10 4 F ¯ = 7.6 × 10 4 F ¯ = 3.4 × 10 4 F ¯ = 8.8 × 10 3
σ = 5.9 × 10 9 σ = 6.3 × 10 4 σ = 3.5 × 10 7 σ = 3.2 × 10 4 σ = 4.9 × 10 4 σ = 4.1 × 10 4 σ = 2.7 × 10 4 σ = 9.8 × 10 3
F m i n = 3.4 × 10 4 F m i n = 3.4 × 10 4 F m i n = 1.6 × 10 4 F m i n = 2.8 × 10 4 F m i n = 1.4 × 10 4 F m i n = 2.1 × 10 4 F m i n = 2.4 × 10 3 F m i n = 1.8 × 10 3
F 19 F ¯ = 2.4 × 10 3 F ¯ = 4.1 × 10 3 F ¯ = 2.4 × 10 3 F ¯ = 3.8 × 10 3 F ¯ = 2.4 × 10 3 F ¯ = 3.8 × 10 3 F ¯ = 2.4 × 10 3 F ¯ = 2.1 × 10 3
σ = 3.9 × 10 2 σ = 9.3 × 10 2 σ = 3.4 × 10 2 σ = 5.5 × 10 2 σ = 3.4 × 10 2 σ = 7.2 × 10 2 σ = 3.0 × 10 2 σ = 2.4 × 10 2
F m i n = 1.9 × 10 3 F m i n = 3.1 × 10 3 F m i n = 1.9 × 10 3 F m i n = 3.0 × 10 3 F m i n = 1.9 × 10 3 F m i n = 2.6 × 10 3 F m i n = 1.9 × 10 3 F m i n = 1.9 × 10 3
F 20 F ¯ = 6.9 × 10 3 F ¯ = 2.9 × 10 3 F ¯ = 2.3 × 10 3 F ¯ = 2.9 × 10 3 F ¯ = 2.3 × 10 3 F ¯ = 2.5 × 10 3 F ¯ = 2.2 × 10 3 F ¯ = 2.2 × 10 3
σ = 3.5 × 10 3 σ = 7.7 × 10 2 σ = 1.2 × 10 2 σ = 6.0 × 10 2 σ = 1.3 × 10 2 σ = 4.2 × 10 2 σ = 1.0 × 10 2 σ = 7.9 × 10 1
F m i n = 3.4 × 10 3 F m i n = 2.1 × 10 3 F m i n = 2.1 × 10 3 F m i n = 2.2 × 10 3 F m i n = 2.1 × 10 3 F m i n = 2.1 × 10 3 F m i n = 2.1 × 10 3 F m i n = 2.1 × 10 3
F 21 F ¯ = 2.5 × 10 3 F ¯ = 2.6 × 10 3 F ¯ = 2.4 × 10 3 F ¯ = 2.8 × 10 3 F ¯ = 2.4 × 10 3 F ¯ = 2.7 × 10 3 F ¯ = 2.4 × 10 3 F ¯ = 2.7 × 10 3
σ = 9.1 × 10 1 σ = 1.9 × 10 2 σ = 6.1 × 10 1 σ = 5.1 × 10 2 σ = 4.4 × 10 1 σ = 2.5 × 10 2 σ = 3.1 × 10 1 σ = 2.7 × 10 12
F m i n = 2.4 × 10 3 F m i n = 2.4 × 10 3 F m i n = 2.3 × 10 3 F m i n = 2.4 × 10 3 F m i n = 2.3 × 10 3 F m i n = 2.4 × 10 3 F m i n = 2.3 × 10 3 F m i n = 2.4 × 10 3
F 22 F ¯ = 1.7 × 10 4 F ¯ = 2.9 × 10 3 F ¯ = 2.4 × 10 3 F ¯ = 2.8 × 10 3 F ¯ = 2.5 × 10 3 F ¯ = 2.6 × 10 3 F ¯ = 2.5 × 10 3 F ¯ = 2.5 × 10 3
σ = 4.1 × 10 3 σ = 4.9 × 10 2 σ = 7.8 × 10 1 σ = 7.0 × 10 2 σ = 2.2 × 10 2 σ = 2.1 × 10 2 σ = 1.1 × 10 2 σ = 7.1 × 10 1
F m i n = 7.8 × 10 3 F m i n = 2.6 × 10 3 F m i n = 2.3 × 10 3 F m i n = 2.5 × 10 3 F m i n = 2.3 × 10 3 F m i n = 2.5 × 10 3 F m i n = 2.4 × 10 3 F m i n = 2.4 × 10 3
F 23 F ¯ = 1.2 × 10 4 F ¯ = 2.7 × 10 3 F ¯ = 2.5 × 10 3 F ¯ = 2.8 × 10 3 F ¯ = 2.5 × 10 3 F ¯ = 2.6 × 10 3 F ¯ = 2.5 × 10 3 F ¯ = 2.5 × 10 3
σ = 2.5 × 10 3 σ = 2.6 × 10 2 σ = 8.6 × 10 1 σ = 3.9 × 10 2 σ = 1.1 × 10 2 σ = 1.3 × 10 2 σ = 4.2 × 10 1 σ = 3.6 × 10 1
F m i n = 6.8 × 10 3 F m i n = 2.6 × 10 3 F m i n = 2.5 × 10 3 F m i n = 2.6 × 10 3 F m i n = 2.5 × 10 3 F m i n = 2.6 × 10 3 F m i n = 2.5 × 10 3 F m i n = 2.5 × 10 3
F 24 F ¯ = 3.6 × 10 3 F ¯ = 2.9 × 10 3 F ¯ = 2.8 × 10 3 F ¯ = 2.9 × 10 3 F ¯ = 2.8 × 10 3 F ¯ = 2.9 × 10 3 F ¯ = 2.8 × 10 3 F ¯ = 2.8 × 10 3
σ = 2.1 × 10 2 σ = 5.9 × 10 1 σ = 3.0 × 10 1 σ = 4.0 × 10 1 σ = 5.3 × 10 0 σ = 4.6 × 10 1 σ = 1.9 × 10 1 σ = 1.7 × 10 1
F m i n = 2.8 × 10 3 F m i n = 2.8 × 10 3 F m i n = 2.8 × 10 3 F m i n = 3.1 × 10 3 F m i n = 2.8 × 10 3 F m i n = 2.8 × 10 3 F m i n = 2.8 × 10 3 F m i n = 2.8 × 10 3
F 25 F ¯ = 3.6 × 10 3 F ¯ = 3.3 × 10 3 F ¯ = 3.6 × 10 3 F ¯ = 3.4 × 10 3 F ¯ = 3.4 × 10 3 F ¯ = 3.4 × 10 3 F ¯ = 3.3 × 10 3 F ¯ = 3.3 × 10 3
σ = 1.2 × 10 2 σ = 9.9 × 10 0 σ = 3.3 × 10 2 σ = 1.6 × 10 1 σ = 8.4 × 10 1 σ = 2.9 × 10 1 σ = 3.1 × 10 2 σ = 1.5 × 10 1
F m i n = 3.4 × 10 3 F m i n = 3.3 × 10 3 F m i n = 3.3 × 10 3 F m i n = 3.3 × 10 3 F m i n = 3.3 × 10 3 F m i n = 3.3 × 10 3 F m i n = 3.3 × 10 3 F m i n = 3.2 × 10 3
F 26 F ¯ = 3.2 × 10 3 F ¯ = 3.3 × 10 3 F ¯ = 3.2 × 10 3 F ¯ = 3.3 × 10 3 F ¯ = 3.2 × 10 3 F ¯ = 3.3 × 10 3 F ¯ = 3.1 × 10 3 F ¯ = 3.2 × 10 3
σ = 5.9 × 10 1 σ = 1.2 × 10 2 σ = 4.4 × 10 1 σ = 8.3 × 10 1 σ = 3.1 × 10 1 σ = 1.5 × 10 2 σ = 2.9 × 10 1 σ = 3.9 × 10 1
F m i n = 3.1 × 10 3 F m i n = 3.1 × 10 3 F m i n = 3.1 × 10 3 F m i n = 3.1 × 10 3 F m i n = 3.1 × 10 3 F m i n = 3.1 × 10 3 F m i n = 3.1 × 10 3 F m i n = 3.1 × 10 3
F 27 F ¯ = 3.7 × 10 3 F ¯ = 3.1 × 10 3 F ¯ = 2.7 × 10 3 F ¯ = 3.2 × 10 3 F ¯ = 2.7 × 10 3 F ¯ = 3.1 × 10 3 F ¯ = 2.7 × 10 3 F ¯ = 2.7 × 10 3
σ = 4.1 × 10 2 σ = 7.3 × 10 1 σ = 1.1 × 10 1 σ = 4.0 × 10 1 σ = 1.1 × 10 1 σ = 1.9 × 10 2 σ = 1.2 × 10 1 σ = 3.9 × 10 4
F m i n = 2.9 × 10 3 F m i n = 2.9 × 10 3 F m i n = 2.7 × 10 3 F m i n = 3.1 × 10 3 F m i n = 2.7 × 10 3 F m i n = 2.7 × 10 3 F m i n = 2.7 × 10 3 F m i n = 2.7 × 10 3
F 28 F ¯ = 2.2 × 10 8 F ¯ = 7.2 × 10 5 F ¯ = 3.4 × 10 6 F ¯ = 7.6 × 10 6 F ¯ = 1.1 × 10 4 F ¯ = 7.4 × 10 4 F ¯ = 7.9 × 10 3 F ¯ = 6.6 × 10 3
σ = 3.1 × 10 8 σ = 2.1 × 10 6 σ = 1.5 × 10 8 σ = 2.9 × 10 7 σ = 1.2 × 10 4 σ = 5.5 × 10 4 σ = 2.0 × 10 3 σ = 1.6 × 10 3
F m i n = 3.7 × 10 4 F m i n = 4.3 × 10 4 F m i n = 4.9 × 10 3 F m i n = 5.0 × 10 4 F m i n = 5.6 × 10 3 F m i n = 2.5 × 10 4 F m i n = 4.4 × 10 3 F m i n = 4.7 × 10 3
F 29 F ¯ = 1.2 × 10 9 F ¯ = 9.3 × 10 6 F ¯ = 1.5 × 10 8 F ¯ = 1.7 × 10 7 F ¯ = 2.9 × 10 5 F ¯ = 4.9 × 10 6 F ¯ = 4.4 × 10 4 F ¯ = 2.3 × 10 4
σ = 2.6 × 10 9 σ = 1.4 × 10 7 σ = 3.3 × 10 8 σ = 2.8 × 10 7 σ = 7.6 × 10 5 σ = 5.4 × 10 6 σ = 2.3 × 10 4 σ = 2.1 × 10 4
F m i n = 8.9 × 10 4 F m i n = 3.7 × 10 5 F m i n = 4.8 × 10 4 F m i n = 1.5 × 10 5 F m i n = 2.4 × 10 4 F m i n = 1.4 × 10 5 F m i n = 6.3 × 10 3 F m i n = 5.2 × 10 3
Rank 6.41 6.51 3.96 6.31 3.24 5.34 1.86 2.34
Bold denotes the best result per function.
Table 2. Composite efficiency scores of LLM-generated meme operators averaged across five benchmark functions; lower values indicate better performance.
Table 2. Composite efficiency scores of LLM-generated meme operators averaged across five benchmark functions; lower values indicate better performance.
Top 5 Generated MemesComposite
Score
Last 5 Generated MemesComposite
Score
m 1 13 0.0445 m 3 11 0.7685
m 1 1 0.0761 m 1 17 0.7723
m 3 15 0.1032 m 3 17 0.7789
m 1 9 0.1106 m 3 9 0.8108
m 1 29 0.1165 m 3 25 0.8413
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Sakharov, M. A Framework for Integrating Large Language Models into Memetic Algorithms. Biomimetics 2026, 11, 383. https://doi.org/10.3390/biomimetics11060383

AMA Style

Sakharov M. A Framework for Integrating Large Language Models into Memetic Algorithms. Biomimetics. 2026; 11(6):383. https://doi.org/10.3390/biomimetics11060383

Chicago/Turabian Style

Sakharov, Maxim. 2026. "A Framework for Integrating Large Language Models into Memetic Algorithms" Biomimetics 11, no. 6: 383. https://doi.org/10.3390/biomimetics11060383

APA Style

Sakharov, M. (2026). A Framework for Integrating Large Language Models into Memetic Algorithms. Biomimetics, 11(6), 383. https://doi.org/10.3390/biomimetics11060383

Article Metrics

Back to TopTop