High-Value Patents Recognition with Random Forest and Enhanced Fire Hawk Optimization Algorithm

Yao, Xiaona; Li, Huijia; Wang, Sili

doi:10.3390/biomimetics10090561

Open AccessArticle

High-Value Patents Recognition with Random Forest and Enhanced Fire Hawk Optimization Algorithm

by

Xiaona Yao

^1,2,*

,

Huijia Li

^1,2 and

Sili Wang

^1,2

¹

Key Laboratory of Ecological Safety and Sustainable Development in Arid Lands, Northwest Institute of Eco-Environment and Resources, Chinese Academy of Sciences, Lanzhou 730000, China

²

Key Laboratory of Knowledge Computing and Intelligent Decision, Lanzhou 730000, China

^*

Author to whom correspondence should be addressed.

Biomimetics 2025, 10(9), 561; https://doi.org/10.3390/biomimetics10090561

Submission received: 18 July 2025 / Revised: 18 August 2025 / Accepted: 21 August 2025 / Published: 23 August 2025

(This article belongs to the Special Issue Biomimicry for Optimization, Control, and Automation: 3rd Edition)

Download

Browse Figures

Review Reports Versions Notes

Abstract

High-value patents are a key indicator of new product development, the emergence of innovative technology, and a source of innovation incentives. Multiple studies have shown that patent value exhibits a significantly skewed distribution, with only about 10% of patents having high value. Identifying high-value patents from a large volume of patent data in advance has become a crucial problem that needs to be addressed urgently. However, current machine learning methods often rely on manual hyperparameter tuning, which is time-consuming and prone to suboptimal results. Existing optimization algorithms also suffer from slow convergence and local optima issues, limiting their effectiveness on complex patent datasets. In this paper, machine learning and intelligent optimization algorithms are combined to process and analyze the patent data. The Fire Hawk Optimization Algorithm (FHO) is a novel intelligence algorithm suggested in recent years, inspired by the process in nature where Fire Hawks capture prey by setting fires. This paper firstly proposes the Enhanced Fire Hawk Optimizer (EFHO), which combines four strategies, namely adaptive tent chaotic mapping, hunting prey, adding the inertial weight, and enhanced flee strategy to address the weakness of FHO development. Benchmark tests demonstrate EFHO’s superior convergence speed, accuracy, and robustness across standard optimization benchmarks. As a representative real-world application, EFHO is employed to optimize Random Forest hyperparameters for high-value patent recognition. While other intelligent optimizers could be applied, EFHO effectively overcomes common issues like slow convergence and local optima trapping. Compared to other classification methods, the EFHO-optimized Random Forest achieves superior accuracy and classification stability. This study fills a research gap in effective hyperparameter tuning for patent recognition and demonstrates EFHO’s practical value on real-world patent datasets.

Keywords:

high-value patents recognition; FHO; inertial weight; levy flight; t-distribution perturbation; random forest

1. Introduction

High-value patents recognition, or patent evaluation, or patent value assessment, refers to the process of identifying patents that hold high technical, market, economic, legal or strategic value [1]. High-value patents are a key indicator of new product development, the emergence of innovative technology, and a source of innovation incentives [2]. These patents carry important innovations and technological breakthroughs, and have a significant influence on the development of business and society [3]. From an enterprise perspective, the identification and acquisition of high-value patents not only guide technological development and optimize resource allocation but also prevent redundant research efforts and decrease risks associated with innovation. Moreover, such patents enhance technological barriers and protect core competitive advantages, thereby promoting sustainable growth and long-term profitability. At the national level, and for science and technology management departments, accurately identifying high-value patents supports the efficient allocation of resources and informs the development of effective policy strategies [4]. However, low-value patents consume a lot of resources for innovation, which may cause firms to innovate more slowly if their low-value patents are not converted or commercialized. This would impede a nation’s industrial development, technological advancement, productivity growth, and economic prosperity [5,6]. Therefore, the high-value patents recognition from a huge number of patents is crucial for the cultivation, application, protection and management of high-value patents in government departments, scientific research institutions and enterprises, and is also a hot topic of academic research [7,8].

With the progress of complex networks and machine learning methods, it is possible to identify high-value patents from large-scale data [9]. Machine learning can effectively deal with large-scale, high-dimensional and structured patent data, extract deep-seated features of patents, and improve the accuracy of patent value identification [10,11,12]. Hido et al. [13] introduce a machine learning and text mining-based tool that assesses patent application quality by computing a score, which predicts the likelihood of approval, and outperforms conventional methods by utilizing a large dataset and a new statistical prediction model. Wu et al. [14] established a patent quality classification model using support vector machine, self-organizing map and kernel principal component analysis, and developed a patent quality analysis and classification system based on it, and the experimental results showed the advantages of the method. Based on multiple relevant metrics that can be accessed immediately after patent issuance, Lee et al. [15] provide a machine learning method for early detection of emergent inventions. Their method can help with responsive technology forecasting and planning, as demonstrated by the pharmaceutical technology case. A deep learning analytical approach was also used by Trappey et al. [16] to automate the evaluation of patent value in the Internet of Things domain. Using the provided dataset, they first identified important patent value indicators using principal component analysis. A Deep Neural Network (DNN) was utilized to forecast values after a total of 11 value indicators were chosen. The outcomes demonstrated that the DNN model outperformed the conventional Back-Propagation Neural Network technology in terms of accuracy. Kwon et al. [17] address the identification of promising inventions by using patent-based machine learning techniques that incorporate the quality of knowledge accumulation as an input variable, finding it to be the most important predictor when compared to other traditional indicators. Hu et al. [1] experimented with five machine learning algorithms based on multidimensional value patent portfolios for the high-value patents identification. The Random Forest approach performs the best overall, according to the results. Liu et al. [18] proposed a multi-task learning framework that unifies the identification of high-value patents and standard-essential patents, leveraging the mutual reinforcement of both tasks. The proposed model, which uses both structured and embedded textual features of patents, significantly outperforms single-task learning models with regard to accuracy, recall, precision, and F1 measure across both balanced and imbalanced datasets.

However, a critical challenge remains in these machine learning approaches: the performance heavily depends on hyperparameter tuning, which is often performed manually or through conventional search strategies such as grid search, which are time-consuming and prone to suboptimal solutions. Existing intelligent optimization algorithms used for this purpose, such as Genetic Algorithm (GA) and Particle Swarm Optimization (PSO), still face issues like slow convergence and getting trapped in local optima when applied to real-world patent datasets.

In a patent evaluation, the patent lifetime has been used as a substitute for the commercial potential of a patent. Choi et al. [19] suggest a method to assess the commercial potential of individual patents by applying the feed-forward neural network model to forecast the probability that a patent will remain valid until its maximum expiry date and develop a patent business potential evaluation system at the end. Kumar et al. [20] developed a hybrid model that combines binomial regression and multi-class classification to accurately predict the renewal life of Indian patents, addressing the challenge of unusual renewal life distribution and achieving 90% accuracy.

By comparing multiple solutions until an optimal or satisfying response is discovered that produces a higher accuracy score than the previous one, optimization algorithms are used to increase the efficiency of the machine learning area [21]. Machine learning algorithms often require manual hyperparameter selection to achieve optimal performance. This brings difficulties, including more complexity and the requirement for expert knowledge to tune parameters effectively [22,23]. Intelligent optimization algorithms like Genetic Algorithm and Particle Swarm Optimization can be used to find optimal hyperparameters, improving the efficiency and performance of neural networks without exhaustive manual tuning [24,25,26].

Fire Hawk Optimizer (FHO) is a novel intelligence algorithm proposed in recent years, inspired by the process in nature where Fire Hawks capture prey by setting fires [27]. FHO demonstrates superior performance and exceptional results in addressing structural engineering design problems [28,29]. This paper combined the FHO algorithm and Random Forest and carried out a high-value patents recognition experiment based on the patent data from the Patyee database. The primary goal of this study is to propose an improved optimization algorithm and validate its effectiveness through the representative real-world application of high-value patent recognition. Nevertheless, the canonical FHO algorithm suffers from poor convergence speed and vulnerability to local optima, limiting its effectiveness for complex hyperparameter tuning in machine learning tasks. There is a lack of research focusing on enhancing FHO to overcome these shortcomings in the context of patent data analysis. Regarding the issues of the Fire Hawk optimization algorithm’s poor convergence speed and vulnerability to local optimization [30], an enhanced Fire Hawk optimization algorithm called EFHO is proposed. The EHFO improved the canonical FHO by integrating four strategies of adaptive tent chaotic mapping, hunting prey, adding the inertial weight, and an enhanced flee strategy to address the weakness of FHO development. Extensive benchmark experiments on 23 well-known test functions demonstrate that EFHO achieves faster convergence, higher optimization accuracy, and better stability compared to FHO, PSO, GWO, and WOA. Further tests on large-scale problems (up to 1000 dimensions) confirm EFHO’s robustness and dimension insensitivity. While other intelligent optimization algorithms could potentially be applied for hyperparameter tuning, many face challenges like slow convergence or getting trapped in local optima, which EFHO effectively addresses. Applying EFHO to optimize Random Forest hyperparameters for high-value patent recognition on a real-world patent dataset, the EFHO-RF model outperforms other classifiers in recognition accuracy and classification stability, demonstrating EFHO’s practical effectiveness and broad applicability in this domain. The remainder of this paper is structured in the following way. Section 2 introduces the original Fire Hawk Optimization Algorithm. The enhanced Fire Hawk Optimization Algorithm is presented in Section 3. Simulation experiments are described in Section 4. Section 5 explains the application of identifying high-value patents with Random Forest. Finally, the conclusion is presented in Section 6.

2. Fire Hawk Optimization Algorithm

FHO is a novel intelligence optimization algorithm suggested in recent years. The Fire Hawk in the algorithm refers to birds such as brown falcon, black kite and whistling kite. These birds are called Fire Hawk because they find and capture prey by setting fire while hunting in the wild. In the FHO algorithm, the candidate solutions in the search space are mapped to the three categories: the main fire, the prey and the Fire Hawks. The concrete steps of the optimization algorithm are as follows:

(1): Compute the fitness value for all solution candidates. The best solution candidate in the global searching space is presumed to be the main fire, with the better n solution candidates considered as the Fire Hawks, and the remaining solutions referred to as prey.
(2): Calculate the length between the prey and the Fire Hawks, and assign each prey to the nearest Fire Hawk based on the distance, thus establishing a territory for each Fire Hawk. The determination of distance relies on the following equation:

${D_{i}}^{j} = \sqrt{{(x_{2} - x_{1})}^{2} + {(y_{2} - y_{1})}^{2}}, \{\begin{matrix} i = 1, 2, \dots, n \\ j = 1, 2, \dots, m \end{matrix},$

(1)

where ${D_{i}}^{j}$ represents the length between the ith Fire Hawk and the jth prey, m represents the number of prey, n represents the number of Fire Hawks, $(x_{1}, y_{1})$ represents the coordinate of the Fire Hawk, $(x_{2}, y_{2})$ represents the coordinate of the prey.
(3): From the main fire, the Fire Hawks gather burning sticks and set fire in their specific territory to force the prey to hastily flee, then move to a new location. Meanwhile, some Fire Hawks are excited to use the sticks that are burning in other Fire Hawks’ territories. The equation of location updating procedures is as follows:

$F H_{i}^{n e w} = F H_{i} + (r_{1} \times M F - r_{2} \times F H_{N e a r}), i = 1, 2, \dots, n$

(2)

where $F H_{i}^{n e w}$ represents the updated location vector of the ith Fire Hawk ( $F H_{i}$ ), $M F$ represents the globally optimal location in the searching space that is regarded as the main fire, $F H_{N e a r}$ represents another Fire Hawk in the searching space, $r_{1}$ and $r_{2}$ are evenly distributed random integers between 0 and 1 that determine how Fire Hawks will migrate toward the main fire or other occupied territories.
(4): The prey in the Fire Hawk’s territory start to run away once it sets fire; they may run away, hide, or mistakenly run in the direction of the Fire Hawk. During the location update process, these actions can be expressed by the following equation:

$P R_{q}^{n e w} = P R_{q} + (r_{3} \times F H_{l} - r_{4} \times S L_{l}), \{\begin{matrix} l = 1, 2, \dots, n . \\ q = 1, 2, \dots, r . \end{matrix}$

(3)

where $P R_{q}^{n e w}$ represents the updated location vector of the qth prey, which is denoted as ${PR}_{q}$ that the lth Fire Hawk, which is denoted as $F H_{l}$ is encircling, ${SL}_{l}$ represents a safe location in the territory of $F H_{l}$ , $r_{3}$ and $r_{4}$ are evenly distributed random integers between 0 and 1 that are used to determine how prey will move toward the safe location and the Fire Hawks. The mathematical presentation of $S L_{l}$ is formulated as follows:

$S L_{l} = \frac{\sum_{q = 1}^{r} P R_{q}}{r}, \{\begin{matrix} q = 1, 2, \dots, r \\ l = 1, 2, \dots, n \end{matrix}$

(4)

where the $P R_{q}$ is the qth prey that $F H_{l}$ is encircling.
(5): Additionally, the prey may run out of the current Fire Hawk’s territory, potentially to another Fire Hawk’s territory, or to a safer location. The position update equation takes these actions into consideration.

$P R_{q}^{n e w} = P R_{q} + (r_{5} \times F H_{O t h e r} - r_{6} \times S L), \{\begin{matrix} l = 1, 2, \dots, n . \\ q = 1, 2, \dots, r . \end{matrix}$

(5)

In this equation, $P R_{q}^{n e w}$ represents the updated location vector of the qth prey which is denoted as $P R_{q}$ that the lth Fire Hawk which is denoted as $F H_{l}$ is encircling, $F H_{O t h e r}$ represents another Fire Hawk in the searching space, $S L$ represents a safe location beyond the territory of $F H_{l}$ , $r_{5}$ and $r_{6}$ are evenly distributed random integers between 0 and 1 that are used to determine how prey will approach the safe location beyond the territory or the other Fire Hawks. The equation of $S L$ is as follows:

$S L = \frac{\sum_{j = 1}^{m} P R_{j}}{m}$

(6)

where the $P R_{j}$ is the jth prey in the search space.
(6): Return to step 1, loop until certain conditions are met to obtain the global best solution, and the algorithm ends.

A growing body of research has applied FHO to diverse real-world and engineering problems, demonstrating its versatilibiomimetics-10-00561ty and effectiveness. In construction project management, Shishehgarkhaneh et al. [28] utilized FHO within a Building Information Modeling framework to balance multiple resource trade-offs such as time, cost, quality, risk, and environmental impact. Their results highlighted FHO’s capability to yield competitive and exceptional solutions in multi-objective scheduling problems. Similarly, Hosseinzadeh et al. [29] applied FHO to enhance security and energy efficiency in wireless sensor networks through a trust-based routing protocol, achieving superior performance in network reliability and energy consumption.

Algorithmic improvements of FHO itself have also been explored. Ashraf et al. [30] introduced novel swarm initialization techniques leveraging quasi-random sequences to boost convergence rates and diversity, significantly outperforming the standard FHO. Baweja et al. [31] proposed the Levy Flight-based Fire Hawk Optimizer, enhancing exploration and reducing premature convergence, with experimental validations confirming improved optimization outcomes on standard benchmarks.

In the domain of human-computer interaction and activity recognition, Alonazi et al. [32] integrated FHO with deep learning for hyperparameter tuning, resulting in enhanced classification accuracy and robustness for human activity detection. In renewable energy modeling, Said et al. [33] employed a modified version of FHO to precisely extract photovoltaic parameters in solar cell models, outperforming other metaheuristic algorithms and demonstrating superior prediction accuracy. In energy system modeling, Khajuria et al. [34] applied a modified FHO to identify unknown parameters in solid oxide fuel cell models, achieving highly accurate parameter estimation across varying temperatures and pressures.

3. Enhanced Fire Hawk Optimization Algorithm

Although recent FHO variants have introduced improvements such as quasi-random sequence initialization [30], Levy flight strategies [31], and domain-specific hybridization [32,33], these methods generally focus on enhancing either the exploration phase or the exploitation phase in isolation. They often fail to simultaneously maintain population diversity and balance global–local search throughout the optimization process, particularly in high-dimensional or complex search spaces.

The EFHO improved the canonical FHO by integrating four strategies of hunting prey, adaptive tent chaos mapping, adding the inertial weight, and an enhanced flee strategy to address the weakness of FHO development. These four strategies are designed to work in a complementary manner, jointly strengthening both exploration and exploitation while preserving population diversity. This holistic enhancement aims to overcome premature convergence and improve robustness across a wider range of problem types.

3.1. Adaptive Tent Chaos Mapping Strategy

In the original algorithm FHO, the initialization strategy is to randomly distribute throughout the entire space, which has high randomness and uneven distribution, leading to a lack of population diversity. Introducing an adaptive tent chaotic mapping strategy to provide a more uniform distribution is beneficial for obtaining high-quality initial populations. The following equation represents the adaptive tent chaos mapping, assuming that the sequence’s beginning values are random numbers between 0 and 1 [35].

s_{i + 1} = \{\begin{matrix} 1 - r a n d (0, 1) & s_{i} = 0 \\ 1 - 2 * s_{i} & 0 < s_{i} < 0.5 \\ 2 * (1 - s_{i}) & 0.5 < s_{i} < 1 \end{matrix}

(7)

In this equation,

s_{i + 1}

represents the i + 1th mapping value, and the

s_{i}

represents the ith mapping value. The following is the modified initialization equation.

{X_{j}}^{i} (i n i t i a l) = X_{j}^{\min} + s_{i} * (X_{j}^{\max} - X_{j}^{\min}), \{\begin{matrix} i = 1, 2, \dots, N \\ j = 1, 2, \dots, d \end{matrix}

(8)

In this equation,

X_{j}^{i} (i n i t i a l)

represents the jth dimension value of the ith object in the initial population,

X_{j}^{\min}

represents the minimum value on the jth dimension,

X_{j}^{\max}

and represents the maximum value on the jth dimension.

3.2. Hunting Prey

The FHO algorithm lacked the hunting prey part for Fire Hawk. To broaden the scope of the search and further strengthen the capability of local development, the part of hunting prey was added. After the prey flees, the fitness value is calculated based on the position after fleeing, and the prey that has the best fitness value is selected for hunting. The hunting method refers to the spiral attack strategy of the whale optimization algorithm [36], and the equation is as follows:

F H_{l}^{n e w} = D_{l}^{'} \times e^{b k} \times \cos (2 π k) + P R_{l}^{b e s t}

(9)

where

{PR}_{l}^{b e s t}

represents the prey with the best position after fleeing,

D_{l}^{'}

represents the length between the prey (

{PR}_{l}^{b e s t}

) and the Fire Hawk (

F H_{l}

), the parameter b represents the shape parameter of the logarithmic spiral; this paper sets b at 1, and the parameter k represents an arbitrary number between −1 and 1.

3.3. Adding the Inertia Weight

This paper adds the inertia weight to the equation for setting fire to the Fire Hawks. The changed equation is as follows:

F H_{l}^{n e w} = ω \times F H_{l} + (r_{1} \times G B - r_{2} \times F H_{N e a r})

(10)

The inertia weight plays an important role in enhancing the search accuracy and accelerating the convergence speed of solutions [37,38]. In the early stage, larger inertia weight facilitates a more robust global search capability, whereas in the later stage, smaller inertia weight has stronger exploitation capability [39]. Consequently, it is imperative to adjust the inertia weight in a non-linear decreasing fashion. The following is the equation of inertia weight:

ω = \frac{1 + \sin (e^{\frac{t}{T} + a})}{b}

(11)

where T represents the total iteration number, t represents the current iteration number, and a and b are constants (i.e., a = 0.5 and b = 20).

To visually illustrate the features of the inertia weight function suggested in this paper, Figure 1 of this paper shows the function curve of the inertia weight, where the iteration number is shown on the x-axis with a maximum of 50, and it can be observed that the inertia weight gradually decreases from 0.1 to 0 as the iteration number increases.

3.4. Enhanced Flee Strategy

There are two flee strategies in the original algorithm, where all prey implemented both flee strategies in sequence. In order to increase population diversity, some prey implemented the first flee strategy while others implemented the second flee strategy. The Equations (3) and (5) are combined as follows:

P R_{q}^{n e w} = \{\begin{matrix} P R_{q} + (r_{3} \times F H_{l} - r_{4} \times S P_{l}) & τ \leq 0.5 \\ P R_{q} + (r_{5} \times F H_{A l t e r} - r_{6} \times S P) & τ > 0.5 \end{matrix}

(12)

where

τ

is evenly distributed random integers between 0 and 1.

In addition, this paper introduces the Levy flight [40,41] for the first flee strategy and the t-distribution perturbation [42,43] strategy for the second flee strategy. The equation of the Levy flight for the first flee strategy is as follows:

P R_{f l e e 1}^{L} = P R_{f l e e 1}^{b e s t} \times l e v y (D)

(13)

where

P R_{f l e e 1}^{b e s t}

is the best solution candidate in the prey’s implemented first flee strategy,

l e v y (D)

represents the Levy flight function, and D is the dimension. After the levy flight, a greedy selection is made between the old and new positions, choosing the one that has better fitness value for the next iteration. The following is the greedy selection equation:

P R_{f l e e 1}^{b e s t} = \{\begin{matrix} P R_{f l e e 1}^{b e s t} f (P R_{f l e e 1}^{b e s t}) < f (P R_{f l e e 1}^{L}) \\ P R_{f l e e 1}^{L} f (P R_{f l e e 1}^{b e s t}) \geq f (P R_{f l e e 1}^{L}) \end{matrix}

(14)

Levy flying, which combines random movement over long and short distances, is a technique to imitate the random flight of animals. During the initial phase of the algorithm, long distances can expand the search range and explore discoveries, which is beneficial for enhancing population diversity and reducing the risk of local optima. In the latter stage of the algorithm, the range of the global optimal solution is basically determined, and short distances can improve the accuracy of the solutions, enabling the algorithm to converge to the global best.

The equation of t-distribution perturbation for the second flee strategy is as follows:

P R_{f l e e 2}^{T} = P R_{f l e e 2}^{b e s t} \times (1 + t_d i s t r i b u t i o n (t))

(15)

where

P R_{f l e e 2}^{b e s t}

is the best solution candidate in the prey that implemented the second flee strategy. The t-distribution function is a distribution function that is symmetric about the y-axis, and its two bounding distributions are the standard Cauchy and Gaussian distributions; this function usually contains only one parameter, called degrees of freedom. The degrees of freedom are the iteration number t in this paper, and the following is the equation of the probability density function.

f (x) = \frac{Γ (\frac{t + 1}{2})}{\sqrt{π t} Γ (\frac{t}{2})} {(1 + \frac{x^{2}}{t})}^{- \frac{t + 1}{2}}, - \infty < x < + \infty

(16)

In the early stage with a small iteration number, the t-distribution gains the ability by approaching the Cauchy distribution. In the middle of the iteration, the t-distribution balances the convergence and population variety of the algorithm by falling between the Gaussian distribution and the Cauchy distribution. In the later stage, the t-distribution gets close to the Gaussian distribution as the iteration number is large, which facilitates the local exploration ability. After the perturbation, the greedy selection model is also used:

P R_{f l e e 2}^{b e s t} = \{\begin{matrix} {PR}_{f l e e 2}^{b e s t} f (P R_{f l e e 2}^{b e s t}) < f (P R_{f l e e 2}^{L}) \\ P R_{f l e e 2}^{L} f (P R_{f l e e 2}^{b e s t}) \geq f (P R_{f l e e 2}^{L}) \end{matrix}

(17)

When the prey flees, the algorithm performs local exploitation around the fire hawk. As the iteration number increases, most of the prey may stay around the fire hawk, causing the algorithm to stagnate and be unable to further solve the global optimal solution. During the flee stage of prey, using the t-distribution perturbation strategy on the current location can prevent prey from staying around the fire hawk and reduce the likelihood of algorithmic stagnation. The convergence speed and the capacity to deviate from local optima were further improved.

The following is a description of the EFHO algorithm steps:

Step 1: Initialization.

The dimension Dim, maximum iterations of number T, and population size pop are set. The population is initialized by Equation (8). Set the current iteration number t as 1.

Step 2: Evaluation.

Compute fitness value for all individuals, the individual that has the best fitness value is considered as the main fire, select better n individuals as the Fire Hawks, make the remaining individuals as prey.

Step 3: Determining Territory.

For each Fire Hawk, compute the length between each Fire Hawk and each prey by Equation (1), and determine its territory where the preys belong.

Step 4: Setting Fire.

The Fire Hawks set fire and adjust the position according to Equation (10).

Step 5: Prey flees.

Some prey flee by the first strategy by Equation (12) and calculate the fitness for each prey. The prey with the best fitness value after fleeing then implements the Levy flight according to Equations (13) and (14). Some prey flee by the second strategy and calculate the fitness for each prey. The prey with the best fitness value after fleeing then implements the t-distribution perturbation according to Equations (15) and (17).

Step 6: Hunting prey.

The Fire Hawks hunt prey and adjust their position according to Equation (9).

Step 7: Termination.

When the termination condition is reached, the optimal individual position and fitness value are outputted, and the algorithm terminates; if not, proceed to Step 2.

4. Simulation Experiment

Two experiments are conducted in this section. The first experiment tested the capability of the EFHO algorithm by comparing it with several closely related algorithms, and the second experiment tested the scalability of the enhanced algorithm for large-scale problems.

4.1. Experiment Setup

To evaluate the proposed algorithm’s convergence rate and optimization precision, 23 well-known test functions of CEC2005 [44] are used for comparison experiments. In these functions, F1 to F7 are unimodal and commonly used as benchmark functions for evaluating search accuracy. Optimization algorithms often struggle to converge to the global optimum for these functions. F8 to F13 are multi-modal functions used to assess global search performance and have multiple local extreme points. F14 to F23 are benchmark functions that are fixed-dimension multi-modal. To obtain unbiased results, the maximum iteration number was set to 500 and the population size was set to 30, for all functions in all comparisons. For each function, all algorithms were independently run 30 times.

All experiments were carried out on a personal computer with Intel(R) Core(TM) i9-12900H CPU, 32.00 GB RAM and Windows 11 operating system, and MATLAB R2023b was used to implement all algorithms.

4.2. Performance Comparison

In this paper, FHO, PSO (Particle Swarm Optimization), GWO (Grey Wolf Optimization) and WOA (Whale Optimization Algorithm) are selected for comparison with EFHO to evaluate the performance of EFHO. Since some of the 23 functions do not have an optimal value of 0, this paper calculates fitness by taking the absolute value of the discrepancy between the computed result and the actual optimal value of the function. This approach converts all optimal values to 0. This paper uses the best value (Best), mean value (Mean), and standard deviation (Std) to evaluate the performance of all algorithms. Table 1 displays the test results; the best result for each function is boldfaced.

According to Table 1, for F1, F2, F3, F4, F9 and F11, EFHO can achieve the ideal value in theory. For F5, F6, F7, F8 and F13, the outcomes of EFHO outperform the other four algorithms. For F10, the best values obtained by the EFHO algorithm are the same as those of the FHO algorithm, superior to the other three algorithms. The above results demonstrate the remarkable accuracy and stability of EFHO.

To assess the statistical significance of the performance differences between EFHO and the baseline algorithms, the Wilcoxon signed-rank test was conducted for each of the 23 benchmark functions. Table 2 summarizes the number and proportion of functions where EFHO achieved a significantly better result (p < 0.05) compared to each baseline. The detailed test results are provided in Appendix A.

As shown in Table 2, EFHO achieved statistically significant improvements (p < 0.05) over PSO, GWO, and WOA in more than half of the benchmark functions, and outperformed FHO in 39.13% of the cases. While the proportion of significant wins varies across comparisons, these results still indicate that EFHO consistently delivers competitive or superior performance against the baseline algorithms across a wide range of test functions.

Figure 2 displays the five algorithms’ convergence curves for each test function, which better illustrates the benefit of EFHO. It is demonstrated that among them, the EFHO convergence rate is relatively quick. The EFHO convergence curves are smooth and decline quickly for the majority of functions.

In conclusion, the EFHO algorithm demonstrated superior convergence, stability, and accuracy compared to the original FHO and the other four algorithms, which adequately validates the feasibility of the suggested method for FHO improvement in this paper.

4.3. EFHO’s Scalability Test for Large-Scale Problems

This paper further confirms whether the EFHO is scalable when it comes to solving large-scale problems, as real-world engineering applications often encounter large-scale optimization challenges. Because the dimensions of F14 to F23 are fixed, the paper selects test functions F1 to F13 and respectively sets the dimensionality of EFHO to 500 and 1000. Table 3 displays the high-dimensional experiment results.

Table 3 shows that even for functions with 500 and 1000 dimensions, EFHO still has a good accuracy. Especially for F1, F2, F3, F4, F9 and F11, EFHO can achieve the ideal value 0 in theory. The results of EFHO for other functions are in line with those of 500 and 1000 dimensions, suggesting that EHO is not dimension-sensitive during problem-solving. The above analysis shows that EFHO performs well when handling large-scale problems since it is not significantly impacted by a significant increase in dimension while maintaining high accuracy.

5. High-Value Patents Recognition with Random Forest

Leo Breiman and Adele Cutler first presented the Random Forest ensemble learning method in 2001 [45]. It addresses classification and regression problems by constructing multiple decision trees and then improving prediction accuracy, generalization ability, and resistance to overfitting through averaging (for regression problems) or majority voting (for classification problems) [46]. Random Forest’s basic idea is to construct many decision trees, where each tree is independent and the features within each tree are randomly selected, thereby reducing the model’s variance [47]. To get the final prediction result, Random Forest averages or votes on each tree’s prediction results. Random Forest works well for a variety of regression and classification problems, especially for complex, high-dimensional datasets and problems that require handling a large number of features. Due to its excellent generalization ability and resistance to overfitting, Random Forest performs well in many practical applications [48].

The Random Forest model has a number of hyperparameters that can be tuned by the user to optimise its performance, such as the number of trees, the minimum number of samples required in a node, and the number of features to be considered for each segmentation. These hyperparameters, also called tuning parameters, control the construction and training process of the model. Because optimal tuning parameter values depend on the dataset being used, they must be carefully chosen [49]. Many researchers employ intelligent optimization algorithms to tune the hyperparameters of random forests, seeking the best hyperparameters for the current dataset [50,51,52]. This paper employs the EFHO algorithm to optimize the hyperparameters of the random forest and conducts application research for the high-value patents identification based on it.

5.1. Dataset

The patent dataset used in this paper is collected from the Patyee database. Patyee is a comprehensive database providing extensive information on patents, including over 180 million in-depth processed patent records from more than 171 countries, regions, and organizations worldwide. This paper retrieves Chinese invention patents that have received awards, such as the China Patent Award, have been published for over ten years, and are still valid, using them as positive samples for the high-value patents dataset, totaling 10,638 patents. The choice of awarded patents as high-value patents is based on the award attribute provided by the Patyee database, which compiles official records from the China National Intellectual Property Administration and other authoritative sources, and is supported by prior research [5,18] with two studies in our references also adopting this approach. Given that such awards are granted through rigorous expert review, the accuracy of this labeling is expected to be very high.

Additionally, to construct the negative sample set, this study retrieves Chinese invention patents with a patent value of 100, published for over ten years, and still valid, totaling 18,030 patents. After removing the awarded patents, the remaining 17,734 patents are used as negative samples. The choice of value 100 is mainly driven by two considerations. First, the number of non-awarded patents is extremely large, reaching several million, making it necessary to narrow the search with specific criteria. Selecting those with a value of 100 results in a set size close to that of the positive samples, which facilitates a relatively balanced dataset. Second, the value score is an internal evaluation index calculated by the Patyee database using a proprietary method that is not publicly disclosed. A score of 100 is simply one of the discrete values in this system and is used here as a filtering attribute, not as a classification threshold. While such database-calculated values do not necessarily indicate truly high-value patents, awarded patents are determined through expert review and can be considered genuinely high-value. In this study, patents with a value of 100 but without awards are treated as negative samples, not to imply that they have low value, but to establish a clear and consistent labeling criterion in which only awarded patents are considered positive. This approach minimizes ambiguity in the definition of high-value patents and avoids dependence on database-specific scoring methods, ensuring consistency in the training labels. Additionally, this paper selects both positive and negative samples from patents that have been published for over ten years, in order to exclude the influence of the publication time.

Numerous studies on patent value indexes have been conducted, taking into account various scales and embracing a wide range of perspectives [53]. In paper 18, researchers used the title and abstract of patents as textual features, which were converted into vectors using the BERT model and then concatenated with other structural features to serve as the model’s input for training. The final accuracy was only a little over sixty percent, which is lower than the accuracy obtained by using only structural features for training [1]. This indicates that textual features contribute little to the identification of high-value patents and might even act as noise. Therefore, this paper does not use textual features but selects the most commonly used patent value indicators as the features of the dataset. Table 4 lists the 14 structural features that are present in the dataset.

5.2. Data Preprocessing

An essential stage in machine learning is data preprocessing. Enhancing data quality, eliminating noise and outliers, and guaranteeing consistency and completeness involves cleaning, converting, and standardizing data.

Missing data can lead to biased results, reduce the statistical power of analyses, and affect the overall performance of machine learning models. There are two approaches to handling missing data: deletion or imputation. There are many methods for imputation, with the most common being replacing missing values with the mean or median. This paper firstly performed a missing data check on the dataset and found that every row and column contains data, so no missing value treatment is necessary.

The dataset used in this paper was split 80:20, with the training set being the remaining 80% and the testing set being 20%. Prior to the split, the dataset was randomly shuffled to ensure a balanced distribution of the data. Before training, this paper also performed normalization on the data, transforming the values of each feature to a value between 0 and 1. The equation of normalization is as follows:

v_{n o r m} = \frac{v - v_{\min}}{v_{\max} - v_{\min}}

(18)

where

v

represents the raw value.

v_{\min}

represents the minimum value,

v_{\max}

represents the maximum value.

v_{n o r m}

represents the transformed value after normalization, which will be within the range of 0 to 1.

5.3. Experiments and Results

Experiment Setup

This paper optimized the Random Forest using the EFHO algorithm and conducted experimental verification based on the patent dataset. This paper also selects Naive Bayes (NB), Back Propagation Neural Network (BP), Support Vector Machine (SVM), and Logistic Regression (LR) for comparison. This article uses MATLAB for experiments; each model was implemented using built-in MATLAB functions. The NB model was called using the fitcnb function, the LR model using the fitclinear function, the SVM model using the fitcsvm function, the RF model using the TreeBagger function, and the BP neural network using the feedforwardnet function; the hidden layer’s node number is set to 8.

In this experiment, the objective function is the mean square error of the classification result, and the formula for the mean square error is as follows:

M = \frac{1}{n} \sum_{i = 1}^{n} {(x_{i} - {\hat{x}}_{i})}^{2}

(19)

where n represents the amount of data in the training or testing set,

x_{i}

represents the true value, and

{\hat{x}}_{i}

represents the predicted value of the model. The algorithm terminates when either the maximum iteration number is exceeded or the objective function falls below the specified threshold.

2.: Results

This paper separately applies the LR model, the NB model, the RF model, the SVM model, the BP model and the RF model optimized using EFHO (EFHO-RF) to high-value patents recognition. To compare the optimization effects, this paper also optimized the BP model using the EFHO algorithm as EFHO-BP. Table 5 displays the results of the experiment. Five-fold cross-validation was used to evaluate each model by indicators such as accuracy, recall, precision, F1 measure and AUC. The AUC means the Area Under the Curve (AUC) of the Receiver Operating Characteristics (ROC) curve.

According to the results of the experiment, the EFHO-RF model performs best, with all indicators, except for precision, being at the highest level. All indicators are superior to the pre-optimization RF model. The accuracy improved from 95.8% before optimization to 96.6%, the recall improved from 91.8% to 96.3%, the precision increased from 96.7% to 97%, the F1 measure increased from 94.2% to 96.3%, and the AUC improved from 98.8% to 99%. The EFHO-BP also performed well, with accuracy, recall, and F1 measure surpassing all models except EFHO-RF. However, its precision and AUC were lower than those of the RF model before optimization, and its precision was even lower than that of the BP model before optimization. The experimental results indicate that EFHO can find better hyperparameters for the Random Forest, providing a desirable solution for the problem. In this classification task, EFHO contributed to improving the Random Forest’s use of 14 patent-level indicators (Table 4) by identifying hyperparameter settings that better balance feature relevance and model complexity. While a separate feature importance ranking was not conducted, the selected features are widely recognized in patent analytics as being strongly associated with patent value, based on both the literature and our long-term practical experience.

In order to assess the discriminative ability of the model, this paper also examined the ROC curve. The AUC for the optimized Random Forest was 99.0%, demonstrating a high level of performance. The ROC curve shows a marked increase in true positive rates while keeping low false positive rates, further validating the robustness of our optimization approach. These results highlight the potential of combining metaheuristic algorithms like FHO with machine learning methods to enhance classification tasks, providing a more reliable and efficient model for practical applications. The ROC curves before and after optimization are shown in Figure 3. This suggests that the Random Forest model’s classification performance is excellent, and the EFHO algorithm can help improve the classification performance.

To make the identification results more transparent, this paper further conducted a feature importance analysis using the out-of-bag (OOB) permutation method. The results are shown in Figure 4. As displayed, the claims number (CLMSN), the countries number belongs to the same family (NCSE), and the assignments number (RAQ) ranked as the top three most influential features. This highlights the central role of patent scope, international family coverage, and technology transfer activities in distinguishing high-value patents. By contrast, features such as the reexamination number (NR) and the applicant’s number (PAN) showed limited or negligible importance in classification. These findings indicate that the EFHO-RF model identifies high-value patents by leveraging a meaningful combination of legal, citation, and family-related attributes. Presenting feature importance thus makes the identified results more visible and interpretable, complementing the performance evaluation and providing additional insights into the determinants of patent value.

6. Conclusions

This paper introduces a novel enhanced Fire Hawk optimization algorithm called EFHO. The EFHO improves upon the canonical FHO by incorporating four key strategies: adaptive tent chaotic mapping, hunting prey, the addition of inertial weight, and an enhanced flee strategy. These modifications address the shortcomings of the original FHO, such as weak convergence and limited exploration capabilities. By enhancing these aspects, EFHO provides a more robust and efficient optimization method.

While EFHO is a general intelligent optimization algorithm, this study uses high-value patent recognition primarily as a representative application scenario to validate its effectiveness. The challenges inherent in such real-world applications demand optimization algorithms with faster convergence and stronger global search ability than traditional methods. Existing hyperparameter tuning techniques often suffer from inefficiency and suboptimal solutions in such scenarios.

This paper used the EFHO algorithm to optimize the Random Forest model’s hyperparameters in the chosen application scenario. The resulting EFHO-RF classification model leverages the strengths of both EFHO and Random Forest, leading to superior performance. EFHO effectively tuned key Random Forest hyperparameters such as the number of trees, the minimum number of samples required in a node, and the number of features to be considered for each segmentation, leading to better utilization of informative features and reduced overfitting. The optimization process improved the model’s balance between feature relevance and complexity, resulting in more stable and accurate classification outcomes. Experimental results demonstrate that this model achieves a high level of accuracy on the selected patent dataset, significantly outperforming traditional methods. As awarded patents are determined through expert review, the identified positives inherently represent truly valuable patents, aligning with the ultimate goal of high-value patent identification. Thus, this research not only advances optimization algorithm methodology but also demonstrates its applicability through a representative real-world task, highlighting the method’s relevance and effectiveness in applied settings.

Beyond the empirical results, this study offers broader scientific and practical value. The proposed EFHO-RF framework provides a replicable approach for addressing other machine learning tasks. Its successful validation in the chosen application scenario offers methodological insights for similar applied analytics and decision-support contexts.

However, certain limitations should be acknowledged. The dataset is limited to Chinese invention patents, and the positive samples are defined solely by award status, which, although highly accurate, may not fully capture all possible interpretations of patent value. In addition, the EFHO-RF model has yet to be tested on datasets from other jurisdictions or with different feature spaces, which may affect its generalizability. It should also be noted that high-value patent recognition is essentially a complex classification problem. By leveraging EFHO’s strong capability in hyperparameter optimization and Random Forest’s robust classification ability, the EFHO-RF framework provides a suitable and effective solution for this type of task. Future work will include cross-domain validation, incorporation of alternative indicators of patent value, and application of EFHO to diverse and heterogeneous datasets.

Author Contributions

All authors contributed to the study conception and design. X.Y. proposed the overall research framework and supervised the project. Patent data collection and analysis were performed by H.L. The first draft of the manuscript was written by S.W., and all authors commented on previous versions of the manuscript. X.Y. also revised and finalized the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Social Science Foundation of China under Grant 19CTQ007 and in part by the Gansu Provincial Natural Science Foundation under Grant 23JRRA581.

Data Availability Statement

The datasets generated or analyzed during this study are available from the corresponding author on reasonable request.

Acknowledgments

The authors thank the anonymous reviewers for their insightful comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1. Wilcoxon signed-rank test results.

	EFHO vs. FHO (p)	EFHO vs. PSO (p)	EFHO vs. GWO (p)	EFHO vs. WOA (p)
F1	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷
F2	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷
F3	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷
F4	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷
F5	2.7455 × 10⁻³	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷
F6	2.4836 × 10⁻⁶	9.1269 × 10⁻⁷	1.2377 × 10⁻⁶	9.1269 × 10⁻⁷
F7	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	1.0106 × 10⁻⁶
F8	6.4886 × 10⁻⁶	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	2.7389 × 10⁻⁶
F9	1.0000 × 10⁰	9.1269 × 10⁻⁷	8.9229 × 10⁻⁷	5.0000 × 10⁻¹
F10	1.0000 × 10⁰	9.1269 × 10⁻⁷	8.6624 × 10⁻⁷	3.8176 × 10⁻⁶
F11	1.0000 × 10⁰	9.1269 × 10⁻⁷	9.7656 × 10⁻⁴	5.0000 × 10⁻¹
F12	1.0000 × 10⁰	1.0621 × 10⁻²	7.1267 × 10⁻⁶	9.9493 × 10⁻¹
F13	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷	9.1269 × 10⁻⁷
F14	1.0000 × 10⁰	1.0000 × 10⁰	9.9550 × 10⁻¹	1.0000 × 10⁰
F15	8.2267 × 10⁻¹	8.7948 × 10⁻¹	6.5960 × 10⁻¹	9.8112 × 10⁻¹
F16	1.0000 × 10⁰	1.0000 × 10⁰	1.0000 × 10⁰	1.0000 × 10⁰
F17	1.0000 × 10⁰	1.0000 × 10⁰	1.0000 × 10⁰	1.0000 × 10⁰
F18	9.9996 × 10⁻¹	1.0000 × 10⁰	9.9999 × 10⁻¹	1.0000 × 10⁰
F19	9.9837 × 10⁻¹	1.0000 × 10⁰	1.0000 × 10⁰	1.0000 × 10⁰
F20	1.0000 × 10⁰	1.0000 × 10⁰	1.0000 × 10⁰	1.0000 × 10⁰
F21	9.4338 × 10⁻¹	3.0623 × 10⁻²	8.1727 × 10⁻¹	7.2097 × 10⁻²
F23	9.5215 × 10⁻¹	8.1727 × 10⁻¹	1.0000 × 10⁰	1.0621 × 10⁻²

References

Hu, Z.; Zhou, X.; Lin, A. Evaluation and identification of potential high-value patents in the field of integrated circuits using a multidimensional patent indicators pre-screening strategy and machine learning approaches. J. Informetr. 2023, 17, 101406. [Google Scholar] [CrossRef]
He, C.; Shi, F.; Tan, R. Evaluation and cultivation method of high-tech value patents for mechanical products. PLoS ONE 2024, 19, 0298144. [Google Scholar] [CrossRef]
Artz, K.W.; Norman, P.M.; Hatfield, D.E.; Cardinal, L.B. A longitudinal study of the impact of r&d, patents, and product innovation on firm performance. J. Prod. Innov. Manag. 2010, 27, 725–740. [Google Scholar] [CrossRef]
Deng, N.; Zhang, J. Hmfm: A method for identifying high-value patents by fusing multiple features. Comput. Mater. Contin. 2024, 82, 1–10. [Google Scholar] [CrossRef]
Wang, S.; Zhou, H.; Zhao, T. Configuration paths to high-value patents: Evidence from patents winning the china patent awards. Scientometrics 2024, 129, 2633–2658. [Google Scholar] [CrossRef]
Rizzo, U.; Sterzi, V. Original but of low value: The paradox of science-industry collaborative patents. Rev. D’economie Ind. 2023, 183, 113–142. [Google Scholar]
Liu, L.-j.; Cao, C.; Song, M. China’s agricultural patents: How has their value changed amid recent patent boom? Technol. Forecast. Soc. Change 2014, 88, 106–121. [Google Scholar] [CrossRef]
Huang, K.G.-L.; Huang, C.; Shen, H.; Mao, H. Assessing the value of china’s patented inventions. Technol. Forecast. Soc. Change 2021, 170, 120868. [Google Scholar] [CrossRef]
Zhou, G.; Tong, Y.; Wang, H. Analysis of key features of high-value patent based on lasso-logit. Financ. Eng. Risk Manag. 2023, 6, 87–95. [Google Scholar]
Miric, M.; Jia, N.; Huang, K.G. Using supervised machine learning for large-scale classification in management research: The case for identifying artificial intelligence patents. Strateg. Manag. J. 2023, 44, 491–519. [Google Scholar] [CrossRef]
Zhou, Y.; Dong, F.; Liu, Y.; Ran, L. A deep learning framework to early identify emerging technologies in large-scale outlier patents: An empirical study of cnc machine tool. Scientometrics 2021, 126, 969–994. [Google Scholar] [CrossRef]
Hu, Y.; Yang, S.; Shi, A. Research on transferable patent recognition based on machine learning. In Proceedings of the 2021 4th International Conference on Data Science and Information Technology, Shanghai, China, 23–25 July 2021; pp. 273–278. [Google Scholar]
Hido, S.; Suzuki, S.; Nishiyama, R.; Imamichi, T.; Takahashi, R.; Nasukawa, T.; Id´e, T.; Kanehira, Y.; Yohda, R.; Ueno, T.; et al. Modeling patent quality: A system for large-scale patentability analysis using text mining. Inf. Media Technol. 2012, 7, 1180–1191. [Google Scholar] [CrossRef]
Wu, J.-L.; Chang, P.-C.; Tsao, C.-C.; Fan, C.-Y. A patent quality analysis and classification system using self-organizing maps with support vector machine. Appl. Soft Comput. 2016, 41, 305–316. [Google Scholar] [CrossRef]
Lee, C.; Kwon, O.; Kim, M.; Kwon, D. Early identification of emerging technologies: A machine learning approach using multiple patent indicators. Technol. Forecast. Soc. Change 2018, 127, 291–303. [Google Scholar] [CrossRef]
Trappey, A.J.; Trappey, C.V.; Govindarajan, U.H.; Sun, J.J. Patent value analysis using deep learning models—The case of iot technology mining for the manufacturing industry. IEEE Trans. Eng. Manag. 2019, 68, 1334–1346. [Google Scholar] [CrossRef]
Kwon, U.; Geum, Y. Identification of promising inventions considering the quality of knowledge accumulation: A machine learning approach. Scientometrics 2020, 125, 1877–1897. [Google Scholar] [CrossRef]
Liu, W.; Li, S.; Cao, Y.; Wang, Y. Multi-task learning based high-value patent and standard-essential patent identification model. Inf. Process. Manag. 2023, 60, 103327. [Google Scholar] [CrossRef]
Choi, J.; Jeong, B.; Yoon, J.; Coh, B.-Y.; Lee, J.-M. A novel approach to evaluating the business potential of intellectual properties: A machine learning-based predictive analysis of patent lifetime. Comput. Ind. Eng. 2020, 145, 106544. [Google Scholar] [CrossRef]
Kumar, A.; Ranjan, P.; Koley, A.; Danish, S. A new hybrid machine learning model for predicting the renewal life of patents. PLoS ONE 2024, 19, 0306186. [Google Scholar] [CrossRef]
Mohapatra, N.; Shreya, K.; Chinmay, A. Optimization of the random forest algorithm. In Advances in Data Science and Management: Proceedings of ICDSM 2019, Hunan, China, 22–23 February 2019; Springer: Berlin/Heidelberg, Germany, 2020; pp. 201–208. [Google Scholar]
Yang, L.; Shami, A. On hyperparameter optimization of machine learning algorithms: Theory and practice. Neurocomputing 2020, 415, 295–316. [Google Scholar] [CrossRef]
Luo, G. A review of automatic selection methods for machine learning algorithms and hyper-parameter values. Netw. Model. Anal. Health Inform. Bioinform. 2016, 5, 18. [Google Scholar] [CrossRef]
Abdolrasol, M.G.; Hussain, S.S.; Ustun, T.S.; Sarker, M.R.; Hannan, M.A.; Mohamed, R.; Ali, J.A.; Mekhilef, S.; Milad, A. Artificial neural networks based optimization techniques: A review. Electronics 2021, 10, 2689. [Google Scholar] [CrossRef]
Sarmah, D.K. A survey on the latest development of machine learning in genetic algorithm and particle swarm optimization. In Optimization in Machine Learning and Applications; Springer: Berlin/Heidelberg, Germany, 2020; pp. 91–112. [Google Scholar]
Abd Elaziz, M.; Dahou, A.; Abualigah, L.; Yu, L.; Alshinwan, M.; Khasawneh, A.M.; Lu, S. Advanced metaheuristic optimization techniques in applications of deep neural networks: A review. Neural Comput. Appl. 2021, 33, 14079–14099. [Google Scholar] [CrossRef]
Azizi, M.; Talatahari, S.; Gandomi, A.H. Fire hawk optimizer: A novel metaheuristic algorithm. Artif. Intell. Rev. 2023, 56, 287–363. [Google Scholar] [CrossRef]
Shishehgarkhaneh, M.B.; Azizi, M.; Basiri, M.; Moehler, R.C. Bim-based resource tradeoff in project scheduling using fire hawk optimizer (FHO). Buildings 2022, 12, 1472. [Google Scholar] [CrossRef]
Hosseinzadeh, M.; Yoo, J.; Ali, S.; Lansky, J.; Mildeova, S.; Yousefpoor, M.S.; Ahmed, O.H.; Rahmani, A.M.; Tightiz, L. A cluster-based trusted routing method using fire hawk optimizer (FHO) in wireless sensor networks (WSNs). Sci. Rep. 2023, 13, 13046. [Google Scholar] [CrossRef]
Ashraf, A.; Anwaar, A.; Haider Bangyal, W.; Shakir, R.; Ur Rehman, N.; Qingjie, Z. An improved fire hawks optimizer for function optimization. In Advances in Swarm Intelligence, Proceedings of the 14th International Conference on Swarm Intelligence 2023, Shenzhen, China, 14–18 July 2023; Springer: Berlin/Heidelberg, Germany, 2023; pp. 68–79. [Google Scholar]
Baweja, D.; Jain, A.; Bhandari, A.; Bohat, V.K. Levy flight based fire hawk optimizer. In Proceedings of the 2024 IEEE Region 10 Symposium (TENSYMP), New Delhi, India, 27–29 September 2024; pp. 1–6. [Google Scholar]
Alonazi, M.; Alnfiai, M.M. Fire Hawk Optimizer with Deep Learning Enabled Human Activity Recognition. Comput. Syst. Sci. Eng. 2023, 45, 3135–3150. [Google Scholar] [CrossRef]
Said, M.; Ismaeel, A.A.; El-Rifaie, A.M.; Hashim, F.A.; Bouaouda, A.; Hassan, A.Y.; Abdelaziz, A.Y.; Houssein, E.H. Evaluation of modified fire hawk optimizer for new modification in double diode solar cell model. Sci. Rep. 2024, 14, 30079. [Google Scholar] [CrossRef]
Khajuria, R.; Bukya, M.; Lamba, R.; Kumar, R. Optimal parameter identification of solid oxide fuel cell using modified fire hawk algorithm. Sci. Rep. 2024, 14, 22469. [Google Scholar] [CrossRef]
Ma, J.; Hao, Z.; Sun, W. Enhancing sparrow search algorithm via multi-strategies for continuous optimization problems. Inf. Process. Manag. 2022, 59, 102854. [Google Scholar] [CrossRef]
Mirjalili, S.; Lewis, A. The whale optimization algorithm. Adv. Eng. Softw. 2016, 95, 51–67. [Google Scholar] [CrossRef]
Chauhan, P.; Deep, K.; Pant, M. Novel inertia weight strategies for particle swarm optimization. Memetic Comput. 2013, 5, 229–251. [Google Scholar] [CrossRef]
Taherkhani, M.; Safabakhsh, R. A novel stability-based adaptive inertia weight for particle swarm optimization. Appl. Soft Comput. 2016, 38, 281–295. [Google Scholar] [CrossRef]
Gu, Y.; Lu, H.; Xiang, L.; Shen, W. Adaptive simplified chicken swarm optimization based on inverted s-shaped inertia weight. Chin. J. Electron. 2022, 31, 367–386. [Google Scholar] [CrossRef]
Li, J.; An, Q.; Lei, H.; Deng, Q.; Wang, G.-G. Survey of Lévy flight-based metaheuristics for optimization. Mathematics 2022, 10, 2785. [Google Scholar] [CrossRef]
Luo, W.; Wu, H.; Peng, J. Improvement of electric fish optimization algorithm for standstill label combined with Levy flight strategy. Biomimetics 2024, 9, 677. [Google Scholar] [CrossRef]
Xiong, J.; Liang, W.; Liang, X.; Yao, J. Intelligent quantification of natural gas pipeline defects using improved sparrow search algorithm and deep extreme learning machine. Chem. Eng. Res. Des. 2022, 183, 567–579. [Google Scholar] [CrossRef]
Osorio, F.; Galea, M.; Henr´ıquez, C.; Arellano-Valle, R. Addressing non-normality in multivariate analysis using the t-distribution. AStA Adv. Stat. Anal. 2023, 107, 785–813. [Google Scholar] [CrossRef]
Suganthan, P.N.; Hansen, N.; Liang, J.J.; Deb, K.; Chen, Y.-P.; Auger, A.; Tiwari, S. Problem definitions and evaluation criteria for the cec 2005 special session on real-parameter optimization. KanGAL Rep. 2005, 2005005, 2005. [Google Scholar]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Pang, H.; Lin, A.; Holford, M.; Enerson, B.E.; Lu, B.; Lawton, M.P.; Floyd, E.; Zhao, H. Pathway analysis using random forests classification and regression. Bioinformatics 2006, 22, 2028–2036. [Google Scholar] [CrossRef]
Salman, H.A.; Kalakech, A.; Steiti, A. Random forest algorithm overview. Babylon. J. Mach. Learn. 2024, 2024, 69–79. [Google Scholar] [CrossRef]
Naghibi, S.A.; Ahmadi, K.; Daneshi, A. Application of support vector machine, random forest, and genetic algorithm optimized random forest models in groundwater potential mapping. Water Resour. Manag. 2017, 31, 2761–2775. [Google Scholar] [CrossRef]
Probst, P.; Wright, M.N.; Boulesteix, A.-L. Hyperparameters and tuning strategies for random forest. Wiley Interdiscip. Rev. Data Min. Knowl. Discov. 2019, 9, 1301. [Google Scholar] [CrossRef]
Zumbado-Corrales, M.; Esquivel-Rodríguez, J. Evoseg: Automated electron microscopy segmentation through random forests and evolutionary optimization. Biomimetics 2021, 6, 37. [Google Scholar] [CrossRef] [PubMed]
Zhou, J.; Huang, S.; Qiu, Y. Optimization of random forest through the use of MVO, GWO and MFO in evaluating the stability of underground entry-type excavations. Tunn. Undergr. Space Technol. 2022, 124, 104494. [Google Scholar] [CrossRef]
Wang, M.; Zhao, G.; Wang, S. Hybrid random forest models optimized by sparrow search algorithm (SSA) and harris hawk optimization algorithm (HHO) for slope stability prediction. Transp. Geotech. 2024, 48, 101305. [Google Scholar] [CrossRef]
Grimaldi, M.; Cricelli, L. Indexes of patent value: A systematic literature review and classification. Knowl. Manag. Res. Pract. 2020, 18, 214–233. [Google Scholar] [CrossRef]

Figure 1. The function curve for inertia weight.

Figure 2. The convergence diagram of five algorithms.

Figure 3. The ROC Curve before and after optimization by EFHO and the dashed line represents the random classifier baseline (AUC = 0.5).

Figure 4. Feature importance of patent indicators in the EFHO-RF model.

Table 1. The performance comparison results.

	Results	PSO	FHO	GWO	WOA	EFHO
F1	Best	5.86 × 10⁻¹	1.4875 × 10⁻⁸³	6.5908 × 10⁻²⁹	1.0343 × 10⁻⁸⁴	0
	Mean	2.17 × 10⁰	1.3582 × 10⁻⁶⁹	1.1309 × 10⁻²⁷	7.9071 × 10⁻⁷³	0
	Std	8.76 × 10⁻¹	7.0382 × 10⁻⁶⁹	1.5143 × 10⁻²⁷	4.1528 × 10⁻⁷²	0
F2	Best	2.69 × 10⁰	3.0352 × 10⁻²¹	1.5109 × 10⁻¹⁷	5.31 × 10⁻⁵⁸	0
	Mean	4.52 × 10⁰	2.4074 × 10⁻¹⁹	8.9556 × 10⁻¹⁷	1.0498 × 10⁻⁵⁰	0
	Std	1.24 × 10⁰	3.3718 × 10⁻¹⁹	5.7898 × 10⁻¹⁷	4.4941 × 10⁻⁵⁰	0
F3	Best	6.92 × 10¹	5.4193 × 10⁻⁸²	9.7792 × 10⁻⁹	1.59 × 10⁴	0
	Mean	1.83 × 10²	1.6619 × 10⁻⁶⁹	4.6808 × 10⁻⁶	4.38 × 10⁴	0
	Std	5.94 × 10¹	5.1581 × 10⁻⁶⁹	7.6958 × 10⁻⁵	1.40 × 10⁴	0
F4	Best	1.48 × 10⁰	3.2607 × 10⁻³⁵	1.2102 × 10⁻⁷	1.78 × 10⁰	0
	Mean	2.06 × 10⁰	1.0124 × 10⁻³⁰	7.4167 × 10⁻⁷	5.33 × 10¹	0
	Std	2.71 × 10⁻¹	3.5189 × 10⁻³⁰	6.7046 × 10⁻⁷	2.79 × 10¹	0
F5	Best	2.79 × 10²	3.52 × 10⁻²	2.61 × 10¹	2.71 × 10¹	2.11 × 10⁻³
	Mean	1.10 × 10³	2.80 × 10⁻¹	2.71 × 10¹	2.80 × 10¹	9.58 × 10⁻²
	Std	7.26 × 10²	2.09 × 10⁻¹	7.51 × 10⁻¹	4.92 × 10⁻¹	9.21 × 10⁻²
F6	Best	9.05 × 10⁻¹	7.85 × 10⁻³	2.50 × 10⁻¹	6.70 × 10⁻²	5.21 × 10⁻³
	Mean	2.26 × 10⁰	8.94 × 10⁻¹	8.35 × 10⁻¹	3.75 × 10⁻¹	1.48 × 10⁻²
	Std	8.27 × 10⁻¹	1.66 × 10⁰	3.20 × 10⁻¹	2.09 × 10⁻¹	8.00 × 10⁻³
F7	Best	2.83 × 10⁰	2.30 × 10⁻⁴	2.43 × 10⁻⁴	6.7084 × 10⁻⁵	5.5551 × 10⁻⁷
	Mean	1.67 × 10¹	9.06 × 10⁻⁴	1.70 × 10⁻³	2.14 × 10⁻³	3.6111 × 10⁻⁵
	Std	1.37 × 10¹	4.29 × 10⁻⁴	9.03 × 10⁻⁴	1.99 × 10⁻³	3.6888 × 10⁻⁵
F8	Best	4.31 × 10³	3.61 × 10⁰	4.89 × 10³	6.76 × 10⁻¹	1.34 × 10⁻²
	Mean	6.40 × 10³	3.93 × 10²	6.54 × 10³	2.32 × 10³	2.61 × 10¹
	Std	1.46 × 10³	7.02 × 10²	6.88 × 10²	1.73 × 10³	3.40 × 10¹
F9	Best	9.46 × 10¹	0	5.6843 × 10⁻¹⁴	0	0
	Mean	1.60 × 10²	0	1.93 × 10⁰	0	0
	Std	3.98 × 10¹	0	3.31 × 10⁰	0	0
F10	Best	2.06 × 10⁰	4.4409 × 10⁻¹⁶	7.1498 × 10⁻¹⁴	4.4409 × 10⁻¹⁶	4.4409 × 10⁻¹⁶
	Mean	2.78 × 10⁰	4.4409 × 10⁻¹⁶	9.6959 × 10⁻¹⁴	3.4047 × 10⁻¹⁵	4.4409 × 10⁻¹⁶
	Std	3.75 × 10⁻¹	0 × 10⁰	1.4564 × 10⁻¹⁴	2.2625 × 10⁻¹⁵	0
F11	Best	3.07 × 10⁻²	0	0	0	0
	Mean	1.48 × 10⁻¹	0	5.12 × 10⁻³	0	0
	Std	4.71 × 10⁻²	0	8.16 × 10⁻³	0	0
F12	Best	5.78 × 10⁻³	4.95 × 10⁻⁴	6.43 × 10⁻³	5.25 × 10⁻³	3.53 × 10⁻³
	Mean	3.31 × 10⁻²	1.47 × 10⁻³	4.08 × 10⁻²	2.31 × 10⁻²	2.35 × 10⁻²
	Std	2.57 × 10⁻²	6.48 × 10⁻⁴	2.23 × 10⁻²	1.13 × 10⁻²	6.51 × 10⁻³
F13	Best	2.26 × 10⁻¹	3.96 × 10⁻³	4.08 × 10⁻¹	1.19 × 10⁻¹	7.9238 × 10⁻⁹
	Mean	5.11 × 10⁻¹	1.08 × 10⁻²	6.87 × 10⁻¹	5.22 × 10⁻¹	5.9291 × 10⁻⁷
	Std	2.13 × 10⁻¹	4.94 × 10⁻³	2.38 × 10⁻¹	2.44 × 10⁻¹	6.9933 × 10⁻⁷
F14	Best	0 × 10⁰	1.7555 × 10⁻⁸	1.8985 × 10⁻¹¹	1.0154 × 10⁻¹¹	1.98 × 10⁰
	Mean	2.00 × 10⁰	5.71 × 10⁻¹	3.46 × 10⁰	1.83 × 10⁰	6.12 × 10⁰
	Std	2.70 × 10⁰	6.20 × 10⁻¹	4.35 × 10⁰	2.96 × 10⁰	1.31 × 10⁰
F15	Best	1.34 × 10⁻⁴	1.3838 × 10⁻⁵	6.009 × 10⁻¹⁰	1.667 × 10⁻⁶	1.06 × 10⁻⁴
	Mean	5.91 × 10⁻⁴	9.51 × 10⁻⁴	2.71 × 10⁻³	3.87 × 10⁻⁴	9.25 × 10⁻⁴
	Std	1.48 × 10⁻⁴	1.94 × 10⁻³	6.80 × 10⁻³	3.76 × 10⁻⁴	6.42 × 10⁻⁴
F16	Best	2.2204 × 10⁻¹⁵	2.1008 × 10⁻⁷	3.8243 × 10⁻¹⁰	6.9722 × 10⁻¹⁴	7.1983 × 10⁻⁶
	Mean	2.4351 × 10⁻¹⁵	9.9372 × 10⁻⁶	2.3649 × 10⁻⁸	1.1901 × 10⁻⁹	3.53 × 10⁻³
	Std	3.9858 × 10⁻¹⁷	1.0288 × 10⁻⁵	2.6884 × 10⁻⁸	4.2983 × 10⁻⁹	3.65 × 10⁻³
F17	Best	1.6653 × 10⁻¹⁶	2.4405 × 10⁻⁵	7.4074 × 10⁻⁸	1.7071 × 10⁻¹¹	4.7274 × 10⁻⁵
	Mean	1.6653 × 10⁻¹⁶	3.77 × 10⁻⁴	2.1562 × 10⁻⁶	5.8586 × 10⁻⁶	6.94 × 10⁻³
	Std	0 × 10⁰	3.50 × 10⁻⁴	2.0217 × 10⁻⁶	1.1906 × 10⁻⁵	7.36 × 10⁻³
F18	Best	2.2204 × 10⁻¹⁵	1.2711 × 10⁻⁵	1.4177 × 10⁻⁷	7.6656 × 10⁻⁸	3.0105 × 10⁻⁶
	Mean	1.0214 × 10⁻¹⁴	1.23 × 10⁻³	3.0221 × 10⁻⁵	5.6182 × 10⁻⁵	1.03 × 10⁻¹
	Std	4.704 × 10⁻¹⁵	1.08 × 10⁻³	4.3207 × 10⁻⁵	1.01 × 10⁻⁴	3.54 × 10⁻¹
F19	Best	3.9968 × 10⁻¹⁵	2.34 × 10⁻⁴	1.9841 × 10⁻⁶	3.8805 × 10⁻⁷	6.17 × 10⁻³
	Mean	5.1514 × 10⁻¹⁵	1.59 × 10⁻²	1.09 × 10⁻³	5.90 × 10⁻³	1.23 × 10⁻¹
	Std	1.907 × 10⁻¹⁵	5.06 × 10⁻²	2.22 × 10⁻³	7.07 × 10⁻³	1.32 × 10⁻¹
F20	Best	7.0895 × 10⁻¹⁰	1.96 × 10⁻²	1.582 × 10⁻⁶	5.8291 × 10⁻⁵	3.13 × 10⁻¹
	Mean	3.96 × 10⁻²	1.25 × 10⁻¹	7.13 × 10⁻²	8.22 × 10⁻²	7.69 × 10⁻¹
	Std	5.60 × 10⁻²	1.11 × 10⁻¹	8.25 × 10⁻²	9.90 × 10⁻²	3.33 × 10⁻¹
F21	Best	1.7764 × 10⁻¹⁵	1.07 × 10⁻¹	4.06 × 10⁻⁴	4.41 × 10⁻⁴	1.77 × 10⁻⁴
	Mean	3.34 × 10⁰	3.78 × 10⁻¹	1.10 × 10⁰	2.30 × 10⁰	8.31 × 10⁻¹
	Std	3.44 × 10⁰	1.93 × 10⁻¹	2.22 × 10⁰	2.86 × 10⁰	7.97 × 10⁻¹
F22	Best	0 × 10⁰	7.39 × 10⁻²	4.29 × 10⁻⁴	3.47 × 10⁻⁴	1.98 × 10⁻²
	Mean	7.87 × 10⁻¹	4.85 × 10⁻¹	1.73 × 10⁻³	2.09 × 10⁰	8.65 × 10⁻¹
	Std	2.03 × 10⁰	2.23 × 10⁻¹	7.32 × 10⁻⁴	2.74 × 10⁰	7.89 × 10⁻¹
F23	Best	8.3489 × 10⁻¹⁴	1.13 × 10⁻¹	9.8691 × 10⁻⁵	1.20 × 10⁻³	1.08 × 10⁻²
	Mean	1.24 × 10⁰	5.42 × 10⁻¹	1.82 × 10⁻¹	2.81 × 10⁰	1.12 × 10⁰
	Std	2.51 × 10⁰	2.95 × 10⁻¹	9.70 × 10⁻¹	3.25 × 10⁰	9.19 × 10⁻¹

Table 2. Summary of Wilcoxon signed-rank test results.

Comparison	Functions with p < 0.05 (n/23)	Proportion (%)
EFHO vs. FHO	9/23	39.13
EFHO vs. PSO	14/23	60.87
EFHO vs. GWO	13/23	56.52
EFHO vs. WOA	12/23	52.17

Table 3. The EFHO test results of high-dimensional problems.

	EFHO
	500 Dimensions		1000 Dimensions
	Mean	Std	Mean	Std
F1	0	0	0	0
F2	0	0	0	0
F3	0	0	0	0
F4	0	0	0	0
F5	7.76 × 10⁰	4.09 × 10¹	9.48 × 10⁻²	1.28 × 10⁻¹
F6	6.61 × 10⁻³	6.66 × 10⁻³	1.72 × 10⁻²	9.96 × 10⁻³
F7	3.3296 × 10⁻⁵	4.4944 × 10⁻⁵	2.9348 × 10⁻⁵	2.603 × 10⁻⁵
F8	1.65 × 10¹	1.18 × 10¹	2.55 × 10¹	3.47 × 10¹
F9	0	0	0	0
F10	4.4409 × 10⁻¹⁶	0	4.4409 × 10⁻¹⁶	0
F11	0	0	0	0
F12	1.62 × 10⁻²	4.78 × 10⁻³	2.64 × 10⁻²	5.58 × 10⁻³
F13	5.5754 × 10⁻⁶	1.2471 × 10⁻⁵	2.8439 × 10⁻⁷	4.072 × 10⁻⁷

Table 4. The selected patent value indicators.

No.	Abbr.	Description
1	BCN	The backward citation number
2	FCN	The forward citation number
3	PAN	The applicants number
4	INN	The inventors number
5	ASGN	The patent holders number
6	CLMSN	The claims number
7	FN	The patent family number
8	NCSE	The country’s number belongs to the same family
9	NR	The reexaminations number
10	IT	The invalidation times
11	LIQ	The licensing frequency
12	RAQ	The assignments number
13	PLQ	The pledging frequency
14	LTN	The lawsuits number

Table 5. Test performance comparison.

Algorithm	Accuracy	Recall	Precision	F1 Measure	AUC
NB	0.918	0.910	0.874	0.891	0.950
LR	0.950	0.898	0.953	0.925	0.966
RF	0.958	0.918	0.967	0.942	0.988
SVM	0.943	0.880	0.964	0.920	0.977
BP	0.947	0.883	0.971	0.925	0.966
EFHO-BP	0.959	0.950	0.963	0.960	0.985
EFHO-RF	0.966	0.958	0.970	0.963	0.990

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yao, X.; Li, H.; Wang, S. High-Value Patents Recognition with Random Forest and Enhanced Fire Hawk Optimization Algorithm. Biomimetics 2025, 10, 561. https://doi.org/10.3390/biomimetics10090561

AMA Style

Yao X, Li H, Wang S. High-Value Patents Recognition with Random Forest and Enhanced Fire Hawk Optimization Algorithm. Biomimetics. 2025; 10(9):561. https://doi.org/10.3390/biomimetics10090561

Chicago/Turabian Style

Yao, Xiaona, Huijia Li, and Sili Wang. 2025. "High-Value Patents Recognition with Random Forest and Enhanced Fire Hawk Optimization Algorithm" Biomimetics 10, no. 9: 561. https://doi.org/10.3390/biomimetics10090561

APA Style

Yao, X., Li, H., & Wang, S. (2025). High-Value Patents Recognition with Random Forest and Enhanced Fire Hawk Optimization Algorithm. Biomimetics, 10(9), 561. https://doi.org/10.3390/biomimetics10090561

Article Menu

High-Value Patents Recognition with Random Forest and Enhanced Fire Hawk Optimization Algorithm

Abstract

1. Introduction

2. Fire Hawk Optimization Algorithm

3. Enhanced Fire Hawk Optimization Algorithm

3.1. Adaptive Tent Chaos Mapping Strategy

3.2. Hunting Prey

3.3. Adding the Inertia Weight

3.4. Enhanced Flee Strategy

4. Simulation Experiment

4.1. Experiment Setup

4.2. Performance Comparison

4.3. EFHO’s Scalability Test for Large-Scale Problems

5. High-Value Patents Recognition with Random Forest

5.1. Dataset

5.2. Data Preprocessing

5.3. Experiments and Results

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI