Adaptive Sequential Infill Sampling Method for Experimental Optimization with Multi-Fidelity Hamilton Kriging Model

Zhang, Shixuan; Ma, Jie

doi:10.3390/aerospace12100913

Open AccessArticle

Adaptive Sequential Infill Sampling Method for Experimental Optimization with Multi-Fidelity Hamilton Kriging Model

by

Shixuan Zhang

¹

and

Jie Ma

^1,2,*

¹

Control and Simulation Center, Harbin Institute of Technology, Harbin 150090, China

²

National Key Laboratory of Complex System Control and Intelligent Agent Cooperation, Harbin 150090, China

^*

Author to whom correspondence should be addressed.

Aerospace 2025, 12(10), 913; https://doi.org/10.3390/aerospace12100913

Submission received: 1 September 2025 / Revised: 24 September 2025 / Accepted: 7 October 2025 / Published: 10 October 2025

(This article belongs to the Section Aeronautics)

Download

Browse Figures

Versions Notes

Abstract

Experimental optimization with surrogate models has received much attention for its efficiency recently in predicting the responses of the experimental optimum. However, with the development of multi-fidelity experiments with surrogate models such as Kriging, the traditional expected improvement (EI) in efficient global optimization (EGO) has suffered from limitations due to low efficiency. Only high-fidelity samples to be used in optimizing Kriging surrogate models are infilled, misleading the sequential sampling method in low-fidelity data sets. This recent theory based on multi-fidelity sequential infill sampling methods has gained much attention for balancing the selection of high- or low-fidelity data sets, but ignores the efficiency of sampling in experiments. This article proposes an Adaptive Sequential Infill Sampling (ASIS) method based on Bayesian inference for a multi-fidelity Hamilton Kriging model in the use of experimental optimization, aiming to address the efficiency of sequential sampling. The proposed method is demonstrated by two numerical simulations and one practical aero-engineering problem. The results verify the efficiency of the proposed method over other popular EGO methods in surrogate models, and ASIS can be useful for any other reliability engineering problems due to its efficiency.

Keywords:

kriging surrogate model; experimental optimization; sequential sampling method; multi-fidelity model; airfoil design

1. Introduction

Surrogate modeling is being increasingly applied in experimental science, offering valuable opportunities to optimize experiments and enhance the prediction of complex engineering systems [1,2]. In engineering experiments, the objectives of experimental design are divided into two main categories: development and identification. A well-structured experiment should exhibit repeatability, randomness, and controllability [3]. To achieve the objective of these engineering experiments, field tests are often required. These tests typically necessitate a controlled test environment, a significant number of personnel, and intricate monitoring [4]. A commonly used approach is to employ a surrogate model for approximation. Initially, sampling is conducted using space-filling experimental designs, such as uniform design [5] and Latin hypercube design [6]. By constructing a surrogate model, it becomes possible to predict responses within the sample space, significantly reducing experimental costs. Experimental design utilizing surrogate models can effectively predict responses in sparse sample experiments [7], achieve accurate predictions despite noise [8], and conduct sensitivity analysis in models with multiple inputs [9].

The initial surrogate model exclusively utilized the input and output data from the experiment. This approach facilitated the development of a black box model, a type of modeling technique where the internal workings are not visible or easily interpretable. Several methodologies were employed in this modeling process, including the response surface model (RSM) [10], polynomial chaotic expansion model (PCE) [11], Kriging model [12], radial basis function model (RBF) [13], and support vector machine model (SVM) [14]. Among them, the Kriging surrogate model is commonly used in aerospace design [15] because it effectively provides nonlinear fitting and predicts both the variance of the prediction point and process variance. The Kriging surrogate model, developed by South African mining scientist Danie Krige in 1959, was initially utilized in geostatistics for the exploration of mineral resources. In recent years, the increasing demand for aerodynamic design optimization has prompted the integration of gradient information into Kriging surrogate modeling, leading to the emergence of the gradient-enhanced Kriging (GEK) model [16]. The GEK model utilizes a first-order Taylor expansion, which transforms the partial derivatives at sample points into weighted sums of additional sample function values. However, in instances where gradients cannot be computed at specific sample points, the GEK model reverts to the standard Kriging approach. When experimental data is limited, the model’s response and gradient may not achieve the required accuracy for the design. Surrogate modeling, which depends on a single data set at a single level of fidelity, has encountered limitations. To address this issue, multiple experiments at varying fidelity levels are being applied to experimental design using surrogate models.

With the advancement of simulation technology and complex computing, it has become increasingly common to utilize data from multiple sources concurrently for developmental testing. For example, in the aerodynamic analysis of aero-load predictions [17], wind tunnel testing is recognized as a high-fidelity, costly, and accurate method. Conversely, computational fluid dynamics (CFD) analysis employing large eddy simulation with fine meshes is categorized as a lower-fidelity approach, while CFD analysis based on Reynolds-Averaged Navier–Stokes equations using coarse meshes is considered an even lower-fidelity test. In the development of multi-fidelity modeling using Kriging, Kennedy [18] introduced a framework for multi-fidelity Kriging. This framework employs Bayesian approximation, where the Kriging surrogate model built with low-fidelity data serves as the prior for the high-fidelity Kriging model. In this approach, data from both fidelity levels are utilized for surrogate modeling. Forrester [19] enhances the CoKriging surrogate modeling methodology by applying it to wing optimization and introducing a novel variance estimator that accounts for varying degrees of uncertainty. Han [20] introduced the Hierarchical Kriging model, which effectively integrates low-fidelity models to approximate the global trend. This model employs data of varying fidelity layered accordingly and streamlines the cross-covariance calculations associated with CoKriging. Han [21] analyzed the connections between multiple fidelity levels and further combined gradient-enhanced Kriging with generalized hybrid bridge functions in the field of variable-fidelity modeling (VFM), proposing a multi-fidelity GEK and applying it to the construction of aerodynamic coefficients. Zhang [22] focused on calculating the weight coefficient between gradient weights and fidelity levels in multi-fidelity modeling and proposed a multi-fidelity Kriging model based on Hamilton Monte Carlo (MHK), which improves the efficiency of weight coefficient calculation. Enhancements derived from multi-fidelity modeling strategies represent only a portion of the overall approach to surrogate optimization. In instances where the accuracy requirements are inadequate, it is also imperative to supplement the data set with additional samples.

A crucial component of surrogate-based optimization involves the implementation of infilling sample experiments, which are referred to as sequential experiments in the context of experimental optimization and as acquisition functions [23] in global optimization. For the purposes of this discussion, we shall refer to these methodologies collectively as the infill sampling strategy. Strategies for optimization can be categorized into three main objectives: improvement-based strategies, confidence-based strategies, and entropy-based strategies. Improvement-based strategies, such as Probability Improvement (PI) [24] and expected improvement (EI) [25], focus on selecting evaluation points that maximize the potential enhancement in the current surrogate model’s target extreme value. Confidence-based strategies, also referred to as the least confident bound (LCB) method in Kriging modeling, utilize a lower confidence limit to guide the selection of optimal points. Hertz [26] combines the estimated standard deviation and the estimated response of the surrogate model with a weighting factor to identify points that enhance the lower bound. Entropy-based strategies, also known as the maximum entropy criterion, were initially introduced by Currin [27] to measure the information gained from a single experiment, aiming to find the global optimum by reducing uncertainty. When dealing with multi-fidelity sample data, selecting the appropriate sample set becomes crucial. Zhang [28] developed a variable-fidelity expected improvement (VFEI) method that effectively employs the Hierarchical Kriging model. This method selectively samples from various fidelity data sets, optimizing the sampling process for both accuracy and efficiency while minimizing resource use. Hao [29] proposed an adaptive multi-fidelity expected improvement (AMEI) method that takes into account both prediction accuracy and optimization potential in the context of the multi-fidelity gradient-enhanced Kriging model. Achieving an optimal balance between exploitation and exploration is crucial for effective experimental optimization. Dong [30] proposed a multi-point infill criterion named the Multi-surrogate-based Global Optimization using a Score-based Infill Criterion (MGOSIC) to identify cheap points with scores for selection. Zhang [31] proposed a neighborhood-based Kriging optimization method for sequential experiments, addressing the issue of screening additional samples when the neighborhood domain is small. There remains a gap in effectively utilizing the statistical information from additional samples and balancing the exploration weights across different fidelity levels in multi-fidelity Kriging modeling.

This paper proposes an Adaptive Sequential Infill Sampling (ASIS) method based on the balancing issues mentioned. The method is utilized for multi-fidelity Hamiltonian Kriging modeling when the accuracy of the model cannot be refined. The neighborhood-based Kriging has been expanded to multi-fidelity modeling with the benefit of MHK, and a Probabilistic Nearest Neighborhood (PNN) strategy has been employed to balance the exploration between multi-fidelity models.

The sections of this paper are organized as follows: Section 2 introduces the concept of experimental optimization utilizing the multi-fidelity Hamiltonian Kriging surrogate model. This section encompasses initial experiments, surrogate modeling, sequential design, and the establishment of performance criteria. Section 3 presents the formulation of an Adaptive Sequential Infill Sampling strategy for MHK. It includes a succinct review of both ordinary Multi-fidelity Experimental Information (MFEI) and the Probabilistic Nearest Neighborhood method, as well as a discussion of the adaptive ASIS framework for MHK. Section 4 employs two numerical simulations alongside a case study from aerospace engineering to illustrate the efficacy of the proposed strategy. Lastly, Section 5 offers a comprehensive summary of the ASIS strategy discussed in this work and outlines its potential for future applications.

2. Experimental Optimization Based on Multi-Fidelity Hamiltonian Kriging

This article examines optimization problems in the design of developmental experiments, with a focus on identifying the extreme value of a target response through experimentation. To illustrate the importance of multi-fidelity surrogate models in experimental optimization, this section emphasizes two levels of data fidelity. The use of experimental data with varying levels of fidelity can be beneficial for future extensions.

2.1. Framework of Multi-Fidelity Surrogate-Based Experiment Optimization

The conventional process for optimizing experimental design comprises three critical phases: initial design of experiments (DoE), practice experiments and subsequent optimization, and sequential experimentation. This process is designed to assess the influence of various experimental factors on the outcomes by systematically analyzing the experimental variables through controlled trials. Ultimately, the goal is to determine the optimal combination of these factors that will yield the desired results. The framework illustrating this experimental process is presented in Figure 1.

In instances where experimental costs are excessively high, the establishment of experimental environments is challenging, or labor expenditures associated with experimentation are substantial, simulation experiments grounded in computational analysis represent a feasible alternative. These simulation experiments, referred to as multi-fidelity experiments, are conducted across various scales. Optimization processes relying on multi-fidelity experiments can be effectively implemented through multi-fidelity surrogate models. The overall procedure is primarily divided into four essential phases: initial DoE, implementation of multi-fidelity experiments, surrogate-based optimization, and sequential DoE. This experimental framework is depicted in Figure 2. In the inner loop of the surrogate optimization algorithm, several key components play a crucial role in enhancing the efficiency of surrogate-based experimental optimization. These components include the selection and allocation of initial samples, the development of the surrogate model, and the formulation of criteria for optimization. Each of these elements plays a significant role in enhancing the overall performance of the optimization process.

2.2. Initial Sample Experiments

In the context of both classical and surrogate-based experimental design optimization, the initial sampling phase is of paramount importance. Unlike gradient-based optimization, where the initial sample primarily serves as a starting point, it is essential to ensure that this sample also contributes to effective space filling.

Assume that the true model of a system in region

x \in ℝ

is

y = f (x)

(1)

where

x = (x_{1}, x_{2}, \dots, x_{s})

is the factor and

y

is the response. It is commonly recognized that the experimental sample space is represented as a hypercube, designated as

[a_{1}, b_{1}] \times [a_{2}, b_{2}] \times \dots \times [a_{s}, b_{s}]

. For the purpose of consistency and simplicity, this space is typically defined as the unit cube, referred to as

C^{s} = {[0, 1]}^{s} = \underset{s}{\underset{︸}{[0, 1] \times [0, 1] \times \dots \times [0, 1]}}

. Let

Ρ = {x_{1}, x_{2}, \dots x_{n}}

represent the set of

n

design points on

C^{s}

. The objective of the experimental design is to develop a surrogate model through the implementation of design

Ρ

.

\hat{y} = \hat{f} (x)

(2)

To estimate the true model presented in Equation (1), it is customary to derive an estimate of parameter

E (y)

by utilizing the sample mean.

E (y) = \int_{C^{s}} f (x) d x

(3)

\bar{y} (Ρ) = \frac{1}{n} \sum_{i = 1}^{n} y_{i}

(4)

The Monte Carlo method is recognized as one of the most straightforward random sampling techniques. In this methodology,

Ρ

signifies

n

samples taken from a uniform distribution

U (C^{s})

within a defined set

C^{s}

. The estimated variance is referred to as

V a r (f (x)) / n

. In accordance with the central limit theorem, it is possible to compute the 95% confidence interval for the variance, which assesses the relationship between the sample mean and the overall mean.

|\bar{y} (Ρ) - E (y)| \leq 1.96 \sqrt{V a r (f (x)) / n}

(5)

In numerous instances, the variance in estimation caused by random sampling can be excessively large. Latin hypercube sampling (LHS), introduced by McKay [32], is a popular method used to minimize this estimation variance. This technique involves dividing the test area

C^{s}

into

n

layers based on variable

x_{k}

, ensuring that each layer maintains the same marginal probability

1 / n

. After this division, a sample is taken from each layer.

When a prior distribution is available, Latin hypercube sampling (LHS) may be conducted according to the specified form of the prior, as elaborated in Appendix A. Figure 3 illustrates an LHS design that utilizes a two-dimensional normal distribution

N [5, {0.05}^{2}] \times N [8, {0.05}^{2}]

as its prior.

2.3. Multi-Fidelity Hamilton Kriging Model

The primary distinction of the Kriging model, in contrast to other models, is its foundation as a non-parametric regression model. Originating from the field of geo-statistics, Kriging posits that a correlation exists between any two exploration points within a specific area, with this correlation being solely dependent on the distance separating them. In the context of a two-dimensional plane, it is assumed that this correlation is pervasive and follows a random stationary distribution. This stationary distribution is termed the correlation function. The strength of the correlation between two known samples is quantified by the correlation coefficient. When a sufficient number of samples are available, it becomes feasible to establish a regression model based on the correlation function, wherein the parameters of the correlation function can be estimated using the maximum likelihood method. For additional details, please refer to Appendix B.

Considering the m-dimensional problem, a small amount of expensive but accurate data (high-fidelity, HF) is established as

y_{h} : ℝ^{m} \to ℝ

, and a large amount of cheap but low-precision data (low-fidelity, LF) is established as

y_{l} : ℝ^{m} \to ℝ

.

S_{h} = {[x^{(1)}, \dots, x^{(n_{h})}]}^{T} \in ℝ^{n_{h} \times m}

(6)

S_{l} = {[{x_{l}}^{(1)}, \dots, {x_{l}}^{(n_{h})}]}^{T} \in ℝ^{n_{l} \times m}

(7)

Y_{h} = {[{y_{h}}^{(1)}, \dots, {y_{h}}^{(n_{h})}]}^{T} = {[y ({x_{h}}^{(1)}), \dots, y ({x_{h}}^{(n_{h})})]}^{T} \in ℝ^{n_{h}}

(8)

Y_{l} = {[{y_{l}}^{(1)}, \dots, {y_{l}}^{(n_{l})}]}^{T} = {[y ({x_{l}}^{(1)}), \dots, y ({x_{l}}^{(n_{l})})]}^{T} \in ℝ^{n_{l}}

(9)

where

S_{h}

is the HF input set and

y_{h}

is the HF response value set. Similarly,

(S_{l}, y_{l})

is considered the sampled set of LF data.

The multi-fidelity Hamiltonian Kriging (MHK) can be constructed as follows:

\hat{y} = \sum_{i}^{n} ω_{i} (ρ_{i} {\hat{y}}_{i} + (1 - ρ_{i}) \nabla {\hat{y}}_{i})

(10)

where

ρ_{i}

is an adaptive parameter of gradient Kriging and

ω_{i}

is a scale parameter. The value of

\nabla \hat{y} (x)

can be obtained as

\nabla \hat{y} (x) = d f^{T} (x) \hat{β} + d r^{T} (x) Ψ^{- 1} (Y - F \hat{β})

(11)

where the ith column of

d f^{T} (x)

is defined as the column that has an equal value in all rows at

\nabla f_{i} (x), i = 1, \dots, p

, and

d r^{T} (x)

is defined as the column that has an equal value in

ψ^{'} ψ {x - x^{'}}, i = 1, \dots, n

.

The selection of MHK is predicated on the capability of the HMC process to effectively simulate the multi-fidelity Kriging target, thereby facilitating faster evaluations. Furthermore, the HMC method’s proficiency in traversing and remaining within the typical distribution permits sampling from the low-fidelity data set

Y_{l}

, while also enabling adaptive access to the high-fidelity data set

Y

for the acquisition of new samples.

2.4. Sequential Infill Sampling Strategy

Following the development of a surrogate model utilizing existing sample points, it may be employed to predict the response for any sample point within the designated sample space. However, should the surrogate model fail to meet the requisite accuracy standards, it is essential to manually introduce additional sample points and modify the parameters to enhance accuracy. In general, the optimization of the surrogate model’s accuracy primarily emphasizes global optimization strategies.

x^{*} = \underset{x \in X}{\arg \max} (|\hat{f} (x) - f_{\min} (x)| - κ s (x))

(12)

The parameter

κ

serves as a tuning variable that establishes a balance between absolute error and prediction variance. When

κ \to 0

denotes the new sample, it corresponds to the widely recognized strategies of EI or PI, as detailed in Appendix C. In contrast, when

κ \to \infty

signifies the new sample, it aligns with the approach aimed at maximizing the squared predicted error.

2.5. Performance Criteria

Quantifying the error in surrogate models can be classified into two primary methods: (1) methods that require additional data, such as test data sets; (2) methods that rely on existing data. The first method is frequently utilized in machine learning, while the second is more prevalent in statistical science. The root-mean-square error (RMSE) serves as an effective measure of global error within the design domain.

R M S E = \sqrt{\frac{1}{N} \sum_{i = 1}^{N} {(y_{i} - {\hat{y}}_{i})}^{2}}

(13)

where

y_{i}

is the real response of the function and

{\hat{y}}_{i}

is the predicted response of the surrogate model at

x_{i}

. However, the test data set can be costly, so the Cross-Validation (CV) method provides a solution for this circumstance.

R M S E_{C V} = \sqrt{\frac{1}{k} \sum_{i = 1}^{k} {({\hat{y}}_{i} - {\hat{y}}_{i}^{- i})}^{2}}

(14)

where

k

is the number of omitted sample points at the kth iteration of surrogate modeling. As the leave-one-out validation (LOO), the k-fold cross-validation [33] method is an extended version of the CV method, and a Predictive Estimation of Model Fidelity (PEMF) is an even more extended of k-fold CV, which can be found in [34].

3. Adaptive Sequential Infill Sampling Strategy for MHK

3.1. Definition of Multi-Fidelity Infill Sampling Strategy

In order to propose an efficient infill sampling strategy for the multi-fidelity surrogate model, we first formulate a definition of the regular multi-fidelity infill sampling strategy.

For a surrogate model as proposed by Equation (10), the prediction of a multi-fidelity model can be written as

{\hat{y}}_{M F} = ω_{1} {\hat{y}}_{1} + ω_{2} {\hat{y}}_{2} + \dots + ω_{n} {\hat{y}}_{n}

(15)

where

ω_{i}

is a weight parameter that can be solved by the Lagrange multiplier approach from an information economic function

J

from MHK via Cholesky decomposition.

The objective of the infill sampling strategy is to choose an appropriate sample point in the design space to promote the accuracy of the surrogate model sequentially. In the process of MHK modeling, a series prediction

\hat{y} (x, i)

can be solved and we get

\hat{y} (x, i) \sim N [{\hat{y}}_{i} (x), s_{i}^{2} (x, i)]

(16)

The multi-fidelity model prediction can be formulated as

\hat{\bar{Y}} (x) \sim N [{\hat{y}}_{M F} (x), {\bar{s}}^{2}]

(17)

Inspired by the EI strategy, Zheng [28] introduced a VFEI strategy specifically designed for multi-fidelity infill sampling. The choice to incorporate LF samples or HF samples is determined by the sequential order of the pairs

E I_{V F} (x, i)

.

I_{V F} (x, i) = \max (Y_{M F \min} - \hat{y} (x, i), 0)

(18)

E I_{V F} (x, i) = \{\begin{matrix} (Y_{MFmin} - \hat{y} (x)) Φ (\frac{Y_{MFmin} - \hat{y} (x)}{s_{i} (x, i)}) + s (x, l) ϕ (\frac{Y_{MFmin} - \hat{y} (x)}{s_{i} (x, i)}), & i f s (x, i) > 0 \\ 0, & i f s (x, i) = 0 \end{matrix}

(19)

3.2. Probabilistic Nearest Neighborhood

Probabilistic Nearest Neighborhood (PNN) was initially introduced by Holmes as a method for pattern classification within the field of statistical computation. As an advancement to K-Nearest Neighbor (KNN) classification, PNN offers a probabilistic framework that effectively addresses the uncertainty associated with interactions between neighborhoods. This methodology yields a continuous marginal probability prediction distribution within the interval (0,1). The primary advantage of PNN is its ability to provide a predictive distribution for new observations, resulting in a smoother representation of neighborhoods that extend into regions characterized by sparse data. To enhance the effective utilization of the expected improvement information derived from newly added sample points, it is advisable to employ PNN to deliver the predictive distribution for pre-added samples across various fidelity models. This approach will facilitate informed decision making concerning the addition of multi-fidelity samples.

Consider a set of responses of

ℚ = {(y_{1}, X_{1}), (y_{2}, X_{2})}

with

n = n_{1} + n_{2}

observations.

(y_{1}, X_{1}) = {(y_{11}, x_{11}), (y_{12}, x_{12}), \dots, (y_{1 n_{1}}, x_{1 n_{1}})}

(20)

(y_{2}, X_{2}) = {(y_{21}, x_{21}), (y_{22}, x_{22}), \dots, (y_{2 n_{2}}, x_{2 n_{2}})}

(21)

where

X_{1} = (x_{11}, x_{12}, \dots, x_{1 n_{1}})

is the training set and

X_{2} = (x_{21}, x_{22}, \dots, x_{2 n_{2}})

is the test set. Both of them are matrices of

n \times p

,

y_{1}

is an

n_{1} \times 1

vector of known class labels for the training data set, and

y_{2}

is an

n_{2} \times 1

vector of unknown class labels that needs to be recognized.

p (y_{n} | x_{n}, β, k) = \prod_{i = 1}^{n} \frac{\exp {β (1 / k) \sum_{j \overset{k}{~} i} δ_{y_{1 i} y_{1 j}}}}{\sum_{q = 1}^{Q} \exp {β (1 / k) \sum_{j \overset{k}{~} i} δ_{q y_{1 j}}}}

(22)

where

δ_{a b}

is the Dirac function,

β

is an interaction parameter between the neighborhoods of

y_{n i}

and

y_{n j}

, k is the neighborhood size of a KNN classifier of Q classes, and

(1 / k) \sum_{j \overset{k}{~} i} δ_{q y_{1 j}}

calculates the proportion of training points in the k nearest neighborhoods of

x_{1 i}

of class q. And a new point predictive distribution can be calculated as

p (y_{n + 1} | x_{n + 1}, β, k, Y) = \frac{\exp {β (1 / k) \sum_{j \overset{k}{~} n + 1} δ_{y_{n + 1} y_{1 j}}}}{\sum_{q = 1}^{Q} \exp {β (1 / k) \sum_{j \overset{k}{~} n + 1} δ_{q y_{1 j}}}}

(23)

An example illustrating the contrast between PNN, KNN, and Discriminant Adaptive Nearest Neighbor (DANN) is shown in Figure 4 and Figure 5 using a set of sine wave-shaped decision boundaries.

3.3. Adaptive Sequential Infill Sampling Strategy

Inspired by the principles of PNN, this paper presents a novel framework for the multi-fidelity addition criterion utilized in MHK. We employ Bayesian inference to determine the optimal number of samples to be incorporated at each fidelity level. Furthermore, we apply PNN to evaluate the neighborhood of the newly added points. The neighborhood points for each fidelity level are ranked according to their mean values, with the highest-scoring points being designated as infill samples for subsequent trials.

Theorem 1.

Bayes formula

Let $B_{1}, B_{2}, \dots, B_{n}$ represent a partition of the sample space $Ω$ . For any event A, it follows that

$P (B_{i} | A) = \frac{p (B_{i}) p (A | B_{i})}{\sum_{j = 1}^{n} p (B_{j}) p (A | B_{j})}$

(24)

Theorem 2.

Bayesian Prediction

Assume $x \sim p (x | θ)$ for $X = (x_{1}, x_{2}, \dots, x_{n})$ , and $θ \sim p (θ | α)$ ; then, the prediction of the $\hat{x}$ posterior predictive distribution can be calculated as

$p (\hat{x} | X, α) = \int p (\hat{x} | θ) p (θ | X, α) d θ$

(25)

With the prediction of MHK as (10), we can calculate the predictive distribution of any fidelity:

$p (y_{i} | {\hat{y}}_{M F}, x) = \int p (y_{i} | ω_{i}, ρ_{i}) p (ω_{i}, ρ_{i} | {\hat{y}}_{M F}, x) d (ω_{i}, ρ_{i})$

(26)

And with the help of (24),

$p (\hat{y}) = \sum_{i = 1}^{n} p (y_{i}) p (\hat{y} | y_{i})$

(27)

The predictive distribution of any infill sample can be expressed as

$p ({x_{n + 1}}^{i}) = \frac{\exp {β (1 / k) \sum_{j \overset{k}{~} n + 1} δ_{x_{n + 1} x_{1 j}}}}{\sum_{q = 1}^{Q} \exp {β (1 / k) \sum_{j \overset{k}{~} n + 1} δ_{q x_{1 j}}}}$

(28)

By substituting Equations (27) and (28) into Equation (18), it is possible to obtain a similar structural representation:

$I_{A S I S} [{x_{n + 1}}^{i}, i] = \max (p (y_{n + 1}^{i}) - p ({\hat{y}}_{\min}), 0)$

(29)

Note: The posterior probability density function (PDF) of the sequential samples at the ith fidelity level is subtracted from the current minimum multi-fidelity response. This approach serves two primary objectives: it not only considers the posterior response associated with the newly added sample but also ensures that the responses of the “pre-added” samples reflect the mean response within the vicinity, in accordance with the minimum neighborhood parameter estimate of the PNN.

A multi-fidelity infill decision-making framework is established as follows:

E I_{A S I S} [{x_{n + 1}}^{i}, i] = \{\begin{matrix} (y_{i}^{n + 1} - y_{\min}) Φ (\frac{y_{i}^{n + 1} - y_{\min}}{s_{i}}) + s_{i} ϕ (\frac{y_{i}^{n + 1} - y_{\min}}{s_{i}}), \begin{matrix} i f & s_{i} > 0 \end{matrix} \\ 0, \begin{matrix} i f & s_{i} = 0 \end{matrix} \end{matrix}

(30)

It is noted that ASIS contributes to the improvement in the MF model. In addition,

E I_{A S I S} [{x_{n + 1}}^{i}, i]

means that to calculate any potential improvement for the MF model, after calculation of all infill samples for the MF model, a Bayesian predictor

p ({x_{n + 1}}^{i})

for

I_{A S I S} [{x_{n + 1}}^{i}, i]

is added, and then

x^{*} = \underset{x \in X_{i}, i = 1, 2, \dots}{\arg \max E I_{A S I S} ({x_{n + 1}}^{i}, i)}

(31)

Figure 6 presents the optimization structure delineated in this paper, which is primarily categorized into three components: the initial design of experiments (DoE), the multi-fidelity surrogate model, and infill sample optimization. The Initial DoE involves sampling experiments at varying fidelity levels, utilizing Latin hypercube sampling (LHS). The multi-fidelity surrogate model incorporates the MHK model and employs Predictive Estimation of Model Fidelity (PEMF) for root-mean-square error (RMSE) cross-validation, thereby enhancing the accuracy of the model. Additionally, the Adaptive Sequential Infill Sampling (ASIS) strategy proposed in this paper is utilized for effective discrimination among multi-fidelity points.

4. Results and Discussion

In this section, two numerical simulations and one engineering example are used to demonstrate the proposed method.

4.1. Forrestal Function

In this one-dimensional test, Forrestal functions are employed to demonstrate the proposed ASIS strategy. The constants A, B, and C can be adjusted to enhance the accuracy of the low-fidelity function.

\begin{array}{l} y_{h} = {(6 x - 2)}^{2} \sin (12 x - 4) \\ y_{l} = A {(6 x - 2)}^{2} \sin (12 x - 4) + B (x - 0.5) + C \end{array}

(32)

where

x \in [0, 1]

,

A ~ N (0.5, 0.035), B ~ N (10, 0.02), C ~ N (- 5, 0.01)

,

y_{h}

is a high-fidelity model, and

y_{l}

is a low-fidelity model. And there is an optimal solution at

x = 0.7573

with the value of

y_{h} = - 6.0207

.

In this test simulation, we first use 10 low-fi samples at

x_{l} = {0; 0 . 1; 0 . 2; 0 . 3; 0 . 4; 0 . 5; 0 . 6; 0 . 7; 0 . 8; 0 . 9; 1}

and 4 hi-fi samples at

x_{h} = {0; 0 . 4; 0 . 6; 1}

to build the initial surrogate model by MHK, which can be found in Figure 7.

The infill process of the MHK surrogate model using the ASIS method is illustrated in Figure 8, Figure 9, Figure 10 and Figure 11. Each figure consists of two components: the surrogate functions and the ASIS criterion. During each iteration, the ASIS compares the optimal neighborhood improvement achieved with high-fidelity and low-fidelity samples. It then selects the median of the PNN to add a new sample. The accuracy of each model is assessed using the RMSE of the PEMF. If the accuracy does not meet the set requirement (defined here as 0.01), additional samples are incorporated to update the surrogate model. Throughout the process of four sample additions, two high-fidelity samples (

x_{h 1}^{*} = 0.5226

,

x_{h 2}^{*} = 0.7751

) and two low-fidelity samples (

x_{l 1}^{*} = 0.7726

,

x_{l 2}^{*} = 0.7420

) are added.

To further demonstrate the efficacy of the proposed method, the trial iteration limit is established at 10, with a corresponding RMSE convergence value of 0.01. For the purpose of comparison, we employ several methods, including VF-EI, augmented EI, and AMEI, utilizing the surrogate model that was originally proposed with these methods. Additionally, to control for variables, we incorporate the MHK surrogate model utilized in this paper for a comprehensive comparative analysis. The results of contemporary surrogate models employing various infill sampling strategies are delineated in Table 1. The median error predictive distribution of RMSE via PEMF is shown in Figure 12. While the optimal value is denoted as

y_{h} = - 6.0207

, it is observed that both the original EI and the augmented EI methodologies predominantly focus on the incorporation of high-fidelity samples during the sampling process. Conversely, the VF-EI and the AMEI minimize the inclusion of a limited number of high-fidelity samples; however, these approaches still necessitate multiple cycles of sample addition, leading to a significantly high total computational cost. The strategy introduced in this paper, referred to as MHK+ASIS, successfully reduces both the quantity of high-fidelity samples added and the overall number of samples incorporated, while maintaining comparable accuracy.

4.2. Rosebrock Function

A widely recognized 2D optimization function serves as the basis for demonstrating the proposed ASIS. The definitions of the multi-fidelity functions are presented as follows:

f_{h} (x_{1}, x_{2}) = 100 {(x_{2} - {x_{1}}^{2})}^{2} + {(1 - x_{1})}^{2}

(33)

f_{l, τ_{i}} (x) = 100 {(τ_{i} x_{2} - x_{1}^{2})}^{2} - 2 ({(x_{2} - τ_{i})}^{2} + {(x_{1} - τ_{i})}^{2})

(34)

where

x_{1}, x_{2} \in {[- 5, 10]}^{2}

,

f_{h} (x_{1}, x_{2})

is the high-fidelity version, and the low-fidelity versions are parameterized by

τ_{i}

[35]. The cost ratio (CR) for each fidelity by

τ_{i}

is settled as Table 2. The four types fidelity of Kriging surrogate models optimization via GA are shown in Figure 13.

Initially, each surrogate model was formulated based on a single fidelity function. The sampling for each model was conducted in accordance with the CR, utilizing 40, 40,000, 400, and 4000 initial sample points ranging from high to low fidelity to establish the initial Kriging model. The Genetic Algorithm (GA) was employed for optimization, with a specified convergence accuracy of 0.01. Table 3 outlines the number of iterations and the maximum absolute error (MAE) associated with the convergence of each model.

A high-fidelity surrogate model demonstrates the ability to achieve a satisfactory level of accuracy with a minimal number of iterations. In contrast, a low-fidelity surrogate model, constructed from a substantial number of samples, may attain commendable accuracy; however, it remains inferior to the high-fidelity counterpart. Furthermore, the implementation of a surrogate model that leverages a large sample size can effectively decrease the number of convergence iterations required for optimization using GA. To enhance our analysis, we developed a multi-fidelity MHK surrogate model utilizing the available test samples. Employing the sequential optimization of ASIS, we initially conducted an experiment focused on sample augmentation. Following the attainment of a surrogate model with a relatively high level of accuracy (PEMF = 0.01), we proceeded to execute further optimization, yielding the subsequent results:

The surrogate model based on the MHK was enhanced through the optimization framework outlined in the article, allowing for the incorporation of samples with varying levels of fidelity, shown in Figure 14. The red boxes indicate the new high-fidelity samples (i = 1) introduced by the ASIS method, while the blue boxes denote the collection of low-fidelity samples (i = 2, 3, 4) added. Following the addition of a limited number of high-fidelity samples (i = 14) and low-fidelity samples (i = 95), the PEMF of MHK achieved the stipulated requirement of 0.01. Optimization history of HF and MF surrogate model is shown in Figure 15.

An analysis is conducted to compare the performance of modern surrogate models that utilize various infilling sampling strategies. Results of the comparison are detailed in Table 4. The standard Rosebrock function features an extreme point, denoted as

{f_{h}}^{*} (x_{1}, x_{2}) = 0

, located at (1, 1). However, as the low-fidelity CR scale varies, this extreme point also shifts accordingly. We established the maximum number of infilling iterations at 30 and repeated the experiment 10 times for each strategy to ensure the robustness of the findings. The average error for each experiment is expressed as the root-mean-square error (RMSE). The data indicate that the results for the HK+AMEI, MHK+AMEI, and MHK+ASIS strategies exhibit relative stability and accuracy. Significantly, the MHK+ASIS method proposed in this study minimizes the requirement for additional high-fidelity samples while concurrently reducing the mean error, thereby providing an efficient solution for experimental optimization.

4.3. Naca 0012 Airfoil Validation

The initial application within the field of aerospace engineering pertains to the optimization of a 2D NACA 0012 wing, specifically aimed at determining its optimal lift-to-drag ratio. For the purpose of multi-fidelity validation, a mesh grid family has been supplied by the Langley Research Center [36], as illustrated in Figure 16. Both mesh families share the same leading edge, and the analysis of the airfoil is conducted under conditions characterized by a Mach number of 0.15 and a Reynolds number of 6 million, facilitating free transition. For the validation of experimental results, wind tunnel test data from Ladson is utilized as high-fidelity validation data.

First, the simulation is performed on Xflow, 4 angles of attack (AoAs) are sampled from the Family 2 grid for HF data, and 16 AoAs are sampled from the Family 1 grid for LF data. With the initial LHS data sets, an MHK is built first as shown in Figure 17.

The objective of this multi-fidelity surrogate model is to identify the maximum lift ratio of the NACA 0012 airfoil. Historically, conducting wind tunnel experiments has been a costly endeavor, often succeeded by high-precision large eddy simulations (CFDs). This study utilizes the infill strategy detailed in this paper to incorporate additional samples, comparing its effectiveness with the widely used VFEI and augmented EI strategies. In summary, the proposed ASIS, which integrates three samples (two lf samples and one hf sample), demonstrates a performance that is comparable to the VFEI approach, which adds three samples (one lf sample and two hf samples). Conversely, the augmented EI strategy enhances accuracy by incorporating three hf samples.

The MHK infill sample optimizations via ASIS, VFEI, and augmented EI are shown in Figure 18. A thorough analysis indicates that while the augmented EI method improves overall accuracy and aligns closely with the high-fidelity model, it does neglect the trend information available from low-fidelity samples. The VFEI approach, leveraging a comparative strategy for each added sample, tends to favor high-fidelity samples in the pursuit of improved accuracy. In contrast, the strategy presented in this paper considers the potential probabilistic neighborhood of the added samples, effectively balancing the immediate accuracy enhancements provided by high-fidelity samples with the trend (gradient) improvements that low-fidelity sample points contribute.

Furthermore, the proposed ASIS enables PEMF error prediction to achieve an RMSE accuracy of less than 0.005 after the addition of only eight sample points, which is shown in Figure 19. This finding suggests that, prior to conducting actual tests, the integration of a limited number of samples through ASIS facilitates a progressive prediction of PEMF. Consequently, this method allows for the determination of the optimal number of samples required to attain the desired accuracy, thus minimizing the costs associated with trial and error.

5. Conclusions

In this article, the Adaptive Sequential Infill Sampling (ASIS) strategy was introduced, which significantly enhances the accuracy and efficiency of sample addition experiments in multi-fidelity surrogate modeling, particularly for Multi-fidelity Hamilton Kriging. The ASIS method was validated using two numerical simulations and was applied to a flow optimization analysis problem involving a NACA 0012 airfoil. The conclusions drawn from this study are as follows.

(1): The Adaptive Sequential Infill Sampling strategy is an advancement of the Best Neighborhood-based Kriging infill method, specifically designed for the optimization of multi-fidelity experiments. This strategy effectively balances accuracy and efficiency in sequential experimental optimization by employing a Probabilistic Neural Network (PNN)-enhanced expected improvement (EI) methodology. Both numerical analyses and practical applications demonstrate the effectiveness of the proposed approach.
(2): The Adaptive Sequential Infill Sampling strategy is an infill strategy that is used for experimental optimization error prediction. In order to balance the exploration between multi-fidelity models, a Probability Nearest Neighborhood method is used not only for error distribution prediction, but also for criteria optimization. Consequently, the ASIS framework delivers a robust estimate of errors throughout the sequential optimization process.
(3): The Adaptive Sequential Infill Sampling strategy demonstrates greater utility and cost-effectiveness for multi-fidelity sequential infill sampling compared to certain advanced infill strategies. This is primarily due to its capability to compute posterior predictive distributions, which enhances the estimation of sampling errors in neighboring samples. Moreover, the application of PNN for predictions based on specified error sampling enables the utilization of fewer high-fidelity data points while still meeting the required root-mean-square error (RMSE) criteria.

In addition to the airfoil optimization engineering problem discussed earlier, the ASIS framework is fully capable of addressing a wide range of sequential optimization challenges that necessitate the incorporation of new samples. In future applications, ASIS has the potential to enhance its predictive capabilities in sequential experimental design by leveraging additional Bayesian frameworks.

Author Contributions

The methodology of Adaptive Sequential Infill Sampling strategy was proposed by S.Z.; S.Z. wrote the majority of the manuscript text and conducted the numerical and engineering validation; J.M. reviewed and edited the manuscript. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data will be made available on reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Let the experimental area be represented as

χ = {[0, 1]}^{s}

, while

u

signifies the uniform design

U (n, n^{s})

, in which each column of

U (n, n^{s})

is a permutation of

1 / 2 n, 3 / 2 n, \dots, 2 n - 1 / 2 n

. The Latin hypercube design entails conducting random sampling within

u

.

Let

Ρ = {x_{1}, x_{2}, \dots x_{n}}

denote a Latin hypercube design, wherein each subcube, represented as

x_{j}

, possesses a side length of

1 / n

and is centered at

x_{j}

, referred to as

C_{x_{j}}

. Additionally, let

y_{j}

be a random sample sourced from

C_{x_{j}}

. The resulting set, designated as

{y_{1}, y_{2}, \dots, y_{n}}

, is recognized as Latin hypercube sampling.

Appendix B

The Kriging model is defined as a multivariate polynomial, namely

f (x) = \sum_{i = 1}^{p} β_{i} b_{i} (x)

(A1)

where for

i = 1 \dots p

, we can set

b_{i} (x)

as a basis function and

β_{i}

as the coefficient. Then, a Kriging model based on the sampled data set

(S, y)

can be built:

y (x) = β f^{T} (x) + Z (x)

(A2)

After building Kriging, the Kriging predictor for any unsampled site can be obtained as

\hat{y} (x^{*}) = f^{T} (x^{*}) \hat{β} + r^{T} (x^{*}) Ψ^{- 1} (Y - F \hat{β})

(A3)

where for the regression function, we have an

n \times p

matrix F as

F = [\begin{matrix} b_{1} (x^{1}) & \dots & b_{p} (x^{1}) \\ ⋮ & ⋱ & ⋮ \\ b_{1} (x^{n}) & \dots & b_{p} (x^{n}) \end{matrix}]

(A4)

And the correlation matrix

Ψ

is defined by n × n matrices

ψ (\cdot, \cdot)

:

Ψ = [\begin{matrix} ψ (x^{1}, x^{1}) & \dots & ψ (x^{1}, x^{n}) \\ ⋮ & ⋱ & ⋮ \\ ψ (x^{n}, x^{1}) & \dots & ψ (x^{n}, x^{n}) \end{matrix}]

(A5)

where

ψ (\cdot, \cdot)

is built up with a set of

θ

, which can be identified in Appendix A. After the fitting process, the prediction variance at any untried sample can be computed as

s^{2} (x^{*}) = σ^{2} (1 - r (x^{*}) Ψ^{- 1} r {(x^{*})}^{T} + \frac{1 - F^{T} Ψ^{- 1} r {(x^{*})}^{T}}{F^{'} Ψ^{- 1} F})

(A6)

where for the unsampled point

x^{*}

, we have

x^{*} = (\begin{matrix} b_{1} (x) & b_{2} (x) & \dots & b_{p} (x) \end{matrix})

as a model prediction matrix,

β = {(F^{T} Ψ F)}^{- 1} F^{T} Ψ^{- 1} y

as a coefficient vector,

r (x^{*}) = (\begin{matrix} ψ (x^{*}, x^{1}) & \dots & ψ (x^{*}, x^{n}) \end{matrix})

as a coefficient matrix between

x^{*}

and

x

, and

σ^{2} = \frac{1}{n} {(y - F β)}^{T} Ψ^{- 1} (y - F β)

as a process variance vector.

Appendix C

The Expectation Improvement (EI) algorithm is often called an efficient global optimization algorithm (EGO). Here, it is necessary to assume that the current optimal objective function value is

y_{\min}

, and the predicted response from Kriging follows

N [\hat{y} (x), s^{2} (x)]

.

\hat{Y} (x) \in N [\hat{y} (x), s^{2} (x)]

(A7)

which can be explored as

P (\hat{Y} (x)) = \frac{1}{\sqrt{2 π} s (x)} \exp (- \frac{1}{2} {(\frac{Y (x) - \hat{y} (x)}{s (x)})}^{2})

(A8)

Then, the improvement of the objective function is set as

I (x) = \max (y_{\min} - \hat{Y} (x), 0)

(A9)

The EI function is derived by weighting potential improvements with probability densities and expressed as

E [I (x)] = \{\begin{matrix} (y_{\min} - \hat{y}) Φ (\frac{y_{\min} - \hat{y}}{s}) + s ϕ (\frac{y_{\min} - \hat{y}}{s}) & s > 0 \\ 0 & s = 0 \end{matrix}

(A10)

where

ϕ

is the normal density and

Φ

is the distribution function, and new points can be added as the EI function reaches its highest value.

The Probability Improvement (PI) algorithm is similar to the Expectation Improvement algorithm, and new samples can be added when the probability of the target function reaches its maximum. The Probability Improvement function is written as

P (x) = Φ (\frac{T - \hat{y} (x)}{s (x)})

(A11)

where it is assumed that the random variable

y (x)

follows a normal distribution when setting

T < {\hat{y}}_{\min}

.

With the help of a reparameterization trick and (A7), the improvement function (A9) can be rewritten as

I (x) = \max (y_{\min} - \hat{Y} (x), 0) = \max (y_{\min} - \hat{y} (x) - s (x) z, 0) z ~ N (0, 1)

(A12)

Therefore,

\begin{array}{l} P (x) = \Pr (I (x) > 0) \\ = 1 - Φ (z) \\ = Φ (- z) \\ = Φ (\frac{\hat{y} (x) - y_{\min}}{s (x)}) \end{array}

(A13)

where

Φ (z) \equiv C D F (z)

and

z = \frac{y_{\min} - \hat{y} (x)}{s (x)}

.

References

Lei, B.; Kirk, T.Q.; Bhattacharya, A.; Pati, D.; Qian, X.; Arroyave, R.; Mallick, B.K. Bayesian optimization with adaptive surrogate models for automated experimental design. npj Comput. Mater. 2021, 7, 194. [Google Scholar] [CrossRef]
Samadian, D.; Muhit, I.B.; Dawood, N. Application of Data-Driven Surrogate Models in Structural Engineering: A Literature Review. Arch. Comput. Methods Eng. 2025, 32, 735–784. [Google Scholar] [CrossRef]
Zanobini, A.; Sereni, B.; Catelani, M.; Ciani, L. Repeatability and Reproducibility techniques for the analysis of measurement systems. Measurement 2016, 86, 125–132. [Google Scholar] [CrossRef]
Kuhn, D.R.; Reilly, M.J. An investigation of the applicability of design of experiments to software testing. In Proceedings of the 27th Annual NASA Goddard/IEEE Software Engineering Workshop, Greenbelt, MD, USA, 5–6 December 2002; pp. 91–95. [Google Scholar]
Fang, K.-T.; Lin, D.K.J.; Winker, P.; Zhang, Y. Uniform Design: Theory and Application. Technometrics 2000, 42, 237–248. [Google Scholar] [CrossRef]
Park, J.-S. Optimal Latin-hypercube designs for computer experiments. J. Stat. Plan. Inference 1994, 39, 95–111. [Google Scholar] [CrossRef]
Davis, S.E.; Cremaschi, S.; Eden, M.R. Efficient Surrogate Model Development: Impact of Sample Size and Underlying Model Dimensions. In Computer Aided Chemical Engineering; Eden, M.R., Ierapetritou, M.G., Towler, G.P., Eds.; Elsevier: Amsterdam, The Netherlands, 2018; pp. 979–984. [Google Scholar]
Fernandez-Godino, M.G.; Haftka, R.T.; Balachandar, S.; Gogu, C.; Bartoli, N.; Dubreuil, S. Noise Filtering and Uncertainty Quantification in Surrogate based Optimization. In Proceedings of the 2018 AIAA Non-Deterministic Approaches Conference, Kissimmee, FL, USA, 8–12 January 2018; American Institute of Aeronautics and Astronautics: Reston, VA, USA, 2018. [Google Scholar] [CrossRef]
Cheng, K.; Lu, Z.; Ling, C.; Zhou, S. Surrogate-assisted global sensitivity analysis: An overview. Struct. Multidiscip. Optim. 2020, 61, 1187–1213. [Google Scholar] [CrossRef]
Myers, R.H.; Montgomery, D.C. Response Surface Methodology. IIE Trans. 1996, 28, 1031–1032. [Google Scholar] [CrossRef]
Xiu, D.; Karniadakis, G.E. Modeling uncertainty in flow simulations via generalized polynomial chaos. J. Comput. Phys. 2003, 187, 137–167. [Google Scholar] [CrossRef]
Krige, D.G. A statistical approach to some basic mine valuation problems on the Witwatersrand. J. S. Afr. Inst. Min. Metall. 1951, 52, 119–139. [Google Scholar]
Regis, R.G.; Shoemaker, C.A. Constrained Global Optimization of Expensive Black Box Functions Using Radial Basis Functions. J. Glob. Optim. 2005, 31, 153–171. [Google Scholar] [CrossRef]
Mangasarian, O.L.; Musicant, D.R. Robust linear and support vector regression. IEEE Trans. Pattern Anal. Mach. Intell. 2000, 22, 950–955. [Google Scholar] [CrossRef]
Weinmeister, J.; Gao, X.; Roy, S. Analysis of a Polynomial Chaos-Kriging Metamodel for Uncertainty Quantification in Aerodynamics. AIAA J. 2019, 57, 2280–2296. [Google Scholar] [CrossRef]
Dwight, R.; Han, Z.-H. Efficient Uncertainty Quantification Using Gradient-Enhanced Kriging. In Proceedings of the 50th AIAA/ASME/ASCE/AHS/ASC Structures, Structural Dynamics, and Materials Conference, Palm Springs, CA, USA, 4–7 May 2009; American Institute of Aeronautics and Astronautics: Reston, VA, USA, 2009. [Google Scholar] [CrossRef]
Han, Z.; Zimmermann, R.; Goertz, S. On improving Efficiency and Accuracy of Variable-Fidelity Surrogate Modeling in Aero-data for Loads Context. In Proceedings of the European Air and Space Conference, Manchester, UK, 26–29 October 2009; Royal Aeronautical Society: London, UK, 2009. [Google Scholar]
Kennedy, M.C.; O’Hagan, A. Predicting the output from a complex computer code when fast approximations are available. Biometrika 2000, 87, 1–13. [Google Scholar] [CrossRef]
Forrester, A.I.J.; Sóbester, A.; Keane, A.J. Multi-fidelity optimization via surrogate modelling. Proc. R. Soc. A Math. Phys. Eng. Sci. 2007, 463, 3251–3269. [Google Scholar] [CrossRef]
Han, Z.-H.; Görtz, S. Hierarchical Kriging Model for Variable-Fidelity Surrogate Modeling. AIAA J. 2012, 50, 1885–1896. [Google Scholar] [CrossRef]
Han, Z.-H.; Görtz, S.; Zimmermann, R. Improving variable-fidelity surrogate modeling via gradient-enhanced kriging and a generalized hybrid bridge function. Aerosp. Sci. Technol. 2013, 25, 177–189. [Google Scholar] [CrossRef]
Zhang, S.; Ma, J. Applied Hamiltonian Monte Carlo for multi-fidelity kriging modelling in experiment optimization. Eng. Optim. 2025, 1–33. [Google Scholar] [CrossRef]
Wilson, J.; Hutter, F.; Deisenroth, M. Maximizing acquisition functions for Bayesian optimization. In Proceedings of the Thirty-Second Annual Conference on Neural Information Processing Systems (NIPS), San Diego, CA, USA, 2–8 December 2018. [Google Scholar]
Kushner, H.J. A New Method of Locating the Maximum Point of an Arbitrary Multipeak Curve in the Presence of Noise. J. Basic Eng. 1964, 86, 97–106. [Google Scholar] [CrossRef]
Ghoreyshi, M.; Badcock, K.J.; Woodgate, M.A. Accelerating the Numerical Generation of Aerodynamic Models for Flight Simulation. J. Aircr. 2009, 46, 972–980. [Google Scholar] [CrossRef]
Hertz-Picciotto, I.; Rockhill, B. Validity and Efficiency of Approximation Methods for Tied Survival Times in Cox Regression. Biometrics 1997, 53, 1151–1156. [Google Scholar] [CrossRef]
Currin, C.; Mitchell, T.; Morris, M.; Ylvisaker, D. Bayesian Prediction of Deterministic Functions, with Applications to the Design and Analysis of Computer Experiments. J. Am. Stat. Assoc. 1991, 86, 953–963. [Google Scholar] [CrossRef]
Zhang, Y.; Han, Z.-H.; Zhang, K.-S. Variable-fidelity expected improvement method for efficient global optimization of expensive functions. Struct. Multidiscip. Optim. 2018, 58, 1431–1451. [Google Scholar] [CrossRef]
Hao, P.; Feng, S.; Li, Y.; Wang, B.; Chen, H. Adaptive infill sampling criterion for multi-fidelity gradient-enhanced kriging model. Struct. Multidiscip. Optim. 2020, 62, 353–373. [Google Scholar] [CrossRef]
Dong, H.; Sun, S.; Song, B.; Wang, P. Multi-surrogate-based global optimization using a score-based infill criterion. Struct. Multidiscip. Optim. 2019, 59, 485–506. [Google Scholar] [CrossRef]
Zhang, S.; Ma, J. Kriging-based design of sequential experiment via best neighborhoods for small sample optimization. In Proceedings of the 43rd Chinese Control Conference (CCC), Kunming, China, 28–31 July 2024; pp. 7006–7011. [Google Scholar]
McKay, M.D.; Bolstad, J.W.; Whiteman, D.E. Application of Statistical Techniques to the Analysis of Reactor Safety Codes; Los Alamos National Laboratory (LANL): Los Alamos, NM, USA, 1978; p. 31. [Google Scholar]
Fushiki, T. Estimation of prediction error by using K-fold cross-validation. Stat. Comput. 2011, 21, 137–146. [Google Scholar] [CrossRef]
Mehmani, A.; Chowdhury, S.; Messac, A. Predictive quantification of surrogate model fidelity based on modal variations with sample density. Struct. Multidiscip. Optim. 2015, 52, 353–373. [Google Scholar] [CrossRef]
Olivanti, R.; Gallard, F.; Brézillon, J.; Gourdain, N. Comparison of Generic Multi-Fidelity Approaches for Bound-Constrained Nonlinear Optimization Applied to Adjoint-Based CFD Applications. In Proceedings of the AIAA Aviation 2019 Forum, Dallas, TX, USA, 17–21 June 2019; American Institute of Aeronautics and Astronautics: Reston, VA, USA, 2019. [Google Scholar] [CrossRef]
Ladson, C.L. Effects of Independent Variation of Mach and Reynolds Numbers on the Low-Speed Aerodynamic Characteristics of the NACA 0012 Airfoil Section; National Aeronautics and Space Administration: Washington, DC, USA, 1988.

Figure 1. Framework of ordinary experiment optimization.

Figure 2. Framework of multi-fidelity surrogate-based experiment optimization.

Figure 3. Latin hypercube sampling of normal distribution.

Figure 4. Pattern classification of sine wave-shaped boundary data sets via KNN, DANN, and PNMN.

Figure 5. PNN probability distribution of sine wave-shaped boundary data sets.

Figure 6. Framework of ASIS strategy.

Figure 7. Initial surrogate model by MHK.

Figure 8. Optimization iteration 1 of Forrestal function via MHK and ASIS.

Figure 9. Optimization iteration 2 of Forrestal function via MHK and ASIS.

Figure 10. Optimization iteration 3 of Forrestal function via MHK and ASIS.

Figure 11. Optimization iteration 4 of Forrestal function via MHK and ASIS.

Figure 12. Median error predictive distribution of RMSE via PEMF.

Figure 13. Surrogate model with four fidelity Rosebrock optimizations via GA.

Figure 14. Rosebrock MHK model via ASIS.

Figure 15. Optimization iterations of HF and MF surrogate model.

Figure 16. Multi-fidelity grid for NACA 0012 wing optimization.

Figure 17. MHK model with initial NACA Xflow data.

Figure 18. MHK infill sample optimizations via ASIS, VFEI, and augmented EI.

Figure 19. NACA 0012 median error predictive distribution of RMSE via PEMF.

Table 1. Results of Forrestal function via modern surrogate models and infill strategies.

Type	Optimal	Iteration	HF Number	Error	Time(s)
Kriging+EI	−6.1020	10	10	0.82	15
CoKriging+augmented EI	−6.0900	10	10	0.52	18
HK+augmented EI	−6.0190	8	8	0.50	10
HK+VF-EI	−6.0195	8	6	0.38	28
HK+AMEI	−6.0198	5	3	0.09	24
MHK+ augmented EI	−6.0191	8	8	0.25	22
MHK+ VF-EI	−6.0197	8	5	0.22	33
MHK+ AM-EI	−6.0200	6	4	0.12	40
MHK+ ASIS	−6.0203	4	2	0.01	20

Table 2. Parameters of multi-fidelity Rosebrock function.

Type i	1	2	3	4
$τ$	1	0.4	0.65	0.9
CR	1	1000	10	100

Table 3. Optimization results of different fidelity surrogate models.

TYPE	1	2	3	4
Iteration number	150	220	210	320
MAE	0.0065	0.0052	0.0103	0.093
Sample number	40	40,000	400	4000

Table 4. Results of Rosebrock function via modern surrogate models and infill strategies.

Type	Optimal	Iteration	HF Number	Error	Time(s)
CoKriging + augmented EI	0.0110	30	30	0.170	55
HK+augmented EI	0.0101	30	30	0.103	45
HK + VF-EI	−0.0082	26	12	0.045	28
HK + AMEI	0.0072	27	10	0.060	32
MHK + augmented EI	0.0100	30	30	0.120	40
MHK + VF-EI	−0.0090	27	11	0.055	29
MHK + AM-EI	0.0080	24	10	0.062	30
MHK + ASIS	0.0053	22	9	0.030	29

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Zhang, S.; Ma, J. Adaptive Sequential Infill Sampling Method for Experimental Optimization with Multi-Fidelity Hamilton Kriging Model. Aerospace 2025, 12, 913. https://doi.org/10.3390/aerospace12100913

AMA Style

Zhang S, Ma J. Adaptive Sequential Infill Sampling Method for Experimental Optimization with Multi-Fidelity Hamilton Kriging Model. Aerospace. 2025; 12(10):913. https://doi.org/10.3390/aerospace12100913

Chicago/Turabian Style

Zhang, Shixuan, and Jie Ma. 2025. "Adaptive Sequential Infill Sampling Method for Experimental Optimization with Multi-Fidelity Hamilton Kriging Model" Aerospace 12, no. 10: 913. https://doi.org/10.3390/aerospace12100913

APA Style

Zhang, S., & Ma, J. (2025). Adaptive Sequential Infill Sampling Method for Experimental Optimization with Multi-Fidelity Hamilton Kriging Model. Aerospace, 12(10), 913. https://doi.org/10.3390/aerospace12100913

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Adaptive Sequential Infill Sampling Method for Experimental Optimization with Multi-Fidelity Hamilton Kriging Model

Abstract

1. Introduction

2. Experimental Optimization Based on Multi-Fidelity Hamiltonian Kriging

2.1. Framework of Multi-Fidelity Surrogate-Based Experiment Optimization

2.2. Initial Sample Experiments

2.3. Multi-Fidelity Hamilton Kriging Model

2.4. Sequential Infill Sampling Strategy

2.5. Performance Criteria

3. Adaptive Sequential Infill Sampling Strategy for MHK

3.1. Definition of Multi-Fidelity Infill Sampling Strategy

3.2. Probabilistic Nearest Neighborhood

3.3. Adaptive Sequential Infill Sampling Strategy

4. Results and Discussion

4.1. Forrestal Function

4.2. Rosebrock Function

4.3. Naca 0012 Airfoil Validation

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix B

Appendix C

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI