You are currently viewing a new version of our website. To view the old version click .
Mathematics
  • Article
  • Open Access

28 December 2024

Strongly Coupled Designs for Computer Experiments with Both Qualitative and Quantitative Factors

,
and
1
School of Science, Minzu University of China, Beijing 100081, China
2
NITFID, LPMC & KLMDASR, School of Statistics and Data Science, Nankai University, Tianjin 300071, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
This article belongs to the Special Issue Advances in Statistical AI and Causal Inference

Abstract

Computer experiments often involve both qualitative and quantitative factors, posing challenges for efficient experimental designs. Strongly coupled designs (SCDs) are proposed in this paper to balance flexibility in run size and stratification properties between qualitative and quantitative factor columns. The existence and construction of SCDs are investigated. When s 2 is a prime or a prime power, the constructed SCDs of λ s 3 runs can accommodate 2 s 1 qualitative factors and a substantial number of quantitative factors. Furthermore, a series of SCDs with s u rows and ( u 3 ) s 3 columns of quantitative factors are constructed, where u 4 , with certain columns of quantitative factors achieving stratification in two or higher dimensions. The proposed SCDs achieve stratification between any two qualitative factors and all quantitative factors, which is superior to MCDs. With the number of levels of the qualitative factors given as s 2 , DCDs have λ s 4 rows, while SCDs have only λ s 3 rows, offering more flexibility. Furthermore, in the designs constructed in this paper with fewer than 100 rows, in 11 out of 17 cases, SCDs have a larger number and higher levels of qualitative factors than DCDs.

1. Introduction

With the rapid development of computer science, an increasing number of scientists and engineers are using computer experiments to study complex physical systems. Space-filling designs are feasible for these types of experiments as they spread the design points in the design region as uniformly as possible (Santner, Williams and Notz, 2003; Fang, Li and Sudjianto, 2006)  [1,2]. Based on the effect sparsity principle (Wu and Hamada, 2021) [3], only a few factors are expected to be active, which makes it reasonable to focus on uniformity properties in low-dimensional projections. Latin hypercube designs (LHDs), which are among the most popular space-filling designs, satisfy one-dimensional uniformity (McKay, Beckman and Conover, 1979) [4]. Owen (1992) and Tang (1993) [5,6] proposed two methods to generate LHDs based on orthogonal arrays (OAs), which inherit t-dimensional projection properties when an OA of strength t is used. He and Tang (2013) [7] proposed a new class of arrays called strong orthogonal arrays (SOAs). To achieve better stratification properties, the strength of SOAs should be 3 or higher (He and Tang, 2014) [8], though this can lead to large run sizes. SOAs of strengths of 2 + , introduced by He, Cheng and Tang (2018) [9], are more economical than SOAs of strength 3 while retaining the same two-dimensional stratification. Zhou and Tang (2019) [10] proposed SOAs of strength 3 , which maintain the same three-dimensional stratification as SOAs of strength 3, as well as the same factor levels as SOAs of strength 2. Recent studies on the construction of designs achieving stratification in low dimensions also include those of Liu et al. (2024), and Wang, Lin and Liu (2024) [11,12].
In some cases, computer experiments involve both quantitative and qualitative factors. Qian, Wu and Wu (2008) [13] proposed a Gaussian process model framework to address the integration of qualitative and quantitative factors in computer modeling. This framework achieves integration by constructing and validating correlation functions that incorporate both types of factors. Han et al. (2009) [14] proposed a Bayesian methodology for predicting computer experiments with both qualitative and quantitative inputs, modeling outputs for different qualitative levels as having similar functional behavior in the quantitative inputs. Huang et al. (2016) [15] introduced SLHDs with good uniformity for each slice and an adaptive analysis strategy to improve prediction precision and assess similarities among qualitative variable levels in computer experiments.
Qian (2012) [16] suggested sliced LHDs (SLHDs) for this type of experiment. Recently, Kumar et al. (2024) [17] investigated the construction methods for sliced orthogonal LHDs with unequal batch sizes. Guo et al. (2023) [18] proposed methods for constructing orthogonal and nearly orthogonal general SLHDs; these designs ensure orthogonality not only in the overall design but also within each layer before and after collapsing. Wang, Wang and Xue (2023) [19] proposed a sequential optimal LHD method to improve the space-filling properties of two-layer computer simulators. In SLHDs, stratification is achieved between all qualitative and quantitative factors; however, this can lead to a significant increase in the run sizes as the number and levels of qualitative factors expand. To provide more economical run sizes, Deng, Hung and Lin (2015) [20] proposed marginally coupled designs (MCDs). Subsequently, a large number of researchers have contributed to the study of MCDs with better properties, such as orthogonality or low-dimensional stratification, including He, Lin and Sun (2017); He et al. (2017); He, Lin and Sun (2019); and Zhou, Yang and Liu (2021) [21,22,23,24]. To study the interaction of the qualitative and quantitative factors, Yang et al. (2023) [25] proposed doubly coupled designs (DCDs) and studied their constructions. DCDs employ better stratification properties between qualitative and quantitative factors compared to MCDs, though this comes at the cost of reducing the number of qualitative factors. Zhou, Huang and Li (2024) [26] investigated the construction of group DCDs.
In this paper, a new type of design called strongly coupled designs (SCDs) is proposed for computer experiments with both quantitative and qualitative factors. Similar to DCDs, SCDs achieve stratification between any two qualitative factors and all quantitative factors to a certain extent. Compared to MCDs, SCDs are more feasible for studying interactions between any two qualitative factors and all quantitative factors. The stratification requirements among qualitative factors are relaxed which makes the run sizes of SCDs more flexible than those of DCDs. For example, when the levels of the qualitative factors are s 2 , DCDs divide the design corresponding to the quantitative factors into s 4 small LHDs, with each corresponding to a specific level combination of two qualitative factors, while SCDs divide it into s 3 small LHDs.
The characteristics and construction methods of SCDs are also studied. The necessary and sufficient conditions for the existence of SCDs are provided from three different perspectives, and SCDs with λ s 3 runs, where s 2 is a prime or a prime power, and λ is a positive integer, are constructed. For a given run size, the constructed SCDs can accommodate a greater number of qualitative factors than DCDs in many cases. Additionally, the construction of SCDs with desirable space-filling properties for quantitative factors is investigated. A series of designs with s u runs and ( u 3 ) s 3 columns of quantitative factors, where u 4 , are constructed. In these designs, certain quantitative factor columns achieve stratification in two or more dimensions.
The remainder of this paper is organized as follows. In Section 2, some basic concepts are introduced, SCDs are defined, and their existence is studied. In Section 3, the construction of SCDs is explored, including those with desirable space-filling properties for quantitative factors. Examples are provided to illustrate the construction process and the properties of the resulting designs. The paper is concluded with a discussion in Section 4. Some theorem proofs are deferred to the Appendix A.

2. Definitions and Characterizations

In this section, key concepts relevant to our work are reviewed, and the definition of SCDs is proposed. Additionally, three characterizations of SCDs are presented, providing heuristic guidance for the construction methods discussed in the next section.
An n × m array, where the j-th column has s j levels, { 0 , 1 , , s j 1 } , is called an OA of strength t, denoted by OA ( n , m , s 1 × × s m , t ) , if every n × t subarray contains each t-tuple with equal frequency. When s 1 = = s m = s , the array is symmetric and is denoted as OA ( n , m , s , t ) for simplicity. An OA ( n , m , s , t ) is said to be completely resolvable, denoted by CROA ( n , m , s , t ) , if its rows can be partitioned into n / s subarrays, each of which is an OA ( s , m , s , 1 ) .
An r × e array D ( r , e , s ) is called a difference scheme based on G F ( s ) if it satisfies the following property: for all i and j such that 1 i , j e and i j , the vector difference between the i-th and j-th columns contains each element of G F ( s ) with equal frequency. Difference schemes are fundamental in the construction of CROAs.
An SOA of strength t, denoted by SOA ( n , m , s t , t ) , is an n × m matrix with entries from { 0 , 1 , , s t 1 } . For any subarray consisting of g columns ( 1 g t ), it can be collapsed into an OA ( n , g , s u 1 × × s u g , g ) , where u 1 , , u g are positive integers satisfying u 1 + + u g = t . The s t levels of a factor are collapsed into s u j levels using the mapping a / s t u j , where a = 0 , 1 , , s t 1 and 1 j g . Here, b denotes the greatest integer less than or equal to b. An SOA of strength 2 + , denoted by SOA ( n , m , s 2 , 2 + ) , is an SOA ( n , m , s 2 , 2 ) with the additional property that any two distinct columns can be collapsed into both an OA ( n , 2 , s 2 × s , 2 ) and an OA ( n , 2 , s × s 2 , 2 ) . Similarly, an SOA of strength 3 , denoted by SOA ( n , m , s 2 , 3 ) , is an SOA ( n , m , s 2 , 2 + ) with the property that any three distinct columns can be collapsed into an OA ( n , 3 , s , 3 ) .
For two matrices E = ( e i j ) m × n and F = ( f i j ) u × v with entries from G F ( s ) , their Kronecker sum ⊕ is defined as
E F = e 11 + ˙ F e 1 n + ˙ F e m 1 + ˙ F e m n + ˙ F ,
where + ˙ denotes addition in G F ( s ) . 1 s is used to denote a vector of length s consisting entirely of ones. For any vector w = ( w 1 , , w m ) T , w 1 s is defined as ( w 1 , , w 1 , w 2 , , w m ) T and 1 s w is defined as ( w 1 , , w m , , w 1 , , w m ) T .
Let D = ( D 1 , D 2 ) be a design with q qualitative factors and p quantitative factors, where D 1 and D 2 are sub-designs for qualitative and quantitative factors, respectively. A design D is called an MCD if D 2 is an LHD and the rows in D 2 corresponding to each level of any factor in D 1 form a small LHD. A DCD is an MCD with the additional property that the rows in D 2 corresponding to each level combination of any two factors in D 1 form an LHD.
In this paper, we introduce a new class of designs for computer experiments with both quantitative and qualitative factors. Suppose the s 2 levels of any qualitative factor are divided into s groups based on a specified grouping method, with each group labeled by a group number: 0, 1, ..., s 1 . The definition of SCDs is presented below.
Definition 1.
An n-run design D = ( D 1 , D 2 ) , with q s 2 -level qualitative factors and p quantitative factors, is called a strongly coupled design, denoted by SCD ( n , ( s 2 ) q , p ) , if it satisfies the following conditions:
(i)  
D 2 is an LHD;
(ii) 
The rows in D 2 corresponding to each group number of any column in D 1 form an LHD;
(iii)
The rows in D 2 corresponding to each combination of group number and level from any two columns in D 1 form an LHD.
For D 1 = ( z 1 , , z q ) , to simplify the following discussion, the s 2 levels of qualitative factors are encoded as 0 , 1 , , s 2 1 . The group number is defined as z ¯ i = z i s , which is obtained by collapsing the s 2 levels into s levels, with levels sharing the same group number classified into the same group. In this paper, D 1 is set as an SOA ( n , q , s 2 , 2 + ) . For D 2 = ( d 1 , , d p ) , define
D ¯ 2 = D 2 s = ( d ¯ 1 , , d ¯ p )   and   D ˜ 2 = D ¯ 2 s 2 = ( d ˜ 1 , , d ˜ p ) ,
where a represents the largest integer not exceeding a.
Next, we present the necessary and sufficient conditions for the existence of SCDs. Theorem 1 specifies the conditions that must be satisfied between columns from D 1 and D 2 .
Theorem 1.
Suppose D 1 = ( z 1 , , z q ) is an SOA ( n , q , s 2 , 2 + ) , and D 2 = ( d 1 , , d p ) is an LHD ( n , p ) . The design D = ( D 1 , D 2 ) is an SCD ( n , ( s 2 ) q , p ) if and only if:
(i) 
( z ¯ i , d ¯ k ) is an OA ( n , 2 , s ( n / s ) , 2 ) for any 1 i q , 1 k p ;
(ii)
( z i , z ¯ j , d ˜ k ) is an OA ( n , 3 , s 2 s ( n / s 3 ) , 3 ) for any 1 i j q , 1 k p .
Assume that D 1 is generated by the formula s A + B . Theorem 2 provides the necessary and sufficient conditions that the columns of A and B must satisfy for the existence of SCDs.
Theorem 2.
Suppose that D 1 = ( z 1 , , z q ) is an SOA ( n , q , s 2 , 2 + ) generated by D 1 = s A + B , where A = ( z ¯ 1 , , z ¯ q ) and B = ( b 1 , , b q ) . The design D = ( D 1 , D 2 ) is an SCD ( n , ( s 2 ) q , p ) if and only if:
(i) 
A can be divided as ( A ( 1 ) T , , A ( n / s 3 ) T ) T , where each A ( l ) is a CROA ( s 3 , q , s , 2 ) , for 1 l n / s 3 ;
(ii)
For any two columns z ¯ i ( l ) and z ¯ j ( l ) in A ( l ) , and the corresponding rows b i ( l ) in b i , the three tuples ( z ¯ i ( l ) , z ¯ j ( l ) , b i ( l ) ) form an OA ( n , 3 , s 3 , 3 ) , for any 1 i j q , 1 l n / s 3 .
Assume that D 1 = s A + B and D ¯ 2 = s 2 C + s E + F . Theorem 3 provides the necessary and sufficient conditions that the columns of A, B, C, E, and F must satisfy.
Theorem 3.
Suppose D 1 is an SOA ( n , q , s 2 , 2 + ) and D 2 is an LHD ( n , p ) , where D 1 and D ¯ 2 are generated by D 1 = s A + B and D ¯ 2 = s 2 C + s E + F , respectively. The design D = ( D 1 , D 2 ) is an SCD ( n , ( s 2 ) q , p ) if and only if there exist five arrays: A = ( z ¯ 1 , , z ¯ q ) = O A ( n , q , s , 2 ) , B = ( b 1 , , b q ) = OA ( n , q , s , 1 ) , C = ( d ˜ 1 , , d ˜ p ) = OA ( n , p , n / s 3 , 1 ) , E = ( e 1 , , e p ) = OA ( n , p , s , 1 ) , and F = ( f 1 , , f q ) = OA ( n , q , s , 1 ) , such that for any 1 i j q and 1 k p , both ( z ¯ i , f k , e k , d ˜ k ) and ( z ¯ i , b i , z ¯ j , d ˜ k ) are OA ( n , 4 , s 3 ( n / s 3 ) , 4 ) .

3. Construction Methods

In this section, methods for constructing SCDs, as well as the ones with desirable space-filling properties for quantitative factors, are proposed. We focus on cases where s 2 is a prime or a prime power. The SCDs constructed in Section 3.1 can accommodate 2 s 1 qualitative factors and a large number of quantitative factors. For the SCDs constructed in Section 3.2, when the run size n = s u , the number of columns for quantitative factors is ( u 3 ) s 3 , and certain columns of quantitative factors achieve stratification in two or more higher dimensions.

3.1. Construction of SCDs

SCDs with run sizes n = λ s 3 can be obtained using Algorithm 1, where s 2 is a prime or a prime power, and λ is a positive integer.    
Algorithm 1: Construction of SCDs
Step 1.
Let H = ( h 1 , , h s ) be a D ( s , s , s ) . Define O = ( 0 , , s 1 ) T H = ( o 1 , , o s ) , an OA obtained by developing the difference schemes H.
Step 2.
Define a 1 = h 1 o 1 , and A i = ( h 1 , h i ) o i for i = 2 , , s . Let A ( 1 ) = = A ( λ ) = ( a 1 , A 2 , , A s ) , and A = ( A ( 1 ) T , , A ( λ ) T ) T .
Step 3.
For 1 j i , m , k i , m s , where 1 i s , 1 m 2 , j i , m k i , m and j i , m 1 , define: b 1 = h j 1 , 1 o k 1 , 1 with j 1 , 1 1 ; b i 1 = h j i , 1 o k i , 1 with k i , 1 1 and j i , 1 i , or k i , 1 = 1 and j i , 1 = i ; b i 2 = h j i , 2 o k i , 2 with j i , 2 = i . Let B i = ( b i 1 , b i 2 ) , and B ( 1 ) = , B ( λ ) = ( b 1 , B 2 , , B s ) . Then obtain B = ( B ( 1 ) T , , B ( λ ) T ) T .
Step 4.
Define c i = ω i 1 s 3 , where ω i is a permutation of ( 0 , , λ 1 ) for 1 i λ ! . For 1 l λ and 1 k s 2 ! , define e k ( l ) = ( ( 1 s β 1 , k ( l ) ) T , , ( 1 s β s , k ( l ) ) T ) T , where ( β 1 , k ( l ) T , , β s , k ( l ) T ) T is a permutation of ( 0 , , s 2 1 ) T . For 1 δ ( s 2 ! ) λ , let e δ = ( e k 1 ( 1 ) T , , e k λ ( λ ) T ) T , where 1 k l s 2 ! . Then, obtain E = ( e 1 , , e ( s 2 ! ) λ ) .
Step 5.
Let D 1 = s A + B and D ¯ 2 = ( s 2 c 1 + E , , s 2 c λ ! + E ) . For each column in D ¯ 2 , replace the s positions of level i by all possible random permutation of { i s , i s + 1 , , i s + ( s 1 ) } , i = 0 , 1 , , n / s 1 , columns juxtapose these matrices to obtain D 2 , then obtain D = ( D 1 , D 2 ) .
    To make it easier for readers to understand the algorithm, we provide the flow chart in Figure 1 to explain the algorithm.
Figure 1. Flow chart of Algorithm 1.
The properties of the constructed designs are summarized in the following theorem.
Theorem 4.
The design D constructed in Algorithm 1 is an SCD ( n , ( s 2 ) q , p ) with n = λ s 3 , q = 2 s 1 and p = λ ! ( s 2 ! ) λ ( s ! ) λ s 2 , where λ is a positive integer.
An example is now given to illustrate the construction procedure and the properties of the constructed designs.
Example 1.
For s = λ = 2 , let
H = ( h 1 , h 2 ) = 0 0 0 1 ,
then,
O = ( 0 , 1 ) T H = 0 0 1 1 0 1 1 0 T = ( o 1 , o 2 ) .
Define a 1 = h 1 o 1 = ( 0 , 0 , 1 , 1 , 0 , 0 , 1 , 1 ) T ,
A 2 = ( h 1 , h 2 ) o 2 = 0 1 1 0 0 1 1 0 0 1 1 0 1 0 0 1 T ,
then,
A = A ( 1 ) A ( 2 ) = a 1 A 2 a 1 A 2 = 0 0 1 1 0 0 1 1 0 1 1 0 0 1 1 0 0 1 1 0 1 0 0 1 0 0 1 1 0 0 1 1 0 1 1 0 0 1 1 0 0 1 1 0 1 0 0 1 T .
Define b 1 = b 21 = b 22 = h 2 o 1 = ( 0 , 0 , 1 , 1 , 1 , 1 , 0 , 0 ) T , then,
B = B ( 1 ) B ( 2 ) = b 1 B 2 b 1 B 2 = 0 0 1 1 1 1 0 0 0 0 1 1 1 1 0 0 0 0 1 1 1 1 0 0 0 0 1 1 1 1 0 0 0 0 1 1 1 1 0 0 0 0 1 1 1 1 0 0 T .
As λ = 2 , let w 1 = ( 0 , 1 ) and w 2 = ( 1 , 0 ) . Correspondingly, obtain
c 1 = w 1 1 8 = ( 0 , 0 , 0 , 0 , 0 , 0 , 0 , 0 , 1 , 1 , 1 , 1 , 1 , 1 , 1 , 1 ) T ,
c 2 = w 2 1 8 = ( 1 , 1 , 1 , 1 , 1 , 1 , 1 , 1 , 0 , 0 , 0 , 0 , 0 , 0 , 0 , 0 ) T .
For 1 k 24 , both ( β 1 , k ( 1 ) T , β 2 , k ( 1 ) T ) T and ( β 1 , k ( 2 ) T , β 2 , k ( 2 ) T ) T are a permutation of ( 0 , 1 , 2 , 3 ) T . Let
e δ = e k 1 ( 1 ) e k 2 ( 2 ) = ( 1 2 β 1 , k 1 ( 1 ) ) T , ( 1 2 β 2 , k 1 ( 1 ) ) T ( 1 2 β 1 , k 2 ( 2 ) ) T , ( 1 2 β 2 , k 2 ( 2 ) ) T T ,
for 1 δ 24 2 , 1 k 1 , k 2 24 . Define E = ( e 1 , , e 24 2 ) , then, obtain D ¯ 2 = ( 4 c 1 + E , 4 c 2 + E ) . The first 24 columns of D ¯ 2 are listed in Table 1. The two positions of level 0 , 1 , , 7 are replaced by all possible random permutations of { 0 , 1 } , { 2 , 3 } , , { 14 , 15 } , respectively. The columns of these matrices are then juxtaposed to form D 2 .
Table 1. The SCD generated in Example 1.
The resulting design D = ( D 1 , D 2 ) is an SCD ( 16 , 4 3 , 24 2 2 9 ) . When collapsing D 1 into 2 levels, the rows in D 2 corresponding to each level of any factor in D 1 form an LHD ( 8 , 24 2 2 9 ) . When collapsing any two columns of D 1 into 4 × 2 or 2 × 4 levels, the rows in D 2 corresponding to each level combination of any two factors in D 1 form an LHD ( 2 , 24 2 2 9 ) . Furthermore, the first two columns of D 2 achieve stratification on 2 × 2  grids.
Table 2 lists various SCDs generated for different positive integer values of λ , demonstrating the flexibility of SCDs in accommodating different run sizes. Each row in the table shows the design’s ability to accommodate qualitative and quantitative factors. As the experiment size increases, the number of quantitative factors that the design can accommodate increases significantly.
Table 2. Some generated SCDs where λ is a positive integer.
Table 3 presents the comparison of SCDs, DCDs, and MCDs with the same run sizes. All three kinds of designs can accommodate a large number of qualitative factors, with MCDs supporting significantly more qualitative and quantitative factors than SCDs and DCDs. However, SCDs and DCDs outperform MCDs in terms of the stratification property between qualitative and quantitative factors. For SCDs and DCDs, in 11 out of 17 cases, SCDs have a larger number and higher levels of qualitative factors than DCDs. In the remaining cases, SCDs accommodate only one fewer qualitative factor than DCDs.
Table 3. Comparison of SCDs, DCDs, and MCDs.

3.2. Construction of SCDs with Desirable Space-Filling Properties

This section focuses on constructing SCDs in which D 2 has desirable space-filling properties. A series of designs with s u rows and ( u 3 ) s 3 columns of quantitative factors are constructed for u 4 . These designs ensure that certain columns of quantitative factors achieve stratification in two or higher dimensions.
Let ξ 1 , , ξ u represent u independent columns with entries from G F ( s ) , each of length n = s u . A regular saturated OA can be constructed by first listing these independent columns and then adding all possible interactions. This design is denoted as S = { u 1 ξ 1 + + u u ξ u u i G F ( s ) , ( u 1 , , u u ) ( 0 , , 0 ) , and the first nonzero entry is 1 } . Subsequently, select columns from S to construct SCDs in which D 2 possesses desirable space-filling properties. For when s 3 is a prime or a prime power, we propose our construction method in Algorithm 2.
Algorithm 2: Construction of SCDs with desirable space-filling properties
Step 1.
Define A 1 = { ξ 1 + ( s 1 ) ξ 2 + w ξ 3 1 w s 2 } and A 2 = { ξ 1 + w ξ 2 + ( s 1 ) ξ 3
1 w s 2 } . Construct
A = ( ξ 1 + ( s 1 ) ξ 2 , ξ 2 + ( s 1 ) ξ 3 , A 1 , A 2 , ξ 1 + ( s 1 ) ξ 2 + ( s 1 ) ξ 3 ) .
Step 2.
Repeat ξ 2 and ξ 3 s 2 times to form B 1 = ( ξ 2 , , ξ 2 ) and B 2 = ( ξ 3 , , ξ 3 ) ,
respectively. Let B = ( ξ 1 , ξ 2 , B 1 , B 2 , ξ 1 ) . Construct D 1 = s A + B .
Step 3.
Define E = ( ξ 1 + ( s 1 ) ξ 2 , , ξ 1 + ( s 1 ) ξ 2 ) and F = ( ξ 1 , , ξ 1 ) , where ξ 1 + ( s
1 ) ξ 2 and ξ 1 are each repeated ( u 3 ) s 3 times.
Step 4.
Let R v = { ξ 1 + u 2 ξ 2 + u 3 ξ 3 + u v + 3 ξ v + 3 | u 2 , u 3 G F ( s ) , u v + 3 G F ( s ) { 0 } } { ξ 2 +
u 3 ξ 3 + u v + 3 ξ v + 3 | u 3 G F ( s ) , u v + 3 G F ( s ) { 0 } } { ξ 3 + u v + 3 ξ v + 3 | u v + 3 G F ( s )
{ 0 } } { ξ v + 3 } = ( r v , 1 , , r v , s 3 ) , where 1 v u 3 . Define
T = s u 4 1 s u 6 s u 5 s u 5 s u 4 s u 7 s u 6 s s 2 s u 4 1 1 s s u 5 s u 4 = ( t 1 , , t u 3 ) .
Let C = ( C 1 , , C s 3 ) , where
C f = ( r 1 , f , , r u 3 , f ) T , 1 f s 3 .
Step 5.
Construct D ¯ 2 = s 2 C + s E + F . For each column of D ¯ 2 , replace the s positions of
level i with a random permutation of { i s , i s + 1 , , i s + ( s 1 ) } to obtain D 2 , where
i = 0 , 1 , , s 2 1 . Finally, let D = ( D 1 , D 2 ) .
A flow chart is provided in Figure 2 to explain the algorithm.
Figure 2. Flow chart of Algorithm 2.
For s = 2 , SCDs with desirable space-filling properties can be constructed by modifying Algorithm 2 as follows: set A = ( ξ 1 , ξ 1 + ξ 2 ) , B = ( ξ 1 + ξ 2 + ξ 3 , ξ 1 + ξ 3 ) , E = ( ξ 2 , , ξ 2 ) , and F = ( ξ 1 + ξ 3 , , ξ 1 + ξ 3 ) , where ξ 2 and ξ 1 + ξ 3 are repeated ( u 3 ) s 3 times. The properties of the resulting designs for s 2 , where s is a prime or a prime power, are summarized in Theorem 5.
Theorem 5.
For s 2 , where s is a prime or a prime power, and u 4 , the design D constructed above is an SCD ( s u , ( s 2 ) q , ( u 3 ) s 3 ) with the following properties:
(i)
q = 2 , w h e n   s = 2 , 2 s 1 , w h e n   s 3   i s   a   p r i m e   o r   a   p r i m e   p o w e r ;
(ii) 
For any m columns d ˜ i 1 , , d ˜ i m with ( i 1 1 ) / ( u 3 ) = = ( i m 1 ) / ( u 3 ) , where m u 3 , the columns d ˜ i 1 , , d ˜ i m achieve stratification on s × × s grids;
(iii)
When ( i 1 ) / ( u 3 ) ( i 1 ) / ( u 3 ) , the columns d ˜ i and d ˜ i achieve stratification on s u 3 × s and s × s u 3 grids.
An example is now provided to illustrate the construction procedure and the properties of the resulting designs.
Example 2.
For s = 3 and u = 4 , let ξ 1 , ξ 2 , ξ 3 , ξ 4 be four independent columns of length 3 4 . Define A 1 = ξ 1 + 2 ξ 2 + ξ 3 and A 2 = ξ 1 + ξ 2 + 2 ξ 3 . Construct
A = ( ξ 1 , ξ 2 , ξ 3 ) 1 0 1 1 1 0 1 2 1 2 2 2 1 2 2 .
Let B 1 = ξ 2 and B 2 = ξ 3 . Then,
B = ( ξ 1 , ξ 2 , ξ 3 ) 1 0 0 0 1 0 1 1 0 0 0 0 0 1 0 .
Construct D 1 = s A + B ; the resulting D 1 is presented in Table 4.
Table 4. The generated SCD in Example 2.
Next, define E = ( ξ 1 + 2 ξ 2 , , ξ 1 + 2 ξ 2 ) and F = ( ξ 1 , , ξ 1 ) , where both ξ 1 + 2 ξ 2 and ξ 1 are repeated nine times. Let R 1 = ( ξ 1 , ξ 2 , ξ 3 , ξ 4 )
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 0 0 0 0 0 0 0 0 0 1 2 0 1 2 0 1 2 0 1 2 0 1 2 0 1 2 1 1 1 1 1 1 0 0 0 0 0 0 1 1 1 2 2 2 0 0 0 1 1 1 2 2 2 0 1 2 0 1 2 1 1 0 1 1 1 1 1 1 1 1 1 2 2 2 2 2 2 2 2 2 1 1 1 2 2 2 1 2 1 .
With T = 1 , we have C = R 1 . Construct D ¯ 2 = s 2 C + s E + F , which is also presented in Table 4. Finally, obtain D 2 by replacing the three positions of each level 0 , 1 , , 26 with random permutations of { 0 , 1 , 2 } , { 3 , 4 , 5 } , , { 78 , 79 , 80 } , respectively.
The constructed design D is an SCD ( 81 , ( 3 2 ) 5 , 27 ) , where D 1 is an SOA ( 81 , 5 , 3 2 , 2 + ) and D 2 is an LHD ( 81 , 27 ) . When D 1 is collapsed into three levels, the rows in D 2 corresponding to each level of any factor in the collapsed D 1 form an LHD ( 27 , 27 ) . Furthermore, when D 1 is collapsed into 9 × 3 or 3 × 9 levels, the rows in D 2 corresponding to each level combination of any two factors in the collapsed D 1 form an LHD ( 3 , 27 ) . Moreover, d ˜ 1 and d ˜ 2 achieve stratification on 3 × 3 grids.
Table 5 presents several SCDs constructed using the method in Algorithm 2. The designs ensure good stratification for quantitative factors while accommodating varying numbers of qualitative and quantitative factors. As the run size of the experimental design increases, the number of columns for qualitative and quantitative factors also increases. This highlights the SCD’s effectiveness in achieving space-filling and its capacity to handle multi-factor experiments efficiently.
Table 5. Some SCDs generated by Algorithm 2.

4. Concluding Remarks

In this paper, a new class of designs for computer experiments with both quantitative and qualitative factors, referred to as SCDs, is introduced. These designs can be seen as a generalization of MCDs proposed by Deng, Hung and Lin (2015) [20] and DCDs introduced by Yang et al. (2023) [25]. Similar to DCDs, certain stratification can be achieved between any two qualitative factors and all quantitative factors in SCDs. The stratification requirements among qualitative factor columns are relaxed, making the run sizes of SCDs more flexible than those of DCDs.
The characteristics and construction methods of SCDs are investigated. Necessary and sufficient conditions for the existence of SCDs are derived from three perspectives, providing valuable insights for their construction. Additionally, two methods for constructing SCDs are proposed. When s 2 is a prime or a prime power, the constructed SCDs can accommodate 2 s 1 qualitative factors and a substantial number of quantitative factors. Furthermore, a series of SCDs with s u rows and ( u 3 ) s 3 columns of quantitative factors, where u 4 , is constructed, with certain columns of quantitative factors achieving stratification in two or higher dimensions. In the future, designs for qualitative factors could be extended to achieve higher-dimensional stratification properties, such as SOAs with strength t 3 or SOAs with strength 3 .

Author Contributions

M.-M.L.: conceptualization, methodology, formal analysis, writing—original draft preparation; M.-Q.L.: methodology, supervision, writing—review and editing, funding acquisition; J.-Y.Y.: methodology, formal analysis, writing—review and editing, supervision. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (Grant Nos. 12131001 and 12371260).

Data Availability Statement

The original contributions presented in the study are included in the article; further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Proof of Theorem 1.
According to part (ii) of Definition 1, the n / s rows in D 2 corresponding to each level of any factor in D ¯ 1 form an LHD if and only if any two columns in D ¯ 1 and D ¯ 2 together form an OA of strength 2. In other words, for any 1 i q , 1 k p , the pair ( z ¯ i , d ¯ k ) forms an OA ( n , 2 , s ( n / s ) , 2 ) . Similarly, when any two columns of D 1 are collapsed into s 2 × s or s × s 2 levels, the n / s 3 rows in D 2 corresponding to each level combination of any two factors in the collapsed D 1 form an LHD if and only if ( z i , z ¯ j , d ˜ k ) form an OA of strength 3, where 1 i j q and 1 k p . This completes the proof. □
Proof of Theorem 2.
The necessity is first shown. By Theorem 1, an SCD ( n , s q , p ) exists if and only if both conditions ( i ) and ( i i ) of Theorem 1 are satisfied. The fact that ( z i = s z ¯ i + b i , z ¯ j , d ˜ k ) forms an OA ( n , 3 , s 2 s ( n / s 3 ) , 3 ) implies that ( z ¯ i , b i , z ¯ j , d ˜ k ) forms an OA ( n , 3 , s 3 ( n / s 3 ) , 4 ) . Consequently, ( z ¯ i , b i , z ¯ j ) can be divided into n / s 3 full factorials, denoted by ( z ¯ i ( l ) , b i ( l ) , z ¯ j ( l ) ) , for l = 1 , , n / s 3 , with each corresponding to one level in d ˜ k . Since ( z ¯ i , d ¯ k ) is an OA ( n , 2 , s ( n / s ) , 2 ) , we have d ˜ k = d ¯ k / s 2 , and thus each A ( l ) = ( z ¯ 1 ( l ) , , z ¯ q ( l ) ) is a CROA ( s 3 , q , s , 2 ) .
Now, we prove the sufficiency by providing a method to construct D 2 when D 1 = s A + B , with A and B satisfying the two conditions in Theorem 2. According to Theorem 2, A = ( A ( 1 ) T , , A ( n / s 3 ) T ) T , where A ( l ) = ( A 1 ( l ) , , A s 2 ( l ) ) , is a CROA ( s 3 , q , s , 2 ) for 1 l n / s 3 . Define D ¯ 2 = ( s 2 c 1 + E , , s 2 c ( n / s 3 ) ! + E ) , where c i = ω i 1 s 3 , with ω i being a permutation of ( 0 , , n / s 3 1 ) , for 1 i ( n / s 3 ) ! . E = ( e 1 , , e ( s 2 ! ) n / s 3 ) , where e k = ( e k ( 1 ) T , , e k ( n / s 3 ) T ) T , and e k ( l ) = ( ( β 1 , k ( l ) 1 s ) T , , ( β s , k ( l ) 1 s ) T ) T , with ( β 1 , k ( l ) T , , β s , k ( l ) T ) T being a permutation of ( 0 , , s 2 1 ) T , for 1 k ( s 2 ) ! n / s 3 . Clearly, ( z ¯ i , d ¯ k ) forms an OA of strength t = 2 . Since d ˜ k = ( c 1 , , c 1 , , c ( n / s 3 ) ! ) , ( z ¯ i ( l ) , z ¯ j ( l ) , b i ( l ) ) forms an OA ( n , 3 , s 3 , 3 ) , with each corresponding to one level in d ˜ k . Therefore, ( z ¯ i , z ¯ j , d ˜ k ) forms an OA of strength 3. By Theorem 1, an SCD exists. The proof is completed. □
Proof of Theorem 3.
By Theorem 1, an SCD ( n , s q , p ) exists if and only if both conditions ( i ) and ( i i ) in Theorem 1 are satisfied. Since D ¯ 2 = s 2 C + s E + F , for any 1 i q and 1 k p , ( z ¯ i , d ¯ k ) forms an OA ( n , 2 , s ( n / s ) , 2 ) if and only if ( z ¯ i , f k , e k , d ˜ k ) forms an OA ( n , 4 , s 3 ( n / s 3 ) , 4 ) . Similarly, for D 1 = s A + B , ( z i , z ¯ j , d ˜ k ) forms an OA ( n , 3 , s 2 s ( n / s 3 ) , 3 ) if and only if ( z ¯ i , b i , z ¯ j , d ˜ k ) forms an OA ( n , 4 , s 3 ( n / s 3 ) , 4 ) . The proof is completed. □
Proof of Theorem 4.
Let O = ( 0 , , s 1 ) T H = ( o 1 , , o s ) be a CROA ( n , s , s , 2 ) , where H = ( h 1 , , h s ) is a D ( s , s , s ) . Define F i = H o i for i = 1 , , s , and let F = ( F 1 , , F s ) . The following two lemmas are first presented.
Lemma A1.
If O is a CROA ( n , q , s , 2 ) and H is a D ( s , s , s ) , then O H is a CROA ( n s , q s , s , 2 ) .
Lemma A2.
Define a 1 as the first column of F 1 , and A i as the first and i-th columns of F i for i = 1 , , s . Then, A = ( a 1 , A 1 , , A s ) = ( a 1 , , a 2 s 1 ) . For 1 j i , m , k i , m s , where 1 i s , 1 m 2 , j i , m k i , m , and j i , m 1 , define b 1 = h j 1 , 1 o k 1 , 1 with j 1 , 1 1 , b i 1 = h j i , 1 o k i , 1 with k i , 1 1 and j i , 1 i , or k i , 1 = 1 and j i , 1 = i , and b i 2 = h j i , 2 o k i , 2 with j i , 2 = i . Let B i = ( b i 1 , b i 2 ) and B = ( b 1 , B 2 , , B s ) = ( b 1 , , b 2 s 1 ) . Then, ( a i , a j , b i ) forms an OA ( n , 3 , s , 3 ) for any 1 i j 2 s 1 .
Based on the proof of Theorem 2, combined with Lemma A1 and Lemma A2, the design D = ( D 1 , D 2 ) constructed in Algorithm 1 is an SCD. □
Proof of Theorem 5.
We first present Lemma A3, which is a straightforward generalization of Theorem 3.29 from Hedayat, Sloane and Stufken (1999) [27].
Lemma A3.
For A, B, C, E, F, and R 1 , , R u 3 in Algorithm 2, we have that
(i)   
( z ¯ i , f k , c k , r 1 , f , , r u 3 , f ) is an OA ( s u , u , s , u ) , for any i k , 1 f s 3 ;
(ii)  
( z ¯ i , e i , z ¯ j , r 1 , f , , r u 3 , f ) is an OA ( s u , u , s , u ) , for any i j , 1 f s 3 ;
(iii) 
( r 1 , f , , r u 3 , f ) is an OA ( s u , u 3 , s , u 3 ) ;
(iv) 
( r 1 , f , , r u 3 , f , r v , l ) is an OA ( s u , u 2 , s , u 2 ) for any 1 v u 3 , 1 f l s 3 .
From the proof of Theorem 4 in He, Cheng and Tang (2018), D 1 is an SOA. From Theorem 3 and conditions (i) and (ii) of Lemma A3, we know that the constructed design D is an SCD. The number of quantitative factors that the SCD can accommodate is equal to the number of columns in B, so p = ( u 3 ) s 3 . The number of qualitative factors is equal to the number of columns in S 1 minus one, which leads to the conclusion of (i) in Theorem 5. For ( i 1 1 ) / ( u 3 ) = = ( i m 1 ) / ( u 3 ) , where m u 3 , the columns d ˜ i 1 , , d ˜ i m come from the same B f , with 1 f s 3 . From (iii) of Lemma A3, condition (ii) of Theorem 5 is established. When ( i 1 ) / ( u 3 ) ( i 1 ) / ( u 3 ) , d ˜ i and d ˜ i are columns from B u and B v , respectively, with u v . From (iv) of Lemma A3, condition (iii) of Theorem 5 is established. □

References

  1. Santner, T.J.; Williams, B.J.; Notz, W.I. The Design and Analysis of Computer Experiments; Springer: New York, NY, USA, 2003. [Google Scholar]
  2. Fang, K.T.; Li, R.; Sudjianto, A. Design and Modeling for Computer Experiments; Chapman & Hall/CRC: New York, NY, USA, 2006. [Google Scholar]
  3. Wu, C.F.J.; Hamada, M.S. Experiments: Planning, Analysis and Optimization, 3rd ed.; John Wiley & Sons: New York, NY, USA, 2021. [Google Scholar]
  4. McKay, M.D.; Beckman, R.J.; Conover, W.J. A comparison of three methods for selecting values of input variables in the analysis of output from a computer code. Technometrics 1979, 21, 239–245. [Google Scholar]
  5. Owen, A.B. Orthogonal arrays for computer experiments, integration, and visualization. Stat. Sin. 1992, 2, 439–452. [Google Scholar]
  6. Tang, B. Orthogonal array-based Latin hypercubes. J. Am. Stat. Assoc. 1993, 88, 1392–1397. [Google Scholar] [CrossRef]
  7. He, Y.; Tang, B. Strong orthogonal arrays and associated Latin hypercubes for computer experiments. Biometrika 2013, 100, 254–260. [Google Scholar] [CrossRef]
  8. He, Y.; Tang, B. A characterization of strong orthogonal arrays of strength three. Ann. Stat. 2014, 42, 1347–1360. [Google Scholar] [CrossRef]
  9. He, Y.; Cheng, C.S.; Tang, B. Strong orthogonal arrays of strength two plus. Ann. Stat. 2018, 46, 457–468. [Google Scholar] [CrossRef]
  10. Zhou, Y.; Tang, B. Column-orthogonal strong orthogonal arrays of strength two plus and three minus. Biometrika 2019, 106, 997–1004. [Google Scholar] [CrossRef]
  11. Liu, H.; Sun, F.; Lin, D.K.J.; Liu, M.Q. On construction of mappable nearly orthogonal arrays with column orthogonality. Commun. Math. Stat. 2024; in press. [Google Scholar] [CrossRef]
  12. Wang, C.; Lin, D.K.J.; Liu, M.Q. A new class of orthogonal designs with good low dimensional space-filling properties. Stat. Sin. 2024; in press. [Google Scholar] [CrossRef]
  13. Qian, P.Z.G.; Wu, H.; Wu, C.F.J. Gaussian process models for computer experiments with qualitative and quantitative factors. Technometrics 2008, 50, 383–396. [Google Scholar] [CrossRef]
  14. Han, G.; Santner, T.J.; Notz, W.I.; Bartel, D. Prediction for computer experiments having quantitative and qualitative input variables. Technometrics 2009, 51, 278–288. [Google Scholar] [CrossRef]
  15. Huang, H.Z.; Lin, D.K.J.; Liu, M.Q.; Yang, J.F. Computer experiments with both qualitative and quantitative variables. Technometrics 2016, 58, 495–507. [Google Scholar] [CrossRef]
  16. Qian, P.Z.G. Sliced Latin hypercube designs. J. Am. Stat. Assoc. 2012, 107, 393–399. [Google Scholar] [CrossRef]
  17. Kumar, A.A.; Mandal, B.N.; Parsad, R.; Dash, S.; Kumar, M. On construction of sliced orthogonal Latin hypercube designs. J. Stat. Theory Pract. 2024; in press. [Google Scholar] [CrossRef]
  18. Guo, B.; Li, X.; Liu, M.Q.; Yang, X. Construction of orthogonal general sliced Latin hypercube designs. Stat Pap. 2023, 64, 987–1014. [Google Scholar] [CrossRef]
  19. Wang, Y.; Wang, D.; Yue, X. Sequential Latin hypercube design for two-layer computer simulators. J. Qual. Technol. 2023, 56, 71–85. [Google Scholar] [CrossRef]
  20. Deng, X.; Hung, Y.; Lin, C.D. Design for computer experiments with qualitative and quantitative factors. Stat. Sin. 2015, 25, 1567–1581. [Google Scholar] [CrossRef]
  21. He, Y.; Lin, C.D.; Sun, F.; Lv, B.J. Marginally coupled designs for two-level qualitative factors. J. Stat. Plan. Inference 2017, 187, 103–108. [Google Scholar] [CrossRef]
  22. He, Y.; Lin, C.D.; Sun, F. On the construction of marginally coupled designs. Stat. Sin. 2017, 27, 665–683. [Google Scholar] [CrossRef]
  23. He, Y.; Lin, C.D.; Sun, F. Construction of marginally coupled designs by subspace theory. Bernoulli 2019, 25, 2163–2182. [Google Scholar] [CrossRef]
  24. Zhou, W.; Yang, J.; Liu, M.Q. Construction of orthogonal marginally coupled designs. Stat. Pap. 2021, 62, 1795–1820. [Google Scholar] [CrossRef]
  25. Yang, F.; Lin, C.D.; Zhou, Y.; He, Y. Doubly coupled designs for computer experiments with both qualitative and quantitative factors. Stat. Sin. 2023, 33, 1923–1942. [Google Scholar] [CrossRef]
  26. Zhou, W.; Huang, S.; Li, M. Group doubly coupled designs. Mathematics 2024, 12, 1352. [Google Scholar] [CrossRef]
  27. Hedayat, A.S.; Sloane, N.J.A.; Stufken, J. Orthogonal Arrays: Theory and Applications; Springer: New York, NY, USA, 1999. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Article Metrics

Citations

Article Access Statistics

Multiple requests from the same IP address are counted as one view.