1. Introduction
The application of experimental design in information theory is primarily reflected in optimizing experimental schemes to maximize information acquisition efficiency while minimizing uncertainty. The core concepts of information theory, such as entropy, provide quantitative tools for experimental design. For example, in the field of communications, experimental design can optimize channel coding schemes, thereby improving transmission reliability. Additionally, in machine learning feature selection, experimental design based on information entropy can identify the most discriminative feature subsets, reducing model complexity and enhancing classification accuracy. The integration of experimental design and information theory enables a systematic approach to solving key challenges in information acquisition, processing, and optimization, offering theoretical guidance for engineering and scientific experiments. Bose explored the interplay between statistical experimental design and information-theoretic concepts, discussing how to construct designs that maximize informational yield from data [
1]. His work established parallels between statistical efficiency (particularly variance minimization in parameter estimates) and information-theoretic principles.
Factorial designs have a wide application in various fields. The baseline parameterization (BP) model and the orthogonal parameterization (OP) model are two of the most used models in the analysis of experimental data. The BP model is a linear model based on baseline constraints while the OP model is based on zero-sum constraints. There are numerous research findings under the OP model. For details, please refer to [
2,
3] and the references therein. BP is quite a natural option for modeling the experiments with each factor having a null state or baseline level. For example, in toxicology experiments [
4], each binary factor represents the presence or absence of a particular toxin. Scientists consider the absence of all toxins to be the natural reference for the possible presence of some toxins. In such experiments, the status of the absence of a toxin can be naturally regarded as a baseline level. Another practical example is introduced by Glonek and Solomon [
5] in a leukemic mice experiment. The authors presented two binary components that stand for sample and time, respectively. A natural baseline level is the condition of the non-leukemic line for one factor and time zero for another factor. For a broader interpretation of BP, one is referred to [
6,
7].
In recent years, choosing efficient designs under BP has raised considerable attention from researchers. The
D-optimality and
A-optimality are two commonly used efficiency criteria for selecting optimal experimental designs under the main effect model. From an information perspective, they aim to maximize the information content of the experimental data or minimize the variance of parameter estimates. Mukerjee and Tang [
7] highlighted the
A-optimality of two-level orthogonal arrays for BP when the interaction effects are all absent. Mukerjee and Huda [
8] applied approximate theory together with discretization procedures to find designs that have high
A-efficiencies and are robust to model misspecification. Liu et al. [
9] employed the
D-optimality criterion to find designs under BP that are both efficient and robust.
For situations where the interaction effects are present, the experimenter usually concerns the bias caused by interaction effects in estimating the main effects. Karunanayaka and Tang [
10] proposed to add runs to one-factor-at-a-time designs for generating compromise designs that are competitive under both the efficiency criterion and the bias criterion. Mukerjee and Tang [
7] proposed an optimality criterion, the
K-aberration criterion, which quantifies the bias for two-level designs. Under the
K-aberration criterion, a few works on designs for BP emerged. Li et al. [
11] proposed an efficient incomplete search algorithm for finding nearly optimal designs and tabulated some 20-run (nearly) optimal two-level designs. Miller and Tang [
12] focused on identifying efficient two-level regular designs through bridging the
K-values and word length pattern. Mukerjee and Tang [
13] developed certain rank conditions which, in conjunction with the idea of the minimum moment aberration and recursive set, help alleviate the burden of finding optimal two-level regular designs. Lin and Yang [
14] considered finding multistratum designs for BP by using the coordinate-exchange algorithm. Li et al. [
15] further proposed a theoretical construction method of compromise designs. Chen et al. [
16] considered the situations where some two-factor interactions are also of interest in addition to the main effects and proposed an algorithm for searching optimal designs under their proposed minimum aberration criterion. Sun and Tang [
17] investigated the linear relationship between the effects under BP and those under OP and explored its applications to design construction under BP in terms of estimability, optimality, and robustness. Yan and Zhao [
18,
19] put forward a minimum aberration criterion for three-level designs for BP and found some optimal designs using their proposed construction algorithm.
This paper focuses on two-level regular designs under the
K-aberration. Though the two-level nonregular designs may outperform the regular ones in some cases, there is a compelling reason for considering two-level regular designs as has been justified in [
13]: the results on two-level regular designs serve as a benchmark for evaluating further work on nonregular ones that have to be compared. Given the importance of the two-level regular designs for BP, we endeavor to make further progress on bridging the
K-values and word length pattern and analytically calculate the quantities for two-level regular designs with a higher resolution based on the results in [
13]. Such new progress has the following advantages: (i) it can help screen out the candidate designs that are not likely to be
K-aberration optimal and thus can yield a further simplification of the search algorithm in [
13]; (ii) it is capable of finding two-level regular designs that have better or even optimal
K-aberration characteristics than those identified by the results in [
13]. Illustrative examples are given to demonstrate these points.
The rest of this paper is organized as follows.
Section 2 introduces some necessary notation and elementary knowledge of BP and the word length pattern.
Section 3 presents the main results of this paper. Applications of the theoretical results are included in
Section 4. The concluding remarks are given in
Section 5.
2. Preliminaries
Consider an experiment with
n factors each at two levels. A full design includes
runs that correspond to
level combinations of the
n factors. Suppose only a
fraction of the full design can be carried out for economic reasons. This paper considers the regular fraction of the full design. Let
denote a two-level regular fractional factorial design
D with
runs and
n columns, with each column at two levels 0 and 1; such a design can be obtained as follows: Let
. Define the
matrix
with columns arranged in Yates order, where
are
q independent columns, and the other columns are obtained by taking the component-wise sum (modulo 2) of the independent ones, and say
. A regular
design
D can be obtained by selecting
n columns of
such that
q are independent and the other
columns are component-wise sums (modulo 2) of the
q independent ones. Clearly, the design
D is an
submatrix of
.
For a regular
design
D, we regard it as a set with elements being the
n columns of
D and denote still the set as
D without causing confusion. Let
denote a subset of
s columns of
D and
denote the collection of all the possible
, where
. For ease of presentation,
is sometimes used instead of
without causing confusion. Then for any
,
is also an
submatrix of
D. Note that each row of
is an
s-tuple with entries of 0 or 1; we call it a binary
s-tuple. Let
be the number of times that the
s-tuple
occurs in the submatrix
. For a regular
design
D, let
denote the total bias to the main effects estimations caused by all the
s-factor interaction effects. Mukerjee and Tang [
7] proved that
where
and
denotes the collection of all the possible
for given
. Formula (
2) applies to any two-level designs not just the regular ones. For more details of the
K-aberration, one is referred to [
7]. A two-level orthogonal array is
K-aberration optimal if it sequentially minimizes the following sequence:
among all the two-level designs, given the run size and the number of columns.
The word length pattern (WLP) is a concept proposed for the designs under OP. With a slight modification, such a concept can also be applied to the two-level regular designs under BP as follows: For the original definition of WLP under OP, one is referred to [
20]. For any regular
baseline design
D, denote
as the sum of the components of the vector
(modulo 2), where
. Define
If the
k columns in
satisfy
(modulo 2)
or
, then we have
or 0, which leads to
and the
k columns correspond to a defining word, where
and
denote the
N-vectors with all components being 1 or 0, respectively. If the
k columns in
do not correspond to a defining word, then these columns must be independent of each other. This means that all of the binary
k-tuples appear equally often as rows in
, which leads to
. Let
, then
is the number of defining words of length
k of the regular design
D. For a given two-level regular design, we call the corresponding sequence
as its word length pattern and
t as its resolution, if the first nonzero element in Sequence (
5) is
, where
.
With the primary knowledge above, the next section builds connections between Sequences (
4) and (
5).
3. Main Results
Note that both Sequences (
4) and (
5) are closely related to the collection
of a regular
design. Recalling the definition of
in (
3), when an
s-column collection
contains no defining word, all of the binary
s-tuples appear exactly
times in
, which means that such an
contributes
to
. When an
s-column collection
contains some defining words, i.e., the columns in
form some defining words, the analyses for the contribution to
caused by such an
become complex. Similar analyses are also required when considering
. Therefore, it is necessary to investigate the possible cases of a given collection
containing defining words, so as to calculate
. Suppose a regular
design has a resolution of
t. Lemma 1 presents the maximum number of defining words in each of its
-column collections
.
Lemma 1. Suppose D is a regular design of resolution t and . Then
- (i)
contains at most two independent defining words for ;
- (ii)
contains at most one defining word for ,
where .
Proof. (i) For , denote . If contains three independent defining words , , and , then , , and generate another four defining words , , , and . These seven defining words contain at least 21 letters (columns) since each defining word contains at least letters. Note that a letter appears at most four times among the seven defining words. Then the seven defining words contain no more than 20 letters since there are only five columns in . This contradiction shows the validity of (i) for . For , the proof is similar.
(ii) If contains two defining words and with the length and , respectively, then and have at least common letters. Since and , the length of the defining word is at most , which contradicts . This completes the proof of (ii). □
Before proceeding to the main results of this section, we first introduce a lemma that is a refinement of the results from [
12,
21].
Lemma 2. Denote as j columns from a regular design D of resolution t. Suppose, among these j columns, only the first i ones correspond a defining word, i.e., or . Then, we have the following:
- (i)
The rows in the i-column matrix must consist of copies of a half replicate of the full factorial design;
- (ii)
Furthermore, in matrix , for the copies of each distinct row of matrix , all the distinct rows of matrix appear equally times,
where .
In (i) of Lemma 2, the half replicate of the full factorial design that the i-column matrix contains depends on whether or (modulo 2). This is addressed in detail in Remark 1.
Remark 1. In Lemma 2, if (modulo 2), then all the possible i-tuples that contain an even number of ones appear times in the i-column matrix . If (modulo 2), then all the possible i-tuples that contain an odd number of ones appear times in the i-column matrix .
For a defining word W, let denote the vector generated by taking component-wise sums (modulo 2) of the columns in the defining word. Then, or . Denote and as the numbers of length i defining words with and , respectively, where . Clearly, Denote as the number of pairs of length four defining words that have two common columns and , and as the number of pairs of length four defining words that have two common columns and . Theorem 1 builds the bridge between and the WLP for .
Theorem 1. Suppose D is a regular design with resolution , then Proof. We calculate and , separately. For , suppose . Since D has resolution 4, contains at most one defining word. There are five scenarios as follows:
- (a1)
contains one length-four defining word W with ;
- (a2)
contains one length-four defining word W with ;
- (a3)
contains one length-five defining word W with ;
- (a4)
contains one length-five defining word W with ;
- (a5)
The five columns in are independent of each other.
For (a1), suppose the defining word is (modulo 2) without loss of generality. According to Lemma 2 (i) and Remark 1, the rows that consist of four ones appear times in the four-column matrix . Therefore, the columns of entire ones appear times in the five-column matrix , i.e., .
For (a2), suppose the defining word is (modulo 2) without loss of generality. According to Lemma 2 (i) and Remark 1, the four-column matrix contains only rows that consist of an odd number of ones. This implies that none of the rows in matrix contains entire ones, i.e., .
With similar arguments to (a1) and (a2), we can obtain
and
for (a3) and (a4), respectively. It is obvious that
for (a5). The number of
s belonging to (a1)–(a5) are
,
,
,
, and
, respectively. With the analysis above, it yields that
Now, we consider calculating . According to Lemma 1 (i) for , there are nine possibilities for :
- (b1)
contains two independent defining words and , which generate the third defining word . Each of the three defining words has a length of four, one has and the other two have ;
- (b2)
contains two independent defining words and , which generate the third defining word . Each of the three defining words has a length of four and ;
- (b3)
contains only one defining word with a length of four and ;
- (b4)
contains only one defining word with a length of four and ;
- (b5)
contains only one defining word with a length of five and ;
- (b6)
contains only one defining word with a length of five and ;
- (b7)
contains only one defining word with a length of six and ;
- (b8)
contains only one defining word with a length of six and ;
- (b9)
The six columns in are independent of each other.
For (b1)–(b9), denote as any five-column subset of , i.e., . Now we proceed to investigate the values of and the number of s in each of the cases for (b1)–(b9).
For (b1), since there is a length-four defining word, say , with in , none of the rows in the matrix consisting of the columns involved in contains entire ones and thus . With careful checking, each must contain only one defining word, say W, which has a length of four with either or . For the s of the former case, we have and there are two such s in . For the s of the latter case, we have and there are four such s in . Note that the in (b1) contains a pair of length-four defining words that have two columns in common and their . Recalling the meaning of , we conclude that the number of s belonging to (b1) is .
For (b2), since contains three length-four defining words and each of which has , we have according to Lemma 2 (ii). Note that each must contain one of these three defining words. From Lemma 2 (ii), we have for each and there are six such s in . Note that there are three pairs of length-four defining words in and each pair has two columns in common. Thus, it has totally s in (b2).
For (b3), it is easy to obtain that . Each contains either a length-four defining word with or five independent columns. For the s of the former case, we have and there are two such s in . For the s of the latter case, we have and there are four such s in . Now we investigate the number of s that belong to (b3). The four columns of each of the defining words jointed with any two of the remaining columns of D induce an . Notably, the three defining words in each of case (b2) induce exactly the itself. Similarly, the in case (b1) can be induced by the length-four defining word with in it. Therefore, the number of s in case (b3) is .
For (b4), we have as there is a length-four defining word with . With a similar analysis to (b3), among the six s in , two of them have and four of them have , depending on whether the contains a length-four defining word with or not. The four columns of each of the defining words jointed with any two of the remaining columns of D induce an in . One thing to note is that the two length-four defining words with in each of case (b1) induce exactly the itself. The number of s belonging to (b4) is .
For cases (b5)–(b8), the results on the values of
s, the number of
s belonging to each case, the values of
s, and the number of
s in each
, which have the same values of
s, are straightforward. These results are summarized in
Table 1 along with those for cases (b1)–(b4), where the notation
represents the number of
s of each value. Note that each
in (b9) has
and
for each
; this results in
. Therefore, there is no need to consider case (b9) when calculating
.
With
Table 1, it is obtained that
With Equations (
7) and (
8), we obtain
This completes the proof. □
Theorem 2 below builds the relationship between and the WLP for .
Theorem 2. Suppose D is a regular design with resolution , thenfor an odd , andfor an even . Proof. To calculate
in (
2) for
, consider five possibilities for
:
- (c1)
contains only one defining word with length t and ;
- (c2)
contains only one defining word with length t and ;
- (c3)
contains only one defining word with length and ;
- (c4)
contains only one defining word with length and ;
- (c5)
consists of columns that are independent of each other.
With similar analyses to Theorem 1, we have
Table 2 and
Table 3 for calculating
for an odd and even
, respectively. With
Table 2 and
Table 3, we obtain that
for an odd
, and
for an even
.
Considering , there are seven possibilities for :
- (d1)
contains only one defining word and its length is t with ;
- (d2)
contains only one defining word and its length is t with ;
- (d3)
contains only one defining word and its length is with ;
- (d4)
contains only one defining word and its length is with ;
- (d5)
contains only one defining word and its length is with ;
- (d6)
contains only one defining word and its length is with ;
- (d7)
consists of columns that are independent of each other.
With similar analyses to Theorem 1, we have
Table 4 and
Table 5 for calculating
for an odd and even
, respectively. Note that each
in (d7) has
and
for each
; this results in
. Therefore, there is no need to consider case (d7) when calculating
.
With
Table 4 and
Table 5, we obtain that
for both an odd and even
.
With Equations (
9)–(
11), we have
for an odd
, and
for an even
. This completes the proof. □
Remark 2. Theorems 1 and 2 establish relationships between the
K-aberration and WLP that are further developments based on the work in [
12]. Theorems 1 and 2 help narrow down the choice of finding optimal regular
designs. Moreover, for some situations, Theorems 1 and 2 are capable of identifying the optimal ones. This point will be demonstrated in
Section 4.
4. Applications
It is worth noting that the concept of isomorphism for the designs under BP is different from that under OP. Under OP, two designs are called isomorphic if one can be obtained from the other by column-permuting, row-permuting, or symbol-switching. However, the symbols of the two-level designs are not interchangeable under BP. Hence, two designs are called isomorphic under BP if one can be obtained from the other by column-permuting or row-permuting. Hereafter, we use the terms OP regular
designs versus BP regular
designs as discriminations. Clearly, switching symbols of some columns of OP regular
designs may result in nonisomorphic BP designs. In the following, we illustrate how to find BP regular
designs that have desirable
K-aberration characteristics by using the catalogs of nonisomorphic OP regular
designs displayed in [
22].
Consider finding desirable BP regular
designs under
K-aberration. By checking the catalogs of nonisomorphic OP designs displayed in [
22], all the OP regular
designs have a resolution of either
or 4. According to Theorem 1 in [
12], any OP regular
design with a resolution of
has a smaller
than those with a resolution of
, noting that
for
and
for
. Among the OP regular
designs of resolution
, the designs with the minimum
have a smaller
according to Theorem 1 (b) in [
12]. According to [
22], the unique OP regular
design with the minimum
, denoted as
, is determined by the following ten independent defining words:
,
,
,
,
,
,
,
,
, and
, where
are the 1st, 2nd,
th,
th columns of the matrix
in (
1) with
. The
s of these ten defining words equaling to
or
determines
BP regular designs that may have different
K-aberration performances. According to Theorem 2 of [
12], among the
BP designs, those with the minimum
have a smaller
. For example, the following two BP regular
designs
and
have the minimum
, which results in the minimum
:
Although
and
have the same value of
, they can be discriminated with respect to
by applying Theorem 1. Compared to
,
has the same
and
but a smaller
than
. This means that
has a smaller
than
according to Theorem 1. As a confirmation, we calculate the
values of the previously stated
BP regular
designs, and it transpires that
is one of the
K-aberration optimal BP regular
designs.
Here is an example of the application of Theorem 2. Consider finding BP regular
designs that have desirable
K-aberration characteristics. By checking the catalogs of nonisomorphic OP regular
designs provided in [
22], we only need to consider the OP regular
design, denoted as
, determined by the independent defining words
,
, and
, since it has
and the minimum
among all the nonisomorphic regular
designs, where
is the
th column of the matrix
in (
1) with
,
. There are
BP regular
designs associated with
depending on whether the previously mentioned three defining words are equal to
or
. Among these regular BP
designs, those with the minimum
have the minimum
according to Theorem 2 of [
12]. For example, the regular BP
design, denoted as
, which is determined by the defining words
,
, and
, has the minimum
and then the minimum
. At the same time,
has the minimum
, which indicates that
has the minimum
according to Theorem 2. As a confirmation, we calculate the
values of all the
BP regular designs and find that
is one of the
K-aberration optimal BP regular
designs.
The two examples above show that Theorems 1 and 2 can help to filter or select designs. Take Theorem 1 as an example. When the experimenters need to compare designs with resolution 4, they can firstly calculate , , , , , , and according to the defining words of the designs and then according to Theorem 1. Clearly, this is time-saving compared with calculating among the designs.
The results of Theorems 1 and 2 establish a relation between the K-values and word length pattern of a design. To facilitate practitioners in other fields applying the methods, the following algorithm is provided based on Theorem 1. The algorithm (Algorithm 1) can be directly extended if Theorem 2 is required.
Algorithm 1: For a given n and m, consider a design D with resolution 4. |
- Step 1.
List all the defining words of D. - Step 2.
Calculate , , , , , , and according to the defining words. - Step 3.
Calculate using ( 6) directively.
|
Here, we would like to point out that, to calculate the defining words of
D, one should refer to [
20].