A New Orthogonal Least Squares Identification Method for a Class of Fractional Hammerstein Models

Yin, Xijian; Liu, Yanjun

doi:10.3390/a18040201

Open AccessArticle

A New Orthogonal Least Squares Identification Method for a Class of Fractional Hammerstein Models

by

Xijian Yin

¹

and

Yanjun Liu

^1,2,*

¹

School of Internet of Things Engineering, Jiangnan University, Wuxi 214122, China

²

Key Laboratory of Advanced Process Control for Light Industry (Ministry of Education), Jiangnan University, Wuxi 214122, China

^*

Author to whom correspondence should be addressed.

Algorithms 2025, 18(4), 201; https://doi.org/10.3390/a18040201

Submission received: 12 March 2025 / Revised: 27 March 2025 / Accepted: 1 April 2025 / Published: 3 April 2025

(This article belongs to the Section Algorithms for Multidisciplinary Applications)

Download

Browse Figures

Review Reports Versions Notes

Abstract

It is known that fractional-order models can effectively represent complex high-order systems with fewer parameters. This paper focuses on the identification of a class of multiple-input single-output fractional Hammerstein models. When the commensurate order is assumed to be known, a greedy orthogonal least squares method is proposed to simultaneously identify the parameters and system orders, combined with a stopping rule based on the Bayesian information criterion. Subsequently, the commensurate order is determined by minimizing the normalized output error. The proposed method is validated by applying it to identify a CD-player arm system.

Keywords:

fractional Hammerstein model; sparse identification; Bayesian information criterion; greedy orthogonal least squares algorithm

1. Introduction

The majority of real processes demonstrate intricate nonlinear dynamic behavior, which can be approximated by using nonlinear models. Due to the inherent diversity and complexity of these models, their development and identification continue to be a central focus of ongoing research efforts. Block-oriented Hammerstein nonlinear systems exhibit remarkable versatility and utility, making them highly effective for modeling a wide range of practical nonlinear dynamic systems, such as pH neutralization processes [1], heat exchangers [2], fuel cell systems [3], and wind turbines [4]. Thus, Hammerstein systems have remained a prominent and actively researched topic over the past decades.

On the other hand, numerous physical nonlinear processes exhibit fractional dynamics, characterized by hereditary properties and infinite-dimensional structures. Fractional operators are inherently capable of capturing memory effects and provide additional degrees of freedom for describing both existing and potential physical attributes of systems. This has generated significant interest among researchers across various disciplines in leveraging fractional derivatives for system modeling [5]. Fractional calculus, a natural extension of traditional integer-order calculus, has emerged as a powerful tool for constructing complex nonlinear models. Compared to integer-order systems, fractional calculus offers a more concise and accurate representation for systems with intricate dynamics [6,7]. Research on the identification of fractional systems has been particularly active, with a predominance of studies focusing on linear cases [8,9,10]. However, most real-world systems inherently exhibit nonlinear behavior to varying degrees, and their modeling remains an open research challenge due to the diversity of their structures. This paper focuses on the identification of fractional Hammerstein systems, particularly those in which the linear component exhibits fractional characteristics.

A wide range of methodologies have been proposed in the literature for the identification of classical integer-order Hammerstein systems [11,12,13,14,15]. For the fractional case, an adapted version of the simplified refined instrumental variable method has been introduced to estimate the parameters of fractional Hammerstein models under the assumption that all differentiation orders are known, with Monte Carlo simulations employed to evaluate the performance of the proposed approaches in [16]. Jin et al. derived an adaptive immune algorithm based on a global search strategy, which was employed to determine initial values for the fractional Hammerstein model’s coefficients and order, which were then refined using an auxiliary model recursive least squares method [17]. Rahmani et al. developed an iterative linear optimization algorithm integrated with Lyapunov stability theory, which was applied to determine the fractional order and parameters of a neuro-fractional Hammerstein model [18]. Meanwhile, ref. [19] utilizes principal component analysis within a subspace identification framework to identify coefficient matrices of fractional systems and employs singular value decomposition to directly estimate the parameters of the nonlinear part. A Levenberg–Marquardt algorithm is applied to determine system parameters, where the linear part is represented by a fractional transfer function [20]. However, these studies either involve matrix inversion operations, which typically result in high computational complexity, or focus solely on parameter identification while neglecting the estimation of the fractional order.

In this paper, a Householder transformation-based greedy orthogonal least squares (H-GOLS) algorithm is proposed for the identification of multi-input single-output (MISO) fractional Hammerstein models. The proposed methodology initiates by reformulating the system into a pseudo-linear regression model via over-parameterization, introducing upper bounds on the nonlinear order and regression length. The parameters are subsequently estimated using orthogonal least squares. In each iteration, the Householder transformation is applied to the information matrix, converting it into an upper triangular matrix. This approach eliminates the need for matrix inversion, thereby significantly reducing computational complexity and mitigating the risk of ill-conditioned solutions in high-dimensional matrices. The selection of columns in each iteration is simultaneously guided by a greedy criterion. Additionally, we adopt a modified stopping criterion based on the Bayesian information criterion (BIC) to determinate the sparsity level and model order. Finally, an output error criterion function is introduced to estimate the approximate fractional order.

The remainder of this paper is organized as follows. Section 2 discusses the mathematical background on fractional differentiation, the structure of the fractional Hammerstein system with multiple inputs, and its identification problem formulation. Section 3 derives an algorithm for identifying the MISO fractional Hammerstein systems. Section 4 applies two examples to support the proposed algorithm. Finally, some conclusions are given in Section 5.

2. Model and Problem Formulation

2.1. Mathematical Background of Fractional Differentiation

Generally, there are three widely used definitions of fractional calculus: the Grünwald–Letnikov (GL) definition [21], the Riemann–Liouville (RL) definition [22], and the Caputo definition [23]. Among these, the GL definition is the most frequently used due to its ease of implementation and programming. Therefore, this paper adopts the GL definition as the basis for investigation. The concept of fractional calculus can be expressed as

Δ^{α} f (t) = lim_{h \to 0} \frac{1}{h^{α}} \sum_{j = 0}^{⌊ (t - t_{0}) / h ⌋} {(- 1)}^{j} (\begin{matrix} α \\ j \end{matrix}) f (t - j h),

(1)

where

⌊ . ⌋

denotes the floor operator,

Δ

represents the differentiation operator

Δ = \frac{d}{d t}

,

α > 0

is the fractional order, and the binomial term

(\begin{matrix} α \\ j \end{matrix})

is defined as

(\begin{matrix} α \\ j \end{matrix}) = \{\begin{matrix} 1, & for j = 0, \\ \frac{α (α - 1) \dots (α - j + 1)}{j!}, & for j > 0 . \end{matrix}

(2)

Define

w_{j}^{α} = {(- 1)}^{j} (\begin{matrix} α \\ j \end{matrix})

, when

t_{0} = 0

, to numerically evaluate the fractional derivative; the parameter h in Equation (1) is substituted with the sampling period, and as a result, the limit operation is dropped

Δ^{α} f (t) = \frac{1}{h^{α}} \sum_{j = 0}^{⌊ t / h ⌋} w_{j}^{α} f (t - j h) + O (h) .

(3)

In the above equation, the error term

O (h)

is directly related to the length of the sampling interval. Therefore, to ensure that the approximation error is negligible, the sampling interval should be sufficiently short.

Under the condition that

f (t)

is relaxed at

t = 0

(

f (t) = 0

for all

t < 0

), the Laplace transform of

Δ^{α}

is given by

L {Δ^{α} f (t)} = s^{α} F (s)

[24]. Therefore, for the following fractional mathematical model,

\begin{matrix} y (t) + a_{1} Δ^{γ_{1}} y (t) + \dots + a_{L} Δ^{γ_{L}} y (t) = b_{0} Δ^{β_{0}} u (t) + b_{1} Δ^{β_{1}} u (t) + \dots + b_{K} Δ^{β_{K}} u (t), \end{matrix}

(4)

a symbolic representation of the above system can be established using the transfer function

\begin{matrix} G (s) = \frac{B (s)}{A (s)} = \frac{\sum_{i = 0}^{K} b_{i} s^{β_{i}}}{1 + \sum_{j = 0}^{L} a_{j} s^{γ_{j}}} . \end{matrix}

(5)

Additionally, if

G (s)

is commensurate of order

α

, it can be rewritten as

\begin{matrix} G (s) = \frac{\sum_{i = 0}^{k} {\tilde{b}}_{i} s^{i α}}{1 + \sum_{j = 0}^{l} {\tilde{a}}_{j} s^{j α}}, \end{matrix}

(6)

where

k = \frac{β_{K}}{α}

and

l = \frac{γ_{L}}{α}

are integers; for

\forall i \in {0, 1, \dots, k}

and

\forall j \in {0, 1, \dots, l}

, we have [16]

\begin{matrix} \{\begin{matrix} {\tilde{b}}_{i} = b_{i} & if \exists i \in {0, 1, \dots, K} such that i α = β_{K}, \\ {\tilde{b}}_{i} = 0 & otherwise, \\ {\tilde{a}}_{j} = a_{j} & if \exists j \in {0, 1, \dots, L} such that j α = γ_{K}, \\ {\tilde{a}}_{j} = 0 & otherwise . \end{matrix} \end{matrix}

(7)

2.2. MISO Fractional Hammerstein System

Consider a MISO fractional Hammerstein CAR model described by

\begin{matrix} \{\begin{matrix} \bar{y} (t) = \sum_{i = 1}^{r} \frac{B_{i} (s)}{A (s)} {\bar{u}}_{i} (t), \\ y (t) = \bar{y} (t) + \frac{1}{A (s)} v (t), \end{matrix} \end{matrix}

(8)

where

{y (t)}

is the measured data of the unobserved output

{\bar{y} (t)}

,

{v (t)}

is the noise with zero mean and variance

σ^{2}

,

A (s) = 1 + \sum_{i = 0}^{n_{a}} a_{i} s^{i α}

, and

B_{i} = \sum_{d = 0}^{n_{b_{i}}} b_{i, d} s^{d α}

.

{{\bar{u}}_{i} (t)}

represents the output of the memoryless nonlinear function with known basis, which serves as the input to the linear part. It can be expressed as

\begin{matrix} {\bar{u}}_{i} (t) = f_{i} (u_{i} (t)) = \sum_{q = 1}^{m_{i}} c_{i, q} f_{i, q} (u_{i} (t)), \end{matrix}

(9)

where

{u_{i} (t)}

denotes the r inputs at the t-th sampling instant,

m_{i}

is the nonlinear order,

c_{i, q}

is the parameter to be identified, and

f_{i, q} (.)

is a known basis function, respectively. According to Equation (3), Model (8) can be rewritten as

\begin{matrix} y (t) = & \frac{1}{1 + \sum_{i = 1}^{n_{a}} a_{i} / h^{i α}} [\sum_{i = 1}^{r} \sum_{d = 1}^{n_{b_{i}}} \sum_{q = 1}^{m_{i}} \frac{b_{i, d}}{h^{d α}} c_{i, q} \sum_{j = 1}^{⌊ t / h ⌋} w_{j}^{i α} f_{i, q} (u_{i} (t - j h)) - \sum_{i = 1}^{n_{a}} \frac{a_{i}}{h^{i α}} \sum_{j = 1}^{⌊ t / h ⌋} w_{j}^{i α} y (t - j h)] \\ + v (t) . \end{matrix}

(10)

The above system can be expressed as the following pseudo-linear regression model:

\bar{y} (t) = φ^{T} (t) θ + v (t),

(11)

where

φ (t)

represents the information vector containing input–output data, which is expressed as

\begin{matrix} φ (t) = & {[ψ_{a} (t), ψ_{1, 1} (t), \dots, ψ_{1, p} (t), \dots, ψ_{r, 1} (t), \dots, ψ_{r, p} (t)]}^{T} \in R^{n}, n = n_{a} + l p r, \\ ψ_{a} (t) = & [- \sum_{j = 1}^{[t / h]} w_{j}^{α} y (t - j h), - \sum_{j = 1}^{[t / h]} w_{j}^{2 α} y (t - j h), \dots, - \sum_{j = 1}^{[t / h]} w_{j}^{n_{a} α} y (t - j h)] \in R^{n_{a}}, \\ ψ_{i, d} (t) = & {[\sum_{j = 1}^{[t / h]} w_{j}^{d α} f_{i, 1} (u_{i} (t - j h)), \sum_{j = 1}^{[t / h]} w_{j}^{d α} f_{i, 2} (u_{i} (t - j h)), \dots, \sum_{j = 1}^{[t / h]} w_{j}^{d α} f_{i, m_{i}} (u_{i} (t - j h))]}^{T}, \end{matrix}

(12)

where

p ⩾ max (m_{i})

is the nonlinear order bound and

l ⩾ max (n_{b_{i}})

is the data regression length. From Equations (8) and (10), the parameter vector

θ

is given by

\begin{matrix} θ = & {[θ_{a}^{T}, ϑ_{1}^{T}, ϑ_{2}^{T}, \dots, ϑ_{r}^{T}]}^{T} \in R^{n}, \\ θ_{a} = & {[Q_{1}, Q_{2}, \dots, Q_{n_{a}}]}^{T} \in R^{n_{a}}, \end{matrix}

(13)

\begin{matrix} ϑ_{i} = & {[c_{i, 1} R_{i}^{T}, \underset{l - n_{b_{i}}}{\underset{︸}{0, \dots, 0}}, \dots, c_{i, m_{i}} R_{i}^{T}, \underset{l - n_{b_{i}}}{\underset{︸}{0, \dots, 0}}, \underset{(p - m_{i}) l}{\underset{︸}{0, \dots, 0}}]}^{T} \in R^{p l}, \\ R_{i} = & {[R_{i, 1}, R_{i, 2}, \dots, R_{i, n_{b_{i}}}]}^{T} \in R^{n_{b_{i}}}, \end{matrix}

(14)

with

\begin{matrix} Q_{i} = \frac{\frac{a_{i}}{h^{i α}}}{1 + \sum_{i = 1}^{n_{a}} \frac{a_{i}}{h^{i α}}}, R_{i, d} = \frac{\frac{b_{i, d}}{h^{d α}}}{1 + \sum_{i = 1}^{n_{a}} \frac{a_{i}}{h^{i α}}} . \end{matrix}

(15)

Assumption 1.

Only

n_{a}

is known, while the parameters

a_{1}

, ⋯,

a_{n_{a}}

,

b_{i, 1}

, ⋯,

b_{i, n_{b_{i}}}

,

c_{i, 2}

, ⋯, and

c_{i, m_{i}}

, as well as the orders

m_{i}

and

n_{b_{i}}

, are to be identified.

Assumption 2.

From Equations (8)–(14), it is observed that for any nonzero constant ζ, the system would yield an identical output with

{c_{i, q} / ζ, ζ R_{i}^{T}}

. To ensure parameter uniqueness, it is assumed that

c_{i, 1} = 1

[12].

Let

Y = {[y (1), y (2), \dots, y (m)]}^{T} \in R^{m}

and

Φ = {[φ (1), φ (2), \dots, φ (m)]}^{T} \in R^{m \times n}

. The goal is to estimate the sparse parameter vector

θ

and then to extract the system parameters

a_{i}

,

b_{i, d}

and

c_{i, q}

from the sparse structure of

θ

. Though minimizing a least squares criterion

J (θ) = {∥ Y - Φ θ ∥}^{2}

leads to a solution

{\hat{θ}}_{LS} = {[Φ^{T} Φ]}^{- 1} Φ^{T} Y

, it requires a large amount of sampled data, since the dimension of

θ

is high. Furthermore, the least squares method does not inherently produce sparse solutions, which poses another challenge in the identification process.

3. Identification Algorithm

The identification problem of sparse vector

θ

can be formulated as

\begin{matrix} \hat{θ} = {arg min ∥ θ ∥}_{0}, s . t . {∥ Y - Φ θ ∥}^{2} \leq ε, \end{matrix}

(16)

where

{∥ θ ∥}_{0}

denotes the

l_{0}

norm of the vector

θ

, and

ε

is the tolerance.

The Orthogonal Matching Pursuit (OMP) algorithm [25] is a typical greedy method used to solve this type of optimization problem. However, it requires computing the inverse of the information matrix in each iteration, which leads to a high computational cost, particularly when the matrix dimension is large. Additionally, using this method may result in an ill-conditioned solution. In this paper, we aim to address these problems by employing the Householder transformation strategy.

3.1. Orthogonal Least Squares Algorithm Based on Householder Transformation

We introduce a permutation matrix

P_{Ξ_{k}}

, which is constructed by performing k column swaps on an

n \times n

identity matrix

I_{n}

. The first k columns of

P_{Ξ_{k}}

can be expressed as follows [26]:

P_{Ξ_{k}} (:, 1 : k) = [η_{ξ_{1}}, η_{ξ_{2}}, \dots, η_{ξ_{k}}] \in R^{n \times k},

(17)

where

Ξ_{k} = {ξ_{1}, ξ_{2}, \dots, ξ_{k}}

,

ξ_{k}

denote the index of the chosen column in the k-th iteration,

η_{ξ_{i}} = {[0_{ξ_{i} - 1}, 1, 0_{n - ξ_{i}}]}^{T}

. Using the permutation matrix

P_{Ξ_{k}}

, the least squares criterion can be formulated as

\begin{matrix} J (θ) & = ∥ Y - (Φ P_{Ξ_{k}}) (P_{Ξ_{k}}^{T} θ) ∥^{2} = {∥ Y - Φ_{Ξ_{k}} θ_{Ξ_{k}} ∥}^{2}, \end{matrix}

(18)

where

Φ_{Ξ_{k}} \in R^{m \times k}

comprises the columns of

Φ

that correspond to k non-zero parameters, and

θ_{k} \in R^{Ξ_{k}}

is a vector containing the k non-zero elements of

θ

. Since the sub-information matrix

Φ_{Ξ_{k}} \in R^{m \times k}

corresponding to

ξ_{K}

is a column full-rank matrix, there must exist an orthogonal matrix

L_{k} \in R^{m \times m}

and an upper triangular matrix

T_{k} \in R^{k \times k}

such that

Φ_{Ξ_{k}} Ξ_{k} = L_{k} [\begin{matrix} T_{Ξ_{k}} \\ 0 \end{matrix}],

(19)

where

0

is a zero matrix. Define

Y = L_{k} [\begin{matrix} g_{k} \\ h_{k} \end{matrix}],

(20)

with

g_{k} \in R^{k}

and

h_{k} \in R^{m - k}

; then, the criterion function (18) can be rewritten as

\begin{matrix} J (θ_{Ξ_{k}}) = {∥L_{k} ([\begin{matrix} g_{k} \\ h_{k} \end{matrix}] - [\begin{matrix} T_{k} \\ 0 \end{matrix}] θ_{k})∥}^{2} = ∥ g_{k} - T_{k} θ_{k} ∥^{2} + {∥ h_{k} ∥}^{2}, \end{matrix}

(21)

By minimizing (21), we obtain

T_{k} {\hat{θ}}_{k} = g_{k}

and

J ({\hat{θ}}_{k}) = {∥ h_{k} ∥}^{2}

. The solution

{\hat{θ}}_{k}

can subsequently be derived using the following back-substitution method:

\begin{matrix} \{\begin{matrix} {\hat{θ}}_{i} = \frac{1}{T_{i i}} (g_{i} - \sum_{k = 1}^{k - i} T_{i + k, i} {\hat{θ}}_{i_{k}}), i = 1, \dots, k - 1, \\ {\hat{θ}}_{k} = \frac{g_{k}}{T_{k k}}, i = k, \end{matrix} \end{matrix}

(22)

where

{\hat{θ}}_{i}

denotes the i-th element of

{\hat{θ}}_{k}

, and

T_{i i}

and

g_{i}

denote the i-th diagonal element of

T_{k}

and the i-th element of

g_{k}

, respectively.

Using the permutation matrix

P_{Ξ_{k}}

, we can recover the parameter estimation by

\hat{θ} = P_{Ξ_{k}}^{T} [\begin{matrix} {\hat{θ}}_{k} \\ 0 \end{matrix}] .

(23)

The Householder transformation [27,28] is adopted for QR decomposition in Equation (19). Let

χ_{ξ_{j}}

denote the column of

Φ

indexed by

ξ_{j}

, and

I_{j}

represent the identity matrix of dimension j. Define the Householder matrix as

\begin{matrix} H_{j} = [\begin{matrix} I_{j - 1} & 0 \\ 0 & {\tilde{H}}_{j} \end{matrix}], j = 1, 2, \dots, K, \end{matrix}

(24)

with

\begin{matrix} {\tilde{H}}_{j} & = I_{m - j + 1} - 2 \frac{ω_{j} {ω_{j}}^{T}}{{ω_{j}}^{T} ω_{j}}, \end{matrix}

(25)

\begin{matrix} ω_{j} & = χ_{ξ_{j}} (j : m) + ∥ χ_{ξ_{j}} (j : m) ∥ e_{m - j + 1}, \end{matrix}

(26)

where

e_{m - j + 1}

is an

(m - j + 1)

-dimensional vector with only the first element being 1 and the rest being 0. Then, we have

{\tilde{H}}_{j} χ_{ξ_{j}} (j : m) = ∥ χ_{ξ_{j}} (j : m) ∥ e_{m - j + 1}

. Let

L_{0} = I_{m}

, and the orthogonal matrix

L_{k}

can be constructed as

\begin{matrix} L_{k} = H_{k} H_{k - 1} \dots H_{1} . \end{matrix}

(27)

Therefore, the critical factor is determining the permutation matrix

Φ_{Ξ_{k}}

. The following section introduces a method to obtain

Φ_{Ξ_{k}}

using the greedy criterion of the Householder transformation.

3.2. Construction of the Permutation Matrix

Assuming that the sub-information matrix

Φ_{Ξ_{k - 1}}

has already been constructed based on the active set

Ξ_{k - 1}

, the next step involves selecting a candidate column from the remaining columns of

Φ

. Let

χ_{ξ_{k}}

represent the candidate column selected at the k-th iteration, and define

\begin{matrix} κ_{ξ_{k}} & = (L_{k} χ_{ξ_{k}}) (1 : k - 1) \in R^{k - 1}, \\ ν_{ξ_{k}} & = (L_{k} χ_{ξ_{k}}) (k : m) \in R^{m - k + 1} . \end{matrix}

(28)

According to Equations (24)–(26), we have

\begin{matrix} e_{m - k + 1}^{T} {\tilde{H}}_{k} & = \frac{ν_{ξ_{k}}^{T}}{∥ ν_{ξ_{k}} ∥}, \end{matrix}

(29)

\begin{matrix} L_{k} Y & = H_{k} L_{k - 1} Y = [\begin{matrix} I_{k - 1} & 0 \\ 0 & {\tilde{H}}_{k} \end{matrix}] [\begin{matrix} g_{k - 1} \\ h_{k - 1} \end{matrix}] = [\begin{matrix} g_{k - 1} \\ {\tilde{H}}_{k} h_{k - 1} \end{matrix}] = [\begin{matrix} g_{k - 1} \\ {\tilde{h}}_{1} \\ {\tilde{h}}_{k - 1} \end{matrix}], \end{matrix}

(30)

where

g_{k - 1} = (L_{k - 1} Y) (1 : k - 1)

,

h_{k - 1} = (L_{k - 1} Y) (k : m)

, and

{\tilde{h}}_{1}

denotes the first element of

{\tilde{H}}_{k} h_{k - 1}

. From Equations (21) and (29), we have

\begin{matrix} J ({\hat{θ}}_{k}) & = ∥ h_{k} ∥^{2} = ∥ {\tilde{h}}_{k - 1} ∥^{2} = {∥ Y ∥}^{2} - ∥ g_{k - 1} ∥^{2} - {∥ {\tilde{h}}_{1} ∥}^{2}, \end{matrix}

(31)

\begin{matrix} {\tilde{h}}_{1} & = e_{m - k + 1}^{T} {\tilde{H}}_{k} h_{k - 1} = \frac{ν_{ξ_{k}}^{T} h_{k - 1}}{∥ ν_{ξ_{k}} ∥} . \end{matrix}

(32)

By substituting Equation (32) into (31), we obtain

J ({\hat{θ}}_{k}) = {∥ Y ∥}^{2} - {∥ g_{k - 1} ∥}^{2} - \frac{{(ν_{ξ_{k}}^{T} h_{k - 1})}^{2}}{∥ ν_{ξ_{k}} ∥^{2}},

(33)

and then we can obtain that

ξ_{k} = arg max_{i \in Ω \ Ξ_{k - 1}} \frac{| χ_{i}^{T} (k : m) h_{k - 1} |}{∥ χ_{i} (k : m) ∥},

(34)

where

Ω = {1, 2, \dots, n}

. We employ the indices

ξ_{k}

,

k = 1, 2, \dots, K

, where

K = n_{a} + \sum_{i = 1}^{r} n_{b_{i}} m_{i}

represents the level of sparsity. The permutation matrix

P_{Ξ_{K}}

can be formed using Equation (17).

3.3. Determination of Sparsity Level

To determine the sparsity level, various information-theoretic criteria (ITCs) can be employed, including the Akaike information criterion (AIC) [29], the Bayesian information criterion (BIC) [29], and the Mallows Cp criterion (Cp) [30]. For the purposes of this paper, our focus will be on the BIC. Typically, the BIC is expressed as

BIC (k) = m ln (\frac{{∥ r_{k} ∥}^{2}}{m}) + k ln (m),

(35)

where

r_{k} : = Y - Φ_{k} {\hat{θ}}_{k}

denotes the residual vector. The sparsity level can then be determined as

\hat{K} = arg min_{k = 1 : \tilde{K}} BIC (k),

(36)

where

\tilde{K}

denotes the maximum allowed level of sparsity. However, the upper bound

\tilde{K}

is hard to determine. In this paper, we propose a new stopping rule based on the BIC criterion,

R (k) = \frac{BIC (k) - BIC (k - 1)}{BIC (k - 1) - BIC (k - 2)} .

(37)

Given a small threshold

ε_{0}

[31], when

R (k) < ε_{0}

, the iteration stops. Then, we have

\hat{K} = k - 1 .

(38)

By adopting this stopping criterion, the computational complexity is reduced, enabling the model to converge rapidly to an accurate sparse structure. Once the sparsity level K has been established, the following subsection explores methods for determining the model order and separating the mixed parameters based on the obtained estimates

\hat{θ}

.

3.4. Identification of Orders and Separation of Parameters

Given the block sparse nature of

θ

, it becomes possible to obtain an efficient parameter estimate, denoted as

\hat{θ}

, which preserves the identical sparse structure. Within this sparse structure,

\hat{θ}

comprises

\sum_{i = 1}^{r} {\hat{m}}_{i} + 1

zero blocks. Let

Z_{ϱ}

represent the number of zeros in each zero block, where

ϱ

ranges from 1 to

\sum_{i = 1}^{r} {\hat{m}}_{i} + 1

. Subsequently, the size of

Z_{ϱ}

can be detected sequentially. Upon detecting that

Z_{ϱ} > l

, the nonlinear order for each input channel can be estimated as

{\hat{m}}_{i} = ϱ - \sum_{j = 0}^{i - 1} {\hat{m}}_{j} - 1, {\hat{m}}_{0} = 0, i = 1, 2, \dots, r .

(39)

Based on the structure of

\hat{θ}

, we can deduce that the number of elements within each non-zero block, ranging from the

(\sum_{j = 1}^{i} {\hat{m}}_{j - 1} + 2)

-th to the

(\sum_{j = 1}^{i} {\hat{m}}_{j} + 1)

-th non-zero block, is constant. The linear orders

{\hat{n}}_{b_{i}}

can be determined by counting the number of elements in any of these non-zero blocks.

According to Equations (13)–(15), it is evident that all non-zero blocks in

\hat{θ}

contain mixed parameters, with the exception of the first block. We can obtain

\hat{a}

and

{\hat{b}}_{i}

from the structure of

\hat{θ}

. Since

c_{i, 1} = 1

, the

(\sum_{j = 1}^{i} {\hat{m}}_{j - 1} + 2)

-th non-zero block corresponds to

{\hat{R}}_{i}^{T}

. The estimations of

{\hat{c}}_{i, q}, q = 2, \dots, {\hat{m}}_{i}

can be derived using the parameters from the

(\sum_{j = 1}^{i} {\hat{m}}_{j - 1} + 2)

-th to the

(\sum_{j = 1}^{i} {\hat{m}}_{j} + 1)

-th non-zero block and

{\hat{R}}_{i}

. Notably,

{\hat{c}}_{i, q}

can be computed using any one of

\frac{{\hat{c}}_{i, q} {\hat{R}}_{i, d}}{{\hat{R}}_{i, d}}

. Thus, the average of these computed values is taken as the estimation of

{\hat{c}}_{i, q}

, i.e.,

{\hat{c}}_{i, q} = \frac{1}{{\hat{n}}_{b_{i}}} \sum_{q = 1}^{{\hat{n}}_{b_{i}}} \frac{{\hat{c}}_{i, q} {\hat{R}}_{i, d}}{{\hat{R}}_{i, d}}, i = 1, \dots, r .

(40)

In order to obtain the fractional order, we introduce an output error criterion function as [16]

\begin{matrix} J (α) = 10 log \frac{∥ ϵ (t, {\hat{θ}}_{α}) ∥^{2}}{{∥ y (t) ∥}^{2}}, \end{matrix}

(41)

where

ϵ (t, {\hat{θ}}_{α}) = y (t) - Φ {\hat{θ}}_{α}

,

{\hat{θ}}_{α}

is the estimated parameter vector for the fractional order

α

. By minimizing this criterion function over different fractional orders, we can determine the approximate fractional order

\hat{α}

.

The H-GOLS algorithm is systematically detailed in the following Algorithm 1, offering a comprehensive suite of implementation procedures.

Algorithm 1 H-GOLS algorithm for MISO fractional Hammerstein system

Input:

{u_{i} (t), y (t)}_{t = 1}^{m} (i = 1, 2, \dots, r)

, p, l and

ε_{0}

.
Output:

{\hat{a}}_{1}

, ⋯,

{\hat{a}}_{n_{a}}

,

{\hat{b}}_{i, 1}

, ⋯,

{\hat{b}}_{i, n_{b_{i}}}

,

{\hat{c}}_{i, 2}

, ⋯,

{\hat{c}}_{i, m_{i}}

,

{\hat{n}}_{b i}

and

{\hat{m}}_{i}

.

1:: $φ (t) = {[ψ a (t), ψ 1, 1 (t), \dots, ψ r, p (t)]}^{T} \in R^{n}$ and $Y : = {[y (1), y (2), \dots, y (m)]}^{T} \in R^{m}$
2:: $Φ : = {[φ (1), φ (2), \dots, φ (m)]}^{T} \in R^{m \times n}$
3:: Initialization: $Ξ_{0} = ⌀, h_{0} = Y, P_{0} = [], L_{0} = I_{m}$
4:: for $k = 1, 2, \dots, n$ do
5:: Select $ξ_{k}$ via Equation (34)
6:: Update $Ξ_{k} = Ξ_{k - 1} ⋃ {ξ_{k}}$
7:: Update $P_{Ξ_{k}} = [P_{Ξ_{k - 1}}, e_{ξ_{k}}]$
8:: Construct $H_{k}$ via Equations (24)–(26)
9:: Update $L_{k}$ by $L_{k} = H_{k} L_{k - 1}$
10:: Compute $ν_{ξ_{k}}$ via Equation (28)
11:: Compute $∥ h_{k} ∥^{2}$ via Equations (31)–(32)
12:: Compute $BIC (k)$ via Equation (35)
13:: Compute $R (k)$ via Equation (36)
14:: if $R (k) < ε_{0}$ , break
15:: end for
16:: Determine K via Equation (38)
17:: Compute ${\hat{θ}}_{K}$ via Equation (22) and recover $\hat{θ}$ via Equation (23)
18:: Read the estimation of $\hat{a}$ , ${\hat{b}}_{i}$ and ${\hat{n}}_{b i}$ from $\hat{θ}$
19:: Compute the estimation of ${\hat{m}}_{i}$ and ${\hat{c}}_{i, l}$ via Equations (39)–(40)
20:: Determine approximate $\hat{α}$ via criterion function (41)

4. Experimental Results and Discussions

Example 1.

Consider the following MISO fractional Hammerstein model:

\begin{matrix} y (t) & = - A (s) y (t) + \sum_{i = 1}^{2} B_{i} (s) {\bar{u}}_{i} (t) + v (t), \\ A (s) & = - 0.40 s^{0.3} + 0.80 s^{0.6}, \\ B_{1} (s) & = 0.85 s^{0.3} + 0.65 s^{0.6} + 1.25 s^{0.9}, \\ B_{2} (s) & = 1.20 s^{0.3} + 0.60 s^{0.6} + 0.75 s^{0.3}, \\ {\bar{u}}_{1} (t) & = u_{1} (t) + 0.50 u_{1}^{2} (t) + 0.40 u_{1}^{3} (t), \\ {\bar{u}}_{2} (t) & = u_{2} (t) + 0.70 u_{2}^{2} (t) + 0.60 u_{2}^{3} (t) . \end{matrix}

Set l = 30 and p = 10. The true parameter vector is

\begin{matrix} ϑ = & {[a^{T}, b_{1}^{T}, b_{2}^{T}, c_{1}^{T} (2 : 3), c_{2}^{T} (2 : 3)]}^{T} \\ = & {[- 0.40, 0.80, 0.85, 0.65, 1.25, 1.20, 0.60, 0.75, 0.50, 0.40, 0.70, 0.60]}^{T} . \end{matrix}

(42)

The inputs {

u_{i} (t), i = 1, 2

} are white noise sequences with zero mean and unit variance

σ_{u}^{2} = 1 . 00^{2}

, while {

v (t)

} is a noise sequence with zero mean and constant variance of

σ^{2} = 0 . 10^{2}

. This example is intended to serve as a standard numerical simulation case to demonstrate the effectiveness of the algorithm. The parameters are not fixed and can indeed be modified as needed. For a data length of

m = 200

, the proposed H-GOLS algorithm along with stopping rule (37) and a threshold of

ε_{0} = 0.05

, was employed. The resulting

BIC

curve is illustrated in Figure 1. The algorithm converged after 21 iterations, as indicated. Consequently, the sparsity level is determined to be

K = 20

. The estimated non-sparse parameter vector is

\begin{matrix} \hat{ϑ} = {[- 0.4007, 0.7998, 0.8493, 0.6256, 1.2661, 1.2017, 0.5983, 0.7680, 0.4984, 0.4071, 0.6937, 0.5963]}^{T} . \end{matrix}

Defining the parameter estimation error as

δ = \frac{∥ ϑ - \hat{ϑ} ∥}{∥ ϑ ∥} \times 100 %

, we obtain

δ = 5.91 %

.

Based on Equations (39)–(40), the following estimates are obtained:

{\hat{m}}_{1} = 3, {\hat{m}}_{2} = 3, {\hat{n}}_{b_{1}} = 3, {\hat{n}}_{b_{2}} = 3 .

The curve of the output error criterion

J (α)

for different values of

α

is shown in Figure 2, allowing the estimated fractional order to be determined as

\hat{α} = 0.30

.

From the above results, it is clear that the proposed H-GOLS algorithm effectively estimates the system’s structure with limited data. Moreover, by implementing stopping rule (37), the algorithm reduces the computational burden while enhancing parameter estimation accuracy.

The H-GOLS, OMP, basis pursuit denoising (BPDN) [32], and the least absolute shrinkage and selection operator (LASSO) [33] algorithms are employed to identify the system; Figure 3 illustrates the parameter estimation errors for varying data lengths with the noise variance

δ^{2} = 0 . 10^{2}

. It is observed that the H-GOLS, OMP, and LASSO algorithms effectively estimate system parameters even with restricted data lengths, with the H-GOLS algorithm exhibiting a higher degree of precision compared to the other two. In cases where the data length is sufficient, all four methods are capable of accurately estimating the system parameters; however, the H-GOLS algorithm maintains the utmost estimation accuracy.

With a data length fixed at

m = 500

, the system was characterized using the H-GOLS, BPDN, OMP, and LASSO algorithms. Figure 4 displays the parameter estimation errors across various noise variance levels. At lower noise variances, all four algorithms demonstrate effective parameter estimation capabilities, with the H-GOLS algorithm exhibiting the highest parameter estimation accuracy. Nevertheless, as the noise variance rises, both the BPDN and OMP algorithms exhibit growing parameter estimation errors, indicating that the H-GOLS algorithm not only achieves superior estimation accuracy but also exhibits better robustness against interference.

Computational complexity serves as a critical performance metric for algorithm evaluation, quantifying the computational resources required for execution. It is conventionally characterized by the number of floating-point operations (FLOPs), primarily encompassing multiplication and addition operations. Let

m = 1000

; the computational requirements for the H-GOLS, BPDN, OMP, and LASSO algorithms are listed in Table 1, where

K_{B} = 50

and

K_{L} = 100

are the given iterations for the BPDN algorithm and the LASSO algorithm, respectively. Since K is significantly smaller than n (i.e.,

K ≪ n

), the H-GOLS algorithm apparently achieves the lowest computational complexity among the four methods.

Example 2.

Consider the CD-player arm system depicted in Figure 5; we employed measurement data sourced from Daisy, a dedicated database for system identification [34]. A total of 400 samples were collected, with the first 200 samples used to fit the fractional Hammerstein model and remaining 200 samples reserved for validation. The inputs

{u_{1} (t)}

and

{u_{2} (t)}

represent the forces exerted by the mechanical actuators, characterized by high autocorrelation, while the output

{y (t)}

reflects the tracking accuracy of the arm. Standardized inputs and output data samples for this system are shown in Figure 6.

Setting

n_{a} = 2

,

l = 3

,

p = 5

, and

ε_{0} = 0.05

, the H-GOLS algorithm is utilized to identify the following fractional Hammerstein model:

\begin{matrix} y (t) = & \sum_{i = 1}^{2} \frac{B_{i} (s)}{A (s)} {\bar{u}}_{i} (t) + \frac{1}{A (s)} v (t), \\ A (s) = & 1 + a_{1} s^{α} + a_{2} s^{2 α}, \\ B_{i} (s) = & b_{i, 1} s^{α} + b_{i, 2} s^{2 α} + b_{i, 3} s^{3 α}, \\ {\bar{u}}_{i} (t) = & c_{i, 1} u_{i} (t) + c_{i, 2} u_{i}^{2} (t) + c_{i, 3} u_{i}^{3} (t) + c_{i, 4} u_{i}^{4} (t) + c_{i, 5} u_{i}^{5} (t) . \end{matrix}

(43)

where {

v (t)

} is a noise sequence with zero mean and constant variance of

σ^{2} = 0 . 10^{2}

. By using the stopping criterion (37), the process stops after completing five iterations. Therefore, the sparsity level is

K = 4

. The estimated parameter vector is

\begin{matrix} \hat{θ} = {[0.2761, 0.1522, 0.1135, 0_{14}, - 0.8091, 0_{14}]}^{T} \in R^{P}, \end{matrix}

where

P = n_{a} + l p r = 32

. Figure 7 displays the output fittings for both the training and validation data.

Figure 8 depicts the curve of the output error criterion

J (α)

as a function of

α

, yielding an estimated fractional order

\hat{α}

of 0.10.

Then, the CD-player arm system can be expressed as

\begin{matrix} y (t) = & \frac{0.1135 s^{0.1}}{1 + 0.2761 s^{0.1} + 0.1522 s^{0.2}} u_{1} (t) - \frac{0.8091 s^{0.1}}{1 + 0.2761 s^{0.1} + 0.1522 s^{0.2}} u_{2} (t) \\ + \frac{1}{1 + 0.2761 s^{0.1} + 0.1522 s^{0.2}} v (t) . \end{matrix}

Table 2 presents the performance evaluation results for the fractional Hammerstein model using the H-GOLS, BPDN, OMP, and LASSO algorithms, along with results for the integer-order Hammerstein model using the H-GOLS algorithm. The results demonstrate that the fractional Hammerstein model offers a significantly more accurate representation of the CD-player arm system compared to its integer-order counterpart. Furthermore, the H-GOLS algorithm demonstrates superior performance compared to BPDN, OMP, and LASSO algorithms.

5. Conclusions

For MISO fractional Hammerstein systems, the H-GOLS algorithm is proposed for jointly identifying the model orders and parameters. This algorithm leverages the Householder transformation to triangularize the sub-information matrix, effectively addressing issues related to ill-conditioned matrices and significantly reducing computational complexity. Subsequently, the sparse parameter vector is obtained using a back-substitution approach. Compared to conventional sparse identification algorithms such as BPDN, OMP, and LASSO, the proposed H-GOLS algorithm demonstrates significant advantages in parameter estimation accuracy, noise robustness, and computational efficiency. Notably, even under limited data availability, H-GOLS maintains superior estimation precision. Furthermore, in comparison to integer-order models, fractional models offer a more accurate description of systems with complex nonlinear characteristics, such as the CD-player mechanical arm presented in this paper. Both numerical simulations and the modeling of a CD-player arm system validate the effectiveness of the proposed H-GOLS algorithm.

In this study, the fractional-order

α

is estimated by first obtaining parameter estimates and then computing the system’s output error across various

α

values. This approach is computationally intensive. In future studies, we aim to investigate methods for simultaneously estimating the parameters and the fractional-order

α

during the identification process.

Author Contributions

Conceptualization, X.Y. and Y.L.; methodology, X.Y. and Y.L.; software, X.Y.; validation, X.Y. and Y.L.; formal analysis, X.Y.; investigation, Y.L.; resources, X.Y. and Y.L.; data curation, X.Y. and Y.L.; writing—original draft preparation, X.Y.; writing—review and editing, Y.L.; visualization, X.Y. and Y.L.; supervision, X.Y. and Y.L.; project administration, X.Y. and Y.L.; funding acquisition, Y.L. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China (Grant No. 61973137), the Natural Science Foundation of Jiangsu Province (Grant No. BK20201339), the China Postdoctoral Science Foundation (Grant No. 2022M711361), and the 111 project (B23008).

Data Availability Statement

The datasets generated and analyzed during the current study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

Huang, K.; Tang, Y.; Liu, X.; Wu, D.; Yang, C.; Gui, W. Knowledge-informed neural network for nonlinear model predictive control with industrial applications. IEEE Trans. Syst. Man-Cybern. Syst. 2023, 4, 2241–2253. [Google Scholar] [CrossRef]
Dragan, P.; Novak, N.; Vojislav, F.; Ljubisa, D. Multilinear model of heat exchanger with Hammerstein structure. J. Control Sci. Eng. 2016, 1, 1–7. [Google Scholar] [CrossRef]
Wu, W.; Jhao, D. Control of a direct internal reforming molten carbonate fuel cell system using wavelet network-based Hammerstein models. J. Process Control 2012, 22, 653–658. [Google Scholar] [CrossRef]
Van der Veen, G.; Wingerden, J.W.; Verhaegen, M. Global identification of wind turbines using a Hammerstein identification method. IEEE Trans. Control Syst. Technol. 2013, 4, 1471–1478. [Google Scholar] [CrossRef]
Guha, D.; Roy, P.K.; Banerjee, S. Adaptive fractional-order sliding-mode disturbance observer-based robust theoretical frequency controller applied to hybrid wind-diesel power system. ISA Trans. 2023, 133, 160–183. [Google Scholar] [CrossRef]
Qureshi, S.; Yusuf, A.; Shaikh, A.A.; Inc, M.; Baleanu, D. Fractional modeling of blood ethanol concentration system with real data application. Chaos 2019, 29, 013143. [Google Scholar] [CrossRef] [PubMed]
Jajarmi, A.; Baleanu, D. On the fractional optimal control problems with a general derivative operator. Asian J. Control 2021, 23, 1062–1071. [Google Scholar] [CrossRef]
Dai, Y.; Wei, Y.; Hu, Y.; Wang, Y. Modulating function based identification for fractional order systems. Neurocomputing 2016, 173, 1959–1966. [Google Scholar] [CrossRef]
Cui, R.; Wei, Y.; Chen, Y.; Chen, S.; Wang, Y. An innovative parameter estimation for fractional-order systems in the presence of outliers. Nonlinear Dyn. 2017, 89, 453–463. [Google Scholar] [CrossRef]
Djamah, T.; Bettayeb, M.; Djennoune, S. Identification of multivariable fractional order systems. Asian J. Control 2013, 15, 741–750. [Google Scholar] [CrossRef]
Ding, F.; Liu, X.; Lin, G. Identification methods for Hammerstein nonlinear systems. Digit. Signal Process. 2011, 21, 215–238. [Google Scholar] [CrossRef]
Ding, F.; Liu, X.; Chu, J. Gradient-based and least-squares-based iterative algorithms for Hammerstein systems using the hierarchical identification principle. IET Control Theory Appl. 2013, 7, 176–184. [Google Scholar] [CrossRef]
Piao, H.; Cheng, D.; Chen, C.; Wang, Y.; Wang, P.; Pan, X. A high-accuracy CO₂ carbon isotope sensing system using subspace identification of Hammerstein model for geochemical application. IEEE Trans. Instrum. Meas. 2021, 71, 1–9. [Google Scholar] [CrossRef]
Bai, E.; Fu, M. A blind approach to Hammerstein model identification. IEEE Trans. Signal Process. 2002, 50, 1610–1619. [Google Scholar] [CrossRef]
Chen, X.; Chai, Y.; Liu, Q.; Huang, P.; Fan, L. Identification of MISO Hammerstein system using sparse multiple kernel-based hierarchical mixture prior and variational Bayesian inference. ISA Trans. 2023, 137, 323–338. [Google Scholar] [CrossRef]
Victor, S.; Malti, R.; Garnier, H.; Oustaloup, A. Parameter and differentiation order estimation in fractional models. Automatica 2013, 49, 926–935. [Google Scholar] [CrossRef]
Jin, Q.; Wang, B.; Wang, Z. Recursive identification for MIMO fractional-order Hammerstein model based on AIAGS. Mathmatics 2022, 10, 212. [Google Scholar] [CrossRef]
Rahmani, M.R.; Farrokhi, M. Identification of neuro-fractional Hammerstein systems: A hybrid frequency-/time-domain approach. Soft Comput. 2018, 22, 8097–8106. [Google Scholar] [CrossRef]
Liao, Z.; Zhu, Z.; Liang, S.; Peng, C.; Wang, Y. Subspace identification for fractional order Hammerstein systems based on instrumental variables. Int. J. Control Autom. Syst. 2012, 10, 947–953. [Google Scholar] [CrossRef]
Aoun, M.; Malti, R.; Cois, O.; Oustaloup, A. System identification using fractional Hammerstein models. IFAC Proc. Vol. 2002, 35, 265–269. [Google Scholar] [CrossRef]
Mohammad, M.J.; Hamed, M.; Mohammad, T. Recursive identification of multiple-input single-output fractional-order Hammerstein model with time delay. Appl. Soft Comput. 2018, 70, 486–500. [Google Scholar] [CrossRef]
Qi, Z.; Sun, Q.; Ge, W.; He, Y. Nonlinear modeling of PEMFC based on fractional order subspace identification. Asian J. Control 2020, 22, 1892–1900. [Google Scholar] [CrossRef]
Gao, Z.; Lin, X.; Zheng, Y. System identification with measurement noise compensation based on polynomial modulating function for fractional-order systems with a known time-delay. ISA Trans. 2018, 79, 62–72. [Google Scholar] [CrossRef] [PubMed]
Wu, X.; Li, J.; Chen, G. Chaos in the fractional order unified system and its synchronization. J. Frankl. Inst. 2008, 345, 392–401. [Google Scholar] [CrossRef]
Wang, D.; Li, L.; Ji, Y.; Yan, Y. Model recovery for Hammerstein systems using the auxiliary model based orthogonal matching pursuit method. Appl. Math. Model. 2018, 54, 537–550. [Google Scholar] [CrossRef]
Liu, X.; Liu, Y.; Zhu, Q.; Ding, F. Joint parameter and time-delay estimation for a class of Wiener models based on a new orthogonal least squares algorithm. Nonlinear Dyn. 2024, 112, 12159–12170. [Google Scholar] [CrossRef]
Gnanasekaran, A.; Darve, E. Hierarchical orthogonal factorization: Sparse least squares problems. J. Sci. Comput. 2022, 91, 50. [Google Scholar] [CrossRef]
Kim, Y.H. QR factorization-based sampling set selection for bandlimited graph signals. Signal Process. 2021, 179, 107847. [Google Scholar] [CrossRef]
Burnham, K.P.; Anderson, D.R. Multimodel inference: Understanding AIC and BIC in model selection. Sociol. Methods Res. 2004, 33, 261–304. [Google Scholar] [CrossRef]
Efron, B.; Hastie, T.; Johnstone, I.; Tibshirani, R. Least angle regression. Ann. Stat. 2004, 32, 407–499. [Google Scholar] [CrossRef]
Brunton, S.L.; Proctor, J.L.; Kutz, J.N. Discovering governing equations from data by sparse identification of nonlinear dynamical systems. Proc. Natl. Acad. Sci. USA 2016, 113, 3932–3937. [Google Scholar] [CrossRef] [PubMed]
Chen, Y.; Liu, Y.; Chen, J.; Ma, J. A novel identification method for a class of closed-loop systems based on basis pursuit de-noising. IEEE Access 2020, 8, 99648–99654. [Google Scholar] [CrossRef]
Li, Y.; Ling, B.; Xie, L.; Dai, Q. Using LASSO for formulating constraint of least-squares programming for solving one-norm equality constrained problem. Signal Image Video Process. 2017, 11, 179–786. [Google Scholar] [CrossRef]
De Moor, B.L.R. DDaisy: Database for the Identification of Systems; Department of Electrical Engineering, Ed.; ESAT/STADIUS; KU Leuven: Leuven, Belgium, 2024. [Google Scholar]

Figure 1. The

BIC

curves versus k.

Figure 1. The

BIC

curves versus k.

Figure 2. Output error criterion

J (α)

versus the commensurate differentiation

α

.

Figure 2. Output error criterion

J (α)

versus the commensurate differentiation

α

.

Figure 3. The parameter estimation errors

δ

of different algorithms under different data lengths m.

Figure 3. The parameter estimation errors

δ

of different algorithms under different data lengths m.

Figure 4. The parameter estimation errors

δ

of different algorithms under different noise variances

σ^{2}

.

Figure 4. The parameter estimation errors

δ

of different algorithms under different noise variances

σ^{2}

.

Figure 5. The CD-player arm system.

Figure 6. The inputs and output data of the CD-player arm system.

Figure 7. Output fitting of the training data and validation data.

Figure 8. Output error criterion

J (α)

versus the commensurate differentiation order

α

for the CD-player arm system.

Figure 8. Output error criterion

J (α)

versus the commensurate differentiation order

α

for the CD-player arm system.

Table 1. Comparison of computational complexity among different algorithms.

Method	Addition Operations	Multiplication Operations
H-GOLS	$\sum_{k = 1}^{K} (\frac{1}{2} k^{2} + 2 m k - \frac{3}{2} K + 3 m + 1)$	$\sum_{k = 1}^{K} (\frac{1}{2} k^{2} + 2 m k + \frac{1}{2} K + 4 m + 3)$
	total: $\sum_{k = 1}^{K} (k^{2} + 4 m k - K + 7 m + 4) [9.83 \times 10^{5}]$
BPDN	$K_{B} (\frac{n^{3}}{3} + n^{2}) + m n^{2} - n^{2} + m n - n$	$K_{B} (\frac{n^{3}}{3} + n^{2}) + m n^{2} + m n$
	total: $2 K_{B} (\frac{n^{3}}{3} + n^{2}) + 2 m n^{2} - n^{2} + 2 m n - n [8.09 \times 10^{9}]$
OMP	$\sum_{k = 1}^{K} (k^{3} + m k^{2} + k^{2} + 2 m k)$	$\sum_{k = 1}^{K} (k^{3} + m k^{2} - k^{2} + 2 m k - 2 k)$
	total: $\sum_{k = 1}^{K} (2 k^{3} + 2 m k^{2} + 4 m k - 2 k) [6.68 \times 10^{6}]$
LASSO	$2 m n K_{L} + 4 n K_{L}$	$2 m n K_{L} + 2 n K_{L}$
	total: $4 m n K_{L} + 6 n K_{L} [2.41 \times 10^{8}]$

Table 2. Comparison of the accuracy of the models established by different algorithms.

Method	RMSE		MAE		$R^{2}$
Method	Training	Validation	Training	Validation	Training	Validation
H-GOLS-Fractional	$0.1505$	$0.1494$	$0.1193$	$0.1169$	$0.9383$	$0.9542$
BPDN	0.1701	0.1798	0.1343	0.1411	0.9202	0.9317
OMP	0.2539	0.2432	0.1873	0.1998	0.8489	0.8074
LASSO	0.1625	0.1600	0.1271	0.1286	0.9272	0.9459
H-GOLS-Integral	0.1560	0.1614	0.1255	0.1289	0.9234	0.9477

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yin, X.; Liu, Y. A New Orthogonal Least Squares Identification Method for a Class of Fractional Hammerstein Models. Algorithms 2025, 18, 201. https://doi.org/10.3390/a18040201

AMA Style

Yin X, Liu Y. A New Orthogonal Least Squares Identification Method for a Class of Fractional Hammerstein Models. Algorithms. 2025; 18(4):201. https://doi.org/10.3390/a18040201

Chicago/Turabian Style

Yin, Xijian, and Yanjun Liu. 2025. "A New Orthogonal Least Squares Identification Method for a Class of Fractional Hammerstein Models" Algorithms 18, no. 4: 201. https://doi.org/10.3390/a18040201

APA Style

Yin, X., & Liu, Y. (2025). A New Orthogonal Least Squares Identification Method for a Class of Fractional Hammerstein Models. Algorithms, 18(4), 201. https://doi.org/10.3390/a18040201

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A New Orthogonal Least Squares Identification Method for a Class of Fractional Hammerstein Models

Abstract

1. Introduction

2. Model and Problem Formulation

2.1. Mathematical Background of Fractional Differentiation

2.2. MISO Fractional Hammerstein System

3. Identification Algorithm

3.1. Orthogonal Least Squares Algorithm Based on Householder Transformation

3.2. Construction of the Permutation Matrix

3.3. Determination of Sparsity Level

3.4. Identification of Orders and Separation of Parameters

4. Experimental Results and Discussions

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI