Momentum-Based Adaptive Laws for Identification and Control

Somers, Luke; Haddad, Wassim M.

doi:10.3390/aerospace11121017

Open AccessArticle

Momentum-Based Adaptive Laws for Identification and Control

by

Luke Somers

and

Wassim M. Haddad

^*

School of Aerospace Engineering, Georgia Institute of Technology, Atlanta, GA 30332-0150, USA

^*

Author to whom correspondence should be addressed.

Aerospace 2024, 11(12), 1017; https://doi.org/10.3390/aerospace11121017

Submission received: 29 October 2024 / Revised: 4 December 2024 / Accepted: 9 December 2024 / Published: 11 December 2024

(This article belongs to the Special Issue Challenges and Innovations in Aircraft Flight Control)

Download

Browse Figures

Versions Notes

Abstract

In this paper, we develop momentum-based adaptive update laws for parameter identification and control to improve parameter estimation error convergence and control system performance for uncertain dynamical systems. Specifically, we introduce three novel continuous-time, momentum-based adaptive estimation and control algorithms and evaluate their effectiveness via several numerical examples. Our proposed adaptive architectures show faster parameter convergence rates as compared to the classical gradient descent and model reference adaptive control methods.

Keywords:

parameter identification; online learning; MRAC; momentum-based learning; high-order tuners; exponential convergence

1. Introduction

One of the fundamental problems in feedback control design is the ability to address discrepancies between system models and real-world systems. To this end, adaptive control has been developed to address the problem of system uncertainty in control-system design [1,2,3,4]. In particular, adaptive control is based on constant linearly parameterized system uncertainty models of a known structure but unknown variation and, in the case of indirect adaptive control, combines an online parameter estimation algorithm with a control law to improve performance in the face of system uncertainties. More specifically, indirect adaptive controllers utilize parameter update laws to identify unknown system parameters and adjust feedback gains to account for system uncertainty.

The parameter estimation algorithms that have been typically used in adaptive control are predicated on two gradient descent type methods that update the system parameter errors in the direction that minimizes a prediction error for a specified cost function. Namely, for a gradient descent flow algorithm, the cost function involves an instantaneous prediction error between an estimated system output and the actual system output of the most recent data, whereas for an integral gradient descent algorithm the cost function captures an aggregate of the past prediction errors in an integral form while placing more emphasis on recent measurements [2].

Additionally, the recursive least squares (RLS) algorithm [2] has also been used for parameter estimation and has been shown to give superior convergence rates as compared to the classical gradient descent algorithms. RLS algorithms can be viewed as a gradient descent method with a time-varying learning rate [5]. For static (i.e., time-invariant) cost functions, momentum-based methods, also known as higher-order tuners, such as the Nesterov method, have been shown to achieve faster parameter error convergence as compared to traditional gradient descent algorithms [6]. Since faster convergence can lead to improved transient system performance, there has been a significant interest in extending momentum-based methods to time-varying cost functions, which naturally appear in adaptive control formulations [7,8,9].

Specifically, in [7] momentum-based architectures were merged with the standard gradient descent method using a variational approach to guarantee boundedness for the parameter estimation and model reference adaptive control problems involving time-varying regressors. Higher-order tuners were also shown to provide an additional degree of freedom in the design of adaptive control laws [10,11]. Given that integral gradient and recursive least squares algorithms provide superior noise rejection properties and improved system performance to that of standard gradient descent methods, there has been a recent interest in integrating momentum-based architectures into adaptive laws for parameter identification and control [5,7].

In this paper, we develop new continuous-time, momentum-based adaptive laws for identification and control by augmenting higher-order tuners into the integral gradient, recursive least squares and composite gradient algorithms. Specifically, in Section 2, we first review the existing gradient-based and momentum-based update laws and introduce three new higher-order tuner architectures. Next, in Section 3, we show how these update laws can be applied for parameter identification, and then, in Section 4, we introduce a momentum-based recursive least squares (MRLS) algorithm for model reference adaptive control. Finally, Section 5 provides several numerical examples that highlight the improved parameter error convergence rate of the proposed momentum-based update laws.

The notation used in this paper is standard. Specifically, we write

R^{n}

to denote the set of

n \times 1

column vectors,

R^{n \times m}

to denote the set of

n \times m

real matrices, and

{(\cdot)}^{T}

to denote transpose. The equi-induced 2-norm of a matrix

Q \in R^{m \times n}

is denoted as

∥ Q ∥ ≜ \sqrt{λ_{\max} (Q^{T} Q)} = σ_{\max} (Q),

where

λ_{\max} (\cdot)

denotes the maximum eigenvalue and

σ_{\max} (\cdot)

denotes the maximum singular value. We write

λ_{\min} (\cdot)

(resp.,

σ_{\min} (\cdot)

) for the minimum eigenvalue (resp., singular value),

\frac{\partial V}{\partial x} (x)

for the gradient of a scalar-valued function V with respect to a vector-valued variable x, and

\hat{f} (s)

for the Laplace transform of

f (\cdot)

, that is

\hat{f} (s) ≜ L {f (t)} = \int_{0}^{\infty} e^{- s t} f (t) d t

. Finally, we write

R (s)

to denote the proper rational functions with coefficients in

R

(i.e., SISO proper rational transfer functions),

R [s]

to denote polynomials with real coefficients,

L_{2}

to denote the space of square-integrable Lebesgue measurable functions on

[0, \infty)

, and

L_{\infty}

to denote the space of bounded Lebesgue measurable functions on

[0, \infty)

.

2. From First-Order to Higher-Order Tuners for Parameter Estimation

Consider the system given by

\begin{matrix} y (t) = θ^{* T} ϕ (t), \end{matrix}

(1)

where, for

t \geq 0

,

y (t) \in R

is the system output,

ϕ (t) \in R^{N}

is a time-varying regressor vector, and

θ^{*} \in R^{N}

is an unknown system parameter vector. Here we assume that

ϕ (\cdot)

is bounded, and hence, can be normalized via the scaling factor

1 + ∥ ϕ (t) ∥

so that, with a minor abuse of notation,

∥ ϕ (t) ∥ \leq 1, t \geq 0

. Since

θ^{*}

is unknown, we design an estimator of the form

y_{e} (t) = θ {(t)}^{T} ϕ (t)

, where

θ (t), t \geq 0

, is an estimate of

θ^{*}

, so that the prediction error between the estimated output

y_{e} (t), t \geq 0

, and the actual system output

y (t), t \geq 0

, is given by

\begin{matrix} e (t) & ≜ y_{e} (t) - y (t) = θ {(t)}^{T} ϕ (t) - y (t) = \tilde{θ} {(t)}^{T} ϕ (t), \end{matrix}

(2)

where

\tilde{θ} (t) ≜ θ (t) - θ^{*}

is the parameter estimation error.

For the error system (2), consider the quadratic loss cost function

\begin{matrix} L (θ, t) = \frac{1}{2} e^{2} (t) \end{matrix}

(3)

and note that the gradient of L with respect to the parameter estimate

θ

is given by

\frac{\partial L (θ, t)}{\partial θ} = ϕ (t) e (t)

. In this case, the gradient flow algorithm is expressed as

\begin{matrix} \dot{θ} (t) = - γ \frac{\partial L (θ, t)}{\partial θ} = - γ ϕ (t) e (t), θ (0) = θ_{0}, t \geq 0, \end{matrix}

(4)

where

γ > 0

is a weighting gain parameter. As shown in [7],

e \in L_{\infty} \cap L_{2}

for all

γ > 0

.

It is well known that the rate of convergence for the gradient flow algorithm (4) can be slow [6]. However, if the regressor vector

ϕ (\cdot)

satisfies a persistency of the excitation condition, then

θ (t)

converges to

θ^{*}

exponentially with a convergence rate proportional to

γ

. Although in this case the convergence rate can improve significantly, large values of

γ

can result in a stiff initial value problem for (4) necessitating progressively smaller sampling times for implementing (4) on a digital computer. Furthermore, small sampling times can increase the sensitivity of the adaptive law to measurement noise and modeling errors [2]. To address these limitations, momentum-based accelerated algorithms predicated on higher-order tuner laws are introduced in [7] to expedite parameter estimation error convergence.

In [12], an array of accelerated methods for continuous-time systems are derived using a Lagrangian variational formulation. Building on this work, [7] extends the variational framework of [12] to address a time-varying instantaneous quadratic loss function of the form given by (3). Specifically, a Lagrangian of the form

\begin{matrix} L (θ, \dot{θ}, t) = e^{\int_{t_{0}}^{t} β N (s) d s} \frac{1}{2 β N (t)} [{\dot{θ}}^{T} (t) \dot{θ} (t) - γ β N (t) e {(t)}^{2}] \end{matrix}

(5)

is introduced, where

N (t) ≜ 1 + μ ϕ {(t)}^{T} ϕ (t), μ > 0

,

β > 0

is a friction coefficient, and

γ > 0

, and the functional

\begin{matrix} J (θ) = \int_{T} L (θ, \dot{θ}, t) d s, \end{matrix}

(6)

where

T

is a finite-time interval, is considered. A necessary condition for

θ

minimizing (6) is shown in [7] to satisfy

\begin{matrix} \ddot{θ} (t) + [β N (t) - \frac{\dot{N} (t)}{N (t)}] \dot{θ} (t) = - γ β N (t) ϕ (t) e (t), \dot{θ} (0) = {\dot{θ}}_{0}, θ (0) = θ_{0}, t \geq 0 . \end{matrix}

(7)

Note that (7) can be rewritten as

\begin{matrix} \dot{V} (t) & = - γ ϕ (t) e (t), V (0) = V_{0}, t \geq 0, \end{matrix}

(8)

\begin{matrix} \dot{θ} (t) & = - β [θ (t) - V (t)] N (t), θ (0) = θ_{0}, \end{matrix}

(9)

where

V

is an auxiliary parameter and

V_{0} = \frac{1}{N (0) β} {\dot{θ}}_{0} + θ_{0}

. The stability properties of (8) and (9) are given in [7]. Taking the limiting case as

β \to \infty

, the gradient flow algorithm (4) can be recovered as a special case of (8) and (9).

Rather than relying solely on the instantaneous system measurement

y (t), t \geq 0

, the loss cost function (3) can be modified to incorporate a weighted sum of past system measurements. Specifically, we can consider loss cost functions in the form of (see [2])

\begin{matrix} L (θ, t) = \frac{1}{2} \int_{0}^{t} e^{- ν (t - τ)} {[y (τ) - θ^{T} (t) ϕ (τ)]}^{2} d τ, \end{matrix}

(10)

where

ν > 0

is a forgetting factor that prevents the degeneration of system information updates in some direction by placing more emphasis on the more recent measurements. In this case, the gradient of the loss cost function (10) is given by

\begin{matrix} \frac{\partial L (θ, t)}{\partial θ} & = \int_{0}^{t} e^{- ν (t - τ)} [y (τ) - θ^{T} (t) ϕ (τ)] ϕ (τ) d τ \\ = R (t) θ (t) + Q (t), \end{matrix}

(11)

where

\begin{matrix} R (t) ≜ \int_{0}^{t} e^{- ν (t - τ)} ϕ (τ) ϕ {(τ)}^{T} d τ, \end{matrix}

(12)

\begin{matrix} Q (t) ≜ \int_{0}^{t} e^{- ν (t - τ)} y (τ) ϕ {(τ)}^{T} d τ . \end{matrix}

(13)

Now, setting

\begin{matrix} \dot{θ} (t) = - γ \frac{\partial L (θ, t)}{\partial θ}, θ (0) = θ_{0}, t \geq 0, \end{matrix}

(14)

we obtain the integral gradient algorithm ([2])

\begin{matrix} \dot{θ} (t) & = - γ [R (t) θ (t) + Q (t)], θ (0) = θ_{0}, t \geq 0, \end{matrix}

(15)

\begin{matrix} \dot{R} (t) & = - ν R (t) + ϕ (t) ϕ {(t)}^{T}, R (0) = 0, \end{matrix}

(16)

\begin{matrix} \dot{Q} (t) & = - ν Q (t) - y (t) ϕ (t), Q (0) = 0 . \end{matrix}

(17)

The stability properties of (15)–(17) are addressed in [2] ([Thm. 3.6.7]).

Next, motivated by the variational formulation of [7] we introduce a momentum-based integral gradient algorithm. Specifically, defining the Lagrangian

\begin{matrix} L (θ, \dot{θ}, t) = e^{\int_{t_{0}}^{t} β N_{i} (s) d s} \frac{1}{β N_{i} (t)} [\frac{1}{2} \dot{θ} {(t)}^{T} \dot{θ} (t) - γ β N_{i} (t) \frac{1}{2} \int_{0}^{t} e^{- ν (t - τ)} {(y (τ) - θ^{T} (t) ϕ (t))}^{2} d τ], \end{matrix}

(18)

where

N_{i} (t) ≜ 1 + μ ∥ R (t) ∥, μ \geq 0

, and using the Euler-Lagrange equation, a necessary condition for

θ

minimizing (6) yields

\begin{matrix} \ddot{θ} (t) + [β N_{i} (t) - \frac{\dot{N_{i}} (t)}{N_{i} (t)}] \dot{θ} (t) & = - γ β N_{i} (t) [R (t) θ (t) + Q (t)], \dot{θ} (0) = {\dot{θ}}_{0}, θ (0) = θ_{0}, \\ t \geq 0, \end{matrix}

(19)

where, for

t \geq 0

,

R (t)

and

Q (t)

are given by (12) and (13). Note that (19) can be rewritten as

\begin{matrix} \dot{V} (t) & = - γ [R (t) θ (t) + Q (t)], V (0) = V_{0}, t \geq 0, \end{matrix}

(20)

\begin{matrix} \dot{θ} (t) & = - β [θ (t) - V (t)] N_{i} (t), θ (0) = θ_{0}, \end{matrix}

(21)

\begin{matrix} \dot{R} (t) & = - ν R (t) + ϕ (t) ϕ {(t)}^{T}, R (0) = 0, \end{matrix}

(22)

\begin{matrix} \dot{Q} (t) & = - ν Q (t) - y (t) ϕ (t), Q (0) = 0, \end{matrix}

(23)

where

V

is an auxiliary parameter and

V_{0} = \frac{1}{N_{i} (0) β} {\dot{θ}}_{0} + θ_{0}

. Furthermore, (20) and (21) can be rewritten in terms of the parameter estimation error

\tilde{θ} (t), t \geq 0

, and the auxiliary parameter estimation error

\tilde{V} (t) ≜ V (t) - θ^{*}

as

\begin{matrix} \dot{\tilde{V}} (t) & = - γ R (t) \tilde{θ} (t), \tilde{V} (0) = {\tilde{V}}_{0}, t \geq 0, \end{matrix}

(24)

\begin{matrix} \dot{\tilde{θ}} (t) & = - β [\tilde{θ} (t) - \tilde{V} (t)] N_{i} (t), \tilde{θ} (0) = {\tilde{θ}}_{0} . \end{matrix}

(25)

The following definition and lemmas are needed for the main results of this section.

Definition 1.

A vector signal

ϕ : R \to R^{N}

is persistently excited (PE) if there exist a positive constant ρ and a finite-time

δ > 0

such that

\begin{matrix} ρ I_{N} \leq \int_{t}^{t + δ} ϕ (τ) ϕ {(τ)}^{T} d τ, t > 0 . \end{matrix}

(26)

Lemma 1.

Consider the system (22). Then, the following statements hold:

(i)

\begin{matrix} ∥ R (t) ∥ \leq \frac{1}{ν}, t \geq 0 . \end{matrix}

(27)

(i i)

If

ϕ (\cdot)

is persistently excited, then

\begin{matrix} e^{- ν δ} ρ \leq ∥ R (t) ∥, t \geq δ . \end{matrix}

(28)

Proof.

To show

(i)

, note that it follows from (12) and

ϕ (t) ϕ {(t)}^{T} \leq I_{N}, t \geq 0

, that

\begin{matrix} R (t) \leq & \int_{0}^{t} e^{- ν (t - τ)} I_{N} d τ \leq \frac{1}{ν} I_{N}, t \geq 0, \end{matrix}

which proves (27). Next, to show (

i i)

, note that if

ϕ (\cdot)

is PE, then

\begin{matrix} R (t) & = \int_{0}^{t - δ} e^{- ν (t - τ)} ϕ (τ) ϕ {(τ)}^{T} d τ + \int_{t - δ}^{t} e^{- ν (t - τ)} ϕ (τ) ϕ {(τ)}^{T} d τ \\ \geq \int_{t - δ}^{t} e^{- ν (t - τ)} ϕ (τ) ϕ {(τ)}^{T} d τ \\ \geq e^{- ν δ} \int_{t - δ}^{t} ϕ (τ) ϕ {(τ)}^{T} d τ \\ \geq e^{- ν δ} ρ I_{N}, t > δ . \end{matrix}

(29)

This proves the lemma. □

Lemma 2

([3]). Let

f : [0, \infty) \to R^{n}

. If

f \in L_{2} \cap L_{\infty}

and

\dot{f} \in L_{\infty}

, then

{lim}_{t \to \infty} f (t) = 0

.

Theorem 1.

Let

μ > \frac{2 γ}{β}

, and consider the momentum-based integral gradient algorithm (20)–(23). Then, the following statements hold:

(i)

θ \in L_{\infty}

,

\dot{θ}, \dot{V}, and (θ - V) \in L_{\infty}

, and

{lim}_{t \to \infty} \dot{θ} (t) = 0

.

(i i)

If

ϕ (\cdot)

is persistently excited, then

θ (t)

converges to

θ^{*}

exponentially as

t \to \infty

with a convergence rate of

\frac{a}{2}

, where

a ≜ min {e^{- ν δ} ρ γ, 2 β}

.

Proof.

To show

(i)

, consider the positive-definite function

V (\tilde{V}, \tilde{θ}) ≜ {\tilde{V}}^{T} \tilde{V} + {(\tilde{θ} - \tilde{V})}^{T} (\tilde{θ} - \tilde{V})

and note that

\begin{matrix} \dot{V} (\tilde{V}, \tilde{θ}, t) & = 2 {\tilde{V}}^{T} \dot{\tilde{V}} + 2 {(\tilde{θ} - \tilde{V})}^{T} (\dot{\tilde{θ}} - \dot{\tilde{V}}) \\ = - 2 γ {\tilde{V}}^{T} R (t) \tilde{θ} - 2 β ∥ \tilde{θ} - \tilde{V} ∥^{2} (1 + μ ∥ R (t) ∥) + 2 γ {(\tilde{θ} - \tilde{V})}^{T} R (t) \tilde{θ} \\ \leq - 2 γ {\tilde{θ}}^{T} R (t) \tilde{θ} - 2 β ∥ \tilde{θ} - \tilde{V} ∥^{2} (1 + μ ∥ R (t) ∥) + 4 γ ∥ {(\tilde{θ} - \tilde{V})}^{T} R^{\frac{1}{2}} (t) ∥ ∥ R^{\frac{1}{2}} (t) \tilde{θ} ∥ \\ \leq - 2 γ {\tilde{θ}}^{T} R (t) \tilde{θ} - 2 β ∥ \tilde{θ} - \tilde{V} ∥^{2} (1 + μ ∥ R (t) ∥) + 4 γ ∥ {(\tilde{θ} - \tilde{V})}^{T} R^{\frac{1}{2}} (t) ∥ ∥ R^{\frac{1}{2}} (t) \tilde{θ} ∥ \\ \leq - γ {\tilde{θ}}^{T} R (t) \tilde{θ} - 2 β {∥ \tilde{θ} - \tilde{V} ∥}^{2} - γ {[∥ R^{\frac{1}{2}} (t) \tilde{θ} ∥ - 2 ∥ {(\tilde{θ} - \tilde{V})}^{T} R^{\frac{1}{2}} (t) ∥]}^{2} \\ \leq - γ {\tilde{θ}}^{T} R (t) \tilde{θ} - 2 β {∥ \tilde{θ} - \tilde{V} ∥}^{2}, \end{matrix}

(30)

which, since

R (t) \geq 0, t \geq 0

, implies

\dot{V} (\tilde{V}, \tilde{θ}, t) \leq 0, t \geq 0

. Thus,

V, \tilde{θ},

and

(\tilde{θ} - \tilde{V}) \in L_{\infty}

.

Next, integrating

\dot{V}

over

[0, \infty)

yields

\begin{matrix} \int_{0}^{\infty} {∥ R^{\frac{1}{2}} (τ) \tilde{θ} (τ) ∥}^{2} d τ \leq V ({\tilde{V}}_{0}, {\tilde{θ}}_{0}, 0) - lim_{t \to \infty} V (\tilde{V} (t), \tilde{θ} (t), t) < \infty, \end{matrix}

and hence,

∥ R^{\frac{1}{2}} \tilde{θ} ∥ \in L_{2}

. Similarly,

∥ \tilde{θ} - \tilde{V} ∥^{2} \in L_{2}

, and hence,

(\tilde{θ} - \tilde{V}) \in L_{\infty} \cap L_{2}

. Since (27) implies that

∥ R ∥ \in L_{\infty}

, it follows that

∥ R^{\frac{1}{2}} \tilde{θ} ∥ \in L_{\infty}

. Now, it follows from (24) and (25) that

\dot{\tilde{V}}

,

\dot{\tilde{θ}} \in L_{\infty} \cap L_{2}

. Finally, since

∥ \dot{R} ∥ \in L_{\infty}

, it follows that

\ddot{\tilde{V}}

,

\ddot{\tilde{θ}} \in L_{\infty}

, and, by Lemma 2, it follows that

{lim}_{t \to \infty} ∥ \dot{θ} (t) ∥ = 0

.

Next, to show (

i i)

, it follows from (28) and (30) that

\begin{matrix} \dot{V} (\tilde{V}, \tilde{θ}, t) & \leq - min {e^{- ν δ} ρ γ, 2 β} V (\tilde{V}, \tilde{θ}), t \geq δ, \end{matrix}

which implies that

\begin{matrix} V (\tilde{V} (t), \tilde{θ} (t)) \leq V ({\tilde{V}}_{0}, {\tilde{θ}}_{0}) e^{- a t}, t \geq δ . \end{matrix}

(31)

Now, (31) implies that, for all

t \geq δ

,

\begin{matrix} ∥ \tilde{V} (t) ∥ & \leq \sqrt{V ({\tilde{V}}_{0}, {\tilde{θ}}_{0})} e^{- \frac{a}{2} t}, \\ ∥ \tilde{θ} (t) - \tilde{V} (t) ∥ & \leq \sqrt{V ({\tilde{V}}_{0}, {\tilde{θ}}_{0})} e^{- \frac{a}{2} t}, \end{matrix}

and hence, using the triangle inequality

∥ \tilde{θ} ∥ - ∥ \tilde{V} ∥ \leq ∥ \tilde{θ} - \tilde{V} ∥

, it follows that

\begin{matrix} ∥ \tilde{θ} (t) ∥ \leq 2 \sqrt{V ({\tilde{V}}_{0}, {\tilde{θ}}_{0})} e^{- \frac{a}{2} t}, t \geq δ, \end{matrix}

(32)

which proves that

\tilde{θ} (t)

converges to 0 exponentially as

t \to \infty

with a convergence rate of

\frac{a}{2}

. □

Remark 1.

Note that unlike gradient flow and recursive least squares algorithms, the momentum-based integral gradient algorithm (20)–(23) does not require the assumption that

\dot{ϕ} (\cdot) \in L_{\infty}

in order to guarantee that

{lim}_{t \to \infty} \dot{θ} (t) = 0

[7].

Remark 2.

Note that a time-varying forgetting factor can also be employed in (22) and (23). In this case, the PE condition can be replaced with a less restrictive excitation condition where (26) holds for a fixed-time t and not for all

t \geq 0

[13].

Next, we consider the cost function

\begin{matrix} J (θ, t) & = \int_{0}^{t} {(y (τ) - ϕ {(τ)}^{T} θ (τ))}^{2} - {(θ (t) - θ (τ))}^{T} F (τ) (θ (t) - θ (τ)) d τ \\ + {(θ (t) - θ (0))}^{T} Γ_{0} (θ (t) - θ (0)), \end{matrix}

(33)

where for all

t \geq 0, F (t)

is an

N \times N

nonnegative-definite matrix function called the general forgetting matrix and

Γ_{0}

is an

N \times N

positive-definite matrix. It can be shown that a necessary condition for

θ

minimizing (33) gives the recursive least squares algorithm ([14])

\begin{matrix} \dot{θ} (t) & = - Γ (t) ϕ (t) e (t), θ (0) = θ_{0}, t \geq 0, \end{matrix}

(34)

\begin{matrix} \dot{Γ} (t) & = - Γ (t) [ϕ (t) ϕ {(t)}^{T} - F (t)] Γ (t), Γ (0) = Γ_{0} . \end{matrix}

(35)

Note that setting

F (t) \equiv 0

and

F (t) = ν H (t) = ν Γ^{- 1} (t)

, where

Γ (t)

is positive-definite for all

t \geq 0

(see [2]), we recover, respectively, the pure recursive least squares and the recursive least squares with exponential forgetting algorithms discussed in [2].

Note that the RLS algorithms (34) and (35) do not involve a gradient flow architecture, and the variational formulation cannot be used to generate a momentum-based recursive least squares (MRLS) algorithm. Despite this fact, higher-order tuner laws can still be incorporated into the RLS architecture to give the MRLS architecture

\begin{matrix} \dot{V} (t) & = - Γ (t) ϕ (t) e (t), V (0) = V_{0}, t \geq 0, \end{matrix}

(36)

\begin{matrix} \dot{θ} (t) & = - β (θ (t) - V (t)) N_{r} (t), θ (0) = θ_{0}, \end{matrix}

(37)

\begin{matrix} \dot{Γ} (t) & = - Γ (t) [ϕ (t) ϕ {(t)}^{T} - F (t)] Γ (t), Γ (0) = Γ_{0}, \end{matrix}

(38)

where

β > 0

and

N_{r} (t) ≜ 1 + μ ϕ {(t)}^{T} Γ (t) ϕ (t) = 1 + μ {∥ Γ^{\frac{1}{2}} (t) ϕ (t) ∥}^{2}

,

μ > 0

. Note that (36) and (37) can be rewritten as

\begin{matrix} \dot{\tilde{V}} (t) & = - Γ (t) ϕ (t) e (t), \tilde{V} (0) = {\tilde{V}}_{0}, t \geq 0, \end{matrix}

(39)

\begin{matrix} \dot{\tilde{θ}} (t) & = - β (\tilde{θ} (t) - \tilde{V} (t)) N_{r} (t), \tilde{θ} (0) = {\tilde{θ}}_{0} . \end{matrix}

(40)

Noting that

H (t) = Γ^{- 1} (t)

and using the fact that

\begin{matrix} \frac{d H (t) Γ (t)}{d t} = \frac{d I_{n}}{d t} = 0 = \dot{H} (t) Γ (t) + H (t) \dot{Γ} (t), \end{matrix}

(41)

it follows that

\begin{matrix} \dot{H} (t) & = - F (t) + ϕ (t) ϕ {(t)}^{T}, H (0) = Γ_{0}^{- 1}, t \geq 0 . \end{matrix}

(42)

Lemma 3.

Consider the system (42) with

F (t) = ν H (t)

. Then, the following statements hold:

(i)

\begin{matrix} ∥ H (t) ∥ \leq α_{2}, t \geq 0, \end{matrix}

(43)

where

α_{2} ≜ λ_{\max} (Γ_{0}^{- 1}) + \frac{1}{ν}

.

(i i)

If

ϕ (\cdot)

is persistently excited, then

\begin{matrix} α_{1} \leq ∥ H (t) ∥, t \geq 0, \end{matrix}

(44)

where

α_{1} ≜ min {e^{- ν δ} ρ, e^{- ν δ} λ_{\min} (Γ_{0}^{- 1})}

.

Proof.

To show

(i)

, note that it follows from (42) with

F (t) = ν H (t)

that

H (t) = e^{- ν t} Γ_{0}^{- 1} + \int_{0}^{t} e^{- ν (t - τ)} ϕ (τ) ϕ {(τ)}^{T} d τ .

(45)

Now, using

ϕ (t) ϕ {(t)}^{T} \leq I_{N}, t \geq 0,

yields

\begin{matrix} H (t) & \leq Γ_{0}^{- 1} + \int_{0}^{t} e^{- ν (t - τ)} I_{N} d τ \leq λ_{\max} (Γ_{0}^{- 1}) I_{N} + \frac{1}{ν} I_{N}, t \geq 0, \end{matrix}

which proves (43).

Next, to show (

i i)

, note that if

ϕ (\cdot)

is PE, then

H (t) \geq e^{- ν t} Γ_{0}^{- 1} \geq e^{- ν δ} λ_{\min} (Γ_{0}^{- 1}) I_{N}, 0 \leq t < δ,

(46)

and

H (t) \geq e^{- ν δ} \int_{t - δ}^{t} ϕ (τ) ϕ {(τ)}^{T} d τ \geq e^{- ν δ} ρ I_{N}, t > δ .

(47)

Thus,

\begin{matrix} α_{1} \leq ∥ H (t) ∥, t \geq 0 . \end{matrix}

(48)

This proves the lemma. □

Theorem 2.

Let

μ > \frac{2}{β}

, and consider the momentum-based recursive least squares algorithm (36)–(38). Then,

e (\cdot)

given by (2) satisfies

e \in L_{2}

. Furthermore, the following statements hold:

(i)

If there exists

α_{1} > 0

such that (44) holds, then

V

,

θ \in L_{\infty}

. If, in addition,

\dot{ϕ} \in L_{\infty}

, then

{lim}_{t \to \infty} \dot{θ} (t) = 0

and

{lim}_{t \to \infty} e (t) = 0

.

(i i)

If

F (t) = ν H (t)

and

ϕ (\cdot)

is persistently excited, then the zero solution

(\tilde{V} (t), \tilde{θ} (t)) \equiv (0, 0)

to (39) and (40) is exponentially stable.

Proof.

Consider the function

V (\tilde{V}, \tilde{θ}, t) ≜ {\tilde{V}}^{T} H (t) \tilde{V} + {(\tilde{θ} - \tilde{V})}^{T} H (t) (\tilde{θ} - \tilde{V})

and note that

\begin{matrix} \dot{V} (\tilde{V}, \tilde{θ}, t) \\ = 2 {\tilde{V}}^{T} H (t) \dot{\tilde{V}} + {\tilde{V}}^{T} \dot{H} (t) \tilde{V} + 2 {(\tilde{θ} - \tilde{V})}^{T} H (t) (\dot{\tilde{θ}} - \dot{\tilde{V}}) + {(\tilde{θ} - \tilde{V})}^{T} \dot{H} (t) (\tilde{θ} - \tilde{V}) \\ = - 2 {\tilde{V}}^{T} H (t) Γ (t) ϕ (t) e + {\tilde{V}}^{T} [- ν H (t) + ϕ (t) ϕ {(t)}^{T}] \tilde{V} - 2 β {(\tilde{θ} - \tilde{V})}^{T} H (t) (\tilde{θ} - \tilde{V}) N_{r} (t) \\ - 2 β {(\tilde{θ} - \tilde{V})}^{T} H (t) Γ (t) ϕ (t) e + {(\tilde{θ} - \tilde{V})}^{T} [- ν H (t) + ϕ (t) ϕ {(t)}^{T}] (\tilde{θ} - \tilde{V}) \\ = - {\tilde{V}}^{T} F (t) \tilde{V} - ν {(\tilde{θ} - \tilde{V})}^{T} F (t) (\tilde{θ} - \tilde{V}) - 2 {\tilde{V}}^{T} ϕ (t) ϕ {(t)}^{T} \tilde{θ} + {\tilde{V}}^{T} ϕ (t) ϕ {(t)}^{T} \tilde{V} \\ - 2 β {(\tilde{θ} - \tilde{V})}^{T} H (t) (\tilde{θ} - \tilde{V}) N_{r} (t) - 2 β {(\tilde{θ} - \tilde{V})}^{T} ϕ (t) ϕ {(t)}^{T} \tilde{θ} \\ + {(\tilde{θ} - \tilde{V})}^{T} ϕ (t) ϕ {(t)}^{T} (\tilde{θ} - \tilde{V}) \\ = - {\tilde{V}}^{T} F (t) \tilde{V} - ν {(\tilde{θ} - \tilde{V})}^{T} F (t) (\tilde{θ} - \tilde{V}) + {(\tilde{θ} - \tilde{V})}^{T} ϕ (t) ϕ {(t)}^{T} (\tilde{θ} - \tilde{V}) - 2 e^{2} \\ - 2 β {(\tilde{θ} - \tilde{V})}^{T} H (t) (\tilde{θ} - \tilde{V}) N_{r} (t) - 2 β {(\tilde{θ} - \tilde{V})}^{T} ϕ (t) ϕ {(t)}^{T} \tilde{θ} \\ + {(\tilde{θ} - \tilde{V})}^{T} ϕ (t) ϕ {(t)}^{T} (\tilde{θ} - \tilde{V}) \\ = - {\tilde{V}}^{T} F (t) \tilde{V} - ν {(\tilde{θ} - \tilde{V})}^{T} F (t) (\tilde{θ} - \tilde{V}) + 2 {∥ {(\tilde{θ} - \tilde{V})}^{T} ϕ (t) ∥}^{2} - 2 e^{2} \\ - 2 β {(\tilde{θ} - \tilde{V})}^{T} H (t) (\tilde{θ} - \tilde{V}) N_{r} (t) - 2 {(\tilde{θ} - \tilde{V})}^{T} ϕ (t) ϕ {(t)}^{T} \tilde{θ} \\ \leq - {\tilde{V}}^{T} F (t) \tilde{V} - ν {(\tilde{θ} - \tilde{V})}^{T} F (t) (\tilde{θ} - \tilde{V}) + 2 {∥ {(\tilde{θ} - \tilde{V})}^{T} ϕ (t) ∥}^{2} - 2 e^{2} \\ - 2 β {(\tilde{θ} - \tilde{V})}^{T} H (t) (\tilde{θ} - \tilde{V}) (1 + μ ∥ Γ^{\frac{1}{2}} (t) ϕ (t) ∥) + 2 ∥ {(\tilde{θ} - \tilde{V})}^{T} ϕ (t) ∥ | e | . \end{matrix}

Now, by the Cauchy–Schwarz inequality,

\begin{matrix} ∥ {(\tilde{θ} - \tilde{V})}^{T} ϕ (t) ∥ = ∥ {(\tilde{θ} - \tilde{V})}^{T} H^{\frac{1}{2}} (t) Γ^{\frac{1}{2}} (t) ϕ (t) ∥ \leq ∥ H^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥ ∥ Γ^{\frac{1}{2}} (t) ϕ (t) ∥, \end{matrix}

and hence,

\begin{matrix} \dot{V} (\tilde{V}, \tilde{θ}, t) & \leq - ∥ F^{\frac{1}{2}} {(t) V ∥}^{2} - ∥ F^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥^{2} + 2 (1 - β μ) ∥ H^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥^{2} {∥ Γ^{\frac{1}{2}} (t) ϕ (t) ∥}^{2} \\ - 2 e^{2} - 2 β ∥ H^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥^{2} + 2 ∥ H^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥ ∥ Γ^{\frac{1}{2}} (t) ϕ (t) ∥ | e | \\ \leq - ∥ F^{\frac{1}{2}} {(t) V ∥}^{2} - ∥ F^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥^{2} - 2 ∥ H^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥^{2} {∥ Γ^{\frac{1}{2}} (t) ϕ (t) ∥}^{2} - 2 e^{2} \\ - 2 β ∥ H^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥^{2} - ∥ H^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥^{2} {∥ Γ^{\frac{1}{2}} ∥}^{2} \\ + 2 ∥ H^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥ ∥ Γ^{\frac{1}{2}} (t) ϕ (t) ∥ | e | \\ \leq - e^{2} - ∥ F^{\frac{1}{2}} {(t) V ∥}^{2} - {∥ F^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥}^{2} \\ - 2 β ∥ H^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥^{2} - [∥ H^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥ ∥ Γ^{\frac{1}{2}} (t) ϕ (t) {∥ - e]}^{2} \\ \leq - e^{2} - ∥ F^{\frac{1}{2}} {(t) V ∥}^{2} - {∥ F^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥}^{2} \\ \leq 0, t \geq 0 . \end{matrix}

(49)

Next, integrating

\dot{V}

over

[0, \infty)

yields

\begin{matrix} \int_{0}^{\infty} e^{2} (τ) d τ \leq V ({\tilde{V}}_{0}, {\tilde{θ}}_{0}, 0) - lim_{t \to \infty} V (\tilde{V} (t), \tilde{θ} (t), t) < \infty, \end{matrix}

and hence,

e \in L_{2}

.

To show (i), note that (49) and (44) imply that

\tilde{V} \in L_{\infty}

and

\tilde{θ} - \tilde{V} \in L_{\infty}

. In addition, (37) implies that

\dot{\tilde{θ}} \in L_{\infty}

, and for

\dot{ϕ} \in L_{\infty}

, it follows from (2) and Lemma 2 that

{lim}_{t \to \infty} \dot{\tilde{θ}} (t) = 0

and

{lim}_{t \to \infty} e (t) = 0

.

Finally, to show

(i i)

, note that if

ϕ (\cdot)

is persistently excited, then (44) holds by Lemma 1. Now, using the fact that

∥ \tilde{V} ∥^{2} + {∥ \tilde{V} - \tilde{θ} ∥}^{2} = {[{\tilde{V}}^{T}, {\tilde{θ}}^{T}]}^{T} P [\tilde{V}, \tilde{θ}],

(50)

where

P ≜ [\begin{matrix} 2 & - 1 \\ - 1 & 1 \end{matrix}]

is a positive-definite matrix with eigenvalues

{\frac{3 - \sqrt{5}}{2}, \frac{3 + \sqrt{5}}{2}}

, it follows from the Schur decomposition that

\frac{3 - \sqrt{5}}{2} (∥ \tilde{V} ∥^{2} + ∥ \tilde{θ} ∥^{2}) \leq ∥ \tilde{V} ∥^{2} + ∥ \tilde{V} - \tilde{θ} ∥^{2} \leq \frac{3 + \sqrt{5}}{2} (∥ \tilde{V} ∥^{2} + ∥ \tilde{θ} ∥^{2}),

(51)

and hence,

\begin{matrix} (\frac{3 - \sqrt{5}}{2}) α_{1} (∥ \tilde{V} ∥^{2} + ∥ \tilde{θ} ∥^{2}) \leq V (\tilde{V}, \tilde{θ}, t) \leq (\frac{3 + \sqrt{5}}{2}) α_{2} (∥ \tilde{V} ∥^{2} + ∥ \tilde{θ} ∥^{2}) . \end{matrix}

(52)

In this case, (49) gives

\begin{matrix} \dot{V} (\tilde{V}, \tilde{θ}, t) & \leq - e^{2} - ∥ F^{\frac{1}{2}} {(t) V ∥}^{2} - {∥ F^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥}^{2} \\ \leq - α_{1} ν (∥ \tilde{V} ∥^{2} + ∥ \tilde{V} - \tilde{θ} ∥^{2}) \\ \leq - α_{1} ν (\frac{3 - \sqrt{5}}{2}) (∥ \tilde{V} ∥^{2} + ∥ \tilde{θ} ∥^{2}), \end{matrix}

(53)

which shows that the zero solution

(\tilde{V} (t), \tilde{θ} (t)) \equiv (0, 0)

to (39) and (40) is exponentially stable [15] ([Theorem 4.6]). □

Remark 3.

Note that setting

\begin{matrix} F (t) = \{\begin{matrix} ν H (t), & if ∥ H (t) ∥ \geq α_{1}, t \geq 0, \\ 0, & otherwise, \end{matrix} \end{matrix}

(54)

guarantees that (44) holds [14].

The constraints on parameter

μ

in Theorems 1 and 2 limit the amount of momentum that can be applied, thereby limiting the potential advantage gained by using a momentum-based algorithm. In the context of static optimization, Lyapunov-like functions are typically constructed as the sum of two terms; one term representing the norm of the error of the parameter estimate and the other term corresponding to the loss cost function [12]. This generalized structure allows for adjustments to the parameter estimate

\tilde{θ}

, that may not decrease

∥ \tilde{V} ∥

or

∥ \tilde{θ} ∥

but can reduce the loss cost function

L (\tilde{θ}, t)

as shown in Figure 1.

In order to incorporate (10) in our Lyapunov function, we introduce the momentum-based composite gradient algorithm

\begin{matrix} \dot{V} (t) & = - Γ (t) ϕ (t) e (t) - \frac{β}{2} Γ (t) N_{r} (t) [R (t) θ (t) + Q (t)], V (0) = V_{0}, t \geq 0, \end{matrix}

(55)

\begin{matrix} \dot{θ} (t) & = - β [θ (t) - V (t)] N_{r} (t), θ (0) = θ_{0}, \end{matrix}

(56)

\begin{matrix} \dot{Γ} (t) & = - Γ (t) [ϕ (t) ϕ {(t)}^{T} - F (t)] Γ (t), Γ (0) = Γ_{0}, \end{matrix}

(57)

where

R (\cdot)

and

Q (\cdot)

are given by (12) and (13). Note that this composite update law includes an integral gradient term as well as a recursive least squares term in (55).

Theorem 3.

Let

μ > \frac{2}{β}

, and consider the momentum-based composite gradient algorithm (55)–(57). Then, the following statements hold:

(i)

If there exists

α_{1} > 0

such that (44) holds, then

θ \in L_{\infty}

,

\dot{θ}, \dot{V}, and (θ - V) \in L_{\infty}

, and

{lim}_{t \to \infty} \dot{θ} (t) = 0

.

(i i)

If

F (t) = ν H (t)

and

ϕ (\cdot)

is persistently excited, then the zero solution

(\tilde{V} (t), \tilde{θ} (t)) \equiv (0, 0)

to (55)–(57) is exponentially stable.

Proof.

This proof is similar to the proof of Theorem 2. Namely, consider the positive-definite function

\begin{matrix} V (\tilde{V}, \tilde{θ}, t) ≜ {\tilde{V}}^{T} H (t) \tilde{V} + {(\tilde{θ} - \tilde{V})}^{T} H (t) (\tilde{θ} - \tilde{V}) + 2 L (\tilde{θ}, t) \end{matrix}

(58)

satisfying

V (0, 0, t) = 0, t \geq 0

, and note that

\begin{matrix} \dot{L} (\tilde{θ}, t) & = \frac{\partial L}{\partial \tilde{θ}} \frac{d \tilde{θ}}{d t} + \frac{\partial L}{\partial t} = {\tilde{θ}}^{T} R (t) \dot{\tilde{θ}} - ν L + \frac{1}{2} e {(t)}^{2} . \end{matrix}

(59)

Now, using (49) and (59), note that

\begin{matrix} \dot{V} (\tilde{V}, \tilde{θ}, t) & \leq - e {(t)}^{2} - ∥ F^{\frac{1}{2}} {(t) V ∥}^{2} - {∥ F^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥}^{2} - β N_{r} (t) {\tilde{V}}^{T} R (t) \tilde{θ} \\ + β N_{r} (t) {(\tilde{θ} - \tilde{V})}^{T} R (t) \tilde{θ} - 2 {\tilde{θ}}^{T} R (t) β (\tilde{θ} - \tilde{V}) N_{r} (t) - 2 ν L + e {(t)}^{2} \\ \leq - ∥ F^{\frac{1}{2}} {(t) V ∥}^{2} - {∥ F^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥}^{2} - 2 β N_{r} (t) {\tilde{θ}}^{T} R (t) \tilde{θ} + 2 β N_{r} (t) {(\tilde{θ} - \tilde{V})}^{T} R (t) \tilde{θ} \\ - 2 {\tilde{θ}}^{T} R (t) β (\tilde{θ} - \tilde{V}) N_{r} (t) - 2 ν L \\ \leq - ∥ F^{\frac{1}{2}} {(t) V ∥}^{2} - {∥ F^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥}^{2} - 2 {\tilde{θ}}^{T} R (t) \tilde{θ} - 2 ν L . \end{matrix}

(60)

(i)

now follows using similar arguments as in the proof of Theorem 2.

To show

(i i)

, note that if

ϕ (\cdot)

is persistently excited, then (44) and (51) hold, and hence, using the fact that

L (\tilde{θ}, t) \leq \frac{1}{ν} {∥ \tilde{θ} ∥}^{2}

, we have

\begin{matrix} (\frac{3 - \sqrt{5}}{2}) e^{- ν δ} ρ (∥ \tilde{V} ∥^{2} + ∥ \tilde{θ} ∥^{2}) \leq V (\tilde{V}, \tilde{θ}, t) & \leq (\frac{3 + \sqrt{5}}{2}) \frac{1}{ν} (∥ \tilde{V} ∥^{2} + ∥ \tilde{θ} ∥^{2}) + \frac{1}{ν} {∥ \tilde{θ} ∥}^{2} \\ \leq (\frac{5 + \sqrt{5}}{2 ν}) (∥ \tilde{V} ∥^{2} + ∥ \tilde{θ} ∥^{2}) . \end{matrix}

(61)

Now, using (49) and (60), it follows that the zero solution

(\tilde{V} (t), \tilde{θ} (t)) \equiv (0, 0)

to (55) and (56) is exponentially stable [15] ([Theorem 4.6]). □

3. System Parameter Identification

In this section, we address the problem of parameter identification using the first- and higher-order tuner algorithms developed in Section 2. First, the following definition is needed.

Definition 2

([2]). A signal

u : [0, \infty) \to R

is stationary and sufficiently rich of order n if the following statements hold:

(i)

The limit

\begin{matrix} lim_{T \to \infty} \frac{1}{T} \int_{t_{0}}^{t_{0} + T} u (τ) u^{T} (t + τ) d τ \end{matrix}

(62)

exists uniformly in

t_{0}

.

(i i)

The support of the spectral measure

S_{u} (ω), ω \in R

, of u contains at least n points.

Consider the stable single-input, single-output (SISO) plant, with output

y (\cdot)

and input

u (\cdot)

being the only available signals for measurement, given by

\begin{matrix} \dot{x} (t) & = A x (t) + B u (t), x (0) = x_{0}, t \geq 0, \end{matrix}

(63)

\begin{matrix} y (t) & = C^{T} x (t), \end{matrix}

(64)

where

x (t) \in R^{n}, t \geq 0

,

A \in R^{n \times n}

, and

B, C \in R^{n}

. Since there are an infinite number of triples (

A, B, C

) yielding the same input–output measurements, we cannot uniquely define the

n^{2} + 2 n

coefficients in (

A, B, C

). However, we can determine the

n + m + 1

parameters corresponding to the coefficients of the stable and strictly proper rational transfer function

G \in R (s)

given by

\begin{matrix} G (s) = C^{T} {(s I_{n} - A)}^{- 1} B = \frac{\hat{y} (s)}{\hat{u} (s)} = \frac{b_{m} s^{m} + b_{m - 1} s^{m - 1} + \dots + b_{0}}{s^{n} + a_{n - 1} s^{n - 1} + \dots + a_{0}}, \end{matrix}

(65)

where, for

s \in C

,

\hat{y} (s)

is the system output,

\hat{u} (s)

is the system input, and the coefficients

a_{i}

,

i = 0, \dots, n - 1

, and

b_{j}

,

j = 0, \dots, m

, where

m \leq n - 1

, are unknown system parameters. The goal of the system parameter identification problem is to identify the system parameter vector

\begin{matrix} θ^{*} = [b_{m}, b_{m - 1}, \dots, b_{0}, a_{n - 1}, \dots, a_{0}] \end{matrix}

(66)

containing the unknown coefficients of the plant transfer function that satisfies

\begin{matrix} s^{n} \hat{y} (s) = θ^{* T} \hat{Y} (s), \end{matrix}

(67)

where

\hat{Y} (s) ≜ {[α_{m}^{T} (s) \hat{u} (s), - α_{n - 1}^{T} (s) \hat{y} (s)]}^{T}

and

α_{i} (s) ≜ {[s^{i}, s^{i - 1}, \dots, 1]}^{T}

.

Note that (67) has the form of (1) in the frequency domain; however, in most applications the only signals that are available for measurement are the input

u (t), t \geq 0

, and the output

y (t), t \geq 0

, and not their derivatives. To address this, we filter (67) through a stable filter

\frac{1}{Λ (s)}

, where

Λ (s)

is an arbitrary monic Hurwitz polynomial of degree n, to obtain, for

y (0) = 0

,

\begin{matrix} \hat{z} (s) = θ^{* T} \hat{ϕ} (s), \end{matrix}

(68)

where

\hat{z} (s) ≜ \frac{s^{n} \hat{y} (s)}{Λ (s)}

and

\hat{ϕ} (s) ≜ {[\frac{α_{m}^{T} (s)}{Λ (s)} \hat{u} (s), - \frac{α_{n - 1}^{T} (s)}{Λ (s)} \hat{y} (s)]}^{T}

. The time signals

ϕ (t), t \geq 0

, and

z (t), t \geq 0

, can be generated by the state equations ([2])

\begin{matrix} {\dot{ϕ}}_{0} (t) & = Λ_{c} ϕ_{0} (t) + b_{c} u (t), ϕ_{0} (0) = 0, t \geq 0, \end{matrix}

(69)

\begin{matrix} ϕ_{1} (t) & = P_{0} ϕ_{0} (t), \end{matrix}

(70)

\begin{matrix} {\dot{ϕ}}_{2} (t) & = Λ_{c} ϕ_{2} (t) - b_{c} y (t), ϕ_{2} (0) = 0, \end{matrix}

(71)

\begin{matrix} ϕ (t) & = {[ϕ_{1}^{T} (t), ϕ_{2}^{T} (t)]}^{T}, \end{matrix}

(72)

\begin{matrix} z (t) & = y (t) + λ^{T} ϕ_{2} (t), \end{matrix}

(73)

where, for

t \geq 0

,

ϕ_{0} (t) \in R^{n}

,

ϕ_{1} (t) \in R^{m + 1}

,

ϕ_{2} (t) \in R^{n}

, and

\begin{matrix} Λ_{c} = [\begin{matrix} - λ^{T} \\ I_{n - 1} | 0_{(n - 1) \times 1} \end{matrix}], b_{c} = {[1, 0, \dots, 0]}^{T}, \end{matrix}

(74)

\begin{matrix} P_{0} = [0_{(m + 1) \times (n - m - 1)} | I_{m + 1}] \in R^{(m + 1) \times n}, λ = {[λ_{n - 1}, λ_{n - 2}, \dots, λ_{0}]}^{T}, \end{matrix}

(75)

and where

\det (s I_{n} - Λ_{c}) = Λ (s) = s^{n} + λ^{T} α_{n - 1} (s)

.

Theorem 4.

Assume that

G \in R (s)

given by (65) has no pole-zero cancellations and

y (0) = 0

. If u is stationary and sufficiently rich of order

n + m + 1

, then the adaptive laws (20)–(23), (36)–(38), and (55)–(57) guarantee that

θ (t)

converges to

θ^{*}

as

t \to \infty

.

Proof.

The proof follows from Theorems 1, 2, and 3 by noting that if

u (\cdot)

is stationary and sufficiently rich of order

n + m + 1

and G has no pole-zero cancellations, then

ϕ (\cdot)

is PE [2] ([Thm. 5.2.4]). □

Remark 4.

Note that if

y (0) \neq 0

, then (68) would involve an additional bias error term. As shown in [2], this term converges to the origin exponentially fast, and hence, Theorem 4 remains valid.

4. Momentum-Based Recursive Least Squares and Model Reference Adaptive Control

Recursive least squares type algorithms were first introduced in adaptive control in [16] and extended in [2,17,18]. In this section, we show how the MRLS algorithm proposed in Section 2 can be extended to the problem of model reference adaptive control for systems with relative degree one.

Consider the linear SISO system given by

{\hat{y}}_{p} (s) = G (s) {\hat{u}}_{p} (s),

(76)

where

G (s) = k_{p} \frac{N_{p} (s)}{D_{p} (s)} \in R (s)

,

N_{p} (s), D_{p} (s) \in R [s]

are unknown polynomials,

k_{p}

is an unknown constant,

{\hat{u}}_{p} (s)

is the control system input, and

{\hat{y}}_{p} (s)

is the system output. As in [18], we make the following assumptions.

(i)

G \in R (s)

is minimum phase and has relative degree one.

(i i)

The degree of

D_{p} (s)

is n.

(i i i)

The sign of

k_{p}

is known.

The control objective is to design an appropriate control law

u_{p} (t), t \geq 0

, such that all the signals of the closed-loop system are bounded and

y_{p} (t), t \geq 0

, tracks the output

y_{r} (t), t \geq 0

, of a reference model given by

\begin{matrix} {\hat{y}}_{r} (s) = M (s) \hat{r} (s), \end{matrix}

(77)

where

M (s) ≜ \frac{k_{r}}{s + a_{r}} \in R (s)

,

a_{r} > 0

and

k_{r}

are known constants, and

\hat{r} (s)

is the Laplace transform of a bounded piecewise continuous signal

r (t), t \geq 0

.

To address this problem, consider the filter system ([2])

\begin{matrix} {\dot{v}}_{1} (t) & = F_{c} v_{1} (t) + g_{c} u_{p} (t), v_{1} (0) = v_{10}, t \geq 0, \end{matrix}

(78)

\begin{matrix} {\dot{v}}_{2} (t) & = F_{c} v_{2} (t) + g_{c} y_{p} (t), v_{2} (0) = v_{20}, \end{matrix}

(79)

where, for

t \geq 0

,

v_{1} (t) \in R^{n - 1}

,

v_{2} (t) \in R^{n - 1}

,

\begin{matrix} F_{c} = [\begin{matrix} - {\hat{λ}}^{T} \\ I_{n - 2} | 0_{(n - 2) \times 1} \end{matrix}] \in R^{(n - 1) \times (n - 1)}, \hat{λ} = {[λ_{n - 2}, λ_{n - 3}, \dots, λ_{0}]}^{T}, \end{matrix}

(80)

g_{c} = {[1, 0 \dots, 0]}^{T} \in R^{n - 1}

, and

\det (s I_{n - 1} - F_{c}) = s^{n - 1} + {\hat{λ}}^{T} α_{n - 2} (s) = F (s)

, where

F (s) \in R [s]

is an arbitrary monic Hurwitz polynomial of degree

n - 1

. Here, we use

v_{1} (t), t \geq 0

, and

v_{2} (t), t \geq 0

, to form the regressor vector

\begin{matrix} ϕ (t) = {[v_{1}^{T} (t), v_{2}^{T} (t), y_{p} (t), r (t)]}^{T} \in R^{2 n} . \end{matrix}

(81)

Note that the existence of a constant parameter vector

θ^{*} = {[θ_{1}^{* T}, θ_{2}^{* T}, θ_{3}^{*}, θ_{4}^{*}]}^{T} \in R^{2 n}

such that the transfer function of the SISO system (76) with

{\hat{u}}_{p} (s) = θ^{* T} \hat{ϕ} (s),

where

\hat{ϕ} (s) = [\frac{α_{n - 2} (s)}{F (s)} {\hat{u}}_{p} (s)

,

\frac{α_{n - 2} (s)}{F (s)} {\hat{y}}_{p} (s), {\hat{y}}_{p} (s), \hat{r} (s)]

, matching the reference model transfer function

M (s) \in R (s)

is guaranteed by the choice of

M (s)

and Assumptions

(i)

–

(i i i)

.

Next, consider the control law

\begin{matrix} {\hat{u}}_{p} (s) & = L (s) {\hat{θ}}^{T} (s) \hat{ξ} (s), \end{matrix}

(82)

where

\hat{θ} (s) \in C^{2 n}

,

L (s) ≜ s + λ_{1}, λ_{1} > 0

, and

\hat{ξ} (s) ≜ L^{- 1} (s) \hat{ϕ} (s) \in C^{2 n}

, and note that the tracking error

{\hat{e}}_{1} (s) ≜ {\hat{y}}_{p} (s) - {\hat{y}}_{r} (s)

satisfies

\begin{matrix} {\hat{e}}_{1} (s) = M (s) L (s) k^{*} [{\hat{u}}_{p} (s) - θ^{* T} \hat{ϕ} (s)] = M (s) L (s) k^{*} {\hat{\tilde{θ}}}^{T} (s) \hat{ξ} (s), \end{matrix}

(83)

where

k^{*} = \frac{k_{p}}{k_{r}}

. Note that (83) has a nonminimal state-space realization given by

\begin{matrix} {\dot{e}}_{2} (t) & = A_{m} e_{2} (t) + B_{m} k^{*} {\tilde{θ}}^{T} (t) ξ (t), e_{2} (0) = e_{20}, t \geq 0, \end{matrix}

(84)

\begin{matrix} e_{1} (t) & = C_{m} e (t) + k_{r} k^{*} {\tilde{θ}}^{T} (t) ξ (t), \end{matrix}

(85)

where

e_{2} (t) \in R^{3 n - 2}

,

A_{m} \in R^{(3 n - 2) \times (3 n - 2)}

,

B_{m} = R^{(3 n - 2) \times 1}

, and

C_{m} = R^{1 \times (3 n - 2)}

. Furthermore, note that

\begin{matrix} M (s) L (s) = \frac{k_{r} (s + λ_{1})}{s + a_{r}} = \frac{k_{r} (λ_{1} - a_{r})}{s + a_{r}} + k_{r}, \end{matrix}

(86)

where

λ_{1} > a_{r}

,

\frac{k_{r} (λ_{1} - a_{r})}{s + a_{r}}

is a strictly positive real transfer function. In this case, it follows from the Meyer–Kalman–Popov lemma [2] ([Lem. 3.5.4]) that there exist

(3 n - 2) \times (3 n - 2)

positive-definite matrices

P > 0

and

Q > 0

such that

\begin{matrix} A_{m}^{T} P + P A_{m} & = - 2 Q, \end{matrix}

(87)

\begin{matrix} P B_{m} & = C_{m}^{T} . \end{matrix}

(88)

Since for every nonsingular matrix

S \in R^{(3 n - 2) \times (3 n - 2)}

and constant

α > 0

, the realizations

(A, B, C)

and

(S A S^{- 1}

,

α S B

,

α^{- 1} C S^{- 1}

) are equivalent, choosing

S = Q^{\frac{1}{2}}

and

α = ∥ C_{m} ∥

we can ensure that

(A_{m}, B_{m}, C_{m})

is a realization that satisfies (87) and (88) with

Q = I_{3 n - 2}

and

C_{m}

is normalized so that

∥ C_{m} ∥ = 1

.

Next, consider the recursive least-squares algorithm given by

\begin{matrix} \dot{θ} (t) & = - α sgn (k^{*}) Γ (t) ξ (t) e_{1} (t), θ (0) = θ_{0}, t \geq 0, \end{matrix}

(89)

\begin{matrix} \dot{Γ} (t) & = ν Γ (t) - Γ (t) ξ (t) ξ {(t)}^{T} Γ (t), Γ (0) = Γ_{0}, \end{matrix}

(90)

where

sgn (σ) ≜ σ / | σ |, σ \neq 0,

and

sgn (0) ≜ 0

,

α > 0

, and

ν \geq 0

. Note that for

ν = 0

, (89) and (90) recover the recursive least-squares algorithm of [19].

Here, we modify the recursive least-squares algorithm (89) and (90) to construct our MRLS algorithm as

\begin{matrix} \dot{V} (t) & = - γ sgn (k^{*}) Γ (t) ξ (t) e_{1} (t), V (0) = V_{0}, t \geq 0, \end{matrix}

(91)

\begin{matrix} \dot{θ} (t) & = - β (θ (t) - V (t)) N_{m} (t), θ (0) = θ_{0}, \end{matrix}

(92)

\begin{matrix} \dot{Γ} (t) & = ν Γ (t) - Γ (t) ξ (t) ξ {(t)}^{T} Γ (t), Γ (0) = Γ_{0}, \end{matrix}

(93)

where

β > 0

,

γ > 0

,

α > 0

,

ν > 0

,

\begin{matrix} N_{m} (t) ≜ 1 + μ ξ {(t)}^{T} Γ (t) ξ (t) = 1 + μ {∥ Γ^{\frac{1}{2}} (t) ξ (t) ∥}^{2}, \end{matrix}

(94)

and

μ > 0

. Note that (91) and (92) can be rewritten in terms of the error parameters

\tilde{V} (t), t \geq 0

, and

\tilde{θ} (t), t \geq 0,

as

\begin{matrix} \dot{\tilde{V}} (t) & = - γ sgn (k^{*}) Γ (t) ξ (t) e_{1} (t), \tilde{V} (0) = V_{0} - V^{*}, t \geq 0, \end{matrix}

(95)

\begin{matrix} \dot{\tilde{θ}} (t) & = - β (\tilde{θ} (t) - \tilde{V} (t)) N_{m} (t), \tilde{θ} (0) = θ_{0} - θ^{*} . \end{matrix}

(96)

Theorem 5.

Consider the SISO system (76) with control law (82) and the MRLS algorithm given by (91)–(93) with

α > \frac{1}{| k_{r} | | k^{*} |}

and

μ > \frac{1}{β} (1 + \frac{2 α | k^{*} |}{ζ})

, where

ζ \leq min {1, \frac{1}{2 k_{r}}}

. Then

e_{2} \in L_{2} \cap L_{\infty}

and

{\tilde{θ}}^{T} ξ \in L_{2}

. If, in addition,

ν = 0

, then

\tilde{θ}, \tilde{V} \in L_{\infty}

and

{lim}_{t \to \infty} e_{1} (t) = 0

.

Proof.

Consider the function

\begin{matrix} V (\tilde{θ}, \tilde{V}, e_{2}, t) ≜ α^{- 1} | k^{*} | {\tilde{V}}^{T} H (t) \tilde{V} + α^{- 1} | k^{*} | {(\tilde{θ} - \tilde{V})}^{T} H (t) (\tilde{θ} - \tilde{V}) + e_{2}^{T} P e_{2}, \end{matrix}

where

P > 0

satisfies (87) and (88), and note that

\begin{matrix} \dot{V} & (\tilde{θ}, \tilde{V}, e_{2}, t) \\ = e_{2}^{T} P {\dot{e}}_{2} + 2 | k^{*} | α^{- 1} {\tilde{V}}^{T} H (t) \dot{\tilde{V}} + | k^{*} | α^{- 1} {\tilde{V}}^{T} \dot{H} (t) \tilde{V} + | k^{*} | α^{- 1} {(\tilde{θ} - \tilde{V})}^{T} \dot{H} (t) (\tilde{θ} - \tilde{V}) \\ + 2 | k^{*} | α^{- 1} {(\tilde{θ} - \tilde{V})}^{T} H (t) (\dot{\tilde{θ}} - \dot{\tilde{V}}) \\ = - 2 ∥ e_{2} ∥^{2} + 2 e_{2}^{T} P B_{m} k^{*} {\tilde{θ}}^{T} ξ (t) - 2 k^{*} {\tilde{V}}^{T} ξ (t) e_{1} + | k^{*} | α^{- 1} {\tilde{V}}^{T} \dot{H} (t) \tilde{V} \\ + | k^{*} | α^{- 1} {(\tilde{θ} - \tilde{V})}^{T} \dot{H} (t) (\tilde{θ} - \tilde{V}) - 2 β | k^{*} | α^{- 1} {(\tilde{θ} - \tilde{V})}^{T} H (t) (\tilde{θ} - \tilde{V}) N_{m} (t) \\ + 2 k^{*} {(\tilde{θ} - \tilde{V})}^{T} ξ (t) e_{1} \\ = - 2 ∥ e_{2} ∥^{2} + 2 {(C_{m} e_{2})}^{T} k^{*} {\tilde{θ}}^{T} ξ (t) - 2 k^{*} {\tilde{V}}^{T} ξ (t) e_{1} + | k^{*} | α^{- 1} {\tilde{V}}^{T} \dot{H} (t) \tilde{V} \\ + | k^{*} | α^{- 1} {(\tilde{θ} - \tilde{V})}^{T} \dot{H} (t) (\tilde{θ} - \tilde{V}) - 2 β | k^{*} | α^{- 1} {(\tilde{θ} - \tilde{V})}^{T} H (t) (\tilde{θ} - \tilde{V}) N_{m} (t) \\ + 2 k^{*} {(\tilde{θ} - \tilde{V})}^{T} ξ (t) e_{1} \\ = - 2 ∥ e_{2} ∥^{2} + 2 {(e_{1} - k_{r} k^{*} {\tilde{θ}}^{T} ξ (t))}^{T} k^{*} {\tilde{θ}}^{T} ξ (t) - 2 k^{*} {\tilde{V}}^{T} ξ (t) e_{1} \\ + | k^{*} | α^{- 1} {\tilde{V}}^{T} [- F (t) + ξ (t) ξ {(t)}^{T}] \tilde{V} + | k^{*} | α^{- 1} {(\tilde{θ} - \tilde{V})}^{T} [- F (t) + \\ ξ (t) ξ {(t)}^{T}] (\tilde{θ} - \tilde{V}) - 2 β | k^{*} | α^{- 1} {(\tilde{θ} - \tilde{V})}^{T} H (t) (\tilde{θ} - \tilde{V}) N_{m} (t) + 2 k^{*} {(\tilde{θ} - \tilde{V})}^{T} ξ (t) e_{1} \\ = - 2 ∥ e_{2} ∥^{2} - | k^{*} | α^{- 1} ∥ F^{\frac{1}{2}} {(t) V ∥}^{2} - | k^{*} | α^{- 1} {∥ F^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥}^{2} - 2 k_{r} {(k^{*} {\tilde{θ}}^{T} ξ (t))}^{2} \\ + 4 e_{1} k^{*} {(\tilde{θ} - \tilde{V})}^{T} ξ (t) + | k^{*} | α^{- 1} | {\tilde{V}}^{T} ξ (t) |^{2} + | k^{*} | α^{- 1} {| {(\tilde{θ} - \tilde{V})}^{T} ξ (t) |}^{2} \\ - 2 β | k^{*} | α^{- 1} {∥ {(\tilde{θ} - \tilde{V})}^{T} H^{\frac{1}{2}} (t) ∥}^{2} N_{m} (t), t \geq 0 . \end{matrix}

(97)

Next, using the Cauchy–Schwarz and Young inequalities we obtain

\begin{matrix} e_{1}^{2} & = e_{2}^{T} C_{m}^{T} C_{m} e_{2} + {(k_{r} k^{*} {\tilde{θ}}^{T} ξ (t))}^{2} + 2 k_{r} k^{*} {\tilde{θ}}^{T} e_{2}^{T} C_{m} \\ \leq ∥ C_{m} ∥^{2} ∥ e_{2} ∥^{2} + {(k_{r} k^{*} {\tilde{θ}}^{T} ξ (t))}^{2} + 2 k_{r} ∥ k^{*} {\tilde{θ}}^{T} ξ (t) ∥ ∥ e_{2}^{T} C_{m} ∥ \\ \leq 2 ∥ C_{m} ∥^{2} {∥ e_{2} ∥}^{2} + 2 {(k_{r} k^{*} {\tilde{θ}}^{T} ξ (t))}^{2} \\ \leq 2 ∥ e_{2} ∥^{2} + 2 {(k_{r} k^{*} {\tilde{θ}}^{T} ξ (t))}^{2} . \end{matrix}

(98)

Now, using (98) along with triangle inequality

| {\tilde{V}}^{T} {ξ (t) |}^{2} \leq | {\tilde{θ}}^{T} {ξ (t) |}^{2} + {| {(\tilde{θ} - \tilde{V})}^{T} ξ (t) |}^{2}

,

t \geq 0

, it follows from (97) that

\begin{matrix} \dot{V} & (\tilde{θ}, \tilde{V}, e_{2}, t) \\ \leq - 2 (1 - ζ) ∥ e_{2} ∥^{2} - ζ e_{1}^{2} + 2 ζ {(k_{r} k^{*} {\tilde{θ}}^{T} ξ (t))}^{2} - | k^{*} | α^{- 1} {∥ F^{\frac{1}{2}} (t) V ∥}^{2} - 2 k_{r} {(k^{*} {\tilde{θ}}^{T} ξ (t))}^{2} \\ - | k^{*} | α^{- 1} ∥ F^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥^{2} + 4 | e_{1} | | k^{*} | {(\tilde{θ} - \tilde{V})}^{T} ξ (t) + | k^{*} | α^{- 1} [| {\tilde{θ}}^{T} ξ (t) |^{2} \\ + ∥ {(\tilde{θ} - \tilde{V})}^{T} ξ (t) ∥^{2}] + | k^{*} | α^{- 1} {| {(\tilde{θ} - \tilde{V})}^{T} ξ (t) |}^{2} \\ - 2 β | k^{*} | α^{- 1} {∥ {(\tilde{θ} - \tilde{V})}^{T} H^{\frac{1}{2}} (t) ∥}^{2} N_{m} (t) \\ \leq - 2 (1 - ζ) ∥ e_{2} ∥^{2} + | k^{*} | (- 2 k_{r} | k^{*} | + 2 ζ k_{r}^{2} | k^{*} | + α^{- 1}) | {\tilde{θ}}^{T} {ξ (t) |}^{2} \\ - | k^{*} | α^{- 1} {∥ F^{\frac{1}{2}} (t) V ∥}^{2} \\ - [\sqrt{ζ} | e_{1} | - 2 \frac{| k^{*} |}{\sqrt{ζ}} | {(\tilde{θ} - \tilde{V})}^{T} ξ (t) {|]}^{2} + (2 | k^{*} | α^{- 1} + 4 \frac{| k^{*} |^{2}}{ζ}) {| {(\tilde{θ} - \tilde{V})}^{T} ξ (t) |}^{2} \\ - | k^{*} | α^{- 1} ∥ F^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥^{2} - 2 β | k^{*} | α^{- 1} {∥ {(\tilde{θ} - \tilde{V})}^{T} H^{\frac{1}{2}} (t) ∥}^{2} N_{m} (t) \\ \leq - 2 (1 - ζ) ∥ e_{2} ∥^{2} + | k^{*} | (- 2 k_{r} | k^{*} | + 2 ζ k_{r}^{2} | k^{*} | + α^{- 1}) | {\tilde{θ}}^{T} {ξ (t) |}^{2} \\ - | k^{*} | α^{- 1} {∥ F^{\frac{1}{2}} (t) V ∥}^{2} \\ + (2 | k^{*} | α^{- 1} + \frac{4 | k^{*} |^{2}}{ζ} - 2 β μ | k^{*} | α^{- 1}) | {(\tilde{θ} - \tilde{V})}^{T} H^{\frac{1}{2}} (t) |^{2} ∥ Γ^{\frac{1}{2}} (t) ξ (t) ∥ \\ - | k^{*} | α^{- 1} ∥ F^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥^{2} - 2 β | k^{*} | α^{- 1} {∥ {(\tilde{θ} - \tilde{V})}^{T} H^{\frac{1}{2}} (t) ∥}^{2} \\ \leq - 2 (1 - ζ) ∥ e_{2} ∥^{2} + | k^{*} | (- 2 k_{r} | k^{*} | + 2 ζ k_{r}^{2} | k^{*} | + α^{- 1}) | {\tilde{θ}}^{T} {ξ (t) |}^{2} \\ - | k^{*} | α^{- 1} ∥ F^{\frac{1}{2}} {(t) V ∥}^{2} - | k^{*} | α^{- 1} {∥ F^{\frac{1}{2}} (t) (\tilde{θ} - \tilde{V}) ∥}^{2} \\ - 2 β | k^{*} | α^{- 1} {∥ {(\tilde{θ} - \tilde{V})}^{T} H^{\frac{1}{2}} (t) ∥}^{2} \\ \leq 0, t \geq 0, \end{matrix}

which shows that

V (\tilde{θ}, \tilde{V}, e_{2}, t) \in L_{\infty}

. Hence,

{\tilde{V}}^{T} H (t) \tilde{V}

,

{(\tilde{θ} - \tilde{V})}^{T} H (t) (\tilde{θ} - \tilde{V}) \in L_{\infty}, t \geq 0

,

e_{2}^{T} P e_{2} \in L_{\infty}

. Next, integrating

\dot{V}

over

[0, \infty)

yields

\begin{matrix} \int_{0}^{\infty} {∥ e_{2} (τ) ∥}^{2} d τ \leq V ({\tilde{V}}_{0}, {\tilde{θ}}_{0}, e_{20}, 0) - lim_{t \to \infty} V (\tilde{V} (t), \tilde{θ} (t), e_{2} (t), t) < \infty, \end{matrix}

and hence,

e_{2} \in L_{2}

. Similarly,

{\tilde{θ}}^{T} ξ \in L_{2}

and, since

P > 0

,

e_{2} \in L_{\infty}

.

Finally, if

ν = 0

, then

H (t)

satisfies

\begin{matrix} \dot{H} (t) & = ξ (t) ξ {(t)}^{T}, H (0) = Γ_{0}^{- 1}, t \geq 0, \end{matrix}

(99)

or

H (t) = Γ_{0}^{- 1} + \int_{0}^{t} ξ (τ) ξ {(τ)}^{T} d τ .

(100)

Hence,

H (t) > Γ_{0}^{- 1}, t \geq 0

, and thus,

\tilde{V}, \tilde{θ} \in L_{\infty}

. Now, if

e_{1}, \tilde{θ} \in L_{\infty}

, then (85) implies that

{\dot{e}}_{1} \in L_{\infty}

and, by Lemma 2,

{lim}_{t \to \infty} e_{1} (t) = 0

. □

Remark 5.

Note that unlike many MRAC schemes (e.g., [20]), (91)–(93) does not necessitate a projection operator to guarantee boundedness of the tracking error. However, unlike [19], wherein only a lower bound is necessary for

| k^{*} |

, we require knowledge of a lower and upper bound for

| k^{*} |

.

Remark 6.

It is important to note that there exists an alternative MRAC framework in the litterature known as the normalized adaptive laws [2] whose design is not based on the tracking error

e_{1} (\cdot)

but rather on a normalized estimation error of a particular parametrization of the plant. For this framework, the momentum-based integral gradient algorithm (20)–(23), the momentum-based recursive least squares algorithms (36)–(38), and momentum-based composite gradient algorithm (55)–(57) can be used directly for strictly proper plants without a relative degree restriction. In this case, the parametrization of the ideal controller is given by

u (s) = θ^{* T} ϕ (s)

, where

θ^{*}

satisfies

\hat{z} (s) = θ^{* T} \hat{ϕ} (s),

\hat{z} (s) = M (s) u_{p} (s)

, and

\hat{ϕ} (s) = [M (s) {\hat{v}}_{1}^{T} (s), {\hat{v}}_{2}^{T} (s), M (s) {\hat{y}}_{p} (s), {\hat{y}}_{p} (s)]

.

5. Illustrative Numerical Examples

5.1. System Parameter Identification

Consider the third-order transfer function representing a servo control system for the pitch control of an aircraft given by

\begin{matrix} \hat{y} (s) = \frac{b_{1} s + b_{0}}{s^{3} + a_{2} s^{2} + a_{1} s + a_{0}} \hat{u} (s), \end{matrix}

(101)

where

\hat{y} (s) \in C

is the pitch angle,

\hat{u} (s) \in C

is the system input, and

b_{1}

,

b_{0}

,

a_{2}

,

a_{1}

,

a_{0} > 0

, and

b_{0} \neq 0

are the unknown system plant parameters. Note that, (68) gives

\hat{z} (s) = \frac{s^{3}}{Λ (s)} \hat{y} (s)

with

θ^{*} = {[b_{1}, b_{0}, a_{2}, a_{1}, a_{0}]}^{T}

and

\hat{ϕ} (s) = {[\frac{[s, 1]}{Λ (s)} \hat{u} (s), \frac{- [s^{2}, s, 1]}{Λ (s)} \hat{y} (s)]}^{T}

.

Let

a_{0} = 0.1774

,

a_{1} = 2.072

,

a_{2} = 0.739

,

b_{0} = 0.1774

,

b_{1} = 1.151

,

u (t) = U [cos (4 t) + cos (2 t) + sin (4 t) + sin (t)]

,

U > 0

, and choose

Λ (s) = {(s + 1)}^{3}

. Note that

u (\cdot)

is a stationary and sufficiently rich signal of order five, and hence, Theorem 4 guarantees that the estimated system parameters

θ (t)

will converge exponentially to

θ^{*}

as

t \to \infty

using both the momentum-based integral gradient algorithm (20)–(23) and the MRLS algorithm (36)–(38).

First, we compare the performance of the RLS algorithm (34) and (35) with the MRLS algorithm (20)–(23). For this comparison, we set

U = 10

,

Γ (0) = 3000 I_{5}

,

ν = 0.1

,

β = 0.1

,

μ = \frac{2}{β}

,

θ (0) = {[0, 0, 0, 0, 0]}^{T}

, and

V (0) = {[0, 0, 0, 0, 0]}^{T}

. Figure 2 shows the system parameter estimate versus time. It can be seen that for both the RLS and the MRLS algorithms the parameter estimate

θ (t)

converges to

θ^{*}

as expected by Theorem 2. Moreover, as seen in Figure 3 the MRLS algorithm provides faster convergence of the system parameters to the true values as compared to the standard RLS algorithm.

Next, we compare the momentum-based composite gradient algorithm (55)–(57) with the composite gradient algorithm given by

\begin{matrix} \dot{θ} (t) & = - Γ (t) ϕ (t) e (t) - \frac{β}{2} Γ (t) N_{r} (t) [R (t) θ (t) + Q (t)], θ (0) = θ_{0}, t \geq 0 . \end{matrix}

(102)

Let

U = 1

,

Γ (0) = 1000 I_{5}

,

ν = 0.2

,

β = 0.1

,

μ = \frac{2}{β}

,

θ (0) = {[0, 0, 0, 0, 0]}^{T}

, and

V (0) = {[0, 0, 0, 0, 0]}^{T}

. Figure 4 shows the system parameter estimates versus time. It can be seen that for both the composite gradient and the momentum-based composite gradient algorithms the parameter estimate

θ (t)

converges to

θ^{*}

as expected by Theorem 3. However, as seen in Figure 5, our proposed algorithm provides faster convergence of the system parameters to their true values. In particular, the momentum-based composite gradient settles around a value of

10^{- 3}

at

t = 16

s as compared to 20 s for the composite gradient. Note that the convergence rate due to the addition of momentum is more pronounced in the case of the composite algorithm as compared to the RLS algorithm.

5.2. Momentum-Based Recursive Least Squares and Model Reference Adaptive Control

Here, we consider the short-term dynamics of the aircraft as an example, as shown in Figure 6, where

α

is the angle of attack,

γ

is the flight path angle, and

U_{e}

is the equilibrium linear axial velocity. The transfer function describing the short-term dynamics of the reduced-order longitudinal state equation of the aircraft with angle of attack output and elevator deflection input

η

is given by [21]

\begin{matrix} \frac{\hat{α} (s)}{\hat{η} (s)} = \frac{k_{p} (s + b_{0})}{s^{2} + a_{1} s + a_{0}}, \end{matrix}

(103)

where

k_{p} = \frac{z_{η}}{U_{e}}

,

b_{0} = U_{e} \frac{m_{η}}{z_{η}}

,

a_{1} = m_{q} + z_{w}

, and

a_{0} = m_{q} z_{w} - m_{w} U_{e}

, and

m_{η}

,

m_{w}

, and

m_{q}

are the concise partial derivatives (see [21]) of the pitching moment with respect to the elevator angle

η

, the normal velocity w, and the pitch rate q. Moreover,

z_{η}

and

z_{w}

denote the concise partial derivatives of the normal force on the aircraft with respect to the elevator angle

η

and the normal velocity w. Here, we set

k_{p} = - 0.1601

,

b_{0} = 71.9844

,

a_{1} = 5.0101

, and

a_{0} = 12.9988

[21].

The desired system performance is to track the reference output

\begin{matrix} {\hat{y}}_{r} (s) = \frac{5}{s + 0.3} \hat{r} (s), \end{matrix}

(104)

where

\hat{r} (s)

is a reference command signal. Next, we use the filter system (78) and (79) with

F_{c} = - 2

and

g_{c} = 1

, and the control law (82) whose state space realization is given by

\begin{matrix} \dot{ξ} (t) & = - λ_{1} ξ (t) + ϕ (t), ξ (0) = 0, t \geq 0, \end{matrix}

(105)

\begin{matrix} u_{p} (t) & = θ^{T} (t) ϕ (t) + {\dot{θ}}^{T} (t) ξ (t), \end{matrix}

(106)

where

ϕ (t) = {[v_{1} (t), v_{2} (t), y_{p} (t), r (t)]}^{T} .

For our simulations, we select the reference signal to be a square wave with a frequency

2 π

and amplitude 1. Furthermore, the system initial conditions are set to

y (0) = 0

,

y_{p} (0) = 0

,

{\dot{y}}_{p} (0) = 0

, and

Γ (0) = 300 I_{4}

.

Next, we compare the RLS algorithm (89) and (90) with our proposed MRLS algorithm (91)–(93). For this comparison, we set

λ_{1} = 2

,

α = 1

,

β = 1

,

μ = 5

,

γ = 1

,

ν = 0.3

,

θ (0) = {[0, 0, 0, 0]}^{T}

, and

V (0) = {[0, 0, 0, 0]}^{T}

. Note that all the conditions of Theorem 5 are satisfied. Figure 7 shows the system parameters versus time for the MRLS and the RLS algorithms. It can be seen that in both cases the system parameters converge to the true values with the MRLS providing faster convergence as compared to the standard RLS update law. Finally, Figure 8 shows the moving average (MA) of the absolute value of the tracking error

e_{1} (\cdot)

with a sliding window of period

2 π

. Note that the MRLS algorithm provides slightly better tracking accuracy as compared to the RLS algorithm after the first 10 s. Finally, we note that the difference in runtime complexity between the different algorithms addressed in this paper is negligible.

6. Conclusions

In this paper, we developed three new momentum-based update laws for online parameter identification and model reference adaptive control. Specifically, we augmented higher-order tuner architectures into the integral gradient, recursive least squares, and composite gradient algorithms to achieve faster error convergence of the system parameters. Several numerical examples were provided to show the efficacy of the proposed approach. Future work will focus on developing new adaptive update laws for identification and control that guarantee finite time and fixed time convergence of the system parameters.

Author Contributions

L.S.: Conseptualization, Formal analysis, Software, Visualization, Writing-original draft. W.M.H.: Conceptualization, Formal analysis, Writing-review and editing, Supervision, Funding Acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the Air Force Office of Scientific Research under Grant FA9550-20-1-0038.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Åström, K.J.; Wittenmark, B. Adaptive Control; Dover Publications: Mineola, NY, USA, 2008. [Google Scholar]
Ioannou, P.; Sun, J. Robust Adaptive Control; Dover Publications: Garden City, NY, USA, 2012. [Google Scholar]
Narendra, K.; Annaswamy, A.M. Stable Adaptive Systems; Prentice-Hall: Englewood Cliffs, NJ, USA, 1989. [Google Scholar]
Krstic, M.; Kanellakopoulos, I.; Kokotovic, P. Nonlinear and Adaptive Control Design; Wiley: New York, NY, USA, 1995. [Google Scholar]
Cui, Y.; Annaswamy, A.M. Discrete-Time High Order Tuner with A Time-Varying Learning Rate. In Proceedings of the 2023 American Control Conference, San Diego, CA, USA, 31 May–2 June 2023; pp. 2993–2998. [Google Scholar]
Nesterov, Y. Introductory Lectures on Convex Optimization; Springer: New York, NY, USA, 2004. [Google Scholar]
Gaudio, J.E.; Gibson, T.E.; Annaswamy, A.M.; Bolender, M.A. Provably Correct Learning Algorithms in the Presence of Time-Varying Features Using a Variational Perspective. arXiv 2019, arXiv:1903.04666. [Google Scholar]
Boffi, N.M.; Slotine, J.J.E. Implicit Regularization and Momentum Algorithms in Nonlinearly Parameterized Adaptive Control and Prediction. Neural Comput. 2021, 33, 590–673. [Google Scholar] [CrossRef] [PubMed]
Online accelerated data-driven learning for optimal feedback control of discrete-time partially uncertain systems. Int. J. Adapt. Control Signal Process. 2023, 38, 848–876.
Costa, R.R. Model-reference adaptive control with high-order parameter tuners. In Proceedings of the 2022 American Control Conference, Atlanta, GA, USA, 8–10 June 2022; pp. 3370–3375. [Google Scholar]
Costa, R.R. Least-squares model-reference adaptive control with high-order parameter tuners. Automatica 2024, 163, 111544. [Google Scholar] [CrossRef]
Wibisono, A.; Wilson, A.C.; Jordan, M.I. A variational perspective on accelerated methods in optimization. Proc. Natl. Acad. Sci. USA 2016, 113, 7351–7358. [Google Scholar] [CrossRef] [PubMed]
Cho, N.; Shin, H.S.; Kim, Y.; Tsourdos, A. Composite Model Reference Adaptive Control with Parameter Convergence Under Finite Excitation. IEEE Trans. Autom. Control 2018, 63, 811–818. [Google Scholar] [CrossRef]
Shaferman, V.; Schwegel, M.; Glück, T.; Kugi, A. Continuous-time least-squares forgetting algorithms for indirect adaptive control. Eur. J. Control 2021, 62, 105–112. [Google Scholar] [CrossRef]
Haddad, W.M.; Chellaboina, V. Nonlinear Dynamical Systems and Control: A Lyapunov-Based Approach; Princeton University Press: Princeton, NJ, USA, 2008. [Google Scholar]
Goodwin, G.; Mayne, D. A parameter estimation perspective of continuous time model reference adaptive control. Automatica 1987, 23, 57–70. [Google Scholar] [CrossRef]
Gaudio, J.E.; Annaswamy, A.M.; Lavretsky, E.; Bolender, M.A. Parameter Estimation in Adaptive Control of Time-Varying Systems Under a Range of Excitation Conditions. IEEE Trans. Autom. Control 2022, 67, 5440–5447. [Google Scholar] [CrossRef]
Costa, R.R. Lyapunov design of least-squares model-reference adaptive control. IFAC-PapersOnLine 2020, 53, 3797–3802. [Google Scholar] [CrossRef]
Costa-Gomes, M.A.; Crawford, V.P.; Iriberri, N. Comparing models of strategic thinking in Van Huyck, Battalio, and Beil’s coordination games. J. Eur. Econ. Assoc. 2009, 7, 365–376. [Google Scholar] [CrossRef]
Naik, S.; Kumar, P.; Ydstie, B. Robust continuous-time adaptive control by parameter projection. IEEE Trans. Autom. Control 1992, 37, 182–197. [Google Scholar] [CrossRef]
Cook, M. Flight Dynamics Principles; Elsevier: Oxford, UK, 2007. [Google Scholar]

Figure 1. Visualization of momentum-based learning for an ill-conditioned problem;

\tilde{θ}

is updated in a direction that increases

∥ \tilde{θ} ∥

while still decreasing the loss cost function.

Figure 1. Visualization of momentum-based learning for an ill-conditioned problem;

\tilde{θ}

is updated in a direction that increases

∥ \tilde{θ} ∥

while still decreasing the loss cost function.

Figure 2. Parameter estimate values

θ (t)

versus time using the RLS and MRLS algorithms. In both cases the parameter estimate values converge to the true values with the MRLS algorithm converging faster.

Figure 2. Parameter estimate values

θ (t)

versus time using the RLS and MRLS algorithms. In both cases the parameter estimate values converge to the true values with the MRLS algorithm converging faster.

Figure 3. Norm of the parameter estimate error

\tilde{θ} (t)

versus time using the RLS and MRLS algorithms.

Figure 3. Norm of the parameter estimate error

\tilde{θ} (t)

versus time using the RLS and MRLS algorithms.

Figure 4. Parameter estimate values

θ (t)

versus time using the composite gradient and the momentum-based composite gradient algorithm. In both cases the parameter estimate values converge to the true values with the momentum-based integral algorithm converging slightly faster.

Figure 4. Parameter estimate values

θ (t)

versus time using the composite gradient and the momentum-based composite gradient algorithm. In both cases the parameter estimate values converge to the true values with the momentum-based integral algorithm converging slightly faster.

Figure 5. Norm of the parameter estimate error

\tilde{θ} (t)

versus time using the composite and the momentum-based composite algorithms.

Figure 5. Norm of the parameter estimate error

\tilde{θ} (t)

versus time using the composite and the momentum-based composite algorithms.

Figure 6. Schematic of an aircraft.

Figure 7. Parameter estimate values

θ (t)

versus time using the RLS and MRLS algorithms. In both cases the parameter estimate values converge to the true values with the MRLS algorithm converging faster.

Figure 7. Parameter estimate values

θ (t)

versus time using the RLS and MRLS algorithms. In both cases the parameter estimate values converge to the true values with the MRLS algorithm converging faster.

Figure 8. Moving average of the absolute value of the error

e_{1} (t) = y_{p} (t) - y_{r} (t)

for the RLS and MRLS algorithms with a sliding window of

2 π

. The tracking error for the MRAC scheme predicated on the MRLS algorithm decreases faster than that predicated on the RLS algorithm.

Figure 8. Moving average of the absolute value of the error

e_{1} (t) = y_{p} (t) - y_{r} (t)

for the RLS and MRLS algorithms with a sliding window of

2 π

. The tracking error for the MRAC scheme predicated on the MRLS algorithm decreases faster than that predicated on the RLS algorithm.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Somers, L.; Haddad, W.M. Momentum-Based Adaptive Laws for Identification and Control. Aerospace 2024, 11, 1017. https://doi.org/10.3390/aerospace11121017

AMA Style

Somers L, Haddad WM. Momentum-Based Adaptive Laws for Identification and Control. Aerospace. 2024; 11(12):1017. https://doi.org/10.3390/aerospace11121017

Chicago/Turabian Style

Somers, Luke, and Wassim M. Haddad. 2024. "Momentum-Based Adaptive Laws for Identification and Control" Aerospace 11, no. 12: 1017. https://doi.org/10.3390/aerospace11121017

APA Style

Somers, L., & Haddad, W. M. (2024). Momentum-Based Adaptive Laws for Identification and Control. Aerospace, 11(12), 1017. https://doi.org/10.3390/aerospace11121017

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Momentum-Based Adaptive Laws for Identification and Control

Abstract

1. Introduction

2. From First-Order to Higher-Order Tuners for Parameter Estimation

3. System Parameter Identification

4. Momentum-Based Recursive Least Squares and Model Reference Adaptive Control

5. Illustrative Numerical Examples

5.1. System Parameter Identification

5.2. Momentum-Based Recursive Least Squares and Model Reference Adaptive Control

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI