Gauss–Newton–Secant Method for Solving Nonlinear Least Squares Problems under Generalized Lipschitz Conditions

Argyros, Ioannis K.; Shakhno, Stepan; Iakymchuk, Roman; Yarmola, Halyna; Argyros, Michael I.

doi:10.3390/axioms10030158

Open AccessArticle

Gauss–Newton–Secant Method for Solving Nonlinear Least Squares Problems under Generalized Lipschitz Conditions

by

Ioannis K. Argyros

¹,

Stepan Shakhno

^2,*

,

Roman Iakymchuk

^3,4,

Halyna Yarmola

⁵ and

Michael I. Argyros

⁶

¹

Department of Mathematical Sciences, Cameron University, Lawton, OK 73505, USA

²

Department of Theory of Optimal Processes, Ivan Franko National University of Lviv, Universytetska Str. 1, 79000 Lviv, Ukraine

³

PEQUAN, LIP6, Sorbonne Université, 4 Place Jussieu, 75252 Paris, France

⁴

Fraunhofer ITWM, Fraunhofer-Platz 1, 67663 Kaiserslautern, Germany

⁵

Department of Computational Mathematics, Ivan Franko National University of Lviv, Universytetska Str. 1, 79000 Lviv, Ukraine

⁶

Department of Computer Science, University of Oklahoma, Norman, OK 73071, USA

^*

Author to whom correspondence should be addressed.

Axioms 2021, 10(3), 158; https://doi.org/10.3390/axioms10030158

Submission received: 22 June 2021 / Revised: 16 July 2021 / Accepted: 19 July 2021 / Published: 21 July 2021

(This article belongs to the Special Issue Numerical Analysis and Computational Mathematics)

Download Versions Notes

Abstract

We develop a local convergence of an iterative method for solving nonlinear least squares problems with operator decomposition under the classical and generalized Lipschitz conditions. We consider the case of both zero and nonzero residuals and determine their convergence orders. We use two types of Lipschitz conditions (center and restricted region conditions) to study the convergence of the method. Moreover, we obtain a larger radius of convergence and tighter error estimates than in previous works. Hence, we extend the applicability of this method under the same computational effort.

Keywords:

nonlinear least squares problem; differential-difference method; divided differences; radius of convergence; residual; error estimates

MSC:

65J15

1. Introduction

Nonlinear least squares problems often arise while solving overdetermined systems of nonlinear equations, estimating parameters of physical processes by measurement results, constructing nonlinear regression models for solving engineering problems, etc. The most used method for solving nonlinear least squares problems is the Gauss–Newton method [1]. In the case when the derivative can not be calculated, difference methods are used [2,3].

Some nonlinear functions have a differentiable and a nondifferentiable part. In this case, a good idea is to use a sum of the derivative of the differentiable part of the operator and the divided difference of the nondifferentiable part instead of the Jacobian [4,5,6]. Numerical study shows that these methods converge faster than Gauss–Newton type’s method or difference methods.

In this paper, we study the local convergence of the Gauss–Newton–Secant method under the classical and generalized Lipschitz conditions for first-order Fréchet derivative and divided differences.

Let us consider the nonlinear least squares problem:

min_{x \in R^{p}} \frac{1}{2} {(F (x) + G (x))}^{T} (F (x) + G (x)),

(1)

where residual function

F + G : R^{p} \to R^{m}

(

m \geq p

) is nonlinear in x, F is a continuously differentiable function, and G is a continuous function, the differentiability of which, in general, is not required.

We propose the following modification of the Gauss–Newton method combined with the Secant-type method [4,6] for finding the solution to problem (1):

x_{n + 1} = x_{n} - {(A_{n}^{T} A_{n})}^{- 1} A_{n}^{T} (F (x_{n}) + G (x_{n})), n = 0, 1, \dots,

(2)

where

A_{n} = F^{'} (x_{n}) + G (x_{n}, x_{n - 1})

,

F^{'} (x_{n})

is a Fréchet derivative of

F (x)

;

G (x_{n}, x_{n - 1})

is a divided difference of the first order of function

G (x)

[7] at points

x_{n}

,

x_{n - 1}

; and

x_{0}

,

x_{- 1}

are given.

Setting

A_{n} = F^{'} (x_{n})

, for solving problem (1), from (2) we obtain an iterative Gauss–Newton-type method:

x_{n + 1} = x_{n} - {(F^{'} {(x_{n})}^{T} F^{'} (x_{n}))}^{- 1} F^{'} {(x_{n})}^{T} (F (x_{n}) + G (x_{n})), n = 0, 1, \dots .

(3)

For

m = p

, problem (1) turns into a system of nonlinear equations:

F (x) + G (x) = 0 .

(4)

In this case, method (2) is transformed into the combined Newton–Secant method [8,9,10]:

x_{n + 1} = x_{n} - {(F^{'} (x_{n}) + G (x_{n}, x_{n - 1}))}^{- 1} (F (x_{n}) + G (x_{n})), n = 0, 1, \dots,

(5)

and method (3) into the Newtons-type method for solving nonlinear equations [11]:

x_{n + 1} = x_{n} - {(F^{'} (x_{n}))}^{- 1} (F (x_{n}) + G (x_{n})), n = 0, 1, \dots .

(6)

The convergence domain is small (in general), and error estimates are pessimistic. These problems restrict the applicability of these methods. The novelty of our work is in the claim that these problems can be addressed without adding hypotheses. In particular, our idea is to use a center and restricted radius Lipschitz conditions. Such an approach to the study of the convergence of methods allows for extending the convergence ball of the method and improving error estimates.

The remainder of the paper is organized as follows: Section 2 deals with the local convergence analysis. The numerical experiments appear in Section 3. Section 4 contains the concluding remarks and ideas about future works.

2. Local Convergence Analysis

Let us consider, at first, some auxiliary lemmas needed to obtain the main results. Let D be an open subset of

R^{p}

.

Lemma 1

([4]). Let

e (t) = \int_{0}^{t} E (u) d u

, where E is an integrable and positive nondecreasing function on

[0, T]

. Then,

e (t)

is monotonically increasing with respect to t on

[0, T]

.

Lemma 2

([1,12]). Let

h (t) = \frac{1}{t} \int_{0}^{t} H (u) d u,

where H is an integrable and positive nondecreasing function on

[0, T]

. Then,

h (t)

is nondecreasing with respect to t on

(0, T]

.

Additionally,

h (t)

at

t = 0

is defined as

h (0) = lim_{t \to 0} (\frac{1}{t} \int_{0}^{t} H (u) d u)

.

Lemma 3

([13]). Let

s (t) = \frac{1}{t^{2}} \int_{0}^{t} S (u) u d u,

where S is an integrable and positive nondecreasing function on

[0, T]

. Then,

s (t)

is nondecreasing with respect to t on

(0, T]

.

Definition 1.

The Fréchet derivative

F^{'}

satisfies the center Lipschitz condition on D with

L_{0}

average if

∥ F^{'} (x) - F^{'} (x^{*}) ∥ \leq \int_{0}^{ρ (x)} L_{0} (u) d u, for each x \in D \subset R^{p},

(7)

where

ρ (x) = ∥ x - x^{*} ∥

,

x^{*} \in D

is a solution of problem (1), and

L_{0}

is an integrable, positive, and nondecreasing function on

[0, T]

.

The functions

M_{0}, L, M, L_{1}

and

M_{1}

introduced next are as the function

L_{0}

: integrable, positive, and nondecreasing functions defined on

[0, 2 R]

.

Definition 2.

The first order divided difference

G (x, y)

satisfies the center Lipschitz condition on

D \times D

with

M_{0}

average if

∥ G (x, y) - G (x^{*}, x^{*}) ∥ \leq \int_{0}^{ρ (x) + ρ (y)} M_{0} (u) d u, for each x, y \in D .

(8)

Let

B > 0

and

α > 0

. We define function

φ

on

[0, + \infty)

by

φ (t) = B [2 α + \int_{0}^{t} L_{0} (u) d u + \int_{0}^{2 t} M_{0} (u) d u] [\int_{0}^{t} L_{0} (u) d u + \int_{0}^{2 t} M_{0} (u) d u] .

(9)

Suppose that equation

φ (t) = 1

(10)

has at least one positive solution. Denote by

γ

the minimal such solution. Then, we can define

Ω_{0} = D \cap Ω (x^{*}, γ)

, where

Ω (x^{*}, γ) = {x : ∥ x - x^{*} ∥ < γ}

.

Definition 3.

The Fréchet derivative

F^{'}

satisfies the restricted radius Lipschitz condition on

Ω_{0}

with L average if

∥ F^{'} (x) - F^{'} (x^{τ}) ∥ \leq \int_{τ ρ (x)}^{ρ (x)} L (u) d u, x^{τ} = x^{*} + τ (x - x^{*}), 0 \leq τ \leq 1, for each x \in Ω_{0} .

(11)

Definition 4.

The first order divided difference

G (x, y)

satisfies the restricted radius Lipschitz condition on

Ω_{0}

with M average if

∥ G (x, y) - G (u, v) ∥ \leq \int_{0}^{∥ x - u ∥ + ∥ y - v ∥} M (u) d u, for each x, y, u, v \in Ω_{0} .

(12)

Definition 5.

The Fréchet derivative

F^{'}

satisfies the radius Lipschitz condition on D with

L_{1}

average if

∥ F^{'} (x) - F^{'} (x^{τ}) ∥ \leq \int_{τ ρ (x)}^{ρ (x)} L_{1} (u) d u, for each x \in D .

(13)

Definition 6.

The first order divided difference

G (x, y)

satisfies the radius Lipschitz condition on D with

M_{1}

average if

∥ G (x, y) - G (u, v) ∥ \leq \int_{0}^{∥ x - u ∥ + ∥ y - v ∥} M_{1} (u) d u, for each x, y, u, v \in D .

(14)

Remark 1.

It follows from the preceding definitions that

L = L (L_{0}, M_{0})

,

M = M (L_{0}, M_{0})

, and for each

t \in [0, γ]

\begin{matrix} L_{0} (t) & \leq & L_{1} (t), \end{matrix}

(15)

\begin{matrix} L (t) & \leq & L_{1} (t), \end{matrix}

(16)

\begin{matrix} M (t) & \leq & M_{1} (t), \end{matrix}

(17)

since

Ω_{0} \subseteq D

. By

L (L_{0}, M_{0})

, we mean that L (or M) depends on

L_{0}

and

M_{0}

by the definition of

Ω_{0}

. In case any of (15)–(17) are strict inequalities, the following benefits are obtained over the work in [4] using

L_{1}, M_{1}

instead of the new functions:

(a1): An at least as large convergence region leading to at least as many initial choices;
(a2): At least as tight upper bounds on the distances $∥ x_{n} - x^{*} ∥$ , so at least as few iterations are needed to obtain a desired error tolerance.

These benefits are obtained under the same computational effort as in [4], since the new functions

L_{0}, M_{0}, L,

and M are special cases of the functions

L_{1}

and

M_{1}

. This technique of using the center Lipschitz condition in combination with the restricted convergence region has been used by us on Newton’s, Secant, Newton-like methods [14,15], and can be used on other methods, too, with the same benefits.

The proof of the next result follows as the corresponding one in [4], but there are crucial differences, where we use

(L_{0}, L)

instead of

L_{1}

and

(M_{0}, M)

instead of

M_{1}

used in [4].

We use the Euclidean norm. Note that the following equality is satisfied for the Euclidean norm

∥ A - B ∥ = ∥ A^{T} - B^{T} ∥,

where

A, B \in R^{m \times p}

.

Theorem 1.

Let

F + G : R^{p} \to R^{m}

be continuous on an open convex subset

D \subset R^{p}

, F be a continuously differentiable function, and G be a continuous function. Suppose that problem (1) has a solution

x^{*} \in D

; the inverse operation

{(A_{*}^{T} A_{*})}^{- 1} = {[{(F^{'} (x^{*}) + G (x^{*}, x^{*}))}^{T} (F^{'} (x^{*}) + G (x^{*}, x^{*}))]}^{- 1}

(18)

exists, such that

∥ {(A_{*}^{T} A_{*})}^{- 1} ∥ \leq B

; (7), (8), (11), and (12) hold, and γ given in (10) exists.

Furthermore,

\begin{matrix} ∥ F (x^{*}) + G (x^{*}) ∥ \leq η, ∥ F^{'} (x^{*}) + G (x^{*}, x^{*}) ∥ \leq α; \end{matrix}

(19)

\begin{matrix} \frac{B}{R} (\int_{0}^{R} L_{0} (u) d u + \int_{0}^{2 R} M_{0} (u) d u) η < 1 \end{matrix}

(20)

and

Ω = Ω (x^{*}, r_{*}) \subseteq D,

where

r_{*}

is the unique positive zero of the function q given by

\begin{matrix} q (r) & = & B [(α + \int_{0}^{r} L_{0} (u) d u + \int_{0}^{2 r} M_{0} (u) d u) (\int_{0}^{r} L (u) u d u + \int_{0}^{r} M (u) d u) \\ + (2 α + \int_{0}^{r} L_{0} (u) d u + \int_{0}^{2 r} M_{0} (u) d u) (\int_{0}^{r} L_{0} (u) d u + \int_{0}^{2 r} M_{0} (u) d u) \\ + (\frac{1}{r} \int_{0}^{r} L_{0} (u) d u + \frac{1}{r} \int_{0}^{2 r} M_{0} (u) d u) η] - 1 . \end{matrix}

(21)

Then, for

x_{0}, x_{- 1} \in Ω

, the iterative sequence

{x_{n}}

,

n = 0, 1, \dots,

generated by (2), is well defined, remains in Ω, and converges to

x^{*}

. Moreover, the following error estimates hold for each

n = 0, 1, 2, \dots

:

\begin{matrix} ∥ x_{n + 1} - x^{*} ∥ & \leq & C_{1} ∥ x_{n - 1} - x^{*} ∥ + C_{2} ∥ x_{n} - x^{*} ∥ + C_{3} ∥ x_{n - 1} - x^{*} ∥ ∥ x_{n} - x^{*} ∥ \\ + C_{4} {∥ x_{n} - x^{*} ∥}^{2}, \end{matrix}

(22)

where

\begin{matrix} g (r) & = & \frac{B}{1 - φ (r)}; C_{1} = g (r_{*}) \frac{1}{2 r_{*}} \int_{0}^{2 r_{*}} M_{0} (u) d u η; \end{matrix}

(23)

\begin{matrix} C_{2} & = & g (r_{*}) (\frac{1}{r_{*}} \int_{0}^{r_{*}} L_{0} (u) d u + \frac{1}{2 r_{*}} \int_{0}^{2 r_{*}} M_{0} (u) d u) η; \end{matrix}

(24)

\begin{matrix} C_{3} & = & g (r_{*}) (α + \int_{0}^{r_{*}} L_{0} (u) d u + \int_{0}^{2 r_{*}} M_{0} (u) d u) \frac{1}{r_{*}} \int_{0}^{r_{*}} M (u) d u; \end{matrix}

(25)

\begin{matrix} C_{4} & = & g (r_{*}) (α + \int_{0}^{r_{*}} L_{0} (u) d u + \int_{0}^{2 r_{*}} M_{0} (u) d u) \frac{1}{r_{*}} \int_{0}^{r_{*}} L (u) u d u . \end{matrix}

(26)

Proof.

We obtain

\begin{matrix} lim_{r \to 0^{+}} \frac{1}{r} \int_{0}^{r} L_{0} (u) d u & \leq & lim_{r \to 0^{+}} \frac{L_{0} (r) r}{r} \leq L_{0} (0), \end{matrix}

(27)

\begin{matrix} lim_{r \to 0^{+}} \frac{1}{r} \int_{0}^{2 r} M_{0} (u) d u & \leq & lim_{r \to 0^{+}} \frac{M_{0} (2 r) 2 r}{r} \leq 2 M_{0} (0), \end{matrix}

(28)

since

L_{0}

and

M_{0}

are positive and nondecreasing functions on

[0, R]

, and

[0, 2 R]

, respectively. Taking into account Lemma 1 for a sufficiently small

η

,

q (0) = B (L_{0} (0) + 2 M_{0} (0)) η - 1 < 0

. With a sufficiently large R, the inequality

q (R) > 0

holds. By the intermediate value theorem, the function q has a positive zero on

(0, R)

denoted by

r_{*}

. Moreover, this zero is the only one on

(0, R)

. Indeed, according to Lemma 2, the function

(\frac{1}{r} \int_{0}^{r} L_{0} (u) d u + \frac{1}{r} \int_{0}^{2 r} M_{0} (u) d u) η

is non-decreasing with respect to r on

(0, R]

. By Lemma 1, functions

\int_{0}^{r} L (u) d u

,

\int_{0}^{r} M (u) d u

, and

\int_{0}^{2 r} M (u) d u

are monotonically increasing on

[0, R]

. Furthermore, by Lemma 3, the function

\int_{0}^{r} L (u) u d u = r^{2} (\frac{1}{r^{2}} \int_{0}^{r} L (u) u d u)

is monotonically increasing with respect to r on

(0, R]

. Therefore,

q (r)

is monotonically increasing on

(0, R]

. Thus, the graph of function

q (r)

crosses the positive r-axis only once on

(0, R)

. Finally, from the monotonicity of q and since

q (γ) > 0

, we obtain

r_{*} < γ

, so

Ω (x^{*}, r_{*}) \subset Ω_{0}

.

We denote

A_{n} = F^{'} (x_{n}) + G (x_{n}, x_{n - 1})

. Let

n = 0

. By the assumption

x_{0}

,

x_{- 1} \in Ω

, we obtain the following estimation:

\begin{matrix} ∥I - {(A_{*}^{T} A_{*})}^{- 1} A_{0}^{T} A_{0}∥ = ∥{(A_{*}^{T} A_{*})}^{- 1} (A_{*}^{T} A_{*} - A_{0}^{T} A_{0})∥ \\ = ∥{(A_{*}^{T} A_{*})}^{- 1} [A_{*}^{T} (A_{*} - A_{0}) + (A_{*}^{T} - A_{0}^{T}) (A_{0} - A_{*}) + (A_{*}^{T} - A_{0}^{T}) A_{*}]∥ \\ \leq ∥{(A_{*}^{T} A_{*})}^{- 1}∥ [∥ A_{*}^{T} ∥ ∥ A_{*} - A_{0} ∥ + ∥ A_{*}^{T} - A_{0}^{T} ∥ ∥ A_{0} - A_{*} ∥ + ∥ A_{*}^{T} - A_{0}^{T} ∥ ∥ A_{*} ∥] \\ \leq B [α ∥ A_{*} - A_{0} ∥ + ∥ A_{*}^{T} - A_{0}^{T} ∥ ∥ A_{0} - A_{*} ∥ + α ∥ A_{*}^{T} - A_{0}^{T} ∥] . \end{matrix}

(29)

Using conditions (11) and (12), we obtain

\begin{matrix} ∥ A_{0} - A_{*} ∥ & = & ∥ (F^{'} (x_{0}) + G (x_{0}, x_{- 1})) - (F^{'} (x^{*}) + G (x^{*}, x^{*})) ∥ \\ = & ∥ F^{'} (x_{0}) - F^{'} (x_{*}) + G (x_{0}, x_{- 1}) - G (x^{*}, x^{*}) ∥ \\ \leq & ∥ F^{'} (x_{0}) - F^{'} (x^{*}) ∥ + ∥ G (x_{0}, x_{- 1}) - G (x^{*}, x^{*}) ∥ \\ \leq & \int_{0}^{ρ_{0}} L_{0} (u) d u + \int_{0}^{ρ_{0} + ρ_{- 1}} M_{0} (u) d u, \end{matrix}

(30)

where

ρ_{k} = ρ (x_{k})

. Then, from inequality (29) and the equation

q (r) = 0

, we obtain by (10)

\begin{matrix} ∥ I - {(A_{*}^{T} A_{*})}^{- 1} A_{0}^{T} A_{0} ∥ \leq B [2 α + \int_{0}^{ρ_{0}} L_{0} (u) d u + \int_{0}^{ρ_{0} + ρ_{- 1}} M_{0} (u) d u] \\ \times [\int_{0}^{ρ_{0}} L_{0} (u) d u + \int_{0}^{ρ_{0} + ρ_{- 1}} M_{0} (u) d u] \leq B [2 α + \int_{0}^{r_{*}} L_{0} (u) d u \\ + \int_{0}^{2 r_{*}} M_{0} (u) d u] [\int_{0}^{r_{*}} L_{0} (u) d u + \int_{0}^{2 r_{*}} M_{0} (u) d u] < 1 . \end{matrix}

(31)

Next, from (29)–(31) and the Banach lemma [16], it follows that

{(A_{0}^{T} A_{0})}^{- 1}

exists, and

\begin{matrix} ∥{(A_{0}^{T} A_{0})}^{- 1}∥ & \leq & g_{0} = B \{1 - B [2 α + \int_{0}^{ρ_{0}} L_{0} (u) d u + \int_{0}^{ρ_{0} + ρ_{- 1}} M_{0} (u) d u] \\ {\times [\int_{0}^{ρ_{0}} L_{0} (u) d u + \int_{0}^{ρ_{0} + ρ_{- 1}} M_{0} (u) d u]\}}^{- 1} \\ \leq & g (r_{*}) = B \{1 - B [2 α + \int_{0}^{r_{*}} L_{0} (u) d u + \int_{0}^{2 r_{*}} M_{0} (u) d u] \\ {\times [\int_{0}^{r_{*}} L_{0} (u) d u + \int_{0}^{2 r_{*}} M_{0} (u) d u]\}}^{- 1} . \end{matrix}

(32)

Hence,

x_{1}

is correctly defined. Next, we will show that

x_{1} \in Ω (x^{*}, r_{*})

.

Using the fact

A_{*}^{T} (F (x^{*}) + G (x^{*})) = {(F^{'} (x^{*}) + G (x^{*}, x^{*}))}^{T} (F (x^{*}) + G (x^{*})) = 0,

(33)

x_{0}, x_{- 1} \in Ω (x^{*}, r_{*})

and the choice of

r_{*}

, we obtain the estimate

\begin{matrix} ∥ x_{1} - x^{*} ∥ & = & ∥x_{0} - x^{*} - {(A_{0}^{T} A_{0})}^{- 1} [A_{0}^{T} (F (x_{0}) + G (x_{0})) - A_{*}^{T} (F (x^{*}) + G (x^{*}))]∥ \\ \leq & ∥ - {(A_{0}^{T} A_{0})}^{- 1} ∥ ∥ - A_{0}^{T} [A_{0} - \int_{0}^{1} F^{'} (x_{*} + t (x_{0} - x^{*})) d t \\ - G (x_{0}, x^{*})] (x_{0} - x^{*}) + (A_{0}^{T} - A_{*}^{T}) (F (x^{*}) + G (x^{*})) ∥ . \end{matrix}

(34)

So, considering the inequalities

\begin{matrix} ∥ A_{0} - \int_{0}^{1} F^{'} (x^{*} + t (x_{0} - x^{*})) d t - G (x_{0}, x^{*}) ∥ \\ = ∥ F^{'} (x_{0}) - \int_{0}^{1} F^{'} (x^{*} + t (x_{0} - x^{*})) d t + G (x_{0}, x_{- 1}) - G (x_{0}, x^{*}) ∥ \\ = ∥ \int_{0}^{1} [F^{'} (x_{0}) - F^{'} (x^{*} + t (x_{0} - x^{*}))] d t + G (x_{0}, x_{- 1}) - G (x_{0}, x^{*}) ∥ \\ = ∥ \int_{0}^{1} [F^{'} (x_{0}) - F^{'} (x_{0}^{t})] d t + G (x_{0}, x_{- 1}) - G (x_{0}, x^{*}) ∥ \\ \leq \int_{0}^{1} \int_{t ρ_{0}}^{ρ_{0}} L (u) d u d t + \int_{0}^{ρ_{- 1}} M (u) d u = \int_{0}^{ρ_{0}} L (u) u d u + \int_{0}^{ρ_{- 1}} M (u) d u \\ \leq \frac{1}{r_{*}^{2}} \int_{0}^{r_{*}} L (u) u d u ρ_{0}^{2} + \frac{1}{r_{*}} \int_{0}^{r_{*}} M (u) d u ρ_{- 1}, \end{matrix}

(35)

\begin{matrix} ∥ A_{0} ∥ \leq ∥ A_{*} ∥ + ∥ A_{0} - A_{*} ∥ \leq α + \int_{0}^{ρ_{0}} L_{0} (u) d u + \int_{0}^{ρ_{0} + ρ_{- 1}} M_{0} (u) d u, \end{matrix}

(36)

we obtain

\begin{matrix} ∥ x_{1} - x^{*} ∥ & \leq & g_{0} \{[α + \int_{0}^{ρ_{0}} L_{0} (u) d u + \int_{0}^{ρ_{0} + ρ_{- 1}} M_{0} (u) d u] \\ \times [\int_{0}^{ρ_{0}} L (u) u d u + \int_{0}^{ρ_{- 1}} M (u) d u] ∥ x_{0} - x^{*} ∥ + η [\int_{0}^{ρ_{0}} L_{0} (u) d u \\ + \int_{0}^{ρ_{0} + ρ_{- 1}} M_{0} (u) d u]\} \leq g_{0} \{[α + \int_{0}^{r_{*}} L_{0} (u) d u + \int_{0}^{2 r_{*}} M_{0} (u) d u] \\ \times [\frac{1}{r_{*}^{2}} \int_{0}^{r_{*}} L (u) u d u ρ_{0}^{2} + \frac{1}{r_{*}} \int_{0}^{r_{*}} M (u) d u ρ_{- 1}] ∥ x_{0} - x^{*} ∥ \\ + η [\frac{1}{r_{*}} \int_{0}^{r_{*}} L_{0} (u) d u ρ_{0} + \frac{1}{2 r_{*}} \int_{0}^{2 r_{*}} M_{0} (u) d u (ρ_{0} + ρ_{- 1})]\} \\ < & g (r_{*}) \{[α + \int_{0}^{r_{*}} L_{0} (u) d u + \int_{0}^{2 r_{*}} M_{0} (u) d u] [\int_{0}^{r_{*}} L (u) u d u + \int_{0}^{r_{*}} M (u) d u] \\ + \frac{1}{r_{*}} [\int_{0}^{r_{*}} L_{0} (u) d u + \int_{0}^{2 r_{*}} M_{0} (u) d u] η\} r_{*} = p (r_{*}) r_{*} = r_{*}, \end{matrix}

(37)

where

\begin{matrix} p (r) & = & g (r) \{[α + \int_{0}^{r} L_{0} (u) d u + \int_{0}^{2 r} M_{0} (u) d u] [\int_{0}^{r} L (u) u d u + \int_{0}^{r} M (u) d u] \\ + \frac{1}{r} [\int_{0}^{r} L_{0} (u) d u + \int_{0}^{2 r} M_{0} (u) d u] η\} . \end{matrix}

(38)

Therefore,

x_{1} \in Ω (x^{*}, r_{*})

, and estimate (22) holds for

n = 0

.

Let us assume that

x_{n} \in Ω (x^{*}, r_{*})

for

n = 0, 1, . . ., k

and estimate (22) holds for

n = 0, 1, . . ., k - 1

, where

k \geq

1 is an integer. We shall show

x_{n + 1} \in Ω

and that the estimate (22) holds for

n = k

.

We can write

\begin{matrix} ∥ I - {(A_{*}^{T} A_{*}^{T})}^{- 1} A_{^{k}}^{T} A_{k} ∥ = ∥ {(A_{*}^{T} A_{*})}^{- 1} (A_{*}^{T} A_{*} - A_{^{k}}^{T} A_{k}) ∥ \\ = ∥ {(A_{*}^{T} A_{*})}^{- 1} (A_{*}^{T} (A_{*} - A_{k}) + (A_{*}^{T} - A_{^{k}}^{T}) (A_{k} - A_{*}) + (A_{*}^{T} - A_{^{k}}^{T}) A_{*}) ∥ \\ \leq B (α ∥ A_{*} - A_{k} ∥ + ∥ A_{*}^{T} - A_{^{k}}^{T} ∥ ∥ A_{k} - A_{*} ∥ + α ∥ A_{*}^{T} - A_{^{k}}^{T} ∥) \\ \leq B [2 α + \int_{0}^{ρ_{k}} L_{0} (u) d u + \int_{0}^{ρ_{k} + ρ_{k - 1}} M_{0} (u) d u] [\int_{0}^{ρ_{k}} L_{0} (u) d u \\ + \int_{0}^{ρ_{k} + ρ_{k - 1}} M_{0} (u) d u] \leq B [2 α + \int_{0}^{r_{*}} L_{0} (u) d u + \int_{0}^{2 r_{*}} M_{0} (u) d u] \\ \times [\int_{0}^{r_{*}} L_{0} (u) d u + \int_{0}^{2 r_{*}} M_{0} (u) d u] < 1 . \end{matrix}

(39)

Consequently,

{(A_{^{k}}^{T} A_{k})}^{- 1}

exists, and

\begin{matrix} ∥ {(A_{k + 1}^{T} A_{k + 1})}^{- 1} ∥ & \leq & g_{k} = B \{1 - B [2 α + \int_{0}^{ρ_{k}} L_{0} (u) d u + \int_{0}^{ρ_{k} + ρ_{k - 1}} M_{0} (u) d u] \\ {\times [\int_{0}^{ρ_{k}} L_{0} (u) d u + \int_{0}^{ρ_{k} + ρ_{k - 1}} M_{0} (u) d u]\}}^{- 1} \leq g (r_{*}) . \end{matrix}

(40)

Therefore,

x_{k + 1}

is correctly defined, and the following estimate holds:

\begin{matrix} ∥ x_{k + 1} - x_{*} ∥ = ∥ x_{k} - x^{*} - {(A_{k}^{T} A_{k})}^{- 1} [A_{k}^{T} (F (x_{k}) + G (x_{k})) - A_{*}^{T} (F (x^{*}) \\ + G (x^{*}))] ∥ \leq ∥ - {(A_{k}^{T} A_{k})}^{- 1} ∥ ∥ - A_{k}^{T} [A_{k} - \int_{0}^{1} F^{'} (x^{*} + t (x_{k} - x^{*})) d t \\ - G (x_{k}, x_{*})] (x_{k} - x^{*}) + (A_{k}^{T} - A_{*}^{T}) (F (x^{*}) + G (x^{*})) ∥ \\ \leq ∥ - {(A_{k}^{T} A_{k})}^{- 1} ∥ ∥ - A_{k}^{T} [A_{k} - \int_{0}^{1} F^{'} (x^{*} + t (x_{k} - x^{*})) d t \\ - G (x_{k}, x_{*})] (x_{k} - x^{*}) + (A_{k}^{T} - A_{*}^{T}) (F (x^{*}) + G (x^{*})) ∥ \\ \leq g_{k} \{[α + \int_{0}^{ρ_{k}} L_{0} (u) d u + \int_{0}^{ρ_{k} + ρ_{k - 1}} M_{0} (u) d u] [\int_{0}^{ρ_{k}} L (u) u d u \\ + \int_{0}^{ρ_{k - 1}} M (u) d u] ∥ x_{k} - x^{*} ∥ + η [\int_{0}^{ρ_{k}} L_{0} (u) d u + \int_{0}^{ρ_{k} + ρ_{k - 1}} M_{0} (u) d u]\} \\ \leq g (r_{*}) \{[α + \int_{0}^{r_{*}} L_{0} (u) d u + \int_{0}^{2 r_{*}} M_{0} (u) d u] \\ \times [\frac{1}{r_{*}^{2}} \int_{0}^{r_{*}} L (u) u d u ρ_{k}^{2} + \frac{1}{r_{*}} \int_{0}^{r_{*}} M (u) d u ρ_{k - 1}] ∥ x_{k} - x^{*} ∥ \\ + η [\frac{1}{r_{*}} \int_{0}^{r_{*}} L_{0} (u) d u ρ_{k} + \frac{1}{2 r_{*}} \int_{0}^{2 r_{*}} M_{0} (u) d u (ρ_{k} + ρ_{k - 1})]\} < p (r_{*}) r_{*} = r_{*} . \end{matrix}

(41)

This proves that

x_{k + 1} \in Ω (x^{*}, r_{*})

and estimate (22) for

n = k

.

Thus, by the induction method, (2) is correctly defined,

x_{n} \in Ω (x^{*}, r_{*})

, and estimate (22) holds for each

n = 0, 1, 2, \dots

.

It remains to be proven that

x_{n} \to x^{*}

for

n \to \infty

.

Let us define functions a and b on

[0, r_{*}]

as

\begin{matrix} a (r) & = & g (r) \{[α + \int_{0}^{r} L_{0} (u) d u + \int_{0}^{2 r} M_{0} (u) d u] [\int_{0}^{r} L (u) u d u + \int_{0}^{r} M (u) d u] \\ + [\frac{1}{r} \int_{0}^{r} L_{0} (u) d u + \frac{1}{2 r} \int_{0}^{2 r} M_{0} (u) d u] η\}; \end{matrix}

(42)

\begin{matrix} b (r) & = & g (r) \frac{1}{2 r} \int_{0}^{2 r} M (u) d u η . \end{matrix}

(43)

According to the choice of

r_{*}

, we obtain

a (r_{*}) \geq 0, b (r_{*}) \geq 0, a (r_{*}) + b (r_{*}) = 1 .

(44)

Using estimate (22), the definition of functions a, b and constants

C_{i}

(

i = 1, 2, 3, 4

), we have

\begin{matrix} ∥ x_{n + 1} - x^{*} ∥ & \leq & C_{1} ∥ x_{n - 1} - x^{*} ∥ + (C_{2} + C_{3} r_{*} + C_{4} r_{*}) ∥ x_{n} - x^{*} ∥ \\ = & a (r_{*}) ∥ x_{n} - x^{*} ∥ + b (r_{*}) ∥ x_{n - 1} - x^{*} ∥ . \end{matrix}

(45)

According to the proof in [17], under the conditions (42)–(45), the sequence

{x_{n}}

converges to

x^{*}

for

n \to \infty

. □

Corollary 1

([4]). The convergence order of method (2) for the problem (1) with zero residual is equal to

\frac{1 + \sqrt{5}}{2}

.

If

η = 0

, we have the nonlinear least squares problem with zero residual. Then, the constants

C_{1} = 0

and

C_{2} = 0

, and estimate (22) takes the form

∥ x_{n + 1} - x^{*} ∥ \leq C_{3} ∥ x_{n - 1} - x^{*} ∥ ∥ x_{n} - x^{*} ∥ + C_{4} {∥ x_{n} - x^{*} ∥}^{2} .

(46)

This inequality can be written as

∥ x_{n + 1} - x^{*} ∥ \leq (C_{3} + C_{4}) ∥ x_{n - 1} - x^{*} ∥ ∥ x_{n} - x^{*} ∥ .

(47)

Then, we can write an equation for determining the convergence order as follows:

t^{2} - t - 1 = 0 .

(48)

Therefore, the positive root,

t^{*} = \frac{1 + \sqrt{5}}{2}

of the latter equation is the order of convergence of method (2).

In case

G (x) \equiv 0

in (1), we obtain the following consequences.

Corollary 2

([4]). The convergence order of method (2) for problem (1) with zero residual is quadratic.

Indeed, if

G (x) \equiv 0

, then

C_{3} = 0

, and estimate (22) takes the form

∥ x_{n + 1} - x^{*} ∥ \leq C_{4} {∥ x_{n} - x^{*} ∥}^{2},

(49)

which indicates the quadratic convergence rate of method (2).

Remark 2.

If

L_{0} = L = L_{1}

and

M_{0} = M = M_{1}

, our results specialize to the corresponding ones in [4]. Otherwise, they constitute an improvement as already noted in Remark 1. As an example, let

q_{1}, g_{1}, C_{1}^{1}, C_{2}^{1}, C_{3}^{1}, C_{4}^{1}, r_{*}^{1}

denote the functions and parameters where

L_{0}, L, M_{0}, M

are replaced by

L_{1}, L_{1}, M_{1}, M_{1}

, respectively. Then, we have in view of (15)–(17) that

\begin{matrix} q (r) & \leq & q_{1} (r), \end{matrix}

(50)

\begin{matrix} g (r) & \leq & g_{1} (r), \end{matrix}

(51)

\begin{matrix} C_{1} & \leq & C_{1}^{1}, \end{matrix}

(52)

\begin{matrix} C_{2} & \leq & C_{2}^{1}, \end{matrix}

(53)

\begin{matrix} C_{3} & \leq & C_{3}^{1}, \end{matrix}

(54)

and

\begin{matrix} C_{4} & \leq & C_{4}^{1} . \end{matrix}

(55)

Hence, we have

\begin{matrix} r_{*}^{1} & \leq & r_{*}, \end{matrix}

(56)

the new error bounds (22) being tighter than the corresponding (6) in [4], and the rest of the advantages (already mentioned in Remark 1) holding true.

Next, we study the convergence of method (2) if

L_{0}, L, M_{0}, M

are constants, as a consequence of Theorem 1.

Corollary 3.

Let

F + G : R^{p} \to R^{m}

be continuous on an open convex subset

D \subset R^{p}

, F be a continuously differentiable, and G be a continuous function on D. Suppose that problem (1) has a solution

x^{*} \in D

, and the inverse operation

{(A_{*}^{T} A_{*})}^{- 1} = {[{(F^{'} (x^{*}) + G (x^{*}, x^{*}))}^{T} (F^{'} (x^{*}) + G (x^{*}, x^{*}))]}^{- 1}

(57)

exists, such that

∥ {(A_{*}^{T} A_{*})}^{- 1} ∥ \leq B

.

Suppose that the Fréchet derivative

F^{'}

satisfies the classic Lipschitz conditions

\begin{matrix} ∥ F^{'} (x) - F^{'} (x^{*}) ∥ & \leq & L_{0} ∥ x - x^{*} ∥, for each x \in D, \end{matrix}

(58)

\begin{matrix} ∥ F^{'} (x) - F^{'} (y) ∥ & \leq & L ∥ x - y ∥, for each x, y \in Ω_{0} \end{matrix}

(59)

and the function G has a first order divided difference

G (x, y)

that satisfies

\begin{matrix} ∥ G (x, y) - G (x^{*}, x^{*}) ∥ & \leq & M_{0} (∥ x - x^{*} ∥ + ∥ y - x^{*} ∥), for each x, y \in D, \end{matrix}

(60)

\begin{matrix} ∥ G (x, y) - G (u, v) ∥ & \leq & M (∥ x - u ∥ + ∥ y - v ∥), for each x, y, u, v \in Ω_{0}, \end{matrix}

(61)

where

Ω_{0} = D \cap Ω (x^{*}, \frac{\sqrt{B^{2} α^{2} + B} - B α}{B (L_{0} + 2 M_{0})})

.

Furthermore,

∥ F (x^{*}) + G (x^{*}) ∥ \leq η, ∥ F^{'} (x^{*}) + G (x^{*}, x^{*}) ∥ \leq α, B (L_{0} + 2 M_{0}) η < 1

(62)

and

Ω = Ω (x^{*}, r_{*}) \subseteq D,

where

r_{*} = \frac{4 (1 - B T_{0} η)}{B α (4 T_{0} + T) + \sqrt{B^{2} α^{2} {(4 T_{0} + T)}^{2} + 8 B T_{0} (2 T_{0} + T) (1 - B T_{0} η)}},

(63)

T_{0} = L_{0} + 2 M_{0}, T = L + 2 M .

Then, for each

x_{0}, x_{- 1} \in Ω

, the iterative sequence

{x_{n}}

,

n = 0, 1, . . .,

generated by (2) is well defined, remains in Ω, and converges to

x^{*}

, such that the following error estimate holds for each

n = 0, 1, 2, \dots

:

\begin{matrix} ∥ x_{n + 1} - x^{*} ∥ & \leq & C_{1} ∥ x_{n - 1} - x^{*} ∥ + C_{2} ∥ x_{n} - x^{*} ∥ \\ + C_{3} ∥ x_{n - 1} - x^{*} ∥ ∥ x_{n} - x^{*} ∥ + C_{4} {∥ x_{n} - x^{*} ∥}^{2}, \end{matrix}

(64)

where

\begin{matrix} g (r) & = & B {[1 - B (2 α + (L_{0} + 2 M_{0}) r) (L_{0} + 2 M_{0}) r]}^{- 1}; \end{matrix}

(65)

\begin{matrix} C_{1} & = & g (r_{*}) M_{0} η; C_{2} = g (r_{*}) (L_{0} + M_{0}) η; \end{matrix}

(66)

\begin{matrix} C_{3} & = & g (r_{*}) (α + (L_{0} + 2 M_{0}) r_{*}) M; \end{matrix}

(67)

\begin{matrix} C_{4} & = & g (r_{*}) (α + (L_{0} + 2 M_{0}) r_{*}) \frac{L}{2} . \end{matrix}

(68)

The proof of Corollary 3 is analogous to the proof of Theorem 1.

3. Numerical Examples

In this section, we give examples to show the applicability of method (2) and to confirm Remark 2. We use the norm

∥ x ∥ = \sqrt{\sum_{i = 1}^{p} x_{i}^{2}}

for

x \in R^{p} .

Example 1.

Let function

F + G : R^{2} \to R^{3}

be defined by

\begin{matrix} F (x) + G (x) = (\begin{matrix} 3 u^{2} v + v^{2} - 1 + | u^{2} - 1 | \\ u^{4} + u v^{3} - 1 + | v | \\ v - 0.3 + | u - 1 | \end{matrix}), \end{matrix}

(69)

\begin{matrix} F (x) = (\begin{matrix} 3 u^{2} v + v^{2} - 1 \\ u^{4} + u v^{3} - 1 \\ v - 0.3 \end{matrix}), G (x) = (\begin{matrix} | u^{2} - 1 | \\ | v | \\ | u - 1 | \end{matrix}), \end{matrix}

(70)

where

x = (u, v)

. The solution of this problem

x^{*} \approx (0.917889, 0.288314)

and

η \approx 0.079411

.

Let us give the number of iterations needed to obtain an approximate solution of this problem. We test method (2) for the different initial points

x_{0} = δ {(1.1, 0.5)}^{T}

, where

δ \in R

, and use the stopping criterion

∥ x_{n + 1} - x_{n} ∥ \leq ε

. The additional point

x_{- 1} = x_{0} + 10^{- 4}

. The numerical results are shown in Table 1.

In Table 2, we give values of

x_{n + 1}

,

∥ x_{n + 1} - x_{n} ∥

and the norm of residual at each iteration.

Example 2.

Let function

F + G : D \subseteq R \to R^{3}

be defined by [5]:

\begin{matrix} F (x) + G (x) = (\begin{matrix} x + μ \\ λ x^{3} + x - μ \\ λ | x^{2} - 1 | - λ \end{matrix}), \end{matrix}

(71)

\begin{matrix} F (x) = (\begin{matrix} x + μ \\ λ x^{3} + x - μ \\ 0 \end{matrix}), G (x) = (\begin{matrix} 0 \\ 0 \\ λ | x^{2} - 1 | - λ \end{matrix}), \end{matrix}

(72)

where

λ, μ \in R

are two parameters. Here

x^{*} = 0

and

η = \sqrt{2} | μ |

. Thus, if

μ = 0

, then we have a problem with zero residual.

Let us consider Example 2 and show that

r_{*}^{1} \leq r_{*}

and the new error estimates (64) are tighter than the corresponding ones in [4]. We consider the case of the classical Lipschitz conditions (Corollary 3). Error estimates from [4] are as follows:

\begin{matrix} ∥ x_{n + 1} - x^{*} ∥ & \leq & C_{1}^{1} ∥ x_{n - 1} - x^{*} ∥ + C_{2}^{1} ∥ x_{n} - x^{*} ∥ \\ + C_{3}^{1} ∥ x_{n - 1} - x^{*} ∥ ∥ x_{n} - x^{*} ∥ + C_{4}^{1} {∥ x_{n} - x^{*} ∥}^{2}, \end{matrix}

(73)

where

\begin{matrix} g^{1} (r) & = & B {[1 - B (2 α + (L_{1} + 2 M_{1}) r) (L_{1} + 2 M_{1}) r]}^{- 1}; \end{matrix}

(74)

\begin{matrix} C_{1}^{1} & = & g^{1} (r_{*}^{1}) M_{1} η; C_{2}^{1} = g^{1} (r_{*}^{1}) (L_{1} + M_{1}) η; \end{matrix}

(75)

\begin{matrix} C_{3}^{1} & = & g^{1} (r_{*}^{1}) (α + (L_{1} + 2 M_{1}) r_{*}^{1}) M_{1}; \end{matrix}

(76)

\begin{matrix} C_{4}^{1} & = & g^{1} (r_{*}^{1}) (α + (L_{1} + 2 M_{1}) r_{*}^{1}) \frac{L_{1}}{2} . \end{matrix}

(77)

They can be obtained from (64) by replacing

r_{*}, L_{0}, L, M_{0}, M

in

g (r)

,

C_{1}

,

C_{2}

,

C_{3}

,

C_{4}

by

r_{*}^{1}, L_{1}, L_{1}, M_{1}, M_{1}

, respectively. Similarly,

r_{*}^{1} = \frac{4 (1 - B T_{1} η)}{5 B α T_{1} + \sqrt{25 B^{2} α^{2} T_{1}^{2} + 24 B T_{1}^{2} (1 - B T_{1} η)}}, T_{1} = L_{1} + 2 M_{1} .

(78)

Let us choose

D = (- 0.5; 0.5)

. Thus, we have

B = 0.5

,

η = \sqrt{2} | μ |

,

α = \sqrt{2}

,

L_{0} = max_{x \in D} 3 | λ | | x |

,

L = max_{x, y \in Ω_{0}} 3 | λ | | x + y |

,

L_{1} = max_{x, y \in D} 3 | λ | | x + y |

,

M_{0} = M = M_{1} = | λ |

. Radii are written in Table 3.

Table 4 and Table 5 report the left and right side of error estimates (64) and (73). We obtained these results for

ε = 10^{- 8}

and starting approximations

x_{- 1} = 0.2001

,

x_{0} = 0.2

. We see that the new error bounds (64) are tighter than the corresponding (73) from [4].

4. Conclusions

We developed an improved local convergence analysis of the Gauss–Newton–Secant method for solving nonlinear least squares problems with nondifferentiable operator. We use a center and restricted radius Lipschitz conditions to study the method. As a consequence, we obtain a larger radius of convergence and tighter error estimates under the same computational effort as in earlier papers. This idea can be used to extend the usage of other methods with inverses, such as Newton-type, Secant-type, single-step, or multi-step, to mention a few. This should be our future work. Finally, it is worth mentioning that except for the methods used in this paper, some of the most representative computational intelligence algorithms can be used to solve the problems, such as monarch butterfly optimization (MBO) [18], the earthworm optimization algorithm (EWA) [19], elephant herding optimization (EHO) [20], the moth search (MS) algorithm [21], the slime mould algorithm (SMA), and Harris hawks optimization (HHO) [22].

Author Contributions

Editing, I.K.A.; Conceptualization S.S.; Investigation I.K.A., S.S., R.I., H.Y. and M.I.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Li, C.; Zhang, W.; Jin, X. Convergence and uniqueness properties of Gauss-Newton’s method. Comput. Math. Appl. 2004, 47, 1057–1067. [Google Scholar] [CrossRef]
Argyros, I.K.; Ren, H. A derivative free iterative method for solving least squares problems. Numer. Algorithms 2011, 58, 555–571. [Google Scholar]
Shakhno, S.M.; Gnatyshyn, O.P. On an iterative algorithm of order 1.839... for solving the nonlinear least squares problems. Appl. Math. Comput. 2005, 161, 253–264. [Google Scholar] [CrossRef]
Shakhno, S.M.; Iakymchuk, R.P.; Yarmola, H.P. An iterative method for solving nonlinear least squares problems with nondifferentiable operator. Mat. Stud. 2017, 48, 97–107. [Google Scholar] [CrossRef][Green Version]
Shakhno, S.M.; Iakymchuk, R.P.; Yarmola, H.P. Convergence analysis of a two-step method for the nonlinear least squares problem with decomposition of operator. J. Numer. Appl. Math. 2018, 128, 82–95. [Google Scholar]
Shakhno, S.; Shunkin, Y. One combined method for solving nonlinear least squares problems. Visnyk Lviv Univ. Ser. Appl. Math. Comp. Sci. 2017, 25, 38–48. (In Ukrainian) [Google Scholar]
Ulm, S. On generalized divided differences. Izv. ESSR Ser. Phys. Math. 1967, 16, 13–26. (In Russian) [Google Scholar]
Cătinaş, E. On some iterative methods for solving nonlinear equations. Rev. Anal. Numér. Théor. Approx. 1994, 23, 47–53. [Google Scholar]
Shakhno, S.M.; Mel’nyk, I.V.; Yarmola, H.P. Convergence analysis of combined method for solving nonlinear equations. J. Math. Sci. 2016, 212, 16–26. [Google Scholar] [CrossRef]
Shakhno, S.M. Convergence of combined Newton-Secant method and uniqueness of the solution of nonlinear equations. Sci. J. Tntu 2013, 1, 243–252. (In Ukrainian) [Google Scholar]
Zabrejko, P.P.; Nguen, D.F. The majorant method in the theory of Newton-Kantorovich approximations and the Pták error estimates. Numer. Funct. Anal. Optim. 1987, 9, 671–686. [Google Scholar] [CrossRef]
Wang, X.; Li, C. Convergence of Newton’s method and uniqueness of the solution of equations in Banach space II. Acta Math. Sin. 2003, 19, 405–412. [Google Scholar] [CrossRef]
Wang, X. Convergence of Newton’s method and uniqueness of the solution of equations in Banach space. IMA J. Numer. Anal. 2000, 20, 123–134. [Google Scholar] [CrossRef]
Argyros, I.K.; Hilout, S. On an improved convergence analysis of Newton’s method. Appl. Math. Comput. 2013, 225, 372–386. [Google Scholar] [CrossRef]
Argyros, I.K.; Magreñán, A.A. Iterative Methods and Their Dynamics with Applications: A Contemporary Study; CRC Press: Boca Raton, FL, USA, 2017. [Google Scholar]
Dennis, J.E.; Schnabel, R.B. Numerical Methods for Unconstrained Optimization and Nonlinear Equations; SIAM: Philadelphia, PA, USA, 1996. [Google Scholar]
Ren, H.; Argyros, I.K. Local convergence of a secant type method for solving least squares problems. Appl. Math. Comput. 2010, 217, 3816–3824. [Google Scholar] [CrossRef]
Wang, G.G.; Deb, S.; Cui, Z. Monarch butterfly optimization. Neural Comput. Appl. 2019, 31, 1995–2014. [Google Scholar] [CrossRef]
Wang, G.G.; Deb, S.; Dos, L.; Coelho, L.D.S. Earthworm optimization algorithm: A bio-inspired metaheuristic algorithm for global optimization problems. Int. J. Bio-Inspired Comput. 2018, 12, 1–22. [Google Scholar] [CrossRef]
Wang, G.G.; Deb, S.; Coelho, L.D.S. Elephant Herding Optimization. In Proceedings of the 3rd International Symposium on Computational and Business Intelligence (ISCBI 2015), Bali, Indonesia, 7–9 December 2015; pp. 1–5. [Google Scholar]
Mirjalili, S. Moth-flame optimization algorithm: A novel nature-inspired heuristic paradigm. Knowl.-Based Syst. 2015, 89, 228–249. [Google Scholar] [CrossRef]
Zhao, J.; Gao, Z.-M. The hybridized Harris hawk optimization and slime mould algorithm. J. Phys. Conf. Ser. 2020, 1682, 012029. [Google Scholar] [CrossRef]

Table 1. Results for Example 1,

ε = 10^{- 8}

.

Table 1. Results for Example 1,

ε = 10^{- 8}

.

	$δ = 0.1$	$δ = 1$	$δ = 5$	$δ = 10$	$δ = 100$
Number of iterations	12	8	15	17	25

Table 2. Iterative sequence, norm of growth, and residual for Example 1,

x_{0} = {(0.8, 0.2)}^{T}

,

ε = 10^{- 6}

.

Table 2. Iterative sequence, norm of growth, and residual for Example 1,

x_{0} = {(0.8, 0.2)}^{T}

,

ε = 10^{- 6}

.

n	$x_{n + 1}$	$∥ x_{n + 1} - x_{n} ∥$	$∥ F (x_{n + 1}) + G (x_{n + 1}) ∥$
0	(0.937901, 0.312602)	0.178033	0.143759
1	(0.918455, 0.290216)	2.965298 × 10 $^{- 2}$	7.973496 × 10 $^{- 2}$
2	(0.917850, 0.288333)	1.977741 × 10 $^{- 3}$	7.941104 × 10 $^{- 2}$
3	(0.917888, 0.288313)	4.346993 × 10 $^{- 5}$	7.941092 × 10 $^{- 2}$
4	(0.917889, 0.288314)	7.873833 × 10 $^{- 7}$	7.941092 × 10 $^{- 2}$

Table 3. Radii of convergence domains.

$λ$	$μ$	$L_{0}$	L	$L_{1}$	M	$r_{*}$	$r_{*}^{1}$
0.4	0	0.6	1.004205	1.2	0.4	0.319259	0.235702
0.1	0.2	0.15	0.3	0.3	0.1	1.192633	0.885163

Table 4. Results for

λ = 0.4

,

μ = 0

.

Table 4. Results for

λ = 0.4

,

μ = 0

.

n	$\| x_{n + 1} - x^{*} \|$	The Right Side of (64)	The Right Side of (73)
0	4.364164 × 10 $^{- 3}$	0.125318	0.169740
1	1.425535 × 10 $^{- 5}$	1.245455 × 10 $^{- 3}$	1.529729 × 10 $^{- 3}$
2	2.179258 × 10 $^{- 11}$	8.675961 × 10 $^{- 8}$	1.060957 × 10 $^{- 7}$
3	3.542853 × 10 $^{- 22}$	4.314684 × 10 $^{- 16}$	5.272102 × 10 $^{- 16}$

Table 5. Results for

λ = 0.1

,

μ = 0.2

.

Table 5. Results for

λ = 0.1

,

μ = 0.2

.

n	$\| x_{n + 1} - x^{*} \|$	The Right Side of (64)	The Right Side of (73)
0	2.063103 × 10 $^{- 3}$	5.909333 × 10 $^{- 2}$	8.484100 × 10 $^{- 2}$
1	5.453349 × 10 $^{- 7}$	9.113893 × 10 $^{- 3}$	1.080560 × 10 $^{- 2}$
2	2.054057 × 10 $^{- 14}$	9.051468 × 10 $^{- 5}$	1.057648 × 10 $^{- 4}$
3	1.447579 × 10 $^{- 18}$	2.390964 × 10 $^{- 8}$	2.792694 × 10 $^{- 8}$

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Argyros, I.K.; Shakhno, S.; Iakymchuk, R.; Yarmola, H.; Argyros, M.I. Gauss–Newton–Secant Method for Solving Nonlinear Least Squares Problems under Generalized Lipschitz Conditions. Axioms 2021, 10, 158. https://doi.org/10.3390/axioms10030158

AMA Style

Argyros IK, Shakhno S, Iakymchuk R, Yarmola H, Argyros MI. Gauss–Newton–Secant Method for Solving Nonlinear Least Squares Problems under Generalized Lipschitz Conditions. Axioms. 2021; 10(3):158. https://doi.org/10.3390/axioms10030158

Chicago/Turabian Style

Argyros, Ioannis K., Stepan Shakhno, Roman Iakymchuk, Halyna Yarmola, and Michael I. Argyros. 2021. "Gauss–Newton–Secant Method for Solving Nonlinear Least Squares Problems under Generalized Lipschitz Conditions" Axioms 10, no. 3: 158. https://doi.org/10.3390/axioms10030158

APA Style

Argyros, I. K., Shakhno, S., Iakymchuk, R., Yarmola, H., & Argyros, M. I. (2021). Gauss–Newton–Secant Method for Solving Nonlinear Least Squares Problems under Generalized Lipschitz Conditions. Axioms, 10(3), 158. https://doi.org/10.3390/axioms10030158

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Gauss–Newton–Secant Method for Solving Nonlinear Least Squares Problems under Generalized Lipschitz Conditions

Abstract

1. Introduction

2. Local Convergence Analysis

3. Numerical Examples

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI