Q-Curve and Area Rules for Choosing Heuristic Parameter in Tikhonov Regularization

Raus, Toomas; Hämarik, Uno

doi:10.3390/math8071166

Open AccessArticle

Q-Curve and Area Rules for Choosing Heuristic Parameter in Tikhonov Regularization

by

Toomas Raus

and

Uno Hämarik

^*

Institute of Mathematics and Statistics, University of Tartu, 51009 Tartu, Estonia

^*

Author to whom correspondence should be addressed.

Mathematics 2020, 8(7), 1166; https://doi.org/10.3390/math8071166

Submission received: 8 June 2020 / Revised: 8 July 2020 / Accepted: 12 July 2020 / Published: 16 July 2020

(This article belongs to the Special Issue Inverse and Ill-Posed Problems)

Download

Browse Figures

Versions Notes

Abstract

We consider choice of the regularization parameter in Tikhonov method if the noise level of the data is unknown. One of the best rules for the heuristic parameter choice is the quasi-optimality criterion where the parameter is chosen as the global minimizer of the quasi-optimality function. In some problems this rule fails. We prove that one of the local minimizers of the quasi-optimality function is always a good regularization parameter. For the choice of the proper local minimizer we propose to construct the Q-curve which is the analogue of the L-curve, but on the x-axis we use modified discrepancy instead of discrepancy and on the y-axis the quasi-optimality function instead of the norm of the approximate solution. In the area rule we choose for the regularization parameter such local minimizer of the quasi-optimality function for which the area of the polygon, connecting on Q-curve this minimum point with certain maximum points, is maximal. We also provide a posteriori error estimates of the approximate solution, which allows to check the reliability of the parameter chosen heuristically. Numerical experiments on an extensive set of test problems confirm that the proposed rules give much better results than previous heuristic rules. Results of proposed rules are comparable with results of the discrepancy principle and the monotone error rule, if the last two rules use the exact noise level.

Keywords:

ill-posed problem; Tikhonov regularization; unknown noise level; regularization parameter choice; heuristic rule; quasi-optimality function

MSC:

45A52; 65J20

1. Introduction

Let

A \in L (H, F)

be a linear bounded operator between real Hilbert spaces H, F. We are interested in finding the minimum norm solution

u_{*}

of the equation

A u = f_{*}, f_{*} \in R (A),

(1)

where noisy data

f \in F

are given instead of the exact data

f_{*}

. The range

R (A)

may be non-closed and the kernel

N (A)

may be non-trivial, so in general this problem is ill-posed. We consider the solution of the problem

A u = f

by the Tikhonov method (see [1,2]) where the regularized solutions in cases of exact and inexact data have the corresponding forms

u_{α}^{+} = {(α I + A^{*} A)}^{- 1} A^{*} f_{*}, u_{α} = {(α I + A^{*} A)}^{- 1} A^{*} f

and

α > 0

is the regularization parameter. Using the well-known estimate

∥u_{α} - u_{α}^{+}∥ \leq \frac{1}{2} α^{- 1 / 2} ∥f - f_{*}∥

(see [1,2]) and notations

e (α) : = ∥u_{α} - u_{*}∥, e_{1} (α) : = ∥u_{α}^{+} - u_{*}∥ + ∥u_{α} - u_{α}^{+}∥,

(2)

e_{2} (α, ∥f - f_{*}∥) : = ∥u_{α}^{+} - u_{*}∥ + \frac{1}{2 \sqrt{α}} ∥f - f_{*}∥,

we have the error estimates

e (α) \leq e_{1} (α) \leq e_{2} (α, ∥f - f_{*}∥) .

(3)

We consider choice of the regularization parameter if the noise level for

∥f - f_{*}∥

is unknown. The parameter choice rules which do not use the noise level information are called heuristic rules. Well known heuristic rules are the quasi-optimality criterion [3,4,5,6,7,8,9], L-curve rule [10,11], GCV (generalized cross-validation)-rule [12], Hanke–Raus rule [13], Reginska’s rule [14] and its modifications [15]; for other rules see [16,17,18]. Heuristic rules are numerically compared in [4,17,18,19]. The heuristic rules give good results in many problems, but it is not possible to construct heuristic rules guaranteeing convergence

∥u_{α} - u_{*}∥ \to 0

as the noise level goes to zero (see [20]). All heuristic rules may fail in some problems and without additional information about the solution, it is difficult to decide, is the obtained parameter reliable or not.

In the quasi-optimality criterion the parameter

α

is chosen as the global minimizer of the function

ψ_{Q} (α) = α ∥\frac{d u_{α}}{d α}∥

on certain interval. We propose to choose the parameter from the set

L_{\min}

of local minimizers of this function from certain set

Ω

of parameters.

We will call the parameter

α_{R}

in arbitrary rule R as pseudooptimal, if

∥u_{α_{R}} - u_{*}∥ \leq c min_{α > 0} e_{1} (α)

with relatively small constant c and we show that at least one parameter from set

L_{\min}

has this property. For the choice of proper parameter from the set

L_{\min}

some algorithms were proposed in [21], in the current work we propose other algorithms. We propose to construct the Q-curve which is the analogue of the L-curve [10], but on the x-axis we use modified discrepancy instead of discrepancy and on the y-axis function

ψ_{Q} (α)

instead of

∥ u_{α} ∥ .

For finding proper local minimizer of the function

ψ_{Q} (α)

, we propose the area rules on the Q-curve. The idea of proposed rules is that we form, for every minimizer of the function

ψ_{Q} (α)

, a certain function which approximates the error of the approximate solution and has one minimizer; we choose for the regularization parameter such local minimizer of

ψ_{Q} (α)

, for which the area of the polygon, connecting this minimum point with certain maximum points, is maximal.

The plan of this paper is as follows. In Section 2 we consider known rules for choice of the regularization parameter, both in case of known and unknown noise level. In Section 3, we prove that the set

L_{\min}

contains at least one pseudooptimal parameter. In Section 4, information about used test problems (mainly from [11,22], but also from [11,23,24,25,26]) and numerical experiments is given. In Section 5, we consider the Q-curve and area rule, in Section 6 further developments of the area rule. These algorithms are also illustrated by the results of numerical experiments.

2. Rules for the Choice of the Regularization Parameter

2.1. Parameter Choice in the Case of Known Noise Level

In the case of known noise level

δ, ∥f - f_{*}∥ \leq δ

we use one of the so-called

δ

-rules, where certain functional

d (α)

and constant

b \geq b_{0}

(

b_{0}

depends on

d (α)

) are chosen and such regularization parameter

α (δ)

is chosen which satisfies

d (α) = b δ .

(1) Discrepancy principle (DP) [2,27]:

d_{D} (α) : = ∥A u_{α} - f∥ = b δ, b \geq 1 .

(2) Modified discrepancy principle (Raus–Gfrerer rule) [28,29]:

d_{MD} (α) : = ∥B_{α} (A u_{α} - f)∥ = b δ, B_{α} : = α^{1 / 2} {(α I + A A^{*})}^{- 1 / 2}, b \geq 1 .

(3) Monotone error rule (ME-rule) [30,31]:

d_{ME} (α) : = \frac{{∥B_{α} (A u_{α} - f)∥}^{2}}{∥B_{α}^{2} (A u_{α} - f)∥} = δ .

The name of this rule is justified by the fact that the chosen parameter

α_{ME}

satisfies

\frac{d}{d α} ∥ u_{α} - u_{*} ∥ > 0 \forall α > α_{ME} .

Therefore

α_{ME} \geq α_{opt} : = argmin ∥ u_{α} - u_{*} ∥

.

(4) Monotone error rule with post-estimation (MEe-rule) [4,18,32,33,34,35]. The inequality

α_{ME} \geq α_{opt}

suggests to use somewhat smaller parameter than

α_{ME}

. Extensive numerical experiments suggest to compute

α_{ME}

and to use the post-estimated parameter

α_{MEe} : = 0.4 α_{ME}

. Then typically

∥ u_{α_{MEe}} - u_{*} ∥ / ∥ u_{α_{ME}} - u_{*} ∥ \in (0.7, 0.9)

. If the exact noise level is known, this MEe-rule gives typically the best results from all

δ

-rules.

(5) Rule R1 [36]. Let

b > \frac{2}{3 \sqrt{3}}

. Choose

α (δ)

as the smallest solution of the equation

d_{R 1} (α (δ)) : = α^{- 1 / 2} ∥A^{*} B_{α}^{2} (A u_{α} - f)∥ = b δ .

Note that this equation can be rewritten using the 2-iterated Tikhonov approximation

u_{2, α}

:

B_{α}^{2} (A u_{α} - f) = A u_{2, α} - f, u_{2, α} = {(α I + A^{*} A)}^{- 1} (α u_{α} + A^{*} f) .

(4)

The last four rules are weakly quasioptimal rules (see [37]) for the Tikhonov method: if

∥f - f_{*}∥ \leq δ

, then

∥u_{α (δ)} - u_{*}∥ \leq C (b) {inf}_{α > 0} e_{2} (α, δ)

(see (3)). The rules for the parameter choice in case of approximately given noise level are proposed and analyzed in [18,32,33,34].

2.2. Parameter Choice in the Case of Unknown Noise Level

A classical heuristic rule is the quasi-optimality criterion. In the Tikhonov method, it chooses

α = α_{Q}

or

α = α_{QD}

as the global minimizer of corresponding functions

ψ_{Q} (α) = α ∥\frac{d u_{α}}{d α}∥ = α^{- 1} ∥A^{*} B_{α}^{2} (A u_{α} - f)∥ = α ∥ A^{*} {(α I + A A^{*})}^{- 2} f ∥ = ∥ u_{2, α} - u_{α} ∥,

(5)

ψ_{QD} (α) = {(1 - q)}^{- 1} ∥u_{α} - u_{q α}∥, 0 < q < 1 .

(6)

The Hanke–Raus rule finds parameter

α = α_{H R}

as the global minimizer of the function

ψ_{HR} (α) = α^{- 1 / 2} ∥B_{α} (A u_{α} - f)∥ .

In practice, often the L-curve is used. The L-curve is the log-log-plot of

∥u_{α}∥

versus

∥A u_{α} - f∥

. The points

(∥A u_{α} - f∥, ∥u_{α}∥)

have often a shape similar to the letter L and the parameter

α_{L}

which corresponds to the corner point is often a good parameter. In the literature several concrete rules for choice of the corner point are proposed. In [14], a parameter is chosen as the global minimizer of the function

ψ_{RE} (α) = ∥A u_{α} - f∥ {∥u_{α}∥}^{τ}, τ \geq 1

(below we use this rule with

τ = 1

). Another rule for choice of the corner point is the maximum curvature method ([38,39]), where such parameter

α

is chosen for which the curvature of the L-curve as the function

ψ_{MC} (α) = 2 \frac{{\hat{ρ}}^{'} {\hat{ξ}}^{″} - {\hat{ρ}}^{″} {\hat{ξ}}^{'}}{{({({\hat{ρ}}^{'})}^{2} + {({\hat{ξ}}^{'})}^{2})}^{3 / 2}}

is maximal. Here

{\hat{ρ}}^{'}, {\hat{ξ}}^{'}, {\hat{ρ}}^{″}, {\hat{ξ}}^{″}

are first and second order derivatives of functions

log d_{D} (α)

and

log ∥u_{α}∥

.

We propose also a new heuristic rule, where the global minimizer of the function

ψ_{WQ} (α) = d_{MD} (α) ψ_{Q} (α)

(7)

is chosen for the parameter. We call this rule as the weighted quasioptimality criterion.

In the following we will find the regularization parameter from the set of parameters

Ω = \{α_{j} : α_{j} = q α_{j - 1}, j = 1, 2, \dots, N, 0 < q < 1\},

(8)

where

α_{0}, q, α_{N}

are given. If in the discretized problem the minimal eigenvalue

λ_{\min}

of the matrix

A^{T} A

is larger than

α_{N}

, the heuristic rules above often choose parameter

α_{N}

, which is generally not a good parameter. The works [6,7,8] propose to search the global minimum of the function

ψ_{Q} (α)

in the interval

[max (α_{N}, λ_{\min}), α_{0}]

.

Definition 1.

We say that the discretized problem

A u = f

does not need regularization if

e_{1} (λ_{\min}) \leq 2 min_{α \in Ω, α \geq λ_{\min}} e_{1} (α) .

If the discretized problem does not need regularization then

α^{'} = 0

or

α^{'} = α_{N}

is the proper parameter while for

α^{'} \leq λ_{m i n}

we have

\begin{matrix} ∥u_{α^{'}} - u_{*}∥ \leq e_{1} (α^{'}) = ∥u_{α^{'}}^{+} - u_{*}∥ + ∥{(α^{'} I + A^{*} A)}^{- 1} A^{*} (f - f_{*})∥ \leq \\ ∥u_{λ_{m i n}}^{+} - u_{*}∥ + 2 ∥{(λ_{m i n} I + A^{*} A)}^{- 1} A^{*} (f - f_{*})∥ \leq 2 e_{1} (λ_{m i n}) \leq 4 min_{α \in Ω, α \geq λ_{m i n}} e_{1} (α) . \end{matrix}

Searching the parameter from the interval

[max (α_{N}, λ_{\min}), α_{0}]

means the a priori assumption that the discretized problem needs regularization. Note that if

λ_{\min} > α_{N}

, then in general it is not possible to decide (without additional information about solution or about noise of the data) whether the discretized problem needs regularization or not.

3. Local Minimum Points of the Function $ψ_{Q} (α)$

In the following we investigate the function

ψ_{Q} (α)

in (5) and show that at least one local minimizer of this function is the pseudooptimal parameter. We need some preliminary results.

Lemma 1.

The functions

ψ_{Q} (α)

,

ψ_{QD} (α)

satisfy for each

α > 0

the estimates

ψ_{Q} (α) \leq e_{1} (α),

(9)

ψ_{QD} (α) \leq q^{- 1} e_{1} (α),

(10)

ψ_{Q} (α) \leq ψ_{QD} (α) \leq q^{- 1} ψ_{Q} (q α) .

Proof.

Using relations

f = A u_{*} + f - f_{*}

,

u_{α} - u_{q α} = (q - 1) α {(α I + A^{*} A)}^{- 1} {(q α I + A^{*} A)}^{- 1} A^{*} f,

∥A^{*} A {(α I + A^{*} A)}^{- 1}∥ \leq 1, α ∥{(α I + A^{*} A)}^{- 1}∥ \leq 1,

we have

\begin{matrix} ψ_{Q} (α) & = & α ∥ A^{*} {(α I + A A^{*})}^{- 2} f ∥ = α ∥ {(α I + A^{*} A)}^{- 2} A^{*} f ∥ \\ \leq & α ∥A^{*} A {(α I + A^{*} A)}^{- 2} u_{*}∥ + α ∥{(α I + A^{*} A)}^{- 2} A^{*} (f - f_{*})∥ \\ \leq & α ∥ {(α I + A^{*} A)}^{- 1} u_{*} ∥ + ∥ {(α I + A^{*} A)}^{- 1} A^{*} (f - f_{*}) ∥ = e_{1} (α), \\ ψ_{QD} (α) & \leq & α ∥A^{*} A {(q α I + A^{*} A)}^{- 1}) {(α I + A^{*} A)}^{- 1}) u_{*}∥ \\ + α ∥{(q α I + A^{*} A)}^{- 1}) {(α I + A^{*} A)}^{- 1}) A^{*} (f - f_{*})∥ \leq q^{- 1} e_{1} (α), \\ ψ_{Q} (α) & = & α ∥{(α I + A^{*} A)}^{- 2} A^{*} f∥ \leq α ∥{(α I + A^{*} A)}^{- 1} {(q α I + A^{*} A)}^{- 1} A^{*} f∥ \\ = & ψ_{QD} (α) \leq α ∥{(q α I + A^{*} A)}^{- 2} A^{*} f∥ = q^{- 1} ψ_{Q} (q α) . \end{matrix}

☐

Remark 1.

Note that

{lim}_{α \to \infty} ψ_{Q} (α) = 0

, but

{lim}_{α \to \infty} e_{1} (α) = ∥u_{*}∥

. Therefore in the case of too large

α_{0}

this

α_{0}

may be a global (or local) minimizer of the function

ψ_{Q} (α)

. The scaling argument suggests: by multiplying the equation

A^{*} A u = A^{*} f

by some constant, it is also necessary to multiply α by this constant in the Tikhonov method. Therefore the parameter

α_{0}

should be proportional to

∥ A^{*} A ∥

. We recommend to take

α_{0} = c ∥A^{*} A∥, c \leq 1

or to minimize the function

{\bar{ψ}}_{Q} (α) : = (1 + α / ∥A^{*} A∥) ψ_{Q} (α)

instead of

ψ_{Q} (α)

. Due to the equality

{lim}_{α \to 0} (1 + α / ∥A^{*} A∥) = 1

, the function

{\bar{ψ}}_{Q} (α)

approximately satisfies (9) for small α.

In the following we define the local minimum points of the function

ψ_{Q} (α)

on the set

Ω

(see (8)).

Definition 2.

We say that the parameter

α_{k}, 0 \leq k \leq N - 1

is the local minimum point of the sequence

ψ_{Q} (α_{k})

, if

ψ_{Q} (α_{k}) < ψ_{Q} (α_{k + 1})

and in case

k > 0

there exists index

j \geq 1

such that

ψ_{Q} (α_{k}) = ψ_{Q} (α_{k - 1}) = \dots = ψ_{Q} (α_{k - j + 1}) < ψ_{Q} (α_{k - j})

. The parameter

α_{N}

is the local minimum point if there exists index

j \geq 1

so that

ψ_{Q} (α_{N}) = ψ_{Q} (α_{N - 1}) = \dots = ψ_{Q} (α_{N - j + 1}) < ψ_{Q} (α_{N - j}) .

Denote the local minimum points by

m_{k}

,

k = 1, \dots, K

(K is the number of minimum points) and corresponding set by

L_{\min} = \{m_{k} : m_{1} > m_{2} > \dots > m_{K}\} .

Definition 3.

The parameter

α_{k}, 0 < k < N

is the local maximum point of the sequence

ψ_{Q} (α_{k})

if

ψ_{Q} (α_{k}) > ψ_{Q} (α_{k + 1})

and there exists index

j \geq 1

so that

ψ_{Q} (α_{k}) = ψ_{Q} (α_{k - 1}) = \dots = ψ_{Q} (α_{k - j + 1}) > ψ_{Q} (α_{k - j}) .

We denote by

M_{k}

the local maximum point between the local minimum points

m_{k + 1}

and

m_{k}, 1 \leq k \leq K - 1

. Denote

M_{0} = α_{0}

,

M_{K} = α_{N}

. Then by the construction

M_{K} \leq m_{K} < M_{K - 1} < \dots < m_{2} < M_{1} < m_{1} \leq M_{0} .

Theorem 1.

The following estimates hold for the local minimizers of the function

ψ_{Q} (α)

.

1: For arbitrary $α_{0}, α_{N}$ we have

$min_{α \in L_{\min}} ∥u_{α} - u_{*}∥ \leq q^{- 1} C min_{α_{N} \leq α \leq α_{0}} e_{1} (α),$

(11)

$C : = 1 + max_{1 \leq k \leq K} max_{α_{j} \in Ω, M_{k} \leq α_{j} \leq M_{k - 1}} T (m_{k}, α_{j}) \leq 1 + c_{q} ln (\frac{α_{0}}{α_{N}}), T (α, β) : = \frac{∥u_{α} - u_{β}∥}{ψ_{Q} (β)} .$
2: If $α_{0} = ∥A^{*} A∥$ , $α_{N} = α_{0} {(\frac{∥f - f_{*}∥}{2 ∥u_{*}∥})}^{2}$ , then

$min_{α \in L_{\min}} ∥u_{α} - u_{*}∥ \leq q^{- 1} (1 + 2 max {1, c_{q} ∣ ln \frac{∥f - f_{*}∥}{2 ∥A∥ ∥u_{*}∥} ∣}) min_{α > 0} e_{2} (α, ∥f - f_{*}∥),$

(12)

where $c_{q} : = (q^{- 1} - 1) / ln q^{- 1} \to 1 i f q \to 1$ .
Moreover, if $u_{*} = {|A|}^{p} v$ , $∥v∥ \leq ρ$ , $p > 0$ , where $| A | : = {(A^{*} A)}^{1 / 2}$ , then

$min_{α \in L_{\min}} ∥u_{α} - u_{*}∥ \leq c_{p, q} ρ^{\frac{1}{p + 1}} |ln ∥f - f_{*}∥| {∥f - f_{*}∥}^{\frac{p}{p + 1}}, 0 < p \leq 2 .$

(13)

Proof.

For arbitrary parameters

α \geq 0, β \geq 0

the inequalities

∥u_{α} - u_{*}∥ \leq ∥u_{α} - u_{β}∥ + ∥u_{β} - u_{*}∥ \leq T (α, β) ψ_{Q} (β) + e_{1} (β)

and (9) lead to the estimate

∥u_{α} - u_{*}∥ \leq (1 + T (α, β)) e_{1} (β) .

(14)

It is easy to see that

min_{α_{j} \in Ω} e_{1} (α_{j}) \leq q^{- 1} min_{α_{N} \leq α \leq α_{0}} e_{1} (α),

(15)

while in case

q α \leq α^{'} \leq α

we have

e_{1} (α^{'}) \leq q^{- 1} e_{1} (α)

.

Let

α_{j *} = α_{0} q^{j *}

be the global minimizer of the function

e_{1} (α)

on the set of the parameters

Ω

. Then,

α_{j *} \in [M_{k}, M_{k - 1}]

for some

k, 1 \leq k \leq K

and this k defines index m with

m_{k} = α_{m}

. From (14) we get the estimate

\begin{matrix} ∥u_{m_{k}} - u_{*}∥ \leq (1 + T (m_{k}, α_{j *})) e_{1} (α_{j *}) \leq (1 + min_{M_{k} \leq α_{j} \leq M_{k - 1}} T (m_{k}, α_{j})) min_{α_{j} \in Ω} e_{1} (α_{j}), \end{matrix}

which together with (15) gives also the estimate (11).

Now we show that

C \leq 1 + c_{q} ln (\frac{α_{0}}{α_{N}})

. If

m_{k} \leq α_{j} \leq M_{k - 1}

, then using Lemma 1 and equality (6) we get

∥u_{α_{m}} - u_{α_{j}}∥ \leq \sum_{j \leq i \leq m - 1} ∥u_{i} - u_{i + 1}∥ \leq q^{- 1} (1 - q) \sum_{j \leq i \leq m - 1} ψ_{Q} (α_{i + 1}) .

Due to the inequalities

ψ_{Q} (α_{i + 1}) \leq ψ_{Q} (α_{j})

for all

i, j \leq i \leq m - 1

we have

\begin{matrix} T (m_{k}, α_{j}) & = & \frac{∥u_{α_{m}} - u_{α_{j}}∥}{ψ_{Q} (α_{j})} \leq q^{- 1} (1 - q) \sum_{j \leq i \leq m - 1} \frac{ψ_{Q} (α_{i + 1})}{ψ_{Q} (α_{j})} \\ \leq & (q^{- 1} - 1) (m - j) \leq (q^{- 1} - 1) N = \frac{(q^{- 1} - 1)}{ln q^{- 1}} ln \frac{α_{0}}{α_{N}} = c_{q} ln \frac{α_{0}}{α_{N}} . \end{matrix}

If

M_{k} \leq α_{j} \leq m_{k}

, then analogous estimation of

T (m_{k}, α_{j})

gives the same result.

Now we prove the estimate (12). For the global minimum point

α_{*}

of the function

e_{2} (α, ∥f - f_{*}∥)

the inequality

α_{*} \geq α_{N}

holds, while for

α < α_{N}

we have

e_{2} (α^{*}) \leq ∥ u_{*} ∥ = ∥ f - f_{*} ∥ / (2 \sqrt{α_{N}}) \leq ∥ f - f_{*} ∥ / (2 \sqrt{α}) < e_{2} (α) .

In the case

α_{*} \leq α_{0}

we get similarly as in the proof of the estimate (11) that

min_{α \in L_{\min}} ∥u_{α} - u_{*}∥ \leq q^{- 1} (1 + c_{q} ln \frac{α_{0}}{α_{N}}) min_{α > 0} e_{2} (α, ∥f - f_{*}∥);

due to

ln \frac{α_{0}}{α_{N}} = ∣ ln ∥f - f_{*}∥ / ∥f_{*}∥ ∣

, the estimate (12) holds. Consider the case

α_{*} > α_{0}

. Then,

e_{2} (α_{*}, ∥f - f_{*}∥) \geq ∥u_{α_{0}}^{+} - u_{*}∥ \geq \frac{α_{0}}{α_{0} + ∥A^{*} A∥} ∥u_{*}∥ = \frac{∥u_{*}∥}{2}

and for each local minimum point

m_{k}, α_{N} \leq m_{k} \leq α_{0}

the inequalities

\begin{matrix} ∥u_{m_{k}} - u_{*}∥ \leq e_{2} (m_{k}, ∥f - f_{*}∥) \leq ∥u_{α_{0}}^{+} - u_{*}∥ + 0.5 {α_{N}}^{- 1 / 2} ∥f - f_{*}∥ = \\ ∥u_{α_{0}}^{+} - u_{*}∥ + ∥u_{*}∥ \leq 3 ∥u_{α_{0}}^{+} - u_{*}∥ \leq 3 e_{2} (α_{*}, ∥f - f_{*}∥) \end{matrix}

hold. Therefore the inequality (12) holds also in this case.

For source-like solution

u_{*} = {|A|}^{p} v

,

∥v∥ \leq ρ

,

p > 0

the error estimate

\begin{matrix} min_{α_{N} \leq α \leq α_{0}} e_{1} (α) \leq c_{p} ρ^{1 / (p + 1)} {∥f - f_{*}∥}^{p / (p + 1)}, 0 < p \leq 2 \end{matrix}

is well-known (see [1,2]) and the estimate (13) follows immediately from (12). ☐

Remark 2.

Theorem 3 holds also in the case if the equation

A u = f_{*}

has only the quasisolution, i.e., in the case

f_{*} \notin R (A)

,

Q f_{*} \in R (A)

, where Q is the orthoprojector

F \to \bar{R (A)}

.

Remark 3.

The inequality (11) holds also in the case if the noise of the function f is not finite but

{min}_{α_{N} \leq α \leq α_{0}} e_{1} (α)

is finite (this holds if

∥A^{*} (f - f_{*})∥

is finite).

Remark 4.

Use of the inequality (10) enables to prove the analogue of Theorem 3 for set

L_{\min}

of local minimizers of the function

ψ_{QD} (α)

: then the inequality (11) holds, where

T (α, β) = q^{- 1} \frac{∥u_{α} - u_{β}∥}{ψ_{Q D} (β)}

.

In choice of the regularization parameter we may exclude from the observation some local minimizers. It is natural to assume that

α_{N}

is so small that

d_{MD} (α_{N}) \leq (1 + ϵ) ∥f - f_{*}∥

(16)

with small

ϵ > 0

. Then the following theorem holds.

Theorem 2.

Let (16) hold. Let

m_{k_{0}}

be some local minimizer in

L_{\min}

. Then

min_{α \in L_{\min}, α \geq m_{k_{0}}} ∥u_{α} - u_{*}∥ \leq max {q^{- 1} C_{1} min_{α \geq 0} e_{1} (α), C_{2} (b, ϵ) min_{α \geq 0} e_{2} (α, ∥f - f_{*}∥)},

where

b = d_{MD} (m_{k_{0}}) / d_{MD} (α_{N}) \geq 1, C_{2} (b, ϵ) : = b (1 + ϵ) + 2

and

C_{1} : = 1 + max_{1 \leq k \leq k_{0}} max_{α_{j} \in Ω, M_{k} \leq α_{j} \leq M_{k - 1}} T (m_{k}, α_{j}) \leq 1 + c_{q} ln (\frac{α_{0}}{m_{k_{0}}}) .

Proof.

Let

α_{i}^{*}, i = 1, 2

be global minimizers of the functions

e_{1} (α)

and

e_{2} (α, ∥f - f_{*}∥)

, respectively. We consider separately 3 cases. If

m_{k_{0}} \leq α_{1}^{*}

we get similarly to the proof of Theorem 3 the estimate

min_{α \in L_{\min}, α \geq m_{k_{0}}} ∥u_{α} - u_{*}∥ \leq q^{- 1} C_{1} min_{α_{N} \leq α \leq α_{0}} e_{1} (α) .

(17)

If

α_{1}^{*} \leq m_{k_{0}} < α_{2}^{*}

, we estimate

∥u_{m_{k_{0}}} - u_{*}∥ \leq ∥u_{α_{2}^{*}}^{+} - u_{*}∥ + \frac{∥f - f_{*}∥}{2 \sqrt{α_{1}^{*}}} \leq min_{α \geq 0} e_{2} (α, ∥f - f_{*}∥) + min_{α \geq 0} e_{1} (α) .

(18)

If

α_{1}^{*} \leq m_{k_{0}}

and

α_{2}^{*} \leq m_{k_{0}}

, we have

∥B_{m_{k_{0}}} (A u_{m_{k_{0}}} - f)∥ \leq b d_{MD} (α_{N}) \leq b (1 + ϵ) ∥f - f_{*}∥

and now we can prove analogically to the proof of the weak quasioptimality of the modified discrepancy principle ([37]) that under assumption

α_{2}^{*} \leq m_{k_{0}}

the error estimate

∥u_{m_{k_{0}}} - u_{*}∥ \leq C_{2} (b, ϵ) min_{α \geq 0} e_{2} (α, ∥f - f_{*}∥)

(19)

holds. Now the assertion 1 of Theorem 7 follows from the inequalities (17)–(19). ☐

4. On Test Problems and Numerical Experiments

We made numerical experiments for the local minimizers of the function

ψ_{Q} (α)

using three sets of test problems. The first set contains 10 well-known test problems from Regularization Toolbox [11] and the following 6 Fredholm integral equations of the first kind (discretized by the midpoint quadrature formula)

\int_{a}^{b} K (t, s) u (s) d s = f (t), c \leq t \leq d .

Groetsch1 [24]: $K (t, s) = \frac{t exp (- t^{2} / (4 s))}{2 \sqrt{π} s^{3 / 2}}$ , $0 \leq s, t \leq 100$ , $u (s) = 40 + 5 cos ((100 - s) / 5) + 2.5 cos (2 (100 - s) / 2.5) + 1.25 cos (4 (100 - s) / 2);$
Groetsch2 [24]: $K (t, s) = \sum_{1 \leq k \leq 100} \frac{sin (k t) sin (k s)}{k}$ , $0 \leq s, t \leq π$ , $u (s) = s (π - s);$
Indram [25]: $K (t, s) = e^{- s t}$ , $0 \leq s, t \leq 1$ , $u (s) = s$ , $f (t) = \frac{1 - (t + 1) e^{- t}}{t^{2}};$
Ursell [11]: $K (t, s) = \frac{1}{1 + s + t}$ , $0 \leq s, t \leq 1$ , $u (s) = s (1 - s)$ , $f (t) = \frac{3 + 2 t}{2} + (2 + 3 t + t^{2}) log (\frac{1 + t}{2 + t});$
Waswaz [26]: $K (t, s) = cos (t - s)$ , $0 \leq s, t \leq π,$ $u (s) = cos (s)$ , $f (t) = \frac{π}{2} cos (t);$
Baker [23]: $K (t, s) = e^{s t}$ , $0 \leq s, t \leq 1$ , $u (s) = e^{s}$ , $f (t) = \frac{e^{t + 1} - 1}{t + 1} .$

The second set of test problems are well-known problems from [22]: Gauss, Hilbert, Lotkin, Moler, Pascal, Prolate. As in [22], we combined these six

n \times n

matrices with 6 solution vectors

x_{i} = 1, x_{i} = i / n

,

x_{i} = {((i - [n / 2]) / [n / 2])}^{2}

,

x_{i} = sin (2 π (i - 1) / n)

,

x_{i} = i / n + 1 / 4 sin (2 π (i - 1) / n)

,

x_{i} = 0

if

i \leq [n / 2]

and

x_{i} = 1

if

i > [n / 2]

. For getting the third set of test problems we combined the matrices of the first set of test problems with 6 solutions of the second set of test problems.

Numerical experiments showed that performance of different rules depends essentially on eigenvalues of the matrix

A^{T} A

. We characterize these eigenvalues via three indicators: the value of minimal eigenvalue

λ_{\min}

, by the value

N_{1}

, showing the number of eigenvalues less than

α_{N}

and by the value

Λ

, characterizing the density of location of eigenvalues on the interval

[max (α_{N}, λ_{\min}), 1]

. More precisely, let the eigenvalues of the matrix

A^{T} A

be

λ_{1} \geq λ_{2} \geq \dots \geq λ_{n} = λ_{\min}

. Then the value of

Λ

is found by the formula

Λ = {max}_{λ_{k} > max (α_{N}, λ_{n})} λ_{k} / λ_{k + 1}

. We characterize the smoothness of the solution by the value

p 1 = \frac{log {min}_{α} e_{2} (α, ∥f - f_{*}∥) - log ∥u_{*}∥}{log ∥f - f_{*}∥ - log ∥f_{*}∥},

where

∥f - f_{*}∥ = 10^{- 6}

. Table 1 contains the results of characteristics of the matrix

A^{T} A

in case

n = 100, α_{N} = 10^{- 18} .

In all tests the discretization parameters

n \in {60, 80, 100, 120, 140, 160, 180}

were used. We present the results of numerical experiments in tables for

n = 100

. Since the performance of rules generally depends on the smoothness p of the exact solution in (1), we complemented the standard solutions

u_{*}

of (now discrete) test problems with smoothened solutions

{| A |}^{p} u_{*}, p = 2

computing the right-hand side as

A (| A |^{p} u_{*})

. Results for

p = 2

are given in Table 2, in all other tables and figures

p = 0

. After discretization, all problems were scaled (normalized) in such a way that the norms of the operator and the right-hand side were 1. All norms here and in the text below are Euclidean norms. On the base of exact data

f_{*}

we formed the noisy data f, where

∥f - f_{*}∥

has values

10^{- 1}, 10^{- 2}, \dots, 10^{- 6}

, noise

f - f_{*}

has normal distribution and the components of the noise were uncorrelated. We generated 20 noise vectors and used these vectors in all problems. We search the regularization parameter from the set

Ω

, where

α_{0} = 1, q = 0.95

and N is chosen so that

α_{N} \geq 10^{- 18} > α_{N + 1}

. To guarantee that calculation errors do not influence essentially the numerical results, calculations were performed on geometrical sequence of decreasing

α

-s and finished for largest

α

with

d_{MD} (q α) > d_{MD} (α)

, while theoretically the function

d_{MD} (α)

is monotonically increasing. Actually this precautionary measure was needed only in problem Groetsch2, calculations on

α > α_{N}

were finished only in this problem. Since in model equations the exact solution is known, it is possible to find the regularization parameter

α_{*}

, which gives the smallest error on the set

Ω

. For every rule R the error ratio

E = \frac{∥u_{α_{R}} - u_{*}∥}{∥u_{α_{*}} - u_{*}∥} = \frac{∥u_{α_{R}} - u_{*}∥}{{min}_{α \in Ω} ∥u_{α} - u_{*}∥}

describes the performance of the rule R on this particular problem. To compare the rules or to present their properties, the following tables show averages and maximums of these error ratios over various parameters of the data set (problems, noise levels

δ

). We say that the heuristic rule fails if the error ratio

E > 100

. In addition to the error ratio E we present in some cases also error ratios

E 1 = \frac{∥u_{α_{R}} - u_{*}∥}{{min}_{α \in Ω} e_{1} (α)}, E 2 = \frac{∥u_{α_{R}} - u_{*}∥}{{min}_{α \in Ω} e_{2} (α)} .

The results of numerical experiments for local minimizers

α \in L_{\min}

of the function

ψ_{Q} (α)

are given in Table 3. For comparison, the results of

δ

-rules with

δ = ∥f - f_{*}∥

are presented in the columns 2–4. Columns 5 and 6 contain respectively the averages and maximums of error ratios E for the best local minimizer

α \in L_{\min}

. The results show that for many problems the Tikhonov approximation with the best local minimizer

α \in L_{\min}

is even more accurate than with the

δ

-rules parameters

α_{ME}, α_{MEe}

or

α_{DP}

. Table 1 and Table 3 show also that for rules ME and MEe the average error ratio E may be relatively large for problems where

Λ

is large and most of eigenvalues are smaller than

α_{N}

, while in this case

{min}_{α \in Ω} e (α)

may be essentially smaller than

{min}_{α \in Ω} e_{2} (α, ∥f_{*} - f∥)

. In these problems the discrepancy principle gives better parameter than ME and MEe rules. The average error in ME rule was largest in the problem Waswaz2, but error ratio E2 there is still under 1. This is due to the fact that for the tasks where the size of a

Λ

(defined on previous page) is large, the minimal error may be significantly smaller than

min e_{2} (α)

.

Columns 7 and 8 contain the averages and maximums of cardinalities

| L_{\min} |

of sets

L_{\min}

(number of elements of these sets). Note that the number of local minimizers depends on the parameter q (for smaller q the number of local minimizers is smaller) and on the length of minimization interval determined by the parameters

α_{N}

,

α_{0}

. The number of local minimizers is smaller also for larger noise level. Columns 9 and 10 contain the averages and maximums of values of constant C in the a posteriori error estimate (11). The value of C and error estimate (11) allow to assert, that in our test problems the choice of

α

as the best local minimizer in

L_{\min}

guarantees that the error of the Tikhonov approximation has the same order as

{min}_{α_{N} \leq α \leq α_{0}} e_{1} (α)

. Note that over all test problems the maximum of error ratio

E 1

for the best local minimizer in

L_{\min}

and for the discrepancy principle were 1.93 and 9.90, respectively. This confirms the result of Theorem 3 that at least one minimizer of the function

ψ_{Q} (α)

is a good regularization parameter.

5. Q-Curve and Triangle Area Rule for Choosing Heuristic Regularization Parameter

We showed in the previous section that at least one local minimizer of the function

ψ_{Q} (α)

is pseudooptimal parameter and we may omit small local minimizers

α

, for which

d_{MD} (α)

is only slightly larger than

d_{MD} (α_{N})

. We propose to construct for the parameter choice the Q-curve. The Q-curve figure uses the log-log scale with functions

d_{MD} (α)

and

ψ_{Q} (α)

on the x-axis and y-axis, respectively. The Q-curve can be considered as the analogue of the L-curve, where functions

A u_{α} - f

and

u_{α} = - α^{- 1} A^{*} (A u_{α} - f)

are replaced by functions

B_{α} (A u_{α} - f)

and

- α^{- 1} A^{*} B_{α}^{2} (A u_{α} - f)

(see (4)), respectively. We denote

{\tilde{d}}_{MD} (α) : = {log}_{10} d_{MD} (α),

{\tilde{ψ}}_{Q} (α) : = {log}_{10} ψ_{Q} (α)

. For many problems the curve

({\tilde{d}}_{MD} (α), {\tilde{ψ}}_{Q} (α))

(or a part of this) has the form of letter L or V and we choose the minimizer at the “corner” point of L or V. We use the common logarithm instead of the natural logarithm, while then the Q-curve allows easier estimation of the supposed value of the noise level. In Figure 1, Figure 2, Figure 3, Figure 4, Figure 5, Figure 6, Figure 7 and Figure 8

n = 100

is used, in Figure 9 and Figure 10

n = 60

. In Figure 1, Figure 2, Figure 3 and Figure 4 the L-curves and Q-curves are compared for two problems, the global minimizer

α_{o p t}

of the function

e_{1} (α)

is also presented. Note that in problem Baart

λ_{\min} < α_{N}

and in problem Deriv2

λ_{\min} > α_{N}

.

In most cases one can see on the Q-curve only one clear corner area with one local minimizer and then we take this local minimizer as corner point. If the corner area contains several local minimizers, we recommend to choose such local minimizer, for which the sum of coordinates of the corresponding point on the Q-curve is minimal. If Q-curve has several corner areas we recommend to use the very right of them. Actually, it is useful to present in the parameter choice besides the figures for every local minimizer

m_{k}

of the function

ψ_{Q} (α)

also the coordinates of the point

({\tilde{d}}_{MD} (m_{k}), {\tilde{ψ}}_{Q} (m_{k}))

and the sums of the coordinates.

For finding the proper local minimizer of the function

ψ_{Q} (α)

we present now a rule which works well for all test problems from set 1. The idea of the rule is to search the proper local minimizer

m_{k}

constructing certain triangles on the Q-curve and finding which of them has the maximal area. For every parameter

α

corresponds a point

P (α)

on the Q-curve with the corresponding coordinates

({\tilde{d}}_{MD} (α), {\tilde{ψ}}_{Q} (α))

. For every local minimizer

m_{k}

of the function

ψ_{Q} (α)

corresponds a triangle

T (k, r (k), l (k))

with the vertices

P (m_{k}), P (M_{r (k)})

and

P (M_{l (k)})

on the Q-curve, where indices

r (k)

and

l (k)

correspond to the largest local maximums of the function

ψ_{Q} (α)

on two sides of the local minimum

m_{k}

:

ψ_{Q} (M_{r (k)}) = max_{j < k} ψ_{Q} (M_{j}), ψ_{Q} (M_{l (k)}) = max_{j \geq k} ψ_{Q} (M_{j}) .

Triangle area rule (TA-rule). We choose for the regularization parameter such local minimizer

m_{k}

of the function

ψ_{Q} (α)

for which the area of the triangle

T (k, r (k), l (k))

is the largest.

Figure 5, Figure 6 and Figure 7 show examples of Q-curves and the triangle

T (k, r (k), l (k))

with the largest area, the TA-rule chooses for the regularization parameter a corresponding minimizer. In some problems the function

ψ_{Q} (α)

may be monotonically increasing as for the problem Groetsch2 (Figure 8), then function

ψ_{Q} (α)

has only one local minimizer

α_{N}

. Then vertices

P (M_{l (k)})

and

P (m_{k})

coincide and the area of the corresponding triangle is zero. Then this is the only triangle, the TA-rule chooses for the regularization parameter

α_{N}

. The results of the numerical experiments for test set 1 (

n = 100

) for the TA-rule and some other rules (see Section 2.2) are given in Table 4 and Table 5. These results show that the TA-rule works well in all these test problems, the accuracy is comparable with

δ

-rules (see Table 3), but previous heuristic rules fail in some problems. Note that average of the error ratio increases for decreasing noise level. For example, for

∥f - f_{*}∥ \in {10^{- 1}, 10^{- 2}, 10^{- 5}, 10^{- 6}}

corresponding error ratios E were 1.47, 1.49, 1.78 and 2.08 respectively.

Let us comment on other heuristic rules. The accuracy of the quasi-optimality criterion is for many problems the same as for the TA-rule, but this rule fails in problem Heat. A characteristic feature of the problem Heat is that location of the eigenvalues in the interval

[α_{N}, 1]

is sparse and only some eigenvalues are smaller than

α_{N}

(see Table 1). The weighted quasioptimality criterion behaves in a similar way as the quasioptimality criterion, but is more accurate in problems where

λ_{\min} \leq α_{N}

; if

λ_{\min} > α_{N}

, the quasioptimality criterion is more accurate. The rule of Hanke–Raus may fail in test problems with large

Λ

and for other problems the error of the approximate solution is in most problems approximately two times larger than for parameter chosen by the quasi-optimality principle. The problem in this rule is that it chooses too large parameter compared with the optimal parameter. However, HR-rule is stable in the sense that the largest error ratio E2 is relatively small in all considered test problems. Reginska’s rule may fail in many problems but it has the advantage that it works better than other previous rules if the noise level is large. The Reginska’s rule did not fail in case

∥f - f_{*}∥ \geq 10^{- 3}

and has average of error ratios of all problems

E = 2.24

and

E = 2.80

in cases

∥f - f_{*}∥ = 10^{- 1}

and

∥f - f_{*}∥ = 10^{- 2}

, respectively. Advantage of the maximum curvature rule is the small percentage of failures compared with other previous rules.

Distribution of error ratios E in Table 5 show also that in test problems set 1 the TA-rule is the most accurate rule from all considered rules.

Note that the figure of the Q-curve enables to estimate the reliability of the chosen parameter. If the Q-curve has only one corner, then the chosen parameter is quasioptimal with small constant C, if

λ_{\min} < α_{N}

, but in case

λ_{\min} \geq α_{N}

it is quasioptimal under assumption that the problem needs regularization.

6. Further Developments of the Area Rule

The TA-rule may fail for problems which do not need regularization, if function

ψ_{Q} (α)

is not monotonically increasing. In this case, the TA rule selects the parameter

α \geq λ_{\min}

, but the parameter

α < λ_{\min}

would be better. For example, the TA-rule fails for matrix Moler in some cases. Let us consider now the question, in which cases the regularization parameter

α_{N}

is good. If function

ψ_{Q} (α)

is monotonically increasing then function

ψ_{Q} (α)

has only one local minimizer

m_{1} = M_{1} = α_{N}

and then for the parameter

α_{N}

we have the error estimate

∥u_{α_{N}} - u_{*}∥ \leq q^{- 1} (1 + T (α_{N})) min_{α_{N} \leq α \leq α_{0}} e_{1} (α),

where the value of

T (α_{N}) = {max}_{α_{j} \in Ω, α_{N} \leq α_{j} \leq α_{0}} T (α_{N}, α_{j}) \leq c_{q} ln (\frac{α_{0}}{α_{N}})

(see Theorem 3) can be computed a posteriori and this value is the smaller the faster the function

ψ_{Q} (α)

increases. We can take

α_{N}

for the regularization parameter also in the case if the condition

\frac{ψ_{Q} (α^{'})}{ψ_{Q} (α)} \leq c_{0} \forall α, α^{'} \in Ω, α_{N} \leq α^{'} < α \leq α_{0}

(20)

holds while one can show similarly to the proof of Theorem 3 that

T (α_{N}) \leq c_{0} c_{q} ln (\frac{α_{0}}{α_{n}})

and the error of the regularized solution is small. For the problems which do not need regularization we can improve the performance of the TA-rule searching the proper local minimizer smaller or equal than

α_{HQ} : = max {α_{HR}, α_{Q}},

where

α_{HR}

,

α_{Q}

are the global minimizers of functions

ψ_{HR} (α)

and

ψ_{Q} (α)

, respectively on the interval

[max (α_{N}, λ_{\min}), α_{0}]

.

These ideas enable to formulate the following upgraded version of the TA-rule.

Triangle area rule 2 (TA-2-rule). We fix a constant

c_{0}, 1 \leq c_{0} \leq 2

. If condition (20) holds, we choose the parameter

α_{N}

. Otherwise choose for the regularization parameter such local minimizer

m_{k} \leq α_{HQ}

of the function

ψ_{Q} (α)

for which the area of triangle

T (k, r (k), l (k))

is largest.

Results of numerical experiments for the rule TA-2 with the discretization parameter

n = 100

and problem sets 1–3 are given in Table 6 and Table 7 (columns 2 and 3). The results show that the rule TA-2 works well in all considered testsets 1–3. However, the rule TA-2 may fail in some other problems which do not need regularization. Such example is the problem with matrix moler and solution

x_{i} = sin (12 π (i - 1) / n)

, where the rule TA-2 fails if the noise level is below

10^{- 4}

; but in this case all other considered heuristic rules fail too.

The rules TA and TA-2 fail in problem Heat in some cases for the discretization parameter

n = 60

. Figure 9 and Figure 10 show the form of the Q-curve in problem Heat with

n = 60

. The function

ψ_{Q} (α)

has two local minimizers with corresponding points

P (m_{1})

and

P (m_{2})

on the Q-curve and 3 local maximum points

P (M_{k}), k = 0, 1, 2

. On Figure 9 the Rule TA-2 chooses the local minimizer corresponding to the point

P (m_{2})

, but then the error ratios are large:

E = 20.3, E 2 = 18.34

.

In the following we consider methods which work well also in this problem. Let

g [α_{1}, α_{2}] (α), α \in [α_{1}, α_{2}]

be the parametric representation of the straight line segment connecting points

P (α_{1})

ja

P (α_{2})

, thus

g [α_{1}, α_{2}] (α) = {\tilde{ψ}}_{Q} (α_{1}) + β ({\tilde{d}}_{MD} (α) - {\tilde{d}}_{MD} (α_{1})), β = \frac{{\tilde{ψ}}_{Q} (α_{2}) - {\tilde{ψ}}_{Q} (α_{1})}{{\tilde{d}}_{MD} (α_{2}) - {\tilde{d}}_{MD} (α_{1})} .

Let

g [α_{1}, α_{2}, \dots, α_{k}] (α)

,

k > 2

be the parametric representation of the broken line connecting points

P (α_{1})

,

P (α_{2}), \dots, P (α_{k})

, thus

g [α_{1}, α_{2}, \dots, α_{k}] (α) = g [α_{j}, α_{j + 1}] (α), α_{j} \leq α \leq α_{j + 1}, 1 \leq j \leq k - 1 .

In the triangle rule certain points

P (M_{l (k)}), P (m_{k}), P (M_{r (k)})

are connected by the broken line

t_{1} (α) = g [M_{l (k)}, m_{k}, M_{r (k)}] (α)

which approximates the error function

{\tilde{e}}_{1} (α) = {log}_{10} (e_{1} (α))

well, if

m_{k}

is the “right” local minimizer. By construction of the function

t_{1} (α)

we use only 3 points on the Q-curve. We will get a more stable rule if the form of the Q-curve has more influence to the construction of approximates to the error function

{\tilde{e}}_{1} (α)

. Let

{i (1), i (2), \dots, i (n 1)}

and

{j (1), j (2), \dots, j (n 2)}

be the largest sets of the indices, satisfying the inequalities

k \leq i (1) < i (2) < \dots < i (n 1) \leq K, ψ_{Q} (M_{i (1)}) \leq ψ_{Q} (M_{i (2)}) \leq \dots \leq ψ_{Q} (M_{i (n 1)}),

k > j (1) > j (2) > \dots > j (n 2) \geq 0, ψ_{Q} (M_{j (1)}) \leq ψ_{Q} (M_{j (2)}) \leq \dots \leq ψ_{Q} (M_{j (n 2)}) .

It is easy to see that

i (n 1) = l (k)

and

j (n 2) = r (k)

. For approximating the error function

{\tilde{e}}_{1} (α)

we propose to connect points

P (M_{i (n 1)}), \dots, P (M_{i (1)}), P (m_{k}), P (M_{j (1)}), \dots, P (M_{j (n 2)})

by the broken line

t_{2} (α) = g [M_{i (n 1)}, \dots, M_{i (1)}, m_{k}, M_{j (1)}, \dots, M_{j (n 2)}] (α)

and to find for every

m_{k}

the area

S_{2} (k)

of the polygon surrounded by the lines

T_{2} (α) = max {t_{2} (α), g [M_{i (n 1)}, M_{j (n 2)}] (α)}

and

t_{2} (α)

. The second possibility is to approximate the error function

{\tilde{e}}_{1} (α)

by the curve

t_{3} (α) = max {t_{2} (α), {\tilde{ψ}}_{Q} (α)}

and to find

S_{3} (k)

as the area of the polygon surrounded by the broken lines

t_{3} (α)

and curve

T_{3} (α) = max {T_{2} (α), {\tilde{ψ}}_{Q} (α)}

. Note that functions

t_{i} (α), i = 1, 2, 3

are monotonically increasing if

α > m_{k}

, and monotonically decreasing if

α < m_{k}

.

Area rules 2 and 3. We fix a constant

c_{0}, 1 \leq c_{0} \leq 2

. First we choose the local minimizer

m_{k} \leq α_{H Q}

, for which the area

S_{i} (k), i \in {2, 3}

is largest. We take for the regularization parameter the smallest

m_{k_{0}} \leq m_{k}

, satisfying condition (compare with (20))

\frac{ψ_{Q} (α^{'})}{ψ_{Q} (α)} \leq c_{0} \forall α, α^{'} \in Ω, m_{k_{0}} \leq α^{'} < α \leq m_{k} .

Let us consider Figure 9. The reason for the failure of the triangle rule is, that for the local minimizer

m (2)

the broken line

g [M (0), m (2), M (2)] (α)

do not approximate well the function

{\tilde{e}}_{1} (α)

, while point

M (1)

is located above the interval

[m (2), M (0)]

. Here the function

{\tilde{e}}_{1} (α)

is better approximated by the broken line

g [M (0), M (1), m (2), M (2)] (α)

, see Figure 10. For the local minimizer

m (1)

, we approximate function

{\tilde{e}}_{1} (α)

by the broken line

g [M (0), m (1), M (1), M (2)] (α)

and due to the inequality

S_{2} (1) > S_{2} (2)

the area rule 2 chooses

m_{1}

for the regularization parameter, then

E = 1.05

.

The area rules 2 and 3 work in problem Heat well for every n and all

α_{N} = 10^{- k}, 12 \leq k \leq 24

, but in some other problems the accuracy of the area rules 2 and 3 (see columns 4–7 of Table 6 and Table 7) is slightly worse than for the rule TA-2. The advantage of the area rule 3, as compared to the area rule 2, is to be highlighted in problem Heat if all noise of the right hand side is placed on one eigenelement (then we use the condition

m_{k} \leq α_{H Q}

only in case

λ_{\min} > α_{N}

). Then the area rule 3 did not fail if

n \geq 80

and

α_{N} \leq 10^{- 20}

. So we can say, the more precisely we take into account the form of the Q-curve in construction of the approximating function for the error function

{\tilde{e}}_{1} (α)

, the more stable is the rule.

Based on the above rules it is possible to formulate a combined rule, which chooses the parameter according to the rule TA-2 or area rule 3 in dependence of certain condition.

Area rule 4 (Combined area rule). Fix a constant

c_{0}, 1 \leq c_{0} \leq 2, b \geq 0

. Let the local minimizer

m_{k}

be chosen by the rule TA-2. If

max_{m_{k} \leq α \leq M_{r (k)}} \frac{{\tilde{ψ}}_{Q} (α)}{g [m_{k}, M_{r (k)}] (α)} \leq b,

we take

m_{k}

for the regularization parameter, otherwise we choose the regularization parameter by the area rule 3.

Note that the combined rule coincides with the rule TA-2, if

b = 0

and with the area rule 3, if

b = \infty

. Experiments of combined rule with

c_{0} = 2, b = 1

(columns 8 and 9 in Table 6 and Table 7) show that the accuracy of this rule is almost the same as in the triangle rule, but unlike the TA-2 rule, it works well also in problem Heat for all n and

α_{N}

. Although, in some cases, in test set 3 the error ratio

E > 100

for rule 4, the high qualification of the rule is characterized by the fact, that over all problems sets 1–3 the largest error ratio E1 was 16.91 (5.06 for set 1) and the largest error ratio E2 was 4.67 (2.62 for set 1). Numerical experiments show that it is reasonable to use parameter

b \in (0.8, 1.2]

. We studied the behavior of the area rules for different

α_{N} = 10^{- k}, 12 \leq k \leq 24

. The results were similar to the results of Table 6 and Table 7, but for smaller

α_{N}

the error ratios were 2–3% smaller than for

α_{N} = 10^{- 18}

and for larger

α_{N}

the error ratios were about 5% larger than in Table 6 and Table 7.

The Table 2 gives results of the numerical experiments in the case of smooth solution,

p = 2

. We see that combined rule worked well also in this case, no failure.

Remark 5.

It is possible to modify the Q-curve. We may use the function

ψ_{QD} (α)

instead of function

ψ_{Q} (α)

and find proper local minimizer of the function

ψ_{QD} (α)

. Unlike the quasi-optimality criterion the use of function

ψ_{QD} (α)

in the Q-curve and in the area rule does not increase the amount of calculations, while approximation

u_{2, α}

is needed also in computation of

d_{MD} (α)

. We can use in these rules the function

d_{ME} (α)

instead of

d_{MD} (α)

, it increases the accuracy in some problems, but the average accuracy of the rules is almost the same. In case of nonsmooth solutions we can modify the Q-curve method and area rule, using the function

d_{D} (α)

instead of

d_{MD} (α)

. In this case, we get even better results for

p = 0

but for

p = 2

, the error ratio E is on average 2 times higher.

Note that if the solution is smooth, then L-curve rule and Reginska’s rule often fail, but replacing in these rules the function

A u_{α} - f

by the function

B_{α} (A u_{α} - f)

gives often better results.

Remark 6.

If the solution is smooth, then using α-s from (8) much better approximate solution than single Tikhonov approximation may be get using the linear combinations of Tikhonov approximations, see [40].

In the case of a heuristic parameter choice, it is also possible to use the a posteriori estimates of the approximate solution, which, in many tasks, allows to confirm the reliability of the parameter choice. Let

α_{H}

be the regularization parameter from some heuristic rule and

α_{*}

be the local minimizer of the function

e_{1} (α)

on the set

Ω

. Then in the case

α_{*} \geq α_{H}

the error estimate

∥u_{α_{H}} - u_{*}∥ \leq (1 + T (α_{H}, α_{*})) e_{1} (α_{*}) \leq q^{- 1} (1 + T_{1} (α_{H})) min_{α} e_{1} (α)

(21)

holds where

T_{1} (α_{H}) = {max}_{α \geq α_{H}, α \in Ω} T (α_{H}, α)

. Using the last estimate, we can prove similarly to the Theorem 7 that if

α_{N}

is so small that

d_{MD} (α_{N}) \leq (1 + ϵ) ∥f - f_{*}∥

, then

∥u_{α_{H}} - u_{*}∥ \leq max {q^{- 1} (1 + T_{1} (α_{H})) min_{α \geq 0} e_{1} (α), C_{2} (b, ϵ) min_{α \geq 0} e_{2} (α, ∥f - f_{*}∥)},

where

b = d_{MD} (α_{H}) / d_{MD} (α_{N})

. If values

T_{1} (α_{H})

and b, what we find a posteriori, are small (for example

b \leq 2

and

T_{1} (α_{H}) \leq 9

), then this estimate allows to argue that the error of the approximate solution for this parameter is not much larger than the minimal error. The conditions

b \leq 2

,

T_{1} (α_{H}) \leq 9

were satisfied in set 1 of test problems in the combined rule for 73% of cases and inequalities

b \leq 2

,

T_{1} (α_{H}) \leq 4

for 61% of cases. The reason of failure of heuristic rule is typically that the chosen parameter is too small. To check this, we can use the error estimate (21). If

T_{1} (α_{H})

is relatively small (for example

T_{1} (α_{H}) \leq 9

), then the estimate (21) allows to argue that the regularization parameter is not chosen too small. In set 1 of test problems the conditions

T_{1} (α_{H}) \leq 9

and

T_{1} (α_{H}) \leq 4

were satisfied in 97% and in 82% of cases, respectively.

7. Conclusions

We finish the paper with the following conclusion. For the heuristic choice of the regularization parameter we recommend to choose the parameter from the set of local minimizers of the function

ψ_{Q} (α)

or the function

ψ_{QD} (α)

. For choice of the parameter from the local minimizers we proposed the Q-curve method and different area rules. The proposed rules gave much better results than previous heuristic rules on extensive set of test problems. Area rules fail in very few cases in comparison with previous rules, and the accuracy of these rules is comparable even with the

δ

-rules if the exact noise level is known. In addition, we also provided a posteriori error estimates of the approximate solution, which allows to check the reliability of parameter chosen heuristically.

Author Contributions

Writing—original draft preparation, T.R.; writing—review and editing, U.H. Both authors have read and agreed to the published version of the manuscript.

Funding

The authors are supported by “Personal research funding: Team grant” project PRG864 of the Estonian Ministry of Education and Research.

Conflicts of Interest

The authors declare no conflict of interest.

References

Engl, H.W.; Hanke, M.; Neubauer, A. Regularization of Inverse Problems, Volume 375 of Mathematics and Its Applications; Kluwer: Dordrecht, The Netherlands, 1996. [Google Scholar]
Vainikko, G.M.; Veretennikov, A.Y. Iteration Procedures in Ill- Posed Problems; Nauka: Moscow, Russia, 1986. (In Russian) [Google Scholar]
Bauer, F.; Kindermann, S. The quasi-optimality criterion for classical inverse problems. Inverse Probl. 2008, 24, 035002. [Google Scholar] [CrossRef]
Hämarik, U.; Palm, R.; Raus, T. On minimization strategies for choice of the regularization parameter in ill-posed problems. Numer. Funct. Anal. Optim. 2009, 30, 924–950. [Google Scholar] [CrossRef]
Kindermann, S. Convergence analysis of minimization-based noise level-free parameter choice rules for linear ill-posed problems. Electron. Trans. Numer. Anal. 2011, 38, 233–257. [Google Scholar]
Kindermann, S. Discretization independent convergence rates for noise level-free parameter choice rules for the regularization of ill-conditioned problems. Electron. Trans. Numer. Anal. 2013, 40, 58–81. [Google Scholar]
Kindermann, S.; Neubauer, A. On the convergence of the quasioptimality criterion for (iterated) Tikhonov regularization. Inverse Probl. Imaging 2008, 2, 291–299. [Google Scholar] [CrossRef]
Neubauer, A. The convergence of a new heuristic parameter selection criterion for general regularization methods. Inverse Probl. 2008, 24, 055005. [Google Scholar] [CrossRef]
Tikhonov, A.N.; Glasko, V.B.; Kriksin, Y. On the question of quasioptimal choice of a regularized approximation. Sov. Math. Dokl. 1979, 20, 1036–1040. [Google Scholar]
Hansen, P.C. Analysis of discrete ill-posed problems by means of the L-curve. SIAM Rev. 1992, 34, 561–580. [Google Scholar] [CrossRef]
Hansen, P.C. Regularization tools: A Matlab package for analysis and solution of discrete ill-posed problems. Numer. Algorithms 1994, 6, 1–35. [Google Scholar] [CrossRef]
Golub, G.H.; Heath, M.; Wahba, G. Generalized cross-validation as a method for choosing a good ridge parameter. Technometrics 1979, 21, 215–223. [Google Scholar] [CrossRef]
Hanke, M.; Raus, T. A general heuristic for choosing the regularization parameter in ill-posed problems. SIAM J. Sci. Comput. 1996, 17, 956–972. [Google Scholar] [CrossRef]
Reginska, T.A. A regularization parameter in discrete ill-posed problems. SIAM J. Sci. Comput. 1986, 17, 740–749. [Google Scholar] [CrossRef]
Gockenbach, M.S.; Gorgin, E. On the convergence of a heuristic parameter choice rule for Tikhonov regularization. SIAM J. Sci. Comput. 2018, 40, A2694–A2719. [Google Scholar] [CrossRef]
Hämarik, U.; Kangro, U.; Kindermann, S.; Raik, K. Semi-heuristic parameter choice rules for Tikhonov regularisation with operator perturbations. J. Inverse Ill Posed Probl. 2019, 27, 117–131. [Google Scholar] [CrossRef]
Hochstenbach, M.E.; Reichel, L.; Rodriguez, G. Regularization parameter determination for discrete ill-posed problems. J. Comput. Appl. Math. 2015, 273, 132–149. [Google Scholar] [CrossRef]
Palm, R. Numerical Comparison of Regularization Algorithms for Solving ill-Posed Problems. Ph.D. Thesis, University of Tartu, Tartu, Estonia, 2010. [Google Scholar]
Bauer, F.; Lukas, M.A. Comparing parameter choice methods for regularization of ill-posed problems. Math. Comput. Simul. 2011, 81, 1795–1841. [Google Scholar] [CrossRef]
Bakushinskii, A.B. Remarks on choosing a regularization parameter using the quasi-optimality and ratio criterion. Comp. Math. Math. Phys. 1984, 24, 181–182. [Google Scholar] [CrossRef]
Raus, T.; Hämarik, U. Heuristic parameter choice in Tikhonov method from minimizers of the quasi-optimality function. In New Trends in Parameter Identification for Mathematical Models; Hofmann, B., Leitao, A., Zubelli, J., Eds.; Birkhäuser: Basel, Switzerland, 2018; pp. 227–244. [Google Scholar]
Brezinski, B.; Rodriguez, G.; Seatzu, S. Error estimates for linear systems with applications to regularization. Numer. Algor. 2008, 49, 85–104. [Google Scholar] [CrossRef]
Baker, C.T.H.; Fox, L.; Mayers, D.F.; Wright, K. Numerical solution of Fredholm integral equations of the first kind. Comput. J. 1964, 7, 141–148. [Google Scholar] [CrossRef]
Groetsch, C.W. Integral equations of the first kind, integral equations, and regularization: A crash course. J. Phys. Conf. Ser. 2007, 73, 012001. [Google Scholar] [CrossRef]
Indratno, S.W.; Ramm, A.G. An iterative method for solving Fredholm integral equations of the first kind. Int. J. Comput. Sci. And Mathematics 2009, 2, 354–379. [Google Scholar] [CrossRef]
Wazwaz, A.-M. The regularization method for Fredholm integral equations of the first kind. Comput. Math. Appl. 2011, 61, 2981–2986. [Google Scholar] [CrossRef]
Morozov, V.A. On the solution of functional equations by the method of regularization. Soviet Math. Dokl. 1966, 7, 414–417. [Google Scholar]
Gfrerer, H. An a posteriori parameter choice for ordinary and iterated Tikhonov regularization of ill-posed problems leading to optimal convergence rates. Math. Comp. 1987, 49, 507–522. [Google Scholar] [CrossRef]
Raus, T. On the discrepancy principle for solution of ill-posed problems with non-selfadjoint operators. Acta Comment Univ. Tartu. 1985, 715, 12–20. (In Russian) [Google Scholar]
Hämarik, U.; Kangro, U.; Palm, R.; Raus, T.; Tautenhahn, U. Monotonicity of error of regularized solution and its use for parameter choice. Inverse Probl. Sci. Eng. 2014, 22, 10–30. [Google Scholar] [CrossRef]
Tautenhahn, U.; Hämarik, U. The use of monotonicity for choosing the regularization parameter in ill-posed problems. Inverse Probl. 1999, 15, 1487–1505. [Google Scholar] [CrossRef]
Hämarik, U.; Palm, R.; Raus, T. Comparison of parameter choices in regularization algorithms in case of different information about noise level. Calcolo 2011, 48, 47–59. [Google Scholar] [CrossRef]
Hämarik, U.; Palm, R.; Raus, T. A family of rules for parameter choice in Tikhonov regularization of ill-posed problems with inexact noise level. J. Comp. Appl. Math. 2012, 36, 221–233. [Google Scholar] [CrossRef]
Raus, T.; Hämarik, U. New rule for choice of the regularization parameter in (iterated) Tikhonov method. Math. Model. Anal. 2009, 14, 187–198. [Google Scholar] [CrossRef]
Raus, T.; Hämarik, U. On numerical realization of quasioptimal parameter choices in (iterated) Tikhonov and Lavrentiev regularization. Math. Model. Anal. 2009, 14, 99–108. [Google Scholar] [CrossRef][Green Version]
Raus, T. About regularization parameter choice in case of approximately given error bounds of data. Acta Comment Univ. Tartu. 1992, 937, 77–89. [Google Scholar]
Raus, T.; Hämarik, U. On the quasioptimal regularization parameter choices for solving ill-posed problems. J. Inverse Ill Posed Probl. 2007, 15, 419–439. [Google Scholar] [CrossRef]
Calvetti, D.; Reichel, L.; Shuib, A. L-curve and curvature bounds for Tikhonov regularization. Numer. Algorithms 2004, 35, 301–314. [Google Scholar] [CrossRef]
Hansen, P.C. Rank-Deficient and Discrete Ill-Posed Problems; SIAM: Philadelphia, PA, USA, 1998. [Google Scholar]
Hämarik, U.; Palm, R.; Raus, T. Extrapolation of Tikhonov regularization method. Math. Model. Anal. 2010, 15, 55–68. [Google Scholar] [CrossRef]

Figure 1. L-curve for Baart.

Figure 2. Q-curve for Baart.

Figure 3. L-curve for Deriv2.

Figure 4. Q-curve for Deriv2.

Figure 5. Q-curve in Heat.

Figure 6. Q-curve in Spikes.

Figure 7. Q-curve in Foxgood.

Figure 8. Q-curve in Groetsch2.

Figure 9. Q-curve, problem Heat,

n = 60

.

Figure 9. Q-curve, problem Heat,

n = 60

.

Figure 10. Q-curve, problem Heat,

n = 60

.

Figure 10. Q-curve, problem Heat,

n = 60

.

Table 1. Characteristics of matrix

A^{T} A

and the solution

u_{*}

.

Table 1. Characteristics of matrix

A^{T} A

and the solution

u_{*}

.

Problem	$λ_{\min}$	$N_{1}$	$Λ$	p1	Problem	$λ_{\min}$	$N_{1}$	$Λ$	p1
Baart	5.2 $\times 10^{- 35}$	92	1665.7	0.197	Spikes	1.3 $\times 10^{- 33}$	89	1529.3	0.005
Deriv2	6.7 $\times 10^{- 9}$	0	16.0	0.286	Wing	2.9 $\times 10^{- 37}$	94	9219.1	0.057
Foxgood	9.0 $\times 10^{- 33}$	85	210.1	0.426	Baker	1.0 $\times 10^{- 33}$	94	9153.1	0.498
Gravity	1.6 $\times 10^{- 33}$	68	4.1	0.403	Ursell	6.9 $\times 10^{- 34}$	94	3090.2	0.143
Heat	5.5 $\times 10^{- 33}$	3	2.4 $\times 10^{20}$	0.341	Indramm	2.7 $\times 10^{- 33}$	94	9154.6	0.395
Ilaplace	3.8 $\times 10^{- 33}$	79	16.1	0.211	Waswaz2	2.0 $\times 10^{- 34}$	98	1.7 $\times 10^{30}$	0.654
Phillips	1.4 $\times 10^{- 13}$	0	9.4	0.471	Groetsch1	5.8 $\times 10^{- 33}$	78	11.2	0.176
Shaw	2.3 $\times 10^{- 34}$	85	289.7	0.244	Groetsch2	1.0 $\times 10^{- 4}$	0	4.0	0.652

Table 2. Results of the numerical experiments,

p = 2

.

Table 2. Results of the numerical experiments,

p = 2

.

Problem	ME	MEe	DP	Best of $L_{\min}$	$\| L_{\min} \|$	Combined Area Rule
Problem	Aver E	Aver E	Aver E	Aver E	Aver	Aver E	Max E
Baart	1.86	1.19	2.93	1.18	4.74	1.60	14.57
Deriv2	1.09	1.19	3.65	1.03	2.00	1.04	1.17
Foxgood	1.56	1.13	3.58	1.14	2.08	1.22	3.58
Gravity	1.33	1.05	2.65	1.09	1.72	1.14	3.18
Heat	1.13	1.12	2.55	1.05	2.10	1.05	1.14
Ilaplace	1.47	1.06	2.78	1.11	2.73	1.13	3.51
Phillips	1.26	1.06	3.35	1.04	2.10	1.04	1.20
Shaw	1.37	1.06	2.58	1.11	3.72	1.29	8.96
Spikes	1.85	1.12	2.10	1.19	4.78	1.31	5.75
Wing	1.67	1.14	2.47	1.22	4.53	1.75	6.63
Baker	2.11	1.29	2.96	1.21	4.38	1.77	11.33
Ursell	1.86	1.19	4.10	1.16	4.82	1.67	18.08
Indramm	1.69	1.14	2.87	1.28	4.53	1.91	6.42
Waswaz2	127.2	49.8	1.20	2.44	1.00	2.43	9.01
Groetsch1	1.40	1.06	2.36	1.11	2.14	1.14	4.56
Groetsch2	1.02	1.23	1.71	1.14	1.67	1.55	3.97
Set 1	9.37	4.18	2.74	1.22	3.06	1.44	18.08
Set 2	2.10	1.26	2.91	1.19	2.83	1.37	29.03
Set 3	6.86	3.21	2.68	1.18	3.12	1.42	52.98

Table 3. Results for the set

L_{\min}

.

Table 3. Results for the set

L_{\min}

.

Problem	ME	MEe	DP	Best of $L_{\min}$		$\| L_{\min} \|$		Apost. C
Problem	Aver E	Aver E	Aver E	Aver E	Max E	Aver	Max	Aver	Max
Baart	1.43	1.32	1.37	1.23	2.51	6.91	8	3.19	3.72
Deriv2	1.29	1.07	1.21	1.08	1.34	1.71	2	3.54	4.49
Foxgood	1.98	1.42	1.34	1.47	6.19	3.63	6	3.72	4.16
Gravity	1.40	1.13	1.16	1.13	1.83	1.64	3	3.71	4.15
Heat	1.19	1.03	1.05	1.12	2.36	3.19	5	3.92	4.50
Ilaplace	1.33	1.21	1.26	1.20	2.56	2.64	5	4.84	6.60
Phillips	1.27	1.02	1.02	1.06	1.72	2.14	3	3.99	4.66
Shaw	1.37	1.24	1.28	1.19	2.15	4.68	7	3.48	4.43
Spikes	1.01	1.00	1.01	1.00	1.02	8.83	10	3.27	3.70
Wing	1.16	1.13	1.15	1.09	1.38	5.20	6	3.07	3.72
Baker	3.91	2.38	2.09	2.31	16.17	5.38	6	3.14	3.72
Ursell	2.14	1.97	2.03	1.69	4.44	5.53	6	3.07	3.43
Indramm	5.20	3.26	3.37	3.38	25.67	5.64	6	3.08	3.71
Waswaz2	127.2	49.9	1.20	2.44	9.03	1.00	1	2.00	2.00
Groetsch1	1.12	1.07	1.08	1.06	1.51	3.99	7	4.23	5.20
Groetsch2	1.02	1.22	1.67	1.13	1.69	1.67	2	5.62	13.72
Set 1	9.62	4.46	1.46	1.48	25.67	3.99	10	3.67	13.72
Set 2	1.57	1.32	1.36	1.20	5.33	4.40	10	3.50	5.43
Set 3	7.19	3.45	1.47	1.48	61.02	3.64	10	3.73	9.12

Table 4. Averages of error ratios E and failure % (in parenthesis) for heuristic rules.

Problem	TA Rule		Quasiopt.	WQ	HR	Reginska	MCurv
Problem	Mean E	Max E	Mean E	Mean E	Mean E	Mean E	Mean E
Baart	1.51	14.57	1.54	1.43	2.58	1.32	4.75
Deriv2	1.18	1.27	2.01	2.26	2.28	3.67	(9.2%)
Foxgood	1.56	3.39	1.57	1.57	8.36	(10.8%)	5.95
Gravity	1.14	2.27	1.13	1.13	2.66	(0.8%)	2.04
Heat	1.26	1.34	(65.8%)	(66.8%)	1.64	(4.2%)	4.11
Ilaplace	1.24	2.34	1.24	1.22	1.94	1.66	2.99
Phillips	1.07	1.20	1.09	(3.3%)	2.27	(44.2%)	1.34
Shaw	1.42	8.96	1.43	1.41	2.34	1.80	4.64
Spikes	1.01	5.75	1.01	1.01	1.03	1.01	1.05
Wing	1.39	6.63	1.40	1.30	1.51	1.18	1.57
Baker	3.30	11.33	3.30	3.30	(0.8%)	(21.7%)	7.78
Ursell	2.87	31.06	3.54	2.35	4.71	1.86	7.54
Indramm	3.74	9.07	4.43	4.16	(2.5%)	(9.2%)	(15.8%)
Waswaz2	2.43	9.01	2.43	2.43	(65.8%)	2.33	(3.3%)
Groetsch1	1.14	4.56	1.14	1.12	1.61	1.26	1.52
Groetsch2	1.13	1.74	1.27	2.73	1.66	5.49	1.81
Total	1.71	31.06	>100	>100	50.5	43.8	8.17
Failure %	0%		4.11%	4.38%	4.32%	5.68%	1.77%
Max E2	2.61		>100	>100	2.63	>100	24.5

Table 5. Distribution of error ratios E in different rules.

Decile	TA Rule	Quasiopt.	WQ	HR	Reginska	MCurv	ME	MEe	DP
10	1.00	1.00	1.00	1.08	1.00	1.06	1.01	1.00	1.00
20	1.01	1.01	1.01	1.36	1.04	1.27	1.03	1.00	1.01
30	1.02	1.03	1.03	1.56	1.12	1.48	1.09	1.01	1.02
40	1.04	1.06	1.06	1.82	1.27	1.83	1.16	1.03	1.04
50	1.09	1.13	1.12	2.12	1.66	2.31	1.22	1.08	1.08
60	1.18	1.29	1.29	2.43	2.42	3.05	1.33	1.16	1.16
70	1.35	1.57	1.59	3.19	4.19	4.51	1.52	1.29	1.30
80	1.71	2.17	2.29	5.94	9.93	7.03	2.02	1.50	1.52
90	2.27	6.45	6.18	19.35	43.91	12.95	4.45	2.88	2.11

Table 6. Averages and maximums of error ratios E in case of area rules, problem set 1.

Problem	TA-2 Rule		Area Rule 2		Area Rule 3		Combined Area Rule
Problem	Aver E	Max E	Aver E	Max E	Aver E	Max E	Aver E	Max E
Baart	1.51	5.18	1.58	2.91	1.59	2.91	1.53	5.18
Deriv2	1.12	1.42	1.12	1.42	1.12	1.42	1.12	1.42
Foxgood	1.57	6.69	1.53	6.19	1.53	6.19	1.57	6.69
Gravity	1.17	4.12	1.21	6.10	1.21	6.10	1.17	4.12
Heat	1.12	2.36	1.12	2.36	1.12	2.36	1.12	2.36
Ilaplace	1.24	2.68	1.22	2.68	1.22	2.68	1.24	2.68
Phillips	1.07	1.72	1.06	1.72	1.06	1.72	1.07	1.72
Shaw	1.42	3.72	1.47	3.64	1.47	3.64	1.42	3.72
Spikes	1.01	1.05	1.01	1.02	1.01	1.02	1.01	1.05
Wing	1.39	1.86	1.44	1.86	1.44	1.86	1.39	1.86
Baker	3.30	45.29	2.67	22.67	2.67	22.67	2.91	33.12
Ursell	2.87	16.78	4.55	27.92	4.55	27.92	3.12	16.78
Indramm	3.74	25.67	9.50	83.20	10.76	83.20	3.87	25.67
Waswaz2	2.43	9.01	2.43	9.01	2.43	9.01	2.43	9.01
Groetsch1	1.14	2.12	1.15	2.12	1.15	2.12	1.14	2.12
Groetsch2	1.52	3.84	1.52	3.84	1.52	3.84	1.52	3.84
Total	1.73	45.29	2.16	83.20	2.22	83.20	1.73	33.12

Table 7. Averages and maximums of error ratios E in proposed rules, problem sets 2 and 3.

Problem	TA-2 Rule		Area Rule 2		Area Rule 3		Combined Area Rule
Problem	Aver E	Max E	Aver E	Max E	Aver E	Max E	Aver E	Max E
Gauss	1.24	5.05	1.26	6.56	1.26	6.56	1.24	5.05
Hilbert	1.46	7.25	1.83	21.22	1.81	21.22	1.46	7.25
Lotkin	1.47	11.17	1.91	18.66	1.88	11.17	1.47	11.17
Moler	1.51	7.35	1.43	7.35	1.43	7.35	1.51	7.35
Prolate	1.57	15.96	1.82	20.64	1.77	15.96	1.58	15.96
Pascal	1.04	1.13	1.06	1.18	1.06	1.18	1.05	1.18
Set 2	1.38	15.96	1.55	21.22	1.53	21.22	1.39	15.96
Set 3	1.85	136.6	2.81	188.1	2.77	188.1	2.02	153.5

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Raus, T.; Hämarik, U. Q-Curve and Area Rules for Choosing Heuristic Parameter in Tikhonov Regularization. Mathematics 2020, 8, 1166. https://doi.org/10.3390/math8071166

AMA Style

Raus T, Hämarik U. Q-Curve and Area Rules for Choosing Heuristic Parameter in Tikhonov Regularization. Mathematics. 2020; 8(7):1166. https://doi.org/10.3390/math8071166

Chicago/Turabian Style

Raus, Toomas, and Uno Hämarik. 2020. "Q-Curve and Area Rules for Choosing Heuristic Parameter in Tikhonov Regularization" Mathematics 8, no. 7: 1166. https://doi.org/10.3390/math8071166

APA Style

Raus, T., & Hämarik, U. (2020). Q-Curve and Area Rules for Choosing Heuristic Parameter in Tikhonov Regularization. Mathematics, 8(7), 1166. https://doi.org/10.3390/math8071166

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Q-Curve and Area Rules for Choosing Heuristic Parameter in Tikhonov Regularization

Abstract

1. Introduction

2. Rules for the Choice of the Regularization Parameter

2.1. Parameter Choice in the Case of Known Noise Level

2.2. Parameter Choice in the Case of Unknown Noise Level

3. Local Minimum Points of the Function $ψ_{Q} (α)$

4. On Test Problems and Numerical Experiments

5. Q-Curve and Triangle Area Rule for Choosing Heuristic Regularization Parameter

6. Further Developments of the Area Rule

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Q-Curve and Area Rules for Choosing Heuristic Parameter in Tikhonov Regularization

Abstract

1. Introduction

2. Rules for the Choice of the Regularization Parameter

2.1. Parameter Choice in the Case of Known Noise Level

2.2. Parameter Choice in the Case of Unknown Noise Level

3. Local Minimum Points of the Function ψ Q ( α )

4. On Test Problems and Numerical Experiments

5. Q-Curve and Triangle Area Rule for Choosing Heuristic Regularization Parameter

6. Further Developments of the Area Rule

7. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3. Local Minimum Points of the Function $ψ_{Q} (α)$