1. Introduction
Graphics, pictures, computer images, computer vision and digital patterns are often distorted by linear or nonlinear transformations. The algorithms that correctly convert images and patterns under these transformations and restore their original images and patterns are essential for applications in image processing, pattern recognition and artificial intelligence (AI). We solicited numerical algorithms for image geometric transformations and proposed basic algorithms in a book [
1] in 1989, which were developed from [
2,
3,
4,
5]. Since then, we have focused on refining existing numerical algorithms, with significant advancements reported in [
1,
6,
7]. Geometric transformations heavily depend on shape boundaries, which are particularly important in applications like face fusion [
8]. This paper extends the splitting algorithms of [
9], with a focus on rigorous error analysis and broader applications. AI relies on three elements: data, models and algorithms. The new trend in AI is to significantly improve algorithmic efficiency, (see DeepSeek [
10]); such a trend has guided our study for many years. A systematic summary of new numerical algorithms for image geometric transformations is provided in the new book [
11].
Let us introduce the integral model of images and its basic algorithms (the model and algorithms as AI’s terminology). The digital image in 2D is denoted by a matrix with non-negative entries representing the pixel greyness. Each pixel greyness can also be regarded as the mean of a greyness function over a small pixel region, leading to a 2D integral. Hence, the image transformation can be reduced to numerical integration. Based on this idea, we develop the splitting-integrating method (SIM) in [
1], which is, indeed, the composite centroid rule in numerical analysis (see Atkinson [
12] and Davis and Rabinowitz [
13]). The advantage of SIM is the simplicity of algorithms, in particular for the restoration of images because nonlinear solutions are not required. Although the smooth features of the piecewise integrand in the integral are different in different subregions, the same centroid rule is always chosen. In applications, we choose the simple piecewise constant interpolations as
and the piecewise linear interpolations as
. Our goal is to achieve the optimal convergence rates
of sequential image greyness errors, where
N is the division number of a pixel split into
.
We may apply SIM for the forward transformation
T, combine the SIM for
, and develop a combination CIIM for a cycle transformation
. A drawback of SIM for
T is that nonlinear solutions may be involved. When the nonlinear transformations are not complicated, the Newton iteration method may be used to carry out the nonlinear solutions. In this case, the combination CIIM is studied in ([
11], Chapters 4 and 5) to provide the optimal sequential convergence rates
,
. The CIIM can also be applied to
n-dimensional image transformation reported in ([
11], Chapter 2). However, for complicated nonlinear transformations
T, solving the associated nonlinear equations becomes challenging, limiting the practicality of SIM and CIIM. For image conversion under a geometric transformation
T, the splitting-shooting method (SSM) does not need nonlinear solutions. Hence, we may combine SSM for T and SIM for
to develop a combination CSIM for
. A strict analysis of the CSIM was given in Li and Bai [
7], to reveal that only the low convergence rates
of image greyness can be obtained for both
and
. Also, by probability analysis, a higher convergence rate as
in probability can be reached in [
7]. For 256 greyness-level images under a nonlinear transformation with a double enlargement, the division number
N should be chosen as large as
or 64 to provide satisfactory pictures (see [
7]). For 256 real greyscale images, CSIM becomes impractical due to the prohibitively large number of pixels. In general, the original CSIM is suited well to a few (≤16) greyness level images. A sophisticated partition technique is also invented for CSIM, leading to the advanced combination
and
in ([
11], Chapter 7), to reduce the division number
N significantly.
Now, we recast CIIM and study new techniques embedded into the algorithm SIM for
T to reduce and even bypass completely the nonlinear solutions. This is important to the wide application of CIIM. An interpolation technique is given in [
6] to reduce the number of solutions to nonlinear equations from
down to
, where
is the total number of image pixels. The piecewise bilinear interpolations are chosen in [
6] based on the exact values of
and
at the semi-points
. In this paper, we also solicit the interpolation to reduce the iteration number but based on the exact values of
and
at the pixel points
. The algorithms in [
6] are denoted as
for
T and
for
.
Moreover, since the values of
and
are unnecessary to be exact, we may also use the interpolation to obtain the approximate values of
and
to provide the improved algorithms
for T and
for
without any nonlinear solutions. The improved algorithms are proposed in [
9], but no error analysis exists so far. This is due to the improved techniques where the combination
can be applied to the images under complicated transformations, e.g., those governed by partial differential equations (PDEs). In this paper, we also discuss the transformations of the Poisson and blending models by using the simple finite difference method (FDM) but apply the
to image transformations undergone with these PDE models.
From the analysis in
Section 4, the optimal convergence rates of greyness errors as
can be maintained for continuous pictures by combination
as well, where
is the small image resolution. Another aspect of the analysis in this paper is to deal with image discontinuity. Real image greyness often has discontinuity. In this paper, we solicit the Sobolev norms and give an error analysis of the boundary discontinuity of image greyness. Using Sobolev norms, we derive the absolute errors
,
. This study is an important remedy for error analysis made for continuous images [
7]. Geometric image transformations play a vital role in image processing, computer vision, computer graphics and AI. For pattern recognition and facial identification, finding and removing the geometric transformations involved are often a critical step. Hence, the improved algorithms with error analysis can be applied to enhance their performance in these areas.
The organization of this paper is as follows. In
Section 2, the SIM is introduced for
and
T, and in
Section 3, the interpolation techniques are proposed to reduce and even to bypass the nonlinear solutions. In
Section 4, error analysis is made for both continuity and discontinuity of image greyness, and error bounds are derived. In
Section 5, the PDE models of complicated transformations are approximated by the FDM, and then applied to image transformations by the new combination
. In
Section 6, some graphical and numerical experiments are provided to verify the theoretical analysis made, and to demonstrate the effectiveness of the proposed algorithms. Concluding remarks are given in
Section 7, and a list of symbols used is provided in
Appendix A.
3. Improved Algorithms of SIM for Images Under T
The nonlinear solutions for seeking
from
in (
16) by the Newton iterations (
17) may encounter some difficulties, e.g., choices of good initial values and multiple solutions. Below, we develop improved techniques to reduce and even to bypass the nonlinear solutions completely. Such improved techniques may also be incorporated with the PDE solutions in the harmonic, Poisson and blending models provided in
Section 5.
Technique I: Reduction of nonlinear solutions.
In Li [
6], the reduction technique was first used for evaluating
and
only at the center of gravity of
in
, i.e., at the mid-points
. It is better to carry out (
16) only at the pixel points
. The number of solutions to (
16) is only
as in Li [
6], where
is the total pixel number of the image. The approximations of
and
in the quadrilateral
in
may be obtained by the piecewise bilinear approximations (see
Figure 2), where the gravity center
,
and
. Then, the values of
at
and
can be evaluated by the bilinear approximation:
Moreover, the following linear approximations may be used (see
Figure 3). If
, we have
If
,
The formula for
is similar. The advantage of (
19) in Li [
6] is that the greyness errors in the computation are smaller. The algorithms in [
6] are denoted by
for
. Note that when
, the
using Technique I is just the original CIIM given in
Section 2.3.
Technique II: Bypassing nonlinear solutions.
The values
in (
19)–(
21) used in Technique I do not need to be exact, either, if their approximate values
have small allowable errors. The nonlinear transformation
T may also be approximated by a piecewise linear transformation
on triangles
, where
in
Figure 3. Hence, the values for
can be evaluated approximately by those for
. Note that the inverse transformation of
is just a linear transformation locally. This avoids the nonlinear solutions in Algorithm SIM for
T. The values
and
can be found using the following four steps.
- Step 1.
Compute all and by and .
- Step 2.
Find all potential
in XOY, which can be determined by
and
, where
where
is the largest integer
. The definitions of
and
are the same.
- Step 3.
Split
into two triangles
and
as
, where
are also triangles, and find all possible
such that
The area coordinates
,
and
are obtained from the following algebraic system:
where
are the vertices of triangles
in
Figure 3. The sufficient and necessary conditions for (
22) are all nonnegative area coordinates
.
- Step 4.
Obtain the approximate values of
related to
in (
22) by
where
,
and
are the vertices of
shown in
Figure 3.
Note that Steps 1–4 above are easy to carry out. The computation complexity is also only
, where
is the total pixel number. More importantly, the computation for values,
does not involve any nonlinear solutions. Based on the known
, the bilinear approximations
are formulated by
The improved SIM using Techniques I and II is given by
where
given in (
14). The computation complexity of (
25) and (
26) is
, where
. Hence, the CPU time by Techniques I and II is insignificant if comparing
to
. Note that no nonlinear solutions are needed in (
26) either. To distinguish the algorithms using Techniques I and II from SIM and CIIM, we denote (
26) by
and the combination of (
12) and (
26) by
. Technique I was proposed in [
6] with a preliminary error analysis, but Technique II was proposed in [
9] without error analysis. The error analysis is important and challenging for improved
for
T via both Techniques I and II.
Below, we will derive error bounds in the next section by Sobolev norms, which can be applied to discontinuous images.
4. Error Analysis
Algorithm analysis (such as error analysis) is essential to efficiency for numerical partial differential equations (PDEs). In this section, the new error analysis of image greyness by Techniques I and II is twofold.
I. Compared to error analysis in the original CIIM in Li [
6], the inverse coordinates for
and
are not exact. First, the piecewise linear (or bilinear) interpolations
and
are used based on the exact values
and
in Technique I. We give a strict analysis (ref. [
6], Section 6). Next, in Technique II, the inverse values
and
are approximated by
and
, also by using the piecewise linear interpolations. The linear (or bilinear) interpolations
and
on
yield the same order
of the errors, and the approximations
and
on
also yield the same error
for continuous images. The improvements in
Section 3 will maintain the optimal convergence rates
for
as in [
11] (Chapter 4). Therefore, Algorithms
and
circumvent the troublesome nonlinear solutions in the original CIIM in
Section 2. For the rather complicated harmonic and blending models, the PDE solutions in
Section 5 may be incorporated easily into the discrete algorithms proposed. Hence, the improved algorithms
and
may be applied to a wide scale of applications in digital images and patterns.
II. To estimate the errors of discontinuous images, we have to face the image discontinuity; this is a challenging task because discontinuity of image greyness always exists, and because the severe discontinuity may become useless for the continuous analysis of image transformations. In
Section 4.2, we do not need the assumption (
9) but still solicit the Sobolev norms for error analysis all the way. For the improved techniques in
Section 3, the greyness errors,
, are given in Corollary 1, where
and
. This is a significant development in error analysis for image transformations, noting that a high smooth greyness of images was often required in our previous papers [
6,
7]. New splitting techniques in this paper can also be applied to image processing, pattern recognition and AI under geometric transformation in [
14,
15,
16,
17].
4.1. Error Analysis for
Since the detailed analysis of the combinations for images under
is given in Li [
6], we only derive the distinct analysis for
for
T as
. Let the division number
, and then, we define the global greyness errors:
where
are given in (
26), and the sequential errors
and the pixel number
. The Sobolev norms over
are defined by
We consider the forward transformations
T, and construct a cubic spline function passing over
such that
and
. Define
where
. The
and
denote the piecewise constant and bilinear functions, respectively. The transformation
T is said to be regular if
,
and the Jacobian determinant
J satisfies
, where
and
are bounded constants. In this subsection, we assume that the image greyness is continuous to satisfy
and
as in (
9), where
is the Sobolev space. Such relaxed assumptions in the Sobolev space will be extended to the discontinuous greyness functions
and
in the sequential subsection. We have the following lemma.
Lemma 1. Let T be regular, hold, and the interpolation functions with be used. There exists a bounded constant C independent of μ and h such thatwhere , and Case B denotes the whose inverse transformed shapes by fall across the boundary of in . For steps 2 and 3 in Figure 1, we havewhere and . Proof. The bounds in (
30) are obtained similarly from [
11] (Chapter 4), by replacing
and
with
b and
, respectively. Next, from (
30), we have
where
and
. This completes the proof of Lemma 1. □
For simplicity, we consider only the case of
and achieve the optimal convergence rates
. For
, the analysis may be carried out leading to
and
in probability in Li and Bai [
7]. The difference in analysis between
of
Section 3 and SIM of
Section 2 lies in the evaluation on the approximate errors resulting from
,
,
and
in (
19) and (
25), instead of the exact values,
and
. Now, we give the grayness errors from Technique I from Strang and Fix [
18].
Lemma 2. Let and in (19) be the piecewise bilinear interpolations of and on , based on the exact values of and satisfying the two equations and exactly. Suppose that and . There exists a bounded constant C independent of H such that In Technique II, we obtain the approximations
in (
25) from the piecewise linear interpolations
and
(see
Figure 3). Below we prove a lemma.
Lemma 3. Let and be the piecewise linear interpolation based on the approximations: and , where and are the piecewise linear interpolations on in Lemma 2. Then, there exist the error bounds: Proof. Take the linear approximation (
20) as an example. We have
where
,
, and
. By using the affine transformation
and
, the triangle
is transformed to a reference triangle
. Then, we have
where the integrals are evaluated by calculus
Then, it follows that
Since all norms in finite dimensions are equivalent to each other, by noting
, we obtain
where
C is a bounded constant independent of
H. Therefore, we obtain from (
35) and (
36)
This is the desired result (
32), and Equation (33) holds similarly. □
Lemma 3 implies that the error order resulting from Technique II is no worse than that from Technique I. Now let us consider
using Techniques I and II, and the absolute errors are defined by
where
and
are defined in (
15) and (
26), respectively. Denote
We have
To provide the error bounds for (
39), we first give two lemmas.
Lemma 4. Suppose that and . There exist the error bounds Proof. We have from
and the Schwarz inequality
This is the first bound (
40).
Next, we have from (
43) and the Schwarz inequality
Third, from Lemma 3 we have similarly
This completes the proof of Lemma 4. □
The Jacobian determinant
is given from (
18). The nonsingular Jacobian determinant satisfies
. Denote by
the minimal singular value of Jacobian matrix
. Then, we have
, and give the following lemma.
Lemma 5. Let the transformation T be regular. Then, under the inverse transformation , there exist the boundswhere is the maximal value of the Jacobian determinant, and is the minimal singular value of matrix in (18). Proof. We have
This is (
45) for
.
Next, since
we have
From (
47), we have
and
. Hence, we obtain
Consequently, we have from (
47) and (
48)
and
These are (
45) for
, and completes the proof of Lemma 5. □
Now we prove a main theorem, which is not only important for error analysis of the improved algorithms in this paper, but also effective for error analysis of discontinuous greyness of images in
Section 4.2.
Theorem 1. Let and , and the conditions in Lemma 5 hold. Then, there exists the error bound for , Proof. From Lemmas 1 and 5 for
,
where
and
are the original domains of
and
. Since
can be regarded as piecewise bi-cubic spline functions, the following bounds hold due to finite-dimensional functions.
It follows that
Also we have from Lemmas 4 and 5
Moreover, we have
from Lemma 5. Then, we obtain from Lemmas 4 and 2,
Based on (
39) we have from (
52)–(
54),
to give the desired results (
49). This completes the proof of Theorem 1. □
Remark 1. For the regular transformation T, we have , and . Then, Equation (49) leads toThe error bounds (56) are the optimal estimates of the SIM via transformation in [11] (Chapters 3 and 4). The optimal convergence rates (56) remain for the improved via T without nonlinear solutions. Hence, the is beneficial for wide applications as in Section 5. Moreover, the error bounds (49) in Sobolev norms can be extended to discontinuous images below. 4.2. Error Analysis for Image Discontinuity
In our past study for image geometric transformations, the greyness functions
were assumed to be continuous
or even
. However, since
, we may study the discontinuity of digital images and patterns. In this subsection, we introduce the discontinuity degree. First, let us consider the binary images. If the entire image is either black or white, the discontinuity degree is zero. On the other hand, if the pixel is black when
is even, and white when
is odd, the discontinuity degree is one. Let
be split into a
partition of
and denoted by
. Denote by
both the blacks and the whites existing at nine-pixel points (see
Figure 4). Let the number
. Hence, the discontinuity degree is defined by
There are four cases of an image in 2D of the image discontinuity with the discontinuity degree from (
57).
- (1)
There are only a few isolated pixels,
- (2)
Greyness discontinuity exists only along the interior and exterior image boundary,
- (3)
Greyness discontinuity is minor,
- (4)
All
are
,
The analysis below in this paper is well suited to Cases (1)–(3). For Case (4), the large, useless errors are obtained. Let us prove a new lemma.
Lemma 6. For nonuniform binary images in , the error bounds of piecewise cubic-spline interpolant functions passing all are given by Proof. By the affine transformation:
,
the region
is transformed to a unit square
in
Figure 4, where
. Then, we have
. Note that the same binary images at the nine-pixel points on
as in
. Let
be the bi-quadratic Lagrange polynomial passing through all pixel points:
. Based on the equivalence of finite dimensional norms, we can prove
. Hence,
where
is also a bi-quadratic Lagrange interpolant polynomial in
.
For the binary images, the bi-quadratic interpolant gives
and
. It is easy to see
The first desired result (
58) follows from (
61) and (
62). Moreover
This complete the proof of Lemma 6. □
For the
q-greyness levels, we may define the greyness jumps of images as
The analysis of binary images can be extended to that of other multiple level images. From Theorem 1 and Lemma 6, we have the following theorem.
Theorem 2. Let transformation T be regular and all conditions in Lemma 6 hold. When , there exist the bounds: Proof. From Theorem 1 and Lemma 6, we have
The regular transformation
T implies
,
,
. We have
This is the desired result (
64) and completed the proof of Theorem 2. □
From Theorem 2, we have the following corollary.
Corollary 1. Let all conditions in Theorem 1 hold. For the finite greyness jumps , the absolute errors areWhen the images have with greyness jumps of interior and exterior boundaries ,When with , denote . We have Remark 2. When the majority greyness of images is continuous, we have with . The errors are small due to small from (65). This is an important development from [6,7], where images are often assumed to be continuous. Note that the sequential errors as still hold. However for Case (4): the full discontinuous images, we have from (65) with and , which is meaningless. Another error analysis for greyness discontinuity is reported for other kinds of splitting algorithms in [11] (Chapter 8). 4.3. Summary of Six Combinations
for the Cycle Transformation
Here let us give a summary of various combinations for the cycle transformation
. There are six combinations with
and
in 2D images under
: (1)
, (2)
and
, (3)
, (4)
, (5)
, and (6)
in this paper. We list their characteristics, accuracy and convergence in
Table 1 with
only. The details of algorithms and rigor analysis are provided in a recent book [
11]. Furthermore, a list of symbols used is listed in
Appendix A.
In summary, combination
in [
1,
7] is the simplest, but suffers from low sequential errors
and
. Combination
in [
11] (Chapter 4) gains the better convergence rates,
. When
using the piecewise bilinear interpolations
and
, the sequential convergence rates
are optimal. For
using the piecewise constant interpolations
and
, the performance of
is still satisfactory. However,
suffers from nonlinear solutions during the forward transformation
T. Consequently, for the inverse transformation
,
is recommended if the nonlinear solutions are easy using iteration methods. The original
is suited well to low numbers of greyness-level images; but
is suited to both low and high numbers of greyness level (≥256) images. The improved
in [
9] may completely bypass the nonlinear solutions, and its rigor analysis is explored in this paper. The improved
are particularly beneficial for harmonic models with the finite element method, finite difference method and finite volume method because the approximate transformation
obtained is already piecewise linear (see [
8,
9]).
To promote accurate images with no need for nonlinear solutions,
for
and
for
in [
11] (Chapter 7) have been developed to have the optimal convergence rates
. The sophisticated combinations
in [
11] (Chapter 8) may be easily carried out without sequential errors when
. Moreover, an analysis in [
11] (Chapter 8) reveals the better absolute error
for all kinds of image discontinuity.
7. Concluding Remarks
Compared to our previous study of digital images under geometric transformations (see [
1,
6,
7,
9]), we address several novelties in this paper.
1. For the images under a nonlinear transformation
T, the original SIM requires the nonlinear solutions, although the number of nonlinear equations may be reduced to
in Li [
6]. In Li, et al. [
9], we proposed interpolation techniques that entirely bypass nonlinear solutions for the geometric images. The new numerical algorithms are called the improved
for
T and Combination
under
. This paper provides error analysis for the improved
by using Techniques I and II.
2. The error bounds are given in Theorem 1 based on the Sobolev norms. The optimal convergence rates are obtained for the piecewise bilinear interpolations () and smooth images, where is the mesh resolution of an optical scanner, and N is the division number of a pixel split into sub-pixels.
3. For the images in which the portion of discontinuity is minor, the error bounds are given in Theorem 2. For general cases of discontinuous images, the error bounds are given in Corollary 1 as , when .
4. The combination
can be applied to the harmonic, Poisson and blending models in
Section 5 and their approximate solutions
and
may be obtained by the FDM using the SOR. The programming of the FDM is much easier than that of the FEM and the finite volume method (FVM), and the CPU time greatly reduced.
5. Numerical and graphical experiments are carried out in
Section 6. The real images of
(and
) pixels of 256 greyness-levels are carried out to show the importance of the improved algorithms. The numerical experiments in
Section 6 also support the error analysis made.
6. New numerical algorithms for image geometric transformations in this paper are beneficial to computer vision, image processing and pattern recognition. Applications of numerical algorithms particularly to deep learning in AI are also given in Remark 4. Modern AI systems (e.g., ChatGPT, DeepSeek, Manus, Sora) rely on three core elements: data, models and algorithms. As emphasized in [
8,
10,
11], efficient algorithms are critical for optimizing computational resources. Six combinations of numerical algorithms for the images under
are summarized in
Table 1.