Deep-Learning Estimators for the Hurst Exponent of Two-Dimensional Fractional Brownian Motion

Chang, Yen-Ching

doi:10.3390/fractalfract8010050

Open AccessArticle

Deep-Learning Estimators for the Hurst Exponent of Two-Dimensional Fractional Brownian Motion

by

Yen-Ching Chang

^1,2

¹

Department of Medical Informatics, Chung Shan Medical University, Taichung 40201, Taiwan

²

Department of Medical Imaging, Chung Shan Medical University Hospital, Taichung 40201, Taiwan

Fractal Fract. 2024, 8(1), 50; https://doi.org/10.3390/fractalfract8010050

Submission received: 20 December 2023 / Revised: 31 December 2023 / Accepted: 10 January 2024 / Published: 12 January 2024

(This article belongs to the Section Numerical and Computational Methods)

Download

Browse Figures

Versions Notes

Abstract

The fractal dimension (D) is a very useful indicator for recognizing images. The fractal dimension increases as the pattern of an image becomes rougher. Therefore, images are frequently described as certain models of fractal geometry. Among the models, two-dimensional fractional Brownian motion (2D FBM) is commonly used because it has specific physical meaning and only contains the finite-valued parameter (a real value from 0 to 1) of the Hurst exponent (H). More usefully, H and D possess the relation of D = 3 − H. The accuracy of the maximum likelihood estimator (MLE) is the best among estimators, but its efficiency is appreciably low. Lately, an efficient MLE for the Hurst exponent was produced to greatly improve its efficiency, but it still incurs much higher computational costs. Therefore, in the paper, we put forward a deep-learning estimator through classification models. The trained deep-learning models for images of 2D FBM not only incur smaller computational costs but also provide smaller mean-squared errors than the efficient MLE, except for size 32 × 32 × 1. In particular, the computational times of the efficient MLE are up to 129, 3090, and 156248 times those of our proposed simple model for sizes 32 × 32 × 1, 64 × 64 × 1, and 128 × 128 × 1.

Keywords:

deep learning; deep-learning estimator; two-dimensional fractional Brownian motion; fractal dimension; maximum likelihood estimator; Hurst exponent

1. Introduction

When describing particular images (or surfaces) and signals, the fractal structure often plays a very effective role. For example, Huang and Lee [1] used fractal analysis to classify pathological prostate images; Lin et al. [2] also adopted fractal analysis to classify solitary pulmonary nodules; He and Liu [3] screened dry coal online through fractal analysis and image processing; Yakovlev et al. [4] also used fractal analysis to evaluate changes in a modified cement composite; Guo et al. [5] characterized and classified tumor lesions of digital mammograms through fractal texture analysis; Crescenzo et al. [6] adopted FBM to predict temperature fluctuation; Paun et al. [7] used fractal analysis to evaluate micro fractographies; and Hu et al. [8] combined FBM and particle swarm optimization to propose a stock prediction model.

It is common for signals and images to use the fractal dimension (FD) to explain and recognize irregular structures; the smaller the FD, the smoother the signal/image. Accordingly, the intensities in the pathological tissues of medical images are usually viewed as fractal images. Therefore, the FD is a very suitable indicator for differentiating between these two different tissues.

Among the models for describing irregular images, two-dimensional fractional Brownian motion (2D FBM) [9,10] is an excellent choice because it can explain the characteristics of images in the sense of physical activity. In contrast, its corresponding two-dimensional fractional Gaussian noise (2D FGN) is also another excellent selection, in that it can describe much rougher patterns. These two models are not only suitable for describing many medical images; they are also appropriate for characterizing many natural scenes [11,12,13]. For traditional estimators, these two models must be separately considered and applied. For deep-learning estimators, they can be trained together and further explained. It is a promising avenue to consider images of 2D FBM and 2D FGN for deep-learning models.

Both 2D FBM and 2D FGN only contain the Hurst exponent (H)—a real value between 0 and 1—which can easily distinguish between images of the Hurst exponent with various realizations; the value of H and FD (D) possesses the relation of D = 3 − H [10]. Since the values of the Hurst exponent are in the range from 0 to 1, it is very likely to act as an important feature. In particular, its range is considerably appropriate to serve as the domain of images. When all sub-images of an original image are transformed into estimates of the Hurst exponent, these estimates (0–1) can be saved as a feature/characteristic image/map for another input source image. When machine-learning or deep-learning models are run on certain images, we can supply this characteristic image for another input to improve the overall classification rate through two inputs (the original source images and their corresponding transformed characteristic images) and/or two model streams.

Typically, based on the Hurst exponent, its effectiveness is related to the accuracy of estimation; that is, the higher the accuracy of estimation, the better the classification rate. Therefore, the better the estimators, the higher the benefits of the Hurst exponent. Over the past decades, some estimators have been calculated via the fractal model [14], some through the fractal feature [15], fractal approach [16], the box-counting method [17,18,19,20,21,22], and the blanket [23]. No matter which method is chosen, we often need to balance between the accuracy and efficiency of estimators. Nevertheless, optimal accuracy is always welcome by users, except when its computational cost is unbearable.

As we know, the accuracy of the maximum likelihood estimator (MLE) is the best among estimators, but its efficiency is appreciably low and even prohibitive. In order to raise its value, Chang [24] recently proposed two versions of MLE for the 2D FBM and 2D FGN: one is called an iterative MLE, and the other is called an efficient MLE. The efficient MLE is much better in terms of efficiency than the iterative MLE, and the iterative MLE is much better in terms of efficiency than the standard MLE or simply MLE. Currently, it is—without a doubt—our best choice to use the efficient MLE to estimate the Hurst exponent.

Before the era of machine learning, we had often used these estimates to distinguish between various tissues or images; generally, this approach was effective for significant differences between images. At the moment, we—in the era of machine learning—can also classify images using these estimates as another feature; this machine-learning approach is typically better than the conventional approach, especially for small differences between images.

As deep learning becomes more and more advanced and mature in algorithms and technologies, in the upcoming future, we are going to use the corresponding characteristic images of all the original images as extra input and then design two inputs and/or two model streams to distinguish between different images. Compared to a single image input, two or more image inputs will intuitively improve the overall classification rate through deep-learning models. It can be expected that this deep-learning approach will be much better than the machine-learning approach, especially for tiny differences between images.

In the future, we will develop multiple inputs and/or diverse model streams to distinguish between different images. Their inputs will contain other characteristic images or maps, such as entropy or spectrum images.

In the past, when images were modeled as 2D FBM, we estimated their Hurst exponents via estimators and then viewed these estimates as an indicator. It is well known that any estimator will contain errors. In addition, the weakness of the best estimator is usually its higher computational cost, even when the error has been much reduced by an efficient MLE [24].

Over the past decade, deep-learning models have flourished. As rich and diverse pre-trained models are developed, a feasible and even effective idea may be that estimates may be indirectly computed from deep-learning models, not directly from the estimators. This approach—estimating the Hurst exponent through deep-learning models—can be regarded as an indirect estimator, but the traditional estimator, such as the MLE, can be regarded as a direct estimator. For convenience, we designate these approaches using deep-learning models as deep-learning estimators. It turns out that deep-learning estimators outperform traditional estimators in terms of mean-squared errors and computational costs.

Since a rich resource of pre-trained deep-learning models was available, which were trained on a million images from the ImageNet database [25] to classify 1000 object categories, in the paper, we adopt deep-learning models for classification—not regression—to classify the Hurst exponent and then convert these classes to the corresponding values of the Hurst exponent. In the future, we will further modify these pre-trained models for classification to qualified pre-trained models for regression.

In a previous pilot study [26], we found that a simple deep-learning model and two pre-trained models—AlexNet and GoogleNet—with and without augmentation outperformed the MLE in terms of mean-squared errors (MSEs) in fractional Brownian images (FBIs—gray-level images saved from 2D FBM) with a size 32 × 32 × 1 under two equally spaced resolutions (11 and 21 classes). For each Hurst exponent, we generated 1000 FBIs. In order to fairly compare the MSEs between the MLE and three deep-learning models, five-fold cross-validation was adopted with the stochastic gradient descent with momentum (sgdm) solver.

At that time, the MSEs for deep-learning models were measured by converting the classified classes to the corresponding MSEs. Therefore, if the classified class is correct, its MSE is zero. The measurement seems to be reasonable, but it is not practical for real-world images, in that we do not know how to precisely segment the Hurst exponent in advance. Therefore, in the paper, we additionally generate a finer resolution of 32 classes to train the models and then use these models trained on 32 classes to classify 11 classes of images and then convert the classified classes to their corresponding MSEs. In this case, there are no absolutely correct classes, and hence, any class will result in an error. Based on the principle, a relatively fair comparison will be conducted.

On the surface, the finer the resolution, the higher the advantage of the deep-learning models. However, the finer resolution will give rise to complexity in training deep-learning models. Therefore, this approach can be used to test the power of deep-learning models.

For a fair comparison among deep-learning models, in the previous experiments, we generated 1000 FBIs through the same seeds (1–1000) from a pseudo-random-number generator. The set from seed 1 to 1000 is called Set 1. Using five-fold cross-validation, this approach seems to be reasonable and reliable because none of the validation images was involved in the training set. In order to understand whether deep-learning models will learn and memorize the hidden realizations from a pseudo-random-number generator, we will additionally generate another set of images from other seeds (1001–2000). The set from seed 1001 to 2000 is called Set 2.

In our study, three main issues will be discussed. The first issue is whether the deep-learning models trained on 32 classes from Set 1 are better in the classification rates and MSEs than those trained on 11 and 21 classes from the same seeds (Set 1) and whether the deep-learning models trained on 21 classes from Set 1 are also better in the classification rates and MSEs than those trained on 11 classes from the same seeds (Set 1). If so, these trained deep-learning models are qualified for estimating the Hurst exponent of FBIs. The evaluation approach of the experiment is called within-set evaluation; that is, deep-learning models trained on a large part of one set are evaluated on a small part of the same set through cross-validation. For five-fold cross-validation, the training set is 80% of one set, and the validation set is 20% of the same set.

For the second issue, we will perform two experiments: one involves Set 2 with 11 classes being evaluated by deep-learning models trained on Set 1 with 11 classes, and the other involves Set 1 with 11 classes being evaluated through five-fold cross-validation by deep-learning models trained on Set 1 with 11 classes. The evaluation approach of the former is called between-set evaluation; that is, deep-learning models trained on one set are evaluated on another set. The latter approach, as before, is within-set evaluation. If two evaluation approaches will give similar results, the deep-learning models for FBIs will possess an extremely high generalization ability. These models are robust against various realizations, and hence, they are extremely suitable for indirectly estimating the Hurst exponent or its corresponding fractal dimension. Otherwise, deep-learning models may learn the hidden realizations through different appearances from different Hurst exponents. The phenomenon suggests that the current deep-learning models possess high learning power.

As we know, any estimation must contain an error, but the MSEs of correctly classified classes are assigned zeros in the previous experiments; therefore, the operation is not fair for a comparison with the MLE. Therefore, addressing the third issue, we also want to know whether Set 2 with 11 classes evaluated by deep-learning models trained on Set 1 with 32 classes performs better in the MSEs and computational costs than the efficient MLE. If true, these models for FBIs are suitable for indirectly estimating the Hurst exponent. In the experiment, we use between-set evaluation, and no so-called correct classes occur; hence, the comparison is a very fair mechanism.

Therefore, in the paper, three resolutions—11, 21, and 32 equally spaced Hurst exponents—will be taken into consideration. For each resolution, we first estimate the Hurst exponents of fractional Brownian surfaces (FBSs—the numerical data of 2D FBM) and FBIs via the efficient MLE for comparison. Based on the same FBIs, we propose a simple deep-learning model and choose five pre-trained deep-learning models to classify and then convert the classified classes to the corresponding Hurst exponents. Likewise, their classification results and MSEs will be provided for further comparison.

Section 2 in the paper introduces some related materials and methods. Section 3 describes our experiments and provides wide results. A detailed discussion is presented in Section 4. Section 5 provides some possible applications in the future. Finally, Section 6 summarizes the results of the paper and presents some future works.

2. Materials and Methods

In brief, we will describe some materials and deep-learning models in this section. The primary materials include the non-stationary process of 2D FBM, its corresponding stationary increment process of 2D FGN, and an efficient MLE for the Hurst exponent: a traditional MLE run in an efficient way. In the end, the deep-learning models consist of our proposed model and five pre-trained models: GoogleNet, Xception, ResNet18, MobileNetV2, and SqueezeNet.

2.1. Two-Dimensional Fractional Brownian Motion

Falconer [10] used an index-H Brownian function to name the non-stationary process of 2D FBM, whereas Hoefer et al. [27] used an isotropic 2D FBM to name the process and an isotropic two-dimensional discrete FBM to name its corresponding discrete version. However, Balghonaim and Keller [28] used a two-variable FBM to name the process and a two-variable discrete FBS to name its corresponding discrete version. In the paper, we simply use 2D FBM to name the process.

Assuming an image (numerical data, not gray-level values or real numbers from gray-level values) of FBI with size

M \times N

is denoted as follows:

I_{B} = [\begin{matrix} B [0, 0] & B [0, 1] & \dots & B [0, N - 1] \\ B [1, 0] & B [1, 1] & \dots & B [1, N - 1] \\ ⋮ & ⋮ & ⋱ & ⋮ \\ B [M - 1, 0] & B [M - 1, 1] & \dots & B [M - 1, N - 1] \end{matrix}]

(1)

Taking out two points or coordinates

B [x_{1}, y_{1}]

and

B [x_{2}, y_{2}]

from the image of 2D FBM, we obtain their autocorrelation function (ACF) as follows:

r_{B B} [(\begin{matrix} x_{1} \\ y_{1} \end{matrix}), (\begin{matrix} x_{2} \\ y_{2} \end{matrix})] = E \{B [x_{1}, y_{1}] B [x_{2}, y_{2}]\} = \frac{σ^{2}}{2} ({‖(\begin{matrix} x_{1} \\ y_{1} \end{matrix})‖}^{2 H} + {‖(\begin{matrix} x_{2} \\ y_{2} \end{matrix})‖}^{2 H} - {‖(\begin{matrix} x_{1} - x_{2} \\ y_{1} - y_{2} \end{matrix})‖}^{2 H})

(2)

where E stands for the expectation operator;

σ^{2}

is the variance in the process;

‖\cdot‖

is the Euclidean norm; H is the only parameter concerned, called the Hurst exponent. It is obvious that the ACF is directly calculated from the distance between two coordinates and two distances from the origin.

To estimate the Hurst exponent, the ACFs must be stationary—not non-stationary. There exist two feasible approaches for stationary ACFs. Hoefer et al. [27] put forward the first approach, simply called the first 2D FGN, by using the second increment process of 2D FBM. They also called the resultant process two-dimensional Gaussian noise (FGN2) [29]. Balghonaim and Keller [28] put forward the second approach, simply called the second 2D FGN, by using the first increment process on each row or column of 2D FBM.

Chang [24] has given a detailed description of these two 2D FGN. When his proposed efficient MLE was run on the second 2D FGN, the mean-squared errors (MSEs) were smaller than those on the first 2D FGN. Compared to the second 2D FGN, the first 2D FGN possesses more physical meanings because real images are usually present in the appearances of the first 2D FGN—seldom in the patterns of the second 2D FGN.

Therefore, we will adopt the first 2D FGN—not the second 2D FGN—to generate its probability density function (PDF) and then estimate the Hurst exponent through the efficient MLE. For the first 2D FGN, we first represent the data of 2D FBM as an image of 2D FBM,

I_{B}

, which is a column vector with size

M N \times 1

, as follows:

{\overset{⇀}{B}}^{T} = {[\begin{matrix} B_{0}^{T} & B_{1}^{T} & \dots & B_{M - 1}^{T} \end{matrix}]}^{}

(3)

where

B_{k}^{T} = [\begin{array}{l} B [k, 0] B [k, 1] \dots B [k, N - 1] \end{array}], k = 0, 1, \dots, M - 1 .

Based on the column vector, we can easily obtain its non-stationary covariance matrix from the following equation:

R_{\overset{⇀}{B} \overset{⇀}{B}} = E \{\overset{⇀}{B} {\overset{⇀}{B}}^{T}\} .

(4)

Next, we calculate the first 2D FGN from the second increment process [27] of 2D FBM, as follows:

X [i, j] = B [i, j] - B [i, j - 1] - B [i - 1, j] + B [i - 1, j - 1], i = 1, 2, \dots, M - 1; j = 1, 2, \dots, N - 1 .

(5)

Similar to the notation of the image of 2D FBM, we also represent this image of 2D FGN as a column vector with size

N_{1} \times 1

,

N_{1} = (M - 1) (N - 1)

:

{\overset{⇀}{X}}^{T} = {[\begin{array}{l} X_{1}^{T} & X_{2}^{T} & \dots & X_{M - 1}^{T} \end{array}]}^{}

(6)

where

X_{k}^{T} = [\begin{matrix} X [k, 1] X [k, 2] \dots X [k, N - 1] \end{matrix}], k = 1, 2, \dots, M - 1 .

The vector,

{\overset{⇀}{X}}^{T}

, can be regarded as a vector time series. Accordingly, its covariance is composed of a special structure, called a Toeplitz-block Toeplitz matrix, or a symmetric-block symmetric matrix, as follows:

R = R (H, σ^{2}) = E \{\overset{⇀}{X} {\overset{⇀}{X}}^{T}\} = [\begin{matrix} R_{0} & R_{1} & \dots & R_{M - 2} \\ R_{1} & R_{0} & \dots & R_{M - 3} \\ ⋮ & ⋮ & ⋱ & ⋮ \\ R_{M - 2} & R_{M - 3} & \dots & R_{0} \end{matrix}],

(7)

where

R_{k} = R_{i, j} = E \{X_{i} X_{j}^{T}\} = [\begin{matrix} F [k, 0] & F [k, 1] & \dots & F [k, N - 2] \\ F [k, 1] & F [k, 0] & \dots & F [k, N - 3] \\ ⋮ & ⋮ & ⋱ & ⋮ \\ F [k, N - 2] & F [k, N - 3] & \dots & F [k, 0] \end{matrix}],

k = |i - j|, i, j = 1, 2, \dots, M - 1,

where

F [k, l] = E \{X [i_{1}, j_{1}] X [i_{2}, j_{2}]\} = \frac{σ^{2}}{2} f (k, l),

k = |i_{1} - i_{2}|, l = |j_{1} - j_{2}|; i_{1}, i_{2} = 1, 2, \dots, M - 1; j_{1}, j_{2} = 1, 2, \dots, N - 1,

where

f (k, l) = 2 {({|k - 1|}^{2} + l^{2})}^{H} + 2 {(k^{2} + {|l - 1|}^{2})}^{H} + 2 {(k^{2} + {|l + 1|}^{2})}^{H} + 2 {({|k + 1|}^{2} + l^{2})}^{H}

- {({|k - 1|}^{2} + {|l - 1|}^{2})}^{H} - {({|k - 1|}^{2} + {|l + 1|}^{2})}^{H} - {({|k + 1|}^{2} + {|l - 1|}^{2})}^{H} - {({|k + 1|}^{2} + {|l + 1|}^{2})}^{H} - 4 {(k^{2} + l^{2})}^{H} .

Obviously, all ACFs are directly connected with the relative distances—not absolute distances—between two coordinates; hence, the first 2D FGN is stationary, or the vector

{\overset{⇀}{X}}^{T}

is a stationary vector time series. Accordingly, we can estimate the Hurst exponent through the MLE based on the PDF of this 2D FGN.

2.2. The Maximum Likelihood Estimator for 2D FBM

Through this 2D FGN, its corresponding PDF will be generated. Based on the PDF, two cases will be considered: one with the known variance, the other with the unknown variance. Without estimating the extra variance, the known variance case is theoretically better in terms of accuracy than the unknown variance case. In the paper, we only consider the unknown variance case, in that it is the most common case in the real world. Therefore, the PDF can be described in the following equation:

p (\overset{⇀}{X}; H, σ^{2}) = \frac{1}{{(2 π)}^{N_{1} / 2} {|R|}^{1 / 2}} \exp \{- \frac{1}{2} {\overset{⇀}{X}}^{T} R^{- 1} \overset{⇀}{X}\} = \frac{1}{{(2 π)}^{N_{1} / 2} {|σ^{2} \bar{R}|}^{1 / 2}} \exp \{- \frac{1}{2 σ^{2}} {\overset{⇀}{X}}^{T} {\bar{R}}^{- 1} \overset{⇀}{X}\},

(8)

where

R = σ^{2} \bar{R} .

The above PDF consists of two parameters to be estimated by the efficient MLE. One is the unknown variance (an explicit parameter), and the other is the Hurst exponent (an implicit parameter). To estimate the explicit unknown variance, we first take the logarithm of the PDF, then maximize the log-likelihood function

\log p (\overset{⇀}{X}; H, σ^{2})

with respect to

σ^{2}

, and finally obtain

\log p (\overset{⇀}{X}; H, {\hat{σ}}^{2}) = - \frac{N_{1}}{2} \log (2 π) - \frac{N_{1}}{2} \log ({\hat{σ}}^{2}) - \frac{1}{2} \log |\bar{R}| - \frac{N_{1}}{2},

(9)

where

{\hat{σ}}^{2} = \frac{1}{N_{1}} {\overset{⇀}{X}}^{T} {\bar{R}}^{- 1} \overset{⇀}{X} .

Without affecting further estimation of the implicit parameter, the Hurst exponent—two constants and the common coefficient 0.5—is omitted, and then, we obtain a compact form in the following equation:

\max_{H} [\log p (\overset{⇀}{X}; H, {\hat{σ}}^{2})] = \max_{H} [- N_{1} \log (\frac{1}{N_{1}} {\overset{⇀}{X}}^{T} {\bar{R}}^{- 1} \overset{⇀}{X}) - \log |\bar{R}|] .

(10)

From the elements of

\bar{R}

, we know that the log-likelihood function only includes the parameter of the Hurst exponent, but in an implicit and inseparable way. Accordingly, it is not possible to directly maximize the log-likelihood function with respect to the implicit Hurst exponent. Instead, in the paper, we adopt the golden section search [30,31] to search for the best estimate.

Different from the traditional or original MLE for estimating the Hurst exponent, Chang [24] further proposed two quicker versions of the MLE: an iterative MLE and an efficient MLE. Both quicker MLEs have the same accuracies as the original MLE but different degrees of efficiency from the original MLE. The efficient MLE is faster than the iterative MLE, and hence, in the paper, we adopt the efficient MLE for comparison.

2.3. Deep-Learning Models

Szymak et al. [32] used pre-trained deep-learning models to classify objects in underwater videos. Maeda-Gutiérrez et al. [33] fine-tuned pre-trained deep-learning models to recognize the diseases of tomato plant. In the paper, we choose five pre-trained models—namely GoogleNet, Xception, ResNet18, MobileNetV2, and SqueezeNet—to classify 2D FBM.

GoogleNet [34] is a convolutional neural network, which is 22 layers deep [35], with 144 layers in total. It is also a pre-trained model, trained on more than a million images from the ImageNet database or Places365 [36]. The model trained on ImageNet [25] can classify images into 1000 object categories. Similar to the network trained on ImageNet, the model trained on Places365 can classify images into 365 different place categories. The model has an image input size of 224 × 224.

Xception [37] is a 71-layer-deep convolutional neural network [35], with 170 layers in total. It is a pre-trained model, trained on more than a million images from the ImageNet database. As with GoogleNet, the pre-trained model can also classify images into 1000 object categories. Therefore, the model learns rich feature structures. The model has an image input size of 299 × 299.

ResNet18 [38] is an 18-layer-deep convolutional neural network [35], with 71 layers in total. As with Xception, it is also a pre-trained model. Likewise, the pre-trained model can recognize 1000 object categories. The model has an image input size of 224 × 224, same as GoogleNet.

MobileNetV2 [39] is a 53-layer-deep convolutional neural network [35], with 154 layers in total. As with ResNet-18, it is also a pre-trained model. The pre-trained model can similarly classify images into 1000 object categories. The model has an image input size of 224 × 224, same as GoogleNet and ResNet18.

SqueezeNet [40] is an 18-layer-deep convolutional neural network [35], with 68 layers in total. As with MobileNetV2, it is also a pre-trained model, trained on more than a million images from the ImageNet database. The pre-trained model can also recognize 1000 object categories. The model has an image input size of 227 × 227.

Among these five pre-trained models, Xception (71 layers deep, 170 layers in total) is the layer-deepest and has the highest number of layers. The second layer-deepest and with the second highest number of layers is MobileNetV2 (53 layers deep, 154 layers in total). Based on layer depth and the number of layers, Xception has the most complex structure, followed by MobileNetV2 and GoogleNet (22 layers deep, 144 layers in total). The layer depth and number of layers of ResNet18 (18 layers deep, 71 layers in total) are similar to those of SqueezeNet (18 layers deep, 68 layers in total).

The number of layers of MobileNetV2 is only 10 larger than that of GoogleNet, but the layer depth of MobileNetV2 is 31 higher than GoogleNet. As a result, the structure of MobileNetV2 is lanky (high and thin), whereas that of GoogleNet is squat (short and fat).

Except for the five pre-trained deep-learning models, in the paper, we are going to propose a simple deep-learning model with 25 layers in order to understand whether a simple model has the ability to classify images of 2D FBM. For effective classification, it is necessary for machine-learning models to manually select features ahead of time by users, but it is not necessary for deep-learning models. Compared to machine-learning models, one of the main advantages of deep-learning models is that they can automatically capture as many patterns as possible depending on their structures and powers and then integrate them through intricate algorithms or mechanisms into significant features.

3. Experimental Results

In the paper, we will evaluate the performance of the efficient MLE and deep-learning models on two resolutions: 0.0909 (1/11) and 0.0476 (1/21). For the efficient MLE, we will estimate two kinds of data: fractional Brownian surfaces (FBSs) (true data) and fractional Brownian images (FBIs), which are obtained from FBSs saved as gray-level images, thereby losing some finer details, especially for larger image sizes. In addition, we will also calculate the corresponding MSEs for comparison. For deep-learning models—including our proposed deep-learning model with 25 layers, GoogleNet, Xception, ResNet18, MobileNetV2, and SqueezeNet—we will first classify these FBIs and then compute the corresponding Hurst exponents according to the formula of Hurst exponents versus classes and finally compute their MSEs.

3.1. Experimental Settings

To investigate the effectiveness, two kinds of classes—11 Hurst exponents and 21 Hurst exponents—will be considered. For 11 classes, the Hurst exponents are H = 1/22, 3/22, …, and 21/22; for 21 classes, the Hurst exponents are H = 1/42, 3/42, …, and 41/42. Hence, the resolution of 11 classes is 0.0909 (1/11), and the resolution of 21 classes is 0.0476 (1/21).

Similar to Hoefer et al. [28], we generated as our dataset 1000 realizations (seed 1–1000, called Set 1) or observations of 2D FBM for each Hurst exponent or class, and each realization had five sizes: 8 × 8 × 1, 16 × 16 × 1, 32 × 32 × 1, 64 × 64 × 1, and 128 × 128 × 1. They were saved as images (called FBIs) as well as numerical data (called FBSs for the efficient MLE) for comparison. In addition, we also generated another 1000 realizations (seed 1001–2000, called Set 2) as our comparison set. The realizations and appearances/images from Set 2 will not be completely seen by deep-learning models trained on Set 1. Although the appearances or images from Set 1 will not be completely seen through five-fold cross-validation by deep-learning models trained on Set 1, the realizations from Set 1 will be partially seen through five-fold cross-validation by deep-learning models trained on Set 1.

When generating each 2D FBM, we followed the following procedure. First, we calculated the covariance matrix according to Equation (4). Second, we decomposed the covariance matrix using Cholesky factorization to obtain its lower triangular. Third, we produced a realization of standard white Gaussian noise. Finally, we multiplied the lower triangular and the white noise to generate a realization of 2D FBM. Hence, for each size with 1000 observations, we produced 11,000 images and surfaces in total for 11 Hurst exponents and 21,000 images and surfaces in total for 21 Hurst exponents. It is worth mentioning again that FBIs are close to FBSs, with some information or finer details being lost.

For a fairer performance comparison of the MLE and deep-learning models, another 32 equally spaced classes or Hurst exponents were generated from Set 1 to train deep-learning models as our comparison models; that is, FBIs of 11 classes from Set 2 were evaluated on three models (among six models) trained on 32 classes from Set 1.

This approach has two purposes. As we know, any estimator will essentially cause errors to occur. The first purpose is to avoid the possibility of zero error. However, if we indirectly obtain the corresponding MSEs according to the formula of MSEs versus classes, then all correctly classified classes will be zero errors. This is not fair for comparison with the MLE.

When FBIs of 11 classes are evaluated on the trained models of 32 classes, no class will result in zero error. For example, the first class of 11 Hurst exponents is 1/22; if the trained models are sufficiently good, then they are classified possibly in the first few classes of 32 Hurst exponents: 1/64 (squared error of 8.8980 × 10⁻⁰⁴), 3/63 (squared error of 2.0177 × 10⁻⁰⁶), or 5/64 (squared error of 0.0011). The best estimate lies in the second class; its squared error is 2.0177 × 10⁻⁰⁶, not zero.

The second purpose is that we would like to know whether the hidden realizations can be learned through five-fold cross-validation by deep-learning models. As we know, under five-fold cross-validation, the deep-learning models will not see these validated or tested images, and hence, the operation is theoretically considerably fair. However, during training, the models have partially seen their hidden realizations through five-fold cross-validation.

To illustrate some possible images or appearances of different Hurst exponents, Figure 1 shows two FBIs of H = 1/64 and H = 9/64 with size 128 × 128 × 1; Figure 2 shows two FBIs of H = 17/64 and H = 25/64 with size 128 × 128 × 1; Figure 3 shows two FBIs of H = 33/64 and H = 41/64 with size 128 × 128 × 1; Figure 4 shows two FBIs of H = 49/64 and H = 57/64 with size 128 × 128 × 1. All eight FBIs were generated from the same realization or seed.

In Figure 1, Figure 2, Figure 3 and Figure 4, it is obvious that we cannot easily discriminate the images of two neighboring Hurst exponents with the naked eye, especially for higher Hurst exponents.

For a fair comparison, six deep-learning models were performed in the following operating environment: (1) a computer (Intel^® Xeon(R) W-2235 CPU) with a GPU processor (NVIDIA RTX A4000) for running the models; (2) MATLAB R2022a for programming the models; (3) three solvers or optimizers [35,41] for training the models: sgdm, adaptive moment estimation (adam), and root mean square propagation (rmsprop); an initial learning rate of 0.001; a mini-batch size of 128; a piecewise learning rate schedule; a learning rate drop period of 20; a learning rate drop factor of 0.1; a shuffle for every epoch; number of epochs at 30; and a validation frequency of 30. For the efficient MLE, the comparison was performed in the same computing environment as outlined in points (1)–(3).

3.2. Results of the Maximum Likelihood Estimator

For comparison, two types of data were considered for the Hurst exponents: fractional Brownian surfaces (FBSs) and fractional Brownian images (FBIs), which are obtained from FBSs saved as gray-level images, thereby losing some finer details. Then, the efficient MLE for FBSs and FBIs was run under 11 Hurst exponents over five image sizes and 1000 observations. Since the efficient MLE will find out the optimal Hurst exponent between 0 and 1 through the golden section search, its computational time is terribly high for size 128 × 128 × 1, and hence, only the first 10 out of 1000 observations were estimated, not the complete 1000 observations.

Table 1 shows the MSEs for Set 1 and Set 2 over four image sizes (8 × 8, 16 × 16, 32 × 32, 64 × 64) under 1000 observations and over image size 128 × 128 × 1 under the first 10 out of 1000 observations. For example, under the same computing environment, our current computer will take about 646 s for the estimation of each observation, and hence, it will take at least 82 days to complete all 11000 estimations (11000 observations in total, 11 classes with 1000 observations each). This is another reason why we need the help of deep-learning models.

On the whole, the MSEs in Set 1 are similar to those in Set 2 for FBSs (true image data). As expected, the MSE decreases as the image size increases. However, this is not the case for FBIs because the higher Hurst exponents will lose many finer details with image size 64 × 64, leading to worse estimates from the efficient MLE. Obviously, the larger the size, the higher the loss.

It can be expected that the MSEs of FBSs will be smaller than those of FBIs because FBIs will lose some finer details, which are important for estimation by the efficient MLE. Moreover, the gaps between the MSEs of FBSs and FBIs will become larger as the image size increases, especially at 128 × 128. This is because more details are lost as the image size increases; our adopted estimator—the efficient MLE—cannot capture the finer structures of FBM. Although the performance with FBSs is far superior to that with FBIs, the data in the real world are mostly image data, not two-dimensional numerical data.

3.3. Results of Deep-Learning Models

The MLE is the best estimator for 2D FBM; it has the lowest MSE and is an unbiased estimator. The efficient MLE for 2D FBM is the fastest among the MLEs. Nevertheless, the computational costs are still extremely high, especially for larger image sizes. As the hardware for deep-learning models becomes quicker, and as deep-learning models become advanced and reliable, we will naturally pay more attention to this field and think of the ways in which the models can help us reduce the problem of computational costs.

In a previous pilot study [26] with only size 32 × 32 × 1 and three deep-learning models (one simple 29-layer model and two pre-trained models: AlexNet and GoogleNet) with solver or optimizer sgdm, we experimentally showed that deep-learning models are indeed feasible.

In the paper, we will try to design one 25-layer deep-learning model—which is simpler than previously designed models—and we will choose five pre-trained deep-learning models for a more comprehensive MSE comparison between the efficient MLE and deep-learning models, including five image sizes and three solvers (sgdm, adam, and rmsprop). In addition, we also want to know whether deep-learning models can learn the hidden realizations, i.e., whether deep-learning models do not see the validated data or observations (from different Hurst exponents) under five-fold cross-validation but possibly see partial realizations (i.e., partial white noise with the same seed was seen during training).

First, we develop a simple network mode—only 25 layers—consisting of four groups. The first group has one input layer; the second group is composed of four layers (a convolutional layer, a batch normalization layer, a ReLU layer, and a maximum pooling layer); the third group is composed of four layers (same as the second group without a maximum pooling layer); and finally, the fourth group is composed of three layers (a fully connected layer, a softmax layer, and a classification layer).

Based on the special structures of FBIs, we designed a simple model, consisting of regularly configured filters and increased sizes. Therefore, the input sizes of our proposed model include five image sizes (8 × 8 × 1, 16 × 16 × 1, 32 × 32 × 1, 64 × 64 × 1, and 128 × 128 × 1). From the first to the sixth convolutional layer, the filter numbers are all 128, with sizes from 3 × 3, 5 × 5, 7 × 7, 9 × 9, 11 × 11, to 13 × 13, respectively. In the future, we will try other combinations of filters and sizes, add other layers—such as dropout layers—and fine-tune their hyperparameters.

The size and stride of all maximum pooling layers are 2 × 2 and 2. According to the number of classes, the output size is 11 or 21. For more clarity, Table 2 shows the detailed architecture of our proposed model.

Based on our proposed 25-layer model, we run five-fold cross-validation on FBIs for five types of image sizes, each Hurst exponent with 1000 realizations from seed 1 to 1000 (Set 1). Under five-fold cross-validation, Table 3 and Table 4 show the classification rates of five folds and their mean and standard deviation for 11 classes and 21 classes, and Table 5 and Table 6 provide their corresponding MSEs of five folds and their mean and standard deviation for 11 classes and 21 classes. For MSEs, we first convert the classified classes to their corresponding Hurst exponents and then calculate the MSEs between the estimated Hurst exponents and the true Hurst exponents.

It is clear in Table 3 and Table 4 that the best accuracies of each solver all occur with size 16 × 16 × 1; that is, the proposed model is more suitable for size 16 × 16 × 1. Obviously, the simple design is not well suited for larger image sizes, such as 64 × 64 × 1 and 128 × 128 × 1, especially when the resolution is finer (21 Hurst exponents). Like all pre-trained models, no model can fit in all databases. Therefore, we can design different architectures of layers for other image sizes in terms of accuracies, MSEs, or other metrics.

Since the values of the Hurst exponent are continuous from 0 to 1, the classes are ordinal, not cardinal; the neighboring classes of the Hurst exponent are closer to each other than distant classes. Under ordinal classes, the misclassified classes of a good model should be closer to or neighboring the correct class. For ordinal classes, the classification rates are only one important indicator of reference, but the most practical metrics should be MSEs. Therefore, the corresponding MSEs for 11 classes and 21 classes—via the formula of MSEs versus classes—are listed for comparison in Table 5 and Table 6.

For a concise comparison, our proposed model was further summarized. Table 7 shows the summary of average accuracies for 11 and 21 classes over three solvers (sgdm, adam, and rmsprop), and Table 8 shows the summary of average MSEs (AMSEs, simply called MSEs) for 11 and 21 classes over three solvers (sgdm, adam, and rmsprop).

It is clear in Table 7 and Table 8 that the lowest MSE corresponds to the highest accuracy. For ordinal classes, the relationship between accuracies and MSEs is often used as an effective evaluation indicator for deep-learning models. The more consistent the relationship between accuracies and MSEs, the better designed the model is. Obviously, the proposed model is, on the whole, well designed and well performed.

On the other hand, the best MSEs occur at 16 × 16 × 1. This indicates that our proposed simple deep-learning model is more suited to smaller sizes. For other sizes, we need other more complex designs or the adoption of some ready-made or pre-trained models.

In addition, although the accuracies of 11 classes are all higher than those of 21 classes, the MSEs of 21 classes are all lower than those of 11 classes. For a well-designed model, this is very reasonable because the misclassified classes only occur at neighboring classes—not at remote classes. The spacing between two neighboring classes under 21 Hurst exponents is finer than that under 11 Hurst exponents. Accordingly, the MSEs will decrease. It can be reasonably expected that the MSEs for reliable deep-learning models will decrease as the resolution increases (or the spacing between two neighboring classes decreases), but for unqualified deep-learning models, the MSEs will possibly increase because these models prematurely converge or cannot converge during training. They will be implemented and discussed in future work.

Compared with MSEs of the MLE, our proposed simple 25-layer deep-learning model is much better than the MLE. This phenomenon may be due to two possibilities: one is that the correctly classified classes are zero errors; the other is that our proposed model can learn the hidden realizations because our model can see the partial realizations of seeds through five-fold cross-validation during training. Therefore, further evaluation is necessary for a fair comparison.

If the reason falls under the second possibility, it can be said experimentally that even a simple deep-learning model has the super power to learn the hidden realizations, not to mention the advanced pre-trained models.

After a simple deep-learning model was successfully proposed, we further experimented with three pre-trained models, which could be directly—without augmentation—run from size 8 × 8 × 1 to 128 × 128 × 1, and two pre-trained models, which could be directly run from size 32 × 32 × 1 to 128 × 128 × 1. The first group of pre-trained deep-learning models included Xception (71 layers deep, 170 layers in total), ResNet18 (18 layers deep, 71 layers in total), and MobileNetV2 (53 layers deep, 154 layers in total); the second group included GoogleNet (22 layers deep, 144 layers in total) and SqueezeNet (18 layers deep, 68 layers in total).

Since GoogleNet and SqueezeNet contain pooling layers—making the size smaller and smaller—GoogleNet and SqueezeNet are not suitable for sizes 8 × 8 × 1 and 16 × 16 × 1. Therefore, in the following four tables (Table 9, Table 10, Table 11 and Table 12), we use x to represent no experiment performed on the corresponding cell. In total, five pre-trained models were performed. All five pre-trained models were executed under five-fold cross-validation for the same FBIs. The sizes of our images for Xception, ResNet18, MobileNetV2 consisted of 8 × 8 × 1, 16 × 16 × 1, 32 × 32 × 1, 64 × 64 × 1, and 128 × 128 × 1. The sizes of our images for GoogleNet and SqueezeNet consisted of 32 × 32 × 1, 64 × 64 × 1, and 128 × 128 × 1. For a clear comparison, Table 9, Table 10, Table 11 and Table 12 contain six deep-learning models, including our proposed deep-learning model plus five pre-trained models.

All five pre-trained models were run under 11 classes and 21 classes. Their mean classification rates plus those of our proposed model are shown in Table 9 for 11 classes and in Table 10 for 21 classes. Their corresponding MSEs plus those of our proposed model are shown in Table 11 for 11 classes and in Table 12 for 21 classes. The corresponding detailed five-fold data of all five pre-trained models are arranged in Table A1, Table A2, Table A3, Table A4, Table A5, Table A6, Table A7, Table A8, Table A9, Table A10, Table A11, Table A12, Table A13, Table A14, Table A15, Table A16, Table A17, Table A18, Table A19 and Table A20 in Appendix A.

In the case of 11 classes, as mentioned previously, our proposed simple deep-learning model—only 25 layers—is more appropriate for smaller sizes, such as 8 × 8 × 1 and 16 × 16 × 1, as well as 32 × 32 × 1 only for the sgdm solver. MobileNetV2 (53 layers deep, 154 layers in total) exhibits an increasing classification rate as the size increases, and Xception (71 layers deep, 170 layers in total) exhibits a similar increasing trend, except with sizes 8 × 8 × 1 (9.10%) and 16 × 16 × 1 (9.09%) with the sgdm solver. This implies that models with deeper layers are beneficial for larger image sizes but detrimental for smaller image sizes. In particular, the sgdm solver is extremely unsuitable for Xception, as performed on our images.

ResNet18 (18 layers deep, 71 layers in total) also exhibits an increasing classification rate as the size increases when the solver is sgdm. When the solver adopted is adam, the best classification rate (99.01%) occurs at 64 × 64 × 1; when the solver adopted is rmsprop, the best classification rate (98.29%) actually occurs at 32 × 32 × 1.

GoogleNet (22 layers deep, 144 layers in total) and SqueezeNet (18 layers deep, 68 layers in total)—only run with data sizes 32 × 32 × 1, 64 × 64 × 1, and 128 × 128 × 1— also exhibit an increasing classification rate as the size increases. Their corresponding MSEs also reflect the trend. It is worth mentioning that an unstable result of mean 49.75% and standard deviation 20.50% occurs in SqueezeNet with the rmsprop solver at 32 × 32 × 1. Its five-fold classification rates are 18.09%, 77.50%, 55.50%, 60.95%, and 36.73% (see Table A5 in Appendix A).

In the case of 21 classes, their performance and trends are similar to those of 11 classes. ResNet18 also exhibits an increasing classification rate as the size increases when the solver is sgdm. When adam and rmsprop are chosen as solvers, their best classification rates of 96.87% and 96.29% both occur at 64 × 64 × 1.

GoogleNet exhibits an increasing classification rate as the size increases; the best classification rate is 95.67%. However, the best classification rate (97.28%) for the adam solver occurs at 64 × 64 × 1, while the best classification rate (91.90%) for the rmsprop solver occurs at 32 × 32 × 1.

In addition, an unstable result of mean 28.64% and standard deviation 23.20% occurs in SqueezeNet with the rmsprop solver at 32 × 32 × 1. Its five-fold classification rates are 4.76%, 57.14%, 4.76%, 21.64%, and 54.88% (see Table A10 in Appendix A).

Xception (71 layers deep, 170 layers in total) also exhibits an increasing trend as image sizes increase, except with sizes 8 × 8 × 1 (4.77%) and 16 × 16 × 1 (4.75%) with the sgdm solver. This also implies that models with deeper layers are beneficial for larger image sizes but detrimental for smaller image sizes. In particular, the sgdm solver is extremely unsuitable for Xception, as performed on our images. Relatively, the performance of MobileNetV2 (53 layers deep, 154 layers in total) steadily increases as the image size increases.

Upon careful observation of the case of Xception with the rmsprop solver with size 64 × 64 × 1 in Table 10 and Table 12, we find that it has the highest classification rate at 98.13%, but its corresponding MSE is 1.63 × 10⁻⁰³—the largest among the six models and much higher than other models. This indicates that Xception with the rmsprop solver with size 64 × 64 × 1 is not reliable—even not qualified—for the ordinal classes in our image dataset. Figure 5 shows the confusion matrix in question from Fold 5. Blue cells stand for the numbers (diagonal) or percentages (vertical or horizontal) of correct classification and non-blue cells for the numbers or percentages of incorrect classification. The darker the color is, the higher the value. Obviously, the sharp increase mainly contributes to the reason for there being 12 class-21 images (H = 0.9762) classified as class 1 (H = 0.0238). This is a substantial mistake for our ordinal classes. It also indicates that a complex model under normal settings might not be suitable for our image dataset. For further applications, these kinds of complex models should be modified by fine-tuning their superparameters in order to achieve higher generalization ability.

As with Xception, some solvers are better, while some are worse. To avoid the potential problem of sensitivity or stability with solvers of deep-learning models, we consider the average accuracies and average MSEs (AMSEs, simply called MSEs) over three solvers (sgdm, adam, and rmsprop). Table 13 shows a summary of the six models for 11 classes, while Table 14 presents a summary of the six models for 21 classes. Table 15 shows a summary of the six models for 11 classes, while Table 16 presents a summary of the six models for 21 classes.

In Table 13 and Table 15, we find that each highest classification rate in Table 13 corresponds exactly to each smallest MSE in Table 15. Likewise, in Table 14 and Table 16, each highest classification rate in Table 14 corresponds almost exactly to each smallest MSE in Table 16, except for image size 32 × 32 × 1, where the highest classification rate is achieved in ResNet18, the second highest in GoogleNet, but the smallest MSE is achieved in GoogleNet, the second smallest in our proposed model, while ResNet18 only achieves the fifth smallest MSE.

If deep-learning models are stable for our image dataset, the MSEs of more classes (21 classes) should be smaller than those of fewer classes (11 classes). However, we find in Table 15 and Table 16 that there are five unstable cases: Xception at 64 × 64 × 1, ResNet18 at 32 × 32 × 1, and SqueezeNet at 32 × 32 × 1, 64 × 64 × 1, and 128 × 128 × 1. Stability is a very important indicator of which model we should adopt.

Compared to Table 1, except for some unstable cases—Xception at 8 × 8 × 1 and 16 × 16 × 1, as well as SqueezeNet at 32 × 32 × 1—our chosen five pre-trained models are almost superior to the true and best estimator (the MLE) in terms of MSEs. The results seem to be promising for the future. However, the calculation of MSEs via the formula of MSEs versus classes presents a potential unfair problem, in that the correctly classified classes in our situations are evaluated as zero errors, but those in real-world applications all contain certain errors. Accordingly, a further comparison is necessary.

In order to ascertain whether deep-learning models can learn the hidden realizations through different appearances or images (different Hurst exponents) with the same realizations, we evaluated the accuracies of Set 2 (seed 1001–2000) in models trained on Set 1 (seed 1–1000) with 11 classes. Table 17 shows the results, and Table 18 shows the corresponding differences between Table 17 (between-set evaluation) and Table 13 (within-set evaluation). For example, the first element of the first row in Table 17 and Table 13 is 37.83% and 92.70%, and hence, its difference is approximately −54.86%.

In Table 18, it is obvious that the accuracies of between-set evaluation are almost lower than those of within-set evaluation, except for our proposed model with size 128 × 128. Table 17 and Table 18 reveal to us that deep-learning models for smaller sizes can learn more hidden realizations from appearances or images, and hence, they can obtain higher accuracies even when they did not see the appearances in advance but did see some realizations through five-fold cross-validation during training. On the contrary, deep-learning models have a higher generalization ability for larger sizes because they can learn more detailed structures from their appearances, even when they did not see these realizations.

Now that deep-learning models have the ability to learn hidden realizations, for a fair and reasonable comparison, we additionally train three promising models (our proposed model, ResNet18, and MobileNetV2), each with five folds, on Set 1 (seed 1–1000) with 32 classes—hence five trained models for each deep-learning model—and then use Set 2 (seed 1001–2000) as our test set; that is, the trained models definitely did not see the appearances and their corresponding hidden realizations of the test or validation set (Set 2).

As mentioned before, the more classes of a qualified model there are, the lower its MSEs. Table 19 lists the average MSEs of the efficient MLE over 11,000 observations (except for size 128 × 128) and three deep-learning models over 165,000 (3 × 5 × 11,000) observations (three solvers and 11,000 observations per trained model from each fold); Table 20 lists the average computational times per observation of the efficient MLE and three deep-learning models. Since the efficient MLE for size 128 × 128 takes much longer (6.47 × 10⁺⁰² s per observation; at least 82 days for all 11,000 observations) to estimate, only the first 10 observations of each class were estimated for comparison.

In Table 19, we can observe that the MSEs of deep-learning models are almost lower than those of the efficient MLE for FBIs, except for size 32 × 32. In Table 1, we also find that the MSEs of these three deep-learning models for size 8 × 8 are also lower than those of the efficient MLE for FBSs (true image data). For size 16 × 16, our proposed model also results in lower MSE than the efficient MLE for FBSs, with MobileNetV2 presenting a slightly lower value.

In addition, the MSE of the MLE for FBIs increases as the size increases beyond 32 × 32 because FBIs lose the finer details when FBSs are stored as FBIs. Although deep-learning models cannot outperform the MLE for FBSs of larger sizes, the real-world data are generally images, not two-dimensional numerical data. The MSE of our proposed simple model almost decreases as the size increases, except for size 128 × 128, because our model is relatively simple (this may be overcome in the future by modifying the simple model structure). Nevertheless, it is still much better than the MLE for FBIs. For size 128 × 128, MobileNetV2 is the best choice in terms of MSE.

As mentioned before, our main purpose is to find an alternative to the estimated Hurst exponent from the efficient MLE because it will take a lot of time to estimate for larger sizes. Accordingly, time performance will play a very important indicator role in our study. It is obvious in Table 20 that the computational time of the efficient MLE increases as the size increases and that of the three models is approximately the same for all sizes. Therefore, the computational times of models are almost less than those of the efficient MLE, except for sizes smaller than 8 × 8 and 16 × 16. Our proposed model takes the least time among the three models.

In particular, the average computational time of each observation for size 128 × 128 takes 6.46 × 10⁺⁰² s, but that of our proposed model only takes 4.14 × 10⁻⁰³ s. The time ratio of the efficient MLE to our proposed model is approximately 156,248—terribly high. Even for size 64 × 64, the time ratio of the efficient MLE to our proposed model is approximately 3090—still very high.

In terms of the computational time and MSE, our proposed simple deep-learning model should be recommended for indirectly estimating the Hurst exponent. In future, we may design some slightly complex models for sizes larger than 64 × 64 or choose some pre-trained models in order to achieve better MSEs within an acceptable time frame.

4. Discussion

The experimental results from Section 3 present us with an interesting phenomenon, namely that our proposed model can outperform the MLE even in FBSs with smaller sizes: 8 × 8 and 16 × 16. Therefore, in future, we would also like to know whether we can use the original data (FBSs) to train some deep-learning models to obtain fewer MSEs than the MLE.

When necessary, we typically use traditional estimators (also called direct estimators) to compute the fractal dimensions of images. For further classification, we use these estimates to classify their classes or types, such as benign and malignant. In the past, we often adopted the box-counting method for estimations because it is simple and efficient; however, it has considerably low accuracy, and therefore, it can only be applied in a domain with significant differences of fractal dimension. When applied to small differences, the box-counting method is often unreliable because of its considerably low accuracy, leading to a low recognition effect.

Lately, an efficient MLE for the Hurst exponent of 2D FBM was produced by Chang [24]. In theory, it is the optimal estimator, and to date, the fastest among the MLEs. However, its computational costs are still prohibitive, especially for larger image sizes, and hence, it is not applicable to fast analysis and evaluation. For example, in the current computing environment, it takes about 13.3 s for an image with size 64 × 64 and 646 s for an image with size 128 × 128. If the number of images needed to be estimated is not high, one can perhaps wait; however, the number is generally considerably high in real-world applications.

Sometimes, we would like to add an extra feature as our second input source in order to adopt two-stream models to enhance the classification effects. The extra feature may come from a fractal-dimension characteristic map of an original image. Therefore, it is extremely necessary to replace the traditional estimators with an effective and efficient approach, especially for larger images.

In the paper, we proposed one simple deep-learning model—only 25 layers—and selected five pre-trained models for comparison as well as a double check as to whether deep-learning models are appropriate as indirect estimators, including GoogleNet, Xception, MobileNetV2, ResNet18, and SqueezeNet.

In general, it is sufficient for fewer classes to analyze the differences between images by directly calculating the estimates through traditional well-performed estimators and then calculating their means and standard deviations. In this simple case, the step of classification through machine learning can be omitted. However, for many classes, it is better to classify these estimates using machine learning.

Machine-learning models depend on our manually chosen and extracted features—for example, the fractal dimension—but deep-learning models have the power to automatically capture as many features as possible. Therefore, if we find that deep-learning models perform well, we can use well-trained models to replace the role of the efficient MLE as an effective and efficient estimator in an indirect way, and further, as a characteristic transformer.

Recently, Chang and Jeng [26] designed a 29-layer sequential deep-learning model and selected three pre-trained networks—AlexNet, GoogleNet, and Xception—in order to prove experimentally that deep-learning models with the sgdm solver can work well for FBIs with size 32 × 32 × 1; additionally, the augmented GoogleNet was superior to the adjusted AlexNet and the augmented Xception in terms of the classification rate and computational cost. In their experiment, Chang and Jeng only used five-fold cross-validation within the same set to compare the performance between the efficient MLE and three models.

In the paper, we further designed a shallower model (25 layers as opposed to 29 implemented previously) and additionally selected three pre-trained models—ResNet18, MobileNetV2, and SqueezeNet—for comparison. In addition, the sizes of FBIs were expanded, including 8 × 8 × 1, 16 × 16 × 1, 32 × 32 × 1, 64 × 64 × 1, and 128 × 128 × 1 (five sizes in total). In addition, three widely used solvers—sgdm, adam, and rmsprop—were considered. Since AlexNet without augmentation cannot be run directly under these five original sizes, its comparison was excluded.

As we all know, these pre-trained models were trained under 1000 common object categories. They never saw the hidden structures of FBIs before, and hence, superficially, these models may work badly. Interestingly, they not only perform better than the efficient MLE for FBIs in terms of computational efficiency—except for smaller sizes on pre-trained models—but they also outperform the MLE in terms of effectiveness (based on MSEs), except for size 32 × 32, especially for larger sizes.

In the experiment conducted in the paper, we used two types of 11 and 21 Hurst exponents, each consisting of 1000 realizations with seed 1–1000 (Set 1), and we evaluated the models through five-fold cross-validation. For distinction, the evaluation method is called within-set evaluation; that is, a partial set (validation set) of Set 1 is evaluated on the other set (training set) of Set 1. Within-set evaluation shows that the best classification rates of the six deep-learning models with three solvers under 11 classes are 93.82% for 8 × 8 × 1, 97.35% for 16 × 16 × 1, 98.40% for 32 × 32 × 1, 99.41% for 64 × 64 × 1, and 99.88% for 128 × 128 × 1. Those under 21 classes are 91.75%, 95.30%, 97.59%, 98.28%, and 99.03%, respectively. For a fair comparison, for ordinal classes or continuous values, the mean-squared error (MSE) was also chosen as a metric. The MSEs of deep-learning models almost reflect their corresponding classification rates, except for Xception with the rmsprop solver. Their MSEs are all much better than those of the MLE.

When converting classes to MSEs for comparison with the MLE, there is some unfairness. For example, the correct classes are considered to be zero errors, but in practice, the estimations must contain certain errors. In addition, the realizations of every class come from the same seeds. We do not know whether deep-learning models have the power to learn the hidden realizations from different appearances (different Hurst exponents) with the same realizations.

To establish whether deep-learning models will learn the structure of realizations, the second set of realizations with seed 1001–2000 (Set 2) was used to generate another 11 classes of the Hurst exponent. The evaluation method is called between-set evaluation, in that the realizations from Set 2 were evaluated on deep-learning models trained on the realizations from Set 1. That is, one set (Set 2) is evaluated on another set (Set 1). The experimental results show that the average accuracies over three solvers under between-set evaluation are much lower than those under within-set evaluation, especially for smaller image sizes. The gaps between within-set and between-set evaluation confirm that deep-learning models will learn the structure of realizations even when they only indirectly saw them through the images of 2D FBM as opposed to seeing these realizations directly on their own.

The images of 2D FBM, unlike other data, are generated from white Gaussian noise through pseudo-random generators. Therefore, the same seed generates the same realization. During training, randomly grouping the images for five-fold cross-validation will lead to some realizations from the training set lying in the validation set in a direct way. Therefore, on the surface, deep-learning models with cross-validation did not see their validation data. In fact, they saw partial hidden realizations during training in an indirect way.

For a fair comparison, we further evaluated the realizations from Set 2 on deep-learning models trained on realizations from Set 1 with 32 classes; under 32 classes, it is impossible for data of 11 classes to be classified in the exact or correct class, and hence, no zero error occurs. The experimental results show that the MSEs of deep-learning models are almost lower than those of the MLE, except for images with size 32 × 32 × 1. Moreover, as the image size increases to more than 32 × 32 × 1, its MSE will increase, not decrease, because FBIs lose the finer details when they are stored from FBSs as FBIs.

In terms of computational costs, deep-learning models incur much lower costs than the efficient MLE, except for smaller image sizes on some pre-trained models, but our proposed model is always superior in efficiency to the efficient MLE, not to mention the MLE. The computational time of the deep-learning model is almost fixed for any image size, whereas that of the efficient MLE will increase as the image size increases; accordingly, the computational time of our proposed simple model is overwhelmingly lower than that of the efficient MLE—up to 156248 times with size 128 × 128 × 1. This is extremely high and cannot be ignored.

As we learned, when deep-learning models were run under the same realizations (within-set evaluation), they could obtain much higher classification rates and much lower MSEs, even though they had different appearances, but when they were run under different realizations (between-set evaluation), their classification rates and MSEs were considerably reduced. The results show that deep-learning models can learn the hidden realizations in an indirect way. For estimation, this is indeed a disadvantage, whereas it is an advantage for data hiding. In the applications of data hiding, we will indirectly hide some information in the classes of the Hurst exponent. When receiving the image of a certain class, a receiver can identify the class and then translate the class to the hidden information. The higher the classification rate, the more correct the interpreted information is. Therefore, when an attacker intercepts the image data, he/she cannot know what realizations were trained before, and hence, his/her interpretation will easily be erroneous. Therefore, in future, how to design deep-learning models will be a dilemma. This gives rise to an enormous challenge, but it also offers a substantial opportunity.

With the good performance of the deep-learning models, we will establish more model streams for one input image size to combine their respective advantages in order to enhance the overall performance in the upcoming future. For example, our proposed model is more suitable for smaller image sizes, while pre-trained models are all appropriate for middle-to-large image sizes. When well integrated together, the multiple model streams can be expected to be better than a single model stream. Moreover, we will consider more input resources, such as two image sizes for one model stream or multiple model streams, in order to explore more potential features.

Based on low MSEs and low computational costs, our well-designed simple deep-learning model and well-selected pre-trained deep-learning models can be widely applied in indirect estimations of the Hurst exponent. Different from traditional estimators, such as the MLE, our approach to estimating the Hurst exponent can therefore be viewed as simply a deep-learning estimator, or precisely, as an indirect deep-learning estimator from classes to Hurst exponents. Furthermore, as we know, different models will learn their own features according to their respective logic; hence, when indirectly estimating the Hurst exponents of practical images—often modeled as 2D FBM but seldom the exact model—they will look for diverse characteristic maps to help us explain more hidden details.

More importantly, the concept of deep-learning models as transformers—transforming the original images to the corresponding characteristic maps of the Hurst exponent—can be heavily promoted for other characteristic maps, such as the spectrum. Finally, through multiple input resources, the overall recognition rate of the original images combined with the corresponding characteristic maps will be highly increased for recognition of medical images. The idea will be explored in the upcoming future.

In general, our proposed simple deep-learning model is more suitable for smaller image sizes for classification and MSEs. This is reasonable because our proposed model only contains 25 layers, and no extra improvements or fine-tuning are imposed on the model. It can be expected that our proposed model can be improved further through other operations, such as adding dropout layers and modifying the related superparameters.

The excellent performance of the deep-learning models informs us that FBIs can still be learned from the curves and/or lines to capture hidden and subtle patterns from low to high layers. In terms of deep-learning models, it is unbelievable to learn the structure of these random processes, but it is a real fact.

Although we proposed a simple deep-learning model and selected five pre-trained models for classification, the corresponding MSEs also prove that our proposed model and selected pre-trained models are better than the MLE, except for size 32 × 32 × 1. At the moment, classification models—not regression models—are chosen because, on the one hand, most deep-learning models are well trained and designed for common object classification. Pre-trained models for classification can be directly adopted without any modification. From an intuitive viewpoint, using classification models followed by conversion of the classes into corresponding MSEs will give us smaller MSEs than using regression models. On the other hand, classifying the Hurst exponent provides potential applications for data hiding, and the higher the number of classes, the higher the amount of data hiding. Furthermore, this kind of data hiding is more difficult to break by the attackers.

In future, on the one hand, we will modify these pre-trained models for classification to those for regression in order to determine whether regression models are better than classification models. After all, our data type is ordinal—not cardinal. On the other hand, for data-hiding use, we will further study how to raise the classification rate of classes while increasing the number of classes for the Hurst exponent.

5. Potential Applications

As we know, some fractal characteristics often exist in the disease area of medical images, especially tumor development. They typically grow from small volumes into large volumes, and hence, we must detect them early and then treat them. If we want to discover the tiny changes, we need to differentiate the small differences between tissues. In the past, we could use traditional or direct estimators—such as the MLE or the box-counting method—to obtain fractal dimensions and then determine them via the distribution of these estimates, such as their means and standard deviations.

Since the MLE is the best estimator, and the efficient MLE is the most efficient among the MLEs, in the past, we would choose the efficient MLE for accuracy in order to estimate the Hurst exponent. However, its computation time often hinders its wide use, even standing still.

As the deep-learning models are becoming more and more advanced and mature, we experimentally proved that they could outperform the MLE for FBIs in the MSE, except for size 32 × 32 × 1. In addition, the computational cost of deep-learning models is kept almost constant as the image sizes increase, but the efficient MLE will increase these costs. When the size is 128 × 128 × 1, the computational time of the efficient MLE is about 156248 times that of our proposed model. For that reason, in future, we will replace the efficient MLE for FBIs with deep-learning models, especially our proposed simple model.

When medical images are modeled as FBIs, we can indirectly estimate the Hurst exponent by suitably selecting deep-learning models in an effective and efficient way. Although the fractal dimension (its value lies between 2 and 3) is more meaningful than its corresponding Hurst exponent (its value lies between 0 and 1) in terms of FBIs, the Hurst exponent is more suitable for elements of a feature map. For example, we selected from Kaggle [42] a database of chest X-rays as potential subjects. In order to exclude unimportant content from all images, we first established the smallest size—127 × 384—as our input size. Then, we clipped the central part of other images according to size. For illustration, we selected four clipped images as our subjects for transforming the original image to a feature map; Figure 6a presents two images from the normal group, and Figure 6b also presents two images from the pneumonia group.

In addition to the efficient MLE, we also selected three deep-learning models—our proposed model (25 layers), ResNet18, and MobileNetV2—run on 32 classes as our transformers, which converted the original images into their characteristic maps of the Hurst exponent. For transformation, we first chose 8 × 8 × 1 as the size of each sub-image for estimating the Hurst exponent. Second, we chose 1 × 1 as the stride or shift in the horizontal and vertical directions. From the top-left corner of the original image to the bottom-right corner, we estimated all the sub-images and saved these estimates as elements of the characteristic map.

In the first step, we directly estimated the Hurst exponents of all sub-images using the efficient MLE. In the second step, we indirectly estimated the Hurst exponents of all sub-images using the three deep-learning models mentioned above and then converted their classes to the corresponding Hurst exponents.

Figure 7, Figure 8, Figure 9 and Figure 10 show the feature/characteristic images or maps (120 × 377) generated using the efficient MLE and the three models. The lighter points are equivalent to the larger Hurst exponents; the darker points represent the smaller Hurst exponents.

Figure 7, Figure 8, Figure 9 and Figure 10 clearly show that these four sets of characteristic maps are not similar to one another. This is reasonable because practical images are not generally the exact images from the structure of 2D FBM. Therefore, the estimated Hurst exponents of all sub-images are generally different from one another. As we know, various deep-learning models will learn and capture various features from their respective viewpoints. For images with the exact characteristics of 2D FBM, deep-learning models can finally merge all the learned parameters into the correct classes, as the tables show. Therefore, even different deep-learning models obtain the same classes or Hurst exponents. However, when the sub-images are not the same images as 2D FBM, the results are usually different to one another, except for the learning structures of various models, which are similar to one another.

On the surface, four approaches—one traditional estimator, our proposed simple deep-learning model, two pre-trained deep-learning models—seem to give different explanations of the same image, but they all obey their peculiar logic to classify, or indirectly estimate, the Hurst exponent. Therefore, their interpretations all play very meaningful and important roles.

Although practical images modeled as a certain model—such as 2D FBM—are typically not ideal for their claimed model, their different characteristic appearances obtained seem to be good phenomena. Based on these various perspectives, through different models, we can extract diverse hidden characteristics. When cleverly integrated together, these various characteristic maps will give us much wider and richer resources for better analysis.

It is clear in Figure 6, Figure 7, Figure 8, Figure 9 and Figure 10 that these characteristic maps estimated by the efficient MLE or captured from the different trained learning models are simpler in essence than their original images. This is the most important quality of feature engineering [43]: expressing a problem in an easier and simpler way of understanding. For effective applications, feature engineering requires deep grasp of the nature of the problem [43]. For most medical images or some natural scenes, the Hurst exponent of 2D FBM is a very appropriate indicator. Therefore, the trained deep-learning models are effective as a tool of feature engineering.

For applications, two factors are important: effectiveness and efficiency. Naturally, we would also like to know the efficiency of models for feature engineering. As shown in Table 20, our proposed 25-layer model is obviously the smallest among the four approaches. In the case of size 8 × 8 × 1, only the computational time of our proposed model is less than that of the efficient MLE, but for larger sizes, deep-learning models are almost more efficient than the efficient MLE, except for MobileNetV2 with size 16 × 16 × 1. For size 128 × 128 × 1, the efficient MLE needs 156248 times more time than our proposed model, 69,018 times more than ResNet18, and 19211 times more than MobileNetV2.

In future, we will work hard to look for other deep-learning models for classification as well as for regression, and then, we will use the promising models to capture various features. Based on the diverse sources of features, we can integrate them with the original images to establish some successful multiple-stream or -input deep-learning models to classify more complex images in order to reach higher performance than a single-input model.

6. Conclusions

An efficient MLE for 2D FBM was recently proposed for direct estimations of Hurst exponents in place of the traditional MLE for higher efficiency. The MLE for 2D FBM is the optimal estimator in terms of performance, but its computational costs are still very high—enough to hinder adoption by users. Therefore, the purpose of the paper was to explore whether deep-learning models have the chance to overcome the problem of high computing costs within an acceptable time frame and with satisfactory accuracy.

In the paper, we proposed deep-learning estimators for the Hurst exponent of 2D FBM. First, we generated data of 2D FBM, called fractional Brownian surfaces (FBSs), then stored FBSs as images, simply called fractional Brownian images (FBIs). Since most pre-trained models were trained for 1000 object categories, we used deep-learning models for classification—not regression—to indirectly estimate the Hurst exponent by converting the classified classes to their corresponding Hurst exponents. Accordingly, we segmented the Hurst exponent into two groups of classes—11 classes and 21 classes—each class with equal spacing. For a fair comparison with the efficient maximum likelihood estimator (MLE) in terms of mean-squared errors (MSEs) and computational time, we also used one group of 32 equally spaced classes.

In the experiment, we used two sets of images, each class with 1000 FBIs from 1000 realizations generated by a pseudo-random generator with seed 1–1000, called Set 1. For further analysis and a fair comparison, we also generated the second set generated from seed 1001–2000, called Set 2. There were five image sizes considered, including 8 × 8 × 1, 16 × 16 × 1, 32 × 32 × 1, 64 × 64 × 1, and 128 × 128 × 1. Afterward, we designed one simple 25-layer deep-learning model and selected five pre-trained models—including Xception, ResNet18, MobileNetV2, GoogleNet, and SqueezeNet—to train these images under three common solvers: sgdm, adam, and rmsprop.

In the first experiment, we used five-fold cross-validation to evaluate six models on Set 1; the approach is called within-set evaluation. The experimental results showed that deep-learning models had very high classification rates, and their corresponding MSEs were much lower than those of the efficient MLE. To ascertain whether deep-learning models will learn the hidden realizations, in the second experiment, we used FBIs from Set 2 with 11 classes to evaluate the six models trained on Set 1; the approach is called between-set evaluation. Each deep-learning model contained five trained models, each fold with one trained model. Compared to within-set evaluation from the first experiment, between-set evaluation showed that the classification rates of deep-learning models decreased too much, especially for smaller image sizes. The big gaps between within-set and between-set evaluation confirmed that deep-learning models can learn the hidden realizations from different appearances based on different Hurst exponents, especially for smaller image sizes. This explained that deep-learning models have the power to learn hard-to-detect features even from a small part of background realizations.

In the previous two experiments, the correctly classified classes were given zero errors; however, any estimator—even the MLE—contains estimation errors. To avoid an unfair comparison with the MLE, in the third experiment, we used images from Set 2 to evaluate three models among the six models trained on 32 classes. Accordingly, no so-called correct class would appear, and hence, our experiment would be absolutely fair. The experimental results showed that the MSEs of three models were almost lower than those of the MLE, except for size 32 × 32 × 1. In addition, the computational efficiency of the three models was almost higher than that of the efficient MLE, except for smaller image sizes on two pre-trained models. Our proposed simple model is quite suitable for our case study of FBIs. The computational time of the efficient MLE was 156248, 69018, and 19211 times higher than that of our proposed model, ResNet18, and MobileNetV2 for size 128 × 128 × 1. This was terribly high. Therefore, deep-learning models are very suitable for replacing the efficient MLE for FBIs. In terms of FBSs, the subject merits another case study.

In terms of computational efficiency and accuracy, deep-learning models are indeed a good choice for indirectly estimating the Hurst exponent. In particular, when applied to larger datasets for transformers to transform the original images to characteristic maps, deep-learning models have a practical value. Time is money. The experimental results showed that lightweight models are more suitable for smaller image sizes. For example, the middle-size Xception under the rmsprop solver has high classification rates, but its MSEs will be higher than other lightweight models. In future, we will continue to consider larger image sizes to explore whether they require large-size models.

Funding

This work was supported by the National Science and Technology Council, Taiwan, Republic of China, under Grant NSTC 112-2221-E-040-008.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A

Table A1. The five-fold classification rates for five types of image sizes and three solvers under 11 Hurst exponents: Xception.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	9.05%	9.14%	9.14%	9.09%	9.09%	9.10%	0.03%
	16 × 16 × 1	9.09%	9.09%	9.05%	9.09%	9.14%	9.09%	0.03%
	32 × 32 × 1	77.91%	80.09%	82.32%	79.27%	80.27%	79.97%	1.44%
	64 × 64 × 1	87.14%	87.68%	87.64%	85.27%	86.73%	86.89%	0.88%
	128 × 128 × 1	95.82%	95.05%	96.41%	94.95%	95.68%	95.58%	0.54%
adam	8 × 8 × 1	84.05%	80.86%	77.27%	82.86%	83.09%	81.63%	2.41%
	16 × 16 × 1	95.32%	96.73%	95.55%	96.45%	96.59%	96.13%	0.58%
	32 × 32 × 1	98.23%	98.68%	98.41%	99.00%	97.68%	98.40%	0.44%
	64 × 64 × 1	99.09%	99.32%	99.73%	99.14%	99.77%	99.41%	0.29%
	128 × 128 × 1	99.86%	99.95%	99.77%	99.82%	100.00%	99.88%	0.08%
rmsprop	8 × 8 × 1	76.64%	77.23%	77.14%	82.41%	79.59%	78.60%	2.16%
	16 × 16 × 1	94.18%	95.68%	95.32%	95.00%	96.23%	95.28%	0.68%
	32 × 32 × 1	98.59%	98.86%	97.18%	98.45%	97.64%	98.15%	0.63%
	64 × 64 × 1	99.36%	99.36%	99.41%	99.27%	99.41%	99.36%	0.05%
	128 × 128 × 1	99.82%	99.86%	99.64%	99.86%	99.77%	99.79%	0.08%

Table A2. The five-fold classification rates for five types of image sizes and three solvers under 11 Hurst exponents: ResNet18.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	58.68%	59.45%	59.68%	59.82%	58.86%	59.30%	0.45%
	16 × 16 × 1	81.27%	82.36%	82.27%	83.73%	82.41%	82.41%	0.78%
	32 × 32 × 1	91.95%	91.68%	92.64%	92.32%	90.64%	91.85%	0.69%
	64 × 64 × 1	95.14%	94.14%	95.32%	95.27%	94.18%	94.81%	0.53%
	128 × 128 × 1	97.23%	97.50%	97.86%	97.55%	97.36%	97.50%	0.21%
adam	8 × 8 × 1	73.91%	80.27%	77.23%	50.77%	73.05%	71.05%	10.46%
	16 × 16 × 1	92.73%	88.64%	81.82%	92.91%	93.27%	89.87%	4.37%
	32 × 32 × 1	98.09%	97.68%	97.91%	98.09%	98.64%	98.08%	0.32%
	64 × 64 × 1	98.41%	99.09%	98.95%	98.77%	99.82%	99.01%	0.46%
	128 × 128 × 1	98.05%	99.27%	98.14%	97.95%	98.27%	98.34%	0.48%
rmsprop	8 × 8 × 1	65.77%	63.32%	56.09%	47.09%	61.86%	58.83%	6.68%
	16 × 16 × 1	91.36%	93.00%	92.41%	92.41%	92.36%	92.31%	0.53%
	32 × 32 × 1	99.00%	98.05%	97.86%	98.27%	98.27%	98.29%	0.39%
	64 × 64 × 1	98.86%	97.59%	96.55%	98.59%	99.14%	98.15%	0.96%
	128 × 128 × 1	97.18%	97.82%	97.59%	99.18%	96.91%	97.74%	0.79%

Table A3. The five-fold classification rates for five types of image sizes and three solvers under 11 Hurst exponents: MobileNetV2.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	35.68%	36.95%	35.23%	33.18%	35.45%	35.30%	1.22%
	16 × 16 × 1	68.95%	63.91%	68.09%	70.18%	68.59%	67.95%	2.13%
	32 × 32 × 1	88.91%	87.73%	89.68%	85.64%	88.00%	87.99%	1.36%
	64 × 64 × 1	94.86%	94.55%	95.27%	95.36%	94.55%	94.92%	0.35%
	128 × 128 × 1	99.05%	98.45%	98.59%	98.64%	98.55%	98.65%	0.20%
adam	8 × 8 × 1	32.55%	35.27%	32.41%	32.23%	38.95%	34.28%	2.59%
	16 × 16 × 1	65.73%	78.50%	64.73%	54.32%	52.77%	63.21%	9.28%
	32 × 32 × 1	96.45%	96.18%	94.23%	95.55%	95.64%	95.61%	0.77%
	64 × 64 × 1	98.73%	98.64%	99.14%	96.73%	99.23%	98.49%	0.91%
	128 × 128 × 1	99.73%	99.77%	99.23%	99.45%	99.73%	99.58%	0.21%
rmsprop	8 × 8 × 1	48.55%	58.77%	45.18%	55.55%	45.32%	50.67%	5.53%
	16 × 16 × 1	89.36%	88.64%	84.18%	90.09%	91.64%	88.78%	2.51%
	32 × 32 × 1	97.18%	97.05%	97.36%	97.68%	97.05%	97.26%	0.24%
	64 × 64 × 1	98.50%	99.55%	98.50%	98.36%	98.77%	98.74%	0.43%
	128 × 128 × 1	99.45%	99.50%	99.64%	99.50%	99.50%	99.52%	0.06%

Table A4. The five-fold classification rates for five types of image sizes and three solvers under 11 Hurst exponents: GoogleNet.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	32 × 32 × 1	92.41%	92.68%	92.95%	91.50%	93.45%	92.60%	0.65%
	64 × 64 × 1	96.50%	96.32%	97.91%	96.82%	97.50%	97.01%	0.60%
	128 × 128 × 1	99.09%	98.73%	99.00%	98.64%	98.82%	98.85%	0.17%
adam	32 × 32 × 1	95.64%	96.27%	97.27%	96.45%	95.82%	96.29%	0.57%
	64 × 64 × 1	98.09%	98.18%	98.59%	97.82%	99.09%	98.35%	0.44%
	128 × 128 × 1	99.32%	99.05%	99.18%	99.05%	98.00%	98.92%	0.47%
rmsprop	32 × 32 × 1	91.86%	91.59%	92.68%	90.41%	91.41%	91.59%	0.73%
	64 × 64 × 1	94.91%	94.41%	95.05%	92.59%	97.05%	94.80%	1.42%
	128 × 128 × 1	97.64%	95.32%	96.91%	96.59%	96.05%	96.50%	0.78%

Table A5. The five-fold classification rates for five types of image sizes and three solvers under 11 Hurst exponents: SqueezeNet.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	32 × 32 × 1	84.05%	85.50%	85.00%	86.86%	86.32%	85.55%	0.99%
	64 × 64 × 1	97.23%	96.45%	97.64%	96.73%	97.82%	97.17%	0.52%
	128 × 128 × 1	98.50%	96.14%	98.95%	98.73%	98.00%	98.06%	1.01%
adam	32 × 32 × 1	94.23%	94.95%	87.95%	93.59%	94.05%	92.95%	2.54%
	64 × 64 × 1	98.59%	98.91%	98.55%	98.32%	98.95%	98.66%	0.24%
	128 × 128 × 1	99.05%	99.27%	99.64%	99.41%	99.36%	99.35%	0.19%
rmsprop	32 × 32 × 1	18.09%	77.50%	55.50%	60.95%	36.73%	49.75%	20.50%
	64 × 64 × 1	96.23%	95.64%	96.18%	95.82%	97.32%	96.24%	0.58%
	128 × 128 × 1	98.32%	97.86%	99.14%	97.77%	98.32%	98.28%	0.48%

Table A6. The five-fold classification rates for five types of image sizes and three solvers under 21 Hurst exponents: Xception.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	4.79%	4.79%	4.76%	4.76%	4.76%	4.77%	0.01%
	16 × 16 × 1	4.76%	4.74%	4.74%	4.76%	4.76%	4.75%	0.01%
	32 × 32 × 1	82.60%	81.64%	81.33%	83.81%	82.36%	82.35%	0.86%
	64 × 64 × 1	87.71%	88.55%	87.86%	88.64%	84.40%	87.43%	1.56%
	128 × 128 × 1	93.98%	94.17%	94.24%	94.36%	93.98%	94.14%	0.15%
adam	8 × 8 × 1	76.48%	78.55%	77.24%	80.14%	78.26%	78.13%	1.25%
	16 × 16 × 1	95.26%	94.62%	93.76%	94.43%	95.71%	94.76%	0.68%
	32 × 32 × 1	97.21%	96.64%	98.36%	97.48%	98.26%	97.59%	0.65%
	64 × 64 × 1	98.88%	98.21%	97.33%	98.38%	98.60%	98.28%	0.52%
	128 × 128 × 1	99.10%	99.14%	98.90%	98.95%	99.07%	99.03%	0.09%
rmsprop	8 × 8 × 1	78.88%	78.07%	79.36%	77.62%	76.55%	78.10%	0.98%
	16 × 16 × 1	95.24%	93.81%	95.05%	93.36%	95.57%	94.60%	0.86%
	32 × 32 × 1	97.76%	94.86%	97.48%	96.36%	97.31%	96.75%	1.06%
	64 × 64 × 1	98.50%	98.02%	97.81%	97.29%	99.05%	98.13%	0.60%
	128 × 128 × 1	99.05%	98.90%	99.14%	97.38%	99.19%	98.73%	0.68%

Table A7. The five-fold classification rates for five types of image sizes and three solvers under 21 Hurst exponents: ResNet18.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	60.50%	63.33%	64.45%	57.81%	63.60%	61.94%	2.45%
	16 × 16 × 1	82.31%	83.88%	84.21%	82.86%	81.31%	82.91%	1.06%
	32 × 32 × 1	92.24%	91.05%	91.40%	91.60%	90.57%	91.37%	0.56%
	64 × 64 × 1	95.00%	94.62%	94.90%	94.36%	93.62%	94.50%	0.49%
	128 × 128 × 1	95.29%	96.24%	95.98%	96.17%	96.24%	95.98%	0.36%
adam	8 × 8 × 1	60.62%	79.48%	74.81%	71.86%	64.12%	70.18%	6.91%
	16 × 16 × 1	74.76%	84.64%	86.43%	82.76%	88.79%	83.48%	4.79%
	32 × 32 × 1	96.93%	97.38%	96.67%	95.00%	95.57%	96.31%	0.89%
	64 × 64 × 1	98.64%	96.26%	96.31%	95.36%	97.76%	96.87%	1.18%
	128 × 128 × 1	95.83%	94.64%	96.64%	91.29%	94.48%	94.58%	1.83%
rmsprop	8 × 8 × 1	70.43%	72.64%	71.17%	78.05%	75.33%	73.52%	2.82%
	16 × 16 × 1	88.33%	80.29%	86.38%	84.67%	84.52%	84.84%	2.66%
	32 × 32 × 1	97.55%	93.31%	96.07%	95.33%	95.19%	95.49%	1.37%
	64 × 64 × 1	98.21%	91.93%	96.05%	97.43%	97.81%	96.29%	2.30%
	128 × 128 × 1	95.83%	95.57%	90.12%	96.10%	90.00%	93.52%	2.83%

Table A8. The five-fold classification rates for five types of image sizes and three solvers under 21 Hurst exponents: MobileNetV2.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	31.14%	32.29%	34.02%	34.29%	32.40%	32.83%	1.17%
	16 × 16 × 1	68.21%	66.38%	67.21%	67.14%	67.02%	67.20%	0.59%
	32 × 32 × 1	86.60%	82.79%	88.26%	86.88%	87.95%	86.50%	1.96%
	64 × 64 × 1	93.12%	92.93%	93.12%	92.88%	93.64%	93.14%	0.27%
	128 × 128 × 1	95.48%	95.07%	96.17%	96.21%	95.48%	95.68%	0.44%
adam	8 × 8 × 1	21.21%	17.95%	21.81%	40.57%	29.40%	26.19%	8.11%
	16 × 16 × 1	47.45%	57.74%	64.38%	74.02%	52.95%	59.31%	9.23%
	32 × 32 × 1	91.36%	82.95%	87.19%	91.45%	93.36%	89.26%	3.74%
	64 × 64 × 1	95.43%	95.88%	97.60%	97.24%	97.24%	96.68%	0.86%
	128 × 128 × 1	98.67%	98.05%	98.64%	99.07%	98.05%	98.50%	0.40%
rmsprop	8 × 8 × 1	62.52%	57.12%	63.50%	49.57%	63.31%	59.20%	5.36%
	16 × 16 × 1	83.05%	79.29%	87.14%	84.93%	86.43%	84.17%	2.82%
	32 × 32 × 1	90.88%	94.48%	96.62%	94.00%	94.29%	94.05%	1.84%
	64 × 64 × 1	95.57%	96.71%	97.64%	96.05%	96.62%	96.52%	0.70%
	128 × 128 × 1	98.40%	98.00%	98.55%	97.69%	97.60%	98.05%	0.38%

Table A9. The five-fold classification rates for five types of image sizes and three solvers under 21 Hurst exponents: GoogleNet.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	32 × 32 × 1	92.60%	91.40%	93.24%	92.76%	92.17%	92.43%	0.62%
	64 × 64 × 1	95.26%	95.05%	94.88%	94.83%	95.21%	95.05%	0.17%
	128 × 128 × 1	95.98%	95.60%	95.48%	95.36%	95.93%	95.67%	0.25%
adam	32 × 32 × 1	95.71%	95.67%	96.50%	96.38%	96.19%	96.09%	0.34%
	64 × 64 × 1	97.40%	97.17%	97.05%	97.21%	97.57%	97.28%	0.19%
	128 × 128 × 1	97.10%	97.10%	96.29%	96.45%	96.95%	96.78%	0.34%
rmsprop	32 × 32 × 1	91.90%	91.21%	93.45%	91.38%	91.57%	91.90%	0.81%
	64 × 64 × 1	89.43%	87.14%	87.93%	89.10%	89.90%	88.70%	1.02%
	128 × 128 × 1	90.52%	91.69%	90.38%	90.24%	91.69%	90.90%	0.65%

Table A10. The five-fold classification rates for five types of image sizes and three solvers under 21 Hurst exponents: SqueezeNet.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	32 × 32 × 1	72.83%	68.07%	74.14%	81.10%	77.67%	74.76%	4.41%
	64 × 64 × 1	88.76%	89.29%	88.50%	88.69%	86.69%	88.39%	0.89%
	128 × 128 × 1	94.31%	92.62%	93.52%	93.48%	93.45%	93.48%	0.54%
adam	32 × 32 × 1	84.10%	74.05%	64.74%	74.21%	85.36%	76.49%	7.56%
	64 × 64 × 1	96.24%	96.29%	96.74%	96.05%	95.55%	96.17%	0.39%
	128 × 128 × 1	97.60%	96.24%	97.29%	97.52%	97.43%	97.21%	0.50%
rmsprop	32 × 32 × 1	4.76%	57.14%	4.76%	21.64%	54.88%	28.64%	23.20%
	64 × 64 × 1	84.31%	83.33%	85.43%	83.79%	83.67%	84.10%	0.73%
	128 × 128 × 1	91.17%	91.69%	92.10%	92.67%	92.90%	92.10%	0.63%

Table A11. The five-fold MSEs for five types of image sizes and three solvers under 11 Hurst exponents: Xception.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	1.57 × 10⁻⁰¹	2.89 × 10⁻⁰¹	2.15 × 10⁻⁰¹	9.10 × 10⁻⁰²	8.28 × 10⁻⁰²	1.67 × 10⁻⁰¹	7.76 × 10⁻⁰²
	16 × 16 × 1	1.57 × 10⁻⁰¹	2.89 × 10⁻⁰¹	2.15 × 10⁻⁰¹	9.17 × 10⁻⁰²	8.27 × 10⁻⁰²	1.67 × 10⁻⁰¹	7.76 × 10⁻⁰²
	32 × 32 × 1	2.52 × 10⁻⁰³	2.07 × 10⁻⁰³	1.75 × 10⁻⁰³	2.13 × 10⁻⁰³	2.02 × 10⁻⁰³	2.10 × 10⁻⁰³	2.50 × 10⁻⁰⁴
	64 × 64 × 1	1.07 × 10⁻⁰³	1.04 × 10⁻⁰³	1.03 × 10⁻⁰³	1.25 × 10⁻⁰³	1.11 × 10⁻⁰³	1.10 × 10⁻⁰³	7.94 × 10⁻⁰⁵
	128 × 128 × 1	3.46 × 10⁻⁰⁴	4.09 × 10⁻⁰⁴	2.97 × 10⁻⁰⁴	4.17 × 10⁻⁰⁴	3.57 × 10⁻⁰⁴	3.65 × 10⁻⁰⁴	4.42 × 10⁻⁰⁵
adam	8 × 8 × 1	1.65 × 10⁻⁰³	2.07 × 10⁻⁰³	2.46 × 10⁻⁰³	2.62 × 10⁻⁰³	2.03 × 10⁻⁰³	2.17 × 10⁻⁰³	3.40 × 10⁻⁰⁴
	16 × 16 × 1	4.09 × 10⁻⁰⁴	2.70 × 10⁻⁰⁴	3.68 × 10⁻⁰⁴	3.27 × 10⁻⁰⁴	3.16 × 10⁻⁰⁴	3.38 × 10⁻⁰⁴	4.73 × 10⁻⁰⁵
	32 × 32 × 1	1.47 × 10⁻⁰⁴	1.09 × 10⁻⁰⁴	1.31 × 10⁻⁰⁴	8.26 × 10⁻⁰⁵	1.92 × 10⁻⁰⁴	1.32 × 10⁻⁰⁴	3.67 × 10⁻⁰⁵
	64 × 64 × 1	7.51 × 10⁻⁰⁵	5.63 × 10⁻⁰⁵	2.25 × 10⁻⁰⁵	7.14 × 10⁻⁰⁵	1.88 × 10⁻⁰⁵	4.88 × 10⁻⁰⁵	2.39 × 10⁻⁰⁵
	128 × 128 × 1	1.13 × 10⁻⁰⁵	3.76 × 10⁻⁰⁶	1.88 × 10⁻⁰⁵	1.50 × 10⁻⁰⁵	0.00 × 10⁺⁰⁰	9.77 × 10⁻⁰⁶	6.97 × 10⁻⁰⁶
rmsprop	8 × 8 × 1	2.47 × 10⁻⁰³	2.47 × 10⁻⁰³	2.58 × 10⁻⁰³	1.83 × 10⁻⁰³	2.34 × 10⁻⁰³	2.34 × 10⁻⁰³	2.66 × 10⁻⁰⁴
	16 × 16 × 1	5.03 × 10⁻⁰⁴	7.51 × 10⁻⁰⁴	5.67 × 10⁻⁰⁴	8.08 × 10⁻⁰⁴	3.57 × 10⁻⁰⁴	5.97 × 10⁻⁰⁴	1.65 × 10⁻⁰⁴
	32 × 32 × 1	1.16 × 10⁻⁰⁴	9.39 × 10⁻⁰⁵	2.33 × 10⁻⁰⁴	1.28 × 10⁻⁰⁴	2.07 × 10⁻⁰⁴	1.56 × 10⁻⁰⁴	5.42 × 10⁻⁰⁵
	64 × 64 × 1	5.26 × 10⁻⁰⁵	5.26 × 10⁻⁰⁵	4.88 × 10⁻⁰⁵	6.01 × 10⁻⁰⁵	4.88 × 10⁻⁰⁵	5.26 × 10⁻⁰⁵	4.12 × 10⁻⁰⁶
	128 × 128 × 1	1.50 × 10⁻⁰⁵	1.13 × 10⁻⁰⁵	3.01 × 10⁻⁰⁵	1.13 × 10⁻⁰⁵	1.88 × 10⁻⁰⁵	1.73 × 10⁻⁰⁵	6.97 × 10⁻⁰⁶

Table A12. The five-fold MSEs for five types of image sizes and three solvers under 11 Hurst exponents: ResNet18.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	5.66 × 10⁻⁰³	5.52 × 10⁻⁰³	5.62 × 10⁻⁰³	5.54 × 10⁻⁰³	5.60 × 10⁻⁰³	5.59 × 10⁻⁰³	5.06 × 10⁻⁰⁵
	16 × 16 × 1	1.98 × 10⁻⁰³	1.68 × 10⁻⁰³	1.59 × 10⁻⁰³	1.54 × 10⁻⁰³	1.75 × 10⁻⁰³	1.71 × 10⁻⁰³	1.56 × 10⁻⁰⁴
	32 × 32 × 1	6.65 × 10⁻⁰⁴	7.33 × 10⁻⁰⁴	6.54 × 10⁻⁰⁴	6.46 × 10⁻⁰⁴	7.96 × 10⁻⁰⁴	6.99 × 10⁻⁰⁴	5.77 × 10⁻⁰⁵
	64 × 64 × 1	4.13 × 10⁻⁰⁴	4.85 × 10⁻⁰⁴	3.87 × 10⁻⁰⁴	3.91 × 10⁻⁰⁴	4.81 × 10⁻⁰⁴	4.31 × 10⁻⁰⁴	4.30 × 10⁻⁰⁵
	128 × 128 × 1	2.29 × 10⁻⁰⁴	2.07 × 10⁻⁰⁴	1.77 × 10⁻⁰⁴	2.03 × 10⁻⁰⁴	2.18 × 10⁻⁰⁴	2.07 × 10⁻⁰⁴	1.76 × 10⁻⁰⁵
adam	8 × 8 × 1	2.76 × 10⁻⁰³	2.09 × 10⁻⁰³	2.73 × 10⁻⁰³	7.01 × 10⁻⁰³	3.02 × 10⁻⁰³	3.52 × 10⁻⁰³	1.77 × 10⁻⁰³
	16 × 16 × 1	6.80 × 10⁻⁰⁴	9.84 × 10⁻⁰⁴	1.54 × 10⁻⁰³	6.20 × 10⁻⁰⁴	6.35 × 10⁻⁰⁴	8.91 × 10⁻⁰⁴	3.49 × 10⁻⁰⁴
	32 × 32 × 1	1.58 × 10⁻⁰⁴	1.92 × 10⁻⁰⁴	1.84 × 10⁻⁰⁴	1.58 × 10⁻⁰⁴	1.13 × 10⁻⁰⁴	1.61 × 10⁻⁰⁴	2.76 × 10⁻⁰⁵
	64 × 64 × 1	1.31 × 10⁻⁰⁴	7.51 × 10⁻⁰⁵	8.64 × 10⁻⁰⁵	1.01 × 10⁻⁰⁴	1.50 × 10⁻⁰⁵	8.19 × 10⁻⁰⁵	3.84 × 10⁻⁰⁵
	128 × 128 × 1	1.62 × 10⁻⁰⁴	6.01 × 10⁻⁰⁵	1.54 × 10⁻⁰⁴	1.69 × 10⁻⁰⁴	1.43 × 10⁻⁰⁴	1.37 × 10⁻⁰⁴	3.97 × 10⁻⁰⁵
rmsprop	8 × 8 × 1	3.99 × 10⁻⁰³	5.03 × 10⁻⁰³	6.12 × 10⁻⁰³	8.19 × 10⁻⁰³	5.02 × 10⁻⁰³	5.67 × 10⁻⁰³	1.43 × 10⁻⁰³
	16 × 16 × 1	8.38 × 10⁻⁰⁴	6.35 × 10⁻⁰⁴	6.50 × 10⁻⁰⁴	6.50 × 10⁻⁰⁴	6.76 × 10⁻⁰⁴	6.90 × 10⁻⁰⁴	7.52 × 10⁻⁰⁵
	32 × 32 × 1	8.26 × 10⁻⁰⁵	1.62 × 10⁻⁰⁴	1.77 × 10⁻⁰⁴	1.43 × 10⁻⁰⁴	1.43 × 10⁻⁰⁴	1.41 × 10⁻⁰⁴	3.19 × 10⁻⁰⁵
	64 × 64 × 1	9.39 × 10⁻⁰⁵	1.99 × 10⁻⁰⁴	2.85 × 10⁻⁰⁴	1.16 × 10⁻⁰⁴	7.14 × 10⁻⁰⁵	1.53 × 10⁻⁰⁴	7.90 × 10⁻⁰⁵
	128 × 128 × 1	2.33 × 10⁻⁰⁴	1.80 × 10⁻⁰⁴	1.99 × 10⁻⁰⁴	6.76 × 10⁻⁰⁵	2.55 × 10⁻⁰⁴	1.87 × 10⁻⁰⁴	6.52 × 10⁻⁰⁵

Table A13. The five-fold MSEs for five types of image sizes and three solvers under 11 Hurst exponents: MobileNetV2.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	1.24 × 10⁻⁰²	1.08 × 10⁻⁰²	1.23 × 10⁻⁰²	1.43 × 10⁻⁰²	1.23 × 10⁻⁰²	1.24 × 10⁻⁰²	1.11 × 10⁻⁰³
	16 × 16 × 1	2.93 × 10⁻⁰³	3.50 × 10⁻⁰³	3.11 × 10⁻⁰³	2.86 × 10⁻⁰³	3.04 × 10⁻⁰³	3.09 × 10⁻⁰³	2.23 × 10⁻⁰⁴
	32 × 32 × 1	9.50 × 10⁻⁰⁴	1.01 × 10⁻⁰³	8.53 × 10⁻⁰⁴	1.19 × 10⁻⁰³	9.92 × 10⁻⁰⁴	9.99 × 10⁻⁰⁴	1.09 × 10⁻⁰⁴
	64 × 64 × 1	4.24 × 10⁻⁰⁴	4.51 × 10⁻⁰⁴	3.91 × 10⁻⁰⁴	3.83 × 10⁻⁰⁴	4.51 × 10⁻⁰⁴	4.20 × 10⁻⁰⁴	2.87 × 10⁻⁰⁵
	128 × 128 × 1	7.89 × 10⁻⁰⁵	1.28 × 10⁻⁰⁴	1.16 × 10⁻⁰⁴	1.13 × 10⁻⁰⁴	1.20 × 10⁻⁰⁴	1.11 × 10⁻⁰⁴	1.69 × 10⁻⁰⁵
adam	8 × 8 × 1	1.76 × 10⁻⁰²	1.60 × 10⁻⁰²	1.90 × 10⁻⁰²	2.04 × 10⁻⁰²	1.25 × 10⁻⁰²	1.71 × 10⁻⁰²	2.74 × 10⁻⁰³
	16 × 16 × 1	3.43 × 10⁻⁰³	1.98 × 10⁻⁰³	3.66 × 10⁻⁰³	5.23 × 10⁻⁰³	5.50 × 10⁻⁰³	3.96 × 10⁻⁰³	1.29 × 10⁻⁰³
	32 × 32 × 1	3.04 × 10⁻⁰⁴	3.27 × 10⁻⁰⁴	4.88 × 10⁻⁰⁴	3.68 × 10⁻⁰⁴	3.61 × 10⁻⁰⁴	3.70 × 10⁻⁰⁴	6.37 × 10⁻⁰⁵
	64 × 64 × 1	1.05 × 10⁻⁰⁴	1.13 × 10⁻⁰⁴	7.14 × 10⁻⁰⁵	2.70 × 10⁻⁰⁴	6.39 × 10⁻⁰⁵	1.25 × 10⁻⁰⁴	7.53 × 10⁻⁰⁵
	128 × 128 × 1	2.25 × 10⁻⁰⁵	1.88 × 10⁻⁰⁵	6.39 × 10⁻⁰⁵	4.51 × 10⁻⁰⁵	2.25 × 10⁻⁰⁵	3.46 × 10⁻⁰⁵	1.74 × 10⁻⁰⁵
rmsprop	8 × 8 × 1	7.07 × 10⁻⁰³	5.29 × 10⁻⁰³	9.17 × 10⁻⁰³	6.44 × 10⁻⁰³	8.56 × 10⁻⁰³	7.30 × 10⁻⁰³	1.41 × 10⁻⁰³
	16 × 16 × 1	9.58 × 10⁻⁰⁴	1.02 × 10⁻⁰³	1.41 × 10⁻⁰³	8.41 × 10⁻⁰⁴	7.14 × 10⁻⁰⁴	9.88 × 10⁻⁰⁴	2.35 × 10⁻⁰⁴
	32 × 32 × 1	2.33 × 10⁻⁰⁴	2.44 × 10⁻⁰⁴	2.18 × 10⁻⁰⁴	1.92 × 10⁻⁰⁴	2.44 × 10⁻⁰⁴	2.26 × 10⁻⁰⁴	1.98 × 10⁻⁰⁵
	64 × 64 × 1	1.24 × 10⁻⁰⁴	3.76 × 10⁻⁰⁵	1.24 × 10⁻⁰⁴	1.35 × 10⁻⁰⁴	1.01 × 10⁻⁰⁴	1.04 × 10⁻⁰⁴	3.52 × 10⁻⁰⁵
	128 × 128 × 1	4.51 × 10⁻⁰⁵	4.13 × 10⁻⁰⁵	3.01 × 10⁻⁰⁵	4.13 × 10⁻⁰⁵	4.13 × 10⁻⁰⁵	3.98 × 10⁻⁰⁵	5.10 × 10⁻⁰⁶

Table A14. The five-fold MSEs for five types of image sizes and three solvers under 11 Hurst exponents: GoogleNet.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	32 × 32 × 1	6.39 × 10⁻⁰⁴	6.05 × 10⁻⁰⁴	5.82 × 10⁻⁰⁴	7.02 × 10⁻⁰⁴	5.41 × 10⁻⁰⁴	6.14 × 10⁻⁰⁴	5.45 × 10⁻⁰⁵
	64 × 64 × 1	2.89 × 10⁻⁰⁴	3.04 × 10⁻⁰⁴	1.73 × 10⁻⁰⁴	2.63 × 10⁻⁰⁴	2.07 × 10⁻⁰⁴	2.47 × 10⁻⁰⁴	4.99 × 10⁻⁰⁵
	128 × 128 × 1	7.51 × 10⁻⁰⁵	1.05 × 10⁻⁰⁴	8.26 × 10⁻⁰⁵	1.13 × 10⁻⁰⁴	9.77 × 10⁻⁰⁵	9.47 × 10⁻⁰⁵	1.39 × 10⁻⁰⁵
adam	32 × 32 × 1	3.61 × 10⁻⁰⁴	3.08 × 10⁻⁰⁴	2.25 × 10⁻⁰⁴	2.93 × 10⁻⁰⁴	3.46 × 10⁻⁰⁴	3.07 × 10⁻⁰⁴	4.74 × 10⁻⁰⁵
	64 × 64 × 1	1.58 × 10⁻⁰⁴	1.50 × 10⁻⁰⁴	1.16 × 10⁻⁰⁴	1.80 × 10⁻⁰⁴	7.51 × 10⁻⁰⁵	1.36 × 10⁻⁰⁴	3.67 × 10⁻⁰⁵
	128 × 128 × 1	5.63 × 10⁻⁰⁵	7.89 × 10⁻⁰⁵	6.76 × 10⁻⁰⁵	7.89 × 10⁻⁰⁵	1.65 × 10⁻⁰⁴	8.94 × 10⁻⁰⁵	3.89 × 10⁻⁰⁵
rmsprop	32 × 32 × 1	6.84 × 10⁻⁰⁴	7.29 × 10⁻⁰⁴	6.16 × 10⁻⁰⁴	8.04 × 10⁻⁰⁴	7.21 × 10⁻⁰⁴	7.11 × 10⁻⁰⁴	6.13 × 10⁻⁰⁵
	64 × 64 × 1	4.21 × 10⁻⁰⁴	4.62 × 10⁻⁰⁴	4.09 × 10⁻⁰⁴	6.12 × 10⁻⁰⁴	2.44 × 10⁻⁰⁴	4.30 × 10⁻⁰⁴	1.18 × 10⁻⁰⁴
	128 × 128 × 1	1.95 × 10⁻⁰⁴	3.87 × 10⁻⁰⁴	2.55 × 10⁻⁰⁴	2.82 × 10⁻⁰⁴	3.27 × 10⁻⁰⁴	2.89 × 10⁻⁰⁴	6.48 × 10⁻⁰⁵

Table A15. The five-fold MSEs for five types of image sizes and three solvers under 11 Hurst exponents: SqueezeNet.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	32 × 32 × 1	1.33 × 10⁻⁰³	1.21 × 10⁻⁰³	1.24 × 10⁻⁰³	1.09 × 10⁻⁰³	1.13 × 10⁻⁰³	1.20 × 10⁻⁰³	8.53 × 10⁻⁰⁵
	64 × 64 × 1	2.29 × 10⁻⁰⁴	2.93 × 10⁻⁰⁴	1.95 × 10⁻⁰⁴	2.70 × 10⁻⁰⁴	1.80 × 10⁻⁰⁴	2.34 × 10⁻⁰⁴	4.29 × 10⁻⁰⁵
	128 × 128 × 1	1.24 × 10⁻⁰⁴	3.19 × 10⁻⁰⁴	8.64 × 10⁻⁰⁵	1.05 × 10⁻⁰⁴	1.65 × 10⁻⁰⁴	1.60 × 10⁻⁰⁴	8.38 × 10⁻⁰⁵
adam	32 × 32 × 1	4.77 × 10⁻⁰⁴	4.17 × 10⁻⁰⁴	5.40 × 10⁻⁰²	5.30 × 10⁻⁰⁴	4.92 × 10⁻⁰⁴	1.12 × 10⁻⁰²	2.14 × 10⁻⁰²
	64 × 64 × 1	1.16 × 10⁻⁰⁴	9.02 × 10⁻⁰⁵	1.20 × 10⁻⁰⁴	1.39 × 10⁻⁰⁴	8.64 × 10⁻⁰⁵	1.10 × 10⁻⁰⁴	1.97 × 10⁻⁰⁵
	128 × 128 × 1	7.89 × 10⁻⁰⁵	6.01 × 10⁻⁰⁵	3.01 × 10⁻⁰⁵	4.88 × 10⁻⁰⁵	5.26 × 10⁻⁰⁵	5.41 × 10⁻⁰⁵	1.59 × 10⁻⁰⁵
rmsprop	32 × 32 × 1	1.31 × 10⁻⁰¹	2.24 × 10⁻⁰²	8.96 × 10⁻⁰²	5.54 × 10⁻⁰²	6.56 × 10⁻⁰²	7.28 × 10⁻⁰²	3.62 × 10⁻⁰²
	64 × 64 × 1	3.12 × 10⁻⁰⁴	3.61 × 10⁻⁰⁴	3.16 × 10⁻⁰⁴	3.46 × 10⁻⁰⁴	2.22 × 10⁻⁰⁴	3.11 × 10⁻⁰⁴	4.83 × 10⁻⁰⁵
	128 × 128 × 1	1.39 × 10⁻⁰⁴	1.77 × 10⁻⁰⁴	7.14 × 10⁻⁰⁵	1.84 × 10⁻⁰⁴	1.39 × 10⁻⁰⁴	1.42 × 10⁻⁰⁴	3.99 × 10⁻⁰⁵

Table A16. The five-fold MSEs for five types of image sizes and three solvers under 21 Hurst exponents: Xception.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	9.27 × 10⁻⁰²	8.57 × 10⁻⁰²	1.04 × 10⁻⁰¹	1.20 × 10⁻⁰¹	1.04 × 10⁻⁰¹	1.01 × 10⁻⁰¹	1.15 × 10⁻⁰²
	16 × 16 × 1	9.23 × 10⁻⁰²	8.59 × 10⁻⁰²	1.04 × 10⁻⁰¹	1.20 × 10⁻⁰¹	1.04 × 10⁻⁰¹	1.01 × 10⁻⁰¹	1.15 × 10⁻⁰²
	32 × 32 × 1	9.94 × 10⁻⁰⁴	6.90 × 10⁻⁰⁴	1.09 × 10⁻⁰³	5.48 × 10⁻⁰⁴	6.33 × 10⁻⁰⁴	7.92 × 10⁻⁰⁴	2.13 × 10⁻⁰⁴
	64 × 64 × 1	2.96 × 10⁻⁰⁴	3.20 × 10⁻⁰⁴	3.01 × 10⁻⁰⁴	2.72 × 10⁻⁰⁴	3.86 × 10⁻⁰⁴	3.15 × 10⁻⁰⁴	3.86 × 10⁻⁰⁵
	128 × 128 × 1	1.37 × 10⁻⁰⁴	1.36 × 10⁻⁰⁴	1.32 × 10⁻⁰⁴	1.28 × 10⁻⁰⁴	1.38 × 10⁻⁰⁴	1.34 × 10⁻⁰⁴	3.64 × 10⁻⁰⁶
adam	8 × 8 × 1	1.23 × 10⁻⁰³	7.76 × 10⁻⁰⁴	8.41 × 10⁻⁰⁴	7.30 × 10⁻⁰⁴	8.33 × 10⁻⁰⁴	8.81 × 10⁻⁰⁴	1.77 × 10⁻⁰⁴
	16 × 16 × 1	1.28 × 10⁻⁰⁴	1.68 × 10⁻⁰⁴	1.88 × 10⁻⁰⁴	1.61 × 10⁻⁰⁴	1.25 × 10⁻⁰⁴	1.54 × 10⁻⁰⁴	2.44 × 10⁻⁰⁵
	32 × 32 × 1	6.80 × 10⁻⁰⁵	8.58 × 10⁻⁰⁵	4.21 × 10⁻⁰⁵	6.69 × 10⁻⁰⁵	4.70 × 10⁻⁰⁵	6.20 × 10⁻⁰⁵	1.58 × 10⁻⁰⁵
	64 × 64 × 1	2.54 × 10⁻⁰⁵	4.05 × 10⁻⁰⁵	6.05 × 10⁻⁰⁵	3.67 × 10⁻⁰⁵	3.19 × 10⁻⁰⁵	3.90 × 10⁻⁰⁵	1.19 × 10⁻⁰⁵
	128 × 128 × 1	2.05 × 10⁻⁰⁵	1.94 × 10⁻⁰⁵	2.48 × 10⁻⁰⁵	2.38 × 10⁻⁰⁵	2.11 × 10⁻⁰⁵	2.19 × 10⁻⁰⁵	2.04 × 10⁻⁰⁶
rmsprop	8 × 8 × 1	7.60 × 10⁻⁰⁴	8.29 × 10⁻⁰⁴	9.17 × 10⁻⁰⁴	1.49 × 10⁻⁰³	8.71 × 10⁻⁰⁴	9.73 × 10⁻⁰⁴	2.63 × 10⁻⁰⁴
	16 × 16 × 1	1.24 × 10⁻⁰⁴	1.39 × 10⁻⁰³	1.46 × 10⁻⁰⁴	3.03 × 10⁻⁰³	1.49 × 10⁻⁰⁴	9.70 × 10⁻⁰⁴	1.14 × 10⁻⁰³
	32 × 32 × 1	5.56 × 10⁻⁰⁵	3.47 × 10⁻⁰⁴	6.05 × 10⁻⁰⁵	9.34 × 10⁻⁰⁵	7.02 × 10⁻⁰⁵	1.25 × 10⁻⁰⁴	1.11 × 10⁻⁰⁴
	64 × 64 × 1	3.40 × 10⁻⁰⁵	4.81 × 10⁻⁰⁵	5.19 × 10⁻⁰³	2.86 × 10⁻⁰³	2.16 × 10⁻⁰⁵	1.63 × 10⁻⁰³	2.09 × 10⁻⁰³
	128 × 128 × 1	2.16 × 10⁻⁰⁵	2.48 × 10⁻⁰⁵	1.94 × 10⁻⁰⁵	5.94 × 10⁻⁰⁵	1.84 × 10⁻⁰⁵	2.87 × 10⁻⁰⁵	1.55 × 10⁻⁰⁵

Table A17. The five-fold MSEs for five types of image sizes and three solvers under 21 Hurst exponents: ResNet18.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	1.86 × 10⁻⁰³	2.02 × 10⁻⁰³	1.74 × 10⁻⁰³	1.78 × 10⁻⁰³	1.77 × 10⁻⁰³	1.84 × 10⁻⁰³	1.02 × 10⁻⁰⁴
	16 × 16 × 1	5.16 × 10⁻⁰⁴	4.88 × 10⁻⁰⁴	4.82 × 10⁻⁰⁴	5.36 × 10⁻⁰⁴	5.92 × 10⁻⁰⁴	5.23 × 10⁻⁰⁴	3.99 × 10⁻⁰⁵
	32 × 32 × 1	2.03 × 10⁻⁰⁴	2.46 × 10⁻⁰⁴	2.27 × 10⁻⁰⁴	2.30 × 10⁻⁰⁴	2.48 × 10⁻⁰⁴	2.31 × 10⁻⁰⁴	1.62 × 10⁻⁰⁵
	64 × 64 × 1	1.20 × 10⁻⁰⁴	1.32 × 10⁻⁰⁴	1.20 × 10⁻⁰⁴	1.39 × 10⁻⁰⁴	1.58 × 10⁻⁰⁴	1.34 × 10⁻⁰⁴	1.39 × 10⁻⁰⁵
	128 × 128 × 1	1.10 × 10⁻⁰⁴	8.69 × 10⁻⁰⁵	9.29 × 10⁻⁰⁵	8.85 × 10⁻⁰⁵	8.69 × 10⁻⁰⁵	9.31 × 10⁻⁰⁵	8.80 × 10⁻⁰⁶
adam	8 × 8 × 1	1.23 × 10⁻⁰³	1.01 × 10⁻⁰³	1.06 × 10⁻⁰³	1.23 × 10⁻⁰³	1.49 × 10⁻⁰³	1.20 × 10⁻⁰³	1.68 × 10⁻⁰⁴
	16 × 16 × 1	7.01 × 10⁻⁰⁴	4.17 × 10⁻⁰⁴	4.10 × 10⁻⁰⁴	6.97 × 10⁻⁰⁴	2.85 × 10⁻⁰⁴	5.02 × 10⁻⁰⁴	1.68 × 10⁻⁰⁴
	32 × 32 × 1	8.58 × 10⁻⁰⁵	6.53 × 10⁻⁰⁵	7.56 × 10⁻⁰⁵	1.21 × 10⁻⁰⁴	1.30 × 10⁻⁰⁴	9.56 × 10⁻⁰⁵	2.54 × 10⁻⁰⁵
	64 × 64 × 1	3.24 × 10⁻⁰⁵	8.80 × 10⁻⁰⁵	8.37 × 10⁻⁰⁵	1.05 × 10⁻⁰⁴	5.08 × 10⁻⁰⁵	7.20 × 10⁻⁰⁵	2.65 × 10⁻⁰⁵
	128 × 128 × 1	9.77 × 10⁻⁰⁵	1.33 × 10⁻⁰⁴	7.61 × 10⁻⁰⁵	1.99 × 10⁻⁰⁴	1.25 × 10⁻⁰⁴	1.26 × 10⁻⁰⁴	4.17 × 10⁻⁰⁵
rmsprop	8 × 8 × 1	1.22 × 10⁻⁰³	1.16 × 10⁻⁰³	1.19 × 10⁻⁰³	8.95 × 10⁻⁰⁴	1.09 × 10⁻⁰³	1.11 × 10⁻⁰³	1.17 × 10⁻⁰⁴
	16 × 16 × 1	3.01 × 10⁻⁰⁴	5.30 × 10⁻⁰⁴	4.41 × 10⁻⁰⁴	4.83 × 10⁻⁰⁴	4.17 × 10⁻⁰⁴	4.34 × 10⁻⁰⁴	7.69 × 10⁻⁰⁵
	32 × 32 × 1	1.57 × 10⁻⁰³	1.59 × 10⁻⁰⁴	9.39 × 10⁻⁰⁵	1.19 × 10⁻⁰⁴	2.49 × 10⁻⁰³	8.85 × 10⁻⁰⁴	9.77 × 10⁻⁰⁴
	64 × 64 × 1	4.37 × 10⁻⁰⁵	1.88 × 10⁻⁰⁴	9.12 × 10⁻⁰⁵	6.32 × 10⁻⁰⁵	5.72 × 10⁻⁰⁵	8.87 × 10⁻⁰⁵	5.20 × 10⁻⁰⁵
	128 × 128 × 1	9.61 × 10⁻⁰⁵	1.02 × 10⁻⁰⁴	2.37 × 10⁻⁰⁴	9.02 × 10⁻⁰⁵	2.38 × 10⁻⁰⁴	1.53 × 10⁻⁰⁴	6.94 × 10⁻⁰⁵

Table A18. The five-fold MSEs for five types of image sizes and three solvers under 21 Hurst exponents: MobileNetV2.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	4.24 × 10⁻⁰³	4.08 × 10⁻⁰³	3.90 × 10⁻⁰³	4.36 × 10⁻⁰³	4.22 × 10⁻⁰³	4.16 × 10⁻⁰³	1.59 × 10⁻⁰⁴
	16 × 16 × 1	1.03 × 10⁻⁰³	1.03 × 10⁻⁰³	1.05 × 10⁻⁰³	1.07 × 10⁻⁰³	9.98 × 10⁻⁰⁴	1.04 × 10⁻⁰³	2.44 × 10⁻⁰⁵
	32 × 32 × 1	3.41 × 10⁻⁰⁴	4.89 × 10⁻⁰⁴	2.98 × 10⁻⁰⁴	3.30 × 10⁻⁰⁴	3.19 × 10⁻⁰⁴	3.55 × 10⁻⁰⁴	6.81 × 10⁻⁰⁵
	64 × 64 × 1	1.59 × 10⁻⁰⁴	1.70 × 10⁻⁰⁴	1.61 × 10⁻⁰⁴	1.63 × 10⁻⁰⁴	1.54 × 10⁻⁰⁴	1.61 × 10⁻⁰⁴	5.28 × 10⁻⁰⁶
	128 × 128 × 1	1.03 × 10⁻⁰⁴	1.12 × 10⁻⁰⁴	8.69 × 10⁻⁰⁵	8.58 × 10⁻⁰⁵	1.04 × 10⁻⁰⁴	9.83 × 10⁻⁰⁵	1.02 × 10⁻⁰⁵
adam	8 × 8 × 1	1.12 × 10⁻⁰²	1.89 × 10⁻⁰²	1.46 × 10⁻⁰²	4.48 × 10⁻⁰³	7.43 × 10⁻⁰³	1.13 × 10⁻⁰²	5.11 × 10⁻⁰³
	16 × 16 × 1	1.95 × 10⁻⁰³	1.63 × 10⁻⁰³	1.21 × 10⁻⁰³	7.67 × 10⁻⁰⁴	1.82 × 10⁻⁰³	1.47 × 10⁻⁰³	4.33 × 10⁻⁰⁴
	32 × 32 × 1	2.13 × 10⁻⁰⁴	4.40 × 10⁻⁰⁴	3.21 × 10⁻⁰⁴	2.16 × 10⁻⁰⁴	1.77 × 10⁻⁰⁴	2.74 × 10⁻⁰⁴	9.61 × 10⁻⁰⁵
	64 × 64 × 1	1.10 × 10⁻⁰⁴	9.50 × 10⁻⁰⁵	5.78 × 10⁻⁰⁵	6.26 × 10⁻⁰⁵	6.42 × 10⁻⁰⁵	7.80 × 10⁻⁰⁵	2.08 × 10⁻⁰⁵
	128 × 128 × 1	3.02 × 10⁻⁰⁵	4.43 × 10⁻⁰⁵	3.08 × 10⁻⁰⁵	2.11 × 10⁻⁰⁵	4.43 × 10⁻⁰⁵	3.41 × 10⁻⁰⁵	8.98 × 10⁻⁰⁶
rmsprop	8 × 8 × 1	1.40 × 10⁻⁰³	1.86 × 10⁻⁰³	1.51 × 10⁻⁰³	2.24 × 10⁻⁰³	1.73 × 10⁻⁰³	1.75 × 10⁻⁰³	2.94 × 10⁻⁰⁴
	16 × 16 × 1	4.59 × 10⁻⁰⁴	5.81 × 10⁻⁰⁴	3.93 × 10⁻⁰⁴	4.52 × 10⁻⁰⁴	3.89 × 10⁻⁰⁴	4.55 × 10⁻⁰⁴	6.95 × 10⁻⁰⁵
	32 × 32 × 1	2.12 × 10⁻⁰⁴	1.44 × 10⁻⁰⁴	7.99 × 10⁻⁰⁵	1.46 × 10⁻⁰⁴	1.44 × 10⁻⁰⁴	1.45 × 10⁻⁰⁴	4.17 × 10⁻⁰⁵
	64 × 64 × 1	1.05 × 10⁻⁰⁴	7.94 × 10⁻⁰⁵	5.34 × 10⁻⁰⁵	8.96 × 10⁻⁰⁵	7.99 × 10⁻⁰⁵	8.15 × 10⁻⁰⁵	1.69 × 10⁻⁰⁵
	128 × 128 × 1	3.62 × 10⁻⁰⁵	4.54 × 10⁻⁰⁵	3.29 × 10⁻⁰⁵	5.24 × 10⁻⁰⁵	5.45 × 10⁻⁰⁵	4.43 × 10⁻⁰⁵	8.56 × 10⁻⁰⁶

Table A19. The five-fold MSEs for five types of image sizes and three solvers under 21 Hurst exponents: GoogleNet.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	32 × 32 × 1	1.86 × 10⁻⁰⁴	2.16 × 10⁻⁰⁴	1.78 × 10⁻⁰⁴	1.99 × 10⁻⁰⁴	1.92 × 10⁻⁰⁴	1.94 × 10⁻⁰⁴	1.30 × 10⁻⁰⁵
	64 × 64 × 1	1.14 × 10⁻⁰⁴	1.20 × 10⁻⁰⁴	1.23 × 10⁻⁰⁴	1.20 × 10⁻⁰⁴	1.18 × 10⁻⁰⁴	1.19 × 10⁻⁰⁴	2.93 × 10⁻⁰⁶
	128 × 128 × 1	9.12 × 10⁻⁰⁵	1.02 × 10⁻⁰⁴	1.06 × 10⁻⁰⁴	1.05 × 10⁻⁰⁴	9.23 × 10⁻⁰⁵	9.92 × 10⁻⁰⁵	6.27 × 10⁻⁰⁶
adam	32 × 32 × 1	1.08 × 10⁻⁰⁴	1.19 × 10⁻⁰⁴	9.23 × 10⁻⁰⁵	9.66 × 10⁻⁰⁵	9.45 × 10⁻⁰⁵	1.02 × 10⁻⁰⁴	1.01 × 10⁻⁰⁵
	64 × 64 × 1	6.05 × 10⁻⁰⁵	6.59 × 10⁻⁰⁵	6.69 × 10⁻⁰⁵	6.48 × 10⁻⁰⁵	5.99 × 10⁻⁰⁵	6.36 × 10⁻⁰⁵	2.87 × 10⁻⁰⁶
	128 × 128 × 1	7.07 × 10⁻⁰⁵	6.75 × 10⁻⁰⁵	8.42 × 10⁻⁰⁵	8.04 × 10⁻⁰⁵	6.91 × 10⁻⁰⁵	7.44 × 10⁻⁰⁵	6.67 × 10⁻⁰⁶
rmsprop	32 × 32 × 1	2.04 × 10⁻⁰⁴	2.47 × 10⁻⁰⁴	1.86 × 10⁻⁰⁴	2.44 × 10⁻⁰⁴	2.13 × 10⁻⁰⁴	2.19 × 10⁻⁰⁴	2.34 × 10⁻⁰⁵
	64 × 64 × 1	2.41 × 10⁻⁰⁴	3.11 × 10⁻⁰⁴	3.00 × 10⁻⁰⁴	2.55 × 10⁻⁰⁴	2.42 × 10⁻⁰⁴	2.70 × 10⁻⁰⁴	2.96 × 10⁻⁰⁵
	128 × 128 × 1	2.15 × 10⁻⁰⁴	1.90 × 10⁻⁰⁴	2.26 × 10⁻⁰⁴	2.26 × 10⁻⁰⁴	1.90 × 10⁻⁰⁴	2.09 × 10⁻⁰⁴	1.64×10⁻⁰⁵

Table A20. The five-fold MSEs for five types of image sizes and three solvers under 21 Hurst exponents: SqueezeNet.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	32 × 32 × 1	6.75 × 10⁻⁰⁴	8.27 × 10⁻⁰⁴	6.66 × 10⁻⁰⁴	4.64 × 10⁻⁰⁴	5.48 × 10⁻⁰⁴	6.36 × 10⁻⁰⁴	1.23 × 10⁻⁰⁴
	64 × 64 × 1	2.61 × 10⁻⁰⁴	2.43 × 10⁻⁰⁴	2.67 × 10⁻⁰⁴	2.56 × 10⁻⁰⁴	3.05 × 10⁻⁰⁴	2.67 × 10⁻⁰⁴	2.08 × 10⁻⁰⁵
	128 × 128 × 1	1.29 × 10⁻⁰⁴	1.69 × 10⁻⁰⁴	1.47 × 10⁻⁰⁴	1.48 × 10⁻⁰⁴	1.52 × 10⁻⁰⁴	1.49 × 10⁻⁰⁴	1.27 × 10⁻⁰⁵
adam	32 × 32 × 1	1.48 × 10⁻⁰²	6.36 × 10⁻⁰²	6.44 × 10⁻⁰²	3.05 × 10⁻⁰²	2.95 × 10⁻⁰²	4.06 × 10⁻⁰²	1.99 × 10⁻⁰²
	64 × 64 × 1	8.53 × 10⁻⁰⁵	8.42 × 10⁻⁰⁵	7.56 × 10⁻⁰⁵	8.96 × 10⁻⁰⁵	1.01 × 10⁻⁰⁴	8.71 × 10⁻⁰⁵	8.28 × 10⁻⁰⁶
	128 × 128 × 1	5.61 × 10⁻⁰⁵	8.53 × 10⁻⁰⁵	6.32 × 10⁻⁰⁵	5.61 × 10⁻⁰⁵	5.83 × 10⁻⁰⁵	6.38 × 10⁻⁰⁵	1.10 × 10⁻⁰⁵
rmsprop	32 × 32 × 1	3.10 × 10⁻⁰¹	2.44 × 10⁻⁰²	3.10 × 10⁻⁰¹	2.10 × 10⁻⁰¹	3.52 × 10⁻⁰²	1.78 × 10⁻⁰¹	1.26 × 10⁻⁰¹
	64 × 64 × 1	3.62 × 10⁻⁰⁴	3.88 × 10⁻⁰⁴	3.43 × 10⁻⁰⁴	3.73 × 10⁻⁰⁴	3.78 × 10⁻⁰⁴	3.69 × 10⁻⁰⁴	1.52 × 10⁻⁰⁵
	128 × 128 × 1	2.04 × 10⁻⁰⁴	1.93 × 10⁻⁰⁴	1.86 × 10⁻⁰⁴	1.70 × 10⁻⁰⁴	1.61 × 10⁻⁰⁴	1.83 × 10⁻⁰⁴	1.55 × 10⁻⁰⁵

References

Huang, P.-W.; Lee, C.-H. Automatic classification for pathological prostate images based on fractal analysis. IEEE Trans. Med. Imaging 2009, 28, 1037–1050. [Google Scholar] [CrossRef]
Lin, P.-L.; Huang, P.-W.; Lee, C.-H.; Wu, M.-T. Automatic classification for solitary pulmonary nodule in CT image by fractal analysis based on fractional Brownian motion model. Pattern Recognit. 2013, 46, 3279–3287. [Google Scholar] [CrossRef]
He, D.; Liu, C. An online detection method for coal dry screening based on image processing and fractal analysis. Appl. Sci. 2022, 12, 6463. [Google Scholar] [CrossRef]
Yakovlev, G.; Polyanskikh, I.; Belykh, V.; Stepanov, V.; Smirnova, O. Evaluation of changes in structure of modified cement composite using fractal analysis. Appl. Sci. 2021, 11, 4139. [Google Scholar] [CrossRef]
Guo, Q.; Shao, J.; Ruiz, V.F. Characterization and classification of tumor lesions using computerized fractal-based texture analysis and support vector machines in digital mammograms. Int. J. Comput. Assist. Radiol. Surg. 2009, 4, 11–25. [Google Scholar] [CrossRef]
Di Crescenzo, A.; Martinucci, B.; Mustaro, V. A model based on fractional Brownian motion for temperature fluctuation in the Campi Flegrei Caldera. Fractal Fract. 2022, 6, 421. [Google Scholar] [CrossRef]
Paun, M.-A.; Paun, V.-A.; Paun, V.-P. Fractal analysis and time series application in ZY-4 SEM micro fractographies evaluation. Fractal Fract. 2022, 6, 458. [Google Scholar] [CrossRef]
Hu, H.; Zhao, C.; Li, J.; Huang, Y. Stock prediction model based on mixed fractional Brownian motion and improved fractional-order particle swarm optimization algorithm. Fractal Fract. 2022, 6, 560. [Google Scholar] [CrossRef]
Barnsley, M.F.; Devaney, R.L.; Mandelbrot, B.B.; Peitgen, H.-O.; Saupe, D.; Voss, R.F. The Science of Fractal Images; Springer: New York, NY, USA, 1988. [Google Scholar]
Falconer, K. Fractal Geometry: Mathematical Foundations and Applications; John Wiley & Sons: New York, NY, USA, 1990. [Google Scholar]
Gonçalves, W.N.; Bruno, O.M. Combining fractal and deterministic walkers for texture analysis and classification. Pattern Recognit. 2013, 46, 2953–2968. [Google Scholar] [CrossRef]
Mandelbrot, B.B. The Fractal Geometry of Nature; W. H. Freeman and Company: New York, NY, USA, 1983. [Google Scholar]
Zuñiga, A.G.; Florindo, J.B.; Bruno, O.M. Gabor wavelets combined with volumetric fractal dimension applied to texture analysis. Pattern Recognit. Lett. 2014, 36, 135–143. [Google Scholar] [CrossRef]
Pentland, A.P. Fractal-based description of natural scenes. IEEE Trans. Pattern Anal. Mach. Intell. 1984, 6, 661–674. [Google Scholar] [CrossRef] [PubMed]
Chen, C.-C.; Daponte, J.S.; Fox, M.D. Fractal feature analysis and classification in medical imaging. IEEE Trans. Med. Imaging 1989, 8, 133–142. [Google Scholar] [CrossRef] [PubMed]
Gagnepain, J.J.; Roques-Carmes, C. Fractal approach to two-dimensional and three dimensional surface roughness. Wear 1986, 109, 119–126. [Google Scholar] [CrossRef]
Sarkar, N.; Chaudhuri, B.B. An efficient differential box-counting approach to compute fractal dimension of image. IEEE Trans. Syst. Man Cybern. 1994, 24, 115–120. [Google Scholar] [CrossRef]
Sarkar, N.; Chaudhuri, B.B. An efficient approach to estimate fractal dimension of textural images. Pattern Recognit. 1992, 25, 1035–1041. [Google Scholar] [CrossRef]
Chen, S.S.; Keller, J.M.; Crownover, R.M. On the calculation of fractal features from images. IEEE Trans. Pattern Anal. Mach. Intell. 1993, 15, 1087–1090. [Google Scholar] [CrossRef]
Jin, X.C.; Ong, S.H.; Jayasooriah. A practical method for estimating fractal dimension. Pattern Recognit. Lett. 1995, 16, 457–464. [Google Scholar] [CrossRef]
Bruce, E.N. Biomedical Signal Processing and Signal Modeling; John Wiley & Sons: New York, NY, USA, 2001. [Google Scholar]
Li, J.; Du, Q.; Sun, C. An improved box-counting method for image fractal dimension estimation. Pattern Recognit. 2009, 42, 2460–2469. [Google Scholar] [CrossRef]
Peleg, S.; Naor, J.; Hartley, R.; Avnir, D. Multiple resolution texture analysis and classification. IEEE Trans. Pattern Anal. Mach. Intell. 1984, 6, 518–523. [Google Scholar] [CrossRef]
Chang, Y.-C. An efficient maximum likelihood estimator for two-dimensional fractional Brownian motion. Fractals 2021, 29, 2150025. [Google Scholar] [CrossRef]
ImageNet. Available online: http://www.image-net.org (accessed on 11 March 2021).
Chang, Y.-C.; Jeng, J.-T. Classifying Images of Two-Dimensional Fractional Brownian Motion through Deep Learning and Its Applications. Appl. Sci. 2023, 13, 803. [Google Scholar] [CrossRef]
Hoefer, S.; Hannachi, H.; Pandit, M.; Kumaresan, R. Isotropic two-dimensional Fractional Brownian Motion and its application in Ultrasonic analysis. In Proceedings of the Engineering in Medicine and Biology Society, 14th Annual International Conference of the IEEE, Paris, France, 29 October–1 November 1992; pp. 1267–1269. [Google Scholar]
Balghonaim, A.S.; Keller, J.M. A maximum likelihood estimate for two-variable fractal surface. IEEE Trans. Image Process. 1998, 7, 1746–1753. [Google Scholar] [CrossRef] [PubMed]
McGaughey, D.R.; Aitken, G.J.M. Generating two-dimensional fractional Brownian motion using the fractional Gaussian process (FGp) algorithm. Phys. A 2002, 311, 369–380. [Google Scholar] [CrossRef]
Schilling, R.J.; Harris, S.L. Applied Numerical Methods for Engineers: Using MATLAB and C; Brooks/Cole: New York, NY, USA, 2000. [Google Scholar]
Chang, Y.-C. N-Dimension Golden Section Search: Its Variants and Limitations. In Proceedings of the 2nd International Conference on BioMedical Engineering and Informatics (BMEI2009), Tianjin, China, 17–19 October 2009. [Google Scholar]
Szymak, P.; Piskur, P.; Naus, K. The effectiveness of using a pretrained deep learning neural networks for object classification in underwater video. Remote Sens. 2020, 12, 3020. [Google Scholar] [CrossRef]
Maeda-Gutiérrez, V.; Galván-Tejada, C.E.; Zanella-Calzada, L.A.; Celaya-Padilla, J.M.; Galván-Tejada, J.I.; Gamboa-Rosales, H.; Luna-García, H.; Magallanes-Quintanar, R.; Méndez, C.A.G.; Olvera-Olvera, C.A. Comparison of convolutional neural network architectures for classification of tomato plant diseases. Appl. Sci. 2020, 10, 1245. [Google Scholar] [CrossRef]
Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 1–9. [Google Scholar]
Beale, M.H.; Hagan, M.T.; Demuth, H.B. Deep Learning Toolbox: User’s Guide; MathWorks: Natick, MA, USA, 2022. [Google Scholar]
Zhou, B.; Khosla, A.; Lapedriza, A.; Torralba, A.; Oliva, A. Places: An image database for deep scene understanding. arXiv 2016, arXiv:1610.02055. [Google Scholar] [CrossRef]
Chollet, F. Xception: Deep learning with depthwise separable convolutions. In Proceedings of the 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, HI, USA, 21–26 July 2017; pp. 1800–1807. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
Sandler, M.; Howard, A.; Zhu, M.; Zhmoginov, A.; Chen, L.C. MobileNetV2: Inverted Residuals and Linear Bottlenecks. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 4510–4520. [Google Scholar]
Iandola, F.N.; Han, S.; Moskewicz, M.W.; Ashraf, K.; Dally, W.J.; Keutzer, K. SqueezeNet: AlexNet-Level Accuracy with 50× Fewer Parameters and <0.5 MB Model Size. arXiv 2016, arXiv:1602.07360. [Google Scholar]
Goodfellow, I.; Bengio, Y.; Courville, A. Deep Learning; The MIT Press: Cambridge, MA, USA, 2016. [Google Scholar]
Chest X-ray Images (Pneumonia). Available online: https://www.kaggle.com/paultimothymooney/chest-xray-pneumonia (accessed on 24 September 2021).
Chollet, F. Deep Learning with Python; Manning: New York, NY, USA, 2018. [Google Scholar]

$Fractalfract 08 00050 g001$

Figure 1. Two FBIs of H = 1/64 (a) and H = 9/64 (b).

$Fractalfract 08 00050 g001$

$Fractalfract 08 00050 g002$

Figure 2. Two FBIs of H = 17/64 (a) and H = 25/64 (b).

$Fractalfract 08 00050 g002$

$Fractalfract 08 00050 g003$

Figure 3. Two FBIs of H = 33/64 (a) and H = 41/64 (b).

$Fractalfract 08 00050 g003$

$Fractalfract 08 00050 g004$

Figure 4. Two FBIs of H = 49/64 (a) and H = 57/64 (b).

$Fractalfract 08 00050 g004$

$Fractalfract 08 00050 g005$

Figure 5. The confusion matrix in question from Fold 5.

$Fractalfract 08 00050 g005$

$Fractalfract 08 00050 g006a$ $Fractalfract 08 00050 g006b$

Figure 6. Four samples of clipped chest X-ray images (127 × 384): Two samples (a) from normal group and two samples (b) from pneumonia group.

$Fractalfract 08 00050 g006a$ $Fractalfract 08 00050 g006b$

$Fractalfract 08 00050 g007a$ $Fractalfract 08 00050 g007b$

Figure 7. Four characteristic maps (120 × 377) using the efficient MLE: Two characteristic maps (a) from normal group and two characteristic maps (b) from pneumonia group.

$Fractalfract 08 00050 g007a$ $Fractalfract 08 00050 g007b$

$Fractalfract 08 00050 g008$

Figure 8. Four characteristic maps (120 × 377) using our proposed model: Two characteristic maps (a) from normal group and two characteristic maps (b) from pneumonia group.

$Fractalfract 08 00050 g008$

$Fractalfract 08 00050 g009$

Figure 9. Four characteristic maps (120 × 377) using ResNet18: Two characteristic maps (a) from normal group and two characteristic maps (b) from pneumonia group.

$Fractalfract 08 00050 g009$

$Fractalfract 08 00050 g010a$ $Fractalfract 08 00050 g010b$

Figure 10. Four characteristic maps (120 × 377) using MobileNetV2: Two characteristic maps (a) from normal group and two characteristic maps (b) from pneumonia group.

$Fractalfract 08 00050 g010a$ $Fractalfract 08 00050 g010b$

Table 1. MSEs of the efficient MLE under 11 Hurst exponents over five image sizes and 1000 observations per class.

Sizes	Sets	FBSs	FBIs
8 × 8	1	3.08 × 10⁻⁰²	3.09 × 10⁻⁰²
	2	3.38 × 10⁻⁰²	3.38 × 10⁻⁰²
16 × 16	1	5.00 × 10⁻⁰³	5.12 × 10⁻⁰³
	2	5.24 × 10⁻⁰³	5.36 × 10⁻⁰³
32 × 32	1	1.09 × 10⁻⁰³	1.40 × 10⁻⁰³
	2	1.02 × 10⁻⁰³	1.33 × 10⁻⁰³
64 × 64	1	2.23 × 10⁻⁰⁴	2.19 × 10⁻⁰³
	2	2.57 × 10⁻⁰⁴	2.31 × 10⁻⁰³
128 × 128	1	8.78 × 10^{−05 1}	8.28 × 10^{−03 1}
	2	7.96 × 10^{−05 1}	6.40 × 10^{−03 1}

¹ 10 observations, not 1000 observations.

Table 2. The architecture of our proposed model.

Groups	Layer Numbers	Group Numbers	Layer Numbers	Filter Sizes or Output Numbers (Convolutional Layer Sizes)
The first group	1	1	1	One of 8 × 8 × 1, 16 × 16 × 1, 32 × 32 × 1, 64 × 64 × 1, 128 × 128 × 1
The second group	4	3	12	128 (3 × 3) + 128 (5 × 5) + 128 (7 × 7) + 128 (9 × 9) + 128 (11 × 11)
The third group	3	3	9	128 (13 × 13)
The fourth group	3	1	3	11 or 21 (Output number)

Table 3. The five-fold classification rates for five types of image sizes and three solvers under 11 Hurst exponents: Our proposed model.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	92.09%	92.14%	91.95%	90.59%	92.68%	91.89%	0.70%
	16 × 16 × 1	95.32%	95.09%	96.59%	95.73%	96.09%	95.76%	0.54%
	32 × 32 × 1	95.95%	95.32%	96.00%	96.00%	95.23%	95.70%	0.35%
	64 × 64 × 1	95.32%	94.50%	95.59%	93.18%	94.86%	94.69%	0.84%
	128 × 128 × 1	95.32%	95.32%	95.09%	94.50%	94.41%	94.93%	0.40%
adam	8 × 8 × 1	93.05%	94.27%	94.23%	93.27%	94.27%	93.82%	0.54%
	16 × 16 × 1	97.00%	96.68%	98.27%	97.59%	97.18%	97.35%	0.55%
	32 × 32 × 1	94.50%	96.00%	96.23%	95.77%	96.00%	95.70%	0.62%
	64 × 64 × 1	95.64%	95.91%	96.18%	93.91%	97.36%	95.80%	1.11%
	128 × 128 × 1	85.14%	93.95%	94.41%	94.59%	90.68%	91.75%	3.60%
rmsprop	8 × 8 × 1	92.50%	92.18%	92.77%	91.77%	92.68%	92.38%	0.37%
	16 × 16 × 1	96.32%	96.68%	97.59%	97.55%	96.50%	96.93%	0.54%
	32 × 32 × 1	91.32%	94.95%	95.64%	91.95%	93.86%	93.55%	1.67%
	64 × 64 × 1	94.41%	95.50%	95.05%	94.00%	96.45%	95.08%	0.86%
	128 × 128 × 1	94.82%	94.45%	95.09%	95.50%	90.86%	94.15%	1.68%

Table 4. The five-fold classification rates for five types of image sizes and three solvers under 21 Hurst exponents: Our proposed model.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	90.64%	90.40%	89.10%	90.90%	90.52%	90.31%	0.63%
	16 × 16 × 1	94.79%	94.14%	95.21%	93.31%	94.50%	94.39%	0.64%
	32 × 32 × 1	93.93%	92.52%	93.57%	92.50%	93.83%	93.27%	0.63%
	64 × 64 × 1	85.19%	84.36%	83.67%	85.17%	82.74%	84.22%	0.93%
	128 × 128 × 1	85.40%	85.38%	82.00%	85.98%	85.81%	84.91%	1.48%
adam	8 × 8 × 1	91.69%	91.36%	90.10%	91.12%	92.02%	91.26%	0.66%
	16 × 16 × 1	95.55%	94.76%	96.43%	94.38%	93.71%	94.97%	0.94%
	32 × 32 × 1	91.17%	91.29%	94.62%	95.55%	93.55%	93.23%	1.76%
	64 × 64 × 1	88.67%	84.10%	86.55%	85.83%	85.26%	86.08%	1.52%
	128 × 128 × 1	87.38%	84.31%	82.86%	85.71%	86.43%	85.34%	1.59%
rmsprop	8 × 8 × 1	90.71%	92.10%	91.50%	92.14%	92.29%	91.75%	0.58%
	16 × 16 × 1	96.38%	94.95%	94.98%	95.38%	94.79%	95.30%	0.58%
	32 × 32 × 1	92.67%	92.26%	92.55%	90.93%	91.86%	92.05%	0.63%
	64 × 64 × 1	84.71%	83.69%	84.74%	84.00%	82.02%	83.83%	0.99%
	128 × 128 × 1	84.83%	80.57%	87.60%	86.36%	84.93%	84.86%	2.37%

Table 5. The five-fold MSEs for five types of image sizes and three solvers under 11 Hurst exponents: Our proposed model.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	7.74 × 10⁻⁰⁴	7.93 × 10⁻⁰⁴	7.93 × 10⁻⁰⁴	9.13 × 10⁻⁰⁴	7.59 × 10⁻⁰⁴	8.06 × 10⁻⁰⁴	5.48 × 10⁻⁰⁵
	16 × 16 × 1	4.32 × 10⁻⁰⁴	4.51 × 10⁻⁰⁴	2.82 × 10⁻⁰⁴	3.76 × 10⁻⁰⁴	3.57 × 10⁻⁰⁴	3.79 × 10⁻⁰⁴	5.99 × 10⁻⁰⁵
	32 × 32 × 1	3.34 × 10⁻⁰⁴	3.87 × 10⁻⁰⁴	3.31 × 10⁻⁰⁴	3.31 × 10⁻⁰⁴	3.94 × 10⁻⁰⁴	3.55 × 10⁻⁰⁴	2.90 × 10⁻⁰⁵
	64 × 64 × 1	3.87 × 10⁻⁰⁴	4.55 × 10⁻⁰⁴	3.64 × 10⁻⁰⁴	5.63 × 10⁻⁰⁴	4.24 × 10⁻⁰⁴	4.39 × 10⁻⁰⁴	6.96 × 10⁻⁰⁵
	128 × 128 × 1	3.87 × 10⁻⁰⁴	3.87 × 10⁻⁰⁴	4.06 × 10⁻⁰⁴	4.55 × 10⁻⁰⁴	4.62 × 10⁻⁰⁴	4.19 × 10⁻⁰⁴	3.27 × 10⁻⁰⁵
adam	8 × 8 × 1	6.42 × 10⁻⁰⁴	5.86 × 10⁻⁰⁴	5.48 × 10⁻⁰⁴	6.54 × 10⁻⁰⁴	6.27 × 10⁻⁰⁴	6.12 × 10⁻⁰⁴	3.90 × 10⁻⁰⁵
	16 × 16 × 1	2.93 × 10⁻⁰⁴	2.74 × 10⁻⁰⁴	1.43 × 10⁻⁰⁴	1.99 × 10⁻⁰⁴	2.33 × 10⁻⁰⁴	2.28 × 10⁻⁰⁴	5.38 × 10⁻⁰⁵
	32 × 32 × 1	4.55 × 10⁻⁰⁴	3.31 × 10⁻⁰⁴	3.12 × 10⁻⁰⁴	3.49 × 10⁻⁰⁴	3.31 × 10⁻⁰⁴	3.55 × 10⁻⁰⁴	5.10 × 10⁻⁰⁵
	64 × 64 × 1	3.61 × 10⁻⁰⁴	3.38 × 10⁻⁰⁴	3.16 × 10⁻⁰⁴	5.03 × 10⁻⁰⁴	2.18 × 10⁻⁰⁴	3.47 × 10⁻⁰⁴	9.21 × 10⁻⁰⁵
	128 × 128 × 1	1.25 × 10⁻⁰³	5.00 × 10⁻⁰⁴	4.62 × 10⁻⁰⁴	4.47 × 10⁻⁰⁴	7.81 × 10⁻⁰⁴	6.88 × 10⁻⁰⁴	3.07 × 10⁻⁰⁴
rmsprop	8 × 8 × 1	7.10 × 10⁻⁰⁴	7.44 × 10⁻⁰⁴	7.25 × 10⁻⁰⁴	8.26 × 10⁻⁰⁴	7.66 × 10⁻⁰⁴	7.54 × 10⁻⁰⁴	4.07 × 10⁻⁰⁵
	16 × 16 × 1	3.38 × 10⁻⁰⁴	2.85 × 10⁻⁰⁴	1.99 × 10⁻⁰⁴	2.25 × 10⁻⁰⁴	3.01 × 10⁻⁰⁴	2.70 × 10⁻⁰⁴	5.06 × 10⁻⁰⁵
	32 × 32 × 1	7.18 × 10⁻⁰⁴	4.17 × 10⁻⁰⁴	3.61 × 10⁻⁰⁴	6.65 × 10⁻⁰⁴	5.07 × 10⁻⁰⁴	5.33 × 10⁻⁰⁴	1.38 × 10⁻⁰⁴
	64 × 64 × 1	4.62 × 10⁻⁰⁴	3.72 × 10⁻⁰⁴	4.09 × 10⁻⁰⁴	4.96 × 10⁻⁰⁴	2.93 × 10⁻⁰⁴	4.06 × 10⁻⁰⁴	7.09 × 10⁻⁰⁵
	128 × 128 × 1	4.28 × 10⁻⁰⁴	4.58 × 10⁻⁰⁴	4.17 × 10⁻⁰⁴	3.72 × 10⁻⁰⁴	7.55 × 10⁻⁰⁴	4.86 × 10⁻⁰⁴	1.37 × 10⁻⁰⁴

Table 6. The five-fold MSEs for five types of image sizes and three solvers under 21 Hurst exponents: Our proposed model.

Solvers	Sizes	Fold 1	Fold 2	Fold 3	Fold 4	Fold 5	Mean	Std.
sgdm	8 × 8 × 1	3.05 × 10⁻⁰⁴	3.10 × 10⁻⁰⁴	3.26 × 10⁻⁰⁴	3.16 × 10⁻⁰⁴	4.09 × 10⁻⁰⁴	3.33 × 10⁻⁰⁴	3.85 × 10⁻⁰⁵
	16 × 16 × 1	1.30 × 10⁻⁰⁴	1.89 × 10⁻⁰⁴	1.55 × 10⁻⁰⁴	1.94 × 10⁻⁰⁴	1.60 × 10⁻⁰⁴	1.66 × 10⁻⁰⁴	2.37 × 10⁻⁰⁵
	32 × 32 × 1	1.58 × 10⁻⁰⁴	1.88 × 10⁻⁰⁴	1.59 × 10⁻⁰⁴	1.88 × 10⁻⁰⁴	1.54 × 10⁻⁰⁴	1.70 × 10⁻⁰⁴	1.53 × 10⁻⁰⁵
	64 × 64 × 1	3.63 × 10⁻⁰⁴	3.92 × 10⁻⁰⁴	4.03 × 10⁻⁰⁴	3.93 × 10⁻⁰⁴	4.61 × 10⁻⁰⁴	4.02 × 10⁻⁰⁴	3.19 × 10⁻⁰⁵
	128 × 128 × 1	3.76 × 10⁻⁰⁴	4.18 × 10⁻⁰⁴	4.75 × 10⁻⁰⁴	3.60 × 10⁻⁰⁴	3.56 × 10⁻⁰⁴	3.97 × 10⁻⁰⁴	4.46 × 10⁻⁰⁵
adam	8 × 8 × 1	2.54 × 10⁻⁰⁴	3.02 × 10⁻⁰⁴	3.00 × 10⁻⁰⁴	2.74 × 10⁻⁰⁴	3.34 × 10⁻⁰⁴	2.93 × 10⁻⁰⁴	2.72 × 10⁻⁰⁵
	16 × 16 × 1	1.09 × 10⁻⁰⁴	1.43 × 10⁻⁰⁴	1.11 × 10⁻⁰⁴	1.67 × 10⁻⁰⁴	1.70 × 10⁻⁰⁴	1.40 × 10⁻⁰⁴	2.61 × 10⁻⁰⁵
	32 × 32 × 1	2.19 × 10⁻⁰⁴	2.20 × 10⁻⁰⁴	1.27 × 10⁻⁰⁴	1.06 × 10⁻⁰⁴	1.56 × 10⁻⁰⁴	1.66 × 10⁻⁰⁴	4.70 × 10⁻⁰⁵
	64 × 64 × 1	2.67 × 10⁻⁰⁴	3.87 × 10⁻⁰⁴	3.20 × 10⁻⁰⁴	3.26 × 10⁻⁰⁴	3.63 × 10⁻⁰⁴	3.32 × 10⁻⁰⁴	4.09 × 10⁻⁰⁵
	128 × 128 × 1	3.37 × 10⁻⁰⁴	4.50 × 10⁻⁰⁴	4.44 × 10⁻⁰⁴	4.00 × 10⁻⁰⁴	3.78 × 10⁻⁰⁴	4.02 × 10⁻⁰⁴	4.20 × 10⁻⁰⁵
rmsprop	8 × 8 × 1	2.66 × 10⁻⁰⁴	2.85 × 10⁻⁰⁴	2.56 × 10⁻⁰⁴	2.60 × 10⁻⁰⁴	3.15 × 10⁻⁰⁴	2.77 × 10⁻⁰⁴	2.18 × 10⁻⁰⁵
	16 × 16 × 1	9.02 × 10⁻⁰⁵	1.43 × 10⁻⁰⁴	1.54 × 10⁻⁰⁴	1.36 × 10⁻⁰⁴	1.55 × 10⁻⁰⁴	1.36 × 10⁻⁰⁴	2.38 × 10⁻⁰⁵
	32 × 32 × 1	1.79 × 10⁻⁰⁴	2.00 × 10⁻⁰⁴	1.80 × 10⁻⁰⁴	2.24 × 10⁻⁰⁴	2.04 × 10⁻⁰⁴	1.97 × 10⁻⁰⁴	1.66 × 10⁻⁰⁵
	64 × 64 × 1	3.77 × 10⁻⁰⁴	3.86 × 10⁻⁰⁴	3.72 × 10⁻⁰⁴	3.76 × 10⁻⁰⁴	4.45 × 10⁻⁰⁴	3.91 × 10⁻⁰⁴	2.72 × 10⁻⁰⁵
	128 × 128 × 1	4.55 × 10⁻⁰⁴	4.99 × 10⁻⁰⁴	3.09 × 10⁻⁰⁴	3.50 × 10⁻⁰⁴	3.71 × 10⁻⁰⁴	3.97 × 10⁻⁰⁴	6.98 × 10⁻⁰⁵

Table 7. Average accuracies for 11 and 21 classes over three solvers: Our proposed model.

Sizes\Classes	11	21
8 × 8	92.70%	91.11%
16 × 16	96.68%	94.88%
32 × 32	94.98%	92.85%
64 × 64	95.19%	84.71%
128 × 128	93.61%	85.04%

Table 8. Average MSEs for 11 and 21 classes over three solvers: Our proposed model.

Sizes\Classes	11	21
8 × 8	7.24 × 10⁻⁰⁴	3.01 × 10⁻⁰⁴
16 × 16	2.93 × 10⁻⁰⁴	1.47 × 10⁻⁰⁴
32 × 32	4.15 × 10⁻⁰⁴	1.77 × 10⁻⁰⁴
64 × 64	3.97 × 10⁻⁰⁴	3.75 × 10⁻⁰⁴
128 × 128	5.31 × 10⁻⁰⁴	3.99 × 10⁻⁰⁴

Table 9. The five-fold classification rates with three solvers under 11 Hurst exponents: Six deep-learning models.

Solvers	Sizes	Proposed	Xception	ResNet18	MobileNetV2	GoogleNet	SqueezeNet
sgdm	8 × 8 × 1	91.89%	9.10%	59.30%	35.30%	x	x
	16 × 16 × 1	95.76%	9.09%	82.41%	67.95%	x	x
	32 × 32 × 1	95.70%	79.97%	91.85%	87.99%	92.60%	85.55%
	64 × 64 × 1	94.69%	86.89%	94.81%	94.92%	97.01%	97.17%
	128 × 128 × 1	94.93%	95.58%	97.50%	98.65%	98.85%	98.06%
adam	8 × 8 × 1	93.82%	81.63%	71.05%	34.28%	x	x
	16 × 16 × 1	97.35%	96.13%	89.87%	63.21%	x	x
	32 × 32 × 1	95.70%	98.40%	98.08%	95.61%	96.29%	92.95%
	64 × 64 × 1	95.80%	99.41%	99.01%	98.49%	98.35%	98.66%
	128 × 128 × 1	91.75%	99.88%	98.34%	99.58%	98.92%	99.35%
rmsprop	8 × 8 × 1	92.38%	78.60%	58.83%	50.67%	x	x
	16 × 16 × 1	96.93%	95.28%	92.31%	88.78%	x	x
	32 × 32 × 1	93.55%	98.15%	98.29%	97.26%	91.59%	49.75%
	64 × 64 × 1	95.08%	99.36%	98.15%	98.74%	94.80%	96.24%
	128 × 128 × 1	94.15%	99.79%	97.74%	99.52%	96.50%	98.28%

Table 10. The five-fold classification rates with three solvers under 21 Hurst exponents: Six deep-learning models.

Solvers	Sizes	Proposed	Xception	ResNet18	MobileNetV2	GoogleNet	SqueezeNet
sgdm	8 × 8 × 1	90.31%	4.77%	61.94%	32.83%	x	x
	16 × 16 × 1	94.39%	4.75%	82.91%	67.20%	x	x
	32 × 32 × 1	93.27%	82.35%	91.37%	86.50%	92.43%	74.76%
	64 × 64 × 1	84.22%	87.43%	94.50%	93.14%	95.05%	88.39%
	128 × 128 × 1	84.91%	94.14%	95.98%	95.68%	95.67%	93.48%
adam	8 × 8 × 1	91.26%	78.13%	70.18%	26.19%	x	x
	16 × 16 × 1	94.97%	94.76%	83.48%	59.31%	x	x
	32 × 32 × 1	93.23%	97.59%	96.31%	89.26%	96.09%	76.49%
	64 × 64 × 1	86.08%	98.28%	96.87%	96.68%	97.28%	96.17%
	128 × 128 × 1	85.34%	99.03%	94.58%	98.50%	96.78%	97.21%
rmsprop	8 × 8 × 1	91.75%	78.10%	73.52%	59.20%	x	x
	16 × 16 × 1	95.30%	94.60%	84.84%	84.17%	x	x
	32 × 32 × 1	92.05%	96.75%	95.49%	94.05%	91.90%	28.64%
	64 × 64 × 1	83.83%	98.13%	96.29%	96.52%	88.70%	84.10%
	128 × 128 × 1	84.86%	98.73%	93.52%	98.05%	90.90%	92.10%

Table 11. The five-fold MSEs with three solvers under 11 Hurst exponents: Six deep-learning models.

Solvers	Sizes	Proposed	Xception	ResNet18	MobileNetV2	GoogleNet	SqueezeNet
sgdm	8 × 8 × 1	8.06 × 10⁻⁰⁴	1.67 × 10⁻⁰¹	5.59 × 10⁻⁰³	1.24 × 10⁻⁰²	x	x
	16 × 16 × 1	3.79 × 10⁻⁰⁴	1.67 × 10⁻⁰¹	1.71 × 10⁻⁰³	3.09 × 10⁻⁰³	x	x
	32 × 32 × 1	3.55 × 10⁻⁰⁴	2.10 × 10⁻⁰³	6.99 × 10⁻⁰⁴	9.99 × 10⁻⁰⁴	6.14 × 10⁻⁰⁴	1.20 × 10⁻⁰³
	64 × 64 × 1	4.39 × 10⁻⁰⁴	1.10 × 10⁻⁰³	4.31 × 10⁻⁰⁴	4.20 × 10⁻⁰⁴	2.47 × 10⁻⁰⁴	2.34 × 10⁻⁰⁴
	128 × 128 × 1	4.19 × 10⁻⁰⁴	3.65 × 10⁻⁰⁴	2.07 × 10⁻⁰⁴	1.11 × 10⁻⁰⁴	9.47 × 10⁻⁰⁵	1.60 × 10⁻⁰⁴
adam	8 × 8 × 1	6.12 × 10⁻⁰⁴	2.17 × 10⁻⁰³	3.52 × 10⁻⁰³	1.71 × 10⁻⁰²	x	x
	16 × 16 × 1	2.28 × 10⁻⁰⁴	3.38 × 10⁻⁰⁴	8.91 × 10⁻⁰⁴	3.96 × 10⁻⁰³	x	x
	32 × 32 × 1	3.55 × 10⁻⁰⁴	1.32 × 10⁻⁰⁴	1.61 × 10⁻⁰⁴	3.70 × 10⁻⁰⁴	3.07 × 10⁻⁰⁴	1.12 × 10⁻⁰²
	64 × 64 × 1	3.47 × 10⁻⁰⁴	4.88 × 10⁻⁰⁵	8.19 × 10⁻⁰⁵	1.25 × 10⁻⁰⁴	1.36 × 10⁻⁰⁴	1.10 × 10⁻⁰⁴
	128 × 128 × 1	6.88 × 10⁻⁰⁴	9.77 × 10⁻⁰⁶	1.37 × 10⁻⁰⁴	3.46 × 10⁻⁰⁵	8.94 × 10⁻⁰⁵	5.41 × 10⁻⁰⁵
rmsprop	8 × 8 × 1	7.54 × 10⁻⁰⁴	2.34 × 10⁻⁰³	5.67 × 10⁻⁰³	7.30 × 10⁻⁰³	x	x
	16 × 16 × 1	2.70 × 10⁻⁰⁴	5.97 × 10⁻⁰⁴	6.90 × 10⁻⁰⁴	9.88 × 10⁻⁰⁴	x	x
	32 × 32 × 1	5.33 × 10⁻⁰⁴	1.56 × 10⁻⁰⁴	1.41 × 10⁻⁰⁴	2.26 × 10⁻⁰⁴	7.11 × 10⁻⁰⁴	7.28 × 10⁻⁰²
	64 × 64 × 1	4.06 × 10⁻⁰⁴	5.26 × 10⁻⁰⁵	1.53 × 10⁻⁰⁴	1.04 × 10⁻⁰⁴	4.30 × 10⁻⁰⁴	3.11 × 10⁻⁰⁴
	128 × 128 × 1	4.86 × 10⁻⁰⁴	1.73 × 10⁻⁰⁵	1.87 × 10⁻⁰⁴	3.98 × 10⁻⁰⁵	2.89 × 10⁻⁰⁴	1.42 × 10⁻⁰⁴

Table 12. The five-fold MSEs with three solvers under 21 Hurst exponents: Six deep-learning models.

Solvers	Sizes	Proposed	Xception	ResNet18	MobileNetV2	GoogleNet	SqueezeNet
sgdm	8 × 8 × 1	3.33 × 10⁻⁰⁴	1.01 × 10⁻⁰¹	1.84 × 10⁻⁰³	4.16 × 10⁻⁰³	x	x
	16 × 16 × 1	1.66 × 10⁻⁰⁴	1.01 × 10⁻⁰¹	5.23 × 10⁻⁰⁴	1.04 × 10⁻⁰³	x	x
	32 × 32 × 1	1.70 × 10⁻⁰⁴	7.92 × 10⁻⁰⁴	2.31 × 10⁻⁰⁴	3.55 × 10⁻⁰⁴	1.94 × 10⁻⁰⁴	6.36 × 10⁻⁰⁴
	64 × 64 × 1	4.02 × 10⁻⁰⁴	3.15 × 10⁻⁰⁴	1.34 × 10⁻⁰⁴	1.61 × 10⁻⁰⁴	1.19 × 10⁻⁰⁴	2.67 × 10⁻⁰⁴
	128 × 128 × 1	3.97 × 10⁻⁰⁴	1.34 × 10⁻⁰⁴	9.31 × 10⁻⁰⁵	9.83 × 10⁻⁰⁵	9.92 × 10⁻⁰⁵	1.49 × 10⁻⁰⁴
adam	8 × 8 × 1	2.93 × 10⁻⁰⁴	8.81 × 10⁻⁰⁴	1.20 × 10⁻⁰³	1.13 × 10⁻⁰²	x	x
	16 × 16 × 1	1.40 × 10⁻⁰⁴	1.54 × 10⁻⁰⁴	5.02 × 10⁻⁰⁴	1.47 × 10⁻⁰³	x	x
	32 × 32 × 1	1.66 × 10⁻⁰⁴	6.20 × 10⁻⁰⁵	9.56 × 10⁻⁰⁵	2.74 × 10⁻⁰⁴	1.02 × 10⁻⁰⁴	4.06 × 10⁻⁰²
	64 × 64 × 1	3.32 × 10⁻⁰⁴	3.90 × 10⁻⁰⁵	7.20 × 10⁻⁰⁵	7.80 × 10⁻⁰⁵	6.36 × 10⁻⁰⁵	8.71 × 10⁻⁰⁵
	128 × 128 × 1	4.02 × 10⁻⁰⁴	2.19 × 10⁻⁰⁵	1.26 × 10⁻⁰⁴	3.41 × 10⁻⁰⁵	7.44 × 10⁻⁰⁵	6.38 × 10⁻⁰⁵
rmsprop	8 × 8 × 1	2.77 × 10⁻⁰⁴	9.73 × 10⁻⁰⁴	1.11 × 10⁻⁰³	1.75 × 10⁻⁰³	x	x
	16 × 16 × 1	1.36 × 10⁻⁰⁴	9.70 × 10⁻⁰⁴	4.34 × 10⁻⁰⁴	4.55 × 10⁻⁰⁴	x	x
	32 × 32 × 1	1.97 × 10⁻⁰⁴	1.25 × 10⁻⁰⁴	8.85 × 10⁻⁰⁴	1.45 × 10⁻⁰⁴	2.19 × 10⁻⁰⁴	1.78 × 10⁻⁰¹
	64 × 64 × 1	3.91 × 10⁻⁰⁴	1.63 × 10⁻⁰³	8.87 × 10⁻⁰⁵	8.15 × 10⁻⁰⁵	2.70 × 10⁻⁰⁴	3.69 × 10⁻⁰⁴
	128 × 128 × 1	3.97 × 10⁻⁰⁴	2.87 × 10⁻⁰⁵	1.53 × 10⁻⁰⁴	4.43 × 10⁻⁰⁵	2.09 × 10⁻⁰⁴	1.83 × 10⁻⁰⁴

Table 13. Average accuracies for 11 classes over three solvers: Six deep-learning models.

Sizes	Proposed	Xception	ResNet18	MobileNetV2	GoogleNet	SqueezeNet
8 × 8 × 1	92.70%	56.44%	63.06%	40.08%	x	x
16 × 16 × 1	96.68%	66.83%	88.20%	73.31%	x	x
32 × 32 × 1	94.98%	92.17%	96.07%	93.62%	93.49%	76.08%
64 × 64 × 1	95.19%	95.22%	97.32%	97.38%	96.72%	97.36%
128 × 128 × 1	93.61%	98.42%	97.86%	99.25%	98.09%	98.56%

Table 14. Average accuracies for 21 classes over three solvers: Six deep-learning models.

Sizes	Proposed	Xception	ResNet18	MobileNetV2	GoogleNet	SqueezeNet
8 × 8 × 1	91.11%	53.67%	68.55%	39.41%	x	x
16 × 16 × 1	94.88%	64.70%	83.74%	70.22%	x	x
32 × 32 × 1	92.85%	92.23%	94.39%	89.94%	93.48%	59.96%
64 × 64 × 1	84.71%	94.62%	95.88%	95.44%	93.68%	89.55%
128 × 128 × 1	85.04%	97.30%	94.69%	97.41%	94.45%	94.27%

Table 15. Average MSEs for 11 classes over three solvers: Six deep-learning models.

Sizes	Proposed	Xception	ResNet18	MobileNetV2	GoogleNet	SqueezeNet
8 × 8 × 1	7.24 × 10⁻⁰⁴	5.71 × 10⁻⁰²	4.93 × 10⁻⁰³	1.23 × 10⁻⁰²	x	x
16 × 16 × 1	2.93 × 10⁻⁰⁴	5.60 × 10⁻⁰²	1.10 × 10⁻⁰³	2.68 × 10⁻⁰³	x	x
32 × 32 × 1	4.15 × 10⁻⁰⁴	7.96 × 10⁻⁰⁴	3.34 × 10⁻⁰⁴	5.32 × 10⁻⁰⁴	5.44 × 10⁻⁰⁴	2.84 × 10⁻⁰²
64 × 64 × 1	3.97 × 10⁻⁰⁴	4.01 × 10⁻⁰⁴	2.22 × 10⁻⁰⁴	2.16 × 10⁻⁰⁴	2.71 × 10⁻⁰⁴	2.18 × 10⁻⁰⁴
128 × 128 × 1	5.31 × 10⁻⁰⁴	1.31 × 10⁻⁰⁴	1.77 × 10⁻⁰⁴	6.19 × 10⁻⁰⁵	1.58 × 10⁻⁰⁴	1.19 × 10⁻⁰⁴

Table 16. Average MSEs for 21 classes over three solvers: Six deep-learning models.

Sizes	Proposed	Xception	ResNet18	MobileNetV2	GoogleNet	SqueezeNet
8 × 8 × 1	3.01 × 10⁻⁰⁴	3.43 × 10⁻⁰²	1.38 × 10⁻⁰³	5.75 × 10⁻⁰³	x	x
16 × 16 × 1	1.47 × 10⁻⁰⁴	3.41 × 10⁻⁰²	4.86 × 10⁻⁰⁴	9.89 × 10⁻⁰⁴	x	x
32 × 32 × 1	1.77 × 10⁻⁰⁴	3.26 × 10⁻⁰⁴	4.04 × 10⁻⁰⁴	2.58 × 10⁻⁰⁴	1.72 × 10⁻⁰⁴	7.30 × 10⁻⁰²
64 × 64 × 1	3.75 × 10⁻⁰⁴	6.61 × 10⁻⁰⁴	9.81 × 10⁻⁰⁵	1.07 × 10⁻⁰⁴	1.51 × 10⁻⁰⁴	2.41 × 10⁻⁰⁴
128 × 128 × 1	3.99 × 10⁻⁰⁴	6.16 × 10⁻⁰⁵	1.24 × 10⁻⁰⁴	5.89 × 10⁻⁰⁵	1.28 × 10⁻⁰⁴	1.32 × 10⁻⁰⁴

Table 17. The average accuracies of Set 2 in six models trained on Set 1 for 11 classes over three solvers.

Sizes\Classes	Proposed	Xception	ResNet18	MobileNetV2	GoogleNet	SqueezeNet
8 × 8	37.83%	22.68%	33.51%	31.96%	x	x
16 × 16	56.50%	32.89%	51.88%	51.46%	x	x
32 × 32	79.97%	69.16%	73.28%	74.33%	75.80%	68.83%
64 × 64	93.50%	86.44%	89.49%	92.95%	92.27%	94.87%
128 × 128	94.07%	96.20%	96.83%	98.69%	97.39%	98.53%

Table 18. The corresponding errors between between-set evaluation and within-set evaluation.

Sizes\Classes	Proposed	Xception	ResNet18	MobileNetV2	GoogleNet	SqueezeNet
8 × 8	−54.86%	−33.76%	−29.55%	−8.13%	x	x
16 × 16	−40.18%	−33.95%	−36.32%	−21.85%	x	x
32 × 32	−15.02%	−23.02%	−22.79%	−19.29%	−17.70%	−7.25%
64 × 64	−1.69%	−8.78%	−7.83%	−4.43%	−4.45%	−2.49%
128 × 128	0.46%	−2.21%	−1.03%	−0.56%	−0.70%	−0.03%

Table 19. Average MSEs of the efficient MLE and three deep-learning models.

Sizes	MLE	Proposed	ResNet18	MobileNetV2
8 × 8	3.38 × 10⁻⁰²	1.21 × 10⁻⁰²	1.63 × 10⁻⁰²	1.70 × 10⁻⁰²
16 × 16	5.36 × 10⁻⁰³	4.21 × 10⁻⁰³	5.30 × 10⁻⁰³	5.24 × 10⁻⁰³
32 × 32	1.33 × 10⁻⁰³	1.51 × 10⁻⁰³	2.10 × 10⁻⁰³	1.77 × 10⁻⁰³
64 × 64	2.31 × 10⁻⁰³	5.43 × 10⁻⁰⁴	8.55 × 10⁻⁰⁴	6.14 × 10⁻⁰⁴
128 × 128	6.40 × 10^{−03 1}	5.61 × 10⁻⁰⁴	4.42 × 10⁻⁰⁴	3.00 × 10⁻⁰⁴

¹ 10 observations, not 1000 observations.

Table 20. Average computational times per observation of the efficient MLE and three deep-learning models.

Sizes	MLE	Proposed	ResNet18	MobileNetV2	Ratio1 ²	Ratio2 ³	Ratio3 ⁴
8 × 8	6.49 × 10⁻⁰³	3.92 × 10⁻⁰³	8.72 × 10⁻⁰³	1.06 × 10⁻⁰¹	1.66 × 10⁺⁰⁰	7.45 × 10⁻⁰¹	6.14 × 10⁻⁰²
16 × 16	3.79 × 10⁻⁰²	3.95 × 10⁻⁰³	8.73 × 10⁻⁰³	1.09 × 10⁻⁰¹	9.59 × 10⁺⁰⁰	4.33 × 10⁺⁰⁰	3.46 × 10⁻⁰¹
32 × 32	5.02 × 10⁻⁰¹	3.87 × 10⁻⁰³	8.96 × 10⁻⁰³	1.23 × 10⁻⁰¹	1.30 × 10⁺⁰²	5.60 × 10⁺⁰¹	4.08 × 10⁺⁰⁰
64 × 64	1.33 × 10⁺⁰¹	4.31 × 10⁻⁰³	8.70 × 10⁻⁰³	8.39 × 10⁻⁰²	3.09 × 10⁺⁰³	1.53 × 10⁺⁰³	1.59 × 10⁺⁰²
128 × 128	6.47 × 10^{+02 1}	4.14 × 10⁻⁰³	9.37 × 10⁻⁰³	3.37 × 10⁻⁰²	1.56 × 10⁺⁰⁵	6.90 × 10⁺⁰⁴	1.92 × 10⁺⁰⁴

¹ 10 observations, not 1000 observations. ² The time ratio of the efficient MLE versus our proposed model. ³ The time ratio of the efficient MLE versus ResNet18. ⁴ The time ratio of the efficient MLE versus MobileNetV2.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chang, Y.-C. Deep-Learning Estimators for the Hurst Exponent of Two-Dimensional Fractional Brownian Motion. Fractal Fract. 2024, 8, 50. https://doi.org/10.3390/fractalfract8010050

AMA Style

Chang Y-C. Deep-Learning Estimators for the Hurst Exponent of Two-Dimensional Fractional Brownian Motion. Fractal and Fractional. 2024; 8(1):50. https://doi.org/10.3390/fractalfract8010050

Chicago/Turabian Style

Chang, Yen-Ching. 2024. "Deep-Learning Estimators for the Hurst Exponent of Two-Dimensional Fractional Brownian Motion" Fractal and Fractional 8, no. 1: 50. https://doi.org/10.3390/fractalfract8010050

APA Style

Chang, Y.-C. (2024). Deep-Learning Estimators for the Hurst Exponent of Two-Dimensional Fractional Brownian Motion. Fractal and Fractional, 8(1), 50. https://doi.org/10.3390/fractalfract8010050

Article Menu

Deep-Learning Estimators for the Hurst Exponent of Two-Dimensional Fractional Brownian Motion

Abstract

1. Introduction

2. Materials and Methods

2.1. Two-Dimensional Fractional Brownian Motion

2.2. The Maximum Likelihood Estimator for 2D FBM

2.3. Deep-Learning Models

3. Experimental Results

3.1. Experimental Settings

3.2. Results of the Maximum Likelihood Estimator

3.3. Results of Deep-Learning Models

4. Discussion

5. Potential Applications

6. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI