Analytic Study of Complex Fractional Tsallis’ Entropy with Applications in CNNs

Ibrahim, Rabha W.; Darus, Maslina

doi:10.3390/e20100722

Open AccessArticle

Analytic Study of Complex Fractional Tsallis’ Entropy with Applications in CNNs

by

Rabha W. Ibrahim

^1,*,† and

Maslina Darus

^2,†

¹

Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur 50603, Malaysia

²

School of Mathematical Sciences, Faculty of Sciences and Technology, Universiti Kebangsaan Malaysia, Bangi 43600, Selangor, Malaysia

^*

Author to whom correspondence should be addressed.

^†

Both authors contributed equally to this work.

Entropy 2018, 20(10), 722; https://doi.org/10.3390/e20100722

Submission received: 29 July 2018 / Revised: 5 September 2018 / Accepted: 10 September 2018 / Published: 20 September 2018

(This article belongs to the Special Issue Nonadditive Entropies and Complex Systems)

Download

Browse Figures

Versions Notes

Abstract

:

In this paper, we study Tsallis’ fractional entropy (TFE) in a complex domain by applying the definition of the complex probability functions. We study the upper and lower bounds of TFE based on some special functions. Moreover, applications in complex neural networks (CNNs) are illustrated to recognize the accuracy of CNNs.

Keywords:

fractional calculus; fractional operator; fractional entropy; CNNs; analytic function; unit disk

1. Introduction

A strategic amount in information theory is entropy. Entropy measures the amount of uncertainty appearing in the assessment of a random variable or the outcome of a random process. In 1988, Tsallis [1] presented the nonadditive entropy, aiming at a generalization of Boltzmann–Gibbs (BG) statistical mechanics. The purpose of this generalization is to study complex systems. Its applications appeared in many fields, such as thermodynamics, chaos, artificial neural networks, image processing, complex systems, information theory, etc. (see [2,3,4,5,6,7,8,9,10,11,12,13,14,15]).

The scheme of the axioms of probability theory placed in 1933 by Kolmogorov can be extended to include the imaginary set of numbers and this by accumulation to his original five axioms. Later, an additional three axioms were given in [16]. Consequently, the complex probability domain is defined by the sum of the real set

S_{R}

with its corresponding real probability and the imaginary

S_{M}

with its corresponding imaginary probability. In general, the advantages of complex probability theory are that it is considered a supplementary dimension (imaginary part) to the event appearing in the real dimension laboratory (real part). It represents physical quantities of complex networks in terms of currents, complex potentials and impedance. Moreover, it fulfills luck and chance in

S_{R}

substituted by total determinism in a complex domain. Finally, it extends many well-known concepts of the traditional probability theory, such as expectation and variance, to the complex probability theory with more accuracy in applications. One of the important applications of complex probability theory is in realistic quantum mechanics [17]; for example, the two slit experiment where a source releases a single particle, which moves to a wall with two slits and is spotted at position

χ

on a shelter placed behind the wall. The typical argument that an interference design on the shelter infers that the particle did not either drive through one slit or the other is ultimately an argument in probability theory such that

P (χ) = P {(χ)}_{1} + P {(χ)}_{2},

where

P {(χ)}_{1}

and

P {(χ)}_{2}

are the probability via the first and second slit, respectively, which is a critical process. This process leads to the use of complex probability theory.

Recently, Abou Jaoude [18] extended Shannon’s information theory by using the complex probability. The author calculated the magnitude of the chaotic factor, the channel capacities in the probability and the degree of knowledge. In general, complex probability leads to better information for all processes compared to the classical probability [19,20]. Figure 1 shows the relation between complex analysis and information theory.

Our investigation is based on the concept of complex probability to extend the idea of Tsallis’ fractional entropy (TFE). The study of the technique delivered by using the approximation theory of special functions of complex variables was useful in information theory. We introduce the upper and lower bound of TFE. Sharpness is discussed as well in the sequel.

2. Results

Let A be an event in a complex domain

S_{C}

. The real and imaginary terms of the complex probability function (CPF):

P_{c} (z) = P_{r} (x, y) + P_{m} (x, y)

where the argument

z = x + i y,

and

P_{r}

and

P_{m}

are the real probability and the imaginary probability in the real set

S_{R}

and imaginary set

S_{M}

, respectively. Following Axiom 7 in [16], we have:

P_{c} (z) = P_{r} (x, y) + i (1 - P_{r} (x, y)),

(1)

such that

z = x + i y

with

{| z |}^{2} = P_{r}^{2} + {(P_{m} / i)}^{2}

and

P_{m} = i (1 - P_{r});

hence,

P_{c}

is always equal to one. Abou Jaoude et al. [19] inferred that

z \in U = {z \in C : | z | < 1}

(the open unit disk).

Tsallis presented an entropic formalization characterized by an index

γ

, which implies a non-extensive statistics. TFE (

T_{γ}

) is the basis of the so-known non-extensive statistical mechanics, which modifies the Boltzmann–Gibbs theory. Tsallis statistics has been used in various fields such as applied mathematics, physics, biology, chemistry, computer science, information theory, engineering, medicine, economics, business, geophysics, etc. Since we study the analytic properties of TFE, therefore, we focus on the continuous formula. The general continuous form of this entropy is given by:

T_{γ} [P] = \frac{1}{γ - 1} (1 - \int_{x} {(P (x))}^{γ} d x), γ \neq 1 .

By applying the concept of CPF in Equation (1), we extend TFE into complex values as follows (CTFE):

T_{γ} [P_{c}] = \frac{1}{γ - 1} (1 - \int_{S_{C}} {(P_{c} (z))}^{γ} d z) .

(2)

For a special domain

S_{C} = U,

we have:

T_{γ} {[P_{c}]}_{U} = \frac{1}{γ - 1} (1 - \int_{U} {(P_{c} (z))}^{γ} d z) .

(3)

For the analytic study, we shall use the definition:

T_{γ} (z) : = (γ - 1) T_{γ} {[P_{c}]}_{U} = 1 - \int_{0}^{z} {(P_{c} (w))}^{γ} d w, z \in U,

(4)

where

P_{c}

is analytic in U, having the form:

P_{c} (z) = \sum_{n = 0}^{\infty} p_{n} z^{n}, z \in U .

It is clear that

T_{γ} (0) = 1

and

ℜ (T_{γ}) > 0 .

TFE has been maximized by using different techniques depending on its parameter

γ

. This problem was discussed in [1,2] for real power index

γ

and in [21] for the complex power index. The authors showed that the Tsallis distribution reserves its fractional power formula, decorating with some specific log-periodic oscillations (convergence dynamics of z-logistic maps). As a result, the authors introduced a complex measure of the thermal bath heat capacity

C = 1 / (γ - 1) .

Thus, in general, the heat capacity becomes complex as well. In this work, CTFE approximates some special functions in a complex domain. These functions are popular in various applications.

Next, we approximate Equation (4) for some special functions. The advantageous of the approximation are: First, for recognizing target functions, the approximation technique studies how certain known functions (for example, special functions) can be approximated by a definite class of functions (for example, polynomials or rational functions) that often have desirable properties (inexpensive computation, continuity, integral and limit values, etc.). Second, the target function, call it

Ψ

, may be unknown; instead of a clear formula, only a set of points of the form (

x, Ψ (x)

) is delivered. Depending on the organization of the domain and codomain of

Ψ

, several methods for approximating

g

may be applicable. For example, if

Ψ

is an operation on the complex numbers, the techniques of geometric function theory can be used.

2.1. Bernoulli Function ${[z / (e^{z} - 1)]}^{γ}$

Mocanu [22] showed that the function

z / (e^{z} - 1)

is convex in U (see Figure 2).

The function is not convex when

γ \geq 2

(see Figure 3).

Series expansions at

z = 0, γ = 2, \dots

are given as follows:

\begin{matrix} T_{2} (z) = 1 - z + (5 z^{2}) / 12 - z^{3} / 12 + z^{4} / 240 + O (z^{5}) \\ T_{3} (z) = 1 - (3 z) / 2 + z^{2} - (3 z^{3}) / 8 + (19 z^{4}) / 240 + O (z^{5}) \\ T_{4} (z) = 1 - 2 z + (11 z^{2}) / 6 - z^{3} + (251 z^{4}) / 720 + O (z^{5}) \\ ⋮ \end{matrix}

(5)

Moreover, when

0 < γ < 1,

we have:

T_{0.5} (z) = 1 - \frac{z}{4} + \frac{z^{2}}{96} + \frac{z^{3}}{384} - \frac{z^{4}}{10240} + \dots

For

φ (z) = \sum φ_{n} z^{n}

and

υ (z) = \sum υ_{n} z^{n}, υ_{n} \geq 0

for all

n \geq 0

, we have

φ ≪ υ

if and only if

| φ_{n} | \leq υ_{n} .

Note that this concept is called majorization coefficients.

We have the following properties (upper bounds):

Proposition 1.

For CTFE approximated by Bernoulli function,

T_{γ} (z) ≪ {(\frac{1 + z}{1 - z})}^{γ}, γ > 0, γ \neq 1 .

Proof.

Let:

ψ (z, γ) = {(\frac{1 + z}{1 - z})}^{γ}, z \in U, γ \neq 1 .

Then, we obtain:

\begin{matrix} ψ (z, 2) = 1 + \sum_{n = 1} (4 n) z^{n} = 1 + 4 z + 8 z^{2} + 12 z^{3} + 16 z^{4} + 20 z^{5} + \dots \\ ψ (z, 3) = 1 + \sum_{n = 1} (2 + 4 n^{2}) z^{n} = 1 + 6 z + 18 z^{2} + 38 z^{3} + \dots . \\ ψ (z, 4) = 1 + \sum_{n = 1} \frac{1}{3} (8 n (2 + n^{2})) z^{n} = 1 + 8 z + 16 z^{2} + 24 z^{3} + \dots . \\ ⋮ \end{matrix}

(6)

Furthermore, for

0 < γ < 1,

we have:

ψ (z, 0.5) = 1 + z + \frac{z^{2}}{2} + \frac{z^{3}}{2} + \frac{3 z^{4}}{8} + \frac{3 z^{5}}{8} + \dots .

Comparing Equation (5) and Equation (6), we conclude that

T_{γ} (z)

is majorized by the function

{(\frac{1 + z}{1 - z})}^{γ}

for all

γ \neq 1

. ☐

Proposition 2.

For CTFE approximated by Bernoulli function, there is a probability measure μ on

{(\partial U)}^{2},

for all

γ > 1 .

Proof.

Let

t, τ \in \partial U

; then, we have:

\begin{matrix} {(\frac{1 + t z}{1 + τ z})}^{γ} & = \frac{{(1 + t z)}^{γ}}{1 + τ z} . \frac{1}{{(1 + τ z)}^{γ - 1}} \\ ≪ \frac{{(1 + z)}^{γ}}{1 - z} . \frac{1}{{(1 - z)}^{γ - 1}} \\ = {(\frac{1 + z}{1 - z})}^{γ}, γ > 1 . \end{matrix}

(7)

In view of Theorem 1.11 in [23], the

{(\frac{1 + t z}{1 + τ z})}^{γ}

admits a probability measure

μ

in

{(\partial U)}^{2}

satisfying:

f (z) = \int_{{(\partial U)}^{2}} {(\frac{1 + t z}{1 + τ z})}^{γ} d μ (t, τ), z \in U .

Then, by virtue of Proposition 1, there is a constant

λ

(diffusion constant) such that:

\int_{{(\partial U)}^{2}} {(\frac{1 + t z}{1 + τ z})}^{γ} d μ (t, τ) = λ \int_{{(\partial U)}^{2}} {(\frac{t z}{e^{τ z} - 1})}^{γ} d μ (t, τ), z \in U .

This completes the proof. ☐

2.2. Gaussian Function $Φ (a, c; z)$

The function

Φ (a, c; z)

is defined by the series:

Φ (a, c; z) = \frac{Γ (c)}{Γ (a)} \sum_{n = 0}^{\infty} \frac{Γ (a + n)}{Γ (c + n)} \frac{z^{n}}{n!} .

A special case of this function is

Φ (a, a; z) = e^{z} .

We consider CTFE approximated by

e^{- γ z} .

Clearly, we have the following results:

Proposition 3.

For CTFE approximated by

e^{- γ z}

:

T_{γ} (z) ≪ Φ (a, c; z),

(γ > 0, γ \neq 1, ℜ a > 1, ℜ c > 1) .

Proposition 4.

For CTFE approximated by

e^{- γ z}

, there is a probability measure μ on [0, 1].

Proof.

In view of Equation (1.2-8) in [24], there is a probability measure on [0, 1] such that:

Φ (a, c; z) = \frac{Γ (c)}{Γ (a) Γ (c - a)} \int_{0}^{1} τ^{a - 1} {(1 - τ)}^{c - a - 1} e^{τ z} d t = \int_{0}^{1} e^{τ z} d μ

d μ (τ) = \frac{Γ (c)}{Γ (a) Γ (c - a)} \frac{τ^{a - 1}}{{(1 - τ)}^{c - a - 1}} d t .

By Proposition 3, we have the desired assertion. ☐

2.3. Fractional Sigmoid Function FSF

CTFE can be approximated by FSF. In our investigation, we focus on the type of function, which is analytic in U. We suggest the function (see Figure 4):

T_{γ} (z) = \frac{2}{1 + e^{- γ z}}, γ \neq 1, z \in U .

(8)

The expansion CTFE are given as follows:

\begin{matrix} T_{2} (z) = 1 + z - \frac{z^{3}}{3} + \frac{2 z^{5}}{15} - \frac{17 z^{7}}{315} + O (z^{9}) \\ T_{3} (z) = 1 + \frac{3 z}{2} - \frac{9 z^{3}}{8} + \frac{81 z^{5}}{80} - \frac{1413 z^{7}}{4480} + O (z^{9}) \\ T_{4} (z) = 1 + 2 z - \frac{8}{3} z^{3} + \frac{64}{15} z^{5} - \frac{217}{315} z^{7} + O (z^{9}) \\ ⋮ \end{matrix}

(9)

For sufficient values of a and

c,

CTFE approximated by FSF can be majorized by

Φ (a, c; z) .

3. Complex-Valued Neural Networks

CNNs are a necessary extension of the analysis of real-valued neural networks. CNNs are networks that utilize complex-valued variables and parameters, effectively distributing in this style with complex-valued information. They are very well matched with wave phenomena, and they are suitable for the procedures connected with complex altitude [25]. The have been used for a long list of applications, essentially in learning tasks, loss function, cost function, utility function and combinatorial optimization.

In CNNs, the neurons in each layer are systematized as a three-dimensional array rather than as a vector in ANNs (artificial neural networks). The first two dimensions are titled spatial, and the third is a partition to networks. The CNN system charts three ideologies characteristic of natural systems: locality, sharing and pooling.

The locality behavior is the information that neurons depend only on their neighbors, rather than on far away neurons. Sharing is the limitation that various pi neurons should undergo the same processing. It is challenging that an affine layer follows locality, and sharing results in a convolution layer. Pooling is used to indicate invariance to small translations. A pooling layer does so by splitting each input channel into patches and replacing each patch with a single representative assessment in the output layer.

Suppose the CNN is delivered by n fully connected in a Hopfield-like net. The output is given by a complex number for each neuron:

Z = {z_{1}, \dots, z_{n}} \subset U .

Thus, the network state (information of the net)

I_{γ} (z_{k}), k = 1, \dots, n

is a complex vector. In this work, we shall use the total information, which is given by the relation:

I_{γ} (z) = \sum_{k = 1}^{n} \frac{T_{γ} (z_{k})}{γ - 1}, γ \neq 1, z \in Z,

(10)

where

T_{γ}

is approximated by Equation (9). Therefore, a large amount of information can be realized from both theoretical study and numerical computations from

T_{γ} (z) .

The stability of Equation (10) is given by the energy equation:

E_{γ} = \frac{I_{γ} (z) {\bar{I}}_{γ} (z)}{n},

(11)

where

{\bar{I}}_{γ} (z)

is the conjugate of

I_{γ} (z) .

The energy provides a tool for studying the dynamics of CNNs. Figure 5 shows the steps of finding the energy. The minimum energy is bounded by the value

ρ

, which is suggested during the training of CNN.

It has been shown by experiences, for a CNN of four neurons, that the minimum energy is satisfying Equation (11) for the output on

\partial U

as follows:

Z = {i, - i, 1, - 1} .

The energy

E_{γ}

is equal to one for all values

γ > 2

; while the energy is increasing for outcomes inside the unit disk

U .

For example, the output set:

Z = {\frac{1 + i}{2}, \frac{1 - i}{2}, \frac{i}{2}, - \frac{i}{2}}

has energy

E_{γ} > 1

, for different values of

γ .

Numerical Examples

Let

Z = {i, - i, 1, - 1}

be the outcome set of CNN. To apply our algorithm, we pursue the following steps:

Step 1. Calculate

T_{γ}, γ > 2

from Equation (8) as follows: for

γ = 3,

we have:

\begin{matrix} T_{3} (i) = \frac{2}{1 + e^{- 3 i}} = 1 + 14.1 i, T_{3} (- i) = \frac{2}{1 + e^{3 i}} = 1 - 14.1 i, \\ T_{3} (1) = \frac{2}{1 + e^{- 3}} = 1.9, T_{3} (- 1) = \frac{2}{1 + e^{3}} = 0.094; \end{matrix}

Step 2. Compute the total information by using Equation (10):

I_{3} (z) = \sum_{k = 1}^{4} \frac{T_{3} (z_{k})}{3 - 1} \approx 2 .

Step 3. Estimate the energy of CNN by applying Equation (11):

E_{3} = \frac{I_{3} (z) {\bar{I}}_{3} (z)}{4} = 1 .

Remark 1.

One can show that for all

γ > 2,

the estimate energy for the set

Z = {i, - i, 1, - 1}

is equal to one. The algorithm will stop at the value

ρ,

which was given previously. In our example, we consider

ρ = 1

for all

z \in \bar{U} .

Moreover, to estimate the energy of the outcomes set

Z = {\frac{1 + i}{2}, \frac{1 - i}{2}, \frac{i}{2}, - \frac{i}{2}},

we follow the above steps:

T_{3} (z) = 5.6, I_{3} (z) = 2.8, E_{3} = 1.96 .

Comparing with

ρ = 1,

the CNN needs more training.

Remark 2.

Comparing with the complex Shannon entropy [18], we obtain the following values for the set

Z = {i, - i, 1, - 1}

:

H (i) \approx 1.0010005 - 0.999999499 i, H (- i) \approx 1.0010005 + 0.999999499 i, H (1) = 0, H (- 1) = 1 .

This implies total information

I (z) = 3 .

Consequently, we have

E = 9 / 4 = 2.25 > 1 .

4. Discussion

Equation (10) refers to the amount of information in the complex system, which is given in the CNN. The advantage is that CNN does not depend on the number of neurons to get full training of the system (see [11,12,13,14,15,26]). Furthermore, the complex value of the output converges to the stability state faster than the real value. All the complex value outputs are given in the open unit disk $U,$ where $| z | < 1$ (see [16]). In this case, we may use the properties of geometry function theory (GFT). For example, the sigmoid function of the complex value is studied widely in view of GFT. The convexity and other geometric representations of this function have been studied by many authors (see [27]).
The parameter $γ$ from $I_{γ}$ is: the simplest non-trivial perturbation of any unperturbed complex system; the complex system (CNN) in which obvious necessary and sufficient conditions are recognized for a small divisor problem is stable.
The output may cause a complex-valued function incited by the set $Z .$ In this situation, the stability comes from the first derivative of $I (z)$ with respect to z. This type of stability is called Lyapunov stability. At a fixed point $z_{0}$ :

$I_{γ}^{'} (z_{0}) = \frac{d}{d z} I_{γ} (z_{0}) = 2 z_{0} .$

At a periodic point $z_{0}$ of period ℘, the first derivative of a function:

${(I_{γ}^{℘})}^{'} (z_{0}) = \frac{d}{d z} I_{γ}^{℘} (z_{0}) = \prod_{i = 0}^{℘ - 1} I_{γ}^{'} (z_{i}) = 2^{℘} \prod_{i = 0}^{℘ - 1} z_{i} = λ$

is usually given by $λ$ and represented by the multiplier or the Lyapunov characteristic number. It applies to checking the stability of periodic points, as well as fixed points ( $λ = 0$ ).
At a non-periodic point, the derivative, $z_{n}^{'},$ can be iterated by:

$z_{0}^{'} = 1; z_{n}^{'} = 2 \times z_{n - 1} \times z_{n - 1}^{'} .$
The above derivative can be replaced by any derivative for a complex variable $z \in C$ such as the Schwarzian derivative. We may suggest this as a future work.
Derivative with respect to $γ$ (parametric derivative): This type of derivative is called the distance estimation method. In this case, CNN has one output in the set $Z$ , and it is fixed. Therefore, we suggest to use the parameter plane collecting information. This occurs as follows: On the parameter plane: $γ$ is a variable, and $z_{0} = 0$ is constant. The first derivative of $I_{γ}^{n} (z_{0})$ with respect to $γ$ is given by the relation:

$z_{n}^{'} = \frac{d}{d γ} I_{γ}^{n} (z_{0}) .$

This derivative can be defined by the following iteration:

$z_{0}^{'} = \frac{d}{d γ} I_{γ}^{0} (z_{0}) = 1$

and then replacing at every consecutive step:

$z_{n + 1}^{'} = \frac{d}{d γ} I_{γ}^{n + 1} (z_{0}) = 2 \cdot I_{γ}^{n} (z) \cdot \frac{d}{d γ} I_{γ}^{n} (z_{0}) + 1 = 2 \cdot z_{n} \cdot z_{n}^{'} + 1 .$

5. Conclusions and Future Research

In the present paper, we have been applying the model of complex probability to Tsallis’ entropy. Henceforth, we established a fitted connection between the new model and the classical FTE. Therefore, we developed the theory of information. As an application, we made a generalization of CNNs; its result implied minimization of the energy in this complex system. The aid of extending FTE leads to very stimulating and successful consequences and outcomes illustrated in this work. Therefore, we are calling this original and beneficial new study in applied mathematics and analytics: “the theory of complex information”.

It is intended that additional development of this original study will be done in subsequent work such as convergence, convexity and concavity. It is proposed that in future research studies, the novel planned analytic method will be elaborated more, and the complex probability model, as well as extensive and various sets of stochastic processes will be applied.

Author Contributions

Conceptualization, R.W.I. and M.D.; methodology, R.W.I.; software, R.W.I.; validation, R.W.I. and M.D.; formal analysis, R.W.I. and M.D.; investigation, R.W.I. and M.D.; writing—original draft preparation, R.W.I.; writing—review and editing, M.D.; funding acquisition, M.D.

Funding

This research was funded by Universiti Kebangsaan Malaysia grant number GUP-2017-064.

Acknowledgments

The authors would like to express their thanks to the reviewers for their important and useful comments to improve the paper. The work here is partially supported by the Universiti Kebangsaan Malaysia grant: GUP (Geran Universiti Penyelidikan)-2017-064.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

$γ$	parameter
$λ$	diffusion constant
z	complex number
$P_{c}$	complex probability
$S_{R}$	real set of events
$S_{M}$	imaginary set of events
$P_{r}$	probability in the real set
$P_{m}$	probability in the imaginary set
U	the open unit disk
${\| z \|}^{2}$	the degree of our knowledge of the random experiment; it is the square of the norm of z
$T_{γ} [P_{c}]$	CFTE
$ℜ (T_{γ})$	the real part of CFTE
$Φ (a, c; z)$	Gaussian function
$Γ (.)$	gamma function
$I_{γ}$	total information
$E_{γ}$	the energy
$ρ$	the upper bound of energy

References

Tsallis, C. Possible generalization of Boltzmann-Gibbs statistics. J. Stat. Phys. 1988, 52, 479–487. [Google Scholar] [CrossRef]
Tsallis, C. The nonadditive entropy Sq and its applications in physics and elsewhere: Some remarks. Entropy 2011, 13, 1765–1804. [Google Scholar] [CrossRef]
Ibrahim, R.W.; Jalab, H.A. Existence of entropy solutions for nonsymmetric fractional systems. Entropy 2014, 16, 4911–4922. [Google Scholar] [CrossRef]
Ibrahim, R.W.; Jalab, H.A. Existence of Ulam stability for iterative fractional differential equations based on fractional entropy. Entropy 2015, 17, 3172–3181. [Google Scholar] [CrossRef]
Ibrahim, R.W.; Jalab, H.A.; Gani, A. Cloud entropy management system involving a fractional power. Entropy 2015, 18, 14. [Google Scholar] [CrossRef]
Ibrahim, R.W.; Jalab, H.A.; Gani, A. Perturbation of fractional multi-agent systems in cloud entropy computing. Entropy 2016, 18, 31. [Google Scholar] [CrossRef]
Jalab, H.A.; Ibrahim, R.W.; Amr, A. Image denoising algorithm based on the convolution of fractional Tsallis entropy with the Riesz fractional derivative. Neural Comput. Appl. 2017, 28, 217–223. [Google Scholar] [CrossRef]
Ibrahim, R.W. The maximum principle of Tsallis entropy in a complex domain. Ital. J. Pure Appl. Math. 2017, 601–606. [Google Scholar]
Ibrahim, R.W. On new classes of analytic functions imposed via the fractional entropy integral operator. Facta Univ. Ser. Math. Inform. 2017, 32, 293–302. [Google Scholar] [CrossRef]
Al-Shamasneh, A.A.R.; Jalab, H.A.; Palaiahnakote, S.; Obaidellah, U.H.; Ibrahim, R.W.; El-Melegy, M.T. A new local fractional entropy-based model for kidney MRI image enhancement. Entropy 2018, 20, 344. [Google Scholar] [CrossRef]
Rubio, J.D.J.; Lughofer, E.; Plamen, A.; Novoa, J.F.; Meda-Campaña, J.A. A novel algorithm for the modeling of complex processes. Kybernetika 2018, 54, 79–95. [Google Scholar] [CrossRef]
Meda, C.; Jesus, A. On the estimation and control of nonlinear systems with parametric uncertainties and noisy outputs. IEEE Access 2018, 6, 31968–31973. [Google Scholar] [CrossRef]
Rubio, J. Error convergence analysis of the SUFIN and CSUFIN. Appl. Soft Comput. 2018, in press. [Google Scholar]
Meda, C.; Jesus, A. Estimation of complex systems with parametric uncertainties using a JSSF heuristically adjusted. IEEE Lat. Am. Trans. 2018, 16, 350–357. [Google Scholar] [CrossRef]
De Jesús Rubio, J.; Lughofer, E.; Meda-Campaña, J.A.; Páramo, L.A.; Novoa, J.F.; Pacheco, J. Neural network updating via argument Kalman filter for modeling of Takagi-Sugeno fuzzy models. J. Intell. Fuzzy Syst. 2018, 35, 2585–2596. [Google Scholar] [CrossRef]
Abou Jaoude, A. The paradigm of complex probability and Chebyshev’s inequality. Syst. Sci. Control Eng. 2016, 4, 99–137. [Google Scholar] [CrossRef]
Youssef, S. Quantum mechanics as Bayesian complex probability theory. Mod. Phys. Lett. A 1994, 9, 2571–2586. [Google Scholar] [CrossRef]
Abou Jaoude, A. The paradigm of complex probability and Claude Shannon’s information theory. Syst. Sci. Control Eng. 2017, 5, 380–425. [Google Scholar] [CrossRef] [Green Version]
Abou Jaoude, A.; El-Tawil, K.; Seifedine, K. Prediction in complex dimension using Kolmogorov’s set of axioms. J. Math. Stat. 2010, 6, 116–124. [Google Scholar] [CrossRef]
Abou Jaoude, A. The complex probability paradigm and analytic linear prognostic for vehicle suspension systems. Am. J. Eng. Appl. Sci. 2015, 8, 147. [Google Scholar] [CrossRef]
Wilk, G.; Włodarczyk, Z. Tsallis distribution with complex nonextensivity parameter q. Phys. A Stat. Mech. Its Appl. 2014, 413, 53–58. [Google Scholar] [CrossRef]
Mocanu, P.T. Convexity of some particular functions. Studia Univ. Babes-Bolyai Math. 1984, 29, 70–73. [Google Scholar]
Ruscheweyh, S. Convolutions in Geometric Function Theory; Presses de l’Université de Montréal: Montréal, QC, Canada, 1982. [Google Scholar]
Miller, S.S.; Mocanu, P.T. Differential Subordinations: Theory and Applications; CRC Press: Boca Raton, FL, USA, 2000. [Google Scholar]
Kaslik, E.; Ileana, R.R. Dynamics of complex-valued fractional-order neural networks. Neural Netw. 2017, 89, 39–49. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Ibrahim, R.W. The fractional differential polynomial neural network for approximation of functions. Entropy 2013, 15, 4188–4198. [Google Scholar] [CrossRef]
Ezeafulukwe, U.A.; Darus, M.; Olubunmi, A. On analytic properties of a sigmoid function. Int. J. Math. Comput. Sci. 2018, 13, 171–178. [Google Scholar]

Figure 1. The connection of the main objectives of this research.

Figure 2. Bernoulli function

z / (e^{z} - 1)

.

Figure 2. Bernoulli function

z / (e^{z} - 1)

.

Figure 3. Bernoulli function

{[z / (e^{z} - 1)]}^{2}

.

Figure 3. Bernoulli function

{[z / (e^{z} - 1)]}^{2}

.

Figure 4. Sigmoid function

\frac{2}{1 + e^{- γ z}}, γ = 2

.

Figure 4. Sigmoid function

\frac{2}{1 + e^{- γ z}}, γ = 2

.

Figure 5. The algorithm of using CTFE in CNNs.

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ibrahim, R.W.; Darus, M. Analytic Study of Complex Fractional Tsallis’ Entropy with Applications in CNNs. Entropy 2018, 20, 722. https://doi.org/10.3390/e20100722

AMA Style

Ibrahim RW, Darus M. Analytic Study of Complex Fractional Tsallis’ Entropy with Applications in CNNs. Entropy. 2018; 20(10):722. https://doi.org/10.3390/e20100722

Chicago/Turabian Style

Ibrahim, Rabha W., and Maslina Darus. 2018. "Analytic Study of Complex Fractional Tsallis’ Entropy with Applications in CNNs" Entropy 20, no. 10: 722. https://doi.org/10.3390/e20100722

APA Style

Ibrahim, R. W., & Darus, M. (2018). Analytic Study of Complex Fractional Tsallis’ Entropy with Applications in CNNs. Entropy, 20(10), 722. https://doi.org/10.3390/e20100722

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Analytic Study of Complex Fractional Tsallis’ Entropy with Applications in CNNs

Abstract

1. Introduction