Supervised Learning Fuzzy Matrix Based on Input–Output Fuzzy Vectors

Ye, Meili; Wang, Nianliang; Yu, Xianfeng; Wang, Xiao; Liu, Wuniu

doi:10.3390/axioms14020126

Open AccessArticle

Supervised Learning Fuzzy Matrix Based on Input–Output Fuzzy Vectors

by

Meili Ye

¹,

Nianliang Wang

¹,

Xianfeng Yu

¹,

Xiao Wang

¹ and

Wuniu Liu

^2,*

¹

School of Mathematics and Computer Application, Shangluo University, Shangluo 726000, China

²

School of Mathematics and Statistics, Shaanxi Normal University, Xi’an 710062, China

^*

Author to whom correspondence should be addressed.

Axioms 2025, 14(2), 126; https://doi.org/10.3390/axioms14020126

Submission received: 25 December 2024 / Revised: 6 February 2025 / Accepted: 7 February 2025 / Published: 9 February 2025

Download

Browse Figures

Versions Notes

Abstract

:

Fuzzy matrices play a crucial role in fuzzy logic and fuzzy systems. This paper investigates the problem of supervised learning fuzzy matrices through sample pairs of input–output fuzzy vectors, where the fuzzy matrix inference mechanism is based on the max–min composition method. We propose an optimization approach based on stochastic gradient descent (SGD), which defines an objective function by using the mean squared error and incorporates constraints on the matrix elements (ensuring they take values within the interval [0, 1]). To address the non-smoothness of the max–min composition rule, a modified smoothing function for max–min is employed, ensuring stability during optimization. The experimental results demonstrate that the proposed method achieves high learning accuracy and convergence across multiple randomly generated input–output vector samples.

Keywords:

fuzzy set; fuzzy matrix; supervised learning; stochastic gradient descent; decision making

MSC:

15B15

1. Introduction

Fuzzy set theory, introduced by Zadeh in 1965 [1,2], has become a useful mathematical framework for handling uncertainty and imprecision in various scientific and engineering fields. It extends classical set theory by allowing the membership of elements in a set to be represented by degrees between 0 and 1, rather than just binary membership (0 or 1). This makes fuzzy set theory a powerful tool for modeling and reasoning under vagueness that is used in many real-world problems, including artificial intelligence [3,4], decision making [5,6], and data mining [7,8]. Within these fields, fuzzy matrices serve as a fundamental tool in fuzzy logic systems. They are typically employed to represent the relationships between various elements in a fuzzy system, where each element corresponds to a degree of membership or a fuzzy value. Fuzzy matrices can be used for tasks such as fuzzy inference, decision making, and optimization. The traditional approach to constructing fuzzy matrices has been largely dependent on expert knowledge, human intuition, and heuristic rules that reflect the domain-specific characteristics of the problem. For example, in applications like fuzzy model checking [9,10,11,12,13], the fuzzy matrix is often crafted manually, based on insights from experts in the respective field. However, this manual construction process is not without challenges: It is time-consuming, highly subjective, and tends toward inaccuracies, particularly in complex or unfamiliar domains. Moreover, expert knowledge may be scarce or unavailable, further complicating the construction of robust fuzzy systems.

As the field of data science has evolved, there has been a growing interest in using data-driven techniques to overcome the limitations of manual construction. The use of machine learning algorithms to automatically learn fuzzy matrices from data offers an alternative, allowing for the creation of more efficient and accurate fuzzy systems. This paradigm shift, from expert-driven to data-driven approaches, has the potential to significantly enhance the performance and applicability of fuzzy reasoning systems.

1.1. Motivations

The learning of fuzzy matrices has become a hot research topic, with applications emerging in various fields. For example, self-learning fuzzy discrete event systems based on external sensor variables [14,15] and self-learning modeling in possibility theory-based model checking [16] have been explored. These methods rely on supervised learning techniques, where input–output datasets are used to adjust the elements of the fuzzy matrix. One of the challenges in learning fuzzy matrices is dealing with non-smooth operations. In fuzzy reasoning systems, the max–min and max–product composition rules are the most commonly used inference mechanisms, involving taking the maximum of the minimum values between corresponding elements of the input and the matrix. This operation is non-differentiable, posing difficulties for various optimization methods. Previous work on learning fuzzy matrices has been based on the max–product composition method, which only requires smoothing the max operation. A substantial body of research has focused on methods for smoothing the max function [17,18]. However, the learning of fuzzy matrices for the max–min composition method has not been studied.

To fill this gap, we propose a supervised learning algorithm based on stochastic gradient descent (SGD) for learning fuzzy matrices in the max–min composition method. This algorithm optimizes a loss function that reflects the error between the predicted output and the target output based on input–output fuzzy vector pairs, allowing the fuzzy matrix to quickly converge to its true value. The simulation results show that the proposed learning algorithm exhibits good convergence performance toward the true value of the fuzzy matrix. The data-driven, automated process for learning fuzzy matrices will lead to more accurate, consistent, and scalable fuzzy systems, particularly in applications where expert knowledge is unavailable or difficult to obtain.

1.2. Structure of This Paper

The structure of this paper is as follows: Section 2 reviews the related theoretical foundations of the fuzzy matrix. Section 3 describes the proposed fuzzy matrix learning algorithm in detail. Section 4 presents the experimental setup. Section 5 shows the results and analysis. Section 6 concludes the paper and discusses potential future research directions.

2. Fuzzy Reasoning Model

Fuzzy reasoning models are widely used in situations where the relationships between variables are uncertain, imprecise, or vague. In many real-world applications, the exact values of input and output variables may not be known with certainty, and traditional binary logic is inadequate to handle the gradations of truth that exist in such systems. Fuzzy logic provides a more flexible framework, allowing for reasoning with degrees of truth rather than just true or false values. One of the key components of a fuzzy logic system is the fuzzy reasoning model, which helps infer outputs based on fuzzy inputs and predefined relationships. The model operates on fuzzy matrices, which represent the fuzzy relationships between input and output dimensions. By utilizing fuzzy inference rules, it can process uncertain or imprecise information and generate outputs that reflect the inherent vagueness of the system.

A fuzzy reasoning model is a

n \times n

fuzzy matrix P:

P = [\begin{matrix} p_{11} & \dots & p_{1 n} \\ ⋮ & ⋱ & ⋮ \\ p_{n 1} & \dots & p_{n n} \end{matrix}],

(1)

where each element

p_{i j}

represents the fuzzy relationship between the i-th input dimension and the j-th output dimension.

Given an input fuzzy vector s =

[s_{1}, s_{2}, \dots, s_{n}]

, the output fuzzy vector

t = [t_{1}, t_{2}, \dots, t_{n}]

is computed using the max–min composition rule (denoted by a symbol “∘”) as follows:

t_{j} = max_{k = 1, 2, \dots, n} [min (s_{k}, p_{k j})], \forall j = 1, 2, \dots, n .

(2)

This formula indicates that for each output dimension j, the output

t_{j}

is the maximum of the minimum values between the elements of the input vector

s

and the corresponding elements of the fuzzy matrix

p_{k j}

. This method is a classic fuzzy reasoning method, widely used to make inferences under uncertainty and vagueness.

For example, let

s

be an input fuzzy vector and P be a fuzzy matrix:

s = [0.8, 0.5], P = [\begin{matrix} 0.7 & 0.4 \\ 0.5 & 0.6 \end{matrix}] .

(3)

The

t = [t_{1}, t_{2}] = s \circ P

is calculated as follows:

t_{1} = max [min (0.8, 0.7), min (0.5, 0.5)] = 0.7,

(4)

t_{2} = max [min (0.8, 0.4), min (0.5, 0.6)] = 0.5 .

(5)

Thus, the resulting output vector

t = [0.7, 0.5]

.

This example illustrates how the max–min composition rule works: For each output dimension, the output value is determined by the maximum of the minimum values computed between the input vector and the corresponding column in the fuzzy matrix. This method is one of the fundamental techniques used in fuzzy reasoning and is particularly useful in situations where inputs are uncertain or imprecise, and we need to make decisions based on fuzzy relationships.

3. Learning Algorithm for Fuzzy Matrix

In real-world applications,

P_{true}

is an unknown fuzzy inference model, and we only have input–output sample pairs to learn the unknown

P_{true}

. The learned

P_{true}

is then applied in the inference model: Given an input vector and

P_{true}

, we obtain the output vector for decision making and control. In the simulation testing of this paper, to generalize our algorithm, we do not use actual datasets to obtain

P_{true}

. Instead, we randomly generate a

P_{true}

as the true

P_{true}

, which is used to generate input–output vector sample pairs. These sample pairs are then used to supervise the learning of

P_{true}

. The learned

P_{true}

is compared with the randomly generated

P_{true}

to verify the performance of the algorithm.

Suppose R sample pairs

{(s_{i}, t_{i})}_{i = 1}^{R}

are available. We want to use them to learn all elements

p_{i j}

of the fuzzy matrix P based on max–min rules. Figure 1 shows the summary of learning algorithm for fuzzy matrix. To measure the discrepancy between the predicted value

\hat{t}

and the target value

t

, we adopt the mean squared error (MSE) as the objective function. The objective function is optimized by calculating the average of the squared differences between the predicted and target values. Specifically, the loss function is defined as follows:

L : = \frac{1}{2} {∥ \hat{t} - t ∥}^{2} = \frac{1}{2} \sum_{k = 1}^{n} {({\hat{t}}_{k} - t_{k})}^{2},

(6)

where

\hat{t}

is the predicted value obtained using the fuzzy matrix P and input fuzzy vector

s

, and

t

is the target value of the sample. By minimizing this loss function, we can learn the optimal fuzzy matrix P. Since P is a

n \times n

fuzzy matrix,

n^{2}

parameters need to be learned by using objective function

L

.

Gradient Calculation and Optimization

Due to the non-smoothness of the fuzzy max–min composition operation, calculating the gradient of the elements of fuzzy matrix is more complex. To address this, we used a revised exponential penalty function to approximate the fuzzy max–min function in Equation (2) for the purpose of accurate learning, i.e.,

t_{j} = ⋁_{k = 1}^{n} (s_{k} \land p_{k j}) \approx τ ln \sum_{k = 1}^{n} {[exp (\frac{- s_{k}}{τ}) + exp (\frac{- p_{k j}}{τ})]}^{- 1},

(7)

where constant

τ > 0

is a hyperparameter determining the accuracy of approximation [19,20].

Figure 2 shows the approximation curves for

τ = 0.01

,

τ = 0.1

, and

n = 15

. In this case, the matrix P and the vector s are both randomly generated. As observed from the figure, the smaller the value of

τ

, the more accurate the approximation.

This observation highlights the trade-off between computational complexity and approximation accuracy. A smaller

τ

often results in better precision but may require more computational effort to achieve convergence, as the optimization process becomes more sensitive to small changes in the parameters.

On the basis of the principle of gradient decent learning,

p_{i j}^{n e w} = p_{i j} - η \frac{\partial L}{\partial p_{i j}},

(8)

where

η

is the learning rate, controlling the step size of each update. The superscript “new” is used to indicate the new value of the parameter

p_{i j}

after it has been updated in one iteration of learning.

To ensure that all elements of the matrix remain within the valid range for a fuzzy matrix, the fuzzy matrix P is projected onto the interval

[0, 1]

after each update; then, we have a new learning equation:

p_{i j}^{n e w} = max [0, min (1, p_{i j} - η \frac{\partial L}{\partial p_{i j}})] .

(9)

In this way, the fuzzy matrix P is progressively optimized to predict the output more accurately.

Based on Equation (9), we have

\frac{\partial L}{\partial p_{i j}} = \sum_{k = 1}^{n} \frac{\partial L}{\partial t_{k}} \frac{\partial t_{k}}{\partial p_{i j}} = \frac{\partial L}{\partial t_{j}} \frac{\partial t_{j}}{\partial p_{i j}}

(10)

and

\frac{\partial L}{\partial t_{j}} = {\hat{t}}_{j} - t_{j} .

(11)

To compute the partial derivative of the function

t_{j} = τ ln \sum_{k = 1}^{n} {[exp (\frac{- s_{k}}{τ}) + exp (\frac{- p_{k j}}{τ})]}^{- 1},

(12)

with respect to

p_{i j}

. Rewriting the function, we define

g_{k} = exp (\frac{- s_{k}}{τ}) + exp (\frac{- p_{k j}}{τ});

(13)

then, the function becomes

t_{j} = τ ln \sum_{k = 1}^{n} g_{k}^{- 1} .

(14)

Using the chain rule,

\frac{\partial t_{j}}{\partial p_{i j}} = τ \cdot \frac{1}{\sum_{k = 1}^{n} g_{k}^{- 1}} \cdot \frac{\partial \sum_{k = 1}^{n} g_{k}^{- 1}}{\partial p_{i j}} .

(15)

First, compute the derivative of

g_{k}^{- 1}

:

\frac{\partial g_{k}^{- 1}}{\partial p_{i j}} = - g_{k}^{- 2} \cdot \frac{\partial g_{k}}{\partial p_{i j}} .

(16)

For

g_{k}

, only the term where

k = i

depends on

p_{i j}

is used. Thus,

\frac{\partial g_{k}}{\partial p_{i j}} = \{\begin{matrix} - \frac{1}{τ} exp (\frac{- p_{i j}}{τ}), & if k = i, \\ 0, & if k \neq i . \end{matrix}

(17)

When

k = i

, the contribution to

\frac{\partial \sum_{k = 1}^{n} g_{k}^{- 1}}{\partial p_{i j}}

is

\frac{\partial \sum_{k = 1}^{n} g_{k}^{- 1}}{\partial p_{i j}} = g_{i}^{- 2} \cdot \frac{1}{τ} exp (\frac{- p_{i j}}{τ}),

(18)

where

g_{i} = exp (\frac{- s_{i}}{τ}) + exp (\frac{- p_{i j}}{τ}) .

Substitute this into the chain rule:

\begin{matrix} \frac{\partial t_{j}}{\partial p_{i j}} & = τ \cdot \frac{1}{\sum_{k = 1}^{n} g_{k}^{- 1}} \cdot (g_{i}^{- 2} \cdot \frac{1}{τ} exp (\frac{- p_{i j}}{τ})) \\ = \frac{1}{\sum_{k = 1}^{n} g_{k}^{- 1}} \cdot \frac{1}{g_{i}^{2}} \cdot exp (\frac{- p_{i j}}{τ}) \end{matrix}

(19)

Simplify the following expression:

\frac{\partial L}{\partial p_{i j}} = \frac{({\hat{t}}_{j} - t_{j}) \cdot exp (\frac{- p_{i j}}{τ})}{{[exp (\frac{- s_{i}}{τ}) + exp (\frac{- p_{i j}}{τ})]}^{2} \cdot \sum_{k = 1}^{n} {[exp (\frac{- s_{k}}{τ}) + exp (\frac{- p_{k j}}{τ})]}^{- 1}} .

(20)

The complete learning algorithm workflow is as follows: Initialize the fuzzy matrix P with random values or 0.5. For each sample pair

(s_{i}, t_{i})

, compute the predicted value

{\hat{t}}_{i}

using the max–min composition rule. Compute the loss function and its gradient. Update P, and project it into the range

[0, 1]

. Repeat steps until the convergence condition is met.

4. Simulation Settings

The goal of the experimental section is to evaluate the performance of the proposed fuzzy matrix learning algorithm. In this section, we describe the experimental setup, including data generation, evaluation metrics, baseline methods, and results. We programmed a program in MATLAB (version 2021b) to achieve the learning algorithm and to generate the sample pairs required to evaluate the learning performance. We have made the MATLAB codes open source for this paper.

4.1. Data Generation

For the experiments, we generate R random sample pairs

(s_{i}, t_{i})

, where each input sample

s_{i}

and its corresponding output

t_{i}

are fuzzy vectors of size n, and their elements are drawn from the interval

[0, 1]

. The inputs

s_{i}

are randomly generated as n-dimensional vectors, with each component

s_{i} [k]

sampled independently from an uniform distribution in

[0, 1]

. The output vectors

t_{i}

are generated by applying a true fuzzy matrix

P_{true}

to each input vector

s_{i}

through the max–min composition rule described earlier:

t_{i} = s_{i} \circ P_{t r u e}

, i.e.,

t_{i} [j] = max_{k} min (s_{i} [k], P_{true} [k, j]), \forall j = 1, \dots, n,

(21)

where

P_{true}

is a randomly generated fuzzy matrix, and the resulting output vectors

t_{i}

represent the true output corresponding to each input vector

s_{i}

. The number of samples R is varied to assess the impact of dataset size on the learning process.

4.2. Evaluation Metrics

To assess the performance of the proposed learning algorithm, we use two primary evaluation metrics:

Mean Squared Error (MSE): The MSE is calculated between the predicted outputs ${\hat{t}}_{i}$ and the true outputs $t_{i}$ over all samples. This metric quantifies the average squared difference between the predicted and target values and serves as a measure of the accuracy of the learned fuzzy matrix.

$MSE = \frac{1}{n R} \sum_{i = 1}^{R} {∥ {\hat{t}}_{i} - t_{i} ∥}^{2} .$

(22)
Matrix Reconstruction Error (MRE): This metric evaluates how well the learned fuzzy matrix $P_{learned}$ approximates the true fuzzy matrix $P_{true}$ . It is calculated as the Frobenius norm of the difference between the true matrix and the learned matrix:

$MRE = ∥ P_{learned} - P_{true} ∥_{F},$

(23)

where ${∥ \cdot ∥}_{F}$ represents the Frobenius norm, which sums the squared differences of all corresponding elements of the two matrices. Concretely, ${∥ A ∥}_{F} = \sqrt{\sum_{i = 1}^{m} \sum_{j = 1}^{n} {| a_{i j} |}^{2}}$ , where $a_{i j}$ is the element in row i and column j of the matrix A. The smaller the value, the closer the learned matrix P is to the true matrix $P_{t r u e}$ .

These metrics allow for a comprehensive assessment of both the accuracy of the output prediction and the quality of the learned fuzzy matrix.

5. Experimental Results and Analysis

The learning algorithms derived from the mentioned theories seem feasible but require computer simulations to evaluate their performance. Similar to other iterative optimization problems, the learning efficiency of the fuzzy matrix depends on various factors, including initial conditions, learning rates, sample sizes, and the number of epochs. The simulations conducted in this study are inherently abstract and not associated with any specific fuzzy system, yet they remain suitable for evaluating learning performance. Since this represents the inaugural proposal of a learning algorithm within the context of possibilistic model checking, conducting a comparative analysis is neither feasible nor supported. The results show the convergence behavior of the proposed algorithm. This indicates that the algorithm converges to a stable solution after a relatively small number of iterations, demonstrating the efficiency of the SGD approach.

5.1. Learning Performance Evaluation

We first choose to test the learning of a

3 \times 3

fuzzy matrix. Therefore, there are a total of nine parameters to be learned. The hyperparameter settings are as follows: number of samples

R = 50

, learning rate

η = 0.1

, and gradient approximation

τ = 0.01

. A trial contains 50 training epochs.

Figure 3 illustrates the learning process of each element in the fuzzy matrix. The red dashed lines represent the true values, while the blue lines indicate the learning curves. It can be observed that regardless of the true values of the matrix, all elements converge to their true values within 20 epochs. Note that this performance is achieved using only 50 samples to learn nine parameters. In fact, the number of samples can be further reduced, but we did not explore this aspect in greater detail.

At the end of learning, the learned fuzzy matrix

P_{learned}

is

P_{learned} = [\begin{matrix} 0.6078 & 0.0329 & 0.7300 \\ 0.6969 & 0.9555 & 0.8951 \\ 0.2306 & 0.0445 & 0.5323 \end{matrix}],

(24)

and

P_{learned} - P_{true} = 1 \times 10^{- 3} \times [\begin{matrix} - 0.0000 & 0.4183 & - 0.0000 \\ - 0.0000 & - 0.2355 & - 0.0000 \\ 0.0000 & 0.0001 & 0.0000 \end{matrix}],

(25)

Figure 4 shows the curve of the MSE as learning epochs. It can be observed that as the number of epochs increases, the MSE decreases rapidly. At the end of this particular trial, the MSE reached

4.72464 \times 10^{- 6}

. The trial was completed in 0.67 s.

Figure 5 shows the curve of the matrix reconstruction error as learning epochs. It can be observed that as the number of epochs increases, the matrix reconstruction error decreases rapidly. At the end of this particular trial, the MSE reached

4.8001 \times 10^{- 4}

.

We further explored the learning performance of the

5 \times 5

fuzzy matrix with the following parameter settings: learning

epochs = 100

, learning rate

η = 0.1

, and sample size

R = 50

. At the end of the learning process, the mean squared error (MSE) reached

9.25065 \times 10^{- 6}

, and the MRE was 0.507827. The learning curves of the matrix are shown in Figure 6.

From Figure 6, it can be observed that with the sample size fixed and the number of parameters to learn increasing to 25, the majority of parameters (20/25) quickly approach their true values. However, a few parameters fail to reach their true values, even when the MSE is sufficiently small and converged. This phenomenon is noteworthy and can be attributed to the fact that, under the max–min composition operation, the fuzzy matrix is not a one-to-one function but a one-to-many function. Consequently, a single set of samples may correspond to multiple optimal fuzzy matrices.

To address this issue, increasing the sample size can help reduce the parameter ambiguity. Subsequently, we further investigated the impact of different sample sizes on the reconstruction error of fuzzy matrices across varying dimensions. We plotted the 3D surfaces showing the variation of the mean squared error and reconstruction error with respect to the matrix dimension n (

n = 3, 5, 10, 15,

and 20) and sample size R (

R = 10, 50, 100, 200,

and 500), as shown in Figure 7 and Figure 8. Note that each data point represents a test set. The plane between data points is color-filled to better observe the trends between matrix dimensions, sample size, and MSE/MRE. The color blocks represent the predicted relationship plane, while the intersection lines between color blocks indicate the predicted relationship curves.

From Figure 7, it can be observed that for fuzzy matrices of any dimension, the MSE decreases significantly as the sample size R increases. However, for the same sample size, higher-dimensional matrices (larger n) generally result in higher MSE values. Figure 8 illustrates the variation of the reconstruction error (Frobenius norm) with the matrix dimension n and sample size R. For lower-dimensional matrices, increasing the sample size R can substantially reduce the reconstruction error. However, for higher-dimensional matrices, significantly more samples are required to achieve a low reconstruction error; otherwise, the algorithm may experience a saturation effect.

We did not further investigate the specific number of samples required to noticeably reduce the reconstruction error when learning high-dimensional fuzzy matrices. as this goes beyond the scope of this study and is a time-consuming process.

In fact, Figure 7 and Figure 8 reflect the strong stability of the proposed learning algorithm. Figure 7 and Figure 8 illustrate the relationship between the number of samples, matrix dimensions, and MSE/MRE, respectively. It can be observed that for these tests, the proposed algorithm consistently exhibits a low MSE and MRE.

5.2. Robustness Test

To test the robustness of the proposed algorithm, we introduce varying levels of noise into the input data. Noise is added to the input vectors

s_{i}

by randomly perturbing their elements with values drawn from a uniform distribution in the interval

[0, δ]

, where

δ

represents the noise level and

δ = 0, 0.02, 0.05, 0.08,

and

0.1

. The algorithm is then trained on this noisy data, and its performance is evaluated using the same metrics (MSE and MRE).

As shown in Figure 9 and Figure 10, even with a noise level of 0.1 (allowing an error of 0.1), the reconstruction error is only 0.035 at the end of the training. The MSE increases as the noise level increases. It is noteworthy that, at a noise level of 0.1, the final MSE is only 0.02. Therefore, the proposed algorithm exhibits strong stability and generalization capabilities under different noise levels. Even as the noise level increases, the algorithm maintains a low MSE and matrix reconstruction error, demonstrating its robustness to noisy data.

We did not pursue potentially better results, as the current results sufficiently validate the theoretical framework and demonstrate the learning performance of the proposed algorithms. The algorithm exhibits good convergence properties, robustness to noise, and strong generalization capabilities. It is worth noting that the learned model parameters are locally optimal rather than globally optimal. In fact, achieving optimal values (true values) for all parameters is highly challenging. Due to the nature of the max–min composition rule in fuzzy matrices, which is essentially a one-to-many function, the output is not unique. Therefore, under the max–min composition rule, multiple optimal fuzzy matrices can exist for the same set of samples. As a result, even if some parameters do not exactly reach their true values, we still consider the solution to be optimal.

The MATLAB program ran on a PC that was equipped with an Intel Xeon(R) E5-2680 v4 2.40 GHz CPU, 128 GB RAM, and the 64-bit Windows 10 operating system.

6. Conclusions

This paper proposes a stochastic gradient descent-based fuzzy matrix learning method using input–output vectors, filling the gap in fuzzy matrix learning under the max–min fuzzy composition rule. First, we introduce a smoothing function to approximate the max–min function. Then, based on the stochastic gradient descent optimization method, we derive the learning algorithm. Finally, we conduct a performance analysis of the proposed algorithm. The simulation results demonstrate that the learning algorithm achieves high accuracy and rapidly converges to its true values. Moreover, the algorithm exhibits strong robustness, as it can still quickly converge to values near the true ones under noise interference.

Future work will focus on three aspects: first, validation and application on real-world datasets, for example, applications in the control and decision making of fuzzy discrete event systems based on external variables [14,15], as well as self-learning modeling in possibilistic model checking [16]. Second, an important research direction is how input–output vectors can be obtained through external variables, for instance, connecting input-output vectors to external sensor variables using Gaussian membership functions, deriving the corresponding learning algorithm, and analyzing its learning performance. Additionally, extending this method to other fuzzy reasoning mechanisms is also an important research direction.

Author Contributions

Conceptualization, M.Y. and W.L.; methodology, M.Y., N.W., X.Y. and X.W.; formal analysis, M.Y. and W.L.; writing—original draft preparation, M.Y.; writing—review and editing, M.Y. and W.L.; visualization, M.Y., X.Y. and W.L.; project administration, M.Y.; funding acquisition, M.Y., N.W., X.Y. and X.W. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Shangluo University Key Disciplines Project under the discipline of Mathematics. The authors also acknowledge the financial support from the Shaanxi Provincial Natural Science Basic Research Program (No. 2024JC-YBMS-062). Additional funding was provided by the Shangluo University Foundation (Grant No. 20SKY021) and Shangluo University Foundation (Nos. 22SKY111, 23KYPY08).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The matlab codes of the study are openly available in https://www.alipan.com/s/qYveBYfAsQ6 (accessed on 25 December 2024).

Acknowledgments

The authors wish to express their gratitude to the anonymous referees for their valuable contributions in refining the presented ideas in this paper and enhancing the clarity of the presentation.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

SGD	Stochastic gradient descent
MSE	Mean squared error
MRE	Matrix reconstruction error

References

Zadeh, L.A. Fuzzy sets. Inf. Control. 1965, 8, 338–353. [Google Scholar] [CrossRef]
Zadeh, L.A. Fuzzy sets as a basis for a theory of possibility. Fuzzy Sets Syst. 1978, 1, 3–28. [Google Scholar] [CrossRef]
Dubois, D.; Prade, H. Fuzzy set and possibility theory-based methods in artificial intelligence. Artif. Intell. 2003, 148, 1–9. [Google Scholar] [CrossRef]
Pedrycz, W. An introduction to computing with fuzzy sets-analysis design and applications. IEEE ASSP Mag. 2021, 190, 79–93. [Google Scholar]
Zimmermann, H.J. Fuzzy Sets, Decision Making, and Expert Systems; Springer Science & Business Media: Berlin/Heidelberg, Germany, 1987; Volume 10. [Google Scholar]
Ferreira, M.A.D.d.O.; Ribeiro, L.C.; Schuffner, H.S.; Libório, M.P.; Ekel, P.I. Fuzzy-Set-Based Multi-Attribute Decision-Making, Its Computing Implementation, and Applications. Axioms 2024, 13, 142. [Google Scholar] [CrossRef]
Hüllermeier, E. Fuzzy sets in machine learning and data mining. Appl. Soft Comput. 2011, 11, 1493–1505. [Google Scholar] [CrossRef]
Miloudi, S.; Wang, Y.; Ding, W. An improved similarity-based clustering algorithm for multi-database mining. Entropy 2021, 23, 553. [Google Scholar] [CrossRef]
Li, Y.; Liu, W.; Wang, J.; Yu, X.; Li, C. Model checking of possibilistic linear-time properties based on generalized possibilistic decision processes. IEEE Trans. Fuzzy Syst. 2023, 31, 3495–3506. [Google Scholar] [CrossRef]
Liu, W.; Li, Y. Optimal strategy model checking in possibilistic decision processes. IEEE Trans. Syst. Man Cybern. Syst. 2023, 53, 6620–6632. [Google Scholar] [CrossRef]
Liu, W.; Wang, J.; He, Q.; Li, Y. Model checking computation tree logic over multi-valued decision processes and its reduction techniques. Chin. J. Electron. 2024, 33, 1399–1411. [Google Scholar] [CrossRef]
Ma, Z.; Li, Z.; Li, W.; Gao, Y.; Li, X. Model checking fuzzy computation tree logic based on fuzzy decision processes with cost. Entropy 2022, 24, 1183. [Google Scholar] [CrossRef] [PubMed]
Yu, X.; Li, Y.; Geng, S.; Li, H. Fuzzy Computation Tree Temporal Logic with Quality Constraints and Its Model Checking. Axioms 2024, 13, 832. [Google Scholar] [CrossRef]
Ying, H.; Lin, F. Self-learning fuzzy automaton with input and output fuzzy sets for system modelling. IEEE Trans. Emerg. Top. Comput. Intell. 2022, 7, 500–512. [Google Scholar] [CrossRef]
Ying, H.; Lin, F. Discrete-Time Finite Fuzzy Markov Chains Realized through Supervised Learning Stochastic Fuzzy Discrete Event Systems. IEEE Trans. Fuzzy Syst. 2024, 32, 6088–6100. [Google Scholar] [CrossRef]
Liu, W.; He, Q.; Li, Z.; Li, Y. Self-learning modeling in possibilistic model checking. IEEE Trans. Emerg. Top. Comput. Intell. 2024, 8, 264–278. [Google Scholar] [CrossRef]
Bertsekas, D. Minimax methods based on approximation. In Proceedings 1976 John Hopkins Conference on Information Sciences and Systems; Johns Hopkins University: Baltimore, MD, USA, 1976. [Google Scholar]
Xu, S. Smoothing method for minimax problems. Comput. Optim. Appl. 2001, 20, 267–279. [Google Scholar] [CrossRef]
Tsoukalas, A.; Parpas, P.; Rustem, B. A smoothing algorithm for finite min–max–min problems. Optim. Lett. 2009, 3, 49–62. [Google Scholar] [CrossRef]
Li, L.; Qiao, Z.; Liu, Y.; Chen, Y. A convergent smoothing algorithm for training max–min fuzzy neural networks. Neurocomputing 2017, 260, 404–410. [Google Scholar] [CrossRef]

Figure 1. Configuration of fuzzy matrix learning model.

Figure 2. Max–min fuzzy function vs smoothed approximation (

τ = 0.01, 0.1

).

Figure 2. Max–min fuzzy function vs smoothed approximation (

τ = 0.01, 0.1

).

Figure 3. Learning progress of

3 \times 3

fuzzy matrix.

Figure 3. Learning progress of

3 \times 3

fuzzy matrix.

Figure 4. Progressive decrease in the mean squared error.

Figure 5. Progressive decrease in the matrix reconstruction error.

Figure 6. Learning progress of

5 \times 5

fuzzy matrix.

Figure 6. Learning progress of

5 \times 5

fuzzy matrix.

Figure 7. The MSE varies with matrix dimension n and sample size R.

Figure 8. The reconstruction error varies with matrix dimension n and sample size R.

Figure 9. MRE over epochs for different noise levels.

Figure 10. MSE over epochs for different noise levels.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Ye, M.; Wang, N.; Yu, X.; Wang, X.; Liu, W. Supervised Learning Fuzzy Matrix Based on Input–Output Fuzzy Vectors. Axioms 2025, 14, 126. https://doi.org/10.3390/axioms14020126

AMA Style

Ye M, Wang N, Yu X, Wang X, Liu W. Supervised Learning Fuzzy Matrix Based on Input–Output Fuzzy Vectors. Axioms. 2025; 14(2):126. https://doi.org/10.3390/axioms14020126

Chicago/Turabian Style

Ye, Meili, Nianliang Wang, Xianfeng Yu, Xiao Wang, and Wuniu Liu. 2025. "Supervised Learning Fuzzy Matrix Based on Input–Output Fuzzy Vectors" Axioms 14, no. 2: 126. https://doi.org/10.3390/axioms14020126

APA Style

Ye, M., Wang, N., Yu, X., Wang, X., & Liu, W. (2025). Supervised Learning Fuzzy Matrix Based on Input–Output Fuzzy Vectors. Axioms, 14(2), 126. https://doi.org/10.3390/axioms14020126

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Supervised Learning Fuzzy Matrix Based on Input–Output Fuzzy Vectors

Abstract

1. Introduction

1.1. Motivations

1.2. Structure of This Paper

2. Fuzzy Reasoning Model

3. Learning Algorithm for Fuzzy Matrix

Gradient Calculation and Optimization

4. Simulation Settings

4.1. Data Generation

4.2. Evaluation Metrics

5. Experimental Results and Analysis

5.1. Learning Performance Evaluation

5.2. Robustness Test

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI