Next Article in Journal
Computing the Matrix G of Multi-Dimensional Markov Chains of M/G/1 Type
Next Article in Special Issue
Decision Uncertainty from Strict Preferences in Sequential Search Scenarios with Multiple Criteria
Previous Article in Journal
Correction: Alanazi et al. Photovoltaic Models’ Parameter Extraction Using New Artificial Parameterless Optimization Algorithm. Mathematics 2022, 10, 4617
Previous Article in Special Issue
Accelerating Consensus Reaching Through Top Persuaders: A Social Persuasion Model in Social Network Group Decision Making
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Forecasting the Number of Passengers on Hungarian Railway Routes Using a Similarity and Fuzzy Arithmetic-Based Inference Method

Faculty of Economics, Eötvös Loránd University, 1053 Budapest, Hungary
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Mathematics 2025, 13(8), 1221; https://doi.org/10.3390/math13081221
Submission received: 8 March 2025 / Revised: 3 April 2025 / Accepted: 7 April 2025 / Published: 8 April 2025

Abstract

:
In this study, we present a similarity and fuzzy arithmetic-based fuzzy inference method and show how effectively it can be used to forecast the number of passengers on a railway route. We introduce a novel fuzzy similarity measure that is derived from the so-called epsilon function, which may be viewed as an alternative to the exponential function. After demonstrating the most important properties of the new similarity measure, we construct a fuzzy inference method that is founded on arithmetic operations over triangular fuzzy numbers. This inference method utilizes the proposed similarity measure to derive weight values for the above-mentioned arithmetic operations. The motivation behind the proposed method is twofold. On the one hand, we aim to construct a method that is simple and easy to implement. On the other hand, we intend to ensure that this method meets the practical requirements for rail passenger forecasts. Using a real-life case study, we demonstrate how well our method can predict the expected number of passengers on a new railway route based on characteristics of this relation. With respect to the studied case, we may conclude that although the similarity and fuzzy arithmetic-based fuzzy inference system has only two adjustable parameters, it may be regarded as a viable alternative to Sugeno-type fuzzy inference systems with a much greater number of adjustable parameters tuned by various optimization techniques.

1. Introduction

Nowadays, when mobility plays a major role in the economy, in business and in private life, understanding the magnitude of the demand for a transport service has become an important factor for service providers. This is also the case for railway operators, for whom forecasting the expected number of passengers on a railway route that does not yet exist is of great importance. Accordingly, several methods have been developed to forecast rail passenger traffic. Some of them are mentioned here, although the list is not exhaustive.
There are some examples of fuzzy models used for short-term passenger demand prediction. A fuzzy model has been developed in a study to predict peak-time demand on the Beijing–Shanghai high-speed railway, which predicts peak cross sections during holiday seasons, helping rail dispatchers to minimize costs and the number of passengers who miss free capacity (see [1]). The application of this method has been validated in practice, showing that fuzzy forecasting has led to significant improvements compared to the previously used methods. There are also some examples of looking for passenger demand trends in a longer-term historical passenger flow database, and learning the characteristics of historical passenger flows can be used to predict future demands using regression techniques (see [2,3]). In other studies, models for air transport demand forecasting were developed by combining neural and fuzzy logic models, with a strong emphasis on the seasonality of transport demands (see [4,5]). A model based on fuzzy logic has also been developed in a study on the prioritization of transport investments, with the aim of prioritizing governmental investments that can have the greatest impact on the shift to public and sustainable transport modes (see [6]). There are also examples of research topics that deal with the human perception of public transport systems, with the aim of demonstrating the potential to increase public transportation ridership (see [7]). Many research articles deal with the implementation of artificial neural networks (ANNs) in transportation sciences and transportation forecasting. The findings and the results comparisons show that these applications have a better prediction ability than the standard methods; however, in most cases it is pointed out that there is more to come in the development and use of artificial neural networks for demand forecasting (see [8,9,10]). Recent studies on traffic forecasting suggest that combinations of existing travel demand forecasting methods can produce more accurate results than using different methods separately. Forecasting techniques ranging from regression models to artificial neural networks and support vector machines (SVMs) incorporating soft computing methods are discussed and compared in [11]. There are many examples where fuzzy forecasting models are combined with neural networks, combinations that can yield more accurate predictions, but the background to the functioning of neural networks might cause difficulties in some cases (see [12,13]). There are also examples of comparing different types of transport demand forecasting models and identifying factors that influence rail transport demand. Examples of such explanatory variables include average rail passenger travel distance, car ownership index, number of buses working in interurban routes per year, or a competition variable, expressed as the ratio of unit cost by bus to the unit cost by rail (see [14]). In some cases, Grey Relational Analysis is also adopted in transportation demand forecasting as well as other consumer demand models (see [15,16]). In other studies, the characteristics of total passenger demand, rail passenger demand and personal car demand are also discussed, with the possible modeling of the modal split (see [17,18]). Passenger demands can also be examined through capacity allocation and timetable planning models. There are examples for the development of timetable optimizations based on the characteristics of railway infrastructure and vehicles (see [19,20]).
The difficulties of standardizing different demand forecasting models is also highlighted, indicating the difficulties of choosing the right type of modeling methods that could be used in different phases of a transportation planning period. An article from recent years dealt with creating a standardized transport demand forecasting knowledge base from which the optimal forecasting method can be easily selected during each planning phase (see [21]).
In this study, we introduce a similarity and fuzzy arithmetic-based fuzzy inference method and demonstrate its effectiveness in predicting the expected number of future passengers on a railway connection. We present a new fuzzy similarity measure derived from the so-called epsilon function, which serves as an alternative to the exponential function (see [22,23]). After highlighting the key properties of this new similarity measure, we develop a fuzzy inference technique that employs arithmetic operations on triangular fuzzy numbers. This method uses the proposed similarity measure to determine weight values for the aforementioned operations. Our motivation for this approach is twofold: firstly, we aim to create a method that is straightforward and easy to apply; secondly, we aspire to ensure that this method fulfills the practical requirements for rail passenger forecasting. Through a real-world case study that uses data from the Hungarian State Railways (MÁV), we illustrate how effectively our method can estimate the average number of passengers for a new railway connection based on the specific characteristics of that connection. It is worth noting that while our similarity and fuzzy arithmetic-based inference system has only two adjustable parameters, it can serve as a practical alternative to Sugeno-type fuzzy inference systems, which typically have a much larger number of adjustable parameters optimized through various techniques such as the Adaptive Neuro Fuzzy Inference System (ANFIS) (see, e.g., [24,25]), genetic algorithm (GA) (see, e.g., [26,27,28]), particle swarm optimization (PSO) (see, e.g., [29]) and pattern-search (PS) method (see, e.g., [30]). Our model and the Sugeno-type fuzzy inference systems with the above-mentioned parameter optimization techniques were implemented in the MATLAB (R2024b) software package [31]. The respective MATLAB source files ‘SFAFIS_example.m’ and ‘FIS_similarity_and_fuzzy_arithmetic_class.m’, which are available at https://jonast.web.elte.hu/, include the example of our case study. With these two files, the results of our case study can be reproduced.
The remaining part of this paper is structured as follows. In Section 2, we provide an overview of the basic concepts we employ in our study. Here, we present the arithmetic operations that will be later utilized over triangular fuzzy numbers and describe a function that will be used to construct a similarity measure. In Section 3, we propose a new fuzzy similarity measure and demonstrate its main properties. A novel fuzzy inference method called the similarity and fuzzy arithmetic-based fuzzy inference method is then presented in  Section 4. In Section 5, by the means of a real-life case study, we demonstrate how this new method can be used to predict the number of passengers on a railway route. Here, we also compare our method with some much more sophisticated fuzzy and neuro-fuzzy methods. Lastly, we make some concluding remarks and outline possible directions for future research.

2. Preliminaries

Here, we will provide a brief overview of the concepts and notations that we will make use of later on. We will use the common notation R for the real line.
Definition 1
(cf. [32]). Let X be a non-empty set. We say that the function μ A : X [ 0 , 1 ] is the membership function of the fuzzy set A on the universe X. For any x X , μ A ( x ) [ 0 , 1 ] is the membership value of x in the fuzzy set A.
Definition 2
(cf. [33]). Let X be a non-empty set and let μ A : X [ 0 , 1 ] be the membership function of the fuzzy set A on the universe X. For any α ( 0 , 1 ] , we say that the ordinary set
A α = { x X : μ A ( x ) α }
is the α-cut of the fuzzy set A.

2.1. Some Arithmetic Operations on Triangular Fuzzy Numbers

We will construct a fuzzy arithmetic-based inference method in which we will utilize triangular fuzzy numbers. We will use the following definition of triangular fuzzy numbers.
Definition 3
(cf. [34]). We say that a fuzzy set A on the real line is a triangular fuzzy number with the parameters a , b , c R , a < b < c , if its membership function μ A ( · ; a , b , c ) : R [ 0 , 1 ] is given by
μ A ( x ; a , b , c ) = 0 , if x a x a b a , if a < x b c x c b , if b < x c 0 , if x > c .
From now on, we will use the notation T ( a , b , c ) for a triangular fuzzy number with the real-valued parameters ab and c, where a < b < c . That is, A = T ( a , b , c ) means that A is a triangular fuzzy number with the parameters ab and c, and so the membership function of A is the function μ A ( · ; a , b , c ) : R [ 0 , 1 ] given in Definition 3.
Remark 1.
It should be added that fuzzy numbers are a class of fuzzy sets (see, e.g., [34,35]). Namely, a fuzzy set A with the membership function μ A : R [ 0 , 1 ] is said to be a fuzzy number if
(a) 
A is normal, i.e., there exists a x 0 R , such that μ A ( x 0 ) = 1 .
(b) 
A is fuzzy convex, i.e., for any t [ 0 , 1 ] and x , y R ,
μ A ( t x + ( 1 t ) y ) min { μ A ( x ) , μ A ( y ) } .
(c) 
μ A is upper semi-continuous on R .
(d) 
The support of A, i.e., the set { x R : μ A ( x ) > 0 } , is bounded.
It can be verified that a fuzzy set with the membership function μ A ( · ; a , b , c ) : R [ 0 , 1 ] given in Equation (2) satisfies the criteria for a fuzzy number. This means that triangular fuzzy numbers are a class of fuzzy numbers.
It is well-known (see, e.g., [33,34]) that using the concept of α -cut and interval arithmetic operations, the addition, subtraction, multiplication by scalar, multiplication and division operations over triangular fuzzy numbers can be defined so that any set of triangular fuzzy numbers is closed under these operations. Noting the definition of the α -cut of a fuzzy set given in Definition 2, after direct calculation, we find that for any α ( 0 , 1 ] , the  α -cut of the triangular fuzzy number T ( a , b , c ) , T ( a , b , c ) α , is the interval
T ( a , b , c ) α = ( 1 α ) a + α b , α b + ( 1 α ) c ,
where a , b , c R and a < b < c . Notice that in Definition 2, α ( 0 , 1 ] ; however, for  α = 0 , the  α -cut of T ( a , b , c ) is the interval [ a , c ] . Hence, for triangular fuzzy numbers, we interpret the α -cut for any α [ 0 , 1 ] as the interval given by Equation (3).
Now, we will show that the weighted arithmetic mean of triangular fuzzy numbers is a triangular fuzzy number whose parameters are the weighted arithmetic means of the corresponding parameters of the triangular fuzzy numbers in question. We will utilize the following operations over intervals.
Definition 4.
For an interval [ x , y ] and a scalar t R , t > 0 , the operation t I [ x , y ] is defined as
t I [ x , y ] = [ t x , t y ] .
Definition 5.
For the intervals [ x 1 , y 1 ] , [ x 2 , y 2 ] , , [ x n , y n ] , n N , n 2 the operation n I i = 1 x i , y i is defined as
n I i = 1 x i , y i = i = 1 n x i , i = 1 n y i .
Using the scalar multiplication operation given in Definition 4 and the summation operator given in Definition 5, we can state the following theorem.
Theorem 1.
Let T ( a 1 , b 1 , c 1 ) , T ( a 2 , b 2 , c 2 ) , , T ( a n , b n , c n ) be triangular fuzzy numbers, where n N , n 2 , and  a i < b i < c i for all i { 1 , 2 , , n } . Furthermore, let w 1 , w 2 , , w n [ 0 , 1 ] such that i = 1 n w i = 1 . Then, for any α [ 0 , 1 ] ,
n I i = 1 w i I T a i , b i , c i α = T i = 1 n w i a i , i = 1 n w i b i , i = 1 n w i c i α ,
where T ( p , q , r ) α denotes the α-cut of a triangular fuzzy number with the parameters p , q , r R , p < q < r .
Proof. 
Let α [ 0 , 1 ] . Based on Equation (3), the  α -cut of T ( a i , b i , c i ) is the interval
T ( a i , b i , c i ) α = ( 1 α ) a i + α b i , α b i + ( 1 α ) c i ,
where i { 1 , 2 , , n } . Now, using Equation (7), and the scalar multiplication and addition operations over intervals given by Equation (4) and Equation (5), respectively, we find that
n I i = 1 w i I T a i , b i , c i α = n I i = 1 w i I ( 1 α ) a i + α b i , α b i + ( 1 α ) c i = n I i = 1 w i ( 1 α ) a i + α b i , w i α b i + ( 1 α ) c i = n I i = 1 ( 1 α ) w i a i + α w i b i , α w i b i + ( 1 α ) w i c i = i = 1 n ( 1 α ) w i a i + α w i b i , i = 1 n α w i b i + ( 1 α ) w i c i = ( 1 α ) i = 1 n w i a i + α i = 1 n w i b i , α i = 1 n w i b i + ( 1 α ) i = 1 n w i c i .
Next, noting Equation (7) again, we see that
( 1 α ) i = 1 n w i a i + α i = 1 n w i b i , α i = 1 n w i b i + ( 1 α ) i = 1 n w i c i = T i = 1 n w i a i , i = 1 n w i b i , i = 1 n w i c i α .
Hence, with Equations (8) and (9), we obtain
n I i = 1 w i I T a i , b i , c i α = T i = 1 n w i a i , i = 1 n w i b i , i = 1 n w i c i α .
   □
We will make use of the following scalar multiplication and summation operations over triangular fuzzy numbers.
Definition 6.
For a triangular fuzzy number T ( a , b , c ) and a t R , t > 0 , the scalar multiplication operation t T ( a , b , c ) is defined as
t T ( a , b , c ) = T ( t a , t b , t c ) .
Definition 7.
For the triangular fuzzy numbers T ( a 1 , b 1 , c 1 ) , T ( a 2 , b 2 , c 2 ) , , T ( a n , b n , c n ) , the summation operation n i = 1 T ( a i , b i , c i ) is defined as
n i = 1 T ( a i , b i , c i ) = T i = 1 n a i , i = 1 n b i , i = 1 n c i .
Exploiting Theorem 1 and using Definitions 6 and 7, we can state the following corollary.
Corollary 1.
Let T ( a 1 , b 1 , c 1 ) , T ( a 2 , b 2 , c 2 ) , , T ( a n , b n , c n ) be triangular fuzzy numbers, where n N , n 2 , and  a i < b i < c i for all i { 1 , 2 , , n } . Furthermore, let w 1 , w 2 , , w n [ 0 , 1 ] such that i = 1 n w i = 1 . Then, the following hold:
(a) 
The weighted arithmetic mean of T ( a 1 , b 1 , c 1 ) , T ( a 2 , b 2 , c 2 ) , , T ( a n , b n , c n ) with respect to the weights w 1 , w 2 , , w n , i.e.,  n i = 1 w i T ( a i , b i , c i ) , is
n i = 1 w i T ( a i , b i , c i ) = T i = 1 n w i a i , i = 1 n w i b i , i = 1 n w i c i .
(b) 
For any α [ 0 , 1 ] , the α-cut of the weighted arithmetic mean n i = 1 w i T ( a i , b i , c i ) , i.e.,  n i = 1 w i T ( a i , b i , c i ) α , is equal to the weighted arithmetic mean of the α-cuts T ( a 1 , b 1 , c 1 ) α , T ( a 2 , b 2 , c 2 ) α , , T ( a n , b n , c n ) α with respect to the weights w 1 , w 2 , , w n . That is,
n i = 1 w i T ( a i , b i , c i ) α = n I i = 1 w i I T a i , b i , c i α .
Proof. 
(a).
Using Equations (10) and (11), we immediately get
n i = 1 w i T ( a i , b i , c i ) = n i = 1 T ( w i a i , w i b i , w i c i ) = T i = 1 n w i a i , i = 1 n w i b i , i = 1 n w i c i .
(b).
Noting Equations (6) and (12) and Theorem 1, we can write
n i = 1 w i T ( a i , b i , c i ) α = T i = 1 n w i a i , i = 1 n w i b i , i = 1 n w i c i α = n I i = 1 w i I T a i , b i , c i α .
   □
Note that the operators I and I are defined over intervals, while ⊙ and ⊕ are defined over triangular fuzzy numbers. Corollary 1 (b) tells us that any α -cut interval of the weighted arithmetic mean of some triangular fuzzy numbers is none other than the weighted arithmetic mean of the α -cut intervals of the respective triangular fuzzy numbers. We will utilize this property of triangular fuzzy numbers in our fuzzy arithmetic-based inference method.

2.2. The Epsilon Function

We will use the so-called epsilon function, which was introduced by Dombi et al. in [22], to construct a fuzzy similarity measure. The epsilon function is defined as follows.
Definition 8
(cf. [22]). Let λ R , λ 0 and Δ R , Δ > 0 . We say that the function ε λ , Δ : ( Δ , + Δ ) R + , which is given by
ε λ , Δ ( x ) = Δ + x Δ x λ Δ 2 ,
is an epsilon function with the parameters λ and Δ.
For more details on the epsilon function and its generalization, see [23]. With direct calculation, we find that
ε λ , Δ ( x ) | x = 0 = 1
and
d ε λ , Δ ( x ) d x | x = 0 = λ Δ 2 Δ 2 x 2 Δ + x Δ x λ Δ 2 | x = 0 = λ Δ 2 Δ 2 x 2 ε λ , Δ ( x ) | x = 0 = λ .
Since
e λ x | x = 0 = 1
and
d e λ x d x | x = 0 = λ ,
from Equations (15)–(18) we see that ε λ , Δ ( x ) and e λ x are identical to first order at x = 0 . Moreover, we have the following result.
Theorem 2
(cf. [22]). For any x ( Δ , + Δ ) ,
lim Δ ε λ , Δ ( x ) = e λ x .
Based on Theorem 2 and the fact that ε λ , Δ ( x ) and e λ x are identical to first order at x = 0 , the epsilon function can be regarded as a good approximation of the exponential function. Exploiting this property of the epsilon function, we will use it to construct a similarity measure.

3. A Fuzzy Similarity Measure Derived from the Epsilon Function

It is well known that if d : [ 0 , 1 ] n × [ 0 , 1 ] n [ 0 , 1 ] is a normalized distance measure in the vector space [ 0 , 1 ] n and λ > 0 , then for any vectors x , y [ 0 , 1 ] n ,
s λ * ( x , y ) = e λ d ( x , y )
may be viewed as a measure of similarity between the vectors x and y . Noting Equation (19), we see that the similarity measure s λ * : [ 0 , 1 ] n × [ 0 , 1 ] n [ 0 , 1 ] has the following elementary properties:
(a)
For any x , y [ 0 , 1 ] n , if  d ( x , y ) = 0 , then s λ * ( x , y ) = 1 (maximal similarity).
(b)
For any x , y [ 0 , 1 ] n , if  d ( x , y ) = 1 , then s λ * ( x , y ) = e λ (minimal similarity).
(c)
For any x , y , x , y [ 0 , 1 ] n , d ( x , y ) > d ( x , y ) implies s λ * ( x , y ) < s λ * ( x , y ) (monotonocity).
Notice that if λ is large, i.e.,  λ 0 , then e λ 0 . That is, if  λ 0 and d ( x , y ) = 1 , then s λ * ( x , y ) 0 .
It is well-known that for any λ > 0 and x [ 0 , 1 ] , the function g λ ( x ) = ( 1 x ) λ is a good approximation of e λ x . Furthermore, g λ ( 0 ) = 1 , g λ ( 1 ) = 0 and  g λ ( x ) and e λ x are identical to first order at x = 0 .
Now, let λ > 0 and define h λ : [ 0 , 1 ] [ 0 , 1 ] as
h λ ( x ) : = ε λ , 1 ( x ) , x [ 0 , 1 ] ,
i.e.,  h λ ( · ) is an epsilon function on [ 0 , 1 ] with the parameters λ and Δ = 1 . Hence, noting Definition 8, h λ ( x ) is given by
h λ ( x ) = 1 + x 1 x λ 2 = 1 x 1 + x λ 2 , x [ 0 , 1 ] .
Taking into account the properties of the epsilon function, we readily find that for any λ > 0 , h λ ( 0 ) = 1 , h λ ( 1 ) = 0 and h λ ( x ) and e λ x are identical to first order at x = 0 . Hence, we see that g λ ( · ) and h λ ( · ) have similar properties on [ 0 , 1 ] . The following theorem states that for any λ > 0 and x [ 0 , 1 ] , h λ ( x ) is even a better approximation of e λ x than g λ ( x ) .
Theorem 3.
For any λ > 0 and x [ 0 , 1 ] , it holds that
( 1 x ) λ 1 x 1 + x λ 2 e λ x .
Proof. 
It is sufficient to show that
( 1 x ) 2 1 x 1 + x e 2 x
holds because noting that λ > 0 and raising both members of Equation (22) to λ 2 , we get Equation (21).
First, we will prove the left hand side inequality in Equation (22). Since x [ 0 , 1 ] , we have 1 x 2 1 , and as 1 x 2 = ( 1 x ) ( 1 + x ) , we find that
( 1 x ) ( 1 + x ) 1 ,
from which, taking into account the fact that 1 x 0 and 1 + x > 0 ,
( 1 x ) 2 1 x 1 + x
follows.
Now, we will prove the right hand side inequality in Equation (22). With the Maclaurin series of e x , we have
e x = i = 0 x i i ! ,
and since x [ 0 , 1 ] , we see that
e x 1 + x + x 2 2 ,
from which
e 2 x 1 1 + x + x 2 2 2
follows. As  x [ 0 , 1 ] , we also have
1 1 + x + x 2 2 2 1 1 + x 2 ,
and so based on Equations (23) and (24), we find that
1 1 + x 2 e 2 x
holds for any x [ 0 , 1 ] , and the equality in Equation (25) holds only if x = 0 . Now, define f ( x ) as
f ( x ) = e 2 x 1 x 1 + x , x [ 0 , 1 ] .
The first derivative of f is
d f ( x ) d x = 2 1 x + 1 2 e 2 x ,
and taking into account the inequality given in Equation (25), we see that for any x [ 0 , 1 ] ,
d f ( x ) d x 0 .
Since the equality in Equation (25) holds only if x = 0 , we find that for any x ( 0 , 1 ] ,
d f ( x ) d x > 0 ,
which means that f ( · ) is a strictly increasing function on ( 0 , 1 ] . As  f ( 0 ) = 0 and f ( · ) is a strictly increasing function on ( 0 , 1 ] , we see that for any x [ 0 , 1 ] , f ( x ) 0 . Therefore, for any x [ 0 , 1 ] , we have
1 x 1 + x λ 2 e λ x .
   □
Remark 2.
We should add that an immediate practical consequence of Theorem 3 is that if λ 0 , then h λ ( x ) e λ x on [ 0 , 1 ] .
Figure 1 shows some example plots of e λ x and its approximation by g λ ( x ) and the epsilon function h λ ( x ) = ε λ , 1 ( x ) on [ 0 , 1 ] .
Based on the above considerations, we define the epsilon-function-based normalized fuzzy similarity measure as follows.
Definition 9.
Let d : [ 0 , 1 ] n × [ 0 , 1 ] n [ 0 , 1 ] be a normalized distance measure in the vector space [ 0 , 1 ] n . We say that the function s λ : [ 0 , 1 ] n × [ 0 , 1 ] n [ 0 , 1 ] is an epsilon-function-based normalized fuzzy similarity measure with a parameter λ > 0 in the vector space [ 0 , 1 ] n , if for any x , y [ 0 , 1 ] n , s λ ( x , y ) is given by
s λ ( x , y ) = 1 d ( x , y ) 1 + d ( x , y ) λ 2 .
Notice that s λ ( x , y ) = h λ ( d ( x , y ) ) , where h λ ( · ) is the epsilon function on [ 0 , 1 ] given by Equation (20). The following lemma summarizes the main properties of the epsilon-function-based normalized fuzzy similarity measure given in Definition 9.
Lemma 1.
Let d : [ 0 , 1 ] n × [ 0 , 1 ] n [ 0 , 1 ] be a normalized distance measure in the vector space [ 0 , 1 ] n and let s λ : [ 0 , 1 ] n × [ 0 , 1 ] n [ 0 , 1 ] be the corresponding epsilon-function-based normalized fuzzy similarity measure with a parameter λ > 0 given by Equation (26). s λ ( · ) has the following properties:
(a) 
For any x , y [ 0 , 1 ] n , if  d ( x , y ) = 0 , then s λ ( x , y ) = 1 (maximal similarity).
(b) 
For any x , y [ 0 , 1 ] n , if  d ( x , y ) = 1 , then s λ ( x , y ) = 0 (minimal similarity).
(c) 
For any x , y , x , y [ 0 , 1 ] n , d ( x , y ) > d ( x , y ) implies s λ ( x , y ) < s λ ( x , y ) (monotonocity).
(d) 
If λ 0 , then for any x , y [ 0 , 1 ] n , s λ ( x , y ) s λ * ( x , y ) , where s λ * ( x , y ) is given by Equation (19).
(e) 
If d ( x , y ) ( 0 , 1 ) , then
lim λ s λ ( x , y ) = 0 .
Proof. 
Properties (a), (b) and (c) readily follow from the definition of s λ ( · ) given in Definition 9, while property (b) is a consequence of Theorem 3.    □

4. A Similarity and Fuzzy Arithmetic-Based Fuzzy Inference Method

In our model, we assume that the value of a dependent variable y R is a function of an n-dimensional normalized feature vector x [ 0 , 1 ] n as follows:
y = f ( x ) + ε ,
where n N , n 1 , f is an unknown mapping from [ 0 , 1 ] n to R and  ε is a normally distributed random variable with mean of 0. Our goal is to build a fuzzy inference method to approximate function f.

4.1. Forming a Feature Vector–Linguistic Value of the Number of Passenger Pairs

We will construct a Similarity and Fuzzy Arithmetic-based Fuzzy Inference System (SFAFIS) to approximate the mapping f given in Equation (28). For this purpose, we will use a set of training data given by the ordered pairs
( x 1 , y 1 ) , ( x 2 , y 2 ) , , ( x m , y m ) ,
where m N , m 2 ,
x i = x i , 1 , x i , 2 , , x i , n [ 0 , 1 ] n ,
i = 1 , 2 , , m , n N , n 1 and  y i R , y i > 0 . Here, the vector x i and the scalar y i represent the normalized feature vector (explanatory or input vector) and the corresponding output in the ith observation, respectively, in the sample given by Equation (29).
Let
q 0 = min i = 1 , 2 , , m ( y i ) and q 1 = max i = 1 , 2 , , m ( y i ) ,
and suppose that y [ q 0 , q 1 ] . Now, let L 1 , L 2 , , L be the linguistic values like ‘very small’, ‘small’, ‘large’, ‘very large’, etc., of y where N , 3 . This means that L 1 , L 2 , , L are fuzzy sets on the interval [ q 0 , q 1 ] . Furthermore, let q k 1 denote the k 1 -quantile of the y 1 , y 2 , , y m values, where k = 1 , 2 , , 2 .
In our approach, the kth linguistic level L k of y is a triangular fuzzy number with the parameters a k , b k , c k R , a k < b k < c k . That is,
L k = T ( a k , b k , c k ) ,
and the membership function μ L k ( · ; a k , b k , c k ) : R [ 0 , 1 ] of L k is given by
μ L k ( y ; a k , b k , c k ) = 0 , if y a k y a k b k a k , if a k < y b k c k y c k b k , if b k < y c k 0 , if y > c k
such that
if k = 1 , then
a k = q 0 q 1 1 q 0 = 2 q 0 q 1 1 , b k = q 0 , c k = q 1 1 ,
if k { 2 , , 1 } , then
a k = q k 2 1 , b k = q k 1 1 , c k = q k 1 ,
if k = , then
a k = q 2 1 , b k = q 1 , c k = q 1 + q 1 q 2 1 = 2 q 1 q 2 1 .
Figure 2 shows an example of the plots of membership functions of the triangular fuzzy numbers (linguistic values) L 1 , L 2 , , L .
Next, for all i = 1 , 2 , , m , we transform the training data pair ( x i , y i ) into the pair
x i , L k ( i ) = x i , T a k ( i ) , b k ( i ) , c k ( i )
such that k ( i ) { 1 , 2 , , } is the smallest index for which the membership value μ L k ( i ) ( y i ; a k ( i ) , b k ( i ) , c k ( i ) ) ) equals the maximum of the membership values
μ L 1 ( y i ; a 1 , b 1 , c 1 ) , μ L 2 ( y i ; a 2 , b 2 , c 2 ) , , μ L ( y i ; a , b , c ) .
That is, k ( i ) is given as
k ( i ) = min r : r { 1 , 2 , , l } , μ L r ( y i ; a r , b r , c r ) = max t = 1 , , ( μ L t ( y i ; a t , b t , c t ) ) .
In other words, k ( i ) { 1 , 2 , , } is the smallest index for which, among the linguistic values L 1 , L 2 , , L , the linguistic value L k ( i ) best represents the y i crisp value.
Remark 3.
Alternatively, instead of the k 1 -quantile, q k 1 can be defined as
q k 1 = k 1 ,
where k = 1 , 2 , , 2 . In this case, we apply a grid partitioning, i.e., with 1 equidistant points, we divide the range [ q 0 , q 1 ] into ℓ consecutive intervals with the same Lebesgue measure (i.e., same length).

4.2. The Inference Mechanism

Suppose that x [ 0 , 1 ] n is an input feature vector for which we aim to predict the value of y. Furthermore, let s λ : [ 0 , 1 ] n × [ 0 , 1 ] n [ 0 , 1 ] be an epsilon-function-based normalized fuzzy similarity measure with a parameter λ > 0 given in Definition 9. That is, for any x , x [ 0 , 1 ] n , s λ ( x , x ) characterizes how much vector x is similar to the vector x . Now, for all i = 1 , 2 , , m , let w i be given by
w i = s λ ( x , x i ) j = 1 m s λ ( x , x j ) .
Notice that w i [ 0 , 1 ] and i = 1 n w i = 1 , and the weight w i may be regarded as the relative measure of how well the input feature vector x fits to the feature vector x i of the ith training data pair ( x i , y i ) , where i { 1 , 2 , , m } .
Since
L k ( i ) = T a k ( i ) , b k ( i ) , c k ( i ) ,
for all i = 1 , 2 , , m , based on Corollary 1 (a), the weighted arithmetic mean of the triangular fuzzy numbers L k ( 1 ) , L k ( 2 ) , , L k ( m ) is the triangular fuzzy number T ( a , b , c ) , which is given by
T ( a , b , c ) = m i = 1 w i L k ( i ) = m i = 1 w i T a k ( i ) , b ( k ( i ) ) , c k ( i ) = T i = 1 m w i a k ( i ) , i = 1 m w i b k ( i ) , i = 1 m w i c k ( i ) ,
where ⊙ and ⊕ are the scalar multiplication and summation operators over triangular fuzzy numbers given in Definition 6 and Definition 7, respectively. Here, we treat the triangular fuzzy number Y = T ( a , b , c ) as the fuzzy output of our similarity and fuzzy arithmetic-based fuzzy inference method, where
a = i = 1 m w i a k ( i ) , b = i = 1 m w i b k ( i ) , c = i = 1 m w i c k ( i ) .
With this approach, our method takes the linguistic value of y i (i.e., the triangular fuzzy number L k ( i ) ) into account in the aggregate fuzzy output Y as much as the input feature vector x fits to the ith feature vector x i . Hence, the defuzzified value of Y may be treated as a y ^ prediction of y for the feature vector x [ 0 , 1 ] n . Using the center of gravity defuzzyfication method, the y ^ prediction of y is
y ^ = a + b + c 3 .
Figure 3 shows the schematic diagram of the similarity and fuzzy arithmetic-based fuzzy inference system (SFAFIS) that we have described so far.

4.3. Tuning the Hyperparameters

The similarity and fuzzy arithmetic-based fuzzy inference system presented above may be treated as a two-parameter function f λ , : [ 0 , 1 ] n R that models the relationship between x i and y i , i.e.,
y i f λ , ( x i ) ,
where ( x i , y i ) is the ith training data pair, i = 1 , 2 , , m . Function f λ , has two parameters, a  λ > 0 and an N , 3 , which may be viewed as the hyperparameters of our SFAFIS. Recall that λ is the parameter of the epsilon-function-based normalized fuzzy similarity measure and is the number of linguistic levels utilized in the SFAFIS. The optimal values of λ and , denoted by λ opt and opt , respectively, can be determined by solving the minimization problem
i = 1 m y i f λ , ( x i ) 2 min .
That is, λ opt and opt are parameter values for which
i = 1 m y i f λ opt , opt ( x i ) 2 = min λ ( 0 , ) N , 3 i = 1 m y i f λ , ( x i ) 2
holds.
The quasi-optimal values of λ and can be found using various numerical methods such as particle swarm optimization or genetic algorithms. In a case study, which we will present in the next section, we are going to demonstrate that, since there are just two tunable parameters in our SFAFIS, even a simple grid search method is an effective way for finding the quasi-optimal values of these parameters.
We should emphasize that although our inference system has only two tunable parameters, in practice, this may serve as a viable alternative to much more sophisticated multi-parametric modeling methods.

5. Case Study

In this case study, we demonstrate how our similarity and fuzzy arithmetic-based model can be applied in practice. The aim of this study was to predict the number of passengers on railway relations using real-life data. For this purpose, historical passenger traffic data of the Hungarian State Railways (MÁV) were collected.
In order to utilize the SFAFIS method, we had to identify those characteristics of the examined relations that mostly affect the number of passengers on these relations. That is, we had to identify the components of an input vector x . Based on the findings of previous research articles (see, e.g., [14,36,37,38,39,40,41,42,43]) and preliminary studies of Hungarian State Railways, the following characteristics of railway routes were taken in to account as explanatory variables:
  • p 1 : Population of the the first city.
  • p 2 : Population of the the second city.
  • l: Distance between the cities (km).
  • t r : Fastest possible travel time between the cities by railway (min).
  • t b : Fastest possible travel time between the cities by bus (min).
  • t c : Fastest possible travel time between the cities by car (min).
  • f I C : Number of average InterCity train departures between the cities.
  • f b : Number of average bus departures between the cities.
  • f r : Number of average train departures between the cities.
  • c r : Unit cost of transport by rail (per passenger-km) (HUF).
  • c b : Unit cost of transport by bus (per passenger-km) (HUF).
  • c c : Unit cost of transport by car (per passenger-km) (HUF).
  • I c o : Car ownership index of the region.
  • d r : Distance of the railway station from the city centers (km).
  • d b : Distance of the bus station from the city centers (km).
Since the above fifteen input characteristics were taken into consideration, an input vector x [ 0 , 1 ] 15 contained the normalized values of the above characteristics. The normalization of the inputs was performed using the well-known min–max normalization method on the data given in Appendix A.
For the sake of simplicity, we stipulated that the railway connections are examined independently, i.e., the number of passengers on different sections of the same route are not added up. Using the company’s sales database for the year 2022, we calculated the rounded value of the average monthly number of passengers for each examined connection. These are shown in column y in Appendix A. Here, we utilized data from 60 railway connections in Hungary. However, it turned out, that even with these real data, the data range is too large; therefore, more data would be required to accurately test the proposed model. Unfortunately, because of the size and usual passenger flows in Hungary, more real-world data could not be gathered; therefore, we selected the densest part of our database (i.e., connections with an average monthly passenger number between 800 and 3500) and generated more records based on the original data. First, we generated new records from the original ones such that each new record was derived as a weighted average of two original records with a randomly selected average ratio. Next, using the same approach, we generated records using the previously generated ones. In Appendix A, we summarize our database, which includes both original and generated data and has 81 records in total. The original records are marked by letters from A to Z, while generated data are marked by a combination of letters corresponding to the records that they were generated from. For example, record AB was generated from records A and B and record ABBC was generated from the records AB and BC.
We evaluated the performance of the SFAFIS method and compared it with some well-known methods using randomly selected training and test data sets from the sample S given in Appendix A. We performed the method evaluations repeatedly so that, in every iteration, the training data set S tr and the test data set S te were randomly selected from the sample S as
S tr = ( x i 1 ( tr ) , y i 1 ( tr ) ) , ( x i 2 ( tr ) , y i 2 ( tr ) ) , , ( x i n tr ( tr ) , y i n tr ( tr ) ) ,
S te = ( x j 1 ( te ) , y j 1 ( te ) ) , ( x j 2 ( te ) , y j 2 ( te ) ) , , ( x j n te ( te ) , y j n te ( te ) ) ,
such that S tr S te = and S tr S te = S , where
  • n tr and n te are the number of training and test data pairs, respectively;
  • x i u ( tr ) and x j v ( te ) are the n-dimensional normalized feature vectors of the i u th and j v th railway connections in the training and test data sets, respectively (i.e., x i u ( tr ) , x j c ( te ) [ 0 , 1 ] n );
  • y i u ( tr ) and y j v ( te ) are the average number of passengers of the i u th and j v th railway relations in the training and test data sets, respectively;
  • y ^ i u ( tr ) and y ^ j v ( te ) are the estimated (computed) values of y i u ( tr ) and y j v ( te ) , respectively;
  • u = 1 , 2 , , n tr and v = 1 , 2 , , n te .
To characterize the goodness of the investigated methods, the following Mean Absolute Percentage Error (MAPE) and Root Mean Square Error (RMSE) metrics were taken into account:
MAPE tr = 100 n tr u = 1 n tr y i u ( tr ) y ^ i u ( tr ) y i u ( tr ) ,
MAPE te = 100 n te v = 1 n te y j v ( te ) y ^ j v ( te ) y j v ( te ) ,
MAPE tr , te = 100 n tr + n te u = 1 n tr y i u ( tr ) y ^ i u ( tr ) y i u ( tr ) + v = 1 n te y j v ( te ) y ^ j v ( te ) y j v ( te ) ,
RMSE tr = 1 n tr u = 1 n tr y i u ( tr ) y ^ i u ( tr ) 2 ,
RMSE te = 1 n te v = 1 n te y j v ( te ) y ^ j v ( te ) 2 ,
RMSE tr , te = 1 n tr + n te u = 1 n tr y i u ( tr ) y ^ i u ( tr ) 2 + v = 1 n te y j v ( te ) y ^ j v ( te ) 2 .
Here, MAPE tr and MAPE te are the MAPE metrics for the training and test data sets, respectively, while MAPE tr , te is the MAPE value that takes into account both the training and test data sets. Similarly, RMSE tr and RMSE te are the RMSE metrics for the training and test data sets, respectively, while RMSE tr , te is the RMSE value that takes into consideration both the training and test data sets.
Notice that using the Equations (33), (34), (36) and (37), the  MAPE tr , te and RMSE tr , te given in Equation (35) and Equation (38), respectively, can also be calculated as
MAPE tr , te = n tr MAPE tr + n te MAPE te n tr + n te
RMSE tr , te = n tr RMSE tr 2 + n te RMSE te 2 n tr + n te .
In every iteration, the number of randomly selected training data points was n tr = 66 , while the remaining n te = 15 data points were utilized as test data.
For every randomly selected training data set, the proposed SFAFIS method was applied with λ = 30 , λ = 60 and λ = 90 parameter values. For each value of the λ parameter, the quasi-optimal value of the number of linguistic variables, i.e.,  opt , was determined by minimizing the RMSE tr 2 quantity as described in Section 4.3, such that opt was searched for in the set { 5 , 6 , , 200 } . That is, for a given value of λ , opt is the value of parameter for which
u = 1 n tr y i u ( tr ) f λ , opt ( x i u ( tr ) ) 2 = min { 5 , 6 , , 200 } u = 1 n tr y i u ( tr ) f λ , ( x i u ( tr ) ) 2 ,
where x i u ( tr ) , y i u ( tr ) is the i u th data pair in the training data set and  f λ , ( x i u ( tr ) ) is the output of the SFAFIS method for input vector x i u ( tr ) .
With the genfis method of the MATLAB (R2024b) software package [31], in every iteration, utilizing the training data, a Sugeno-type fuzzy inference system (FIS) was generated using the subtractive clustering (SC) algorithm. This Sugeno-type FIS had 15 inputs ( x [ 0 , 1 ] 15 ) with 8 Gaussian membership functions for each input, i.e., the number of parameters on the input side was 15 × 8 × 2 = 240 . The number of rules in the rule base, i.e., the number of clusters formed, was eight, and so the output side of the FIS consisted of eight linear functions of the inputs. That is, each of these linear functions had 16 parameters (number of dimensions in the input vectors plus one for the constant term in the linear function). Hence, the total number of tunable parameters in such a Sugeno-type FIS was 15 × 8 × 2 + 8 × 16 = 368 . These parameters were tuned (optimized) using the following techniques, along with a k = 5 -fold cross validation:
  • Adaptive Neuro Fuzzy Inference System (ANFIS) (see, e.g., [24,25]);
  • Genetic Algorithm (GA) (see, e.g., [26,27,28]);
  • Particle Swarm Optimization (PSO) (see, e.g., [29]);
  • Pattern-search (PS) method (see, e.g., [30]).
The parameter tuning (optimization) of the Sugeno-type FIS was performed using the tunefis method of the MATLAB (R2024b) software package. The modeling goodness of the SFAFIS method and that of the Sugeno-type FIS tuned by the above-listed methods were expressed in terms of MAPE and RMSE metrics. These goodness results are shown in Table 1, Table 2, Table 3 and Table 4. The MATLAB source codes that were used to generate the results in these tables are available at https://jonast.web.elte.hu/. The ‘SFAFIS_example.m’ file contains the example of our case study, and the ‘FIS_similarity_and_fuzzy_arithmetic_class.m’ file is the class file in which the method itself is implemented. Using these two files, the results of our case study can be reproduced. We should note that in the MATLAB implementation of SFAFIS, instead of the similarity measure s λ ( x , y ) given in Equation (26), the following modified variant was employed:
s λ ( x , y ) = 1 d ( x , y ) 1 + d ( x , y ) λ ,
i.e., s λ ( x , y ) = s 2 λ ( x , y ) .
Using the goodness results shown in Table 1, Table 2, Table 3 and Table 4, Mann–Whitney median tests were conducted to compare the modeling performance of the SFAFIS method with that of the Sugeno-type FIS tuned using the ANFIS, GA, PSO and PS methods. From the three SFAFIS metric values (for λ = 30 , 60 and 90 parameter values) corresponding to each iteration in Table 1, Table 2, Table 3 and Table 4, the best one always was taken into account in the hypothesis tests. The results of the Mann–Whitney median tests are shown in Table 5. In this table, the notation m M stands for the median of the corresponding metric for method M.
Based on the results of the Mann–Whitney median tests shown in Table 5, with respect to the studied case, the following conclusions can be drawn:
  • On the training data, the Sugeno-type fuzzy inference systems tuned by the ANFIS, GA, PSO and PS methods perform significantly better than the proposed SFAFIS method both in terms of MAPE and RMSE (see the Mann–Whitney test results for MAPE tr and RMSE tr in Table 5).
  • On the test data, the proposed SFAFIS method performs significantly better than the Sugeno-type fuzzy inference systems tuned by the ANFIS, GA, PSO and PS methods both in terms of MAPE and RMSE (see the Mann–Whitney test results for MAPE te and RMSE te in Table 5).
  • Taking into account the training and test data together, the proposed SFAFIS method performs significantly better than the Sugeno-type fuzzy inference systems tuned by the ANFIS, GA, PSO and PS methods both in terms of MAPE and RMSE (see the Mann–Whitney test results for MAPE te , tr and RMSE te , tr in Table 5).
Noting the modeling results shown in Table 1, Table 2, Table 3 and Table 4, in most of the cases, the MAPE te metric for the SFAFIS method is less than 5. This means that using the above-considered characteristics of a new railway connection, the SFAFIS method can predict the average number of passengers on this connection with an absolute relative error of less than 5%. This accuracy exceeds the practical expectations.
Table 6 shows the average training and inference times of the investigated methods examined on the same computer under the same conditions. In this table, the average inference time represents the mean value of the time needed to predict both the training and test output data for the iterations shown in Table 1, Table 2, Table 3 and Table 4. In every iteration, from the three SFAFIS methods (for λ = 30 , 60 and 90 parameter values), the one with the best RMSE tr , te value was taken into account in computing the average inference time of the SFAFIS method.
Based on the average training and inference times shown in Table 6, we see that, for the data utilized in our case study, the average training time for the SFAFIS method was similar to that of the ANFIS method, and the average training times of these two methods (2.288 and 3.695 s) were considerably shorter than the average training times of the other three methods. At the same time, the average inference time for the SFAFIS method turned out to be longer than that of the other investigated methods. Here, we should emphasize that since we aim to predict the number of passengers on railway connections and do not need to do this real-time, neither the training time nor the inference time of the method used is critical.

Possibilities for Applying the SFAFIS Method to Other Railway Systems

As mentioned before, in our case study, fifteen characteristics of railway connections were taken into consideration, and so the input vector x [ 0 , 1 ] 15 contained the normalized values of these characteristics. Here, we should note that these characteristics were identified based on previous research articles (see, e.g., [14,36,37,38,39,40,41,42,43]) and preliminary studies of Hungarian State Railways (MÁV). This means that we identified those characteristics of the examined railway connections that mostly affected the number of passengers on these connections. Hence, these characteristics are specific to the Hungarian railway system in the sense that these inputs are the most explanatory in the given system. At the same time, it is worth noting that the SFAFIS method presented in Section 4 can be applied in the same way to any another railway system, even if the input characteristics in that system are different. That is, once the input features of railway routes, and hence the explanatory variables, in a railway system are identified, and the historical number of passengers associated with each input vector is given, the SFAFIS method can be applied in the same way as described. The number of passengers can be determined from ticket sales data.

6. Conclusions and Plans for Future Research

In this study, we developed a new fuzzy inference method that utilizes a novel similarity measure and fuzzy arithmetic operations to model the relation between inputs and outputs. Using known input–output pairs, for a given input vector, we determine its similarity to all the known input vectors, and from these similarities, we derive weights. Using these weights, the output of our system is computed as the defuzzified value of the weighted average of triangular fuzzy numbers that represent the linguistic values of the outputs. Based on the results of a real-life case study, which we presented in Section 5, we may conclude that although the proposed method has only two adjustable parameters, its forecasting capability is comparable with that of Sugeno-type fuzzy inference systems, which have a much larger number of adjustable parameters.
In Section 3, we introduced the epsilon-function-based normalized fuzzy similarity measure and demonstrated its main properties. Later, we showed how this similarity measure can be used in the presented SFAFIS method. However, we did not study how the SFAFIS method would perform using other similarity measures. Since comparing the proposed similarity measure with other similarity measures is an interesting topic, we plan to study this in our future research.
We proposed the use of a simple grid search method to find the quasi-optimal values of the λ and parameters of an SFAFIS system. However, as part of our future research, we plan to investigate how more effective optimization methods can be utilized for this purpose. In the proposed inference system, we perform arithmetic operations over triangular fuzzy numbers. We would also like to examine how our method can be adapted to other types of fuzzy numbers.
In the presented case study, we utilized passenger traffic data from the Hungarian State Railways (MÁV). This naturally raises the question of how the SFAFIS method can be applied to other transportation systems, such as high-speed rail networks, urban metro systems or air routes. We would therefore like to further explore this question in our future research.
In the wider field of transportation planning, in our future research, we would also like to investigate how the choice of transport mode can be modeled using fuzzy methods. Moreover, we are also motivated to answer the question of how and by what evaluation methods limited transport capacities can be allocated on a socioeconomic basis.

Author Contributions

Conceptualization, M.F. and T.J.; methodology, M.F. and T.J.; software, M.F. and T.J.; validation, M.F. and T.J.; formal analysis, M.F. and T.J.; data curation, M.F. and T.J.; writing—original draft preparation, M.F. and T.J.; writing—review and editing, M.F. and T.J.; visualization, M.F. and T.J.; supervision, T.J. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. The Data Set Used in the Case Study

City 1City 2 p 1 p 2 l t r t b t c f IC f b f r c r c b c c I co d r d b y
AB55,91054,428869918010089242520252043560.46024.51.3824
AC55,91078,19098103230100817102700279041400.44964.61.61759
DE55,16466,061876818590161961860261039960.42412.921772
FG199,858353067521168081021980168030600.38882.51.52459
HI8389116,28245391106091811230139519440.40552.91.72040
JK18,03923,742647311765810131490149030960.51171.71.8885
LM23,116158,7979176153801719101860218041040.40753.51.12758
NO7190147,533483713060923181770180027720.39872.51.11390
PO15,504147,5333628655092332149093015480.39873.91.23149
QR60,33465,8301379525011078123630390557960.44221.61.31521
ST1,658,34224,37114516520014311183249341075960.40442.70.52634
SU1,658,34229,88913012211590110253480283056160.4458113325
FV199,858127,59929926968021819036240919013,6080.43141.921208
FM199,858158,79718119230819001884398395081000.38942.71.81725
FW199,85849,182102160150850892200252047520.4112.71.7839
VA127,59955,910681171058008101490149029160.454.11.81970
VE127,59966,06120018540314020024380582082440.43262.31.51194
XE108,12066,06147498065171481120130021240.44062.41.43327
OW147,53349,18238759565811201490130029880.39953.71.41466
OV147,533127,59923924266519017014698769011,8800.41992.91.7838
OT147,53324,371212735300138755055010080.37834.81.21904
OY147,53331,02239659060013231120112021600.35033.31.61042
IE116,28266,061145109290150181842399472065520.41823.21.81731
ZV139,330127,5991862262701908255160420087840.432821.81160
ZC139,33078,1901792613002100954620420086400.43252.51.61308
A1C95,04578,1901361452251208863330388058320.46272.81.2959
A1A95,04555,9104042554016213384084016920.45064.81.23092
CB78,19054,42847557555021181330112020160.47232.51.32013
AABC55,91070,56794102214100814142642270342090.4534.61.51459
ADCE55,55172,345938620895121882295270340710.43733.81.81765
DFEG102,00745,817816316387131651899230936930.41272.81.81994
FHGI140,29638,60560481147481221747159127130.3942.61.62329
HJIK11,95382,10652521136291551326143023690.44472.51.71613
JLKM20,63692,8257875135731315111679184336120.45842.61.41843
LNMO17,430154,7757662145731420131828204436280.40443.11.12269
NPOO9038147,533453511658923211708160725000.39872.81.11781
PQOR48,25787,8401107720094812173054310446520.43052.21.31960
QSRT122013935,74014314621413431093353354671020.41482.40.72329
SSTU1,658,34228,825133130131100110223435294259980.43781.30.93192
SFUV1,223,44859,02418016628312867184303472679990.44151.31.32694
FFVM199,858156,81418819733219211784515428384500.3922.61.81692
FFMW199,858143,99517018828717601784101375776480.39232.71.81605
FVWA160,74552,824841371268208101816196237580.43213.51.81451
VVAE127,59961,92914615728211612353204405860750.439731.61510
VXEE109,26366,06156579969171381311156524830.44012.41.43202
XOEW124,50659,044436086651313131274130024830.42352.91.42553
OOWV147,533122,28722523162618216124481725711,2780.418531.7881
OOVT147,533115,007212216588170152114192681910,5540.41483.11.6968
OOTY147,53330,24037618456013311053105320240.35363.51.61143
OIYE143,06336,034547111973314201303163527880.363.31.61141
IZEV117,75269,987148116289153171742575468766940.41923.11.81695
ZZVC139,33084,5831802562962071854690420086590.43252.41.61289
ZA1CC101,63578,1901421622361337863522392862500.45822.81.31011
A1A1CA95,04576,4431281372121149983135364255070.461731.21126
A1CAB80,52854,63446537253221201262108119710.46932.81.32163
AAADBCCE55,73171,4539394211981016112469270341400.44524.21.61611
ADDFCEEG91,60151,759836817389131751988239737780.418231.81943
DFFHEGGI121,91442,068705513780111431820193631830.4032.71.72168
FHHJGIIK49,98469,21554501136591441451147824710.42972.51.71825
HJJLIKKM17,76389,278696712869111591562170632010.45392.61.51767
JLLNKMMO17,959144,5447664143731419131803201136260.41333.11.22199
LNNPMOOO14,177151,9686452133671221161781187531910.402231.12080
NPPQOOOR45,31492,3191057419491813182953299144900.42812.31.31946
PQQSORRT499,57067,775123103205109611143169327455950.42442.31.12102
QSSSRTTU1,240,16035,424142145210132310103357351870520.41582.30.72368
SSSFTUUV1,402,05946,62216115122111748203947399371770.441.31.12898
SFFFUVVM301,763147,07818819432718521694494432784050.3972.51.81792
FFFFVMMW199,858150,02017919230818311784296400480250.39222.71.81646
FFFVMWWA165,58464,10594143146940992098218542390.42723.41.81470
FVVVWAAE142,11157,9431191482131017572596314050610.43643.21.71484
VVVXAEEE111,29165,604666811975171271521184128800.442.51.43015
VXXOEEEW123,06359,708446087651413121277132524830.42512.91.42615
XOOOEWWV138,37897,14215316341213515673206488977810.42052.91.61546
OOOOWVVT147,533118,69321922360817616174338704110,9200.416731.7924
OOOOVTTY147,53358,030941112499459242082294348210.37373.41.61086
OOOITYYE143,36035,648537111672214211286159627370.35963.31.61141
OIIZYEEV132,99549,5399189186105815141809284943420.38363.21.71361
IZZZEVVC138,30683,8911782502962052954590422385650.43192.51.61308
ZZZA1VCCC131,02883,1751722362831912854433414081280.43822.51.51228
ZA1A1A1CCCA100,43277,8711401582321307863451387561140.45882.81.21032
A1A1A1CCAAB89,20067,6619510315689614132381261140830.46482.91.21544

References

  1. Dou, F.; Wang, L.; Jia, L. A train dispatching model based on fuzzy passenger demand forecasting during holidays. J. Ind. Eng. Manag. 2013, 6, 320–335. [Google Scholar] [CrossRef]
  2. Fowkes, A.S.; Nash, C.A.; Whiteing, A.E. Understanding trends in inter-city rail traffic in Great Britain. Transp. Plan. Technol. 1985, 10, 65–80. [Google Scholar] [CrossRef]
  3. Butkevičius, J.; Mazūra, M.; Ivankovas, V.; Mazūra, S. Analysis and forecast of the dynamics of passenger transportation by public land transport. Transport 2004, 19, 3–8. [Google Scholar] [CrossRef]
  4. Xiao, Y.; Liu, J.J.; Hu, Y.; Wang, Y.; Lai, K.K.; Wang, S. A neuro-fuzzy combination model based on singular spectrum analysis for air transport demand forecasting. J. Air Transp. Manag. 2014, 39, 1–11. [Google Scholar]
  5. Srisaeng, P.; Baxter, G.; Wild, G. An adaptive neuro-fuzzy inference system for forecasting Australia’s domestic low cost carrier passenger demand. Aviation 2015, 19, 150–163. [Google Scholar] [CrossRef]
  6. Pamucar, D.; Deveci, M.; Canıtez, F.; Božanić, D. A fuzzy Full Consistency Method-Dombi-Bonferroni model for prioritizing transportation demand management measures. Appl. Soft Comput. 2020, 87, 105952. [Google Scholar] [CrossRef]
  7. Dorsey, B. Mass transit trends and the role of unlimited access in transportation demand management. J. Transp. Geogr. 2005, 13, 235–246. [Google Scholar] [CrossRef]
  8. Sharif Azadeh, S.; Marcotte, P.; Savard, G. Railway demand forecasting in revenue management using neural networks. Int. J. Revenue Manag. 2013, 7, 18–36. [Google Scholar] [CrossRef]
  9. Milosavljević, N.; Milenkovic, M.; Bojovic, N.; Svadlenka, L.; Avramovic, Z. A hybrid model for forecasting the volume of passenger flows on Serbian railways. Oper. Res. 2015, 16, 271–285. [Google Scholar] [CrossRef]
  10. Zhang, J.; Guo, W. Research on Railway Passenger Flow Prediction Method Based on GA Improved BP Neural Network. J. Comput. Commun. 2019, 7, 283–292. [Google Scholar] [CrossRef]
  11. Ghalehkhondabi, I.; Ardjmand, E.; Young, W.A.; Weckman, G.R. A review of demand forecasting models and methodological developments within tourism and passenger transportation industry. J. Tour. Futur. 2019, 5, 75–93. [Google Scholar] [CrossRef]
  12. Noersasongko, E.; Julfia, F.; Syukur, A.; Purwanto, P.; Pramunendar, R.; Supriyanto, C. A Tourism Arrival Forecasting using Genetic Algorithm based Neural Network. Indian J. Sci. Technol. 2016, 9, 1–5. [Google Scholar] [CrossRef]
  13. Huarng, K.H.; Yu, T.; Moutinho, L.; Wang, Y.C. Forecasting tourism demand by fuzzy time series models. Int. J. Cult. 2012, 6, 377–388. [Google Scholar] [CrossRef]
  14. Profillidis, V.; Botzoris, G. A Comparative Analysis of Performances of Econometric, Fuzzy and Time-series Models for the Forecast of Transport Demand. In Proceedings of the 2007 IEEE International Fuzzy Systems Conference, London, UK, 23–26 July 2007; pp. 1–6. [Google Scholar] [CrossRef]
  15. Rong, W.; Liu, D.; He, X. Prediction of High Speed Railway Passenger Demand Volume Based on Grey Relational Analysis. In Proceedings of the ICTE 2015, Hong Kong, China, 2–4 July 2015; pp. 173–179. [Google Scholar] [CrossRef]
  16. Tsekeris, T.; Tsekeris, C. Demand Forecasting in Transport: Overview and Modeling Advances. Econ. Res.-Ekon. Istraživanja 2011, 24, 82–94. [Google Scholar] [CrossRef]
  17. Profillidis, V.A.; Botzoris, G.N. Econometric models for the forecast of passenger demand in Greece. J. Stat. Manag. Syst. 2006, 9, 37–54. [Google Scholar] [CrossRef]
  18. Nijkamp, P.; Reggiani, A.; Tritapepe, T. Modelling Inter-Urban Transport Flows in Italy: A Comparison between Neural Network Analysis and Logit Analysis. Transp. Res. Part C 1996, 4, 323–338. [Google Scholar]
  19. Yang, L.; Li, K.; Gao, Z. Train Timetable Problem on a Single-Line Railway With Fuzzy Passenger Demand. IEEE Trans. Fuzzy Syst. 2009, 17, 617–629. [Google Scholar] [CrossRef]
  20. Ghoseiri, K.; Szidarovszky, F.; Asgharpour, M.J. A multi-objective train scheduling model and solution. Transp. Res. Part B Methodol. 2004, 38, 927–952. [Google Scholar] [CrossRef]
  21. Banerjee, N.; Morton, A.; Akartunalı, K. Passenger demand forecasting in scheduled transportation. Eur. J. Oper. Res. 2020, 286, 797–810. [Google Scholar]
  22. Dombi, J.; Jónás, T.; Tóth, Z.E. The epsilon probability distribution and its application in reliability theory. Acta Polytech. Hung. 2018, 15, 197–216. [Google Scholar] [CrossRef]
  23. Jónás, T. The Generalized Epsilon Function: An Alternative to the Exponential Function. Acta Cybern. 2022, 25, 703–716. [Google Scholar] [CrossRef]
  24. Jang, J.S. ANFIS: Adaptive-network-based fuzzy inference system. IEEE Trans. Syst. Man Cybern. 1993, 23, 665–685. [Google Scholar] [CrossRef]
  25. Abraham, A. Adaptation of Fuzzy Inference System Using Neural Learning. In Fuzzy Systems Engineering: Theory and Practice; Nedjah, N., Macedo Mourelle, L.d., Eds.; Springer: Berlin/Heidelberg, Germany, 2005; pp. 53–83. [Google Scholar] [CrossRef]
  26. Banzhaf, W.; Nordin, P.; Keller, R.E.; Francone, F.D. Genetic Programming: An Introduction: On the Automatic Evolution of Computer Programs and Its Applications; Morgan Kaufmann Publishers Inc.: San Francisco, CA, USA, 1998. [Google Scholar]
  27. Vose, M.D. The Simple Genetic Algorithm: Foundations and Theory; MIT Press: Cambridge, CA, USA, 1999. [Google Scholar]
  28. Alhijawi, B.; Awajan, A. Genetic algorithms: Theory, genetic operators, solutions, and applications. Evol. Intell. 2024, 17, 1245–1256. [Google Scholar] [CrossRef]
  29. Bonyadi, M.R.; Michalewicz, Z. Particle Swarm Optimization for Single Objective Continuous Space Problems: A Review. Evol. Comput. 2017, 25, 1–54. [Google Scholar] [CrossRef]
  30. Dolan, E.D.; Lewis, R.M.; Torczon, V. On the Local Convergence of Pattern Search. SIAM J. Optim. 2003, 14, 567–583. [Google Scholar] [CrossRef]
  31. The MathWorks Inc. Fuzzy Logic Toolbox Version: 24.2 (R2024a). Available online: https://www.mathworks.com (accessed on 1 November 2024).
  32. Zadeh, L. Fuzzy sets. Inf. Control 1965, 8, 338–353. [Google Scholar] [CrossRef]
  33. Dubois, D.; Prade, H. Fuzzy Sets and Systems: Theory and Applications; Academic Press: Cambridge, MA, USA, 1980. [Google Scholar]
  34. Cheng, C.B. Group opinion aggregationbased on a grading process: A method for constructing triangular fuzzy numbers. Comput. Math. Appl. 2004, 48, 1619–1632. [Google Scholar] [CrossRef]
  35. Bede, B. Fuzzy Numbers. In Mathematics of Fuzzy Sets and Fuzzy Logic; Springer: Berlin/Heidelberg, Germany, 2013; pp. 51–64. [Google Scholar] [CrossRef]
  36. Nag, A.; Sarkar, S. Integrating choice freedom, economic health, and transportation infrastructure to forecast tourism demand: A case study of Bishnupur and its alignment with sustainable development goals. Transp. Policy 2024, 147, 198–214. [Google Scholar] [CrossRef]
  37. Kuby, M.; Barranda, A.; Upchurch, C. Factors influencing light-rail station boardings in the United States. Transp. Res. Part A Policy Pract. 2004, 38, 223–247. [Google Scholar] [CrossRef]
  38. Chiou, Y.C.; Jou, R.C.; Yang, C.H. Factors affecting public transportation usage rate: Geographically weighted regression. Transp. Res. Part A Policy Pract. 2015, 78, 161–177. [Google Scholar] [CrossRef]
  39. Iseki, H.; Liu, C.; Knaap, G. The determinants of travel demand between rail stations: A direct transit demand model using multilevel analysis for the Washington D.C. Metrorail system. Transp. Res. Part A Policy Pract. 2018, 116, 635–649. [Google Scholar] [CrossRef]
  40. Liu, X.; Wu, J.; Huang, J.; Zhang, J.; Chen, B.Y.; Chen, A. Spatial-interaction network analysis of built environmental influence on daily public transport demand. J. Transp. Geogr. 2021, 92, 102991. [Google Scholar] [CrossRef]
  41. Ma, J.; Liu, H.; Chen, L. Demand-Responsive Transportation Vehicle Routing Optimization Based on Two-Stage Method. Comput. Mater. Contin. 2024, 81, 443–469. [Google Scholar] [CrossRef]
  42. Toro-González, D.; Cantillo, V.; Cantillo-García, V. Factors influencing demand for public transport in Colombia. Res. Transp. Bus. Manag. 2020, 36, 100514. [Google Scholar] [CrossRef]
  43. Konečný, V.; Brídziková, M.; Marienka, P. Research of bus transport demand and its factors using multicriteria regression analysis. Transp. Res. Procedia 2021, 55, 180–187. [Google Scholar] [CrossRef]
Figure 1. Example plots of e λ x and its approximation by g λ ( x ) = ( 1 x ) λ and the epsilon function h λ ( x ) = 1 x 1 + x λ 2 on [ 0 , 1 ] .
Figure 1. Example plots of e λ x and its approximation by g λ ( x ) = ( 1 x ) λ and the epsilon function h λ ( x ) = 1 x 1 + x λ 2 on [ 0 , 1 ] .
Mathematics 13 01221 g001
Figure 2. Example plots of the triangular fuzzy numbers (linguistic values) L 1 , L 2 , , L .
Figure 2. Example plots of the triangular fuzzy numbers (linguistic values) L 1 , L 2 , , L .
Mathematics 13 01221 g002
Figure 3. Schematic diagram of the similarity and fuzzy arithmetic-based fuzzy inference system (SFAFIS).
Figure 3. Schematic diagram of the similarity and fuzzy arithmetic-based fuzzy inference system (SFAFIS).
Mathematics 13 01221 g003
Table 1. Modeling results of SFAFIS and the Sugeno-type FIS tuned using ANFIS.
Table 1. Modeling results of SFAFIS and the Sugeno-type FIS tuned using ANFIS.
IterationMethod MAPE tr MAPE te MAPE tr , te RMSE tr RMSE te RMSE tr , te opt λ
1SFAFIS0.22024.41830.997615.0363135.8958.65412330
0.153774.72551.00044.213139.6960.23312860
0.153244.83941.0214.2055141.6861.08712890
ANFIS0.001463712.0942.24070.04118302.94130.37--
2SFAFIS0.224535.82651.26195.0409126.0654.43811930
0.187515.1861.11324.4291130.8656.45411960
0.186024.96951.07194.4058133.7257.68211990
ANFIS0.00152019.71511.80030.08549197.5785.02--
3SFAFIS0.329083.64510.943169.239172.86432.44610130
0.13942.81350.63463.824365.98628.60511660
0.137032.51590.577553.80663.05927.35311690
ANFIS0.00376846.36761.18230.097153144.3862.132--
4SFAFIS0.229892.07130.570885.474269.62330.36612930
0.164681.45030.402763.83352.77922.97412960
0.162291.32630.377853.825745.73819.98312990
ANFIS0.00192794.57120.848080.11433120.4251.821--
5SFAFIS0.327830.726390.401649.570215.55810.92912530
0.16170.722370.265534.124819.6219.228212560
0.161720.768680.274124.137120.6939.656212590
ANFIS0.00166932.64550.491270.04509868.99129.689--
6SFAFIS0.251645.50061.22375.878798.65142.78311830
0.15564.27950.919283.903279.83234.53511860
0.152553.83940.83533.855971.45130.94411890
ANFIS0.00133555.96631.1060.03225690.3438.876--
7SFAFIS0.290383.08330.807598.4814116.9350.89711230
0.170552.99780.694114.4023114.1149.26713260
0.164413.15990.719134.2877115.7349.95113290
ANFIS0.00496984.45720.829450.5193896.28241.436--
8SFAFIS0.30942.97240.802548.984106.3346.4711730
0.15671.42120.390873.603447.16220.55412960
0.157410.989540.311513.613941.7218.24812990
ANFIS0.00211858.90881.65150.066603236.31101.69--
9SFAFIS0.304473.78470.948958.939665.48929.31511330
0.168891.43840.403984.076741.318.1511760
0.158151.46360.399893.959943.41119.0211790
ANFIS0.00135978.30171.53850.054377146.2162.918--
10SFAFIS0.198314.99071.08585.0846129.5955.95512530
0.162014.59090.982184.291138.0159.51812560
0.160434.67780.996974.3038139.3660.09512590
ANFIS0.00147986.92321.28330.047309140.5660.487--
Table 2. Modeling results of SFAFIS and the Sugeno-type FIS tuned using genetic algorithm (GA).
Table 2. Modeling results of SFAFIS and the Sugeno-type FIS tuned using genetic algorithm (GA).
IterationMethod MAPE tr MAPE te MAPE tr , te RMSE tr RMSE te RMSE tr , te opt λ
1SFAFIS0.22024.41830.997615.0363135.8958.65412330
0.153774.72551.00044.213139.6960.23312860
0.153244.83941.0214.2055141.6861.08712890
GA5.939 × 10 13 54.01110.0021.2304 × 10 11 1612.8694.02--
2SFAFIS0.312854.90461.16328.9279124.9754.37911830
0.169853.58690.802634.569382.41635.70512660
0.158623.59160.794364.457487.14537.71712690
GA8.2149 × 10 13 30.9795.73681.7463 × 10 11 584.05251.34--
3SFAFIS0.312482.16820.656138.922660.61527.311930
0.177351.73830.466424.427832.42114.51311960
0.179311.80020.479464.492334.05815.20711990
GA3.9052 × 10 13 37.4246.93047.9338 × 10 12 933.91401.89--
4SFAFIS0.2663.16350.802588.455986.52938.0111930
0.140443.28050.721933.9928108.9747.03212860
0.134743.59540.775613.9833119.3751.49512890
GA1.0777 × 10 12 107.7419.9532.7237 × 10 11 2169.8933.72--
5SFAFIS0.213373.7960.876815.080117776.30511830
0.150854.11580.88514.0728178.6176.94911860
0.148644.25970.909964.0587188.5881.23611890
GA2.3839 × 10 13 23.9424.43374.6876 × 10 12 507.75218.5--
6SFAFIS0.207164.88871.07415.0419120.9952.26412930
0.153654.93711.03954.305131.8856.88412860
0.151824.91141.03324.2387135.3758.37812890
GA4.1628 × 10 13 33.7136.24317.9228 × 10 12 748.46322.09--
7SFAFIS0.280092.94980.774488.27755.69425.10411730
0.155271.86330.471574.017637.82416.67611060
0.152371.72950.444443.975132.95914.6313290
GA1.0958 × 10 13 48.428.96672.4363 × 10 12 1099.8473.27--
8SFAFIS0.208076.70261.41084.7065427.77184.1312330
0.160017.77271.56983.8988473.47203.7812360
0.158157.881.58813.8891477.16205.3712390
GA1.3733 × 10 13 20.043.71123.2496 × 10 12 797.99343.4--
9SFAFIS0.296916.45881.4388.9759129.4256.28210930
0.16283.78290.833194.235673.22531.74213260
0.155573.15360.710774.118965.90428.60313290
GA1.9898 × 10 13 16.1642.99325.7774 × 10 12 476.88205.22--
10SFAFIS0.287081.26860.468848.172239.50218.5311830
0.156610.993230.311543.676740.17717.60512960
0.161781.02920.322423.704341.45718.15112990
GA1.8214 × 10 13 14.0272.59764.2808 × 10 12 413.89178.11--
Table 3. Modeling results of SFAFIS and the Sugeno-type FIS tuned using particle swarm optimization (PSO).
Table 3. Modeling results of SFAFIS and the Sugeno-type FIS tuned using particle swarm optimization (PSO).
IterationMethod MAPE tr MAPE te MAPE tr , te RMSE tr RMSE te RMSE tr , te opt λ
1SFAFIS0.22024.41830.997615.0363135.8958.65412330
0.153774.72551.00044.213139.6960.23312860
0.153244.83941.0214.2055141.6861.08712890
PSO5.939  × 10 13 54.01110.0021.2304  × 10 11 1612.8694.02--
2SFAFIS0.306612.68710.747449.3238104.9545.94311530
0.162991.44580.400544.365649.90121.83213260
0.15741.1730.345484.26846.29120.28913290
PSO4.1799  × 10 13 32.7446.06371.0934  × 10 11 967.97416.55--
3SFAFIS0.187474.59461.00364.7889110.2647.64512130
0.180074.5110.98214.3273122.3452.79210560
0.179214.29450.941314.3083129.2355.7510590
PSO1.0973  × 10 13 8.61571.59552.0855  × 10 12 183.5778.995--
4SFAFIS0.218299.79791.99234.9939440.71189.7111030
0.1612610.5542.08584.1652488.16210.113260
0.1581910.752.11964.1102492.26211.8713290
PSO6.0409  × 10 13 38.0557.04731.2908  × 10 11 842.16362.41--
5SFAFIS0.1963911.8092.34684.5202453.9195.3711630
0.145611.9982.34043.5729495.23213.1411660
0.143412.0382.34613.5365498.18214.4111690
PSO2.7137  × 10 13 48.3398.95175.7643  × 10 12 1072.2461.42--
6SFAFIS0.330543.56670.929829.809170.69331.68410830
0.165863.24940.736884.193974.44932.26112660
0.154363.69570.810164.07688436.33412690
PSO1.7145  × 10 13 18.2893.38693.4361  × 10 12 293.82126.44--
7SFAFIS0.288999.95872.07979.5004431.5185.8912630
0.13979.5931.89033.8897475.58204.6912660
0.133349.47871.8643.8658479.45206.3512690
PSO1.5571  × 10 13 23.1234.2823.8223  × 10 12 486.1209.18--
8SFAFIS0.195828.98881.82414.5347441.01189.8210930
0.159878.51281.70674.129484.7208.6213260
0.157868.58721.71884.1076489.37210.6213290
PSO3.6489  × 10 13 31.3225.80036.6528  × 10 12 1599.4688.29--
9SFAFIS0.251636.89971.48277.7123388.25167.2212430
0.141935.88371.20523.3971461.81198.7512460
0.142546.02151.23123.4167475.42204.6112490
PSO4.8984  × 10 13 58.24810.7878.6798  × 10 12 1805.7777.05--
10SFAFIS0.193797.96281.63254.5137394.07169.6313230
0.147398.19861.63844.0815470.95202.711660
0.145568.51111.69474.0623486.14209.2311690
PSO3.1618  × 10 13 31.8425.89678.6715  × 10 12 1804776.3--
Table 4. Modeling results of SFAFIS and the Sugeno-type FIS tuned using pattern-search (PS) optimization.
Table 4. Modeling results of SFAFIS and the Sugeno-type FIS tuned using pattern-search (PS) optimization.
IterationMethod MAPE tr MAPE te MAPE tr , te RMSE tr RMSE te RMSE tr , te opt λ
1SFAFIS0.22024.41830.997615.0363135.8958.65412330
0.153774.72551.00044.213139.6960.23312860
0.153244.83941.0214.2055141.6861.08712890
PS5.939  × 10 13 54.01110.0021.2304  × 10 11 1612.8694.02--
2SFAFIS0.2360511.8462.38615.8128159.0968.66310830
0.156366.83861.39383.8409121.4652.38411960
0.186024.96951.07194.4058133.7257.68211990
PS1.5614  × 10 13 26.5824.92273.8216  × 10 12 482.6207.68--
3SFAFIS0.283648.5121.80748.396434.57187.1611230
0.14767.85131.57423.7915474.26204.1212660
0.137032.51590.577553.80663.05927.35311690
PS1.5834  × 10 13 41.4387.67383.239  × 10 12 782.9336.91--
4SFAFIS0.283018.10451.73148.5418406.78175.2210930
0.151656.46161.32023.6838464.64199.9713260
0.162291.32630.377853.825745.73819.98312990
PS3.797  × 10 13 73.30613.5757.2282  × 10 12 1413.5608.27--
5SFAFIS0.293532.23790.653618.698.40643.05311930
0.152021.42590.387934.65861.98827.00512460
0.161720.768680.274124.137120.6939.656212590
PS2.494  × 10 13 52.2749.68045.0776  × 10 12 1196.8515.02--
6SFAFIS0.209464.54731.01284.9107127.2954.95413230
0.169734.89161.04414.3344140.2660.48413260
0.152553.83940.83533.855971.45130.94411890
PS3.2409  × 10 13 24.4754.53245.7138  × 10 12 926.2398.57--
7SFAFIS0.279932.91450.767818.015293.39540.83712930
0.139063.20110.70613.6172111.4148.05412860
0.164413.15990.719134.2877115.7349.95113290
PS3.7447  × 10 13 21.9394.06287.1633  × 10 12 542.97233.66--
8SFAFIS0.301544.60141.09788.8431420.07180.9512430
0.170525.04511.07323.8546459.5197.7712360
0.157410.989540.311513.613941.7218.24812990
PS1.8569  × 10 13 11.6272.15313.8067  × 10 12 265.96114.45--
9SFAFIS0.286644.89991.14098.684117.9651.36311730
0.131642.25610.525073.299354.86923.79912860
0.158151.46360.399893.959943.41119.0211790
PS2.6935  × 10 13 31.1855.77496.8908  × 10 12 1237.4532.5--
10SFAFIS0.216335.54821.20375.1291132.8657.3613230
0.167464.31790.936064.2094133.9257.75413260
0.160434.67780.996974.3038139.3660.09512590
PS5.4371  × 10 13 19.4653.60461.1614  × 10 11 402.97173.41--
Table 5. Results of Mann–Whitney median tests for the modeling results of SFAFIS and the Sugeno-type FIS tuned using the ANFIS, GA, PSO and PS methods and the samples in Table 1, Table 2, Table 3 and Table 4.
Table 5. Results of Mann–Whitney median tests for the modeling results of SFAFIS and the Sugeno-type FIS tuned using the ANFIS, GA, PSO and PS methods and the samples in Table 1, Table 2, Table 3 and Table 4.
MetricNull and Alternative Hypotheses and p-Values
MAPE tr H 0 : m SFAFIS = m ANFIS m SFAFIS = m GA m SFAFIS = m PSO m SFAFIS = m PS
H 1 : m SFAFIS > m ANFIS m SFAFIS > m GA m SFAFIS > m PSO m SFAFIS > m PS
p-value:0.0000.0000.0000.000
MAPE te H 0 : m SFAFIS = m ANFIS m SFAFIS = m GA m SFAFIS = m PSO m SFAFIS = m PS
H 1 : m SFAFIS < m ANFIS m SFAFIS < m GA m SFAFIS < m PSO m SFAFIS < m PS
p-value:0.0010.0000.0000.000
MAPE tr , te H 0 : m SFAFIS = m ANFIS m SFAFIS = m GA m SFAFIS = m PSO m SFAFIS = m PS
H 1 : m SFAFIS < m ANFIS m SFAFIS < m GA m SFAFIS < m PSO m SFAFIS < m PS
p-value:0.0030.0000.0000.000
RMSE tr H 0 : m SFAFIS = m ANFIS m SFAFIS = m GA m SFAFIS = m PSO m SFAFIS = m PS
H 1 : m SFAFIS > m ANFIS m SFAFIS > m GA m SFAFIS > m PSO m SFAFIS > m PS
p-value:0.0000.0000.0000.000
RMSE te H 0 : m SFAFIS = m ANFIS m SFAFIS = m GA m SFAFIS = m PSO m SFAFIS = m PS
H 1 : m SFAFIS < m ANFIS m SFAFIS < m GA m SFAFIS < m PSO m SFAFIS < m PS
p-value:0.0060.0000.0020.000
RMSE tr , te H 0 : m SFAFIS = m ANFIS m SFAFIS = m GA m SFAFIS = m PSO m SFAFIS = m PS
H 1 : m SFAFIS < m ANFIS m SFAFIS < m GA m SFAFIS < m PSO m SFAFIS < m PS
p-value:0.0060.0000.0030.000
Table 6. Training and inference times vs. method.
Table 6. Training and inference times vs. method.
MethodAverage Training Time (s)Average Inference Time (s)
SFAFIS2.2880.018157
Sugeno-type FIS tuned by ANFIS3.6950.003318
Sugeno-type FIS tuned by GA545.9850.005004
Sugeno-type FIS tuned by PSO310.3190.003858
Sugeno-type FIS tuned by PS1185.9360.005838
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Fetter, M.; Jónás, T. Forecasting the Number of Passengers on Hungarian Railway Routes Using a Similarity and Fuzzy Arithmetic-Based Inference Method. Mathematics 2025, 13, 1221. https://doi.org/10.3390/math13081221

AMA Style

Fetter M, Jónás T. Forecasting the Number of Passengers on Hungarian Railway Routes Using a Similarity and Fuzzy Arithmetic-Based Inference Method. Mathematics. 2025; 13(8):1221. https://doi.org/10.3390/math13081221

Chicago/Turabian Style

Fetter, Marcell, and Tamás Jónás. 2025. "Forecasting the Number of Passengers on Hungarian Railway Routes Using a Similarity and Fuzzy Arithmetic-Based Inference Method" Mathematics 13, no. 8: 1221. https://doi.org/10.3390/math13081221

APA Style

Fetter, M., & Jónás, T. (2025). Forecasting the Number of Passengers on Hungarian Railway Routes Using a Similarity and Fuzzy Arithmetic-Based Inference Method. Mathematics, 13(8), 1221. https://doi.org/10.3390/math13081221

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop