A Multigraph-Defined Distribution Function in a Simulation Model of a Communication Network

Miletic, Slobodan; Pokrajac, Ivan; Pena-Pena, Karelia; Arce, Gonzalo R.; Mladenovic, Vladimir

doi:10.3390/e24091294

Open AccessArticle

A Multigraph-Defined Distribution Function in a Simulation Model of a Communication Network

by

Slobodan Miletic

¹,

Ivan Pokrajac

¹,

Karelia Pena-Pena

²

,

Gonzalo R. Arce

²

and

Vladimir Mladenovic

^3,*

¹

Electronic Systems Department, Military Technical Institute, 11000 Belgrade, Serbia

²

Department of Electrical and Computer Engineering, University of Delaware, Newark, DE 19716, USA

³

Faculty of Technical Sciences Cacak, University of Kragujevac, 34000 Kragujevac, Serbia

^*

Author to whom correspondence should be addressed.

Entropy 2022, 24(9), 1294; https://doi.org/10.3390/e24091294

Submission received: 2 June 2022 / Revised: 29 August 2022 / Accepted: 5 September 2022 / Published: 14 September 2022

(This article belongs to the Special Issue Symbolic Entropy Analysis and Its Applications III)

Download

Browse Figures

Review Reports Versions Notes

Abstract

We presented a method based on multigraphs to mathematically define a distribution function in time for the generation of data exchange in a special-purpose communication network. This is needed for the modeling and design of communication networks (CNs) consisting of integrated telecommunications and computer networks (ITCN). Simulation models require a precise definition of network traffic communication. An additional problem for describing the network traffic in simulation models is the mathematical model of data distribution, according to which the generation and exchange of certain types and quantities of data are realized. The application of multigraphs enabled the time and quantity of the data distribution to be displayed as operational procedures for a special-purpose communication unit. A multigraph was formed for each data-exchange time and allowed its associated adjacency matrix to be defined. Using the matrix estimation method allowed the mathematical definition of the distribution function values. The application of the described method for the use of multigraphs enabled a more accurate mathematical description of real traffic in communication networks.

Keywords:

communication network; multigraphs; adjacency matrix; network simulation; network traffic; distribution function

1. Introduction

The design of communication networks as a spatially distributed integrated telecommunication and computer network (ITCN) has been improved by the application of computer simulations. Defining a simulation model of an ITCN is realized by using advanced simulation software with integrated tools. These tools allow an analysis of the network elements’ parameters. The application of this methodology of ITCN simulation model design requires a precise definition of network traffic in addition to a definition of the active and passive elements of the architecture and network topology [1]. Network traffic is the process of time events generating a certain type and amount of data at a source, and their distribution between sources and destinations connected in a communication network. There may be multiple data sources in a network that generate the same or different types and amounts of data at the same or different times, and destinations may simultaneously receive data from one or more sources. The problem in designing a simulation model is generating an accurate description of this network traffic. Depending on the purpose of the communication network, the network traffic process can be a deterministic or stochastic event.

In accordance with this definition, network traffic requires the application of appropriate models and distribution functions of communication data over time. An additional problem of describing network traffic in the simulation model is defining the mathematical model according to which the generation of and change in the amount of data are realized. The model requires an appropriate distribution function that, in the simulation model, temporally describes the generation and distribution of the amount of data between network elements. We used the sampling matrix associated with multigraphs [2] to derive the time distribution function of the communication events of network traffic. A method of applying multigraphs for defining the distribution functions of the generation time of data between network elements of the ITCN is presented in this study. The contributions of this study are as follows:

A new method of defining network traffic was proposed. The distribution function for creating a simulation model of a communication network was developed, based on the description of communication events and the values of the parameters they determined. The application of this method enabled us to solve the problem of describing the time of data generation and distribution in the communication networks.
The application of multigraphs for the mathematical derivation of a more precise distribution function of data was proposed and compared with other methods in which the distribution function of data was approximated by the type of network traffic and by the time variation of the data.
The application of multigraphs and their related matrices enabled multiple descriptions of network traffic in terms of events and communication parameters, which enabled their change in time to be mathematically represented as a function of the schedule. The new approach enabled a more accurate description of the network traffic in the design of a simulation model of the communication network and time-accurate results in the simulation.

The paper is organized as follows. Section 2 provides an overview of the different methods used for defining and statistically describing network traffic. Definitions of all the starting elements needed to describe network communications are presented in Section 3. In Section 4, the basic concept, and details of the proposed method of applying multigraphs for describing the time of network traffic and data distribution are presented. Section 5 presents an application of the mathematical derivation, and a graphical representation of the time distribution functions in the proposed method. Section 6 concludes the study and gives directions for further research in the application of the method.

2. Related Work

In earlier works, different methods of defining network traffic were proposed. Network traffic is a complex time-stochastic or -deterministic process of network structure. Earlier methods consisted of complex procedures for describing and defining the network traffic. The methods are complex, especially for describing network traffic with the distribution of multiple data formats. The basic method of defining network traffic was realized by measuring and recording traffic in test networks, as in [3,4,5,6,7], with the theoretical derivation of statistical mathematical descriptions for further analysis. Measurements of the generated and distributed network traffic enabled statistical descriptions and parametric descriptions in [4,5,8]. In recent work [3,6,7,9,10,11,12,13], statistical typification of network traffic with known distribution functions and parameter variation was defined or the traffic was described by using self-similarity related to heavy-tail distributions [14,15]. Further definitions of network traffic were limited to the recognition network traffic type (voice internet, HTTP, VoIP, multimedia, etc.) and descriptions of the intended distribution function. An overview and comparison of the methods used in previous studies to define network traffic are given in Table 1.

These methods are integrated into simulation tools such as OPNET and other advanced software simulation packages. The application of the methods described in previous research may lead to incorrect selection or description of the statistical parameters of the distribution function. The consequence is that one may obtain incorrect simulation results and derive erroneous conclusions and decisions about the design of the network structure. To increase the accuracy of parameters when describing and defining network traffic in [2], we performed an analytical method where we used multigraphs to describe communication interactions as events between network elements. This method was executed and tested on the example of deterministic arranged communication in the network.

The network traffic matrix model has a significant role in network design, network traffic design and analysis of the results, as in the method given in [18,19]. In the method proposed in this study, we introduced a new approach to defining the basis required to obtain a mathematical model from the network traffic matrix. By applying the mathematical models given in [20,21], the time dimension of the multigraph was added and the estimated distribution function that describes the network traffic as a statistical time event can be obtained.

3. Data Exchange in the Communication Network

Central to definition of network traffic models is the matrix of network traffic between the source and the destination of communications in the network. To achieve functional relations between the participants in communication, the type of necessary communication is defined for which information flows are determined. The realization and establishment of information flows in an ITCN require the application of appropriate network application services, marked as (S1, S2, Sn). The data exchange and information are the basis for defining the moment of time t = (t₀, t₁, …, t_N−1, t_N, t_m) when the participants in communication establish their communication interaction, achieve mutual communication, and, at the same time, exchange certain types and amounts of data.

The moments of time in which communication should take place, the duration of the communication, and the types of information provided for the data exchanged in communication are the particular operational procedures of communication. In Figure 1, the operational procedures determine the moments of change in the amount of data to be distributed between elements of the organizational structure. The type of data and the information (voice, message, symbol, text, table, image, video, etc.) to be exchanged between participants in the communication process are defined by the operational procedures for the command operational function. Starting from the defined communication relationships between the elements of the organization, the matrix of network traffic can be obtained.

In accordance with the concepts given in [2], communication data exchanges are defined. The time distribution of the data is mapped to the distribution function for the OPNET simulation model, which is shown by the logical flow in Figure 1.

3.1. The Data of Network Distribution over Time

Designing a simulation model required us to define the variation in the amount of data generated at the source sent to the destination. The value of the amount of data (Adt) in kbps or Mbps was determined for each type of information (voice, message, symbol, text, table, database, image, video, etc.) exchanged between the elements.

The distribution of the amount of data occurs at the moments of time t = (t₀, t₁, …, t_N−1, t_N, t_m). Additionally, depending on the function in the communication process, the minimum and maximum amount of data generated by the network element, Ei, for distribution in the network is calculated. The transfer of information to the ITCN requires encrypting of the communication channels. The amount of data for distribution in the network can be increased by the amount of digital code required for protecting information (reconstruction, encryption, error detection). The increase in the amount of data is realized in relation to the header size of the individual layers of the OSI network model. The choice of the access technique, the technology and transmission medium, the communication protocols, and the data packet size (MTU) affect the amount of data to be transferred by the telecommunication links in the ITCN. The total data payload for distribution by the network from the source to destination is determined by the steps in the procedure shown in Figure 2.

The distribution function of communication interactions between network elements according to the method given in [2] represents the law of data generation over time. Data generation is realized by the application services.

3.2. Distribution Function for Variations in the Amount of Data

The accuracy of the network traffic simulation results in the OPNET simulation model is conditioned by the choice of the distribution function. The distribution function should describe the generation of data and the variation in the amount of data over time. The selection of the distribution function requires one to define its parameters. Moreover, the distribution function is based on the statistical study of communication in the network traffic record, as in [3,4,5,6,7,11,12,13], or is based on an approximation of the type of network traffic (audio, messages, text, IP, VoIP, video, HTTP, web, ATM, etc.) with existing known distribution functions (exponential, Poisson, Normal (Gaussian), uniform, Weibull etc.), as in [8,10,16,17]. Data traffic modeling is based on self-similarity with the Pareto distributions and the α-stable distributions, as in [14,15]. Two basic parameters describe the event of data generation at the source in the simulation model. The data generation time is the first parameter. The time of data generation should be adjusted by the time of establishing communication between the network elements. The variation in the amount of data over the duration of the communication is the second parameter. If one chooses an inappropriate distribution function, or by incorrectly defining the value of the variation in the amount of data, or by incorrectly defining the time, the network traffic will be described incorrectly. Simulations of incorrectly described network traffic will not match the predicted network traffic in an ITCN. In that case, the simulation results are not accurate for analyzing and optimizing the communication network. The error in describing the amount of data generated over time is reduced when the amount of data generated is defined based on the ITCN network traffic matrix and the corresponding distribution function.

4. Description of the ITCN Network Distribution Using Multigraphs

The network data distribution in an ITCN is realized based on the operating procedures and according to the methodology specified in [1]. The information flows of the distribution of the predicted types of information between the network elements are described as well. The multigraphs are defined based on this description. The application of multigraphs allows the time relationships of data distribution to be displayed based on the operational communication procedures. For each moment of time t = (t₀, t₁, …, t_N−1, t_N, t_m) when data are exchanged, a multigraph is formed. The formed multigraph is joined with the similarity matrix. The corresponding value of the distribution function for each moment of time is calculated by mathematical estimation of the similarity matrix associated with the multigraph. The use of all the calculated values for all moments of time in the communication interval ΔT = [t₀, t_m] enables the definition of the appropriate distribution function.

4.1. Data Distribution Time Scheme between ITCN Network Elements

The generation and distribution of data between network elements in the ITCN are realized through the network application services Srv, rv = (1, 2, …). Each application service in the ITCN enables the establishment of communication and network distribution of the appropriate type of data. The Srv application service on the network element Ei is activated by establishing a communication interaction between the network elements Ei and Ej (i ≠ j) at the moments of time t = (t₀, t₁, …, t_N−1, t_N, t_m). The moments of time are set at the beginning of the time interval in which the application service is active between the network elements. The generation of communication information is enabled and transformed into the appropriate amount of digital data for distribution to the network element Ej is performed. The time scheme of communication interactions (Figure 3), as in [1,2], shows the flow of these activities from the operational procedures.

The individual timeline of the individual service Srv now of time t = (t₀, t₁, …, t_N−1, t_N, t_m) of activation are separated from the given time scheme. The amount of data Adt = (Adt₀, Adt₁, …, Adt_N−1, Adt_N, Adt_m) generated at the moment of time t = (t₀, t₁, …, t_N−1, t_N, t_m) in the application service Srv is also defined. Examples of the separate individual timing schemes for Services S1 and S2 are displayed in Figure 4.

Other ways of representing the communication activation of application services S1 to S4 between network elements E1 to E8 are shown in Figure 5. This representation is used to define the amount of data generated Adt at the moment of time t = (t₀, t₁, …, t_N−1, t_N, t_m). For example, the service S1 in the network element E1 with a data quantity of Adt₀ = 15 kbps for distribution to the network element E2 at time t₀ is denoted as E1E2S1_Adt₀.

4.2. Multigraphs of Data Distribution in ITCN Network Traffic

The data exchanged by the applicable service Srv at each moment of time t = (t₀, t₁, …, t_N−1, t_N, t_m) are shown by presenting the network traffic as a multigraph (Figure 6a). The single-service multigraph (labeled SSMG_Srv_Adt) shows the amount of data Adt (kbps or Mbps) exchanged between the network elements Ei and Ej (i ≠ j) by the application service Srv at time t. A single edge between the nodes (simple graphs) Ei and Ej (i ≠ j) represents the communication interaction between these network elements, where the amount of data Adt are distributed through the application service Srv at time t. The creation of all the single-service multigraphs between the network elements Ei and Ej (i ≠ j) through the application service Srv for each moment of time t = (t₀, t₁, …, t_N−1, t_N, t_m) enables the presentation of data exchanged during the communication time interval ΔT = [t₀, t_m]. The total data exchanged between the nodes Ei and Ej (i ≠ j) through all application services, are Srv = (S1, S2, …, Sn) with time t representing the unification of all the single-service multigraphs formed previously into one multi-service multigraph (labeled MSMG_S1Sn_Adt), as shown in Figure 6b. The multi-service multigraph enables the definition of network traffic among the ITCN’s network elements at the observed moments of time t = (t₀, t₁...t_N−1, t_N, t_m) and enables the application of graph sampling theory to perform predictions, as in [22].

The creation of multi-service multigraphs for each moment of time t = (t₀, t₁, …, t_N−1, t_N, t_m) enables the presentation of the exchange of all data through all application services during the communication time interval ΔT = [t₀, t_m]. A set of multi-service multigraphs allows one to define the total network traffic among the ITCN’s network elements during the communication time interval ΔT = [t₀, t_m].

4.3. Matrix Associated with the ITCN Network Traffic Distribution Multigraph

The multigraph data distribution in network traffic of the ITCN is mathematically represented by the symmetric matrix T_{SSMG_Srv_Adt} in Equation (1) with integer terms and a diagonal of zero, where n is the number of network elements Ei. The associated symmetric matrix is formed by using a timeline or a time plane of the communication interactions (Figure 5) or by using a single-service multigraph (Figure 6a), such that

T_{SSMG_Srv_Adt} = [\begin{matrix} 0 & A d_{12} t & A d_{13} t & A d_{14} t & A d_{15} t & . & A d_{1 n} t \\ A d_{21} t & 0 & A d_{23} t & A d_{24} t & A d_{25} t & . & A d_{2 n} t \\ A d_{31} t & A d_{32} t & 0 & A d_{34} t & A d_{35} t & . & A d_{3 n} t \\ A d_{41} t & A d_{42} t & A d_{43} t & 0 & A d_{45} t & . & A d_{4 n} t \\ A d_{51} t & A d_{52} t & A d_{53} t & A d_{54} t & 0 & . & A d_{5 n} t \\ A d_{61} t & A d_{62} t & A d_{63} t & A d_{64} t & A d_{65} t & . & A d_{6 n} t \\ . & . & . & . & . & 0 & . \\ A d_{n 1} t & A d_{n 2} t & A d_{n 3} t & A d_{n 4} t & A d_{n 5} t & . & 0 \end{matrix}]

(1)

where Ad_ijt is the amount of data distributed in the communication interactions between the nodes Ei and Ej (i ≠ j) with the application service Srv = (S1, S2, Sn) at the moment of time t = (t₀, t₁, …, t_N−1, t_N, t_m). Figure 7 shows the single-service multigraph for data exchanged among the network elements E1 to E8 with the application service S1 at the moment of time t₀, and its associated symmetric 8 × 8 matrix.

For all single-service multigraphs, the set of associated matrices T_{SSMG_Srv_Adt} defines the matrix at each moments of time t = (t₀, t₁, …, t_N−1, t_N, t_m), as in Equations (2) and (3):

T_{SSMG_S1_{Adt}_{0}} = [\begin{matrix} 0 & A d_{12} t_{0} & A d_{13} t_{0} & A d_{14} t_{0} & A d_{15} t_{0} & . & A d_{1 n} t_{0} \\ A d_{21} t_{0} & 0 & A d_{23} t_{0} & A d_{24} t_{0} & A d_{25} t_{0} & . & A d_{2 n} t_{0} \\ A d_{31} t_{0} & A d_{32} t_{0} & 0 & A d_{34} t_{0} & A d_{35} t_{0} & . & A d_{3 n} t_{0} \\ A d_{41} t_{0} & A d_{42} t_{0} & A d_{43} t_{0} & 0 & A d_{45} t_{0} & . & A d_{4 n} t_{0} \\ A d_{51} t_{0} & A d_{52} t_{0} & A d_{53} t_{0} & A d_{54} t_{0} & 0 & . & A d_{5 n} t_{0} \\ A d_{61} t_{0} & A d_{62} t_{0} & A d_{63} t_{0} & A d_{64} t_{0} & A d_{65} t_{0} & . & A d_{6 n} t_{0} \\ . & . & . & . & . & 0 & . \\ A d_{i 1} t_{0} & A d_{i 2} t_{0} & A d_{i 3} t_{0} & A d_{i 4} t_{0} & A d_{i 5} t_{0} & . & 0 \end{matrix}]

(2)

T_{SSMG_S1_{Adt}_{m}} = [\begin{matrix} 0 & A d_{12} t_{m} & A d_{13} t_{m} & A d_{14} t_{m} & A d_{15} t_{m} & . & A d_{1 n} t_{m} \\ A d_{21} t_{m} & 0 & A d_{23} t_{m} & A d_{24} t_{m} & A d_{25} t_{m} & . & A d_{2 n} t_{m} \\ A d_{31} t_{m} & A d_{32} t_{m} & 0 & A d_{34} t_{m} & A d_{35} t_{m} & . & A d_{3 n} t_{m} \\ A d_{41} t_{m} & A d_{42} t_{m} & A d_{43} t_{m} & 0 & A d_{45} t_{m} & . & A d_{4 m} t_{0} \\ A d_{51} t_{m} & A d_{52} t_{m} & A d_{53} t_{m} & A d_{54} t_{m} & 0 & . & A d_{5 n} t_{m} \\ A d_{61} t_{m} & A d_{62} t_{m} & A d_{63} t_{m} & A d_{64} t_{m} & A d_{65} t_{m} & . & A d_{6 n} t_{m} \\ . & . & . & . & . & 0 & . \\ A d_{i 1} t & A d_{i 2} t & A d_{i 3} t & A d_{i 4} t & A d_{i 5} t & . & 0 \end{matrix}]

(3)

The set of associated matrices T_{SSMG_Srv_Adt} enables one to define the function for the distribution of data in the network through the service Srv = (S1, S2, …, Sn) in the communication time interval ΔT = [t₀, t_m].

The variation in the value of the amount of data VarAdt distributed by the application service Srv = (S1, S2, …, Sn) during the communication time interval ΔT = [t₀, t_m] is defined by the minimum and maximum values of the amount of data distributed among the network elements of the ITCN:

V a r A d t = [A d_{m i n}, A d_{m a x}]

(4)

A d_{m i n} = m i n {A d_{i j} t_{0}, \dots, A d_{i j} t_{m}}, i = [1, n] j = [1, n] i \neq j

(5)

A d_{m a x} = m a x {A d_{i j} t_{0}, \dots, A d_{i j} t_{m}}, i = [1, n] j = [1, n] i \neq j .

(6)

For the multi-service multigraph, the associated symmetric n × n matrix T_{MSMG_S1Sn_Adt} of data distribution shown in Equation (7) is formed. The value of the distribution function of the total amount of data distributed through all application services Srv at the moments of time t = (t₀, t₁, …, t_N−1, t_N, t_m) is defined by the associated symmetric matrix T_{MSMG_S1Sn_Adt}.

T_{MSMG_S1Sn_Adt} = [\begin{matrix} 0 & s A d_{12} t & s A d_{13} t & s A d_{14} t & . & s A d_{1 n} t \\ s A d_{21} t & 0 & s A d_{23} t & s A d_{24} t & . & s A d_{2 n} t \\ s A d_{31} t & s A d_{32} t & 0 & s A d_{34} t & . & s A d_{3 n} t \\ s A d_{41} t & s A d_{42} t & s A d_{43} t & 0 & . & s A d_{4 n} t \\ s A d_{51} t & s A d_{52} t & s A d_{53} t & s A d_{54} t & . & s A d_{5 n} t \\ s A d_{61} t & s A d_{62} t & s A d_{63} t & s A d_{64} t & . & s A d_{6 n} t \\ . & . & . & . & 0 & . \\ s s A d_{n 1} t & s A d_{n 2} t & s A d_{n 3} t & s A d_{n 4} t & . & 0 \end{matrix}]

(7)

where sAd_ijt is the total amount of data distributed in communication interactions between nodes Ei and Ej (i ≠ j) with all the activated application services Srv = (S1, S2, …, Sn) at the moments of time t = (t₀, t₁, …, t_N−1, t_N, t_m), where:

s A d_{i j} t = \sum_{S 1}^{S n} \sum_{i = 1}^{n} \sum_{j = 1}^{n} A d_{i j} t, i \neq j, t = (t_{0,} t_{1 \dots} t_{N - 1,} t_{N,} t_{m})

(8)

The set of associated matrices T_{MSMG_S1Sn_Adt} at each moment of time t = (t₀, t₁, …, t_N−1, t_N, t_m) defines a set of multi-service multigraphs. The set of associated symmetric matrices T_{MSMG_S1Sn_Adt} enables the definition of the value of the distribution function of the total amount of data distributed through all the application services Srv during the communication time interval ΔT = [t₀, t_m], such that

T_{MSMG_S1Sn_{Adt}_{0}} = [\begin{matrix} 0 & s A d_{12} t_{0} & s A d_{13} t_{0} & s A d_{14} t_{0} & . & s A d_{1 n} t_{0} \\ s A d_{21} t_{0} & 0 & s A d_{23} t_{0} & s A d_{24} t_{0} & . & s A d_{2 n} t_{0} \\ s A d_{31} t_{0} & s A d_{32} t_{0} & 0 & s A d_{34} t_{0} & . & s A d_{3 n} t_{0} \\ s A d_{41} t_{0} & s A d_{42} t_{0} & s A d_{43} t_{0} & 0 & . & s A d_{4 n} t_{0} \\ s A d_{51} t_{0} & s A d_{52} t_{0} & s A d_{53} t_{0} & s A d_{54} t_{0} & . & s A d_{5 n} t_{0} \\ s A d_{61} t_{0} & s A d_{62} t_{0} & s A d_{63} t_{0} & s A d_{64} t_{0} & . & s A d_{6 n} t_{0} \\ . & . & . & . & 0 & . \\ s A d_{n 1} t_{0} & s A d_{n 2} t_{0} & s A d_{n 3} t_{0} & s A d_{n 4} t_{0} & . & 0 \end{matrix}]

(9)

T_{MSMG_S1Sn_{Adt}_{m}} = [\begin{matrix} 0 & s A d_{12} t_{m} & s A d_{13} t_{m} & s A d_{14} t_{m} & . & s A d_{1 n} t_{m} \\ s A d_{21} t_{m} & 0 & s A d_{23} t_{m} & s A d_{24} t_{m} & . & s A d_{2 n} t_{m} \\ s A d_{31} t_{m} & s A d_{32} t_{m} & 0 & s A d_{34} t_{m} & . & s A d_{3 n} t_{m} \\ s A d_{41} t_{m} & s A d_{42} t_{m} & s A d_{43} t_{m} & 0 & . & s A d_{4 n} t_{m} \\ s A d_{51} t_{m} & s A d_{52} t_{m} & s A d_{53} t_{m} & s A d_{54} t_{m} & . & s A d_{5 n} t_{m} \\ s A d_{61} t_{m} & s A d_{62} t_{m} & s A d_{63} t_{m} & s A d_{64} t_{m} & . & s A d_{6 n} t_{m} \\ . & . & . & . & 0 & . \\ s A d_{n 1} t_{m} & s A d_{n 2} t_{m} & s A d_{n 3} t_{m} & s A d_{n 4} t_{m} & . & 0 \end{matrix}]

(10)

The elements for defining the data distribution function are realized by forming all the sets of associated single-service matrices T_{SSMG_Srv_Adt} and all the sets of the associated multi-service matrices T_{MSMG_S1Sn_Adt}.

5. Generating the Data Distribution Function in the ITCN by Sampling Multigraphs

The function distribution of the amount of data in the time interval (pdF(ΔT)) for implementation in the OPNET simulation model is defined by sampling the single-service and multi-service multigraphs of data distribution. Sampling the multigraphs is equivalent to sampling the associated symmetric matrices given in [23]. The associated symmetric matrix is sampled using the sequential importance sampling (SIS) method for sampling multigraphs given in [20]. The value of the estimated distribution function represents the content of the multigraph and is determined by applying the asymptotic approximation given in [20,21].

Additionally, the distribution function for the approximation of multigraphs is defined by using graphs and weighting coefficients and applying the methods given in [20,24].

The matrix T_SSMG(t) = T_{SSMG_Srv_Adt} belongs to the set of associated symmetric matrices related to the distribution of data between the network elements Ei and Ej (i ≠ j) with the application service Srv at the moments of time t = (t₀, t₁, …, t_N−1, t_N, t_m), where ΣT is the number of matrices in the set. The distribution function q(T_SSMG(t)) > 0 for the matrix T_SSMG(t) defines the amount of data for distribution between the network elements Ei and Ej (i ≠ j) through the application service Srv at the moments of time t = (t₀, t₁, …, t_N−1, t_N, t_m). The estimated value of the distribution function is:

E_{q} [\frac{1}{q (T_{S S M G} (t))}] = \sum_{T} \frac{1}{q (T_{S S M G} (t))} q (T_{S S M G} (t)) = | \sum^{} T |

(11)

| \sum^{} T | = \frac{1}{n} \sum_{i = 1}^{n} \frac{1}{q (T_{S S M G} (t))}_{.}

(12)

The distribution function q(T_SSMG(t)) is determined with the test distribution function q(·) by sampling the T_SSMG(t) matrix column by column (c₁, c₂, …, c_n), using the method and procedure in [20] and [21]. Here, q(T_SSMG(t)) is represented by:

q (T_{S S M G} (t) = (c_{1}, c_{2}, \dots c_{n})) = q (c_{1}) q (c_{1} | c_{2}) \dots q (c_{n} | c_{n - 1} \dots, c_{1}) t_{1}, \dots, t_{N - 1}

(13)

The sum of the row margin (d_i) of the matrix (n × n), denoted d⁽²⁾, d⁽³⁾, …, d⁽ⁱ⁾, and the updated row margins of the (n−1) × (n−1) submatrix are determined for the matrix T_Ei(t).

The procedure of sampling and removing the matrix columns in T_SSMG(t) is repeated until all the columns (c₁, c₂, …, c_n) have been sampled. The value of each margin of the row (d_i) and the total margin of the matrix (M) is calculated according to the following:

d_{i} = \sum_{j = 1}^{n} α_{i j}, i = [1, n]

(14)

d = (d_{1}, d_{2}, d_{3} \dots, d_{n})

(15)

d^{(2)} = (d_{2} - α_{21}, d_{4} - α_{42}, \dots, d_{n} - α_{n 2})

(16)

d^{(i)} = (d_{i} - α_{i, i - 1}, d_{i + 1} - α_{i + 1, i - 1}, \dots, d_{n} - α_{n, i - 1}), i = [2, n]

(17)

M = \sum_{i = 1}^{n} d_{i}

(18)

For the T_SSMG(t) matrix (t), the number of multigraphs |Σd| is calculated. Submatrices are formed by removing columns. For forming the submatrices, the number of multigraphs |Σd⁽ⁱ⁾| is calculated, which corresponds to the associated submatrix. Based on the asymptotic approximation given in [20] and [21], the expression for |Σd| and for |Σd⁽ⁱ⁾| is performed.

| \sum^{} d | \sim Δ_{d} \equiv \frac{f (M)}{\prod_{i = 1}^{n} d_{i}!} e^{a (d)}

(19)

f (M) = M! / {[(\frac{M}{2})! 2^{\frac{M}{2}}]}_{}

(20)

a (d) = {(\sum_{i} (\begin{matrix} d_{i} \\ 2 \end{matrix}) / M)}^{2} - \sum_{i} (\begin{matrix} d_{i} \\ 2 \end{matrix}) / M_{}

(21)

| \sum^{} d^{(2)} | \sim Δ_{d^{(2)}} \equiv \frac{f (M - 2 d_{1})}{\prod_{i = 1}^{n} (d_{i} - α_{i 1})!} e^{a (d^{(2)})}

(22)

The expression obtained for each column (c₁, c₂, …, c_n) of the T_SSMG(t) matrix determines the marginal distribution function of each column p(c_i) ∼ q(c_i). The marginal distribution function represents the derived distribution function q(T_SSMG(t)).

p (c_{1} = (0, α_{21}, \dots, α_{n 1})) = \frac{| \sum^{} d^{(2)} |}{| \sum^{} d |}

(23)

p (c_{2}) = \frac{| \sum^{} d^{(3)} |}{| \sum^{} d |}

(24)

p (c_{n - 1}) = \frac{| \sum^{} d^{(n)} |}{| \sum^{} d |}

(25)

Combining the Expressions in (19) and (22) derives an expression for q(c₁):

q (c_{1} = (0, α_{21}, \dots, α_{n 1})) = \frac{1}{\prod_{i = 1}^{n} (d_{i} - α_{i 1})!} e^{a (d^{(2)})}

(26)

The expressions for q(c₁|c₂), …, q(c_n|c_n−1, …, c₁) are derived in the same way. The value of q(T_SSMG(t)) is calculated from the obtained values. The procedure given in [19] evaluates the sampling efficiency of the matrix and the accuracy of the derived distribution function q(T_SSMG(t)) in relation to the marginal distribution p(T_SSMG(t)). The value of the standard estimation error μ and the difference between the obtained values of cv² is used to calculate the following expression:

\hat{μ} = \frac{\sum_{i = 1}^{n} f (T_{SSMGi} (t)) \frac{p (T_{SSMG} (t))}{q (T_{SSMG} (t))}}{\sum_{i = 1}^{n} \frac{p (T_{SSMG} (t))}{q (T_{SSMG} (t))}} = \frac{\sum_{i = 1}^{n} f (T_{SSMG} (t)) \frac{\frac{1}{| q (T_{SSMG} (t)) |}}{q (T_{SSMG} (t))}}{\sum_{i = 1}^{N} \frac{\frac{1}{| q (T_{SSMG} (t)) |}}{q (T_{SSMG} (t))}} = = \frac{\sum_{i = 1}^{N} f (T_{SSMG} (t)) \frac{1}{q (T_{SSMG} (t))}}{\sum_{i = 1}^{N} \frac{1}{q (T_{SSMG} (t))}}

(27)

The use of weight coefficients (weights) ω_i calculated by the procedure given in [20] and [22] realizes the correction and adjustment of values between the derived distribution function q(T_SSMG(t)) and the marginal distribution p(T_SSMG(t)).

The application of the previous procedure to all single-service multigraphs and their associated matrices defines the change in the amount of data for distribution in the network. The change in the amount of distributed data is realized between network elements Ei and Ej (i ≠ j) through the application service Srv in the communication time interval ΔT = [t₀, t_m]. The calculated values of the distribution function form a set of values of the distribution function q(T_SSMG(t)). These values enable one to define the data distribution function pdF(Srv(t)) of the application service Srv in the communication time interval ΔT = [t₀, tm]:

{q (T_{S S M G} (t))} = > p d F (q (T_{S S M G} (t))), t = (t_{0}, t_{1}, \dots t_{N - 1}, t_{N}, t_{m})

(28)

p d F (S r v (Δ T)) = p d F (q (T_{S S M G} (t))), S r v = (S 1, \dots S n) .

(29)

The same procedure applies to multi-service multigraphs. The calculated values of the distribution function pdF(q(T_MSMG(t))) define the data distribution function pdF(S1Sn(ΔT)) of all application services Srv in the communication time interval ΔT = [t₀, t_m]. The calculated values form a set of values {q(T_SSMG(t))}. The use of values from the set of values {q(T_SSMG(t))} thus formed enables the creation of graphs of the distribution function pdF(Srv(ΔT)). Graphically, the values are connected in the order of the moments of time t = (t₀, t₁, …, t_N−1, t_N, t_m) to which the values refer.

The graph of the data distribution function pdF (Figure 8) shows the regularity of the time of the change in the amount of data for distribution among the ITCN’s network elements through the application service Srv in the communication time interval ΔT = [t₀, t_m].

The same procedure is used to define the graph of the data distribution function of all the application services pdF(S1Sn(ΔT)) in the communication time interval ΔT = [t₀, t_m]. Determining the similarity of the graphs of the function pdF(Srv(t)) to the graphs of the known distribution functions (exponential, Poisson, Normal (Gaussian), uniform, Weibull, etc.) given in [8,10,16,17] allows one to identify the derived distribution function pdF(Srv(t)). The identification of pdF(Srv(t)) as a known distribution function enables the selection of the existing distribution function in the OPNET simulation model and the application of the parameter values from the graphs (Figure 8). If no similarity is found, the use of software tools integrated into the OPNET software allows one to import graphics of the derived distribution function pdF(Srv(t)). This way, the distribution function can be used as a newly defined distribution for realization of the ITCN simulation model.

6. Conclusions and Further Research

The application of multigraphs for describing data distribution in an ITCN was described in this study. It enabled a more accurate definition of communication events in the network and a mathematical description of the network traffic. The described method of applying multigraphs is primarily intended for the development of a simulation model of networks with deterministically defined and controlled communication. The method of applying multigraphs is also possible in networks with stochastically generated network traffic, which requires a definition of the variations in the amount of data to be generated between the source and destination. Achieving more accurate results of simulating the predicted communication in an ITCN enables the integration of all the derived distribution functions for a description of the network traffic into the OPNET simulation model. Based on the analysis of the results of realized discrete OPNET simulations of an ITCN, we propose the application of single-service multigraphs, and the derivation and use of the distribution functions of pdF(Srv(t)) of data distribution for each application service. For realizing the simulation of network traffic flows, we propose the use of multi-service multigraphs, and the creation and use of data distribution functions pdF(S1Sn(ΔT)) of all services at the same time. In future research, we will analyze the correlation of the data distribution functions with functions that describe individual network parameters (connectivity, capacity, etc.). This research will enable predictions in the design and optimization of an ITCN using a simulation model.

Author Contributions

Methodology and algorithm development: S.M. and V.M.; simulation, data analysis, testing and validation: S.M. and I.P.; Multigraph analysis: K.P.-P. and G.R.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

Miletic, S.; Milosevic, M.; Mladenovic, V. A New Methodology for Designing of Tactical Integrated Telecommunication and Computer Networks for OPNET Simulation. In Proceedings of the 9th International Scientific Conference on Defensive Technologies, OTEH 2020, Belgrade, Serbia, 15–16 October 2020; Technical Review. Volume 70, pp. 35–40. [Google Scholar]
Miletic, S.; Mladenovic, V.; Pokrajac, I. Application of multigraph sampling method in network traffic design of simulation model of Integrated Telecommunication and Computer Network. E3S Web Conf. 2021, 279, 02011. [Google Scholar] [CrossRef]
Tatarnikova, T.; Sikarev, I.; Karetnikov, V.; Butsanets, A. Statistical research and modeling network traffic. E3S Web Conf. 2021, 244, 07002. [Google Scholar] [CrossRef]
Antoniuo, I.; Ivanov, V.V.; Zrelov, P.V. Statistical model of network traffic. Phys. Part. Nucl. 2004, 35, 530. [Google Scholar] [CrossRef]
Chen, T.M. Chapter in The Handbook of Computer Networks. In Network Traffic Modeling; Hossein, B., Ed.; Wiley: Hoboken, NJ, USA, 2007. [Google Scholar]
Dymora, P.; Mazurek, M.; Strzalka, D. Computer network traffic analysis with the use of statistical self-similarity factor. Ann. UMCS Inform. AI XIII 2013, 2, 69–81. [Google Scholar] [CrossRef][Green Version]
Alsamar, M.; Parisis, G.; Clegg, R.; Zakhleniuk, N. On the distribution of traffic volumes in the Internet and its implications. arXiv 2019, arXiv:1902.03853v1 [cs.NI]. [Google Scholar]
Leemis, L. Input Modeling Techniques for Discrete-Event Simulations; Department of Mathematics, The College of William & Mary: Williamsburg, VA, USA, 2021; pp. 23187–28795. [Google Scholar]
Sanchez, P.J. Fundamentals of simulation modeling. In Proceedings of the Winter Simulation Conference, Washington, DC, USA, 9–12 December 2007. [Google Scholar]
Chandrasekaran, B. Survy of Network Traffic Models. Available online: https://www.cse.wustl.edu/~jain/cse567-06/ftp/traffic_models3/index.html (accessed on 1 May 2021).
Markelov, O.; Duc, V.N.; Bogachev, M. Statistical Modeling of the Internet Traffic Dynamics: To Which Extent Do We Need Long-Term Correlations; Elsevier: Amsterdam, The Netherlands, 2017; Volume 485, pp. 48–60. [Google Scholar] [CrossRef]
Malyeyeva, O.; Davydovskyi, Y.; Kosenko, V. Statistical Analysis of Data on the Traffic Intensity of Internet Networks for the Different Periods of Time. In Proceedings of the Second International Workshop on Computer Modeling and Intelligent Systems (CMIS-2019), Zaporizhzhia, Ukraine, 15–19 April 2019; pp. 897–910. [Google Scholar]
Davydovskyi, Y.; Reva, O.; Artiukh, O.; Kosenko, V. Simulation of Computer Network Load Parameters over a Given Period of Time. In Innovative Technologies and Scientific Solutions for Industries; Quarterly Scientific Journal: Kharkiv, Ukraine, 2019; ISSN 2524-2296. [Google Scholar] [CrossRef]
Barner, K.; Gonzalo, R.A. Processing Theory, Methods and Applications, CRC Press LLC, 2000 N.W.; Corporate Blvd.: Boca Raton, FL, USA, 2004; ISBN 0-8493-1427-5. [Google Scholar]
Arce, G.R. Nonlinear Signal Processing: A Statistical Approach; Wiley and Sons: Hoboken, NJ, USA, 2004. [Google Scholar]
Schmidt, R.; De, O.; Sadre, R.; Pras, A. Gaussian traffic revisted. In 2013 IFIP Networking Conference; IEEE: Piscataway, NJ, USA, 2013; ISBN 978-3-901882-55-5. [Google Scholar]
Manaseer, S.; Al-Nahar, O.M.; Hyassat, A.S. Network traffic modeling. Int. J. Recent Technol. (IJRTE) 2019, 7. [Google Scholar]
Gongx, Y.; Wang, X.; Malboubi, M. Towards Accurate Online Traffic Matrix Estimation in Software-Defined Networks. In Proceedings of the 1st ACM SIGCOMM Symposium on Software Defined Network Research, Santa Clara, CA, USA, 17–18 June 2015; pp. 1–7. [Google Scholar]
Mukhin, V.; Romanenkov, Y.; Bilokin, J.; Rohovyi, A.; Kharazii, A.; Kosenko, V.; Kosenko, N.; Su, J. The Method of Variant Synthesis of Information and Communication Network Structures on the Basis of the Graph and Set-Theoretical Models. Int. J. Intell. Syst. Appl. 2017, 11, 42–51. [Google Scholar] [CrossRef]
Eisinger, R.D.; Chen, Y. Sampling strategies for conditional inference on multigraphs. Stat. Its Interface 2018, 11, 649–656. [Google Scholar] [CrossRef]
Chen, Y.; Diaconis, P.; Holmes, S.P.; Liu, J.S. Sequential Monte Carlo methods for statistical analysis of tables. J. Am. Stat. Assoc. 2005, 100, 109–120. [Google Scholar] [CrossRef]
Sardellitti, S.; Barbarossa, S.; Di Lorenzo, P. Enabling Prediction via Multi-Layer Graph Inference and Sampling, Auckland University of Technology; IEEE: Piscataway, NJ, USA, 2020; pp. 1–4. [Google Scholar]
Lau, D.L.; Arce, G.R.; Parada-Mayorga, A.; Dapena, D.; Pena-Pena, K. Blue-Noise Sampling of Graph and Multigraph Signals: Dithering on Non-Euclidean Domains. IEEE Signal Processing Mag. 2020, 37, 31–42. [Google Scholar] [CrossRef]
Barrat, A.; Barthelemy, M.; Satorras, R.P.; Vespignani, A. The architecture of complex weighted networks. Proc. Natl. Acad. Sci. USA 2004, 101, 3747. [Google Scholar] [CrossRef] [PubMed]

Figure 1. The basic concept of the mapping timeline of the network elements’ distribution of data in the OPNET simulation model.

Figure 2. The procedure for determining the amount of data to send to the network elements in the ITCN.

Figure 3. Timeline of activation and repetition of the network elements’ communication interactions and the network application services (Srv).

Figure 4. Timeline of activation and repetition of network application services S1 and S2 with amounts of generated data Adt.

Figure 5. Activation and repetition of the application services S1 to S4 and the communication interactions among network elements E1 to E8: (a) timeline; (b) time plane.

Figure 6. Data exchange multigraph among network elements E1 to E8 with the application services S1 (blue line) to S2 (red line) at time t₀: (a) single-service multigraph; (b) multi-service multigraph.

Figure 7. Data exchange single-service multigraph and its symmetric 8 × 8 matrix.

Figure 8. The graph of the data distribution function pdF through the application service Srv in the communication time interval ΔT = [t₀, t_m].

Table 1. Review of the network traffic defining methods.

Reference	Methods	Measurement Source	Statistical Description	Traffic	Illustrating	Application	Country	Year
[3]	Traffic self-similarity, the approximation function of traffic	The average daily traffic recorded	Pareto distribution	2G, voice, HSDPA,	Function distribution graph	Simulating real network traffic	Russia	2021
[4]	Nonlinear analysis of traffic measurements	A medium-sized LAN with 200 to 250 interconnected computers	Kolmogorov’s scheme for describing network traffic, log-normal distribution, Gaussian distribution	NetBEUI, TCP/IP	Function distribution graph	Realistic dynamical models of network traffic	Russia	2004
[5]	Mathematical approximation	Traffic volume recorded by routers, ethernet traffic traces	Poisson’s probability distribution	Ethernet, MPEG4, TCP/IP, web, email, multimedia	Function distribution graph	Traffic modeling	USA, Texas	2007
[6]	Self-similarity statistical analysis of network traffic measurements	Computer network in small company	Gaussian or power-law probability distributions	Web, HTTP, internet, email, SSL, IPv6	Function distribution graph	Computer network traffic analysis	Poland	2021
[7]	Statistical analysis	Academic, commercial and residential networks; data centers	Log-normal distribution, Gaussian distribution, Weibull distribution	Internet IPv4	Function distribution graph	Predicting the proportion of time traffic, statistically predicted outcomes for the network	USA, Chicago	2019
[8]	Introductory techniques for input modeling; graphical and statistical methods; mathematics	Sample statistics, the Kolmogorov–Smirnov test statistic, the discrete-event simulation, hypothetical arrival process, stochastic processes	Binomial, degenerate Normal, exponential, Bezier curve, independent binomial, bivariate exponential, Markov chain, Poisson process, nonhomogeneous Poisson process, Markov process	Discrete, continuous modeling arrivals	Histogram, function distribution graph	Input models available to simulation analysts	USA	2001
[9]	Simulation modeling process	Describing the behaviors and interactions	Classical statistics right-triangular distribution, cumulative distribution function, uniform distribution	Discrete event systems		Simulating and modeling operations, distribution modeling	USA	2007
[16]	Traffic modeling	Core router of a university, backbone links trans-Pacific backbone link	Gaussian distribution	Gaussian traffic model	Q–Q plots, timescales	Network modeling	Netherlands, Denmark	2013
[10]	Traffic analysis	Counting process, inter-arrival time process, discrete-time traffic,	Poisson, Pareto, Weibull, Markov, Markov chain, on–off model, interrupted Poisson	The traffic on the network	Mathematically, graphs	Traffic modeling, capacity planning the design of networks and services
[17]	Traffic analysis	The University of Jordan’s network	Poisson traffic model, long-tail traffic models	Internet traffic	Daily traffic flow graph	Traffic model QoS	Jordan	2019
[11]	Traffic analysis, mathematics	1998 FIFA World Cup website	Poisson traffic model, Gaussian distribution	Internet traffic	Function distribution graph	Simulation model	Russia	2017
[12]	Traffic analysis	Hubs of cities in Europe and America	Normal probabilistic	Internet traffic, web traffic	Traffic flow graph, probabilistic distribution	Simulation model	Ukraine	2019
[13]	Traffic analysis	Computer network traffic		Multimedia, VoIP	Average daily computer network traffic	Network modeling	Ukraine	2019

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Miletic, S.; Pokrajac, I.; Pena-Pena, K.; Arce, G.R.; Mladenovic, V. A Multigraph-Defined Distribution Function in a Simulation Model of a Communication Network. Entropy 2022, 24, 1294. https://doi.org/10.3390/e24091294

AMA Style

Miletic S, Pokrajac I, Pena-Pena K, Arce GR, Mladenovic V. A Multigraph-Defined Distribution Function in a Simulation Model of a Communication Network. Entropy. 2022; 24(9):1294. https://doi.org/10.3390/e24091294

Chicago/Turabian Style

Miletic, Slobodan, Ivan Pokrajac, Karelia Pena-Pena, Gonzalo R. Arce, and Vladimir Mladenovic. 2022. "A Multigraph-Defined Distribution Function in a Simulation Model of a Communication Network" Entropy 24, no. 9: 1294. https://doi.org/10.3390/e24091294

APA Style

Miletic, S., Pokrajac, I., Pena-Pena, K., Arce, G. R., & Mladenovic, V. (2022). A Multigraph-Defined Distribution Function in a Simulation Model of a Communication Network. Entropy, 24(9), 1294. https://doi.org/10.3390/e24091294

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Multigraph-Defined Distribution Function in a Simulation Model of a Communication Network

Abstract

1. Introduction

2. Related Work

3. Data Exchange in the Communication Network

3.1. The Data of Network Distribution over Time

3.2. Distribution Function for Variations in the Amount of Data

4. Description of the ITCN Network Distribution Using Multigraphs

4.1. Data Distribution Time Scheme between ITCN Network Elements

4.2. Multigraphs of Data Distribution in ITCN Network Traffic

4.3. Matrix Associated with the ITCN Network Traffic Distribution Multigraph

5. Generating the Data Distribution Function in the ITCN by Sampling Multigraphs

6. Conclusions and Further Research

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI