1. Introduction
1.1. Review
WMNs provide low-cost and effective connectivity solutions. They have the term “mesh” as a foundation, which allows for communication between nodes without an access point (AP). Within WMNs, all network nodes can simultaneously be repeaters and clients, so as to cover broader regions. The architecture of a WMNN is shown in
Figure 1. The simplicity of adding the new routers makes WMNs the preferred technology for the Internet of Things (IoT), intrusion detection systems, remote video surveillance, smart grids, and environmental monitoring. In many applications, WMNs are expected to support internet services to heterogeneous clients [
1].
The nodes within a WMN perform a primary function, supporting several radios. This is also known as multi-radio. Nevertheless, this feature is not visible with network clients (smartphones, laptops) nor fundamental to creating a WMN. However, it is important to remark that multiradio functionality notably improves the WMN’s performance [
1].
HWPM is the default routing protocol in the IEEE 802.11s [
3] draft. It works in layer 2 and has three clear-cut objectives to create a WMN. The first is concerned with node discovery and mesh establishment, the second comprises the metric calculation and route maintenance, and the last one focuses on security and synchronization [
3]. Furthermore, the same draft proposes the metric calculation for
ALM as follows:
where:
O, channel access overhead, standard IEEE 802.11s;
Bt, number of bits in the test frame;
R, current bit rate in use;
ef, packet error rate at current bit rate.
Routing metrics predict the cost of the route calculated by the routing protocols. They provide quantifiable values that can be used to judge the cost or efficiency of a route [
4].
To carry out this work, two computer packages were used. The first is ns-3; this is a discrete-event network simulator for Internet systems, targeted primarily for research and educational use. ns-3 is a free, open-source software, licensed under the GNU GPLv2 license and maintained by a worldwide community [
5].
In ns-3, the implementation of different networking route protocols is developed within multiple file types, with .h files (the libraries) being the most important ones and .cc, which refer to language routines C++. Thus, the application of the IEEE 802.11s standard is characterized by many of these files, among others.
The second computer package is RStudio; this is a language for statistical computing and graphics. It is a GNU project which is similar to the S language and environment, which was developed at Bell Laboratories (formerly AT&T, now Lucent Technologies) by John Chambers and colleagues. R can be considered as a different implementation of S. There are some important differences, but much code written for S runs unaltered under R [
6].
R provides a wide variety of statistical (linear and nonlinear modeling, classical statistical tests, time-series analysis, classification, clustering, etc.) and graphical techniques, and is highly extensible. The S language is often the vehicle of choice for research in statistical methodology, and R provides an open-source route to participation in that activity [
7].
RStudio is a powerful and easy way to interact with R programming, considered an integrated development environment (IDE) that provides a one-stop solution for all statistical computing and graphics. The RStudio is a more advanced version of R that comes with a multi-pane window setup that provides access to all primary things on a single screen (such as source, console, environment and history, files, photos, graphs, etc.) [
8].
1.2. About the Paper
This work seeks to introduce a new methodology for metric calculation for the wireless routing protocol HWMP, and it is divided as follows.
Section 2 comprises the design proposal criteria employing two programs: ns-3 and RStudio. In addition, this section presents the software tools, scripts, and databases employed for this research work.
Section 3 describes the statistical tests for validating the proposal.
These tests encompass the management of tools such as lm, glm, ANOVA, ANCOVA, SARGAZER, and STEPWIZE, belonging to software RStudio 2022.07.1 (Posit Software, Boston, MA, USA).
Finally,
Section 4 encompasses the main conclusions of this project and some plausible alternatives for its improvement.
1.3. Contributions
This research presents a new methodology for routing metric calculation in WMNs, particularly for the implementation of HWMP in ns-3.
A lot of research has gone into understanding and evaluating the performance of WMNs with different scenarios, and [
9] presents a new metric for the HWMP protocol based on two criteria: the diversity of the two-hop channel and the hop delay. This new metric, called NMH, has been tested, simulated, and compared with the WCETT metric using ns-2, resulting in a decrease in the end-to-end delay of the network and increment of the throughput.
The information in [
10] proposes a metric calculation based on the following parameters: hop count, energy cost, and network traffic, which employ a linear combination as follows:
Simulation analysis shows that the approach outperforms single-metric routing protocols while supporting flexible service criteria, including load balancing at access points.
In [
11], an improvement in the metric of the HWMP protocol is presented. This new metric takes the traffic flow into account and allocates network resources efficiently. In addition, this model manages the link quality historically, as well as in real-time, so it becomes more sensitive to variations in link quality.
The model in [
12] uses EXT (expected transmission count) as a metric and compares it with the HWMP metric based on hops. It then utilizes ns-2, which results in a decrease in network latency. Conversely, the overhead is increased.
The model in [
13] presents a metric called ETX-3 hop because it comprises three nodes for metric calculation. It includes a more precise method to obtain the link quality and it also shows improvements in the network throughput regarding ETX.
In [
14], an analytic hierarchy process (AHP) is applied to calculate the two new metric calculations within four parameters of the QoS: delay, bandwidth, security, and packet loss probability. The results of this paper are focused on demonstrating the effects of metric prioritization on the routing decisions.
In [
15], presented herein is an improved routing metrics algorithm for WMNs that use the page rank (PR) theory and linear regression. To solve the problems mentioned above, the PR theory and multiple linear regression (MLR) methods are introduced in the metric calculation to generate metrics closer to real values by conditionally referring sibling metrics to a common parent or candidate.
After a thorough revision of the diverse approaches within the metric calculation for WMNs, statistical treatment for the employed models was found to be one of the main opportunities of this research. Even though in works such as [
10,
14,
15] new linear models are presented for the calculus of variations, none of these have included the statistical software RStudio for its treatment. With the latter being the pivot of this work, we subsequently took the new linear model back to ns-3 where we performed the WMN performance tests.
The RStudio allows for the management of multivariable models, including statistical tests such as an ANOVA, ANCOVA, STARGAZER, and STEPWIZE, which have not been considered in previous research.
2. Methodology
The methodology is divided into three parts: in the first one, the ns-3 network simulator comprises the variable data from the IEEE 802.11s standard with other additional variables that will be addressed subsequently.
The second part processes the gathered data in the RStudio software. In the last, statistical functions are employed to include a priori tools before returning to the ns-3 simulator to validate the suggested linear models.
Finally, the ns-3 simulator is back to compare the resulting models in the RStudio with the original one, IEEE 802.11s.
Figure 2 displays a detailed diagram of this last process.
Furthermore, the TXX tool allows for knowing the network performance without use of the ns-3 simulator. By inserting the other variables, this tool predicts the network performance. This is possible with the analysis of variance (ANCOVA) technique.
Figure 3 depicts a detailed model of the last process.
The next section describes the suggested methodology in detail; this requires changes in the calculation code of the ALM metric as part of the HWMP application in ns-3. This process incorporates other network characteristics such as the layer 4 protocol, packet numbers, mesh size, etc. These resulting data are stored in a compatible file and then transported to an RStudio for a statistical evaluation of the initial model, and therefore a proposal for new linear models using lm and glm, and finally for the calculation of the ALM metric.
2.1. Software & Hardware
An ns-3 version 3.30 was used for the simulation, which runs on the Linux Ubuntu 20.04.5 LTS system with an Intel Core i5-5200 CPU @ 2.2 GHz × 4, 8 GB de RAM, Architecture x64.
The mathematical modeling utilized RStudio 2022.07.1 +554, which runs on Windows 11 PRO version 21H2 with an Intel(R) Core (TM) i7-8565U CPU @ 1.80 GHz-2.00 GHz, 8 GB RAM, Architecture x64.
2.2. Data Collection in ns-3
To begin, the network simulator ns-3 was employed to write and/or modify four scripts. The first one is a modification of airtime-metric.cc, which, in its original version, allows for obtaining the ALM metric, and when modified, results in additional parameters for the protocol performance.
The script mesh-ping.cc was developed based on the original mesh.cc script. The objective was to create a metric report and the parameters for its calculation every 100 ms to evaluate the network performance frequently. This script works by default with UDP in layer 4.
Similarly, the script mesh-tcp.cc, which shares identical characteristics to mesh-ping.cc, was developed with the TCP in layer 4.
Finally, the script wifi-phy-header.cc was modified to work with network packets with an extension up to 8192 bytes, since the original script only allows 2048 bytes.
To present the network performance information, particularly unicasts and broadcast in transmission and reception, the following scripts were modified: peer-management-protocol.cc, peer-management-protocol_mac.cc, peer-link.cc, mesh-point-device.cc, mesh-wifi-interface-mac.cc, hwmp-protocol.cc, mesh-wifi-interface.cc, mesh-helper.cc, and hwmp-protocol-mac.cc.
Once the changes took place, it was possible to set our WMN simulator with the parameters shown below.
Figure 4 presents the general design for the mesh network used in ns-3 for the simulations.
The simulation was run for 2.5 s with all the possible interactions among the simulation parameters, as shown in
Table 1; these simulations produced the test database that was used to work with RStudio.
The generated database was:
See the variable descriptions below:
- -
FAILAVG: ef, defined in standard IEEE 802.11s
- -
TXDURATION: relation between Bt/r. defined in standard IEEE 802.11s
- -
METRIC: ALM metric, defined in standard IEEE 802.11s
The previous three variables were taken from the IEEE 802.11s standard draft and it was useful to obtain data from the original model.
The next six variables were obtained in the mesh network setting, as well as from the IEEE 802.11b y 802.11s standards.
- -
MXN: mesh size
- -
L4: protocol in layer 4 (they could be UDP or TCP)
- -
M_STEP: distance between nodes
- -
M_PACKET: packet size
- -
INTERFACES: number of channels, for IEEE 802.11b 3 channels available: 1, 6 y 11
- -
COMP: inverse complement to 1 of FAILAVG
- -
PDF: The packet delivery function can be defined as the ratio of the number of data packets delivered to the total number of packets sent. This illustrates the efficiency of the data delivered to the destination.
A greater value of the packet delivery ratio indicates a better performance of the protocol [
3].
- -
TXX: Network performance
This last variable is qualitative since it shows the network performance based on the criterion about the PDF which should be higher than 60%. This part is critical because at the end of the process, it will be used as a reference to analyze the results.
2.3. ALM Metric Calculation in RStudio
As stated before, RStudio was used for the statistical analysis of the resulting data base in ns-3.
From here onward, the simulator variables presented in
Table 2 will be called linear model (LM) regressors.
The database used for this initial analysis was mesh.csv. According to the IEEE 802.11s standard, the regressor that determines the ALM value is FAILAV, and the remaining equation terms lack importance since they are constant.
The LM for the original model IEEE 802.11s is:
Next, the original model was modified. This is doable with an LM, where all database regressors, the square, and the cube of FAILAVG are included:
Then, the tests ANOVA and STEPWIZE determined the regressors which were significant for the model. At this point, the regressors can be discarded.
This final section presents an LM using the regressor COMP as follows:
2.4. New Suggested Model
The researchers present new fitting generalized linear models (GLM) to effectively predict the METRIC considering that one of the results working with the LM (as seen in the previous section) is that the data distribution is not Gaussian.
From this point, the database
mesh_udp_2.csv is used to obtain a new independent model of the protocol in layer 4, since one of the tests in the previous section discarded the LM parameter. This model is:
This GLM used a POISSON and LOG distribution as a link function.
2.5. Networking Performance
To model TXX network performance, researchers also used GLMs for the reasons previously explained and because it is a categorical variable. In this model, TXX is the dependent variable, METRIC is the covariable, and the remaining regressors are independent variables.
A BINOMIAL and LOGIT distributions worked as a link function. Then, ANCOVA analysis was conducted, consisting of three additional tests which are called statistical assumptions.
Homogeneity of variance: The null-hypothesis shows that the variance is equal. The test LEVENE was used.
Data independence: the null-hypothesis shows the data independence. The test ANOVA was used.
Homogeneity of regression slopes: The null-hypothesis shows the homogeneity of the regression slopes. The test ANOVA was used.
All the scripts and databases can be seen here [
16].
4. Conclusions
This work reveals a new methodology for the ALM metric calculation in the 802.11s standard, specifically for the HWMP protocol, starting from ns-3 simulations for a later database processing in RStudio.
Second, statistical techniques such as the lm, glm, ANOVA, ANCOVA, and STEWIZE are employed to select and validate the regressors models presenting a priori tool to define, delete, or maximize network variables for the metric calculation in WMS protocol.
This paper also includes an analysis based on the PDF criteria, which illustrates that in all the cases, the number of delivered packets grows. Thus, our model presents a superior performance when compared to the original model. Nevertheless, it is important to remark that as of yet, more testing is missing.
Similarly, this new model based on GLM is presented to predict the WMN’s performance as a previous process to replicate this new model to the simulators and then to real scenarios.
It is worth mentioning that in addition to the predictive models produced by RStudio, the predictive model for network performance (TXX) was added based on the GLM. This is a priori model which does not require ns-3 to corroborate if the sent packets within the network reach their final destination with 60% of effectivity.
5. Future Research
This work continues and the researchers are carrying out new tests in the ns-3 simulators, end-to-end delay, throughput, etc. The results will be revealed in a future paper that will complement this research.
The computational cost is a critical and concerning issue that this work reveals. The inclusion of new regressors and new calculation ways interfere with the data processing on mobile devices. This problem should be dealt with in the Taylor series to simplify the metric calculation and reduce the processing tasks on network devices to compare the computational cost to traditional calculation options.
Future works can focus on the incorporation of new metrics in the proposed models and in the metrics explained in the following works [
4,
15,
17].
It is important to carry on with the process of taking the suggested models to the ns-3 simulator for a first evaluation and, if required, its application can be extended to real scenarios.
As it is warned in one of the earlier sections, it is feasible to try reducing the sample numbers near values of 1000 to avoid anomalies in Big Data.