Abstract
We examine a general wireless sensor network (WSN) model which incorporates a large number of sensors distributed over a large and complex geographical area. The study proposes solutions for a flexible deployment, low cost and high reliability in a wireless sensor network. To achieve these aims, we propose the application of an unmanned aerial vehicle (UAV) as a flying relay to receive and forward signals that employ nonorthogonal multiple access (NOMA) for a high spectral sharing efficiency. To obtain an optimal number of subclusters and optimal UAV positioning, we apply a sensor clustering method based on K-means unsupervised machine learning in combination with the gap statistic method. The study proposes an algorithm to optimize the trajectory of the UAV, i.e., the centroid-to-next-nearest-centroid (CNNC) path. Because a subcluster containing multiple sensors produces cochannel interference which affects the signal decoding performance at the UAV, we propose a diagonal matrix as a phase-shift framework at the UAV to separate and decode the messages received from the sensors. The study examines the outage probability performance of an individual WSN and provides results based on Monte Carlo simulations and analyses. The investigated results verified the benefits of the K-means algorithm in deploying the WSN.
1. Introduction
Deployments of wireless sensor networks (WSNs) are increasing because of their beneficial applications. For example, WSNs can be deployed to monitor or collect environmental data (meteorological information such as precipitation, wind speed and direction, air pressure, humidity, temperature, etc.) in remote or difficult terrain [1,2,3,4,5,6]. A major challenge in deploying a WSN is distributing a large number of wireless sensors over a large and complex geographical area. Wireless sensors are generally low cost, have low power consumption and are highly flexible in their application. However, transmitting a signal from a wireless sensor directly to a control centre presents an important challenge [7], especially if a large number of wireless sensors are deployed to directly collect data. Using terrestrial infrastructure for the purpose of collecting data from wireless sensors is impractical because of high deployment costs and a low flexibility. A potential solution to this problem is using an environmental monitoring system which dispatches an unmanned aerial vehicle (UAV) to a geographical area to retrieve the collected sensor data.
To deploy a WSN, Heinzelman et al. [8] proposed a low-energy adaptive clustering hierarchy (LEACH), a clustering method now considered the most well-known clustering protocol for WSNs. In a hierarchical topology, clusters contain two types of node: cluster members and cluster heads. Member nodes are grouped into different clusters, and in each cluster, a single node is designated a cluster head. The cluster head has the most important role in the cluster, tasked with receiving signals from cluster members and forwarding those signals to other cluster heads [9] or the base station [10].
UAVs have gained increasing consideration as aerial relays which deliver mobility and on-demand wireless connections in areas with complex topography and no network coverage. A UAV’s online time, however, is limited by its own on-board energy limitations. The evolution of UAV-assisted WSNs is compelling the scientific community to search for new ways of performing energy harvesting (EH) from external power sources to prolong the online time of UAVs. A variety of effective solutions have been proposed, grouped according to two main types of technique, namely, simultaneous wireless information and power transfer (SWIPT) and techniques for determining the optimal positions for the UAV.
Radio frequency EH shows promise as a potential solution for UAV-assisted WSNs. Initial studies on radio frequency EH were used in a technology termed wireless power transfer (WPT) to recharge the wireless sensors in the WSN. A new radiofrequency EH technique, termed SWIPT, introduced significant benefits to WPT [11]. Many authors have studied SWIPT over the last two decades [12,13,14,15], investigating the performance difference between time switching and power splitting in SWIPT protocols [16]. The current study applied a time-switching protocol because it supports a phase for EH.
Applied to all beyond 5G/6G wireless communications, nonorthogonal multiple access (NOMA) provides massive connections, low latency and high reliability [14,17,18,19,20,21]. In the current study, we took advantage of NOMA’s benefits, applying NOMA at the UAV to superimpose coding of the data received from wireless sensors and to forward this superimposed signal to a mobile data centre.
1.1. Motivation
The use of machine learning in practical applications is escalating. The authors in [6] applied an artificial neural network (ANN) for sensor clustering. By contrast, some wireless sensors in the study in [6] were clustered as separate single-member clusters. In [22], the authors proposed a distance- and energy-constrained K-means clustering scheme (DEKCS) for cluster head selection to prolong the lifetime of underwater WSNs. With this new clustering algorithm, a prospective cluster head was selected according to its position in the cluster and its residual battery level. The authors dynamically updated residual energy thresholds set for prospective cluster heads to ensure that the network fully depleted its energy before disconnection. In this manner, cluster heads could be drained of energy and become inactive/dead sensors. The current study applied the K-means algorithm and the gap statistic method first introduced in [23] to obtain an optimal number of subclusters. To our best knowledge, the gap statistic method has not been applied for WSN clustering in any previous study.
In [24], the authors examined a UAV-assisted data collection WSN. The UAV’s trajectory was optimized by applying the travelling salesman problem. Note that in [1], the UAV in the proposed network visited every wireless sensor, while in [24], the optimal serving order for sensors was determined according to a standard travelling salesman problem algorithm, which can be optimally solved with the efficient cutting-plane method (i.e., the shortest path from the start point to the end point). The authors also proposed an algorithm which used the pattern search method to solve the problem of optimizing the UAV position and sensor uploading power. In [1], the UAV could be exhausted as a consequence of long flight distances. The authors in [24] used a UAV that navigated the shortest path from the start point to end point, but it consequently ignored/missed some wireless sensors. In another study, the authors addressed the UAV’s trajectory problem by jointly optimizing the UAV’s velocity, hovering positions and visiting sequence [25]. The scientific community is very interested in studying UAVs’ trajectories for the significant potential gains in aerial network performance. Researchers have applied several types of trajectory, for example, straight trajectory [26,27], circular trajectory [10,28,29] and spiral trajectory [30,31,32]. The authors in [25] introduced an interesting UAV trajectory scheme (Figure 4), where the UAV visited all N monitoring areas and then found suitable positions to transmit the collected data. In [25], the UAV collected data in four stages: (i) UAV data collection flight, (ii) UAV data collection processing, (iii) UAV data transmission flight and (iv) UAV data transmission processing. That UAV’s operating schedule is illustrated in Figure 5 [25].
The current study proposes the use of a UAV for its high mobility, quick implementation and low cost. The main drawback to a small-sized, lightweight aircraft such as a UAV, however, is its limited on-board energy. UAVs are therefore not suitable for flying close to each sensor to collect data, as proposed in [25]. The current study therefore investigated the application of a K-means algorithm to cluster wireless sensors into multiple, optimized subclusters.
1.2. Contribution
Inspired by the studies mentioned in the previous section, we employed a UAV as an aerial relay to provide a sustainable, functional solution for a WSN. The main contributions of the current study are:
- The use of three-dimensional Cartesian coordinates for a WSN which contains a random number of randomly distributed wireless sensors.
- The decomposition of the UAV trajectory optimization into two subproblems: (i) the global WSN cluster is divided into multiple subclusters whose number is optimized with unsupervised machine learning which applies K-means clustering in combination with the gap statistic method; (ii) a centroid-to-next-nearest-centroid algorithm is then applied to find the shortest path for travel through every subcluster.
- An analysis of the system performance of the WSN over Rayleigh distributions and a presentation of the derived closed-form expressions for the outage probability at the UAV and mobile base station.
- Outage probability results for the UAV and mobile base station derived from Monte Carlo simulations and verified with an analysis.
The remainder of the paper is organized as follows: Section 2 introduces the WSN model, wireless sensor clustering algorithm, joint UAV trajectory, free-space channel modelling, and joint UAV operating schedule; Section 3 provides an analysis of the WSN’s performance based on outage probability and presents the closed form expressions for outage probability at the UAV and mobile base station; Section 4 examines and plots the investigated results; Section 5 discusses conclusions.
For clarity, Table 1 presents the notation used in the paper.
Table 1.
List of important notations.
2. WSN Model
The current study examines a general WSN with a randomly distributed number of wireless sensors. Figure 1 depicts a WSN with a random number of sensors positioned at the Cartesian coordinate in three dimensions. Let us assume that a mobile base station B is positioned at and each wireless sensor for is positioned randomly at coordinate , where and as shown in Table 2. For simplicity, we assume that the wireless sensors and mobile base station are positioned relative to a flat earth.
Figure 1.
Random positioning of sensor nodes, where .
Table 2.
Wireless sensor positions are distributed randomly.
Definition 1.
We denote the global set as containing all wireless sensors. returns N, the total number of wireless sensor nodes (i.e., ). Let us assume that the data observations (i.e., wireless sensor positioning) are clustered into K subclusters, i.e., and , where .
Figure 1 illustrates a random distribution of wireless sensors. Each wireless sensor is allocated a given index by the subscript n, where and a lower index n has a higher priority. For clarity, sensor has a higher priority than sensor (e.g., sensor has a higher priority than sensor ).
2.1. WSN Clustering
Remark 1.
Because the optimization problem is complex, we propose breaking it down into several subproblems and observing the random distribution of wireless sensors over a large geographical area, as illustrated in Figure 1. The global wireless sensor cluster can be divided into multiple subclusters. The number of subclusters can be optimized by applying the gap statistic method, and the wireless sensors can be assigned to a subcluster using the K-means algorithm. To solve these problems, we propose a solution in Proposition 1.
Proposition 1.
The optimal number of subclusters is yielded as follows:
- Observing the latitudes (x-axis) and longitudes (y-axis) of the wireless sensors, we determine the optimal number of subclusters . The gap statistic method is applied to the number of subclusters k to compute the corresponding total within the intracluster variation , i.e., the sum of squares function, given bywhere , returns the number of wireless sensor nodes in cluster and is the squared Euclidean distance of all pairwise sensor nodes in the cluster for and . It is important to note that we assume and , where is the number of observations (or the number of sensors within the global cluster ). Let us briefly consider factors and . We define to prevent a uniform distribution of wireless sensors positions throughout the area; the gap statistic method thus returns the optimal number of clusters . For example, Figure 2a,b in [23] plots the distribution of sensors spread throughout a region and the corresponding optimal number of clusters at , respectively. However, we also define to prevent each wireless sensor owning a private cluster, a problem that would lead to a UAV visiting every wireless sensor to collect data.
- Reference data sets Ω with a random uniform distribution are generated. Each reference data set ω of these reference data sets Ω is clustered with a variable number of clusters . The corresponding total is computed within the intracluster variation given in the dispersion metrics for and .
- The estimated gap statistic is computed as the deviation of the observed value from its expected value under the null hypothesis . Let . The standard deviation (sd) of the statistics is then computed, given by .
- Using the gap statistic method, the smallest value of κ is selected as the optimal number of clusters, the gap statistic being within one standard deviation of the gap statistic at , given and , where .
For example, Figure 2 indicates the optimal number of clusters at , determined by the gap statistic algorithm according to the randomly positioned sensor nodes shown in Figure 1.
Figure 2.
Optimized number of subclusters using the gap statistic method, the optimal number of clusters at satisfying the first maximum standard error.
The K-means algorithm was used to calculate the position for each centroid, with an optimal number of clusters . The computed centroids of four subclusters () are listed in Table 3. Figure 3 illustrates all wireless sensor nodes after clustering to K subclusters. After clustering, each sensor node is grouped into a subcluster; for example, indicates that sensor is a member of subcluster (Figure 3).
Table 3.
Centroids after clustering.
Figure 3.
Sensor clustering using the K-means algorithm, with optimal number of clusters .
2.2. Joint UAV Trajectory
Definition 2.
We introduce a novel joint UAV trajectory algorithm to compute the centroid to next nearest centroid.
Problem 1.
Operating as a flying relay, the UAV has the advantage of a high mobility and is able to fly close to wireless sensors to receive and forward a superimposed signal. This, however, leads to a long flight path, and the other wireless sensors must wait to be served. We minimized the flight time/path of the UAV according to the cluster centroid positions shown in Table 3. How to obtain the shortest flight path is outlined in Proposition 2.
Proposition 2.
The centroid-to-next-nearest-centroid trajectory was computed as follows:
- Step 1: To determine the nearest centroid from the mobile base station B, we calculate the smallest pairwise Cartesian distance from the mobile base station to each subcluster centroid.
- Step 2: The UAV selects the next nearest cluster centroid. In this case, the UAV considers candidate centroids without regard to any of the previously selected cluster centroids in . It is important that the centroids contained in the visited set be removed from the candidate list to prevent the UAV returning to the previous subcluster . The UAV repeats Step 2 (i.e., ) until the list of candidate subclusters is empty (i.e., ).
Without loss of generality, we examined a single round trip of the UAV. Table 4 lists the next nearest subcluster centroids determined from the above selection strategy. The results in Figure 3 indicate that subcluster was the nearest to the mobile base station B compared to the other subclusters. Subcluster was therefore selected at block period time . The UAV visited subcluster first to collect data from all sensor members in subcluster . The visited set was then updated. After all data from the sensor members in subcluster were collected, the UAV selected subcluster because it contained the next nearest subcluster centroid. The UAV then visited subcluster at global time period to collect data from all sensor members in the subcluster. The visited set was again updated. The UAV continued to follow this procedure, selecting the next nearest centroid and updating the visited set, until all data have been collected from each subcluster. In this manner, the UAV followed the shortest possible flight path, as shown in Figure 4. After travelling through all K subclusters () and collecting all data from wireless sensor members in each subcluster, the UAV’s task was complete and it returned to the mobile base station. For the real-time application of a UAV-assisted WSN, the UAV would repeat the round trips summarized in Table 5.
Table 4.
Pairwise centroid-to-centroid distance based on Cartesian distances.
Figure 4.
Joint UAV trajectory and the shortest path based on the centroid-to-next-nearest-centroid distance given by Algorithm 1 (i.e., ).
Table 5.
Joint trajectory schedule for global transmission time period T and optimal number of clusters K, where the UAV period .
Observing Table 4, notice that numbers with inclined lines (e.g., ) and numbers with bold (e.g., 0.5005) mean visited clusters and next nearest clusters. For clarity, when UAV visited cluster (row with ), the UAV selects the next-nearest cluster (i.e., ) and ignores cluster . Next, the UAV visited cluster (row with ), the UAV selects the next-nearest cluster (i.e., ) and ignores clusters and . The remaining rows in Table 4 have the same meaning.
| Algorithm 1K-means clustering for the optimal number of subclusters and shortest path determined from a centroid to the next nearest centroid |
|
2.3. Channel Modelling for a UAV-Assisted WSN
In our previous work [33], we considered free space (i.e., air-to-ground (A2G), ground-to-air (G2A) and air-to-air (A2A)) and first introduced the flat-earth distance based on real latitudes and longitudes. The proposed solutions were effective in determining and tracking the optimal positions for the UAV. A separate study [33] examined the problems related to channel modelling in a WSN which contained multiple subclusters. In the current study, we address the uplinks, i.e., the channels from the wireless sensors to the UAV () and the channels from the UAV to the mobile base station (). The precoding channel matrices and are expressed by
where and are the number of antennae on the wireless sensor and UAV U, respectively; the channel coefficient is formulated according to , where g is the Rayleigh fading channel, is the path-loss exponent, and is the G2A distance from the sensor node to UAV U. Note that the free-space distance based on latitude and longitude is given by expression ([33], Equation (2)). For simplicity and without loss of generality, all wireless sensors are allocated with Cartesian coordinates in a three-dimensional space. The G2A distance from wireless sensor to UAV U is therefore given by , where the x, y and z axes represent latitude, longitude and altitude, respectively, on a flat earth.
Similarly, the precoding channel matrix is expressed as
where is the number of antennae at the mobile base station, and the channel coefficient is formulated according to , where is the A2G distance from the UAV U to the mobile base station B and given by = .
2.4. UAV Joint Schedule
This study introduces a novel scheduling protocol for a UAV-assisted WSN. The coefficient t is the time required to complete a transmission cycle of three phases, i.e., , and , where is the first phase during which the UAV receives signals from the sensor nodes in a cluster, is the second phase during which the UAV receives radiofrequency energy from the mobile base station, and is the third phase during which the UAV transmits the superimposed signals to the mobile base station for data analysis. Figure 5 depicts an electronic control unit (ECU) which performs a task corresponding to a predefined operation in a common UAV schedule.
Figure 5.
Joint schedule.
The ECU implements an electronic switch which applies three successive modes during the same transmission block t:
- In phase , the interface of the receiving signal circuit is active while the other interfaces are inactive. The UAV receives signals from the sensor nodes in the currently visited subcluster, given by (5).
- In phase , the interface of the EH circuit is active while the other interfaces are inactive. The UAV receives radiofrequency energy from the mobile base station B, given by (10), while the ECU decodes the messages from the signals received from the wireless sensors.
- In phase , the interface of the transmitting signal circuit is active while the other interfaces are inactive. The UAV encodes the messages received in the first phase and forwards the superimposed signals to the mobile base station B, given by (11).
2.4.1. Phase 1: Uplinks between Wireless Sensors and the UAV
In the first phase, having selected the next nearest subcluster centroid, the UAV visits and hovers at the selected centroid and receives signals wirelessly from the subcluster’s sensors. Figure 6 illustrates the procedure of receiving signals and processing data at the UAV during the first phase.
Figure 6.
Procedure of processing data at the UAV.
Based on the number of subclusters K and the global period T, we calculate the UAV period t as follows:
where the function refers to the modulo value between T and K (e.g., for and , ). At each UAV period t, the UAV selects the next nearest centroid using the centroid-to-next-nearest-centroid algorithm. According to the trajectory mapped in Table 5, for , and , the UAV selects the subcluster and serves wireless sensors , , , , , , , , , , , and . The signals received from the wireless sensors in subcluster over UAV period t are given by
where t is obtained from (4), k is mapped as in Table 5, and is the transmit power of sensor . For simplicity, we assume that .
Let us denote , which is the mathematical description of a diagonal matrix, as follows:
where the diagonal matrix has the size for transmission period T and all nondiagonal elements are zero, as indicated in Figure 6. The predecoded matrix obtained at the UAV is derived by multiplying the received signal matrix (5) with the diagonal matrix (7); thus, . The UAV selects each element in the predecoded matrix to obtain the SINR at the point when the UAV decodes message from sensor , as follows:
where the signal-to-noise ratio (SNR) . For simplicity, we assume that .
We then obtain the instantaneous bit rate at the point when the UAV decodes message from sensor , as follows:
2.4.2. Phase 2: Prolong the UAV’s Online Time with EH
The most challenging aspect of deploying a UAV is managing its power limitations as a small, lightweight aircraft. We propose applying SWIPT techniques to prolong the UAV’s online time. In a previous study [33] (Figure 4), we adopted a power splitting protocol. In the current study, we applied a time-switching technique for its advantages in a WSN (Figure 5); the technique is different from the proposed time-switching models in [33] (Figure 4). In phase , the UAV harvests radiofrequency energy from the mobile base station according to
where is the power domain at the mobile base station, and is the expected channel gain between the mobile base station and the UAV at its current UAV position. It is important to note that is the collected energy factor and that we assume for simplicity.
2.4.3. Phase 3: Transmitting Signals
In [10], the authors applied the amplify-and-forward protocol at the UAV to receive and forward signals to a single device. In the current study, we implemented a decode-and-forward protocol at the UAV to ensure that the UAV received, decoded and encoded messages successfully before forwarding the superimposed signals to the mobile base station. To improve latency, we applied the emerging NOMA technique for its high spectral efficiency. The UAV U encoded the messages from the sensors in the current subcluster and superimposed them into the signal by sharing the power domain and using different power allocation factor . From the precoding matrix , as given by (3), only the best channel was selected for signal transmission.
In the third phase of transmission block t, the mobile base station B received radiofrequency signals as follows:
where is the AWGN (i.e., with zero mean and variance ) at the mobile base station B, is the power domain at UAV U, and is the power allocation factor for message of wireless sensor . The NOMA technique applies superimposed coding by sharing the power domain and therefore, the power allocation strategy strongly affects the success or failure of decoding a message. Previous studies [17,21,34] have also applied power allocation strategies; the current study, however, addresses a WSN divided into multiple subclusters and therefore proposes the novel power allocation strategy described below.
Proposition 3.
The power allocation strategy for transmitting messages from wireless sensors over UAV transmission period t while the UAV visits subcluster is expressed as follows:
where a sensor with a higher priority is allocated a larger power allocation factor; for example, sensor , which has the highest priority and is the first member in subcluster , is allocated the largest power allocation factor , whereas sensor , which has the lowest priority and is the last member in the subcluster , is allocated the smallest power allocation factor . For clarity, we applied the power allocation factors presented in Table 6. From Equation (12), the power allocation strategy in the subcluster is constrained such that and .
Table 6.
Power allocation factors at wireless sensors for transmitting messages, arranged according to subclusters.
The SINR at the mobile base station B when B decodes message treats other messages , where , and AWGN as interference by applying SIC:
where in (13) and in (14).
The maximum instantaneous bit-rate threshold attained if the mobile base station decodes message in the best-received signal, given by (11), is expressed as:
where , and .
3. System Performance Analysis
In this section, we derive the novel closed-form expressions for the independent outage probability at the UAV and the dependent outage probability at the mobile base station.
3.1. Outage Probability Performance at the UAV
Theorem 1.
The independent outage probability at the UAV U relates to the UAV’s unsuccessful decoding of the message in the received signal, given by (5). In other words, the maximum instantaneous bit-rate threshold, given by (9), cannot reach the predefined bit-rate threshold . The independent outage probability at the UAV U in transmission block t is therefore expressed as
Based on Equation (16), we propose Algorithm 2 to calculate the Monte Carlo simulations for the outage probability at the UAV U.
| Algorithm 2 Calculate the outage probability at the UAV U from (16) for transmission block t |
|
Remark 2.
From expression (16), we obtain the outage probability at the UAV over Rayleigh distributions:
where the SINR threshold is given by . It is important to note that Equation (17) obtains the independent outage probabilities at the UAV. Generally, the outage probability at the UAV is calculated from .
See Appendix C for the proof.
3.2. Outage Probability at the Mobile Base Station
Theorem 2.
The dependent outage event at the mobile base station occurs when the flying relay (FR)-UAV either cannot decode at least the message or the mobile base station B cannot decode at least the message from the best-received signal , given by (11). The outage probability at the mobile base station with an underlying U-assisted multi-input multioutput (MIMO)-NOMA network is therefore expressed as
Based on Equation (18), we propose Algorithm 3 to calculate the Monte Carlo simulations for outage probability at the mobile base station for transmission block t over Rayleigh distributions.
| Algorithm 3 Calculate the outage probability at the mobile base station from (18) for transmission block t over Rayleigh distributions |
|
Remark 3.
The outage probability at the mobile base station in transmission block t is given by (18) from Theorem 1 and expressed in novel closed-form as follows:
where SINR threshold .
See Appendix D for the proof.
4. Numerical Results and Discussion
In this section, we examine the individual WSN and discuss the results of the study. For the purposes of the analysis and the Monte Carlo simulations, a random number of wireless sensors N was generated and randomly distributed according to the positions illustrated in Figure 1. Unless specified otherwise, we assumed that the mobile base station’s position was at coordinate and that the UAV’s position determined by K-means clustering had a fixed altitude at . The number of antennae equipped at the wireless sensors, UAV and mobile base station was . The K-means algorithm determined the optimal number of subclusters as . The path-loss exponent factor was . The list of pairwise distances from each subcluster centroid to the mobile base station was , , and . The nearest subcluster to the mobile base station was therefore . The UAV selected subcluster for the global period . At global period , the UAV then selected the next nearest subcluster to its current subcluster , i.e., subcluster , because distances . At global period , the UAV again selected the next nearest subcluster to subcluster , i.e., subcluster , since distances . At global period , the UAV selected the next nearest subcluster to subcluster , i.e., subcluster , because the final distances . The UAV thus selected the shortest trajectory . Without loss of generality and for simplicity, we assumed that for a single round trip of the UAV and global period .
4.1. Numerical Results
Figure 7a–d plot the outage probabilities at the UAV at the point when it decoded the received signals from wireless sensors in subclusters , , and , respectively. The bit-rate threshold for all wireless sensors was bps/Hz. The outage probabilities at the majority of wireless sensors were very similar as SNR ; however, the graphs in Figure 7a indicate that the outage probability of sensor was worse than the outage probability at the other sensors of the same subcluster . Figure 3 indicates that wireless sensor was the farthest from the subcluster centroid (). The results verified the efficiency of the K-means algorithm. It is important that the bit-rate threshold for the wireless sensors was set to bps/Hz. However, the UAV successfully decoded most of the messages from the wireless sensors, achieving a high outage probability performance (Figure 7). We conclude that the outage probability performance of the majority of wireless sensors in each subcluster was equal since they were evenly distributed around the subcluster’s centroid. The Monte Carlo simulations given by (16) were also verified by the analysis results given by (17).
Figure 7.
Outage probability at the UAV for the UAV’s subcluster trajectory sequence (a) , (b) , (c) and (d) .
Next, we examined the results for the mobile base station and obtained its outage probability performance at the points when the UAV visited subclusters , , and and forwarded the superimposed signals, given by (11), to the base station (Figure 8a–d). The outage probability performance of the mobile base station was poorer than the outage probability performance at the UAV (Figure 7a–d), even though the bit-rate threshold was set to bps/Hz. This may have been because the UAV was deployed with NOMA and therefore, the sensors were forced to share the power domain to transmit the messages in the superimposed signal. This means that the last member in the subcluster was allocated a very small power allocation factor, given by (12). These power allocation factors are presented in Table 6. Subcluster contained wireless sensors, and the last wireless sensor in ( ) was allocated the lowest power allocation factor (). Therefore, despite a global optimization of the subclusters and the positions of the cluster centroids by the K-means algorithm, the large number of wireless sensors in the subcluster unfortunately led to unsatisfactory results.

Figure 8.
Outage probability at the mobile base station for the UAV’s subcluster trajectory sequence (a) , (b) , (c) and (d) .
4.2. Discussion
The outage probability performance at the mobile base station was strongly affected by several factors, such as the UAV’s transmit power , the distance of the UAV from the mobile base station and the number of messages transmitted in the superimposed signals. The UAV was not able to increase the transmit power , however, because of its power limitations. A large number of antennae at both the wireless sensors and the UAV could not be equipped since the constraints for a small size, light weight and low cost did not permit it. It was also not possible to reduce the distance from the UAV to the mobile base station because of obstructions in the terrain. To address these conditions, we equipped a larger number of antennae at the mobile base station, as the mobile base station incorporated a generator and energy was not a significant problem. Therefore, equipping antennae at the mobile base station instead of the same number at the UAV (Figure 8a–d) yielded the results in Figure 9a–d. It is clear that the outage probability improved significantly at the mobile base station for while . It is also clear that the outage probability performance at the mobile base station when the UAV visited subcluster improved greatly since this subcluster was closer than the other subclusters. The outage probabilities at the other subclusters also improved as the SNR .

Figure 9.
Improved outage probability at the mobile base station equipped with antennae for the UAV trajectory subcluster sequence (a) , (b) , (c) and (d) .
5. Conclusions
This study presented a general WSN containing a randomly distributed number of wireless sensors with three-dimensional Cartesian coordinates. To improve the WSN’s performance, we applied a K-means algorithm and gap statistic method to optimize sensor clustering into a number of subclusters K. The UAV’s trajectory was calculated with an algorithm which determined the shortest path between the subcluster centroids. The aims of the study were achieved (i.e., flexible deployment, low cost and high reliability) through the effective proposed solutions, and the results were verified with both Monte Carlo simulations and theoretical analysis. Although the study provided some benefits from the application of the K-means algorithm for wireless sensor clustering, some problems still persisted that can be studied in future work. Future studies can investigate the problems with (1) fragmented power resources created by an imbalance in the number of subcluster sensors and (2) some clusters covering a larger geographic area than others as a result of sparsely distributed sensors. As a potential solution, we propose dividing the network into larger clusters when the number of sensors reaches a certain threshold.
Author Contributions
T.-N.T. and T.-L.N. proposed ideas and formulation of overarching research aims, designed of WSN model, applied mathematical to analyze the proposed model, wrote the initial draft. V.T.H. contributed to preparation and presentation of the published work by those from the original research group, specifically critical review, commentary and revision—including pre- and post-publication stages. M.V. contributed to evolution of overarching research goals, management activities to annotate, oversight and leadership responsibility for the research activity planning and execution, including mentorship external to the core team, acquisition of the financial support for the project leading to this publication. All authors have read and agreed to the published version of the manuscript.
Funding
This research received funding from the Ministry of Education, Youth and Sports under grant Reg. No. SP2021/25 and partially under the Large Infrastructures for Research, Experimental Development and Innovations project Reg. No. LM2018140.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
The data used in this study were randomly generated.
Conflicts of Interest
The authors declare no conflict of interest. The funders had no role in the design of the study, the collection, analysis and interpretation of data, writing of the manuscript, or the decision to publish the results.
Abbreviations
| A2A | air-to-air |
| A2G | air-to-ground |
| AWGN | additive white Gaussian noise |
| CDF | cumulative distribution function |
| CSI | channel state information |
| EH | energy harvesting |
| FR | flying relay |
| G2A | ground-to-air |
| MIMO | multi-input multioutput |
| NOMA | nonorthogonal multiple access |
| probability density function | |
| SIC | successive interference cancellation |
| SINR | signal-to-interference-plus-noise ratio |
| SNR | signal-to-noise ratio |
| SWIPT | simultaneous wireless information and power transfer |
| UAV | unmanned aerial vehicle |
| WPT | wireless power transfer |
| WSN | wireless sensor network |
Appendix A
The probability density function (PDF) and cumulative distribution function (CDF) of the Rayleigh distribution are expressed, respectively, as:
and
where are random independent variables, i.e., x in (A1) and (A2). In addition, is the expected channel gain, where between the source (src) and destination (des).
Appendix B
We studied an individual WSN (Figure 1) which contained an optimal number of subclusters (Figure 2) and wireless sensors allocated according to those subclusters (Figure 3). A UAV travelled the shortest path from each subcluster centroid through all subclusters (Figure 4). The results for the outage probability at the UAV indicated a high performance (Figure 7a–d); however, the outage probability at the mobile base station was comparatively poorer (Figure 8a–d). We hypothesized that the MIMO technique could improve the OP performance and therefore equipped a larger number of antennas at the mobile base station (); the results showed a significant improvement in outage probability at the mobile base station (Figure 8a,c,d), except when the mobile base station decoded the message of the last sensor of subcluster , which demonstrated a worse performance (Figure 8b), probably because the subcluster contained sensors. Another individually generated WSN (Figure A1a) contained an optimal number of subclusters (Figure A1b) and wireless sensors allocated according to those subclusters (Figure A1c). This configuration also contained an optimal UAV trajectory (Figure A1d). It is important to note that the subclusters depicted in Figure A1c contained fewer sensors than the subclusters in Figure 3. The wireless sensors depicted in Figure A1c were consequently better served than the wireless sensors in Figure 3. However, fewer sensors in each subcluster, and thus a greater number of subclusters, led to a longer UAV flight path. For example, the results in Figure 4 indicate that the UAV’s period to visit all subclusters was ; the results in Figure A1d, however, indicate a UAV period of . We conclude that fewer sensors in each subcluster deliver better results, but at the expense of the UAV consuming more energy to travel longer distances.
Figure A1.
The randomly distributed WSN (a), determined optimal number of subclusters K (b), division into subclusters (c) and centroid-to-next-nearest-centroid trajectory (d).
Appendix C
By substituting the SINR given by (8) into (9) and then substituting (9) into Theorem 1, as given (16), we thus obtain the independent outage probability at the UAV:
From the precoding matrix given (2) and the PDF given by (A1), we obtain
Note that Equation (A4) evaluates the independent outage probability at the UAV at the point when the UAV decodes the received signal from wireless sensor in subcluster unsuccessfully. The outage probability at the UAV is generally expressed as .
Appendix D
Expression (18) is rewritten as follows:
Let . By substituting the SINR given by (8) into (9) and then substituting (9) into , we obtain
Similarly, let . By substituting the SINR given by (13) or (14) into (15) and then substituting (15) into , we obtain
From expression (A5), we conclude that the outage probability at the mobile base station is independent since it belongs to the outage probability at UAV. The study verified that and therefore, the outage probability at the mobile base station given by (A5) refers to . Observing the individual WSN model depicted in Figure 4, a possible explanation is that the UAV was close to the wireless sensors, which thus strongly owned the channel state information (CSI) , even though they simply transmitted their own messages under their own power domain . However, the UAV travelled a long distance since the subcluster was far from the mobile base station; therefore, it forwarded messages in the superimposed signal by sharing the power domain , and thus . To improve , both and must be improved. To improve , the number of antennae at the wireless sensors or their transmit power must be increased. However, wireless sensors have certain constraints, such as having a low cost and low power. These solutions are therefore not practical or even obtainable. To address these conditions, we attempted to improve by increasing the number of antennae at the mobile base station (). The outage probabilities at the mobile base station improved (Figure 9a–d) over the previous results (Figure 8a–d).
References
- Gong, J.; Chang, T.H.; Shen, C.; Chen, X. Flight Time Minimization of UAV for Data Collection over Wireless Sensor Networks. IEEE J. Sel. Areas Commun. 2018, 36, 1942–1954. [Google Scholar] [CrossRef]
- Li, J.; Zhao, H.; Wang, H.; Gu, F.; Wei, J.; Yin, H.; Ren, B. Joint Optimization on Trajectory, Altitude, Velocity, and Link Scheduling for Minimum Mission Time in UAV-Aided Data Collection. IEEE Int. Things J. 2020, 7, 1464–1475. [Google Scholar] [CrossRef]
- Zhan, C.; Zeng, Y.; Zhang, R. Energy-Efficient Data Collection in UAV Enabled Wireless Sensor Network. IEEE Wirel. Commun. Lett. 2018, 7, 328–331. [Google Scholar] [CrossRef]
- Zhan, C.; Zeng, Y. Completion Time Minimization for Multi-UAV-Enabled Data Collection. IEEE Trans. Wirel. Commun. 2019, 18, 4859–4872. [Google Scholar] [CrossRef]
- Wang, Z.; Liu, R.; Liu, Q.; Thompson, J.S.; Kadoch, M. Energy-Efficient Data Collection and Device Positioning in UAV-Assisted IoT. IEEE Int. Things J. 2020, 7, 1122–1139. [Google Scholar] [CrossRef]
- Kong, P.Y. Distributed Sensor Clustering Using Artificial Neural Network with Local Information. IEEE Int. Things J. 2022, 9, 21851–21861. [Google Scholar] [CrossRef]
- Ur Rahman, S.; Kim, G.H.; Cho, Y.Z.; Khan, A. Positioning of UAVs for throughput maximization in software-defined disaster area UAV communication networks. J. Commun. Netw. 2018, 20, 452–463. [Google Scholar] [CrossRef]
- Heinzelman, W.; Chandrakasan, A.; Balakrishnan, H. An application-specific protocol architecture for wireless microsensor networks. IEEE Trans. Wirel. Commun. 2002, 1, 660–670. [Google Scholar] [CrossRef]
- Dargie, W.; Wen, J. A Simple Clustering Strategy for Wireless Sensor Networks. IEEE Sens. Lett. 2020, 4, 1–4. [Google Scholar] [CrossRef]
- Zhang, S.; Zhang, H.; He, Q.; Bian, K.; Song, L. Joint Trajectory and Power Optimization for UAV Relay Networks. IEEE Commun. Lett. 2018, 22, 161–164. [Google Scholar] [CrossRef]
- Jayakody, D.N.K.; Thompson, J.; Chatzinotas, S.; Durrani, S. Wireless Information and Power Transfer: A New Paradigm for Green Communications; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
- Zhang, R.; Ho, C.K. MIMO Broadcasting for Simultaneous Wireless Information and Power Transfer. IEEE Trans. Wirel. Commun. 2013, 12, 1989–2001. [Google Scholar] [CrossRef]
- Zhou, X.; Zhang, R.; Ho, C.K. Wireless Information and Power Transfer: Architecture Design and Rate-Energy Tradeoff. IEEE Trans. Commun. 2013, 61, 4754–4767. [Google Scholar] [CrossRef]
- Tran, T.N.; Voznak, M.; Fazio, P.; Ho, V.C. Emerging cooperative MIMO-NOMA networks combining TAS and SWIPT protocols assisted by an AF-VG relaying protocol with instantaneous amplifying factor maximization. AEU-Int. J. Electron. Commun. 2021, 135, 153695. [Google Scholar] [CrossRef]
- Tran, T.N.; Vo, T.P.; Fazio, P.; Voznak, M. SWIPT model adopting a PS framework to aid IoT networks inspired by the emerging cooperative NOMA technique. IEEE Access 2021, 9, 61489–61512. [Google Scholar] [CrossRef]
- Perera, T.D.P.; Jayakody, D.N.K. Analysis of time-switching and power-splitting protocols in wireless-powered cooperative communication system. Phys. Commun. 2018, 31, 141–151. [Google Scholar] [CrossRef]
- Ding, Z.; Yang, Z.; Fan, P.; Poor, H.V. On the performance of non-orthogonal multiple access in 5G systems with randomly deployed users. IEEE Signal Process. Lett. 2014, 21, 1501–1505. [Google Scholar] [CrossRef]
- Timotheou, S.; Krikidis, I. Fairness for non-orthogonal multiple access in 5G systems. IEEE Signal Process. Lett. 2015, 22, 1647–1651. [Google Scholar] [CrossRef]
- Xiao, Y.; Hao, L.; Ma, Z.; Ding, Z.; Zhang, Z.; Fan, P. Forwarding strategy selection in dual-hop NOMA relaying systems. IEEE Commun. Lett. 2018, 22, 1644–1647. [Google Scholar] [CrossRef]
- Tang, X.; An, K.; Guo, K.; Wang, S.; Wang, X.; Li, J.; Zhou, F. On the performance of two-way multiple relay non-orthogonal multiple access-based networks with hardware impairments. IEEE Access 2019, 7, 128896–128909. [Google Scholar] [CrossRef]
- Tran, T.N.; Voznak, M. Adaptive multiple access assists multiple users over multiple-input-multiple-output non-orthogonal multiple access wireless networks. Int. J. Commun. Syst. 2021, 34, e4803. [Google Scholar] [CrossRef]
- Omeke, K.G.; Mollel, M.S.; Ozturk, M.; Ansari, S.; Zhang, L.; Abbasi, Q.H.; Imran, M.A. DEKCS: A Dynamic Clustering Protocol to Prolong Underwater Sensor Networks. IEEE Sens. J. 2021, 21, 9457–9464. [Google Scholar] [CrossRef]
- Tibshirani, R.; Walther, G.; Hastie, T. Estimating the number of clusters in a data set via the gap statistic. J. R. Stat. Soc. Ser. B 2001, 63, 411–423. [Google Scholar] [CrossRef]
- Wang, Y.; Chen, M.; Pan, C.; Wang, K.; Pan, Y. Joint Optimization of UAV Trajectory and Sensor Uploading Powers for UAV-Assisted Data Collection in Wireless Sensor Networks. IEEE Int. Things J. 2022, 9, 11214–11226. [Google Scholar] [CrossRef]
- Liu, K.; Zheng, J. UAV Trajectory Optimization for Time-Constrained Data Collection in UAV-Enabled Environmental Monitoring Systems. IEEE Int. Things J. 2022, 9, 24300–24314. [Google Scholar] [CrossRef]
- Ma, Y.; Tang, Y.; Tao, J.; Zhang, D.; Tao, S.; Li, W. Energy-Efficient Transmit Power And Straight Trajectory Optimization In Uav-Aided Wireless Sensor Networks. In Proceedings of the 2020 IEEE 91st Vehicular Technology Conference (VTC2020-Spring), Antwerp, Belgium, 25–28 May 2020; pp. 1–7. [Google Scholar] [CrossRef]
- Yuan, X.; Yang, T.; Hu, Y.; Xu, J.; Schmeink, A. Trajectory Design for UAV-Enabled Multiuser Wireless Power Transfer with Nonlinear Energy Harvesting. IEEE Trans. Wirel. Commun. 2021, 20, 1105–1121. [Google Scholar] [CrossRef]
- Li, B.; Qi, X.; Yu, B.; Liu, L. Trajectory Planning for UAV Based on Improved ACO Algorithm. IEEE Access 2020, 8, 2995–3006. [Google Scholar] [CrossRef]
- Wu, Q.; Zeng, Y.; Zhang, R. Joint Trajectory and Communication Design for Multi-UAV Enabled Wireless Networks. IEEE Trans. Wirel. Commun. 2018, 17, 2109–2121. [Google Scholar] [CrossRef]
- Ji, J.; Zhu, K.; Niyato, D.; Wang, R. Probabilistic Cache Placement in UAV-Assisted Networks with D2D Connections: Performance Analysis and Trajectory Optimization. IEEE Trans. Commun. 2020, 68, 6331–6345. [Google Scholar] [CrossRef]
- Jafari, B.; Saeedi, H.; Enayati, S.; Pishro-Nik, H. Energy-Optimized Path Planning for Moving Aerial Base Stations: A Non User-Oriented Framework. IEEE Commun. Lett. 2022, 26, 672–676. [Google Scholar] [CrossRef]
- Vanegas, G.; Armesto, L.; Girbés-Juan, V.; Pérez, J. Smooth Three-Dimensional Route Planning for Fixed-Wing Unmanned Aerial Vehicles with Double Continuous Curvature. IEEE Access 2022, 10, 94262–94272. [Google Scholar] [CrossRef]
- Tran, T.N.; Nguyen, T.L.; Voznak, M. Approaching K-Means for Multiantenna UAV Positioning in Combination with a Max-SIC-Min-Rate Framework to Enable Aerial IoT Networks. IEEE Access 2022, 10, 115157–115178. [Google Scholar] [CrossRef]
- Tran, T.N.; Voznak, M. On secure system performance over SISO, MISO and MIMO-NOMA wireless networks equipped a multiple antenna based on TAS protocol. EURASIP J. Wirel. Commun. Netw. 2020, 2020, 11. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).