1. Introduction
Every year, forest fires burn millions of hectares of natural landscapes worldwide, causing severe losses of biodiversity, releasing large quantities of carbon into the atmosphere, and inflicting major economic damage. In 2023, Canada experienced its most destructive fire season on record, with the affected area ranging from approximately 15 to over 18 million hectares [
1]. The economic cost of that season exceeded CAD 1 billion in direct firefighting expenditure alone, with insured losses and ecosystem service damage estimated at several times that figure [
1]. Globally, the area burned by wildfires has increased by more than 25% over the past two decades, with Southern Europe, Australia, and sub-Saharan Africa experiencing the most severe escalation. Climate change—through prolonged dry periods and increasingly frequent extreme weather events—is considered the primary driver of this escalating trend [
2]. In conflict-affected regions, wildfire risks are further compounded: according to satellite data from 2024, approximately 8700–9000 fires were recorded in Ukraine alone, affecting approximately 965,000 hectares—fires frequently ignited by ammunition debris and occurred in areas where mine contamination makes emergency access hazardous or impossible [
3].
Traditional approaches to wildfire monitoring—satellite observation, ground-based sensor networks, and manned aerial surveillance—each carry structural limitations that prevent timely, coordinated response. Satellite systems provide broad spatial coverage but are constrained by revisit cycles measured in hours, resulting in critical detection latency during rapidly developing fires. Ground sensor networks deliver continuous environmental data but lack spatial mobility and cannot independently confirm or localise detected anomalies. Manned aerial surveillance enables direct visual inspection but requires human operators and is therefore unsuitable for mine-contaminated terrain or other hazardous access conditions. Critically, none of these approaches, used in isolation, support the closed-loop cycle of detection, verification, and coordinated response that effective wildfire management requires.
The complementary integration of unmanned aerial vehicles (UAVs), unmanned ground vehicles (UGVs), and stationary sensor networks (SNs) addresses these limitations in a principled way. UAVs provide rapid aerial inspection and thermal imaging over hard-to-reach terrain, enabling the autonomous confirmation of sensor-detected anomalies within minutes. UGVs enable ground-level intervention in areas dangerous for human responders, including mine-contaminated zones. The operational relevance of autonomous robotic systems in such environments is supported by research on robotic-biological platforms for explosive ordnance detection [
4] and deep learning models enabling real-time ordnance identification by autonomous ground vehicles [
5]. SNs deliver continuous monitoring of temperature, humidity, CO
2, and smoke concentration as a persistent environmental baseline. Together, the three subsystems form an integrated detection–verification–response loop that no single platform can achieve alone. This integration is especially significant for smart regions—territories with data-driven governance infrastructure—where wildfire monitoring must feed into digital twins, early warning dispatchers, and public safety services as part of a coordinated environmental management system.
Despite growing research interest, most existing solutions address only one platform type, lack formalised infrastructure placement, or omit validation protocols suitable for operational deployment. A comprehensive, deployment-oriented framework that integrates all three subsystems with explicit engineering specifications has not yet been established. This study addresses that gap. It extends prior work [
6], which introduced the UAV–UGV–SN concept and demonstrated a neural network approach for risk-map generation, by providing the engineering framework required for practical deployment: (i) the hub-and-spoke infrastructure placement formalised as a mixed-integer programming problem with platform endurance, terrain, and budget constraints; (ii) an end-to-end data pre-processing pipeline comprising confidence scoring, resampling, Kalman and median filtering, and strategy fusion; (iii) six scenario-based operational modes with defined triggers, action sequences, and measurable KPIs; and (iv) a validation protocol based on synthetic experiments, precision/recall/F1 metrics, and hardware-in-the-loop testing.
Figure 1 presents the overall structure of the study, connecting its objectives, methodological steps, and expected outputs. To address the identified gap, the following research questions are formulated:
To address this gap, the following research questions are formulated:
RQ1: How can an integrated UAV-UGV-SN architecture be designed to ensure continuous, reliable, and scalable wildfire monitoring across diverse terrain and operational conditions?
RQ2: How can hub-and-spoke infrastructure deployment be formalised as an optimisation problem to minimise response time while satisfying platform endurance, budget, and logistical constraints?
RQ3: How can heterogeneous multimodal data from UAVs, UGVs, and sensor networks be pre-processed and fused to support accurate and timely wildfire risk assessment?
RQ4: What scenario-based operational modes and validation approaches can be used to evaluate integrated UAV-UGV-SN systems in the absence of full-scale field implementation?
RQ5: How does the deployment of an integrated UAV-UGV-SN system contribute to the environmental, social, and economic sustainability of smart regions?
The remainder of the paper is structured as follows.
Section 2 presents a structured literature review across the relevant domains.
Section 3 (Materials and Methods) describes the integrated system architecture, the hub-and-spoke deployment model, and data pre-processing pipeline, addressing RQ1, RQ2, and RQ3.
Section 4 (Results) presents the six scenario-based operational modes, addressing RQ4.
Section 5 (Discussion) analyses contributions, limitations, and sustainability impact, addressing RQ5.
Section 6 presents conclusions with explicit answers to each RQ.
3. Materials and Methods
This study follows design science research: it constructs and evaluates an artefact—an integrated UAV-UGV-SN architecture for wildfire monitoring—against a set of defined requirements rather than testing a hypothesis through empirical experimentation. The contribution is a deployment-focused engineering framework supported by formal mathematical modelling (mixed-integer programming), structured data pipeline design, and scenario-based functional evaluation. Empirical field validation is explicitly identified as a direction for future work.
The study proceeds in four methodological steps. First, system architecture is specified by defining the roles, interaction protocols, and communication topology of the three subsystems (
Section 3.1,
Section 3.2 and
Section 3.3). Second, infrastructure placement is formalised as a mixed-integer programming problem that maps platform parameters and terrain geometry to binary hub, spoke, and coverage assignment variables (
Section 3.2). Third, a data pre-processing and fusion pipeline is designed to handle heterogeneous, multi-rate telemetry from UAVs, UGVs, and SNs (
Section 3.4). Fourth, scenario-based operational modes are derived from the architecture to define the system’s expected behaviour under each operational condition, serving as the primary basis for functional evaluation (
Section 4).
This section specifies the engineering architecture of the integrated system, covering subsystem roles and interaction protocols, infrastructure placement, data pre-processing, and communication design. The overarching objective is to achieve sub-threshold ignition detection, structured operational data collection, and coordinated multi-platform response through spatially optimised and synchronised deployment of all system elements.
3.1. Integrated UAV-UGV-SN System Architecture
Three functionally distinct subsystems constitute the integrated architecture. UAV swarms fulfil the aerial layer: they execute rapid area surveys, carry thermal and optical sensors, maintain communication relay links, and can deliver initial suppression payloads; cooperative search and escort capabilities of such platforms are documented in [
51,
55,
57]. UGV swarms form the ground layer: their primary mission is precise hot-spot localisation, the establishment of firebreaks, and intervention in terrain that is dangerous or physically inaccessible to human responders. The stationary sensor network (SN) provides the persistent environmental baseline: nodes deployed across high-risk zones continuously log temperature, relative humidity, smoke density, and gas concentrations and forward aggregated readings to the central station for processing and analysis. Combining the spatial omnipresence of fixed sensing with the reactive mobility of autonomous platforms yields detection and response capabilities that neither component class could achieve independently.
Figure 2 shows a generalised diagram of this architecture.
All operational intelligence is consolidated at the central station, which aggregates incoming data streams, generates risk maps, and issues mission directives. Three subsystems execute in parallel under its coordination: the UAV swarm handles aerial imaging and inter-node communication relay; the UGV swarm localises ignition points and conducts suppression operations; and the SN subsystem maintains uninterrupted acquisition of temperature, humidity, smoke, and gas readings.
3.2. Hub-and-Spoke Deployment Model
The location of ground bases for UAVs and UGVs involves a well-thought-out combination of large strategic hubs, compact forward points, and mobile logistics nodes, while accounting for operational, landscape, and communication constraints. Strategic hubs serve as secure technical support and logistics centres: they house repair shops, warehouses for spare platforms and consumables, powerful charging stations and generators, as well as channels for high-speed data transmission and for coordinating operations. Forward points (spokes) are located closer to the operational area and are designed for quick preparation, the charging or replacement of batteries, and the rapid departure/departure and quick return of vehicles. They can be stationary with autonomous power supply or mobile (containers, vehicle complexes) for flexible coverage of “white spots.” Mobile logistics nodes complement the network, allowing for the rapid movement of resources and closing gaps in coverage during dynamic events.
When choosing and placing sites, it is necessary to systematically take into account the operational radius of specific platforms, i.e., the real flight/movement time, taking into account the time of the on-site task and the safe return reserve, the topography of the area, the availability and accessibility of logistics corridors, as well as meteorological conditions, in particular permanent wind corridors, which affect the safety of take-off and landing. Planning should be based on multidimensional criteria: time to point, energy costs, likelihood of communication loss, risk of exposure to external threats (such as combat or natural factors), and the possibility of infrastructure masking.
The hierarchical hub-and-spoke model combines the advantages of centralisation (savings on maintenance, concentration of specialists and supplies) with the resilience of a distributed network of forward points that ensure low response times. The practical implementation of this model requires a formalised approach: the use of coverage, facility-location, and p-median tasks to select the optimal set of hubs and spokes under given constraints (budget, maximum reach time, hub capacity). For dynamic scenarios, adaptive methods are helpful, allowing the placement of mobile nodes to be adjusted in real time. Hub a large base with battery/fuel reserves, spare equipment, powerful communication channels, and repair equipment.
A spoke/forward point is a light platform closer to the area of operations, designed for charging/replacing batteries, quick take-offs/landings, and operational departures.
To increase reliability, several technical requirements should be established: autonomous power supply for forward points (solar panels, batteries with a charge-control system), standardised quick-change battery kits, compact sites with structures for camouflaging and protecting equipment, and unified interfaces for quick platform changes between hubs and spokes.
Deployment planning should be supported by simulations and field pilots: simulation tests should take into account terrain, wind, and typical fire scenarios, hardware-in-the-loop testing to work out communications and logistics, as well as controlled pilot deployments to fine-tune maintenance procedures and evaluate KPIs (time from detection to verification, false positive frequency, time from verification to start of extinguishing, resource efficiency).
In general, the optimal distribution of bases combines the advantages of a hierarchical hub-and-spoke model with distributed autonomous sites, formalised through coverage and facility-location models, verified through simulations and pilots, and supplemented by practical requirements for energy, communications, security, and maintenance—which together provide a reliable, flexible, and efficient infrastructure to support UAV/UGV operations in real-world conditions.
Figure 3 shows example of a hub-and-spoke layout.
We consider the problem of two-step coverage in the two-dimensional space S⊂R with a set of points of interest I = {1,…,n}, the coordinates “
” of which there are sets of candidate places for strategic hubs J = {1,…,m} with coordinates “
” and for forward points (spoke) K = {1,…,p} with coordinates “
”. Platforms can be of different types of r ∈ R (eg UAV type A, UAV type B, UGV), each type is set to speed “
” (km/h), autonomy “
” (hours), mission lead time “
” (hours) and backup time “
” (hours). The distances between the points are denoted by d(a,b), travel/flight time (1).
Each hub or spoke opening has a cost of “” or “”, and the hub can have a capacity of “” (maximum load/number of spokes).
The idea of a two-step model, such field operations are performed with spoke departure/exit from spoke to point I, the performance of work and return to the same spoke must be invested in the autonomy of the platform; logistics support of spoke (carrying batteries, repairing) is carried out through hub, so each open spoke must be attached to one hub. For practical implementation, preliminary data processing is performed first. For each triangle (j, k, i) and platform type r, it is checked whether the mission from spoke k to point I is feasible, taking into account the round-trip time, on-site time, and reserve. For example, for UAV, the feasibility condition has the form (2).
If the inequality holds, then we denote “”, otherwise “”. Similarly, the suitability of the connection between the hub and the spoke, based on replenishment time or other logistical constraints, can be checked.
Formalisation of the problem as integer optimisation uses binary variables “
” means opening hub j, “
” opening spoke k, “
” pinning spoke k by hub j, and “
” covering point I with spoke k. The requirement of full coverage is written as (3).
For a point to be assigned only to an open spoke, the constraint applies “”.
Each open spoke must be anchored to exactly one hub, that is (4), and binding to a hub is only possible if the hub is open (“
”).
The capacity of the hub can be given as (5).
Ensuring mission feasibility is taken into account through pre-computed (6): the point can be fixed to spoke k only if spoke k is connected to such a hub j, which makes the mission executable.
If there is a budget, then a limit is imposed (7).
Typical optimisation goals include minimising the combined opening cost and average response time, minimising the maximum response time (k-centre approach), or minimising the combined risk of undercoverage. To avoid nonlinear products of variables, it is helpful during data pre-processing to compute numerical and Boolean parameters (e.g., matrices and time matrices) that preserve the linearity of the MIP model.
For illustration, we will give a small numerical example. Suppose one type of UAV with parameters: speed “”, autonomy “” h (30 min), mission time “” and reserve “”. The available time for going back and forth is “” (20 min). Multiplying by the speed, we get a total permissible round-trip distance of 60⋅0.333 = 20 km, so the maximum one-way distance from spoke to the point “” km. If, for example, a point has coordinates (10.9) and spoke k—(8.8), then “”, distance “”. The round-trip distance is approximately 4.4729 km, which is significantly less than the permissible 20 km. Time for round trip 4.472 min; total time with mission time and reserve 4.47 + 5 + 5 ≈14.47 min, which is autonomous for 30 min, that is, such a mission is feasible. This example demonstrates how to immediately obtain a Boolean feasibility value (1 or 0) from the parameters of the platform and the geometry of the point to fill the matrix “” after which the optimisation problem is solved as an MIP.
From the point of view of infrastructure planning, it is also advisable to lay out escalation scenarios: in the event of extensive fires or difficult access, increase the number of temporary forward points and, if necessary, mobile nodes to involve neighbouring strategic hubs. This approach allows for maintaining the system’s flexibility and adaptability without a significant initial investment in the full number of stationary sites.
The hierarchical hub-and-spoke scheme, together with distributed autonomous platforms, provides a balanced approach: centralised logistics from hubs provides stable technical support, and spokes reduce response time and expand the operational radius of UAV/UGV operations. Platform parameters (speed, mission time, reserve), terrain, supply routes and quality of communication should be taken into account in planning; duplication and retransmission capability should be provided at critical points. The final solution is based on formalised optimisation and verification through simulations and pilot deployments, ensuring reliability in real-world conditions.
3.3. Stationary Sensor Network Placement
The stationary sensor network is designed as an adaptive grid with variable pitch: node density increases in high-risk areas, and additional sensors are located around the perimeter and in natural wind corridors for early warning and quick determination of flame propagation directions. The architecture combines energy-efficient peripheral nodes in a mesh topology with reliable backbone nodes that have better communication channels and power supplies to aggregate and transmit data to a central station. Sensor classes include thermal, humidity, and smoke and gas sensors; low-speed cameras or acoustic sensors are used as needed. Power is mainly from solar panels and batteries, with consideration of alternatives in areas with limited solar resources. For the periphery, it is recommended to use LoRa/LoRaWAN as an energy-efficient channel, while backbone nodes provide transmission via LTE/5G or satellite channels; repeaters at height or supports are installed at critical points.
When planning the placement of sensors and backbone nodes, it is important to formalise coverage and connectivity problems to move from qualitative descriptions to specific design and validation solutions. Let the set notation and formalisation principles be as follows: the set of candidate positions for placing sensors is denoted by I (index i), and the set of points of interest (discretisation of space) by J (index j). The binary variable “” takes the value 1 if the sensor is set in position I; “”—1 if the backbone/hub is placed in position I. The parameter shows whether position I covers point j. The cost of installation and equipment for position I is indicated by “”, and the total budget by B.
Moving to the calculation of the fundamental problem of coverage, the main goal is to minimise the number of installed sensors (or the total cost) under the condition of covering points of interest. To do this, we will use the Formula (8) and the restricted Formula (9).
If full coverage is not possible due to the budget, tasks are formulated as maximising important coverage. Then the importance “
” (risk) of point j is introduced, and the binary variable “
” to indicate that the point is covered, and then the main goal is to maximise (10) with a fixed B.
For increased reliability, it is necessary to accept the requirement of at least “
” sensors at critical points “
”. This can then be calculated using the Formula (11).
It is also necessary to pay attention to refusals. If the probability of sensor failure I is denoted by “
”, the problem can be posed as ensuring that the probability of uncritical non-overlap of point j is less than the permissible threshold. This comes down to redundancy requirements (k-coverage). Also, to minimise the detection time, we can determine the expected detection time “
” for each point j, which depends on the location of the sensors, the interrogation frequency and transmission delays. Furthermore, it is calculated according to the Formula (12).
Let us go to an example. Let us discretise the area by representing it as the set J of |J| = 100 points. A set of candidate positions for sensors I with |I| = 50. Suppose that the nominal coverage radius for each peripheral node is R = 150 m (the radius within which the model provides a sufficient signal/detection probability), and the budget allows no more than B = 15 sensors. Our problem statement is to choose up to 15 sensors to maximise the sum of the weights of the covered points “” where “” if the point is covered when selected “”.
Then we first form the coating matrices. To do this, we calculate the Euclidean distance (Euclidean distance) “” for each candidate position I and each point j. If “”, then “”, otherwise 0. Now, let us determine the weights (risks). These will be determined based on historical fire data and landscape characteristics. However, for example, let us take the following points:
30 points is a high risk and their weight “”;
50 points is the average risk and their weight “”;
20 points is a low risk and their weight “”.
Then the total maximum potential weight is “”.
To demonstrate calculations, we will apply an iterative approach. At each step, we calculate the increase in the total covered weight if each candidate position is added. Then we select the position with the most significant increase, mark the covered points, and repeat until the budget is exhausted. Take three iterations:
Iteration 1: Found position A that covers 20 points: 10 high (“”), 8 medium (“”), 2 low (“”) then the weight gain is 44.
Iteration 2: Position B gives 18 new points: 6 high (18), 9 medium (13.5), 3 low (3) then the weight gain is 34.5.
Iteration 3: Position C gives 12 new points: 4 high (12), 6 medium (9), 2 low (2) then the weight gain is 23.
Thus, repeating similar steps at 15 positions, we achieved a total coverage of 28 out of 30 high-risk points, 45 out of 50 medium-risk points, and 18 out of 20 low-risk points. Then the total weight of the coating is equal to “”. Now we can calculate the relative coverage by weight “”, that is, 91.6% of the maximum possible important coverage.
3.4. System Components
Each hardware layer contributes a distinct capability to the monitoring pipeline. The SN layer comprises thermal and infrared cameras, electrochemical smoke and gas analysers, meteorological stations, acoustic detectors, and multispectral nodes distributed across fire-prone zones. Continuous acquisition of temperature, humidity, smoke density, and gas concentration enables anomaly detection prior to visible ignition. The UAV layer carries high-resolution optical, infrared, and thermal imaging payloads. Aerial platforms reach alert locations within minutes, supply georeferenced imagery for alarm verification, and serve as airborne relay nodes that extend communication range over complex terrain. The UGV layer operates at ground level: platforms are fitted to transport fire-suppression agents and deploy physical firebreak structures, enabling precise localisation of hot spots and initial containment in areas inaccessible or unsafe for human crews.
The central control station consolidates all incoming data, performs multi-level processing using analytical algorithms and machine learning models, and generates up-to-date risk maps and short- and long-term forecasts of fire spread. Based on this data, the system also dynamically prioritises resource deployment and suggests optimal launch positions from strategic hubs to closer forward points and mobile logistics nodes, taking into account the operational radius of the platforms, terrain, and communication quality. Real-time information is transmitted to subsystems (SN, UAV, UGV, and communication relays), enabling rapid deployment of drones and ground platforms, including the use of repeaters or UAV relays in areas with poor coverage, and providing redundancy for critical functions to increase resilience. This coordinated effort significantly reduces response times and minimises economic and environmental damage.
3.5. Interaction and Reliability of Subsystems
Operationally, the system executes a closed-loop sequence of perception, inference, and action. The SN layer provides a continuous stream of environmental telemetry—temperature, humidity, smoke concentration, and gas levels—from instrumented high-risk zones. On anomaly detection, readings are forwarded via encrypted links to the central station, where they are fused with historical baselines and contextual data to refine risk estimates and trigger the appropriate response tier.
To formalise the system’s reliability, it is appropriate to consider three key subsystems: the sensor network (SN), mobile platforms (UAV and UGV), and the communication infrastructure (central station and communication channels). In the simplest case, the system functions when all these subsystems are operating, and the overall probability of failure-free operation is given by the product of their reliabilities (13).
However, the sensor subsystem usually has a higher risk of failure due to environmental factors, limited power supply, and possible physical damage, so it is advisable to duplicate it by creating independent sensor clusters and backup sensors. The combined reliability of two independent sensor clusters with the same reliability “
” is equal to (14).
The redundancy of communications, implemented through LTE, 5G, satellite channels, and radio relay lines, is modelled as parallel channels and calculated using Formula (15).
For example, suppose that the basic reliability of the sensor network is “ = 0.70” (lower due to the increased risk of failures), the reliability of the UAV, “”, the reliability of the UGV “”, and in the absence of communication reserves “”. Then the basic overall reliability of the system without duplication of sensors and without channel redundancy is “”. If you introduce duplication of sensor clusters (two independent clusters) such that “”, and add backup communication channels (LTE = 0.80, 5G = 0.70, SAT = 0.90), the combined reliability of communications is “”, and then the overall reliability of the system increases to about “”. Such a transition from 0.495 to 0.799 demonstrates that duplicating the vulnerable sensor subsystem, combined with the multi-channel communication, provides a significant increase in reliability.
Following risk assessment, the central station issues prioritise deployment orders. UAVs proceed to the flagged zone, transmitting real-time thermal video and telemetry that either confirm or dismiss the alert and allow for incremental refinement of the risk map. Upon confirmation, UGVs are dispatched for ground-level investigation, firebreak establishment, and initial containment. Throughout this sequence, the SN continues uninterrupted data transmission, supplying the situation picture that synchronises the actions of all mobile assets.
When planning and executing operations, the central station considers the ground infrastructure, including the locations of strategic hubs, forward points, and mobile logistics nodes. Decisions as to which site to initiate a mission from are made taking into account the operational radius of specific platforms (taking into account speed, available mission time and return reserve), terrain and availability of logistics corridors; in case of complicated communication coverage, options using repeaters or UAV relays are automatically offered, as well as moving mobile nodes closer to priority areas.
Communication integrity across all nodes is maintained through a layered redundancy strategy: UAV and UGV platforms double as airborne and ground-level relay nodes, sustaining data flow in topographically complex or signal-degraded environments. Primary wireless links are backed by parallel channels—LTE/5G, satellite uplinks, and short-range radio relay—so that no single link failure interrupts the operational pipeline. The modular system architecture further accommodates the incremental addition of sensors and platforms without service interruption.
Autonomy is provided by built-in machine learning models that can make local decisions in the event of a loss of communication with the centre: analyse local indicators, predict the spread of flames, and initiate basic response measures. Equipment self-diagnosis mechanisms are also implemented, including monitoring node states, detecting failures, and automatically switching to backup modes to minimise the risk of data or functionality loss.
The central station is operated by trained emergency management personnel—typically regional fire dispatch officers or civil protection coordinators—who retain final decision authority over all response actions. The system is designed to support, not replace, human judgement. Alert levels are communicated through a structured interface: a yellow alert (composite risk score 0.50–0.74) displays the triggered sensor readings, the affected cluster on the risk map, and a recommended UAV sortie with the proposed launch site, allowing the operator to confirm or override the automated suggestion. A red alert (score ≥ 0.75, or UAV-confirmed ignition) presents a prioritised action list—UGV deployment routes, fire-agent quantities, and inter-spoke coordination options—and initiates autonomous pre-positioning of platforms, while requiring explicit operator confirmation before irreversible actions such as extinguishing agent release. All alert decisions, operator overrides, and system recommendations are logged with timestamps to support post-event review and model calibration.
The resulting architecture—persistent SN-based monitoring, hub-and-spoke-aware resource allocation, multi-channel communication redundancy, and on-board autonomous inference—shortens the interval from anomaly detection to effective response, thereby limiting fire spread and reducing the associated environmental and economic damage.
3.6. Architecture of a Neural Network System with Clustering
For validation of the fire and smoke detection component, domain-specific datasets are used: the FLAME dataset [
23], which contains UAV-captured aerial footage of fires in forest environments, and the FireNet benchmark [
8], comprising real fire and non-fire images suitable for embedded deployment validation. These datasets are directly relevant to wildfire detection and enable credible evaluation of the proposed neural network architecture under realistic operational conditions.
The neural network processing subsystem ingests heterogeneous streams from stationary sensors, UAV onboard sensors, UGV telemetry, and auxiliary peripherals. Its purpose extends beyond real-time data aggregation: by applying clustering and predictive modelling to multi-source fused features, the subsystem converts raw measurements into calibrated risk scores and early-warning signals that drive proactive platform dispatch.
Figure 4 presents the architecture of the data processing and clustering subsystem.
It should be noted that
Figure 4,
Figure 5,
Figure 6 and
Figure 7 are reproduced from the companion study [
6], which introduced the conceptual UAV–UGV–SN framework and the underlying neural network architecture for risk-map generation. They are retained here to provide the reader with a complete picture of the data processing pipeline on which the present work builds. The original contributions of this article do not concern these conceptual diagrams. They lie in the formal MIP-based hub-and-spoke deployment model (
Section 3.2); the engineering-level data pre-processing pipeline with confidence scoring, Kalman/median filtering, and fusion strategy selection (
Section 3.7); the six scenario-based operational modes with defined triggers, outputs, and KPIs (
Section 4); and the validation protocol and sustainability KPI framework (
Section 4 and
Section 5). These four components are entirely new and are not present in [
6].
The architecture consists of several interconnected blocks:
Data acquisition layer. Environmental readings and platform telemetry are ingested from the distributed SN nodes, UAV onboard sensors, UGV modules, and hub service records. Spatial and temporal metadata (UTC timestamps, GPS coordinates) are attached to each packet at source to support downstream georeferenced analysis.
Pre-processing layer. Raw streams arrive in heterogeneous formats and may contain sensor noise, transmission gaps, or sampling-rate mismatches. This stage performs format unification, resampling to a common time step, Kalman and median filtering for noise suppression, short-gap interpolation, longer-gap imputation, feature normalisation, and confidence-score assignment based on sensor health and link quality.
Neural network processing and clustering layer. Cleaned, normalised feature vectors are passed to machine learning models that extract latent correlations including cross-modal relationships among humidity, temperature, and gas levels, and automatically forms clusters of territories based on parameter similarities. Cluster analysis allows for grouping areas by risk level and identifying priority areas for preventive or operational actions.
Dynamic risk-map generation. Cluster assignments and composite risk scores are projected onto a georeferenced territorial grid to produce an interactive threat-level visualisation. The map refreshes continuously as new sensor readings arrive and serves as the primary decision-support interface from selecting the optimal hub or spoke for the next sortie to assigning specific mission profiles to individual UAV and UGV units.
Fire prediction subsystem. Prediction modules based on historical data and current indicators analyse indicator development over time and assess the likelihood of fire in specific clusters in the near future. This allows preventive measures to be initiated in advance.
Automatic learning and adaptation. The system constantly replenishes its database with new data and regularly updates clustering and forecasting models, thereby improving estimate accuracy and adapting to seasonal/environmental changes. In addition, validation and training quality control mechanisms are implemented to prevent model degradation over time.
Taken together, the subsystem implements a closed inference loop: raw multi-source data are ingested, cleaned, clustered, and scored, and the resulting risk map directly feeds mission-planning logic. Embedding hub, spoke, and mobile-node location data into this loop grounds predictive outputs in actual platform reach and logistics constraints, enabling risk forecasts to be translated directly into executable sortie plans and substantially reducing the probability of uncontrolled fire escalation.
3.7. Detailed Technology of Neural Network Cluster Monitoring
The technology of neural network-based cluster monitoring of forest fire risks is conceived as an engineering-representative pipeline that combines the collection of heterogeneous data, their preliminary processing, algorithmic clustering, and integration of results with the operational management of UAV/UGV. Data sources include stationary ground sensors (temperature, relative humidity, CO2/CO gas concentrations), sensors on support poles and forward points, UAV onboard thermal and optical cameras, UGV data, and historical climate databases; each data packet has a timestamp (UTC) and georeferencing (lat/lon). At the data pre-processing stage, all flows are resampled to a single time step, undergo noise filtering (median or Kalman filters for time series), interpolation of short gaps, imputation of longer gaps, and normalisation of features to unify input vectors and ensure algorithm stability. To increase data confidence, each packet is assigned “”, depending on the state of the sensor and the quality of the communication. This value is used in data aggregation and in the calculation of composite risk.
Trait formation combines short- and long-term statistics, including hourly temperature changes, moving averages, drought indices, and temperature-to-humidity ratios, and provides image extraction of traits using pre-trained CNNs. Data fusion is implemented in two modes: early fusion, which merges normalised sensor and meteorofits into a single vector before clustering; and late fusion, which uses separate modules for images and time series, followed by merging results at the feature or solution level. Composite “” for each territorial cell is calculated as the weighted sum of normalised components, for example “” denotes Ĥ normalised values, and w—weights; “”, “” (inverse component since low humidity increases the risk), “” and “” is chosen as a working example. For operational logic, “ = 0.5” (yellow) and “” (red) are proposed, which can be adapted to local conditions if necessary after calibration on historical data.
Clustering can be performed by different methods depending on the requirements: for an interpreted and fast approach, k-means is used (practical k = 3 for division into low/medium/high risk), for the detection of irregular local clusters—DBSCAN (eps parameters, “”), for uncertainty in belonging—a Gaussian Mixture, and for fuzzy features, it is advisable to use an autoencoder with further clustering in the latent space. The choice of the number of clusters and parameter settings is justified by clustering quality metrics (silhouette score and Davies–Bouldin) and domain requirements (e.g., the need to display results on the map in the format “green/yellow/red”). Operational integration assumes that, when exceeded, the system automatically creates a task to check the UAV, selecting the nearest hub/spoke based on the minimum departure time criterion. When an image or additional sensor data confirms the ignition, a task is generated to deploy UGVs and ground crews. In a work project, target SLAs can be as follows: time from detection to departure of the UAV ≤ 15 min, and time from confirmation to deployment of the UGV ≤ 30–60 min, depending on the distance.
Model training is proposed to take place offline on historical data using time-series cross-validation. The primary metrics are ROC-AUC for early ignition detection, precision/recall/F1 for ignition events, and silhouette and Davies–Bouldin for clusters. To control drift, monitoring of changes in the distributions of figs (PSI, KL) and the medium “”. When significant drift is detected, model overproduction is triggered. The system is designed to support online learning or incremental updates, enabling adaptation to seasonal and environmental changes, but at the same time it is necessary to implement quality control of learning (validation on hold-out, testing on retrospective events) to prevent degradation.
Fault tolerance is achieved by duplicating channels at critical points, such as key locations, using several independent sensors. Majority voting or weighting by e will also be used in case of discrepancies. Moreover, packets are proposed to be locally stored on hubs in case of communication loss and to provide backup transmission channels.
3.8. Algorithm for Using Neural Networks
The monitoring and forecasting algorithm operates across two complementary temporal scales. A static baseline layer is established from multi-year climate records for each territorial cell, capturing long-run patterns of temperature, humidity, drought index, and fire history. This baseline defines the normal operating envelope and informs the initial placement of strategic hubs and forward spokes by revealing zones with persistently elevated ignition probability.
Concurrently, a real-time acquisition layer streams current readings from the distributed sensor network. Nodes continuously log temperature, humidity, barometric pressure, and ancillary environmental indicators, providing the operational deviation signal against which the baseline is compared. Current images and measurements are visualised on a separate map of the current state (
Figure 5).
Fusing historical baselines with live sensor streams gives the neural network models their primary analytical advantage: the dual-timescale input allows the system to distinguish genuine pre-ignition anomalies from seasonal noise, improving both sensitivity and specificity. Cluster analysis partitions the territory into risk-homogeneous zones and assigns each an ignition-probability estimate; a cell exhibiting historically moderate temperature that suddenly records a sharp rise alongside a rapid humidity drop transitions to the high-risk category.
The outputs of this analysis are rendered as a continuously refreshed georeferenced risk map coded on a three-level scale: green denotes stable conditions within historical norms; yellow signals elevated risk requiring heightened monitoring; red indicates probable ignition onset and triggers immediate response actions.
Elevation of any territorial cell to yellow status automatically triggers a UAV inspection sortie. The mission-planning module selects the departure site hub or forward spoke by minimising estimated flight time to the flagged cell subject to platform endurance constraints and current terrain accessibility. During flight, the UAV streams georeferenced thermal and optical imagery in real time; the central station and operator jointly assess the footage against the SN signals to accept or reject the alarm. A confirmed ignition automatically generates downstream task assignments for UGV deployment and updates the risk map to reflect the verified threat boundary.
If the drone inspection confirms that a fire has already started or is in its early stages, the system immediately switches to extinguishing mode. The neural network instantly updates the area’s status on the map, marking it in red to signal the need for urgent intervention. At the same time, aviation and ground resources are deployed: UAVs conduct aerial monitoring and apply initial extinguishing measures, while UGVs and ground units are dispatched to conduct local operations to eliminate the fire’s source (
Figure 7).
Black arrows indicate data exchange channels between the UAV, UGV, node (which collects information from the SN and transmits it to the hub and also serves as a forwarding point for drone data), and the hub.
The algorithm thus implements a complete sense-assess-act cycle: territorial state is evaluated by fusing long-term climate data with live sensor readings; the resulting risk map flags anomalies for UAV verification; confirmed events escalate to coordinated aerial and ground suppression. Zone risk scores feed back directly into launch-site selection, closing the loop between situational awareness and infrastructure utilisation. The net effect is a reduction in mean detection-to-response latency and a corresponding limitation of fire spread extent.
3.9. Advantages of Using Clustering
Clustering provides the analytical backbone through which raw heterogeneous telemetry is converted into actionable territorial intelligence. By partitioning the monitored area into coherent risk groups, cluster analysis yields three operational advantages. First, zone-level resolution: aggregating sensors with similar readings identifies the highest-risk micro-regions more reliably than individual node thresholds. Second, resource targeting: mission planners can concentrate UAV sorties, pre-positioned UGVs, and suppression reserves in high-risk clusters rather than distributing them uniformly across the entire territory. Third, infrastructure rationalisation: cluster boundaries and centroid locations inform optimal hub and spoke placement, shortening average sortie distance within each risk tier and strengthening overall system responsiveness.
To illustrate the practical value of this approach, consider a regional fire management authority operating with a constrained seasonal budget. Without clustering, resources must be distributed across the entire monitored territory based on broad administrative boundaries. With the proposed system, the neural network partitions the territory into three risk clusters—low, medium, and high—updated continuously from SN and UAV data. A regional manager consulting the risk dashboard can immediately identify that, for example, 15% of the monitored area falls into the high-risk cluster and concentrate preventive UAV patrols, pre-positioned UGVs, and firefighting chemical reserves exclusively in those zones. The remaining 85% of the territory, classified as low- or medium-risk, requires only routine SN monitoring and periodic aerial checks. This targeted allocation can reduce preventive deployment costs by a substantial margin while simultaneously increasing coverage density and therefore detection probability in the zones that matter most.
5. Discussion
5.1. Analysis of Architectural and Methodological Contributions
The concept of an integrated UAV–UGV–SN system for early incident detection, rapid verification, and response coordination represents a substantive advance over isolated platform approaches. The results demonstrate that the six operational scenarios collectively span the full wildfire management cycle—from AI-driven early warning and alarm verification through autonomous extinguishing, post-fire recovery, sensor-gap compensation, and inter-hub reinforcement. Technical approaches including combined telemetry processing, multi-sensor fusion, and HIL validation of algorithms enable a practical balance between alert accuracy and response speed, which is critical for smart region operations.
The hub-and-spoke deployment model formalised as a mixed-integer programming problem provides a rigorous, reproducible basis for infrastructure planning. The pre-computed feasibility matrices preserve the linearity of the MIP formulation while accounting for platform-specific endurance and terrain constraints—a practical improvement over heuristic placement approaches found in the literature.
5.2. Limitations, Practical Risks, and Architectural Considerations
The proposed framework shows how UAVs, UGVs, and distributed sensor networks can be integrated into a unified wildfire monitoring and response system, but it should not be treated as a fully validated operational solution. This should be taken into account when interpreting the results and defining the scope of this work.
Several limitations must therefore be considered. The effectiveness of the system depends on the quality, consistency, and availability of heterogeneous data obtained from ground sensors, UAV imagery, UGV telemetry, and environmental databases, as well as on the reliability of the communication infrastructure. In real deployment conditions, unstable wireless coverage, delays in transmission, missing data, and uneven spatial coverage may reduce both the accuracy of risk assessment and the timeliness of operational decisions.
The neural network and clustering modules are included as configurable elements of the processing pipeline rather than as fixed predictive models with a single predefined implementation. Their exact architecture, hyperparameters, and training settings may vary depending on the available datasets, sensor configuration, and operational conditions. For this reason, the present study does not aim to define a benchmark-ready machine-learning model, but to describe the system-level workflow in which such models operate.
In the same context, the risk maps presented in
Figure 5 and
Figure 6 should be interpreted as operational outputs of the pipeline “data collection–pre-processing–clustering–risk estimation–decision support”. They are intended to illustrate how heterogeneous data can be transformed into actionable situational awareness for mission planning and resource allocation, rather than to serve as quantitatively validated predictive maps. A rigorous evaluation of prediction accuracy, clustering quality, or map reliability would require large-scale ground-truth datasets obtained from real integrated UAV–UGV–sensor deployments, which are currently not available.
The proposed hub-and-spoke deployment model and scenario-based operational logic provide a scalable basis for implementation. However, their practical efficiency must still be confirmed through pilot deployments, hardware-in-the-loop testing, and field experiments. Real operating conditions such as terrain constraints, adverse weather, hardware failures, battery limitations, and human-in-the-loop decision processes may significantly influence system behaviour.
In addition to these architectural limitations, several practical risks must be considered. The quality and density of sensor network data may vary significantly across deployment regions, affecting model calibration and the reliability of risk scores. Unreliable communication channels, insufficient energy infrastructure, and uneven spatial coverage may reduce system availability at critical moments, particularly in remote or conflict-affected areas. Classification errors and false positives also require careful threshold tuning and human-in-the-loop verification to avoid unnecessary resource mobilisation. At the same time, social, legal, and ethical aspects remain relevant: continuous aerial and ground data collection may raise privacy concerns, while deployment of sensor infrastructure may face community resistance without transparent engagement. In addition, the environmental footprint of the infrastructure itself—including battery production and disposal, as well as the energy consumption of hubs and charging stations—should be taken into account in long-term life-cycle planning.
Compared with approaches based only on satellite monitoring, stationary sensor networks, or standalone UAV missions, the proposed architecture offers greater integration and operational flexibility but also introduces higher system complexity and stronger infrastructure dependence. Therefore, the results of this work should be interpreted primarily as a methodological and architectural contribution that establishes the basis for further simulation studies, quantitative validation, and real-world pilot implementations.
5.3. Impact on Sustainable Development of Smart Regions
The integrated UAV-UGV-SN system contributes across three pillars of regional sustainable development. Environmentally, early detection and rapid localisation of ignition events reduce the burned area, limit CO2 and particulate emissions, preserve biodiversity habitats, and mitigate soil erosion. Socially, shorter detection-to-verification and verification-to-response times reduce risk to human responders and lower public exposure to smoke pollution, while the system’s integration into regional early-warning infrastructures supports broader community resilience. Economically, reducing the scale of fires decreases direct losses to forestry and property, and enables more efficient allocation of fire-fighting resources.
Practical impact assessment is supported by a three-tier KPI framework: environmental KPIs (reduction in burned area, biodiversity indices, soil degradation metrics); social KPIs (detection and response times, number of protected persons and assets, community trust indicators) and economic KPIs (reduction in direct damage costs, cost–benefit ratio). Technical performance metrics—precision, recall, F1, system availability, false alarm frequency—are integrated as operational sub-indicators within this framework. A phased implementation approach is recommended: baseline measurement, pilot deployment with iterative KPI collection, scalable rollout with cost–benefit analysis, and annual public reporting aligned with regional digital twin dashboards.
A structured assessment of the environmental footprint of the UAV-UGV-SN infrastructure requires consideration of three life-cycle phases. During the production phase, the dominant impact arises from lithium-ion battery manufacturing a process associated with significant CO
2 emissions, water consumption, and the extraction of critical minerals (lithium, cobalt, nickel); published LCA studies of commercial UAV platforms estimate production-phase carbon footprints in the range of 20–80 kg CO
2-equivalent per unit, depending on battery capacity and platform size. During the operational phase, energy consumption is the primary driver: hub charging stations and SN backbone nodes require continuous power supply, which, if grid-sourced, contributes to the system’s carbon footprint. This is mitigated in the proposed architecture through solar panels and battery storage at forward points, as well as energy-efficient LoRaWAN communication for peripheral SN nodes. During the end-of-life phase, lithium battery packs and drone electronics require dedicated recycling pathways; unmanaged disposal poses risks of heavy-metal leaching and loss of recoverable critical materials. Mitigation measures built into the proposed system design include renewable energy supply at hubs and spokes, modular platform architectures that extend hardware service life through component replacement, minimally invasive hub structures, and explicit component disposal plans as a contractual requirement in the KPI framework. A full quantitative LCA integrating empirical energy consumption data from field-deployed platforms and regional electricity grid emission factors is identified as a priority direction for future work (
Section 5.4).
5.4. Directions for Further Research
From a technical perspective, future work should focus on improving system resilience through distributed edge computing, adaptive energy management, and federated learning to preserve privacy while improving models in heterogeneous environments. Development of open interoperability standards for UAV-UGV-SN component interaction and automated resource coordination workflows will facilitate integration with regional information systems and digital twins. A dedicated life-cycle assessment (LCA) of the UAV-UGV-SN infrastructure covering lithium battery production emissions, operational energy consumption disaggregated by subsystem, and end-of-life recycling pathways for drone electronics and battery packs is recommended as a follow-on quantitative study, building on the qualitative framework outlined in
Section 5.3.
A further avenue for investigation concerns the communication layer of the sensor network itself. Current LoRaWAN and LDSE-based protocols optimise energy efficiency and connectivity under static network topologies, but do not adapt routing decisions to real-time fire risk maps or platform mobility patterns. Machine-learning-enhanced cluster routing protocols such as EMO-PEGASIS, which applies evolutionary multi-objective optimisation to the PEGASIS chain-based aggregation scheme offer a promising direction for UAV relay networks, where optimal cluster-head selection and chain formation must be updated dynamically as UAVs move, battery reserves deplete, and fire fronts evolve. Evaluating the applicability and performance gain of such protocols within the proposed UAV-UGV-SN architecture represents a concrete and tractable direction for future protocol-level research.
Further research directions relevant to sustainability and the smart region framework include: dynamic digital forest twin models integrating real-time sensor data with predictive fire spread simulations, citizen-science platforms and public alert applications that engage local communities in monitoring and reporting, predictive energy models that optimise infrastructure self-sufficiency and minimise carbon footprint and community response pattern modelling to refine alert dissemination strategies and foster data-driven collective resilience.
In conclusion, the proposed concept demonstrates high applied value as a tool for improving the sustainability of smart regions, but its long-term success depends on a comprehensive approach that combines technical solutions, environmental practices, regulatory mechanisms, field validation, and active stakeholder engagement.
6. Conclusions
This article developed and described a deployment-oriented engineering framework for wildfire monitoring and suppression, founded on the synchronised operation of three autonomous subsystem classes: aerial platforms, ground vehicles, and distributed stationary sensor nodes. The following conclusions are drawn in direct response to the research questions formulated in the Introduction.
A three-layer architecture was proposed, in which UAV swarms provide rapid aerial inspection and thermal imaging coverage, UGV swarms enable ground-level intervention in hazardous or inaccessible areas, and a stationary SN performs continuous environmental monitoring of temperature, humidity, CO2, and smoke indicators. Integration is achieved through a central processing station coordinating all subsystems via a hierarchical hub-and-spoke topology. Communication redundancy (LoRa + LTE/5G + satellite relay through UAVs), autonomous power supply at forward points, and edge processing capability ensure operational continuity across diverse terrain and adverse conditions, including conflict-affected areas.
Infrastructure placement was formalised as a mixed-integer programming (MIP) problem with binary decision variables for hub and spoke opening, coverage constraints ensuring all points of interest are served, feasibility matrices encoding platform endurance and geometric reachability, spoke-to-hub assignment constraints, hub capacity limits, and an optional budget constraint. The objective function minimises combined infrastructure cost and average response time. Pre-computing Boolean feasibility values from platform parameters and terrain geometry preserves linearity and enables practical solution using standard MIP solvers. A numerical example demonstrated that the approach can rapidly evaluate mission feasibility and populate the A_ijk matrix, supporting automated deployment planning.
A structured data pre-processing pipeline was proposed, comprising: packet-level confidence scoring based on sensor health and communication quality, resampling of heterogeneous data streams to a unified time step, noise reduction via Kalman filters for time series and median filters for spatial data, interpolation of short gaps and imputation of longer gaps, feature normalisation for algorithm stability and data fusion in either early mode (merging normalised vectors before clustering) or late mode (separate processing of image and time-series streams, merged at decision level). A composite risk score (Equation (8)) weighted across temperature, humidity, CO concentration, and historical risk components enables real-time territorial risk assessment. Clustering methods (k-means, DBSCAN, Gaussian Mixture Models) applied to fused feature vectors support dynamic risk map generation with interpretable three-level outputs.
Six scenario-based operational modes were proposed and specified: Sc1 (AI-driven forecasting and early warning, triggered by composite risk score exceeding threshold_warn), Sc2 (early detection and alarm verification through autonomous UAV inspection), Sc3 (autonomous local extinguishing coordinating UAV aerial suppression and UGV ground response), Sc4 (post-fire recovery mapping for re-ignition risk and erosion assessment), Sc5 (temporary sensor-gap compensation using UAVs as mobile sensor relays) and Sc6 (inter-hub reinforcement and resource reallocation during large-scale events). Validation is planned to combine synthetic dataset experiments with precision/recall/F1 metrics for detection quality, ROC-AUC for early-warning reliability, silhouette and Davies–Bouldin indices for clustering quality, and hardware-in-the-loop (HIL) testing for real-time system integration and communications.
Deployment of the proposed system is expected to contribute across all three pillars of sustainable development. Environmentally, early ignition detection and rapid response have the potential to reduce burned area, limit CO2 and particulate emissions, and support the preservation of biodiversity habitats and watershed integrity. Socially, shorter detection-to-verification and verification-to-response times are designed to reduce the risk to emergency responders and public exposure to smoke, while integration with regional early-warning infrastructure is intended to strengthen community resilience. Economically, containing fires at an earlier stage reduces direct damage costs and improves the cost-efficiency of fire-fighting resource deployment. A three-tier KPI framework—covering environmental, social, and economic indicators, with technical performance metrics as operational sub-indicators—provides a measurable basis for ongoing impact assessment and integration with regional digital twin platforms.
In summary, the proposed framework bridges the gap between conceptual UAV-UGV-SN research and practical deployment by combining a formalised MIP placement model, an engineering-grade data pre-processing pipeline, six described and structured operational scenarios, and a KPI-driven sustainability assessment protocol. Future work should prioritise pilot field deployments to validate the proposed architecture under real conditions, refine the optimisation model with empirically measured platform parameters, and expand the system toward full digital twin integration at the smart region scale.