1. Introduction
In recent years, IoT technologies have been increasingly used in various sectors, including environmental monitoring, intelligent transportation, industrial automation, and urban infrastructure management, resulting in a rapid growth in the number of low-power wireless devices operating in large-scale networks [
1]. The rapid increase in the number of connected devices, expected to exceed 40.6 billion by 2034 from 19.8 billion in 2025, further amplifies the need for scalable, energy-efficient, and cost-effective IoT solutions [
2]. In such applications, sensor data is of practical value when its location is known [
3,
4]. Global navigation satellite systems (GNSS), such as GPS, remain the dominant solution for outdoor positioning and can provide meter-level accuracy in open-air conditions [
5]. However, integrating GNSS receivers into large-scale IoT deployments is often impractical due to their relatively high energy consumption, additional hardware cost, and the degradation of GNSS positioning performance in dense urban environments, where satellite signals are affected by blockage, non-line-of-sight reception, and multipath propagation [
6]. Due to these limitations, there is research into alternative localization methods that utilize radio measurements in wireless communication infrastructures, thereby eliminating the need for specialized positioning equipment [
7]. LoRaWAN has attracted considerable attention due to its long communication range, low power consumption, and the availability of public network infrastructure for large-scale urban IoT systems [
8,
9]. LoRaWAN allows battery-powered devices to operate for years, making this technology an energy-efficient solution for monitoring and location applications in smart cities [
10]. Existing approaches to localization in LoRaWAN networks can generally be divided into time-based and RSSI-based methods. Time difference of arrival (TDoA) methods use precise timestamps at multiple gateways and can achieve localization errors on the order of 10–100 m under favorable conditions [
11]. However, TDoA-based approaches rely on specialized infrastructure, including nanosecond-level synchronization between gateways, and are highly sensitive to multipath propagation, clock instability, and NLoS conditions. These practical constraints significantly limit their scalability and real-world applicability in large-scale public LoRaWAN deployments [
12].
In contrast, RSSI-based localization does not require additional hardware or time synchronization in gateways and is widely supported in LoRaWAN infrastructures [
13]. Traditional RSSI-based methods typically rely on path loss modeling or geometric triangulation. However, in complex urban environments, RSSI measurements are highly susceptible to shadowing, building density, multipath fading, and radio channel temporal variability, leading to large errors during localization using range-based models [
14]. Unlike traditional methods based on direct processing of RSSI or TDoA, ML algorithms are able to identify complex nonlinear dependencies in radio data. Fingerprint-based localization, combined with ML methods, has recently been widely studied [
15]. Various ML algorithms, including k-nearest neighbors (kNN), which estimates location by identifying nearby signal dependencies in a fingerprint database, support vector regression (SVR), which models nonlinear dependencies between RSSI features and spatial coordinates, have been widely applied in RSSI-based localization. Furthermore, artificial neural networks (ANNs), which are capable of learning complex propagation patterns using multi-layer nonlinear representations, have demonstrated superior performance in urban NLoS environments where signal behavior is highly irregular [
16]. Despite the progress achieved, ML-based LoRaWAN localization methods remain limited in large-scale public deployments. Analyses conducted on real urban datasets show that RSSI observations are inherently sparse, heterogeneous, and subject to dynamically varying gateway visibility, resulting in incomplete and non-uniform feature representations that degrade model reliability and generalization capability [
17]. Furthermore, many traditional ML models perform direct regression of geographic coordinates from raw RSSI data without incorporating physically interpretable priors. In dense urban environments, where signal propagation is strongly influenced by NLoS conditions and nonlinear attenuation patterns, such purely data-driven formulations often demonstrate instability and limited transferability across spatial regions [
18].
To summarize, considering the limitations described above, this study aims to improve RSSI-based localization in public LoRaWAN networks by introducing a hybrid framework that integrates WCL with residual learning. The proposed approach combines a physically interpretable localization prior with data-driven residual regression. This ensures improved robustness and generalization capability under sparse, heterogeneous, and predominantly NLoS urban conditions. Within this framework, multiple regression models are evaluated at the residual stage, including a MLP, kNN, Extreme Gradient Boosting (XGBoost), and Light Gradient Boosting Machine (LightGBM), enabling a systematic comparison of ML methods and neural networks under identical physical prior distributions and feature representations.
Main contributions:
Proposed a physically grounded hybrid localization framework, integrating WCL with residual learning to improve RSSI-based positioning in public LoRaWAN networks.
Conducted a systematic evaluation of multiple residual regression models, including MLP, kNN, XGBoost, and LightGBM, allowing a controlled comparison of ML methods and neural networks under identical physical priors and feature representations.
Introduced a spatial data splitting method based on grid partitioning to ensure geographic separation between training and testing regions, providing a realistic assessment of model performance in previously unseen urban areas.
2. Related Works
This paper explores the localization capabilities implemented using the intrinsic characteristics of LoRa technology, which allows for coordinate determination without the use of additional equipment or specialized infrastructure.
Table 1 provides a brief overview of outdoor LoRaWAN localization methods reported in the literature, all of which were evaluated using data collected in the city of Antwerp [
19].
The table presents the approaches proposed and evaluated by various researchers, along with the corresponding localization accuracy metrics, including the mean and median localization error and the R
2 score, where available. In [
19], a kNN-based localization method was evaluated, where the optimal value of the parameter k was determined through hyperparameter tuning, resulting in a mean localization error of 398.40 m and a median error of 273.03 m. While the kNN-based fingerprinting approach demonstrates moderate localization accuracy, it relies solely on similarity matching in the RSSI space and does not incorporate physically interpretable priors or mechanisms for spatial generalization, which may limit its robustness in heterogeneous and geographically disjoint urban environments. Authors in [
20] proposed an RSSI-based fingerprint localization method for LoRaWAN networks, achieving a mean localization error of 291.51 m using a branched Convolutional Neural Network (CNN) architecture enhanced with Squeeze-and-Excitation (SE) blocks. While deep convolutional models improve nonlinear feature extraction, they still rely on direct coordinate regression from RSSI fingerprints and do not explicitly incorporate physically interpretable priors, which may limit their robustness across spatially disjoint regions. Authors in [
16] investigated outdoor localization using LoRa technology with the application of several ML models, such as kNN, CNN, SVR, ANN, XGBoost, and LightGBM. The authors proposed a hybrid architecture that combines convolutional feature extraction with gradient-based regression, achieving the best performance with a mean localization error of 244.51 m for the hybrid model, compared to 248.72 m for XGBoost and 249.57 m for LightGBM. Despite these improvements, the approach remains fully data-driven and does not explicitly decompose the problem into coarse physical estimation and structured residual correction, leaving open questions regarding interpretability and stability under varying gateway density.
In [
21], an ensemble learning-based approach was proposed that integrates RSSI measurements with nanosecond-level timestamp information using a kNN combined with a Random Forest Regressor (RFR), achieving a mean localization error of 332.63 m and a median error of 193.63 m. TDoA-based localization relies on gateway-side infrastructure, specifically on multiple gateways providing precise and synchronized reception timestamps. In contrast, the present study focuses on localization from readily available radio features without assuming timing-based infrastructure as the main design basis.
Table 1.
Comparison of outdoor LoRaWAN localization methods.
Table 1.
Comparison of outdoor LoRaWAN localization methods.
| Paper | Method | Mean (m) | Median (m) | R2 Score | Input Features | Preprocessing | Dataset Size | Year |
|---|
| [19] | kNN | 398.4 | 273.0 | N/A | RSSI | Missing gateway receptions filled with −200 dBm | 123,529 | 2018 |
| [18] | Random Forest | 340 | N/A | 0.91 | RSSI | Missing gateway RSSI set to −200 dBm; RSS transformed into normalized/exponential/powed forms; StandardScaler; PCA with 95% retained variance, reducing 72 features to 40 components, only messages with ≥3 gateways | 55,259 | 2020 |
| Range-based | 700 | N/A |
| kNN weighted | 343 | 0.90 |
| SVR | 1155 | 0.55 |
| Linear SGD | 784 | 0.72 |
| [22] | K-means + Weighted Kernel Regression | 346.03 | 158.41 | N/A | RSSI | Keep only messages received by ≥3 gateways; discard 75,556 messages with fewer than 3 gateways; remove gateways with <1% visibility; represent missing reception as −200 dBm | 54,873 | 2022 |
| [23] | RF | 351 | N/A | N/A | RSSI | RSS transformation into normalized/exponential/powed forms; StandardScaler; PCA with 95% retained variance, reducing 72 features to 40 components; −200 dBm mapped to 0 in positive representation | 130,430 | 2023 |
| Range-based | 735.37 |
| [20] | CNN + SE | 291.51 | 147.55 | 0.93 | RSSI, SF | Remove 28 inactive gateways; RSSI representations: Positive/Normalized/Exponential/Powed; StandardScaler + MinMaxScaler | 130,430 | 2024 |
| [17] | kNN | 313.30 | 217.98 | N/A | N/A | N/A | N/A | 2025 |
Neural Network | 277.61 | 163.49 |
| [16] | k-NN | 284.58 | 160.01 | 0.9216 | RSSI, SNR, SF, Estimated Signal Power (ESP) | Each message converted to one sample; missing RSSI filled with −200 dBm, then replaced by −128 dBm; logarithmic transform + min-max normalization for RSSI; min-max normalization for non-RSSI features and targets | 130,430 | 2025 |
| CNN | 319.57 | 219.96 |
| SVR | 320.90 | 194.38 |
| ANN | 279.66 | 171.70 |
| XGBoost | 248.72 | 145.54 |
| LightGBM | 249.57 | 146.72 |
| Hybrid Model | 244.51 | 130.39 |
| Our work | WCL + MLP | 160.47 | 73.78 | 0.968 | RSSI, SNR, SF, gateway observation mask, statistical features | Removed messages without valid GPS or without reception by at least one gateway; removed gateways with activity below 1%; removed messages received by fewer than 3 gateways; built sparse RSSI/SNR matrices, binary observability matrix, and aggregated statistics | 54,874 | 2026 |
In study [
22], a hierarchical clustering-based approach for urban LoRaWAN localization was proposed, combining K-means clustering, kernel density estimation, Kullback-Leibler divergence, and weighted kernel regression. The method achieved a median localization error of 158.41 m and a mean error of 346.03 m. Although the median error is competitive, the relatively high mean error indicates the presence of large localization deviations in certain regions, suggesting sensitivity to outliers and spatially irregular propagation conditions. In study [
18], RSSI fingerprint-based localization methods were evaluated, achieving a mean localization error of 340 m, while path-loss-based ranging methods using propagation models resulted in a mean error of 700 m. In [
23], a Random Forest model was applied for distance estimation, followed by modified trilateration, using RSSI, Signal-to-noise ratio (SNR), and Spreading Factor (SF) as input features. The proposed approach achieved a mean localization error of 735.37 m, whereas fingerprint-based localization attained a mean error of 351 m. These studies demonstrate that range-based localization methods, whether based on analytical path-loss models or ML-assisted distance estimation, remain highly sensitive to NLoS conditions and urban channel variability, leading to significantly higher localization errors and limited robustness in practical LoRaWAN deployments. Authors in [
17] proposed ML approaches, including the kNN algorithm and neural networks, which were evaluated, achieving mean localization errors of 313.30 m and 277.61 m, respectively. While optimized hyperparameter tuning yields noticeable improvements, the study primarily focuses on model-level optimization rather than architectural reformulation of the localization pipeline.
Overall, although RSSI-based LoRaWAN localization has progressed significantly, most existing methods rely either on purely data-driven regression or analytical ranging models, without combining a physically grounded coarse prior with residual learning. This limitation motivates the development of hybrid frameworks that enhance robustness and spatial generalization.
4. Results and Discussion
This section presents a comprehensive evaluation of the proposed hybrid framework under both spatial and random splitting strategies. The analysis examines localization accuracy using mean error, median error, and R2 metrics, as well as the spatial distribution of prediction errors across the deployment area. A spatial split is employed to evaluate the model’s generalization capability to previously unseen geographic regions, while a random split is used as a baseline reference scenario.
Table 4 presents a comparative evaluation of baseline ML models and their WCL-enhanced counterparts for outdoor LoRaWAN localization. The results demonstrate that the integration of WCL as a coarse localization prior leads to a substantial improvement in positioning accuracy for all considered models. In particular, the proposed WCL + MLP approach achieves the lowest mean and median localization errors of 160.47 m and 73.78 m, respectively, along with the highest R
2 score of 0.968. Compared to the baseline MLP model, which yields a mean localization error of 226.45 m, the incorporation of WCL reduces the mean error by approximately 29%, confirming the effectiveness of the proposed hybrid approach. This significant reduction highlights the advantage of combining physics-based coarse localization with data-driven learning, resulting in improved robustness and accuracy under heterogeneous and sparse LoRaWAN measurement conditions.
Table 4 also reports the standard deviation of localization error (Std), the maximum localization error (Max), and the huge-error rate for errors above 1000 m, which provide a more explicit characterization of error dispersion and severe failure cases. A limited number of very large localization errors is still present for all compared methods, which explains the relatively high Std value. The best-performing configuration of MLP was obtained with a learning rate of 0.0012, dropout of 0.2, Huber delta of 20, and a batch size of 256.
Table 5 presents the localization performance of the ML models and their versions with WCL under spatial splitting. The best results are achieved by the WCL + MLP model, with a mean error of 157.95 m, a median error of 55.59 m, and the highest R
2 value (0.962). Compared to the pure MLP model (218.90 m), the addition of WCL reduces the mean error by approximately 28%, confirming the effectiveness of the hybrid approach. Overall, integrating WCL improves both accuracy and robustness, with WCL + MLP demonstrating the most reliable performance among all evaluated methods.
Table 5 further reports the standard deviation of localization error (Std), the maximum localization error (Max), and the huge-error rate for errors above 1000 m, providing a more detailed view of error dispersion and extreme failure behavior under geographically unseen conditions. Although a small number of very large localization errors remain, which leads to relatively high Std values across all models, the proposed WCL + MLP model still shows a lower spread of errors and a lower frequency of severe failures than the compared methods.
While
Table 4 and
Table 5 were introduced to characterize uncertainty and severe failure cases, they also allow an empirical interpretation of the residual learning effect. The reduction in mean and median localization error suggests improved correction of systematic error, whereas the lower standard deviation and lower huge-error rate indicate reduced dispersion and fewer severe failures. Thus, the residual formulation appears to improve both bias-related and variance-related components of localization error.
To further confirm the claim of reduced effective training complexity, loss curves were plotted for the baseline MLP and the proposed WCL + MLP model (
Figure 6).
The results show that WCL + MLP converges more favorably and achieves a lower loss function. This confirms that the introduction of a physically based WCL prior makes the second-stage problem more structured and robust to learning.
Figure 7 shows the localization error distribution function for the machine learning models and the WCL model under random splitting. The WCL models generally produce left-shifted curves, indicating smaller localization errors over most of the distribution.
This trend is particularly noticeable near 90%, where the best WCL models remain significantly lower below error thresholds than their corresponding baseline models. Specifically, the 90% errors are below 409.93 m for WCL + MLP, 464.94 m for WCL + LightGBM, and 445.81 m for WCL + XGBoost, compared to 558.84 m, 595.99 m, and 594.70 m for the corresponding baseline models. The improvement for kNN is smaller, but the WCL variant still reduces the 90% error threshold from 661.09 m to 619.83 m. These results demonstrate that the WCL model improves not only the central portion of the error distribution but also its performance in more complex cases. As a result, the hybrid variants demonstrate greater robustness by reducing the frequency of large localization errors.
Figure 8 shows the localization error distribution function for the WCL-based machine learning models and their hybrid variants under spatial separation. This setting is more demanding because the test data comes from geographically unknown regions, making the results more revealing. Under these conditions, the WCL-based models perform better, confirming the choice of physically based priors before the residual correction step. The most pronounced improvement is observed for WCL + MLP, whose curve remains consistently shifted to the left relative to the baseline MLP model. Specifically, 90% of errors are less than 404.84 m for WCL + MLP, compared to 538.83 m for MLP. A similarly strong effect is observed for LightGBM, where 90% of errors decrease from 729.15 m to 535.52 m after incorporating WCL. For XGBoost, the gain is smaller but still present, with 90% of errors being less than 530.52 m for WCL + XGBoost compared to 532.79 m for the baseline model. These results indicate that the WCL model is particularly useful for reducing the frequency of large localization biases in previously unobserved urban areas.
A different behavior is observed for kNN. While the baseline kNN model remains competitive in terms of distribution, the WCL model does not demonstrate the same consistent advantage as the neural and boosted models. In fact, near the 90% threshold, the threshold changes only slightly: from 681.93 m for kNN to 664.64 m for WCL + kNN. This suggests that the advantage of the WCL prior depends not only on the quality of the rough estimate but also on how effectively the regressor can exploit this prior. Overall, under spatial partitioning, WCL models remain the most effective for MLP and LightGBM, while WCL + MLP provides the most stable and robust performance among all the compared models.
CDF analysis and the detailed percent-of-error values for the random and spatial splitting configurations demonstrate that the integration of WCL shifts the entire error distribution toward lower values in both evaluation scenarios. In geographically overlapping and geographically disjoint configurations, the WCL-enhanced models exhibit a higher proportion of localization results within smaller error thresholds, reduced variability, and improved robustness.
To visualize the spatial behavior of localization errors and identify potential outliers,
Figure 9b presents the geographic distribution of positioning errors for the best-performing model, WCL + MLP. The map illustrates the spatial distribution of errors under the random split configuration across the urban area. The results show that in most parts of the city, the localization error predominantly falls within the 100–300 m range, indicating stable performance in both central and peripheral regions. A significant number of points exhibit errors below 100 m, demonstrating the model’s ability to achieve high positioning accuracy under favorable signal propagation conditions and sufficient gateway visibility. These low-error estimates are spatially distributed throughout the study area rather than concentrated within a single cluster. Only a limited number of points correspond to large-error cases exceeding 1000 m, and such extreme errors do not form pronounced spatial clusters, indicating that the hybrid framework effectively mitigates severe outliers.
In contrast,
Figure 9a presents the spatial error distribution for the MLP model. The map reveals a noticeably larger proportion of errors in the 300–500 m and 500–1000 m ranges, particularly within dense urban areas. Compared to the hybrid approach, the pure MLP model exhibits a higher concentration of medium-to-large deviations and a more heterogeneous spatial error pattern. Large-error regions appear more widespread, suggesting reduced robustness to complex propagation conditions.
Figure 10a illustrates the spatial distribution of localization errors for the MLP model. Compared to the random splitting scenario, the spatial error pattern appears somewhat more structured, with a moderate reduction in regions characterized by medium errors in the 300–500 m range.
However, noticeable clusters of medium and large deviations persist, particularly in dense urban areas and peripheral zones, indicating sensitivity to heterogeneous propagation conditions in the absence of WCL.
Figure 10b presents the results for the WCL + MLP model. Under spatial splitting, the hybrid framework demonstrates an even more consistent error pattern than in the random split scenario. The majority of localization errors are concentrated within the 100–300 m range, while areas with errors below 100 m remain widely distributed across the deployment region. Importantly, large-error cases exceeding 1000 m are rare and do not form concentrated clusters in specific areas.
The results obtained under random and spatial data splitting strategies indicate a reduction in medium and large localization errors and a more geographically consistent distribution of prediction errors. While random splitting reflects statistical prediction accuracy, spatial splitting provides a more rigorous evaluation of generalization capability to previously unseen regions. Together, both protocols confirm the reliability and stability of the proposed WCL-based approach.
To further support the comparative analysis of localization performance, a non-parametric statistical evaluation was conducted using the Friedman test. The corresponding results are summarized in
Table 6 and
Table 7.
Table 6 reports the overall significance of performance differences among the compared models for the random and spatial splitting scenarios, while
Table 7 presents the mean ranks derived from the Friedman procedure, which provide an interpretable measure of the relative performance of each method across the test samples.
Table 6 shows that the Friedman test identifies statistically significant overall differences among the compared models in both evaluation scenarios (
). This result indicates that the observed variation in localization error is not random and confirms that the models do not perform equivalently under either random or spatial splitting.
Table 7 presents the mean ranks of the compared methods, where lower values correspond to better overall performance. In both scenarios, WCL + MLP achieves the lowest mean rank, indicating the most consistent localization accuracy among all evaluated models. Under random splitting, WCL + XGBoost and KNN also demonstrate relatively strong performance, whereas the LightGBM and XGBoost models obtain the highest mean ranks. Under spatial splitting, WCL + MLP remains the best-performing method, while XGBoost becomes the strongest baseline model. In contrast, WCL + KNN yields the highest mean rank in the spatial setting, suggesting the weakest overall performance in that scenario.
Overall, the statistical analysis confirms that the compared models differ significantly in their localization behavior, and the mean-rank results further indicate that WCL + MLP is the most robust and consistently accurate method across both evaluation settings. This finding supports the effectiveness of combining WCL with residual learning based on MLP for outdoor LoRaWAN localization.
To assess the robustness of the WCL stage, a parametric analysis was performed using three key parameters: τ, top_strong, and min_gw. The results, presented in
Figure 11, demonstrate that the accuracy depends on the choice of parameters but remains quite robust over the studied range. For min_gw = 300, the best result was obtained with τ = 10 and top_strong = 0.05, with a median error of 74.04 m, while for min_gw = 500, the minimum was observed at τ = 8 and top_strong = 0.15, with a median error of 73.78 m.
Overall, the analysis showed that a more stringent min_gw threshold, a moderate τ value, and a wider proportion of strong RSSI observations provide the most robust initial coordinate estimate.
Table 8 presents the results of an additional ablation analysis in terms of mean localization error for the full model and its simplified variants. As can be seen, the full WCL + MLP residual model achieves the lowest mean localization error of 160.47 m. When the residual correction is replaced by direct coordinate refinement in the WCL + MLP direct refine variant, the mean error increases to 230.73 m, indicating that the improvement is not due only to the introduction of the WCL prior, but specifically to the residual learning formulation. Removing statistical features, SNR, and the gateway observation mask further degrades the performance to 230.98 m, 239.13 m, and 261.75 m, respectively. Using WCL only results in a much larger mean localization error of 564.45 m, confirming that WCL in the proposed approach serves as a coarse, physically interpretable initial estimate rather than a sufficiently accurate standalone localization method. Overall, the results show that the best performance is achieved only when the WCL prior, residual correction, and complete feature representation are used jointly.
Figure 12 presents an analysis of the importance of group permutations for the trained hybrid model. The gateway observation mask has the greatest impact on performance, increasing the median localization error by 306.82 m after permutation. RSSI and SNR are the next most important feature groups, while aggregated statistical features make an additional, but smaller, contribution. SF has the least impact in the current configuration. These results indicate that the explicit gateway observation mask is a key factor determining the performance of the proposed method.
In terms of computational efficiency, the WCL + MLP hybrid model exhibits only a moderate increase in cost compared to the original MLP. Specifically, training time increases from 276.94 s to 312.49 s, and inference time from 1.5522 s to 1.5806 s. Similarly, the latency per sample increases insignificantly, from 0.1414 s to 0.1440 s. The model size also increases from 1.94 MB to 2.48 MB. For the fastest model, XGBoost, the latency is 8.61 ms, compared to 8.65 ms for WCL + XGBoost. This demonstrates that including WCL can improve localization accuracy while only slightly increasing computation time. Thus, the addition of WCL-based coordinate priors does not significantly degrade computational performance and can be considered an acceptable tradeoff for the potential improvement in localization accuracy. These results are important because both execution time and memory footprint are critical practical factors for real-world localization systems, especially when models are expected to operate with limited computing resources or in near-real-time settings. Therefore, the observed increases in execution time and memory footprint can be considered small and acceptable, given the corresponding improvement in localization accuracy. All experiments were conducted on a system with an Intel Core i7–12700H (12th Gen, 2.30 GHz) processor and 16 GB RAM (4800 MT/s).
A limitation of the present evaluation is that the use of a single dataset does not fully establish the generalizability of the proposed method to other urban environments. Nevertheless, large open datasets for urban outdoor LoRaWAN localization remain very limited, which is why the Antwerp dataset continues to serve as one of the main public references in this area. In addition, the benchmark itself contains a source of irreducible uncertainty, since the geographic labels were collected from moving vehicles and may not perfectly match the true transmitter position at the exact reception moment. Therefore, part of the observed localization error should be interpreted as benchmark label uncertainty rather than purely algorithmic error.
In addition to the quantitative comparison, the obtained results should also be interpreted from the perspective of practical applicability. The proposed method is aimed at large-scale low-power IoT applications, including asset logistics, environmental monitoring, and smart city systems, which represent typical LoRaWAN deployment scenarios. In such systems, the use of GNSS is technically possible; however, when thousands of battery-powered devices are involved, it often proves to be economically and energetically inefficient.
From this perspective, the achieved improvement has practical significance. In dense urban environments, localization errors on the order of several hundred meters are often too coarse for practical use, since they may assign a device to the wrong facility, administrative zone, or road segment. A reduction in the mean error and, especially, a reduction in the median error to below 100 m increases the usefulness of the system by enabling more reliable localization at the zone or object level. This is particularly important for distributed air-quality monitoring, municipal infrastructure supervision, and low-cost asset tracking, where the goal is not navigation-grade accuracy, but correct assignment of measurements or assets to the corresponding operational zone. Thus, the proposed method narrows the gap between costly GNSS-based positioning and traditional coarse localization based on LPWAN radio measurements.