Figure 1.
Proposed framework for virtual sensor development integrating preprocessing, statistical virtual sensor generation (KDE, Copula, IDW, and Ridge Regression), KDE-based validation, physical constraint filtering, KL-divergence-based data fusion, deep generative augmentation (VAE and CTGAN), and time-series modeling using BiLSTM/BiGRU for accurate prediction and reconstruction of sensor data.
Figure 1.
Proposed framework for virtual sensor development integrating preprocessing, statistical virtual sensor generation (KDE, Copula, IDW, and Ridge Regression), KDE-based validation, physical constraint filtering, KL-divergence-based data fusion, deep generative augmentation (VAE and CTGAN), and time-series modeling using BiLSTM/BiGRU for accurate prediction and reconstruction of sensor data.
Figure 2.
Ridge Regression flow for virtual sensor simulation.
Figure 2.
Ridge Regression flow for virtual sensor simulation.
Figure 3.
Inverse Distance Weighting workflow for virtual sensor estimation.
Figure 3.
Inverse Distance Weighting workflow for virtual sensor estimation.
Figure 4.
Density comparison between real sensor data and IDW-generated virtual sensor data for temperature, humidity, wind direction, and wind speed.
Figure 4.
Density comparison between real sensor data and IDW-generated virtual sensor data for temperature, humidity, wind direction, and wind speed.
Figure 5.
Density comparison between real sensor data and KDE-generated virtual sensor data for temperature, humidity, wind direction, and wind speed.
Figure 5.
Density comparison between real sensor data and KDE-generated virtual sensor data for temperature, humidity, wind direction, and wind speed.
Figure 6.
Density comparison between real sensor data and Copula-generated virtual sensor data.
Figure 6.
Density comparison between real sensor data and Copula-generated virtual sensor data.
Figure 7.
KL-divergence-based fusion of virtual sensor data generated from different statistical methods. The fused data are validated within the defined physical ranges before augmentation and downstream model training.
Figure 7.
KL-divergence-based fusion of virtual sensor data generated from different statistical methods. The fused data are validated within the defined physical ranges before augmentation and downstream model training.
Figure 8.
Density comparison of the fused IDW–Ridge virtual sensor data.
Figure 8.
Density comparison of the fused IDW–Ridge virtual sensor data.
Figure 9.
Density comparison of the fused IDW–KDE virtual sensor data.
Figure 9.
Density comparison of the fused IDW–KDE virtual sensor data.
Figure 10.
Density comparison of the fused Copula–KDE virtual sensor data.
Figure 10.
Density comparison of the fused Copula–KDE virtual sensor data.
Figure 11.
Augmentation of the initial data generated using statistical models, followed by deep generative modeling (VAE and CTGAN) to produce high-fidelity synthetic sensor data for downstream learning.
Figure 11.
Augmentation of the initial data generated using statistical models, followed by deep generative modeling (VAE and CTGAN) to produce high-fidelity synthetic sensor data for downstream learning.
Figure 12.
Architecture of the Variational Autoencoder (VAE) model, illustrating the encoder, latent space sampling, and decoder used for generating augmented sensor data.
Figure 12.
Architecture of the Variational Autoencoder (VAE) model, illustrating the encoder, latent space sampling, and decoder used for generating augmented sensor data.
Figure 13.
Architecture of the CTGAN model used for generating augmented sensor data.
Figure 13.
Architecture of the CTGAN model used for generating augmented sensor data.
Figure 14.
Fused Copula & KDE weighted virtual sensor data augmentation analysis using PCA visualization: (a) VAE-augmented data and (b) CTGAN-augmented data.
Figure 14.
Fused Copula & KDE weighted virtual sensor data augmentation analysis using PCA visualization: (a) VAE-augmented data and (b) CTGAN-augmented data.
Figure 15.
KDE-based distribution plots of feature-wise similarity for fused Copula & KDE weighted virtual sensor data: (a) VAE-augmented data and (b) CTGAN-augmented data.
Figure 15.
KDE-based distribution plots of feature-wise similarity for fused Copula & KDE weighted virtual sensor data: (a) VAE-augmented data and (b) CTGAN-augmented data.
Figure 16.
Fused KL-based IDW & KDE virtual sensor data augmentation analysis using PCA visualization: (a) VAE-augmented data and (b) CTGAN-augmented data.
Figure 16.
Fused KL-based IDW & KDE virtual sensor data augmentation analysis using PCA visualization: (a) VAE-augmented data and (b) CTGAN-augmented data.
Figure 17.
Density distribution analysis of fused KL-based IDW & KDE virtual sensor data augmentation: (a) VAE-augmented data and (b) CTGAN-augmented data.
Figure 17.
Density distribution analysis of fused KL-based IDW & KDE virtual sensor data augmentation: (a) VAE-augmented data and (b) CTGAN-augmented data.
Figure 18.
Fused KL-based Ridge & IDW virtual sensor data augmentation analysis using PCA visualization: (a) VAE-augmented data and (b) CTGAN-augmented data.
Figure 18.
Fused KL-based Ridge & IDW virtual sensor data augmentation analysis using PCA visualization: (a) VAE-augmented data and (b) CTGAN-augmented data.
Figure 19.
Density distribution analysis of fused KL-based Ridge & IDW virtual sensor data augmentation: (a) VAE-augmented data and (b) CTGAN-augmented data.
Figure 19.
Density distribution analysis of fused KL-based Ridge & IDW virtual sensor data augmentation: (a) VAE-augmented data and (b) CTGAN-augmented data.
Figure 20.
MAE comparison of BiLSTM and BiGRU models for temperature prediction across physical and virtual sensor data variants.
Figure 20.
MAE comparison of BiLSTM and BiGRU models for temperature prediction across physical and virtual sensor data variants.
Figure 21.
MAE comparison of BiLSTM and BiGRU models for humidity prediction across physical and virtual sensor data variants.
Figure 21.
MAE comparison of BiLSTM and BiGRU models for humidity prediction across physical and virtual sensor data variants.
Figure 22.
MAE comparison of BiLSTM and BiGRU models for wind direction prediction across physical and virtual sensor data variants.
Figure 22.
MAE comparison of BiLSTM and BiGRU models for wind direction prediction across physical and virtual sensor data variants.
Figure 23.
MAE comparison of BiLSTM and BiGRU models for wind speed prediction across physical and virtual sensor data variants.
Figure 23.
MAE comparison of BiLSTM and BiGRU models for wind speed prediction across physical and virtual sensor data variants.
Table 1.
Comparison of recent sensor, temporal prediction, and temporal generative studies with the proposed framework.
Table 1.
Comparison of recent sensor, temporal prediction, and temporal generative studies with the proposed framework.
| Study | Main Focus | Key Strength | Limitation Compared with This Work |
|---|
| Miranda et al. [20] | Virtual sensor modeling | Estimates missing or unreliable measurements | Limited fusion, augmentation, and temporal prediction |
| Jutras et al. [27] | Copula dependency modeling | Captures multivariate dependencies | No complete augmentation–prediction pipeline |
| Liu et al. [28] | IDW-based spatial estimation | Effective for spatial sensor estimation | Limited modeling of nonlinear temporal dynamics |
| Dong et al. [37] | ML-assisted physical sensing | High recognition accuracy | Focuses on physical sensor classification |
| Ahmed et al. [43] | Synthetic tabular data | Improves data availability | Limited physical constraints and KL-based fusion |
| Zhou et al. [44] | Transformer forecasting | Captures long-range temporal dependency | Prediction-oriented; no virtual sensor generation |
| Lim et al. [38] | TFT forecasting | Interpretable multi-horizon prediction | Forecasting-focused, not virtual sensing |
| Jin et al. [39] | Spatio-temporal GNNs | Models graph-based sensor dependencies | Requires explicit sensor graph topology |
| Yoon et al. [41] | Temporal data generation | Preserves temporal dynamics | Adversarial training; no physical/KL fusion |
| Lin et al. [42] | Diffusion-based time-series generation | Strong iterative generative modeling | Computationally expensive denoising process |
| Proposed framework | Virtual sensor generation and prediction | Unified physical, statistical, generative, and temporal pipeline | Needs tuning and validation for other domains |
Table 2.
Description of physical sensor variables used in the study, including data types and corresponding environmental measurements collected from the weather station.
Table 2.
Description of physical sensor variables used in the study, including data types and corresponding environmental measurements collected from the weather station.
| # | Physical Sensors | Data Type | Description |
|---|
| 1 | Timestamp | Temporal (datetime) | The date and time when the data was recorded. |
| 2 | Temperature | Continuous (float) | Temperature in Celsius (°C) measured at the weather station. |
| 3 | Humidity | Continuous (float) | Relative humidity in percentage (%), indicating the amount of moisture in the air. |
| 4 | Wind Direction Angle | Continuous (float) | Average wind direction angle in degrees (0–360°). Since this is a circular variable, it is encoded using sine and cosine components for model training. |
| 5 | Wind Speed | Continuous (float) | Wind speed in meters per second (m/s), indicating the magnitude of air movement. |
| 6 | Longitude | Continuous (float) | Longitude coordinate of the sensor location (in degrees). |
| 7 | Latitude | Continuous (float) | Latitude coordinate of the sensor location (in degrees). |
Table 3.
Physical constraints used for validating sensor variables.
Table 3.
Physical constraints used for validating sensor variables.
| Variable | Physical Constraint Used in Validation | Explanation |
|---|
| Temperature | | According to the region temperature. |
| Humidity | | Relative humidity percentage. |
| Wind speed | | Wind speed cannot be negative. |
| Wind direction | | Circular angular variable. |
| Longitude | Valid GPS longitude range/station metadata | Sensor location coordinate. |
| Latitude | Valid GPS latitude range/station metadata | Sensor location coordinate. |
Table 4.
Experimental system configuration used for model training and evaluation.
Table 4.
Experimental system configuration used for model training and evaluation.
| Component | Specification |
|---|
| Operating system | Windows 10 for PC Server |
| Main memory | 96 GB RAM |
| Processor | 12th Gen Intel(R) Core(TM) i9-12900K, 3.20 GHz |
| Programming language | Python 3 |
| IDE | PyCharm Professional 2024.3.5 |
| Data storage/format | CSV and MS Excel files |
| Core libraries | Pandas 2.3.3, Scikit-learn 1.9.0, Keras 3.14.1, TensorFlow 2.21.0, Seaborn 0.13.2, Matplotlib 3.10.9, CTGAN 0.12.1, and PyTorch 2.12.0 |
Table 5.
Key hyperparameters of the VAE and CTGAN models used for data augmentation.
Table 5.
Key hyperparameters of the VAE and CTGAN models used for data augmentation.
| Parameter | VAE | CTGAN |
|---|
| Data split | 80% training, 20% validation | 80% training, 20% validation |
| Latent/noise dimension | 40 | Default CTGAN noise dimension |
| Network structure | Encoder: Dense(128)–Dense(64); Decoder: Dense(64)–Dense(128)–Dense(input dim.) | Generator dimensions: (128, 128); default discriminator |
| Activation function | ReLU in hidden layers | Default CTGAN activations |
| Loss function | Reconstruction MSE + KL-divergence | Adversarial training loss |
| KL weighting | Linear warm-up: | Not applicable |
| Optimizer | Adam | Adam-based CTGAN optimizer |
| Learning rate | 0.001 | Default CTGAN learning rate |
| Batch size | 16 | 32 |
| Epochs | 50 | 50 |
| Output validation | Inverse scaling and physical validity filtering | Inverse scaling and physical validity filtering |
Table 6.
Key network structure and training parameters of the BiLSTM and BiGRU models.
Table 6.
Key network structure and training parameters of the BiLSTM and BiGRU models.
| Parameter | BiLSTM | BiGRU |
|---|
| Input sequence length | 3 time steps | 3 time steps |
| Input shape | for scalar features; for WindDirX/WindDirY | for scalar features; for WindDirX/WindDirY |
| Recurrent layers | BiLSTM(64, ReLU, return sequences) → BiLSTM(32, ReLU) | BiGRU(64, ReLU, return sequences) → BiGRU(32, ReLU) |
| Dropout | 0.3 after first recurrent layer; 0.2 after second recurrent layer | 0.3 after first recurrent layer; 0.2 after second recurrent layer |
| Dense layer | Dense(32, ReLU) | Dense(32, ReLU) |
| Output layer | Dense(1) for scalar features; Dense(2) for WindDirX/WindDirY | Dense(1) for scalar features; Dense(2) for WindDirX/WindDirY |
| Optimizer | Adam | Adam |
| Learning rate | 0.001 | 0.001 |
| Loss function | MSE | MSE |
| Epochs | 50 | 50 |
| Batch size | 32 | 32 |
| Train-test split | 80% training, 20% testing | 80% training, 20% testing |
| Wind direction reconstruction | atan2(WindDirY, WindDirX), mapped to | atan2(WindDirY, WindDirX), mapped to |
Table 7.
Computational characteristics and scalability considerations of the proposed framework.
Table 7.
Computational characteristics and scalability considerations of the proposed framework.
| Stage | Main Operations | Computational Cost | Scalability Consideration |
|---|
| Preprocessing | Missing-value handling, physical constraints, scaling, wind-direction encoding | Low; approximately linear with data size | Easily scalable using batch or streaming preprocessing |
| Virtual sensor generation | IDW, KDE, Ridge-based estimation, Copula-based modeling | Moderate; depends on records, variables, and estimation method | Can be parallelized across variables or sensor locations |
| KL-divergence fusion | Distributional comparison and best-source selection | Moderate; depends on number of candidate virtual sensors and features | Scalable through feature-wise or location-wise computation |
| VAE/CTGAN augmentation | Iterative deep generative model training and sample validation | High; requires repeated neural network optimization | Can use mini-batch training, GPU acceleration, and parallel runs |
| BiLSTM/BiGRU prediction | Sequence construction and recurrent neural network training | Moderate to high; depends on sequence length, epochs, and batch size | Scalable using shorter windows, mini-batches, and model compression |
| Evaluation | MAE, MSE, RMSE, and wind-direction reconstruction | Low | Easily scalable and can be automated |
Table 8.
Quantitative computational cost and scalability analysis of the main augmentation and temporal prediction components.
Table 8.
Quantitative computational cost and scalability analysis of the main augmentation and temporal prediction components.
| Component | Dataset/Case | Training Time (s) | Inference Time (s) | Single Inference Time (s) | Memory Increase (MB) | Time Complexity |
|---|
| CTGAN augmentation | CTGAN data | 351.79 | 0.054 | – | 150.43 | |
| VAE augmentation | VAE data | 1077.27 | 0.003 | – | 12.52 | |
| BiLSTM prediction | Original data | 1928.40 | 61.11 | 0.1154 | 150.21 | / |
| BiGRU prediction | Original data | 1343.79 | 37.58 | 0.0709 | 72.28 | / |
| BiLSTM prediction | KL-Ridge-IDW data | 115.72 | 72.48 | 0.1481 | 64.02 | / |
| BiGRU prediction | KL-Ridge-IDW data | 66.33 | 34.53 | 0.0719 | 73.96 | / |
| BiLSTM prediction | Copula-KDE + VAE data | 42.45 | 30.32 | 0.0643 | 59.27 | / |
| BiGRU prediction | Copula-KDE + VAE data | 74.68 | 55.15 | 0.1125 | 68.00 | / |
Table 9.
Experimental Results of BiLSTM and BiGRU Models on Physical Sensors Data.
Table 9.
Experimental Results of BiLSTM and BiGRU Models on Physical Sensors Data.
| Feature | BiLSTM | BiGRU |
|---|
| | MAE | MSE | RMSE | MAE | MSE | RMSE |
|---|
| temperature | 1.448813 | 4.143780 | 2.035628 | 1.630625 | 4.904090 | 2.214518 |
| humidity | 5.279900 | 70.627216 | 8.404000 | 5.370100 | 73.673039 | 8.583300 |
| winddirangleavg | 0.543000 | 0.431800 | 0.657100 | 0.535280 | 0.430390 | 0.656040 |
| windspeedmax | 0.744917 | 0.923897 | 0.961196 | 0.770717 | 0.957817 | 0.978682 |
Table 10.
Experimental Results of BiLSTM and BiGRU Models on Fused Copula-KDE Weighted Virtual Sensor Data Variants.
Table 10.
Experimental Results of BiLSTM and BiGRU Models on Fused Copula-KDE Weighted Virtual Sensor Data Variants.
| Sensors | BiLSTM | BiGRU |
|---|
| | MAE | MSE | RMSE | MAE | MSE | RMSE |
|---|
| Experimental Results of a Learning Model with Fused Copula KDE Data |
| temperature | 2.705277 | 11.718570 | 3.423240 | 2.710848 | 11.843070 | 3.441376 |
| humidity | 13.161027 | 249.928821 | 15.809137 | 13.170420 | 249.856166 | 15.806839 |
| winddirangleavg | 0.279480 | 0.118100 | 0.343780 | 0.279350 | 0.119400 | 0.345650 |
| windspeedmax | 1.090956 | 1.691932 | 1.300743 | 1.089263 | 1.698534 | 1.303278 |
| Experimental Results of a Learning Model on Fused-CTGAN Data |
| temperature | 3.094070 | 14.901010 | 3.860182 | 3.078674 | 14.814790 | 3.848999 |
| humidity | 13.732957 | 272.891302 | 16.519422 | 13.773016 | 275.639974 | 16.602409 |
| winddirangleavg | 0.292930 | 0.146370 | 0.382580 | 0.295500 | 0.144030 | 0.379500 |
| windspeedmax | 1.059140 | 1.746934 | 1.321716 | 1.090574 | 1.792604 | 1.338881 |
| Experimental Results of a Learning Model on Fused-VAE Data |
| temperature | 2.269028 | 7.944535 | 2.818605 | 2.269672 | 7.936787 | 2.817230 |
| humidity | 10.913837 | 183.663820 | 13.552263 | 10.666777 | 177.631780 | 13.327857 |
| winddirangleavg | 0.162810 | 0.046050 | 0.214590 | 0.159390 | 0.045280 | 0.212800 |
| windspeedmax | 0.882243 | 1.048907 | 1.024162 | 0.883929 | 1.051384 | 1.025370 |
Table 11.
Experimental Results of a Learning Model for Virtual Sensors based on KL-IDW-KDE, CTGAN, and VAE Data.
Table 11.
Experimental Results of a Learning Model for Virtual Sensors based on KL-IDW-KDE, CTGAN, and VAE Data.
| Sensors | BiLSTM | BiGRU |
|---|
| | MAE | MSE | RMSE | MAE | MSE | RMSE |
|---|
| Experimental Results on KL-IDW-KDE Fused Virtual Sensor Data |
| temperature | 3.026100 | 13.894600 | 3.727500 | 3.036400 | 14.062800 | 3.750000 |
| humidity | 18.334388 | 462.720927 | 21.510949 | 18.310038 | 462.794497 | 21.512659 |
| winddirangleavg | 0.475960 | 0.361058 | 0.600800 | 0.445960 | 0.370340 | 0.608500 |
| windspeedmax | 1.019100 | 1.492500 | 1.221700 | 1.020200 | 1.489300 | 1.220400 |
| Experimental Results on KL-IDW-KDE with CTGAN Data |
| temperature | 3.238600 | 16.077100 | 4.009600 | 3.247900 | 16.128900 | 4.016100 |
| humidity | 16.502866 | 425.539386 | 20.628606 | 16.349276 | 414.968922 | 20.370786 |
| winddirangleavg | 0.484890 | 0.364200 | 0.603490 | 0.434080 | 0.384550 | 0.620100 |
| windspeedmax | 1.195900 | 1.818700 | 1.348600 | 1.201300 | 1.828800 | 1.352300 |
| Experimental Results on KL-IDW-KDE with VAE Data |
| temperature | 2.458100 | 9.835900 | 3.136200 | 2.463700 | 9.804000 | 3.131100 |
| humidity | 14.986074 | 334.733370 | 18.295720 | 14.966607 | 334.624520 | 18.292745 |
| winddirangleavg | 0.321710 | 0.189690 | 0.435500 | 0.329310 | 0.184870 | 0.429960 |
| windspeedmax | 0.773200 | 0.898700 | 0.948000 | 0.768800 | 0.909900 | 0.953900 |
Table 12.
Experimental Results of a Learning Model for Virtual Sensors based on KL-Ridge-IDW, CTGAN, and VAE Data.
Table 12.
Experimental Results of a Learning Model for Virtual Sensors based on KL-Ridge-IDW, CTGAN, and VAE Data.
| Sensors | BiLSTM | BiGRU |
|---|
| | MAE | MSE | RMSE | MAE | MSE | RMSE |
|---|
| Experimental Results on KL-Ridge-IDW Fused Virtual Sensor Data |
| temperature | 2.650400 | 10.764100 | 3.280900 | 2.654300 | 10.777700 | 3.282900 |
| humidity | 11.891702 | 223.552805 | 14.951682 | 11.864079 | 223.465279 | 14.948755 |
| winddirangleavg | 0.238300 | 0.100500 | 0.317040 | 0.228580 | 0.104800 | 0.323770 |
| windspeedmax | 0.641200 | 0.656500 | 0.810300 | 0.639300 | 0.665800 | 0.816000 |
| Experimental Results on KL-Ridge-IDW with CTGAN Data |
| temperature | 2.684300 | 11.258600 | 3.355400 | 2.688100 | 11.377300 | 3.373000 |
| humidity | 14.544929 | 309.811054 | 17.601450 | 14.574742 | 312.976560 | 17.691144 |
| winddirangleavg | 0.340970 | 0.181320 | 0.425810 | 0.339930 | 0.179650 | 0.423860 |
| windspeedmax | 0.647700 | 0.635700 | 0.797300 | 0.645800 | 0.630200 | 0.793900 |
| Experimental Results on KL-Ridge-IDW with VAE Data |
| temperature | 2.531300 | 10.184400 | 3.191300 | 2.535000 | 10.237600 | 3.199600 |
| humidity | 10.248988 | 166.336333 | 12.897144 | 10.628155 | 176.619255 | 13.289818 |
| winddirangleavg | 0.166470 | 0.048700 | 0.220750 | 0.173850 | 0.050348 | 0.224380 |
| windspeedmax | 0.589600 | 0.495200 | 0.703700 | 0.591100 | 0.496000 | 0.704300 |
Table 13.
Experimental Results of a Learning Model for Virtual Sensors based on KDE, KDE-CTGAN, and KDE-VAE Data.
Table 13.
Experimental Results of a Learning Model for Virtual Sensors based on KDE, KDE-CTGAN, and KDE-VAE Data.
| Sensors | BiLSTM | BiGRU |
|---|
| | MAE | MSE | RMSE | MAE | MSE | RMSE |
|---|
| Experimental Results on KDE Virtual Sensor Data |
| temperature | 3.863000 | 23.701600 | 4.868400 | 3.883900 | 24.005900 | 4.899600 |
| humidity | 18.812860 | 490.288933 | 22.142469 | 18.898043 | 497.731995 | 22.309908 |
| winddirangleavg | 0.506560 | 0.417130 | 0.645860 | 0.505970 | 0.416330 | 0.645236 |
| windspeedmax | 1.135000 | 1.831800 | 1.353400 | 1.136000 | 1.828100 | 1.352100 |
| Experimental Results on KDE with CTGAN Data |
| temperature | 3.839900 | 23.665100 | 4.864700 | 3.861900 | 23.962200 | 4.895100 |
| humidity | 22.217714 | 699.933905 | 26.456264 | 22.335684 | 704.135255 | 26.535547 |
| winddirangleavg | 0.428180 | 0.299130 | 0.546930 | 0.451960 | 0.307280 | 0.554330 |
| windspeedmax | 1.214100 | 2.062900 | 1.436300 | 1.220600 | 2.112100 | 1.453300 |
| Experimental Results on KDE with VAE Data |
| temperature | 3.662600 | 19.831200 | 4.453200 | 3.676400 | 20.068100 | 4.479700 |
| humidity | 13.663877 | 270.809119 | 16.456279 | 13.707969 | 272.862230 | 16.518542 |
| winddirangleavg | 0.384350 | 0.228350 | 0.477860 | 0.387490 | 0.230820 | 0.480440 |
| windspeedmax | 0.865700 | 1.010500 | 1.005200 | 0.850200 | 0.996100 | 0.998100 |
Table 14.
Comparison with temporal generative baselines using BiGRU as the downstream prediction model.
Table 14.
Comparison with temporal generative baselines using BiGRU as the downstream prediction model.
| Method | Temperature MAE | Humidity MAE | Wind Direction MAE | Wind Speed MAE |
|---|
| TimeGAN | 4.7075 | 16.9984 | 1.0545 | 2.2265 |
| Diffusion | 2.4285 | 3.9679 | 0.6157 | 0.8915 |
| Proposed | 2.2697 | 10.6668 | 0.1594 | 0.8839 |
Table 15.
Statistical significance analysis using the Wilcoxon signed-rank test.
Table 15.
Statistical significance analysis using the Wilcoxon signed-rank test.
| Comparison | Variable | p-Value | Significance |
|---|
| Proposed vs. TimeGAN | Temperature | | Significant |
| Proposed vs. TimeGAN | Humidity | | Significant |
| Proposed vs. TimeGAN | Wind direction | | Significant |
| Proposed vs. TimeGAN | Wind speed | | Significant |
| Proposed vs. Diffusion | Temperature | | Significant |
| Proposed vs. Diffusion | Humidity | | Significant |
| Proposed vs. Diffusion | Wind direction | | Significant |
| Proposed vs. Diffusion | Wind speed | | Significant |
Table 16.
External validation using the public NOAA LCD weather-station dataset with BiGRU as the downstream prediction model.
Table 16.
External validation using the public NOAA LCD weather-station dataset with BiGRU as the downstream prediction model.
| Dataset/Method | Temperature MAE | Humidity MAE | Wind Direction MAE | Wind Speed MAE |
|---|
| Original NOAA Public Data | 0.8730 | 3.2324 | 11.7888 | 0.3593 |
| KL-Ridge–IDW Fused NOAA Data | 0.7746 | 3.3330 | 11.8946 | 0.3627 |
| KL-Ridge–IDW + VAE NOAA Data | 5.2698 | 11.6837 | 61.5136 | 0.7058 |