Next Article in Journal
Modelling Large-Scale Group Decision-Making Through Grouping with Large Language Models
Previous Article in Journal
Design and Implementation of an Autonomous Mobile Robot for Object Delivery via Homography-Based Visual Servoing
Previous Article in Special Issue
Unsupervised Person Re-Identification via Deep Attribute Learning
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Novel Encoder–Decoder Architecture with Attention Mechanisms for Satellite-Based Environmental Forecasting in Smart City Applications

by
Kalsoom Panhwar
1,*,
Bushra Naz Soomro
2,*,
Sania Bhatti
3 and
Fawwad Hassan Jaskani
4
1
Department of Computer Systems Engineering, University of Sindh, Jamshoro 76080, Pakistan
2
Department of Computer Systems Engineering, Mehran University of Engineering and Technology, Jamshoro 76062, Pakistan
3
Department of Software Engineering, Mehran University of Engineering and Technology, Jamshoro 76062, Pakistan
4
Department of Computer Systems Engineering, The Islamia University of Bahawalpur, Bahawalpur 63100, Pakistan
*
Authors to whom correspondence should be addressed.
Future Internet 2025, 17(9), 380; https://doi.org/10.3390/fi17090380
Submission received: 24 July 2025 / Revised: 9 August 2025 / Accepted: 15 August 2025 / Published: 25 August 2025
(This article belongs to the Special Issue Advances in Deep Learning and Next-Generation Internet Technologies)

Abstract

Desertification poses critical threats to agricultural productivity and socio-economic stability, particularly in vulnerable regions like Thatta and Badin districts of Sindh, Pakistan. Traditional monitoring methods lack the accuracy and temporal resolution needed for effective early warning systems. This study presents a novel Spatio-Temporal Desertification Predictor (STDP) framework that integrates deep learning with next-generation satellite imaging for time-series desertification forecasting. The proposed encoder–decoder architecture combines Convolutional Neural Networks (CNNs) for spatial feature extraction from high-resolution satellite imagery with modified Long Short-Term Memory (LSTM) networks enhanced by multi-head attention to capture temporal dependencies. Environmental variables are fused through an adaptive data integration layer, and hyperparameter optimization is employed to enhance model performance for edge computing deployment. Experimental validation on a 15-year satellite dataset (2010–2024) demonstrates superior performance with MSE = 0.018, MAE = 0.079, and R 2 = 0.94 , outperforming traditional CNN-only, LSTM-only, and hybrid baselines by 15–20% in prediction accuracy. The framework forecasts desertification trends through 2030, providing actionable signals for environmental management and policy-making. This work advances the integration of AI with satellite-based Earth observation, offering a scalable path for real-time environmental monitoring in IoT and edge computing infrastructures.

1. Introduction

The integration of deep learning algorithms with emerging Internet paradigms has opened new opportunities for addressing critical environmental challenges through intelligent satellite-based monitoring. Desertification, defined as the transformation of fertile land into desert through drought, deforestation, or poor agricultural practices [1], represents a major global threat that demands AI-driven solutions. According to the United Nations Convention to Combat Desertification (UNCCD) [2], about 1.3 billion people are affected annually by 12 million hectares of land degradation, with Thatta and Badin districts of Sindh, Pakistan, being particularly vulnerable.
The convergence of artificial intelligence and remote sensing within IoT-enabled environmental monitoring systems offers a paradigm shift in desertification prediction and management. Traditional monitoring suffers from limited temporal resolution and accuracy, hindering early detection and trend analysis [3]. These shortcomings constrain timely intervention and sustainable land management, especially in vulnerable regions [2,4].
Recent advances in deep learning, combined with edge computing and 5G/6G infrastructures, have transformed satellite image processing for environmental applications. High-resolution imagery now enables detailed, real-time observations of land conditions, and when processed with state-of-the-art algorithms [5], can reveal complex desertification patterns and predict future changes with high accuracy. Convolutional Neural Networks (CNNs) for spatial feature extraction and Long Short-Term Memory (LSTM) networks for temporal pattern analysis have shown notable improvements in environmental monitoring [6].
AI-driven monitoring has achieved high accuracy in detecting and mapping desertification patterns [7,8], but often lacks sophisticated attention mechanisms and multi-modal data fusion for complex scenarios. While traditional methods improve accuracy by up to 15% over baselines, hybrid CNN–LSTM models—especially when enhanced with attention mechanisms—are emerging as the next frontier [9]. However, key challenges remain, including integration of diverse environmental variables, real-time edge deployment, and effective temporal feature prioritization [10].
This research introduces several groundbreaking innovations at the intersection of deep learning algorithms and next-generation Internet technologies for environmental monitoring: We present the first Spatio-Temporal Desertification Predictor (STDP) framework that integrates an encoder–decoder architecture specifically designed for satellite-based environmental monitoring. Unlike existing approaches, our model incorporates multi-head attention mechanisms that automatically identify and prioritize critical temporal features, addressing a key limitation in current environmental AI systems.
Our research introduces an innovative data fusion layer that seamlessly integrates multi-temporal satellite imagery with real-time environmental variables, enabling deployment in IoT-enabled smart city infrastructures and edge computing environments. This represents a significant advancement over traditional single-source approaches. We develop an adaptive hyperparameter optimization strategy specifically designed for deployment in edge computing and 5G/6G network environments, ensuring real-time processing capabilities while maintaining prediction accuracy—a critical requirement for next-generation environmental monitoring systems. The proposed framework addresses the scalability challenges in satellite image processing by introducing novel model compression techniques and efficient feature extraction methods suitable for distributed computing environments and autonomous monitoring systems.
The primary goal of this research is to develop an advanced deep learning framework that leverages next-generation Internet technologies for optimized time-series forecasting of desertification levels from satellite imagery. Our framework addresses critical gaps in current environmental AI systems by introducing the following:
  • A hybrid model combining CNNs with attention-enhanced LSTM networks for superior spatial–temporal feature extraction from satellite data, specifically optimized for edge computing deployment.
  • An innovative approach that seamlessly integrates multi-temporal satellite images with real-time environmental variables through advanced fusion mechanisms suitable for smart city and autonomous monitoring applications.
  • Adaptive hyperparameter optimization techniques that enhance both forecast accuracy and computational efficiency, enabling deployment in 5G/6G networks and edge computing infrastructures.
  • Comprehensive validation using real-world satellite datasets to demonstrate effectiveness in early detection and management of environmental degradation across diverse geographical regions.
To summarize, the key contributions of this work are outlined as follows:
  • We propose the first Spatio-Temporal Desertification Predictor (STDP) framework that combines CNN-based spatial feature extraction with attention-enhanced LSTM temporal modeling, optimized for edge computing deployment.
  • We integrate multi-modal IoT sensor data with satellite imagery through an uncertainty-aware fusion mechanism, enabling real-time environmental monitoring in smart city infrastructures.
  • We design an adaptive hyperparameter optimization process tailored for low-latency, energy-efficient inference in 5G/6G environments.
  • We validate the framework on a 15-year multi-source dataset (2010–2024) from two vulnerable districts in Pakistan, achieving superior accuracy over CNN-only, LSTM-only, and Transformer baselines.
This research paper is structured to provide comprehensive coverage of our novel AI-driven environmental monitoring framework. Section 1 introduces the desertification challenge within the context of next-generation Internet technologies and deep learning applications. Section 2 presents a thorough literature review of AI-driven environmental monitoring, highlighting the research gaps that our STDP framework addresses. Section 3 details the methodology of our novel encoder–decoder architecture, including the innovative CNN–LSTM hybrid model with attention mechanisms, advanced data fusion strategies, and edge computing optimization techniques. Section 4 presents comprehensive results and discussions, comparing our approach with traditional and state-of-the-art methods for environmental monitoring. Section 5 summarizes key contributions and outlines the practical applications of our framework for sustainable environmental management and future research directions in AI-driven environmental systems.
The research contributes significantly to the intersection of deep learning algorithms and next-generation Internet technologies by providing a scalable, accurate, and computationally efficient solution for satellite-based environmental monitoring, with broad applications in smart cities, autonomous environmental systems, and IoT-enabled sustainability initiatives.

2. Literature Review

The convergence of deep learning algorithms with next-generation Internet technologies has catalyzed significant advancements in satellite-based environmental monitoring systems. This section comprehensively reviews the state-of-the-art developments in AI-driven desertification detection, with particular emphasis on novel deep learning architectures, attention mechanisms for feature extraction, and their integration with edge computing and IoT infrastructures for real-time environmental monitoring.
The foundation of modern AI-driven environmental monitoring lies in scalable cloud computing platforms that enable processing of massive satellite datasets. Aslanov et al. [1] demonstrated the effectiveness of Google Earth Engine for comprehensive desertification pattern analysis across Central Asia, utilizing spatio-temporal data from 2000 to 2020. Their cloud-based approach enabled monitoring of environmental trends over vast geographical areas with unprecedented speed and scalability, establishing a paradigm for large-scale environmental AI applications. This work exemplifies the integration of cloud computing with satellite imagery processing, a critical component of next-generation Internet technologies for environmental applications.
The temporal dynamics of vegetation cover as desertification indicators have been extensively studied through multi-temporal satellite image analysis. Badreldin et al. [2] investigated spatio-temporal dynamics in Egypt, revealing significant vegetation cover decline through comprehensive temporal data integration. Their methodology demonstrates the essential role of time-series analysis in satellite-based environmental monitoring, providing foundational insights for developing advanced temporal modeling approaches in deep learning frameworks.
Advanced feature extraction techniques form the backbone of intelligent environmental monitoring systems. Barimah [4] established the critical importance of Normalized Difference Vegetation Index (NDVI) for desertification detection and quantification in Ghana’s Upper East Region, analyzing data from 2000 to 2012. Their findings revealed strong correlations between vegetation loss and desertification patterns, providing essential validation for NDVI-based feature extraction in deep learning models.
Complementing NDVI analysis, Bezerra et al. [11] explored Enhanced Vegetation Index (EVI2) applications using multi-temporal MODIS data for desertification area identification. Their research demonstrated the capability of vegetation indices to detect regions with significant vegetation decrease, establishing the foundation for multi-spectral feature extraction in contemporary AI systems. These studies collectively highlight the importance of sophisticated feature extraction mechanisms, a key focus area in the current special issue on attention mechanisms for feature extraction.
The application of advanced deep learning architectures to environmental monitoring represents a significant breakthrough in AI-driven earth observation systems. Chang et al. [6] pioneered the use of U-Net architecture combined with machine learning for time-series growth prediction in Arabidopsis, demonstrating deep learning’s capability to extract spatially explicit and temporally contingent features necessary for accurate environmental predictions. Their work established the foundation for encoder–decoder architectures in environmental AI applications, directly relevant to the novel architectural designs emphasized in this special issue.
The integration of multiple machine learning algorithms for environmental monitoring was advanced by Feng et al. [12], who utilized Random Forest and Support Vector Machine algorithms for desertification monitoring in Mu Us Sandy Land, China. Their approach achieved superior accuracy compared to traditional methods, demonstrating the potential of ensemble learning approaches in satellite-based environmental monitoring systems. This work provides important insights for developing hybrid AI architectures suitable for edge computing deployment.
Spatial and temporal dynamics analysis represents a critical component of intelligent environmental monitoring systems. Guo et al. [13] conducted comprehensive satellite monitoring of desertification dynamics in Ordos Plateau, China, from 2000 to 2015, identifying key areas of severe desertification using MODIS and Landsat imagery. Their methodology demonstrates the feasibility of large-scale environmental change monitoring through integrated remote sensing approaches, providing essential validation for scalable AI deployment in environmental applications.
The application of semi-supervised learning approaches to environmental monitoring was advanced by Harrou et al. [14], who developed Landsat imagery-based anomaly detection for desertification indicators. Their semi-supervised approach effectively addresses the challenge of limited labeled data in environmental monitoring, a critical consideration for deploying AI systems in IoT and edge computing environments where continuous learning capabilities are essential.
The integration of multiple data sources represents a frontier in AI-driven environmental monitoring systems. Hummadi and Khalaf [5] demonstrated the effectiveness of spectral indices for studying spatial and temporal variability of agricultural drought and desertification in Salah Al-Din Governorate, tracking soil moisture and vegetation health changes using NDVI, EVI, and other spectral indices. Their multi-modal approach provides essential insights for developing comprehensive data fusion strategies in IoT-enabled environmental monitoring systems.
Advanced SAR imagery processing was explored by Jo et al. [8], who achieved excellent classification accuracy for rice paddy identification in South Korea using multi-temporal Synthetic Aperture Radar imaging with deep learning approaches. Their work demonstrates the potential of all-weather monitoring capabilities essential for autonomous environmental monitoring systems, particularly relevant for edge computing applications where continuous operation is required regardless of atmospheric conditions.
The challenge of integrating diverse data sources for comprehensive environmental monitoring was addressed by Kussul et al. [10], who proposed innovative approaches for land degradation estimation using mixed data sources and analytical techniques. Their integrated methodology combining remote sensing data, machine learning models, and ground observations provides a comprehensive framework for land degradation identification and quantification, establishing important precedents for multi-source data fusion in AI systems.
Comprehensive reviews of deep learning applications in environmental monitoring have been provided by Jelas et al. [9], who examined semantic segmentation techniques for deforestation detection, and Miller et al. [15], who reviewed deep learning techniques for satellite image time-series analysis. These reviews highlight the evolution from traditional CNN and RNN approaches to sophisticated attention-based architectures, emphasizing the importance of temporal feature extraction capabilities in environmental AI systems.
Advanced predictive modeling for environmental applications has been significantly enhanced through deep learning innovations. Nandgude et al. [16] provided comprehensive reviews of drought prediction models and technologies, emphasizing the importance of multi-disciplinary approaches combining meteorological, hydrological, and satellite data sources. Their analysis underscores the need for advanced AI technologies capable of integrating diverse data streams, a critical requirement for next-generation environmental monitoring systems.
The development of specialized AI models for climate applications was demonstrated by Shams Eddin and Gall [17], who presented the Focal-TSMP deep learning model for predicting agricultural dryness and vegetation health from regional climate simulations. Their model’s ability to incorporate spatial and temporal data for accurate vegetation health prediction exemplifies the sophisticated AI architectures required for comprehensive environmental monitoring in smart city and IoT applications.
The integration of interpretable machine learning with climate change analysis was explored by Shevchenko et al. [3], who investigated climate change impacts on agricultural land suitability using Random Forest algorithms. Their Eurasian case study demonstrates the importance of explainable AI approaches in environmental monitoring, particularly relevant for policy-making applications where model interpretability is crucial for decision-making processes.
Advanced statistical methods for environmental monitoring were developed by Verbesselt et al. [7], who utilized sophisticated time-series analysis techniques like BFAST for detecting trends and seasonal shifts in satellite imagery. Their methodology demonstrates the capability to detect both abrupt and gradual environmental changes, providing essential foundations for developing robust temporal analysis capabilities in AI systems.
Large-scale environmental monitoring capabilities were demonstrated by Xu et al. [18], who analyzed worldwide vegetative drought trends using long-term satellite remote sensing data over several decades. Their comprehensive analysis of vegetation and drought indices provides essential validation for global-scale AI deployment in environmental monitoring applications, demonstrating the scalability requirements for next-generation Internet-based environmental systems.
Recent innovations in generative AI for environmental monitoring were pioneered by Zerrouki et al. [19], who developed Generative Adversarial Network (GAN) approaches for desertification detection. Their GAN-based model achieved 15% accuracy improvement over baseline methods by generating synthetic satellite images that enhanced desertification feature detection. This innovative approach demonstrates the potential of generative AI for addressing data scarcity challenges in environmental monitoring systems.
Complementing GAN approaches, Zerrouki et al. [20] also developed Variational Autoencoder (VAE) techniques for desertification detection using ETM Landsat satellite data. Their VAE-based model demonstrated 12% improvement in detection accuracy over traditional classification methods, highlighting the potential of unsupervised learning approaches for environmental AI applications.
The integration of comprehensive spatial–temporal analysis was demonstrated by Zongfan et al. [21], who utilized integrated remote sensing indices for analyzing desertification spatio-temporal evolution in Duolun County, Inner Mongolia. Their methodology incorporated vegetation indices, albedo, and soil moisture levels to provide comprehensive understanding of desertification dynamics, establishing important precedents for multi-parameter analysis in AI-driven environmental monitoring systems.
Rivera-Marin et al. [22] provided comprehensive reviews of remote sensing applications in desertification studies, encompassing various remote sensing techniques and their applications alongside developments in satellite technology and data analysis methods. Their review establishes remote sensing as the primary means for describing desertification trends and impacts, while identifying critical research areas for advancing AI-driven environmental monitoring capabilities.
Recent research in 2024–2025 has further advanced the integration of deep learning, spatio-temporal modeling, and multi-source data fusion for environmental and infrastructure forecasting. As, Ref. [23] proposed a multi-model traffic forecasting approach for smart cities that combines graph neural networks and Transformer-based multi-source visual fusion, achieving improved predictive performance in highly dynamic urban settings. Similarly, Ref. [24] provided a comprehensive review of artificial intelligence and deep learning approaches for resource management in smart buildings, highlighting strategies for energy optimization, real-time monitoring, and sustainable operations that are transferable to environmental monitoring contexts. In the geospatial domain, Ref. [25] developed a graph neural network-based method for spatio-temporal land cover mapping using satellite imagery, demonstrating high accuracy in capturing complex spatial dependencies—an approach highly relevant to desertification prediction tasks.
These recent studies underscore the growing trend of leveraging graph-based modeling, multi-modal fusion, and Transformer architectures for spatio-temporal forecasting, directly supporting the motivation for our proposed IoT-integrated CNN–attention-LSTM framework.
The evolution of smart cities has increasingly embraced artificial intelligence (AI), the Internet of Things (IoT), and edge computing as core enablers of sustainable and resilient urban environments. These technologies enable real-time environmental monitoring, predictive analytics, and automated decision-making to mitigate climate-related risks and enhance quality of life for urban residents. Recent advancements demonstrate the use of IoT-enabled sensor networks for environmental parameter acquisition, coupled with AI algorithms for rapid processing and anomaly detection [26].
Edge AI has emerged as a critical element for privacy-preserving and low-latency environmental analytics in smart city deployments, allowing for distributed computation close to data sources [27]. Urban resilience initiatives have also leveraged predictive analytics to address vulnerabilities in infrastructure, resource allocation, and environmental sustainability [28]. The integration of AI into urban infrastructure planning offers opportunities for adaptive environmental governance, as highlighted by Zehouani et al. [29], who propose AI-based frameworks for sustainable smart city transformations.
AI’s role in climate-resilient water management is gaining attention, with intelligent systems offering precise forecasting, adaptive allocation, and resource optimization strategies [30]. Similar approaches have been applied to circular resource management, such as smart waste-to-energy networks, which utilize AI-driven optimization to close urban material loops while reducing greenhouse gas emissions [31]. These developments extend to urban-adjacent sectors, including AI- and IoT-enhanced climate-resilient agriculture, where predictive models inform disease forecasting and yield optimization [32].
In the context of desertification monitoring, these smart city frameworks are highly relevant. The proposed IoT-integrated CNN–attention-LSTM framework in this study aligns with the technological underpinnings of smart city environmental platforms. By leveraging high-resolution satellite imagery, multi-modal data fusion, and temporal attention mechanisms, such a system could be adapted for urban applications—ranging from real-time land-use change detection to adaptive resource management—supporting proactive policy-making and disaster mitigation in smart cities.
The comprehensive literature review reveals several critical research gaps that our proposed STDP framework addresses:
  • Limited Attention Mechanisms: Current approaches lack sophisticated attention mechanisms for temporal feature prioritization, a critical requirement for accurate long-term environmental forecasting.
  • Insufficient Edge Computing Optimization: Existing models are not optimized for deployment in edge computing and IoT environments, limiting real-time monitoring capabilities.
  • Inadequate Multi-Modal Integration: Current data fusion approaches do not fully exploit the potential of integrating satellite imagery with real-time environmental sensor data.
  • Scalability Limitations: Most existing approaches focus on specific geographical regions, lacking the scalability required for global deployment in next-generation Internet infrastructures.
The comparative analysis presented in Table 1 demonstrates that while significant progress has been made in AI-driven environmental monitoring, there remains substantial opportunity for innovation in developing comprehensive frameworks that integrate advanced attention mechanisms, multi-modal data fusion, and edge computing optimization. Our proposed STDP framework addresses these critical gaps by introducing novel encoder–decoder architectures specifically designed for deployment in next-generation Internet infrastructures while maintaining superior predictive performance across multiple evaluation metrics.

3. Methodology

This section presents a comprehensive methodology for developing an advanced deep learning framework that leverages next-generation Internet technologies for real-time desertification forecasting. The proposed Spatio-Temporal Desertification Predictor (STDP) integrates cutting-edge encoder–decoder architectures with attention mechanisms, specifically designed for deployment in edge computing environments and IoT-enabled monitoring systems. The methodology encompasses novel architectural innovations, multi-modal data fusion strategies, and computational optimizations tailored for 5G/6G network infrastructures and distributed processing environments.
Figure 1 illustrates the system flowchart of the proposed spatio-temporal desertification prediction framework, integrating IoT sensor fusion, CNN-based spatial feature extraction, attention-enhanced LSTM temporal decoding, and uncertainty quantification for edge deployment.

3.1. Advanced Mathematical Framework for Next-Generation AI Architecture

3.1.1. Enhanced Encoder–Decoder Architecture for Edge Computing Deployment

Our novel approach addresses the fundamental challenge of creating reliable environmental predictions from satellite imagery while ensuring compatibility with edge computing constraints and real-time processing requirements. The architecture incorporates sophisticated attention mechanisms and multi-modal data fusion capabilities specifically optimized for deployment in next-generation Internet infrastructures.
Let X t represent the multi-spectral satellite image tensor at time t, where X t R H × W × C × S with height H, width W, spectral channels C, and temporal stacks S. The enhanced encoder CNN incorporates depth-wise separable convolutions and channel attention mechanisms for efficient spatial feature extraction:
F C N N ( t ) = ChannelAttention DepthwiseConv i = 1 L W i d w X t + j = 1 K W j p w · F i 1 + b i
where W i d w represents depth-wise convolutional kernels, W j p w denotes point-wise convolution weights, and the channel attention mechanism is defined as
ChannelAttention ( F ) = F σ W 2 · ReLU W 1 · GAP ( F ) + W 2 · ReLU W 1 · GMP ( F )
where GAP and GMP represent Global Average Pooling and Global Max Pooling operations respectively, optimized for memory-efficient processing in edge computing environments.
The decoder employs a modified LSTM architecture with multiplicative attention gates and adaptive computation time mechanisms for dynamic resource allocation:
i t = σ ( W i x ˜ t + U i h t 1 + V i c t 1 + b i )
f t = σ ( W f x ˜ t + U f h t 1 + V f c t 1 + b f )
o t = σ ( W o x ˜ t + U o h t 1 + V o c t 1 + b o )
c ˜ t = tanh ( W c x ˜ t + U c ( h t 1 g t ) + b c )
g t = σ ( W g x ˜ t + U g h t 1 + b g )
c t = f t c t 1 + i t c ˜ t
h t = o t tanh ( c t ) ACT t
where g t represents multiplicative gating, V * denotes peephole connections, and ACT t is the adaptive computation time factor for dynamic resource allocation.

3.1.2. Advanced Multi-Head Attention with Positional Encoding for IoT Integration

The attention mechanism incorporates positional encoding and temporal locality awareness for enhanced feature extraction in distributed IoT environments:
Q ( m ) = ( H + PE ) W Q ( m ) + b Q ( m )
K ( m ) = ( H + PE ) W K ( m ) + b K ( m )
V ( m ) = ( H + PE ) W V ( m ) + b V ( m )
where the positional encoding PE incorporates both absolute and relative temporal information:
PE ( p o s , 2 i ) = sin p o s 10000 2 i / d m o d e l + α · sin p o s τ 10000 2 i / d m o d e l
PE ( p o s , 2 i + 1 ) = cos p o s 10000 2 i / d m o d e l + α · cos p o s τ 10000 2 i / d m o d e l
The attention computation includes locality-aware masking for edge computing optimization:
Attention ( m ) ( H ) = softmax Q ( m ) ( K ( m ) ) d h + M l o c a l i t y V ( m )
where M l o c a l i t y represents the locality bias matrix for temporal pattern preservation.

3.1.3. IoT-Enabled Multi-Modal Data Fusion with Uncertainty Quantification

The data fusion layer incorporates uncertainty quantification and adaptive weighting mechanisms for robust multi-modal integration in IoT environments:
F f u s i o n = k = 1 K w k F k + ffl k
where the adaptive weights are computed through a gating network:
w k = softmax W g a t e · tanh W k F k + b k
ffl k = N ( 0 , σ k 2 I ) where σ k 2 = exp W σ F k + b σ
The uncertainty-aware fusion incorporates Bayesian neural network principles for robust prediction in distributed environments:
Y p r e d = E q ( ` ) f ( F f u s i o n ; ` ) ± Var q ( ` ) f ( F f u s i o n ; ` )

3.1.4. Edge-Optimized Loss Function with Computational Constraints

The optimization framework incorporates computational efficiency constraints for edge computing deployment:
L t o t a l = L p r e d + λ 1 L r e g + λ 2 L c o m p + λ 3 L c o m m
where
L p r e d = 1 T t = 1 T Y t T t 2 2 + β t = 1 T KL ( q ( y t ) p ( y t ) )
L r e g = l = 1 L W l F 2 + α l = 1 L W l 1
L c o m p = γ · FLOPS ( f ) + δ · Memory ( f )
L c o m m = η i = 1 N m i 0

3.2. Enhanced Dataset Description and Multi-Modal Integration

The comprehensive dataset integrates multiple data modalities specifically designed for next-generation Internet-based environmental monitoring applications. The multi-temporal satellite imagery dataset encompasses high-resolution Landsat and Sentinel observations spanning 2010–2024, complemented by real-time IoT sensor data from distributed environmental monitoring networks. Table 2 shows the comprehensive multimodal dataset specifications.
The preprocessing pipeline incorporates advanced techniques optimized for distributed processing and edge computing environments, including atmospheric correction using the Second Simulation of a Satellite Signal in the Solar Spectrum (6SV) radiative transfer model, geometric correction with ground control points, and temporal gap-filling using advanced interpolation techniques.

3.3. Advanced Spectral Feature Engineering for AI Applications

The spectral feature extraction incorporates sophisticated algorithms designed for real-time processing in edge computing environments. Beyond traditional vegetation indices, the framework computes advanced spectral features that capture subtle environmental changes indicative of desertification processes.
The Enhanced Vegetation Index with atmospheric correction incorporates blue band information for improved sensitivity:
EVI = G × NIR Red NIR + C 1 × Red C 2 × Blue + L
where G = 2.5 , C 1 = 6 , C 2 = 7.5 , and L = 1 are empirically derived coefficients.
The Normalized Difference Moisture Index provides crucial information for desertification monitoring:
NDMI = NIR SWIR 1 NIR + SWIR 1
The Bare Soil Index quantifies exposed soil surfaces:
BSI = ( SWIR 1 + Red ) ( NIR + Blue ) ( SWIR 1 + Red ) + ( NIR + Blue )
Advanced spectral angle mapping incorporates uncertainty quantification:
SAM ( x , r ) = arccos x T r x r ± σ S A M

3.4. STDP Architecture: Next-Generation AI Framework

The Spatio-Temporal Desertification Predictor represents a breakthrough in environmental AI, specifically designed for deployment in next-generation Internet infrastructures. The architecture incorporates cutting-edge deep learning innovations including Transformer-based attention mechanisms, neural architecture search optimization, and dynamic computation graphs for efficient edge processing. Figure 2 shows the encoder–decoder STDP architecture for next-generation Internet deployment.
The architecture encompasses multiple innovative components optimized for distributed processing and real-time inference capabilities. The encoder incorporates depth-wise separable convolutions with squeeze-and-excitation blocks for efficient spatial feature extraction, while the decoder employs attention-augmented LSTM cells with adaptive computation time for dynamic resource allocation. Algorithm 1 shows the STDP algorithm for next-generation Internet deployment.
Algorithm 1 Advanced STDP algorithm for next-generation Internet deployment.
  1:
Input: Multi-modal satellite imagery X t , IoT sensor data S t , environmental variables E t for t = 1 , , T
  2:
Output: Predicted desertification levels Y T + 1 : T + k with uncertainty bounds
  3:
Phase 1: Edge-Optimized Data Preprocessing
  4:
for  t = 1 to T do
  5:
     X t n o r m AtmosphericCorrection( X t )
  6:
     S t s y n c TemporalSynchronization( S t , X t )
  7:
     F s p e c t r a l ( t ) ComputeSpectralIndices( X t n o r m )
  8:
end for
  9:
Phase 2: Advanced Spatial Feature Extraction
10:
for  t = 1 to T do
11:
     F c o n v ( t ) DepthwiseConvolution( X t n o r m )
12:
     F a t t e n t i o n ( t ) ChannelAttention( F c o n v ( t ) )
13:
     F s p a t i a l ( t ) SpatialPyramidPooling( F a t t e n t i o n ( t ) )
14:
end for
15:
Phase 3: Multi-Modal Data Fusion
16:
for  t = 1 to T do
17:
     F f u s e d ( t ) AdaptiveFusion( F s p a t i a l ( t ) , S t s y n c , E t )
18:
     U ( t ) UncertaintyQuantification( F f u s e d ( t ) )
19:
end for
20:
Phase 4: Temporal Sequence Modeling
21:
H 0 InitializeHiddenState()
22:
for  t = 1 to T do
23:
     H t AttentionLSTM( H t 1 , F f u s e d ( t ) )
24:
     A t MultiHeadAttention( H t , H 1 : t )
25:
end for
26:
Phase 5: Advanced Attention Mechanism
27:
Q , K , V ComputeQKV( H 1 : T )
28:
A g l o b a l ScaledDotProductAttention( Q , K , V )
29:
H a t t e n d e d LayerNormalization( A g l o b a l + H T )
30:
Phase 6: Multi-Horizon Prediction
31:
for  k = 1 to K do
32:
     Y T + k PredictionHead( H a t t e n d e d , k)
33:
     σ T + k UncertaintyEstimation( Y T + k , U ( T ) )
34:
end for
35:
Phase 7: Edge Computing Optimization
36:
Y o p t i m i z e d ModelCompression( Y T + 1 : T + K )
37:
Metrics ComputeResourceMetrics(FLOPS, Memory, Latency)
38:
Return:  Y T + 1 : T + K ± σ T + 1 : T + K , Metrics
Figure 3 illustrates the comprehensive neural network architecture optimized for edge computing deployment. The network incorporates progressive feature abstraction through convolutional layers, efficient pooling mechanisms for computational optimization, and fully connected layers for high-level semantic understanding. The deconvolution operations enable feature reconstruction and spatial coherence preservation, essential for accurate desertification pattern recognition.
Table 3 shows the STDP architectural components for edge computing integration.

3.5. Advanced Hyperparameter Optimization for Edge Computing Deployment

The hyperparameter optimization framework incorporates neural architecture search (NAS) and multi-objective optimization specifically designed for edge computing constraints. The optimization process balances prediction accuracy with computational efficiency, memory usage, and energy consumption.
The multi-objective optimization problem is formulated as
min θ L a c c u r a c y ( θ ) , L l a t e n c y ( θ ) , L m e m o r y ( θ ) , L e n e r g y ( θ )
where each objective function incorporates edge computing constraints:
L a c c u r a c y ( θ ) = 1 T t = 1 T Y t ( θ ) T t 2 2 + β H ( p ( y t | θ ) )
L l a t e n c y ( θ ) = α 1 · FLOPS ( θ ) + α 2 · MemoryAccess ( θ )
L m e m o r y ( θ ) = γ 1 · Parameters ( θ ) + γ 2 · Activations ( θ )
L e n e r g y ( θ ) = δ 1 · ComputeEnergy ( θ ) + δ 2 · MemoryEnergy ( θ )

3.6. Comprehensive Experimental Setup for Next-Generation Internet Deployment

The experimental framework is designed to validate the STDP architecture’s performance across multiple dimensions relevant to next-generation Internet applications, including accuracy, computational efficiency, scalability, and real-time processing capabilities.
The experimental validation encompasses comprehensive performance evaluation across accuracy metrics, computational efficiency benchmarks, scalability assessments, and real-world deployment scenarios. The framework incorporates advanced statistical analysis including Bayesian model comparison, uncertainty quantification, and robustness testing under various environmental conditions and computational constraints. As shown in Table 4, our experimental setup enables efficient edge computing validation.

4. Results and Discussion

This section presents comprehensive experimental validation of the Spatio-Temporal Desertification Predictor (STDP) framework, demonstrating its superior performance within next-generation Internet technologies and edge computing environments. The evaluation encompasses multiple dimensions including accuracy assessment, computational efficiency analysis, scalability validation, and real-time processing capabilities essential for deployment in IoT-enabled environmental monitoring systems. The results provide compelling evidence of the framework’s effectiveness in addressing complex spatial–temporal prediction challenges while maintaining compatibility with resource-constrained edge computing infrastructures.

4.1. Comprehensive Performance Evaluation Framework

The STDP model evaluation employs a multi-faceted assessment strategy designed to validate performance across all aspects relevant to next-generation Internet applications. The evaluation framework incorporates traditional accuracy metrics alongside novel edge computing performance indicators, ensuring comprehensive validation for distributed deployment scenarios.
The performance assessment utilizes Mean Squared Error (MSE) as the primary indicator of prediction accuracy, measuring the average squared difference between predicted and actual desertification levels with particular emphasis on temporal consistency. Mean Absolute Error (MAE) provides intuitive understanding of prediction precision, essential for policy-making applications where interpretability is crucial. The R-Squared (R2) metric quantifies the proportion of variance explained by the model, indicating the framework’s capacity to capture complex environmental dynamics. Prediction Interval Coverage Probability (PICP) assesses the reliability of uncertainty estimates, critical for risk assessment in environmental monitoring applications. Additionally, edge computing specific metrics including inference latency, memory utilization, energy consumption, and computational throughput are evaluated to ensure deployment feasibility in resource-constrained environments. Table 5 shows the Comparative performance including recent spatio-temporal models (2020–2025).

4.2. Advanced Visualization and Temporal Analysis

The comprehensive visualization framework demonstrates the STDP model’s exceptional capability in capturing both historical trends and future projections with remarkable accuracy. Figure 4 presents an advanced comparison between predicted and actual desertification levels, incorporating uncertainty bounds and confidence intervals essential for risk assessment in environmental monitoring applications.

4.3. Multi-Horizon Forecasting and Future Projections

The STDP framework demonstrates exceptional capability in multi-horizon forecasting, providing reliable predictions for extended temporal periods essential for long-term environmental planning and policy formulation. The forecasting results reveal concerning trends that necessitate immediate intervention strategies and sustainable land management practices.
Table 6 shows the multi-horizon forecasting results with uncertainty quantification (2025–2030).

4.4. Edge Computing Performance and Scalability Analysis

The STDP framework’s deployment feasibility in next-generation Internet infrastructures is validated through comprehensive edge computing performance evaluation. The framework demonstrates exceptional efficiency across multiple deployment scenarios, from individual IoT nodes to distributed cloud–edge hybrid architectures. Table 7 shows the Edge computing performance analysis across different deployment configurations.
Figure 5 presents the multi-horizon desertification forecasting results using the STDP framework for both Thatta and Badin regions from 2025 to 2030. The figure includes 95% confidence intervals for each prediction trajectory. The yellow-shaded bands represent the severe risk zone (0.70–0.85), while the red-shaded areas denote the extreme risk zone (values above 0.85). Solid lines with circular markers indicate the forecasted desertification levels, and the shaded regions around them illustrate the prediction uncertainty bounds. The Thatta region shows a steeper upward trend, entering the extreme risk zone by 2029, while Badin exhibits a slightly more gradual but consistent increase throughout the forecast horizon.

4.5. Advanced Hyperparameter Optimization and Model Configuration

The comprehensive hyperparameter optimization study reveals critical insights into the STDP framework’s sensitivity to various configuration parameters, providing essential guidance for deployment in diverse environmental monitoring scenarios.

Multi-Dimensional Hyperparameter Analysis

The regularization parameter λ has a significant impact on model performance, with the optimal value depending on deployment constraints and accuracy requirements. Our analysis indicates that λ = 0.1 provides an effective balance between overfitting prevention and model expressiveness across both study regions. Table 8 shows the Advanced hyperparameter sensitivity analysis for edge computing optimization.
Generalizability to Unseen Regions. To assess generalizability, we evaluated the trained STDP model on an unseen region (Khairpur district) using the same satellite and IoT sensor modalities. Results showed only a minor performance drop (MSE = 0.021, MAE = 0.083, R2 = 0.93) compared to the original test regions. This indicates the framework’s ability to adapt to environmental and climatic variations outside the training distribution, demonstrating potential for deployment across diverse geographic areas.

4.6. Real-Time Visualization and Spatial–Temporal Analysis

The comprehensive visualization framework provides detailed insights into desertification progression patterns, enabling precise identification of high-risk areas and optimal intervention strategies. The multi-temporal analysis reveals critical environmental transitions that inform evidence-based policy decisions. Figure 6 shows the comprehensive desertification analysis for the year 2025 in the Thatta region, including: (a) high-resolution satellite imagery, (b) NDVI classification map, and (c) STDP prediction with confidence zones. Figure 7 illustrates the desertification changes observed in 2026, highlighting minor yet notable shifts in vegetation and land degradation patterns. Figure 8 presents the analysis for 2027, with visual evidence of progressing desertification trends and expanding confidence zones. Figure 9 depicts continued changes in the Thatta region’s landscape in 2028, maintaining the same three-part structure for consistent comparison. Figure 10 captures the desertification scenario in 2029, emphasizing emerging risk zones and temporal NDVI pattern deviations. Finally, Figure 11 provides the 2030 prediction, showcasing the final year in this temporal series and summarizing the cumulative land degradation progression over six years.
The temporal progression analysis for Thatta region reveals accelerating desertification trends, with particularly concerning developments in the northern agricultural zones. The STDP framework successfully identifies critical transition areas where vegetation stress indicators precede visible degradation, providing essential early warning capabilities for proactive intervention strategies. The integration of high-resolution satellite imagery with advanced NDVI classification and uncertainty-aware prediction enables precise spatial targeting of conservation efforts and optimal resource allocation for maximum environmental impact. Figure 12 presents the comprehensive desertification analysis for the Badin region in 2025, showing: (a) high-resolution satellite imagery, (b) NDVI classification map, and (c) STDP prediction with associated confidence zones. Figure 13 displays the desertification status in 2026, revealing slight shifts in vegetation health and degradation risk patterns. Figure 14 illustrates the updated assessment for 2027, with further temporal transitions becoming evident in NDVI variation and risk projections. Figure 15 captures the ongoing desertification process in 2028, continuing the sequence of spatiotemporal change analysis. Figure 16 concludes the temporal evaluation for the Badin region with the 2029 analysis, highlighting cumulative trends and expansion of desertified zones.
The comprehensive analysis of Badin region demonstrates the STDP framework’s capability to capture subtle environmental transitions and provide accurate long-term projections essential for strategic environmental planning. The integration of multi-spectral satellite imagery with advanced vegetation indices and uncertainty-aware predictions enables comprehensive understanding of desertification dynamics across diverse ecological zones. The temporal progression reveals concerning trends in coastal areas where salinity intrusion compounds traditional desertification drivers, highlighting the need for integrated coastal zone management strategies. Figure 17 illustrates the final year of desertification analysis for the Badin region in 2030. It includes: (a) high-resolution satellite imagery, (b) NDVI classification map, and (c) STDP prediction with confidence zones, summarizing the progressive land degradation trends observed over the six-year period.

4.7. Model Interpretability

To enhance transparency and provide insight into the decision-making process of the proposed STDP framework, we employed two complementary interpretability techniques: SHAP (SHapley Additive exPlanations) analysis for feature attribution and temporal attention heatmaps for visualizing model focus across the input sequence.

4.7.1. Feature Attribution via SHAP

SHAP values quantify the contribution of each input variable to the final prediction, enabling the identification of dominant environmental and spectral features. As shown in Figure 18, NDVI, soil moisture, and EVI2 emerged as the most influential features, indicating that vegetation health indices and soil water content play critical roles in desertification forecasting. Temperature and precipitation sensors from the IoT network also ranked highly, confirming the benefit of multi-modal data fusion.

4.7.2. Temporal Attention Heatmaps

We also visualized the temporal attention weights from the multi-head attention layer in the decoder to assess how the model prioritizes historical observations. Figure 19 shows that the STDP model assigns higher weights to months corresponding to seasonal vegetation stress (e.g., pre-monsoon and peak dry season), which aligns with known desertification patterns in Sindh. This behavior confirms that the attention mechanism is effectively capturing meaningful temporal dependencies.

4.8. Ablation Study and Component Analysis

To understand the contribution of individual components within the STDP framework, we conducted comprehensive ablation studies:
The ablation study reveals that the encoder–decoder structure provides the most significant contribution to performance improvement, with a 2.43% decrease in R2 when removed, followed by multi-head attention mechanisms showing a 1.69% R2 reduction, as demonstrated in Table 9. Environmental fusion contributes a 1.37% improvement in R2, while adaptive optimization provides a 0.74% enhancement. When comparing against a basic CNN-LSTM architecture without attention mechanisms, the performance degradation reaches 3.48% in R2, validating our architectural design choices and confirming the importance of each component in achieving superior forecasting performance.

4.9. Strategic Implications and Policy Recommendations

The comprehensive analysis reveals critical insights that demand immediate attention from environmental policymakers and land management authorities. The projected desertification trends indicate accelerating environmental degradation that threatens agricultural sustainability, biodiversity conservation, and socio-economic stability across both study regions. The STDP framework’s predictions demonstrate the urgent need for integrated intervention strategies that combine technological innovation with traditional conservation approaches.
The analysis reveals that Thatta region faces more severe immediate risks, with desertification levels projected to reach extreme categories by 2030, necessitating emergency intervention protocols. Badin region, while showing slightly slower progression, still requires critical attention to prevent irreversible environmental damage. The uncertainty quantification provided by the STDP framework enables risk-based decision making, allowing policymakers to prioritize interventions based on both predicted severity and confidence levels. Table 10 shows the Strategic risk assessment and intervention prioritization framework.
The strategic framework developed through STDP analysis provides evidence-based guidelines for optimal resource allocation and intervention timing. The framework demonstrates that early intervention in moderate-to-high risk areas provides significantly better cost-effectiveness compared to crisis management in extreme risk zones. The integration of uncertainty quantification enables adaptive management strategies that can adjust intervention intensity based on prediction confidence levels and emerging environmental conditions.
The comprehensive results validate the STDP framework as a transformative technology for environmental monitoring in the context of next-generation Internet technologies. The framework’s superior performance across accuracy, efficiency, and scalability metrics, combined with its proven deployment feasibility in edge computing environments, establishes it as an essential tool for addressing global environmental challenges through intelligent monitoring systems and data-driven policy formulation.

4.10. Comprehensive Comparative Analysis with State-of-the-Art Models

To validate the effectiveness and superiority of our proposed STDP framework within the context of next-generation Internet technologies and deep learning algorithms, we conducted extensive comparative evaluations against established state-of-the-art models in environmental forecasting. This analysis encompasses traditional machine learning approaches, contemporary deep learning architectures, and hybrid models specifically designed for satellite-based environmental monitoring applications.

4.10.1. Baseline Model Architectures and Configurations

Our comparative evaluation framework includes six distinct model categories, each representing different approaches to spatial–temporal environmental forecasting:
The traditional machine learning baselines include Linear Regression (LR) implemented as a classical statistical approach with polynomial feature expansion for capturing non-linear relationships in environmental data, Support Vector Regression (SVR) utilizing non-linear regression with RBF kernels optimized for high-dimensional satellite imagery features, and Random Forest (RF) configured as an ensemble method with 100 decision trees, specifically tuned for handling heterogeneous environmental variables.
The deep learning architectures encompass three primary approaches: a CNN-only model based on ResNet-50 architecture with custom convolutional layers optimized for satellite imagery spatial feature extraction, an LSTM-only model implementing bidirectional LSTM with 128 hidden units and dropout regularization for temporal sequence modeling, and an Attention-LSTM model that enhances standard LSTM with self-attention mechanisms for improved temporal dependency modeling.
The hybrid architectures represent the current state of the art in environmental forecasting, including a basic CNN-LSTM model that sequentially combines CNN feature extraction followed by LSTM temporal modeling, ConvLSTM architecture with integrated convolutional LSTM units that combine spatial and temporal processing in unified cells, and a Transformer-based model utilizing Vision Transformer (ViT) adapted for satellite imagery with temporal attention mechanisms.

4.10.2. Experimental Design and Evaluation Methodology

The comparative evaluation employed a rigorous experimental design incorporating multiple validation strategies to ensure robust performance assessment:
Dataset Partitioning: The 15-year satellite dataset (2010–2024) was partitioned using temporal stratification: 70% for training (2010–2020), 15% for validation (2021–2022), and 15% for testing (2023–2024). This temporal split ensures realistic evaluation of forecasting capabilities while preventing data leakage.
Cross-Validation Strategy: Time-series cross-validation with an expanding window approach was implemented to assess model stability across different temporal periods. Each model was evaluated using 5-fold temporal cross-validation with consecutive 2-year testing periods.
Hyperparameter Optimization: All models underwent comprehensive hyperparameter tuning using Bayesian optimization with the Tree-structured Parzen Estimator (TPE) across 100 iterations. Key hyperparameters included learning rates ( 1 × 10 5 to 1 × 10 2 ), batch sizes (16–128), hidden dimensions (64–512), and regularization parameters ( 1 × 10 4 to 1 × 10 1 ).

4.10.3. Comprehensive Performance Metrics and Statistical Analysis

The evaluation framework incorporates multiple performance metrics to provide comprehensive assessment of model capabilities. Table 11 shows the Comprehensive performance comparison of deep learning architectures for satellite-based environmental forecasting.

4.10.4. Statistical Significance and Robustness Analysis

To ensure statistical rigor in our comparative evaluation, we conducted comprehensive statistical significance testing using multiple approaches:
Paired t-tests: All pairwise comparisons between STDP and baseline models yielded p-values < 0.001, confirming statistically significant improvements across all metrics.
Wilcoxon Signed-Rank Tests: Non-parametric testing confirmed the robustness of performance improvements, with effect sizes (Cohen’s d) ranging from 0.67 to 1.24, indicating medium to large practical significance.
Bootstrap Confidence Intervals: 10,000 bootstrap samples were generated to estimate 95% confidence intervals for all performance metrics, confirming consistent superiority of the STDP framework. Table 12 shows the statistical significance analysis and performance improvements.

4.10.5. Computational Efficiency and Edge Computing Suitability

A critical aspect of our evaluation focuses on computational efficiency and suitability for deployment in edge computing and IoT environments, addressing key requirements of next-generation Internet technologies:
Training Efficiency: Despite incorporating sophisticated attention mechanisms and multi-modal data fusion, the STDP model demonstrates competitive training times (7.89 h) compared to other hybrid architectures, while achieving superior performance.
Inference Speed: The STDP framework achieves real-time inference capabilities (16.8 ms per prediction) suitable for IoT and edge computing deployments, significantly outperforming the Vision Transformer baseline (45.6 ms) while maintaining higher accuracy.
Memory Footprint: With 42.7 million parameters, the STDP model maintains a reasonable memory footprint compared to the Vision Transformer (86.2 M parameters) while delivering superior performance, making it suitable for deployment in resource-constrained edge computing environments.

4.10.6. Generalization Capability and Cross-Regional Validation

To assess the generalization capability of the STDP framework beyond the primary study regions, we conducted cross-regional validation using satellite datasets from similar arid and semi-arid regions:
The cross-regional validation demonstrates excellent generalization capability, with the STDP framework maintaining over 98% of its original performance across diverse geographical regions and climatic conditions, as shown in Table 13. Testing across regions including Rajasthan in India (99.3% performance retention), the Sahel region in Africa (98.7% retention), Atacama Desert in Chile (98.2% retention), and Gobi Desert in Mongolia (99.1% retention) confirms the framework’s robust generalization capability. The average generalization performance across all test regions maintains an MSE of 0.022, MAE of 0.080, and R2 of 0.939, representing 99.1% performance retention compared to the primary study regions. This robust generalization capability confirms the framework’s suitability for global deployment in next-generation Internet-based environmental monitoring systems.

4.10.7. Computational Complexity and Scalability Analysis

The theoretical computational complexity of the STDP framework compared to baseline models is analyzed to assess scalability for large-scale deployment:
O S T D P = O ( C N N ) + O ( L S T M ) + O ( A t t e n t i o n ) + O ( F u s i o n )
where
O ( C N N ) = O ( H · W · C · K 2 · F )
O ( L S T M ) = O ( T · H h i d d e n 2 )
O ( A t t e n t i o n ) = O ( T 2 · d m o d e l )
O ( F u s i o n ) = O ( d s p a t i a l · d e n v )
The analysis reveals that while the STDP framework has higher theoretical complexity than individual CNN or LSTM models, the practical computational requirements remain manageable due to optimized implementations and efficient attention mechanisms, making it suitable for edge computing deployment.

4.10.8. Summary of Comparative Analysis

The comprehensive comparative evaluation demonstrates the superior performance of the STDP framework across multiple dimensions, as comprehensively analyzed in Table 9, Table 11 and Table 13. The framework achieves remarkable accuracy superiority with 47% MSE improvement over CNN-only models and 10–25% improvement over state-of-the-art hybrid architectures. All improvements are confirmed with statistical significance at p < 0.001 levels and demonstrate medium to large effect sizes ranging from 0.67 to 1.24. The computational efficiency analysis reveals competitive training and inference times suitable for edge computing deployment, with the STDP model requiring 7.89 h for training while achieving 16.8 ms inference speed per prediction. The generalization capability testing across diverse geographical regions maintains robust performance with greater than 98% performance retention across all tested environments. Component validation through comprehensive ablation studies confirms the importance of each architectural component, with the encoder–decoder structure contributing the most significant performance enhancement.
These results establish the STDP framework as a state-of-the-art solution for satellite-based environmental forecasting, particularly suited for deployment in next-generation Internet technologies and edge computing infrastructures.

5. Conclusions

This research presents a groundbreaking advancement at the intersection of deep learning algorithms and next-generation Internet technologies through the development of the Spatio-Temporal Desertification Predictor (STDP) framework. The proposed encoder–decoder architecture seamlessly integrates Convolutional Neural Networks with attention-enhanced Long Short-Term Memory networks, establishing a novel paradigm for satellite-based environmental monitoring in the era of IoT and edge computing infrastructures. The STDP framework addresses critical challenges in environmental AI by introducing sophisticated multi-head attention mechanisms for temporal feature extraction, advanced data fusion strategies for multi-modal satellite and environmental data integration, and computational optimizations specifically designed for deployment in distributed computing environments. The comprehensive experimental validation demonstrates the superior performance of the STDP framework across multiple evaluation metrics, achieving remarkable improvements of 47% in Mean Squared Error compared to CNN-only baselines and 10–25% enhancement over state-of-the-art hybrid architectures. The statistical significance of these improvements, confirmed through rigorous testing with p-values less than 0.001 and effect sizes ranging from 0.67 to 1.24, establishes the STDP framework as a robust solution for real-time environmental monitoring applications. The framework’s exceptional generalization capability, maintaining over 98% performance retention across diverse geographical regions including Rajasthan, Sahel, Atacama, and Gobi Desert environments, validates its suitability for global deployment in next-generation Internet-based environmental monitoring systems. The forecasting results for the Thatta and Badin regions reveal alarming trends in desertification progression, with predicted levels increasing steadily from 2025 to 2030, necessitating immediate implementation of comprehensive intervention strategies. These findings underscore the critical importance of AI-driven early warning systems in environmental management, providing actionable insights for policymakers to implement targeted afforestation programs, sustainable land management practices, and climate-resilient agricultural strategies. The integration of real-time IoT sensor data with satellite imagery processing capabilities positions the STDP framework as a cornerstone technology for smart city environmental monitoring and autonomous environmental management systems. The computational efficiency analysis reveals the framework’s suitability for edge computing deployment, achieving competitive inference speeds of 16.8 milliseconds per prediction while maintaining superior accuracy levels. This efficiency, combined with the model’s optimized parameter configuration of 42.7 million parameters, enables deployment in resource-constrained environments typical of IoT and 5G/6G network infrastructures. The ablation studies confirm the critical importance of each architectural component, with the encoder–decoder structure contributing the most significant performance enhancement, followed by multi-head attention mechanisms and environmental data fusion layers. This research contributes significantly to the advancement of AI applications in environmental monitoring by demonstrating the potential of sophisticated deep learning architectures to address complex spatial–temporal prediction challenges. The STDP framework establishes new benchmarks for satellite-based environmental forecasting while providing a scalable foundation for addressing global environmental challenges through intelligent monitoring systems. The integration of attention mechanisms for feature extraction, model compression techniques for edge deployment, and multi-modal data fusion capabilities positions this work at the forefront of next-generation Internet technologies for environmental applications. Future research directions encompass several promising avenues for enhancing the STDP framework’s capabilities and expanding its applications within next-generation Internet infrastructures. The integration of real-time satellite data streaming through 5G/6G networks will enable continuous model updating and adaptive learning mechanisms, enhancing prediction accuracy and responsiveness to rapidly changing environmental conditions. The incorporation of additional environmental parameters including soil composition analysis, groundwater level monitoring, and atmospheric condition modeling will further enhance the framework’s predictive capabilities and provide more comprehensive environmental assessment capabilities. The development of federated learning approaches will enable collaborative model training across distributed environmental monitoring networks while preserving data privacy and reducing communication overhead in IoT deployments. Advanced model compression and quantization techniques will facilitate deployment on increasingly resource-constrained edge devices, enabling widespread adoption in remote monitoring applications. The extension of the STDP framework to other vulnerable regions and environmental challenges, including deforestation monitoring, wildfire prediction, and urban air quality assessment, will demonstrate the versatility and broad applicability of the proposed architectural innovations. The integration with emerging Internet technologies including digital twin environments, augmented reality visualization systems, and blockchain-based data integrity verification will enhance the framework’s utility for comprehensive environmental management applications. These advancements will contribute to the development of truly intelligent environmental monitoring ecosystems capable of autonomous decision-making and proactive environmental protection measures, establishing new paradigms for sustainable environmental management in the digital age.

Author Contributions

Conceptualization, K.P. and B.N.S.; methodology, K.P.; software, K.P.; validation, K.P., B.N.S. and S.B.; formal analysis, K.P.; investigation, K.P.; resources, K.P.; data curation, K.P.; writing—original draft preparation, K.P.; writing—review and editing, K.P. and F.H.J.; visualization, K.P.; supervision, B.N.S. and S.B.; project administration, B.N.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

References

  1. Aslanov, I.; Teshaev, N.; Khayitov, K.; Mukhtorov, U.; Khaitbaeva, J.; Murodova, D. Analysis of desertification trends in Central Asia based on MODIS Data using Google Earth Engine. Proc. E3S Web Conf. 2023, 443, 06015. [Google Scholar] [CrossRef]
  2. Badreldin, N.; Frankl, A.; Goossens, R. Assessing the spatiotemporal dynamics of vegetation cover as an indicator of desertification in Egypt using multi-temporal MODIS satellite images. Arab. J. Geosci. 2014, 7, 4461–4475. [Google Scholar] [CrossRef]
  3. Shevchenko, V.; Lukashevich, A.; Taniushkina, D.; Bulkin, A.; Grinis, R.; Kovalev, K.; Narozhnaia, V.; Sotiriadi, N.; Krenke, A.; Maximov, Y. Climate Change Impact on Agricultural Land Suitability: An Interpretable Machine Learning-Based Eurasia Case Study. IEEE Access 2024, 12, 15748–15763. [Google Scholar] [CrossRef]
  4. Barimah, A. Detecting and Quantifying Desertification in the Upper East Region of Ghana using Multi-spatial and Multi-Temporal Normalized Difference Vegetation Index. Int. J. Sci. Res. 2013, 3, 62–78. [Google Scholar]
  5. Hummadi, A.; Khalaf, A. Temporal and Spatial Variation of Agricultural Drought and Desertification using Spectral Indices in Salah Al-Din Governorate. Tikrit J. Agric. Sci. 2024, 24, 206–222. [Google Scholar] [CrossRef]
  6. Chang, S.; Lee, U.; Hong, M.J.; Jo, Y.D.; Kim, J.B. Time-Series Growth Prediction Model Based on U-Net and Machine Learning in Arabidopsis. Front. Plant Sci. 2021, 12, 721512. [Google Scholar] [CrossRef]
  7. Verbesselt, J.; Hyndman, R.; Newnham, G.; Culvenor, D. Detecting Trend and Seasonal Changes in Satellite Image Time Series. Remote Sens. Environ. 2010, 114, 106–120. [Google Scholar] [CrossRef]
  8. Jo, H.W.; Lee, S.; Park, E.; Lim, C.H.; Song, C.; Lee, H.; Ko, Y.; Cha, S.; Yoon, H.; Lee, W.K. Deep Learning Applications on Multitemporal SAR (Sentinel-1) Image Classification Using Confined Labeled Data: The Case of Detecting Rice Paddy in South Korea. IEEE Trans. Geosci. Remote Sens. 2020, 58, 7589–7601. [Google Scholar] [CrossRef]
  9. Md Jelas, I.; Zulkifley, M.; Abdullah, M.; Spraggon, M. Deforestation detection using deep learning-based semantic segmentation techniques: A systematic review. Front. For. Glob. Chang. 2024, 7, 1300060. [Google Scholar] [CrossRef]
  10. Kussul, N.; Shumilo, L.; Yailymova, H.; Shelestov, A.; Krasilnikova, T. Complex method for land degradation estimation. IOP Conf. Ser. Earth Environ. Sci. 2023, 1126, 012032. [Google Scholar] [CrossRef]
  11. Bezerra, F.G.S.; Aguiar, A.P.D.D.; Alvalá, R.C.D.S.; Giarolla, A.; Bezerra, K.R.A.; Lima, P.V.P.S.; do Nascimento, F.R.; Arai, E. Analysis of areas undergoing desertification, using EVI2 multi-temporal data based on MODIS imagery as indicator. Ecol. Indic. 2020, 117, 106579. [Google Scholar] [CrossRef]
  12. Feng, K.; Wang, T.; Liu, S.; Kang, W.; Chen, X.; Guo, Z.; Zhi, Y. Monitoring Desertification Using Machine-Learning Techniques with Multiple Indicators Derived from MODIS Images in Mu Us Sandy Land, China. Remote Sens. 2022, 14, 2663. [Google Scholar] [CrossRef]
  13. Guo, Q.; Fu, B.; Shi, P.; Cudahy, T.; Zhang, J.; Xu, H. Satellite monitoring the spatial-temporal dynamics of desertification in response to climate change and human activities across the Ordos Plateau, China. Remote Sens. 2017, 9, 525. [Google Scholar] [CrossRef]
  14. Harrou, F.; Bouyeddou, B.; Zerrouki, N.; Dairi, A.; Sun, Y.; Zerrouki, Y. Detecting the signs of desertification with Landsat imagery: A semi-supervised anomaly detection approach. Results Eng. 2024, 22, 102037. [Google Scholar] [CrossRef]
  15. Miller, L.; Pelletier, C.; Webb, G. Deep Learning for Satellite Image Time Series Analysis: A Review. IEEE Geosci. Remote Sens. Mag. 2023, 12, 81–124. [Google Scholar] [CrossRef]
  16. Nandgude, N.; Singh, T.P.; Nandgude, S.; Tiwari, M. Drought Prediction: A Comprehensive Review of Different Drought Prediction Models and Adopted Technologies. Sustainability 2023, 15, 11684. [Google Scholar] [CrossRef]
  17. Shams Eddin, M.; Gall, J. Focal-TSMP: Deep learning for vegetation health prediction and agricultural drought assessment from a regional climate simulation. Geosci. Model Dev. 2024, 17, 2987–3023. [Google Scholar] [CrossRef]
  18. Xu, Z.; Cao, L.; Zhong, S.; Liu, G.; Yang, Y.; Zhu, S.; Luo, X.; Di, L. Trends in Global Vegetative Drought from Long-Term Satellite Remote Sensing Data. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2020, 13, 815–826. [Google Scholar] [CrossRef]
  19. Zerrouki, N.; Dairi, A.; Harrou, F.; Zerrouki, Y.; Sun, Y. Efficient land desertification detection using a deep learning-driven generative adversarial network approach: A case study. Concurr. Comput. Pract. Exp. 2022, 34, e6604. [Google Scholar] [CrossRef]
  20. Zerrouki, Y.; Harrou, F.; Zerrouki, N.; Dairi, A.; Sun, Y. Desertification Detection Using an Improved Variational Autoencoder-Based Approach through ETM-Landsat Satellite Data. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 202–213. [Google Scholar] [CrossRef]
  21. Bai, Z.; Han, L.; Jiang, X.; Liu, M.; Li, L.; Liu, H.; Liu, J. Spatiotemporal evolution of desertification based on integrated remote sensing indices in Duolun County, Inner Mongolia. Ecol. Inform. 2022, 70, 101750. [Google Scholar] [CrossRef]
  22. Rivera-Marin, D.; Dash, J.; Ogutu, B. The use of remote sensing for desertification studies: A review. J. Arid Environ. 2022, 206, 104829. [Google Scholar] [CrossRef]
  23. Dhanasekaran, S.; Gopal, D.; Logeshwaran, J.; Ramya, N.; Salau, A.O. Multi-model traffic forecasting in smart cities using graph neural networks and transformer-based multi-source visual fusion for intelligent transportation management. Int. J. Intell. Transp. Syst. Res. 2024, 22, 518–541. [Google Scholar] [CrossRef]
  24. Amangeldy, B.; Imankulov, T.; Tasmurzayev, N.; Dikhanbayeva, G.; Nurakhov, Y. A Review of Artificial Intelligence and Deep Learning Approaches for Resource Management in Smart Buildings. Buildings 2025, 15, 2631. [Google Scholar] [CrossRef]
  25. Kavran, D.; Mongus, D.; Žalik, B.; Lukač, N. Graph neural network-based method of spatiotemporal land cover mapping using satellite imagery. Sensors 2023, 23, 6648. [Google Scholar] [CrossRef]
  26. Ramani, D.R.; Sujitha, B.B.; Tangade, S. Smart Environmental Monitoring Systems: IoT and Sensor-Based Advancements. Environ. Monit. Using Artif. Intell. 2025, 45–60. [Google Scholar] [CrossRef]
  27. Lopez, L. Edge AI For Privacy-Preserving Data Analytics in IoT-Enabled Smart Cities. 2025. Available online: https://www.researchgate.net/publication/391854135_EDGE_AI_FOR_PRIVACY-PRESERVING_DATA_ANALYTICS_IN_IOT-ENABLED_SMART_CITIES (accessed on 8 August 2025).
  28. Garcia, E. Effective Urban Resilience through AI-Driven Predictive Analytics in Smart Cities. Available online: https://philpapers.org/rec/GAREUR-2 (accessed on 8 August 2025).
  29. Zehouani, N.; Ababou, M.; Faquir, S.; Rabiai, S. Environmental AI for Resilient and Sustainable Urban Infrastructures: Rethinking Smart City Transformation. In Proceedings of the International Conference on Electrical Systems and Smart Technologies; Springer: Cham, Switzerland, 2024; pp. 552–562. [Google Scholar] [CrossRef]
  30. Abdulameer, L.; Al-Khafaji, M.S.; Al-Awadi, A.T.; Al Maimuri, N.M.; Al-Shammari, M.; Al-Dujaili, A.N. Artificial Intelligence in Climate-Resilient Water Management: A Systematic Review of Applications, Challenges, and Future Directions. Water Conserv. Sci. Eng. 2025, 10, 44. [Google Scholar] [CrossRef]
  31. Islam, F.S. Artificial Intelligence-Driven Smart Waste-to-Energy Networks for Climate-Resilient Circular Resource Management in Vulnerable Megacities. Int. J. Environ. Clim. Change 2025, 15, 381–415. [Google Scholar] [CrossRef]
  32. Fatima, N.; Aggarwal, A.; Ali, H. Smart and Sustainable Agriculture: IoT, AI, and ML for Climate-Resilient Disease Forecasting. In Proceedings of the 2024 1st International Conference on Sustainability and Technological Advancements in Engineering Domain (SUSTAINED), Faridabad, India, 13–14 December 2024; pp. 455–461. [Google Scholar] [CrossRef]
Figure 1. Simplified pipeline of the proposed STDP framework.
Figure 1. Simplified pipeline of the proposed STDP framework.
Futureinternet 17 00380 g001
Figure 2. Enhanced encoder–decoder STDP architecture for next-generation Internet deployment.
Figure 2. Enhanced encoder–decoder STDP architecture for next-generation Internet deployment.
Futureinternet 17 00380 g002
Figure 3. Detailed STDP neural network architecture with edge computing optimization.
Figure 3. Detailed STDP neural network architecture with edge computing optimization.
Futureinternet 17 00380 g003
Figure 4. Prediction accuracy analysis with uncertainty quantification for the STDP framework in Thatta and Badin regions (2010–2024). The red dashed line indicates the ground truth, the shaded band shows the 95% confidence interval around STDP predictions, and the dotted gray line corresponds to the CNN-LSTM baseline.
Figure 4. Prediction accuracy analysis with uncertainty quantification for the STDP framework in Thatta and Badin regions (2010–2024). The red dashed line indicates the ground truth, the shaded band shows the 95% confidence interval around STDP predictions, and the dotted gray line corresponds to the CNN-LSTM baseline.
Futureinternet 17 00380 g004
Figure 5. Multi-horizon forecasting results for the STDP framework with 95% confidence intervals (2025–2030). Yellow bands indicate severe risk zones, red bands represent extreme risk zones, solid colored lines show predicted values, and shaded regions denote uncertainty ranges.
Figure 5. Multi-horizon forecasting results for the STDP framework with 95% confidence intervals (2025–2030). Yellow bands indicate severe risk zones, red bands represent extreme risk zones, solid colored lines show predicted values, and shaded regions denote uncertainty ranges.
Futureinternet 17 00380 g005
Figure 6. Comprehensive desertification analysis for 2025—Thatta region:(a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Figure 6. Comprehensive desertification analysis for 2025—Thatta region:(a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Futureinternet 17 00380 g006
Figure 7. Comprehensive desertification analysis for 2026—Thatta region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Figure 7. Comprehensive desertification analysis for 2026—Thatta region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Futureinternet 17 00380 g007
Figure 8. Comprehensive desertification analysis for 2027—Thatta region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Figure 8. Comprehensive desertification analysis for 2027—Thatta region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Futureinternet 17 00380 g008
Figure 9. Comprehensive desertification analysis for 2028—Thatta region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Figure 9. Comprehensive desertification analysis for 2028—Thatta region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Futureinternet 17 00380 g009
Figure 10. Comprehensive desertification analysis for 2029—Thatta region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Figure 10. Comprehensive desertification analysis for 2029—Thatta region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Futureinternet 17 00380 g010
Figure 11. Comprehensive desertification analysis for 2030—Thatta region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Figure 11. Comprehensive desertification analysis for 2030—Thatta region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Futureinternet 17 00380 g011
Figure 12. Comprehensive desertification analysis for 2025—Badin region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Figure 12. Comprehensive desertification analysis for 2025—Badin region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Futureinternet 17 00380 g012
Figure 13. Comprehensive desertification analysis for 2026—Badin region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Figure 13. Comprehensive desertification analysis for 2026—Badin region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Futureinternet 17 00380 g013
Figure 14. Comprehensive desertification analysis for 2027—Badin region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Figure 14. Comprehensive desertification analysis for 2027—Badin region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Futureinternet 17 00380 g014
Figure 15. Comprehensive desertification analysis for 2028—Badin region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Figure 15. Comprehensive desertification analysis for 2028—Badin region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Futureinternet 17 00380 g015
Figure 16. Comprehensive desertification analysis for 2029—Badin region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Figure 16. Comprehensive desertification analysis for 2029—Badin region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Futureinternet 17 00380 g016
Figure 17. Comprehensive desertification analysis for 2030—Badin region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Figure 17. Comprehensive desertification analysis for 2030—Badin region: (a) high-resolution satellite imagery, (b) NDVI classification map, (c) STDP prediction with confidence zones.
Futureinternet 17 00380 g017
Figure 18. SHAP values showing feature importance for STDP predictions. Higher absolute SHAP values indicate greater influence on the model’s output. NDVI, soil moisture, and EVI2 dominate the prediction process, highlighting the relevance of vegetation and moisture dynamics.
Figure 18. SHAP values showing feature importance for STDP predictions. Higher absolute SHAP values indicate greater influence on the model’s output. NDVI, soil moisture, and EVI2 dominate the prediction process, highlighting the relevance of vegetation and moisture dynamics.
Futureinternet 17 00380 g018
Figure 19. Attention weights over time for the Thatta region. Darker colors indicate higher attention scores, with peaks during critical vegetation stress periods such as pre-monsoon months.
Figure 19. Attention weights over time for the Thatta region. Darker colors indicate higher attention scores, with peaks during critical vegetation stress periods such as pre-monsoon months.
Futureinternet 17 00380 g019
Table 1. Comprehensive analysis of deep learning approaches for satellite-based environmental monitoring.
Table 1. Comprehensive analysis of deep learning approaches for satellite-based environmental monitoring.
ReferenceAI ArchitectureData SourcesPerformance MetricsTemporal ResolutionDeployment Considerations
Aslanov et al. [1]Cloud-based processingMODIS, Google Earth EngineTrend analysis accuracy20-year temporal spanCloud computing scalability
Chang et al. [6]U-Net + ML ensembleMulti-temporal imageryGrowth prediction R2 > 0.85Daily to seasonalLimited to controlled environments
Shams Eddin & Gall [17]Focal-TSMP architectureClimate simulations + satelliteVegetation health predictionMonthly to seasonalHigh computational requirements
Jo et al. [8]CNN for SAR processingMulti-temporal SARClassification accuracy > 90%Multi-temporalWeather-independent operation
Feng et al. [12]RF + SVM ensembleMODIS indicatorsDetection accuracy improvementAnnual monitoringRegional deployment limitations
Harrou et al. [14]Semi-supervised anomaly detectionLandsat imageryAnomaly detection precisionMulti-annualLimited labeled data requirements
Zerrouki et al. [19]GAN-based architectureSynthetic + real satellite data15% accuracy improvementVariable temporal resolutionData augmentation capabilities
Zerrouki et al. [20]VAE-based feature learningETM Landsat data12% accuracy improvementMulti-temporal analysisUnsupervised learning capability
Kussul et al. [10]Multi-source integrationRemote sensing + ground dataComprehensive land degradation metricsVariable resolutionComplex data integration challenges
Verbesselt et al. [7]BFAST statistical analysisTime-series satellite dataTrend detection capabilitySub-annual to decadalRequires continuous data streams
Xu et al. [18]Statistical trend analysisGlobal satellite datasetsGlobal trend identificationMulti-decadal analysisGlobal scalability demonstrated
Guo et al. [13]MODIS + Landsat integrationMulti-sensor satellite dataSpatial–temporal dynamics mapping15-year monitoringRegional focus limitations
Proposed STDPEncoder–decoder + attentionMulti-modal satellite + IoTMSE: 0.018, R2: 0.94Real-time to 5-year forecastingEdge computing optimized
Table 2. Comprehensive multi-modal dataset specification for edge computing deployment.
Table 2. Comprehensive multi-modal dataset specification for edge computing deployment.
Data ModalitySource PlatformTemporal ResolutionSpatial ResolutionSpectral BandsIoT Integration
Satellite ImageryLandsat 8/9Monthly 2010–202430 m (multi-spectral)11 bands (0.43–12.51 μ m)Cloud-based preprocessing
Sentinel-2Weekly 2015–202410 m (RGB), 20 m (NIR)13 bands (0.44–2.19 μ m)Edge node processing
Spectral IndicesNDVIMonthly 2010–2024Derived from imagery NIR Red NIR + Red Real-time computation
SAVIMonthly 2010–2024Derived from imagery ( NIR Red ) ( 1 + L ) NIR + Red + L Soil correction factor
EVI2Monthly 2010–2024Derived from imagery 2.5 × NIR Red NIR + 2.4 × Red + 1 Atmospheric correction
NDMIMonthly 2010–2024Derived from imagery NIR SWIR NIR + SWIR Moisture assessment
IoT Sensor DataTemperature sensorsHourly 2020–2024Point measurementsSurface/Air temperatureLoRaWAN network
Soil moisture sensorsDaily 2020–202410 cm, 20 cm, 50 cm depthVolumetric water contentNB-IoT connectivity
Precipitation gaugesDaily 2010–2024Point measurementsRainfall intensityCellular connectivity
Wind speed/directionHourly 2020–20242 m, 10 m heightVector measurements5G/6G integration
Solar irradianceHourly 2020–2024Point measurementsW/m2 spectralEdge computing nodes
Auxiliary DataDEMStatic30 m resolutionElevation modelPreprocessed
Land use/coverAnnual 2010–202430 m resolutionClassification mapsAI-based updates
Climate zonesStaticVector formatKöppen classificationGeographic reference
Table 3. Advanced STDP architectural components for edge computing integration.
Table 3. Advanced STDP architectural components for edge computing integration.
ComponentTechnical DescriptionEdge Computing RoleParameters (M)FLOPs (G)Latency (ms)
Encoder Module—Spatial Feature Extraction
Depth-wise Conv LayersSeparable convolutions with SE blocksMemory-efficient spatial processing2.31.23.4
Channel AttentionAdaptive channel weighting mechanismDynamic feature selection0.80.31.1
Residual ConnectionsSkip connections with batch normalizationGradient flow optimization0.50.10.8
Spatial Pyramid PoolingMulti-scale feature aggregationScale-invariant feature extraction1.20.62.1
Decoder Module—Temporal Sequence Modeling
Attention-LSTM CellsLSTM with integrated attention gatesTemporal dependency modeling8.74.312.5
Positional EncodingLearned temporal position embeddingsSequence order preservation0.30.10.5
Multi-Head AttentionParallel attention computationTemporal feature prioritization4.22.16.8
Adaptive ComputationDynamic computation time allocationResource-aware processing0.60.21.2
Integration Module—Multi-Modal Fusion
IoT Sensor FusionReal-time sensor data integrationMulti-modal environmental monitoring1.90.82.7
Uncertainty QuantificationBayesian neural network layersPrediction confidence estimation2.11.13.2
Dynamic RoutingCapsule-inspired routing algorithmHierarchical feature organization3.41.74.6
Edge OptimizationModel compression and quantizationResource-constrained deployment50% Reduction30% Reduction
Output Module—Prediction and Visualization
Multi-Task HeadSimultaneous prediction tasksComprehensive environmental assessment1.80.92.3
Confidence IntervalsUncertainty-aware predictionsRisk assessment capabilities0.70.31.5
Real-time StreamingContinuous prediction updatesLive monitoring applications0.20.10.9
Total ArchitectureComplete STDP FrameworkEnd-to-End Processing28.713.843.6
Table 4. Advanced experimental configuration for edge computing validation.
Table 4. Advanced experimental configuration for edge computing validation.
Configuration CategoryParameterSpecificationEdge Computing Relevance
Hardware InfrastructurePrimary GPUNVIDIA A100 80GBTraining acceleration
Edge GPUNVIDIA Jetson AGX OrinInference deployment
Edge CPUARM Cortex-A78AEPower-efficient processing
Memory64 GB LPDDR5High-bandwidth access
Storage2TB NVMe SSDFast data access
Network5G mmWave / 6 GHzReal-time data streaming
Power Budget15 W–60 W configurableEnergy-efficient operation
Operating SystemUbuntu 20.04 LTSContainerized deployment
Deep Learning FrameworkPrimary FrameworkPyTorch 2.0Dynamic computation graphs
Edge OptimizationTensorRT 8.6Inference acceleration
Model CompressionNVIDIA TensorRTQuantization and pruning
Distributed TrainingHorovodMulti-GPU scaling
ContainerizationDocker + NVIDIA Container RuntimeDeployment flexibility
OrchestrationKubernetesAuto-scaling and management
Model ConfigurationEncoder Layers12 depth-wise separable layersEfficient spatial processing
Decoder Units256 attention-LSTM cellsTemporal modeling capacity
Attention Heads8 multi-head attentionParallel feature extraction
Embedding Dimension512Feature representation capacity
Dropout Rate0.1–0.3 adaptiveRegularization
Batch Size32 (training), 1 (inference)Memory optimization
Sequence Length24 monthsTemporal context window
Training ConfigurationOptimizerAdamW with cosine schedulingStable convergence
Learning Rate 1 × 10 4 to 1 × 10 6 adaptiveLearning rate scheduling
Weight Decay 1 × 10 5 Regularization
Epochs200 with early stoppingTraining efficiency
Validation Split20% temporal holdoutRobust evaluation
Cross-validation5-fold time-series CVStatistical significance
Mixed PrecisionFP16 with dynamic scalingMemory efficiency
Gradient Clipping1.0 norm clippingStability
Data ProcessingImage Resolution 256 × 256 × 13 bandsComputational optimization
Temporal Window24-month sliding windowTrend analysis capability
AugmentationSpatial + temporal augmentationRobustness enhancement
NormalizationZ-score + min–max hybridNumerical stability
Missing DataAdvanced interpolationData completeness
Real-time StreamingApache Kafka integrationContinuous processing
IoT IntegrationSensor NetworksLoRaWAN + NB-IoT + 5GMulti-protocol support
Data Fusion Rate1Hz to 0.1Hz adaptiveDynamic processing
Edge NodesNVIDIA Jetson deploymentDistributed processing
Communication ProtocolMQTT + CoAPEfficient messaging
SecurityTLS 1.3 + device certificatesSecure communications
Table 5. Comparative performance including recent spatio-temporal models (2020–2025).
Table 5. Comparative performance including recent spatio-temporal models (2020–2025).
Performance CategoryMetricThatta RegionBadin Region
SthPCNN-LSTMAttention-LSTMTransformerTFTInformerAutoformerASTGNNSthPCNN-LSTMAttention-LSTMTransformer
(Ours)BaselineBaselineBaseline2021202120222023(Ours)BaselineBaselineBaseline
Accuracy MetricsMSE0.0180.0340.0240.0210.0200.0220.0210.0190.0220.0380.0300.025
MAE0.0740.1230.0890.0840.0790.0810.0800.0780.0810.1280.0980.092
R20.9480.8430.9120.9250.9400.9370.9380.9440.9350.8320.8890.908
PICP0.9520.8820.9240.9310.9400.9380.9410.9450.9430.8700.9130.925
MAPE (%)5.238.976.456.125.605.785.715.495.899.347.126.78
Edge ComputingInference Time (ms)16.832.428.745.638.226.529.133.818.234.130.547.8
Memory Usage (MB)164298221412245210228239172306235425
Energy (mJ/prediction)23.441.231.758.935.629.430.833.225.143.834.261.3
Throughput (pred/s)59.530.934.821.926.137.834.429.654.929.332.820.9
Parameters (M)42.758.348.986.265.152.760.348.542.758.348.986.2
FLOPs (G)13.824.618.235.728.319.522.820.113.824.618.235.7
ScalabilityBatch Processing12864963264969664128649632
Multi-GPU Scaling0.940.780.850.720.800.870.860.880.920.760.830.70
Distributed Nodes846357768463
IoT IntegrationReal-timeBatchNear real-timeBatchNear real-timeReal-timeReal-timeNear real-timeReal-timeBatchNear real-timeBatch
RobustnessCross-Region R20.8910.7560.8340.8120.8400.8520.8480.8590.8790.7420.8210.798
Noise Resilience0.9230.7980.8670.8450.8800.8880.8850.8910.9150.7850.8520.831
Temporal Stability0.9370.8230.8890.9010.9020.9100.9070.9140.9280.8140.8760.892
Table 6. Comprehensive multi-horizon forecasting results with uncertainty quantification (2025–2030).
Table 6. Comprehensive multi-horizon forecasting results with uncertainty quantification (2025–2030).
YearThatta RegionBadin Region
Prediction95% CIRisk LevelInterventionPrediction95% CIRisk LevelIntervention
[Lower, Upper]Priority[Lower, Upper]Priority
20250.73[0.69, 0.77]SevereImmediate0.68[0.64, 0.72]Moderate–HighHigh
20260.76[0.71, 0.81]SevereCritical0.72[0.68, 0.76]SevereImmediate
20270.79[0.74, 0.84]CriticalEmergency0.75[0.71, 0.79]SevereCritical
20280.82[0.77, 0.87]CriticalEmergency0.79[0.75, 0.83]CriticalEmergency
20290.85[0.80, 0.90]ExtremeEmergency0.82[0.78, 0.86]CriticalEmergency
20300.89[0.84, 0.94]ExtremeEmergency0.85[0.81, 0.89]ExtremeEmergency
Note: Risk Classification—0.0–0.3: Low, 0.3–0.5: Moderate, 0.5–0.7: High, 0.7–0.85: Severe, 0.85–1.0: Extreme.
Table 7. Edge computing performance analysis across different deployment configurations.
Table 7. Edge computing performance analysis across different deployment configurations.
DeploymentHardwareInferenceMemoryEnergyThroughputAccuracyCost
ConfigurationPlatformTime (ms)(MB)(mJ)(pred/s)Retention (%)Efficiency
Edge DevicesJetson Nano28.4 ± 2.1298 ± 1545.2 ± 3.835.2 ± 2.894.2 ± 1.5High
Jetson Xavier NX19.7 ± 1.6256 ± 1232.1 ± 2.450.8 ± 4.297.1 ± 1.2Very High
Jetson AGX Orin16.8 ± 1.2164 ± 1023.4 ± 1.959.5 ± 4.299.1 ± 0.8Optimal
Raspberry Pi 467.3 ± 5.9412 ± 2889.7 ± 7.214.9 ± 1.389.3 ± 2.1Moderate
Cloud InstancesAWS t3.medium12.3 ± 0.9145 ± 815.7 ± 1.281.3 ± 6.199.8 ± 0.5High
Google Cloud n111.8 ± 0.8139 ± 714.2 ± 1.084.7 ± 6.899.9 ± 0.4Very High
Azure Standard D213.1 ± 1.1152 ± 917.3 ± 1.476.3 ± 5.799.7 ± 0.6High
5G/6G IntegrationMEC Deployment21.5 ± 1.8187 ± 1228.9 ± 2.346.5 ± 3.998.3 ± 1.1Very High
Network Slicing18.9 ± 1.5171 ± 1125.6 ± 2.052.9 ± 4.498.7 ± 0.9Optimal
Table 8. Advanced hyperparameter sensitivity analysis for edge computing optimization.
Table 8. Advanced hyperparameter sensitivity analysis for edge computing optimization.
ParameterValueThatta PerformanceBadin PerformanceEdge Suitability
MSEMAER2MSEMAER2Score
Regularization ( λ )0.0010.028 ± 0.0030.092 ± 0.0080.915 ± 0.0120.032 ± 0.0040.098 ± 0.0090.902 ± 0.015Moderate
0.010.023 ± 0.0020.089 ± 0.0070.924 ± 0.0100.027 ± 0.0030.093 ± 0.0080.910 ± 0.012Good
0.10.018 ± 0.0020.079 ± 0.0060.940 ± 0.0080.021 ± 0.0020.085 ± 0.0070.929 ± 0.009Optimal
1.00.032 ± 0.0040.095 ± 0.0090.901 ± 0.0150.036 ± 0.0050.102 ± 0.0100.890 ± 0.018Fair
10.00.045 ± 0.0060.118 ± 0.0120.875 ± 0.0200.049 ± 0.0070.125 ± 0.0130.860 ± 0.022Poor
Learning Rate ( α )0.00010.034 ± 0.0040.098 ± 0.0090.908 ± 0.0150.037 ± 0.0050.105 ± 0.0110.895 ± 0.018Slow
0.0010.020 ± 0.0020.082 ± 0.0070.938 ± 0.0090.022 ± 0.0030.088 ± 0.0080.928 ± 0.011Good
0.010.018 ± 0.0020.079 ± 0.0060.940 ± 0.0080.019 ± 0.0020.081 ± 0.0070.934 ± 0.009Optimal
0.10.042 ± 0.0050.110 ± 0.0110.882 ± 0.0180.045 ± 0.0060.115 ± 0.0120.870 ± 0.021Unstable
Batch Size160.025 ± 0.0030.085 ± 0.0080.930 ± 0.0120.028 ± 0.0040.090 ± 0.0090.922 ± 0.014High Variance
320.018 ± 0.0020.079 ± 0.0060.940 ± 0.0080.020 ± 0.0020.081 ± 0.0070.935 ± 0.009Optimal
640.022 ± 0.0020.083 ± 0.0070.935 ± 0.0090.024 ± 0.0030.087 ± 0.0080.928 ± 0.011Good
1280.029 ± 0.0030.090 ± 0.0080.920 ± 0.0130.032 ± 0.0040.095 ± 0.0090.910 ± 0.015Memory Intensive
LSTM Units320.028 ± 0.0030.092 ± 0.0080.918 ± 0.0130.030 ± 0.0040.096 ± 0.0090.912 ± 0.015Limited Capacity
640.020 ± 0.0020.082 ± 0.0070.938 ± 0.0100.022 ± 0.0030.085 ± 0.0080.932 ± 0.012Good
1280.018 ± 0.0020.079 ± 0.0060.940 ± 0.0080.020 ± 0.0020.081 ± 0.0070.940 ± 0.009Optimal
2560.024 ± 0.0030.088 ± 0.0080.930 ± 0.0120.026 ± 0.0030.090 ± 0.0090.925 ± 0.013Computational Overhead
Table 9. Ablation study: component contribution analysis.
Table 9. Ablation study: component contribution analysis.
Model VariantMSEMAER2Performance Impact
STDP (Full Model)0.0180.0740.948Baseline
w/o Multi-head Attention0.0240.0860.932−1.69% R2
w/o Environmental Fusion0.0220.0810.935−1.37% R2
w/o Encoder–Decoder Structure0.0270.0890.925−2.43% R2
w/o Adaptive Optimization0.0210.0780.941−0.74% R2
CNN + LSTM (No Attention)0.0310.0940.915−3.48% R2
Table 10. Strategic risk assessment and intervention prioritization framework.
Table 10. Strategic risk assessment and intervention prioritization framework.
Risk CategoryDesertification Level RangeConfidence ThresholdTime to Critical StateIntervention StrategyResource AllocationExpected Effectiveness
Low Risk0.0–0.3>0.90>10 yearsPreventive monitoring15%95% success
Moderate Risk0.3–0.5>0.857–10 yearsActive conservation25%85% success
High Risk0.5–0.7>0.804–7 yearsIntensive restoration35%70% success
Severe Risk0.7–0.85>0.752–4 yearsEmergency intervention40%55% success
Extreme Risk>0.85>0.70<2 yearsCrisis management60%35% success
Table 11. Comprehensive performance comparison of deep learning architectures for satellite-based environmental forecasting.
Table 11. Comprehensive performance comparison of deep learning architectures for satellite-based environmental forecasting.
Model ArchitectureMSEMAER2MAPE (%)PICPTraining Time (h)Inference (ms)Parameters (M)
Traditional Machine Learning Baselines
Linear Regression0.087 ± 0.0120.201 ± 0.0180.672 ± 0.04518.32 ± 2.140.743 ± 0.0320.050.120.001
Support Vector Regression0.071 ± 0.0090.183 ± 0.0150.731 ± 0.03815.67 ± 1.890.778 ± 0.0282.341.870.003
Random Forest0.063 ± 0.0080.167 ± 0.0120.765 ± 0.03413.94 ± 1.560.812 ± 0.0251.783.210.012
Deep Learning Architectures
CNN-only (ResNet-50)0.034 ± 0.0040.098 ± 0.0080.908 ± 0.0128.45 ± 0.670.882 ± 0.0154.2312.523.5
LSTM-only (Bidirectional)0.027 ± 0.0030.088 ± 0.0060.925 ± 0.0107.12 ± 0.540.913 ± 0.0123.678.912.8
Attention-LSTM0.024 ± 0.0030.084 ± 0.0050.932 ± 0.0096.78 ± 0.480.924 ± 0.0115.1215.318.6
Hybrid Architectures
Basic CNN-LSTM0.022 ± 0.0020.083 ± 0.0050.935 ± 0.0086.34 ± 0.420.931 ± 0.0106.7818.736.3
ConvLSTM0.021 ± 0.0020.081 ± 0.0040.937 ± 0.0076.12 ± 0.390.936 ± 0.0098.4522.128.9
Vision Transformer0.020 ± 0.0020.080 ± 0.0040.938 ± 0.0075.98 ± 0.370.939 ± 0.00812.3445.686.2
Proposed Architecture
STDP (Ours)0.018 ± 0.0010.074 ± 0.0030.948 ± 0.0055.23 ± 0.280.952 ± 0.0067.8916.842.7
Table 12. Statistical significance analysis and performance improvements.
Table 12. Statistical significance analysis and performance improvements.
ComparisonMSE Improvement (%)R2 Improvement (%)p-ValueCohen’s d95% CI (MSE)
STDP vs. CNN-only47.06 ± 3.124.40 ± 0.45<0.0011.24[0.016, 0.020]
STDP vs. LSTM-only33.33 ± 2.872.49 ± 0.32<0.0010.98[0.017, 0.019]
STDP vs. Attention-LSTM25.00 ± 2.341.72 ± 0.28<0.0010.87[0.017, 0.019]
STDP vs. Basic CNN-LSTM18.18 ± 1.981.39 ± 0.24<0.0010.76[0.017, 0.019]
STDP vs. ConvLSTM14.29 ± 1.671.17 ± 0.21<0.0010.72[0.017, 0.019]
STDP vs. Vision Transformer10.00 ± 1.451.07 ± 0.18<0.0010.67[0.017, 0.019]
Table 13. Cross-regional generalization performance.
Table 13. Cross-regional generalization performance.
Test RegionMSEMAER2Performance Retention (%)
Thatta and Badin (Primary)0.0180.0740.948100.0
Rajasthan, India0.0210.0790.94199.3
Sahel Region, Africa0.0230.0820.93698.7
Atacama Desert, Chile0.0250.0850.93198.2
Gobi Desert, Mongolia0.0220.0810.93999.1
Average Generalization0.0220.0800.93999.1
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Panhwar, K.; Soomro, B.N.; Bhatti, S.; Jaskani, F.H. Novel Encoder–Decoder Architecture with Attention Mechanisms for Satellite-Based Environmental Forecasting in Smart City Applications. Future Internet 2025, 17, 380. https://doi.org/10.3390/fi17090380

AMA Style

Panhwar K, Soomro BN, Bhatti S, Jaskani FH. Novel Encoder–Decoder Architecture with Attention Mechanisms for Satellite-Based Environmental Forecasting in Smart City Applications. Future Internet. 2025; 17(9):380. https://doi.org/10.3390/fi17090380

Chicago/Turabian Style

Panhwar, Kalsoom, Bushra Naz Soomro, Sania Bhatti, and Fawwad Hassan Jaskani. 2025. "Novel Encoder–Decoder Architecture with Attention Mechanisms for Satellite-Based Environmental Forecasting in Smart City Applications" Future Internet 17, no. 9: 380. https://doi.org/10.3390/fi17090380

APA Style

Panhwar, K., Soomro, B. N., Bhatti, S., & Jaskani, F. H. (2025). Novel Encoder–Decoder Architecture with Attention Mechanisms for Satellite-Based Environmental Forecasting in Smart City Applications. Future Internet, 17(9), 380. https://doi.org/10.3390/fi17090380

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Article metric data becomes available approximately 24 hours after publication online.
Back to TopTop