Next Article in Journal
Impact of Regional Agricultural Product Branding on Income Inequality: Evidence from Guangdong Province, China
Previous Article in Journal
Construction and Application of Soil–Water Characteristic Curve Model Considering Water Mineralization Degree
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Chilo suppressalis Population Dynamics Forecasting by Exponential Smoothing Decomposition and Multi-Stream Network

by
Chao He
1,2,
Ziang Peng
1,2,
Longhuang Peng
1,2,
Yi Liu
3,
Chengyuan Zhang
4,
Lei Zhu
1,2,
Siqiao Tan
1,2,* and
Ling Zou
1,2,*
1
College of Information and Intelligence, Hunan Agricultural University, Changsha 410125, China
2
National Digital Planting Industry (Tea) Innovation Sub-Center, Changsha 410125, China
3
Xiang Jiang Elite Engineers School, Hunan Agricultural University, Changsha 410125, China
4
College of Computer Science and Electronic Engineering, Hunan University, Changsha 410082, China
*
Authors to whom correspondence should be addressed.
Agriculture 2025, 15(23), 2474; https://doi.org/10.3390/agriculture15232474 (registering DOI)
Submission received: 16 October 2025 / Revised: 23 November 2025 / Accepted: 25 November 2025 / Published: 28 November 2025
(This article belongs to the Section Artificial Intelligence and Digital Agriculture)

Abstract

Rice plays a pivotal role in global food security, particularly for Asian populations. However, its production is significantly threatened by insect pests, with Chilo suppressalis being a major pest in Asian rice-growing regions. Therefore, developing accurate predictive models for C. suppressalis outbreaks is essential. This study presents a novel time series forecasting model (named ESD-TripleStream) for C. suppressalis population dynamics based on a multi-stream structure, which addresses the limitations of existing approaches, which often omit the further decomposability of and the timestamp information in the time series. This model integrates Exponential Smoothing Decomposition (ESD) to separate the trend and seasonal components of time series data, along with a temporal feature stream to form a three-stream network to capture multi-scale periodic patterns and temporal dependencies. For our evaluation, we collected and constructed a novel dataset, referred to as HNRP-6R, which includes rice pest monitoring data from the past two decades (2000–2022) alongside 13 meteorological factors across six key rice producing regions in Hunan Province, southern China. ESD-TripleStream was evaluated across short-term and medium-term C. suppressalis population prediction scales using HNRP-6R, demonstrating state-of-the-art performance. Specifically, in short-term prediction, ESD-TripleStream achieved a 31.8% reduction in Mean Squared Error (MSE) and 26.55% reduction in Mean Absolute Error (MAE) compared to the PatchMLP model, while outperforming the transformer-based TimeXer by 14.43% in MSE and 9.8% in MAE. For medium-term prediction, ESD-TripleStream has both MSE and MAE significantly lower than those of baseline models such as P-sLSTM and xPatch. Furthermore, generalization tests on Nilaparvata lugens (N. lugens) population prediction demonstrated the model’s adaptability to diverse pest dynamics.

1. Introduction

Rice represents one of the most essential staple food crops worldwide. For more than half of the global population, rice remains fundamental to their survival and livelihoods [1,2]. Insect pest infestations, however, significantly impact rice production. These infestations cause annual declines in global rice yields ranging from 20% to 40% [3]. The Rice Stem Borer (RSB), Chilo suppressalis, represents a major lepidopteran pest threatening rice cultivation across Asia [4]. As a stem-boring insect, it damages rice plants throughout their developmental stages, from vegetative growth to reproductive maturation [5]. The C. suppressalis life cycle comprises four sequential developmental stages: egg (3~7 day), larva (15~25 day), pupa (7~10 day), and adult (5~7 day) [6]. During the egg stage, C. suppressalis eggs are primarily on the undersides of leaves, near the leaf sheaths. In the larval stage, the larvae penetrate rice stems and rhizomes to feed on plant sap, making this the most damaging stage for rice. When temperatures reach 11 °C or higher, larvae pupate either within stems or between leaf sheaths and stems. Adult C. suppressalis emerge when temperatures exceed 15 °C; generally speaking, the male-to-female ratio is approximately 1:1, and a female lays an average of 100~300 eggs. They exhibit nocturnal activity, possess a certain flight capacity (which can affect their local abundance), and remain concealed at the base of rice plants during the day [7]. This developmental pattern makes the comprehensive visual monitoring of C. suppressalis’s destructive activities virtually impossible. Although there are some natural enemies, such as Trichogrammatidae and Apanteles ruficrus, which can significantly reduce species abundance, these natural enemies still need to be artificially released. Therefore, developing accurate predictive models for C. suppressalis population dynamics becomes critically important for guiding integrated pest management strategies [8]. These models focus on (1) enabling early intervention through infestation forecasting, and (2) facilitating precise insecticide application to minimize pest resistance development and environmental contamination [9,10,11].
Artificial intelligence technology has been widely applied, and smart agriculture has also achieved rapid development in recent years [12]. This progress has been accompanied by an important trend: the application of time series analysis and machine learning techniques for accurate pest infestation prediction [13,14,15,16]. Traditional time series prediction models have been widely implemented in agricultural research. The Autoregressive Integrated Moving Average (ARIMA) model, for example, has proven effective in rice yield prediction and pest infestation forecasting [17]. However, traditional models demonstrate notable limitations in agricultural pest prediction, particularly regarding medium-term forecasting and real-world scenario adaptation. These limitations stem from inherent constraints: (1) sole dependence on univariate time series data, (2) insufficient handling of nonlinear and non-stationary datasets, and (3) inability to incorporate multi-factorial interactions and biological characteristics. Addressing these challenges, researchers increasingly combine traditional time series prediction models with deep neural networks [18,19]. Some researchers have utilized ARIMA and Artificial Neural Networks (ANNs) to predict Helicoverpa armigera (Hübner), enhancing prediction accuracy [20]. The integration of ARIMA and grey models has been applied to cocoa yield prediction, showing promising results even with limited data [21]. Additionally, researchers have exploited the advantages of two models for the Yellow Stem Borer (YSB)—a major rice pest [22]. Convolutional Neural Networks (CNNs) excel at local feature extraction, while Long Short-Term Memory (LSTM) is well-suited for time series data. They combined these two models to develop prediction and classification frameworks.
While these advancements have improved model predictive capabilities, they still inadequately address long-term dependencies and nonlinear relationships inherent in time series data [23,24]. In practical agricultural scenarios, pest outbreaks demonstrate strong correlation with weather and climatic factors [25,26,27]. Research indicates that stem-boring pests are particularly vulnerable to climate change in China [28,29]. Building on these insights, researchers increasingly incorporate meteorological factors into predictive frameworks, conducting multivariate time series predictions and interpretability analyses [30]. For instance, bidirectional Recurrent Neural Networks (RNNs) and LSTM have been used to forecast cotton pest outbreaks [31]. Researchers incorporated climatic variables into the models, which achieved an AUC of 0.95. This result demonstrates the importance of climatic parameters for pest dynamics. Similarly, the BCNN–LSTM model has been applied to tobacco disease prediction [32]. It incorporates bidirectional convolution to capture interactions between meteorological factors. This design substantially improves the model’s ability to characterize nonlinear and dynamic features.
The previously mentioned models have demonstrated effectiveness in predicting agricultural pests and diseases. However, they process time series as uniform inputs, overlooking a fundamental structural property: time series comprise trend components and seasonal components, with potential interference between them. This limitation becomes particularly evident when analyzing agricultural pest data with pronounced seasonality or complex trends, potentially compromising prediction accuracy. Time series forecasting employs established decomposition frameworks to separate and model these components independently [33,34,35]. For example, the Dlinear model obtains trend components using a moving average kernel. Then, it derives seasonal components by calculating the difference between the original time series and the obtained trend components. Finally, each component is processed through a single linear layer, after which they are weighted [36,37]. However, after decomposing the time series, the DLinear model only processes both of the decomposed components through a single simple linear layer. This approach overlooks the high-frequency periodicity inherent in the seasonal component and the weak linearity existing in the trend component. When faced with the respective characteristics of these two decomposed branches, the model fails to conduct targeted modeling for them. Additionally, traditional time series prediction models tend to neglect the inherent cycles of different features during encoding and overlook the significance of timestamps in time series prediction.
To address the aforementioned issues, we propose a novel deep learning-based multiple-stream framework for C. suppressalis population forecasting. Unlike traditional decomposition methods (e.g., the moving average method) that struggle to handle data with dynamic evolution patterns, Exponential Smoothing Decomposition (ESD) uses exponentially decaying weights to prioritize recent observations, thereby better adapting to the dynamic change characteristics of the data [38]. This enables ESD to adapt more rapidly to trend shifts and eliminates the need for data padding. The proposed framework begins by decomposing the input time series into trend and seasonal components using ESD. Additionally, a temporal feature stream is also incorporated to process the original C. suppressalis series along with highly correlated meteorological factors, thereby effectively capturing multi-scale temporal patterns. Within this stream, distinct path lengths are utilized for different temporal features, such as months and seasons [39,40]. The overall architecture thus consists of three streams: a seasonal stream, a trend stream, and a temporal feature stream. We refer to the proposed framework as ESD-TripleStream; to the best of our knowledge, this constitutes the first application of a triple-stream network to time series forecasting:
  • We propose a novel deep learning-based framework for C. suppressalis population forecasting. The proposed framework employs a triple-stream network to effectively capture distinct multi-scale patterns, thereby improving prediction accuracy.
  • We construct a novel real-world dataset, HNRP-6R. This dataset comprises over two decades (2000–2022) of rice pest monitoring data alongside 13 meteorological factors across six key rice-producing regions in Hunan Province, southern China.
  • We conduct extensive experiments on the newly created HNRP-6R dataset, evaluating our framework against several baselines and state-of-the-art methods. The results demonstrate the significant superiority of our proposed framework, achieving state-of-the-art performance in C. suppressalis population forecasting.

2. Materials and Methods

2.1. Dataset and Preprocessing

2.1.1. Study Area and Data Collection

Study Area. As shown in Figure 1, this research focuses on Hunan Province, China—a major rice-producing region with a subtropical monsoon climate [41], which makes it highly susceptible to seasonal pest outbreaks. Geographically, Hunan spans longitudes 108 47 E to 114 15 E and latitudes 24 38 N to 30 08 N , with an average annual temperature ranging from 16 C to 18 C . The region experiences hot, humid summers and distinct monsoon seasons, which, combined with its high crop and continuous rice planting cycles, create optimal conditions for pest population surges.
HNRP-6R Dataset. For developing a robust dataset to forecast rice pest populations, we gathered longitudinal pest monitoring data from 93 regions across Hunan Province, sourced via the Hunan Crop Monitoring and Early Warning Information System (https://hn.pestiot.com/#/index/province (accessed on 27 May 2025)). Trained plant protection specialists performed daily data collection using light-trapping devices, with pest counts manually logged at 8:00 AM. Spanning 2000 to 2022, the dataset encompasses 10 key rice pest species (Table 1), including Chilo suppressalis, Rice planthoppers, Cnaphalocrocis medinalis, Sesamia inferens, Tryporyza incertulas, Naranga aenescens Moore, Echinocnemus squameus Billberg, Lissorhoptrus oryzophilus, Orseolia oryzae, and Mythimna separata. Note that the quality of the data is limited due to both subjective and objective factors. Subjectively, it stems from reporting errors made by data collectors. Objectively, these factors include equipment malfunctions of light traps, as well as the light traps’ reliance on a single ultraviolet light source, fixed collection time windows, and the impact of adverse weather conditions (such as heavy rain and strong winds). To guarantee reliability, we filtered out regions with over 40% missing data, ultimately selecting six representative areas—Dong’an, Guiyang, Hongjiang, Shuangfeng, Xiangyin, and Yuanjiang—for our final pest time series dataset. The typical population dynamics of C. suppressalis in these regions are illustrated in Figure 2.
Given the significant influence of climatic conditions on rice pest outbreaks, integrating comprehensive meteorological data is vital for enhancing the accuracy of pest population forecasting models. To achieve this, we acquired meteorological records from the China National Meteorological Science Data Center (https://data.cma.cn/), spanning the period from 2000 to 2022. The dataset encompasses 13 key meteorological variables known to influence pest dynamics, including mean daily temperature (°C), mean daily relative humidity (%), mean daily wind speed (m/s), daily sunshine duration (h), daily precipitation (mm), daily evaporation (mm), etc. A detailed list of the selected meteorological factors along with their corresponding units is presented in Table 2.
By integrating pest records with corresponding meteorological data, we develop a novel time series dataset for forecasting rice pest population dynamics, named HNRP-6R (HuNan Rice Pest time series from 6 Regions). In this study, given the severe threat of the C. suppressalis to rice production and the critical need for accurate forecasting, we focus on developing accurate predictive models for C. suppressalis outbreaks based on the C. suppressalis relative data in this dataset. Table 3 provides the C. suppressalis statistical overview of the dataset.

2.1.2. Data Preprocessing

Linear Interpolation. Despite carefully selecting six regions with relatively high-quality data, missing values persist in both pest population and meteorological time series. These gaps arise from inconsistencies in observations, sensor malfunctions, and manual reporting errors. To address data sparsity and maintain temporal continuity, we apply linear interpolation—a widely adopted technique for estimating missing values while preserving the data’s underlying trends. For a missing data point at position x, its interpolated value y is computed using the formula y = y 1 + ( x x 1 ) × y 2 y 1 x 2 x 1 , where x 1 and x 2 are the nearest known data points immediately preceding and following x, and y 1 and y 2 are their respective y-values. This approach assumes a locally linear relationship between adjacent observations, ensuring seamless transitions in the reconstructed time series.
Outlier Handling. During the collection of C. suppressalis population data, factors such as extreme weather and equipment failures introduce outliers that significantly deviate from the true population dynamics. The Savitzky–Golay Filter (SG filter) is used for preprocessing C. suppressalis data because it effectively smooths data and suppresses noise while preserving high-order statistical features of the signal—making it particularly suitable for time series of biological population dynamics with both periodicity and abrupt change characteristics. The SG filter formula is y i = j = m m a j x i + j , where m represents the window half-width, so the window length is set to 2 m + 1 = 5 (meaning 2 data points before and after the target point, plus the target point itself, form a fitting window of 5 points in total); the polynomial order is d = 2 , and the weighting coefficients a j are determined via polynomial fitting within the window to achieve outlier removal and minimize signal distortion.
Data Normalization. Time series data often exhibit non-stationary characteristics due to temporal shifts in distribution, which can degrade model performance. To address this challenge, we utilize Reversible Instance Normalization (RevIN) [42], a methodology engineered to boost model adaptability while retaining the original data scale. The normalization workflow comprises two reversible phases. First, each time series instance is standardized through mean subtraction and division by its standard deviation, a process that alleviates the impact of non-stationary fluctuations. Second, post-model inference, predicted outputs undergo rescaling: they are multiplied by the original standard deviation and then the mean is re-added, ensuring that final predictions remain interpretable within the original feature domain. In contrast to traditional normalization approaches, RevIN dynamically adapts to instance-specific distributions, rendering it highly effective for time series forecasting tasks where data distributions evolve over time.
Covariates Selection. Given that the outbreak of C. suppressalis is significantly influenced by meteorological factors, the accurate prediction of C. suppressalis largely depends on the selected covariates. To address this, the study employs a systematic dual-filtering approach that integrates computational statistical analysis with ecological knowledge validation. The selection process begins with calculating the Spearman correlation coefficient (Figure 3), a non-parametric measure specifically chosen for its ability to capture nonlinear dependencies between the C. suppressalis time series and potential covariates. This statistical screening is subsequently complemented by biological rationalization [43], where temperature parameters (TEMP, Tmax, Tmin) are prioritized based on their direct regulatory effects on C. suppressalis developmental rates and reproductive capacity, while evaporation (EVP) is retained as a humidity proxy influencing larval survival. Through this synergistic combination of data-driven analysis and domain expertise, the study identifies five key covariates: average temperature (TEMP), maximum temperature (Tmax), minimum temperature (Tmin), sunshine duration (SD), and evaporation (EVP).

2.2. Methodology

2.2.1. Problem Formulation

Given an C. suppressalis population time series x 1 : τ = ( x 1 , , x τ ) , c covariate time series Y 1 : τ = ( y 1 : τ ( 1 ) , , y 1 : τ ( c ) ) and w time-related feature encoding values F 1 : τ = ( f 1 : τ ( 1 ) , , f 1 : τ ( w ) ) , our research goal is to construct a time series forecasting framework M capable of precisely forecasting the subsequent s-step dynamics of C. suppressalis populations:
x ^ τ + 1 : τ + s = M x 1 : τ , Y 1 : τ , f 1 : τ .
Here, y 1 : τ ( i ) and f 1 : τ ( i ) denote the i-th associated covariates and time-related feature values, respectively. Before being input into the model, x 1 : τ and Y 1 : τ are concatenated into a matrix X R τ × ( c + 1 ) , where τ is the historical time steps. Time-related features refer to the temporal information recorded alongside C. suppressalis population counts, typically including components such as year, season, month, and day of the week. For example, the timestamp 1 September 2024 can be represented by the categorical features (Autumn, September, Sunday). In practice, these features are numerically encoded as discrete values (e.g., Autumn = 2, September = 9) and then normalized to a common range, such as [−0.5, 0.5]. For instance, the encoding and normalization formula for the “Day Of Month” time-related feature is expressed as ( day 1 ) / 30.0 0.5 . In the following sections, we let F R τ × w denote the matrix of time-related features associated with the historical data. Here, w signifies the number of selected temporal features employed in the analysis.

2.2.2. Overview of ESD-TripleStream

The overall architecture of our ESD-TripleStream model, as illustrated in Figure 4, comprises five core modules: an Exponential Smoothing Decomposition (ESD) module, a triple-stream network (including Seasonal Stream, Trend Stream, and Temporal Feature Stream), and a Predictor. The ESD module separates the input C. suppressalis population time series and its covariates into long-term trend components ( X T ) and high-frequency seasonal components ( X S ) using exponentially decaying weights, which adaptively emphasize recent data to capture dynamic patterns without padding, addressing the interference between trend and seasonal features in raw time series. The triple-stream network processes these decomposed components and temporal features separately: the Seasonal Stream handles the nonlinear, multi-periodic X S with a Multi-Scale Uniform Patch Embedding (MUP) module to capture multi-frequency periodic patterns via patches of varying lengths, integrating Intra-Series MLP (modeling within-C. suppressalis temporal dependencies) and Inter-Variable MLP (mining C. suppressalis-covariate interactions); the Trend Stream processes the weakly linear X T through a Single-Scale Dynamic Patch Embedding (SDP) module that adaptively adjusts patch boundaries based on local trends, using Intra-Patch MLP (short-term trends), Inter-Patch MLP (long-term trends), and Inter-Variable MLP (cross-variable interactions) to model hierarchical dependencies; the Temporal Feature Stream incorporates time-related features (e.g., season, month) via a Multi-Scale Periodicity-Aware Patch Embedding (MPPE) module, which designs exclusive patch lengths for different temporal attributes to capture periodic patterns, with an Intra-Series MLP enhancing temporal feature representation. The Predictor adopts a direct multi-step strategy, integrating outputs from the three streams via weighted summation and using linear projections to generate future C. suppressalis population predictions, optimized end-to-end to capture both short-term fluctuations and long-term trends, collectively addressing nonlinearity, multi-scale periodicity, and temporal dependency in C. suppressalis forecasting.

2.2.3. Trend-Seasonal Decomposition

The C. suppressalis population is influenced by multiple factors such as temperature, precipitation, and crops, and its time series often contains both significant seasonal cycles and long-term trends. The trend component reveals the long-term change direction of C. suppressalis populations, often exhibiting weakly non-linear characteristics, while the seasonal component focuses on capturing periodic biological rhythms and tends to show non-linear features. Due to the non-linear and non-stationary nature of C. suppressalis population data, traditional single models struggle to simultaneously characterize the smooth evolution of trends and the high-frequency characteristics of seasonal fluctuations. Decomposing the C. suppressalis and its covariate sequences enables the model to design targeted learning mechanisms for the characteristics of these two components, thereby more accurately modeling the dynamics of the C. suppressalis population and improving the accuracy of predicting future outbreak periods and population peaks.
Inspired by xPatch [38], we adopt the Exponential Smoothing Decomposition (ESD) module, a time-series processing technique that leverages exponentially decaying weights to separate trend and seasonality components. Unlike traditional moving-average methods [36], ESD assigns higher weights to more recent data points, enabling faster adaptation to trend changes without requiring padding operations.
For the C. suppressalis and its covariate sequence X, the ESD process is formally defined as
s 1 = x 1 , s t = α x t + ( 1 α ) s t 1 , X T = ESD ( X ) , X S = X X T .
where α (0 < α < 1) is the smoothing factor controlling the weight decay. By progressing from t = 1 to t = τ , ESD enhances the control over trend and seasonality components, making it more suitable for processing C. suppressalis data with dynamic and evolving patterns, and ensuring more precise feature extraction. The resulting X T R τ × ( c + 1 ) represents the long-term trend, while X S R τ × ( c + 1 ) represents the high-frequency seasonal fluctuations. Figure 5 shows an example of ESD decomposition.

2.2.4. Seasonal Stream

The decomposed seasonal component X S exhibits nonlinear and multi-periodic characteristics; therefore, our MUP module captures multi-frequency periodic features through patches of different lengths, while Intra-Series MLP models nonlinear dependencies within the C. suppressalis series and Inter-Variable MLP mines cross-variable interactions between C. suppressalis and covariables. The combination of these components enables the precise capture of complex multi-scale temporal relationships and variable associations in seasonal components, enhancing the modeling capability for nonlinear patterns and prediction generalization.
Multi-Scale Uniform Patch Embedding. A single-scale patch fails to effectively identify periodic features at different frequencies, leading to inaccurate or incomplete local feature capture by the model. Inspired by PatchMLP [44], we adopted the MUP (Multi-Scale Uniform Patch Embedding) module. By using patches of different lengths, it can not only capture local high-frequency fluctuations with shorter patches but also explore medium-term seasonal and periodic trends with longer patches. This enables the model to flexibly learn representative features across different time spans, thereby more accurately capturing the complex multi-scale temporal relationships in the seasonal component and enhancing the predictive accuracy and the model’s generalization ability to seasonal patterns.
Specifically, we design a set of different scales P = { p 1 , p 2 , , p K } (such as [ 60 , 45 , 30 , 15 ] ) to perform non-overlapping segmentation on the input sequence X S R ( c + 1 ) × τ . Through the unfold operation, the sequence is divided into N i = ( τ p i ) / p i + 1 patches according to each scale p i P , forming X p i R ( c + 1 ) × N i × p i . Then, a single linear layer Linear i is used to map it to low-dimensional features E i = Linear i ( X p i ) R ( c + 1 ) × N i × d i (where d i = d model / K , K is the number of scales of the patch set P to achieve dimension alignment, and d model is the final embedding dimension). Finally, the outputs of all scales are flattened and concatenated to obtain the embedding representation X S emb R ( c + 1 ) × d model that fuses multi-scale information.
Intra-Series MLP. For the feature matrix X S emb R ( c + 1 ) × d model , we design an Intra-Series MLP module to specifically extract the temporal dependencies of the target C. suppressalis variable. By isolating and focusing on the C. suppressalis variable, this module avoids interference from irrelevant covariates, significantly enhancing the model’s predictive capability for the target variable.
Specifically, we first isolate the C. suppressalis variable from the embedding matrix: X S emb = X S C . suppressalis X Scov , where X S C . suppressalis R 1 × d model , X Scov R c × d model . Subsequently, the Intra-Series MLP performs nonlinear transformations on the C. suppressalis variable, capturing complex temporal patterns through an MLP architecture with residual connections and layer normalization:
X ¯ S C . suppressalis = LN Dropout GELU Linear X S C . suppressalis + X S C . suppressalis ) .
where Linear is a fully connected layer, GELU is an activation function, and LayerNorm maintains feature distribution stability. The processed C. suppressalis feature X ¯ S C . suppressalis is re-concatenated with the unprocessed covariate feature X cov to form an augmented embedding matrix: X ¯ S emb = X ¯ S C . suppressalis X Scov R ( c + 1 ) × d model , where X ¯ S emb denotes the enhanced embedding matrix.
Inter-Variable MLP. The Inter-Variable MLP is designed to model the interdependencies between the C. suppressalis variable and covariates. Specifically, the input matrix X ¯ S emb first undergoes a dimension transposition operation to swap the variable and feature dimensions. The transposed matrix is processed through a fully connected network, then transposed back to the original dimension. Residual connections with the original input are applied to preserve the initial information, and layer normalization is used to stabilize the feature distribution. The overall process can be expressed as
X ^ S emb = Per Dropout GELU Linear Per X ¯ S emb , X S emb = LayerNorm X ^ S emb + X S emb .
The Inter-Variable MLP effectively captures the interactive influences between the C. suppressalis variable and covariates through shared linear transformations in the feature dimension. Residual connections ensure that the original temporal patterns are not disrupted, while layer normalization enhances training stability and model generalization capability.

2.2.5. Trend Stream

The decomposed trend component X T exhibits relatively smooth weakly linear characteristics, with a focus on long-term dependencies and local continuity. Therefore, we employ the Single-Scale Dynamic Patch Embedding (SDP) module, which adaptively and dynamically adjusts patch sizes according to the local features of the trend component. This approach preserves the inherent weak linear structure while precisely capturing the gradual information between adjacent patches. Additionally, Intra-Patch MLP and Inter-Patch MLP are used to model the short-term and long-term trends in C. suppressalis, respectively, while Inter-Variable MLP mines the cross-variable interactions between C. suppressalis trend components and covariates.
Single-Scale Dynamic Patch Embedding. Fixed single-scale patch embedding operations [45] fail to adapt to the scale variations of the trend component across different time periods, prone to causing a loss of patch boundary information. In contrast, Multi-Scale Patch Embedding operations increase computational complexity and may introduce redundancy, as the weakly linear nature of the trend component lacks multi-frequency periodicity. Inspired by HDMixer [46], we propose the SDP (Single-Scale Dynamic Patch Embedding) module, which adaptively adjusts patch lengths according to the local features of the trend component. This approach not only preserves complete trend boundary information but also mitigates semantic incoherence caused by fixed patches through dynamic patch length extension, avoiding fragmentation of key trend features.
Given the C. suppressalis trend component X T R τ × ( c + 1 ) , the Single-Scale Dynamic Patching strategy adaptively segments X T into N patches with flexible lengths, preserving critical temporal patterns while ensuring computational efficiency. For each patch, a fixed number of time points is sampled via bilinear interpolation, followed by learnable linear projection to generate patch-level embeddings X T emb R ( c + 1 ) × N × d model .
Specifically, for the i-th patch, its central position c i is dynamically adjusted by a learnable offset Δ c i , enabling alignment with key temporal features. The left and right boundaries of the patch are adaptively extended as
l i = ( c i + Δ c i ) λ 1 2 + Δ l i , r i = ( c i + Δ c i ) + λ 1 2 + Δ r i ,
where λ denotes the default number of sampled time steps per patch, and Δ l i , Δ r i represent learnable expansion factors. To maintain a fixed number of values per patch, the expanded region is resampled using bilinear interpolation:
x i = j max ( 0 , 1 | i j | ) · x j ,
where x i denotes the sampled value at fractional time step i, and x j is the original value at step j. This ensures smooth transitions and preserves differentiability.
To optimize the parameters Δ c i , Δ l i , Δ r i based on input characteristics, a patch diversity loss L patch is introduced. This loss encourages informative and diverse patch representations by minimizing redundancy. Given X T , let X T emb denote its dynamic patches and X T emb denote fixed-length patches. The loss is defined as
L patch = 1 N k = 1 N log Sim k ( X T emb | γ ) Sim k ( X T emb | γ ) ,
where Sim k ( X T emb | γ ) = 1 N r I ( d ( X T emb ( r ) , X T emb ( k ) ) γ ) measures patch similarity, γ is a threshold, and d ( · , · ) computes the maximum absolute difference between patches. Minimizing this loss ensures that the model learns representations tailored to the C. suppressalis trend component’s structure.
Intra-Patch MLP. Intra-Patch MLP is designed to capture the short-term trends of the C. suppressalis trend component within each patch. For the embedding matrix X T emb R ( c + 1 ) × N × d model , the patch-level embedding of the C. suppressalis variable, X T C . suppressalis R 1 × N × d model , is first isolated from it. Subsequently, we adopt a two-layer fully connected network combined with the GELU activation function, dropout regularization, and residual connections to model the trend features within each patch of X T C . suppressalis :
X ¯ T C . suppressalis = W r a 2 · Dropout GELU W r a 1 · X T C . suppressalis + X T C . suppressalis ,
where W r a 1 , W r a 2 R d model × d model are weight matrices.
Inter-Patch MLP. This module models the long-term trends of the C. suppressalis trend component across different patches. The output from the Intra-Patch MLP, X ¯ T C . suppressalis R 1 × N × d model , is first transposed to yield a new tensor, whose dimension is R 1 × d model × N , facilitating the subsequent modeling of long-term dependencies in the C. suppressalis trend component. Next, the following operations are applied to X ¯ T C . suppressalis :
X ^ T C . suppressalis = Per W e r 2 · GELU W e r 1 · LN Per X ¯ T C . suppressalis + X T C . suppressalis ,
where W e r 1 , W e r 2 R N × N are weight matrices, and LN denotes layer normalization. The two Per denote the permutation operation and the operation of transposing back to the original dimensions, respectively, i.e., X ^ T C . suppressalis R 1 × N × d model . This module, while capturing cross-patch long-term dependencies, complements the intra-patch MLP’s ability to capture short-term trends, thereby achieving the joint modeling of long-and-short-term trends.
Inter-Variable MLP. This module aims to model the interaction between the C. suppressalis trend component and its covariates. First, the output of the Inter-Patch MLP module, X ^ T C . suppressalis , is reintegrated into the original embedding matrix X T emb to obtain a new matrix X ˜ T emb . After performing a transpose operation on its dimensions, we obtain a new tensor, whose dimension is R d model × N × ( c + 1 ) , which facilitates cross-variable interaction modeling across the variable dimension. Subsequently, the following operations are performed:
X ¯ Temb = Per W v 2 · GELU W v 1 · LN Per X ˜ Temb + X ˜ Temb ,
where W v 1 , W v 2 R ( c + 1 ) × ( c + 1 ) are weight matrices for cross-variable interaction, and the two Per denote the permutation operation and the operation of transposing back to the original dimensions, respectively, i.e., X ¯ Temb R ( c + 1 ) × N × d model . Finally, we merge the N and d model dimensions of the X ¯ Temb matrix and remap it back to d model dimensions through a linear layer, resulting in the final output of this module: X Temb R ( c + 1 ) × d model .

2.2.6. Temporal Feature Stream

As an agricultural pest, the population dynamics of the C. suppressalis exhibit strong temporal dependencies. Time-related features F R τ × w serve as critical carriers for characterizing these dependencies. Therefore, we can employ learnable models to capture the relationship between C. suppressalis’s time-related features and observed population densities. For a specific time point i, its time-related features are denoted as F i R w (such as season, month, day of the week, etc., totaling w types of features), and the corresponding C. suppressalis observation is x i . It is assumed that x i follows a conditional distribution p ϕ ( x i | F i ) , i.e., C. suppressalis observations depend on time-related features. For example, the C. suppressalis population count x i on 1 September is correlated with the two time-related features F i of “September” and “1st”. We employ a learnable generative model q θ to approximate this distribution:
X U q θ ( X U | F ) ,
where the input time-related feature matrix is F R τ × w , and, since we only need to predict the C. suppressalis, the output observation matrix is X U R τ × 1 , representing future predictions based on C. suppressalis historical time-related observations. Due to the strong linear characteristics exhibited by time-related features after encoding, q θ consists of the Multi-Scale Periodicity-Aware Patch Embedding (MPPE) and the Intra-Series MLP. These two modules are introduced in detail next.
Multi-Scale Periodicity-Aware Patch Embedding. The core innovation of this module lies in designing exclusive patch lengths for the periodic characteristics of different time-related features, enabling the more precise capture of periodic patterns at different time scales. This design allows the model to better understand and utilize the implicit information in time-related features, providing richer and more accurate feature representations for C. suppressalis prediction. The MPPE module can be represented as
F embed = MPPE ( F ) .
The time feature matrix is F R τ × w , where τ denotes the number of time steps and w represents the count of time features. Additionally, F embed R w × d model stands for a multi-scale periodicity-aware feature-embedding matrix. Specifically, for the F , we transpose it to F R w × τ . This transposition ensures that each row corresponds to an independent time feature, facilitating the separate processing of each time feature. For each time feature f 1 : τ ( i ) R τ ( i = 1 , 2 , , w ), a dedicated patch length P i is designed according to its inherent periodicity. For instance, the patch length P i for “Day Of Month” is set to 30. Then, f 1 : τ ( i ) is divided into n i = ( τ p i ) / p i + 1 non-overlapping patches with a step size of P i . The j-th patch is denoted as f j ( i ) = f ( j 1 ) p i + 1 : j p i ( i ) , j = 1 , 2 , , n i . This ensures that each patch captures a complete periodic unit and avoids information mixing across cycles. For each patch f j ( i ) , it is embedded into the model’s feature space through a two-stage linear transformation. First, the patch of length P i is projected into a subspace via a linear layer, changing its dimension to d model n i :
Emb j ( i ) = Linear 1 ( i ) f j ( i ) , Linear 1 ( i ) R d model n i × P i .
This step captures the temporal dependencies within the patch through dimensional transformation, enhancing the feature representation ability. For example, when P i = 30 (Day Of Month), d model = 512 , and n i = 4 , each patch is expanded from 30 dimensions to 128 dimensions. Subsequently, the embedded patches of the same feature are concatenated and then passed through a second-layer linear layer to restore the full model dimension d model :
Emb ( i ) = Linear 2 ( i ) Concat Emb 1 ( i ) , Emb n i ( i ) ,
Here, Linear 2 ( i ) R d model × d model . This process integrates all patch information of the feature while preserving the temporal order. Finally, the embeddings of all w time features are concatenated along the feature dimension to form a unified representation F embed R w × d model : F embed = Emb ( 1 ) , Emb ( 2 ) , , Emb ( w ) .
Intra-Series MLP. The core objective of this module is to perform non-linear transformations on the embedding vectors of each time feature through a Multi-Layer Perceptron (MLP) to capture the complex temporal dependencies within individual time features, thereby enhancing the semantic representation capability of each time feature. For the input feature matrix F embed R w × d model , the following operations are performed:
F ¯ embed = LN Linear 2 GELU Linear 1 F embed ,
where Linear 1 R d model × d model 2 and Linear 2 R d model 2 × d model are fully connected layers. LN stands for layer normalization. Subsequently, a dimension transposition operation is performed on F ¯ embed to facilitate linear combination in the feature dimension. This process is represented as mapping the features to the target prediction dimension through a linear layer:
F embed = LN Linear 3 Per F ¯ embed ,
where F embed R d model × 1 , Per denotes the dimension permutation operation, Linear 3 R w × 1 , and LN stands for layer normalization.

2.2.7. Predictor

We adopt direct multi-step forecasting, where the model outputs results for multiple future time points at once. This approach avoids the error accumulation issues caused by single-step forecasting or iterative multi-step forecasting [36]. First, we add the output X Semb R ( c + 1 ) × d model of the seasonal Stream module and the output X Temb R ( c + 1 ) × d model of the trend Stream module to obtain X . Then, following common practice, we employ two separate linear predictors that take X and the temporal feature Stream output F as inputs, respectively generating the predicted sequences x ¯ τ + 1 : τ + s and x ˜ τ + 1 : τ + s , and, finally, perform a weighted summation to generate the final prediction result:
x ¯ τ + 1 : τ + s = W 1 X , x ˜ τ + 1 : τ + s = W 2 F ,
x ^ τ + 1 : τ + s = β x ¯ τ + 1 : τ + s + ( 1 β ) x ˜ τ + 1 : τ + s ,
where x ^ τ + 1 : τ + s denotes the predicted value of C. suppressalis, and β ( 0 , 1 ) serves as a tunable parameter that calibrates the contributions from the respective modules.
To quantify the deviation between the predicted values and the ground truth, we adopt the mean squared error (MSE) loss function:
L pred = i = 1 s x τ + i x ^ τ + i 2 2 .
Ultimately, the aggregate loss function L total is formulated as the combination of the prediction loss L pred and the patch reconstruction loss L patch :
L total = L pred + L patch .

3. Results and Discussion

3.1. Experimental Settings

Datasets. We conducted comprehensive experiments on the self-constructed HNRP-6R dataset. For training and evaluation purposes, the dataset was divided into training, validation, and test sets at ratios of 70 % , 10 % , and 20 % respectively. To ensure experiment reproducibility, the data was randomly split using a fixed random seed.
Implementation Details. To ensure equitable comparisons, we uniformly configure hyperparameters across all models. The patch length and stride are both fixed at 30; the embedding dimension is set to D = 512 and FFN dimension is set to 1024. The encoder consists of L = 2 layers, and a dropout rate of 0.1 is applied to prevent overfitting. Each model undergoes training for a maximum of 30 epochs, incorporating early stopping to balance performance and computational efficiency. Specifically, training halts if the validation loss shows no improvement for five consecutive epochs (patience = 5). Moreover, the learning rate is η = 1 × 10 4 and batch size is 128. All experimental setups are strictly consistent to ensure reproducibility, with identical configurations maintained across all model runs.
Experimental Environment. All algorithms are implemented with PyTorch 2.0.1 [47] on a workstation with Intel(R) Core(TM) i7-8750H CPU, 8·GB memory, and 1 NVIDIA GeForce GTX 1050 Ti GPU.
Baseline Methods. We compare our method with eight state-of-the-art baseline methods, including two transformer-based methods—TimeXer [48], iTransformer [49]; one LSTM-based method—P-sLSTM [50]; two linear-based models—Timelinear [39], DLinear [36]; two MLP-based models—PatchMLP [44], HDMixer [46]; and one MLP-based and CNN-based model—xPatch [38]. The brief introduction of them is listed here:
  • TimeXer (NeurIPS’24) innovatively uses a hybrid patch- and variable-level representation. It captures endogenous variables’ temporal dependencies via self-attention and exogenous variables’ global trends through cross-attention.
  • iTransformer (ICLR’24) features series-wise representations. It treats each variable as a “variate token” and captures variate correlations via the attention mechanism.
  • P-sLSTM (AAAI’25) enhances sLSTM with patching and channel independence, addressing traditional LSTM’s short memory limitations to improve medium-term time series forecasting performance.
  • TimeLinear (arXiv’24) introduces the Time Stamp Forecaster (TimeSter) module to encode time-related features (e.g., seasons, hours, weekdays) and integrates them with a linear model for enhanced medium-term forecasting.
  • Dlinear (AAAI’23) splits the time series into a trend series and a remainder series. Subsequently, it utilizes two single-layer linear networks to separately model these two sequences.
  • PatchMLP (AAAI’25) innovatively employs Multi-Scale Patch Embedding to capture multiscale relationships in sequences, decomposes latent vectors into trend and seasonal components, processes them separately, and enhances variable interactions to improve forecasting accuracy.
  • HDMixer (AAAI’24) devises a length-extensible patch module that module serves to enrich the boundary information of patches. Moreover, it makes use of the Hierarchical Dependency Explore module to effectively model hierarchical interactions.
  • xPatch (AAAI’25) employs EMA for seasonal-trend decomposition, separates time-series components via an MLP–CNN dual-flow architecture with patching and channel independence, and introduces arctangent loss and sigmoid learning rate schemes to optimize training and boost performance.
Evaluation Protocols. To quantify prediction errors, we use Mean Squared Error (MSE) and Mean Absolute Error (MAE). MSE penalizes large errors more due to squaring, making it sensitive to outliers, while MAE sums errors linearly and treats all deviations equally. Both metrics are widely used in regression tasks for their interpretability and effectiveness in measuring absolute errors, with formulas provided below:
MSE = 1 n i = 1 n ( y i y ^ i ) 2 ,
MAE = 1 n i = 1 n y i y ^ i
where y ^ i represents the predicted values, y i denotes the ground truth values, and n is the number of samples. A lower MSE or MAE indicates better predictive accuracy. MSE amplifies large errors due to squaring, making it sensitive to outliers, while MAE provides a linear measure of average error magnitude.

3.2. Results and Analysis

3.2.1. Performance Comparison

We compare and evaluate our proposed method against baseline approaches in both short-term and medium-term forecasting tasks, as these two tasks are critical for predicting and managing C. suppressalis outbreaks. Short-term forecasting enables timely pest control measures to reduce potential yield losses, while medium-term forecasting helps growers to understand medium-term trends in pest populations to facilitate proactive strategy development. An ideal forecasting model should perform well in both quick response and medium-term planning, so evaluating performance across these two tasks provides a comprehensive assessment of the proposed method’s overall effectiveness. The comparative results are presented in Table 4 and Table 5, where we report evaluation metrics such as MSE and MAE. The best-performing results are bolded, and the second-best results are underlined to facilitate a clear comparison of different methods’ performances.
Short-Term Forecasting.Table 4 illustrates a detailed comparison of 30-day short-term forecasting performance across six regions. Our model demonstrates top-tier performance, achieving 48 first-place and 10 second-place rankings across 60 evaluated scenarios. The MSE and MAE of our model decreased by 31.8% and 26.55% compared to PatchMLP, and by 31.17% and 25.66% compared to xPatch. The main reasons are as follows: while both PatchMLP and xPatch perform seasonal-trend decomposition, PatchMLP fails to handle seasonal and trend components differently, instead using uniform multi-scale embedding and MLP operations. This approach prevents it from accurately capturing the true patterns in the two data types. xPatch, although distinguishing between trend and seasonal components, does not account for cross-variable correlations. Additionally, both models overlook modeling temporal information in their predictions, which our model explicitly incorporates through customized temporal encoding and integration, leading to significant performance improvements. In addition, compared with the transformer-based model TimeXer, our model achieves a 14.43% decrease in MSE and 9.8% decrease in MAE. This is primarily because TimeXer does not separate trend and seasonal components, causing the two to interfere with each other in unified embedding, which makes it difficult for the model to accurately distinguish between trend directions and fluctuation details. Additionally, due to the lack of any explicit encoding of temporal positional features such as dates and weeks, TimeXer fails to leverage temporal information to enhance prediction performance. Notably, compared with TimeLinear, our model achieves a 20.60% decrease in MSE and 14.85% decrease in MAE. The main reasons lie in the fact that, although TimeLinear encodes time-related features, it fails to design dedicated embedding scales for the inherent periods of different temporal attributes. Instead, it employs generic convolution and linear projection operations, leading to the incomplete fitting of periodic patterns in temporal information and reduced sensitivity to temporal positions. Additionally, TimeLinear uses a simple linear model that cannot accurately capture the complex nonlinear relationships in the data.
Medium-Term Forecasting. Table 5 shows that our model demonstrates superior performance in medium-term forecasting (45-, 60-, 75-, and 90-day horizons), achieving 33 first-place and 14 second-place rankings across 48 experimental settings. Compared with the most powerful transformer-based model TimeXer and the LSTM-based model P-sLSTM, our model reduces MSE by 8.83% and 17.5%, and MAE by 7.68% and 24.59%. In contrast, versus PatchMLP, our model achieves significant performance improvements, with MSE and MAE decreasing by 31.18% and 26.87%, respectively. Notably, as the forecasting length increases, the prediction performance of our model on most datasets improves, primarily attributed to the time-encoding prediction component that provides reliable time-reference information for medium-sequence forecasting.

3.2.2. Qualitative Results

Figure 6 shows the prediction results for six regions, where the model uses 120 days of historical data as input to forecast the next 30 days. We compare our method with three state-of-the-art baseline models: TimeXer, PatchMLP, and TimeLinear. The results indicate that PatchMLP’s predictions exhibit significant fluctuations and instability, primarily because the trend and seasonal components in C. suppressalis data inherently possess distinct characteristics, yet PatchMLP applies identical modeling to both. This prevents the model from capturing true long-term trends and precise seasonal variations, leading to large-scale, irregular fluctuations in predictions. Secondly, although TimeLinear utilizes time-related features for forecasting, it relies on a simple linear model that fails to capture the complex seasonal and trend relationships in C. suppressalis data and does not decompose the data. This results in TimeLinear’s inability to accurately identify seasonal fluctuation patterns or characterize trend changes, leading to poor prediction performance. Additionally, the trend and seasonal components in C. suppressalis data have different temporal scale characteristics. TimeXer divides continuous C. suppressalis sequences into fixed-length patches of a single scale, which may disrupt the natural continuity of trends and seasons at patch boundaries, causing incoherent semantic information and making it difficult for the model to capture complete periodic patterns. Notably, the attention mechanism in TimeXer, which calculates correlations between different sequence positions to capture key information, struggles to adapt to both the long-distance dependencies required for trend modeling and the short-distance periodic repetitions needed for seasonal modeling due to fixed patch lengths. Directly applying attention to patches mixing both features confuses the importance of different characteristics, leaving TimeXer unable to accurately model the medium-term evolution of trends or replicate seasonal cycles, resulting in deviations from real data—especially at critical points of trend shifts or significant seasonal fluctuations.
In contrast, our model demonstrates superior visualization performance by designing differentiated patch embedding and MLP strategies tailored to the distinct properties of C. suppressalis data’s trend, seasonal, and time-related features. This enables precise modeling of linear trends, nonlinear seasonal fluctuations, and temporal periodic patterns in C. suppressalis population dynamics, fundamentally avoiding the flaws of other models in feature separation, scale adaptation, and nonlinear processing. As a result, our model’s prediction curves closely match the complex variations in real data, showcasing excellent visualization effectiveness.

3.2.3. Ablation Analysis

To comprehensively and methodically assess the impact of each proposed component, we carry out ablation studies across six distinct regions within a short-term (30-day) forecasting context. In order to gauge the models’ resilience and adaptability across various temporal scales, we test them under a range of input sequence lengths, spanning from 120 to 600. More precisely, we formulate six different ablation model variants. Through these variants, we aim to systematically dissect and understand the specific role that each individual component plays in the overall model performance. This approach allows us to gain in-depth insights into the contribution of every part of the proposed model structure:
  • w/o TFS: Remove the Temporal Feature module means that the model does not consider time-related information at all during the prediction process.
  • w/o SS: Remove the Seasonal Stream module and replace it with the Trend Stream module.
  • w/o TS: Remove the Trend Stream module and replace it with the Seasonal Stream module.
  • w/o ESD-S: Remove the ESD module, i.e., do not decompose the original sequence into trend and seasonal components, and use the Seasonal Stream module to process the original sequence.
  • w/o ESD-T: Remove the ESD module, i.e., do not decompose the original sequence into trend and seasonal components, and use the Trend Stream module to process the original sequence.
  • w/o TFS-Mod: Replace the modules in the temporal feature stream with two nonlinear hidden layers, a 1D convolutional layer, and a linear projection layer from the TimeLinear module.
Table 6 displays the outcomes of our ablation experiments, which fully confirm the subsequent conclusions.
Removing the temporal feature module (w/o TFS) led to an average increase of 13.07% in MSE and 11.53% in MAE across six regional datasets. This highlights the critical role of time information encoding in capturing periodic patterns of C. suppressalis populations. The absence of this module prevents the model from identifying temporal dimensional features (e.g., day/month positions), causing periodic prediction deviations.
Replacing the Seasonal Stream module (w/o SS) increased MSE/MAE by 4.25%/4.02%, confirming that Multi-Scale Uniform Patch Embedding for seasonal components is essential for extracting nonlinear short-term fluctuations and multi-frequency features. Uniform single-scale processing for trends fails to capture complex seasonal patterns, amplifying prediction errors.
Replacing the Trend Stream module (w/o TS) caused MSE/MAE to rise by 6.86%/4.85%, validating the necessity of dynamic patch splitting embedding for trend components to capture long-term linear dependencies. Multi-scale embedding fragments trends into local fluctuations, misleading the model’s global trend judgment and introducing redundant local analysis errors.
Removing the seasonal decomposition module (w/o ESD-S) and directly applying multi-scale embedding to raw sequences increased MSE/MAE by 8.78%/7.02%. This indicates that separating trend and seasonal components via EMA decomposition is prerequisite for targeted modeling. Mixed processing misinterprets linear trends as nonlinear oscillations, disrupting feature independence between the two components.
Skipping trend decomposition (w/o ESD-T) and using uniform dynamic patch splitting for raw sequences led to MSE/MAE increases of 3.62%/2.79%. The lack of decomposition weakens global trend consistency and local seasonal multi-frequency feature extraction, reducing the model’s ability to distinguish between the two components.
Replacing the temporal feature module structure (w/o TFS-Mod) caused MSE/MAE to rise by 1.93%/2.03%. This demonstrates that designing dedicated embedding scales for temporal attributes outperforms generic convolutional operations in capturing fine-grained temporal position features, avoiding incomplete periodic pattern fitting and reduced temporal sensitivity.

3.2.4. Generalization Analysis

To evaluate the generalization ability of our model, we perform an extra experiment focusing on predicting the population dynamics of N. lugens, a major agricultural pest that poses a significant threat to rice production. N. lugens inflicts crop damage through multiple pathways, such as sap-sucking, viral transmission, and oviposition, which can result in severe plant lodging and even complete crop failure. Although both N. lugens and C. suppressalis cause substantial yield losses, their ecological behaviors and population dynamics exhibit notable differences. N. lugens displays migratory patterns influenced by seasonal climatic conditions, with population surges primarily driven by external environmental factors. In contrast, C. suppressalis follows a relatively stable life cycle that relies more heavily on crop growth stages. Given these fundamental differences, evaluating the reverse forecasting ability of our model serves as a rigorous test of its capacity to generalize across pest species with diverse temporal dynamics.
For a thorough evaluation, this experiment utilizes datasets from four regions—HJ, YJ, DA, and GY—which cover diverse geographical environments. The covariates employed in N. lugens forecasting (TEMP, Tmax, Tmin, SD, and EVP) are set with the same sequence length as those in the C. suppressalis forecasting task, ensuring a consistent experimental framework. Model performance will be measured using MSE and MAE metrics to determine whether it maintains high predictive accuracy despite the different behavioral patterns of N. lugens. We conduct both quantitative and qualitative analyses of the results, providing critical insights into the model’s applicability in real-world scenarios that demand accurate forecasting of multiple pest species with varying ecological.
Quantitative Results.Table 7 displays the 30-day short-term forecasting results of our model when compared against eight advanced baseline models. The input sequence length ranges between 120 and 600 time steps, adhering to the same experimental configurations as previous setups. Experimental outcomes indicate that our model achieves the top performance in 30 out of 40 evaluation cases, while ranking second in the remaining 8 instances. These findings strongly highlight the model’s exceptional forecasting proficiency across diverse input scales. Compared with the state-of-the-art transformer-based model TimeXer, MLP-based model PatchMLP, and LSTM-based model P-sLSTM, the MSE of our model decreased by 2.4%, 42%, and 22.56%, respectively, and the MAE decreased by 4.07%, 33.43%, and 33.39%, respectively. Our model not only maintains high predictive accuracy when applied to pest species with unique ecological behaviors but also outperforms specialized models designed for sequential data. These results underscore its applicability in multi-species forecasting scenarios that demand robust generalization capabilities, demonstrating its ability to excel across diverse pest dynamics and modeling paradigms.
Qualitative Results. To provide an intuitive assessment of the prediction performance, we present visualizations of the N. lugens population dynamics in GY in 2022 and HJ in 2021. Figure 7 shows the prediction curves of our model, TimeXer, PatchMLP, and TimeLinear, as well as the actual data. The left subplot corresponds to GY, and the right subplot represents HJ. The results indicate that, compared with other models, our model more accurately captures the cyclical and seasonal trends of the N. lugens population. These findings further confirm the universality of our model in predicting pest population dynamics under different ecological conditions.

3.2.5. Model Limitations

Although the proposed ESD-TripleStream model has achieved state-of-the-art performance in forecasting the populations of C. suppressalis and N. lugens, it still has inherent limitations regarding climate sensitivity and variability that warrant further discussion.
Limitations in Climate Sensitivity. The model’s reliance on historical meteorological data (e.g., average temperature, precipitation) results in insufficient sensitivity to abnormal events of an extreme climate nature. For instance, abnormal weather conditions such as consecutive days with temperatures exceeding 35 °C and short-duration heavy rainfall directly disrupt the developmental cycle of C. suppressalis (e.g., larval survival rate, pupation success rate). However, such events are not given special weight consideration in the current model framework.
Limitations in Climate Variability. The HNRP-6R dataset is derived from the subtropical monsoon climate region of Hunan Province. Although data from five regions were used in the experiments, the regional scope is too narrow on a global scale. This means that the model is more optimized for the climate dynamics specific to this region. When the model is extended to areas with vastly different climate types (e.g., tropical rainforests, arid regions), the interaction patterns between climate and pests in these areas differ significantly from those in Hunan, which may lead to a decline in the model’s performance.
These limitations indicate that future research needs to improve the model’s adaptability to complex climate conditions. Examples of such improvements include integrating indicators of extreme climate events, incorporating climate change projection data, and designing region-adaptive modules to address climate variability across different geographical environments.

4. Conclusions

This study proposes a novel multi-stream framework for C. suppressalis population dynamics forecasting in agriculture, which addresses the limitations in existing time series prediction approaches, often omitting the further decomposability and the timestamp information of the time series. During our evaluation, the proposed framework outperformed several baselines and reported excellent performance in C. suppressalis population dynamics forecasting. The proposed forecasting model enables farmers to carry out precise control before the pests cause irreversible damage, thereby minimizing yield loss directly. It is also helpful in effectively preventing arbitrary pesticide use, reducing agricultural non-point source pollution at its source, and is a fundamental step towards developing green agriculture. A novel dataset, which includes rice pest monitoring data from the past two decades (2000–2022) alongside 13 meteorological factors across six key rice-producing regions in Hunan Province, southern China, is constructed for our experimental evaluation. This dataset will serve as a valuable asset for advancing research in predicting rice pest outbreaks.
In our future work, we plan to integrate more fine-grained environmental features (e.g., soil conditions, pesticide application records) and extend the proposed framework to model multi-pest interactions. Furthermore, we also consider integrating real-time monitoring data and developing interpretable prediction mechanisms. These enhancements are expected to increase the model’s practicality, thereby contributing to precision agriculture and sustainable crop protection strategies.

Author Contributions

C.H.: Conceptualization, methodology, software, validation, formal analysis, writing—original draft preparation, writing—review and editing; Z.P.: Conceptualization, validation, visualization; L.P.: Resources; Y.L.: Validation, investigation, data curation; C.Z.: Resources; L.Z. (Lei Zhu): Writing—review and editing, supervision, funding acquisition; S.T.: Funding acquisition; L.Z. (Ling Zou): Writing—review and editing, project administration, funding acquisition. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported in part by the Hunan Provincial Key Research and Development Program Project under Grant 2023NK2011, the Scientific Research Fund of Hunan Provincial Education Department under Grant 23A0168, the National Natural Science Foundation of China under Grant 62202163, Grant 62472161 and Grant 62372150, the Natural Science Foundation of Hunan Province under Grant 2023JJ30169 and 2022JJ30231, the Teaching Reform Research Program of Regular Higher Education Institutions of Hunan Province under Grant HNJG-20230396, and the Yuelu Mountain Laboratory Seed Industry Special Project under Grant YLS-2025-ZY04016.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Data will be made available on request.

Conflicts of Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

References

  1. Gnanamanickam, S.S. Rice and its importance to human life. In Biological Control of Rice Diseases; Springer: Dordrecht, The Netherlands, 2009; pp. 1–11. [Google Scholar]
  2. Liu, Q.; Wang, X.; Tzin, V.; Romeis, J.; Peng, Y.; Li, Y. Combined transcriptome and metabolome analyses to understand the dynamic responses of rice plants to attack by the rice stem borer Chilo suppressalis (Lepidoptera: Crambidae). BMC Plant Biol. 2016, 16, 259. [Google Scholar] [CrossRef]
  3. Douglas, A.E. Strategies for enhanced crop resistance to insect pests. Annu. Rev. Plant Biol. 2018, 69, 637–660. [Google Scholar] [CrossRef]
  4. Cohen, M.B.; Chen, M.; Bentur, J.S.; Heong, K.L.; Ye, G. Bt rice in Asia: Potential benefits, impact, and sustainability. In Integration of Insect-Resistant Genetically Modified Crops Within IPM Programs; Springer: Dordrecht, The Netherlands, 2008; pp. 223–248. [Google Scholar]
  5. Lin, K.; Hou, M.; Han, L.; Liu, Y. Research progress in host selection and underlying mechanisms, and factors affecting population dynamics of Chilo suppressalis. Plant Prot. 2008, 34, 22. [Google Scholar]
  6. Zhong, H.Y.; Zhang, J.F.; Li, F.; Yu, K.L.; Chen, J.M. The morphology and an improvement of breeding efficiency of larval Chilo suppressalis (Lepidoptera: Crambidae). Entomol. News 2025, 132, 253–265. [Google Scholar] [CrossRef]
  7. Xiang, X.; Liu, S.H.; Li, H.J.; Danso Ofori, A.; Yi, X.Q.; Zheng, A.P. Defense strategies of rice in response to the attack of the herbivorous insect, Chilo suppressalis. Int. J. Mol. Sci. 2023, 24, 14361. [Google Scholar] [CrossRef] [PubMed]
  8. Pandit, P.; Krishnamurthy, K.N.; Bakshi, B. Prediction of crop yield and pest-disease infestation. In AI, Edge and IoT-Based Smart Agriculture; Elsevier: Amsterdam, The Netherlands, 2022; pp. 375–393. [Google Scholar]
  9. Kopton, J.; de Bruin, S.; Schulz, D.; Luedeling, E. Combining spatio-temporal pest risk prediction and decision theory to improve pest management in smallholder agriculture. Comput. Electron. Agric. 2025, 236, 110426. [Google Scholar] [CrossRef]
  10. Zheng, Y.Q.; Zhang, X.Y.; Liu, X.; Qin, N.N.; Xu, K.F.; Zeng, R.S.; Liu, J.; Song, Y.Y. Nitrogen supply alters rice defense against the striped stem borer Chilo suppressalis. Front. Plant Sci. 2021, 12, 691292. [Google Scholar] [CrossRef] [PubMed]
  11. Wang, S.; Liu, C.; Qiao, S.T.; Guo, F.R.; Xie, Y.; Sun, H.; Liu, Y.; Zhao, S.Q.; Zhou, L.Q.; He, L.F.; et al. The evolution and mechanisms of multiple-insecticide resistance in rice stem borer, Chilo suppressalis Walker (Lepidoptera: Crambidae). J. Agric. Food Chem. 2024, 72, 26475–26490. [Google Scholar] [CrossRef]
  12. Zhu, L.; Yuan, Z.C.; Yi, Z.Q.; Zhang, C.Y.; Wu, L.; Zhang, Y.; Boussaid, F.; Bennamoun, M.; Zhang, S.C. Information bottleneck-guided KNN contrastive hashing for unsupervised cross-modal retrieval. Knowl.-Based Syst. 2025, 329, 114347. [Google Scholar] [CrossRef]
  13. Jeong, W.J.; Kim, K.H. Determining the minimum data size for the development of artificial neural network-based prediction models for rice pests in Korea. Comput. Electron. Agric. 2024, 220, 108865. [Google Scholar] [CrossRef]
  14. Gao, J.; Gu, C.G.; Yang, H.J.; Weng, T.F. Prediction of spatial distribution of invasive alien pests in two-dimensional systems based on a discrete time model. Ecol. Eng. 2020, 143, 105673. [Google Scholar] [CrossRef]
  15. Ibrahim, E.A.; Salifu, D.; Mwalili, S.; Dubois, T.; Collins, R.; Tonnang, H.E.Z. An expert system for insect pest population dynamics prediction. Comput. Electron. Agric. 2022, 198, 107124. [Google Scholar] [CrossRef]
  16. Johnson, B.J.; Gomez, M.M.; Munch, S.B. Empirical dynamic modeling for prediction and control of pest populations. Ecol. Model. 2025, 504, 111081. [Google Scholar] [CrossRef]
  17. Khushboo, S.S.; Gupta, V.; Pandit, D. Forewarning of stripe rust (Puccinia striiformis) of wheat in Jammu plains. Indian Phytopathol. 2023, 76, 767–776. [Google Scholar] [CrossRef]
  18. Zhu, L.; Wu, R.B.; Liu, D.Y.; Zhang, C.Y.; Wu, L.; Zhang, Y.; Zhang, S.C. Textual semantics enhancement adversarial hashing for cross-modal retrieval. Knowl.-Based Syst. 2025, 317, 113303. [Google Scholar] [CrossRef]
  19. Wang, M.H.; Li, T. Pest and disease prediction and management for sugarcane using a hybrid autoregressive integrated moving average—a long short-term memory model. Agriculture 2025, 15, 500. [Google Scholar] [CrossRef]
  20. Narava, R.; DV, S.R.K.; Jaba, J.; P, A.K.; GV, R.R.; V, S.R.; Mishra, S.P.; Kukanur, V. Development of temporal model for forecasting of Helicoverpa armigera (Noctuidae: Lepidopetra) using Arima and Artificial Neural Networks. J. Insect Sci. 2022, 22, 2. [Google Scholar] [CrossRef] [PubMed]
  21. Quartey-Papafio, T.K.; Javed, S.A.; Liu, S.F. Forecasting cocoa production of six major producers through ARIMA and grey models. Grey Syst. Theory Appl. 2021, 11, 434–462. [Google Scholar] [CrossRef]
  22. Jain, S.; Ramesh, D. AI based hybrid CNN-LSTM model for crop disease prediction: An ML advent for rice crop. In Proceedings of the 2021 12th International Conference on Computing Communication and Networking Technologies (ICCCNT), Kharagpur, India, 6–8 July 2021; IEEE: Piscataway, NJ, USA, 2021; pp. 1–7. [Google Scholar]
  23. Guo, Q.C.; He, Z.F.; Wang, Z.S. Monthly climate prediction using deep convolutional neural network and long short-term memory. Sci. Rep. 2024, 14, 17748. [Google Scholar] [CrossRef]
  24. Guo, Q.C.; He, Z.F.; Wang, Z.S. Prediction of monthly average and extreme atmospheric temperatures in Zhengzhou based on artificial neural network and deep learning models. Front. For. Glob. Change 2023, 6, 1249300. [Google Scholar] [CrossRef]
  25. Kakarla, S.K.; Schall, E.; Dettweiler, A.; Stohl, J.; Glaser, E.; Adam, H.; Teubler, F.; Ingwersen, J.; Sauer, T.; Piepho, H.P.; et al. Dependence of the abundance of reed glass-winged cicadas (Pentastiridius leporinus (Linnaeus, 1761)) on weather and climate in the upper Rhine Valley, southwest Germany. Agriculture 2025, 15, 1323. [Google Scholar] [CrossRef]
  26. Pulighe, G.; Lupia, F.; Manente, V. Climate-driven invasion risks of Japanese beetle (Popillia japonica Newman) in Europe predicted through species distribution modelling. Agriculture 2025, 15, 684. [Google Scholar] [CrossRef]
  27. Neta, A.; Levi, Y.; Morin, E.; Morin, S. Seasonal forecasting of pest population dynamics based on downscaled SEAS5 forecasts. Ecol. Model. 2023, 480, 110326. [Google Scholar] [CrossRef]
  28. He, Z.P.; Wei, X.J.; Li, Y.P.; Deng, X.Q.; Zhuo, Z.H. Dynamics of Aromia bungii (Faldermann, 1835) (Coleoptera, Cerambycidae) distribution in China amidst climate change: Dual insights from MaxEnt and meta-analysis. Agriculture 2025, 15, 1224. [Google Scholar] [CrossRef]
  29. Tian, X.Y.; Gao, Y.; Ali, M.Y.; Li, X.H.; Hu, Y.L.; Li, W.B.; Wang, Z.J.; Shi, S.S.; Zhang, J.P. Impact of temperature on age–stage, two-sex life table analysis of a Chinese population of bean bug, Riptortus pedestris (Hemiptera: Alydidae). Agriculture 2022, 12, 1505. [Google Scholar] [CrossRef]
  30. Wang, J.L.; Zhang, D. Intelligent pest forecasting with meteorological data: An explainable deep learning approach. Expert Syst. Appl. 2024, 252, 124137. [Google Scholar] [CrossRef]
  31. Chen, P.; Xiao, Q.X.; Zhang, J.; Xie, C.J.; Wang, B. Occurrence prediction of cotton pests and diseases by bidirectional long short-term memory networks with climate and atmosphere circulation. Comput. Electron. Agric. 2020, 176, 105612. [Google Scholar] [CrossRef]
  32. Chen, Y.; Li, C.C.; Wang, C.; Xiao, Y.S.; Liu, T.B.; Li, J.Y.; Teng, K.; Cai, H.L.; Xiao, Z.P.; Zhou, H.; et al. The application of integrated deep learning models with the assistance of meteorological factors in forecasting major tobacco diseases. Comput. Electron. Agric. 2025, 236, 110429. [Google Scholar] [CrossRef]
  33. Wu, H.X.; Xu, J.H.; Wang, J.M.; Long, M.S. Autoformer: Decomposition transformers with auto-correlation for long-term series forecasting. Adv. Neural Inf. Process. Syst. 2021, 34, 22419–22430. [Google Scholar]
  34. Zhou, T.; Ma, Z.Q.; Wen, Q.S.; Wang, X.; Sun, L.; Jin, R. Fedformer: Frequency enhanced decomposed transformer for long-term series forecasting. In Proceedings of the International Conference on Machine Learning, PMLR, Baltimore, MD, USA, 17–23 July 2022; pp. 27268–27286. [Google Scholar]
  35. Zhang, X.Y.; Jin, X.Y.; Gopalswamy, K.; Gupta, G.; Park, Y.; Shi, X.J.; Wang, H.; Maddix, D.C.; Wang, Y.Y. First de-trend then attend: Rethinking attention for time-series forecasting. arXiv 2022, arXiv:2212.08151. [Google Scholar]
  36. Zeng, A.L.; Chen, M.X.; Zhang, L.; Xu, Q. Are transformers effective for time series forecasting? In Proceedings of the AAAI Conference on Artificial Intelligence, Washington, DC, USA, 7–14 February 2023; Volume 37, pp. 11121–11128. [Google Scholar]
  37. Zhu, L.; Wu, R.B.; Zhu, X.H.; Zhang, C.Y.; Wu, L.; Zhang, S.C.; Li, X.L. Bi-direction label-guided semantic enhancement for cross-modal hashing. IEEE Trans. Circuits Syst. Video Technol. 2024, 35, 3983–3999. [Google Scholar] [CrossRef]
  38. Stitsyuk, A.; Choi, J.S. xPatch: Dual-stream time series forecasting with exponential seasonal-trend decomposition. In Proceedings of the AAAI Conference on Artificial Intelligence, Philadelphia, PA, USA, 27 February–2 March 2025; Volume 39, pp. 20601–20609. [Google Scholar]
  39. Zeng, C.L.; Tian, Y.; Zheng, G.J.; Gao, Y.J. How much can time-related features enhance time series forecasting? arXiv 2024, arXiv:2412.01557. [Google Scholar] [CrossRef]
  40. Zhu, L.; Yu, W.R.; Zhu, X.H.; Zhang, C.Y.; Li, Y.D.; Zhang, S.C. MvHAAN: Multi-view hierarchical attention adversarial network for person re-identification. World Wide Web 2024, 27, 59. [Google Scholar] [CrossRef]
  41. Yu, W.J.; Wang, Q.N.; Zhu, J.; Cao, J.Z.; Liu, T.; Yi, S.X.; Sun, Q.F.; Cui, J.H.; Li, J.W.; Song, Y.L.; et al. Quantifying climate change impacts on rice chalkiness in southern China: Future trends and spatiotemporal patterns. Comput. Electron. Agric. 2025, 231, 109988. [Google Scholar] [CrossRef]
  42. Kim, T.S.; Kim, J.H.; Tae, Y.W.; Park, C.B.; Choi, J.H.; Choo, J.G. Reversible instance normalization for accurate time-series forecasting against distribution shift. In Proceedings of the International Conference on Learning Representations, Virtual, 3–7 May 2021. [Google Scholar]
  43. Mo, W.J.; Li, Q.; Lu, Z.X.; Ullah, F.; Guo, J.W.; Xu, H.X.; Lu, Y.H. Dynamic monitoring of Chilo suppressalis resistance to insecticides and the potential influencing factors. Plants 2025, 14, 724. [Google Scholar] [CrossRef]
  44. Tang, P.W.; Zhang, W.T. Unlocking the power of patch: Patch-based MLP for long-term time series forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Philadelphia, PA, USA, 27 February–2 March 2025; Volume 39, pp. 12640–12648. [Google Scholar]
  45. Nie, Y.Q.; Nguyen, N.H.; Sinthong, P.; Kalagnanam, J. A time series is worth 64 words: Long-term forecasting with transformers. arXiv 2022, arXiv:2211.14730. [Google Scholar]
  46. Huang, Q.H.; Shen, L.; Zhang, R.X.; Cheng, J.H.; Ding, S.H.; Zhou, Z.Y.; Wang, Y. Hdmixer: Hierarchical dependency with extendable patch for multivariate time series forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Vancouver, BC, Canada, 26–27 February 2024; Volume 38, pp. 12608–12616. [Google Scholar]
  47. Paszke, A.; Gross, S.; Massa, F.; Lerer, A.; Bradbury, J.; Chanan, G.; Killeen, T.; Lin, Z.M.; Gimelshein, N.; Antiga, L.; et al. Pytorch: An imperative style, high-performance deep learning library. Adv. Neural Inf. Process. Syst. 2019, 32, 8026–8037. [Google Scholar]
  48. Wang, Y.X.; Wu, H.X.; Dong, J.X.; Qin, G.; Zhang, H.R.; Liu, Y.; Qiu, Y.Z.; Wang, J.M.; Long, M.S. Timexer: Empowering transformers for time series forecasting with exogenous variables. Adv. Neural Inf. Process. Syst. 2024, 37, 469–498. [Google Scholar]
  49. Liu, Y.; Hu, T.G.; Zhang, H.R.; Wu, H.X.; Wang, S.Y.; Ma, L.T.; Long, M.S. itransformer: Inverted transformers are effective for time series forecasting. arXiv 2023, arXiv:2310.06625. [Google Scholar]
  50. Kong, Y.X.; Wang, Z.P.; Nie, Y.Q.; Zhou, T.; Zohren, S.; Liang, Y.X.; Sun, P.; Wen, Q.S. Unlocking the power of lstm for long term time series forecasting. In Proceedings of the AAAI Conference on Artificial Intelligence, Philadelphia, PA, USA, 27 February–2 March 2025; Volume 39, pp. 11968–11976. [Google Scholar]
Figure 1. Locations of study areas: Hunan Province and the six selected regions.
Figure 1. Locations of study areas: Hunan Province and the six selected regions.
Agriculture 15 02474 g001
Figure 2. Population dynamics of C. suppressalis across five research areas in the HNRP-6R dataset, visualized over a two-year period to improve trend analysis and readability. The x-axis represents time, and the y-axis denotes the population size of C. suppressalis (the same below).
Figure 2. Population dynamics of C. suppressalis across five research areas in the HNRP-6R dataset, visualized over a two-year period to improve trend analysis and readability. The x-axis represents time, and the y-axis denotes the population size of C. suppressalis (the same below).
Agriculture 15 02474 g002
Figure 3. Spearman correlation heatmap between C. suppressalis population dynamics and meteorological factors. Correlation coefficients range from −1 to 1, with higher absolute values indicating stronger associations.
Figure 3. Spearman correlation heatmap between C. suppressalis population dynamics and meteorological factors. Correlation coefficients range from −1 to 1, with higher absolute values indicating stronger associations.
Agriculture 15 02474 g003
Figure 4. Framework of the ESD-TripleStream model: It integrates the Exponential Smoothing Decomposition (ESD) module (separating trend and seasonal components), a triple-stream network (Seasonal Stream, Trend Stream, and Temporal Feature Stream) and a Predictor module (integrating streams to forecast C. suppressalis population).
Figure 4. Framework of the ESD-TripleStream model: It integrates the Exponential Smoothing Decomposition (ESD) module (separating trend and seasonal components), a triple-stream network (Seasonal Stream, Trend Stream, and Temporal Feature Stream) and a Predictor module (integrating streams to forecast C. suppressalis population).
Agriculture 15 02474 g004
Figure 5. Example of ESD decomposition with α = 0.3, 0.5, 0.7 on a 120-length sample from the GY dataset.
Figure 5. Example of ESD decomposition with α = 0.3, 0.5, 0.7 on a 120-length sample from the GY dataset.
Agriculture 15 02474 g005
Figure 6. Visualization results for six research regions. Specifically, Panel (a) shows the experimental results of the HJ region; Panel (b) shows those of the GY region; Panel (c) shows those of the DA region; Panel (d) shows those of the YJ region; Panel (e) shows those of the XY region; and Panel (f) shows those of the SF region. The input length is set to τ = 120 , and the prediction horizon is s = 30 . The vertical axis represents the observed C. suppressalis population.
Figure 6. Visualization results for six research regions. Specifically, Panel (a) shows the experimental results of the HJ region; Panel (b) shows those of the GY region; Panel (c) shows those of the DA region; Panel (d) shows those of the YJ region; Panel (e) shows those of the XY region; and Panel (f) shows those of the SF region. The input length is set to τ = 120 , and the prediction horizon is s = 30 . The vertical axis represents the observed C. suppressalis population.
Agriculture 15 02474 g006
Figure 7. Visualization of prediction results for GY (a) and HJ (b). GY predicts the next month based on data from mid-April to mid-August 2022, while HJ predicts the next month based on data from mid-April to mid-August 2021. The vertical axis represents the observed N. lugens population.
Figure 7. Visualization of prediction results for GY (a) and HJ (b). GY predicts the next month based on data from mid-April to mid-August 2022, while HJ predicts the next month based on data from mid-April to mid-August 2021. The vertical axis represents the observed N. lugens population.
Agriculture 15 02474 g007
Table 1. Scientific names of ten types of rice pests included in the HNRP-6R dataset and their total statistical quantities.
Table 1. Scientific names of ten types of rice pests included in the HNRP-6R dataset and their total statistical quantities.
Name of PestAbbreviationNumber of Records
Chilo suppressalisC. suppressalis3,443,619
Nilaparvata lugensN. lugens3,825,898
Cnaphalocrocis medinalisC. medinalis329,841
Sesamia inferensS. inferens90,015
Tryporyza incertulasT. incertulas22,747
Naranga aenescens MooreN. aenescens408,126
Echinocnemus squameus BillbergE. squameus15,242
Lissorhoptrus oryzophilusL. oryzophilus9415
Orseoia oryzaeO. oryzophilus10,834
Mythimna seperataM. seperata26,630
Table 2. The 13 types of meteorological factors included in the HNRP-6R dataset and their corresponding units.
Table 2. The 13 types of meteorological factors included in the HNRP-6R dataset and their corresponding units.
Meteorological FactorAbbreviationUnit
Mean daily temperatureTEMP°C
Daily maximum temperatureTmax°C
Daily minimum temperatureTmin°C
Mean daily relative humidityRH%
Daily minimum relative humidityRHmin%
Mean daily wind speedWSm/s
Daily maximum wind speedWSmaxm/s
Daily extreme wind speedWSexm/s
Daily maximum wind directionWDmaxDeg
Daily extreme wind directionWDexDeg
Daily sunshine durationSDh
Daily precipitationPRCPmm
Daily evaporationEVPmm
Table 3. Statistics on the population size of C. suppressalis in six research areas within the HNRP-6R dataset. These include the time span of data collection and the total population size of C. suppressalis for each research area.
Table 3. Statistics on the population size of C. suppressalis in six research areas within the HNRP-6R dataset. These include the time span of data collection and the total population size of C. suppressalis for each research area.
LocationAbbreviationTime SpanNumber of C. suppressalis
HongjiangHJ2007–202263,862
Dong’anDA2006–202256,013
YuanjiangYJ2000–202247,341
XiangyinXY2000–201945,082
ShuangfengSF2010–202141,761
GuiyangGY2000–202214,720
Table 4. Short-term forecasting results. The input sequence length configured as τ { 120 , 240 , 360 , 480 , 600 } and the forecasting horizon s = 30 . The best-performing results are highlighted in bold, and the second-best results are underlined (the same below).
Table 4. Short-term forecasting results. The input sequence length configured as τ { 120 , 240 , 360 , 480 , 600 } and the forecasting horizon s = 30 . The best-performing results are highlighted in bold, and the second-best results are underlined (the same below).
ModelOursTimeXeriTransformerPatchMLPHDMixerxPatchP-sLSTMTimeLinearDlinear
MetricMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAE
HJ 1200.2120.2280.2210.2480.2160.2440.2580.2310.2180.2490.3670.3220.3240.4060.2160.2320.2880.410
HJ 2400.2230.2500.2620.2920.2160.3090.2840.33310.2550.2960.3930.3550.5110.520.2650.2960.2830.386
HJ 3600.2300.2790.2870.2600.2990.3410.3640.3990.2680.3250.4710.4030.4890.5060.2740.3150.2910.395
HJ 4800.2300.2780.2870.3270.3210.3680.3520.4010.2830.3440.4920.4430.5590.5520.2880.3370.3150.421
HJ 6000.2160.2670.2680.3190.3000.3540.3320.3860.2990.3750.4790.4230.4530.4910.2890.3260.2990.406
GY 1200.2100.3120.2180.3190.2220.3260.4440.3990.2470.3250.5780.4640.3460.3930.2550.3250.2880.359
GY 2400.3040.3790.3280.3920.4550.4260.9030.5620.4330.4250.9910.6270.3800.4180.4990.4510.4790.445
GY 3600.3580.4230.3380.4180.4970.4830.8620.6060.4220.4450.9780.6120.3650.4250.3700.4140.3920.427
GY 4800.3370.4210.4400.4480.6320.5300.9440.6540.6000.5380.9670.5980.3940.4270.8220.6060.4470.461
GY 6000.3240.4280.4060.4420.6490.5581.0330.7260.6890.5960.9970.5640.3130.4400.5620.5320.4880.505
DN 1201.4920.6741.7370.7521.9590.8111.9700.8471.6410.7131.7810.8561.4840.6071.6210.6931.5860.680
DN 2401.2930.6311.6630.7681.7660.7951.8560.8721.4100.6771.6180.7811.3700.6411.6970.7941.4220.658
DN 3601.3470.6601.4810.6881.7490.8311.7750.8701.3900.6851.6980.8381.3120.6361.6540.7931.4180.684
DN 4801.3380.6561.8050.9091.8860.9052.0520.9641.6320.7611.3610.7151.3690.6712.1001.0151.5660.775
DN 6001.3830.7212.0050.9802.0660.9862.1681.0911.6850.8071.6320.7971.3930.7512.3901.1011.7320.850
YJ 1200.9180.4411.0140.4801.2340.5821.2930.6050.9320.4491.1620.5751.1400.5440.9290.4460.9380.457
YJ 2400.8450.4320.9740.4671.0930.5371.1540.5680.8350.4111.1260.5441.0710.5330.8970.4500.9690.478
YJ 3600.9170.4370.9650.4671.0810.5151.0560.5240.9320.4441.1960.5861.0470.5420.9210.4340.9510.482
YJ 4800.9720.4441.0440.4691.0260.4881.1040.5360.9760.4541.1990.5891.0310.5431.0120.4610.9790.452
YJ 6000.9160.4460.9910.4751.2120.7111.0390.5120.9630.4720.9900.4761.0110.5541.0230.4791.0910.534
XY 1200.5920.4160.5950.4230.6090.4300.7650.5330.6030.4220.7540.5640.6470.5120.5990.4280.6040.479
XY 2400.5810.4370.6130.4610.6780.4980.7280.5420.6190.4480.7760.5820.7960.6110.6120.4630.5930.485
XY 3600.5780.4680.6670.4870.7560.5450.8420.6270.7410.5260.8950.6540.8280.6230.7130.5420.7140.563
XY 4800.5930.4800.6990.5070.7650.5520.8620.6450.8070.5580.8840.6590.7990.6000.7660.5740.8020.610
XY 6000.6350.5300.9990.6291.2110.7091.1490.7781.4430.7251.1960.8310.8430.6161.2290.6971.1540.733
SF 1200.5060.3410.5140.3470.4450.3190.5230.3690.4350.3040.6170.4190.5490.4630.5010.3330.4600.354
SF 2400.4990.3520.5020.3540.5040.3750.5670.4180.5070.3860.6540.4630.5840.5060.5060.3740.5110.374
SF 3600.5090.3470.5190.3670.5570.4100.6280.4680.5330.3980.6630.4690.6400.5400.5320.4000.5270.401
SF 4800.5090.3560.5150.3720.5710.4370.6810.5070.5410.4030.7450.5580.6820.5500.5410.4070.5220.420
SF 6000.4960.3510.5060.4180.5560.4610.6950.5390.5240.3760.7620.5650.7600.5880.5470.4240.6200.453
Table 5. Medium-term forecasting results. The input length τ of all baselines is set to 240, and the forecasting horizons s { 45 , 60 , 75 , 90 } are set respectively.
Table 5. Medium-term forecasting results. The input length τ of all baselines is set to 240, and the forecasting horizons s { 45 , 60 , 75 , 90 } are set respectively.
ModelOursTimeXeriTransformerPatchMLPHDMixerxPatchP-sLSTMTimeLinearDlinear
MetricMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAE
HJ 450.2250.2570.2640.2960.2720.3020.3070.3550.2730.3110.3210.3740.5690.5620.2510.2850.2830.394
HJ 600.2190.2530.2710.2920.2750.3050.3120.3590.2570.2930.3360.3970.5510.5540.2550.2870.2830.395
HJ 750.2250.2530.2560.2830.2780.3090.3140.3600.2630.2990.3310.3860.5180.5230.2650.2960.2870.399
HJ 900.2210.2510.2700.2980.2880.3170.3160.3610.2930.3240.3490.4120.5250.5340.2700.3050.2870.398
GY 450.3290.3890.3700.4150.4400.4260.8750.5600.4300.4250.9120.5870.3460.3980.4170.4140.4170.430
GY 600.3000.3800.3440.4020.4460.4371.0020.5960.5310.4651.1330.6480.3120.4040.3610.3940.4530.447
GY 750.2880.3810.3530.4070.5300.4671.0030.5980.9700.8341.1240.6540.2730.3990.3670.3950.4370.438
GY 900.2940.3790.3530.3960.4200.4241.0530.6090.9610.831.1970.6840.3080.4060.3790.3980.4310.437
DN 451.3570.6181.6780.7451.7090.7571.8540.8531.3670.6321.4870.7531.4010.6611.9791.8501.4640.678
DN 601.3280.5831.5130.6771.6420.7271.8260.8391.3800.6371.4570.6911.3380.6011.8190.7841.5060.691
DN 751.3490.5861.4760.6681.5500.6901.8230.8351.3590.6101.4320.6661.3400.5891.6890.7471.5550.692
DN 901.3360.5621.3740.5741.5170.6631.8130.8261.3400.6131.3780.6671.4000.6021.5520.6831.4390.639
YJ 450.9160.4491.0040.4701.0770.5271.1690.5670.8760.4341.0720.8981.0270.5320.8710.4281.0060.491
YJ 600.9360.4500.9970.4661.0650.5151.1570.5740.9290.4451.0500.8881.0200.5290.9460.4661.0110.491
YJ 750.9520.4500.9920.4811.0740.5131.1590.5631.0280.4781.0570.8871.0250.5260.9780.4671.0150.489
YJ 900.9600.4491.0010.4711.0750.5081.1490.5541.0360.4831.0740.8941.0440.5340.9740.4641.0140.487
XY 450.5480.4180.6310.4580.6550.4910.7200.5420.6450.4850.7730.5750.7250.5820.5590.4620.5810.489
XY 600.5490.4240.5950.4520.6400.4860.7090.5390.6140.4730.7640.5610.7470.5870.6020.4670.5910.499
XY 750.5340.4200.5480.4310.6280.4800.7050.5380.5950.4660.7940.6340.9030.6660.6020.4680.5860.497
XY 900.5400.4210.5750.4470.6220.4750.7010.5350.6080.4690.7610.5580.7110.5740.6050.4690.5710.491
SF 450.4980.3380.4900.3400.4990.3570.5980.4300.4710.3450.5730.4380.6550.5490.4980.3490.5060.377
SF 600.5020.3350.5070.3310.4940.3470.5780.4240.5120.3510.5510.4610.5940.4950.5090.3520.5120.353
SF 750.4390.3190.4570.3380.4450.3310.5220.4120.4320.3110.5130.5420.5940.5270.4500.3390.4450.357
SF 900.3960.3050.3990.3370.3980.3200.4820.4030.4020.3290.5420.5540.5500.5090.4160.3350.4050.357
Table 6. The forecasting results of ablation studies. The input sequence length is configured as τ { 120 , 240 , 360 , 480 , 600 } and the forecasting horizon s = 30 .
Table 6. The forecasting results of ablation studies. The input sequence length is configured as τ { 120 , 240 , 360 , 480 , 600 } and the forecasting horizon s = 30 .
ModelFull Versionw/o TSFw/o SSw/o TSw/o ESD-Sw/o ESD-Tw/o TFS-Mod
MetricMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAE
HJ 1200.2120.2280.2150.2430.2150.2310.2230.2530.2200.2470.2150.2340.2160.237
HJ 2400.2230.2500.2540.2940.2360.2680.2430.2740.2610.2920.2270.2570.2290.255
HJ 3600.2300.2790.2730.3220.2520.2940.2560.2950.2690.3110.2330.2830.2380.286
HJ 4800.2300.2780.2590.3150.2440.2930.2570.3020.2770.3200.2310.2780.2290.278
HJ 6000.2160.2670.2530.3200.2210.2780.2500.3000.2620.3190.2410.2950.2240.274
GY 1200.2100.3120.2720.3320.2540.3220.2410.3220.2550.3290.2520.3230.2190.319
GY 2400.3040.3790.5080.4520.3400.3890.3540.3950.3900.4100.3610.4000.3090.386
GY 3600.3580.4230.6710.5410.3720.4310.3580.4200.4150.4520.4210.4460.3610.419
GY 4800.3370.4210.5690.5310.4790.5050.3870.4380.4800.4970.3620.4280.3300.420
GY 6000.3240.4280.6050.5660.4280.4720.4100.4710.4480.4930.5640.5430.3380.432
DN 1201.4920.6741.6710.7151.6370.6821.5810.7001.6540.7101.5020.6781.5010.671
DN 2401.2930.6311.3810.6651.3260.6491.3360.6411.4070.6621.2990.6281.2960.634
DN 3601.3470.6601.4990.7441.3840.6861.4020.6731.3770.6681.3380.6551.4240.679
DN 4801.3380.6561.5550.7951.3920.6901.4250.7171.4680.7091.3520.6671.3560.682
DN 6001.3830.7211.8570.9381.4400.7951.5620.7901.6000.7721.3640.7481.4490.789
YJ 1200.9180.4410.9420.4580.9390.4350.9420.4460.9480.4550.9640.4480.9240.448
YJ 2400.8450.4320.8810.4590.8470.4280.9420.4570.8930.4430.9020.4340.8960.442
YJ 3600.9170.4370.9450.4640.9270.4390.9240.4380.9290.4460.9290.4390.9410.438
YJ 4800.9720.4440.9810.4890.9530.4421.0240.4610.9500.4400.9560.4330.9480.444
YJ 6000.9160.4460.9360.4590.9210.4491.0500.4660.9520.4520.9670.4620.9590.458
XY 1200.5920.4160.6180.4370.5640.4200.6060.4200.6100.4250.5940.4140.5970.412
XY 2400.5810.4370.6400.4830.5910.4570.5960.4500.6190.4680.5770.4450.5800.439
XY 3600.5780.4680.6880.5440.6410.4910.6200.4890.6360.5000.6360.4950.5820.473
XY 4800.5930.4800.6850.5430.5920.4770.6340.5060.6600.5250.5960.4830.5970.483
XY 6000.6350.5300.7500.5850.6940.5450.7300.5730.7700.5840.6690.5420.6480.543
SF 1200.5060.3410.4970.3420.4890.3290.5200.3380.5090.3430.4990.3350.5210.375
SF 2400.4990.3520.5070.3690.4880.3570.5150.3630.5370.3770.4920.3540.5030.356
SF 3600.5090.3470.5220.3780.5040.3640.5490.3750.5510.3910.5060.3590.4970.456
SF 4800.5090.3560.5240.3760.5110.3800.5410.3680.5520.3920.5170.3640.5150.359
SF 6000.6350.5300.7500.5850.6940.5450.7300.5730.7700.5840.6690.5420.6480.543
Table 7. Prediction outcomes for the N. lugens population are presented, with the input sequence length configured as τ { 120 , 240 , 360 , 480 , 600 } and the forecasting horizon set to s = 30 .
Table 7. Prediction outcomes for the N. lugens population are presented, with the input sequence length configured as τ { 120 , 240 , 360 , 480 , 600 } and the forecasting horizon set to s = 30 .
ModelOursTimeXeriTransformerPatchMLPHDMixerxPatchP-sLSTMTimeLinearDlinear
MetricMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAEMSEMAE
HJ 1200.3690.1480.3920.1590.3750.1710.4670.2300.3710.1520.5520.3210.3730.2270.3670.1490.4790.322
HJ 2400.3760.1770.3930.1890.4080.2050.5490.3000.4050.1960.5690.3340.4200.2740.3990.1920.4750.289
HJ 3600.3790.1840.3970.2210.4260.2340.5600.3340.4560.2590.5970.3870.4450.3130.4490.2530.4690.297
HJ 4800.3840.2010.4050.2290.4470.2700.6100.3910.4400.2580.5480.3150.4880.3620.4370.2610.5270.355
HJ 6000.3910.2420.4340.2740.4730.3060.6170.4230.4260.2580.5790.3530.5080.3800.4560.2860.5200.357
GY 1200.0140.0580.0150.0590.0150.0570.0190.0750.0160.0590.0330.1440.0330.1480.0160.0670.0490.194
GY 2400.0140.0580.0150.0590.0160.0600.0180.0720.0160.0600.0310.1430.0430.1530.0160.0600.0290.141
GY 3600.0140.0570.0150.0580.0160.0600.0190.0780.0160.0590.0370.1680.0630.1910.0160.0590.0360.164
GY 4800.0130.0550.0140.0580.0150.0590.0180.0770.0160.0640.0380.1710.0600.1880.0150.0570.0390.174
GY 6000.0130.0550.0140.0560.0150.0580.0200.0820.0150.0600.0470.1920.0590.1940.0140.0570.0450.193
DN 1200.0340.0780.0330.0820.0340.0810.0430.1090.0320.0830.3210.5230.1010.2740.0340.0790.2510.477
DN 2400.0300.0680.0320.0750.0320.0830.0410.1100.0310.0800.2910.4960.0690.2020.0350.0810.2230.448
DN 3600.0300.0690.0320.0710.0320.0830.0390.1060.0310.0760.2780.4750.0670.1810.0370.0850.2900.520
DN 4800.0310.0710.0300.0780.0320.0910.0380.1080.0320.0880.2890.4830.0790.1990.0390.0920.2610.487
DN 6000.0310.0740.0340.0780.0320.0910.0360.0770.0360.0970.2530.4320.0730.1910.0380.0860.2440.471
YJ 1200.5890.5450.5780.5450.5610.5230.9990.7150.7520.6140.9760.7020.5820.5570.7480.6190.7610.613
YJ 2400.5850.5470.5740.5370.6580.5711.0300.7530.6680.5701.1120.8140.5990.5770.7480.6180.8510.664
YJ 3600.5890.5620.5810.5670.6670.5931.1750.8220.7680.6621.1650.8340.6750.5881.0930.8130.7640.632
YJ 4800.5720.5510.5850.5690.6720.5901.1570.8280.7160.6421.1770.8670.7660.6250.8890.7180.8330.667
YJ 6000.5840.5610.597 0.5850.6630.5871.2460.8660.6610.5911.3060.9121.0110.7300.8780.7010.8480.672
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

He, C.; Peng, Z.; Peng, L.; Liu, Y.; Zhang, C.; Zhu, L.; Tan, S.; Zou, L. Chilo suppressalis Population Dynamics Forecasting by Exponential Smoothing Decomposition and Multi-Stream Network. Agriculture 2025, 15, 2474. https://doi.org/10.3390/agriculture15232474

AMA Style

He C, Peng Z, Peng L, Liu Y, Zhang C, Zhu L, Tan S, Zou L. Chilo suppressalis Population Dynamics Forecasting by Exponential Smoothing Decomposition and Multi-Stream Network. Agriculture. 2025; 15(23):2474. https://doi.org/10.3390/agriculture15232474

Chicago/Turabian Style

He, Chao, Ziang Peng, Longhuang Peng, Yi Liu, Chengyuan Zhang, Lei Zhu, Siqiao Tan, and Ling Zou. 2025. "Chilo suppressalis Population Dynamics Forecasting by Exponential Smoothing Decomposition and Multi-Stream Network" Agriculture 15, no. 23: 2474. https://doi.org/10.3390/agriculture15232474

APA Style

He, C., Peng, Z., Peng, L., Liu, Y., Zhang, C., Zhu, L., Tan, S., & Zou, L. (2025). Chilo suppressalis Population Dynamics Forecasting by Exponential Smoothing Decomposition and Multi-Stream Network. Agriculture, 15(23), 2474. https://doi.org/10.3390/agriculture15232474

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop