Previous Article in Journal
Fault-Tolerant Designs of Graphs with Gallai’s Property in Euclidean Space Tilings
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Partial-Information Node-Level Forecasting in Directed Logistics Networks via Topology-Perturbation Encoding

College of Electrical Engineering, North China University of Science and Technology, Tangshan 063210, China
*
Author to whom correspondence should be addressed.
Math. Comput. Appl. 2026, 31(3), 107; https://doi.org/10.3390/mca31030107 (registering DOI)
Submission received: 11 May 2026 / Revised: 6 June 2026 / Accepted: 11 June 2026 / Published: 13 June 2026

Abstract

Node-level cargo-volume forecasting in logistics sorting networks requires modeling temporal dynamics together with directed inter-node dependencies and planned topology perturbations. This study addresses 1-h-ahead forecasting under a partial-information boundary, where historical node volumes, the pre-change network structure, and planned route-topology changes are available before prediction, whereas continuous post-change dynamic edge weights and realized post-change graph states are unavailable. We propose a perturbation-aware framework that represents the sorting system as a directed network and integrates temporal features, pre-change structural descriptors, topology-change encodings, perturbation-response proxies, and similarity-assisted support for data-scarce nodes within a unified forecasting protocol. A shared random forest backbone is used only to assess the incremental value of these representations. Experiments on 57 sorting centers show that temporal dynamics dominate under stable-network conditions. Under topology perturbation, topology-change signals reduce test weighted absolute percentage error (WAPE) from 18.10% to 17.11%, and perturbation-response proxies further reduce it to 16.91%. For data-scarce nodes, similarity support reduces test WAPE from 29.43% to 26.68%, with consistent gains under 10%, 20%, and 30% sample-retention settings. These results suggest that the framework provides an interpretable and information-admissible representation strategy for node-level forecasting in directed networked systems.

1. Introduction

Short-term node-level cargo-volume forecasting in logistics sorting networks can be formulated as an applied mathematical and computational problem on a directed network. In such systems, sorting centers are represented as nodes, and route connections define directed dependencies through which upstream volume variation, downstream transfer capacity, diversion patterns, and network topology may influence local cargo volumes. The forecasting target is therefore not an isolated local time series, but a node-level response embedded in a route-connected system [1,2]. Although such forecasts can support downstream applications, the central modeling problem is how temporal signals and network-coupled structural information can be organized into admissible computational inputs for node-level prediction.
This problem is particularly challenging under a partial-information boundary. Before prediction, historical node volumes, the pre-change network structure, and planned route-topology adjustments may be available, whereas continuous post-change dynamic edge weights, realized cargo transfers, and updated graph states are typically unavailable in disruption-management, dynamic-routing, and logistics-network resilience settings [3,4,5,6,7,8,9,10]. This differs from full-information dynamic-graph forecasting, where evolving node–edge states, adaptive graph weights, or continuously updated graph structures can be observed during model construction [11,12,13,14,15,16,17]. Under the information boundary considered here, forecasting inputs must be constructed without target-period node volumes, realized post-change edge flows, or updated dynamic graph states. The key computational question is therefore how to encode planned topology changes and potential node-level perturbation responses in an interpretable and information-admissible manner. Figure 1 illustrates the adopted forecasting setting by linking directed network coupling, planned topology perturbation, the 1-h-ahead node-level forecasting target, and computational validation outputs.
Several research streams provide useful foundations for this problem, although each addresses only part of the present setting.
First, short-term logistics demand and cargo-volume forecasting studies have developed statistical, machine-learning, neural-network, and ensemble methods for operational, freight, regional, distribution-center, and supply-chain demand prediction [18,19,20,21,22,23,24,25,26,27,28]. These studies show that lagged observations, temporal regularity, nonlinear learning, and ensemble modeling can support logistics-related forecasting. However, many formulate the target as an aggregate demand series, regional indicator, or operational planning unit, rather than as node-level cargo-volume forecasting in a directed logistics network. This study therefore reformulates cargo-volume forecasting as a node-level prediction problem under directed route dependence.
Second, graph-based and spatiotemporal forecasting studies show that network relations can be used to model inter-node dependence and temporal evolution. Representative models, including diffusion graph forecasting, spatiotemporal graph convolution, adaptive graph learning, and dynamic spatial–temporal graph neural networks, demonstrate the value of graph structures in transportation and networked forecasting [11,12,13,14,15,16,17]. They also show that node states should not be treated as independent time series when network relations exist. However, many such models require observed dynamic node–edge states, adaptive graph weights, or updated graph structures during model construction. In the present setting, these post-change graph states are unavailable before prediction, creating a different problem: how to transform pre-forecast and planned topology information into admissible node-level predictors.
Third, studies on topology perturbation, disruption propagation, dynamic routing, reliable logistics-network design, and supply-chain resilience explain how topology changes, route disruptions, node failures, rerouting pressure, and inter-tier connections influence network performance [3,4,5,6,7,8,9,10,29]. These studies are important because they show that network structure and disruption propagation can affect local load and system-level performance. Nevertheless, their main focus is usually disruption explanation, resilience assessment, dynamic routing, or network-design optimization. Few studies convert planned route-topology changes into computable node-level variables for 1-h-ahead cargo-volume forecasting under a strict partial-information boundary. This motivates the perturbation-response proxy representation proposed in this paper.
Fourth, data-scarce, few-shot, transfer-learning, and neighborhood-support forecasting studies show that information from related regions, nodes, or systems can support prediction when local observations are limited [30,31,32,33]. These studies provide a methodological basis for transferable or similarity-based support in low-data settings. In this study, however, data scarcity is not treated as a separate forecasting task with a different target or evaluation logic. Similarity-assisted support is instead introduced under the same forecasting target, sample-construction rule, forecasting backbone, and chronological evaluation protocol, allowing its incremental value to be evaluated without changing the prediction problem.
The remaining gap is therefore not merely the absence of a more accurate forecasting algorithm but how to represent partial pre-forecast information in a directed logistics network. Planned topology changes must be transformed into interpretable and information-admissible node-level variables, while similarity support must be incorporated without changing the forecasting target or evaluation protocol. Table 1 summarizes these four research streams, their limitations under the partial-information boundary, and how this study addresses each computational gap.
To address this gap, this study develops a perturbation-aware computational framework for 1-h-ahead node-level cargo-volume forecasting in directed logistics sorting networks. The framework represents the logistics system as a directed graph and constructs supervised node-hour samples using temporal features, pre-change structural descriptors, topology-change encodings, perturbation-response proxies, and similarity-assisted support features. A shared random forest model is used as a controlled forecasting backbone for mixed temporal, structural, perturbation-related, and similarity-derived inputs [34]. The novelty lies not in the random forest algorithm itself but in the information-admissible representation of planned topology perturbations and limited-history support.
This study makes four main contributions. First, it formulates 1-h-ahead node-level cargo-volume forecasting as a partial-information problem on a directed logistics network. Second, it introduces topology-change encodings and perturbation-response proxies to capture neighborhood reconfiguration, damage to the pre-change connectivity basis, and rerouting pressure without reconstructing post-change edge-level flows. Third, it extends the same forecasting protocol to data-scarce nodes using similarity-assisted support from topological structure, cargo-volume patterns, and periodic regularity. Fourth, it provides a unified validation design for stable-network, topology-perturbation, and data-scarce-node settings, covering ablation analysis, sensitivity analysis, node-level gain coverage, statistical testing, and high-load warning evaluation.
The principal findings are that temporal signals dominate stable-network forecasting, topology-change and perturbation-response inputs provide incremental predictive value under route perturbations, and similarity-assisted support improves transitional forecasting for data-scarce nodes. Specifically, topology-change signals reduce test WAPE from 18.10% to 17.11%, perturbation-response proxies further reduce it to 16.91%, and similarity support reduces test WAPE from 29.43% to 26.68% under the data-scarce-node setting. These results indicate that topology perturbation and similarity information can improve node-level forecasting within the adopted partial-information boundary, without relying on target-period node volumes or realized post-change dynamic edge weights.
The remainder of this paper is organized as follows. Section 2 describes the data, directed network representation, feature construction, perturbation-response proxies, similarity support, and evaluation protocol. Section 3 reports the experimental results under stable-network, topology-perturbation, and data-scarce-node settings. Section 4 discusses methodological implications, limitations, and generalizability. Section 5 concludes the paper.

2. Materials and Methods

2.1. Research Scope and Boundary Conditions

This study examines 57 sorting centers in a logistics sorting network and addresses node-level short-term cargo-volume forecasting under network dependence, topology perturbation, and uneven data availability. The logistics system is modeled as a directed graph rather than as a set of independent nodes [1,2,11,12,13,14,15,16,17]. Each sorting center is treated as a forecasting node, and directed route connections define the structural relations through which local volume dynamics may be affected by upstream and downstream network states.
The empirical node set is defined by the dataset rather than selected by the authors. The benchmark data contain 57 anonymized sorting centers, all of which are retained after node-index unification. No center is included or excluded according to operational importance, cargo scale, geographic representativeness, or forecasting performance. Accordingly, the experiments should be interpreted as covering the complete anonymized node set available in the benchmark data under the adopted partial-information boundary, rather than as a statistically representative sample of all logistics sorting networks.
Three task settings are examined within the same computational framework. The Base task evaluates routine node-level forecasting under stable-network conditions, following short-term logistics demand and cargo-volume forecasting studies [18,19,20,21,22,23,24,25,26,27,28]. The Core task evaluates forecasting under topology perturbation, where planned route-topology changes are known before prediction but continuous post-change edge weights are unavailable, consistent with disruption-management, dynamic-routing, and logistics-network resilience studies [3,4,5,6,7,8,9,10,29]. The Extension task applies the same forecasting protocol to a controlled limited-history setting, in which selected nodes retain limited but nonzero local observations, following data-scarce, few-shot, transfer-learning, and neighborhood-support forecasting studies [30,31,32,33].

Partial-Information Boundary and Computational Workflow

The partial-information boundary is implemented as a methodological constraint. Feature construction uses only information available before the forecasting time, including historical node-level cargo volumes, the pre-change network structure, and planned route-topology changes. Realized post-change edge flows, target-period node volumes, updated graph states, and continuous post-change dynamic edge weights are excluded. This restriction distinguishes the proposed setting from full-information dynamic-graph forecasting and is consistent with studies that distinguish planned or observable pre-forecast information from realized dynamic network states [3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,29].
The data-scarce-node setting is designed as a controlled limited-history simulation rather than as a naturally missing-data scenario or a complete cold-start problem. It evaluates whether transferable network, temporal, and similarity-based information can support short-term forecasting when local training observations are intentionally restricted but remain nonzero [30,31,32,33].
Figure 2 summarizes the partial-information forecasting workflow and task-specific input organization.

2.2. Data Description and Availability

The empirical data are derived from the anonymized logistics sorting-network benchmark dataset described in the Data Availability Statement. Four data components are integrated under a unified node-index mapping: daily node-volume records, hourly node-volume records, pre-change route-volume records, and planned topology-adjustment records. The selected periods correspond to the complete and internally consistent data windows available for each component. Daily records from 1 August to 30 November 2023 provide medium-term trends and day-level periodic information, while hourly records from 1 to 30 November 2023 provide high-frequency observations for lag-feature construction, 24 h historical windows, and 1-h-ahead forecasting. The pre-change route-volume records summarize historical route volumes over the preceding 90 days and are used to construct the weighted directed network before topology adjustment. The planned topology-adjustment records describe pre-announced route connectivity and are used only for planned topology-change encoding; they do not contain realized post-change cargo flows, target-period node volumes, or continuous post-change dynamic edge weights.
All four data components were derived from the same anonymized benchmark dataset, and all 57 available sorting centers were retained after node-index unification. Table 2 summarizes the data scope, preprocessing statistics, chronological split, supervised-sample construction, and information-exclusion rules.
Based on the summary in Table 2, Figure 3 illustrates how the raw records are transformed into model-ready task datasets through node-index unification, temporal alignment, missing-value imputation, network-matrix construction, topology-change extraction, supervised-sample generation, chronological splitting, and leakage prevention. Across all tasks, only information available up to forecast origin t is used to predict the node-level cargo volume at t + 1 .
Supervised samples are constructed with a 24 h historical window and a unified 1-h-ahead node-level target. To avoid cross-split historical leakage, the window is constructed within each chronological split. Excluding the first 24 h of each split yields 36,936 valid supervised samples, corresponding to 57 × 648 node-hour forecast origins.
Each supervised sample is indexed by node i and forecast origin time t , uses only information available up to t , and predicts the node-level cargo volume at t + 1 . The Base, Core, and Extension tasks share the same target, split, and leakage-prevention protocol, but activate different input modules. Appendix A.2 Table A2 provides a compact sample-structure example and one anonymized model-ready sample row. Numerical values are masked, anonymized, or rounded only to illustrate the supervised node-hour sample structure without disclosing raw operational records.
In the controlled limited-history experiment, the 57 nodes are divided into 40 data-rich nodes and 17 data-scarce nodes. For data-scarce nodes, only the earliest 20% of training samples are retained, while validation and test observations remain unchanged under the same chronological protocol. This setting approximates newly activated, recently reconfigured, or low-history nodes with limited but nonzero local records. It is not a naturally missing-data scenario because the history reduction is imposed for experimental control, nor a complete cold-start setting because each data-scarce node retains partial local history.

2.3. Logistics Network Representation

The logistics system is represented as a directed network
G = ( V , E ) ,
where V = 1 , , N is the set of sorting centers, E is the set of directed transport connections, and N = 57 . This representation is consistent with graph-based forecasting studies that model inter-node dependence through network structure [11,12,14,15].
Because cargo transfer is directional, the edges are directed. Let i , j { 1 , , N } . A transport route from node i to node j is denoted by e i j . In general, the existence or strength of e i j does not imply the same existence or strength of e j i .
Under regular conditions, the pre-change network dataset is used to construct the original weighted directed adjacency matrix:
A b e f o r e = a i j b e f o r e ,
where a i j b e f o r e denotes the historical route strength from node i to node j , measured by the average cargo volume on that route over the previous 90 days. If no directed route exists from node i to node j , then a i j b e f o r e = 0 .
For topology-level comparison, the pre-change binary topology matrix is defined as
A ~ b e f o r e = a ~ i j b e f o r e ,
where
a ~ i j b e f o r e = 1 , if   a   directed   connection   exists   from   node   i   t o   n o d e   j , 0 , otherwise .
Under route-change conditions, the planned post-adjustment topology is represented by the post-change topology matrix:
A ~ a f t e r = a ~ i j a f t e r ,
where
a ~ i j a f t e r = 1 , if   a   directed   connection   exists   after   route   adjustment , 0 , otherwise .
It should be emphasized that A ~ a f t e r is a planned topology matrix rather than a realized post-change weighted-flow matrix. It records whether a route connection is planned to exist after adjustment, but it does not contain realized post-change cargo volumes or continuous dynamic edge weights.
The topology-change matrix is then defined as
A = A ~ a f t e r A ~ b e f o r e ,
For each directed pair i   j , a i j = 1 indicates an added connection, a i j = 1 indicates a removed connection, and a i j = 0 indicates no topology change. This representation converts planned route adjustments into a computable perturbation matrix while remaining within the partial-information boundary.
Node in-degree, out-degree, weighted in-degree, and weighted out-degree are also computed as basic structural indicators. They are not forecasting targets, but they support structural feature construction, perturbation characterization, and node-similarity analysis.

2.4. Multi-Source Feature Construction

A unified multi-source feature system is built from the network representation. Inputs are grouped into temporal, structural, perturbation-response, and similarity-support features, corresponding to short-term node dynamics, network position, local topology-perturbation response, and transferable support for data-scarce nodes.
For node i at time t , the general multi-source feature vector can be written as
x i , t = x i , t t i m e , g i , z i p e r t , s i s i m ,
where x i , t t i m e denotes temporal features, g i denotes pre-change structural features, z i p e r t denotes perturbation-response proxy features, and s i s i m denotes similarity-support information. The specific activated feature groups differ across the Base, Core, and Extension tasks, as defined in Section 2.6.

2.4.1. Temporal Features

Temporal features are the main source of short-term predictive information. Using the daily and hourly datasets, this study extracts lag features and moving averages to represent recent fluctuations and local trends [22,23,25,26]. Calendar and periodic encodings are further used to capture intraday and recurrent operating rhythms [18,19,20,21,24]. Missingness indicators are retained to preserve information contained in observation status.

2.4.2. Structural Features

Structural features describe each node’s position and relational strength in the pre-change network. They are derived from the pre-change weighted adjacency matrix A b e f o r e and the binary topology matrix A ~ b e f o r e , consistent with studies that model inter-node dependency through network topology and graph representations [1,2,11,12,14,15]. These features include in-degree, out-degree, neighbor scale, weighted degree, and upstream and downstream connection strength. They provide routine relational context in the Base task, the pre-change connectivity basis in the Core task, and a structural basis for comparing data-scarce and data-rich nodes in the Extension task.

2.4.3. Perturbation-Related Features

Perturbation-response features describe local structural reconfiguration and load-bearing pressure under route changes, following studies on logistics-network topology, disruption propagation, and resilience [1,2,3,4,5,6,7,8,9,10,29]. They are derived from the planned topology-change matrix Δ A , the pre-change weighted adjacency matrix A b e f o r e , and local neighborhood indicators. Their detailed proxy definitions are provided in Section 2.5.
These features should be interpreted as partial-information proxy variables. They do not reconstruct true post-change edge-level cargo flows, but they provide admissible node-level inputs for representing potential local perturbation response before realized post-change edge weights become available.

2.4.4. Similarity-Support Features

Similarity-support features are constructed for the data-scarce-node scenario. They identify the most similar reference node from three dimensions: topological structure, cargo-volume pattern, and periodic regularity, following transfer-support and neighborhood-information ideas in data-scarce prediction settings [30,31,32,33]. The resulting component scores and composite similarity score provide auxiliary transferable information for nodes with limited local history.
These feature groups serve as task-activated input modules. The next section develops the perturbation-response proxy representation for route-change scenarios.

2.5. Perturbation-Response Proxy Representation

In logistics sorting networks, route changes affect node load indirectly by altering local connectivity, triggering possible cargo-flow redistribution, and producing short-term load fluctuations [1,2,3,4,5,6,7,8,9,10,29]. This study therefore represents route changes as node-level perturbation states rather than simple event labels. Because continuous post-change edge weights are unavailable, these states are described by proxy variables derived from observable structural change [3,5,6,7,8,9,10].
Based on Figure 4, route changes are translated into the following computable and interpretable perturbation-response proxies.

2.5.1. Neighborhood Change Rate

The neighborhood change rate measures the relative extent of local connectivity reconfiguration. Let n i a d d and n i d e l denote the numbers of added and removed connections associated with node i , respectively, and let d i p r e denote its pre-change neighborhood size. The neighborhood change rate is defined as
C i = n i a d d + n i d e l d i p r e + ε ,
where ε is a small constant used to avoid division by zero. This variable uses only pre-change topology, planned post-change topology, and their difference. A larger C i indicates stronger local reconfiguration and a higher likelihood of subsequent redistribution pressure under the partial-information boundary.

2.5.2. Perturbation Intensity

Connection counts alone cannot reflect the severity of a perturbation, because removing high-weight routes is usually more consequential than modifying low-weight routes. Perturbation intensity therefore measures damage to the original strong-connectivity basis. Let w i l o s s denote the total lost pre-change weight associated with removed connections, w i a d d denote the compensatory proxy contribution associated with newly added connections, and b i p r e denote the original weighted connectivity basis. The perturbation intensity is defined as
I i = m a x 0 , w i l o s s λ p w i a d d b i p r e + ε ,
where w i a d d does not represent observed post-change flow. Instead, it is a topology-based compensatory proxy constructed from planned added connections and pre-change node-level connectivity strength. The parameter λ p controls how strongly this proxy compensation offsets the lost pre-change connectivity basis. In the main experiments, λ p = 0.50 is used because it yields the lowest validation WAPE among the representative settings examined in Appendix B.1 and Table A3. This parameter should therefore be interpreted as a validation-selected proxy-construction weight, not as a realized post-change flow-redistribution parameter. The resulting I i is a partial-information proxy, with larger values indicating greater damage to the pre-change operating basis.

2.5.3. Rerouting Pressure Index

Node-level deviations after route changes depend not only on connection loss but also on temporary diversion pressure. The rerouting pressure index captures this local load-bearing risk:
P i = I i C i ,
This index combines connectivity-basis damage and neighborhood reconfiguration. Compared with a simple topology-change indicator, it better reflects whether a node may become a local pressure-bearing point during cargo-flow reorganization. It remains a proxy variable under partial information and should not be interpreted as an observed rerouted cargo volume.

2.5.4. Node-Level Perturbation State Representation

The perturbation state of node i is represented as:
z i p e r t = [ I i c h g , C i , I i , P i ] ,
where I i c h g = 1 if any added or removed connection exists in the neighborhood of node i , and 0 otherwise.
This state vector does not attempt to recover the true post-change edge-level propagation process. Instead, it converts route changes into structured, computable inputs that reflect local structural reconfiguration, damage to the connectivity basis, and potential load-bearing pressure under the adopted information boundary. These variables form the perturbation-response module activated in the Core task and, as complementary information, in the Extension task, as summarized in Table 3.

2.5.5. Verification Strategy for Perturbation-Response Proxy Design

The perturbation variables are evaluated as a proxy set rather than as direct reconstructions of edge-level flow propagation. Their validity is assessed through two checks. First, Appendix B.1 and Table A3 report the sensitivity analysis for the perturbation-intensity balancing parameter λ p . The test WAPE remains within a narrow range across representative λ p values, indicating that the Core-task conclusion is not driven by a single parameter setting. Second, node-level statistical tests, gain coverage, heterogeneity analysis, and high-load warning results examine whether the proxy set provides stable incremental value within the unified framework.

2.6. Unified Forecasting Framework as a Testable Research Design

Random forest is used as a shared forecasting backbone to evaluate the proposed representations under a controlled setting. It is suitable because it handles nonlinear tabular inputs, mixed temporal and graph-derived features, and interactions among heterogeneous feature groups [34]. Unlike dynamic graph models, it does not require continuous post-change dynamic edge weights, updated graph states, or long sequence inputs, which are unavailable under the adopted partial-information boundary.
Using a common random forest backbone isolates the incremental value of topology-change encodings, perturbation-response proxies, and similarity-support features under the same forecasting target, sample-construction rules, chronological split, and evaluation metrics. Thus, the methodological contribution lies in the information-admissible representation of planned topology perturbations and limited-history support, rather than in the random forest algorithm itself.
Under the same forecasting target, sample-construction rules, temporal split, and evaluation metrics, the Base, Core, and Extension tasks evaluate the computational value of task-specific information modules. Table 3 summarizes the shared design and task-specific modules.

2.6.1. Framework Overview

Let the input sequence of node i over a historical window of length L at time t be:
X i ( t L + 1 : t ) = x i t L + 1 , x i t L + 2 , , x i t ,
where x i t is the multi-source feature vector of node i at time t . Depending on the task, this vector may contain temporal, structural, perturbation-response, and similarity-support features.
The unified forecasting target is
y ^ i t + 1 = f u i t ,
where u i t is the task-specific input vector at time t . The Base task uses temporal and structural features. The Core task adds perturbation-response inputs. The Extension task further adds similarity-support information. The target y i t + 1 , sample-construction logic, chronological split, and evaluation metrics remain identical across tasks.

2.6.2. Multi-Source Input Organization

Candidate inputs are drawn from four sources: temporally unfolded historical features, pre-change structural features, perturbation-response proxy features, and similarity-support information for data-scarce nodes. Let φ ( ) denote temporal unfolding within the historical window, and let X i ( t L + 1 : t ) denote the temporal subsequence of node i . Temporal features capture short-term evolution and periodicity, structural features describe regular network position, perturbation features represent local reconfiguration and temporary load-bearing pressure, and similarity features provide transferable support when history is insufficient.

2.6.3. Unified Predictor

Random forest is used as the common predictor [34]. It consists of multiple regression trees trained with bootstrap sampling and random feature selection at each split. For any input sample,
y ^ i , t + 1 = 1 M m = 1 M T m u i t ,
where T m ( ) is the output of the m-th regression tree. Random forest is used as a controlled forecasting backbone for representation evaluation rather than as the methodological novelty of this study. The input vector consists of heterogeneous tabular variables, including lagged temporal features, static structural descriptors, topology-change indicators, perturbation-response proxies, and similarity-support features. RF is suitable because it can model nonlinear interactions among mixed feature groups without requiring continuous post-change dynamic edge weights, updated graph states, or long sequence inputs.
The same RF model class and leakage-prevention protocol are therefore used across the Base, Core, and Extension tasks. Hyperparameters are selected only from predefined validation-set search spaces within each task, and the test set is not used for parameter selection. This controlled design keeps the model family fixed while allowing validation-based tuning, thereby isolating the contribution of topology-change encoding, perturbation-response proxies, and similarity-support features under the same forecasting target, sample-construction rule, and chronological evaluation protocol.

2.6.4. Task-Specific Input Forms

Under the shared design, the three tasks differ only in the activated input modules. The Base-task input is:
u i , t b a s e = [ φ ( X i , t i m e ( t L + 1 : t ) ) , g i ] ,
where g i denotes the structural feature vector of node i .
The Core-task input is:
u i , t c o r e = [ φ ( X i , t i m e ( t L + 1 : t ) ) , g i , z i p e r t ] ,
where z i p e r t denotes the perturbation-response state vector defined in Section 2.5. For the Core-task ablation variants, the perturbation-related module is controlled by either removing it, as in RF-Core-Plain, or replacing it with basic topology-change indicators, as in RF-Core-TopoOnly. Thus, the Core-task variants differ only in the level of perturbation-information representation, while the forecasting target, sample-construction rule, and evaluation protocol remain unchanged.
The Extension-task input is:
u i , t e x t = [ φ ( X i , t i m e ( t L + 1 : t ) ) , g i , z i p e r t , s i s i m ] ,
where g i denotes structural features and s i s i m similarity-support information.

2.7. Similarity-Assisted Extension for Data-Scarce Nodes

To evaluate similarity-assisted support under controlled limited-history conditions, this study constructs the Extension task by intentionally restricting the available training history of selected nodes while retaining their validation and test observations for chronological evaluation. This design simulates limited but nonzero local history rather than naturally missing data or a complete cold-start setting [30,31,32,33].
Node similarity is defined along three dimensions: topology, cargo-volume pattern, and periodic regularity. The composite similarity between target node i and candidate reference node j is:
S i j = λ 1 S i j t o p o + λ 2 S i j p a t t e r n + λ 3 S i j p e r i o d ,
where S i j t o p o , S i j p a t t e r n , and S i j p e r i o d denote topological similarity, cargo-volume pattern similarity, and periodic similarity, respectively. The weights λ 1 , λ 2 , and λ 3 are validation-selected combination weights rather than causal parameters.
Topological similarity is computed from standardized structural indicators, including in-degree, out-degree, weighted degree, neighborhood size, and relative structural role. Cargo-volume pattern similarity uses historical volume level, fluctuation features, and short-term variation patterns. Periodic similarity captures consistency in intraday rhythms and recurring operating patterns. All three scores are normalized before weighted summation.
All similarity scores are computed using only training-period or pre-forecast historical observations. Test-period cargo volumes are not used in reference-node selection or similarity-weight estimation. The final weight combination is λ 1 = 0.25 , λ 2 = 0.25 , and λ 3 = 0.50 . This setting gives greater weight to short-term periodic regularity and is chosen by validation comparison.
For each data-scarce node, the most similar data-rich reference node is
j * ( i ) = a r g   m a x j R S i j ,
where R is the set of data-rich nodes. The selected reference-node information is then added as auxiliary input together with the target node’s temporal, structural, and perturbation-related features. This design provides similarity-assisted support for limited-history nodes while keeping the forecasting target, sample-construction rule, forecasting backbone, and evaluation protocol unchanged.

2.8. Baselines, Evaluation Metrics, and Experimental Setting

All comparative experiments use the same sample-construction procedure, chronological split, and evaluation metrics. The aim is to test whether the proposed partial-information representations provide incremental forecasting value under a consistent modeling setting, rather than to present a universal ranking of forecasting algorithms.

2.8.1. Baselines and Comparison Levels

The comparison includes statistical and time-series baselines, conventional machine-learning models, recurrent deep-learning baselines, feature-fusion deep-learning references, and the proposed random forest comparison series [27,28,34,35]. These models provide reference levels ranging from simple historical extrapolation to nonlinear tabular learning, sequence modeling, and feature-fusion forecasting under the same information boundary.
The random forest series forms the main comparison chain. RF-Plain (Base) and RF-Base are used to test the marginal value of structural information under stable-network conditions. In the Base-task comparison, RF-Plain (Base) denotes the temporal-feature RF baseline, whereas RF-Base denotes the structure-augmented RF model using temporal features and pre-change structural features.
Within the Core task, three RF variants are constructed to separate the contribution of different perturbation-related representations. RF-Core-Plain uses temporal features and pre-change structural features under the route-change evaluation period but does not include explicit topology-change or perturbation-response variables. RF-Core-TopoOnly further adds observable topology-change indicators, such as added-edge counts, removed-edge counts, and change flags. RF-Core-Proposed adds the perturbation-response proxy state, including neighborhood change rate, perturbation intensity, and rerouting pressure index, together with the validation-selected feature subset. RF-LowData-Plain and RF-LowData-Similarity test the value of similarity support under the data-scarce-node scenario. A supplementary temporal–graph reference model, Temporal Encoder + Graph Residual + MLP (TE-GR-MLP), is also included because it combines node history with the pre-change weighted directed network without requiring continuous post-change dynamic edge weights.
The model groups, available input information, and comparison status are summarized in Table 4.

2.8.2. Evaluation Metrics

MAE, RMSE, MAPE, and WAPE are reported throughout the experiments. MAE measures average absolute error, RMSE gives greater weight to large deviations, and MAPE provides a relative-error reference but may be unstable for low-volume samples. WAPE is therefore used as the primary metric because it better captures aggregate forecasting error across heterogeneous cargo-volume scales.
High-load warning performance is evaluated to examine whether forecasting improvements enhance the identification of high-volume node-time periods. For each node i , the high-load threshold is defined as the q -th empirical quantile of its raw hourly cargo volume observed before the corresponding test period:
θ i ( q ) = Q q ( { y i , t : t < T i , 0 t e s t } ) ,
where T i , 0 t e s t is the first test timestamp of node i . The main warning analysis uses q = 0.90 , and the threshold-sensitivity analysis uses q { 0.80,0.85,0.90,0.95 } .
The observed high-load indicator is defined as
h i , t = I ( y i , t θ i ( q ) ) ,
Predicted high-load events are obtained by applying the same node-specific threshold to the predicted cargo volumes. Based on TP, FP, FN, and TN computed on the test samples, the warning metrics are defined as follows:
R e c a l l = T P T P + F N ,
M i s s R a t e = F N T P + F N ,
F a l s e W a r n i n g R a t i o = F P T P + F P ,
Here, F a l s e W a r n i n g R a t i o denotes the proportion of predicted warning events that are false alarms. In addition, a continuous asymmetric weighted loss is reported:
W e i g h t e d L o s s = 1 N ( i , t ) 2 m a x ( y i , t y ^ i , t , 0 ) + m a x ( y ^ i , t y i , t , 0 ) ,
where under-prediction is penalized twice as heavily as over-prediction. The warning metrics and the continuous weighted loss are used only for evaluation, not for model training.

2.8.3. Data Processing and Leakage Prevention

Table 2 reports the aggregate preprocessing and split statistics. Missing values were identified on the complete hourly node-time grid of 57 × 30 × 24 = 41,040 expected records, corresponding to 57 sorting centers, 30 days, and 24 hourly observations per day. Among 7759 missing entries, 402 were filled by local linear interpolation, 3909 by node-specific same-hour means, and 3448 by node-level overall means. No missing values remained after imputation, and missingness/imputation indicators were retained as input features.
To prevent information leakage, imputation statistics, normalization parameters, feature-screening decisions, similarity-node selection, similarity-weight estimation, and hyperparameter tuning were determined only from the training and validation periods or from pre-forecast information. The test period was reserved for final evaluation. The 24 h historical window was also constructed within each chronological split, yielding 25,992 training, 5472 validation, and 5472 test supervised samples.
All topology-change variables were constructed from planned route-adjustment information available before the forecasting timestamp, rather than from realized post-change flows, target-period node volumes, or continuous post-change dynamic edge weights.

2.8.4. Parameter Setting and Lightweight Screening

The random forest series is implemented in MATLAB R2021a using staged parameter search. Preliminary screening compares the number of trees, minimum leaf size, and random feature-subset size on the validation set. For the final Core-task model, a full grid search with multi-random-seed repetition is applied after the final feature subset is determined. Mean validation WAPE is used as the primary selection criterion, with standard deviation as a stability reference. The test set is not used for parameter selection or feature screening. Appendix A.2 Table A1 summarizes the candidate search spaces, final settings, and validation-based selection criteria for all comparison models.
Before task-specific activation, candidate predictors are organized into four groups: temporal features, pre-change structural features, topology-change and perturbation-related features, and similarity-support features. The model-ready feature set contains 25 temporal and quality-control features, 12 pre-change structural features, 16 topology-change and perturbation-related features, and 12 similarity-support features. The Base task activates temporal and structural information, the Core task adds topology-change and perturbation-related information, and the Extension task further includes similarity-support information for limited-history nodes.
After task-specific organization, the compact feature-channel dimensions are 36 for the Base task, 52 for the Core task, and 56 for the Extension task. These values describe task-level feature channels, not the final predictor count used by random forest. In the RF implementation, time-indexed variables and selected feature channels are flattened into an expanded predictor matrix after temporal unfolding and validation-only feature screening. Thus, the RF-Core-Proposed setting of mtry = 386 in Appendix A.2 Table A1 refers to the number of predictors sampled at each tree split from the expanded RF predictor matrix, not to the compact 52-channel Core-task dimension. Feature organization, screening, similarity-weight selection, and hyperparameter tuning are conducted only with the training and validation periods. The test set is not used for feature selection, feature screening, similarity-node identification, or parameter tuning. Detailed hyperparameter configurations are reported in Appendix A.2 Table A1.

2.8.5. Comparison Boundary and Fairness

Because continuous post-change dynamic edge weights are unavailable, the main comparison is limited to models implementable under the same pre-forecast information boundary. Dynamic-edge GNN/STGNN variants are therefore excluded from the fair main comparison. Table 4 defines this information-admissible comparison boundary by distinguishing models implementable with pre-forecast information from dynamic-graph models that require unavailable post-change edge weights. Accordingly, the results should be interpreted as evidence on partial-information representation design rather than as a universal ranking of forecasting algorithms.

2.8.6. Implementation Environment and Reproducibility

Data processing, sample construction, and model experiments are conducted mainly in MATLAB R2021a. The workflow includes raw-data cleaning, node-time sample construction, chronological train-validation-test splitting, and supervised-sample generation for the Base, Core, and Extension tasks. All tasks share the same data-preparation workflow; they differ only in input composition. Reproducibility is supported by the unified workflow, shared temporal split, shared evaluation system, leakage-prevention rules, and transparent reporting of parameter settings and implementation details under the available data constraints.

3. Results

The results are organized around the three computational validation settings defined in the unified framework. The Base task evaluates node-level forecasting under stable-network conditions. The Core task examines forecasting under topology perturbation when planned route changes are observable but continuous post-change dynamic edge weights are unavailable. The Extension task evaluates the same forecasting protocol for data-scarce nodes with limited local history. Figure 5 summarizes the main predictive performance across these three settings.

3.1. Performance Under Stable-Network Conditions

Table 5 reports the Base-task comparison under stable-network conditions. RF-Plain (Base), the temporal-feature random forest baseline, achieves a test WAPE of 16.98%, whereas RF-Base, which additionally incorporates pre-change structural features, yields a test WAPE of 17.06%. These results suggest that recent node-level observations and periodic temporal regularities dominate short-term prediction when the topology is stable. Pre-change structural descriptors provide relational context, but their marginal value is limited when recent temporal information is already available.

3.2. Performance Under Topology-Perturbation Conditions

Table 6 reports the Core-task comparison under topology perturbation. RF-Core-Plain obtains a test WAPE of 18.10%; adding topology-change signals in RF-Core-TopoOnly reduces WAPE to 17.11%, and further adding perturbation-response proxies in RF-Core-Proposed reduces it to 16.91%. This two-stage reduction shows that topology-change encodings and perturbation-response proxies provide incremental value under the partial-information boundary by representing structural reconfiguration and rerouting pressure. Because realized post-change edge weights, target-period node volumes, and updated graph states are unavailable, this result should be interpreted as evidence for information-admissible topology-perturbation representation, not as a claim of large-scale error reduction.
Table 7 reports the supplementary TE-GR-MLP reference results under the same information boundary. This comparison provides an additional temporal–graph reference based on node history and the pre-change weighted directed network, without using continuous post-change dynamic edge weights or updated graph states. Together with the benchmark results in Table 6, it supports the value of the proposed information-admissible input representations rather than a universal ranking of forecasting model classes.

Sensitivity Verification of Perturbation-Response Proxy Design

Sensitivity and ablation tests show that the Core-task gains remain stable across different proxy settings. The balancing parameter λ p is used only to construct the perturbation-response proxy and is not intended to recover true post-change edge-level cargo flows. The two-stage WAPE reduction from RF-Core-Plain to RF-Core-TopoOnly and then to RF-Core-Proposed supports the validity of the topology-change and perturbation-response input design within the adopted partial-information boundary.

3.3. Performance Under Data-Scarce Node Conditions

Table 8 reports the Extension-task results for nodes with limited but nonzero local history. RF-LowData-Plain obtains a test WAPE of 29.43%, whereas RF-LowData-Similarity reduces it to 26.68%, corresponding to a 2.75 percentage-point improvement. This result indicates that similarity-assisted support provides useful transitional information for data-scarce nodes while preserving the same forecasting target, sample-construction logic, forecasting backbone, and chronological evaluation protocol.
This experiment should be interpreted as a controlled limited-history simulation, not as a naturally missing-data scenario or a complete cold-start setting. The selected data-scarce nodes retain limited but nonzero local observations, and the method still depends on meaningful reference nodes. Thus, the observed WAPE reduction supports similarity-assisted limited-history forecasting, but not complete cold-start forecasting.

3.4. Statistical Validation and Robustness Checks

The remaining analyses examine whether the gain from RF-Core-Plain to RF-Core-Proposed is systematic across nodes, whether it is related to perturbation-related node states, and whether it transfers to warning-oriented evaluation. These analyses complement aggregate WAPE and assess whether the proposed representations provide robust computational value across the network.

3.4.1. Node-Level Statistical Significance Test

Table 9 provides the paired node-level significance tests and gain coverage for the Core and Extension tasks. In the Core task, mean node-level WAPE decreases from 22.56% to 21.58%, corresponding to a mean gain of 0.98 percentage points; the improvement is statistically significant, with p = 2.17 × 10 5 , rank-biserial r r b = 0.6467 , and 44 of 57 nodes improved. In the Extension task, mean WAPE decreases from 29.55% to 26.72%, corresponding to a mean gain of 2.83 percentage points; the result is also significant, with p = 7.13 × 10 4 , r r b = 0.9346 , and 15 of 17 nodes improved.
Figure 6 illustrates the distribution of node-level WAPE improvements in the Core task. Most nodes obtain positive WAPE reductions, indicating that the enhanced representation benefits a broad set of nodes rather than only a small number of outliers.

3.4.2. Mechanism-Related Heterogeneity of Node-Level Gains

Table 10 reports the Core-task forecasting results by perturbation severity group. The Low group contains 36 nodes, for which mean WAPE improves by 0.55 percentage points, from 23.77% to 23.22%, with a median gain of 0.92 points. The Medium–High group contains 21 nodes, for which mean WAPE improves by 1.70 percentage points, from 20.47% to 18.77%, with a median gain of 1.37 points. Figure 7a further suggests that nodes with higher rerouting pressure generally obtain larger WAPE improvements, although the gains remain heterogeneous across nodes. This supports the interpretation that perturbation-response inputs are especially useful for nodes that are more likely to bear local rerouting pressure.
A supplementary grouping by upstream dependency provides a complementary view. As shown in Figure 7b, nodes with lower upstream dependency tend to obtain larger but more dispersed forecasting gains, whereas nodes with higher upstream dependency show smaller and more concentrated improvements. This pattern suggests that nodes heavily dependent on upstream inputs may be more directly affected by external-flow fluctuations and therefore face greater intrinsic forecasting difficulty.
At the individual-node level, Figure 7c examines the relationship between node-level WAPE improvement and the rerouting pressure index. The scatterplot shows only a weak association rather than a proportional monotonic relationship, indicating that stronger perturbation pressure does not automatically imply proportionally larger forecasting gain. Taken together, Figure 7 suggests that the Core-task improvement is related to both perturbation pressure and structural dependency, but the benefit of perturbation-aware representations remains heterogeneous across nodes.

3.4.3. Warning-Oriented Evaluation of Predictive Improvement

The warning-oriented evaluation examines whether forecasting improvements translate into better high-load identification. As shown in Table 11, under the node-specific 90th percentile threshold, the Core setting reduces miss rate from 0.596 to 0.535, increases recall from 0.404 to 0.465, and decreases continuous weighted loss from 155.92 to 148.31 after adding the proposed inputs. However, the false-warning ratio increases from 0.092 to 0.143, indicating a trade-off between fewer missed high-load events and more false warnings. In the Extension setting, similarity support reduces the miss rate from 0.063 to 0.052, increases recall from 0.937 to 0.948, and decreases continuous weighted loss from 228.55 to 206.08, while the false-warning ratio remains almost unchanged. Overall, the warning results indicate improved high-load coverage rather than uniform improvement across all warning metrics.
These indicators link forecasting accuracy to warning-oriented evaluation. Under the node-specific 90th percentile threshold, the enhanced settings reduce missed high-volume events in both the topology-perturbation and data-scarce-node scenarios. The warning metrics are used only as application-level evaluation indicators, not as training targets.
Figure 8 examines threshold sensitivity by varying the node-specific high-load threshold from the 80th to the 95th percentile of pre-test hourly cargo volume. Because miss rate and false-warning ratio are threshold-dependent, the analysis focuses on these two metrics. Across thresholds, the enhanced Core setting generally reduces miss rate but increases the false-warning ratio, indicating a recall-oriented warning trade-off. In the data-scarce Extension setting, similarity support yields lower or comparable miss rates at most thresholds, while the false-warning ratio remains close to that of the Plain setting. Continuous weighted loss is not treated as a threshold-sensitivity metric because it is a continuous asymmetric prediction-loss measure.

3.4.4. Sensitivity to Low-Data Retention Ratio

Figure 9 reports the sensitivity of test-set WAPE to different retained training ratios under the data-scarce-node setting. RF-LowData-Similarity consistently outperforms RF-LowData-Plain at the 10%, 20%, and 30% retention ratios. Specifically, test WAPE decreases from 29.07% to 26.94% at the 10% retention ratio, from 29.43% to 26.68% at the 20% retention ratio, and from 29.63% to 27.58% at the 30% retention ratio.
These results show that similarity-assisted support is not restricted to a single retained-sample ratio. The consistent improvement across 10%, 20%, and 30% retention settings supports the robustness of the similarity-assisted extension for limited-history nodes.

4. Discussion

The findings suggest that node-level short-term cargo-volume forecasting in logistics sorting networks should be understood as a partial-information computational forecasting problem rather than as a competition in model complexity alone. Under stable-network conditions, temporal dynamics and periodic regularities dominate routine prediction. Under topology perturbation, robustness depends on whether planned route-topology changes can be converted into admissible node-level representations when continuous post-change dynamic edge weights are unavailable. Under data-scarce-node conditions, forecasting performance depends on whether transferable information from similar nodes can be incorporated without changing the forecasting target or evaluation protocol. Across the three tasks, the shared random forest backbone keeps the predictor class stable, allowing the empirical analysis to focus on the value of task-activated representations within a unified computational framework.

4.1. Methodological Interpretation of the Main Findings

First, the Base-task results show that routine 1-h-ahead forecasts are driven mainly by temporal dynamics and periodic patterns, whereas pre-change structural descriptors provide limited additional value under stable-network conditions. This result is consistent with the idea that, when topology does not change, recent node-level observations already contain much of the short-term predictive information.
Second, the Core-task results show that observable route-topology changes and perturbation-response proxies improve forecasting robustness under the adopted information boundary. The reduction in WAPE from 18.10% to 17.11% indicates that basic topology-change signals contain useful predictive information. The further reduction to 16.91% indicates that perturbation-response proxies provide additional value by representing local structural reconfiguration and rerouting pressure.
Third, the Extension-task results show that similarity-assisted support can improve forecasting for nodes with limited but nonzero local history. The reduction in WAPE from 29.43% to 26.68%, together with the improvement of 15 of 17 data-scarce nodes, indicates that similarity information can provide useful transitional support when local training samples are insufficient.

4.2. Relation to Existing Forecasting and Graph-Based Modeling Studies

The Base-task findings are consistent with logistics demand and cargo-volume forecasting studies showing that recent observations, lagged volumes, trends, and periodic patterns are strong predictors under stable operating conditions [22,23,24,25,26]. This study extends that line of work by formulating the target as node-level forecasting in a directed network, rather than as isolated single-node or aggregated demand prediction.
The Core-task findings clarify the relationship between the proposed framework and graph-based spatiotemporal forecasting. Graph neural networks and spatiotemporal graph models are valuable when dynamic graph states, updated edge weights, or node–edge observations are available [14,15,16,17,22,23,24,25,26]. Under the present partial-information boundary, however, continuous post-change edge weights and updated post-change graph states are unavailable before prediction. Therefore, the main computational challenge is not simply to adopt a more complex graph model, but to convert planned topology changes into admissible node-level inputs. The staged gain from topology-change signals to perturbation-response proxies supports the value of this representation strategy.
The Extension task connects the study with data-scarce, few-shot, and transfer-support forecasting research [30,31,32]. In the present framework, data-scarce nodes are not treated as a separate prediction problem with a different target. Instead, similarity support is introduced into the same supervised forecasting protocol, using topological structure, cargo-volume patterns, and periodic regularity to support nodes with limited but nonzero local history.

4.3. Methodological Implications of the Proposed Representations

The results provide four methodological implications. First, temporal representation dominates short-term node-level forecasting under stable-network conditions, while structural features mainly provide relational context when recent local observations and periodic patterns are already informative.
Second, topology-change encoding is useful when post-change dynamic edge weights are unavailable. Because planned route adjustments may be known before realized edge-level transfers, added and removed connections can be encoded as admissible pre-forecast topology inputs. The improvement from RF-Core-Plain to RF-Core-TopoOnly supports the incremental value of these signals.
Third, perturbation-response proxies provide an admissible way to represent local structural reconfiguration and rerouting pressure. They should not be interpreted as reconstructed edge-level flows but as proxy inputs derived from planned topology changes and pre-change network information.
Fourth, similarity-assisted support is useful for data-scarce nodes with limited but nonzero history when meaningful reference nodes can be identified. It should be interpreted as a controlled limited-history extension, not as a complete cold-start solution.
Taken together, the proposed representations are not generic feature engineering. They are designed for a partial-information setting in which only historical node volumes, the pre-change network structure, and planned topology changes are available before prediction. Within this boundary, topology-change encoding and perturbation-response proxies convert planned edge-level route adjustments into admissible node-level forecasting inputs.
The core methodological contribution of this study is therefore not the use of random forest itself. Random forest is used as a controlled forecasting backbone so that the incremental value of the proposed information representations can be evaluated under the same forecasting target, sample-construction logic, chronological split, and evaluation protocol. The main contribution lies in transforming planned topology perturbations into interpretable and information-admissible node-level predictive variables under partial information. In particular, the perturbation-response proxies represent neighborhood reconfiguration, damage to the pre-change connectivity basis, and potential rerouting pressure without reconstructing true post-change edge-level cargo flows.
The empirical gains support this methodological contribution, but they should be interpreted cautiously. In the Core task, topology-change encoding reduces test WAPE from 18.10% to 17.11%, and perturbation-response proxies further reduce it to 16.91%. Node-level paired statistical tests indicate that the improvement is statistically significant, and 44 of 57 nodes are improved. However, these gains remain incremental rather than large-scale. They should not be interpreted as evidence that the proposed RF-based framework universally dominates all forecasting alternatives, especially under full-information dynamic-graph settings where continuous post-change edge weights and updated graph states are available. Table 12 summarizes these scenario-specific interpretations by linking the stable-network, route-perturbation, and data-scarce-node evidence to the corresponding computational and methodological implications.

4.4. Practical Implications and System Integration

The proposed framework translates partial pre-forecast information into three operationally usable outputs: 1-h-ahead node-level cargo-volume forecasts, high-load warning flags, and perturbation-sensitive node lists. These outputs are intended to support short-term operational preparation rather than replace routing or dispatching decisions.
The operational use of these outputs depends on the forecasting scenario. Under stable-network conditions, node-level forecasts support routine monitoring, staffing adjustment, equipment allocation, vehicle organization, and buffer-area preparation. Under planned route perturbations, topology-change encoding and perturbation-response proxies help identify nodes exposed to local reconfiguration or potential rerouting pressure, thereby supporting priority monitoring, temporary staffing, vehicle rescheduling, and capacity-buffer preparation. For data-scarce nodes, similarity-assisted forecasting provides conservative transitional support for newly activated, recently reconfigured, or peripheral nodes with limited but nonzero local history.
The operational meaning of the Core-task WAPE reduction should be interpreted cautiously. The decrease from 18.10% to 17.11% and then to 16.91% is moderate in absolute magnitude and should not be described as a large-scale improvement or evidence of universal superiority. Its value lies in the strict partial-information boundary: forecasts are generated before realized post-change edge flows, continuous dynamic edge weights, and updated graph states become available.
To provide a quantitative indication of deployment cost, runtime was measured using MATLAB 9.10.0.1602886 (R2021a) on a PCWIN64 workstation with a 13th Gen Intel(R) Core(TM) i9-13900HX CPU, 16 GB RAM, and 24 CPU cores detected by MATLAB. Although an NVIDIA GeForce RTX 4060 Laptop GPU was available on the workstation, no GPU acceleration was used in this benchmark. For the final RF-Core-Proposed model with 700 trees, a minimum leaf size of 10, and 386 predictors sampled at each split, final model training required 474.87 ± 15.88 s over three repeated benchmark runs. RF inference for the complete test set of 5472 node-hour samples required 9.52 ± 0.25 s, corresponding to approximately 0.0992 ± 0.0026 s per hourly forecast origin for all 57 sorting centers. This benchmark covers model fitting and RF inference after the required feature tables have been prepared. It does not include raw data I/O, database access, dashboard communication, or full recomputation of structural descriptors from raw operational systems. Online inference assumes cached structural and topology descriptors. When planned route-adjustment records are updated, topology-change encodings and perturbation-response proxies should be recomputed before the next forecasting cycle. These results indicate that the proposed framework has a lightweight online computational profile in the present 57-node setting, although runtime and scalability should be re-evaluated before deployment in larger logistics networks.
The high-load warning results provide a more direct operational interpretation than WAPE alone. In the Core setting, the miss rate decreases from 0.596 to 0.535 and recall increases from 0.404 to 0.465, indicating fewer missed high-load periods. Continuous weighted loss also decreases from 155.92 to 148.31, suggesting lower under-prediction-oriented error. Because this benefit is accompanied by a higher false-warning ratio, the result should be interpreted as improved high-load coverage rather than uniform improvement across all warning metrics. In sorting operations, reducing missed warnings may support more timely staffing, vehicle dispatch, equipment activation, and buffer-area preparation, although the additional false-warning burden should be considered in deployment.
A real-world deployment can follow an hourly rolling forecasting workflow. At each forecast origin, the system updates recent node-level cargo volumes and reads cached pre-change network descriptors. When planned route-topology adjustments are available, the topology-change matrix and perturbation-response proxies are updated. For limited-history nodes, similarity-support features are computed from admissible historical information or retrieved from a precomputed reference-node table. The deployed predictor then outputs 1-h-ahead node-level forecasts, high-load warning flags, and perturbation-sensitive node lists, which can be connected to staffing systems, vehicle-dispatch modules, equipment-activation rules, buffer-area planning tools, and monitoring dashboards. Warning thresholds and false-warning tolerance should be selected according to the local cost balance between missed high-load periods and unnecessary resource activation.
Most computational burden is offline or periodic. The trained forecasting model does not need to be retrained at every hourly forecast origin. Routine retraining can be scheduled daily or weekly, depending on local data-drift monitoring, operational policy, and computational resources. By contrast, topology-change encodings and perturbation-response proxies should be recomputed whenever planned route-adjustment records are updated. Thus, volume-feature refreshing follows the hourly forecasting cycle, model retraining follows a periodic schedule, and topology-related feature recomputation is triggered by route-topology update events.
Several operational constraints remain. The framework requires reliable node identifiers, timely hourly cargo-volume data, planned topology-change records, and stable data interfaces among forecasting, routing, and monitoring systems. It is less suitable for unannounced disruptions when no topology-change information is available before prediction. It is also not a complete cold-start solution, because similarity-assisted forecasting still assumes limited but nonzero local history. In full-information settings where dynamic edge weights and updated graph states are available, dynamic GNN or STGNN models should be evaluated under that richer information boundary. For policymakers and platform managers, the findings highlight the importance of standardized data sharing, route-adjustment logging, and early-warning protocols in logistics sorting networks.

4.5. Generalizability, Boundary, and Failure Conditions

The proposed framework is applicable when historical node-level volumes, the pre-change network structure, and planned or observable route-topology changes are available before forecasting, but continuous post-change dynamic edge weights and regularly updated graph states are not. Under this boundary, the framework provides admissible node-level forecasting inputs rather than attempting to reconstruct the full post-change edge-level cargo-flow process.
The framework may also be relevant to other networked forecasting systems, such as transportation networks, supply-chain networks, service networks, and infrastructure networks, where planned topology changes are known before realized dynamic edge weights are observed. However, this generalizability should be treated as a hypothesis requiring external validation. The present evidence is based on one logistics sorting network and should not be interpreted as direct proof of performance in other networked systems.
The framework is less suitable in several conditions. First, when disruptions occur unexpectedly and no advance topology-change information is available, perturbation-response proxy features may not capture the relevant change. Second, in full-information settings where continuous post-change dynamic edge weights are observable, dynamic graph or spatiotemporal graph models should be compared under that richer information setting rather than excluded. Third, the framework is designed for short-term node-level forecasting, not for long-term structural evolution forecasting. Fourth, if topology changes do not plausibly affect node-level load redistribution, perturbation-response proxy features may provide limited additional value. Fifth, for data-scarce nodes with almost no usable history or no meaningful reference nodes, similarity support may be unstable.
The gains should also be interpreted as conditional rather than uniform. In the Core task, nodes under stronger perturbation pressure obtain larger improvements: the low-perturbation group has a mean improvement of 0.55 percentage points, whereas the medium–high-perturbation group has a mean improvement of 1.70 percentage points. This does not imply a proportional linear relationship between rerouting pressure and forecasting gain. It indicates that perturbation-response proxy features are more useful when topology changes create stronger local load-bearing pressure. In the Extension task, similarity support improves 15 of 17 data-scarce nodes and remains beneficial under 10%, 20%, and 30% retention ratios, with WAPE changing from 29.07% to 26.94%, from 29.43% to 26.68%, and from 29.63% to 27.58%, respectively. Nevertheless, gain magnitudes vary across nodes, so the reported improvements should be interpreted as conditional gains under the defined partial-information setting rather than as universal improvements across all forecasting environments.

4.6. Limitations and Future Research

This study has several limitations. First, the empirical validation is based on a single anonymized logistics sorting network. Although the dataset contains all 57 sorting centers available under the adopted benchmark structure, it should not be interpreted as a statistically representative sample of logistics sorting networks in general. Future research should validate the framework across multiple logistics networks, firms, regions, and operating periods.
Second, the perturbation-response variables are proxy inputs constructed when post-change dynamic edge weights are unavailable. They represent neighborhood reconfiguration, damage to the pre-change connectivity basis, and potential rerouting pressure, but they do not recover true edge-level cargo-flow propagation. Future work should compare these proxies with realized post-change flow data when such data become available.
Third, full-information dynamic graph models, including dynamic GNN and STGNN variants, are excluded from the main comparison because they require post-change edge weights, evolving node–edge states, or updated graph structures that are unavailable before prediction in the present setting. This exclusion reflects the information-admissibility boundary and should not be interpreted as a claim that such models are inherently inferior. Future research should compare the proposed framework with dynamic graph and spatiotemporal graph models under full-information settings.
Fourth, the data-scarce-node extension is a controlled limited-history simulation, not a naturally missing-data scenario or a complete cold-start method. Each data-scarce node retains limited but nonzero local history, and the similarity-support module still depends on meaningful reference-node selection. Future work should examine true cold-start settings, adaptive reference-node selection, and robustness when reference nodes are weakly matched or operationally unstable.

5. Conclusions

Before prediction, the available information includes historical node volumes, the pre-change network structure, and planned route-topology changes, whereas realized post-change edge flows, continuous post-change dynamic edge weights, and updated graph states remain unavailable. This setting differs from full-information dynamic-graph forecasting, where evolving node–edge states or updated graph structures can be observed and directly incorporated into model construction. The central computational problem addressed in this study is therefore how to convert limited pre-forecast information and planned topology changes into admissible node-level predictors for short-term forecasting.
The empirical results demonstrate that different information modules contribute differently across the three forecasting settings. Under stable-network conditions, temporal dynamics and periodic patterns dominate node-level forecasting, while structural descriptors provide only limited additional benefit. Under route perturbation, topology-change signals reduce test WAPE from 18.10% to 17.11%, and perturbation-response proxies further reduce it to 16.91%, with improvements observed for 44 of the 57 nodes. Under the data-scarce-node setting, similarity support reduces test WAPE from 29.43% to 26.68%, with 15 of the 17 data-scarce nodes improved. The gains remain consistent under 10%, 20%, and 30% training-sample retention settings, where WAPE decreases from 29.07% to 26.94%, from 29.43% to 26.68%, and from 29.63% to 27.58%, respectively. These improvements are incremental rather than large-scale, but node-level statistical tests indicate significant gains, and the reductions are practically meaningful under the defined partial-information boundary.
The warning-oriented results further indicate improved high-load coverage. The enhanced settings reduce missed high-load events and increase recall, while the continuous weighted loss decreases, suggesting reduced under-prediction-oriented forecasting error. However, in the Core task, the improvement in recall is accompanied by a higher false-warning ratio, revealing a trade-off between missed-event reduction and false-warning control. Therefore, the warning results should be interpreted as recall-oriented improvement in high-load identification, rather than as uniform improvement across all operational warning metrics.
The methodological novelty of this study does not lie in the random forest algorithm itself. Random forest is used as a controlled forecasting backbone to evaluate the incremental value of the proposed computational representations under the same target, sample-construction logic, and evaluation protocol. The core contribution is the transformation of planned topology perturbations into interpretable and information-admissible node-level predictive variables. Specifically, the perturbation-response proxies represent neighborhood reconfiguration, damage to the pre-change connectivity basis, and rerouting pressure before realized post-change edge-level information becomes available. These proxies do not reconstruct true post-change edge-level flows. The similarity-support module further extends the same forecasting protocol to limited-history nodes without changing the forecasting target, supervised-sample construction rule, or evaluation protocol.
From a practical and computational perspective, the proposed framework provides an information-admissible decision-support layer for hourly node monitoring, high-load warning, short-term staffing, vehicle coordination, equipment allocation, and buffer-area preparation. Its value is particularly relevant before realized post-change dynamic edge weights become available, when logistics managers must still prepare for possible short-term node-level load changes. The framework is not intended to replace richer full-information dynamic graph models when continuous post-change edge weights, updated graph states, and realized flow observations are available. In deployment, warning thresholds and false-warning tolerance should be selected according to the local cost balance between missed high-load periods and unnecessary resource activation.
Several limitations remain. First, the empirical validation is based on a single anonymized logistics sorting network; future work should evaluate the framework across multiple networks, firms, and operating periods. Second, the perturbation-response proxies are not true edge-flow recovery variables; future studies should compare them with realized post-change flow data when such data become available. Third, dynamic GNN and STGNN models are excluded only because their required information is unavailable under the present boundary; future work should compare them with the proposed framework under full-information settings. Fourth, the data-scarce-node experiment is a controlled limited-history simulation, not a naturally missing-data scenario or a complete cold-start method; future research should examine true cold-start settings and adaptive reference-node selection. Future work should also investigate online updating, integration with real-time monitoring systems, and forecast-informed operational optimization.

Author Contributions

Conceptualization, W.L. and Z.Z.; methodology, W.L. and Z.Z.; software, W.L.; validation, Y.W., G.L. and S.Z.; formal analysis, W.L. and Y.W.; investigation, W.L., Y.W., G.L. and S.Z.; data curation, Y.W., G.L. and S.Z.; writing—original draft preparation, W.L.; writing—review and editing, Y.W. and Z.Z.; visualization, S.Z.; supervision, Z.Z.; project administration, Z.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The raw data used in this study were released for the 14th MathorCup Mathematical Application Challenge in 2024, Problem C: Logistics Network Sorting Center Cargo-Volume Forecasting and Personnel Scheduling. The official competition problem statement and data-access information are available from the MathorCup release page: https://www.mathorcup.org/detail/2438 (accessed on 10 June 2026). Access to and redistribution of the original competition data are subject to the rules of the competition organizer or data provider. The derived node-mapping files, directed network matrices, supervised learning samples, parameter-configuration files, and code used in this study are available from the corresponding author upon reasonable request.

Acknowledgments

The authors would like to sincerely thank the editor and reviewers for their helpful comments.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
ARIMAAutoregressive Integrated Moving Average
BiLSTMBidirectional Long Short-Term Memory
GNNGraph Neural Network
LSTMLong Short-Term Memory
MAEMean Absolute Error
MLMachine Learning
MAPEMean Absolute Percentage Error
RFRandom Forest
RMSERoot Mean Square Error
SVRSupport Vector Regression
TE-GR-MLPTemporal–Graph-Structure Reference Model (Temporal Encoder + Graph Residual + MLP)
WAPEWeighted Absolute Percentage Error

Appendix A. Reproducibility Details

Appendix A.1. Core-Task Feature Screening and Parameter Selection

In the Core task, feature screening and parameter tuning were conducted under a fixed random forest backbone. All temporal features were retained, while structural and perturbation-related candidates were screened using validation-set WAPE. Candidate perturbation-related subsets were then compared, and the final retained subset was tuned by multi-seed random forest search. Mean validation WAPE was the primary selection criterion, and the test set was used only once after all screening and tuning decisions had been fixed.

Appendix A.2. Supplementary Results for the Core Task

After validation-only feature screening, the final RF-Core-Proposed model was tuned using multi-seed random forest search. The final configuration was numTrees = 700, minLeaf = 10, and mtry = 386. Across five random seeds (2026–2030), the model achieved a mean validation WAPE of 16.19% and a mean test WAPE of 16.91%, with standard deviations of 0.018 and 0.021 percentage points, respectively. These results indicate that the Core-task findings are stable within the predefined validation-only search procedure. Table A1 reports the search spaces and final settings of the main comparison models.
Table A1. Hyperparameter settings and configurations of the applied models.
Table A1. Hyperparameter settings and configurations of the applied models.
ModelConfigurationFinal SettingCriterion
ARIMANode-wise ARIMA baselineARIMA(1,0,1); fallback to ARIMA(1,0,0) or ARIMA(0,0,0) when estimation failedFixed baseline
SVRFlattened temporal featuresGaussian kernel; C = 1; epsilon = 0.1; kernel scale = autoValidation WAPE
RF-BaseTime + structure featuresNumTrees = 200; MinLeaf = 5; mtry = allValidation WAPE
LSTMSequential temporal features hidden   =   64 ;   dropout   =   0.1 ;   FC   =   32 ;   lr   =   1 × 10 3 ; batch = 128; epochs = 30Validation WAPE
BiLSTMSequential temporal features hidden   =   64 ;   dropout   =   0.1 ;   FC   =   32 ;   lr   =   1 × 10 3 ; batch = 128; epochs = 30Validation WAPE
GMF-BiLSTM (Base)Temporal + structural early fusion BiLSTM   hidden   =   64 ;   FC   =   64 / 32 ;   dropout   =   0.1 ;   lr   =   1 × 10 3 ; batch = 128; epochs = 40Validation WAPE
Fusion-BiLSTM (Base)Time + structure fusion hidden   =   128 ;   FC   =   64 ;   dropout   =   0.2 ;   lr   =   1 × 10 3 ; batch = 128; epochs = 80Validation WAPE
Fusion-BiLSTM (Core)Time + structure fusion + perturbation fusion hidden   =   128 ;   FC   =   64 ;   dropout   =   0.2 ;   lr   =   1 × 10 3 ; batch = 128; epochs = 80Validation WAPE
RF-Core-PlainTime + pre-change structural featuresNumTrees = 200; MaxDepth = 15; MinLeaf = 5Validation WAPE
RF-Core-TopoOnlyTime + structure + basic topology-change indicatorsNumTrees = 200; MaxDepth = 15; MinLeaf = 5Validation WAPE
RF-Core-ProposedTime + selected structure + perturbation-response proxy featuresNumTrees = 700; MinLeaf = 10; mtry = 386Mean validation WAPE across seeds
RF-LowData-PlainLow-data time + structure + perturbation-related featuresNumTrees = 300; MinLeaf = 5; mtry = sqrt(P)Validation WAPE
RF-LowData-SimilarityLow-data features + selected similarity featuresNumTrees = 180 or 300; MinLeaf = 3 or 5; mtry selected from sqrt(P) or P/3Validation WAPE
TE-GR-MLPSnapshot graph residual model hidden _ dim   =   32 ;   lr   =   1 × 10 4 ; dropout = 0Validation WAPE
Note: The task-level dimensions reported in Section 2.8.4 refer to compact feature-channel counts after task-specific organization. In contrast, mtry is a random forest hyperparameter defined on the expanded predictor matrix after temporal unfolding and validation-only feature screening. Thus, the RF-Core-Proposed value mtry = 386 is not directly comparable with the compact Core-task dimension of 52.
Table A2. Illustrative supervised node-hour sample structure and anonymized compact example.
Table A2. Illustrative supervised node-hour sample structure and anonymized compact example.
Feature GroupVariable ExamplesExample ValueDescription
Node-time indexnode_id, timestampnode 17, 27 November 2023, 08:00Identifies node i  and forecast origin time  t .
Forecasting target y i , t + 1 cargo volume at 27 November 2023, 09:001-h-ahead node-level target; not used as input.
Temporal historylag_1h, lag_2h, lag_24h, moving_avg_3hhistorical volumes available up to forecast origin tCaptures recent variation, daily periodicity, and short-term trend.
Calendar and missingnesshour_of_day, missing_flag, imputation_flag8, 0, 0Records time context and observation/imputation status.
Structural featuresin_degree, out_degree, weighted_in_degree, weighted_out_degreegraph-derived node descriptorsDescribes the node’s position in the pre-change directed network.
Topology-change featuresadded_edges, removed_edges, change_flag1, 2, 1Encodes planned local route-topology changes.
Perturbation proxy features C i , I i , P i neighborhood change, intensity, pressure indexRepresents local reconfiguration and potential rerouting pressure under partial information.
Similarity-support featuresreference_node_id, topological_similarity, pattern_similarity, periodic_similarity, composite_similaritynode 42, 0.81, 0.76, 0.69, 0.75Provides auxiliary support for data-scarce-node forecasting.
Source: Authors’ construction based on the anonymized benchmark data and the supervised sample-generation procedure described in Section 2.2. Note: This table shows the structure of a supervised node-hour sample rather than a complete raw data record. Each sample is indexed by node i and forecast origin time t , and the target is the node-level cargo volume at t + 1 . All input variables are constructed using information available before the forecasted period. Target-period cargo volumes, realized post-change edge flows, and updated post-change dynamic graph states are not used in feature construction, thereby preventing information leakage. Similarity-support features are activated only in the data-scarce-node extension task.

Appendix B. Sensitivity Analyses

Appendix B reports sensitivity analyses for the perturbation-intensity balancing parameter λ p and the similarity-combination weights λ 1 , λ 2 , and λ 3 . Table A3 reports the λ p results, and Table A4 reports the similarity-weight results. These one-factor robustness checks use fixed auxiliary settings to examine whether the main conclusions depend on a single parameter or weight configuration. They are not intended to reproduce the fully tuned results in Table 6 and Table 8.

Appendix B.1. Sensitivity Analysis of the Perturbation-Intensity Balancing Parameter

To examine whether the perturbation-intensity proxy depends on a single balancing-parameter setting, λ p is varied over representative values while keeping the same feature-construction rule, chronological split, and random-forest backbone. Table A3 reports the validation and test WAPE results.
The balancing parameter λ p is used to control the strength of the topology-based proxy compensation term in the perturbation-intensity formulation. It is not a causal parameter, not a physical recovery coefficient, and not an estimate of realized post-change cargo redistribution.
Table A3. Sensitivity analysis of the perturbation-intensity balancing parameter λ p .
Table A3. Sensitivity analysis of the perturbation-intensity balancing parameter λ p .
λ p Val WAPE (%)Test WAPE (%)
0.3016.30216.958
0.5016.25316.993
0.7016.31617.044
0.9016.35516.935
The test WAPE values in Table A3 remain close to the final RF-Core-Proposed result reported in Table 6, but they are not expected to be numerically identical because the sensitivity analysis fixes auxiliary settings while varying only λ p .

Appendix B.2. Sensitivity Analysis of the Similarity Weights

Similarity-weight sensitivity is also examined under representative weight settings. The three weights correspond to topological similarity, cargo-volume pattern similarity, and periodic similarity, respectively. Across representative settings, Extension-task test WAPE ranges from 26.859% to 27.088%, indicating that the conclusion on similarity support is not driven by a single weight configuration.
Table A4. Sensitivity analysis settings and results.
Table A4. Sensitivity analysis settings and results.
Similarity Weights λ 1 / λ 2 / λ 3 Val WAPE (%)Test WAPE (%)
0.33/0.33/0.3319.86327.088
0.50/0.25/0.2519.90326.859
0.25/0.50/0.2519.76426.923
0.25/0.25/0.5019.75526.865
These results show that the similarity-assisted extension remains stable across representative similarity-weight configurations. The setting λ 1 = 0.25 , λ 2 = 0.25 and λ 3 = 0.50 is the validation-selected combination used in the main Extension-task configuration. Its test WAPE in the sensitivity analysis may differ slightly from the final RF-LowData-Similarity result reported in Table 8 because Table A4 is a fixed-setting robustness check, whereas Table 8 reports the final Extension-task comparison under the complete selected configuration. Therefore, the improvement observed in the data-scarce-node setting is not dependent on a single arbitrary similarity-weight combination.

References

  1. Ezaki, T.; Imura, N.; Nishinari, K. Towards understanding network topology and robustness of logistics systems. Commun. Transp. Res. 2022, 2, 100064. [Google Scholar] [CrossRef]
  2. Chen, D.; Sun, D.; Yin, Y.; Dhamotharan, L.; Kumar, A.; Guo, Y. The resilience of logistics network against node failures. Int. J. Prod. Econ. 2022, 244, 108373. [Google Scholar] [CrossRef]
  3. Wide, P. Improving decision support for operational disruption management in freight transport. Res. Transp. Bus. Manag. 2020, 37, 100540. [Google Scholar] [CrossRef]
  4. Mardešić, N.; Erdelić, T.; Carić, T.; Đurasević, M. Review of stochastic dynamic vehicle routing in the evolving urban logistics environment. Mathematics 2024, 12, 28. [Google Scholar] [CrossRef]
  5. Feng, J.; Hu, X.; Xu, L.; Luo, S.; Chen, J. Reliable logistics network design joint optimization problem applying demand sensitivity to correlated disruptions. Comput. Ind. Eng. 2025, 209, 111472. [Google Scholar] [CrossRef]
  6. L’Hermitte, C.; Trent, N.M.; Qin, Y.; Friday, D. Capturing the full complexity of postdisaster freight disruptions: An application of the theory of constraints. Transp. Res. Rec. 2025, 2679, 882–898. [Google Scholar] [CrossRef]
  7. Lu, Z.; Zhao, Z.; Huang, G.Q. STAR: Spatial–Temporal Attention Reasoning model for dynamic logistics network routing in Cyber–Physical Internet. Adv. Eng. Inform. 2026, 69, 103830. [Google Scholar] [CrossRef]
  8. Yang, Y.; Wei, Q.; Huo, B. Robust optimization of pharmaceutical emergency logistics network under dynamic demand. Ind. Manag. Data Syst. 2026; ahead of print. [CrossRef]
  9. Habibi, F.; Chakrabortty, R.K.; Abbasi, A. Evaluating supply chain network resilience considering disruption propagation. Comput. Ind. Eng. 2023, 183, 109531. [Google Scholar] [CrossRef]
  10. Vidza, M.-S.; Budka, M.; Chai, W.K.; Thrush, M.; Teixeira Alves, M. Supply network disruption: A framework for assessing vulnerability and implementing resilience strategies. Inf. Sci. 2025, 717, 122336. [Google Scholar] [CrossRef]
  11. Li, Y.; Yu, R.; Shahabi, C.; Liu, Y. Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting. arXiv 2018, arXiv:1707.01926. [Google Scholar] [CrossRef]
  12. Yu, B.; Yin, H.; Zhu, Z. Spatio-Temporal Graph Convolutional Networks: A Deep Learning Framework for Traffic Forecasting. In Proceedings of the Twenty-Seventh International Joint Conference on Artificial Intelligence (IJCAI-18); IJCAI: San Francisco, CA, USA, 2018; pp. 3634–3640. [Google Scholar] [CrossRef]
  13. Wu, Z.; Pan, S.; Long, G.; Jiang, J.; Zhang, C. Graph WaveNet for Deep Spatial-Temporal Graph Modeling. In Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence (IJCAI-19); IJCAI: San Francisco, CA, USA, 2019; pp. 1907–1913. [Google Scholar] [CrossRef]
  14. Huang, F.; Yi, P.; Wang, J.; Li, M.; Peng, J.; Xiong, X. A dynamical spatial-temporal graph neural network for traffic demand prediction. Inf. Sci. 2022, 594, 286–304. [Google Scholar] [CrossRef]
  15. Wu, J.-L.; Chung, W.-Y.; Wu, Y.-H.; Ho, Y.-N. Dynamic traffic network representation model for improving the prediction performance of passenger flow for mass rapid transit. Knowl. Based Syst. 2024, 304, 112442. [Google Scholar] [CrossRef]
  16. Xu, K.; Pan, B.; Zhang, M.X.; Zhang, X.; Hou, X.Y.; Yu, J.X.; Lu, Z.Z.; Zeng, X.; Jia, Q.Q. Non-end-to-end adaptive graph learning for multi-scale temporal traffic flow prediction. PLoS ONE 2025, 20, e0322145. [Google Scholar] [CrossRef]
  17. Liu, Y.; Feng, T.; Rasouli, S.; Wong, M. ST-DAGCN: A spatiotemporal dual adaptive graph convolutional network model for traffic prediction. Neurocomputing 2024, 601, 128175. [Google Scholar] [CrossRef]
  18. Alqatawna, A.; Abu-Salih, B.; Obeid, N.; Almiani, M. Incorporating time-series forecasting techniques to predict logistics companies’ staffing needs and order volume. Computation 2023, 11, 141. [Google Scholar] [CrossRef]
  19. Al Hajj Hassan, L.; Mahmassani, H.S.; Chen, Y. Reinforcement learning framework for freight demand forecasting to support operational planning decisions. Transp. Res. Part E Logist. Transp. Rev. 2020, 137, 101926. [Google Scholar] [CrossRef]
  20. Sonnleitner, B.; Kourentzes, N.; Ehrig, C.; Pflaum, A. Forecasting for optimization in road freight transport: A review. Transp. Res. Part E Logist. Transp. Rev. 2025, 204, 104378. [Google Scholar] [CrossRef]
  21. Tseremoglou, I.; Bombelli, A.; Santos, B.F. A combined forecasting and packing model for air cargo loading: A risk-averse framework. Transp. Res. Part E Logist. Transp. Rev. 2022, 158, 102579. [Google Scholar] [CrossRef]
  22. Huang, L.; Xie, G.; Zhao, W.; Gu, Y.; Huang, Y. Regional logistics demand forecasting: A BP neural network approach. Complex Intell. Syst. 2023, 9, 2297–2312. [Google Scholar] [CrossRef]
  23. Salais-Fierro, T.E.; Saucedo-Martínez, J.A. Demand forecasting for freight transport applying machine learning into the logistic distribution. Mob. Netw. Appl. 2022, 27, 2172–2181. [Google Scholar] [CrossRef]
  24. Huang, W.; Yin, Y.; Li, H.; Xie, A.; Fan, Y. High-speed railway express delivery volume forecast based on data-driven ensemble forecast approaches: The China case. Expert Syst. Appl. 2024, 258, 125196. [Google Scholar] [CrossRef]
  25. Ma, F.; Wang, S.; Xie, T.; Sun, C. Regional logistics express demand forecasting based on improved GA-BP neural network with indicator data characteristics. Appl. Sci. 2024, 14, 6766. [Google Scholar] [CrossRef]
  26. Mamede, F.P.; da Silva, R.F.; de Brito Junior, I.; Yoshizaki, H.T.Y.; Hino, C.M.; Cugnasca, C.E. Deep learning and statistical models for forecasting transportation demand: A case study of multiple distribution centers. Logistics 2023, 7, 86. [Google Scholar] [CrossRef]
  27. Li, Y.; Wei, Z. Regional Logistics Demand Prediction: A Long Short-Term Memory Network Method. Sustainability 2022, 14, 13478. [Google Scholar] [CrossRef]
  28. Douaioui, K.; Oucheikh, R.; Benmoussa, O.; Mabrouki, C. Machine Learning and Deep Learning Models for Demand Forecasting in Supply Chain Management: A Critical Review. Appl. Syst. Innov. 2024, 7, 93. [Google Scholar] [CrossRef]
  29. Habibi, F.; Chakrabortty, R.K.; Abbasi, A.; Ho, W. Investigating disruption propagation and resilience of supply chain networks: Interplay of tiers and connections. Int. J. Prod. Res. 2025, 63, 6229–6251. [Google Scholar] [CrossRef]
  30. Lyu, H.; Wang, C. ST-CML: A contrastive meta learning framework for spatio-temporal graph few-shot learning with cross-city transfer. Neurocomputing 2025, 656, 131483. [Google Scholar] [CrossRef]
  31. Sun, H.; Li, P.; Li, Y. Traffic flow prediction in data-scarce regions: A transfer learning approach. Comput. Mater. Contin. 2025, 83, 4899–4914. [Google Scholar] [CrossRef]
  32. Jiang, Y.; Liu, M.; Li, Y.; Liu, Y.; Zhang, J.; Liu, Y.; Liu, C. Enhanced neighborhood node graph neural networks for load forecasting in smart grid. Int. J. Mach. Learn. Cybern. 2024, 15, 129–148. [Google Scholar] [CrossRef]
  33. Pan, S.J.; Yang, Q. A Survey on Transfer Learning. IEEE Trans. Knowl. Data Eng. 2010, 22, 1345–1359. [Google Scholar] [CrossRef]
  34. Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
  35. Li, T.; Wang, S.; Nong, T.; Liu, B.; Hu, F.; Chen, Y.; Han, Y. Bayesian Optimization of LSTM-Driven Cold Chain Warehouse Demand Forecasting Application and Optimization. Processes 2025, 13, 3085. [Google Scholar] [CrossRef]
Figure 1. Directed network coupling, topology perturbation, forecasting target, and computational modules in a logistics sorting network. Nodes A–G denote illustrative anonymized sorting centers. Gray arrows indicate regular directed route connections, and the red crossed link indicates a removed or disrupted route connection. Green and orange arrows indicate sequential links between the illustrated modules. The upward and downward “Volume” arrows denote potential node-level cargo-volume increase and decrease, respectively. Different node colors are used for visual distinction.
Figure 1. Directed network coupling, topology perturbation, forecasting target, and computational modules in a logistics sorting network. Nodes A–G denote illustrative anonymized sorting centers. Gray arrows indicate regular directed route connections, and the red crossed link indicates a removed or disrupted route connection. Green and orange arrows indicate sequential links between the illustrated modules. The upward and downward “Volume” arrows denote potential node-level cargo-volume increase and decrease, respectively. Different node colors are used for visual distinction.
Mca 31 00107 g001
Figure 2. Partial-information forecasting workflow and task-specific input organization.
Figure 2. Partial-information forecasting workflow and task-specific input organization.
Mca 31 00107 g002
Figure 3. Workflow for transforming raw records into task-specific datasets. The directed logistics network example illustrates that nodes denote sorting centers, arrows denote directed route connections and cargo-flow directions, the central dark node represents the target sorting center, and the surrounding nodes represent its neighboring centers. The dashed arrow in the topology timeline indicates the planned stage after the change point.
Figure 3. Workflow for transforming raw records into task-specific datasets. The directed logistics network example illustrates that nodes denote sorting centers, arrows denote directed route connections and cargo-flow directions, the central dark node represents the target sorting center, and the surrounding nodes represent its neighboring centers. The dashed arrow in the topology timeline indicates the planned stage after the change point.
Mca 31 00107 g003
Figure 4. Computational interpretation of node-level perturbation-response formation under planned route-topology changes.
Figure 4. Computational interpretation of node-level perturbation-response formation under planned route-topology changes.
Mca 31 00107 g004
Figure 5. Main predictive results across the three computational validation settings. (a) Test-set WAPE comparison under the Base task, corresponding to stable-network conditions; (b) test-set WAPE comparison under the Core task, corresponding to topology-perturbation conditions; (c) test-set WAPE comparison under the Extension task, corresponding to the data-scarce-node setting. Dark-blue bars highlight the main proposed or best-performing RF variants in each setting, while light-gray bars indicate baseline or comparison models.
Figure 5. Main predictive results across the three computational validation settings. (a) Test-set WAPE comparison under the Base task, corresponding to stable-network conditions; (b) test-set WAPE comparison under the Core task, corresponding to topology-perturbation conditions; (c) test-set WAPE comparison under the Extension task, corresponding to the data-scarce-node setting. Dark-blue bars highlight the main proposed or best-performing RF variants in each setting, while light-gray bars indicate baseline or comparison models.
Mca 31 00107 g005
Figure 6. Distribution of node-level WAPE improvements under the Core task.
Figure 6. Distribution of node-level WAPE improvements under the Core task.
Mca 31 00107 g006
Figure 7. Node-level improvement patterns under perturbation-related settings: (a) node-level WAPE improvement grouped by rerouting pressure; (b) node-level WAPE improvement grouped by upstream dependency; (c) relationship between node-level WAPE improvement and the rerouting pressure index. In panels (a,b), boxes indicate group distributions, colored points represent individual node-level observations in each group, and black diamonds indicate group mean values.
Figure 7. Node-level improvement patterns under perturbation-related settings: (a) node-level WAPE improvement grouped by rerouting pressure; (b) node-level WAPE improvement grouped by upstream dependency; (c) relationship between node-level WAPE improvement and the rerouting pressure index. In panels (a,b), boxes indicate group distributions, colored points represent individual node-level observations in each group, and black diamonds indicate group mean values.
Mca 31 00107 g007
Figure 8. Threshold sensitivity of high-load warning performance under different node-specific quantile thresholds: (a) Core-task miss rate; (b) Core-task false-warning ratio; (c) Extension-task miss rate; (d) Extension-task false-warning ratio. The thresholds are defined using pre-test raw hourly cargo-volume quantiles. The continuous weighted loss is evaluated as a threshold-independent asymmetric prediction-loss measure.
Figure 8. Threshold sensitivity of high-load warning performance under different node-specific quantile thresholds: (a) Core-task miss rate; (b) Core-task false-warning ratio; (c) Extension-task miss rate; (d) Extension-task false-warning ratio. The thresholds are defined using pre-test raw hourly cargo-volume quantiles. The continuous weighted loss is evaluated as a threshold-independent asymmetric prediction-loss measure.
Mca 31 00107 g008
Figure 9. Sensitivity of test-set WAPE to different retained training ratios under the data-scarce-node setting.
Figure 9. Sensitivity of test-set WAPE to different retained training ratios under the data-scarce-node setting.
Mca 31 00107 g009
Table 1. Computational gaps in node-level forecasting under partial information.
Table 1. Computational gaps in node-level forecasting under partial information.
StreamKey LimitationThis Study Addresses
Cargo-volume and demand forecastingOften treats nodes as independent time series.Formulates node forecasting as prediction on a directed network.
Graph-based and spatiotemporal forecastingOften requires stable graphs, dynamic edge weights, or updated graph states.Converts observable topology changes into admissible node-level inputs.
Topology perturbation, propagation, and resilienceExplains shock diffusion but rarely supports 1 h forecasting input design.Builds perturbation-response proxies for local load-bearing pressure.
Data-scarce and transfer-support forecastingOften treats data scarcity as a generic small-sample issue.Adds similarity-assisted support for data-scarce nodes under the same forecasting protocol.
Source: Authors’ synthesis based on the reviewed literature streams and the partial-information forecasting gap identified in Section 1.
Table 2. Data scope, preprocessing, and sample construction summary.
Table 2. Data scope, preprocessing, and sample construction summary.
ItemSummary
Data source and node scopeAll data components were derived from the same anonymized benchmark dataset. The 57 sorting centers constitute the complete unified node set, not an author-selected sample.
Observation windowsDaily records cover 1 August–30 November 2023, with 6954 records. Hourly records cover 1 November 00:00–30 November 2023 23:00, with 41,040 expected node-hour records.
Missing-value handlingThe hourly panel contained 7759 missing node-hour entries, corresponding to an 18.91% missing rate. Missing values were filled by local linear interpolation, node-specific same-hour means, and node-level overall means; no missing values remained after imputation.
Network topologyThe pre-change network contained 134 directed edges, and the planned post-change topology contained 122 directed edges. The topology adjustment included 120 retained, 14 removed, and 2 added edges, involving 23 affected nodes.
Chronological splitTraining: 1–20 November 2023, 27,360 rows; validation: 21–25 November 2023, 6840 rows; test: 26–30 November 2023, 6840 rows.
Supervised samples and information boundaryA 24 h within-split historical window produced 36,936 supervised samples: 25,992 training, 5472 validation, and 5472 test samples. Target-period volumes, realized post-change edge flows, and updated post-change graph states were excluded from feature construction.
Source: Authors’ calculation and organization based on the anonymized benchmark dataset and the preprocessing workflow described in Section 2.2. Note: The reduction from 41,040 node-hour records to 36,936 supervised samples results from excluding the first 24 h of each chronological split for within-split historical-window construction.
Table 3. Task design and activated input modules under the unified forecasting protocol.
Table 3. Task design and activated input modules under the unified forecasting protocol.
TaskScenarioShared DesignActivated ModulesTask-Specific Role
BaseStable network1 h node-level target; 24 h window; chronological split; RF backbone; MAE/RMSE/MAPE/WAPETime + structureTests routine forecasting drivers
CoreRoute perturbationSame as BaseTime + structure + perturbation responseTests topology-change and perturbation-proxy gains
ExtensionData-scarce nodesSame as BaseTime + structure + perturbation response + similarity supportTests transferable support for short-history nodes
Source: Authors’ design based on the unified partial-information forecasting protocol proposed in this study. Note: Perturbation-response inputs are proxy variables under partial information, not recovered post-change edge-level flows. The Extension task is a controlled limited-history simulation, not a naturally missing-data scenario or a complete cold-start setting.
Table 4. Model comparison boundary under partial information.
Table 4. Model comparison boundary under partial information.
Model GroupExamplesInput BasisStatus
Statistical/time-series baselinesSame-Hour Mean; ARIMAHistorical node volumeMain comparison
Conventional ML/sequence baselinesSVR; LSTM; BiLSTMHistorical or tabular featuresMain comparison
Feature-fusion deep referencesGMF-BiLSTM; Fusion-BiLSTMTime + available structural/perturbation inputsMain comparison
RF comparison chainRF-Plain; RF-Base; RF-Core variants; RF-LowData variantsTask-specific modulesMain comparison
Temporal–graph referenceTE-GR-MLPTime + pre-change structureSupplementary reference
Dynamic graph models requiring post-change edge weightsFull dynamic-edge GNN/STGNN variantsPost-change dynamic edge weightsExcluded
Note: Dynamic graph models requiring continuous post-change edge weights are excluded from the fair main comparison because such information is unavailable before forecasting.
Table 5. Test-set WAPE comparison across models under the Base task.
Table 5. Test-set WAPE comparison across models under the Base task.
ModelTest WAPE (%)Difference Relative to RF-Plain (Base) (pts)
RF-Plain (Base)16.980.00
RF-Base17.06−0.08
Fusion-BiLSTM (Base)18.43−1.45
GMF-BiLSTM (Base)19.58−2.60
BiLSTM19.82−2.84
SVR22.34−5.36
LSTM27.78−10.80
Same-Hour Mean31.45−14.47
ARIMA33.62−16.64
Note: Positive values indicate lower WAPE than RF-Plain (Base), whereas negative values indicate higher WAPE.
Table 6. Performance comparison across models under the Core task.
Table 6. Performance comparison across models under the Core task.
ModelDescriptionTest WAPE (%)
RF-Core-PlainTime + pre-change structural features18.10
RF-Core-TopoOnlyTime + structure + basic topology-change indicators17.11
RF-Core-ProposedTime + selected structure + perturbation-response proxies16.91
Fusion-BiLSTMDeep fusion baseline18.75
BiLSTMSequence baseline20.28
SVRClassical ML baseline24.57
LSTMSequence baseline27.92
Table 7. Performance of the supplementary temporal–graph reference model under the Core task.
Table 7. Performance of the supplementary temporal–graph reference model under the Core task.
ModelVal WAPE (%)Test WAPE (%)Test MAETest RMSE
TE-GR-MLP19.915120.8311105.9230186.4553
Table 8. Test-set WAPE comparison across models under the data-scarce-node setting.
Table 8. Test-set WAPE comparison across models under the data-scarce-node setting.
ModelTest WAPE (%)Improvement vs. Plain (pts)
RF-LowData-Plain29.430.00
RF-LowData-Similarity26.682.75
Note: The data-scarce-node setting is constructed by retaining only the earliest 20% of training samples for the data-scarce nodes while keeping the validation and test observations complete. RF-LowData-Similarity further incorporates similarity-support information, among which the composite similarity score and periodic similarity score are the most effective features.
Table 9. Node-level significance and gain coverage.
Table 9. Node-level significance and gain coverage.
TaskComparisonNodesPlain WAPE (%)Enhanced WAPE (%)Improvement (pts)Median Gain (pts)p-ValueEffect Size ( r r b )Improved Nodes
Core taskCore-Plain vs. Core-Proposed5722.5621.580.981.122.17 × 10−50.646744/57
Low-data taskLowData-Plain vs. LowData-Similarity1729.5526.722.832.207.13 × 10−40.934615/17
Table 10. Core-task forecasting results by perturbation severity grouping.
Table 10. Core-task forecasting results by perturbation severity grouping.
Perturbation GroupNo. of NodesCore-Plain WAPE (%)Core-Proposed WAPE (%)Mean ΔWAPE (pts)Median ΔWAPE (pts)
Low3623.7723.220.550.92
Medium–High2120.4718.771.701.37
Note: Groups are constructed using rerouting-related state indicators, with the rerouting pressure index as the primary grouping metric. ΔWAPE is calculated as WAPE (Core-Plain) – WAPE (Core-Proposed); therefore, a positive value indicates an improvement of Core-Proposed over Core-Plain.
Table 11. High-load warning performance under the node-specific 90th percentile threshold.
Table 11. High-load warning performance under the node-specific 90th percentile threshold.
TaskSettingMiss RateRecallFalse-Warning RatioContinuous Weighted LossOperational Interpretation
CorePlain0.5960.4040.092155.92Baseline warning support under route perturbation
CoreEnhanced0.5350.4650.143148.31Earlier node warning and resource prepositioning under route perturbation
ExtensionPlain0.0630.9370.233228.55Baseline warning support for data-scarce nodes
ExtensionEnhanced0.0520.9480.236206.08Transitional warning support for operationally important data-scarce nodes
Note: Node-specific high-load thresholds are defined as the 90th empirical quantiles of raw hourly cargo volumes observed before the corresponding test period. Recall is T P / ( T P + F N ) , Miss rate is F N / ( T P + F N ) , and False-warning ratio is F P / ( T P + F P ) , namely the proportion of predicted warning events that are false alarms. Continuous weighted loss is calculated as 1 N [ 2 m a x ( y y ^ , 0 ) + m a x ( y ^ y , 0 ) ] , where under-prediction is penalized twice as heavily as over-prediction. This loss is independent of the warning-threshold quantile and is used only for evaluation.
Table 12. Scenario-specific computational interpretation of the proposed representations.
Table 12. Scenario-specific computational interpretation of the proposed representations.
ScenarioEmpirical EvidenceComputational InterpretationMethodological Implication
Stable-network forecastingRF-Plain (Base) and RF-Base show similar WAPE valuesShort-term temporal dynamics dominate when topology is stableStructural descriptors are useful as context but may add limited marginal predictive value
Route-perturbation forecastingWAPE 18.10%→17.11%→16.91%; 44/57 nodes improved; miss rate 0.596→0.535Topology-change encoding and perturbation-response proxies capture admissible pre-forecast perturbation informationPlanned topology changes can be transformed into node-level computational inputs without using realized post-change edge weights
Data-scarce-node forecastingWAPE 29.43%→26.68%; 15/17 nodes improved; miss rate 0.063→0.052Similarity support provides transitional forecasting when local history is insufficientLimited-history nodes can be supported under the same sample-construction and evaluation protocol
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, W.; Wang, Y.; Li, G.; Zhang, S.; Zhang, Z. Partial-Information Node-Level Forecasting in Directed Logistics Networks via Topology-Perturbation Encoding. Math. Comput. Appl. 2026, 31, 107. https://doi.org/10.3390/mca31030107

AMA Style

Li W, Wang Y, Li G, Zhang S, Zhang Z. Partial-Information Node-Level Forecasting in Directed Logistics Networks via Topology-Perturbation Encoding. Mathematical and Computational Applications. 2026; 31(3):107. https://doi.org/10.3390/mca31030107

Chicago/Turabian Style

Li, Weicheng, Yixian Wang, Guozheng Li, Shunyao Zhang, and Zhongwei Zhang. 2026. "Partial-Information Node-Level Forecasting in Directed Logistics Networks via Topology-Perturbation Encoding" Mathematical and Computational Applications 31, no. 3: 107. https://doi.org/10.3390/mca31030107

APA Style

Li, W., Wang, Y., Li, G., Zhang, S., & Zhang, Z. (2026). Partial-Information Node-Level Forecasting in Directed Logistics Networks via Topology-Perturbation Encoding. Mathematical and Computational Applications, 31(3), 107. https://doi.org/10.3390/mca31030107

Article Metrics

Back to TopTop