Sample and Aggregate Voronoi Neighborhood Weighted Graph Neural Network (SAGE-Voronoi) and Its Capability for City-Sized Vehicle Traffic Time Series Prediction

Bielecki, Przemysław; Hachaj, Tomasz; Wąs, Jarosław

doi:10.3390/app152412899

Open AccessArticle

Sample and Aggregate Voronoi Neighborhood Weighted Graph Neural Network (SAGE-Voronoi) and Its Capability for City-Sized Vehicle Traffic Time Series Prediction

by

Przemysław Bielecki

,

Tomasz Hachaj

^*

and

Jarosław Wąs

Faculty of Electrical Engineering, Automatics, Computer Science and Biomedical Engineering, AGH University of Krakow, Al. Mickiewicza 30, 30-059 Krakow, Poland

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2025, 15(24), 12899; https://doi.org/10.3390/app152412899

Submission received: 6 November 2025 / Revised: 27 November 2025 / Accepted: 4 December 2025 / Published: 7 December 2025

Download

Browse Figures

Versions Notes

Abstract

The application of graph convolutional neural networks for traffic prediction is a standard procedure; however, this approach is rarely used under the assumption that the exact city plan is unknown and the prediction area is a city-sized region. This paper fills this gap by proposing and evaluating the Sample and Aggregate-Voronoi method (SAGE-Voronoi), which utilizes the novel concept of Voronoi Neighborhood Weighted Graph-based convolutional networks to predict car traffic in cities. It demonstrates the usefulness of this method for short-term predictions using real sensor data from the moderate-sized town of Darmstadt. The results obtained are compared with those of other neural network algorithms, namely pure Long Short-Term Memory, SAGE, Diffusion Convolutional Gated Recurrent Unit (DCGRU), and Spatio-Temporal Graph Convolutional Neural Network (STGCN). SAGE-Voronoi obtained significantly better results than the state-of-the-art approaches. The SAGE-Voronoi graph neural network enables the reliable prediction of varying car traffic among network nodes. The proposed approach is not limited to spatiotemporal traffic data and can be utilized in other similar domains. The source code and dataset used in our experiments are available for download, enabling full reproducibility of the results.

Keywords:

traffic prediction; Voronoi Neighborhood; graph neural network; convolutional network; long short-term memory; sensor data; short-term; city-sized data

1. Introduction

Traffic prediction is one of the most important issues in intelligent transportation systems [1]. Many cities nowadays have sensor-based infrastructure that enables real-time traffic monitoring. Thanks to this high-fidelity data, civil engineers and scientists can visualize actual traffic, reason, and learn from historical data. Contemporary machine learning approaches enable a better understanding of traffic patterns and provide short- and long-term forecasts of future car traffic in the monitored area [2]. For obvious reasons, in urban areas, vehicle motion is constrained by streets, which can be modeled as graph data structures. Therefore, a natural approach to data-driven traffic modeling is to apply graph-based methods such as graph neural networks (GNN).

1.1. State-of-the-Art

Traffic flow and vehicle speed data have complex spatial dependence and temporal correlation [3,4]. This is because traffic flows through individual intersections that are connected by a network of roads. For this reason, there is a natural correlation among individual measurements that the predictive model must account for. From the point of view of urban traffic management, both traffic passenger flow forecasting at transportation hubs, such as subway/bus stations [5], and individual transport-level prediction are essential. Most often, this problem is solved by combining the functionality of a graph neural network with a recurrent neural network such as Gate Recurrent Unit (GRU) [3,6,7,8,9], Long Short-Term Memory (LSTM) [10], or well-known statistical approaches such as Hidden Markov Models (HMM) [11]. The authors proposed the creation of integrated spatio-temporal network units, such as recurrent graph neural networks (Res-RGNN) in [12] and the Dynamic Spatial-Temporal Aware Graph Neural Network (DSTAGNN) [13]. Solutions are also proposed that allow for generalizing the knowledge gathered during the training of this type of GNN and, through transfer learning, adapting it to other urban regions [14]. Data for prediction purposes can be collected in various ways [15]. These are most often inductive sensors, data from surveillance cameras, data collected through wireless communication [16], or even social media [17].

As shown, the application of Graph Neural Networks to city traffic forecasting is a straightforward choice [18,19,20,21]. However, it becomes more challenging if there is no detailed information about the route graph. In this situation, we need to estimate the connections between nodes representing the points where traffic is measured. Suppose we have no information about the topological structure of the real road network to estimate the connection graph. In that case, we can use solutions such as the adaptive graph learning algorithm proposed in [22], which captures node correlations adaptively during training. However, we usually know the exact or approximate locations of measurement sensors, which can be treated as the vertices of a connection graph in a city, with streets as edges.

The problem of connectivity between objects whose spatial coordinates are known is often solved by the Voronoi diagram [23]. Therefore, it is unsurprising that the Voronoi-based approach is frequently used to solve such problems. Over 30 years ago, in 1993, authors in [24] proposed constructing a Voronoi diagram over a set of points representing patterns in feature space to facilitate the derivation of alternative neural network structures for achieving the desired pattern classification.

More contemporary approaches use the Voronoi diagram to produce data-structural models for neural networks. For example, these models have been used for protein modeling [25,26], neighborhood analysis for robotics tactile features related to contact depth [27] and path planning [28], or general-purpose clustering [29]. In paper [30], the spatiotemporal graph convolutional network based on Voronoi diagrams is used for traffic crash prediction. Paper [31] demonstrates the effectiveness of the spatiotemporal-based predictions of the integration of Voronoi tessellations with spatiotemporal deep learning models, such as Long Short-Term Memory (LSTM) [32].

1.2. Novelty of This Paper

As shown in the state-of-the-art survey, the application of graph convolutional neural networks for traffic prediction is a standard procedure; however, it is rarely used when the exact city layout is unknown, and the prediction area spans an entire city. This paper fills this gap by proposing and evaluating a method that uses the novel concept of Voronoi Neighborhood Weighted Graph-based convolutional networks for city-scale traffic prediction, more specifically for forecasting traffic volume at intersections. The technique is demonstrated to be useful for short-term predictions using real sensor data from a moderate-sized city. The results obtained are compared with other neural network algorithms. The proposed approach is not limited to spatiotemporal traffic data and can be utilized in other similar domains.

2. Materials and Methods

The methodology used for the prediction of the car traffic has two components: (i) construction of a Voronoi Neighborhood Weighted Graph (VN-WG) capturing spatial relations between sensors, and (ii) a spatiotemporal neural network combining graph convolution with recurrent modeling. Spatial embeddings are obtained with GraphSAGE, temporal dependencies are modeled with an LSTM layer, and the final prediction is made with a fully connected layer. The following subsections present the GraphSAGE formulation, the VN-WG construction, and the hybrid SAGE-Voronoi model.

2.1. Sample and Aggregate Network Layer

The sample and aggregate method (GraphSAGE), as described in [33,34] is a reliable and popular method for inductive node embedding. It incorporates node features into the learning algorithm and can learn the topological structure of each node’s neighborhood. It can be defined in the following way:

\begin{matrix} S A G E (F, W, A) = R e L U ({[\begin{matrix} {(F_{(n, m, i f)} \times W_{(i f, o f)})}_{(n, m, o f)} \\ p i n (F_{(n, m, i f)}, A) \times W_{(i f, o f)})_{(n, m, o f)} \end{matrix}]}_{(n, m, 2 \cdot o f)}) \end{matrix}

(1)

where: F—input features (tensor of observations), W—weight tensor (trainable parameters), A—adjacency matrix of graph G, n—number of vertices in graph G, m—input sequence length (number of samples in time series),

i f

—input features count (number of features per vertex),

o f

—output features of SAGE count, and

p i n

—permutation invariant pooling operator (often maximum, mean or sum). ReLU (Rectified Linear Unit) introduces nonlinearity to the solution [35].

The SAGE layer produces low-dimensional tensor representations for all graph nodes in the form of a tensor with dimensionality

(n, m, 2 \cdot o f)

, which is a concatenation of the features tensor F multiplied by the weight tensor W and the features tensor from the specific neighborhood of each node aggregated by permutation invariant pooling

p i n

multiplied by the same weight tensor W. Next, the embedding is propagated to a temporal modeling layer, such as an LSTM or a Gated Recurrent Unit (GRU). A fully connected layer forms the final prediction. Various approaches can be used to design a node’s neighborhood in the graph. Assuming that we are dealing with real-world spatiotemporal data, the most intuitive approach is either to utilize the known topology of the graph with a distance-based threshold or, if the graph is unknown, model the graph structure only by distances between nodes, as in [36]:

A_{a b} = \{\begin{matrix} e^{- \frac{d_{a b}^{2}}{σ^{2}}} & i f & a \neq b, e^{- \frac{d_{a b}^{2}}{σ^{2}}} \geq ϵ \\ 0 & o t h e r w i s e \end{matrix}

(2)

where

A_{a b}

is a coefficient in the adjacency matrix between nodes indexed a and b, and

σ

and

ϵ

are domain-specific parameters that depend on the real-world distances d. As it can be challenging to rationally estimate the slope of (2) that is guided by

σ

, the simplified approach is often used:

A_{a b} = \{\begin{matrix} 1 & i f & a \neq b, e^{- \frac{d_{a b}^{2}}{σ^{2}}} \geq ϵ \\ 0 & o t h e r w i s e \end{matrix}

(3)

which produces a binary adjacency matrix. The binary adjacency matrix also simplifies the

p i n

operator in (1) because it does not have to take edge weight into account:

p i n_{a} (F, A) = Θ (F_{b} \forall A_{a b} \neq 0)

(4)

where

Θ

is a permutation invariant pooling function (see explanation under (1)).

However, in that approach, we lose some information about graph topology. We can convey richer information about the neighborhoods of vertices in a graph by using the approach we will discuss in the following sections.

2.2. Voronoi Neighborhood Weighted Graph

Let us recall the definition of a Voronoi diagram, which will be needed later.

Definition 1

(Voronoi diagram [23]). Let S be the set of points in the plane. For two distinct points

s_{1}, s_{2} \in S

the dominance of

s_{1}

over

s_{2}

is defined as the subset of the plane being at least close to

s_{1}

as to

s_{2}

:

d o m i n a n c e (s_{1}, s_{2}) = {x \in R^{2} : d_{E u c l i d e a n} (x, s_{1}) \leq (x, s_{2})}

(5)

where

d_{E u c l i d e a n}

is Euclidean distance; we will cover only

d_{E u c l i d e a n}

in this paper.

The cell of a point

s_{1}

is the portion of the plane lying in all of the dominance over the remaining points in S.

c e l l (s_{1}) = ⋂_{s_{i} \in S - {s_{1}}} d o m i n a n c e (s_{1}, s_{i})

(6)

While considering cells (6) for each each

s \in S

, they create partition of the plane which is called the Voronoi diagram.

Once we have established the terminology used in the definition of a Voronoi diagram, we can define a Voronoi Neighborhood Weighted Graph.

Definition 2

(Voronoi neighborhood weighted graph). Let

P = {p_{1}, \dots, p_{n}}

be the set of points in the plane. Define the Voronoi adjacency graph

G_{V} = (V, E_{V})

, where the set of nodes

V = P

and the edge

{p_{i}, p_{j}}

belong to

E_{V}

if the corresponding Voronoi cells share a boundary (those cells are adjacent to each other). For a parameter

d_{max} \in N

, the Voronoi Neighborhood Graph is defined as

G = (V, E)

with

{p_{i}, p_{j}} \in E \Leftrightarrow {dist}_{G_{V}} {p_{i}, p_{j}} \leq d_{max},

where

{dist}_{G_{V}} {p_{i}, p_{j}}

is the length of the shortest path in

G_{V}

between

p_{i}

and

p_{j}

. A weighted version

G = (V, E, W)

is obtained by assigning to each edge

{p_{i}, p_{j}} \in E

a weight

w_{i j}

derived from

{dist}_{G_{V}} (p_{i}, p_{j})

using one of the scaling rules (7)–(9).

In the next section we will present a recursive algorithm that can be used for Voronoi neighborhood graph calculation.

2.3. Voronoi Neighborhood Graph Calculation

Let us assume that we are registering data at a finite number of points with known coordinates, no pair of points have identical coordinates, and that the data are time series. Also, let us assume that we anticipate the influence of spatial relations between nodes on time series values, and that the strength of the influence is positively correlated with the proximity between points. The intuitive approach to modeling the spatial relationship between these points is to represent them as a graph

G_{V}

derived from the Voronoi diagram. The nodes of this graph are the sensor locations; an edge connects two points if their Voronoi cells share a boundary.

The Voronoi neighborhood graph G is derived from

G_{V}

and has an additional parameter

d_{m a x}

—maximal neighborhood size. G consists of all the nodes from

G_{V}

. Two nodes of G are connected if there is a path in a graph

G_{V}

of length no greater than

d_{m a x}

.

To calculate graphs

G_{V}

and G, we can apply Delaunay triangulation because the Delaunay triangulation of a discrete point set corresponds to the dual graph of the Voronoi diagram [37]. The proposed algorithm for calculating

G_{V}

and G is presented in Algorithm 1.

Algorithm 1 Calculate Voronoi neighborhood graph.

Require: P—a set of n points that represent the spatial position of measurements

(for example, the position of sensors at road crossings),

all points have distinct coordinates:

\forall p_{a}, p_{b} \in P d_{E u c l i d e a n} (p_{a}, p_{b}) = 0 \Leftrightarrow a = b

(

d_{E u c l i d e a n}

is an Euclidean distance function),

d_{m a x}

—maximal neighborhood size.

Begin

T \leftarrow D e l a u n a y (P)

▹ perform Delaunay tessellation, returns data structure T

▹ which for each

p_{i} \in P

holds information

▹ about each

p_{j} \in P

that has a common edge

A \leftarrow {[0]}_{n \times n}

▹ initialize adjacency matrix of size

n \times n

with zeros

▹ for the graph that will be generated for each of the n points in P

procedure Calculate_A(

i, k, T, A, d, d_{m a x}

) ▹ Calculate the adjacency matrix

▹ where i—initial point index,

▹k—neighbor point index,

▹T—Delaunay tessellation structure,

▹A—adjacency matrix,

▹d—actual neighborhood distance.

for

j \in T [k]

do

if (

A [i, j] = 0

or

A [i, j] > d

) and

j \neq i

then

A [i, j] \leftarrow A [j, i] \leftarrow d

end if

if

d < d_{m a x}

then

Calculate_A

(i, j, T, A, d + 1, d_{m a x})

end if

end for

end procedure

for

p_{i} \in P

do ▹ Fill adjacency matrix for each

p_{i} \in P

Calculate_A

(i, i, T, A, 1, d_{m a x})

end for

return A

Figure 1 illustrates the process of calculating the Voronoi neighborhood. In this image, a Voronoi diagram has been generated from a set of points. To calculate the neighborhood of a point indicated in red, we evaluate all cells around it with increasing diameter. The neighborhood level between the red and blue points is color-coded. We repeat this procedure for all points to compute the adjacency matrix of G.

The next step in our approach is to rescale the adjacency matrix A generated by Algorithm 1 so that the farther the path between the nodes in graph

G_{V}

is, the smaller the values in the rescaled adjacency matrix

A^{'}

. In other words, the weights of the edges in

A^{'}

should be inversely proportional to the path length between nodes in

G_{V}

. To achieve this, we can apply one of several possible approaches:

Linear scaling:

$A_{a, b}^{'} = \{\begin{matrix} \frac{1}{A_{a, b}} & i f & A_{a, b} \neq 0 \\ 0 & i f & A_{a, b} = 0 \end{matrix}$

(7)
Exponential scaling:

$A_{a, b}^{'} = e^{1 - A_{a, b}}$

(8)
Binary thresholding:

$A_{a, b}^{'} = \{\begin{matrix} 1 & i f & A_{a, b} \neq 0 \\ 0 & i f & A_{a, b} = 0 \end{matrix}$

(9)

2.4. Voronoi Neighborhood in Graph Neural Network: SAGE-Voronoi

After applying Algorithm 1 and one of the approaches (7)–(9) we can use the adjacency matrix

A^{'}

in SAGE layer (1). In order to do so, we modify the

p i n

operator in (1) so that it takes edge weight into account:

p i n_{V o r o n o i} (F, A) = Θ (F_{b} \cdot A_{a b}^{'})

(10)

The graph convolutional neural network proposed in this paper is composed of a SAGE layer (1) with the

p i n

operator (10) for graph embedding, followed by an LSTM layer for temporal modeling. The final, third layer is the fully connected layer that calculates the network response by performing a linear combination of the LSTM outputs. We will refer to this network later in this paper as SAGE-Voronoi. The loss function used for training is a mean squared error (MSE):

M S E (y, \hat{y}) = \frac{\sum_{i}^{n} {(y_{i} - \hat{y_{i}})}^{2}}{n}

(11)

A “Basic” SAGE network can also be used to perform an ablation study on the influence of the parameter

d_{m a x}

in the SAGE-Voronoi network. In practice, when we replace

A_{a b}^{'}

(7)–(9) in (10) by

A_{a b}

(3) (

A_{a b}

is a binary adjacency matrix), the SAGE layer works the same as a SAGE-Voronoi layer. Parameter

d_{m a x}

plays a role in Algorithm 1 similar to that of

ϵ

in (3); however, when using the scaling functions (7)–(8), we can produce a non-binary adjacency matrix. Replacing (3) with Voronoi neighborhood graph scaled with binary thresholding (9) in (10) changes the spatio-temporal graph problem from a pure distance-based approach to a neighborhood-based approach.

2.5. Dataset

To evaluate the usability of the network we propose, we used a city-scale car traffic dataset from Darmstadt city, which is available to download from https://opendata.darmstadt.de/search/tags/Transport%20und%20Verkehr-24 (accessed on 4 November 2025). Darmstadt is an example of a smart city whose data serve as a reference for various studies in statistics and machine learning [1,38,39,40,41].

The data is updated every minute and is provided in CSV format. The collection contains traffic volume values for individual intersections with known coordinates. The connections (roads) between intersections are not present in the dataset. We will treat this set of intersection coordinates as points P in Definition 2 and in Algorithm 1.

To download the data, we used the script available at https://github.com/browarsoftware/darmstadt_download (accessed on 4 November 2025). We have utilized data from 35 days from 1 March 2024, resampled to 10-min intervals. During this period, 104 crossings with active sensors were included. We added up the measurements from all the sensors at each crossing, so we did not account for the direction of car traffic. As a result, the adjacency matrix

A^{'}

is symmetric. If during the 35 days any sensor did not provide traffic data, we replaced the readings with zeros (we did not apply any procedure to fill in missing data). Our dataset does not contain information on why the car traffic values are missing. There may be two reasons for this: the closure of the road infrastructure and the measurement sensor being turned off, or a malfunction of the measurement sensor. Since we do not know the reasons, excluding this type of data would distort the evaluation of our method, as we assume that we want to test its performance on real-world data. What is more, a common approach to missing data is to approximate it. In practice, this approximation is based on analogous solutions used for prediction [42,43], such as graph neural networks, which are similar to the predictors we evaluate in this paper. We therefore decided that estimating missing data is outside the scope of this paper. It does not affect the evaluation of the proposed method, as the experimental setup for all tested methods is identical. We have split the dataset into the train, validation, and test subsets in proportions

0.5

,

0.2

, and

0.3

, respectively, starting from the earliest to the latest time periods. The train and validation data were used during training. Evaluation of method performance was made on the test dataset. Each subset was randomly shuffled using a fixed seed. We repeated our experiments 10 times, changing the seed each time. Each of the three datasets was standardized by removing the mean and scaling to unit variance.

Figure 2 shows the locations of car crossings on a map of the city of Darmstadt. Red crossings are those discussed in detail in Section 3 and Section 4.

3. Results

We have implemented our method using the Python 3.8 programming language and the machine learning libraries TensorFlow 2.8 [45], Keras 2.8 [46], and SciPy 1.8 [47], which include the Delaunay tessellation. Our implementation of the original SAGE GNN, which we significantly extended, is based on the source code https://keras.io/examples/timeseries/timeseries_traffic_forecasting/ (accessed on 4 November 2025). The source code and dataset for our experiments can be downloaded from https://github.com/bielprze/SAGE-Voronoi (accessed on 4 November 2025), and the experiments are fully reproducible.

To evaluate the proposed SAGE-Voronoi method, we have used the dataset described in Section 2.5. We have considered three short-term forecast horizons: 1-sample horizon (10 min ahead), 2-sample horizons (20 min ahead), and 3-sample horizons (30 min ahead). We have tested three adjacency scalers, as defined by Equations (7)–(9). The results of the SAGE-Voronoi network were compared with those of the original SAGE approach and the simple Pure LSTM approach. Both the SAGE and SAGE-Voronoi used 64 LSTM units, with the length W in (1) set to 10. The networks were trained using the RMSprop optimizer [48] with a learning rate of

0.0002

for 40 epochs. The maximum neighborhood size

d_{m a x}

in Algorithm 1 was set to 5. We have used mean as a pooling function in permutation invariant pooling for both SAGE and SAGE-Voronoi.

A Pure LSTM network consists of an LSTM layer with 200 units, a connected dense layer with 200 units with ReLu activation, and a final dense layer with a size equivalent to the forecast horizon. The network was trained using the Adam optimizer [49] with a learning rate of

0.0001

for 200 epochs. The loss function was mean squared error (MSE). We used the meta-parameters for SAGE-family networks proposed by the creators of the original SAGE implementations.

We have also compared the performance of the proposed solution with two state-of-the-art deep learning approaches to car traffic prediction, namely Diffusion Convolutional Gated Recurrent Unit (DCGRU) [50] and Spatio-Temporal Graph Convolutional Neural Network (STGCN) [36]. For DCGRU, as suggested in the original implementation, we use the MSE loss, the Adam optimizer with a learning rate of

10^{- 3}

, 30 epochs of training, 20 DCGRU units, and a Diffusion Convolution parameter

k = 2

. We have utilized the network implementation available at https://github.com/mensif/DCGRU_Tensorflow2 (accessed on 22 November 2025). For STGCN, as suggested in the original implementation, we use the MSE loss, the Root Mean Square Propagation (RMSprop) optimizer with a learning rate of

10^{- 3}

, 40 epochs of training, and a channel size of

(64, 32, 128)

in the ST-convolution block. We have utilized the network implementation available at https://github.com/Swadesh13/STGCN-Tf2 (accessed on 22 November 2025). Each network was trained for a specified number of epochs until no further loss decrease was observed. When there was no decrease in loss, we stopped training the network.

We have evaluated four error functions—MSE (11), root mean squared error (RMSE)

R M S E (y, \hat{y}) = \sqrt{M S E (y, \hat{y})}

(12)

mean absolute error (MAE)

M A E (y, \hat{y}) = \frac{\sum_{i}^{n} |y_{i} - \hat{y_{i}}|}{n}

(13)

and mean relative error (MRE)

M R E (y, \hat{y}) = \frac{1}{n} \sum_{i}^{n} \frac{|y_{i} - \hat{y_{i}}|}{|y_{i}|}

(14)

Results for the Pure LSTM network are presented in Table 1, for the DCGRU network in Table 2, for STGCN in Table 3, for the SAGE network in Table 4, and for SAGE-Voronoi in Table 5. All results were averaged over 10 repetitions with different random seeds (see Section 2.5); hence, the table headers report “Mean MSE,” “Mean RMSE,” “Mean MAE,” and “Mean MRE” plus-minus standard deviation. In Table 6, Table 7 and Table 8, we present calculations of confidence intervals between means with a level of confidence equal to 0.975 for 1-sample prediction, 2-sample prediction, and 3-sample prediction, respectively, and the mean RMSE is considered. A confidence interval equal to 0 indicates that the difference is not statistically significant, suggesting there may be no real difference between the means. Our experiment aimed to check which network is best at predicting car traffic in three short-term time horizons (1, 2, and 3 samples ahead). In the evaluation, we did not consider individual streets separately but calculated the aggregate average of metrics (11)–(14) for the entire city. In practice, analyzing the network’s performance on individual streets would be impractical for a city the size of Darmstadt and would obscure the overall effectiveness of the tested methods. However, to better visualize the effectiveness of individual networks at the street-level performance in Figure 3 and Figure 4, we present detailed traffic forecast values for three selected crossings that are representative of our dataset. These crossings are A003, A017, and A019. Figure 5 presents evaluation results of all tested neural networks (means with standard deviations are marked as error bars) for 1-sample prediction, and the mean RMSE is considered.

In Table 9 we present a comparison of the mean runtime (training + inference) for all tested approaches (seconds per epoch) and the prediction time (on the whole test dataset). All methods were evaluated on a GPU, except for STGCN predictions, which, due to insufficient GPU RAM, were evaluated on a CPU (on the tested hardware setup, the GPU accelerates calculations approximately 4 times compared to the CPU).

4. Discussion

As shown in Table 1, Table 2, Table 3, Table 4, Table 5 and Figure 5, all graph neural networks outperform the Pure LSTM architecture. Providing node topology information clearly improves the predictive capabilities of the DCGRU, STGCN, SAGE, and SAGE-Voronoi architectures. A non-linear fully connected layer in the Pure LSTM approach is insufficient to deduce this information from the training dataset. The Mean MRE prediction of Pure LSTM never dropped below

0.3

while the Mean RMSE was around 0.9 and the Mean MAE around 0.4.

DCGRU performs slightly better than LSTM; however, it obtains worse results than other graph networks. According to Table 6, Table 7 and Table 8, the difference in performance measured by RMSE is statistically significant for all considered sample ranges. STGCN performs better than LSTM and DCGRU, but has higher variance than SAGE-family networks. In the case when sample size equals 1 (10 min), there is no significant difference between STGCN and SAGE-family networks (see Table 6). In the case of 2 samples (20 min) and 3 samples (30 min), the predictions (see Table 7 and Table 8) show that STGCN performs significantly worse than SAGE-family networks (the values in the corresponding columns are above zero). The RMSEs in Table 3 are 79.8 and 85.8, respectively, while for the SAGE in Table 4, they are 7374 and 8202. We can observe a significant difference between SAGE and SAGE-Voronoi for scaling (7) and (8) for a 1-sample prediction in Table 6, (8) for 2- and 3-sample predictions in Table 7 and Table 8. In all those cases, SAGE-Voronoi performs better than a basic SAGE network.

It is also worth noting that there is a positive correlation between the values of MSE, RMSE, MAE, and MRE. This means that an increase or decrease in one of these metrics is reflected in an increase or decrease in the others (this is obvious for MSE and RMSE). The MRE metric is especially important because it shows the error rate as a percentage of the actual value. In the Darmstadt dataset, traffic varies from dozens to thousands of cars per sample, so a relative measure is more appropriate for assessing prediction quality. Both networks yielded very similar results for SAGE and SAGE-Voronoi. For forecast horizons of 10 min (1 sample) and 30 min (3 samples), the results of SAGE-Voronoi were better across all considered adjacency scaling methods. For one and three samples, prediction Mean MRE in SAGE-Voronoi dropped by

0.012

compared to the SAGE approach while using linear scaling (7). For the 20 min prediction (2 samples), SAGE-Voronoi has slightly worse results in the case of exponential scaling (8) for all error metrics. In comparison, the other two adjacency scalings resulted in better performance across all metrics, except Mean MRE, which has identical values for (9). Therefore, we conclude that, in most cases, applying the Voronoi neighborhood graph improvement proposed in this paper has a positive effect on the SAGE graph neural network’s prediction performance.

Figure 3 and Figure 4 present a more detailed visualization of the networks’ performance on three various crossings. These three crossings were selected because they exhibit significantly different scales of average movement per unit of time. As shown, both SAGE and SAGE-Voronoi perform very similarly; however, the quantitative error measures clearly demonstrate the advantage of the SAGE-Voronoi approach. The Pure LSTM approach is visibly and quantitatively inferior to the other methods. An interesting phenomenon also occurs at the A017 crossing, where there is heavy traffic from April 1 to April 2 (see Figure 4a and the enlarged fragment in in Figure 4b). The SAGE-Voronoi was able to predict the excessive traffic more accurately.

As shown in Table 9, using the Darmstadt dataset as the benchmark, the slowest of the considered algorithms is STGCN. One epoch of training lasts about 10.1 s. This is nearly twice as slow as one training epoch of DCGRU, over three times as slow as LSTM, and four times as slow as SAGE and Sage-Voronoi. The SAGE network is the fastest; however, it is only 8% faster than SAGE-Voronoi. In terms of prediction performance on the entire test dataset, all considered networks are in the same order; however, DCGRU and LSTM operate at nearly identical speeds. In the case of STGCN, prediction is measured on the CPU rather than the GPU due to insufficient GPU RAM (the hardware we used in the experiment on the GPU typically provides about four times acceleration compared to the CPU). We can conclude that the SAGE network is the fastest, while SAGE-Voronoi is the second-fastest. However, we must acknowledge that this comparison might be unreliable—although we used implementations of all methods from their original papers, we cannot guarantee that those implementations are optimal.

Summarizing the proposed SAGE-Voronoi graph neural network allows reliable prediction of varied car traffic among network nodes. It also better fits the non-typical data in our dataset, demonstrating superior generalization performance compared to the basic SAGE network.

5. Conclusions

The Voronoi neighborhood enables modeling real-world scenarios in which measurement sensors are unevenly and sparsely distributed over an area. When those sensors are situated at road crossings, assuming the neighborhood is distance-based (see Equation (10)) may lead to incorrect conclusions about the graph topology. The evaluation results indicate that applying the Voronoi framework nicely improves the predictive ability of a graph neural network. By using various scaling functions (7)–(9) to the adjacency matrix of the Voronoi neighborhood weighted graph, we can scale the influence of each graph vertex’s neighborhood on the calculation of the output signal of the sample and the aggregate layer of the network. The results we have obtained so far are promising, and further research will address another potential application of the proposed Voronoi-based framework. There are no methodological obstacles to applying the proposed methodology to non-symmetric graphs and using other distance metrics than Euclidean. Future work will investigate predictive ability on larger datasets with longer time series. Since the proposed method is not limited to traffic data, it may also be applied and evaluated in other spatiotemporal domains.

Author Contributions

Conceptualization, P.B., T.H. and J.W.; methodology, P.B. and T.H.; software, P.B. and T.H.; validation, P.B.; formal analysis, P.B. and T.H.; investigation, P.B.; resources, P.B. and T.H.; data curation, P.B. and T.H.; writing—original draft preparation, P.B., T.H. and J.W.; writing—review and editing, P.B., T.H. and J.W.; visualization, P.B. and T.H.; supervision, T.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data and source codes in this study can be downloaded from https://github.com/browarsoftware/darmstadt_download and https://github.com/bielprze/SAGE-Voronoi appropriately (accessed on 4 November 2025).

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

DCGRU	Diffusion Convolutional Gated Recurrent Unit
GNN	Graph Neural Networks
GRU	Gated Recurrent Unit
LSTM	Long Short-Term Memory
MAE	Mean absolute error
MRE	Mean relative error
MSE	Mean squared error
ReLU	Rectified Linear Unit
RMSE	Root mean squared error
SAGE	Sample and Aggregate
STGCN	Spatio-Temporal Graph Convolutional Neural Network
VN-WG	Voronoi Neighborhood Weighted Graph

References

Patelli, A.; Hamilton, J.R.; Lush, V.; Ekart, A. A gentler approach to urban traffic modelling and prediction. In Proceedings of the 2022 IEEE Congress on Evolutionary Computation (CEC), Padua, Italy, 18–23 July 2022; IEEE: Piscataway, NJ, USA, 2022; pp. 1–8. [Google Scholar] [CrossRef]
Gomes, B.; Coelho, J.; Aidos, H. A survey on traffic flow prediction and classification. Intell. Syst. Appl. 2023, 20, 200268. [Google Scholar] [CrossRef]
Jiang, W.; Xiao, Y.; Liu, Y.; Liu, Q.; Li, Z. Bi-GRCN: A Spatio-Temporal Traffic Flow Prediction Model Based on Graph Neural Network. J. Adv. Transp. 2022, 2022, 5221362. [Google Scholar] [CrossRef]
Xie, Z.; Lv, W.; Huang, S.; Lu, Z.; Du, B.; Huang, R. Sequential Graph Neural Network for Urban Road Traffic Speed Prediction. IEEE Access 2020, 8, 63349–63358. [Google Scholar] [CrossRef]
Peng, H.; Wang, H.; Du, B.; Bhuiyan, M.Z.A.; Ma, H.; Liu, J.; Wang, L.; Yang, Z.; Du, L.; Wang, S.; et al. Spatial temporal incidence dynamic graph neural networks for traffic flow forecasting. Inf. Sci. 2020, 521, 277–290. [Google Scholar] [CrossRef]
Wang, Y.; Zheng, J.; Du, Y.; Huang, C.; Li, P. Traffic-GGNN: Predicting Traffic Flow via Attentional Spatial-Temporal Gated Graph Neural Networks. IEEE Trans. Intell. Transp. Syst. 2022, 23, 18423–18432. [Google Scholar] [CrossRef]
Zhao, L.; Song, Y.; Zhang, C.; Liu, Y.; Wang, P.; Lin, T.; Deng, M.; Li, H. T-GCN: A Temporal Graph Convolutional Network for Traffic Prediction. IEEE Trans. Intell. Transp. Syst. 2020, 21, 3848–3858. [Google Scholar] [CrossRef]
Wang, X.; Ma, Y.; Wang, Y.; Jin, W.; Wang, X.; Tang, J.; Jia, C.; Yu, J. Traffic flow prediction via spatial temporal graph neural network. In Proceedings of the Web Conference 2020, Taipei, Taiwan, 20–24 April 2020; pp. 1082–1092. [Google Scholar] [CrossRef]
Guo, K.; Hu, Y.; Qian, Z.; Liu, H.; Zhang, K.; Sun, Y.; Gao, J.; Yin, B. Optimized Graph Convolution Recurrent Neural Network for Traffic Prediction. IEEE Trans. Intell. Transp. Syst. 2021, 22, 1138–1149. [Google Scholar] [CrossRef]
Chen, L.; Shao, W.; Lv, M.; Chen, W.; Zhang, Y.; Yang, C. AARGNN: An Attentive Attributed Recurrent Graph Neural Network for Traffic Flow Prediction Considering Multiple Dynamic Factors. IEEE Trans. Intell. Transp. Syst. 2022, 23, 17201–17211. [Google Scholar] [CrossRef]
Diehl, F.; Brunner, T.; Le, M.T.; Knoll, A. Graph Neural Networks for Modelling Traffic Participant Interaction. In Proceedings of the 2019 IEEE Intelligent Vehicles Symposium (IV), Paris, France, 9–12 June 2019; pp. 695–701. [Google Scholar] [CrossRef]
Chen, C.; Li, K.; Teo, S.G.; Zou, X.; Wang, K.; Wang, J.; Zeng, Z. Gated residual recurrent graph neural networks for traffic prediction. In Proceedings of the AAAI Conference on Artificial Intelligence, Honolulu, HI, USA, 27 January–1 February 2019; Volume 33, pp. 485–492. [Google Scholar] [CrossRef]
Lan, S.; Ma, Y.; Huang, W.; Wang, W.; Yang, H.; Li, P. DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural Network for Traffic Flow Forecasting. In Proceedings of the 39th International Conference on Machine Learning, Baltimore, MD, USA, 17–23 July 2022; Chaudhuri, K., Jegelka, S., Song, L., Szepesvari, C., Niu, G., Sabato, S., Eds.; Proceedings of Machine Learning Research 2022. Volume 162, pp. 11906–11917. [Google Scholar]
Huang, Y.; Song, X.; Zhang, S.; Yu, J.J. Transfer Learning in Traffic Prediction with Graph Neural Networks. In Proceedings of the 2021 IEEE International Intelligent Transportation Systems Conference (ITSC), Indianapolis, IN, USA, 9–22 September 2021; pp. 3732–3737. [Google Scholar] [CrossRef]
Verendel, V.; Yeh, S. Measuring traffic in cities through a large-scale online platform. J. Big Data Anal. Transp. 2019, 1, 161–173. [Google Scholar] [CrossRef]
Zhang, Q.; Yu, K.; Guo, Z.; Garg, S.; Rodrigues, J.J.P.C.; Hassan, M.M.; Guizani, M. Graph Neural Network-Driven Traffic Forecasting for the Connected Internet of Vehicles. IEEE Trans. Netw. Sci. Eng. 2022, 9, 3015–3027. [Google Scholar] [CrossRef]
Wang, S.; He, L.; Stenneth, L.; Yu, P.S.; Li, Z.; Huang, Z. Estimating Urban Traffic Congestions with Multi-sourced Data. In Proceedings of the 2016 17th IEEE International Conference on Mobile Data Management (MDM), Porto, Portugal, 13–16 June 2016; Volume 1, pp. 82–91. [Google Scholar] [CrossRef]
Jiang, W.; Luo, J. Graph neural network for traffic forecasting: A survey. Expert Syst. Appl. 2022, 207, 117921. [Google Scholar] [CrossRef]
Sant’Ana da Silva, E.; Pedrini, H.; Santos, A.L.d. Applying Graph Neural Networks to Support Decision Making on Collective Intelligent Transportation Systems. IEEE Trans. Netw. Serv. Manag. 2023, 20, 4085–4096. [Google Scholar] [CrossRef]
Cheng, S.; Wang, Z.; Yang, B.; Nakano, K. Convolutional Neural Network-Based Lane-Change Strategy via Motion Image Representation for Automated and Connected Vehicles. IEEE Trans. Neural Netw. Learn. Syst. 2024, 35, 12953–12964. [Google Scholar] [CrossRef]
Davis, N.; Raina, G.; Jagannathan, K. Grids Versus Graphs: Partitioning Space for Improved Taxi Demand-Supply Forecasts. IEEE Trans. Intell. Transp. Syst. 2021, 22, 6526–6535. [Google Scholar] [CrossRef]
Zhang, W.; Zhu, F.; Lv, Y.; Tan, C.; Liu, W.; Zhang, X.; Wang, F.Y. AdapGL: An adaptive graph learning algorithm for traffic prediction based on spatiotemporal neural networks. Transp. Res. Part C Emerg. Technol. 2022, 139, 103659. [Google Scholar] [CrossRef]
Aurenhammer, F. Voronoi diagrams—A survey of a fundamental geometric data structure. ACM Comput. Surv. 1991, 23, 345–405. [Google Scholar] [CrossRef]
Bose, N.; Garga, A. Neural network design using Voronoi diagrams. IEEE Trans. Neural Netw. 1993, 4, 778–787. [Google Scholar] [CrossRef]
Olechnovič, K.; Venclovas, Č. VoroIF-GNN: Voronoi tessellation-derived protein–protein interface assessment using a graph neural network. Proteins Struct. Funct. Bioinform. 2023, 91, 1879–1888. [Google Scholar] [CrossRef] [PubMed]
Igashov, I.; Olechnovič, K.; Kadukova, M.; Venclovas, C.; Grudinin, S. VoroCNN: Deep convolutional neural network built on 3D Voronoi tessellation of protein structures. Bioinformatics 2021, 37, 2332–2339. [Google Scholar] [CrossRef] [PubMed]
Fan, W.; Yang, M.; Xing, Y.; Lepora, N.F.; Zhang, D. Tac-VGNN: A Voronoi Graph Neural Network for Pose-Based Tactile Servoing. In Proceedings of the 2023 IEEE International Conference on Robotics and Automation (ICRA), London, UK, 29 May–2 June 2023; pp. 10373–10379. [Google Scholar] [CrossRef]
Qian, F.; Liu, W.; Bao, H.; Shi, X. A CNN-Based Fast Generalized Voronoi Diagrams Framework for Path Planning. In Proceedings of the 2024 International Conference on Networking, Sensing and Control (ICNSC), Hangzhou, China, 18–20 October 2024; pp. 1–5. [Google Scholar] [CrossRef]
Gentile, C.; Sznaier, M. An improved Voronoi-diagram-based neural net for pattern classification. IEEE Trans. Neural Netw. 2001, 12, 1227–1234. [Google Scholar] [CrossRef]
Gan, J.; Yang, Q.; Zhang, D.; Li, L.; Qu, X.; Ran, B. A Novel Voronoi-Based Spatio-Temporal Graph Convolutional Network for Traffic Crash Prediction Considering Geographical Spatial Distributions. IEEE Trans. Intell. Transp. Syst. 2024, 25, 21723–21736. [Google Scholar] [CrossRef]
Wang, H.; Zhou, H.; Cheng, S. Dynamical system prediction from sparse observations using deep neural networks with Voronoi tessellation and physics constraint. Comput. Methods Appl. Mech. Eng. 2024, 432, 117339. [Google Scholar] [CrossRef]
Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar] [CrossRef]
Hamilton, W.L.; Ying, R.; Leskovec, J. Inductive representation learning on large graphs. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Red Hook, NY, USA, 4–9 December 2017; pp. 1025–1035. [Google Scholar]
Chetoui, I.; El Bachari, E.; Ait Lahcen, Y. Creating Semantic Learner Groups in Distance Education Using the GraphSAGE approach. In Proceedings of the E3S Web of Conferences, Meknes, Morocco, 21–22 November 2024; EDP Sciences 2025. Volume 601, p. 00096. [Google Scholar] [CrossRef]
Hara, K.; Saito, D.; Shouno, H. Analysis of function of rectified linear unit used in deep learning. In Proceedings of the 2015 International Joint Conference on Neural Networks (IJCNN), Killarney, Ireland, 12–17 July 2015; pp. 1–8. [Google Scholar] [CrossRef]
Yu, B.; Yin, H.; Zhu, Z. Spatio-temporal graph convolutional networks: A deep learning framework for traffic forecasting. In Proceedings of the 27th International Joint Conference on Artificial Intelligence, Stockholm, Sweden, 13–19 July 2018; pp. 3634–3640. [Google Scholar]
Fortune, S. Voronoi diagrams and Delaunay triangulations. In Computing in Euclidean Geometry; CRC Press: Boca Raton, FL, USA, 1997; pp. 193–233. [Google Scholar] [CrossRef]
Kasaraneni, P.P.; Yellapragada, V.P.K.; Moganti, G.L.K.; Flah, A. Analytical Enumeration of Redundant Data Anomalies in Energy Consumption Readings of Smart Buildings with a Case Study of Darmstadt Smart City in Germany. Sustainability 2022, 14, 10842. [Google Scholar] [CrossRef]
Kodali, Y.; Kumar, Y.V.P. ANOVA-Based Variance Analysis in Smart Home Energy Consumption Data Using a Case Study of Darmstadt Smart City, Germany. Eng. Proc. 2024, 82, 31. [Google Scholar] [CrossRef]
Birkle, C.; Hess, C. Integrating Data Ethics in Smart Cities: Insights from Leading European Cities. In Proceedings of the International Scientific-Practical Conference, Cologne, Germany, 9–10 October 2024; Springer: Berlin/Heidelberg, Germany, 2024; pp. 427–441. [Google Scholar] [CrossRef]
Gosek, Ł.; Muras, F.; Michałek, P.; Wąs, J. Traffic Prediction Based on Modified Nagel-Schreckenberg Model. Case Study for Traffic in the City of Darmstadt. In Proceedings of the International Conference on Parallel Processing and Applied Mathematics, Bialystok, Poland, 8–11 September 2019; Springer: Berlin/Heidelberg, Germany, 2019; pp. 478–488. [Google Scholar] [CrossRef]
Zhang, Y.; Kong, X.; Zhou, W.; Liu, J.; Fu, Y.; Shen, G. A Comprehensive Survey on Traffic Missing Data Imputation. IEEE Trans. Intell. Transp. Syst. 2024, 25, 19252–19275. [Google Scholar] [CrossRef]
Laña, I.; Olabarrieta, I.I.; Vélez, M.; Del Ser, J. On the imputation of missing data for road traffic forecasting: New insights and novel techniques. Transp. Res. Part C Emerg. Technol. 2018, 90, 18–33. [Google Scholar] [CrossRef]
OpenStreetMap Contributors. Planet Dump Retrieved from https://planet.osm.org. 2017. Available online: https://www.openstreetmap.org (accessed on 4 November 2025).
Abadi, M.; Barham, P.; Chen, J.; Chen, Z.; Davis, A.; Dean, J.; Devin, M.; Ghemawat, S.; Irving, G.; Isard, M.; et al. TensorFlow: A System for Large-Scale Machine Learning. In Proceedings of the 12th USENIX Symposium on Operating Systems Design and Implementation (OSDI 16), Savannah, GA, USA, 2–4 November 2016; pp. 265–283. [Google Scholar]
Keras. 2015. Available online: https://keras.io (accessed on 4 November 2025).
Virtanen, P.; Gommers, R.; Oliphant, T.E.; Haberland, M.; Reddy, T.; Cournapeau, D.; Burovski, E.; Peterson, P.; Weckesser, W.; Bright, J.; et al. SciPy 1.0: Fundamental algorithms for scientific computing in Python. Nat. Methods 2020, 17, 261–272. [Google Scholar] [CrossRef] [PubMed]
Elshamy, R.; Abu-Elnasr, O.; Elhoseny, M.; Elmougy, S. Improving the efficiency of RMSProp optimizer by utilizing Nestrove in deep learning. Sci. Rep. 2023, 13, 8814. [Google Scholar] [CrossRef]
Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. arXiv 2017, arXiv:1412.6980. [Google Scholar] [CrossRef]
Li, Y.; Yu, R.; Shahabi, C.; Liu, Y. Diffusion Convolutional Recurrent Neural Network: Data-Driven Traffic Forecasting. In Proceedings of the International Conference on Learning Representations (ICLR’18), Vancouver, BC, Canada, 30 April–3 May 2018. [Google Scholar]

Figure 1. This figure illustrates the concept of calculating the Voronoi neighborhood. The Voronoi diagram is generated from a set of points. To calculate the neighborhood of a point indicated in red, we evaluate all cells around it with increasing radii. The neighborhood level between the red and blue points is color-coded.

Figure 2. Positions of car crossings on the city map of Darmstadt. The red crossings are those discussed in detail in Section 3 and Section 4. Map data copyrighted OpenStreetMap contributors and available from https://www.openstreetmap.org [44] (accessed on 4 November 2025).

Figure 3. The detailed results for the selected crossings. Under each plot, we present mean error values between the original dataset and evaluated networks. The error orders are the same as in Table 1. The networks are ordered as follows: SAGE, SAGE-Voronoi, Pure LSTM. (a) Errors for SAGE (850.383, 29.161, 22.221, 0.150), SAGE-Voronoi (809.970, 28.460, 21.580, 0.145), DCGRU (1611.233, 40.140, 29.601, 0.181), STGCN (1114.347, 33.382, 24.567, 0.159), and Pure LSTM (3391.328, 58.235, 43.948, 0.263); (b) errors for SAGE (35.861, 5.988, 4.402, 0.331), SAGE-Voronoi (34.095, 5.839, 4.283, 0.325), DCGRU (38.102, 6.173, 4.540, 0.342), STGCN (52.081, 7.217, 5.176, 0.534), and Pure LSTM (62.588, 7.911, 5.783, 0.390).

Figure 4. The detailed results for the selected crossings. Under each plot, we present mean error values between the original dataset and evaluated networks. The error orders are the same as in Table 1. The networks are ordered as follows: SAGE, SAGE-Voronoi, Pure LSTM. (a) Errors for SAGE (87,823.811, 296.350, 109.077, 0.234), SAGE-Voronoi (60,322.241, 245.605, 95.932, 0.226), DCGRU (68,537.251, 261.796, 112.676, 0.309), STGCN (59,309.944, 243.536, 99.497, 0.273), and Pure LSTM (83,675.687, 289.267, 165.341, 0.649); (b) errors for SAGE (864,463.345, 929.765, 678.609, 0.547), SAGE-Voronoi (530029.143, 728.031, 525.195, 0.503), textcolorredDCGRU (572,398.523, 756.570, 550.173, 0.537), STGCN (392,170.993, 626.236, 452.298, 0.688), and Pure LSTM (630,720.066, 794.178, 582.447, 0.756).

Figure 5. The evaluation results (mean RMSE is considered) of all tested neural networks on the test dataset described in Section 2.5 (means with standard deviations are marked as error bars) for 1-sample prediction.

Table 1. The evaluation results of the Pure LSTM neural network on the test dataset described in Section 2.5 (means plus-minus standard deviations).

Forecast Horizon	Mean MSE	Mean RMSE	Mean MAE	Mean MRE
10 min (1 sample)	10,793 ± 358	94.8 ± 1.6	39.9 ± 0.6	0.300 ± 0.007
20 min (2 samples)	11,328 ± 470	98.5 ± 1.8	40.6 ± 0.9	0.304 ± 0.007
30 min (3 samples)	11,758 ± 565	101.0 ± 2.4	41.5 ± 0.7	0.314 ± 0.008

Table 2. The evaluation results of the DCGRU neural network on the test dataset described in Section 2.5 (means plus-minus standard deviations).

Forecast Horizon	Mean MSE	Mean RMSE	Mean MAE	Mean MRE
10 min (1 sample)	10,082 ± 303	90.5 ± 1.3	37.7 ± 0.6	0.288 ± 0.007
20 min (2 samples)	10,410 ± 463	94.2 ± 1.9	38.9 ± 0.7	0.304 ± 0.011
30 min (3 samples)	10,735 ± 357	96.4 ± 1.5	40.1 ± 0.7	0.319 ± 0.008

Table 3. The evaluation results of the STGCN neural network on the test dataset described in Section 2.5 (means plus-minus standard deviations).

Forecast Horizon	Mean MSE	Mean RMSE	Mean MAE	Mean MRE
10 min (1 sample)	6436 ± 287	70.7 ± 1.4	28.2 ± 0.7	0.302 ± 0.031
20 min (2 samples)	8135 ± 272	79.8 ± 1.3	31.1 ± 0.8	0.323 ± 0.033
30 min (3 samples)	9370 ± 329	85.8 ± 1.5	33.9 ± 0.9	0.351 ± 0.035

Table 4. The evaluation results for the SAGE neural network on the test dataset described in Section 2.5 (means plus-minus standard deviations).

Forecast Horizon	Mean MSE	Mean RMSE	Mean MAE	Mean MRE
10 min (1 sample)	6357 ± 146	70.7 ± 0.7	25.0 ± 0.4	0.241 ± 0.021
20 min (2 samples)	7374 ± 44	76.7 ± 0.2	27.5 ± 0.3	0.261 ± 0.021
30 min (3 samples)	8202 ± 86	81.1 ± 0.4	30.1 ± 0.3	0.287 ± 0.017

Table 5. The evaluation results of the SAGE-Voronoi neural network on the test dataset described in Section 2.5 (means plus-minus standard deviations).

Forecast Horizon	Adjacency Scaler	Mean MSE	Mean RMSE	Mean MAE	Mean MRE
10 min (1 sample)	(7)	6192.2 ± 70.1	69.9 ± 0.4	24.6 ± 0.1	0.229 ± 0.012
10 min (1 sample)	(8)	6221.8 ± 68.8	70.1 ± 0.4	24.8 ± 0.2	0.232 ± 0.013
10 min (1 sample)	(9)	6266.8 ± 116.2	70.3 ± 0.6	24.7 ± 0.4	0.234 ± 0.023
20 min (2 samples)	(7)	7324.2 ± 53.6	76.4 ± 0.3	27.4 ± 0.5	0.263 ± 0.030
20 min (2 samples)	(8)	7414.3 ± 41.7	76.9 ± 0.1	27.6 ± 0.3	0.263 ± 0.022
20 min (2 samples)	(9)	7326.1 ± 117.7	76.4 ± 0.5	27.4 ± 0.6	0.261 ± 0.038
30 min (3 samples)	(7)	8114.8 ± 70.8	80.7 ± 0.4	29.7 ± 0.2	0.275 ± 0.011
30 min (3 samples)	(8)	8248.9 ± 107.7	81.3 ± 0.4	30.1 ± 0.2	0.283 ± 0.011
30 min (3 samples)	(9)	8063.1 ± 59.6	80.4 ± 0.3	29.5 ± 0.2	0.275 ± 0.016

Table 6. Calculations of confidence interval between means with a level of confidence equal to 0.975 for a 1-sample prediction (mean RMSE is considered). A confidence interval equal to 0 indicates that the difference is not statistically significant, suggesting there may be no real difference between the means. SAGE-V is a SAGE-Voronoi network.

	DCGRU	LSTM	SAGE	SAGE-V (9)	SAGE-V (7)	SAGE-V (8)	STGCN
DCGRU	0.00	8.50	39.61	40.45	41.17	40.89	39.62
LSTM	8.50	0.00	48.12	48.95	49.68	49.39	48.13
SAGE	39.61	48.12	0.00	0.00	1.56	1.27	0.00
SAGE-V (9)	40.45	48.95	0.00	0.00	0.00	0.00	0.00
SAGE-V (7)	41.17	49.68	1.56	0.00	0.00	0.00	0.00
SAGE-V (8)	40.89	49.39	1.27	0.00	0.00	0.00	0.00
STGCN	39.62	48.13	0.00	0.00	0.00	0.00	0.00

Table 7. Calculations of confidence interval between means with a level of confidence equal to 0.975 for a 2-sample prediction (mean RMSE is considered). A confidence interval equal to 0 indicates that the difference is not statistically significant, suggesting there may be no real difference between the means. SAGE-V is a SAGE-Voronoi network.

	DCGRU	LSTM	SAGE	SAGE-V (9)	SAGE-V (7)	SAGE-V (8)	STGCN
DCGRU	0.00	8.53	35.14	35.63	35.66	34.72	28.89
LSTM	8.53	0	43.67	44.16	44.19	43.25	37.42
SAGE	35.14	43.67	0.00	0.00	0.00	0.41	6.25
SAGE-V (9)	35.63	44.16	0.00	0.00	0.00	0.91	6.74
SAGE-V (7)	35.66	44.19	0.00	0.00	0.00	0.94	6.77
SAGE-V (8)	34.72	43.25	0.41	0.91	0.94	0.00	5.83
STGCN	28.89	37.42	6.25	6.74	6.77	5.83	0.00

Table 8. Calculations of confidence interval between means with a level of confidence equal to 0.975 for a 3-sample prediction (mean RMSE is considered). A confidence interval equal to 0 indicates that the difference is not statistically significant, suggesting there may be no real difference between the means. SAGE-V is a SAGE-Voronoi network.

	DCGRU	LSTM	SAGE	SAGE-V (9)	SAGE-V (7)	SAGE-V (8)	STGCN
DCGRU	0.00	9.14	30.58	32.07	31.48	30.27	21.22
LSTM	9.14	0.00	39.72	41.21	40.63	39.41	30.36
SAGE	30.58	39.72	0.00	1.49	0.91	0.00	9.36
SAGE-V (9)	32.07	41.21	1.49	0.00	0.00	1.80	10.85
SAGE-V (7)	31.48	40.63	0.91	0.00	0.00	1.21	10.27
SAGE-V (8)	30.27	39.41	0.00	1.80	1.21	0.00	9.06
STGCN	21.22	30.36	9.36	10.85	10.27	9.06	0.00

Table 9. Comparison of mean runtime (training + inference) of all tested approaches (seconds per epoch) and prediction (prediction on the whole test dataset). All methods were evaluated on a GPU, except for STGCN predictions, which were evaluated on a CPU due to insufficient GPU RAM.

Model	Training [s/Epoch]	Prediction [s/Prediction]
DCGRU	6.0	1.3
LSTM	3.0	1.3
SAGE	2.3	0.9
SAGE-Voronoi (7)–(9)	2.5	1.0
STGCN	10.1	24.2 (on CPU)

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bielecki, P.; Hachaj, T.; Wąs, J. Sample and Aggregate Voronoi Neighborhood Weighted Graph Neural Network (SAGE-Voronoi) and Its Capability for City-Sized Vehicle Traffic Time Series Prediction. Appl. Sci. 2025, 15, 12899. https://doi.org/10.3390/app152412899

AMA Style

Bielecki P, Hachaj T, Wąs J. Sample and Aggregate Voronoi Neighborhood Weighted Graph Neural Network (SAGE-Voronoi) and Its Capability for City-Sized Vehicle Traffic Time Series Prediction. Applied Sciences. 2025; 15(24):12899. https://doi.org/10.3390/app152412899

Chicago/Turabian Style

Bielecki, Przemysław, Tomasz Hachaj, and Jarosław Wąs. 2025. "Sample and Aggregate Voronoi Neighborhood Weighted Graph Neural Network (SAGE-Voronoi) and Its Capability for City-Sized Vehicle Traffic Time Series Prediction" Applied Sciences 15, no. 24: 12899. https://doi.org/10.3390/app152412899

APA Style

Bielecki, P., Hachaj, T., & Wąs, J. (2025). Sample and Aggregate Voronoi Neighborhood Weighted Graph Neural Network (SAGE-Voronoi) and Its Capability for City-Sized Vehicle Traffic Time Series Prediction. Applied Sciences, 15(24), 12899. https://doi.org/10.3390/app152412899

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Sample and Aggregate Voronoi Neighborhood Weighted Graph Neural Network (SAGE-Voronoi) and Its Capability for City-Sized Vehicle Traffic Time Series Prediction

Abstract

1. Introduction

1.1. State-of-the-Art

1.2. Novelty of This Paper

2. Materials and Methods

2.1. Sample and Aggregate Network Layer

2.2. Voronoi Neighborhood Weighted Graph

2.3. Voronoi Neighborhood Graph Calculation

2.4. Voronoi Neighborhood in Graph Neural Network: SAGE-Voronoi

2.5. Dataset

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI