1. Introduction
In the face of the ongoing climate crisis and increasing regulatory pressure from international institutions, a thorough assessment of greenhouse gas (GHG) emissions and energy consumption in the road transport sector is becoming a key tool for effective environmental policy. Road transport remains one of the largest final energy consumers and a significant source of CO
2 emissions in Europe [
1].
In the European Union, the basis for the inventory of road transport emissions is computational models, the most widespread of which is COPERT (Computer Program to calculate Emissions from Road Transport). Developed by the European Environment Agency, this model is used to estimate emissions based on data on vehicle type, fuel type, fleet age, average driving speed, and driving cycles [
2,
3].
Despite its wide application, numerous studies have highlighted significant limitations of COPERT, especially in the context of the realistic representation of operating conditions. This model uses an average-speed approach, ignoring traffic variability, dynamic accelerations, stops, or the influence of driver driving style [
4,
5]. As a result, the estimates may significantly deviate from the actual values, which has significant implications for the credibility of national emission reports.
The uncertainty of COPERT emission estimates is significant. For example, [
6] showed that imprecision in input data, such as fleet composition or annual mileage, can lead to errors of up to several dozen percent. According to [
3], the uncertainty range in emission estimates can span from −58% to +76%, depending on driving conditions and driving style. In another study, [
2] found that neglecting the actual speed distribution leads to an underestimation of fuel consumption by 25% and NO
x emissions by 36%.
Due to these limitations, increasing attention is being paid to the use of high-resolution data from ITS (Intelligent Transport Systems), GPS, or Internet-of-Vehicles (IoV) systems [
7,
8,
9]. Such sources enable more realistic emission modeling and identification of local emission hotspots in urban agglomerations. Moreover, real-world road surface conditions influence vehicle dynamics, emissions, and energy consumption [
10]. This study demonstrated that using accelerometer data and neural networks enables effective pavement classification, providing valuable inputs for enhancing emission models with dynamic road condition factors. Comparable implementations of neural networks have also been applied in industrial measurement optimizations, such as the modeling of optimal probe measurement times on CNC machine tools [
11], the prediction of emissions from hydrogen-fueled engines [
12] in real driving emission diagnostics [
13], and vehicle condition classification under biodiesel operation scenarios [
14].
Validation of the COPERT model using empirical data remains an indispensable step. The research uses two main methods: laboratory tests on chassis dynamometers and measurements in real conditions using PEMS systems (Portable Emissions Measurement Systems). The results of these studies confirm that the COPERT model tends to underestimate emissions in dynamic situations [
15] and Kousoulidou et al. [
16] showed that NO
x emissions measured using PEMS can be up to 1.6 times higher than model estimates. Similar conclusions were also confirmed in our own studies, where significant discrepancies were shown between emissions recorded in cold-start conditions and values estimated by static models, especially in urban conditions and for alternative fuels [
17]. Additional laboratory studies involving dual-fuel diesel–CNG engines also highlighted the trade-off between NO
x reduction and the increase in unburned hydrocarbon and carbon monoxide emissions due to fuel substitution strategies [
18].
Additionally, energy consumption analyses in electric and hybrid vehicles indicate the significant impact of driving style and operating conditions on energy efficiency, with a visible optimal range of driving speeds [
17,
19]. Similar effects have been confirmed in internal combustion vehicles during chassis dynamometer studies, where energy efficiency was shown to depend strongly on vehicle load and velocity patterns [
20]. Observations in the context of small-scale biomass machinery additionally demonstrated that CO
2 emissions can reach nearly 2 kg/h, depending on operating conditions and machine type [
21]. Complementary work on dust and heavy metal emissions from friction systems, omitted in most models, confirms the need for a comprehensive approach to emission inventory [
22]. The importance of synergy between experimental data and computational methods using AI in a local approach has also been emphasized in studies comparing ICE and BEV vehicles [
23]. Similar findings were reported by [
24], who emphasized that emissions from tribological systems, such as brakes and tires, contribute significantly to total particulate matter and heavy metal emissions and are mainly overlooked in standard emission models such as COPERT.
Discrepancies also apply to CO
2 emissions and energy consumption, and [
25] observed that for LPG-fueled vehicles, CO
2 emissions predicted by COPERT were significantly overestimated compared with the results obtained on the chassis dynamometer. PM emissions differed by up to 90%; [
26] indicated that PN emissions can be up to eight times higher than the catalogue values adopted in COPERT and HBEFA.
In the context of alternative fuels, the impact of their properties on the efficiency of drive systems was also confirmed in research by Mikielewicz et al. [
27], who showed that using biofuels in gas turbines leads to changes in the efficiency of energy cycles. This indicates the need to take into account the fuel characteristics in emission and energy consumption models, especially for vehicles and installations using unconventional fuels.
The importance of driving speed as an emission variable is well documented. Farzaneh et al. [
28] showed that for spark-ignition-engined vehicles, CO
2 emissions increase rapidly at speeds above 65 km/h, with a maximum at 90–105 km/h. In the city, where speeds below 30 km/h predominate, emissions increase due to frequent changes in driving dynamics and limited engine thermal efficiency.
Driver behavior, such as driving style, also directly impacts emissions. Alessandrini et al. [
29] proved that applying eco-driving principles can reduce CO
2 emissions by up to 30%. Suarez et al. [
30] confirmed that acceleration and speed variability translate directly into fuel consumption and emissions, further exposing the shortcomings of models based on average values. Additionally, technical factors such as tire conditions may influence driving dynamics and energy consumption. Orynycz et al. [
31] demonstrated that deteriorated or improperly maintained tires significantly impair braking efficiency, potentially altering driving behavior and affecting fuel use.
Considering the results cited, it is becoming increasingly important to develop and apply next-generation predictive models that can take into account nonlinear dependencies and the local variability of road traffic conditions. Integrating data from measurement systems and using artificial intelligence methods—such as artificial neural networks—offer new perspectives on increasing the accuracy of emission forecasts. Recent developments include XAI-enhanced deep learning for CO
2 prediction in vehicles [
32], two-stream neural architectures for COPERT corrections of NO
x [
33], ensemble NNs handling uncertainty in energy efficiency prediction [
34], PEMS-based COPERT validations refined by NN engine maps [
35], and lightweight surrogate emission models such as NeuralMOVES covering CO
2 estimation under variable traffic [
36]. Other promising approaches include real driving emission prediction via Elman neural networks [
37], DNN-based instantaneous emission forecasting [
38], systematic literature surveys on ML for carbon emission estimation [
39], hybrid DNN/LSTM models for soot prediction in transient cycles [
40], and fine-grained street-level CO
2 emission mapping using ML [
41]. Real-world applications of intelligent transportation systems, where driving speeds are optimized based on real-time data, have already demonstrated significant reductions in CO
2 emissions and energy consumption [
42]. Recent studies by Kozłowski et al. [
43] have demonstrated that instantaneous energy consumption and recuperation events significantly impact the overall energy balance during real-world driving, emphasizing the inadequacy of models relying solely on average-speed approaches.
This work aims to provide a detailed analysis of the discrepancies between the CO2 emission and energy consumption values estimated by the COPERT model and the data measured during vehicle tests in laboratory conditions (WLTP). Additionally, the effectiveness of using artificial neural networks (MLP, SANN) to model the relationship between speed, emissions, and energy consumption was assessed. The results will be used to determine the usefulness of the COPERT model in the local context and formulate recommendations for further calibration. The study not only evaluates the accuracy of the COPERT model but also explores the potential of integrating machine learning methods such as MLP and SANNs into emission modeling. These techniques offer an opportunity to improve the model’s adaptability by enabling local calibration based on empirical data and real-world traffic dynamics.
Global climate policies have significantly tightened vehicle emission regulations, encouraging the development of advanced modeling tools that accurately reflect real driving emissions (RDE). Conventional inventory models such as COPERT, while widely used, rely on averaged driving cycles, which may not capture dynamic driving behaviors. This study introduces artificial neural networks (ANNs) as a flexible alternative for emission prediction, capable of modeling complex nonlinear relationships between speed, acceleration, load, and emissions. Comparable implementations of ANNs have been successfully applied in diverse transport-related contexts, including fuel-saving driver behavior systems [
44], pavement condition classification [
45,
46], and real-time vehicle parameter estimation [
47]. This research fills a gap in integrating laboratory WLTP data with ANN architectures to improve emission estimations and support life-cycle assessment methodologies.
2. Materials and Methods
The tests were carried out on a chassis dynamometer using a vehicle meeting the Euro 6 emission standard and equipped with a spark-ignition engine. The tests were carried out in accordance with the WLTP procedure, but for comparison purposes only those cycle stages were selected that correspond to urban and suburban driving conditions at a maximum speed of up to 80 km/h. This selection allowed for a focus on the conditions of real traffic, which are most important from the point of view of emissions in urban environments. The technical parameters of the tested vehicle are listed in
Table 1.
For the purposes of comparative analysis, the emission and energy consumption values obtained from measurements on the chassis dynamometer were compared with the results of the COPERT model. For this purpose, a vehicle configuration corresponding to the class and technology of the tested car was selected in the COPERT calculation environment, assuming identical average-speed values for the analyzed urban and suburban phases. On this basis, model data on CO2 emissions and fuel consumption were obtained, which were subjected to direct comparison with the experimental results.
In addition, a set of input variables including vehicle speed, instantaneous energy consumption, and selected emission components were used to build predictive models. On this basis, artificial neural networks of the MLP (Multi-Layer Perceptron) and SANN (Self-Associative Neural Networks) types were used, the aim of which was to predict energy consumption and map the relationship between emissions and driving parameters.
The dependent variable in the classical analysis is CO
2 emission [g/km], and in the energy analysis it is energy consumption [MJ/km]. In the case of modeling using neural networks, the instantaneous speed value and emissions were also predicted based on input data representing typical operating profiles (
Figure 1 and
Figure 2).
The next part of this paper presents the results of the emission and energy studies and their comparison with the COPERT model estimates.
The next step was to determine the energy consumed [MJ/km];
Figure 3 presents a comparison of travel speed and energy consumed.
The bag plot graph shown in
Figure 3 shows the dependence of energy consumption (EC) on speed (v). The blue points in this plot (o) represent the actual EC values [MJ/km] for different speeds. The square (■) indicates the median or central value of energy consumption for the analyzed data. The inner darker area (“bag”) contains the central 50% of the observations, meaning that values in this range are most typical. The outer lighter area (“fence”) indicates the area where most of the observations are, but points outside it may be potential outliers, marked with an asterisk.
The EC [MJ/km] values decrease with increasing speed, reaching a minimum in the range of about 50–70 km/h. For higher speeds (75–85 km/h), energy consumption increases again. However, for low speeds (5–10 km/h) energy consumption is much higher and goes beyond the “fence” area. This shows that moving at very low speeds generates disproportionately high energy consumption. The graph does not show any significant outliers for high speeds, which suggests more stable energy consumption in this range. Furthermore, based on
Figure 3, it can be concluded that the most efficient speed range is 50–70 km/h, where the EC [MJ/km] is the lowest. On the other hand, very low and high (80–85 km/h) speeds lead to high energy consumption.
The variations in energy consumption (EC [MJ/km]) with speed observed in
Figure 3 can be explained by the processes occurring in the combustion chamber of the spark- ignition engine. The key factors are combustion efficiency, thermal losses, internal resistance, and torque characteristics.
Figure 4 shows the relationship between vehicle speed (
X-axis) and specific energy consumption (EC [TJ/km],
Y-axis) for the training and validation samples used in the MLP model. A clear decrease in energy consumption is visible with increasing speed, ranging from 10 to about 45 km/h, which corresponds to typical urban and suburban conditions.
In the higher speed range (above 55 km/h), the EC values stabilize at a lower level, which suggests that a more efficient operating point of the drive system has been achieved. This may indicate the optimization of combustion processes and reduction of mechanical losses during stabilized driving in extraurban conditions. The lack of values in the speed range > 85 km/h results from the limitation of the study to city and suburban cycles, consistent with the WLTP test range (without the motorway phase). The distribution of the data confirms the accuracy of their selection for training neural models that are to predict energy consumption in the range of real operating conditions.
Figure 5 shows the relationship between specific energy consumption (EC) and carbon dioxide (CO
2) emissions based on measurement data from chassis dynamometer tests and linear regression results. A positive correlation between these variables is visible, but the linear correlation coefficient (r = 0.21757) indicates a weak linear relationship. The determined linear regression equation has the form:
Despite the clearly positive regression trend, the large scatter of points and wide confidence intervals (95%) indicate the significant variability of results, especially in the low and high energy consumption ranges. This variability may result from differences in engine operating conditions (e.g., warm-up phase, load) and driving dynamics, which strongly affect the instantaneous CO2 emission values.
Due to the detected positive relationship, the analysis indicates the need to use more advanced models capable of capturing nonlinearities and local anomalies.
The need to model nonlinear, complex relationships between many engines’ operating parameters, emissions, speed, and energy consumption prompted the authors to develop a matrix model.
Exhaust gas emission E
i for each substance i ∈ {CO
2, CO, THC, CH
4} depends on the fuel combustion characteristics and the engine load. We can write it as a function:
where
αi—coefficient related to the intensity of fuel combustion,
βi—dependence of emissions on speed,
γ
i—constant component (dependent on engine technology and emission standard).
EC = f(v, v2, dv/dt)—in the matrix equation it appears in the feedback.
The matrix coefficients are determined based on the vehicle physical parameters and empirical emission factors.
The experimental research should be conducted under real-world traffic conditions, covering routes with diverse characteristics, including urban, suburban, and highway segments. Measurements should be carried out using a Portable Emissions Measurement System (PEMS), which enables the direct recording of instantaneous emissions of major pollutants such as carbon dioxide (CO2), carbon monoxide (CO), total hydrocarbons (THC), and methane (CH4). Simultaneously, vehicle speed, acceleration, and instantaneous fuel consumption should be recorded. Traffic data should be obtained using a GPS system integrated with an inertial measurement unit (IMU), while instantaneous fuel consumption should be recorded using a mass flowmeter system.
The collected data should be synchronized in time and then processed to obtain structured input and output vectors. The input data should include values of vehicle speed, its square, acceleration, and instantaneous fuel consumption, whereas the output data would consist of the recorded emission values of the respective pollutants. To describe the relationship between emissions and driving parameters, a linear model of the form:
is applied, where Y(t) is the emission vector, X(t) is the input variable vector including driving and combustion parameters, while matrix A and vector B contain the coefficients estimated during the analysis. The coefficients should be estimated using the least squares method, extending the input data matrix with an additional column of ones to account for the constant terms, following the approach used in the PHEM model [
33,
34] and COPERT [
35,
36]. The developed model allows for a quantitative assessment of the impact of driving parameters on pollutant emission levels under real-world vehicle operation conditions.
Given the limitations in performing large-scale on-road measurements under real-world driving conditions, the present study is based on chassis dynamometer data. Consequently, the experimental analysis was conducted using an alternative methodology, specifically selected to correspond to the characteristics and constraints of the available measurement dataset.
In the initial phase of the WLTP test, covering speeds in the range of 5–20 km/h, increased fuel consumption and CO2 emissions were observed. This phenomenon results primarily from the unstable operation of the drive system after a cold start and frequent speed changes characteristic of city driving. In this range, thermal and mechanical losses dominate, and the engine operates in conditions of reduced efficiency, which results in increased energy consumption per unit of distance.
The lowest energy consumption was recorded in the range of 50–70 km/h, which indicates the existence of optimal operating conditions. In this range, a favorable balance is achieved between aerodynamic drag, mechanical losses, and combustion efficiency. The engine runs stably, and the power demand is low enough to achieve the highest energy efficiency and minimize CO2 emissions.
The increase in energy consumption observed in the upper range of test speeds (up to 80 km/h) can be attributed to increasing aerodynamic drag and increased demand for engine power. Although the operating conditions of the drive system are more stable than at low speeds, increasing motion resistance requires more intensive engine work, which translates into increased fuel consumption and carbon dioxide emissions.
The conclusion is that both in the lower and upper speed ranges, energy consumption increases due to the decrease in the energy efficiency of the drive system. The 50–70 km/h range is the most energy-efficient operating point, which is confirmed by the minimum EC values observed in the tested cycle.
To better understand the complex relationships between operating parameters and emissions, artificial intelligence models were also used, namely MLP and SANN artificial neural networks.
Artificial neural networks were used to model the relationship between exhaust emission parameters and speed and energy consumption. Particular attention was paid to SANNs (Self-Associative Neural Networks), which are used in time series analysis. Their advantage is the ability to capture nonlinear dependencies and time delays, which makes them a more flexible tool compared with classical statistical methods such as linear regression or ARIMA models.
SANNs use self-association, meaning that they learn to map input data based on their own previous states, preserving the temporal context of the phenomenon being analyzed. In practice, this allows them to effectively predict emissions or vehicle speed from training data. By fully connecting the layers, SANNs can effectively model time-dependent variables such as exhaust emissions in dynamic driving conditions.
To ensure proper generalization of the neural network model to unseen data, cross-validation and regularization techniques were systematically applied. During hyperparameter optimization, a k-fold cross-validation procedure (with k = 5) was utilized to partition the dataset into multiple train-validation subsets, allowing the assessment of model stability across varying data splits. This approach minimized the risk of overfitting and provided a reliable estimate of model performance under real-world variability, including differences in driving cycles, traffic intensity, road gradients, and transient engine states.
Additionally, dropout regularization was implemented with a dropout rate of 15% after each hidden layer. This technique randomly deactivates a fraction of neurons during training, forcing the model to avoid excessive reliance on any single subset of features and thus improving its ability to generalize across different operational conditions. The combination of k-fold cross-validation and dropout regularization contributed to stable validation loss curves and consistently small gaps between training and validation errors, indicating robust model learning and limited overfitting.
When applying SANNs to time series regression, the goal is to minimize the prediction error. This can be expressed using a loss function (e.g., mean square error—MSE) [
28,
30]:
where N is the number of samples in the test set; y
i is the actual value on the i-th sample;
is the predicted value by the model; and θ are the model parameters (e.g., weights in the neural network).
In SANNs, the learning process consists of adjusting the weights in the neural network to minimize the difference between the model predictions and the actual results. This is performed using the classical backpropagation algorithm, which calculates the gradient of the loss function with respect to the network parameters and updates them in the direction of reducing the error.
In order to predict time series values, a neural network is trained on a set of training data, which are treated as inputs. In the case of regression, the model’s task is to predict future values based on past data.
The prediction algorithm using SANNs is as follows:
Data preparation: Temporal data is typically transformed into inputs and outputs in the form of pairs (xi, yi), where xi is the input in the form of lagged time series values and yi is the value to be predicted.
Model Training: The SANN neural network is trained to minimize the loss function, based on the training data.
Forecasting: Once the network is trained, the model is used to predict future values based on available data.
The advantage of using SANNs in time series analysis is the ability of neural networks, including SANNs, to capture complex, nonlinear dependencies in time series data that may be difficult to model using classical statistical methods. Moreover, another advantage is adaptability. Neural networks can adapt to changing patterns in time series data, which is useful in the case of nonstationary time series. Additionally, performance in long-term forecasting is observed. Thanks to their auto-association mechanism, SANNs can exploit long-term dependencies in the data.
3. Results
This section presents the results of emission-based vehicle speed prediction using various MLP neural network architectures. Neural networks, especially Self-Associative Neural Networks (SANNs), are a powerful tool for modeling and forecasting time series in the context of regression. SANNs are particularly suitable for modeling nonlinear and nonstationary time series, as they can capture complex dependencies and adapt to evolving data patterns. We applied both Multi-Layer Perceptron (MLP) and Static Artificial Neural Network (SANN) models. Input variables were normalized using Min-Max scaling. Training utilized 70% data for learning, 15% for validation, and 15% for testing. Early stopping and L2 regularization were implemented to prevent overfitting. Hyperparameters were as follows (
Table 3):—Layers: 3 (input-hidden-output)—Neurons: 5-10-1—Learning rate: 0.001—Optimizer: Adam—Epochs: 300.
1: network for predicting v [km/h]
First, a model was created using artificial neural networks of the speed of harmful components emitted in exhaust gases (CO
2, CO, THC, CH
4, NMHC) and energy consumption.
Table 2 presents the input data, and the goal divided into three sets.
The validation data covers a wide speed range, which allows for the assessment of the model’s prediction quality in a variety of driving conditions, both urban and suburban.
The analysis of the CO
2 emission values in
Table 2 indicates higher emission levels at low speeds (10–20 km/h). This phenomenon may be related to the intense variability of the engine load and frequent changes in driving speed in urban conditions. With increasing speed, CO
2 emissions are reduced, reaching lower values in the range of 50–70 km/h, which suggests that in this range the driving conditions are more conducive to the vehicle’s energy stability.
A similar trend is observed for energy consumption (EC [MJ/km]), which reaches its highest values at the lowest speeds and then decreases with increasing speed. In the range of 50–70 km/h, the minimum EC values were recorded, which indicates the existence of an optimal driving range in terms of energy efficiency. Above this range, a new increase in EC is observed, which may be due to the increase in power demand at higher driving speeds.
Table 3.
Correlation coefficients for MLP models.
Table 3.
Correlation coefficients for MLP models.
| Train [km/h] | Test [km/h] | Validation [km/h] |
---|
1.MLP 6-6-1 | 0.958644 | 0.832050 | 0.987982 |
2.MLP 6-4-1 | 0.956190 | 0.832050 | 0.993118 |
3.MLP 6-4-1 | 0.963044 | 0.832050 | 0.997390 |
4.MLP 6-11-1 | 0.959227 | 0.832050 | 0.984440 |
5.MLP 6-10-1 | 0.961213 | 0.832050 | 0.998323 |
Table 4 shows the correlation coefficients for different MLP (Multi-Layer Perceptron) models used to predict vehicle speed (km/h). The train column shows the correlation values on the training set for individual network models (rows). Test shows correlation on the test set. Validation shows correlation on the validation set. The correlation coefficient (R) measures the strength of the relationship between the actual and predicted values. Values close to 1 indicate a strong relationship (the model predicts well). Values close to 0 indicate no relationship, and close to −1 indicate an inverse relationship (which is not the case in our case). Interpreting the values of the correlation coefficients obtained in
Table 4, it can be seen that the values for Train 0.958—0.961 are very high, which indicates that the models learn the relationships in the training set well. The values for Test (0.832) are constant for all models, which means that the model achieves moderate quality on new data. The values for Validation 0.984—0.998 are very high, hence the models generalize well on the validation set. Furthermore, the MLP 6-10-1 model has the highest validation correlation 0.998323, suggesting the best agreement with the actual values.
From the above analysis of
Table 4, it can be concluded that all models are well trained R>0.95 for Train. The model performs worse on the test set R = 0.832, which may indicate insufficient generalization or differences in the test data. As mentioned above, the best model is MLP 6-10-1 because it has the highest correlation coefficient on the validation set 0.998.
Table 4 shows the prediction statistics for different MLP (Multi-Layer Perceptron) models for the vehicle speed target (km/h) in three sets: train—the training set on which the model was trained; test—the test set used to assess the quality of the model; and validation—the validation set, checking the generalization of the model and any gaps (which do not occur in our data). The stability and precision of the MLP 6-10-1 model indicates its potential for applications in predictive emission inventories, especially in urban environments.
Prediction ranges (minimum and maximum predicted values), residuals (differences between actual and predicted values), and standardized residuals (residuals divided by the standard deviation) were analyzed.
Examining the data in
Table 4, it is clear that the models predict values in different ranges. Train: All models learn well, but they differ in the range of values. MLP 6-11-1 has the narrowest range (8.94–68.88 km/h), which may indicate limited model flexibility. MLP 6-6-1 and MLP 6-10-1 have wider ranges, which may be more realistic. Test and Validation: MLP 6-10-1 and MLP 6-6-1 predict values close to the actual validation data; MLP 6-4-1 has the widest range on validation (72.18 km/h), which may indicate greater variability [
28].
The most stable models in terms of prediction range are MLP 6-6-1 and MLP 6-10-1.
Minimum and Maximum Residuals show how wrong the model is. In Train, the best training model is MLP 6-10-1 because it has the smallest residual spread (–13.43 to 16.83), which means that it represents the training data well. In Test, the largest errors are in MLP 6-6-1 and MLP 6-4-1 because they have the largest deviations (–19.11 to 18.27 km/h). The best test model is MLP 6-10-1 (–16.83 to 16.83 km/h), due to its more stable predictions. In Validation, the smallest validation residuals are in MLP 6-11-1 and MLP 6-10-1 because they have the smallest errors, which indicates good generalization [
48,
49].
This table also shows that the best model, this time in terms of residuals on validation, is the 6-10-1 model.
Again, the MLP 6-10-1 model is the best of the analyzed models generated during training on the dataset available from the experiment analyzed in this article. It is characterized by a good fit to the training set, stable residuals in validation and test, and the best prediction range. In addition, it generalizes well to new data.
Figure 6 shows the spatial visualization of the speed estimation error by the MLP 6-10-1 model as a function of the target value (Target), the predicted value (Output), and the energy consumption (EC). The analysis shows that the lowest values of the prediction error (green and blue) occur in the range of medium speeds, especially between 30 and 60 km/h, which confirms the effectiveness of the model in this operating range. An increase in the error (yellow–red colors) is observed at extremely low and high speeds, which may be the result of both greater data variability in these areas and a smaller number of observations in the training set. Despite local extremes, the overall course of the error surface remains relatively regular, which indicates good model stability and its ability to generalize in the analyzed data range [
48,
49].
Figure 7 shows the three-dimensional speed estimation error surface generated by the MLP 6-10-1 model, where the input variable is CO
2 emission. A curvilinear relationship is clearly visible; the lowest CO
2 values (dark green) occur in the medium speed range (approx. 30–60 km/h), both on the target and predicted side. On the other hand, in the low (<20 km/h) and high speed (above 70 km/h) areas, a sharp increase in CO
2 emission is observed (yellow, orange, and red areas), which corresponds to the increased energy demand of the vehicle in these conditions [
48,
49].
This model shows good prediction agreement in the range of average speeds, which confirms the precision of the representation for typical operating conditions. At the same time, the surface structure indicates the increasing difficulty of estimation in the areas of extreme CO
2 values, which are characteristic of dynamic phases of city driving or accelerations on suburban sections. The whole confirms the effectiveness of the MLP 6-10-1 model in representing complex nonlinear dependencies of emissions on variable road traffic [
27].
Figure 8 shows a comparison of vehicle speed prediction results obtained using five MLP (Multi-Layer Perceptron) neural network models with different architectures (marked as MLP 6-6-1, 6-4-1, 6-11-1, 6-10-1) with respect to the actual values (blue line). The values are presented as ordered measurement cases for training and validation sets.
The apparent agreement of the prediction course with the reference curve in the range of 10–75 km/h confirms the good performance of all architectures in reproducing the general trend of speed changes. The MLP 6-10-1 model stands out in particular, which most faithfully reproduces the speed variability in the entire range of cases, with minimal scatter and without visible over- or underestimation in the speed range of 30–55 km/h.
Noticeable deviations of predictions from reference values in some cases—especially for MLP 6-4-1—may indicate insufficient network depth to capture complex nonlinear dependencies between emission and velocity. Differences between architectures highlight the importance of selecting the number of hidden neurons in the context of estimation accuracy. These conclusions support the selection of the MLP 6-10-1 model for further prediction analyses.
Analysis of experimental data obtained on a chassis dynamometer under WLTP test conditions showed a clear variability of pollutant emissions and energy consumption depending on vehicle speed. As shown in
Figure 1, CO
2, CO, THC, and CH
4 emissions take the highest values in the low-speed range (below 20 km/h), which is characteristic of urban operating conditions. In this range, incomplete combustion and high thermal and mechanical losses dominate.
Figure 2 integrates emission data with operating parameters, confirming observed trends and indicating correlations between dynamic speed changes and emission spikes. Energy consumption values (
Figure 3 and
Figure 4) show a clear “U” shape; EC [MJ/km] is highest at very low (5–20 km/h) and high speeds (above 75 km/h), and the minimum occurs in the range of 50–70 km/h.
Further analysis of the linear regression of CO
2 emissions on energy consumption (
Figure 5) showed a moderate positive correlation, but with considerable variability, suggesting the need to use more advanced nonlinear models.
In this context, artificial neural networks of the MLP and SANN type were used, which allowed for mapping the relationship between exhaust emissions, speed, and energy consumption.
Table 4 presents the correlation coefficients of speed prediction, reaching values above 0.98 in the validation set, with the MLP 6-10-1 model showing the best generalization properties.
Figure 6,
Figure 7 and
Figure 8 show spatial visualizations of prediction errors and mapping of curvilinear dependencies of CO
2 emissions and energy consumption with respect to actual values. The MLP 6-10-1 model maintains high prediction accuracy in the speed range of 30–60 km/h, while the largest errors are observed in the range of extreme speeds, which may result from greater variability of operating conditions and smaller representation of these data in the training set.
Neural models outperformed COPERT matrix models in RMSE, MAE, and R2 metrics across all datasets. Cross-validation confirmed minimal overfitting. U-shaped relationships were observed due to low thermal efficiency at low speeds, rising aerodynamic drag at high speeds, and optimal efficiency zones at mid-range velocities.
4. Discussion
The experimental results obtained confirm the limitations of the COPERT model known from the literature in terms of the realistic representation of emissions and energy consumption in dynamic conditions. The overestimation of CO
2 emissions and energy consumption observed in the study is consistent with the findings of Bellin et al. [
25], who showed that COPERT can significantly overestimate emissions, especially for vehicles powered by alternative fuels (e.g., LPG), where the differences from the chassis dynamometer test data exceeded 50–90% for CO
2 and PM. Keuken et al. [
26] also indicated that particulate matter (PN) emissions can actually be up to eight times higher than those estimated by COPERT and HBEFA, which emphasizes the need to verify these models in operating conditions.
One of the key factors of estimation error is the use of average speeds by COPERT, which does not take into account the variability of the microcycle driving. As shown by Lejri et al. [
2], the failure to take into account the full speed distribution results in the underestimation of fuel consumption by 25% and NOx emissions by up to 36%. Our results are consistent with these findings; CO, THC and CH
4 emissions increase significantly in the low-speed range (5–20 km/h), typical for the urban environment, which indicates the dominance of suboptimal combustion conditions (cold engine, intermittent driving, and rich mixture). The results obtained using MLP and SANN models suggest that these approaches can be used to supplement the COPERT model by capturing nonlinear patterns and local anomalies that are not reflected in average-speed-based estimations. This hybrid approach—combining deterministic models with AI-based corrections—could enhance the accuracy and relevance of national emission inventories, especially in urban environments. Similar findings were reported by Gemma et al. [
50], who demonstrated that motorway traffic emissions are significantly affected by stochastic fluctuations in traffic dynamics, and that using average-speed approaches underestimates the variability and magnitude of real-world emissions.
In turn, Dey et al. [
3] drew attention to the enormous sensitivity of the emission model to input data; their analysis showed uncertainty ranging from −58% to +76%. In the context of our research, this confirms the need for local calibration of the COPERT model, especially in conditions deviating from the assumptions of test cycles (e.g., WLTP), as well as the adjustment of model parameters to the realities of the domestic fleet. Analogous conclusions regarding the influence of the choice of operational technologies on energy consumption and emissions were also confirmed in the agricultural sector. Orynycz [
30] showed that using no-tillage technologies in rapeseed cultivation significantly improves the energy balance, which emphasizes the need to consider the characteristics of technological processes when estimating emissions in various sectors of the economy.
The need for local verification is also confirmed by Notter et al. [
6], who emphasize that errors in emission inventories result not only from inaccuracies in input data, but also from not taking into account seasonality and driver behavior variability. In practice, this means that emission models should be supported by real data, both from stationary and mobile PEMS systems. Moreover, the energy balance and emissions associated with fuel production processes should also be considered. Orynycz and Wasiak [
51] demonstrated that the embodied energy of biodiesel production significantly affects its overall energetic effectiveness, highlighting the importance of holistic assessments in emission modeling.
Our results also show that energy consumption increases at both low and high speeds, with a minimum in the 50–70 km/h range. Such characteristics have already been demonstrated in the literature, e.g., by Wen et al. [
4] and Ren et al. [
5], who argue that the actual energy balance of a vehicle strongly depends on dynamic overloads, instantaneous power, and rolling and aerodynamic resistance.
The use of artificial neural networks, such as MLP 6-10-1, allowed for accurate representation of nonlinear relationships between energy consumption, CO
2 emissions, and driving speed. The achieved correlation coefficients (>0.98) in the validation set indicate that machine learning models are a reliable alternative to classical methods based on average values. According to the literature, the use of machine learning in vehicle emission prediction (e.g., SANN, MLP) allows for significantly improving the temporal and spatial resolution of the model and adapting it to specific local conditions [
5]. Similar advantages of machine learning in improving the accuracy of complex data analysis tasks have been demonstrated in other fields, such as reflective ultrasound tomography [
52], suggesting the broader potential of AI-based models over traditional deterministic approaches.
The applied artificial neural network (ANN) models effectively capture transient driving dynamics that are often neglected in average-speed approaches such as COPERT. Incorporating real-world transient operating data substantially improves the predictive accuracy of emission estimates, particularly for urban driving conditions and detailed emission inventories. However, certain limitations of the current study include the restricted size of the test vehicle fleet and the absence of Portable Emissions Measurement System (PEMS) data, which could further enhance model calibration and validation.