Use of Artiﬁcial Neural Networks to Predict Fuel Consumption on the Basis of Technical Parameters of Vehicles

: This publication presents a multi-faceted analysis of the fuel consumption of motor vehicles and the way human impacts the environment, with a particular emphasis on the passenger cars. The adopted research methodology is based on the use of artiﬁcial neural networks in order to create a predictive model on the basis of which fuel consumption of motor vehicles can be determined. A database containing 1750 records, being a set of information on vehicles manufactured in last decade, was used in the process of training the artiﬁcial neural networks. The MLP (Multi-Layer Perceptron) 22-10-3 network has been selected from the created neural networks, which was further subjected to an analysis. In order to determine if the predicted values match the real values, the linear Pearson correlation coefﬁcient r and coefﬁcient of determination R 2 were used. For the MLP 22-10-3 neural network, the calculated coefﬁcient r was within range 0.93–0.95, while the coefﬁcient of determination R 2 assumed a satisfactory value of more than 0.98. Furthermore, a sensitivity analysis of the predictive model was performed, determining the inﬂuence of each input variable on prediction accuracy. Then, a neural network with a reduced number of neurons in the input layer (MLP-20-10-3) was built, retaining a quantity of the hidden and output neurons and the activation functions of the individual layers. The MLP 20-10-3 neural network uses similar values of the r and R 2 coefﬁcients as the MLP 22-10-3 neural network. For the evaluation of both neural networks, the measures of the ex post prediction errors were used. Depending on the predicted variable, the MAPE errors for the validation sets reached satisfactory values in the range of 5–8% for MLP 22-10-3 and 6–10% for MLP 20-10-3 neural network, respectively. The prediction tool described is intended for the design of passenger cars equipped with internal combustion engines.


Introduction
The fuel consumption of motor vehicles powered by internal combustion engines unchangeably constitutes an essential issue in forming not only the development trends of the automotive industry, but is also an important factor in the economics of transport [1][2][3][4]. On the basis of the analysis of a set of the research papers, this problem is certainly a complex issue and must be considered as a many-faceted one. In logistics and business economics, fuel consumption generates costs relating to the transport of materials and products, which are an important component of a company's expenditure [5]. From the viewpoint of enterprise economics, an underlying direction of an optimisation is to minimise the costs linked to fuel consumption [6][7][8].
Climate change and the degradation of the natural environment, related to the human and industrial activities, are an aetiology of an ecological approach to the processes of the consumption of petroleum-based fuels in vehicles powered by internal combustion engines [9][10][11]. The manufacturers from the automotive industry are obliged to fulfil the more and more strict EURO emission standards, which are applicable within the European Union. The assumption of EURO standards consists in protecting the natural environment by limiting the emission of nitrogen oxides, hydrocarbons, carbon oxides and particulates. The objective of the successive tightening up of the exhaust emission standards is to reduce the air pollution caused by vehicles powered by internal combustion engines, especially in large urban agglomerations [12]. According to Regulation (EU) 2019/631 of the European Parliament and of the Council, the fleetwide CO 2 Emission Targets for new passenger cars are 95 g/km (if we want to compare the previous standards in years 2015-2020, it was on the level of 130 g/km). The supply and exhaust systems of the internal combustion engines were the subject of a large amount of research concerning the decrease in fuel consumption and the emission of harmful chemicals [1,[13][14][15][16]. In the case of fuel consumption, optimisation and the reduction in the emission of harmful chemical compounds by vehicles powered by a hybrid powertrain, the genetic algorithms have already been implemented, lowering the values of the optimised variables by over 40% [17].
In the literature, one of the current research problem is the impact of the technical solutions and road conditions on fuel consumption. The publication by C. D. Rakopoulos et al. [18] contains the influence analysis of vegetable-origin additives to diesel oil on the volumetric fuel consumption in a direct injection diesel engine. Relationships between road conditions, driving style and combustion in buses and passenger cars providing taxi services were considered in the research papers authored by Jinghui Wang et al. [3] and Ying Yao et al. [19], respectively. In an article by P. Typaldos et al. [20] attempts were made to determine the conditions and kinematic trajectories of the vehicle that could lead to a minimisation of fuel consumption according to the principles of eco-driving. In the abovementioned papers, the statistical and analytical methods were used to analyse the relationships between the variables under consideration.
Artificial Neural Networks (ANNs) as a research tool for estimating fuel consumption were used in many published scientific studies. They were applied in modelling problems for aircrafts [21], sea vessels [22][23][24][25][26][27], mining trucks [28] and agricultural tractors [29]. In publication [30], the authors implemented artificial neural networks to determine a relationship between the engine velocity and torque and the actual fuel consumption while driving trucks, and the achieved results were encumbered with mean percentage errors at a low level (below 5%). The advantage of this solution was a high accuracy of an estimation of the values of the predicted variables. In the case of passenger cars, research studies where the ANN was used to search for the relations between technical parameters and conditions of the use and fuel consumption were found [31]. In publication [32], the authors used artificial neural networks to predict fuel consumption and torque generated by a spark-ignition engine, obtaining a mean absolute percentage error (MAPE) under 3%.
The research papers presented above were focused on the search for the relationships between the studied variables in a strictly defined vehicle, machine or another technical object. In the paper authored by Zargarnezhad S. et al. [8] an attempt was made to use neural networks in fuel consumption prediction on a database of 51 Toyota and Hyundai vehicles. A network combining five input and one output variables was proposed. On the basis of the correlation coefficient values of training, testing and validation sets, ranging from 0.94-0.99, with mean square error (MSE) at a level not exceeding 0.6, an acceptability of prediction was determined. Table 1 presents the results of the selected studies concerning the research problems related to fuel consumption, engine performance and emission of the chemical compounds generated during the fuel combustion, to which artificial neural networks were applied. Among these results, the analyses focused on the area of research related to engine performance, fuel consumption and the emissions of chemical compounds from compression ignition engines. A common property of most of the analyses compared is the high accuracy of the developed predictive models, where MAPE errors fail to exceed 5%. The essence and the originality of this study is the determination of the synergistic impact of many technical parameters on fuel consumption for a broad range of the motor vehicles-modern passenger cars. The effect of such an approach is the development of a general predictive model for a prediction of fuel consumption. In this study, fuel consumption corresponding to the operating environment of the vehicle is adopted, as reflected by three predictor variables (city, motorway and mixed cycle). The outcomes of the formerly analysed research papers give a opportunity of artificial neural networks application as tools to develop predictive models. The use of artificial neural networks allows for conducting an effective analysis of a very large quantity of the empirical data. Contrary to the former research, the authors of the publication considered a broad range of passenger cars featuring different technical parameters. This paper constitutes a more generalised approach in comparison to the previously published predictive models based on artificial neural networks.

Materials and Methods
Artificial Neural Networks (ANNs) constitute an artificial intelligence tool, a method of data analysis that was widely used as an alternative to the traditional analytical methods. The objective of their use is to create a time-effective and accurate predictive model for the objects or phenomena under investigation [45]. The construction of the artificial neural network is modelled after the human brain, and for this reason, it is composed of neurons and synaptic pathways occurring between them. Each neuron may receive sets of information reflected by a value of a quantity of the variables, but only one piece of information represented by the value of one variable is the output [46]. Among the various types of artificial neural networks, MLP (Multi-Layer Perceptron) networks are the most prevalent and most commonly used. They consist of neurons divided into couple layers: input, hidden (one or more) and output. High accuracy of the predictive models based on MLP networks induced the authors of this research to use this type of artificial neural network to predict fuel consumption.
Studies dedicated to the use of artificial neural networks for prediction of fuel consumption were divided into the following stages of implementation:

•
Selecting the technical parameters having a significant impact on the quantity of fuel consumption; • Developing a database for the neural network learning process; • Creating a set of neural networks and selecting the best of them; • Calculating a correlation and determining the coefficients and carrying out a sensitivity analysis in respect of the input variables; • Making an attempt to simplify the predictive model-an elimination of the input variables for which the sensitivity coefficient takes values less than or is equal to 1.0; • Creating a new set of neural networks from a reduced database and selecting the best of them; • Calculating the correlation and determining the coefficients, as well as performing a sensitivity analysis of the input variables for the reduced model; • Comparing and evaluating an acceptability of the predictive models on the basis of the values of the ex post prediction error measures.
In Figure 1 the proprietary algorithm of using artificial neural networks to create a predictive model, according to the methodology described above, is shown. In Figure 1 the proprietary algorithm of using artificial neural networks to create a predictive model, according to the methodology described above, is shown.

Identification of the Input and Output Variables
The impact of the design solutions and technical parameters of the vehicles on fuel consumption was repeatedly analysed in the earlier studies carried out by other researchers. A significant relation between displacement capacity, vehicle weight, number of cylinders, number of valves and fuel consumption were presented in the paper by Zargarnezhad et al. [8]. On the basis of the literature analysis, a set of input and output variables for further studies was determined and it is presented in Table 2. It contains 12 input variables (7 quantitative and 5 qualitative) and 3 output variables. In the case of quantitative variables, a system of units was defined, while in the case of the individual qualitative variables, sets of admissible nominal values were assigned.

Identification of the Input and Output Variables
The impact of the design solutions and technical parameters of the vehicles on fuel consumption was repeatedly analysed in the earlier studies carried out by other researchers. A significant relation between displacement capacity, vehicle weight, number of cylinders, number of valves and fuel consumption were presented in the paper by Zargarnezhad et al. [8]. On the basis of the literature analysis, a set of input and output variables for further studies was determined and it is presented in Table 2. It contains 12 input variables (7 quantitative and 5 qualitative) and 3 output variables. In the case of quantitative variables, a system of units was defined, while in the case of the individual qualitative variables, sets of admissible nominal values were assigned.

Correlation Coefficients
For the purposes of evaluation of the matching between predicted and real values of FC U , FC H and FC M variables, the commonly used Pearson correlation coefficients r [41,[47][48][49] and the coefficient of determination R 2 [35,36,38,42] were employed. Pearson correlation coefficient is used to evaluate the level of linear relationship between the variables. It takes values from range [−1;1], where values close to 1 mean a positive dependence, close to -1 mean a negative dependence, and 0 means no linear dependence. The coefficient of determination R 2 takes values from the range [0;1]. The relationships enabling the calculation of values of coefficients r and R 2 are presented using the Equations (1) and (2), adjusting them to the needs of carried-out research: where: EFC i -experimental fuel consumption value for vehicle i, PFC i -predicted fuel consumption value for vehicle i and n-number of records.

Sensitivity Analysis
A sensitivity analysis in the neural networks is used to determine the impact of individual input variables on predicted values of output variables [50,51]. The indicator used to determine the sensitivity of the predictive model to the values of the individual input variables is the coefficient W j [52,53] calculated according to formula (3): where: W j -sensitivity coefficient of the model to variable j, MSE-mean squared error of the model prediction with all input variables and MSE j -mean squared error of the model prediction without the variable j.
If the coefficient W j takes a value equal to or less than 1, then there are presumptions to eliminate variable j from the model while maintaining a current or higher accuracy of the predictive model [52].

Prediction Errors
A significant element of the result analysis of the use of artificial neural networks to predict fuel consumption is an evaluation of the admissibility of the predictions received as a result of the implementation of a given predictive model. In the literature [54,55], ex post errors are commonly used to analyse the accuracy of a prediction. The most frequently used measures of the prediction errors are presented below, along with the dependencies adjusted to the essence of the research problem [21,[56][57][58]: • Mean squared error (MSE): • Root mean squared error (RMSE): • Mean absolute percentage error (MAPE):

Database
Artificial neural networks need a lot of data to conduct the learning process. For this purpose, a real database of 1750 records was established. The basis for its creation was data from the automotive industry, describing the technical parameters and experimentally determined quantities of fuel consumption in compliance with declarations made by manufacturers. The data concerns passenger cars manufactured all across the world during the years 2010-2020. The values of the fuel consumption variables were determined according to the NEDC (New European Driving Cycle) and the WLTP (Worldwide Harmonised Light Vehicles Test Procedure) tests [33,[59][60][61][62][63]. The database contains a set of the vehicles featuring very different technical parameters and design solutions. The essence of this approach is to search for the universal relationships between the analysed variables for a broad range of the technical objects. Table 3 presents values obtained from the statistical analysis of the quantitative variables (input and output) from the developed database. The Pearson correlation coefficients between individual input variables presented in Table 4 do not take values close enough to 1 to allow for the elimination of one of pair of correlated variables. The values of the input and output variables, before being used for learning, testing and validation process of the neural network, were subjected to the normalization process, according to Formula (7). All normalized values of the quantitative variables are in the range [0; 1]. Each nominal value of all qualitative variables is assigned a corresponding neuron in the input layer, while the signal coming from such a neuron takes a binary value (0 or 1): where: x s -value of standardized quantitative variable, x-value of a quantitative variable before normalization, x min -minimum value of a quantitative variable and x max -maximum value of the quantitative variable. Figures 2 and 3 show the relationships between fuel consumption (FC U , FC H , FC M ) and the quantity and quality parameters of a vehicle. The graphical visualisation of the input data illustrates the broad range of research carried out and the high multifacetedness of relations between the individual variables of the predictive model. The shapes of the plots are similar to each other within individual input variables. Depending on output variable, the respective shapes come across as being scaled accordingly. This indicates the possibility of developing one predictive model for three predicted variables. This model corresponds to the neural network, containing three neurons in the output layer. The visual effect of scaling the plots will be reflected in the appropriate assignment of weights of the synaptic connections between the neurons of the hidden layer and the output layer.
dicates the possibility of developing one predictive model for three predicted variables. This model corresponds to the neural network, containing three neurons in the output layer. The visual effect of scaling the plots will be reflected in the appropriate assignment of weights of the synaptic connections between the neurons of the hidden layer and the output layer.

Results
To create the artificial neural network, Statistica 13 software was used, in which the proprietary database developed on basis of the technical parameters of passenger cars was analysed. The database was divided at random by the programme into three sets: training (80%), testing (10%) and validation (10%). The neural network derives information from the data transmitted to it and on this basis, it conducts a learning process, i.e., a selection of the appropriate activation functions of the neurons and weights of the pathways between neurons. The use of Statistica software for modelling artificial neural networks accelerated and improved the process of calculation of the training algorithms. The exceptions are papers [22,35,37,42], in which Matlab software was used to create the neural networks.

Results
To create the artificial neural network, Statistica 13 software was used, in which the proprietary database developed on basis of the technical parameters of passenger cars was analysed. The database was divided at random by the programme into three sets: training (80%), testing (10%) and validation (10%). The neural network derives information from the data transmitted to it and on this basis, it conducts a learning process, i.e., a selection of the appropriate activation functions of the neurons and weights of the pathways between neurons. The use of Statistica software for modelling artificial neural networks accelerated and improved the process of calculation of the training algorithms. The exceptions are papers [22,35,37,42], in which Matlab software was used to create the neural networks.

ANN Model with 12 Input Variables
Upon the user issuing a command, the programme automatically generated 1000 artificial neural networks, of which it retained the ten best solutions. Their characteristics are presented in Table 5. All created artificial neural networks have an important common feature: an equal quantity of neurons in the input layer and in the output layer. This is conditioned by the quantity of the input and output variables. Each quantitative variable generates one neuron, while each qualitative variable generates as many neurons as the different values it takes. Among the ten networks retained in the programme, the one with the smallest number of errors in the validation set was selected. This is network No. 8 (MLP 22-10-3), comprising 10 neurons in the hidden layer. The network uses the BFGS learning algorithm and the SOS (sum of squares) error function. The learning, testing and validation errors presented in Table 5

ANN Model with 12 Input Variables
Upon the user issuing a command, the programme automatically generated 1000 artificial neural networks, of which it retained the ten best solutions. Their characteristics are presented in Table 5. All created artificial neural networks have an important common feature: an equal quantity of neurons in the input layer and in the output layer. This is conditioned by the quantity of the input and output variables. Each quantitative variable generates one neuron, while each qualitative variable generates as many neurons as the different values it takes. Among the ten networks retained in the programme, the one with the smallest number of errors in the validation set was selected. This is network No. 8 (MLP 22-10-3), comprising 10 neurons in the hidden layer. The network uses the BFGS learning algorithm and the SOS (sum of squares) error function. The learning, testing and validation errors presented in Table 5 correspond to half of the mean square error calculated from the values of unnormalized input variables for each set. The best selection of the weights of the synaptic pathway was achieved in 84th learning cycle network. The MLP 22-10-3 neural network learning diagram, showing changes in the value of half the mean square error for the normalized values of the variables in individual epochs (learning cycles), is presented in two scales in Figure 4. The activation functions are, respectively, for the hidden layer-exponential function and for the output layer-unipolar sigmoid function. The weights of the synaptic pathways in the MLP 22-10-3 network are presented in Appendix A.

Sets:
Training Test  In Figure 5, a simplified model of the selected MLP 22-10-3 artificial neural network is presented, which has the lowest error value for the test set. The input layer (red) has 22 neurons, the hidden layer (blue)-10 neurons, and the output layer (green)-3 neurons. To each input neuron a quantitative input variable or one nominal value of a qualitative input variable is assigned. Each output neuron corresponds to one assigned output variable describing fuel consumption in a given environment. The synaptic pathways occur between the neurons from the input and the hidden layer, and between the hidden and the output neuron, by combining each neuron from the concerned layer with each neuron from the adjacent layer. In Figure 5, a simplified model of the selected MLP 22-10-3 artificial neural network is presented, which has the lowest error value for the test set. The input layer (red) has 22 neurons, the hidden layer (blue)-10 neurons, and the output layer (green)-3 neurons. To each input neuron a quantitative input variable or one nominal value of a qualitative input variable is assigned. Each output neuron corresponds to one assigned output variable describing fuel consumption in a given environment. The synaptic pathways occur between the neurons from the input and the hidden layer, and between the hidden and the output neuron, by combining each neuron from the concerned layer with each neuron from the adjacent layer. The values of the correlation coefficients r and the coefficient of determination R 2 for the selected neural network MLP 22-10-3 are shown in Table 6. For all variables in each sample, the value of correlation coefficient r was above 0.93, reaching a maximum value of 0.9499 for the FCM variable in the Test trial. The coefficient of determination R 2 for all variables reached values above 0.98. The values of the correlation coefficients r and the coefficient of determination R 2 for the selected neural network MLP 22-10-3 are shown in Table 6. For all variables in each sample, the value of correlation coefficient r was above 0.93, reaching a maximum value of 0.9499 for the FC M variable in the Test trial. The coefficient of determination R 2 for all variables reached values above 0.98.
The plots of dispersion of real values and values of predicted variables FC U , FC H and FC M divided by datasets are presented in Figure 6. Above each of the plots there are the linear regression formulas. The high values of the correlation coefficient r and the coefficient of determination R 2 and a small quantity of outliers in the plots indicate a good matching of the model to the real data.  For the purposes of determining the magnitude of impact of individual input variables on the accuracy of the predictive model, a global sensitivity analysis was conducted. The results of the analysis are presented in Table 7. The sensitivity coefficient Wj of the Gearbox TG variable is equal to exactly 1, which means that this variable has no im-  For the purposes of determining the magnitude of impact of individual input variables on the accuracy of the predictive model, a global sensitivity analysis was conducted. The results of the analysis are presented in Table 7. The sensitivity coefficient W j of the Gearbox T G variable is equal to exactly 1, which means that this variable has no impact on the MSE error of the prediction. Table 7. Global sensitivity analysis of the MLP 22-10-3 artificial neural network.

ANN Model with 11 Input Variables
The next stage of the research is an attempt to optimise the artificial neural network. It is based on the results received in the sensitivity analysis of the input variables. Therefore, a hypothesis was put forward assuming that an elimination of the T G variable from the set of the input variables will not reduce the prediction accuracy by the neural network. Using Statistica software, the process of creating the artificial neural networks was conducted, including 11 input variables and 3 output variables. The quantity of neurons in the input layer was decreased, while at the same time maintaining the quantity of neurons in the hidden layer and the activation functions of the individual layers such as in the MLP 22-10-3 network. In Table 8 the results of the created ten neural networks featuring the MLP 20-10-3 structure are presented. The network with the lowest error on the validation set, assuming the value of 0.8343, is network No. 3. It uses the BFGS learning algorithm, which found the best selection of the synaptic pathway weights in 77 learning cycles. The MLP 20-10-3 neural network learning diagram, showing changes in the value of half of the mean square error for the normalized values of the variables in individual epochs (learning cycles), is presented in two scales in Figure 7. The errors made by the networks with 11 input variables take similar values to those made by the networks with 12 input variables. The weights of the synaptic pathways in the MLP 20-10-3 network are presented in Appendix B.
A simplified model of the MLP 20-10-3 network is shown in Figure 8. The selection of colour markings of the layers corresponds to the model shown in Figure 5, similarly to the manner of the synaptic pathways between the neurons of the adjacent layers. A simplified model of the MLP 20-10-3 network is shown in Figure 8. The selection of colour markings of the layers corresponds to the model shown in Figure 5, similarly to the manner of the synaptic pathways between the neurons of the adjacent layers. The Pearson correlation coefficients r and the coefficients of determination R 2 divided into the output variables and database sets are presented in Table 9. The highest value of the correlation coefficient r is achieved by the FCM variable for the Test set and the lowest value-the FCH variable for the Training set. All values of the r coefficient are in the range 0.93-0.95, whereas the R 2 coefficient, similarly to in the MLP 22-10-3 model, assumed values above 0.98, which is a result similar to the results achieved within the previous studies by other authors [8,32,35,42]. Nevertheless, it should be mentioned that the papers published so far are based on a significantly narrower group of vehicles featuring similar technical parameters, which facilitates the adjustment of the predictive model to the experimental values. A simplified model of the MLP 20-10-3 network is shown in Figure 8. The selection of colour markings of the layers corresponds to the model shown in Figure 5, similarly to the manner of the synaptic pathways between the neurons of the adjacent layers. The Pearson correlation coefficients r and the coefficients of determination R 2 divided into the output variables and database sets are presented in Table 9. The highest value of the correlation coefficient r is achieved by the FCM variable for the Test set and the lowest value-the FCH variable for the Training set. All values of the r coefficient are in the range 0.93-0.95, whereas the R 2 coefficient, similarly to in the MLP 22-10-3 model, assumed values above 0.98, which is a result similar to the results achieved within the previous studies by other authors [8,32,35,42]. Nevertheless, it should be mentioned that the papers published so far are based on a significantly narrower group of vehicles featuring similar technical parameters, which facilitates the adjustment of the predictive model to the experimental values. The Pearson correlation coefficients r and the coefficients of determination R 2 divided into the output variables and database sets are presented in Table 9. The highest value of the correlation coefficient r is achieved by the FC M variable for the Test set and the lowest value-the FC H variable for the Training set. All values of the r coefficient are in the range 0.93-0.95, whereas the R 2 coefficient, similarly to in the MLP 22-10-3 model, assumed values above 0.98, which is a result similar to the results achieved within the previous studies by other authors [8,32,35,42]. Nevertheless, it should be mentioned that the papers published so far are based on a significantly narrower group of vehicles featuring similar technical parameters, which facilitates the adjustment of the predictive model to the experimental values.
In Figure 9 the plots of the dispersion of the experimental values (declared by the manufacturers) and the values predicted by the MLP 20-10-3 artificial neural network for the variables FC U , FC H and FC M , respectively, are shown. Above each of the plots the linear regression formulas are described. The values of the correlation coefficient r and the coefficient of determination R 2 and a small quantity of outliers in the plots indicate a good matching of the model to the real data.  The global sensitivity analysis for the MLP 20-10-3 network, presented in Table 10, indicates the influence of the values of the individual input variables on the accuracy of the predictive model. The neural network has the highest sensitivity to the Type of charge BT variables and the lowest sensitivity to the Quantity of valves NV variables. However, no value is equal to or less than 1. Therefore, no further attempts were made to The global sensitivity analysis for the MLP 20-10-3 network, presented in Table 10, indicates the influence of the values of the individual input variables on the accuracy of the predictive model. The neural network has the highest sensitivity to the Type of charge B T variables and the lowest sensitivity to the Quantity of valves N V variables. However, no value is equal to or less than 1. Therefore, no further attempts were made to decrease the quantity of input variables for the neural network.

Discussion
On the basis of Equations (4)-(6), the ex post prediction errors were calculated for all vehicles used in the database. The results of the calculations are presented in Table 11. The  In the literature [28], the level of acceptability of the prediction based on the MAPE is 10%. In the conducted studies this level for the predicted values of the FC U , FC H and FC M values is in the range of 5-11% for the MLP 22-10-3 network and in the range 6.5-11.5% for the MLP 20-10-3 network. However, for the validation set of all variables in both neural networks, the MAPE failed to exceed the acceptable value of 10%. The lowest value of 5.06% was recorded for the FC M variable in the validation set of the MLP 22-10-3 network. For the accepted level of prediction acceptability, the prediction models developed can be determined as sufficiently accurate.
The value ranges of the predicting error measures confirm the appropriate selection of the explanatory variables used in the process of creating the predictive models based on the artificial neural networks. It should be understood that the technical parameters of motor vehicles that have an impact on the fuel consumption were properly determined. The sensitivity analysis of the predictive models indicated the influence of the individual variables on the accuracy of the prediction. An attempt to simplify the predictive model by rejecting the variable T G with a sensitivity coefficient W j equal to 1 resulted in a minimal difference in the accuracy between the model with 12 variables and the model with 11 input variables. In both models, the E T variable, which corresponds to the engine type, had the greatest impact on fuel consumption, due to the type of fuel used to operate it and the hybridisation aspect of the drive. Furthermore, there were the F I and D E variables, corresponding to the fuel injection method and engine displacement. Their significance in the predictive model is reflected in the development trends of the modern internal combustion engines. Most manufacturers in eth automotive industry are departing from the use of indirect injection engines towards direct injection in order to minimise fuel consumption and further reduce engine capacity (downsizing). A predicting tool, such as the artificial neural networks created, makes it possible to estimate fuel consumption on the basis of the assumptions made and the values of the individual variables for motor vehicles at the design stage.

Conclusions
This publication brings up the research problem of the prediction of the fuel consumption of motor vehicles equipped with internal combustion engines. An attempt was made to develop a prediction model capable of mapping the impact of the individual technical parameters of the vehicle on fuel consumption under different conditions of use. The literature review presented in the introduction indicates the topicality of the undertaken research subject. The objective of the research was to use artificial neural networks for the prediction of fuel consumption on the basis of the selected technical parameters and the design solutions of passenger cars. The research's subject was the results related to fuel consumption obtained during the international tests, according to the declarations of the manufacturers from the automotive industry. Passenger cars with different technical parameters, manufactured in the years 2010-2020, were used as the basis for creating the database used for training the artificial neural networks.
For the purposes of carrying out the research on the possibility of using artificial neural networks in the development of a predictive model of the fuel consumption of the motor vehicles, Statistica computer software was used, supporting the analysis of the statistical data. The created MLP 22-10-3 network was characterised by a high quality of training, testing and validation in the range 0.93-0.95. Furthermore, upon conducting a global sensitivity analysis, an attempt was made to optimise the neural network by eliminating one of the variables. The effect of the optimisation was to create a reduced MLP 20-10-3 artificial neural network characterised by levels of the quality of learning, test and validation similar to those of the MLP 22-10-3 ANN. Analysing the values of the correlation and determination coefficients between the experimental and predicted output variables and comparing them to the results obtained from research published by other authors, a high capability of the created artificial neural network to predict fuel consumption was determined.
Additionally, an analysis of the ex post predicting errors was carried out and used to compare the predicted values with real ones. The values of the MSE, RMSE and MAPE measures indicate the acceptable accuracy of the predictions made with the use of the predictive models: MLP 22-10-3 and MLP 20-10-3. To sum up, the use of the artificial intelligence allows us to estimate fuel consumption of passenger cars driven in different operating environments (urban, extraurban, mixed cycle) with high accuracy. At the same time, the high accuracy of the prediction indicates a proper determination of the technical parameters used as input variables in the predictive model.
The effectiveness of using artificial neural networks to predict the fuel consumption of a passenger car powered by internal combustion engine, already at the stage of its design on the basis of the technical parameters specified in the assumptions for the design is confirmed in this publication. It is a multi-faceted issue, manifested by the tendency of the automotive market to minimise fuel consumption in relation to the reduction of emissions of harmful chemical compounds into the environment. Further research should focus on improving the accuracy of the predictive models by making them more detailed and updating the database on a successive basis.  Tables A1 and A2.

Appendix B
Weights of the synaptic pathways in the MLP 20-10-3 artificial neural network are presented in Tables A3 and A4.