1. Introduction
Although it is a primarily reliant industry globally and a cornerstone of human survival, agriculture is also one of the most energy-intensive industries. Soil cultivation, a fundamental activity in crop production, consumes substantial amounts of energy, primarily through the operation of heavy machinery, impacting productivity and environmental sustainability [
1]. According to Upadhyay and Raghubanshi [
2], energy use in modern agriculture impacts agricultural sustainability and is ultimately very unsustainable. Therefore, knowing and managing agricultural energy use is vital for preserving agriculture’s long-term sustainability and competitiveness [
3]. Accurately predicting energy requirements during soil cultivation is essential to optimize machinery efficiency, reduce fuel consumption, and promote sustainable farming practices [
4].
Over the past few decades, agriculture has been heavily reliant on fossil fuels [
1] and estimating energy inputs in agriculture entails examining the energy inputs required in various farming operations and processes [
5,
6]. It is broadly divided into direct and indirect consumption. Direct energy consumption includes energy used in agricultural production processes, including electricity, refined petroleum products, natural gas-based fuels, and wood chips [
5,
7]. In contrast, indirect energy inputs include energy used for producing fertilizers, pesticides, farm machinery, seeding, and feed [
8,
9]. Although direct energy consumption constitutes less than 50% of total energy consumption in the agricultural sector, optimizing this consumption is an integral component of sustainable agriculture [
8,
10]. Tillage, seeding and planting, fertilization and pesticide, irrigation, and harvesting are the key agricultural energy-consuming activities [
4,
11]. This research is focused on energy consumption relating to soil tillage. Energy consumption during tillage varies and is significantly influenced by soil texture and compaction, farm machinery efficiency, tillage method, and moisture content [
4,
7,
11,
12]. Understanding the energy inputs in these tasks is critical for optimizing farming techniques and lowering environmental footprints.
Traditional methods for estimating energy consumption rely on empirical models, laboratory tests, or field trials that measure draft force and traction efficiency [
13]. While these methods provide useful insights, they are often time-consuming, labor-intensive, and inefficient, and may not capture the spatial variability of soil properties, leading to increased operational costs and energy waste [
4]. As a result, there is a growing demand for innovative methods that can provide precise, real-time estimates of energy requirements while accounting for soil heterogeneity. Recent advancements in sensing technologies, such as Veris systems, Claas Telematics, and electromagnetic induction (EMI), offer innovative approaches to understanding and predicting energy consumption [
14,
15,
16]. These technologies provide real-time, high-resolution data on soil characteristics such as soil compaction, moisture content, and subsurface structure, directly impacting the energy required for mechanical operations [
17,
18]. This enables a shift from generalized energy estimates to site-specific predictions [
15].
In recent years, there has been an observable rise in the utilization of geophysical data within the agricultural sector. The data can be sourced from various devices, including scanners, which utilize electrode-based electrical resistivity tomography (ERT) and electromagnetic induction for measurement purposes. EMI is a technique used to measure the soil’s response to electromagnetic fields, thus determining its electrical conductivity, while ERT measures a soil’s response to induced current, hence resistivity. Apparent electrical conductivity (ECa), defined as the current flow in the soil, is proportional to the total dissolved solids in the soil [
19] and is frequently used to characterize field variability for application in precision agriculture [
20,
21]. The factors influencing soil ECa are soil moisture content, texture, porosity, salinity level, bulk density, organic matter, and temperature [
22,
23,
24]. The scanners employed in agriculture, including the Geonics EM-38 and Veris 3100, are proficient in measuring a range of parameters in addition to ECa. The Geonics EM-38 scanner is also used to measure magnetic susceptibility (MS), defined by Schenck [
25] as a physical property that characterizes a material’s response to an external magnetic field. MS measurement is necessary to establish characteristics, contributing variables, and methods of soil development. In agriculture, the primary function of MS measurement is the identification of soil contamination by heavy metals [
26]. However, it has been demonstrated in previous studies that the use of MS measured in the soil profile in conjunction with ECa improves the efficacy of models for estimating soil compaction based on soil electrical parameters [
27]. The Veris 3100 scanner is armed with an optical sensor capable of measurements in the red light and infrared bands. Depending on variables, including settling times, soil preparation techniques, and pertinent soil properties, the infrared methodology can efficiently examine the distribution of soil particle sizes [
28].
The acquisition of large amounts of data, including in real time, is becoming increasingly straightforward. Contemporary agricultural machinery is equipped with a plethora of sensors, with data being collected via the perception of the Internet of Things (IoT). However, the crux of the matter lies not in the acquisition of data but rather in its effective processing to yield actionable information useful to agricultural practice. The employment of machine learning (ML) is advantageous in this context, as it facilitates the development of highly accurate predictive and classification models through data analysis. The utilization of such models is manifold, encompassing applications such as yield prediction [
29], crop quality assessment [
30,
31], optimization of the storage process for fruit and vegetables [
32], and support for vision systems in agricultural robots [
33]. These methods are also employed in the context of energy consumption estimation in the agricultural sector. Research on this topic is mainly concerned with energy consumption prediction from historical or energy-related data [
10,
34,
35]. However, there is a lack of publications in the contemporary literature concerning the prediction of the energy consumption required to perform agrotechnical operations based on geophysical data. The use of scanning technologies for energy estimation is not only a step toward precision agriculture but a means to achieve sustainability goals. Precise energy predictions have the potential to assist farmers in optimizing machinery utilization, reducing fuel consumption, and minimizing greenhouse gas emissions. Furthermore, these technologies can enable site-specific soil management, allowing for targeted interventions that enhance soil health and productivity while reducing unnecessary energy expenditures.
Previous research has shown that soil compaction can be predicted based on geophysical parameters such as soil electrical conductivity and magnetic susceptibility [
27]. As soil compaction has a significant impact on energy consumption, this study aimed to investigate how to estimate energy usage during soil cultivation, indicated by fuel consumption and productivity, by combining geophysical data with ML. To this end, a research hypothesis was formulated, stating that it is possible to predict energy consumption through the use of ML combined with scanning methods popular in agriculture. Using cutting-edge technologies, it bridges the gap between conventional approaches to energy consumption estimation and the demand for accurate, scalable, cost-effective, and machinery-efficient forecasts. The prevailing methodologies employed for the purpose of predicting energy consumption in agricultural operations are characterized by a reliance on predefined equations and fixed parameters that delineate physical processes, machinery specifications, and operational settings. This renders the models susceptible to inaccuracies in parameter estimation, which can result in substantial prediction errors. Consequently, there is an increasing necessity for more flexible, data-driven approaches that are capable of modeling nonlinear relationships and adapting to complex, evolving agricultural environments. The integration of machine learning with geophysical data represents a promising approach within this context. The study’s goal is to ascertain the two aspects of the development of predictive models. Firstly, the optimal ML method and its hyperparameters must be identified. Secondly, the most suitable set of independent variables for the models must be determined.
2. Materials and Methods
The process of generating machine learning models for energy consumption approximation was broken into two primary phases: gathering and evaluating data for model training, then executing the model training, and assessing the outcomes to determine the best options, as shown in the flowchart in
Figure 1.
2.1. Study Area
Field measurements were conducted in Brest county, Opolskie Voivodeship, Poland, with GNSS coordinates 50°39′41.55″ N/17°21′34.23″ E. The study area is located in one of the warmest climatic regions of Poland—the Wrocław climatic district. The climate is temperate with oceanic characteristics. The average annual temperature is approximately 8.5 °C, with annual precipitation ranging between 500 and 600 mm. Snow cover persists for about 50–60 days per year, and the growing season lasts around 225 days. The predominant winds are from the west and northwest. Approximately 80% of the area is covered by soils classified as protected for agricultural use (classes I–IVa). Clay soils are predominant in the region.
Within a plot of land of approximately 100 ha, a study site of 10 ha was designated for measurements (
Figure 2). In 2022, after harvesting the main crop (spring barley), shallow post-harvest cultivation with a disk harrow was carried out, mustard catch crop sown and chemically eradicated in autumn 2022. No agrotechnical treatments were carried out on the plot before the 2023 survey. The main crop in 2023 on the surveyed plot was maize for grain.
2.2. Sensor Data Acquisition
Using galvanic contact resistivity (GCR) and electromagnetic induction (EMI) techniques, soil geophysical parameters were measured at the same location. Geophysical measurements were carried out at 0.2 m using the Veris 3100 GCR sensor (Veris Technologies, Inc., Salina, KS, USA; VERIS-EC) as a contact scanner and Geonics EM-38 (Geonics Limited, Mississauga, ON, Canada) as a non-contact sensor scanner at a depth of 0.5 m. The two instruments were pulled by a gator separately, ensuring electrical measurements from scanners without electrical conductivity signal interruptions (
Figure 3). The scanner surveys were conducted in an independent manner, with two distinct runs. Initially, the Veris 3100 scanner was utilized for scanning, followed by the EM-38 scanner being connected to the quad, resulting in another run along the delineated lines. A data logging station was mounted on the gator for the entire measurement period, recording data of the measured parameters.
The Veris 3100 soil scanner is a device that records four important soil properties in a single operation. In this electrode-based system, sensors are rolled across the fields to create direct contact with the soil. It is a three-sensor system measuring ECa, organic matter content, altitude, and acidity (pH) in the rooting zone. It also measures soil reflectance (IR) using its optical sensor in infrared spectra by directing IR light onto the soil as it moves across a field. The Veris 3100 used in this study to record measurements consists of 2 pairs of rolling colter disks, giving an electrical conductivity reading every second. To obtain ECa values, one pair of disk electrodes generates current into the soil, and the voltage change across the other pair of disk electrodes is monitored. A Global Navigation Satellite System (GNSS) receiver fitted on the Veris unit tracks the location of each soil electrical conductivity measurement site in the field. In the present study, the Veris 3100 scanner was utilized to measure two parameters: electrical conductivity and soil reflectance.
The Geonics EM-38 EMI conductivity meter is a portable field device used in measuring soil electrical conductivity and magnetic susceptibility in the rooting zone. It is appropriate for large-scale soil conductivity measurements because of its high speed and accuracy. It creates an initial electromagnetic field by ejecting small horizontal electrical currents in the soil, generating a secondary electromagnetic field. It features a built-in receiver coil that detects both fields, and the percentage of these is the measurement of ECa shown in units of mS·m−1. It is pulled on a non-conductive plastic material to prevent signal interruption and is assisted by a GPS receiver that tracks the location of each measurement site in the field. The present study involved the measurement of two parameters using the EM-38 scanner: electrical conductivity and magnetic susceptibility.
On the experimental plot, parallel lines were drawn along its longest edge at a distance of 10 m. For the first pass, GPS coordinates from the beginning to the end of the line were recorded and entered into the quad’s automatic guidance system (see
Figure 4A).
Subsequently, the scanners were connected to the quad’s automatic guidance system, which received the coordinates of the drawn lines. The quad’s velocity was 2.5 m·s−1. Depending on the scanner settings, measurements were made every 3 m for the Veris 3100 and every 10 m for the EM-38.
2.3. The Layout of the Field Measurements of Energy Consumption
The Claas Axion 960TT (CLAAS, Paderborn, Germany) is primarily designed for heavy-duty agricultural tasks such as tillage, enhancing traction. Its Terra Trac system improves traction while protecting soil structure, making it ideal for large-scale and precision farming. It is equipped with a Global Navigation Satellite System (GNSS) receiver that tracks the location of each measurement point on the field. The tractor’s stability and adaptability across various terrains ensure reliable performance.
Claas Telematics is an advanced system designed to optimize efficiency and monitor the performance of agricultural machinery. It allows real-time data collection, analysis, and remote access to machine operations, including fuel consumption, working hours, and GPS-based field mapping. When mounted on a tractor for tillage, it collects real-time data such as fuel consumption, engine load, working speed, and GPS-based field position.
Figure 4B shows the locations of the measurement points. It records tillage patterns, covered areas, and operational efficiency.
After the scan, cultivation was carried out to a depth of 0.25 m. Tillage was carried out with a Claas Axion 960TT semi-truck tractor and a Horsch Tiger 5 AS (HORSCH Maschinen GmbH, Schwandorf, Germany). After the tests, the nearest points of the EM-38 and Veris 3100 scanners were assigned to each recorded tractor point using the least squares method for further analysis.
The Horsch Tiger 5 AS is a robust cultivator engineered for intensive soil cultivation and capable of operating at depths of up to 35 cm. Its four-bar frame design features a tine spacing of 23 cm and a frame height of 85 cm, ensuring thorough mixing of crop residues across the entire working depth. The Horsch Tiger 5 AS also provides a variety of packer options to achieve optimal soil consolidation. With a working width of 4.7 m, a transport height of 3.3 m, and a transport width of 3 m, it requires a power input of 185 kW. It efficiently enhances soil aeration, integrates agricultural residues, and breaks up compacted soil layers. For high-intensity soil cultivation, the Axion 960TT and Tiger 5 AS work well together to provide effective, deep tillage with minimal soil disturbance. In this study, fuel consumption and productivity are the two parameters utilized as energy consumption indicators. Fuel consumption (dm
3∙h
−1) was measured using a flow meter equipped on the tractor. The parameters needed for calculating the tractor’s productivity were measured and recorded by the radar it is equipped with. The working width of the aggregate was measured to be 5 m; however, an overlap of 0.3 m was assumed to avoid uncultivated areas. Therefore, the assumed actual aggregate width was 4.7 m. Productivity was determined by the following equation:
where
Prod is productivity (ha·h
−1),
V is the actual speed (km·h
−1), and
s is the actual aggregate width (m).
The fuel consumption of the tractor was recalculated to express this parameter in (dm
3·ha
−1):
where
FC is fuel consumption (dm
3·ha
−1),
FCTR is fuel consumption measured by tractor (dm
3·h
−1), and
Prod is productivity (ha·h
−1).
2.4. Soil Sampling
The establishment of five management zones (MZ) was facilitated by a proprietary algorithm, devised by the surveying enterprise, which drew upon the findings derived from the EM-38 scanner. Consequently, a reduced number of soil samples were collected to evaluate the moisture content and texture of the soil. Five soil samples, one from each MZ, weighing one kilogram, were randomly taken at depths varying from 0 to 30 cm and put in an airtight plastic bag with a label. Using a mortar and pestle, soil samples were ground after being air-dried for two days at 26 °C and then oven-dried for twenty-four hours at 105 °C. To ascertain the soil texture of the samples, Prószyński’s method [
36] was used. Based on the results, the soil coefficient (
SCoef) was calculated using the Prószyński method according to the following equation:
Using a penetrometer (Eijkelkamp Soil &Water, Giesbeek, The Netherlands), soil moisture was approximated at 27%.
2.5. Machine Learning Modeling Pipeline
Three ML techniques were utilized to model the relationship between the geophysical data, combined with soil properties, fuel consumption, and productivity. These methods included two types of artificial neural networks (ANNs)—multilayer perceptron (MLP) and radial basis function (RBF)—as well as support vector machines (SVMs). These methods are frequently employed in the development of regression models in agriculture, often yielding highly accurate results. Neural networks are inspired by the biological nervous system, where the fundamental unit, an artificial neuron, computes a weighted sum of inputs and applies an activation function to generate an output. ANNs have a layered architecture and do not require explicit programming; instead, they learn from a training dataset. After training, the model is evaluated on a separate test set to determine its generalization capability. Defining the network structure, particularly the number of neurons in the hidden layer, is a crucial step before training.
SVM operates based on support vectors—data points that define the optimal solution. In classification tasks, it identifies a hyperplane that separates different data groups, while in regression tasks, it finds a function that best represents the data. Support vectors play a key role in shaping the hyperplane or regression function. This study employed epsilon-insensitive support vector machine (ε-SVM) regression, which aims to find a function where deviations from target values do not exceed ε for each training data point. The Gaussian radial basis function kernel was selected due to its effectiveness in capturing nonlinear patterns in the data.
The dataset comprised 534 data vectors following the removal of outliers using Interquartile Range method. The data vector incorporated independent variables, namely electrical conductivity and magnetic susceptibility measured by Geonics EM-38 scanner (ECa_EM and MS_EM), electrical conductivity, and soil reflectance in infrared spectra measured by Veris 3100 (ECa_Veris and IR), and soil coefficient (SCoef), as well as independent variables, namely productivity and fuel consumption. The development of two distinct models for fuel consumption and productivity as predicted parameters was undertaken in this study. Furthermore, different combinations of input parameters were tested, according to the five scenarios presented in
Table 1.
The experimental dataset was divided into subsets depending on the ML method. For ANNs, the data was divided into train, validation, and test subsets in a ratio of 70:15:15, with the train and validation subsets being used during the model training stage and the test set being used to assess whether model overfitting had occurred. For SVM models, the data were split into train and test sets in a ratio of 80:20. The optimization of model parameters was conducted through a grid search approach. For ANNs, the optimal number of neurons in the hidden layer was identified. Based on our previous research, a range of the number of neurons in the hidden layer from 10 to 50 was assumed. For MLP networks, an additional evaluation of diverse neuron activation functions in the hidden and output layers was undertaken, encompassing linear, sigmoid, hyperbolic tangent, and exponential types. The optimal parameters of the MLP and RBF models were adjusted after training 20,000 different network configurations with randomly determined initial values of synaptic weights. For the SVM models, the hyperparameters C (inverse regularization parameter), ε, and γ (RBF kernel width) were subject to fine-tuning, with model evaluation being conducted through the utilization of ten-fold cross-validation. Based on preliminary research, the following ranges of hyperparameters were used: C from 1 to 10, ε from 0.1 to 0.5, and γ from 0.1 to 0.5. The MLP, RBF, and SVM machine learning models were developed using Statistica v. 13 (TIBCO Software Inc., Tulsa, OK, USA).
The accuracy of the models was assessed based on three error metrics: the correlation coefficient R between target and predicted values, the mean absolute percentage error (MAPE), and the Root Mean Square Error (RMSE). The calculation of these metrics is described in Equations (4)–(6).
where
Ytar is the absolute target value,
is the mean target value,
Ypred is the absolute predicted value,
is the mean predicted value, and
n is the number of vectors in a dataset.
2.6. Statistical Analysis
To evaluate the performance differences among the regression models, a 10-fold cross-validation procedure was applied. For each model and performance metric (MAPE, RMSE, R), the results from all folds were collected. Prior to conducting statistical tests, the normality of the distribution was assessed using the Shapiro–Wilk test, and homogeneity of variances was verified using Levene’s test. If the assumptions of normality and equal variances were satisfied, a one-way ANOVA was performed to determine statistically significant differences between models. In instances where these assumptions were violated, the non-parametric Kruskal–Wallis test was employed as an alternative. The significance level was set at p < 0.05 for all statistical comparisons.
4. Discussion
Estimating energy usage in soil cultivation has been a growing research focus area, particularly with advancements in precision agricultural technologies. Several studies have explored approaches, including empirical modeling, mechanical sensor-based methods, and remote sensing techniques, to forecast energy consumption. However, integrating geophysical data with machine learning models to predict energy consumption during soil cultivation seems very promising as it considers the variability of the soil and time–cost–environmental effects. It addresses the gap between traditional energy consumption estimation methods and the need for cost-effectiveness, scalability, and accurate predictions using advanced technologies. Our findings support earlier research [
38], which proved that geophysical data in combination with ML techniques enables the prediction of soil compaction, an important factor influencing energy consumption during agricultural operations. For both dependent variables, which we used as indicators of energy consumption (fuel consumption and productivity), the best models were produced by scenarios that used both electrical conductivity and magnetic susceptibility as inputs. More accurate models were obtained for productivity using the SVM algorithm and fuel consumption using the MLP algorithm. Therefore, it can be stated that the accuracy of the models indicated by the MAPE of 4% is appropriate for practical applications in estimating energy consumption. This is especially noticeable in the productivity context, where a very high R-value is correlated with a low MAPE. The superior performance of SVM relative to models based on neural networks may be attributable to the relatively modest size of the training dataset. SVM has been demonstrated to generate more accurate regression models in comparison to neural networks, including MLP and RBF, particularly in scenarios where the quantity of training data is restricted. In contrast to neural networks, which frequently necessitate extensive hyperparameter tuning and substantial datasets for effective generalization, SVMs demonstrate reduced sensitivity to the curse of dimensionality and exhibit robust performance in high-dimensional or sparse feature spaces [
39]. Furthermore, SVM regression models employ a subset of the training data as support vectors, a feature that renders them more data-efficient and less prone to sensitivity to noise. This, in turn, results in improved generalization performance in small-sample scenarios.
Integrating geophysical data with machine learning methodologies facilitates the development of highly accurate predictive models for specific soil properties. Karim et al. [
40] demonstrate that LiDAR sensors provide rapid and accurate data, including the ability to provide precise real-time data for field mapping, a critical factor influencing precision agriculture. This method reduces the need for extensive field sampling and allows for real-time decision-making—a major advantage over traditional mechanical approaches. Diaz-Gonzalez et al. [
41] combined remote sensing data with ML algorithms to estimate soil quality in agricultural systems at local and regional scales. Their findings align with research supporting the use of SVM and ANN in precision agriculture. Similarly, Basso and Antle [
42] demonstrated that integrating unmanned aerial vehicle-based multispectral imaging with AI-driven energy models improved the accuracy of productivity predictions, as AI-based approaches better capture complex interactions. Liu et al. [
43] used Partial Least Squares Regression (PLSR), Random Forest (RF), SVM, and Gaussian Process Regression (GPR) in predicting soil water content. In their findings, the GPR model had the best prediction performance (R
2 ≥ 0.95). Liu et al. [
44], in their study to predict soil nutrients, proposed a sensor array optimization method based on the dynamic feature importance of the RF-Pearson correlation coefficient to identify the optimal sensor combinations for soil nutrients. SVM, RF, MLP, and MLP-RF models were applied to predict the soil nutrients. According to their findings, the MLP-RF model outperforms other models, with a coefficient of determination of 0.94. In researching the potential of utilizing land surface temperature data from MODIS satellites to estimate maximum air temperature across India, Joy et al. [
45] employed extreme gradient boosting (XGBoost), ANN, a generalized additive model, and multiple linear regression models. From their results, XGBoost outperformed the other techniques, achieving the lowest RMSE and R² values of 1.79 °C and 0.90, respectively. In their study to provide an accurate classification of soil texture classes by assessing the capabilities of environmental covariates from Landsat 8 OLI Science products and a digital elevation model, Kaya et al. [
46] employed Decision Tree (DT), RF, and SVM algorithms. The best of the three machine learning models for soil texture classification was RF, with an overall accuracy of 0.63 according to the accuracy evaluation. Deng et al. [
47] aimed to improve the measurement accuracy and application range of low-frequency capacitance moisture sensors by correcting the relationship between low-frequency capacitance and moisture through the detection of soil conductivity. They utilized three modified models: logistic, exponential, and polynomial. The results showed that the Maximum Absolute Measurement Error (MAME) and Mean Absolute Error (MAE) of the logistic model were below 3.55% and 2.50%, respectively, which satisfy most agricultural production requirements for soil moisture detection. Waqas et al. [
48] reviewed the use of machine learning and deep learning for crop selection, land monitoring and management, water, soil, and nutrient management, weed control, as well as harvest and post-harvest practices. They found that machine learning and deep learning facilitate the analysis of complex datasets, enabling data-driven decision-making, reducing reliance on subjective expertise, and improving farm management strategies. The authors noted that advancements in these technologies present significant opportunities to enhance agricultural productivity, sustainability, and resilience.
The utilization of ML algorithms has been demonstrated to facilitate the prediction of energy consumption within the agricultural sector. Trejo-Perea et al. [
49] utilized MLP to forecast energy consumption in greenhouses, with the prediction model incorporating temperature, hour, relative humidity, and load as key variables. The ANN model developed in this research study demonstrated superior performance in comparison to the regression model, achieving a mean absolute percentage error (MAPE) of less than 6% and an R-value greater than 0.9. Ceylan [
50] employed ML methods, including SVM and GPR, to forecast agricultural energy consumption in Turkey. The study utilized population data, combined with gross domestic product share of agriculture, agricultural value-added, and total arable land, as independent variables. The authors developed a GPR model of high accuracy, with a MAPE of 7% and an R-value of 0.97 for the test dataset. In their study, Sharma et al. [
51] used statistical and ML methodologies, namely RF and LSTM, to estimate the seasonal peak demand for electricity consumption in the agricultural sector in India. The models were based on time series data. In that study, statistical methodologies generated more precise models.
It is also noteworthy that contemporary research continues to utilize statistical methodologies and mechanistic models to predict energy consumption in the agricultural sector [
52,
53]. The models presented in this thesis are not without their limitations. The utilization of data-driven models is constrained to the circumstances delineated in the data itself. The research was conducted in a single field, characterized by minimal variability in soil moisture and soil texture at the time of treatment with a specific aggregate. Consequently, the models obtained cannot be utilized under conditions that differ from those of the experiment that provided the training data. In order to obtain more universal models, it is planned that training data will be acquired under a variety of soil conditions and for different agrotechnical treatments.
5. Conclusions
This study establishes electrical parameter-based management as a crucial element of modern agricultural practices, contributing to economic viability and environmental sustainability. While traditional approaches rely on empirical models and field measurements, which are labor-intensive, costly, and prone to errors, modern scanning technologies like the Geonics EM-38 and Veris 3100 provide high-resolution data on soil texture, electrical properties, and spectral indices essential for optimizing resource use, reducing costs and time, and minimizing environmental impacts. The most accurate models were obtained using soil electrical parameters (electrical conductivity and magnetic susceptibility) measured with a Geonics EM-38 scanner, along with electrical conductivity and soil reflectance measured using a Veris 3100 scanner. The models demonstrating high accuracy (MAPE of 3–4%) were produced using the RBF, SVM, and MLP algorithms for productivity and fuel consumption as indicators of energy consumption. The results indicate that employing Geonics EM-38 and Veris 3100 scanners, combined with machine learning, predicts energy use with adequate accuracy for precision agriculture.
Integrating scanning techniques with machine learning-driven analysis enables real-time soil assessment, facilitating adaptive strategies such as variable-depth plowing to reduce fuel consumption and mechanical wear. Based on rapid scanning methods used in soil variability mapping, it is possible to obtain valuable information about soil condition variability, which directly affects draft forces, power requirements, and fuel consumption. This enables more accurate determination of the power needed to operate specific tractor-mounted agricultural implements of given widths and helps assess whether a particular tractor can efficiently work with a chosen implement on a given field. Additionally, this information can support optimal tractor ballasting, for example, by selecting the appropriate front ballast weight to ensure sufficient traction while minimizing soil compaction.