1. Introduction
Indonesia is an agricultural country, and the agricultural sector has a very important role in the national economy. Based on data from the Central Bureau of Statistics (2021) [
1], the percentage of workers in the agricultural sector was 88.43% in 2021. Apart from the planting process, post-harvest yield management greatly affects the quality of agricultural products, one of which is the drying process of agricultural products [
2]. Drying can be performed by using a machine with electrical energy, or manually by relying on sunlight, which only supplies energy during the day. Thus, at night, the drying process cannot be carried out. Drying with electric-powered machines is difficult, especially in remote agricultural areas, because electricity from the State Electricity Company sources is not yet affordable. Therefore, local electricity production by utilizing energy resources such as Renewable Energy Sources (RES) can be implemented; one of these resources is solar energy [
3]. Our group has previously developed “SMART DOME 4.0”, which is an intelligent, self-sufficient solar dome system for drying agricultural products using photovoltaic (PV) energy sources. SMART DOME 4.0 utilizes a Machine Learning algorithm for environmental condition prediction to optimize its electrical power generation under environmental dynamics [
4].
In the Solar Power Plant energy system [
4], the battery plays an important role as a store of electrical energy in electrochemical cells. The importance of battery for an energy storage system goes beyond agricultural applications. Energy storage systems are becoming increasingly important as more renewable energy systems are integrated into the electrical grid. If renewable energy systems are to be increasingly integrated into the electrical (or power utility) grid, low-cost and reliable energy storage is paramount [
4]. Renewable energy systems, such as solar and wind power, are variable in their output and are therefore not always able to meet consumers’ electricity demands. Energy storage systems can help balance the supply and demand of electricity by storing excess energy when it is produced and releasing it when it is needed.
Battery performance can be seen based on several parameters, including the depth of discharge (DoD), number of battery cycles, the efficiency of the battery, the charge and discharge rate, and the temperature of the battery and battery capacity. During its use, the battery will degrade. Battery management plays an important role in battery operation to prevent the battery from being damaged quickly. Accurate estimation is one of the efforts to prevent batteries from operating in an overcharge and over-discharge state, which can cause damage to the battery and cause a short battery life [
5]. Research on the estimation of battery characteristics has been carried out, especially on batteries that are applied to electric motors [
6,
7]. However, there are still not many batteries that can be used as energy storage in solar power plants. While lead-acid batteries have been a popular choice for energy storage in solar power plants due to their low cost and reliability, several other battery technologies are being developed specifically for use in solar power systems. These newer technologies offer advantages over lead-acid batteries in terms of efficiency, capacity, and lifespan [
8].
With the aim of optimizing the battery storage system by describing the battery degradation model based on the state of health (SOH) and estimating the remaining useful lifetime (RUL) of the battery, an approach method with high accuracy is needed. Batteries consist of electrochemical cells that cause parameter changes that are affected by their use. In other words, battery degradation is nonlinear. Therefore, it is possible to estimate accurate results using a machine learning approach, namely Coulomb Counting, Fuzzy Logic, or Artificial Neural Network [
9,
10,
11]. In this research, the methods used were long short-term memory and support vector regression.
There are three types of approaches for battery SOH estimation, namely direct measurement, model-based and data-driven methods. The direct measurement method depends heavily on experimental data to analyze various observable battery metrics, such as voltage, current, and temperature, which will be used to determine the state of a battery. One example is the ampere-hour counting method, also called the Coulomb counting method, which estimates the battery status by cumulatively measuring the charged and discharged current [
12]. This method is more appropriate for laboratory use and has limited practicality for real-world conditions.
The model-based approach involves two crucial elements, namely filtering and modeling. For example, Kalman filter (KF) assessed the optimal value by comparing the “measured value” from the current instrument to the “predicted amount” from the previous hour. Some advantages include real-time data processing and applicability to nonlinear systems. However, its primary limitation is its dependence on the battery model’s accuracy to improve its precision [
13].
The last method is a data-driven approach, which enables the identification of hidden information to estimate SOH using a variety of data learning modes, without needing expertise in battery systems. As a result, it can avoid the intricate issues of model acquisition [
14]. For example, support vector machine (SVM) are often employed as algorithm that estimate SOH by converting nonlinear problems in a low-dimensional space to linear problems in a higher-dimensional space. The training capability for SVM is robust for small samples, and it has low computational complexity. Currently, neural network-based methods are increasingly popular due to their efficiency. The Recurrent Neural Network (RNN) has been employed to estimate SOH owing to its ability to retain critical information between the input data and the SOH value. However, issues of vanishing gradients or explosive gradients hinder RNN utilization for long-term estimation [
13]. To solve this issue, LSTM is introduced since it possesses the capability to selectively remember or forget information through its memory cell architecture and grafting mechanism.
Therefore, this study was conducted to determine the battery capacity and RUL, with the result being that the use of batteries in SMART DOME 4.0 can be operated in optimal conditions and support the performance of the existing system. Battery RUL (remaining useful life) estimation and optimization are critical for the successful commercialization of renewable energy systems through full integration into the power grid [
15]. RUL estimation is the process of predicting how much longer a battery will be able to operate before it needs to be replaced. This information is important for system operators, as it allows them to plan for battery replacement and maintenance, which can help minimize downtime and maximize the lifespan of the battery [
16]. Optimization, on the other hand, involves using data and algorithms to manage the charging and discharging of batteries in a way that maximizes their performance and lifespan. This can help reduce operating costs and improve the overall efficiency of the energy storage system [
17]. By accurately estimating battery RUL and optimizing battery performance, renewable energy systems can be integrated more fully into the power grid, improving grid resilience and enabling the greater use of renewable energy sources. This can help reduce our reliance on fossil fuels and decrease carbon emissions, leading to a more sustainable energy future [
18].
Research on battery characteristics estimation has been carried out previously to determine battery capacity by estimating the State Of Charge (SOC), RUL and degradation model and SOH prediction. Many previous estimates have been made on battery applications in electronics [
12,
13,
14]. However, for applications in Solar Power Plants that use lead acid batteries, not much research has been published. The challenges related to battery lifetime and optimization are not limited to lead-acid batteries only, but also apply to other battery technologies, including lithium-ion batteries (LIBs), that are commonly used in renewable energy systems [
19]. While LIBs have many advantages over lead-acid batteries, such as a higher energy density and better efficiency, they do have a relatively short life compared to that of power generation technology [
20]. This means that the batteries may need to be replaced before the power generation technology, which can be costly and may reduce the overall efficiency of the energy system.
Therefore, the techniques for battery RUL estimation and optimization that we propose in this manuscript are relevant to all types of batteries used in renewable energy systems, including lead-acid and lithium-ion batteries. By accurately estimating the remaining useful life of batteries and optimizing their performance, we can maximize the value of the energy storage system and minimize costs. While our focus in this manuscript is on the use of lead-acid batteries due to the low-cost agricultural needs of the project, the techniques we propose can be applied to any type of battery used in renewable energy systems, making them a valuable tool for the industry as a whole. The condition of a battery can be assessed from two aspects, namely SOC and SOH. While the accurate estimation/prediction of the SOC in lead-acid batteries for an advanced RES has been the focus of many earlier studies [
6,
7,
8], lead-acid batteries’ long-term degradation and the use of machine learning and its potential insights into the reliability/durability or long-term health of the battery system (SOH and RUL) have not been the subject of in-depth investigations in many studies. This manuscript is our attempt to start looking into these important aspects in a more systematic manner, especially for the lead-acid battery as an energy storage system with potential importance in an advanced RES.
The prediction of the battery condition is categorized into four methods, namely direct measurement, bookkeeping estimation, adaptive systems, and hybrid methods [
20]. The adaptive system is an estimate using artificial intelligence (AI) or machine learning. The methods that can be used are neural networks, support vector machines (SVM), fuzzy neural networks and Kalman filters. SVM is a machine learning model with an algorithm that separates data based on the best hyperplane. SVM is used for classification, while for estimation in the form of regression, support vector regression is used; this is part of SVM [
13].
3. Results and Discussion
A correlation analysis between numerical variables is conducted to determine the relationship among variables. Visualization of the correlation between variables is carried out using a heatmap or correlation plot. A heatmap is a visualization that depicts a matrix using colors to represent values. The use of a color scale indicates the values of the matrix, using a spectrum from cooler colors (blue) to warmer colors (red). Based on
Figure 1, the correlation results of variables using a heatmap show that the spectrum of red colors indicates higher correlation values, while blue indicates lower correlation. It can be concluded that variables such as capacity, voltage measured, current measured, temperature measured, and time have a high correlation, indicated by the dark red color with a positive value of 1.
The color intensity in heatmap cells reflects the correlation value between variables; the stronger the correlation, whether positive or negative, the brighter the color. As the correlation value approaches 1, the color will be closer to bright red or a shade of positive correlation. Conversely, as the correlation value approaches −1, the color will be closer to bright blue, and it will become darker when closer to 0.
A strong positive correlation indicates that the variables move together or in the same direction. This means that if the value of one variable increases, the other variables are also likely to increase. On the other hand, a negative correlation indicates that the variables have an inverse relationship, meaning that if the value of one variable increases, the other variable is likely to decrease. A correlation value close to 0 indicates that the two variables do not have a strong linear relationship, meaning that changes in one variable do not consistently affect changes in another variable.
The SVR method built with the RBF kernel shows poor SOH estimation results. This is indicated by the low accuracy value of 0.127. The accuracy value shows the ratio of the correct prediction results to the overall data. With the results obtained, the correct prediction results from the SVR model are very small. Thus, this indicates that the algorithm performance of the built model is poor. Therefore, the use of the prediction results of this model has a very large risk of error, while the algorithm model with the LSTM method that was built showed better estimation results. The calculation results show that the RMSE value obtained is 0.0688.
Figure 1 shows the estimation results of LSTM SOH. The graph form obtained also shows almost the same curve, correctly showing the data pattern of the model studied. The capacity decline, which is mostly connected to the battery’s SOH, is the most obvious sign of battery deterioration. From
Figure 2, we can see that the actual SOH value is higher than the predicted SOH for each cycle. This shows that the current model cannot accurately predict the Remaining Useful Lifetime (RUL) and that the battery may degrade faster than expected. This could be due to a variety of factors, such as variability in the manufacturing process, differences in usage patterns or environmental conditions, or the inherent complexity of battery aging.
The capacity decline, which is mostly connected to the battery’s SOH, is the most obvious sign of battery deterioration. From
Figure 2, we can see that the actual SOH value is higher than the predicted SOH for each cycle. This shows that the Remaining Useful Lifetime (RUL) cannot be accurately predicted by the current model and that the battery may degrade faster than expected. This could be due to a variety of factors, such as variability in the manufacturing process, differences in usage patterns or environmental conditions, or the inherent complexity of battery aging.
To accurately predict RUL, it is important to consider additional factors that may impact battery degradation and incorporate them into the model. This could involve collecting more data on the battery’s usage history, environmental conditions, and other relevant factors, and using advanced modeling techniques to better capture the complex interactions between these variables. Ultimately, the accurate prediction of RUL is critical for optimizing battery performance and minimizing downtime, particularly in applications where batteries play a critical role, such as electric vehicles or grid-scale energy storage systems. Therefore, continued research and development in this area is essential for advancing the state-of-the-art in battery management and improving the reliability and performance of battery systems. With a smaller error value, the LSTM model algorithm was chosen to estimate the battery RUL. The estimation is carried out with the same stages and models, namely sequential, using the initial 50 cycles of data as training data. In the training phase, the model is trained using 200 epochs. This model training is carried out using a regress fit code (x_train, y_train, epochs = 200, batch_size = 25). In this study, one epoch was achieved when 25 batches of training data samples were processed. The fitting results show good model performance. This is indicated by the reduction in the loss value in each epoch.
The prediction results show that the RMSE value is 0.031, which indicates that the error value is close to 0. This means that the predicted results are close to the actual observed data. The RUL graph can be seen in
Figure 3.
In the initial analysis, the Support Vector Regression (SVR) method with the RBF kernel showed poor results, with a low accuracy value of 0.0127 and RMSE 5377. The model was implemented using default parameters (RBF kernel, C = 1.0, epsilon = 0.1, gamma = ‘scale’), and hyperparameter tuning (e.g., grid search) was not conducted due to computational constraints and the exploratory nature of the study. The main goal was to assess the feasibility of using AI to model battery degradation rather than optimizing predictive accuracy. Input features were standardized using StandardScaler to address SVR’s sensitivity to input scale. On the other hand, the Long Short-Term Memory (LSTM) method demonstrated better estimation results with an RMSE value of 0.0688, which is relatively close to 0.
To gain more insights into the relationship between features and the target variable (SOH), it is essential to conduct a thorough analysis of feature correlations and their impact on the model. By conducting a comprehensive analysis of feature statistics and correlations, we can gain valuable insights into the behavior of the battery and identify critical factors affecting SOH. This, in turn, will help refine the LSTM model and make more accurate predictions for the remaining useful lifetime (RUL) of the battery.
The use of the LSTM model algorithm for estimating battery RUL with a smaller error value is a promising development. LSTM models are a type of neural network that is well-suited to sequential data analysis; they have been shown to be effective for time series forecasting. The training of the LSTM model using 200 epochs and a batch size of 25 is a standard approach for deep learning models. The reduction in the loss value in each epoch indicates that the model is learning and improving its performance over time.
The prediction results with an RMSE value of 0.031 indicate that the model’s predictions are very close to the actual observed data. This is a good indication that the model RUL model shown in
Figure 3 accurately captures the complex relationships between the input variables and the battery RUL. However, it is important to note that even small errors can have a significant impact on the performance of the battery, so ongoing monitoring and optimization of the model will be important for maximizing battery lifespan and performance.
The description of the battery capacity is then compared with the use of batteries in Solar Power Plants in Indonesia. This comparison data illustrates the condition of the lead acid battery capacity when used as energy storage for power systems, as well as SMART Solar Dome 4.0 [
4].
While the description of the battery capacity in the previous section focused on a specific study and experimental setup, it is useful to consider how these findings relate to the use of batteries in other applications, such as solar power plants in Indonesia. For instance, the difference rates of SOH or battery health decline as shown in
Figure 4 could be an important insight that we could learn from this specific system. There is clearly a change in the battery health decline rate at about 50 cycles—it was declining at much higher pace from thereon. Whether this may be associated with the system parameters of the battery arrays or part of the larger energy systems, or with more fundamental parameters of the individual batteries, remains an important question that is worth pursuing further in future studies that could benefit from our data analytics in this article.
The initial SOH and predicted SOH have relatively higher values compared to the Solar Power Plant SOH shown in
Figure 4. This shows that the initial SOH and predicted SOH values from the previous section may not be directly comparable to the SOH values observed in solar power plant batteries in Indonesia or other locations. There are several reasons why this may be the case.
- (1)
The battery chemistries used in solar power plant applications may differ from those used in the experimental setup described in the previous section. For example, some solar power plants may use lithium-ion batteries, which have different aging characteristics and performance requirements compared to lead-acid batteries.
- (2)
The operating conditions in solar power plant applications may be different from those in the experimental setup. Solar power plant batteries may be subject to different temperature and humidity conditions, may have different charge and discharge profiles, and may experience different levels of usage and cycling. All of these factors can impact the aging and performance of the batteries and may result in different SOH values.
- (3)
Finally, it is possible that the differences in SOH values between the experimental setup and solar power plant applications may reflect differences in the quality or maintenance of the batteries. Solar power plant batteries may be subject to different maintenance schedules, inspection procedures, and replacement criteria compared to experimental batteries, which could impact their SOH values.
The results of the estimation of the SOH value of the dataset and Solar Power Plants showed a significant difference. With differences in characteristics at the time of measurement, especially temperature differences, the NASA dataset was measured under constant ambient temperature conditions of 24 °C, different from the Solar Power Plant data conditions measured in temperature differences. The graph above also shows the characteristics of the battery type. The type of lithium-ion battery has advantages over lead-acid batteries. Thus, the RUL of the battery also lasts longer.
In general, lead-acid batteries are a common choice for energy storage in solar power systems, including both off-grid and grid-connected applications. These batteries are relatively inexpensive, widely available, and can provide high discharge currents when needed. However, they also have some limitations, including a relatively short lifespan compared to other battery chemistries and the need for regular maintenance to prevent degradation and ensure reliable operation. This manuscript aims to provide important insights from big data analytics approaches that may be useful in future studies to enhance the long-term reliability and lifetime of the lead-acid battery systems—be it from the systemic point of views (energy storage systems) or from the fundamental electrochemistry aspect of the individual battery elements.
In the context of the SMART Solar Dome 4.0 project [
4], it is likely that similar considerations apply. The specific battery capacity requirements and usage patterns will depend on the details of the project, such as the size of the solar array, the expected energy demand, and the local climate and weather conditions. However, some general principles for battery sizing and selection can be applied, such as ensuring that the battery capacity is sufficient to meet the energy demand and taking into account factors such as temperature and depth of discharge.
Monitoring the State of Health (SOH) value of a battery can provide highly valuable information for understanding the physical degradation occurring within the battery [
22]. Decreased capacity is usually an indicator of physical degradation [
23,
24], as the active materials within the battery cell undergo structural changes and are unable to efficiently store energy as they did when the battery was new.
Meanwhile, SOH reflects the battery’s health condition relative to its original capacity. A lower SOH value indicates significant physical degradation within the battery cell. Monitoring the SOH value provides an insight into how charge–discharge cycles affect battery health. Repeated usage and exposure to extreme temperatures during charge cycles can accelerate degradation.
During physical degradation, the battery tends to generate more heat when in use. Monitoring the battery temperature during usage and charge–discharge cycles, detecting unusual temperature changes or excessive temperature rise [
25], which indicates potential physical degradation issues, can be identified more promptly. This is also evident from the graph in
Figure 4, where the SOH of the solar PV system experiences a significant decline due to measurements being taken in an environment with fluctuating temperatures influenced by changing weather conditions and repeated battery usage, leading to temperature elevation.
Hence, envisaging a scenario where the battery temperature remains stable would greatly enhance the effectiveness of battery utilization and extend its lifespan. This represents one way in which the current study has provided meaningful insights into useful studies on battery optimization, especially for lead acid batteries, and for the purposes of battery utilization and lifetime extension. Other possible path forwards stem from the fact that the results in the present study do not fully describe the conditions for using lead acid batteries in a Solar Power Plant due to the limited dataset available in public repositories. Therefore, when they become available in our project SMART DOME 4.0, the use of actual lead acid battery measurement data in the project will better describe lead acid battery degradation for relevant agricultural-integrated solar farm applications.