Next Article in Journal
GenAI-Powered Text Personalization: Natural Language Processing Validation of Adaptation Capabilities
Next Article in Special Issue
Probing Solid-State Interface Kinetics via Alternating Current Electrophoretic Deposition: LiFePO4 Li-Metal Batteries
Previous Article in Journal
Closing Editorial: New Insights into Food Ingredients for Human Health Promotion
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Performance Enhancement of Second-Life Lithium-Ion Batteries Based on Gaussian Mixture Model Clustering and Simulation-Based Evaluation for Energy Storage System Applications

School of Electrical Engineering, Soongsil University, Seoul 06978, Republic of Korea
*
Author to whom correspondence should be addressed.
Appl. Sci. 2025, 15(12), 6787; https://doi.org/10.3390/app15126787
Submission received: 15 May 2025 / Revised: 31 May 2025 / Accepted: 16 June 2025 / Published: 17 June 2025

Abstract

Lithium-ion batteries (LIBs) are widely deployed in electric vehicles due to their high energy density and long cycle life. Even after retirement, typically at around 80% of their rated capacity, LIBs can still be repurposed for second-life applications such as residential energy storage systems (ESSs). However, effectively grouping these heterogeneous cells is crucial to optimizing performance of the module. Retired LIBs can be effectively repurposed for numerous second-life applications such as ESSs, and other power backups. In this paper, we compare four clustering approaches including random grouping, equal-number Support Vector Clustering, K-means, and an equal-number Gaussian Mixture Model (GMM) to organize 60 retired cells into 48 V modules consisting of 15-cell groups. We verify the performance of each method via simulations of a 15S2P configuration, focusing on the standard deviation of final charge voltage, average charge throughput, delta capacity, and coulombic efficiency. Based on the evaluation metrics analyzed after regrouping the battery cells and simulating them for second-life ESS applications, the GMM-based clustering method demonstrates better performance.

1. Introduction

With the increasing global emphasis on decarbonization and sustainable energy solutions, lithium-ion batteries have emerged as the leading choice for energy storage in electric vehicles (EVs) due to their high energy density, long cycle life, and favorable performance characteristics [1,2]. As the EV market continues to grow, the worldwide production and deployment of Li-ion batteries are expected to surge, resulting in a parallel increase in the number of batteries reaching their end of life [3,4]. However, many of these batteries retain sufficient residual capacity and can be repurposed for less-demanding applications rather than being prematurely discarded. This concept of second-life usage provides both economic and environmental benefits by extending battery lifespans and reducing the environmental impact associated with battery manufacturing and disposal.
Among various second-life applications, stationary energy storage systems (ESSs) have gained significant attention. Residential and commercial ESS solutions can buffer fluctuations in renewable energy sources and help to stabilize the power grid [5,6,7,8,9]. In particular, the demand for standardized battery module configurations is growing as it simplifies the integration process, lowers development costs, and ensures compatibility with existing battery management systems (BMSs). One widely adopted standard for residential ESSs is the 48 V module configuration, which provides a safe and modular building block [10,11]. Despite its practical advantages, reassembling used cells into battery modules still faces challenges related to cell-to-cell variability, state of health (SOH) discrepancies, and capacity mismatches. These issues necessitate a robust and effective grouping or clustering strategy to ensure consistent module performance and prolonged operating life.
In light of these challenges, there has been extensive research on methods to optimally regroup cells for second-life usage. Beyond simple heuristic or rule-based approaches, data-driven clustering and classification methods are increasingly being adopted to handle complex, multidimensional battery performance data [12]. For instance, Support Vector Clustering (SVC) leverages the kernel trick to capture non-linear relationships in the data, making it suitable for identifying tightly knit clusters even when the boundaries are not trivially separable. K-means, a classic and widely used partitioning algorithm, remains attractive due to its simplicity and computational efficiency [13]. However, K-means typically assumes spherical or near-spherical cluster shapes, which may not always align with battery performance data distributions [14,15]. Gaussian Mixture Models (GMMs), on the other hand, offer a probabilistic framework that can model more flexible cluster geometries and allow for soft clustering [16,17], which is when data points can belong to multiple clusters with varying degrees of membership.
Despite their theoretical differences, there is a need to systematically compare these clustering methods under realistic operational scenarios, especially for second-life battery modules intended for residential ESS applications. In this work, we offer a theoretical comparison of three clustering algorithms—K-means, Gaussian Mixture Models (GMMs), and Support Vector Clustering (SVC)—by applying them directly to our 60-cell dataset to form four equal 15-cell modules. We then quantify how each method’s cluster assignments affect voltage deviation and coulombic efficiency in 15s2p simulations, showing that GMMs’ probabilistic grouping produces the most uniform performance across cells. By validating these approaches on real battery data, this comparative framework highlights the theoretical basis for choosing clustering techniques to optimize second-life module design. In this study, we build a 48 V module simulation platform designed to replicate real-world applications and thoroughly evaluate SVC, K-means, and GMMs in terms of their ability to form well-balanced groups of second-life Li-ion cells. By simulating the module’s operational metrics, such as the standard deviation in final cell charging voltages, energy throughput, and delta capacity over time, we demonstrate how different clustering strategies can significantly influence overall module performance and battery longevity. Our findings show that while all three methods have their merits, GMM-based clustering tends to yield more balanced cell groupings and better overall performance under a range of load scenarios.

2. Data Description and Pre-Processing

The experimental data used in this work is derived from the MIT–Stanford Toyota Research Center battery dataset, originally measured by Severson et al. [18]. The dataset focuses on the APR18650M1A battery cells, manufactured by A123 Systems, Hangzhou, China. Table 1 provides an overview of the key parameters and usage recommendations for the APR18650M1A battery. Notably, the testing was conducted at a constant temperature of 30 °C, employing a one-step or two-step fast charging strategy. The charging protocol follows a constant-current–constant-voltage (CCCV) approach with an upper cutoff voltage of 3.6 V and a lower cutoff voltage of 2.0 V, as recommended by the manufacturer.
For our study, we select 60 battery samples from the dataset and use the data of their first 400 charge–discharge cycles to validate the proposed clustering approach. All samples undergo 4C-discharge rates to ensure a uniform discharge process. The fundamental goal is to repurpose these retired or partially used cells for second-life applications, making it critical to quantify their current condition accurately and group them accordingly. From the available measurements, we focus on three key parameters [19], capacity, internal resistance (IR), and remaining useful life (RUL), as these factors play a dominant role in determining battery health, performance, and viability for second-life usage.
Battery capacity is a primary indicator of the SOH for lithium-ion batteries. The discharge capacity Cn of the nth cycle can be expressed by the following:
C n = C n H C n L
where C n H denotes the total capacity after the battery is fully discharged and C n L is the capacity at the start of discharge for cycle n. As the number of cycles increases, C n typically declines, adversely affecting the battery’s usable lifetime. Figure 1 illustrates how the discharge capacity decreases over the cycles.
Internal resistance (IR) is another critical parameter that impacts the performance of battery and thermal characteristics. A higher IR leads to increased energy losses and faster degradation. When the battery starts discharging, there is an instantaneous voltage drop [20]. Let t be the moment the battery begins discharging, and U be the interpolated voltage at that instant. If t1 and t2 are two closely spaced sampling points with voltages U1 and U2, the ohmic internal resistance R n during the nth discharge cycle can be approximated by the following:
R n = U n I n   w h e r e   U n = U 2 U 1 t 2 t 1 t
where U n is the effective voltage change caused by internal resistance and I n is the discharge current of the nth cycle. Figure 2 illustrates the rapid voltage–current fluctuations occurring during instantaneous discharge of the power lithium-ion batteries due to internal resistance.
RUL represents the projected service life or the number of additional cycles a battery can deliver before reaching a specified end-of-life threshold. In this dataset, we can calculate RUL directly from the measured capacity. When a battery’s capacity decays to 80% of its rated capacity CR, it is considered retired [21]. That is, the condition for retirement is as follows:
C 0.8 C R
where C is the actual capacity of the battery. Figure 3 shows the variation trend of battery capacity with the number of cycles.
To ensure fair and consistent comparison among the three parameters (capacity, internal resistance, and RUL), we apply a min–max normalization scheme. This transformation maps each parameter to a uniform range of 0 to 1, thereby preventing any single parameter with larger numeric values from dominating the clustering algorithms. The min–max normalization for a given parameter x can be expressed as follows [22]:
x n o r m = x x m i n x max x m i n
where x m i n and x max are the minimum and maximum observed values of x in the dataset, respectively. By performing this step, the capacity, internal resistance, and RUL metrics all lie on a comparable scale, which is beneficial for methods such as SVC, K-means, and the GMM.

3. Clustering Methods

After extracting and normalizing the key battery parameters like capacity, IR, and RUL, we employ three different clustering algorithms to regroup second-life batteries: SVC, K-means, and GMMs. This section offers an overview of each approach and their application to our standardized dataset.

3.1. Support Vector Clustering (SVC)

Support Vector Clustering (SVC) is a kernel-based approach that adapts the principles of support vector machines (SVMs) to unsupervised learning. In SVC, the normalized data points are mapped to a high-dimensional feature space using a kernel function, typically the radial basis function (RBF) [19], which is defined as follows:
K x i , x j = e γ x i x j 2
where γ is a hyperparameter controlling the width of kernel. In this feature space, SVC determines the smallest hypersphere that encloses the majority of the data points.
Upon convergence, the projection of this hypersphere back into the original input space produces a decision boundary. A data point x can be tested by evaluating the following:
f x = Φ x a   2 R 2
where f x < 0 indicates that x is inside the sphere, and f x > 0 signals it is an outlier or lies in another region. When different enclosed regions emerge in the input space, multiple clusters can be formed. In the context of second-life battery regrouping, equal number SVC applies an additional post-processing step to ensure each cluster contains a fixed number of cells. Although SVC excels at discovering arbitrarily shaped clusters, it demands careful parameter tuning. The results of clustered cells with SVC are shown in Figure 4.

3.2. K-Means

In contrast, the K-means algorithm partitions the dataset into a predetermined number of clusters k , by minimizing the within-cluster sum-of-squares. Which can be represented by the following:
J c , u = i = 1 M x u i 2
where x i u i 2 is the squared Euclidean distance between a data point x and the centroid u i [23]. The number of clusters was determined via the elbow method by plotting total within-cluster SSE against K, which revealed a distinct elbow, beyond which SSE improvements became negligible. To improve the stability and convergence speed of the algorithm, we employ K-means++ initialization, which selects initial centroids in a way that maximizes their separation. This approach helps the clusters converge to a more optimal configuration than random initialization would typically allow.
After initialization, each data point is iteratively assigned to the nearest centroid [24]. The centroids are then recalculated as the mean of the assigned points, and this process is repeated until the assignments stabilize or a maximum iteration limit is reached. While K-means++ reduces the risk of poor initial cluster placements, the algorithm itself still assumes roughly spherical clusters and relies on Euclidean distance for defining cluster boundaries. Consequently, K-means struggles to handle overlapping or non-spherical distributions, as is often the case in second-life battery data where cell parameters may not neatly conform to uniform cluster shapes. The clustering of the cells with K-means is shown in Figure 5.

3.3. Gaussian Mixture Model (GMM)

Gaussian Mixture Models (GMMs) offer a probabilistic framework that models the data as a mixture of K Gaussian distributions. Each component in the mixture is characterized by a mean vector μ k , a covariance matrix Σ k , and a mixing coefficient π k and the overall probability density function is given by the following [25]:
p x     Θ = k = 1 K π k N x ; μ k , Σ k
π k 0 a n d k = 1 k π k = 1
N x ; μ k , Σ k = 1 2 π d 2 Σ k 1 2 e exp 1 2 x U 1 T Σ k x μ k
Θ = π k , u k , Σ k k = 1 K
where π k represents the mixing coefficient for the kth component and Θ denotes the set of all model parameters.
A crucial step in fitting a GMM is choosing the initial parameter values, since poor initialization can lead to suboptimal clustering or slow convergence in the EM algorithm. In our case, we initialize the GMM means u k using K-means++, which was proposed by S. Vassilvitskii and D. Arthur in 2007 [26]. We employ K-means++ to initialize the means of the Gaussian components prior to the expectation–maximization procedure. K-means++ selects initial centroids that are more widely dispersed than those from random initialization, which significantly reduces the likelihood of convergence to suboptimal local maxima in the likelihood function. This approach not only accelerates convergence but also improves the stability and reproducibility of the GMM clustering results, as documented in prior studies. Specifically, we first use K-means++ to select K well-spaced centroids. These centroids then serve as the initial means u k . The covariance matrices Σ k can be initialized (for instance) as the identity matrix or estimated from the local neighborhoods of these initial means, while the mixing coefficients π k are set to 1 K by default. This initialization strategy helps to avoid the local maxima often encountered by random initialization.
Once we have initial parameter estimates, the GMM parameters are iteratively refined using the expectation–maximization (EM) algorithm [27]. Each iteration of EM comprises the following two steps:

3.3.1. Expectation Step

Compute the posterior probability γ i k that data point x i belongs to cluster k through the following:
γ i k = p z k   x i = π k N x i ; μ k , Σ k i = 1 N π j N x i ; μ k , Σ k
where z k denotes the event that x i comes from the kth Gaussian component. The responsibilities γ i k quantify soft cluster membership, allowing a single data point to have fractional membership in different clusters.

3.3.2. Maximization Step

Update the model parameters { π k , u k , Σ k } to maximize the expected complete-data log-likelihood. The standard formulas for the updates are as follows:
π k n e w = 1 N i = 1 N γ i k ,   μ k n e w = i = 1 N γ i k x i i = 1 N γ i k   ,   k n e w = i = 1 N γ i k ( x i μ k n e w ) ( x i μ k n e w ) T i = 1 N γ i k
where N is the total number of data points. Each update step re-estimates the mixing coefficients, means, and covariances based on the current responsibilities.
These two steps (E and M) alternate until convergence, typically when changes in the log-likelihoods or parameter values fall below a chosen threshold. After all this process, the resulting clusters of GMMs are shown in Figure 6.
Soft clustering is a defining characteristic of Gaussian Mixture Models (GMMs) that proves especially valuable in second-life battery applications, where cells often exhibit overlapping performance characteristics. Unlike hard clustering methods, which assign each cell to exactly one cluster, a GMM outputs membership probabilities that indicate how strongly each cell belongs to each cluster. In our case, we leverage these soft memberships to form equal-sized modules, by sorting the cells of each cluster in descending order of membership probability and assigning them until the desired module size is reached. If any clusters become over- or underfilled, cells near the boundary of cluster membership are reassigned to maintain the overall equal distribution. This probabilistic approach ensures that each 15-cell module remains compositionally uniform while still respecting the practical requirement of equal-sized modules.

4. Comparative Analysis

4.1. Clustering Methods Evaluation

In our evaluation, we quantitatively compared the clustering performance of SVC, K-means, and GMMs using the following three widely recognized indices: the Silhouette Coefficient (SC), Calinski–Harabasz score (CHS), and Davies–Bouldin score (DBI). The Silhouette Coefficient measures how similar an object is to its own cluster compared to other clusters, defined for each sample as follows:
S C = b a max ( a , b )
where a is the average distance between a sample and all other points in the same cluster and b is the average distance between a sample and the points in the nearest cluster. Values closer to 1 indicate well-separated clusters. The Calinski–Harabasz score, calculated as the ratio of the between-cluster dispersion to the within-cluster dispersion, further validates the distinctness of clusters. The higher the CHS, the more distinct the clusters are [28]. In contrast, the Davies–Bouldin score, which evaluates the average similarity between each cluster and its most similar one, rewards lower values that imply better clustering performance [29].
Figure 7 shows a bar chart comparing SC, CHS, and DBI for SVC, K-means, and GMMs. GMMs consistently outperforms the other two methods across all three metrics. The ability of GMMs to provide soft assignments helps it adapt to inherent variations in the battery data, allowing each cell to be assigned in a way that better balances the 15-cells-per-module requirement while accommodating natural differences in capacity, resistance, and RUL.
In contrast, K-means, which strongly depends on the Euclidean distance to cluster centroids, can struggle with clusters that have non-spherical shapes or uneven data densities. SVC, particularly in its equal-number variant, does allow for more flexible boundaries compared to K-means, but the forced equal-cluster-size constraint means that if the natural distribution of battery parameters is strongly imbalanced, the clustering may not fully capture all nuances of the data. As a result, both SVC and K-means can exhibit a slightly lower SC, a lower CHS, and a higher DBI than GMMs.

4.2. Clustering Performance Evaluation

After determining appropriate cell clusters using SVC, K-means, or GMMs, it is essential to verify that these groupings also perform well under practical operating conditions. A purely data-driven clustering approach can overlook system-level constraints such as voltage, current, and thermal behavior during charge–discharge cycles. Therefore, a MATLAB/Simulink-based simulation block diagram, shown in Figure 8, is a valuable bridge between theoretical clustering outcomes and actual battery pack performance. By modeling the battery modules in software, we can accurately estimate how each cluster will behave as part of a 48 V battery pack.
Residential ESS solutions commonly use a nominal 48 V battery pack design for reasons of safety, modularity, and compatibility. To achieve this nominal voltage using Li-ion cells (with each cell having a nominal voltage of 3.3 V), the typical solution is to connect around 15 cells in a series. In our case, a 15S chain ensures the battery pack can reach the required voltage range under normal conditions. While 15 cells in series fix the pack voltage, adding parallel branches increases the total capacity and extends the usable energy for the module. Depending on the energy needs, the number of parallel branches can be scaled up, allowing for a flexible approach to capacity sizing without changing the nominal voltage. Figure 9 shows the voltage, current, and state of the charge waveforms of the 15S2P (30 cells) simulation for one of the clustering groups.
The effectiveness of each clustering method was evaluated by simulating battery modules using the actual parameters of the clustered cells from the dataset, including internal resistance, actual capacity, SOC-OCV values, etc. Each module was constructed using 15 representative cells per cluster and configured as a 15S2P system. The modules were subjected to constant current (CC) charging until the voltage limit was reached, followed by constant voltage (CV) charging, and then constant current discharging.
The key performance indicators used for comparison were as follows:
  • The standard deviation of the final charging voltages measured across all 15 cells within the battery module, which indicates the degree of voltage imbalance among the cells at the end of the charging process.
  • The average charge throughput of the 15 cells during the charging process, representing the total amount of charge each cell accepted on average, used as a measure of the overall charging performance and uniformity within the module.
  • The difference between the maximum and minimum cell capacities within each cluster reflects the internal consistency of the clustering method in grouping cells with similar energy storage capabilities.
  • The Coulombic efficiency of the battery module, calculated as the ratio of discharge capacity to charge capacity, used to evaluate how effectively the input electrical energy is converted into usable output energy.
These indicators compare the performance across modules formed via SVC-, K-means-, and GMM-based clustering. Each parameter was extracted from the simulation models to evaluate if the inconsistency of the LIBs group is reduced, and if the performance of the groups has been improved.
In this simulation, a 15S2P battery pack underwent a 1.5 A constant-current–constant-voltage (CCCV) charge. The final cell voltages within a group were then monitored, and the standard deviation of these final charge voltages were calculated to gauge the voltage spread across the cells in one group. As shown in Figure 10, comparing the random grouping, SVC, K-Means, and GMMs methods, it is clear that the GMM-based module exhibits the lowest voltage spread. Specifically, clusters B and D achieve standard deviations of 0.01320 and 0.01133, respectively.
In addition to analyzing the final voltage dispersion, we also examined the average charge throughput, which can be seen in the bar graph on Figure 11. As all cells in a series string share the same current, the throughput primarily reflects how uniformly the cells accept charge over the duration of the charging period. In our simulation, we observed that modules clustered via GMMs exhibit more consistent charge throughput, indicating fewer outlier cells that could reduce the overall capacity utilization, as group A and B have an average charge throughput higher than 1 Ah. Conversely, random grouping or other clustering approaches tend to include cells with disproportionately lower or higher acceptance rates, which ultimately lowers the group’s average throughput. By aligning cells of similar condition, GMMs ensure identical charging behavior, thereby optimizing the total charge delivered to the battery pack and enhancing the reliability of second-life energy storage modules.
Another key metric in analysis is the delta capacity in Figure 12, which is the difference between the highest-capacity cell and the lowest-capacity cell within each group. A smaller delta capacity indicates that the cells are more closely matched, thereby minimizing the risk of overcharging or over-discharging the weaker cells during operation. The GMM-based clusters showed notably lower delta capacity values compared to both random groupings and the two other clustering methods.
Finally, we evaluated coulombic efficiency, which measures how effectively each module converts the charged energy into usable discharge energy. The 15S2P configurations derived from GMM-based clustering demonstrated a consistently higher coulombic efficiency of 98.18% relative to other methods, as shown in Figure 13, underscoring more uniform energy utilization and reduced internal losses. Cells with similar capacity and internal resistance avoid mismatch-driven inefficiencies that often occur in packs assembled through less-adaptive clustering approaches.
Overall, these findings demonstrate that the GMM-based clustering approach provides more balanced and efficient battery modules than random grouping, SVC, or K-means. The minimized voltage spread indicates tighter cell matching, while the higher average charge throughput reflects a more uniform charge acceptance across the entire module. In tandem, the reduced delta capacity confirms that the GMM consistently groups cells of similar health, mitigating the risk of over-stressing weaker cells. Finally, the notable coulombic efficiency gain underscores the method’s ability to limit internal losses and capitalize on the available capacity. Taken together, these improvements validate GMMs as a robust clustering strategy for assembling second-life lithium-ion cells into reliable and long-lasting battery packs.

5. Conclusions

In conclusion, this work demonstrates that effectively grouping second-life lithium-ion cells into standardized 48 V modules can significantly enhance their performance in residential ESS applications. By comparing random grouping, equal-number SVC, K-means, and a K-means++-initialized equal-number GMM, we reveal that the soft clustering paradigm of GMMs provides the most balanced clusters. Simulation results from a 15S2P configuration confirm reduced voltage deviations, higher average charge throughput, lower delta capacity, and improved coulombic efficiency for GMM-based modules. The K-means++ initialization assists in mitigating local optima, while the post-processing step ensures each group contains an equal number of cells. These findings underscore the potential of GMMs to accommodate subtle variations in retired EV batteries, extending their useful life and stability in second-life applications. Future work could involve expanding the proposed approach to larger-scale battery packs and validating the technique in field demonstrations to further establish its robustness and versatility.

Author Contributions

Conceptualization, A.S.A. and W.C.; Methodology, A.S.A.; Software, A.S.A.; Validation, A.S.A.; Formal analysis, A.S.A.; Investigation, A.S.A. and W.C.; Resources, W.C.; Writing—original draft, A.S.A.; Writing—review & editing, A.S.A.; Supervision, W.C.; Project administration, W.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The raw data supporting the conclusions of this article will be made available by the authors on request.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Xu, J.; Cai, X.; Cai, S.; Shao, Y.; Hu, C.; Lu, S.; Ding, S. High-energy lithium-ion batteries: Recent progress and a promising future in applications. Energy Environ. Mater. 2023, 6, e12450. [Google Scholar] [CrossRef]
  2. Ramoni; Olalekan, M.; Zhang, H.-C. End-of-life (EOL) issues and options for electric vehicle batteries. Clean Technol. Environ. Policy 2013, 15, 881–891. [Google Scholar] [CrossRef]
  3. Jena, A.; Borra, V.L.; Saida, S.; Venkatesan, P.; Önal, M.A.R.; Borra, C.R. Synergistic hydrometallurgical recycling of Li-ion battery cathode active material, anode copper and waste SmCo magnets. Sustain. Mater. Technol. 2025, 43, e01284. [Google Scholar] [CrossRef]
  4. Qiao, D.; Wang, G.; Gao, T.; Wen, B.; Dai, T. Potential impact of the end-of-life batteries recycling of electric vehicles on lithium demand in China: 2010–2050. Sci. Total Environ. 2021, 764, 142835. [Google Scholar] [CrossRef]
  5. Sarker, M.T.; Haram, M.H.S.M.; Shern, S.J.; Ramasamy, G.; Al Farid, F. Second-life electric vehicle batteries for home photovoltaic systems: Transforming energy storage and sustainability. Energies 2024, 17, 2345. [Google Scholar] [CrossRef]
  6. Mulleriyawage, U.G.; Shen, W. A review of battery energy storage systems for residential DC microgrids and their economical comparisons. DEStech Trans. Environ. Energy Earth Sci. 2019, 3, 1–26. [Google Scholar] [CrossRef]
  7. Ya, L.V. Energy Storage System (ESS) in Residential Applications. In Energy Storage Applications in Power Systems; IntechOpen: London, UK, 2023. [Google Scholar]
  8. Heymans, C.; Walker, S.B.; Young, S.B.; Fowler, M. Economic analysis of second use electric vehicle batteries for residential energy storage and load-levelling. Energy Policy 2014, 71, 22–30. [Google Scholar] [CrossRef]
  9. Duong, V.T.; Waheed, Z.; Choi, W. Non-Isolated Ultra-High Step-Up DC-DC Converter Topology Using Coupled-Inductor Based Inverting Buck-Boost and Voltage Multipliers. Preprints 2025, 2025052240. [Google Scholar] [CrossRef]
  10. Santos, D.; Fonte, P.M.; Pereira, R.; Barata, F.A.; Almeida, P.; Cordeiro, A.; Luís, R.; Pires, V. ESS Design and Management considering Solar PV to fed off-grid EV Charger. In Proceedings of the 2024 12th International Conference on Smart Grid (icSmartGrid), Setubal, Portugal, 27–29 May 2024; pp. 636–641. [Google Scholar] [CrossRef]
  11. Jeong, H.J.; Kwon, M.H.; Choi, S.W. A 48V-400V Non-isolated Bidirectional Soft-switching DC-DC Converter for Residential ESS. Trans. Korean Inst. Power Electron. 2018, 23, 190–198. [Google Scholar]
  12. Shahjalal, M.; Roy, P.K.; Shams, T.; Fly, A.; Chowdhury, J.I.; Ahmed, M.R.; Liu, K. A review on second life of Li-ion batteries: Prospects, challenges, and issues. Energy 2022, 241, 122881. [Google Scholar] [CrossRef]
  13. Li, W.; Chen, S.; Peng, X.; Xiao, M.; Gao, L.; Garg, A.; Bao, N. A comprehensive approach for the clustering of similar-performance cells for the design of a lithium-ion battery module for electric vehicles. Engineering 2019, 5, 795–802. [Google Scholar] [CrossRef]
  14. Jiang, Y.; Wang, Y.; Zhang, C.; Su, G.; Liu, J. Research on group methods of second-use Li-ion batteries based on k-means clustering model. In Proceedings of the 2014 IEEE Conference and Expo Transportation Electrification Asia-Pacific (ITEC Asia-Pacific), Beijing, China, 31 August–3 September 2014; pp. 1–6. [Google Scholar]
  15. Wu, S.; Chow, T.W. Clustering of the self-organizing map using a clustering validity index based on inter-cluster and intra-cluster density. Pattern Recognit. 2004, 37, 175–188. [Google Scholar] [CrossRef]
  16. He, X.; Cai, D.; Shao, Y.; Bao, H.; Han, J. Laplacian Regularized Gaussian Mixture Model for Data Clustering. IEEE Trans. Knowl. Data Eng. 2011, 23, 1406–1418. [Google Scholar] [CrossRef]
  17. Lai, X.; Deng, C.; Tang, X.; Gao, F.; Han, X.; Zheng, Y. Soft clustering of retired lithium-ion batteries for the secondary utilization using Gaussian mixture model based on electrochemical impedance spectroscopy. J. Clean. Prod. 2022, 339, 130786. [Google Scholar] [CrossRef]
  18. Severson, K.A.; Attia, P.M.; Jin, N.; Perkins, N.; Jiang, B.; Yang, Z.; Chen, M.H.; Aykol, M.; Herring, P.K.; Fraggedakis, D.; et al. Data-driven prediction of battery cycle life before capacity degradation. Nat. Energy 2019, 4, 383–391. [Google Scholar] [CrossRef]
  19. Li, C.; Wang, N.; Li, W.; Li, Y.; Zhang, J. Regrouping and Echelon Utilization of Retired Lithium-Ion Batteries Based on a Novel Support Vector Clustering Approach. IEEE Trans. Transp. Electrif. 2022, 8, 3648–3658. [Google Scholar] [CrossRef]
  20. Zhao, S.; Wu, F.; Yang, L.; Gao, L.; Burke, A.F. A measurement method for determination of dc internal resistance of batteries and supercapacitors. Electrochem. Commun. 2010, 12, 242–245. [Google Scholar] [CrossRef]
  21. How to Measure the Remaining Useful Life of a Battery. Battery University. Available online: https://batteryuniversity.com/article/bu-901b-how-to-measure-the-remaining-useful-life-of-a-battery (accessed on 17 March 2025).
  22. Dalatu, P.I.; Midi, H. New approaches to normalization techniques to enhance K-means clustering algorithm. Malays. J. Math. Sci. 2020, 14, 41–62. [Google Scholar]
  23. Yuan, C.; Yang, H. Research on K-Value Selection Method of K-Means Clustering Algorithm. J 2019, 2, 226–235. [Google Scholar] [CrossRef]
  24. Song, R.; Pang, F.; Jiang, H.; Zhu, H. A machine learning based method for constructing group profiles of university students. Heliyon 2024, 10, e29181. [Google Scholar] [CrossRef]
  25. Gaussian Mixture Model—Understanding and Implementation. Data Flair. Available online: https://data-flair.training/blogs/gaussian-mixture-model/ (accessed on 17 March 2025).
  26. Vassilvitskii, S.; Arthur, D. k-means++: The advantages of careful seeding. In Proceedings of the Eighteenth Annual ACM-SIAM Symposium on Discrete Algorithms, New Orleans, LA, USA, 7–9 January 2007; Society for Industrial and Applied Mathematics: Philadelphia, PA, USA, 2006; pp. 1027–1035. [Google Scholar]
  27. Zhang, T.; Kuo, C.C.J. Sound Effects Classification and Retrieval. In Content-Based Audio Classification and Retrieval for Audiovisual Data Parsing; The Springer International Series in Engineering and Computer Science; Springer: Boston, MA, USA, 2021; Volume 606. [Google Scholar] [CrossRef]
  28. Falah, N.; Falah, N.; Solis-Guzman, J.; Marrero, M. An Indicator-Based Framework of Circular Cities Focused on Sustainability Dimensions and Sustainable Development Goal 11 Obtained Using Machine Learning and Text Analytics. Sustain. Cities Soc. 2025, 121, 106219. [Google Scholar] [CrossRef]
  29. Sasithradevi, A.; Perumal, D.A.; Persiya, J. Infrared Perspectives: Computing laptop energy dissipation via thermal imaging and the Stefan-Boltzmann equation. Therm. Sci. Eng. Prog. 2024, 53, 102742. [Google Scholar] [CrossRef]
Figure 1. Relationship between capacity and voltage.
Figure 1. Relationship between capacity and voltage.
Applsci 15 06787 g001
Figure 2. Concept of internal resistance measurement by discharge pulse.
Figure 2. Concept of internal resistance measurement by discharge pulse.
Applsci 15 06787 g002
Figure 3. Battery capacity degradation and RUL approximation.
Figure 3. Battery capacity degradation and RUL approximation.
Applsci 15 06787 g003
Figure 4. Support Vector Clustering results.
Figure 4. Support Vector Clustering results.
Applsci 15 06787 g004
Figure 5. K-Means clustering results.
Figure 5. K-Means clustering results.
Applsci 15 06787 g005
Figure 6. GMM Clustering results.
Figure 6. GMM Clustering results.
Applsci 15 06787 g006
Figure 7. Comparison of evaluation indices for the clustering methods: (a) Silhouette Coefficient; (b) Davies–Bouldin score; and (c) Calinski–Harabasz score.
Figure 7. Comparison of evaluation indices for the clustering methods: (a) Silhouette Coefficient; (b) Davies–Bouldin score; and (c) Calinski–Harabasz score.
Applsci 15 06787 g007
Figure 8. Simulation block diagram.
Figure 8. Simulation block diagram.
Applsci 15 06787 g008
Figure 9. Voltage, current, and SOC waveforms of 15s2p simulation for CCCV of regrouped cells.
Figure 9. Voltage, current, and SOC waveforms of 15s2p simulation for CCCV of regrouped cells.
Applsci 15 06787 g009
Figure 10. Standard deviation of final cell voltages of each module across different clustering methods.
Figure 10. Standard deviation of final cell voltages of each module across different clustering methods.
Applsci 15 06787 g010
Figure 11. Average charge throughput of each module across different clustering methods.
Figure 11. Average charge throughput of each module across different clustering methods.
Applsci 15 06787 g011
Figure 12. Delta capacity of each module across different clustering methods.
Figure 12. Delta capacity of each module across different clustering methods.
Applsci 15 06787 g012
Figure 13. Coulombic efficiency of each module across different clustering methods.
Figure 13. Coulombic efficiency of each module across different clustering methods.
Applsci 15 06787 g013
Table 1. Specifications and cycling conditions of the battery cell.
Table 1. Specifications and cycling conditions of the battery cell.
ParameterValue
Nominal Capacity1.1 Ah
Nominal Voltage3.3 V
Recommended standard charge method1.5 A to 3.6 V (CCCV)
Recommended charge and cut-off voltage at 25°3.6 to 2.0 V
Operating temperature range−30~60 °C
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Akram, A.S.; Choi, W. Performance Enhancement of Second-Life Lithium-Ion Batteries Based on Gaussian Mixture Model Clustering and Simulation-Based Evaluation for Energy Storage System Applications. Appl. Sci. 2025, 15, 6787. https://doi.org/10.3390/app15126787

AMA Style

Akram AS, Choi W. Performance Enhancement of Second-Life Lithium-Ion Batteries Based on Gaussian Mixture Model Clustering and Simulation-Based Evaluation for Energy Storage System Applications. Applied Sciences. 2025; 15(12):6787. https://doi.org/10.3390/app15126787

Chicago/Turabian Style

Akram, Abdul Shakoor, and Woojin Choi. 2025. "Performance Enhancement of Second-Life Lithium-Ion Batteries Based on Gaussian Mixture Model Clustering and Simulation-Based Evaluation for Energy Storage System Applications" Applied Sciences 15, no. 12: 6787. https://doi.org/10.3390/app15126787

APA Style

Akram, A. S., & Choi, W. (2025). Performance Enhancement of Second-Life Lithium-Ion Batteries Based on Gaussian Mixture Model Clustering and Simulation-Based Evaluation for Energy Storage System Applications. Applied Sciences, 15(12), 6787. https://doi.org/10.3390/app15126787

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop