1. Introduction
The global ambition to tackle climate change has led to a renewed focus on the transportation sector [
1]. Further, on national levels, countries have developed goals to achieve the overall reduction targets, crafting laws and regulations to make their ambition binding [
2]. Within the transportation sector, road-based transportation accounts for the majority of greenhouse gas emissions; at the same time, its decarbonization lags behind [
3,
4,
5]. Crafting effective and efficient polices is promising to reduce Greenhouse Gas (GHG)-emissions in the long run; however, it requires a profound understanding of the holistic transportation ecosystem [
6,
7,
8,
9]. As passenger car fleets consist of a wide range of different vehicles, clustering them into manageable subgroups is used to simplify the analysis and communication of findings to the public and parties of interest. Necessarily, the feasibility and precision of current approaches are critical to the validity of the derived findings. Current methods often rely heavily on predefined categories, such as vehicle class or chassis type, which can introduce inconsistencies and variations in key parameters across vehicles within the same segment. These deviations hinder the comparability of vehicles and limit the effectiveness of segmentation for certain analytical tasks. To illustrate this, the Ferrari Purosangue, the Toyota Aygo X, and the Mercedes-Benz Maybach GLS 600 are all classified as SUVs, even though their technical characteristics differ significantly (e.g., the Aygo has a curb weight of 1015 kg and the Maybach GLS a curb weight of 2825 kg). Using weight as a proxy for consumption, which is an important characteristic regarding their environmental performance, the practical usability is questionable.
In this paper, we propose a novel, data-driven approach that focuses exclusively on the technical properties of vehicles for segmentation and classification. Our methodology is built upon a dataset from the ADAC database, covering vehicle specifications from the years 2019 to 2025. By employing both raw and engineered features, we perform clustering on the data to uncover inherent groupings that are statistically more comparable than traditional vehicle categories. This approach allows for the identification of more consistent vehicle segments, free from the biases introduced by conventional segmentation keys.
To achieve this, we first perform data merging and preprocessing steps, followed by feature selection and engineering to enhance the predictive power of the dataset. The resulting feature sets are fed into clustering algorithms, which generate novel vehicle clusters based solely on technical characteristics. These clusters not only improve statistical comparability but also provide valuable insights for future vehicle categorization, especially in contexts where technical performance is the primary consideration.
Our work contributes to the growing body of research in data-driven vehicle market research, offering a new framework that can be adopted by academic researchers, commercial organizations, and public authorities seeking to adapt to the changed reality of the vehicle market. The remainder of this paper is organized as follows:
Section 2 reviews related work in the field,
Section 3 describes the dataset and methodology in detail,
Section 4 presents the results of our clustering experiments, and
Section 5 discusses the implications of our findings.
2. State of the Art
Vehicles can be categorized in different ways. On the highest level, the vehicle classes N, M, L, etc., are set according to Regulation 168/2013 and Directive 2007/46/EC by the European Union [
10,
11]. Within the M class (passenger vehicles), it is further differentiated into M1, M2, and M3, while M1 is of special interest for this study. M2 (>8 passengers, ≤5
) and M3 (>8 passengers, >5
) are types of buses out of this publication’s scope. Besides the classic M1 segment, M1G describes vehicles that are dedicated to off-road use. Such vehicles need to be approved by 2007/46/EG or 2018/858/EG and are listed as M1G [
10,
12]. Without this approval, vehicles with “off-road characteristics” are usually labeled as Sport Utility Vehicle (SUV). Older vehicles (before 1990) as well as vehicles without proper FIN are listed as motorhomes or others. Regarding passenger cars, which are commonly used for individual motorized mobility and everyday use, there are several subsections. The European Commission differentiates them based on the court decision between Hyundai and Kia in 1999 [
13]. With this, the first differentiation was set, dividing the passenger vehicle market into nine segments (A: mini cars, B: small cars, C: medium cars, D: large cars, E: executive cars, F: luxury cars, S: sport coupés, M: multi-purpose cars, and J: sport utility cars (including off-road vehicles)) [
13].
As one of the biggest lobby groups regarding vehicles in the EU, the European Automobile Manufacturers’ Association (ACEA) also uses certain segmentation for its communication. Their classification is similar to the one introduced by the EU in 1999. However, it does not cover sport coupés and refers to SUV rather than sport utility cars [
14]. The EURO NCAP classification uses different classes, distinguishing between family cars, SUV, MPVs, Vans, and some more, all in different sizes from small to large [
15]. The ICCT mentioned their classes in a report regarding a Brazilian vehicle study. In this, they state eight different passenger car classes (subcompact, compact, medium, large, sport, off-road, SUV, and minivan) [
16] Other than ACEA and the EURO NCAP, they provided the technical characteristics used to classify vehicles. For this purpose, they use the system power in
, the power-to-curb-mass ratio, the area of the vehicle, and auxiliary information (e.g., off-road vehicles need to have an all-wheel drive function, minivans need to provide between six and twelve seats). The SUV definition is borrowed from a classification according to the Brazilian automotive marketing company ADK [
16].
In Germany, the Kraftfahrt-Bundesamt (KBA) is the official institution covering topics across the transportation sector. For passenger cars, they differentiate 13 segments: minis, small cars, compact cars, mid-class, upper-mid class, upper class, SUV, off-road vehicles, sports cars, minivans, large vans, utilities, and mobile homes [
17]. These segments are used for enhanced comparability, using optical, technical, and market-specific characteristics, which are not necessarily included in type certification documents. Further, according to the KBA,
vehicles can be divided by certain chassis types such as sedan, hatchback, station wagon, coupé, convertible sedan, commercial vehicle, and passenger car pickup. Moreover, special-purpose vehicles are identified, such as mobile homes, ambulances, emergency vehicles, hearses, bulletproof vehicles, wheelchair accessible vehicles, and others. Finally, it is differentiated whether a vehicle has an open or closed chassis, as a necessary characteristic for a convertible car [
18]. Staying in Germany, the Allgemeiner Deutscher Automobil-Club (ADAC) represents one of the most important German automotive institutions. Contrary to public authorities, the ADAC is a private automobile association, representing the interests of its members [
19]. Similar to the KBA, the ADAC defines a segmentation of passenger cars; however, the segments and chassis variants differ from those the KBA uses. According to an e-mail message from the ADAC, seven different segment types are covered: micro cars, mini cars, small cars, lower middle class, middle class, upper middle class, and upper class. Further, the chassis variants are divided into 15 different types: hatchbacks, notchbacks, station wagons, coupés, convertibles, roadsters, vans, SUV, off-road vehicles, high-roof station wagons, small transporters, transporters, pickups, buses, and motorhomes [
20]. For the classification parameters, such as length, width, height, wheelbase, average engine power, and corresponding basic version purchase prices are used. These criteria are compared with the average values of the corresponding chassis form with the existing segments in Germany. This decision is not influenced by third parties, according to a representative from the ADAC. Lastly, they state a dynamic classification always with respect to the actual available vehicle alternative on the current market.
Related Work
Using vehicle classes as segmentation is commonly used in studies covering fleets of vehicles. While some challenge the classes themselves, they are usually taken as they are and used to evaluate certain parts of a fleet. In the following, we focus on work that targets the classification method itself or proposes a novel approach to cluster existing vehicles. Vehicle and automotive market segmentation has been carried out using technical parameters [
21,
22,
23,
24,
25,
26,
27,
28,
29,
30] and user behavior [
27,
31,
32,
33], using data-driven approaches such as unsupervised learning models, often benchmarked their results against expert groups.
Perr-Sauer et al. [
23] proposed a data-driven clustering approach, as they criticized the need for expert knowledge when crafting custom features for clustering. They combined fleet data with geodata and conducted a two-approach study evaluating hand-crafted features and automatically extracted features, stating comparable results. This indicates that a data-driven approach is potentially feasible for clustering vehicles. However, as their study focused on commercial vehicles driving in the United States, the transferability is questionable.
Niroomand et al. [
26] noted the need for a novel approach to segment vehicles. They introduced a Fuzzy C-means and a k-means approach, showing promising results regarding the performance of the classifiers. In another study, Niroomand et al. [
24] investigated a Semi-Supervised Fuzzy C-means approach to cluster passenger cars based on registration data of Switzerland. They evaluated their results against expert classification and found a better correlation compared with other unsupervised approaches. In 2022, Niroomand et al. [
25] used their previously mentioned approach to analyze inter- and intra-class deviations of known vehicle segmentations with respect to
-emissions of the investigated vehicles. They found that within known classes,
-emission shows especially high variance, and further stated a potential to effectively reduce emissions by taking these variabilities into account. While all three studies show the need for novel segmentation approaches and partially mention ideas towards this (e.g., dividing vehicle classes into SUV and non-SUV), a concrete suggestion is not found in their studies. Nevertheless, they show the potential of data-driven approaches to tackle fuzzy state-of-the-art classifications of the current passenger car fleet. In 2024, Niroomand and Bach [
29] used their proposed Semi-Supervised Fuzzy C-mean approach to predict average mileage values and challenged their results with a polynomial classifier. Finally, they followed up with a study extending their work with machine learning approaches, namely a Deep Fuzzy C-means model, a Support Vector Regression, and a Random Forest approach. They used their models to show their feasibility to predict engine performance as well as the environmental impact. Again, they stressed the need for adapting the state-of-the-art classification of passenger cars [
30]. Vaiti et al. [
28] used historical data collected via the OBD-II interface of vehicles in Canada. They propose five different vehicle classes reflecting different environmental performances. While they identify clusters and compare them to the original vehicle segments, they are limited by six features, missing potentially important features such as size and weight dimensions. Yet, they focus on Internal Combustion Engine (ICE) related features, such as cylinder size and fuel type (regular or premium gasoline, and diesel), neglecting electrified vehicles like BEV or PHEV. Nazari et al. [
27] reviewed clustering strategies for electric vehicles, extending their scope to include infrastructure, user behavior, and battery characteristics. Their results underline the need for further research on the promising use of clustering approaches to segment vehicles, especially in the context of electric vehicles. Ardiansyah et al. [
33] used k-means clustering for a market segmentation study for Wuling vehicles in Indonesia. They used survey data to classify different customer groups, showing another use case utilizing a data-driven approach to obtain valuable insights in the automotive world. Elser et al. [
22] used Fuzzy C-means clustering on PCA-reduced technical features to segment the Swiss vehicle fleet from 1995 to 2022. Their results show shifting vehicle characteristics over time and highlight the limitations of static classifications, supporting the motivation for our work. They demonstrate this with a fixed number of clusters. To elaborate on that, we investigate the influence of the number of clusters on separation. In a previous work, Schockenhoff et al. [
31,
32] proposed their customer-oriented concept assessment tool to utilize certain technical parameters to create user-related features. While they provide a user-friendly approach to compare multiple passenger cars, they neither take them into perspective with existing vehicle classes nor do they extend their concept to develop a novel segmentation approach.
Summarizing the mentioned prior studies, we have identified several limitations. First, we notice a high reliance on state-of-the-art segmentations, which do not reflect the changing market conditions, especially regarding the increasing numbers of electric and hybrid-electric vehicles [
21,
22,
23,
24,
25,
26,
27,
28,
29,
30]. Second, the transferability to the German market is questionable, as the studies relate not to passenger cars [
23] or other areas of investigation [
22,
23,
28,
33]. Lastly, while new approaches are presented, especially by Niroomand and Elster [
22,
24,
25,
26,
29,
30], they used a solely data-driven approach, missing the need for understandability and comprehensiveness of non-experts in order to make the proposed cluster usable.
In this article, we examine the current vehicle classes regarding their usefulness in differentiating between passenger cars. This study uses merged fleet data from the KBA with technical data from the ADAC database. Together with data-driven approaches to preprocess and later cluster the vehicle fleet we use, we compare two clustering methods with two feature selections with different cluster sizes. Raw and engineered features are extracted and used for a k-means clustering and a Ward clustering. With both approaches, we can obtain novel vehicle classes challenging the status quo, addressing the practical implementation currently missing in the field of research. We seek to contribute to the understanding of current classification weaknesses, proposing a novel approach that improves the clustering performance. Using multiple approaches, we redefine current vehicle clusters to obtain more meaningful and differentiable segments. By respecting the rise of electrified vehicles, we tend to eliminate the existing fuzziness of segments defined by out-of-date approaches. Moreover, this work proposes a framework for developing clusters using data-driven approaches and presents reasonable clusters. We show clearly how this approach can be used for different applications depending on the use case. Applying this to fleet-level data bridges gaps in current research and provides valuable insights for institutions utilizing vehicle classes.
The main contributions of this paper can be summarized as follows:
Illustration of the current vehicle classification performance
Using Principal Component (PC)-analysis and silhouette scores to determine the fuzziness of the state-of-the-art classification.
Presentation of a novel segmentation approach
Demonstration of a data-driven methodology utilizing raw and engineered features.
Evaluation of the identified cluster performance
Demonstration of the efficacy of the proposed methodology for the passenger car fleet in Germany.
5. Results
To start out,
Figure 4 presents a comprehensive assessment of the status quo of the classification into vehicle segments as documented in [
20]. The left plot presents the vehicle segments in the principal component space (All plots of this type are available as interactive plots on
https://tumftm.github.io/passenger-vehicle-clustering/ (accessed on 17 September 2025), along with the vehicle models associated, and we highly encourage exploring them for a better understanding of the matter): The first PC on the x-axis explains
% of the variance, while the second PC is on the y-axis and contributes another
%. The data is projected into the PC1/2 space, as this provides the most informative 2D representation for the purposes of this paper.
Each dot represents a vehicle model, and each ellipse indicates the 95 percent confidence region of the cluster covariance in the PC-plane. As
Figure A3 in the appendix demonstrates, PC1 is strongly aligned with features like towing load, curb weight, or fuel consumption. PC2 aligns more strongly with the number of doors, the number of seats, or the vehicle height. Consequently, micro, small, and mini cars are mostly on the left, to the lower-left side of the plane, while higher segments are mostly oriented towards the right. The variance within clusters of higher segments is generally higher, as e.g., sports cars, SUV, and high-priced vans are all within the luxury segment, but are scattered widely across the PC-plane.
The right plot visualizes the silhouette values for the representatives of each cluster as box plots. The low separation quality becomes obvious, as 2 clusters have an average silhouette near zero (mid-size class, upper mid-size class), and luxury cars have a negative silhouette on average. To put this into perspective, more than 75% of luxury cars are more similar to other segments than to other luxury cars, similar to the sports version of a sedan being more similar to a sedan than an expensive SUV.
The goal of this article is to find a cluster partition that demonstrates better silhouettes quantitatively and good semantic similarity within a cluster qualitatively. For this purpose, the clustering methods presented in
Section 4 are evaluated for different numbers of clusters in
Figure 5. Here, the state of the art is represented by the dashed lines, indicating the silhouette of vehicle segments and vehicle chassis types evaluated in the aggregated feature space.
First and foremost, both clustering methods, k-means and Ward, on both feature sets, raw and aggregated, perform superior to the state of the art—for all numbers of clusters, our methods exhibit a higher silhouette score than the partition into segments or chassis types. The silhouette reach of the k-means and Ward clustering results reaches up to for two clusters and declines slightly with increasing number of clusters. A higher number of clusters generally allows for more specific labeling of the subgroups, yet a lower number of clusters increases separation. To select a specific value for further analysis, we opt for the compromise for two reasons: Firstly, it is equal to the number of vehicle segments, and thus maximizes comparability with the state of the art. Secondly, the k-means algorithm on aggregated features exhibits a local maximum in its silhouette here, and thus hints at an inherent partition into seven groups within the dataset.
Within the raw feature set, Ward performs superior to k-means, while k-means outperforms Ward on the aggregated feature set. We thus select both winners for closer examination in
Section 5.1 and
Section 5.2, but the other two results are provided in the appendix. Notably, the clusters are not related to any existing classification of vehicles anymore: to provide a descriptive and memorable working title for those clusterings, we fed 10 randomly drawn vehicles to ChatGPT version 4o. The prompt template used is documented in the appendix.
5.1. Clustering Using Raw Features
Starting with raw features listed in
Figure A1, the Ward approach outperforms k-means with a silhouette score of around
compared with
at
.
Figure 6 shows the clustering result in the PC plane in the left panel and the silhouette scores in the right. Now, all but one cluster exhibit silhouettes that are positive on average. The worst separation is observed for the
All-Terrain Family Utility type (e.g., Land Rover Defender 90, Jeep Wrangler, but also VW T5 Multivan, and Peugeot Partner Kombi), which has a close proximity to
Modern Commuter Workhorses and
Refined Executives for Professional Mobility.
Modern Commuter Workhorses holds vehicles such as a Honda Civic, Opel Zafira, and Audi A6 Avant, while
Refined Executives for Professional Mobility covers cars such as Skoda Superb Combi and BMW 335i, but also a Cadillac Escalade, which is more of an outlier than a centered data point in this cluster. At first sight, these clusters are semantically similar to each other and the ellipses overlap; however, their centers are different on both PC axes.
The cluster for
Elite Roadsters & Performance Icons and
Aging Essentials with Everyday Appeal shows the highest average silhouette scores around
. This is plausible, as their location in the PCA space is also at the lower right and left edges, with a clearer distinction from other clusters. The semantic name for the
Elite Roadsters & Performance Icons cluster makes sense when inspecting the individuals of it, mainly consisting of Porsche, Jaguar, and Mercedes-Benz AMG vehicles.
Aging Essentials with Everyday Appeal contains rather small and compact vehicles such as a FIAT Panda 4×4, VW Golf, and Dacia Duster. The cluster with the highest variance is
Luxury Performance Titans, as the majority of its entities are high-priced vehicles with much engine power. On the bottom part, similar vehicles compared with the
Elite Roadsters & Performance Icons can be found (e.g., Porsche Panamera, Mercedes-Benz AMG SL 63). At the upper part of the cluster, bigger vehicles are located (e.g., Toyota Land Cruiser, Ineos Grenadier, and Mercedes-Benz G280 CDI).
Multi-Purpose Transport Shuttles is located at the upper section of the PCplane, which aligns with the height characteristics of such vehicles (e.g., VW T4 Kombi, Mercedes-Benz Sprinter, or Citroen Jumpy Kombi) and the PC2 axis. In general, the clusters are separated quite well, apart from
All-Terrain Family Utility. Comparing the clustering structure to the k-means results visible in
Figure A4, the clusters are clearly distinguishable from each other, and the low-performing
Executive and Utility Movers cluster is separated into two clusters.
5.2. Clustering Using Engineered Features
Utilizing the engineered features, described in
Table 2, we compute the clusters using k-means and Ward again, yet evaluate the silhouette on the engineered feature space, too. In this context, k-means shows better performance than the Ward approach, with a silhouette score of
vs.
at
. These values indicate meaningful structure in the data, when compared with legacy classifications, which often yield silhouette scores close to 0, suggesting little to no separation in the technical feature space. The PC plane remains the same for comparability, but the larger variance within clusters in the plot hints at the fact that the principal components of the raw feature space do not reflect the engineered feature space ideally anymore. Yet we maintain the visualization to assure comparability with
Figure 6. This is the reason that even clusters spanning the whole PC plane (
Next-Gen Electric Pioneers) can maintain a good separation, as shown in
Figure 7.
The Executive & Shuttle Utility Fleet has a high variance and a slightly negative average silhouette, but all other clusters are positive. Going even further, this is the first approach that ensures that 5 out of seven clusters even have non-negative lower quartiles. Analyzing single vehicles in this cluster, this variance can be explained as on the PC1 axis, almost all vehicles with a value of more than 5 are within this cluster. This area covers especially utility vehicles, e.g., VW Crafter, Mercedes-Benz Sprinter, and Ford Transit. On the lower right side, more SUV-like vehicles are located, e.g., BMW X7, Mercedes-Benz G350, and Land Rover Discovery. Centered vehicles within this cluster are a mixture of larger vehicles in general, e.g., VW Sharan, Audi Q5, and Opel Insignia. Interestingly, the Next-Gen Electric Pioneers has the highest distinction. Inspecting the distribution of the four engineered features, we observe that the ecological factor is significantly higher than in other clusters, with a mean value of . Further, the economic factor is above average with a mean of . Both seem plausible as BEV are superior in terms of emissions and energy efficiency, which also supports the economic performance. Moreover, the clustering algorithm found a cluster consisting only of BEV on its own without utilizing a discrete variable for this. We assume a mix of the low ecological and economic factors is responsible for this cluster. With only one outlier, a VW e-Crafter, the cluster is well described. The overall spread is more along the PC2 axis, spanning from smaller electric vehicles such as the Smart ForTwo Cabrio EQ up to large BEV such as the BMW iX M60 or the Tesla Model X Plaid.
Finally, to mention,
Figure A5 has a lower silhouette score compared with the k-means approach; however, the silhouette scores from all clusters are positive, but the average values are lower. This does not necessarily mean that the clusters are less useful; rather, it indicates the need for a concise metric to evaluate the clustering performance for a real-world problem.
6. Discussion and Summary
This study shows the high variance of the state-of-the-art segmentation of the German passenger car landscape and proposes a novel segmentation approach. We combine the publicly accessible database from the KBA with the ADAC, creating a database that holds fleet size data along with technical characteristics for each vehicle. The HSN/TSN keys are used to map vehicles, allowing for easy and reproducible preprocessing. Two variants of features are derived, once a carefully selected subset of the raw features, and once engineered features that combine certain technical characteristics into more abstract features (see
Section 3). First, the status quo is evaluated using the vehicle class and chassis type as clusters. Finally, utilizing the raw and aggregated features, two clustering algorithms are applied to create new clusters. For raw features, the Wardalgorithm performed better, and for the aggregated feature set, k-means performed better.
With the combination of both datasets at the beginning of the study, we encountered issues regarding the data congruence of both datasets. Within the ADAC database, the HSN/TSN key is not unique, but multiple trims of a model hold the identical key. We thus only included the basic trim of the vehicles in question; however, we lose information at this point. As the KBA database does not hold more specific details regarding certain variants, this assumption is necessary, and the information loss is inevitable. However, putting the number of appearances into account, we find that the overall coverage is good, as our dataset holds
million vehicles. The KBA states that in 2024
million passenger cars were registered in Germany, which indicates a coverage rate of around 87% [
38].
In terms of preprocessing, we adjusted and imputed some of the technical data, sometimes to allow for better comparability over time, such as the adjusted price values, and sometimes to allow for better comparability across different fuel types, such as the adjusted consumption values. Even though making plausible and valid assumptions regarding correction factors and inflation rates, adjusting these values introduces uncertainty to the dataset. It is yet to be evaluated how much the adjustment can contribute to reducing this and creating a fair and comparable dataset. We are aware of this circumstance and refer to future studies in this field.
The choice of clustering methods was determined through an assessment of dataset characteristics and task requirements. Given the need for hard assignments, noise-free labeling, interpretability, and no discretization, only k-means and Ward met all criteria. Both methods let us set the cluster count explicitly and require minimal additional hyperparameter tuning, making them ideal for this study. In evaluating both methods, k-means outperformed Ward on aggregated features, while Ward prevailed on the raw feature space. Feature engineering reduced dimensionality and noise, yielding more compact, convex clusters. In this setting, k-means likely outperforms Ward because its reliance on Euclidean distance implicitly assumes convex, isotropic cluster shapes. In contrast, the hierarchical agglomerative method Ward iteratively merges clusters based on variance minimization and can thus capture more complex relationships between samples. In the high-dimensional raw feature space, noisy, imbalanced, or hierarchically nested relationships are more likely to occur, which could be a reason why Ward generally excels in the raw feature space.
A strength of the method is the transition of the unsupervised learning problem (clustering) into a classification problem: The clustering results can be directly used to train a classifier that sorts newly launched vehicles into one of the seven newly conceived groups. A random forest classifier easily achieved a % F1-score in classifying the k-means results. We are thus confident that newly launched vehicles could be classified effectively.
Besides utilizing raw features, we attempted to create understandable engineered features, namely the usability, performance, ecological, and economic factors. The unique elements of these features are shown in
Table 2. The choice of the number of factors can be varied, as new factors can be added in this methodology. An alternation of the number of factors may have an influence on the clustering performance, possibly resulting in even more separated clusters. At the moment, conventional segmentation partially reflects the factors for usability (e.g., chassis types such as vans and SUV offer more trunk space, and usually more usable range), for performance (e.g., high-end sports cars usually are found in the luxury or upper-mid class), and for economic (e.g., luxury cars’ purchase price is significantly higher than for other classes). While some of the features are correlated with the state-of-the-art segmentation, we find that the variance within the cluster is high, drastically reducing the interpretability and informative value of such clusters. With the new aggregated features, the raw technical characteristics are directly used in a transparent manner, making it more comprehensible. For the weights of the single technical features within one aggregated feature, we started with an equal weighting. We believe that without further information, this assumption is valid. Depending on the aim of a potential follow-up study, these weights can be changed to achieve other results. The value
k, which is set to 5 for this study, was tuned in conjunction with the min and max values to achieve a distribution with minimum skewness. We are aware that using aggregated features may potentially lead to oversimplification. As, however, the current state of the art uses aggregated features as described in
Section 1, we decided to add an aggregated feature approach next to the raw data approach. While simplification could potentially disguise differences, it also helps for an easier understanding of why certain vehicles are located in a certain group. This results in a trade-off between completeness and understandability, and should be carefully handled when using the proposed approach for official statements.
To evaluate the performance of the proposed clustering, we used the silhouette score. As already mentioned in
Section 5, this can be helpful to understand the overall performance between the two approaches, but misses certain details, such as the silhouette score of single clusters. Moreover, we found that the size of the clusters varies. For the raw feature clustering, the biggest cluster holds 5869 individuals, and the smallest 520. For the clustering using aggregated features, the biggest cluster has 4132 different models, and the smallest 246. Introducing a penalty for the large deviation in cluster size can be an interesting improvement to obtain similar-sized clusters.
At the moment, the segmentation of vehicles is rather used as a statistical auxiliary variable than for policy measures or other real-world issues, and relies partially on technical but also on subjective characteristics. We observe that, especially, the term SUV but also other categories are referenced in the press and public discussions. We showed in
Section 5 that the variance within what is officially called a SUV is fuzzy and overlaps significantly with other segments, which reduces the explanatory power of such statements, potentially leading to wrong conclusions. This, together with the fact that the variance has grown over the last years, is why we strongly recommend updating the classification requirements to regain validity. Introducing vehicle classes that also take other features into account (e.g., ecological features) can also help to achieve more comprehensiveness, regardless of whether raw or aggregated features are used. An analysis of the datasets reveals that the average age of the models varies between 11 and 12—this means that the turnover in the fleets is gradual, and shifts in technology and market shares occur over long time frames. A reevaluation of the results after half the average age period is thus reasonable. Notably, the two most promising methods resulting from our research (k-means on aggregated features and Ward on raw features) exhibit different sensitivities to changes in the market. As hierarchical clustering (Ward’s method) does not consider the number of samples at all, it is more stable against changing market shares. The partitioning-based method (k-means), in turn, requires market shares to operate, so the temporal stability is reduced. For future research, the temporal stability of clusters would be a relevant research topic, yet it requires a sound database of vehicle models and ideally registrations dating back at least 15 to 20 years.
The influence of policies, customer behavior, and the differentiation between private and commercial purchases are promising future fields of investigation. Regarding future ecological development, a time-sensitive evaluation of the ecological performance of proposed clusters can potentially offer valuable insights. Incorporating, e.g., improving grid-mixes or V2X systems allows for the determination of, e.g., break-even points regarding environmental performance. Facing the mentioned limitations of the proposed method, the identified clusters are still ready to be utilized by institutions and authorities. Using the proposed method, the overall parameters are easily adaptable, and the composition and normalization of the aggregated features can be changed according to future use cases. We proved that our approach offers better distinction between vehicles and promises clearer communication for society and policymakers. As Europe can be seen as a nationally overarching market, we assume a high transferability of the proposed method. Whether the identified clusters are transferable is highly dependent on the composition of the passenger car fleet and should be investigated for other countries as well. The technical data from the ADAC can be used for this as the HSN/TSN tuples are standardized across the EU. Regarding markets such as the US and China, we assume the methodology to be usable; however, the clusters might not be and will be part of future studies.