Quantitative Emotional Salary and Talent Commitment in Universities: An Unsupervised Machine Learning Approach

Alonso-Sastre, Ana-Isabel; Pardo, Juan; Cortijo, Oscar; Falcó, Antonio

doi:10.3390/merits5020014

Open AccessArticle

Quantitative Emotional Salary and Talent Commitment in Universities: An Unsupervised Machine Learning Approach

¹

Department of Human Resources, Universidad CEU Cardenal Herrera, CEU Universities, Carrer Assegadors, 2, 46115 Alfara del Patriarca (Valencia), Spain

²

Department of Mathematics, Physics and Technology, Universidad CEU Cardenal Herrera, CEU Universities, San Bartolomé 55, 46115 Alfara del Patriarca (Valencia), Spain

^*

Authors to whom correspondence should be addressed.

Merits 2025, 5(2), 14; https://doi.org/10.3390/merits5020014

Submission received: 29 April 2025 / Revised: 3 June 2025 / Accepted: 4 June 2025 / Published: 13 June 2025

Download

Browse Figures

Review Reports Versions Notes

Abstract

In the world of academia, there is a great mobility of talented university professors with a high level of movement among different entities. This could be a major problem, as universities must retain a minimum level of talent to support their various academic programmes. In this sense, finding out what factors could increase the loyalty of such staff can be of great interest to human resource (HR) departments and the overall administrative management of an organisation. Thus, this area, also known as People Analytics (PA), has become very powerful in human resource management to strategically address challenges in talent management. This paper examines talent commitment within the university environment, focusing on identifying key factors that influence the loyalty of professors and researchers. To achieve this, machine learning (ML) techniques are employed, as Principal Component Analysis (PCA) for dimensionality reduction and clustering techniques for individual segmentation have been employed in such tasks. This methodological approach allowed us to identify such critical factors, which we have termed Quantitative Emotional Salary (QES), enabling us to identify those factors beyond those merely related to compensation. The findings offer a novel data-driven perspective to enhance talent management strategies in academia, promoting long-term engagement and loyalty.

Keywords:

people analytics; machine learning; principal component analysis; clustering; emotional salary; talent engagement

1. Introduction

Over the past several years, the role of data analytics in human resources has evolved into a crucial tool for addressing key challenges in talent management and enhancing institutional effectiveness [1]. When supported by technological advancements and innovative methodologies, strategic talent management can serve as a sustainable competitive advantage for organisations [2].

Research has demonstrated that predictive analytics transforms human capital management [3] and that algorithmic approaches can optimise processes and enhance efficiency [4]. However, the successful adoption of People Analytics (PA) relies on overcoming essential structural and strategic challenges, such as integrating data infrastructure with decision-making processes [5] and addressing key implementation factors in human resource management [6].

One of the earliest adopters of PA was Google, pioneering data-driven strategies in workforce management [7]. Talent retention has since emerged as a priority in PA research [8], with studies demonstrating that advanced human resource (HR) analytics techniques improve employee retention [9]. The application of HR analytics enables organisations to make informed decisions that enhance employee well-being, thereby strengthening institutional reputation and fostering loyalty [10]. Organisations that effectively implement HR analytics can pinpoint key factors affecting employee satisfaction and engagement, ultimately reducing turnover rates [11]. The use of predictive analytics, when built on robust, high-quality databases, represents a powerful tool for strategic decision-making in talent retention [12].

In the higher education sector, academic staff play a pivotal role in institutional success. Their commitment is essential for maintaining excellence, fostering stability, and driving long-term development. The Spanish higher education landscape has experienced significant transformations in recent years, marked by increased faculty mobility due to talent scarcity and the necessity to comply with evolving regulatory frameworks. Unlike traditional corporate structures, universities operate under hierarchical systems with distinct career trajectories, necessitating a nuanced, data-driven approach to understanding the factors influencing faculty loyalty.

The scientific career in Spain is defined by a highly regulated and hierarchical structure. It is built around a sequence of temporary contracts, competitive calls, and stabilisation processes that rely heavily on external evaluation criteria and the availability of public funding. Access to and progression within the national research and academic system are determined by programmes promoted by national and regional government bodies. This career path—particularly during the postdoctoral stage—is marked by extended periods of contractual precariousness, during which researchers are expected to simultaneously manage teaching responsibilities, produce high-impact scientific work, and continuously apply for funding through highly competitive calls [13]. Securing a permanent position within the university system can take a long time, even for candidates with internationally competitive research profiles and strong scientific credentials.

These structural characteristics have a profound impact on researchers’ professional experiences [14], influencing their sense of job security, institutional recognition, and long-term career prospects. Consequently, any attempt to understand the factors that drive academic talent engagement and loyalty in the Spanish context must begin with a critical examination of this regulatory and organisational framework, which differs significantly from other higher education and research systems across Europe.

Given this dynamic and complex environment, talent retention has become a strategic priority for universities. ML techniques offer a sophisticated approach to analysing and predicting faculty commitment, identifying key factors that influence long-term engagement. However, the widespread adoption of PA in Human Resource Management (HRM) faces challenges, including the lack of analytical expertise among HR professionals and ethical concerns surrounding data privacy [1].

The application of predictive analytics in HR has been instrumental in anticipating employee turnover. Studies have shown that HR databases can be leveraged to predict voluntary departures by identifying behavioural patterns indicative of attrition risk [15]. Various ML models have been explored in retention analysis, with logistic regression being one of the most widely applied techniques [16]. More recent research suggests that deep learning techniques further enhance predictive accuracy by capturing complex, non-linear relationships among retention factors [17]. Feature selection methods have also been employed to refine classification models, improving both interpretability and efficiency [18].

Unsupervised ML techniques in PA facilitate the systematic identification of hidden patterns that may not be evident through traditional HR analysis. Feature selection methods streamline predictive models while maintaining high accuracy [19]. Critical factors influencing retention, such as job dissatisfaction, lack of professional growth, and misalignment with organisational culture, have been extensively studied [20,21]. Additionally, research highlights the impact of job satisfaction on performance, with engagement acting as a mediator and employee health as a moderator [22].

Beyond financial compensation, employee satisfaction and engagement are strongly influenced by non-monetary benefits, collectively termed emotional salary. Rooted in classical motivation theories [23,24], emotional salary encompasses factors such as recognition, professional development, autonomy, and work–life balance. Studies underscore its direct impact on employee well-being and retention [25,26].

The “happy and productive worker” model suggests that workplace well-being correlates with job performance, reinforcing the importance of non-monetary incentives in reducing turnover and fostering organisational commitment [27]. In academia, these factors are particularly significant, as they contribute to an environment that nurtures long-term engagement among faculty and researchers.

Recent findings demonstrate that emotional salary has a positive influence on job satisfaction and performance, as it enhances employee motivation and engagement [28]. As a result, this concept has gained prominence in talent management, particularly within higher education institutions.

This study provides a comprehensive framework for identifying key quantitative factors influencing talent commitment in academia through the application of unsupervised ML techniques.

Within the framework of this scientific research, the quantitative emotional salary (QES) sub-dataset refers to a specific set of quantitative variables that capture the non-monetary yet measurable aspects of the work experience that directly influence the satisfaction, motivation, and loyalty of university faculty. The QES sub-dataset serves as an empirical representation of the broader concept of emotional salary, which is traditionally associated with qualitative factors such as recognition, work–life balance, or professional development. However, by focusing exclusively on its quantifiable dimensions, this sub-dataset enables an aim, data-driven analysis that can support strategic decision-making in academic talent management.

By isolating and analysing such variables independently from direct salary measures, the QES subset allows for a deeper understanding of the hidden factors that influence academic talent commitment. Its application makes it possible to go beyond traditional financial incentives and to optimise retention strategies through a more holistic and evidence-based approach.

The objective is to extract a set of interrelated quantitative variables—referred to as quantitative emotional salary (QES)—whose behaviour closely resembles that of economic compensation (EC). This systematic approach aims to pinpoint the most influential retention factors for university faculty, ultimately strengthening the institution’s intellectual capital.

By integrating machine learning methods, this research offers a data-driven perspective on talent commitment in higher education. It highlights the transformative potential of combining innovative technologies with HR analytics to enhance institutional outcomes and faculty engagement.

The following sections outline the methodology employed, present the results obtained, and discuss the implications of these findings for the future of HR analytics in academia.

2. Materials and Methods

This research follows a structured and reproducible approach to ensure the validity and replicability of results. The methodology consists of multiple stages, each employing specialised computational tools and machine learning techniques for data integration, preprocessing, analysis, and modelling.

The data collection and integration process was carried out and is detailed below. The first step is data collection, which is critical for determining the success of a predictive model [29]. Human resources data is often dispersed across multiple systems, making it necessary to standardise and integrate diverse data sources into a unified structure to ensure consistency and comparability.

The primary datasets for this study were sourced from the University CEU Cardenal Herrera, following compliance with the General Data Protection Regulation (GDPR). The dataset spans a historical record of five academic years (2018–2022) and includes information on university academic staff. The data sources are as follows:

Workday (Workday, Inc. (Pleasanton, CA, USA)) and A3 (Wolters Kluwer (Alphen aan den Rijn, The Netherlands)): Administrative and compensation-related data.
Cornerstone (Cornerstone OnDemand Inc. (Santa Monica, CA, USA)): Employee training, performance, and talent management data.
Own University IT Systems: Organisational structure and faculty employment records.

To ensure semantic coherence, integration required format transformation and standardisation, aligning different fields despite originating from separate platforms. The Extract, Transform, Load (ETL) methodology was applied to clean, transform, and load the data into a unified dataset, allowing for seamless analysis.

The process of data preprocessing and feature engineering was conducted, as outlined below. Given the considerable number of records in the dataset, an initial exploratory analysis was conducted to:

Identify and handle outliers, redundancies, and null values.
Correct typographical errors and standardise variable formats.

Following this, feature engineering was performed to enhance data quality. The final dataset included 1144 individuals and 103 features, of which 72 were quantitative. After filtering the dataset to focus on tenure-track faculty, adjunct professors were excluded, resulting in a refined dataset of 599 individuals. A key qualitative variable within the dataset is “commitment,” which indicates whether a faculty member was actively employed at the university as of 31 August 2023.

Then, Data Normalisation and Correlation Analysis were performed. Since machine learning techniques based on distance measures require data normalisation, the 72 numerical features were standardised to a common scale without distorting their relative differences. To explore relationships between variables, a correlation matrix was computed using Kendall’s correlation coefficient, which measures monotonic associations between quantitative variables. Based on the correlation structure, sub-datasets were constructed, preserving only significantly correlated variables.

Moreover, dimensionality reduction using Principal Component Analysis (PCA) was used to reduce the dataset dimensionality while preserving key information. This method, introduced by [30] and further developed by [31], transforms correlated variables into uncorrelated principal components ranked by importance. The following considerations guided the PCA application:

Eigenvalues: Used to rank components in terms of information retention. Components with the highest eigenvalues were prioritised.
Eigenvectors: Define the direction of principal components, ensuring they remain orthogonal.
Projection Principles: The first principal component maximises data variance, while subsequent components are perpendicular, forming an orthogonal coordinate system.
Goodness-of-Fit Measure: The elbow method was applied to determine the optimal number of components by explaining at least 70% of the variance [32].

Following dimensionality reduction, unsupervised machine learning techniques were applied, specifically the K-Means clustering algorithm. This technique groups data points based on similarity, minimizing intra-cluster variance while maximizing inter-cluster separation.

Several studies indicate that applying K-Means in PCA-transformed subspaces enhances computational efficiency [33,34]. Despite its simplicity, K-Means has been shown to perform competitively against more complex clustering methods [35]. Additionally, research by [36] suggests that there is no universal solution for K-Means, reinforcing the importance of empirical validation.

Key Steps in K-Means Implementation:

Step 1. Setting a Random Seed (Ensures reproducibility).
Step 2. Centroid Initialisation (Centroids are randomly assigned as initial cluster representatives).
Step 3. Cluster Assignment (Each data point is assigned to the nearest centroid based on Euclidean distance).
Step 4. Centroid Update (New centroids are recalculated as the arithmetic mean of points within each cluster).
Step 5: Iteration Until Convergence (Steps 3 and 4 are repeated until centroids stabilise, or a predefined iteration limit is reached).

The optimal number of clusters (k) was determined using the elbow method, which evaluates intra-cluster variance (Within-Cluster Sum of Squares, WCSS) as a function of k. The optimal k is selected at the inflexion point where adding more clusters does not significantly improve compactness.

After clustering, the composition of each group was examined based on the qualitative variables from the initial dataset. This analysis provided valuable insights into the characteristics of faculty members with lower commitment, helping to identify key retention factors and patterns that influence long-term faculty engagement.

3. Results

The primary findings of this research are outlined below. The initial dataset, after applying the preprocessing methodologies described earlier, consisted of 103 variables encompassing both qualitative and quantitative attributes. This study focuses exclusively on the 72 quantitative variables, as the analytical techniques employed—such as principal component analysis and correlation analysis—require numerical input to yield meaningful results. Thus, quantitative variables were analysed to identify significant interrelationships. To examine such relationships, a correlation matrix was computed using Kendall’s coefficient. Variables demonstrating a statistically significant correlation with at least one other variable in the dataset were selected for further analysis. A significance threshold of α = 0.05 was established, ensuring that only correlations with a p-value below this threshold were considered meaningful.

To enhance the interpretability and reduce the redundancy, a filtering process was implemented to retain only variables with statistically significant correlations relevant to our research objectives. As a result, multiple sub-datasets were constructed from the initial dataset based on thematic coherence, enabling a more structured and detailed analysis. Two main sub-datasets were defined, reflecting the two primary dimensions of employee retention—non-monetary and monetary factors, respectively. The list of variables included in the different sub-datasets can be found in Appendix A, Table A1.

Quantitative Emotional Salary (QES)
Economic Compensation (EC)

Within the QES sub-dataset, dimensionality reduction was performed using PCA.

The results indicate that five principal components optimally encapsulate over 70% of the total variance in the data. And this was sufficient to explain the variability that exists in the data.

Table 1 presents the percentage contribution of each original variable to these components, allowing for the identification of the most influential factors within the QES construct. It shows Dimension 1 is characterised by educational support between 2018–2020, while Dimension 2 highlights continuous training. Dimension 3 is defined by time in position and training, whereas Dimension 4 is associated with seniority and training, and Dimension 5 with time in position and professional development.

Figure 1 below presents a graphical representation, which was constructed using the first two principal components, where each point represents an individual who was not retained, i.e., a person who left the organisation. Although the analysis focuses on quantitative variables, one qualitative variable from the original set of 103 is incorporated: the academic category at the time of turnover. This variable is used for colour coding in the visualisations, offering additional insights into potential patterns. The variable categories are listed in Appendix A, Table A2.

Figure 1 shows the distribution of non-retained individuals from the QES sub-dataset in the plane defined by the first two principal components of the PCA. In this plot, non-retained individuals are grouped according to their academic category. This type of visualisation helps identify clustering patterns by observing how the observations are positioned based on the indicated variables.

To further explore underlying structures, K-Means clustering was applied within the reduced-dimensional space obtained through PCA. The optimal number of clusters was determined to be four, based on the elbow method. The resulting cluster distribution is depicted in Figure 2 below, where each point represents an individual, positioned according to the first two principal components, facilitating visual differentiation among clusters.

After obtaining the results from Figure 2, an examination of cluster composition concerning commitment levels revealed that Cluster 1 exhibited the highest proportion of non-retained individuals (27.86%), whereas the remaining clusters showed significantly lower percentages, with Cluster 4 containing no non-retained individuals at all. To gain additional insights, the proportion of the grouped current academic category levels within each cluster was analysed. This categorical variable, which is part of the initial database comprising 103 variables, is detailed in Table A2 of Appendix A. The results of this analysis are summarised in Table 2, providing valuable information on the attributes of non-retained individuals in the QES sub-dataset.

This analysis allows for the identification of the composition of each cluster based on different qualitative variables of interest, providing relevant information about the features of non-retained individuals in the QES sub-dataset. In this case, the current category level variable reflects a clear segmentation between academic levels, from the initial levels in Cluster 1 to the advanced levels in Cluster 4. The EC sub-dataset was similarly analysed using PCA, which revealed that the first two principal components account for approximately 85% of the total variance.

Table 3 details the percentage of each variable’s contribution to the variance explained by these two principal components. It shows that the first principal component is strongly influenced by the variables related to gross compensation across different years, while the second principal component reflects a greater weight of the variables associated with the functional complement.

To visualise the relationships among the variables, a scatter plot was generated using the first two principal components (see Figure 3). This plot specifically includes individuals classified as non-loyal (i.e., those for whom the target variable is “Yes”). Each point is colour-coded based on grouped academic categories, allowing for clear differentiation. This quantitative variable is listed in Appendix A, Table A2.

Next, K-Means clustering was applied to this reduced space to segment the data (see Figure 4). As in the QES sub-dataset, the elbow method was used to determine the optimal number of clusters, which was found to be four.

Figure 4 shows the distribution of the QES sub-dataset data in the space defined by the first two principal components after applying the K-Means clustering algorithm. Each colour represents a different cluster, allowing visualisation of how individuals are grouped based on the underlying structure of the data. The clusters are clearly differentiated, indicating that the variables selected for the analysis allow for effective segmentation of individuals into distinct profiles. This visualisation provides an overview of the internal structure of the dataset. The spatial separation between clusters suggests significant differences that can be explored more deeply in subsequent analyses.

Finally, an exploration of the distribution of qualitative variables across the identified clusters was performed. These qualitative variables are part of the initial dataset, which consists of 103 variables. In particular, the qualitative variable “objective” classifies individuals into two groups: those considered non-loyal, defined by a “Yes” value in the target variable, and those considered loyal, classified with a “No” value. In this context, the analysis of cluster composition about commitment levels revealed that Cluster 1 had the highest proportion of non-retained individuals, with 27.82% of the cases, suggesting a notable concentration of non-loyal individuals within this cluster. A key observation from this clustering analysis is that Cluster 1 is strongly associated with a lack of “talent engagement”, distinguishing it as a critical group for further investigation. Identifying the unique characteristics of this cluster is essential to understanding the factors contributing to non-retention.

The percentage distribution of individuals based on “current category level” is presented in Table 4, further contextualising the insights derived from the EC sub-dataset analysis.

Table 4 shows the percentage distribution of a qualitative variable that reflects the current academic levels of individuals in each of the four clusters defined within the EC sub-dataset. This distribution allows for the observation of how individuals are grouped based on their academic category across the different clusters. The table reveals that the clusters are primarily grouped according to academic career levels. Clusters 1 and 3 mainly include individuals at the lower levels of the academic career. In contrast, Cluster 2 groups individuals at more advanced levels, with a higher representation of Full Professors.

4. Discussion

This study applied unsupervised machine learning methods to analyse a dataset related to the university academic environment, with a specific focus on university professors and researchers, and their relationship with talent commitment.

One of the key aspects to consider when analysing talent commitment in higher education institutions is the specific nature of the university environment compared to other sectors [37], particularly the business sector. Unlike commercial organisations, universities are subject to additional legal and administrative regulations that significantly influence human resource management. These include public regulations governing recruitment, promotion, and job stability, externally determined salary scales, and an institutional framework in which non-monetary incentives carry considerable weight.

In this regard, although the present study draws on concepts such as Quantitative Emotional Salary (QES), which are inspired by practices from the business world, their application has been carefully contextualised. The aim is not to directly transfer private sector models, but rather to explore internal patterns within the university system that may serve as meaningful indicators of motivation and retention, based on their statistical behaviour resembling that of economic compensation.

Acknowledging these structural differences is essential for a proper interpretation of the results. Thus, the contribution of this study lies in offering a methodological framework that can be adapted to different institutional contexts, always considering the regulatory and organisational constraints inherent to the education sector.

The findings offer valuable insights into the factors influencing talent loyalty within academic institutions. The composition of the variables that make up the two sub-datasets is detailed in Appendix A, Table A1. This table provides a complete description of the variables included in each sub-dataset, offering a clear view of the elements that constitute the data used in the analysis. Through dimensionality reduction and clustering techniques, significant patterns were identified—some aligning with previous research while others providing novel interpretations of professional behaviours and needs.

The proposed methodology proved effective in not only simplifying the inherent complexity of the dataset but also generating actionable insights that can inform strategies to improve talent commitment in academic settings. Various analytical tools were employed, including correlation matrix analysis, which was computed using Kendall’s coefficient, Principal Component Analysis (PCA), and the K-Means clustering algorithm. These techniques helped reduce the dataset’s dimensionality, facilitating the identification of key relationships and sub-datasets of interest.

Two primary sub-datasets emerged from the analysis: Quantitative Emotional Salary (QES) and Economic Compensation (EC). The QES sub-dataset, comprising nineteen correlated variables, highlights the critical role of emotional factors in faculty commitment. In contrast, the EC sub-dataset, consisting of ten correlated variables, demonstrates how economic compensation is also linked to talent commitment.

PCA was instrumental in reducing the dataset’s complexity while preserving its most significant information. In the QES sub-dataset, five principal components explained more than 70% of the total variance, whereas in the EC sub-dataset, just two principal components accounted for 80% of the total variance. Beyond dimensionality reduction, PCA also provided a clearer visualisation of relationships between clusters, enabling a more intuitive interpretation of the segmentation.

The application of clustering techniques, particularly the K-Means algorithm, revealed four distinct employee segments in both sub-datasets. Notably, the cluster closest to the origin of the coordinate system in both cases exhibited lower explained variability by the first two principal components. This suggests that observations within these clusters share less variability in explanatory factors, a finding of relevance since talent turnover was concentrated in these groups. Each observation represents a set of characteristics that define an individual in the dataset. Therefore, within the clusters closest to the origin of the coordinate system, individuals have lower variability in the characteristics that explain them, meaning they share more similarities. This is relevant because, as mentioned, talent turnover was concentrated in these clusters with lower variability, which may indicate that these shared characteristics are linked to the lack of talent retention. This indicates that the characteristics defining these clusters are associated with a higher likelihood of faculty member departure.

In the QES sub-dataset, segmentation revealed that professors with lower quantitative emotional salary levels were more likely to leave the institution. This finding aligns with previous studies, such as [38], which emphasised the importance of emotional well-being and job satisfaction as key predictors of talent retention. These results suggest that emotional salary factors—such as recognition, work–life balance, and institutional support—play a crucial role in faculty commitment.

The study also found that all academic staff categories responded similarly to the absence of emotional salary, underscoring its universal importance. Additionally, results indicate that professors with lower economic salaries, as well as those in research-intensive positions, were more vulnerable to turnover.

By identifying key factors influencing talent commitment, this research reinforces the growing significance of non-economic well-being as a fundamental pillar for fostering faculty loyalty. While economic compensation remains relevant, the findings highlight those intangible factors—represented by the QES sub-dataset—strongly influence faculty decisions to remain at an institution.

These insights emphasise the need for comprehensive retention strategies that integrate both financial and emotional incentives. Strengthening faculty commitment requires not only improvements in salary structures but also targeted initiatives to enhance the work environment, professional recognition, and overall job satisfaction. By addressing both economic and emotional factors, universities can develop holistic strategies to retain academic talent and foster long-term institutional stability.

5. Conclusions

The findings of this research highlight a significant relationship between Quantitative Emotional Salary (QES) and talent loyalty, showing similar behavioural patterns to those observed in the Economic Compensation (EC) sub-dataset. However, this study emphasises, that both QES and EC independently contribute to academic talent loyalty, suggesting that while both factors influence commitment, they do so in complementary ways. This study reinforces the growing recognition that emotional salary plays a fundamental role in academic talent commitment, alongside economic factors.

The developed methodological framework proved effective in identifying latent patterns in the data while also reducing dimensionality, thereby enhancing interpretability. This improved clarity makes the results more accessible and applicable for decision-makers in higher education institutions.

Nevertheless, it is important to underscore that this research is based on a single institutional case study. As such, its findings are context-specific and should not be generalised across the entire higher education sector without further validation. Institutional structures, regulatory frameworks, and cultural contexts vary widely between universities and national systems, and these factors could significantly affect the relevance and replicability of the observed patterns.

Despite promising results, this study has certain limitations. One key limitation is that the projection of original variables into the PCA space may obscure significant nonlinear relationships, potentially limiting a deeper understanding of underlying data structures. Additionally, while the K-Means clustering algorithm is computationally efficient, it assumes spherical clusters of similar sizes, which may not fully capture the actual distribution of data points.

Another limitation pertains to the possible exclusion of essential qualitative variables during the initial data selection process. This study focused primarily on quantitative dimensions of emotional salary and economic compensation; however, qualitative factors—such as personal perceptions, institutional culture, and professional experiences—could provide additional insights into talent loyalty.

Future research should address these methodological limitations by exploring hybrid approaches that integrate both qualitative and quantitative dimensions. Expanding the scope of analysis to broader and more diverse organisational contexts would allow for a more comprehensive understanding of talent commitment.

This study also opens new pathways for data-driven talent management research. By leveraging data analytics, future studies can explore critical strategic lines in human resource management, helping institutions develop more targeted and effective policies for fostering long-term faculty engagement and commitment.

Author Contributions

Conceptualization, A.-I.A.-S., J.P., O.C. and A.F.; methodology, J.P. and A.F.; software, A.-I.A.-S.; validation, J.P. and A.F.; formal analysis, J.P. and A.F.; investigation, A.-I.A.-S. and O.C.; resources, A.-I.A.-S. and O.C.; data curation, A.-I.A.-S.; writing—original draft preparation, A.-I.A.-S. and J.P.; writing—review and editing, A.-I.A.-S., J.P., O.C. and A.F.; visualization, A.-I.A.-S.; supervision, J.P.; project administration, J.P.; funding acquisition, A.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Universidad CEU Cardenal Herrera with the grant number INDI24/17.

Institutional Review Board Statement

This study did not require approval from an ethics committee, as it was conducted using aggregated data with no individual-level identifiers. The data used in the analysis are fully anonymised and grouped in such a way that it is not possible to trace back any information to individual participants. Therefore, according to applicable regulations and ethical standards, ethical approval was not necessary.

Informed Consent Statement

Informed consent for participation was not required, as the dataset consists exclusively of aggregated and anonymised data with no personal or identifiable information related to health, religion, ethnicity, or sexual orientation. There is no way to trace any data back to individual persons, and thus, the study does not fall within the scope of research involving human subjects as defined by ethics guidelines.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:

PA	People Analytics
ML	Machine Learning
PCA	Principal Component Analysis
HR	Human Resources
HRM	Human Resources Management
QES	Quantitative Emotional Salary
EC	Economic Compensation
GDPR	General Data Protection Regulation
ETL	Extract Transform Load
WCSS	Within-Cluster Sum of Squares

Appendix A

Table A1. Descriptions of the variables included in the two sub-datasets (QES and EC).

Sub-dataset	Final Features	English Description
EC	Functional Complement 2018	Amount of functional economic complements in 2018.
EC	Functional Complement 2019	Amount of functional economic complements in 2019.
EC	Functional Complement 2020	Amount of functional economic complements in 2020.
EC	Functional Complement 2021	Amount of functional economic complements in 2021.
EC	Functional Complement 2022	Amount of functional economic complements in 2022.
EC	Gross 2018	Employee’s gross salary amount in 2018.
EC	Gross 2019	Employee’s gross salary amount in 2019.
EC	Gross 2020	Employee’s gross salary amount in 2020.
EC	Gross 2021	Employee’s gross salary amount in 2021.
EC	Gross 2022	Employee’s gross salary amount in 2022.
QES	Average hours in position	Average teaching hours during the employee’s time at the university (up to five years).
QES	Average reduction hours	Average hour reduction during the employee’s time at the university (up to five years).
QES	Average training courses	Average number of training courses completed during the employee’s time at the university (up to five years).
QES	Average training hours	Average training hours completed during the employee’s time at the university (up to five years).
QES	Hours last year reduction	Teaching hour reduction in the last year. It is a way to compensate for the additional workload related to responsibilities outside of direct teaching, such as academic management, program coordination, or participation in committees.
QES	Last year position hours	Teaching hours in the last year.
QES	Maximum reduction hours	Maximum hour reduction in an academic year.
QES	Number of years reduction	Number of years the employee had hour reductions.
QES	Rector position year	Last year the employee assumed an executive team position.
QES	Seniority duration	Duration of the employee’s seniority.
QES	Seniority year	Year of the employee’s seniority.
QES	Steps	Number of changes in the employee’s position—either in category level or category—during the employee’s time at the university (up to five years).
QES	Study Aids 2018	Amount of study aid received in 2018.
QES	Study Aids 2019	Amount of study aid received in 2019.
QES	Study Aids 2020	Amount of study aid received in 2020.
QES	Study Aids 2021	Amount of study aid received in 2021.
QES	Study Aids 2022	Amount of study aid received in 2022.
QES	Work day	Indicates the percentage of the employee’s working hours.
QES	Years of training	Number of years the employee has received training.

Table A2. Values taken by the qualitative variables used throughout the study.

Current Cat Grouped	Current Category	Current Category Level
Associate	Associate Professor	Associate Professor Level I
Associate	Associate Professor	Associate Professor Level II
Associate	Associate Professor	Associate Professor Level III
Associate	Associate Professor	Associate Professor Level IV
Associate	Associate Professor	Associate Professor Level V
Associate	Associate Professor	Associate Professor Level VI
Associate	Associate Professor	Associate Professor Level VII
Associate	Associate Professor	Associate Professor Level VIII
Full	Full Professor	Full Professor Level I
Full	Full Professor	Full Professor Level II
Full	Full Professor	Full Professor Level III
Full	Full Professor	Full Professor Level IV
Full	Full Professor	Full Professor Level V
Full	Full Professor	Full Professor Level VI
Full	Full Professor	Full Professor Level VII
Full	Full Professor	Full Professor Level VIII
Research Group	Assistant Professor	Assistant Professor Level I
Research Group	Assistant Professor	Assistant Professor Level II
Research Group	Assistant Professor	Assistant Professor Level III
Research Group	Assistant Professor	Assistant Professor Level IV
Research Group	Assistant Professor	Assistant Professor Level V
Research Group	Assistant Professor	Assistant Professor Level VI
Research Group	Assistant Professor	Assistant Professor Level VII
Research Group	Instructor	Instructor Level I
Research Group	Instructor	Instructor Level II
Research Group	Research Professor	Research Professor

References

Tursunbayeva, A.; Di Lauro, S.; Pagliari, C. People analytics—A scoping review of conceptual boundaries and value propositions. Int. J. Inf. Manag. 2018, 43, 224–247. [Google Scholar] [CrossRef]
Coculova, J.; Tomcikova, L. Innovative human resource management practices for the talent management implementation. Mark. Manag. Innov. 2021, 5, 47–54. [Google Scholar] [CrossRef]
Loscher, G.J.; Bader, V. Creating accountability through HR analytics—An audit society perspective. Hum. Resour. Manag. Rev. 2023, 33, 100974. [Google Scholar] [CrossRef]
Meijerink, J.; Bondarouk, T. The duality of algorithmic management: Toward a research agenda on HRM algorithms, autonomy and value creation. Hum. Resour. Manag. Rev. 2023, 33, 100876. [Google Scholar] [CrossRef]
Belizón, M.J.; Kieran, S. Human resources analytics: A legitimacy process. Hum. Resour. Manag. J. 2022, 32, 603–630. [Google Scholar] [CrossRef]
Shet Sateesh, V.; Poddar, T.; Wamba Samuel, F.; Dwivedi, Y.K. Examining the determinants of successful adoption of data analytics in human resource management—A framework for implications. J. Bus. Res. 2021, 131, 311–326. [Google Scholar] [CrossRef]
Shrivastava, S.; Nagdev, K.; Rajesh, A. Redefining HR using people analytics: The case of Google. Hum. Resour. Manag. Int. Dig. 2018, 26, 3–6. [Google Scholar] [CrossRef]
Margherita, A. Human resources analytics: A systematization of research topics and directions for future research. Hum. Resour. Manag. Rev. 2022, 32, 100795. [Google Scholar] [CrossRef]
Belizón, M.J.; Majarín, D.; Aguado, D. Human resources analytics in practice: A knowledge discovery process. Eur. Manag. Rev. 2024, 21, 659–677. [Google Scholar] [CrossRef]
Álvarez-Gutiérrez, F.J.; Stone, D.L.; Castaño, A.M.; García-Izquierdo, A.L. Human Resources Analytics: A systematic Review from a Sustainable Management Approach. Rev. Psicol. Trab. Y Las Organ. 2022, 38, 129–147. [Google Scholar] [CrossRef]
Ravesangar, K.; Narayanan, S. Adoption of HR analytics to enhance employee retention in the workplace: A review. Hum. Resour. Manag. Serv. 2024, 6, 3481. [Google Scholar] [CrossRef]
Meijerink, J.; Boons, M.; Keegan, A.; Marler, J. Algorithmic human resource management: Synthesizing developments and cross-disciplinary insights on digital HRM. Int. J. Hum. Resour. Manag. 2021, 32, 2545–2562. [Google Scholar] [CrossRef]
Castro-Ceacero, D.; Rodriguez-Gomez, D.; Muñoz-Moreno, J.L.; Calatayud, A. The intergenerational climate of Spanish university research. Stud. High. Educ. 2023, 48, 1696–1707. [Google Scholar] [CrossRef]
Kennedy, M.R.; Deans, Z.; Ampollini, I.; Breit, E.; Bucchi, M.; Seppel, K.; Vie, K.J.; Meulen, R.t. “It is Very Difficult for us to Separate Ourselves from this System”: Views of European Researchers, Research Managers, Administrators and Governance Advisors on Structural and Institutional Influences on Research Integrity. J. Acad. Ethics 2023, 21, 471–495. [Google Scholar] [CrossRef]
Rombaut, E.; Guerry, M.A. Predicting voluntary turnover through human resources database analysis. Manag. Res. Rev. 2018, 41, 96–112. [Google Scholar] [CrossRef]
Guerranti, F.; Dimitri, G.M. A Comparison of Machine Learning Approaches for Predicting Employee Attrition. Appl. Sci. 2023, 13, 267. [Google Scholar] [CrossRef]
Ben Yahia, N.; Hlel, J.; Colomo-Palacios, R. From Big Data to Deep Data to Support People Analytics for Employee Attrition Prediction. IEEE Access 2021, 9, 60447–60458. [Google Scholar] [CrossRef]
Naz, K.; Siddiqui, I.F.; Koo, J.; Khan, M.A.; Qureshi, N.M.F. Predictive Modeling of Employee Churn Analysis for IoT-Enabled Software Industry. Appl. Sci. 2022, 12, 10495. [Google Scholar] [CrossRef]
Wardhani, F.H.; Lhaksmana, K.M. Predicting Employee Attrition Using Logistic Regression With Feature Selection. Sinkron 2022, 7, 2214–2222. [Google Scholar] [CrossRef]
Fallucchi, F.; Coladangelo, M.; Giuliano, R.; De Luca, E.W. Predicting employee attrition using machine learning techniques. Computers 2020, 9, 86. [Google Scholar] [CrossRef]
Sanjeetha, S.; Phani Krishna, C. Analysis of Employee Attrition using for Machine Learning Techniques. Turk. J. Comput. Math. Educ. 2021, 12, 28–31. [Google Scholar]
Stirpe, L.; Profili, S.; Sammarra, A. Satisfaction with HR practices and employee performance: A moderated mediation model of engagement and health. Eur. Manag. J. 2022, 40, 295–305. [Google Scholar] [CrossRef]
Herzberg, F.I. Work and the Nature of Man; World Pub: Cleveland, OH, USA, 1966; Available online: https://www.scirp.org/reference/referencespapers?referenceid=1775482 (accessed on 26 October 2024).
Deci, E.L.; Ryan, R.M. Intrinsic Motivation and Self-Determination in Human Behavior; Springer: Berlin/Heidelberg, Germany, 1985. [Google Scholar] [CrossRef]
Ruíz-Valdés, S.; Ruíz-Tapia, J.A. The emotional salary as a strategy to encourage work commitment and talent retention in organization. J. Int. Econ. 2022, 6, 8–16. [Google Scholar] [CrossRef]
Augusto Reis, T.; Rose Campagnolli, D.; Canuto da Silva, T.; Oste Graziano Cremonezi, G.; Author, C. Emotional Salary as a Strategy to Retain Talents. IOSR J. Humanit. Soc. Sci. 2018, 23, 74–80. [Google Scholar] [CrossRef]
Peiró, J.M.; Kozusznik, M.; Molina, I.R.; Tordera, N. The happy-productive worker model and beyond: Patterns of wellbeing and performance atwork. Int. J. Environ. Res. Public Health 2019, 16, 479. [Google Scholar] [CrossRef] [PubMed]
Junça Silva, A.; Burgette, A.R.; Fontes da Costa, J. Toward a Sustainable World: Affective Factors Explain How Emotional Salary Influences Different Performance Indicators. Sustainability 2024, 16, 2198. [Google Scholar] [CrossRef]
Krishna, S.; Sidharth, S. HR Analytics: Employee Attrition Analysis using Random Forest. Int. J. Perform. Eng. 2022, 18, 275–281. [Google Scholar] [CrossRef]
Pearson, K. On lines and planes of closest fit to systems of points in space. Philos. Mag. 1901, 2, 559–572. [Google Scholar] [CrossRef]
Hotelling, H. Analysis of a complex of statistical variables into principal components. J. Educ. Psychol. 1933, 24, 498–520. [Google Scholar] [CrossRef]
Niemczynowicz, A.; Kycia, R.A. The analysis of engagement at the workplace of Generation Z—Machine learning in management. In Managing Generation Z: Motivation, Engagement and Loyalty; Taylor and Francis: Oxfordshire, UK, 2023; pp. 83–103. [Google Scholar] [CrossRef]
Xu, Q.; Ding, C.; Liu, J.; Luo, B. PCA-guided search for K-means. Pattern Recognit. Lett. 2015, 54, 50–55. [Google Scholar] [CrossRef]
Malli, S.; Nagesh, H.R.; Rao, B.D. Approximation to the K-Means Clustering Algorithm using PCA. Int. J. Comput. Appl. 2020, 175, 43–46. [Google Scholar] [CrossRef]
Romanuke, V.V. Random centroid initialization for improving centroid-based clustering. Decis. Mak. Appl. Manag. Eng. 2023, 6, 734–746. [Google Scholar] [CrossRef]
Ahmed, M.; Seraj, R.; Islam, S.M.S. The k-means algorithm: A comprehensive survey and performance evaluation. Electronics 2020, 9, 1295. [Google Scholar] [CrossRef]
Zeng, G.; Liu, F.; Xiong, C.; Huang, Y. Analysis of Innovative Strategies for Talent Team Development from the Perspective of Human Resource Management in Colleges and Universities. Contemp. Educ. Teach. Res. 2024, 5, 430–435. [Google Scholar] [CrossRef]
Qamar, Y.; Samad, T.A. Human resource analytics: A review and bibliometric analysis. Pers. Rev. 2022, 51, 251–283. [Google Scholar] [CrossRef]

Figure 1. Distribution of non-retained individuals of QES sub-dataset in the coordinate system of the first two principal components. Associate: Associate Professor. Full: Full Professor. Research Group: Assistant Professor, Instructor, and Research Professor.

Figure 2. K-Means Clusters in the Space of the First Two Principal Components.

Figure 3. Distribution of Non-Retained Observations of EC sub-dataset in the coordinate system of the first two principal components. Associate: Associate Professor. Full: Full Professor. Research Group: Assistant Professor, Instructor, and Research Professor.

Figure 4. K-Means Clusters in the Space of the First Two Principal Components.

Table 1. Contribution to the total variance explained by the principal components in the QES sub-dataset.

Quantitative Emotional Salary	Dim.1	Dim.2	Dim.3	Dim.4	Dim.5
Average hours in position	0.569451	0.539365	15.453060	2.030245	16.625706
Average reduction hours	10.127527	3.922517	8.036510	0.144809	1.121154
Average training courses	2.028594	2.698313	9.886314	11.815590	13.909010
Average training hours	1.839729	2.108883	7.028011	13.228940	22.318579
Hours last year reduction	10.244442	3.189606	7.914184	0.480177	1.166354
Last year position hours	0.793666	0.900228	21.446270	1.237760	6.655441
Maximum reduction hours	10.761794	3.475717	8.122696	0.136399	1.219806
Number of years reduction	9.867864	2.759933	0.418937	0.000030	0.006230
Rector position year	5.453789	1.446501	5.120593	0.201137	1.379636
Seniority duration	6.878744	0.060213	0.920874	29.334620	6.230866
Seniority year	6.878744	0.060213	0.920874	29.334620	6.230866
Steps	0.942224	2.385260	1.317487	2.822608	21.011102
Study aids 2018	2.748775	15.613072	0.245473	0.631967	0.297226
Study aids 2019	3.210014	19.622232	0.278924	1.260249	0.032952
Study aids 2020	4.225638	18.830464	0.124125	1.858693	0.024720
Study aids 2021	6.962728	7.913909	0.016477	0.771763	0.762980
Study aids 2022	6.509201	9.800938	0.000180	0.940578	0.525112
Work day	3.735572	2.489691	4.363779	3.533885	0.386016
Years of training	6.221506	2.182946	8.385231	0.235929	0.096246

N = 599; values reflect the contribution of each variable to the five principal dimensions; a higher absolute value indicates a greater influence of the variable in the corresponding dimension.

Table 2. Percentage distribution of a qualitative variable by clusters with QES sub-dataset.

Current Category Level	Cluster 1	Cluster 2	Cluster 3	Cluster 4
Assistant Professor Level I	27.05	-	14.11	-
Assistant Professor Level II	5.74	6.59	7.26	6.25
Assistant Professor Level III	-	3.30	2.42	6.25
Assistant Professor Level IV	0.41	1.10	0.81	-
Assistant Professor Level V	0.41	2.20	0.40	-
Assistant Professor Level VI	-	-	1.21	-
Assistant Professor Level VII	1.23	7.69	2.02	12.50
Associate Professor Level I	3.28	7.69	12.90	-
Associate Professor Level II	2.05	7.69	14.11	-
Associate Professor Level III	1.64	1.10	1.61	-
Associate Professor Level IV	-	2.20	0.40	-
Associate Professor Level V	-	2.20	1.61	-
Associate Professor Level VI	-	1.10	-	-
Associate Professor Level VII	-	-	-	6.25
Associate Professor Level VIII	0.41	2.20	3.23	6.25
Full Professor Level I	-	5.49	2.42	6.25
Full Professor Level II	-	12.09	5.24	6.25
Full Professor Level III	-	13.19	2.82	6.25
Full Professor Level IV	-	4.40	2.42	25.00
Full Professor Level V	-	3.30	2.42	-
Full Professor Level VI	-	5.49	2.02	12.50
Full Professor Level VII	0.41	3.30	1.61	6.25
Full Professor Level VIII	0.41	5.49	1.21	-
Instructor Level I	31.97	-	4.84	-
Instructor Level II	6.56	1.10	12.90	-
Research Professor	18.44	1.10	-	-

N = 599; values represent the percentage of professors in each category within the clusters; (-) indicates the absence of values in the corresponding category within the cluster.

Table 3. Contribution to the total variance explained by the principal components in EC sub-dataset.

Economic Compensation Sub-Dataset	Dim 1	Dim 2
Functional complement 2018	7.053482	8.731980
Functional complement 2019	9.993619	10.174539
Functional complement 2020	10.151388	13.075159
Functional complement 2021	9.899995	12.410984
Functional complement 2022	8.837577	9.658164
Gross 2018	10.008358	10.272620
Gross 2019	11.267491	8.489656
Gross 2020	11.462697	9.554073
Gross 2021	11.145923	9.344951
Gross 2022	10.179470	8.287875

N = 599; values reflect the contribution of each variable to the first two principal dimensions; a higher absolute value indicates a greater influence of the variable in the corresponding dimension.

Table 4. Percentage distribution of a qualitative variable by clusters with EC sub-dataset.

Current Category Level	Cluster 1	Cluster 2	Cluster 3	Cluster 4
Assistant Professor Level I	32.61	-	13.07	-
Assistant Professor Level II	3.04	1.92	12.56	7.58
Assistant Professor Level III	-	0.96	4.02	1.52
Assistant Professor Level IV	-	1.92	0.50	1.52
Assistant Professor Level V	0.87	0.96	-	1.52
Assistant Professor Level VI	-	-	1.01	1.52
Assistant Professor Level VII	-	8.65	1.01	9.09
Associate Professor Level I	3.48	1.92	16.58	6.06
Associate Professor Level II	1.74	5.77	16.58	6.06
Associate Professor Level III	0.43	1.92	1.51	4.55
Associate Professor Level IV	-	0.96	0.50	1.52
Associate Professor Level V	-	0.96	1.01	4.55
Associate Professor Level VI	-	0.96	-	-
Associate Professor Level VII	-	-	-	1.52
Associate Professor Level VIII	-	7.69	1.01	3.03
Full Professor Level I	-	3.85	2.01	6.06
Full Professor Level II	-	15.38	1.51	9.09
Full Professor Level III	-	12.50	0.50	9.09
Full Professor Level IV	-	7.69	0.50	7.58
Full Professor Level V	-	7.69	-	1.52
Full Professor Level VI	-	7.69	-	6.06
Full Professor Level VII	-	4.81	0.50	4.55
Full Professor Level VIII	-	5.77	0.50	3.03
Instructor Level I	33.04	-	7.04	-
Instructor Level II	5.22	-	17.59	3.03
Research Professor	19.57	-	0.50	-

N = 599; values represent the percentage of professors in each category within the clusters; (-) indicates the absence of values in the corresponding category within the cluster.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Alonso-Sastre, A.-I.; Pardo, J.; Cortijo, O.; Falcó, A. Quantitative Emotional Salary and Talent Commitment in Universities: An Unsupervised Machine Learning Approach. Merits 2025, 5, 14. https://doi.org/10.3390/merits5020014

AMA Style

Alonso-Sastre A-I, Pardo J, Cortijo O, Falcó A. Quantitative Emotional Salary and Talent Commitment in Universities: An Unsupervised Machine Learning Approach. Merits. 2025; 5(2):14. https://doi.org/10.3390/merits5020014

Chicago/Turabian Style

Alonso-Sastre, Ana-Isabel, Juan Pardo, Oscar Cortijo, and Antonio Falcó. 2025. "Quantitative Emotional Salary and Talent Commitment in Universities: An Unsupervised Machine Learning Approach" Merits 5, no. 2: 14. https://doi.org/10.3390/merits5020014

APA Style

Alonso-Sastre, A.-I., Pardo, J., Cortijo, O., & Falcó, A. (2025). Quantitative Emotional Salary and Talent Commitment in Universities: An Unsupervised Machine Learning Approach. Merits, 5(2), 14. https://doi.org/10.3390/merits5020014

Article Menu

Quantitative Emotional Salary and Talent Commitment in Universities: An Unsupervised Machine Learning Approach

Abstract

1. Introduction

2. Materials and Methods

3. Results

4. Discussion

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Abbreviations

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI