Previous Article in Journal
Mindfulness: A Missing Integral Component of Leadership Preparation Programs
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Quantitative Emotional Salary and Talent Commitment in Universities: An Unsupervised Machine Learning Approach

1
Department of Human Resources, Universidad CEU Cardenal Herrera, CEU Universities, Carrer Assegadors, 2, 46115 Alfara del Patriarca (Valencia), Spain
2
Department of Mathematics, Physics and Technology, Universidad CEU Cardenal Herrera, CEU Universities, San Bartolomé 55, 46115 Alfara del Patriarca (Valencia), Spain
*
Authors to whom correspondence should be addressed.
Merits 2025, 5(2), 14; https://doi.org/10.3390/merits5020014
Submission received: 29 April 2025 / Revised: 3 June 2025 / Accepted: 4 June 2025 / Published: 13 June 2025

Abstract

:
In the world of academia, there is a great mobility of talented university professors with a high level of movement among different entities. This could be a major problem, as universities must retain a minimum level of talent to support their various academic programmes. In this sense, finding out what factors could increase the loyalty of such staff can be of great interest to human resource (HR) departments and the overall administrative management of an organisation. Thus, this area, also known as People Analytics (PA), has become very powerful in human resource management to strategically address challenges in talent management. This paper examines talent commitment within the university environment, focusing on identifying key factors that influence the loyalty of professors and researchers. To achieve this, machine learning (ML) techniques are employed, as Principal Component Analysis (PCA) for dimensionality reduction and clustering techniques for individual segmentation have been employed in such tasks. This methodological approach allowed us to identify such critical factors, which we have termed Quantitative Emotional Salary (QES), enabling us to identify those factors beyond those merely related to compensation. The findings offer a novel data-driven perspective to enhance talent management strategies in academia, promoting long-term engagement and loyalty.

1. Introduction

Over the past several years, the role of data analytics in human resources has evolved into a crucial tool for addressing key challenges in talent management and enhancing institutional effectiveness [1]. When supported by technological advancements and innovative methodologies, strategic talent management can serve as a sustainable competitive advantage for organisations [2].
Research has demonstrated that predictive analytics transforms human capital management [3] and that algorithmic approaches can optimise processes and enhance efficiency [4]. However, the successful adoption of People Analytics (PA) relies on overcoming essential structural and strategic challenges, such as integrating data infrastructure with decision-making processes [5] and addressing key implementation factors in human resource management [6].
One of the earliest adopters of PA was Google, pioneering data-driven strategies in workforce management [7]. Talent retention has since emerged as a priority in PA research [8], with studies demonstrating that advanced human resource (HR) analytics techniques improve employee retention [9]. The application of HR analytics enables organisations to make informed decisions that enhance employee well-being, thereby strengthening institutional reputation and fostering loyalty [10]. Organisations that effectively implement HR analytics can pinpoint key factors affecting employee satisfaction and engagement, ultimately reducing turnover rates [11]. The use of predictive analytics, when built on robust, high-quality databases, represents a powerful tool for strategic decision-making in talent retention [12].
In the higher education sector, academic staff play a pivotal role in institutional success. Their commitment is essential for maintaining excellence, fostering stability, and driving long-term development. The Spanish higher education landscape has experienced significant transformations in recent years, marked by increased faculty mobility due to talent scarcity and the necessity to comply with evolving regulatory frameworks. Unlike traditional corporate structures, universities operate under hierarchical systems with distinct career trajectories, necessitating a nuanced, data-driven approach to understanding the factors influencing faculty loyalty.
The scientific career in Spain is defined by a highly regulated and hierarchical structure. It is built around a sequence of temporary contracts, competitive calls, and stabilisation processes that rely heavily on external evaluation criteria and the availability of public funding. Access to and progression within the national research and academic system are determined by programmes promoted by national and regional government bodies. This career path—particularly during the postdoctoral stage—is marked by extended periods of contractual precariousness, during which researchers are expected to simultaneously manage teaching responsibilities, produce high-impact scientific work, and continuously apply for funding through highly competitive calls [13]. Securing a permanent position within the university system can take a long time, even for candidates with internationally competitive research profiles and strong scientific credentials.
These structural characteristics have a profound impact on researchers’ professional experiences [14], influencing their sense of job security, institutional recognition, and long-term career prospects. Consequently, any attempt to understand the factors that drive academic talent engagement and loyalty in the Spanish context must begin with a critical examination of this regulatory and organisational framework, which differs significantly from other higher education and research systems across Europe.
Given this dynamic and complex environment, talent retention has become a strategic priority for universities. ML techniques offer a sophisticated approach to analysing and predicting faculty commitment, identifying key factors that influence long-term engagement. However, the widespread adoption of PA in Human Resource Management (HRM) faces challenges, including the lack of analytical expertise among HR professionals and ethical concerns surrounding data privacy [1].
The application of predictive analytics in HR has been instrumental in anticipating employee turnover. Studies have shown that HR databases can be leveraged to predict voluntary departures by identifying behavioural patterns indicative of attrition risk [15]. Various ML models have been explored in retention analysis, with logistic regression being one of the most widely applied techniques [16]. More recent research suggests that deep learning techniques further enhance predictive accuracy by capturing complex, non-linear relationships among retention factors [17]. Feature selection methods have also been employed to refine classification models, improving both interpretability and efficiency [18].
Unsupervised ML techniques in PA facilitate the systematic identification of hidden patterns that may not be evident through traditional HR analysis. Feature selection methods streamline predictive models while maintaining high accuracy [19]. Critical factors influencing retention, such as job dissatisfaction, lack of professional growth, and misalignment with organisational culture, have been extensively studied [20,21]. Additionally, research highlights the impact of job satisfaction on performance, with engagement acting as a mediator and employee health as a moderator [22].
Beyond financial compensation, employee satisfaction and engagement are strongly influenced by non-monetary benefits, collectively termed emotional salary. Rooted in classical motivation theories [23,24], emotional salary encompasses factors such as recognition, professional development, autonomy, and work–life balance. Studies underscore its direct impact on employee well-being and retention [25,26].
The “happy and productive worker” model suggests that workplace well-being correlates with job performance, reinforcing the importance of non-monetary incentives in reducing turnover and fostering organisational commitment [27]. In academia, these factors are particularly significant, as they contribute to an environment that nurtures long-term engagement among faculty and researchers.
Recent findings demonstrate that emotional salary has a positive influence on job satisfaction and performance, as it enhances employee motivation and engagement [28]. As a result, this concept has gained prominence in talent management, particularly within higher education institutions.
This study provides a comprehensive framework for identifying key quantitative factors influencing talent commitment in academia through the application of unsupervised ML techniques.
Within the framework of this scientific research, the quantitative emotional salary (QES) sub-dataset refers to a specific set of quantitative variables that capture the non-monetary yet measurable aspects of the work experience that directly influence the satisfaction, motivation, and loyalty of university faculty. The QES sub-dataset serves as an empirical representation of the broader concept of emotional salary, which is traditionally associated with qualitative factors such as recognition, work–life balance, or professional development. However, by focusing exclusively on its quantifiable dimensions, this sub-dataset enables an aim, data-driven analysis that can support strategic decision-making in academic talent management.
By isolating and analysing such variables independently from direct salary measures, the QES subset allows for a deeper understanding of the hidden factors that influence academic talent commitment. Its application makes it possible to go beyond traditional financial incentives and to optimise retention strategies through a more holistic and evidence-based approach.
The objective is to extract a set of interrelated quantitative variables—referred to as quantitative emotional salary (QES)—whose behaviour closely resembles that of economic compensation (EC). This systematic approach aims to pinpoint the most influential retention factors for university faculty, ultimately strengthening the institution’s intellectual capital.
By integrating machine learning methods, this research offers a data-driven perspective on talent commitment in higher education. It highlights the transformative potential of combining innovative technologies with HR analytics to enhance institutional outcomes and faculty engagement.
The following sections outline the methodology employed, present the results obtained, and discuss the implications of these findings for the future of HR analytics in academia.

2. Materials and Methods

This research follows a structured and reproducible approach to ensure the validity and replicability of results. The methodology consists of multiple stages, each employing specialised computational tools and machine learning techniques for data integration, preprocessing, analysis, and modelling.
The data collection and integration process was carried out and is detailed below. The first step is data collection, which is critical for determining the success of a predictive model [29]. Human resources data is often dispersed across multiple systems, making it necessary to standardise and integrate diverse data sources into a unified structure to ensure consistency and comparability.
The primary datasets for this study were sourced from the University CEU Cardenal Herrera, following compliance with the General Data Protection Regulation (GDPR). The dataset spans a historical record of five academic years (2018–2022) and includes information on university academic staff. The data sources are as follows:
  • Workday (Workday, Inc. (Pleasanton, CA, USA)) and A3 (Wolters Kluwer (Alphen aan den Rijn, The Netherlands)): Administrative and compensation-related data.
  • Cornerstone (Cornerstone OnDemand Inc. (Santa Monica, CA, USA)): Employee training, performance, and talent management data.
  • Own University IT Systems: Organisational structure and faculty employment records.
To ensure semantic coherence, integration required format transformation and standardisation, aligning different fields despite originating from separate platforms. The Extract, Transform, Load (ETL) methodology was applied to clean, transform, and load the data into a unified dataset, allowing for seamless analysis.
The process of data preprocessing and feature engineering was conducted, as outlined below. Given the considerable number of records in the dataset, an initial exploratory analysis was conducted to:
  • Identify and handle outliers, redundancies, and null values.
  • Correct typographical errors and standardise variable formats.
Following this, feature engineering was performed to enhance data quality. The final dataset included 1144 individuals and 103 features, of which 72 were quantitative. After filtering the dataset to focus on tenure-track faculty, adjunct professors were excluded, resulting in a refined dataset of 599 individuals. A key qualitative variable within the dataset is “commitment,” which indicates whether a faculty member was actively employed at the university as of 31 August 2023.
Then, Data Normalisation and Correlation Analysis were performed. Since machine learning techniques based on distance measures require data normalisation, the 72 numerical features were standardised to a common scale without distorting their relative differences. To explore relationships between variables, a correlation matrix was computed using Kendall’s correlation coefficient, which measures monotonic associations between quantitative variables. Based on the correlation structure, sub-datasets were constructed, preserving only significantly correlated variables.
Moreover, dimensionality reduction using Principal Component Analysis (PCA) was used to reduce the dataset dimensionality while preserving key information. This method, introduced by [30] and further developed by [31], transforms correlated variables into uncorrelated principal components ranked by importance. The following considerations guided the PCA application:
  • Eigenvalues: Used to rank components in terms of information retention. Components with the highest eigenvalues were prioritised.
  • Eigenvectors: Define the direction of principal components, ensuring they remain orthogonal.
  • Projection Principles: The first principal component maximises data variance, while subsequent components are perpendicular, forming an orthogonal coordinate system.
  • Goodness-of-Fit Measure: The elbow method was applied to determine the optimal number of components by explaining at least 70% of the variance [32].
Following dimensionality reduction, unsupervised machine learning techniques were applied, specifically the K-Means clustering algorithm. This technique groups data points based on similarity, minimizing intra-cluster variance while maximizing inter-cluster separation.
Several studies indicate that applying K-Means in PCA-transformed subspaces enhances computational efficiency [33,34]. Despite its simplicity, K-Means has been shown to perform competitively against more complex clustering methods [35]. Additionally, research by [36] suggests that there is no universal solution for K-Means, reinforcing the importance of empirical validation.
Key Steps in K-Means Implementation:
  • Step 1. Setting a Random Seed (Ensures reproducibility).
  • Step 2. Centroid Initialisation (Centroids are randomly assigned as initial cluster representatives).
  • Step 3. Cluster Assignment (Each data point is assigned to the nearest centroid based on Euclidean distance).
  • Step 4. Centroid Update (New centroids are recalculated as the arithmetic mean of points within each cluster).
  • Step 5: Iteration Until Convergence (Steps 3 and 4 are repeated until centroids stabilise, or a predefined iteration limit is reached).
The optimal number of clusters (k) was determined using the elbow method, which evaluates intra-cluster variance (Within-Cluster Sum of Squares, WCSS) as a function of k. The optimal k is selected at the inflexion point where adding more clusters does not significantly improve compactness.
After clustering, the composition of each group was examined based on the qualitative variables from the initial dataset. This analysis provided valuable insights into the characteristics of faculty members with lower commitment, helping to identify key retention factors and patterns that influence long-term faculty engagement.

3. Results

The primary findings of this research are outlined below. The initial dataset, after applying the preprocessing methodologies described earlier, consisted of 103 variables encompassing both qualitative and quantitative attributes. This study focuses exclusively on the 72 quantitative variables, as the analytical techniques employed—such as principal component analysis and correlation analysis—require numerical input to yield meaningful results. Thus, quantitative variables were analysed to identify significant interrelationships. To examine such relationships, a correlation matrix was computed using Kendall’s coefficient. Variables demonstrating a statistically significant correlation with at least one other variable in the dataset were selected for further analysis. A significance threshold of α = 0.05 was established, ensuring that only correlations with a p-value below this threshold were considered meaningful.
To enhance the interpretability and reduce the redundancy, a filtering process was implemented to retain only variables with statistically significant correlations relevant to our research objectives. As a result, multiple sub-datasets were constructed from the initial dataset based on thematic coherence, enabling a more structured and detailed analysis. Two main sub-datasets were defined, reflecting the two primary dimensions of employee retention—non-monetary and monetary factors, respectively. The list of variables included in the different sub-datasets can be found in Appendix A, Table A1.
  • Quantitative Emotional Salary (QES)
  • Economic Compensation (EC)
Within the QES sub-dataset, dimensionality reduction was performed using PCA.
The results indicate that five principal components optimally encapsulate over 70% of the total variance in the data. And this was sufficient to explain the variability that exists in the data.
Table 1 presents the percentage contribution of each original variable to these components, allowing for the identification of the most influential factors within the QES construct. It shows Dimension 1 is characterised by educational support between 2018–2020, while Dimension 2 highlights continuous training. Dimension 3 is defined by time in position and training, whereas Dimension 4 is associated with seniority and training, and Dimension 5 with time in position and professional development.
Figure 1 below presents a graphical representation, which was constructed using the first two principal components, where each point represents an individual who was not retained, i.e., a person who left the organisation. Although the analysis focuses on quantitative variables, one qualitative variable from the original set of 103 is incorporated: the academic category at the time of turnover. This variable is used for colour coding in the visualisations, offering additional insights into potential patterns. The variable categories are listed in Appendix A, Table A2.
Figure 1 shows the distribution of non-retained individuals from the QES sub-dataset in the plane defined by the first two principal components of the PCA. In this plot, non-retained individuals are grouped according to their academic category. This type of visualisation helps identify clustering patterns by observing how the observations are positioned based on the indicated variables.
To further explore underlying structures, K-Means clustering was applied within the reduced-dimensional space obtained through PCA. The optimal number of clusters was determined to be four, based on the elbow method. The resulting cluster distribution is depicted in Figure 2 below, where each point represents an individual, positioned according to the first two principal components, facilitating visual differentiation among clusters.
After obtaining the results from Figure 2, an examination of cluster composition concerning commitment levels revealed that Cluster 1 exhibited the highest proportion of non-retained individuals (27.86%), whereas the remaining clusters showed significantly lower percentages, with Cluster 4 containing no non-retained individuals at all. To gain additional insights, the proportion of the grouped current academic category levels within each cluster was analysed. This categorical variable, which is part of the initial database comprising 103 variables, is detailed in Table A2 of Appendix A. The results of this analysis are summarised in Table 2, providing valuable information on the attributes of non-retained individuals in the QES sub-dataset.
This analysis allows for the identification of the composition of each cluster based on different qualitative variables of interest, providing relevant information about the features of non-retained individuals in the QES sub-dataset. In this case, the current category level variable reflects a clear segmentation between academic levels, from the initial levels in Cluster 1 to the advanced levels in Cluster 4. The EC sub-dataset was similarly analysed using PCA, which revealed that the first two principal components account for approximately 85% of the total variance.
Table 3 details the percentage of each variable’s contribution to the variance explained by these two principal components. It shows that the first principal component is strongly influenced by the variables related to gross compensation across different years, while the second principal component reflects a greater weight of the variables associated with the functional complement.
To visualise the relationships among the variables, a scatter plot was generated using the first two principal components (see Figure 3). This plot specifically includes individuals classified as non-loyal (i.e., those for whom the target variable is “Yes”). Each point is colour-coded based on grouped academic categories, allowing for clear differentiation. This quantitative variable is listed in Appendix A, Table A2.
Next, K-Means clustering was applied to this reduced space to segment the data (see Figure 4). As in the QES sub-dataset, the elbow method was used to determine the optimal number of clusters, which was found to be four.
Figure 4 shows the distribution of the QES sub-dataset data in the space defined by the first two principal components after applying the K-Means clustering algorithm. Each colour represents a different cluster, allowing visualisation of how individuals are grouped based on the underlying structure of the data. The clusters are clearly differentiated, indicating that the variables selected for the analysis allow for effective segmentation of individuals into distinct profiles. This visualisation provides an overview of the internal structure of the dataset. The spatial separation between clusters suggests significant differences that can be explored more deeply in subsequent analyses.
Finally, an exploration of the distribution of qualitative variables across the identified clusters was performed. These qualitative variables are part of the initial dataset, which consists of 103 variables. In particular, the qualitative variable “objective” classifies individuals into two groups: those considered non-loyal, defined by a “Yes” value in the target variable, and those considered loyal, classified with a “No” value. In this context, the analysis of cluster composition about commitment levels revealed that Cluster 1 had the highest proportion of non-retained individuals, with 27.82% of the cases, suggesting a notable concentration of non-loyal individuals within this cluster. A key observation from this clustering analysis is that Cluster 1 is strongly associated with a lack of “talent engagement”, distinguishing it as a critical group for further investigation. Identifying the unique characteristics of this cluster is essential to understanding the factors contributing to non-retention.
The percentage distribution of individuals based on “current category level” is presented in Table 4, further contextualising the insights derived from the EC sub-dataset analysis.
Table 4 shows the percentage distribution of a qualitative variable that reflects the current academic levels of individuals in each of the four clusters defined within the EC sub-dataset. This distribution allows for the observation of how individuals are grouped based on their academic category across the different clusters. The table reveals that the clusters are primarily grouped according to academic career levels. Clusters 1 and 3 mainly include individuals at the lower levels of the academic career. In contrast, Cluster 2 groups individuals at more advanced levels, with a higher representation of Full Professors.

4. Discussion

This study applied unsupervised machine learning methods to analyse a dataset related to the university academic environment, with a specific focus on university professors and researchers, and their relationship with talent commitment.
One of the key aspects to consider when analysing talent commitment in higher education institutions is the specific nature of the university environment compared to other sectors [37], particularly the business sector. Unlike commercial organisations, universities are subject to additional legal and administrative regulations that significantly influence human resource management. These include public regulations governing recruitment, promotion, and job stability, externally determined salary scales, and an institutional framework in which non-monetary incentives carry considerable weight.
In this regard, although the present study draws on concepts such as Quantitative Emotional Salary (QES), which are inspired by practices from the business world, their application has been carefully contextualised. The aim is not to directly transfer private sector models, but rather to explore internal patterns within the university system that may serve as meaningful indicators of motivation and retention, based on their statistical behaviour resembling that of economic compensation.
Acknowledging these structural differences is essential for a proper interpretation of the results. Thus, the contribution of this study lies in offering a methodological framework that can be adapted to different institutional contexts, always considering the regulatory and organisational constraints inherent to the education sector.
The findings offer valuable insights into the factors influencing talent loyalty within academic institutions. The composition of the variables that make up the two sub-datasets is detailed in Appendix A, Table A1. This table provides a complete description of the variables included in each sub-dataset, offering a clear view of the elements that constitute the data used in the analysis. Through dimensionality reduction and clustering techniques, significant patterns were identified—some aligning with previous research while others providing novel interpretations of professional behaviours and needs.
The proposed methodology proved effective in not only simplifying the inherent complexity of the dataset but also generating actionable insights that can inform strategies to improve talent commitment in academic settings. Various analytical tools were employed, including correlation matrix analysis, which was computed using Kendall’s coefficient, Principal Component Analysis (PCA), and the K-Means clustering algorithm. These techniques helped reduce the dataset’s dimensionality, facilitating the identification of key relationships and sub-datasets of interest.
Two primary sub-datasets emerged from the analysis: Quantitative Emotional Salary (QES) and Economic Compensation (EC). The QES sub-dataset, comprising nineteen correlated variables, highlights the critical role of emotional factors in faculty commitment. In contrast, the EC sub-dataset, consisting of ten correlated variables, demonstrates how economic compensation is also linked to talent commitment.
PCA was instrumental in reducing the dataset’s complexity while preserving its most significant information. In the QES sub-dataset, five principal components explained more than 70% of the total variance, whereas in the EC sub-dataset, just two principal components accounted for 80% of the total variance. Beyond dimensionality reduction, PCA also provided a clearer visualisation of relationships between clusters, enabling a more intuitive interpretation of the segmentation.
The application of clustering techniques, particularly the K-Means algorithm, revealed four distinct employee segments in both sub-datasets. Notably, the cluster closest to the origin of the coordinate system in both cases exhibited lower explained variability by the first two principal components. This suggests that observations within these clusters share less variability in explanatory factors, a finding of relevance since talent turnover was concentrated in these groups. Each observation represents a set of characteristics that define an individual in the dataset. Therefore, within the clusters closest to the origin of the coordinate system, individuals have lower variability in the characteristics that explain them, meaning they share more similarities. This is relevant because, as mentioned, talent turnover was concentrated in these clusters with lower variability, which may indicate that these shared characteristics are linked to the lack of talent retention. This indicates that the characteristics defining these clusters are associated with a higher likelihood of faculty member departure.
In the QES sub-dataset, segmentation revealed that professors with lower quantitative emotional salary levels were more likely to leave the institution. This finding aligns with previous studies, such as [38], which emphasised the importance of emotional well-being and job satisfaction as key predictors of talent retention. These results suggest that emotional salary factors—such as recognition, work–life balance, and institutional support—play a crucial role in faculty commitment.
The study also found that all academic staff categories responded similarly to the absence of emotional salary, underscoring its universal importance. Additionally, results indicate that professors with lower economic salaries, as well as those in research-intensive positions, were more vulnerable to turnover.
By identifying key factors influencing talent commitment, this research reinforces the growing significance of non-economic well-being as a fundamental pillar for fostering faculty loyalty. While economic compensation remains relevant, the findings highlight those intangible factors—represented by the QES sub-dataset—strongly influence faculty decisions to remain at an institution.
These insights emphasise the need for comprehensive retention strategies that integrate both financial and emotional incentives. Strengthening faculty commitment requires not only improvements in salary structures but also targeted initiatives to enhance the work environment, professional recognition, and overall job satisfaction. By addressing both economic and emotional factors, universities can develop holistic strategies to retain academic talent and foster long-term institutional stability.

5. Conclusions

The findings of this research highlight a significant relationship between Quantitative Emotional Salary (QES) and talent loyalty, showing similar behavioural patterns to those observed in the Economic Compensation (EC) sub-dataset. However, this study emphasises, that both QES and EC independently contribute to academic talent loyalty, suggesting that while both factors influence commitment, they do so in complementary ways. This study reinforces the growing recognition that emotional salary plays a fundamental role in academic talent commitment, alongside economic factors.
The developed methodological framework proved effective in identifying latent patterns in the data while also reducing dimensionality, thereby enhancing interpretability. This improved clarity makes the results more accessible and applicable for decision-makers in higher education institutions.
Nevertheless, it is important to underscore that this research is based on a single institutional case study. As such, its findings are context-specific and should not be generalised across the entire higher education sector without further validation. Institutional structures, regulatory frameworks, and cultural contexts vary widely between universities and national systems, and these factors could significantly affect the relevance and replicability of the observed patterns.
Despite promising results, this study has certain limitations. One key limitation is that the projection of original variables into the PCA space may obscure significant nonlinear relationships, potentially limiting a deeper understanding of underlying data structures. Additionally, while the K-Means clustering algorithm is computationally efficient, it assumes spherical clusters of similar sizes, which may not fully capture the actual distribution of data points.
Another limitation pertains to the possible exclusion of essential qualitative variables during the initial data selection process. This study focused primarily on quantitative dimensions of emotional salary and economic compensation; however, qualitative factors—such as personal perceptions, institutional culture, and professional experiences—could provide additional insights into talent loyalty.
Future research should address these methodological limitations by exploring hybrid approaches that integrate both qualitative and quantitative dimensions. Expanding the scope of analysis to broader and more diverse organisational contexts would allow for a more comprehensive understanding of talent commitment.
This study also opens new pathways for data-driven talent management research. By leveraging data analytics, future studies can explore critical strategic lines in human resource management, helping institutions develop more targeted and effective policies for fostering long-term faculty engagement and commitment.

Author Contributions

Conceptualization, A.-I.A.-S., J.P., O.C. and A.F.; methodology, J.P. and A.F.; software, A.-I.A.-S.; validation, J.P. and A.F.; formal analysis, J.P. and A.F.; investigation, A.-I.A.-S. and O.C.; resources, A.-I.A.-S. and O.C.; data curation, A.-I.A.-S.; writing—original draft preparation, A.-I.A.-S. and J.P.; writing—review and editing, A.-I.A.-S., J.P., O.C. and A.F.; visualization, A.-I.A.-S.; supervision, J.P.; project administration, J.P.; funding acquisition, A.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by Universidad CEU Cardenal Herrera with the grant number INDI24/17.

Institutional Review Board Statement

This study did not require approval from an ethics committee, as it was conducted using aggregated data with no individual-level identifiers. The data used in the analysis are fully anonymised and grouped in such a way that it is not possible to trace back any information to individual participants. Therefore, according to applicable regulations and ethical standards, ethical approval was not necessary.

Informed Consent Statement

Informed consent for participation was not required, as the dataset consists exclusively of aggregated and anonymised data with no personal or identifiable information related to health, religion, ethnicity, or sexual orientation. There is no way to trace any data back to individual persons, and thus, the study does not fall within the scope of research involving human subjects as defined by ethics guidelines.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

Abbreviations

The following abbreviations are used in this manuscript:
PAPeople Analytics
MLMachine Learning
PCAPrincipal Component Analysis
HRHuman Resources
HRMHuman Resources Management
QESQuantitative Emotional Salary
ECEconomic Compensation
GDPRGeneral Data Protection Regulation
ETLExtract Transform Load
WCSSWithin-Cluster Sum of Squares

Appendix A

Table A1. Descriptions of the variables included in the two sub-datasets (QES and EC).
Table A1. Descriptions of the variables included in the two sub-datasets (QES and EC).
Sub-datasetFinal FeaturesEnglish Description
ECFunctional Complement 2018Amount of functional economic complements in 2018.
ECFunctional Complement 2019Amount of functional economic complements in 2019.
ECFunctional Complement 2020Amount of functional economic complements in 2020.
ECFunctional Complement 2021Amount of functional economic complements in 2021.
ECFunctional Complement 2022Amount of functional economic complements in 2022.
ECGross 2018Employee’s gross salary amount in 2018.
ECGross 2019Employee’s gross salary amount in 2019.
ECGross 2020Employee’s gross salary amount in 2020.
ECGross 2021Employee’s gross salary amount in 2021.
ECGross 2022Employee’s gross salary amount in 2022.
QESAverage hours in positionAverage teaching hours during the employee’s time at the university (up to five years).
QESAverage reduction hoursAverage hour reduction during the employee’s time at the university (up to five years).
QESAverage training coursesAverage number of training courses completed during the employee’s time at the university (up to five years).
QESAverage training hoursAverage training hours completed during the employee’s time at the university (up to five years).
QESHours last year reductionTeaching hour reduction in the last year. It is a way to compensate for the additional workload related to responsibilities outside of direct teaching, such as academic management, program coordination, or participation in committees.
QESLast year position hoursTeaching hours in the last year.
QESMaximum reduction hoursMaximum hour reduction in an academic year.
QESNumber of years reductionNumber of years the employee had hour reductions.
QESRector position yearLast year the employee assumed an executive team position.
QESSeniority durationDuration of the employee’s seniority.
QESSeniority yearYear of the employee’s seniority.
QESStepsNumber of changes in the employee’s position—either in category level or category—during the employee’s time at the university (up to five years).
QESStudy Aids 2018Amount of study aid received in 2018.
QESStudy Aids 2019Amount of study aid received in 2019.
QESStudy Aids 2020Amount of study aid received in 2020.
QESStudy Aids 2021Amount of study aid received in 2021.
QESStudy Aids 2022Amount of study aid received in 2022.
QESWork dayIndicates the percentage of the employee’s working hours.
QESYears of trainingNumber of years the employee has received training.
Table A2. Values taken by the qualitative variables used throughout the study.
Table A2. Values taken by the qualitative variables used throughout the study.
Current Cat GroupedCurrent CategoryCurrent Category Level
AssociateAssociate ProfessorAssociate Professor Level I
AssociateAssociate ProfessorAssociate Professor Level II
AssociateAssociate ProfessorAssociate Professor Level III
AssociateAssociate ProfessorAssociate Professor Level IV
AssociateAssociate ProfessorAssociate Professor Level V
AssociateAssociate ProfessorAssociate Professor Level VI
AssociateAssociate ProfessorAssociate Professor Level VII
AssociateAssociate ProfessorAssociate Professor Level VIII
FullFull ProfessorFull Professor Level I
FullFull ProfessorFull Professor Level II
FullFull ProfessorFull Professor Level III
FullFull ProfessorFull Professor Level IV
FullFull ProfessorFull Professor Level V
FullFull ProfessorFull Professor Level VI
FullFull ProfessorFull Professor Level VII
FullFull ProfessorFull Professor Level VIII
Research GroupAssistant ProfessorAssistant Professor Level I
Research GroupAssistant ProfessorAssistant Professor Level II
Research GroupAssistant ProfessorAssistant Professor Level III
Research GroupAssistant ProfessorAssistant Professor Level IV
Research GroupAssistant ProfessorAssistant Professor Level V
Research GroupAssistant ProfessorAssistant Professor Level VI
Research GroupAssistant ProfessorAssistant Professor Level VII
Research GroupInstructorInstructor Level I
Research GroupInstructorInstructor Level II
Research GroupResearch ProfessorResearch Professor

References

  1. Tursunbayeva, A.; Di Lauro, S.; Pagliari, C. People analytics—A scoping review of conceptual boundaries and value propositions. Int. J. Inf. Manag. 2018, 43, 224–247. [Google Scholar] [CrossRef]
  2. Coculova, J.; Tomcikova, L. Innovative human resource management practices for the talent management implementation. Mark. Manag. Innov. 2021, 5, 47–54. [Google Scholar] [CrossRef]
  3. Loscher, G.J.; Bader, V. Creating accountability through HR analytics—An audit society perspective. Hum. Resour. Manag. Rev. 2023, 33, 100974. [Google Scholar] [CrossRef]
  4. Meijerink, J.; Bondarouk, T. The duality of algorithmic management: Toward a research agenda on HRM algorithms, autonomy and value creation. Hum. Resour. Manag. Rev. 2023, 33, 100876. [Google Scholar] [CrossRef]
  5. Belizón, M.J.; Kieran, S. Human resources analytics: A legitimacy process. Hum. Resour. Manag. J. 2022, 32, 603–630. [Google Scholar] [CrossRef]
  6. Shet Sateesh, V.; Poddar, T.; Wamba Samuel, F.; Dwivedi, Y.K. Examining the determinants of successful adoption of data analytics in human resource management—A framework for implications. J. Bus. Res. 2021, 131, 311–326. [Google Scholar] [CrossRef]
  7. Shrivastava, S.; Nagdev, K.; Rajesh, A. Redefining HR using people analytics: The case of Google. Hum. Resour. Manag. Int. Dig. 2018, 26, 3–6. [Google Scholar] [CrossRef]
  8. Margherita, A. Human resources analytics: A systematization of research topics and directions for future research. Hum. Resour. Manag. Rev. 2022, 32, 100795. [Google Scholar] [CrossRef]
  9. Belizón, M.J.; Majarín, D.; Aguado, D. Human resources analytics in practice: A knowledge discovery process. Eur. Manag. Rev. 2024, 21, 659–677. [Google Scholar] [CrossRef]
  10. Álvarez-Gutiérrez, F.J.; Stone, D.L.; Castaño, A.M.; García-Izquierdo, A.L. Human Resources Analytics: A systematic Review from a Sustainable Management Approach. Rev. Psicol. Trab. Y Las Organ. 2022, 38, 129–147. [Google Scholar] [CrossRef]
  11. Ravesangar, K.; Narayanan, S. Adoption of HR analytics to enhance employee retention in the workplace: A review. Hum. Resour. Manag. Serv. 2024, 6, 3481. [Google Scholar] [CrossRef]
  12. Meijerink, J.; Boons, M.; Keegan, A.; Marler, J. Algorithmic human resource management: Synthesizing developments and cross-disciplinary insights on digital HRM. Int. J. Hum. Resour. Manag. 2021, 32, 2545–2562. [Google Scholar] [CrossRef]
  13. Castro-Ceacero, D.; Rodriguez-Gomez, D.; Muñoz-Moreno, J.L.; Calatayud, A. The intergenerational climate of Spanish university research. Stud. High. Educ. 2023, 48, 1696–1707. [Google Scholar] [CrossRef]
  14. Kennedy, M.R.; Deans, Z.; Ampollini, I.; Breit, E.; Bucchi, M.; Seppel, K.; Vie, K.J.; Meulen, R.t. “It is Very Difficult for us to Separate Ourselves from this System”: Views of European Researchers, Research Managers, Administrators and Governance Advisors on Structural and Institutional Influences on Research Integrity. J. Acad. Ethics 2023, 21, 471–495. [Google Scholar] [CrossRef]
  15. Rombaut, E.; Guerry, M.A. Predicting voluntary turnover through human resources database analysis. Manag. Res. Rev. 2018, 41, 96–112. [Google Scholar] [CrossRef]
  16. Guerranti, F.; Dimitri, G.M. A Comparison of Machine Learning Approaches for Predicting Employee Attrition. Appl. Sci. 2023, 13, 267. [Google Scholar] [CrossRef]
  17. Ben Yahia, N.; Hlel, J.; Colomo-Palacios, R. From Big Data to Deep Data to Support People Analytics for Employee Attrition Prediction. IEEE Access 2021, 9, 60447–60458. [Google Scholar] [CrossRef]
  18. Naz, K.; Siddiqui, I.F.; Koo, J.; Khan, M.A.; Qureshi, N.M.F. Predictive Modeling of Employee Churn Analysis for IoT-Enabled Software Industry. Appl. Sci. 2022, 12, 10495. [Google Scholar] [CrossRef]
  19. Wardhani, F.H.; Lhaksmana, K.M. Predicting Employee Attrition Using Logistic Regression With Feature Selection. Sinkron 2022, 7, 2214–2222. [Google Scholar] [CrossRef]
  20. Fallucchi, F.; Coladangelo, M.; Giuliano, R.; De Luca, E.W. Predicting employee attrition using machine learning techniques. Computers 2020, 9, 86. [Google Scholar] [CrossRef]
  21. Sanjeetha, S.; Phani Krishna, C. Analysis of Employee Attrition using for Machine Learning Techniques. Turk. J. Comput. Math. Educ. 2021, 12, 28–31. [Google Scholar]
  22. Stirpe, L.; Profili, S.; Sammarra, A. Satisfaction with HR practices and employee performance: A moderated mediation model of engagement and health. Eur. Manag. J. 2022, 40, 295–305. [Google Scholar] [CrossRef]
  23. Herzberg, F.I. Work and the Nature of Man; World Pub: Cleveland, OH, USA, 1966; Available online: https://www.scirp.org/reference/referencespapers?referenceid=1775482 (accessed on 26 October 2024).
  24. Deci, E.L.; Ryan, R.M. Intrinsic Motivation and Self-Determination in Human Behavior; Springer: Berlin/Heidelberg, Germany, 1985. [Google Scholar] [CrossRef]
  25. Ruíz-Valdés, S.; Ruíz-Tapia, J.A. The emotional salary as a strategy to encourage work commitment and talent retention in organization. J. Int. Econ. 2022, 6, 8–16. [Google Scholar] [CrossRef]
  26. Augusto Reis, T.; Rose Campagnolli, D.; Canuto da Silva, T.; Oste Graziano Cremonezi, G.; Author, C. Emotional Salary as a Strategy to Retain Talents. IOSR J. Humanit. Soc. Sci. 2018, 23, 74–80. [Google Scholar] [CrossRef]
  27. Peiró, J.M.; Kozusznik, M.; Molina, I.R.; Tordera, N. The happy-productive worker model and beyond: Patterns of wellbeing and performance atwork. Int. J. Environ. Res. Public Health 2019, 16, 479. [Google Scholar] [CrossRef] [PubMed]
  28. Junça Silva, A.; Burgette, A.R.; Fontes da Costa, J. Toward a Sustainable World: Affective Factors Explain How Emotional Salary Influences Different Performance Indicators. Sustainability 2024, 16, 2198. [Google Scholar] [CrossRef]
  29. Krishna, S.; Sidharth, S. HR Analytics: Employee Attrition Analysis using Random Forest. Int. J. Perform. Eng. 2022, 18, 275–281. [Google Scholar] [CrossRef]
  30. Pearson, K. On lines and planes of closest fit to systems of points in space. Philos. Mag. 1901, 2, 559–572. [Google Scholar] [CrossRef]
  31. Hotelling, H. Analysis of a complex of statistical variables into principal components. J. Educ. Psychol. 1933, 24, 498–520. [Google Scholar] [CrossRef]
  32. Niemczynowicz, A.; Kycia, R.A. The analysis of engagement at the workplace of Generation Z—Machine learning in management. In Managing Generation Z: Motivation, Engagement and Loyalty; Taylor and Francis: Oxfordshire, UK, 2023; pp. 83–103. [Google Scholar] [CrossRef]
  33. Xu, Q.; Ding, C.; Liu, J.; Luo, B. PCA-guided search for K-means. Pattern Recognit. Lett. 2015, 54, 50–55. [Google Scholar] [CrossRef]
  34. Malli, S.; Nagesh, H.R.; Rao, B.D. Approximation to the K-Means Clustering Algorithm using PCA. Int. J. Comput. Appl. 2020, 175, 43–46. [Google Scholar] [CrossRef]
  35. Romanuke, V.V. Random centroid initialization for improving centroid-based clustering. Decis. Mak. Appl. Manag. Eng. 2023, 6, 734–746. [Google Scholar] [CrossRef]
  36. Ahmed, M.; Seraj, R.; Islam, S.M.S. The k-means algorithm: A comprehensive survey and performance evaluation. Electronics 2020, 9, 1295. [Google Scholar] [CrossRef]
  37. Zeng, G.; Liu, F.; Xiong, C.; Huang, Y. Analysis of Innovative Strategies for Talent Team Development from the Perspective of Human Resource Management in Colleges and Universities. Contemp. Educ. Teach. Res. 2024, 5, 430–435. [Google Scholar] [CrossRef]
  38. Qamar, Y.; Samad, T.A. Human resource analytics: A review and bibliometric analysis. Pers. Rev. 2022, 51, 251–283. [Google Scholar] [CrossRef]
Figure 1. Distribution of non-retained individuals of QES sub-dataset in the coordinate system of the first two principal components. Associate: Associate Professor. Full: Full Professor. Research Group: Assistant Professor, Instructor, and Research Professor.
Figure 1. Distribution of non-retained individuals of QES sub-dataset in the coordinate system of the first two principal components. Associate: Associate Professor. Full: Full Professor. Research Group: Assistant Professor, Instructor, and Research Professor.
Merits 05 00014 g001
Figure 2. K-Means Clusters in the Space of the First Two Principal Components.
Figure 2. K-Means Clusters in the Space of the First Two Principal Components.
Merits 05 00014 g002
Figure 3. Distribution of Non-Retained Observations of EC sub-dataset in the coordinate system of the first two principal components. Associate: Associate Professor. Full: Full Professor. Research Group: Assistant Professor, Instructor, and Research Professor.
Figure 3. Distribution of Non-Retained Observations of EC sub-dataset in the coordinate system of the first two principal components. Associate: Associate Professor. Full: Full Professor. Research Group: Assistant Professor, Instructor, and Research Professor.
Merits 05 00014 g003
Figure 4. K-Means Clusters in the Space of the First Two Principal Components.
Figure 4. K-Means Clusters in the Space of the First Two Principal Components.
Merits 05 00014 g004
Table 1. Contribution to the total variance explained by the principal components in the QES sub-dataset.
Table 1. Contribution to the total variance explained by the principal components in the QES sub-dataset.
Quantitative Emotional SalaryDim.1Dim.2Dim.3Dim.4Dim.5
Average hours in position0.5694510.53936515.4530602.03024516.625706
Average reduction hours10.1275273.9225178.0365100.1448091.121154
Average training courses2.0285942.6983139.88631411.81559013.909010
Average training hours1.8397292.1088837.02801113.22894022.318579
Hours last year reduction10.2444423.1896067.9141840.4801771.166354
Last year position hours0.7936660.90022821.4462701.2377606.655441
Maximum reduction hours10.7617943.4757178.1226960.1363991.219806
Number of years reduction9.8678642.7599330.4189370.0000300.006230
Rector position year5.4537891.4465015.1205930.2011371.379636
Seniority duration6.8787440.0602130.92087429.3346206.230866
Seniority year6.8787440.0602130.92087429.3346206.230866
Steps0.9422242.3852601.3174872.82260821.011102
Study aids 20182.74877515.6130720.2454730.6319670.297226
Study aids 20193.21001419.6222320.2789241.2602490.032952
Study aids 20204.22563818.8304640.1241251.8586930.024720
Study aids 20216.9627287.9139090.0164770.7717630.762980
Study aids 20226.5092019.8009380.0001800.9405780.525112
Work day3.7355722.4896914.3637793.5338850.386016
Years of training6.2215062.1829468.3852310.2359290.096246
N = 599; values reflect the contribution of each variable to the five principal dimensions; a higher absolute value indicates a greater influence of the variable in the corresponding dimension.
Table 2. Percentage distribution of a qualitative variable by clusters with QES sub-dataset.
Table 2. Percentage distribution of a qualitative variable by clusters with QES sub-dataset.
Current Category LevelCluster 1Cluster 2Cluster 3Cluster 4
Assistant Professor Level I27.05-14.11-
Assistant Professor Level II5.746.597.266.25
Assistant Professor Level III-3.302.426.25
Assistant Professor Level IV0.411.100.81-
Assistant Professor Level V0.412.200.40-
Assistant Professor Level VI--1.21-
Assistant Professor Level VII1.237.692.0212.50
Associate Professor Level I3.287.6912.90-
Associate Professor Level II2.057.6914.11-
Associate Professor Level III1.641.101.61-
Associate Professor Level IV-2.200.40-
Associate Professor Level V-2.201.61-
Associate Professor Level VI-1.10--
Associate Professor Level VII---6.25
Associate Professor Level VIII0.412.203.236.25
Full Professor Level I-5.492.426.25
Full Professor Level II-12.095.246.25
Full Professor Level III-13.192.826.25
Full Professor Level IV-4.402.4225.00
Full Professor Level V-3.302.42-
Full Professor Level VI-5.492.0212.50
Full Professor Level VII0.413.301.616.25
Full Professor Level VIII0.415.491.21-
Instructor Level I31.97-4.84-
Instructor Level II6.561.1012.90-
Research Professor18.441.10--
N = 599; values represent the percentage of professors in each category within the clusters; (-) indicates the absence of values in the corresponding category within the cluster.
Table 3. Contribution to the total variance explained by the principal components in EC sub-dataset.
Table 3. Contribution to the total variance explained by the principal components in EC sub-dataset.
Economic Compensation Sub-DatasetDim 1Dim 2
Functional complement 20187.0534828.731980
Functional complement 20199.99361910.174539
Functional complement 202010.15138813.075159
Functional complement 20219.89999512.410984
Functional complement 20228.8375779.658164
Gross 201810.00835810.272620
Gross 201911.2674918.489656
Gross 202011.4626979.554073
Gross 202111.1459239.344951
Gross 202210.1794708.287875
N = 599; values reflect the contribution of each variable to the first two principal dimensions; a higher absolute value indicates a greater influence of the variable in the corresponding dimension.
Table 4. Percentage distribution of a qualitative variable by clusters with EC sub-dataset.
Table 4. Percentage distribution of a qualitative variable by clusters with EC sub-dataset.
Current Category LevelCluster 1Cluster 2Cluster 3Cluster 4
Assistant Professor Level I32.61-13.07-
Assistant Professor Level II3.041.9212.567.58
Assistant Professor Level III-0.964.021.52
Assistant Professor Level IV-1.920.501.52
Assistant Professor Level V0.870.96-1.52
Assistant Professor Level VI--1.011.52
Assistant Professor Level VII-8.651.019.09
Associate Professor Level I3.481.9216.586.06
Associate Professor Level II1.745.7716.586.06
Associate Professor Level III0.431.921.514.55
Associate Professor Level IV-0.960.501.52
Associate Professor Level V-0.961.014.55
Associate Professor Level VI-0.96--
Associate Professor Level VII---1.52
Associate Professor Level VIII-7.691.013.03
Full Professor Level I-3.852.016.06
Full Professor Level II-15.381.519.09
Full Professor Level III-12.500.509.09
Full Professor Level IV-7.690.507.58
Full Professor Level V-7.69-1.52
Full Professor Level VI-7.69-6.06
Full Professor Level VII-4.810.504.55
Full Professor Level VIII-5.770.503.03
Instructor Level I33.04-7.04-
Instructor Level II5.22-17.593.03
Research Professor19.57-0.50-
N = 599; values represent the percentage of professors in each category within the clusters; (-) indicates the absence of values in the corresponding category within the cluster.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Alonso-Sastre, A.-I.; Pardo, J.; Cortijo, O.; Falcó, A. Quantitative Emotional Salary and Talent Commitment in Universities: An Unsupervised Machine Learning Approach. Merits 2025, 5, 14. https://doi.org/10.3390/merits5020014

AMA Style

Alonso-Sastre A-I, Pardo J, Cortijo O, Falcó A. Quantitative Emotional Salary and Talent Commitment in Universities: An Unsupervised Machine Learning Approach. Merits. 2025; 5(2):14. https://doi.org/10.3390/merits5020014

Chicago/Turabian Style

Alonso-Sastre, Ana-Isabel, Juan Pardo, Oscar Cortijo, and Antonio Falcó. 2025. "Quantitative Emotional Salary and Talent Commitment in Universities: An Unsupervised Machine Learning Approach" Merits 5, no. 2: 14. https://doi.org/10.3390/merits5020014

APA Style

Alonso-Sastre, A.-I., Pardo, J., Cortijo, O., & Falcó, A. (2025). Quantitative Emotional Salary and Talent Commitment in Universities: An Unsupervised Machine Learning Approach. Merits, 5(2), 14. https://doi.org/10.3390/merits5020014

Article Metrics

Back to TopTop