Next Article in Journal
Leaders, Let’s Get Agile: Examining Project Performance Through Sequential Moderated Mediation of Value Co-Creation and Stakeholder Satisfaction Using the Lens of Agile Leadership
Previous Article in Journal
Venture Capital as a Catalyst for Innovation and Economic Growth in Emerging Economies: A Systematic Review and Future Research Agenda
Previous Article in Special Issue
Determinants of Job Performance in Professional Workers and Moderating Effect of Gender: The Peruvian Case
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Prediction and Ranking of Corporate Diversity in European and American Firms

by
Iñigo Martín-Melero
1,*,
Felipe Hernández-Perlines
1,
Raúl Gómez-Martínez
2 and
María Luisa Medrano-García
2
1
Business Administration Department, University of Castilla-La Mancha, 45071 Toledo, Spain
2
Business Economics Department, Rey Juan Carlos University, 28933 Madrid, Spain
*
Author to whom correspondence should be addressed.
Adm. Sci. 2025, 15(11), 406; https://doi.org/10.3390/admsci15110406
Submission received: 18 August 2025 / Revised: 13 September 2025 / Accepted: 14 October 2025 / Published: 22 October 2025

Abstract

Currently, corporate social responsibility and environmental/social/governance topics are gaining more relevance in business and finance. Attention to corporate diversity in boards and the workforce is included in this trend. Although most studies focus on executive boards and objective scores, the perception of diversity by employees and its rankability are not fully understood or researched. In this paper, we analyze corporate diversity rankings from the perspective of predictive and prescriptive analytics. Inside predictive analytics, the perceived diversity of a sample of 350 European diversity leader companies is predicted by using three different feature sets (raw financial data, ratios and objective diversity variables) and three machine learning algorithms (K Nearest Neighbors, Logistic Regression, Decision Tree). The best performing algorithm is the Decision Tree, and all three feature sets outperform one random dummy algorithm; the best performing set is the financial ratios set. Inside prescriptive analytics, several rankings involving American companies are intersected and compared in three exercises (studying diversity categorization, ethnic origin and comparing diversity with other unrelated metrics). From these, global rankings were built to search for the best possible agreement among the rankings. These results with both predictive and prescriptive analytics encourage managers to strategize and include diversity in management, as well as employ new technologies in their decision-making processes.

1. Introduction

Diversity in boards and in the workplace, within the Corporate Social Responsibility (CSR) paradigm, is a growing demand of society and a topic of scientific interest in business (Frynas & Yamahaki, 2016). As defined by Patrick and Kumar (2012), diversity refers to understanding, respecting and accepting the uniqueness of each individual, especially in the dimensions of race, ethnicity, gender, sexual orientation, socioeconomic status, age, physical abilities and religious, political or other beliefs. Foma (2014) argues that workplace diversity promotes positive points such as the exchange of ideas, development of friendship without discrimination, elimination of stereotyping, conflict management and employee retention; however, drawbacks include communication gaps due to language barriers and resistance to change. Similarly, Fine et al. (1990) suggest that men, women, and people of color have different views on organizational issues; therefore, considering gender and race as cultures might be a useful framework for understanding diversity. The advantages of corporate diversity should theoretically prove its business case; indeed, a diverse workforce is sometimes linked to an improvement in financial performance and innovation (Lorenzo & Reeves, 2018). The literature shows that diverse executive boards are positively correlated with CSR compliance (Harjoto et al., 2015). This is also applicable to gender diversity too (Beji et al., 2021; Rao & Tilt, 2016), which is one of the most researched themes in this field. However, empirical research on the effect of diversity on financial performance has produced mixed results: Jayne and Dipboye (2004) conclude that it is very context-dependent and recommend organizations to build senior management commitment, extensively assess the needs, emphasize team building and tie the business results via metrics with the diversity strategy.
Consequently, diversity is a relevant topic for analysis with important implications for businesses and corporations (Aßländer et al., 2016; Gotsis & Kortezi, 2013). The current academic debate surrounding corporate diversity, CSR, and performance is multifaceted, with ongoing discussions regarding the tangible financial benefits of diversity, meta-analyses and large-scale studies offering conflicting or context-dependent findings. Different perspectives may be adopted when analyzing corporate diversity. One popular perspective is to evaluate the relationship between corporate diversity and financial performance, where there is no clear consensus that the former improves the latter (Opstrup & Villadsen, 2015; Singal & Gerde, 2015). Even though predictive analytics techniques, such as Machine Learning (ML), are proven to be suitable for building robust models in general (Ahmed et al., 2022), literature applying it to the study of this relationship is not abundant and centered mostly on diversity at the board composition level (Behlau et al., 2024; Yang et al., 2024), instead of workforce corporate diversity as a whole and its perception by employees. These predictions bear more complex social outcomes, which remain less common and face validity challenges, inside the a-contextual, unclear and unstable concept of diversity climate (Cachat-Rosset et al., 2019). Another approach to corporate diversity are company rankings, where diversity is increasingly employed as a criterion for ranking and comparing firms (Bouslah et al., 2023; Pasztor, 2019). Even though companies are recurrently ranked in terms of diversity and prescriptive analytics techniques, such as Operations Research (OR), and provide methods to analyze these rankings (Birge & Linetsky, 2007), literature comparing the results of companies among different diversity rankings is very limited. While the focus in business and research is shifting from simple demographic representation to measuring genuine equity and inclusion, robust metrics remain elusive: proliferation of corporate diversity ratings and rankings faces scrutiny regarding methodological consistency, transparency, and potential ‘greenwashing’ or ‘diversity-washing’. The gaps in the literature are, consequently, three-fold. In theory, diversity climate is generally analyzed in terms of how effective HR policies are when set in place, such as the age diversity ones in Boehm et al. (2014), or the differences between the objective reality and perceived diversity, as in Reinwald et al. (2019). However, references linking financial performance and diversity climate are more scarce. Diversity rankings and its relationship to other corporate rankings is also underanalyzed, even though their relevance as a source of institutional legitimacy is defended in Tayar (2017), including how some of them may only reflect superficial elements and not deeply ingrained diversity. Finally, in practice, diversity climate has not been analyzed with ML and OR tools, despite their potential and application in other fields.
Therefore, in this paper, these gaps in the literature on corporate workforce diversity. In the predictive ML part, the relationship between financial performance and corporate workforce diversity is studied by building several ML models to predict employees’ perception of diversity in European corporations using three different feature sets: raw financial variables, preprocessed financial ratios and objective diversity scores as extracted from recognized rankings and databases. The best ML models were Decision Trees fitted with financial ratios, with an accuracy of ∼66%. In the prescriptive, OR part, performed in American companies, the similarity and conflict between the criteria of 11 recognized rankings is explored, employing a mixed integer linear programming formulation to aggregate the grading orders of the individual rankings into a general grading order, which most rankings agree with. The three OR comparison exercises carried out include one on best companies solely according to diversity scores, another on best companies depending on the ethnic origin of the employee, and one crossing the best companies in diversity jointly with other metrics, such as accessibility for new graduates or veterans. Small intersections and moderate–low correlations between individual rankings were obtained, and several solutions for the OR ranking exercises where found.
This paper contributes to the debate on the effects of corporate diversity in several ways. The ML part suggests a fair relationship between the financial performance and the perceived workforce diversity of a firm. Moderately linking corporate diversity to financial performance highlights how profitability can affect the perception employees have of corporate diversity, as well as their engagement. Likewise, not adopting diversity initiatives or having a low corporate diversity score might become a risk in strategy management. In this case, ML becomes useful for extracting decision-making insights regarding diversity, and the fair agreement in the models positions our paper in line with those that suggest a neutral, mildly positive relationship between corporate diversity and firm performance. The OR part manifests the possibility of constructing aggregated ranks of companies from partial rankings built with different methodologies, confirming the importance of formulating comprehensive diversity strategies that address its several dimensions. Aggregated ranks also portray industry-specific insights and allow managers to evaluate their relative positions in diversity policies with respect to industrial peers. Moreover, this work opens additional future research lines on the use of ML and OR in the study of diversity.

2. Literature Review

2.1. Corporate Diversity Theory

The study of board and workforce diversity and its outcomes in organizational and financial performance is of scientific interest; in De Abreu Dos Reis et al. (2007), a literature review of over 50 years of empirical research, corporate diversity is divided into the ethnical-racial, gender, age, group tenure, organizational tenure, functional background and educational background dimensions. Most of this research on corporate diversity consists of econometric analyses, of a deductive nature.
When discussing corporate diversity in terms of nationality, theories like Hofstede’s cultural dimensions explain differences in behavior between people in the business environment (Beugelsdijk et al., 2017). This cultural theory includes six dimensions: power distance, individualism/collectivism, masculinity/femininity, avoidance of uncertainty, long-term orientation and indulgence/restraint. Xu et al. (2024) proves that power distance has a negative effect on green innovation, while individualism, masculinity, uncertainty avoidance, long-term orientation and indulgence have a positive effect, from data of Asian economies in the period 2000–2019. Afzal and Lyu (2025) analyzes 100 construction firms and concludes that female board representation positively influences diversity outcomes, moderated by cultural traits: low power distance, high indulgence and high uncertainty avoidance positively correlated with diversity outcomes.
In the case of ethnic-racial diversity, while some studies argue for its positive effects on performance (Hartenian & Gudmundson, 2000; Richard, 2000), others found a negative relationship (Kirkman et al., 2004; Sacco & Schmitt, 2005); further works found that ethnic-racial diversity had no or negligible effects (Ely, 2004). Similarly, research shows that gender diversity may have a positive (Erhardt et al., 2003), negative (Alagna et al., 1982) or no effect at all (Kochan et al., 2003) on performance.
Similarly, according to Ding and Riccucci (2023), diversity also presents mixed and context-specific results in the public administration sector. While it contributes positively to the operation of public organizations (Nicholson-Crotty et al., 2017), innovation (Choi et al., 2018) and inclusion (Sabharwal, 2014), it also incurs communication costs (Owens & Kukla-Acevedo, 2012) and low commitment from some groups (Ritz & Alfes, 2018). In the CSR field, other empirical results prove that diversity, specifically gender, can create a more stable and rigorous corporate climate and improve corporate soundness and social contributions (Rhee et al., 2023). Furthermore, age and tenure diversity can contribute to adopting more sustainable approaches in corporations’ vision and strategies, improving governance and reducing carbon emissions (Cahyono et al., 2023; Ferrero-Ferrero et al., 2015).
In any case, most studies linking corporate diversity and organizational or financial performance focus on gender or ethnic origin equality in executive boards (Amorelli & García-Sánchez, 2021). Some works argue that boards with larger gender diversity are associated with better environmental and social performance (Buertey, 2021; Orazalin & Baydauletov, 2020), while others conclude that it has no relevant effect (Veltri et al., 2021). In addition, most of these studies used linear or logistic regression. In Carter et al. (2007), these two dimensions are analyzed in a sample of Fortune 500 listed firms over 4 years. A positive link between board diversity and financial performance is found, supporting the economic case of board diversity. However, this positive effect was subtle and complex, suggesting that gender and ethnic origin diversity cannot be treated as a same category. Similarly, Rose et al. (2013) study female and citizenship board representation in the largest listed firms in Germany, Denmark, Sweden, Finland and Norway. Using data from 2010 onwards, several correlations were found between the size of the board and the proportion of females and non-nationals present in them. In the regression analyses, the ROA, ROE and ROCE ratios were employed as dependent variables. The results showed that female board representation was not associated with superior performance, and larger executive boards had a negative impact on performance. Still, having non-nationals from the US, the UK or Australia on the boards had a positive impact on the performance. Campbell and Mínguez-Vera (2008), when researching Spanish executive boards, recognized that investors did not penalize gender diversity in boards through two regression models, one based on Tobin’s Q. Gender diversity did not destroy shareholder value, implying it may generate economic returns. Other works, such as Moreno-Gómez et al. (2018), are more firm on this assertion: their results showed, from a sample of 54 Colombian public businesses studied during 2008–2015, that gender diversity is positively associated with business performance. The presence of women in CEO positions and in top management benefited ROA, while women’s representation in the boardroom was more evident in ROE.
Most previous literature studies corporate diversity through objective scores, quotas, ratios or KPIs. However, the diversity as perceived by the employees of the firm might not match this objective reality, nor be the same for every person. For instance, Kossek and Zonia (1993) found that, compared to white men, white women and racioethnic minorities appreciated an employer’s efforts to promote diversity, holding more favorable attitudes regarding the qualifications of women and racioethnic minorities. In Higgins (2020), diversity climate perception was positively related to work engagement and organizational justice in 230 surveys of employees mainly belonging to the technology, education and healthcare sectors. Similar conclusions were reached by Jauhari and Singh (2013), who found a mediating role of perceived organizational support in the positive relationship between perceived diversity climate and employees’ organizational loyalty. Wolfson et al. (2011) and Madera et al. (2013) associate a perceived diversity climate with organizational commitment and increased job satisfaction, while Seriwatana (2021) considers it key for employee retention.
The Literature also focuses on how diversity climate changes over time, after relevant social, political or economic events happen. For instance, while the COVID-19 pandemic was an opportunity for social innovation to help overcoming the lack of diversity management in countries like Turkey (Palalar Alkan et al., 2022), it also produced an increase in societal polarization and cleavages in the US due to individualism and anti-statism (Bazzi et al., 2021). However, in contrast to the subprime 2007 crisis, economic interventions by the Federal Reserve generated positive spillovers during the pandemic (Cortes et al., 2022). Also in the US, the aftermath of George Floyd’s murder brought increasing societal demands for diversity-promoting policies (Balakrishnan et al., 2023), which in turn produced rejection from specific societal groups a few years later (Rajgopal et al., 2023). ESG is given higher relevance in investment and policy depending on politics (Dantas, 2021; Hilson, 2024), which currently includes a rollback of diversity initiatives creating a climate of fear, exacerbating existing inequalities and intensifying political polarization (Ng et al., 2025).
Some works also examine diversity climate in terms of organizational and financial performance. Regression analysis in Allen et al. (2007) supports the positive link between organizational performance and diversity climate, while recognizing limitations in purely basing the study in perceptual measures and not objective scores (perception versus reality). Lim et al. (2023) found a positive link between diversity perception by employees and a firm’s financial performance, which is positively moderated by diversity at the board management level. McKay et al. (2008) found that diversity climate moderated the sales performance of employees depending on their racial-ethnic differences. In the cultural dimension, openness to linguistics, value, and informational diversity showed strong positive associations with perceived group performance in Lauring and Selmer (2011). In contrast, H. Lee (2019) finds that diversity climate bears good outcomes only in U.S. federal agencies that mainly work on promoting social equity for disadvantaged populations. Moon and Christensen (2020) go one step further, suggesting that racial and tenure diversity have positive relationships with organizational performance, but functional diversity does not.
More recently, due to political changes, the diversity and inclusion paradigm has been challenged by corporate entities and national authorities (McGowan et al., 2025). Some works continue defending it: in 18 interviews analyzed in Beckert and Koch (2025), diversity managers reveal internal factors (increased creativity and productivity) and external pressures (stakeholders expectations and talent attraction) affecting diversity initiatives, and how diversity-washing implies potential credibility loss and reputational damage. On the other hand, Armstrong (2025) suggests a complete and radical reconceptualization of diversity and inclusion, so that it addresses power imbalance. The relationship between diversity climate and the objective diversity of a firm remains the object of debate; as summarized in Herdman and McMillan-Capehart (2010), achieving a diversity climate is positively related to implementing diversity initiatives in a not straightforward manner, hence the relevance of studying it with novel techniques.

2.2. Corporate Diversity Studies with Predictive Models

Even though studies involving board and workforce diversity in business are abundant, references that employ ML techniques are scarce. Ranta and Ylinen (2023) employed 21 features related to board and firm characteristics to predict three workplace diversity labels. These diversity labels were gender equality, inclusiveness/diversity and attitude towards older colleagues, which were extracted from a sample of 250,000 employee reviews from Kununu, a recruiting website similar to Glassdoor. For the ML part, Gradient Boosting trees were employed with a typical training/testing split of 80%/20%. In the training set, hyperparameter tuning was performed using a 5-fold cross-validation GridSearch scheme. The Gradient Boosting trees (Friedman, 2001) were compared and benchmarked against an Ordinary Least Squares (OLS) regressor and the Least Absolute Shrinkage and Selection Operator (LASSO) regressor. SHAP values were also employed in the analysis of the models (Lundberg & Lee, 2017). The Gradient Boosting trees obtained a higher R 2 and lower Mean Square Error (MSE) than OLS and LASSO. Gender diversity in the workplace was a predictor of firm value and was strongly positively associated with equality and inclusivity on boards; however, it was weakly negatively associated with age diversity. Yousaf et al. (2021) coupled board diversity attributes with ML prediction capabilities of corporate financial distress. The final sample consisting of 160 healthy and 135 insolvent Chinese A-listed companies was extracted from Shanghai and Shenzhen Stock Exchanges data from 2007 to 2016. The ML analysis was performed with a training/testing split proportion of 90%/10% (Veganzones & Séverin, 2018), as well as two different feature sets: one formed by 23 economic variables (including accounting, market, growth, macro economic and corporate governance ratios) and another one formed by eight economic variables plus seven diversity variables. Both feature sets were tested using Logistic Regression, Dynamic Hazard, Random Forest, bagging, boosting and K Nearest Neighbors algorithms. The feature set including board diversity slightly outperformed the feature set containing only financial variables. Koseoglu et al. (2025) proved the use of financial indicators to predict diversity scores with R programming, on 873 multinational companies data from 2021.
Other works like Bianchi et al. (2022) argue otherwise. This empirical study researched the board of directors’ diversity (gender, nationality and age) of a sample of 59,229 Italian companies during 2017–2019. This work specifically targeted Small and Medium Enterprises (SMEs) with a total production of less than 50 million euro and less than 250 employees. The ML pipeline included a 5-fold cross validation scheme and a GridSearch hyperparameter tuning of the training set. Linear, LASSO and Ridge regressions were employed in the regression analysis, and the classification included Logistic Regression, Decision Tree and Random Forest (Breiman, 2001). In both sets of cases, ML was not capable of learning how to predict corporate financials from diversity variables, exhibiting high Root Mean Square Errors (RMSEs) and low accuracy values.

2.3. Corporate Diversity Studies with Prescriptive Models

In business management and economic research, rankings are commonly employed to compare business schools (Bickerstaffe & Ridgers, 2007), countries (Malul et al., 2009) and top companies in specific sectors (Klass et al., 2006). The aggregation of different overlapping ranks to form combined, summarizing rankings is useful for simplified decision-making and prioritization in many other disciplines, as it can integrate information from individual genomic studies addressing the same questions in biology (X. Li et al., 2019), as well as building meta-searches and improving search precision on the web (Dwork et al., 2001). In business, ranks are aggregated in Filbeck et al. (2013), who study four different corporate rankings. The cumulative and interactive effects in companies listed in these rankings add short-term and long-term value to business portfolios, especially when a given company is present in more than two or three of these rankings simultaneously. In addition, Martín-Zamora et al. (2025) proved the positive relationship between gender diversity in TMTs and corporate reputation by analyzing rankings.
However, references specifically linking OR and corporate diversity are very limited, despite the necessity to address non-uniformity and heterogeneity in diversity benchmarking methodologies (Foster et al., 2023). Most studies remain purely theoretical and mathematical: Kuo et al. (1993) and Martí et al. (2010) discuss the maximum diversity problem, that consists of selecting a subset of elements from a set such that the sum of distances between the chosen elements is maximized. Bhadury et al. (2000) found that workforce diversity in project teams is maximized by solving the dining problem, within the network-flow domain (Wolsey & Nemhauser, 2014). Nonetheless, no OR technique has been applied to thoroughly study companies as a whole in terms of diversity; hence, the relevance and novelty of this work.

3. Methodology

3.1. Data and Variables

The financial and diversity information of a set of European companies was studied in the ML exercise. These European companies are among the 850 ranked in Europe’s Diversity Leaders 2023, published by Statista R and Financial Times (Vincent, 2022). This ranking is produced by more than 100,000 surveys of European corporations employing at least 250 people, from April to July 2022. The surveys included direct and indirect evaluations of companies through five-point Likert scale statements on general, gender, ethnicity, LGBTQ+ status, age and disability diversity. Higher importance was given to the survey answers coming from the diversity groups. Each one of the 100,000 employees surveyed took an average of 6–9 min to complete the survey, evaluating not only their own employer but also other relevant employers in their industry, resulting in a total of 300,000 evaluations of companies. The result of these evaluations was a numeric score given to each company, namely di. To form the ranking, the companies were ordered from best to worst by their di score. This source represents perceived diversity because it is based on the subjectivity of employees’ responses, rather than objective numeric quotas or ratios involving corporate diversity.
The financial information of these European companies was then extracted from 15 ratios and 11 variables contained in Yahoo Finance through the income statement, balance sheet and cash flow (Yahoo, 2023). As for the diversity information, two scores (pd, wo) out of the eight available were sampled from Refinitiv Eikon (LSEG, 2023), while the remaining one, di, was extracted from the original Financial Times ranking. The variables of this ML exercise are collected and abbreviated in Table 1; they are commonly employed in other ML works involving Finance (D’Amato et al., 2021; Paule-Vianez et al., 2019; Roman & Șargu, 2013). Those companies present in Europe’s Diversity Leaders 2023 that had no financial information on Yahoo Finance or diversity scores on Eikon were excluded from the analysis, like Giorgio Armani or Start People. These data availability issues resulted in a reduction in the final dataset from the potential 850 original companies in the Financial Times ranking to around 350 companies.
The OR exercise consists of analyzing a series of rankings of American companies. Three different studies are proposed: one compares diversity rankings, another focuses on ethnic origin and a third one links diversity with other corporate metrics. The rankings employed in this OR exercise were performed in 2023 and are listed in Table 2, where the number of companies in the lists is also included.
The aim of the diversity study is to compare how diversity is measured differently in three data sources. One of them is America’s Best Employers for Diversity 2023, by Forbes and Statista R (Peachman, 2023c). This ranking was built upon direct and indirect scoring made by 45,000 employees of companies with more than 1000 employees, as well as objective KPIs involving management and engagement. Another ranking employed is Top 50 Companies For Diversity, collected by Fair360 on companies with more than 750 employees. In this ranking, over 1400 factors categorize companies with respect to leadership accountability, talent programs, human capital metrics, workplace practices, supplier fairness and philanthropy (Gray Miller, 2023a). The third pillar of this study is composed of companies’ D&I scores in Glassdoor, with nearly 50,000 verified companies (Landbase, 2025); this online portal is usually studied when researching employee satisfaction within corporations (Das Swain et al., 2020; Dube & Zhu, 2021). These three sources are among the most employed in diversity research in the US context, referenced in works like Filbeck et al. (2017) and Dobbin and Kalev (2022).
The ethnic origin study contrasts if fair opportunities and diversity are applied to all racial backgrounds equally; in this line, some ethnic minorities may be promoted or cared for while others may be marginalized (Van der Meer & Roosblad, 2004). To study this, four different rankings representing the best companies for Asian American, Black, Latino and Native American/Pacific Islander executives were collected. These rankings belong, as specialty lists, to the aforementioned Top 50 Companies For Diversity of Fair360. The companies are ranked for the hiring, promotion, and retention of each ethnic origin separately, as well as their presence in management levels 1–4 and in the 10% highest-paid employees (Gray Miller, 2023b). These rankings also evaluate the participation of ethnic minorities in mentoring and sponsorship programs, as well as leadership commitment to achieving proportional race representation.
The global study evaluates whether companies best at diversity are also outperforming in other different corporate fields. The research sample is formed by another five studies from Forbes and Statista R. The first one is the aforementioned America’s Best Employers for Diversity 2023, which serves as diversity-as-a-whole reference. The second one, America’s Best Employers for Women, surveyed 40,000 women in companies with more than 1000 employees by using four criteria: direct recommendations in general, direct recommendations in topics specifically related to women, indirect recommendations and diversity among top executives and board (Schwarz, 2023a). The other three rankings are not necessarily related to workplace diversity; in one of them, America’s Best Large Employers, 45,000 employees working for companies with at least 1000 people responded on the willingness to recommend one’s own employer and the willingness to recommend other employers, which was translated into a direct and an indirect score (Schwarz, 2023b). In America’s Best Employers For New Grads, 28,000 U.S. young professionals in companies with at least 1000 employees were again queried on direct and indirect recommendations regarding corporate atmosphere and development, image, working conditions, salary wage, workplace, diversity and likelihood of recommendation (Peachman, 2023a). Finally, America’s Best Employers For Veterans surveyed 8500 U.S. veterans in companies with at least 1000 employees with the same direct–indirect scoring structure; the targeted topics in the surveys were the same as with the new grads, except for the specific veteran topics (Peachman, 2023b).

3.2. Predictive Analytics Pipeline

A total of 10 ML simulations were performed using Python and the Sci-kit Learn library (Pedregosa et al., 2011). The targeted label to be predicted by ML is the diversity index of the Financial Times ranking, di, which represents corporate diversity perceived by employees. Several regression, classification and discretization approaches were examined, of which the best performing are kept in the paper. A classification approach was chosen over a regression approach because of its better performance, as illustrated in works comparing both like Strecht et al. (2015) and Martin-Melero et al. (2025). To obtain a classification problem, di can be discretized using different schemes (S. Garcia et al., 2012). In this work, di was discretized into two different levels: high and low. For this application, binning the categories into two levels is optimal (Carmona et al., 2013) and the 45%/55% proportion between classes is ideal to avoid data imbalance problems, which worsens the performance and fitness of ML models (Luque et al., 2019). The ML models were tested on three different feature sets: one composed of financial ratios (variables ps, er, ee, bt, wk, pr, pf, op, cr, qr, ch, dr, ra, re, es), another one on financial information (variables rv, eb, ni, ca, cs, iv, na, cl, nl, wc, eq) and a last one focused on diversity data (variables pd, wo). The first and second feature sets compare the fits of raw financial data versus financial ratios, which are useful in multiple econometric analyses (Delen et al., 2013; Song et al., 2018) but also do present their limitations (Feroz et al., 2003). The third feature set is employed to compare objective diversity metrics from Eikon to the ML label of diversity as perceived by employees, which might not coincide (Hentschel et al., 2013; Shemla et al., 2016). Indeed, the relationship between both is complex (Cachat-Rosset et al., 2019), and benefits from objective diversity are only maximized when individuals perceive the diversity-favoring climate; if not, the benefits are lost (Cox, 1994).
Several ML algorithms were employed and compared with a dummy random classifier in each one of the three feature sets studied in the ML analysis, resulting in the 3 feature sets × 3 algorithms + 1 dummy classifier = 10 ML cases. The K Nearest Neighbors, Logistic Regression and Decision Tree algorithms are selected for the ML exercises because of their good performance in different fields (Sarker, 2021) as well as in financial applications (Dixon et al., 2020). Three of these are very different construction-wise, which provides a varied pool of prediction models. To ensure a fair comparison between the models, all simulations had the same training/testing division of 80%/20%, which is a widely employed proportion in ML studies (Gholamy et al., 2018). Moreover, a hyperparameter tuning analysis was performed on the training set with a 10-fold cross validation GridSearch; Table 3 shows the modified parameters for each model.
The performance metrics employed are expressed in Equations (1)–(6), and include the accuracy, sensitivity (or recall), specificity, precision, F1-Score and area under curve; they are among the most frequent and employed in classification exercises (Canbek et al., 2017, 2022).
Accuracy ( ACC ) = T P + T N T P + T N + F P + F N
Sensitivity ( SEN ) = T P T P + F N
Specificity ( SPE ) = T N T N + F P
Precision ( PRE ) = T P T P + F P
F 1 - Score ( F 1 S ) = 2 × p r e c i s i o n × s e n s i t i v i t y p r e c i s i o n + s e n s i t i v i t y
Area Under Curve ( AUC ) = 0 1 s e n s i t i v i t y s p e c i f i c i t y 1 ( x ) d x
where
  • T P are the True Positive values, high di predicted as so.
  • T N are the True Negative values, low di predicted as so.
  • F P are the False Positive values, low di predicted as high di.
  • F N are the False Negative values, high di predicted as low di.

3.2.1. K Nearest Neighbors

The K Nearest Neighbors (KNN) algorithm is a non-parametric classifier that predicts the grouping of data points by employing proximity. First defined by Cover and Hart (1967), it is based on majority voting, which consists of predicting labels by studying the most frequently represented ones around the data point. The distance between two datapoints x i and x j is expressed in Equation (7), as defined by Minkowski, where each datapoint x comprises a number of attributes a 1 , a 2 , a 3 , , a n (Cunningham & Delany, 2021; Sun & Huang, 2010).
d ( x i , x j ) = b = 1 n w b × ( a b ( x i ) a b ( x j ) ) p 1 p
where
  • n is the dimensionality of the vector, or number of attributes.
  • a b is the bth attribute.
  • w b is the weight of the bth attribute.
  • p is Minkowski’s order.
Different expressions for the distance can be found when altering p. The value p = 1 represents the Manhattan distance, p = 2 the Euclidean distance and when p , Chebyshev’s distance is obtained; the optimization of p has been extensively researched in the literature (Lubis & Lubis, 2020; Prasatha et al., 2017). Therefore, the smaller d ( x i , x j ) is, the more similar the datapoints are. The label is finally assigned to a test data point by the majority vote of its k nearest neighbors, according to Equation (8).
y ( d i ) = a r g max k x z k N N y ( x z , c k )
where
  • d i is a test data point.
  • x z is a k nearest neighbor to d i .
  • y ( x z , c k ) indicates if x z belongs to class c k .

3.2.2. Logistic Regression

The Logistic Regression (LR) algorithm estimates the maximum likelihood; the model is adjusted by finding the parameter values that maximize the likelihood of making the given observations (Dangeti, 2017). LR applies this principle after transforming the label to predict into a natural odds of occurring (or logit) label, with respect to the features. The original LR algorithm predicts its dichotomous labels by outputting the probability of belonging to one class or the other, which is modeled in Kleinbaum et al. (2002) as Equation (9).
P ( z ) = 1 1 + e ( α + β i X i )
where
  • P ( z ) represents the probability of the label belonging to class z.
  • X i are independent variables.
  • α and β i are unknown constant parameters.
Among its business applications, LR has been employed in performance measurement (Wood, 2006), customer satisfaction data analysis (Lawson & Montgomery, 2006) and business failure prediction (H. Li et al., 2013).

3.2.3. Decision Tree

Decision Trees (DTs) or ensembles based on them are among the most popular and powerful ML classifiers (Fernández-Delgado et al., 2014). These sequential models are expressed as recursive partitions of the instance space. DT essentially consists of a series of internal nodes that split the instance space into two or more subspaces, according to a certain threshold (Rokach & Maimon, 2005). The decision nodes where no more splitting is performed, known as leaves, represent the termination of the tree. The construction of a DT consists of two phases. In the growth phase, the training set is iteratively split until each leaf is associated with a single class or is compliant with a certain criteria. In the pruning phase that follows, the DT is generalized by creating a subtree that prevents overfitting of the training data (Kotsiantis, 2013). The node splitting strategy is key for achieving a well-performing DT; Equations (10) and (11) show Entropy and Gini methods (Safavian & Landgrebe, 1991), where P i is the probability of selecting a data point in class i.
E n t r o p y = i = 1 n P i log 2 P i
G i n i = 1 i = 1 n P i 2

3.3. Prescriptive Analytics Pipeline

To adequately compare the agreement and conflict between the different rankings in the studies, an intersection of them was first performed. Therefore, the companies forming the final dataset for each one of the three studies are those companies present in all the rankings belonging to that specific study. Once this intersection is found, each individual ranking is reorganized to maintain the same order as in the original one, but with different numbers. Companies that were not simultaneously present in all rankings belonging to one specific study were discarded from the final dataset.
Different rank correlation metrics have been proposed in works like Blest (2000) and Borroni (2013). In this paper, Kendall’s τ as proposed by Kendall (1938) is employed, being one of the most robust and widely used ones. Kendall’s τ ranges from −1 to +1, medium correlations are considered higher than 0.3, and high correlations are considered from 0.5 onwards (Kuckartz et al., 2013). The two methods for expressing τ are described as follows. Let ( x 1 , y 1 ) , , ( x n , y n ) be a set of observations of criteria X and Y, so that all x i and y i are unique. Any pair of observations ( x i , y i ) and ( x j , y j ) , where i < j , is said to be concordant if the classification order of ( x i , x j ) and ( y i , y j ) agrees; if they disagree, they are said to be discordant. Kendall’s τ can be defined as in Equation (12), or explicitly as in Equation (13).
τ = 2 ( n c n d ) n ( n 1 )
τ = 2 n ( n 1 ) i < j s i g n ( x i x j ) s i g n ( y i y j )
where
  • n c are the concordant pairs.
  • n d are the discordant pairs.
  • n are the total number of elements.
From a business and mathematics perspective, the study of agreement and conflict in rankings is complex (Gordon, 1979; Ray & Triantaphyllou, 1998). In this paper, the linear ordering problem algorithm is employed for ranking comparison purposes; it has been applied to many different fields, like input–output economic analysis (Grötschel et al., 1984; Mitchell & Borchers, 1996), job scheduling in manufacturing (Ascheuer et al., 1993) and archeological seriation (Glover et al., 1974). One of its many applications includes the rankability of data (P. Anderson et al., 2019; Cameron et al., 2021).
The Linear Ordering Problem (LOP), first described by Chenery and Watanabe (1958), is an extensively researched combinatorial optimization problem of NP-hard nature (Baioletti et al., 2018; Santucci, 2021), reflecting the difficulty in solving instances up to optimality. More recently, it has been studied in ML (Mishra & Singh, 2022; Santucci et al., 2020) and textual translations (Kondo et al., 2011; Tromble & Eisner, 2009). This problem seeks to find a permutation of rows and columns that maximizes the sum of the superdiagonal in a squared non-negative matrix.
In the context of this paper, this squared matrix was built by evaluating the relative order between the elements in the different criteria belonging to a ranking. Therefore, the diversity, ethnic origin and general studies are converted into three, four and five criteria rankings, respectively. The LOP can be formulated according to Martí and Reinelt (2011). A preference matrix D is composed of d i j elements that represent the number of times one element i V outperforms another element j V throughout all criteria in a given ranking. The goal of the LOP is to obtain an order that represents the best agreement among the different criteria of the ranking, modeled as the mixed-integer linear problem expressed in Expressions (14)–(17).
max i V j V : j i d i j z i j
s . t . z i j + z j i = 1 i , j V : i < j
z i j + z j k + z k i 2 i , j , k V : i j , j k , i k
z i j { 0 , 1 } i , j V : i j
where z i j is a binary variable that equals 1 when element i is ranked before element j and 0 otherwise. Expression (14) represents the objective function to maximize, which is the sum of the upper diagonal values in the ranked preference matrix. Expression (15) states that element i goes before element j or element j goes before element i, but not both simultaneously. Expression (16) disables forbidden cases; if i goes before j and j goes before k, then k cannot go before i. Lastly, Expression (17) constrains variable z i j to the binary domain.

4. Diversity Prediction with Machine Learning

4.1. Descriptive Statistics of the Data

The most relevant statistics of the ML dataset and the companies included in this study are described in Table A1 and Table A2, respectively, in Appendix A. Table A3, also in Appendix A, shows the sectors and countries of origin of the 350 companies in the ML study. The 25 sectors and 17 countries these companies belong to reflect that the final dataset remains representative of European corporate diversity. Figure 1 represents the histograms of pd and wo, which confirm the diversity feature set and represent Eikon’s percentage of female employees and whether the company has a policy to drive diversity and equal opportunity or not. As shown, pd ranges from 53 to 58 and is relatively balanced: all values represent around 10–20% of the total. The share of women in companies, wo, is distributed along a wider range. More than half of the companies studied have a proportion of women employees higher than 50%; in addition, the majority of the companies have 30–80% female employees.
The histograms of the second feature set, raw financial data, are represented in Figure 2. These values present a larger dispersion, as the range typically starts from 0 and reaches 10 7 or 10 8 . The reason for this is the presence of both large multinational companies (like Dell or Microsoft) and other smaller regional ones (such as Polish Allegro or French Konica Minolta) in the same dataset; practically any metric can be orders of magnitude apart. The patterns observed in these graphs are similar: an accumulation of values towards the left, which represent approximately 10–20% of each variable. The most extreme case is iv, the inventory, which has up to 30% of its data below 10 6 . As for the tails in the right half of the graphs, these parts account for less than ∼10% in all variables. The use of financial ratios as a feature set was partly to avoid this large data dispersion. Even though the raw financial data might be orders of magnitude apart, these differences are considerably reduced when taking ratios. The descriptive histograms of the financial ratios in Figure 3 show more balanced distributions. The ranges are also shorter: while ps, er, ee and es are between 1 and 30 in the widest range (ee), the rest of the variables are contained between −1 and 3, at most. These histograms can be divided into two types. In some cases, there is a clear accumulation of values around several adjacent bars: ps and er approximately lie around 0 and 1, re around 0 and 0.2, and es around 0 and 2. In the second kind of histograms, the distribution is more irregular: for instance, dr has a consistent presence of variables between 0.4 and 0.8, without a clear peak. Similarly, ee and bt do not show a clear peak but rather a constant and consistent distribution of points around two values. The pr variable represents a special case, as it presents two clear peaks: one in 0 and the other one around 0.4. The violinplot and pie chart in Figure 4 show the distributions of the label, di. In its continuous form, it resembles a normal distribution positively skewed; in its discrete form, the “High” and “Low” di labels form a practically balanced dataset.
The Pearson correlation values for the dataset are listed in Table 4. All feature sets present low correlation (less than 30%) with label di; it is surprising that the diversity features are among the least correlated with di (less than 10%). Financial ratios exhibit moderate–low correlations among themselves (except pairs er-ps and cr-qr), while raw financial variables are highly correlated (60–90% for most cases). However, these two groups are not correlated, nor with the diversity features.

4.2. Performance of the Simulations

The results of the ML analysis of the random dummy model and the algorithms applied to the three different feature sets are presented in Table 5. The loss in performance of the ML models between the fitted training sets and the predicted testing sets is approximately 5–15%, which are reasonable values that demonstrate the generalization capabilities of ML and the agreement of the models with the data.
The three feature sets have at least one algorithm that outperforms the dummy random guess in terms of ACC and F1S for both the train and test sets, proving that diversity and financial information have at least some predictive power for corporate diversity perception by employees. In principle, this relationship between the feature sets and the perceived diversity is not strong at all; the accuracy metrics do not surpass 70%, which shows a fair (and not good, or excellent) agreement of the models with the data (Cabitza et al., 2020). In general, the three feature sets have high SPEs and low SENs, manifesting a slight overfit towards predicting labels as the majority class, low d i values. Interestingly, the diversity and financial raw data feature sets share similar metrics: ACCs of approximately 40–55% and high and low PREs, SENs and F1Ss, depending on the ML algorithm. In contrast, the ratios feature set is the best performing of all, achieving consistent ACCs around 60%, PREs of 60–75% and SENs and F1Ss of 30–55%, in the testing set. In a way, this improved performance might be due to the fact that financial ratios correct the effect of the firm size with respect to raw financial data, as they are in relative form (Lev & Sunder, 1979). Similarly, financial ratios are more predictive of the label di than the group of objective diversity features pd-wo, implying how diversity perception may be affected by the firm performance (Allen et al., 2007; Jauhari & Singh, 2013).
As for the ML algorithms, some common patterns were observed across the three feature sets. KNN and LR behave similarly in the training sets; however, the LR is responsible for two exceptionally low SENs (<10%) in the testing sets. In contrast, DT is responsible for the best metrics in the training sets, with ACCs of 60–69% and outperforming the other two by 2–3%. It is also the best in the testing sets, with 66% of ACC and 75% PRE using financial ratio features. These insights are further supported by the Receiver-Operating Characteristic (ROC) curves of the training and testing sets as illustrated in Figure 5 and Figure 6, respectively, as well as their AUC values included in Table 5. For the training set, all feature sets and learning models have higher AUCs than the random dummy, and practically all of them (except for LR with diversity data) are higher than 0.6, considered fair classification by works like Shatnawi et al. (2010). The KNN for financial ratios obtained an AUC of 0.722, which is considered acceptable. However, the ROCs and AUCs on testing sets significantly worsened: the only feature set where all learning models successfully outperform the AUC of the random dummy is the financial ratios. With this feature set, a maximum AUC of 0.631 was achieved with DT, which is once again regarded as fair, and not a strong fit. This further proves the moderate fit of the models with the data, limited by size of the dataset and perhaps the relevant variables of study.
Therefore, this ML exercise suggests a fair relationship between financial performance and corporate diversity, belonging to the literature that defends a mildly positive economic case for diversity initiatives in companies: increased profitability, innovation and talent retention (Latukha et al., 2022). Managers can leverage these insights in recruitment and hiring practices, as diversity appears to be positive from the financial side, in line with Koseoglu et al. (2025). The predictive power of ML also supports the allocation of resources to improve diversity, as well as identifying low diversity scores or non-compliance in diversity as potential corporate risks (Jane Lenard et al., 2014). Management may also employ the conclusions from ML to communicate and convince stakeholders and customers about the importance of corporate diversity. In stakeholder theory, this fair agreement between perceived diversity and financial performance enhances decision-making and fairness among stakeholders: the former among managers and the latter among the hired employees, product of the diversity policies. However, at the same time, unexplainable ML models can also erode transparency and trust in stakeholder relationships, making the inclusion of feedback from stakeholders essential when deploying ML.

5. Diversity Rankings with Operations Research

5.1. Intersection of the Rankings

Intersecting the different targeted rankings and converting them into criteria for the final rankings representing the diversity, ethnic origin and global studies is the first step in the OR pipeline. Table 6 shows the number of companies in common among the different rankings of each study.
For the diversity study, the limitation is marked by Fa_D: this original ranking only contained 50 companies, of which 35 also appeared in the top 500 classified by Fo_D. The Gl_D scores were not a limitation in this case, as they were extracted based on the already intersecting 35. As the rankings of the ethnic origin study were all collected by Fair360, the consistency in methodology enables a nearly full intersection of the companies between the rankings. For instance, top companies in Fa_N also appear in Fa_L; similarly, the totality of companies in Fa_A are present in Fa_L and Fa_N too. In contrast, the intersections in the rankings forming the global study are much more heterogeneous, even though they all come from the same source and a similar methodology. Part of this is due to the different number of companies in each ranking; for example, intersections of the rankings with Fo_V might seem low, but Fo_V ranks only 150 companies, so in most cases the resulting intersection already comprehends 30–50% of the size of Fo_V. On the other hand, this study probes the relationship of diversity with other different corporate themes, hence allowing a larger disparity at the intersections.
In any case, the final sample of companies studied in this OR part is included in Table A7 of the Appendix A, including their State in the US where they are headquartered, the industry they belong to, the abbreviation employed in further tables to refer to them and in which of the three studies they appear. Likewise, Appendix A also contains the preprocessing steps of collecting the intersected companies in each study and reorganizing their numeration, to create the new criteria that conform the three targeted studies. These tables are divided into two parts: the positions of the companies in the original rankings, and their numeration reorganized for the ranking comparisons. Table A4 includes these steps for the diversity study, Table A5 for the ethnic origin study and Table A6 for the global study. In addition, for summarizing purposes, the main body of the paper includes the results of these preprocessing steps. The intersected rankings to be inputted into the OR model are present in Table 7, which uses the companies abbreviations as defined in Appendix A.

5.2. Descriptive Statistics of Rankings

The τ correlation values for the three prescriptive analytics studies are listed in Table 8. The τ correlations were low in the diversity study, as they did not exceed 30%. The correlation Fa_D-Fo_D is very low; Fa_D and Fo_D have both higher correlations with Gl_D but are still low. One relevant reason behind ranking discrepancies are differences in methodology: criteria may be weighed (or even considered) differently. For instance, Shehatta and Mahmood (2016) found moderate–high correlations in six university rankings, while Schütte et al. (2018) found transparency discrepancies between three healthcare system rankings. Ranking methodologies are especially relevant in diversity climate, highly complex and subjective, and some rankings only explore superficial aspects of inclusion with the potential to diversity-wash (Tayar, 2017).
The ethnic origin study presented much higher correlations, reaching around 50–80% as τ maximum values. In contrast to the diversity study, the four original ethnicity rankings from Fair360 employ a same methodology and rank approximately the same set of companies, ensuring higher correlations and explainability among them. However, interestingly, equal opportunities in companies are not the same among all ethnic origins. While top companies for Asian Americans are highly correlated with top companies of Latinos and Native American/Pacific Islanders, the correlation with top companies for Blacks is very low. Similarly, top companies for Blacks have a moderate–low correlation with top companies for Latinos and Native American/Pacific Islanders. In this line, there is a clear difference in correlation with regard to the top companies for Blacks, consistent with Collins (1993) and Bermiss et al. (2024). The global study has mixed results, correlation-wise. The lowest correlations of rankings were found for Fo_D, which was not higher than 20%. In fact, the correlation between Fo_D and Fo_V is slightly negative, meaning that there is a weak pattern between the worst-performing companies in terms of diversity and the best ones for veterans, and vice versa. The best companies for women, Fo_W, is moderately correlated with the best large companies and the best companies for graduates, as in Terjesen et al. (2007), and the Fo_G-Fo_L pair is also fairly correlated, consistent with Murphy and Collins (2015). The correlation between Fo_V and the rest of the rankings (except the aforementioned Fo_D) is moderate to low, ranging around 20–30%, which is consistent with Ainspan and Saboe (2021).
The descriptive statistics of the ranking intersections found previously can be illustrated using radar plots, which are useful tools for visually comparing multicriteria ranked elements. Figure 7, Figure 8 and Figure 9 represent the radar plots for the diversity, ethnic origin and global studies, respectively. Each line corresponds to a different company: the smaller the shape traced by a firm, the better the firm has resulted across the different criteria.
These plots are already somewhat indicative toward building the global orders with the LOP, as well as finding the individual correlation between ranking criteria. In Figure 7, the smallest triangle is obtained by MSCD, whereas the largest and least favourable one corresponds to WALG. Figure 8 is perhaps clearer due to the lower number of companies in the ethnic origin intersections: the best polygon appears to be TOYO, while the largest one seems to be BOEG. As for Figure 9, there is not a clear best and worst firm, but rather the positions fluctuate significantly with the criteria.

5.3. Comparison of Rankings

The optimal orders of the companies obtained using the LOP algorithm are listed in Table 9. Each study includes three different optimal solutions that have the same LOP optimum value. Indeed, LOP problems can have multiple optimals, adding a fairness component (P. E. Anderson et al., 2022). Table 10 shows the τ correlation values of the solutions: all of which are higher than 90%, which demonstrates that the obtained LOP optimal solutions are very similar to each other (Okoye & Hosseini, 2024). Therefore, an effort to normalize and standardize the criteria of several rankings related to diversity into one and obtain a general classification of companies is possible.
The technology and financial sectors rank among the top: MSCD and GOGL have been extensively praised for their diversity programs and studied in works such as Allen and Montgomery (2001) and Ly-Le (2022). In contrast, cutting on diversity and inclusion favors bad positions in aggregated rankings, as seen with BOEG in Dungan (2024). In addition, Table 11 describes the τ correlation values of the optimal solutions with respect to the original rankings. The LOP solutions were fairly to highly correlated with the original rankings in the three studies, with all τ values greater than 30%, except for Fo_D in the global study. This further justifies the usefulness of rank aggregation, as the obtained solutions maintain the similarity with the original rankings while providing a combination of them, which were not always highly correlated by separation.
This OR exercise verifies the existence of a moderate–small intersection or general agreement between rankings built using different methodologies, and proves their adequacy. Ranking aggregation is valuable, even with moderate–small intersections, as it provides a simplification of decision making for stakeholders lacking the time, knowledge or resources to analyze multiple standalone rankings. Aggregated rankings also improve comparability among factors that are difficult to contrast individually, reduce bias towards inconsistent methodologies and allow for the identification of consistent top performer organizations. However, rank aggregation may also present some drawbacks, one of which is the increased computational capacity it requires. Handling incomplete rankings or those with numerous ties, like Newsweek America’s Greatest Workplaces for Inclusion & Diversity, is also problematic (Newsweek & Group, 2025). Rank aggregation might also oversimplify problems by losing information, as well as not fully satisfy any of the individual rankings.
In stakeholder theory, aggregating ranks involves enhanced understanding and fairer representation of diverse stakeholders interests, which increases engagement and induces greater trust and legitimacy. As for corporate diversity, it encompasses several dimensions and encourages managers to take a more integrated and holistic approach (Cachat-Rosset et al., 2019). Strong positions in aggregated diversity rankings can increase a company’s reputation and capacity to attract top talent and socially responsible consumers, while enabling managers to identify them as leaders in diversity initiatives and as an example to follow (Kamal & Ferdousi, 2009). The use of several diversity rankings also drives public image and accountability (Espeland & Sauder, 2008) and allows more tailored communication to internal and external stakeholders (Beckert & Koch, 2025).

6. Conclusions

In this paper, corporate diversity, its perception by employees and its link to financial performance have been analyzed by employing analytics tools to build models. While prescriptive OR enables the formulation of mathematical optimization models, predictive ML is employed to obtain labels from a sample of features.

6.1. Summary of Machine Learning and Operations Research

Specifically, the ML analysis probed the prediction of the relationship between financial performance and corporate diversity perception based on several feature sets. The three feature sets consisted of 11 raw financial variables, 15 financial ratios and 2 diversity features. The raw financial data feature set showed moderate levels of correlation with itself, but the rest of the feature sets were not correlated, nor with the label. All successfully outperformed a dummy model based on random guess for a dataset of 350 European companies, indicating their predictive capabilities on the labels. The best models were based on financial ratios rather than diversity features, and the best ML algorithms was the DT. The DT applied on financial ratios achieved an accuracy of ∼66%, precision of 75% and F1 Score of ∼56% in the test set. This is considered reasonable, yet not a good or excellent fit, implying that other variables not considered in this study also play an important role in predicting corporate diversity.
The comparison of diversity in rankings and scoring was analyzed throughout three OR exercises. In one of them, diversity rankings are collected from different sources. In another, one common source that compares the best companies for four ethnic origins is analyzed. In the third exercise, diversity is compared with other criteria to rank top companies, such as suitability for graduates or veterans. In the three cases, the intersections between the rankings were fairly small, partly due to the different size of each individual ranking. The correlations between the rankings when analyzing diversity was moderate–low, mainly caused by differences in methodology when classifying companies. The ethnic origin study presented higher correlations, but the distribution of equal opportunities was not entirely the same: the top companies among Asian Americans, Native American/Pacific Islanders and Latinos were similar between them and different from those top companies for Blacks. The heterogeneity of rankings in the global study signified moderate–low correlations; this time, the diversity ranking was the least correlated with the others. In any case, the LOP formulation enabled the extraction of several aggregated classifying orders with the best agreement between the individual rankings; for all studies, these optimal orders were similar to each other, proving how different rankings can be homogenized and normalized by employing OR.

6.2. Implications and Impact of Research

These results highlight the relevance of adopting data-driven decisions, which has several implications. In terms of policy, these points confirm the need for regulatory standards for consistent, transparent and standardized diversity reporting. Diversity has been effectively proven as a relevant Environmental/Social/Governance metric that can incentivize investment to adopt more inclusive practices (Hunt et al., 2015). Governments might also consider introducing legislation in the form of incentives or penalties for compliance with corporate diversity standards, or perhaps industry benchmarks (Primec & Belak, 2022). These benchmarks may be implemented at an international level, allowing organizations, companies and public administrations to align their policies with best practices. In addition, procurement strategies in the public and private sectors may prioritize partnerships with companies that excel in diversity.
Moreover, promoting diversity as suggested by the ML and OR sections, supports broader international goals, like Sustainable Development Goals (SDG) 5 and 8 (Gupta & Vegelin, 2016). As proven in M. F. Garcia et al. (2025), managers must strategize about diversity and work-life balance because it mediates the relationship between SDG 5 and social performance; in Singha (2022), it is deemed as essential to develop a powerful and talent-attractive industry, in the case of banking in India. As for SDG 8, Šilenskytė and Rašković (2024) defends its embedding in business school education to promote a more equitable, just, and inclusive economy. From the policy perspective, Asif et al. (2023) recommends the implementation of ISO 3700-2021 (ISO, 2021) to boost firm sustainability and promote a circular economy. As a contribution to these works, our paper supports that companies actively employing advanced analytics in corporate diversity could support in the achievement of these goals. In summary, this paper impacts the following areas.
  • There is a relationship between financial variables and diversity scores, and ML can be used to predict the diversity scores from corporate financial variables, in line with Koseoglu et al. (2025). The performance of ML is fair, suggesting the complexity of the diversity climate. This encourages managers to adopt diversity initiatives and is useful for ESG investors to consider investments in diversity-compliant companies (O. Lee et al., 2022).
  • Generally, methodologies are different across diversity rankings, producing moderate–low correlations between them; as in Tayar (2017), some of these rankings may only focus on superficial aspects of diversity. When comparing ethnic origin, best companies for Blacks are lowly correlated with best companies for Asian American, Latinos and Native American/Pacific Islanders, especially applicable in the current political scenario (Rice et al., 2025). When comparing diversity to other metrics, the best companies for women are moderately correlated with the best large companies and the best companies for new graduates, in line with LLC (2023) and Dennison (2025). Our results catch the rise in women’s rights awareness that new graduates and Generation Z are creating in the workplace, as detected also in Global (2025).
  • The discrepancies between diversity ranking methodologies can be smoothed with rank aggregation tools like the LOP. Treating the different diversity rankings proves how unstable and multifaceted the diversity climate is (Cachat-Rosset et al., 2019). For this paper, LOP solutions were moderately correlated with rankings that were weakly correlated between them, thus providing the LOP a simplified, balanced summary of them. Even though this may imply losing information and perhaps oversimplifying the comparison, it is useful for faster decision-making for managers and ESG investors, as well as for simpler communication of a company’s position across different rankings to both employees and shareholders or investors.

6.3. Limitations and Future Research Directions

This work pioneers the combination of ML and OR to study corporate diversity rankings and the methodology employed in them, saving time in accounting documentation revision and increasing transparency in this scoring process. Nevertheless, limitations in both parts justify the need for further research.
In the ML analysis, the discretization of the problem for classification and the relatively small training set are limitations that could have been potential sources of error and loss of relevant information. The dataset of companies was small due to financial data availability issues and Eikon only providing eight diversity scores; in many companies most of these scores were empty or null, so they were discarded. Employing fewer features from financial raw data and ratios effectively enlarges the possible dataset size, and including environmental or governance scores (more numerous in Eikon) jointly with diversity scores would improve the diversity feature set. Non-financial data, like texts in employee reviews or surveys on workplace environment, could potentially be employed and require more sophisticated ML models capable of handling multi-modal data. As for the ML models, other algorithms could be studied, as well as feature importance via SHAP values to uncover which variables are the most relevant.
As for the OR part, the incorporation of more rankings would enrich the problem, involving other forms of diversity such as linguistic, religious or neurodiversity. A more exhaustive analysis of the complete set of optimal solutions would also be relevant, as well as different rank aggregation optimization problems (or weighing schemes) and solving rank aggregation with ties. Examining different industries or comparing findings across different geographic regions could yield valuable information.
For both ML and OR parts, a longitudinal comparative analysis of how diversity climate and rankings vary over time would offer dynamic insights, which is especially relevant after certain political, economic or social events happen. This would require more data gathering of the same rankings, financial variables and diversity scores from previous years, enabling a two-fold comparison: the snapshot of each specific year, and how it evolved through time, with the current paper being the analysis of the snapshot for the years 2023-2024. The differences in cultural traits among nationalities, more linkable to Hofstede’s cultural theories, would also be interesting to analyze with ML and OR: perhaps ranking correlations or diversity predictability variations depending on the specific national context and their relationship with Hofstede’s six dimensions.

Author Contributions

Conceptualization, I.M.-M. and F.H.-P.; methodology, I.M.-M., R.G.-M. and M.L.M.-G.; software, I.M.-M.; validation, I.M.-M. and F.H.-P.; formal analysis, I.M.-M.; investigation, I.M.-M.; resources, F.H.-P., R.G.-M. and M.L.M.-G.; data curation, I.M.-M.; writing—original draft preparation, I.M.-M.; writing—review and editing, F.H.-P., R.G.-M. and M.L.M.-G.; visualization, I.M.-M.; supervision, F.H.-P., R.G.-M. and M.L.M.-G.; project administration, F.H.-P., R.G.-M. and M.L.M.-G.; funding acquisition, F.H.-P., R.G.-M. and M.L.M.-G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data that support the findings of this study are available from the corresponding author, upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest in the writing and submission of this paper.

Appendix A. Additional Tables

Table A1. Descriptive statistics of variables in machine learning exercises.
Table A1. Descriptive statistics of variables in machine learning exercises.
MeanStd.Min.Q1 (25%)Q2 (50%)Q3 (75%)Max.
ps2.433.020.000.571.383.2520.51
er2.733.170.020.771.673.4922.24
ee13.2119.56−153.996.1610.3917.90238.68
bt0.990.47−0.080.650.991.272.96
wk0.140.36−0.93−0.070.080.311.83
pr0.711.780.000.190.420.6721.14
pf0.080.12−0.550.030.060.131.07
op0.120.16−1.460.050.100.171.21
cr1.461.200.190.991.211.5615.46
qr1.111.070.170.700.941.2115.46
ch0.510.770.010.190.330.538.55
dr0.630.190.110.500.630.741.40
ra0.050.07−0.520.020.050.080.45
re0.120.92−9.250.060.120.2010.40
es8.37 × 10 3 1.56 × 10 5 −38.960.541.835.102.93 × 10 6
rv3.05 × 10 7 5.37 × 10 7 1.59 × 10 0 4.97 × 10 6 1.32 × 10 7 3.10 × 10 7 5.29 × 10 8
eb6.41 × 10 6 1.35 × 10 7 −6.25 × 10 6 6.66 × 10 5 1.88 × 10 6 6.44 × 10 6 1.19 × 10 8
ni2.93 × 10 6 8.07 × 10 6 −6.44 × 10 6 1.89 × 10 5 7.31 × 10 5 2.71 × 10 6 8.92 × 10 7
ca1.66 × 10 7 2.87 × 10 7 4.07 × 10 4 2.57 × 10 6 6.69 × 10 6 1.87 × 10 7 2.24 × 10 8
cs5.64 × 10 6 1.22 × 10 7 6.70 × 10 3 5.10 × 10 5 1.80 × 10 6 5.73 × 10 6 1.02 × 10 8
iv3.26 × 10 6 6.61 × 10 6 0.00 × 10 0 1.25 × 10 5 1.18 × 10 6 3.72 × 10 6 7.34 × 10 7
na3.13 × 10 7 5.37 × 10 7 2.90 × 10 4 3.51 × 10 6 9.92 × 10 6 3.42 × 10 7 3.41 × 10 8
cl1.36 × 10 7 2.32 × 10 7 7.36 × 10 3 2.01 × 10 6 5.10 × 10 6 1.52 × 10 7 1.83 × 10 8
nl1.66 × 10 7 2.95 × 10 7 4.11 × 10 3 1.63 × 10 6 5.05 × 10 6 1.66 × 10 7 2.18 × 10 8
wc2.97 × 10 6 9.52 × 10 6 −2.15 × 10 7 −1.33 × 10 4 8.50 × 10 5 3.01 × 10 6 8.25 × 10 7
eq1.77 × 10 7 3.45 × 10 7 −1.58 × 10 7 1.95 × 10 6 5.43 × 10 6 1.83 × 10 7 2.61 × 10 8
pd55.041.6752.0054.0055.0056.0061.00
wo54.8122.881.0038.0056.0072.0099.00
di7.480.266.947.297.477.658.45
Table A2. Companies studied in machine learning exercises.
Table A2. Companies studied in machine learning exercises.
InfineonAmazonRoyal BAM GroupIberiaAdecco GroupRedrow
AllegroAsicsVodafoneCBREStora EnsoBombardier Group
Cd ProjektAir France-KLM GroupAir ProductsJohn DeereTF1 GroupAlcoa
Hermès3MGSK (GlaxoSmithKline)BASFMondadoriA2A
Keysight TechnologiesDell TechnologiesBoschNestléRentokil InitialSAS
Prada GroupExpediaKnorr-BremseSamsungTeva PharmaceuticalsNovo Nordisk
Merit MedicalAalberts Surface TreatmentIntelMcDonald’sAsusENGIE
Agilent TechnologiesEssityLenzingFortumMitsubishi ElectricThe Swatch Group
SalesforceFujitsuProcter & GambleSchindlerFraportKonica Minolta
GoogleAccentureKingfisherDaimlerWSPSodexo
Hyatt Hotels CorporationEatonMondiShellRenaultVoestalpine
PayPalEpirocPKN OrlenSanofiHoneywellKONE
AbbVieOrangeTexas InstrumentsOMVMotorola SolutionsSaab Group
MicrosoftCognizantConcentrixBlackRockTAURONWestern Union
Hugo BossSevern TrentSonyTelenorElisaIpsos
alightGivaudanSikaThales GroupMitchells & ButlersMetso
SartoriusPEABScandic HotelsEvonik IndustriesBaxterMorgan Sindall
EnBWJacobs EngineeringSainsbury’sThe Coca-Cola CompanySkanskaLassila & Tikanoja
OrklaMelia Hotels InternationalInditexAuto TraderadessoEstée Lauder
CumminsHapag-LloydAirbusABBBridgestoneNokia
RexeladidasSymriseSchibstedInfosysGrupo Acciona
IBMHusqvarnaRheinmetallEnel GroupSwisscomKSB
RocheNovartisDiageoAonPublicisMagna
AGCOPolsat BoxComputacenterAhold DelhaizeCanonDXC Technology
Boston ScientificHeidelberg Cement GroupAlfa LavalYitHondaArcelorMittal
LillyDeutsche TelekomPhilipsTokmanniKoninklijke KPNSSAB
Arrow ElectronicsWickesBMWJabilPrysmian GroupZeiss
AstraZenecaEricssonSolvayKeringPlastic OmniumStrabag
CiscoL’OréalAvery DennisonUberUnited InternetValeo
ChevronBeiersdorfHitachiHyundai Motor CompanyArkemamastercard
Louis VuittonRELX GroupUnileverSandvikWorldlineSpotify
WhitbreadPandoraOcado GroupBremboCostcoknowit
BookingHenkelGoodyear DunlopPepsicoCloettaSmith & Nephew
SAPMarriott InternationalBD (Becton, Dickinson and Co.)Grupa AzotyFirstGroupDeutsche Post
Volvo Car GroupWillis Towers WatsonTaylor WimpeyColruyt GroupBureau VeritasNetflix
Fnac DartyEssilorLuxotticaBrenntagGreggsGroupe CarrefourSaint-Gobain
MerckPets at Home GroupTelia CompanyMTU Aero EnginesNexityLeonardo
United UtilitiesTietoEVRYAmadeusRepsolOVSGreencore Group
AppleDalata Hotel GroupRB (Reckitt Benckiser)STMicroelectronicsSKF GroupFoot Locker
FacebookBPTakedaAccorHotelsSchaeffler-GruppeOlympus
DowTeslaLufthansaOracleCNH Industrialamplifon
BudimexDuPontPORRBAE SystemsEniFedEx
Hewlett Packard EnterprisePhilip Morris InternationalMarks & SpencerDanoneAlconITV
PfizerLogistaH&M Hennes & MauritzBouyguesNeucaBoeing
Wolters KluwerCarnivaleBayElectroluxValmetLevi Strauss & Co
EasyjetAbbottBT GroupTUIThe Walt Disney CompanyAgfa-Gevaert
NTT DataNissan Motor CorporationAxfoodLonzaAubayXerox
Akamai TechnologiesNikeTelefónica O2SiemensBalfour Beatty
MedtronicPageGroupNktBayerDS Smith
Moody’s CorporationDunelmAdva Optical NetworkingAECOMANDRITZ
SageRWEAtlas CopcoIberdrolaEtteplan
Logitech InternationalMicrochipGeneral ElectricGXO LogisticsCaterpillar
FerrariZalandoNational GridBioNTechBayWa
Johnson & JohnsonSavencia Fromage & DairyAir LiquideCentricaAlstom
SkyColgate-PalmoliveAbout YouThermo Fisher ScientificMTR Corporation
AdobeTescoCapgeminiLegrandArcadis
PUMAFord Motor CompanyRalph LaurenCGIEurofins
Icon PlcSafran GroupStarbucksBroadcomTrelleborgTCS (Tata Consultancy Services)
Hilton Hotels & ResortsAT&TRolls-RoyceHeinzBabcock InternationalUPS (United Parcel Service)
InterContinental Hotels GroupSchneider ElectricCarlsbergTeleperformanceHeinekenSmurfit Kappa
Table A3. Sectors and countries of origin of companies studied in machine learning exercises.
Table A3. Sectors and countries of origin of companies studied in machine learning exercises.
Sector/CountryAustriaBelgiumDenmarkFinlandFranceGermanyIrelandIsraelItalyLuxembourgNetherlandsNorwayPolandSpainSwedenSwitzerlandUnited KingdomTotal
Aerospace, Defence, Manufacture of Transport Equipment 32 1 1 11615
Automotive (Producers and Suppliers) 1 38 2 1 11421
Banking and Financial Services11 1 1 26
Business Services and Supplies 13 149
Clothing and Accessories, Sports Equipment (Manufacturing and Retail) 11 35 2 3 12 18
Construction2 1 1 112 311
Consulting and Accounting 2 2 26
Drugs and Biotechnology 1 1611 1 5622
Engineering, Manufacturing11 21 31413
Food, Soft Beverages, Alcohol and Tobacco 1 2 1 11 13515
Health Care Equipment and Services 231 1 2110
Insurance 11
IT, Internet, Software and Services 1 2375 1 11241138
Manufacture and Processing of Materials, Metals and Paper3 1121 111 13 116
Media and Advertising 3 1 211 412
Oil and Gas Operations, Mining, and Chemicals11 124 1 1 2112421
Packaged Goods 22 1319
Restaurants 44
Retail 1 12 11 111 1019
Semiconductors, Electronics, Electrical Engineering, Hardware 112102 1 2 132429
Telecommunications Services, Cable Supplier 121 11 111312
Transportation and Logistics 1 34 1 312
Travel and Leisure 112 2 11 614
Utilities 113 2 11 413
Wholesale 11 11 4
Total884124459131124203811222893350
Table A4. Positions of companies in the diversity study.
Table A4. Positions of companies in the diversity study.
CompaniesPlace, Original RankingPlace, Ranking Intersection
Fa_DFo_DGl_DFa_DFo_DGl_D
MSCD1181151
MEDT21431121611
HERS32662532325
TOYO44091343113
LILL540085308
KPMG62621962219
DOWW738747274
TIAA837827
HUMN106159315
BOEG1247528103528
CNBC138331111031
CIGN1410622121222
ABBV16272013720
WALM1739933142933
RAND1918526151826
TDBK20231613
KYBK2211018171318
SOUC2421116181916
ECOL2512627191527
NOGR2711912201412
CAPO28220921219
SAFI2946414223314
ALLY31242423624
GEMO3321810242010
TARG34652125921
CENT3718223261723
COPA394262786
UNAI41334528255
P&GG431222942
AMFI4539632302832
WALG4647335313435
ALIC4742830323230
HOND4837934332634
BEBU499417341117
WYNH5029729352429
Total in ranking50500-353535
Table A5. Positions of companies in the ethnic origin study.
Table A5. Positions of companies in the ethnic origin study.
CompaniesPlace, Original RankingPlace, Ranking Intersection
Fa_AFa_BFa_LFa_NFa_AFa_BFa_LFa_N
TOYO22441144
SHIE310332733
MEDT4191131011
HERS59264626
LILL66555355
HILT82812136151011
EYYY918827972
ADPP102191081188
BOEG122622219131515
DOWW13251011101299
CNBC1437711267
ABBT1527191912141414
KPMG171214141381212
HUMN18815171451313
CIGN19713121541110
Total in ranking1928242315151515
Table A6. Positions of companies in the global study.
Table A6. Positions of companies in the global study.
CompaniesPosition in Original RankingPosition in Intersection Of Rankings
Fo_DFo_WFo_GFo_LFo_VFo_DFo_WFo_GFo_LFo_V
PROG1172305949111221816
INTL118210855111225141531
P&GG12547610343322112314
ALLY24191713764426624
ACCT2610417323550531163317
PNCF3539613920786628383026
SALF3641521729716978
MATT461742495775836231723
CISC513738375991281219
UNHE552861771951271032332934
APPL571254636911114181127
GOGL5933115122142
JPMO613051962091361334363136
PFIZ66212501521151415312632
AMEX741361832137158191037
NASA8430218381617811
DELL931678377541723212118
NIKE100211551781311817302835
DELA10313101223196557
MICR12748583320310310
MEDT143299292258482138343515
FIDI15568712225225
SONY17619261401022319271330
CSCH231152135662247201620
IBMM2451181911382259172525
ADDS259204229169932635292728
TEXI264751452681492729133638
3MMM2682813411297282662429
LOCM35398954610292415144
ORCL35439373300653021373721
BMWG357179406631311325199
HOND3792211492521223230323433
FORD385201191222403333283213
HEBB4331127593410413
COST469758257135183922
BOEG4751761388419362724226
SOAI4796267704372012201
CACI487303285311393837353812
Total ranking5004003005001503838383838
Table A7. Companies studied in operations research exercises.
Table A7. Companies studied in operations research exercises.
CompanyStateIndustryAbbrv.Diversity StudyEthnic Origin StudyGeneral Study
ProgressiveOhioInsurancePROG X
IntelCaliforniaSemiconductors, Electronics, Hardware & EquipmentINTL X
AccentureNew YorkProfessional ServicesACCT X
PNC Financial ServicesPennsylvaniaBanking and Financial ServicesPNCF X
Salesforce.com_CaliforniaIT, Internet, Software & ServicesSALF X
Marriott InternationalMarylandTravel & LeisureMATT X
Cisco SystemsCaliforniaIT, Internet, Software & ServicesCISC X
UnitedHealth GroupMinnesotaInsuranceUNHE X
AppleCaliforniaSemiconductors, Electronics, Hardware & EquipmentAPPL X
GoogleCaliforniaIT, Internet, Software & ServicesGOGL X
JPMorgan ChaseNew YorkBanking and Financial ServicesJPMO X
PfizerNew YorkDrugs & BiotechnologyPFIZ X
American ExpressNew YorkBanking and Financial ServicesAMEX X
NASADistrict of ColumbiaAerospace & DefenseNASA X
Dell TechnologiesTexasSemiconductors, Electronics, Hardware & EquipmentDELL X
NikeOregonClothing, Shoes, Sports EquipmentNIKE X
Delta Air LinesGeorgiaTransportation and LogisticsDELA X
MicrosoftWashingtonIT, Internet, Software & ServicesMICR X
Fidelity InvestmentsMassachusettsBanking and Financial ServicesFIDI X
SonyNew YorkSemiconductors, Electronics, Hardware & EquipmentSONY X
Charles SchwabCaliforniaBanking and Financial ServicesCSCH X
IBMNew YorkIT, Internet, Software & ServicesIBMM X
AdidasOregonClothing, Shoes, Sports EquipmentADDS X
Texas InstrumentsTexasSemiconductors, Electronics, Hardware & EquipmentTEXI X
3MMinnesotaPackaged Goods3MMM X
Lockheed MartinMarylandAerospace & DefenseLOCM X
OracleCaliforniaIT, Internet, Software & ServicesORCL X
BMW GroupNew JerseyAutomotive (Automotive and Suppliers)BMWG X
Honda MotorCaliforniaAutomotive (Automotive and Suppliers)HONDX X
Ford MotorMichiganAutomotive (Automotive and Suppliers)FORD X
H-E-BTexasRetail and WholesaleHEBB X
Costco WholesaleWashingtonRetail and WholesaleCOST X
BoeingIllinoisAerospace & DefenseBOEGXXX
Southwest AirlinesTexasTransportation and LogisticsSOAI X
CACI InternationalVirginiaAerospace & DefenseCACI X
MastercardNew YorkBanking and Financial ServicesMSCDX
The Hershey CompanyPennsylvaniaFood, Soft Beverages, Alcohol & TobaccoHERSXX
Toyota North AmericaTexasAutomotive (Automotive and Suppliers)TOYOXX
Eli Lilly and CompanyIndianaDrugs & BiotechnologyLILLXX
KPMGNew YorkProfessional ServicesKPMGXX
DowMichiganConstruction, Oil & Gas Operations, Mining and ChemicalsDOWWXX
TIAANew YorkBanking and Financial ServicesTIAAX
HumanaKentuckyInsuranceHUMNXX
Comcast NBCUniversalPennsylvaniaMedia & AdvertisingCNBCXX
The Cigna GroupConnecticutInsuranceCIGNXX
AbbVieIllinoisDrugs & BiotechnologyABBVX
WalmartArkansasRetail and WholesaleWALMX
RandstadGeorgiaBusiness Services & SuppliesRANDX
TD BankNew JerseyBanking and Financial ServicesTDBKX
KeyBankOhioBanking and Financial ServicesKYBKX
Southern CompanyGeorgiaUtilitiesSOUCX
EcolabMinnesotaBusiness Services & SuppliesECOLX
Northrop GrummanVirginiaAerospace & DefenseNOGRX
Capital OneVirginiaBanking and Financial ServicesCAPOX
Sanofi U.S.New JerseyDrugs & BiotechnologySAFIX
Ally FinancialMichiganBanking and Financial ServicesALLYX X
General MotorsMichiganAutomotive (Automotive and Suppliers)GEMOX
TargetMinnesotaRetail and WholesaleTARGX
Centene CorporationMissouriInsuranceCENTX
Colgate-PalmoliveNew YorkPackaged GoodsCOPAX
United AirlinesIllinoisTransportation and LogisticsUNAIX
Procter & GambleOhioPackaged GoodsP&GGX X
American Family InsuranceWisconsinInsuranceAMFIX
WalgreensIllinoisRetail and WholesaleWALGX
Allstate Insurance CompanyIllinoisInsuranceALICX
Best BuyMinnesotaRetail and WholesaleBEBUX
Wyndham Hotels & ResortsNew JerseyTravel & LeisureWYNHX
Blue Shield of CaliforniaCaliforniaInsuranceSHIE X
MedtronicMinnesotaHealth Care Equipment & ServicesMEDTXXX
HiltonVirginiaTravel & LeisureHILT X
EYNew YorkProfessional ServicesEYYY X
ADPNew JerseyIT, Internet, Software & ServicesADPP X
AbbottIllinoisHealth Care Equipment & ServicesABBT X

References

  1. Afzal, F., & Lyu, K. (2025). Do women leaders promote social sustainability? Exploring the effect of board diversity on equity, diversity, and inclusion practices of project-based organizations. Project Management Journal, 87569728251357495. [Google Scholar] [CrossRef]
  2. Ahmed, S., Alshater, M. M., El Ammari, A., & Hammami, H. (2022). Artificial intelligence and machine learning in finance: A bibliometric review. Research in International Business and Finance, 61, 101646. [Google Scholar] [CrossRef]
  3. Ainspan, N. D., & Saboe, K. N. (2021). Military veteran employment: A guide for the data-driven leader. Oxford University Press. [Google Scholar]
  4. Alagna, S. W., Reddy, D. M., & Collins, D. (1982). Perceptions of functioning in mixed-sex and male medical training groups. Academic Medicine, 57(10), 801–803. [Google Scholar] [CrossRef] [PubMed]
  5. Allen, R. S., Dawson, G., Wheatley, K., & White, C. S. (2007). Perceived diversity and organizational performance. Employee Relations, 30(1), 20–33. [Google Scholar] [CrossRef]
  6. Allen, R. S., & Montgomery, K. A. (2001). Applying an organizational development approach to creating diversity. Organizational Dynamics, 30(2), 149–161. [Google Scholar] [CrossRef]
  7. Amorelli, M.-F., & García-Sánchez, I.-M. (2021). Trends in the dynamic evolution of board gender diversity and corporate social responsibility. Corporate Social Responsibility and Environmental Management, 28(2), 537–554. [Google Scholar] [CrossRef]
  8. Anderson, P., Chartier, T., & Langville, A. (2019). The rankability of data. SIAM Journal on Mathematics of Data Science, 1(1), 121–143. [Google Scholar] [CrossRef]
  9. Anderson, P. E., Chartier, T. P., Langville, A. N., & Pedings-Behling, K. E. (2022). Fairness and the set of optimal rankings for the linear ordering problem. Optimization and Engineering, 23(3), 1289–1317. [Google Scholar] [CrossRef]
  10. Armstrong, A. (2025). Diversity, equity and inclusion work: A difference that makes a difference…? Equality, Diversity and Inclusion: An International Journal, 44(5), 588–601. [Google Scholar] [CrossRef]
  11. Ascheuer, N., Escudero, L. F., Grötschel, M., & Stoer, M. (1993). A cutting plane approach to the sequential ordering problem (with applications to job scheduling in manufacturing). SIAM Journal on Optimization, 3(1), 25–42. [Google Scholar] [CrossRef]
  12. Asif, M., Khan, P. A., Irfan, F., Salim, M., Jan, A., & Khan, M. (2023). Is gender diversity is diversity washing or good governance for firm sustainable development goal performance: A scoping review. Environmental Science and Pollution Research, 30(53), 114690–114705. [Google Scholar] [CrossRef]
  13. Aßländer, M. S., Gössling, T., & Seele, P. (2016). Business ethics in a European perspective: A case for unity in diversity? Journal of Business Ethics, 139, 633–637. [Google Scholar] [CrossRef]
  14. Baioletti, M., Milani, A., & Santucci, V. (2018, July 8–13). Algebraic crossover operators for permutations. 2018 IEEE Congress on Evolutionary Computation (CEC) (pp. 1–8), Rio de Janeiro, Brazil. [Google Scholar]
  15. Balakrishnan, K., Copat, R., De la Parra, D., & Ramesh, K. (2023). Racial diversity exposure and firm responses following the murder of George Floyd. Journal of Accounting Research, 61(3), 737–804. [Google Scholar] [CrossRef]
  16. Bazzi, S., Fiszbein, M., & Gebresilasse, M. (2021). “Rugged individualism” and collective (in) action during the COVID-19 pandemic. Journal of Public Economics, 195, 104357. [Google Scholar] [CrossRef]
  17. Beckert, J., & Koch, T. (2025). Corporate perspectives on diversity: Engagement, communication motives, and the Diversity-Washing Dilemma. Journal of Marketing Communications, 31, 538–555. [Google Scholar] [CrossRef]
  18. Behlau, H., Wobst, J., & Lueg, R. (2024). Measuring board diversity: A systematic literature review of data sources, constructs, pitfalls, and suggestions for future research. Corporate Social Responsibility and Environmental Management, 31(2), 977–992. [Google Scholar] [CrossRef]
  19. Beji, R., Yousfi, O., Loukil, N., & Omri, A. (2021). Board diversity and corporate social responsibility: Empirical evidence from France. Journal of Business Ethics, 173, 133–155. [Google Scholar] [CrossRef]
  20. Bermiss, S., Green, J., & Hand, J. R. (2024). Racial/Ethnic misrepresentation of and bias against minority executives. Journal of Economics, Race, and Policy, 8(1), 74–103. [Google Scholar] [CrossRef]
  21. Beugelsdijk, S., Kostova, T., & Roth, K. (2017). An overview of Hofstede-inspired country-level culture research in international business since 2006. Journal of International Business Studies, 48(1), 30–47. [Google Scholar] [CrossRef]
  22. Bhadury, J., Mighty, E. J., & Damar, H. (2000). Maximizing workforce diversity in project teams: A network flow approach. Omega, 28(2), 143–153. [Google Scholar] [CrossRef]
  23. Bianchi, M. T., Morrone, C., Valerio, M., & Donato, F. (2022). Board diversity and firm performance: An empirical analysis of Italian small-medium enterprise. Corporate Ownership & Control, 19(3), 8–24. [Google Scholar]
  24. Bickerstaffe, G., & Ridgers, B. (2007). Ranking of business schools. Journal of Management Development, 26(1), 61–66. [Google Scholar] [CrossRef]
  25. Birge, J. R., & Linetsky, V. (Eds.). (2007). Handbooks in operations research and management science: Financial engineering. Elsevier. [Google Scholar]
  26. Blest, D. C. (2000). Theory & methods: Rank correlation—An alternative measure. Australian & New Zealand Journal of Statistics, 42(1), 101–111. [Google Scholar]
  27. Boehm, S. A., Kunze, F., & Bruch, H. (2014). Spotlight on age-diversity climate: The impact of age-inclusive HR practices on firm-level outcomes. Personnel Psychology, 67(3), 667–704. [Google Scholar] [CrossRef]
  28. Borroni, C. G. (2013). A new rank correlation measure. Statistical papers, 54, 255–270. [Google Scholar] [CrossRef]
  29. Bouslah, K., Liern, V., Ouenniche, J., & Pérez-Gladish, B. (2023). Ranking firms based on their financial and diversity performance using multiple-stage unweighted TOPSIS. International Transactions in Operational Research, 30(5), 2485–2505. [Google Scholar] [CrossRef]
  30. Breiman, L. (2001). Random forests. Machine Learning, 45, 5–32. [Google Scholar] [CrossRef]
  31. Buertey, S. (2021). Board gender diversity and corporate social responsibility assurance: The moderating effect of ownership concentration. Corporate Social Responsibility and Environmental Management, 28(6), 1579–1590. [Google Scholar] [CrossRef]
  32. Cabitza, F., Campagner, A., Del Zotti, F., Ravizza, A., & Sternini, F. (2020, July 21–25). All you need is higher accuracy? On the quest for minimum acceptable accuracy for medical artificial intelligence. e-Health Procedings of the 12th International Conference on e-Health (pp. 21–23), Online. [Google Scholar]
  33. Cachat-Rosset, G., Carillo, K., & Klarsfeld, A. (2019). Reconstructing the concept of diversity climate—A critical review of its definition, dimensions, and operationalization. European Management Review, 16(4), 863–885. [Google Scholar] [CrossRef]
  34. Cahyono, S., Harymawan, I., & Kamarudin, K. A. (2023). The impacts of tenure diversity on boardroom and corporate carbon emission performance: Exploring from the moderating role of corporate innovation. Corporate Social Responsibility and Environmental Management, 30(5), 2507–2535. [Google Scholar] [CrossRef]
  35. Cameron, T. R., Charmot, S., & Pulaj, J. (2021). On the linear ordering problem and the rankability of data. arXiv, arXiv:2104.05816. [Google Scholar] [CrossRef]
  36. Campbell, K., & Mínguez-Vera, A. (2008). Gender diversity in the boardroom and firm financial performance. Journal of Business Ethics, 83, 435–451. [Google Scholar] [CrossRef]
  37. Canbek, G., Sagiroglu, S., Temizel, T. T., & Baykal, N. (2017, October 5–8). Binary classification performance measures/metrics: A comprehensive visualized roadmap to gain new insights. 2017 International Conference on Computer Science and Engineering (UBMK) (pp. 821–826), Antalya, Turkey. [Google Scholar]
  38. Canbek, G., Taskaya Temizel, T., & Sagiroglu, S. (2022). PToPI: A comprehensive review, analysis, and knowledge representation of binary classification performance measures/metrics. SN Computer Science, 4(1), 13. [Google Scholar] [CrossRef] [PubMed]
  39. Carmona, M. A. Á., Ochoa, J. A. C., & Trinidad, J. F. M. (2013, November 11–15). Combining techniques to find the number of bins for discretization. 2013 32nd International Conference of the Chilean Computer Science Society (SCCC) (pp. 54–57), Temuco, Chile. [Google Scholar]
  40. Carter, D., D’Souza, F. P., Simkins, B. J., & Simpson, W. G. (2007). The diversity of corporate board committees and firm financial performance. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=972763 (accessed on 21 May 2025).
  41. Chenery, H. B., & Watanabe, T. (1958). International comparisons of the structure of production. Econometrica: Journal of the Econometric Society, 26(4), 487–521. [Google Scholar] [CrossRef]
  42. Choi, H., Hong, S., & Lee, J. W. (2018). Does increasing gender representativeness and diversity improve organizational integrity? Public Personnel Management, 47(1), 73–92. [Google Scholar] [CrossRef]
  43. Collins, S. M. (1993). Blacks on the bubble: The vulnerability of black executives in white corporations. The Sociological Quarterly, 34(3), 429–447. [Google Scholar] [CrossRef]
  44. Cortes, G. S., Gao, G. P., Silva, F. B., & Song, Z. (2022). Unconventional monetary policy and disaster risk: Evidence from the subprime and COVID–19 crises. Journal of International Money and Finance, 122, 102543. [Google Scholar] [CrossRef]
  45. Cover, T., & Hart, P. (1967). Nearest neighbor pattern classification. IEEE Transactions on Information Theory, 13, 21–27. [Google Scholar] [CrossRef]
  46. Cox, T. (1994). Cultural diversity in organizations: Theory, research and practice. Berrett-Koehler Publishers. [Google Scholar]
  47. Cunningham, P., & Delany, S. J. (2021). k-Nearest neighbour classifiers—A tutorial. ACM Computing Surveys, 54, 1–25. [Google Scholar] [CrossRef]
  48. D’Amato, V., D’Ecclesia, R., & Levantesi, S. (2021). Fundamental ratios as predictors of ESG scores: A machine learning approach. Decisions in Economics and Finance, 44(2), 1087–1110. [Google Scholar] [CrossRef]
  49. Dangeti, P. (2017). Statistics for machine learning. Packt Publishing Ltd. [Google Scholar]
  50. Dantas, M. (2021). Are ESG funds more transparent? Available at SSRN 3269939. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=3269939 (accessed on 21 June 2025).
  51. Das Swain, V., Saha, K., Reddy, M. D., Rajvanshy, H., Abowd, G. D., & De Choudhury, M. (2020, April 25–30). Modeling organizational culture with workplace experiences shared on Glassdoor. 2020 CHI Conference on Human Factors in Computing Systems (pp. 1–15), Honolulu, HI, USA. [Google Scholar]
  52. De Abreu Dos Reis, C. R., Sastre Castillo, M. Á., & Roig Dobón, S. (2007). Diversity and business performance: 50 years of research. Service Business, 1, 257–274. [Google Scholar] [CrossRef]
  53. Delen, D., Kuzey, C., & Uyar, A. (2013). Measuring firm performance using financial ratios: A decision tree approach. Expert Systems with Applications, 40(10), 3970–3983. [Google Scholar] [CrossRef]
  54. Dennison, K. (2025, February 10). Companies that value DEIA perform better. Forbes. Available online: https://www.forbes.com/sites/karadennison/2025/02/10/companies-that-value-deia-perform-better/ (accessed on 13 September 2025).
  55. Ding, F., & Riccucci, N. M. (2023). How does diversity affect public organizational performance? A meta-analysis. Public Administration, 101(4), 1367–1393. [Google Scholar] [CrossRef]
  56. Dixon, M. F., Halperin, I., & Bilokon, P. (2020). Machine learning in finance (Vol. 1170). Springer International Publishing. [Google Scholar]
  57. Dobbin, F., & Kalev, A. (2022). Getting to diversity: What works and what doesn’t. Harvard University Press. [Google Scholar]
  58. Dube, S., & Zhu, C. (2021). The disciplinary effect of social media: Evidence from firms’ responses to Glassdoor reviews. Journal of Accounting Research, 59(5), 1783–1825. [Google Scholar] [CrossRef]
  59. Dungan, R. (2024, November 1). ‘Opportunities not outcomes’|Boeing disbands DEI department to focus on a ‘merit-based performance system’. HR Grapevine USA. Available online: https://www.hrgrapevine.com/us/content/article/2024-11-01-aircraft-giant-ditches-dei-efforts-as-survey-underlines-consistent-employee-support (accessed on 12 May 2025).
  60. Dwork, C., Kumar, R., Naor, M., & Sivakumar, D. (2001, May 1–5). Rank aggregation methods for the web. 10th international conference on World Wide Web (pp. 613–622), Hong Kong, China. [Google Scholar]
  61. Ely, R. J. (2004). A field study of group diversity, participation in diversity education programs, and performance. Journal of Organizational Behavior: The International Journal of Industrial, Occupational and Organizational Psychology and Behavior, 25(6), 755–780. [Google Scholar] [CrossRef]
  62. Erhardt, N. L., Werbel, J. D., & Shrader, C. B. (2003). Board of director diversity and firm financial performance. Corporate Governance: An International Review, 11(2), 102–111. [Google Scholar] [CrossRef]
  63. Espeland, W., & Sauder, M. (2008). Rankings and diversity. Southern California Review of Law and Social Justice, 18, 587. [Google Scholar]
  64. Fernández-Delgado, M., Cernadas, E., Barro, S., & Amorim, D. (2014). Do we need hundreds of classifiers to solve real world classification problems? The Journal of Machine Learning Research, 15(1), 3133–3181. [Google Scholar]
  65. Feroz, E. H., Kim, S., & Raab, R. L. (2003). Financial statement analysis: A data envelopment analysis approach. Journal of the operational Research Society, 54(1), 48–58. [Google Scholar] [CrossRef]
  66. Ferrero-Ferrero, I., Fernández-Izquierdo, M. Á., & Muñoz-Torres, M. J. (2015). Integrating sustainability into corporate governance: An empirical study on board diversity. Corporate Social Responsibility and Environmental Management, 22(4), 193–207. [Google Scholar] [CrossRef]
  67. Filbeck, G., Foster, B., Preece, D., & Zhao, X. (2017). Does diversity improve profits and shareholder returns? Evidence from top rated companies for diversity by DiversityInc. Advances in Accounting, 37, 94–102. [Google Scholar] [CrossRef]
  68. Filbeck, G., Gorman, R., & Zhao, X. (2013). Are the best of the best better than the rest? The effect of multiple rankings on company value. Review of Quantitative Finance and Accounting, 41, 695–722. [Google Scholar] [CrossRef]
  69. Fine, M. G., Johnson, F. L., & Ryan, M. S. (1990). Cultural diversity in the workplace. Public Personnel Management, 19(3), 305–320. [Google Scholar] [CrossRef]
  70. Foma, E. (2014). Impact of workplace diversity. Review of Integrative Business and Economics Research, 3(1), 382. [Google Scholar]
  71. Foster, B. P., Manikas, A. S., & Kroes, J. R. (2023). Which diversity measures best capture public company value? Corporate Social Responsibility and Environmental Management, 30(1), 236–247. [Google Scholar] [CrossRef]
  72. Friedman, J. H. (2001). Greedy function approximation: A gradient boosting machine. Annals of Statistics, 1189–1232. [Google Scholar]
  73. Frynas, J. G., & Yamahaki, C. (2016). Corporate social responsibility: Review and roadmap of theoretical perspectives. Business Ethics: A European Review, 25(3), 258–285. [Google Scholar] [CrossRef]
  74. Garcia, M. F., Liu, H. F., Triana, M. d. C., & Treviño, L. J. (2025). Support for sustainable development goal 5 and social performance: The role of diversity targets, work-life balance practices, and female representation. Human Resource Management, 64(5), 1457–1479. [Google Scholar] [CrossRef]
  75. Garcia, S., Luengo, J., Sáez, J. A., Lopez, V., & Herrera, F. (2012). A survey of discretization techniques: Taxonomy and empirical analysis in supervised learning. IEEE Transactions on Knowledge and Data Engineering, 25(4), 734–750. [Google Scholar] [CrossRef]
  76. Gholamy, A., Kreinovich, V., & Kosheleva, O. (2018). Why 70/30 or 80/20 relation between training and testing sets: A pedagogical explanation. International Journal of Intelligent Technologies and Applied Statistics, 11(2), 105–111. [Google Scholar]
  77. Global, D. (2025, May 14). 2025 gen z and millennial survey: Focused on growth and learning (Tech. Rep.). Deloitte. Available online: https://www.deloitte.com/global/en/issues/work/genz-millennial-survey.html (accessed on 13 September 2025).
  78. Glover, F., Klastorin, T., & Kongman, D. (1974). Optimal weighted ancestry relationships. Management Science, 20(8), 1190–1193. [Google Scholar] [CrossRef]
  79. Gordon, A. D. (1979). A measure of the agreement between rankings. Biometrika, 66(1), 7–15. [Google Scholar] [CrossRef]
  80. Gotsis, G., & Kortezi, Z. (2013). Ethical paradigms as potential foundations of diversity management initiatives in business organizations. Journal of Organizational Change Management, 26(6), 948–976. [Google Scholar] [CrossRef]
  81. Gray Miller, J. (2023a). 2023 top 50 companies. Available online: https://www.fair360.com/top-50-list/2023/ (accessed on 21 June 2025).
  82. Gray Miller, J. (2023b). Methodology for fair360’s top companies rankings. Available online: https://www.fair360.com/methodology-for-diversityincs-top-companies-rankings/ (accessed on 21 June 2025).
  83. Grötschel, M., Jünger, M., & Reinelt, G. (1984). A cutting plane algorithm for the linear ordering problem. Operations Research, 32(6), 1195–1220. [Google Scholar] [CrossRef]
  84. Gupta, J., & Vegelin, C. (2016). Sustainable development goals and inclusive development. International Environmental Agreements: Politics, Law and Economics, 16, 433–448. [Google Scholar] [CrossRef]
  85. Harjoto, M., Laksmana, I., & Lee, R. (2015). Board diversity and corporate social responsibility. Journal of Business Ethics, 132, 641–660. [Google Scholar] [CrossRef]
  86. Hartenian, L. S., & Gudmundson, D. E. (2000). Cultural diversity in small business: Implications for firm performance. Journal of Developmental Entrepreneurship, 5(3), 209. [Google Scholar]
  87. Hentschel, T., Shemla, M., Wegge, J., & Kearney, E. (2013). Perceived diversity and team functioning: The role of diversity beliefs and affect. Small Group Research, 44(1), 33–61. [Google Scholar] [CrossRef]
  88. Herdman, A. O., & McMillan-Capehart, A. (2010). Establishing a diversity program is not enough: Exploring the determinants of diversity climate. Journal of Business and Psychology, 25, 39–53. [Google Scholar] [CrossRef]
  89. Higgins, A. (2020). The relationship between diversity climate perceptions and organizational citizenship behavior and work engagement: The mediating role of overall organizational justice [Unpublished master’s thesis, San Jose State University]. [Google Scholar]
  90. Hilson, C. (2024). Climate change and the politicization of ESG in the US. Frontiers in Political Science, 6, 1332399. [Google Scholar] [CrossRef]
  91. Hunt, V., Layton, D., & Prince, S. (2015). Diversity matters. McKinsey & Company, 1(1), 15–29. [Google Scholar]
  92. ISO. (2021). ISO 37000:2021 Governance of organizations. Guidance. Available online: https://www.iso.org/standard/65036.html (accessed on 20 June 2025).
  93. Jane Lenard, M., Yu, B., Anne York, E., & Wu, S. (2014). Impact of board gender diversity on firm risk. Managerial Finance, 40(8), 787–803. [Google Scholar] [CrossRef]
  94. Jauhari, H., & Singh, S. (2013). Perceived diversity climate and employees’ organizational loyalty. Equality, Diversity and Inclusion: An International Journal, 32(3), 262–276. [Google Scholar] [CrossRef]
  95. Jayne, M. E., & Dipboye, R. L. (2004). Leveraging diversity to improve business performance: Research findings and recommendations for organizations. Human Resource Management: Published in Cooperation with the School of Business Administration, The University of Michigan and in alliance with the Society of Human Resources Management, 43(4), 409–424. [Google Scholar] [CrossRef]
  96. Kamal, Y., & Ferdousi, M. (2009). Managing diversity at workplace: A case study of hp. ASA University Review, 3(2), 157–170. [Google Scholar]
  97. Kendall, M. G. (1938). A new measure of rank correlation. Biometrika, 30(1–2), 81–93. [Google Scholar] [CrossRef]
  98. Kirkman, B. L., Tesluk, P. E., & Rosen, B. (2004). The impact of demographic heterogeneity and team leader-team member demographic fit on team empowerment and effectiveness. Group & Organization Management, 29(3), 334–368. [Google Scholar] [CrossRef]
  99. Klass, O. S., Biham, O., Levy, M., Malcai, O., & Solomon, S. (2006). The Forbes 400 and the Pareto wealth distribution. Economics Letters, 90(2), 290–295. [Google Scholar] [CrossRef]
  100. Kleinbaum, D. G., Dietz, K., Gail, M., & Klein, M. (2002). Logistic regression. Springer. [Google Scholar]
  101. Kochan, T., Bezrukova, K., Ely, R., Jackson, S., Joshi, A., Jehn, K., Leonard, J., Levine, D., & Thomas, D. (2003). The effects of diversity on business performance: Report of the diversity research network. Human Resource Management: Published in Cooperation with the School of Business Administration, The University of Michigan and in Alliance with the Society of Human Resources Management, 42(1), 3–21. [Google Scholar] [CrossRef]
  102. Kondo, S., Komachi, M., Matsumoto, Y., Sudoh, K., Duh, K., & Tsukada, H. (2011, December 6–9). Learning of linear ordering problems and its application to JE patent translation in NTCIR-9 PatentMT. NTCIR-9 Workshop Meeting, Tokyo, Japan. [Google Scholar]
  103. Koseoglu, M. A., Arici, H. E., Saydam, M. B., & Olorunsola, V. O. (2025). Financial predictors of firms’ diversity scores: A machine learning approach. Equality, Diversity and Inclusion: An International Journal. [Google Scholar] [CrossRef]
  104. Kossek, E. E., & Zonia, S. C. (1993). Assessing diversity climate: A field study of reactions to employer efforts to promote diversity. Journal of Organizational Behavior, 14(1), 61–81. [Google Scholar] [CrossRef]
  105. Kotsiantis, S. B. (2013). Decision trees: A recent overview. Artificial Intelligence Review, 39, 261–283. [Google Scholar] [CrossRef]
  106. Kuckartz, U., Rädiker, S., Ebert, T., & Schehl, J. (2013). Statistik: Eine verständliche Einführung. Springer. [Google Scholar]
  107. Kuo, C.-C., Glover, F., & Dhir, K. S. (1993). Analyzing and modeling the maximum diversity problem by zero-one programming. Decision Sciences, 24(6), 1171–1185. [Google Scholar] [CrossRef]
  108. Landbase. (2025). Companies using glassdoor by glassdoor, inc in 2025. Available online: https://data.landbase.com/technology/glassdoor/ (accessed on 12 September 2025).
  109. Latukha, M., Kriklivetc, A., & Podgainyi, F. (2022). Generation diverse talent management practices: Main determinants and its influence on firm performance. Journal of East-West Business, 28(4), 291–322. [Google Scholar] [CrossRef]
  110. Lauring, J., & Selmer, J. (2011). Multicultural organizations: Does a positive diversity climate promote performance? European Management Review, 8(2), 81–93. [Google Scholar] [CrossRef]
  111. Lawson, C., & Montgomery, D. C. (2006). Logistic regression analysis of customer satisfaction data. Quality and Reliability Engineering International, 22(8), 971–984. [Google Scholar] [CrossRef]
  112. Lee, H. (2019). Does increasing racial minority representation contribute to overall organizational performance? The role of organizational mission and diversity climate. The American Review of Public Administration, 49(4), 454–468. [Google Scholar] [CrossRef]
  113. Lee, O., Joo, H., Choi, H., & Cheon, M. (2022). Proposing an integrated approach to analyzing ESG data via machine learning and deep learning algorithms. Sustainability, 14(14), 8745. [Google Scholar] [CrossRef]
  114. Lev, B., & Sunder, S. (1979). Methodological issues in the use of financial ratios. Journal of Accounting and Economics, 1(3), 187–210. [Google Scholar] [CrossRef]
  115. Li, H., Sun, J., Li, J. C., & Yan, X. Y. (2013). Forecasting business failure using two-stage ensemble of multivariate discriminant analysis and logistic regression. Expert Systems, 30(5), 385–397. [Google Scholar] [CrossRef]
  116. Li, X., Wang, X., & Xiao, G. (2019). A comparative study of rank aggregation methods for partial and top ranked lists in genomic applications. Briefings in Bioinformatics, 20(1), 178–189. [Google Scholar] [CrossRef]
  117. Lim, J., Vaughan, Y., & Jang, J. (2023). Do employees’ perceptions of diversity management enhance firm’s financial performance: The moderating role of board members’ diversity level. International Journal of Contemporary Hospitality Management, 35(11), 3990–4009. [Google Scholar] [CrossRef]
  118. LLC, B. P. (2023, July). 2023 diversity, equity, and inclusion barometer: The value of diversity, equity & inclusion initiatives: Boom or bust? (Tech. Rep.). Report based on survey of 400 C-Suite and HR leaders. Bridge Partners LLC. Available online: https://bridgepartnersllc.com/insight/survey-2023-diversity-equity-and-inclusion-barometer/ (accessed on 13 September 2025).
  119. Lorenzo, R., & Reeves, M. (2018). How and where diversity drives financial performance. Available online: https://hbr.org/2018/01/how-and-where-diversity-drives-financial-performance (accessed on 21 June 2025).
  120. LSEG. (2023). Ftse diversity and inclusion index. Available online: https://www.lseg.com/en/ftse-russell/indices/diversity-and-inclusion-index#t-methodology (accessed on 21 June 2025).
  121. Lubis, A. R., & Lubis, M. (2020). Optimization of distance formula in K-Nearest Neighbor method. Bulletin of Electrical Engineering and Informatics, 9, 326–338. [Google Scholar] [CrossRef]
  122. Lundberg, S. M., & Lee, S. I. (2017). A unified approach to interpreting model predictions. In Advances in neural information processing systems (Vol. 30). The MIT Press. [Google Scholar]
  123. Luque, A., Carrasco, A., Martín, A., & de Las Heras, A. (2019). The impact of class imbalance in classification performance metrics based on the binary confusion matrix. Pattern Recognition, 91, 216–231. [Google Scholar] [CrossRef]
  124. Ly-Le, T.-M. (2022). Hiring for gender diversity in tech. Journal of Management Development, 41(6), 393–403. [Google Scholar] [CrossRef]
  125. Madera, J. M., Dawson, M., & Neal, J. A. (2013). Hotel managers’ perceived diversity climate and job satisfaction: The mediating effects of role ambiguity and conflict. International Journal of Hospitality Management, 35, 28–34. [Google Scholar] [CrossRef]
  126. Malul, M., Hadad, Y., & Ben-Yair, A. (2009). Measuring and ranking of economic, environmental and social efficiency of countries. International Journal of Social Economics, 36(8), 832–843. [Google Scholar] [CrossRef]
  127. Martin-Melero, I., Gomez-Martinez, R., Medrano-Garcia, M. L., & Hernandez-Perlines, F. (2025). Comparison of sectorial and financial data for ESG scoring of mutual funds with machine learning. Financial Innovation, 11(1), 84. [Google Scholar] [CrossRef]
  128. Martí, R., Gallego, M., & Duarte, A. (2010). An exact method for the maximum diversity problem. European Journal of Operational Research, 200(1), 36–44. [Google Scholar] [CrossRef]
  129. Martí, R., & Reinelt, G. (2011). The linear ordering problem: Exact and heuristic methods in combinatorial optimization (Vol. 175). Springer Science & Business Media. [Google Scholar]
  130. Martín-Zamora, M.-P., Borralho, J. M. C., & Hernández-Linares, R. (2025). Gender diversity in top management teams and corporate reputation: Evidence from Spanish listed companies. Gender, Work & Organization, 32(3), 1144–1168. [Google Scholar]
  131. McGowan, B. L., Hopson, R., Epperson, L., & Leopold, M. (2025). Navigating the backlash and reimagining diversity, equity, and inclusion in a changing sociopolitical and legal landscape. Journal of College and Character, 26(1), 1–11. [Google Scholar] [CrossRef]
  132. McKay, P. F., Avery, D. R., & Morris, M. A. (2008). Mean racial-ethnic differences in employee sales performance: The moderating role of diversity climate. Personnel Psychology, 61(2), 349–374. [Google Scholar] [CrossRef]
  133. Mishra, N. K., & Singh, P. K. (2022). Linear ordering problem based classifier chain using genetic algorithm for multi-label classification. Applied Soft Computing, 117, 108395. [Google Scholar] [CrossRef]
  134. Mitchell, J. E., & Borchers, B. (1996). Solving real-world linear ordering problems using a primal-dual interior point cutting plane method. Annals of Operations Research, 62(1), 253–276. [Google Scholar] [CrossRef]
  135. Moon, K.-K., & Christensen, R. K. (2020). Realizing the performance benefits of workforce diversity in the US federal government: The moderating role of diversity climate. Public Personnel Management, 49(1), 141–165. [Google Scholar] [CrossRef]
  136. Moreno-Gómez, J., Lafuente, E., & Vaillant, Y. (2018). Gender diversity in the board, women’s leadership and business performance. Gender in Management, 33(2), 104–122. [Google Scholar] [CrossRef]
  137. Murphy, A. J., & Collins, J. M. (2015). The relevance of diversity in the job attribute preferences of college students. College Student Journal, 49(2), 199–216. [Google Scholar]
  138. Newsweek, & Group, P. I. (2025). America’s greatest workplaces for inclusion & diversity 2025. Available online: https://rankings.newsweek.com/americas-greatest-workplaces-diversity-2025 (accessed on 21 June 2025).
  139. Ng, E., Fitzsimmons, T., Kulkarni, M., Ozturk, M. B., April, K., Banerjee, R., & Muhr, S. L. (2025). The anti-DEI agenda: Navigating the impact of Trump’s second term on diversity, equity and inclusion. Equality, Diversity and Inclusion: An International Journal, 44(2), 137–150. [Google Scholar] [CrossRef]
  140. Nicholson-Crotty, S., Nicholson-Crotty, J., & Fernandez, S. (2017). Will more black cops matter? Officer race and police-involved homicides of black citizens. Public Administration Review, 77(2), 206–216. [Google Scholar] [CrossRef]
  141. Okoye, K., & Hosseini, S. (2024). Correlation tests in R: Pearson cor, kendall’s tau, and spearman’s rho. In R programming. Springer. [Google Scholar]
  142. Opstrup, N., & Villadsen, A. R. (2015). The right mix? Gender diversity in top management teams and financial performance. Public Administration Review, 75(2), 291–301. [Google Scholar] [CrossRef]
  143. Orazalin, N., & Baydauletov, M. (2020). Corporate social responsibility strategy and corporate environmental and social performance: The moderating role of board gender diversity. Corporate Social Responsibility and Environmental Management, 27(4), 1664–1676. [Google Scholar] [CrossRef]
  144. Owens, C. T., & Kukla-Acevedo, S. (2012). Network diversity and the ability of public managers to influence performance. The American Review of Public Administration, 42(2), 226–245. [Google Scholar] [CrossRef]
  145. Palalar Alkan, D., Ozbilgin, M., & Kamasak, R. (2022). Social innovation in managing diversity: COVID-19 as a catalyst for change. Equality, Diversity and Inclusion: An International Journal, 41(5), 709–725. [Google Scholar] [CrossRef]
  146. Pasztor, S. K. (2019). Exploring the framing of diversity rhetoric in “top-rated in diversity” organizations. International Journal of Business Communication, 56(4), 455–475. [Google Scholar] [CrossRef]
  147. Patrick, H. A., & Kumar, V. R. (2012). Managing workplace diversity: Issues and challenges. Sage Open, 2(2), 2158244012444615. [Google Scholar] [CrossRef]
  148. Paule-Vianez, J., Gutiérrez-Fernández, M., & Coca-Pérez, J. L. (2019). Prediction of financial distress in the Spanish banking system: An application using artificial neural networks. Applied Economic Analysis, 28(82), 69–87. [Google Scholar] [CrossRef]
  149. Peachman, R. R. (2023a). America’s best employers for new grads. Available online: https://www.forbes.com/lists/best-employers-for-new-grads/?sh=7f55c3bd203a (accessed on 21 June 2025).
  150. Peachman, R. R. (2023b). America’s best employers for veterans. Available online: https://www.forbes.com/lists/best-employers-for-veterans/?sh=2e1b00313606 (accessed on 21 June 2025).
  151. Peachman, R. R. (2023c). Meet america’s best employers for diversity 2023. Available online: https://www.forbes.com/sites/rachelpeachman/2023/04/25/meet-americas-best-employers-for-diversity-2023/?sh=424561ac50af (accessed on 21 June 2025).
  152. Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., & Duchesnay, É. (2011). Scikit-learn: Machine learning in Python. The Journal of Machine Learning Research, 12, 2825–2830. [Google Scholar]
  153. Prasatha, V. S., Alfeilate, H. A. A., Hassanate, A. B., Lasassmehe, O., Tarawnehf, A. S., Alhasanatg, M. B., & Salmane, H. S. E. (2017). Effects of distance measure choice on knn classifier performance—A review. arXiv, arXiv:1708.04321. [Google Scholar]
  154. Primec, A., & Belak, J. (2022). Sustainable CSR: Legal and managerial demands of the new EU legislation (CSRD) for the future corporate governance practices. Sustainability, 14(24), 16648. [Google Scholar] [CrossRef]
  155. Rajgopal, S., Srivastava, A., & Zhao, R. (2023). Do political anti-ESG sanctions have any economic substance. The Case of Texas Law Mandating Divestment from ESG Asset Management Companies. SSRN Electron. J. Available online: https://papers.ssrn.com/sol3/papers.cfm?abstract_id=4386268 (accessed on 21 June 2025). [CrossRef]
  156. Ranta, M., & Ylinen, M. (2023). Board gender diversity and workplace diversity: A machine learning approach. Corporate Governance: The International Journal of Business in Society, 23(5), 995–1018. [Google Scholar] [CrossRef]
  157. Rao, K., & Tilt, C. (2016). Board composition and corporate social responsibility: The role of diversity, gender, strategy and decision making. Journal of Business Ethics, 138, 327–347. [Google Scholar] [CrossRef]
  158. Ray, T. G., & Triantaphyllou, E. (1998). Evaluation of rankings with regard to the possible number of agreements and conflicts. European Journal of Operational Research, 106(1), 129–136. [Google Scholar] [CrossRef]
  159. Reinwald, M., Huettermann, H., & Bruch, H. (2019). Beyond the mean: Understanding firm-level consequences of variability in diversity climate perceptions. Journal of Organizational Behavior, 40(4), 472–491. [Google Scholar] [CrossRef]
  160. Rhee, C. S., Woo, S., & Rhee, H. (2023). Effect of gender diversity on corporate soundness and social contribution. Corporate Social Responsibility and Environmental Management, 30(1), 419–430. [Google Scholar] [CrossRef]
  161. Rice, D. B., Young, N. C., Taylor, R. M., & Leonard, S. R. (2025). Politics and race in the workplace: Understanding how and when trump-supporting managers hinder black employees from thriving at work. Human Resource Management Journal, 35(1), 256–275. [Google Scholar] [CrossRef]
  162. Richard, O. C. (2000). Racial diversity, business strategy, and firm performance: A resource-based view. Academy of Management Journal, 43(2), 164–177. [Google Scholar] [CrossRef]
  163. Ritz, A., & Alfes, K. (2018). Multicultural public administration: Effects of language diversity and dissimilarity on public employees’ attachment to employment. Public Administration, 96(1), 84–103. [Google Scholar] [CrossRef]
  164. Rokach, L., & Maimon, O. (2005). Decision trees. In Data mining and knowledge discovery handbook (pp. 165–192). Springer. [Google Scholar]
  165. Roman, A., & Șargu, A. C. (2013). Analysing the financial soundness of the commercial banks in Romania: An approach based on the CAMELS framework. Procedia Economics and Finance, 6, 703–712. [Google Scholar] [CrossRef]
  166. Rose, C., Munch-Madsen, P., & Funch, M. (2013). Does board diversity really matter? Gender does not, but citizenship does. International Journal of Business Science & Applied Management (IJBSAM), 8(1), 16–27. [Google Scholar]
  167. Sabharwal, M. (2014). Is diversity management sufficient? Organizational inclusion to further performance. Public Personnel Management, 43(2), 197–217. [Google Scholar] [CrossRef]
  168. Sacco, J. M., & Schmitt, N. (2005). A dynamic multilevel model of demographic diversity and misfit effects. Journal of Applied Psychology, 90(2), 203. [Google Scholar] [CrossRef] [PubMed]
  169. Safavian, S. R., & Landgrebe, D. (1991). A survey of decision tree classifier methodology. IEEE Transactions on Systems, Man, and Cybernetics, 21(3), 660–674. [Google Scholar] [CrossRef]
  170. Santucci, V. (2021, July 1–June 28). Is algebraic differential evolution really a differential evolution scheme? 2021 IEEE Congress on Evolutionary Computation (CEC) (pp. 9–16), virtual. [Google Scholar]
  171. Santucci, V., Ceberio, J., & Baioletti, M. (2020, July 8–12). Gradient search in the space of permutations: An application for the linear ordering problem. 2020 Genetic and Evolutionary Computation Conference Companion (pp. 1704–1711), Cancún, Mexico. [Google Scholar]
  172. Sarker, I. H. (2021). Machine Learning: Algorithms, Real-World Applications and Research Directions. SN Computer Science, 2(3), 160. [Google Scholar] [CrossRef] [PubMed]
  173. Schütte, S., Acevedo, P. N. M., & Flahault, A. (2018). Health systems around the world–a comparison of existing health system rankings. Journal of Global Health, 8(1), 010407. [Google Scholar] [CrossRef]
  174. Schwarz, A. (2023a). America’s best employers for women. Available online: https://www.forbes.com/lists/best-employers-women/?sh=497565c466c3 (accessed on 21 June 2025).
  175. Schwarz, A. (2023b). America’s best large employers 2023: The top 100. Available online: https://www.forbes.com/sites/forbesstaff/2023/03/22/americas-best-large-employers-2023-the-top-100/?sh=11efc18f1d81 (accessed on 21 June 2025).
  176. Seriwatana, P. (2021). Diversity climate as a key to employee retention: The moderating role of perceived cultural difference. Asian Administration & Management Review, 4(2). [Google Scholar]
  177. Shatnawi, R., Li, W., Swain, J., & Newman, T. (2010). Finding software metrics threshold values using ROC curves. Journal of Software Maintenance and Evolution: Research and Practice, 22(1), 1–16. [Google Scholar] [CrossRef]
  178. Shehatta, I., & Mahmood, K. (2016). Correlation among top 100 universities in the major six global rankings: Policy implications. Scientometrics, 109(2), 1231–1254. [Google Scholar] [CrossRef]
  179. Shemla, M., Meyer, B., Greer, L., & Jehn, K. A. (2016). A review of perceived diversity in teams: Does how members perceive their team’s composition affect team processes and outcomes? Journal of Organizational Behavior, 37, S89–S106. [Google Scholar] [CrossRef]
  180. Singal, M., & Gerde, V. W. (2015). Is diversity management related to financial performance in family firms? Family Business Review, 28(3), 243–259. [Google Scholar] [CrossRef]
  181. Singha, S. (2022). Social inclusion, equality, leadership, and diversity to attain sustainable development goal 5 in the Indian banking industry. Journal of International Women’s Studies, 23(5), 135–141. [Google Scholar]
  182. Song, Y.-g., Cao, Q.-l., & Zhang, C. (2018). Towards a new approach to predict business performance using machine learning. Cognitive Systems Research, 52, 1004–1012. [Google Scholar] [CrossRef]
  183. Strecht, P., Cruz, L., Soares, C., Mendes-Moreira, J., & Abreu, R. (2015, June 26-29). A comparative study of classification and regression algorithms for modelling students’ academic performance. International Educational Data Mining Society, Madrid, Spain. [Google Scholar]
  184. Sun, S., & Huang, R. (2010, August 10–12). An adaptive k-nearest neighbor algorithm. Seventh International Conference on Fuzzy Systems and Knowledge Discovery (Vol. 1, pp. 91–94), Yantai, China. [Google Scholar]
  185. Šilenskytė, A., & Rašković, M. (2024). Embedding diversity, equity, and inclusion (DEI) in international business education. In International business and sdg 8: Exploring the relationship between ib and society (pp. 299–318). Springer. [Google Scholar]
  186. Tayar, M. (2017). Ranking LGBT inclusion: Diversity ranking systems as institutional archetypes. Canadian Journal of Administrative Sciences/Revue Canadienne des Sciences de l’Administration, 34(2), 198–210. [Google Scholar] [CrossRef]
  187. Terjesen, S., Vinnicombe, S., & Freeman, C. (2007). Attracting Generation Y graduates: Organisational attributes, likelihood to apply and sex differences. Career Development International, 12(6), 504–522. [Google Scholar] [CrossRef]
  188. Tromble, R., & Eisner, J. (2009, August 6–7). Learning linear ordering problems for better translation. 2009 Conference on Empirical Methods in Natural Language Processing (pp. 1007–1016), Singapore. [Google Scholar]
  189. Van der Meer, M., & Roosblad, J. (2004). Overcoming marginalisation?: Gender and ethnic segregation in the dutch construction, health, IT and printing industries. Available online: https://hdl.handle.net/11245/1.427072 (accessed on 21 June 2025).
  190. Veganzones, D., & Séverin, E. (2018). An investigation of bankruptcy prediction in imbalanced datasets. Decision Support Systems, 112, 111–124. [Google Scholar] [CrossRef]
  191. Veltri, S., Mazzotta, R., & Rubino, F. E. (2021). Board diversity and corporate social performance: Does the family firm status matter? Corporate Social Responsibility and Environmental Management, 28(6), 1664–1679. [Google Scholar] [CrossRef]
  192. Vincent, M. (2022). Investors and workers in diversity data challenge. Available online: https://www.ft.com/content/a2c1ceeb-a5bd-400b-b475-a1c742884720 (accessed on 21 June 2025).
  193. Wolfson, N., Kraiger, K., & Finkelstein, L. (2011). The relationship between diversity climate perceptions and workplace attitudes. The Psychologist-Manager Journal, 14(3), 161–176. [Google Scholar] [CrossRef]
  194. Wolsey, L. A., & Nemhauser, G. L. (2014). Integer and combinatorial optimization. John Wiley & Sons, Inc. [Google Scholar]
  195. Wood, E. H. (2006). The internal predictors of business performance in small firms: A logistic regression analysis. Journal of Small Business and Enterprise Development, 13(3), 441–453. [Google Scholar] [CrossRef]
  196. Xu, R., Farooq, U., Alam, M. M., & Dai, J. (2024). How does cultural diversity determine green innovation? New empirical evidence from Asia region. Environmental Impact Assessment Review, 106, 107458. [Google Scholar] [CrossRef]
  197. Yahoo. (2023). Yahoo finance. Available online: https://finance.yahoo.com/ (accessed on 21 June 2025).
  198. Yang, L.-W., Nguyen, T. T. B., & Young, W.-J. (2024). Performance and Board Diversity: A Practical AI Perspective. Big Data and Cognitive Computing, 8(9), 106. [Google Scholar] [CrossRef]
  199. Yousaf, U. B., Jebran, K., & Wang, M. (2021). Can board diversity predict the risk of financial distress? Corporate Governance: The International Journal of Business in Society, 21(4), 663–684. [Google Scholar]
Figure 1. Histograms and density plots of diversity features.
Figure 1. Histograms and density plots of diversity features.
Admsci 15 00406 g001
Figure 2. Histograms and density plots of financial data.
Figure 2. Histograms and density plots of financial data.
Admsci 15 00406 g002
Figure 3. Histograms and density plots of financial ratios.
Figure 3. Histograms and density plots of financial ratios.
Admsci 15 00406 g003
Figure 4. Violinplot and pie chart of the continuous and discrete diversity index.
Figure 4. Violinplot and pie chart of the continuous and discrete diversity index.
Admsci 15 00406 g004
Figure 5. ROC curves of training sets.
Figure 5. ROC curves of training sets.
Admsci 15 00406 g005
Figure 6. ROC curves of testing sets.
Figure 6. ROC curves of testing sets.
Admsci 15 00406 g006
Figure 7. Radar plot of the diversity study.
Figure 7. Radar plot of the diversity study.
Admsci 15 00406 g007
Figure 8. Radar plot of the ethnic origin study.
Figure 8. Radar plot of the ethnic origin study.
Admsci 15 00406 g008
Figure 9. Radar plot of the global study.
Figure 9. Radar plot of the global study.
Admsci 15 00406 g009
Table 1. Variables in the machine learning study.
Table 1. Variables in the machine learning study.
Financial ratios
Full nameAbbrv.Full nameAbbrv.
Price/SalespsCurrent ratiocr
Enterprise Value/RevenueerQuick ratioqr
Enterprise Value/EBITDAeeCash ratioch
Beta (5Y Monthly)btDebt ratiodr
52-Week ChangewkROAra
Payout RatioprROEre
Profit MarginpfEPSes
Operating Marginop
Financial information
Full nameAbbrv.Full nameAbbrv.
RevenuervNon Current Assetsna
Normalized EBITDAebCurrent Liabilitiescl
Net IncomeniNon Current Liabilitiesnl
Current AssetscaWorking Capitalwc
CashcsEquityeq
Inventoryiv
Diversity variables
Full nameAbbrv.Full nameAbbrv.
Policy DiversitypdDiversity Indexdi
Women Employeeswo
Table 2. Rankings and scores included in the operation research study.
Table 2. Rankings and scores included in the operation research study.
StudyData SourceNameAbbrv.Companies
1. DiversityFair360Top 50 Companies For DiversityFa_D50
ForbesAmerica’s Best Employers for DiversityFo_D500
GlassdoorDiversity and Inclusion scoreGl_D49,233
2. Ethic originFair360Top Companies for Asian American ExecutivesFa_A19
Fair360Top Companies for Black ExecutivesFa_B28
Fair360Top Companies for Latino ExecutivesFa_L24
Fair360Top Companies for Native American/Pacific Islander ExecutivesFa_N23
3. GlobalForbesAmerica’s Best Employers for DiversityFo_D500
ForbesAmerica’s Best Employers for WomenFo_W400
ForbesAmerica’s Best Employers for New GradsFo_G300
ForbesAmerica’s Best Large EmployersFo_L500
ForbesAmerica’s Best Employers for VeteransFo_V150
Table 3. Hyperparameters tuned in GridSearch.
Table 3. Hyperparameters tuned in GridSearch.
AlgorithmParameterValues
K Nearest Neighborsn_neighbors[1, 5, 10, 15, 20]
metric[euclidean, manhattan, minkowski]
Logistic Regressionpenalty[l1, l2, elasticnet]
C[0.1, 1, 10]
Decision Treecriterion[gini, entropy, log_loss]
max_depth[2, 4, 8, 16, 32, 64, 128]
Table 4. Pearson correlation matrix of the financial and diversity variables (%).
Table 4. Pearson correlation matrix of the financial and diversity variables (%).
psereebtwkprpfopcrqrchdrrareesrvebnicacsivnaclnlwceqpdwodi
ps
er97
ee4049
bt−18−18−4
wk3129108
pr61437−8−11
pf56507−1527−15
op535527−1315141
cr241760−221916
qr30237−321232090
ch35278−12024228693
dr−15−9−3199−4−131−39−29−31
ra464011−1210−73330141415−14
re15145−79187331−525
es2018407−62811171816126−1
rv32−1−412070−7−3−2311511
eb21202−810−22311-324−32591587
ni29275−611−42715057−43511167292
ca650−212−193−22258413868067
cs18151−116−416841014−51551383857991
iv−8−85133−4−5−2−9−77−5185938227049
na68−1−1173126−51104512848262817653
cl21−1−310−161−12−7−715541187766195817083
nl580−863128−5201927127572527462529382
wc151013111126242523−2112213485552697640414424
eq109−1−1092125278−23821180826681864988726570
pd873−5−1435357−71−13-32234301−283
wo23−95−3−332−5−5−492−100−102−14−110409
di272717−614316174811−514061018211117−513910101529
Table 5. Performance metrics of the machine learning simulations.
Table 5. Performance metrics of the machine learning simulations.
ApproachModelTraining SetTesting Set
ACCPRESENSPEF1SAUCACCPRESENSPEF1SAUC
DummyRandom0.4890.4550.5000.4800.4760.5000.4140.3500.4830.3660.4060.500
Financial dataKNN0.6430.6540.4240.8190.5150.6440.4860.4500.2650.6940.3330.567
LR0.6180.5940.4560.7480.5160.6320.3430.1250.0590.6110.0800.245
DT0.6860.7680.4240.8970.5460.6960.5430.5500.3240.7500.4070.578
Financial ratiosKNN0.6680.6670.5120.7940.5790.7220.5860.6320.3530.8060.4530.603
LR0.6570.6790.4400.8320.5340.6640.6000.6670.3530.8330.4620.556
DT0.6500.6960.3840.8650.4950.6880.6570.7500.4410.8610.5560.631
Diversity dataKNN0.5790.5510.3040.8000.3920.6230.4140.2670.1180.6940.1630.455
LR0.5360.2220.0160.9550.0300.5390.5430.7500.0880.9720.1580.516
DT0.5960.5680.4000.7550.4690.6350.5430.5420.3820.6940.4480.522
Table 6. Intersection matrix of the individual rankings.
Table 6. Intersection matrix of the individual rankings.
Diversity StudyEthnic Origin StudyGlobal Study
Fa_DFo_DGl_D Fa_AFa_BFa_LFa_N Fo_DFo_WFo_GFo_LFo_V
Fa_D- Fa_A- Fo_D-
Fo_D35- Fa_B15- Fo_W231-
Gl_D3535-Fa_L1918- Fo_G174169-
Fa_N191823-Fo_L269224192-
Fo_V76596492-
Table 7. Positions of companies in each individual ranking inside the diversity, ethnic origin and global studies.
Table 7. Positions of companies in each individual ranking inside the diversity, ethnic origin and global studies.
Diversity Study RankingsEthnic Origin Study RankingsGlobal Study Rankings
Companies Fa_D Fo_D Gl_D Companies Fa_A Fa_B Fa_L Fa_N Companies Fo_D Fo_W Fo_G Fo_L Fo_V
MSCD151TOYO1144PROG111221816
MEDT21611SHIE2733INTL225141531
HERS32325MEDT31011P&GG322112314
TOYO43113HERS4626ALLY4426624
LILL5308LILL5355ACCT531163317
KPMG62219HILT6151011PNCF628383026
DOWW7274EYYY7972SALF716978
TIAA827ADPP81188MATT836231723
HUMN9315BOEG9131515CISC91281219
BOEG103528DOWW101299UNHE1032332934
CNBC111031CNBC11267APPL1114181127
CIGN121222ABBT12141414GOGL122142
ABBV13720KPMG1381212JPMO1334363136
WALM142933HUMN1451313PFIZ1415312632
RAND151826CIGN1541110AMEX158191037
TDBK1613 NASA1617811
KYBK171318 DELL1723212118
SOUC181916 NIKE1817302835
ECOL191527 DELA196557
NOGR201412 MICR20310310
CAPO21219 MEDT2138343515
SAFI223314 FIDI225225
ALLY23624 SONY2319271330
GEMO242010 CSCH247201620
TARG25921 IBMM259172525
CENT261723 ADDS2635292728
COPA2786 TEXI2729133638
UNAI28255 3MMM282662429
P&GG2942 LOCM292415144
AMFI302832 ORCL3021373721
WALG313435 BMWG311325199
ALIC323230 HOND3230323433
HOND332634 FORD3333283213
BEBU341117 HEBB3410413
WYNH352429 COST35183922
BOEG362724226
SOAI372012201
CACI3837353812
Table 8. Kendall’s τ correlation coefficients (%) for intersected rankings.
Table 8. Kendall’s τ correlation coefficients (%) for intersected rankings.
Diversity StudyEthnic Origin StudyGlobal Study
τ Fa_DFo_DGl_D τ Fa_AFa_BFa_LFa_N τ Fo_DFo_WFo_GFo_LFo_V
Fa_D- Fa_A- Fo_D-
Fo_D8- Fa_B6- Fo_W11-
Gl_D1828-Fa_L5631- Fo_G140-
Fa_N502682-Fo_L115854-
Fo_V-15193330-
Table 9. Optimal solutions and LOP values for the operations research studies.
Table 9. Optimal solutions and LOP values for the operations research studies.
StudyAlternativesOptimum OrderLOP Optimum
DiversityD.A[MSCD, TDBK, TIAA, P&GG, COPA, MEDT, DOWW, LILL, TOYO, HUMN, ABBV, CNBC,
CIGN, KYBK NOGR, SOUC, CAPO, ALLY, GEMO, BEBU, KPMG, TARG, CENT, HERS,
RAND, ECOL, UNAI, SAFI, BOEG, WYNH, AMFI, WALM, ALIC, HOND, WALG]
1381
D.B[MSCD, TDBK, P&GG, DOWW, TIAA, COPA, MEDT, LILL, TOYO, HUMN, ABBV, CNBC,
CIGN, KYBK, NOGR, SOUC, CAPO, ALLY, GEMO, TARG, BEBU, KPMG, CENT, HERS,
RAND, ECOL, UNAI, SAFI, BOEG, WYNH, AMFI, WALM, ALIC, HOND, WALG]
1381
D.C[MSCD, TDBK, TIAA, P&GG, MEDT, DOWW, LILL, TOYO, HUMN, ABBV, COPA, CNBC,
CIGN, KYBK, NOGR, SOUC, CAPO, GEMO, KPMG, SAFI, ALLY, TARG, BEBU, CENT,
HERS, RAND, ECOL, UNAI, BOEG, WYNH, AMFI, WALM, ALIC, HOND, WALG]
1381
Ethnic OriginEO.A[MEDT, SHIE, TOYO, HERS, LILL, CNBC, EYYY, ADPP, DOWW, CIGN, HILT, KPMG,
HUMN, ABBT, BOEG]
354
EO.B[MEDT, SHIE, TOYO, HERS, LILL, EYYY, CNBC, ADPP, DOWW, CIGN, HILT, KPMG,
HUMN, ABBT, BOEG]
354
EO.C[MEDT, SHIE, TOYO, HERS, LILL, CNBC, EYYY, ADPP, DOWW, HILT, CIGN, KPMG,
HUMN, ABBT, BOEG]
354
GlobalG.A[GOGL, FIDI, HEBB, DELA, SALF, NASA, MICR, PROG, ALLY, CISC, COST, APPL,
SOAI, P&GG, LOCM, INTL, ACCT, AMEX, CSCH, DELL, MATT, IBMM, BMWG, SONY,
3MMM, BOEG, PFIZ, NIKE, PNCF, FORD, ADDS, HOND, UNHE, JPMO, MEDT, TEXI,
ORCL, CACI]
2715
G.B[GOGL, FIDI, HEBB, DELA, SALF, NASA, MICR, PROG, ALLY, CISC, COST, P&GG,
APPL, SOAI, LOCM, INTL, ACCT, AMEX, CSCH, DELL, MATT, IBMM, BMWG, SONY,
3MMM, BOEG, PFIZ, NIKE, PNCF, FORD, ADDS, HOND, UNHE, JPMO, MEDT, TEXI,
ORCL, CACI]
2715
G.C[GOGL, FIDI, HEBB, DELA, SALF, NASA, MICR, PROG, ALLY, CISC, COST, SOAI,
P&GG, APPL, LOCM, INTL, ACCT, AMEX, CSCH, DELL, MATT, IBMM, BMWG, SONY,
3MMM, BOEG, PFIZ, NIKE, PNCF, FORD, ADDS, HOND, UNHE, JPMO, MEDT, TEXI,
ORCL, CACI]
2715
Table 10. Kendall’s τ correlation coefficients (%) for LOP solutions.
Table 10. Kendall’s τ correlation coefficients (%) for LOP solutions.
Diversity StudyEthnic Origin StudyGlobal Study
τ D.AD.BD.C τ EO.AEO.BEO.C τ G.AG.BG.C
D.A- EO.A- G.A-
D.B98- EO.B98- G.B99-
D.C9493-EO.C9896-G.C9999-
Table 11. Kendall’s τ correlation coefficients (%) for LOP and original rankings.
Table 11. Kendall’s τ correlation coefficients (%) for LOP and original rankings.
Diversity StudyEthnic Origin StudyGlobal Study
τ D.AD.BD.C τ EO.AEO.BEO.C τ G.AG.BG.C
Fa_D434347Fa_A586060Fo_D212221
Fo_D545450Fa_B333131Fo_W605959
Gl_D676767Fa_L949296Fo_G697069
Fa_N899087Fo_L787777
Fo_V444444
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Martín-Melero, I.; Hernández-Perlines, F.; Gómez-Martínez, R.; Medrano-García, M.L. Prediction and Ranking of Corporate Diversity in European and American Firms. Adm. Sci. 2025, 15, 406. https://doi.org/10.3390/admsci15110406

AMA Style

Martín-Melero I, Hernández-Perlines F, Gómez-Martínez R, Medrano-García ML. Prediction and Ranking of Corporate Diversity in European and American Firms. Administrative Sciences. 2025; 15(11):406. https://doi.org/10.3390/admsci15110406

Chicago/Turabian Style

Martín-Melero, Iñigo, Felipe Hernández-Perlines, Raúl Gómez-Martínez, and María Luisa Medrano-García. 2025. "Prediction and Ranking of Corporate Diversity in European and American Firms" Administrative Sciences 15, no. 11: 406. https://doi.org/10.3390/admsci15110406

APA Style

Martín-Melero, I., Hernández-Perlines, F., Gómez-Martínez, R., & Medrano-García, M. L. (2025). Prediction and Ranking of Corporate Diversity in European and American Firms. Administrative Sciences, 15(11), 406. https://doi.org/10.3390/admsci15110406

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop