Integrating Machine Learning with Multi-Criteria Decision-Making Models for Sustainable Supplier Selection in Dynamic Supply Chains

Gidiagba, Osheyor Joachim; Tartibu, Lagouge; Okwu, Modestus

doi:10.3390/logistics9040152

Open AccessArticle

Integrating Machine Learning with Multi-Criteria Decision-Making Models for Sustainable Supplier Selection in Dynamic Supply Chains

by

Osheyor Joachim Gidiagba

^*,

Lagouge Tartibu

and

Modestus Okwu

Department of Mechanical and Industrial Engineering, University of Johannesburg, Johannesburg 2092, South Africa

^*

Author to whom correspondence should be addressed.

Logistics 2025, 9(4), 152; https://doi.org/10.3390/logistics9040152

Submission received: 15 September 2025 / Revised: 15 October 2025 / Accepted: 21 October 2025 / Published: 24 October 2025

(This article belongs to the Special Issue Multi-Criteria Decision-Making and Its Application in Sustainable Smart Logistics—2nd Edition)

Download

Browse Figures

Versions Notes

Abstract

Background: Supplier evaluation and selection are pivotal processes in supply chain management, profoundly influencing organisational efficiency and sustainability. This study addresses the limitations of traditional multi-criteria decision-making approaches, particularly the Technique for Order Preference by Similarity to an Ideal Solution, which often lacks dimensional reduction capability and assumes uniform weight distribution across criteria. Methods: To overcome these challenges, a hybrid model integrating non-negative matrix factorisation, random forest, and the Technique for Order Preference by Similarity to an Ideal Solution is developed for supplier evaluation in the pharmaceutical sector. The method first applies non-negative matrix factorisation to condense twenty-four evaluation criteria into eight core dimensions, enhancing analytical efficiency and reducing complexity. Random forest is then employed to derive data-driven weights for each criterion, ensuring accurate prioritisation. Finally, the Technique for Order Preference by Similarity to an Ideal Solution ranks suppliers and provides actionable insights for decision-makers. Results: Results from real-world pharmaceutical data validate the model’s effectiveness and demonstrate superior performance over conventional evaluation methods. Conclusions: The findings confirm that integrating machine learning techniques with established decision-making frameworks enhances precision, interpretability, and sustainability in supplier selection while requiring adequate data quality and computational resources for implementation.

Keywords:

supplier evaluation; sustainable supplier selection; random forest; MCDM; machine learning

1. Introduction

Supplier selection is a crucial step in supply chain management, through which decision markers select the best suppliers for the services or product they wish to purchase [1]. To increase performance and sustain current connections over time, businesses usually look for the best suppliers. Furthermore, in many industries, the cost of raw materials accounts for most manufacturing expenses [2], so choosing a supplier is essential to a company’s financial health. Raw materials and services required to make a product typically account for 70% of a manufacturer’s selling price [3,4,5]. Consequently, a cost-effective supplier can drastically lower the supply chain’s expenses. Furthermore, suppliers have a direct impact on an organisation’s profitability [6,7].

The identification and verification of the supplier is the first step in the supplier selection process. This is followed by the contract being signed. Although choosing a supplier may appear simple, it is one of the most important supply chain steps [8]. Therefore, to remain competitive, lucrative, and secure, it is crucial to carefully choose a supplier based on the situational requirements [9,10,11]. Choosing a supplier and determining which one is most likely to meet the requirements are frequent questions that come up during the SS process. The solution to this question depends on several qualitative and quantitative parameters [12,13], as the supplier selection process is a multi-criteria problem [14,15,16,17]. The choice of suppliers is significantly influenced by these kinds of considerations. Therefore, selecting the best supplier requires balancing these criteria and trade-offs. In practice, nevertheless, it is not feasible to trade off every potential supplier selection criterion. Furthermore, not all criteria contribute equally to the selection of suppliers. When suppliers are chosen using pointless criteria, the SS process might become more complex and there is a greater chance that decisions will be made incorrectly. Furthermore, picking a supplier based on a broad range of factors may misguide the selection process, which could have a detrimental effect on the business’s performance and earnings. Selecting relevant criteria for the selection process is therefore crucial when choosing a supplier. So, it would be ideal if it were able to identify every significant criterion from the broad spectrum of criteria. Using Multi-Criteria Decision-Making (MCDM) procedures is feasible given that this is a multi-criteria problem [18]. MCDM offers an assessment framework that can address real-world problems by using scientific analytical techniques to help decision-makers (DMs) come up with workable answers [19,20].

Sorting and analysing big data are one of the most difficult tasks for large manufacturing-oriented businesses when evaluating suppliers because there are so many criteria and options that make up a massive data system. What are the best criteria for efficiently determining the benefits and drawbacks of suppliers? is the question posed [21]. Data mining is an all-inclusive method for analysing large data. Data mining and knowledge discovery are computer science subfields that can be used to mine or extract important information from massive amounts of data and then transform it into concepts and structures that decision-makers (DMs) can easily grasp [22]. According to Saura [23], businesses might waste a lot of effort creating, organising, and cleaning databases (which may contain information from users, suppliers, and consumers). When constructing and reviewing databases, businesses can make better use of their time by utilising pertinent metrics and performance criteria. They demonstrated that one of the key responsibilities of data mining is identifying the fundamental criteria.

Furthermore, the supplier’s performance history dataset volume is growing with time, which is significant to observe. The supplier dataset shows the supplier’s historical performance. Compared to traditional methods, Machine Learning (ML) techniques can handle vast amounts of data more efficiently [24]. Therefore, the machine learning technique is far more effective than traditional multi-criteria selection methods because the latter are occasionally unable to identify the true patterns in large datasets [25]. One artificial intelligence (AI) technology called machine learning (ML) enables the effective selection of suppliers based on their prior performance [26,27]. Every provider has qualities or standards by which they are judged, and in machine learning, features are used to symbolise qualities. Nonetheless, a technique called the feature selection (FS) algorithm can be applied to determine the crucial supplier selection criteria. Further understanding of how the identified critical criteria impact other elements is also required, as the selection of suppliers will be dependent on them.

While ML and MCDM offer different ranges of benefits, they also have different requirements. Therefore, combining them can offer opportunities to address more needs and gain access to additional advantages, as well as lessen the disadvantages that each type of strategy has on its own. The purpose of this study is to propose such an integrated strategy by combining machine learning techniques for dimension reduction with the computation of selection criteria weights, which are subsequently applied to supplier ranking. An actual case study involving a pharmaceutical corporation validates the suggested methodology. The following is a summary of this study’s contributions:

A novel integrated supplier selection strategy that combines the TOPSIS method with machine learning approach.
A comparative analysis of various machine learning algorithms to ascertain their applicability within the framework of the suggested methodology.
A case study that illustrates the suggested methodology real-world scenario.

This paper’s remaining sections are organised as follows. The pertinent background information is compiled in Section 2, which also offers a targeted summary of important literature on integrated approaches to supplier selection. The proposed integrated method is discussed in Section 3. In this section, the TOPSIS approach used to rank suppliers is detailed, along with the machine learning techniques used for dimension reduction and determining the weight of criteria. To validate the suggested approach, a case study involving a pharmaceutical organisation is presented in Section 4. Section 5 wraps up by outlining several potential lines of inquiry for further study.

2. Literature Review

Suppliers give the supply chain system the materials, parts, and technology it needs to function. Supplier procurement operations can have a substantial impact on a company’s profitability, as procurement expenditures account for between 70% and 80% of most companies’ production costs. Additionally, the company’s turnover is heavily reliant on the resources and capabilities of these suppliers [28]. The main objectives of any supply chain management system are to effectively manage the flow of resources, data, and money to meet client demands and accomplish overarching corporate objectives [29,30]. As the primary operational motor, suppliers can either accelerate or decrease the efficacy of the supply chain [31]. Nevertheless, there are other downsides that make the supplier selection process more difficult. One of the main problems with SS is figuring out what criteria to include in the evaluation and selection process that are both acceptable and relevant [32]. The selection of suppliers ought to be guided by the criteria of objectivity, specificity, and comprehensiveness. Businesses must create a thorough and precise evaluation system before choosing suppliers. The criteria obtained through a literature review include financial capabilities, equipment management, human resource development, quality control, cost control, technology development, user happiness, delivery agreements, and environmental awareness [33]. To find suppliers for the cold supply chain (CSC), Ullah and Yousaf [34] examined fifteen different essential criteria. They discovered that “utilisation of resources” is the most crucial.

MCDM methods consider preferences across a variety of quantitative and qualitative criteria, which are typically conflicting and difficult to reconcile, making it more difficult to come to a consensus. Behavioural decision theory, computer science, economics, and information systems are among the fields that influence the creation of MCDM methodologies [1]. Numerous MCDM strategies have been put out in earlier research to assist businesses in choosing qualified suppliers. These include fuzzy set theory (FST) [35], data envelopment analysis (DEA) [36], the technique for order preference by similarity to ideal solution (TOPSIS) [37], the analytical hierarchy process (AHP) [38], and multi-objective programming [39,40]. To meet the demands of the decision-making scenarios, researchers have improved or combined these well-liked and traditional methods [41,42,43,44]. Most research, however, concentrated on supplier selection theories and methodologies, ignoring the development of criteria systems or only qualitatively evaluating them using pre-existing literature or professional judgement [45]. The calibre of the decision-making in the earlier stages has a significant impact on the calibre of the supplier that is ultimately chosen [46]. The complexity of supplier assessment difficulties and the unpredictability of human thought have led to a considerable increase in the number of studies on SS employing traditional approaches in the literature in recent years [47,48,49,50]. However, the SS process can be handled by machine learning [51,52].

ML primarily offers reliable information since it accurately forecasts the circumstances and assists in identifying the optimal course of action among the several options created during the study [53,54]. ML algorithms have been carefully studied in a number of findings. These techniques include supervised and unsupervised machine learning (ML), including k-means, principal component analysis (PCA), random forest (RF), support vector machines (SVMs), artificial neural networks (ANNs), and others [55,56,57,58]. In a variety of domains, such as medical imaging, image classification, speech recognition, and other industrial contexts, machine learning techniques are effectively applied with remarkable outcomes on object detection problems [59] and dimension reduction [60]. Nonnegative Matrix Factorization (NMF) is an effective dimension reduction technique that outperforms classic linear methods and other techniques [61,62,63]. One of its primary strengths is its capacity to handle non-negative constraints, making it ideal for datasets with only positive values. Using this trait, NMF may extract parts-based and additive representations, exposing underlying patterns and features in data [64]. Furthermore, NMF’s inherent sparsity-promoting nature enables it to automatically choose relevant features, significantly lowering data dimensionality while retaining critical information. Unlike certain linear approaches, which may struggle with high-dimensional and complicated datasets, NMF is robust and scalable in such settings [65]. NMF interpretability is important because it allows researchers to obtain relevant insights into the data’s latent structure, which facilitates data exploration and analysis [66]. Overall, the combination of nonnegativity, sparsity, interpretability, and scalability make NMF a versatile and compelling strategy for dimension reduction tasks, offering a viable alternative to other methods in the field [67].

Despite ML’s ability to handle complicated problems, its application to SS has been rather limited [68]. Moreover, Huo et al. [69] claim that RF feature selection models offer the most accurate predictions that closely match the real one. For example, RF helped assess a green supplier by exposing the pairwise correlations between the criteria [70]. Furthermore, RF aids in supplier ranking according to performance [71]. Additionally, RF establishes the process’s flexibility and versatility and makes supplier evaluation trustworthy [72].

There has been an extensive amount of study conducted regarding the application of several MCDM techniques combined with ML techniques to solve the drawbacks of each strategy in the supplier selection process. This section offers a selection of related research on the supplier selection and assessment challenge that has been addressed through the integration of ML algorithms with MCDM methodologies.

To highlight the specific research gap that the methodology suggested in Section 3 aims to fill, a critical review is provided, with a primary focus on recently published research. Using vast amounts of historical data, Neji et al. [73] showed how to use data-driven MCDM to green supplier selection issues. Using Random Forests, they examined the connections between various supplier selection factors. After that, they utilised a combination of DEMATEL and ANP to determine the criteria weights. Multi-objective optimisation and ratio analysis were then used to evaluate suppliers by calculating the difference between ideal and current suppliers. Through a case study of a green supplier selection procedure used by a Taiwanese electronics company, the effectiveness of the approach was confirmed.

The integrated strategy used by Cheng et al. [74] integrates ML models with several MCDM techniques. To specifically identify suppliers, DEA and TOPSIS are combined. The tagged dataset is subsequently used to build a Support Vector Regression model, which can categorise undesired suppliers. A case study on a manufacturer of automation and electronic systems was carried out, proving the method’s accuracy and resilience. To evaluate customer satisfaction and pinpoint important supplier components [75], investigate a combination of the Supply Chain Occupational Reference (SCOR) 4.0 model and BWM. They concentrate on sustainability and resilience in the pharmaceutical sector. After that, a gradient boosting machine learning model is used to categorise and rank suppliers according to their acceptability score; the algorithm’s efficacy is shown by the outcomes. When considering supplier selection from the perspective of operating in uncertain environments [76], have investigated the integration of fuzzy Delphi and fuzzy BWM to prioritise suppliers under information uncertainty, refine and weigh criteria, and use TOPSIS and Grey Correlation to select the best supplier and distribute orders.

A prevalent feature of hybrid MCDM/ML methodologies in the literature is that they prioritise performance over adaptability [1]. The low adoption of these solutions by supply chain stakeholders may also be explained by the fact that they are not easily integrated into procurement processes, even though they can successfully identify suitable suppliers more efficiently than typical MCDM approaches [77]. The main obstacle to adoption is frequently the inability to justify the choices made by an ML-based or MCDM/ML hybrid strategy. Abdulla et al. [78] have previously investigated interpretable machine learning techniques in conjunction with AHP to carry out supplier selection in this setting. To determine the most crucial selection criteria and weights that are then utilised to rank suppliers using AHP, a decision tree method was utilised. The results showed that by concentrating just on a subset of selection criteria, the decision tree algorithm could effectively determine the most crucial criteria, hence lessening the strain of applying the AHP approach. By investigating a greater range of machine learning algorithms outside of decision trees and considering a more modern MCDM approach created to address AHP’s problems, the methodology described in this work and detailed in the following section is continuing along the same trajectory.

3. Proposed Model

This section outlines the methodology employed and provides a detailed explanation of the computational process and solution procedure, which incorporates NMF, RF, and TOPSIS. As depicted in Figure 1, supplier performance criteria data is first processed using NMF to reduce dimensionality and identify the core criteria. Subsequently, RF is applied, based on input from decision-makers, to assign weights to the core criteria, reflecting their significance to the case company within its operational context. Lastly, the framework evaluates potential new suppliers, utilising TOPSIS to consolidate the evaluation data and rank the suppliers. The detailed calculation steps are presented in the following sections.

3.1. Identification of Criteria and Data Preprocessing

The six primary factors used by pharmaceutical firms are supplier profile, cost, quality, services, delivery, and overall staff competencies [79]. Table 1 shows how these main criteria are broken down into other sub-criteria. A questionnaire is used to collect information on all 24 criteria form industrial experts. To rate the supplier selection criteria (c₁, c₂, …, c₂₄) from 0 (the least important) to 10 (the most important), business managers of 34 pharmaceutical companies were consulted.

3.2. Dimension Reduction with NMF Method

3.2.1. Matrix Construction and Notation

In this analysis, the supplier selection process is represented by a matrix X of dimensions

m \times n

, where m is the number of suppliers, and n is the number of evaluation criteria. Each column c_j (e.g., c₁, c₂, etc.) represents a unique criterion used to assess suppliers, such as quality, cost efficiency, or delivery performance [80].

This matrix X forms the basis for applying Non-negative Matrix Factorization (NMF).

NMF is a dimensionality reduction technique that factors the input matrix X into two non-negative matrices, W and H, giving the following [81]:

X \approx W . H

(1)

Here:

W ∈ R^m^×r contains the weights of each supplier in r latent components, representing supplier profiles.
H ∈ R^r×n contains the contribution of each criterion in these components, capturing the importance of each criterion in forming these profiles.

The rank r (or number of components) controls the complexity of the model, balancing the fidelity of the approximation with interpretability and computational efficiency.

3.2.2. Optimisation Objective

The decomposition is achieved by minimising the reconstruction error, measured by the Frobenius norm of the difference between X and its approximation

W . H

:

\min_{W, H} {‖X - W . H‖}_{F}^{2}

(2)

where ‖ . ‖_F denotes the Frobenius norm, defined as follow:

{‖X - W . H‖}_{F} = \sqrt{\sum_{i = 1}^{m} \sum_{j = 1}^{n} {(X_{i j} - {(W . H)}_{i j})}^{2}}

(3)

This objective ensures that the factorisation captures as much of the original data’s structure as possible, which is particularly useful when negative values have no interpretive meaning, as in supplier evaluation scores [82].

3.2.3. Selecting the Optimal Rank r Using the Elbow Method and KneeLocator

Determining the optimal rank r is crucial for effective dimensionality reduction. A common approach is the “elbow method,” where the reconstruction error is plotted as a function of r. The point where the error reduction slows significantly, forming an” elbow,” indicates a rank that balances accuracy and simplicity [83]. To automate this process, the KneeLocator algorithm is employed, which detects the “elbow” or “knee” in a curve by identifying the point of maximum curvature.

3.3. Random Forest Feature Selection

The following steps outline the intuition underlying the RF feature selection [84,85].

Step 1: From the original training dataset, it constructs K number of classification trees.

Step 2: Consider the associated

{O O B}_{t}

sample for each tree

t

in the

k_{t h}

Random Forest. Also, error of a single tree

t

of this sample can be denoted as

{e e r O O B}_{t}

.

Step 3: Randomly permutes the value of

C_{j}

in

{O O B}_{t}

to create a perturbed sample which can be denoted by

{O O B}_{j t}

. The feature or criteria importance will be as follows [86]:

C_{j} = \frac{1}{N_{t r e e}} \sum_{j = 1}^{N_{t r e e}} ({e r r O O B}_{t}^{j} - {e r r O O B}_{t})

(4)

From Equation (1), the importance of features can be measures to select the critical criteria.

The effectiveness of an ML model is indicated via performance measures. Despite the abundance of performance measurements, most of the earlier research concentrated on accuracy and F−score and obtained a more accurate picture of their model [87,88,89]. Therefore, in this work, RF classifier performance is measured using F−score and accuracy using Equations (5) and (6).

A c c u r a c y = \frac{T_{p} + T_{n}}{T_{p} + F_{p} + F_{n} + T_{n}}

(5)

F - s c o r e = \frac{P r e c i s i o n \times R e c a l l}{P r e c i s i o n + R e c a l l}

(6)

3.4. TOPSIS

Throughout the past few decades, the TOPSIS model has been widely used in a variety of research domains to help with decision-making by rating multiple options according to how close or similar they are to an ideal answer [90,91,92]. The TOPSIS model is used in this study to rank the suppliers after evaluating their performances in relation to sustainability aspects. The steps below are used to formulate the TOPSIS model:

Step 1: Evaluate supplier performance in relation to sustainability considerations:

The questionnaires created specifically for this study are used for this. The respondents used the linguistics scale in Table 2 to illustrate the vendors’ performance on the questions.

Step 2: Normalise the decision matrix design:

The normalised decision matrix can be computed as follows:

D = {[d_{i j}]}_{m \times n}

(7)

where

i = 1, \dots m

and

j = 1, \dots n

and D is the normalised decision matrix with m rows and n columns.

The sustainability criteria are divided into two categories: “benefit criteria,” which indicate that a scale increase is good, and “cost criteria,” which indicate that a scale decline is favourable.

Benefit criteria:

d_{i j} = (\frac{r_{i j}}{q_{j}^{+}}, \frac{m_{i j}}{q_{j}^{+}}, \frac{q_{i j}}{q_{j}^{+}})

(8)

where

q_{j}^{+} = \max q_{i j}

.

Cost criteria:

d_{i j} = (\frac{r_{j}^{-}}{q_{i j}}, \frac{r_{j}^{-}}{m_{i j}}, \frac{r_{j}^{-}}{q_{i j}})

(9)

where

r_{j}^{-} = \max r_{i j}

.

Step 3: Determine the normalised weighted decision matrix:

The weighted normalised decision matrix is calculated as follows:

W = {⌊w_{i j}⌋}_{m \times n}

(10)

where

i = 1 \dots m

and

j = 1 \dots n

.

W_{i j} = d_{i j} \times v_{i j}

(11)

where

v_{i j}

is the weight of the

i_{t h}

criterion for the

j_{t h}

supplier alternative.

Step 4: Calculate the positive ideal solution (P⁺) and negative ideal solution (P⁻):

Benefit criteria:

P^{+} = (w_{1 j}^{+}, w_{2 j}^{+}, \dots, w_{i j}^{+})

(12)

P^{-} = (w_{1 j}^{-}, w_{2 j}^{-}, \dots, w_{i j}^{-})

(13)

Cost criteria:

P^{-} = (w_{1 j}^{-}, w_{2 j}^{-}, \dots, w_{i j}^{-})

(14)

P^{-} = (w_{1 j}^{+}, w_{2 j}^{+}, \dots, w_{i j}^{+})

(15)

where:

w_{j}^{+} = m a x \{w_{i j}\}

and

w_{j}^{-} = m i n \{w_{i j}\}

;

i = 1, \dots, m

and

j = 1, \dots, n

.

Step 5: Determine the separation measures of each supplier alternative from the positive and negative ideal solution:

b_{j}^{+} = \sum_{j = 1}^{n} b_{t} (t_{i j}, t_{j}^{+}) i = 1, \dots, m

(16)

b_{j}^{-} = \sum_{j = 1}^{n} b_{t} (t_{i j}, t_{j}^{-}) i = 1, \dots, m

(17)

where

b_{t}

is the distance between two corresponding numbers on the linguistic scale.

Step 6: Computation of the closeness coefficient (A) for each supplier alternative:

A = \frac{b_{j}^{-}}{(b_{j}^{+} + b_{j}^{-})} j = 1, \dots, n

(18)

Step 7: Prioritisation of supplier alternatives:

According to the proximity coefficient, the supplier options are rated, with the best option being the one that is closest to the positive ideal solution and the worst option being the one that is most distant from the negative ideal method. Once these procedures are finished, a ranking of all potential suppliers has been determined. This ranking can be the direct result of our approach if stakeholders are interested in evaluating several possibilities; if just one supplier is to be selected, the output can be simply the supplier with the highest score.

4. Case Study

A real-life case study was considered to illustrate the approach and analyse its effectiveness. A questionnaire is used to collect information about the importance of 24 criteria across thirty-four (34) pharmaceutical companies. This section provides a detailed explanation of how the proposed approach was applied to the data. To rate the supplier selection criteria (c₁, c₂, …, c₂₄) from 0 (the least important) to 10 (the most important), the business managers of 34 pharmaceutical companies responded as shown in Table 3:

4.1. Model Establishment and Calculation of NMF

To identify the most impactful criteria for supplier selection, an overall score was computed for each criterion based on its contribution across all latent factors derived from the NMF decomposition as shown in Table 4. Specifically, the contributions of each criterion were summed across the 7 selected factors, resulting in a cumulative score that reflects the overall importance of each criterion within the model.

NMF is adopted for dimension reduction, identifying the core criteria for the evaluation framework. Figure 2 shows the 8 criteria obtained from the original 24, which are the main criteria for classifying suppliers’ ratings. As a result, the number of original data were 34 × 24, which was reduced to 34 × 8. To ensure reliable dimensionality reduction, the parameter settings for Non-negative Matrix Factorization were determined through an iterative evaluation of reconstruction accuracy and interpretability. The number of latent components (k) was varied between 6 and 12, and the model’s performance was assessed based on the Frobenius norm of reconstruction error. The optimal configuration of k = 8 was selected because it produced the lowest reconstruction error while maintaining clear interpretability of the resulting factors in the context of supplier evaluation. This significantly reduces the amount of data and reduces the noise factor to improve the accuracy of the evaluation.

Using the elbow method, the cumulative scores to find a point where additional criteria provided diminishing returns in contribution are analysed. This approach allowed for proper selection of the top eight (8) criteria that collectively accounted for the most significant influence on supplier selection decisions. The graphical representation of the selected criteria using the elbow method is presented in Figure 3:

The main critical criteria from the NMF analysis entails Product reliability which evaluates the overall quality of the product offered by the supplier, Record history which considers the supplier’s documented history of performance, including reliability and compliance, Purchase price which assesses the cost-effectiveness of the supplier’s products, an essential factor in budget considerations, Management quality, reviewing the supplier’s management structure and organisational efficiency, which impact reliability. Within this category further entails, technical competence which examines the supplier’s technical competencies, including specialised skills and technologies, payment conditions which looks at the flexibility and conditions of payment, influencing financial feasibility, customer relations evaluating the supplier’s approach to managing customer relationships and financial strength which considers the financial health of the supplier, critical for assessing stability.

These criteria represent a balanced view of the supplier’s capabilities, covering quality, cost, management, technical strength, financial strength, and customer relations. Selecting these top criteria based on their cumulative scores helps in constructing a robust evaluation framework that prioritises the most influential factors in supplier selection.

4.2. Using Random Forest to Obtain Criteria Weights

To determine the importance of each criterion in the supplier selection process, a machine learning classification approach is employed. This process involves constructing pipelines that combined data scaling methods with various classifiers, followed by hyperparameter tuning and feature importance extraction. Two scaling techniques were adopted, Standard Scaler and MinMaxScaler, to normalise the data, ensuring that each feature contributed equally regardless of scale. Four classifiers were applied: Random Forest Classifier, Support Vector Classifier (SVC), K-Nearest Neighbours (KNNs), and Logistic Regression. For each combination of scaler and classifier, a pipeline was created to streamline preprocessing and model fitting.

GridSearchCV was utilised to perform hyperparameter tuning, testing various parameter values to optimise each classifier’s performance. The hyperparameter grid was specific to each model, with parameters such as the number of estimators and maximum depth for Random Forest, and the regularisation parameter C for SVC. The cross-validation was used with accuracy as the scoring metric to identify the best-performing pipeline in each combination. To optimise the Random Forest model and ensure robust feature weighting, a 10-fold cross-validation procedure was implemented. Parameter tuning was performed using a grid search approach, varying the number of trees (from 100 to 1000) and maximum tree depth (from 5 to 15). After identifying the best pipeline, it was further evaluated by extracting feature importances, specifically for models that provide this information, such as Random Forest. The feature importance indicates the relative weight of each criterion in the model, enabling a ranked prioritisation of criteria. The results showed that the Standard Scaler with Random Forest Classifier achieved the highest accuracy. The extracted feature importances revealed that criteria such as technical competence (c₁₅), Management quality (c₁₄), Product reliability (c₄), had the greatest impact, suggesting these factors are crucial for accurate supplier classification. This machine learning driven weighting provides an evidence-based approach for prioritising criteria, enhancing the robustness of the supplier selection framework.

Four different models are adopted to determine the criteria importance which entails random forest, SVM, logistic regression and KNN. The accuracy level of each model as displayed in Table 5 suggesting random forest as the most accurate model with an accuracy score of 84.3%.

The weight distribution as shown in Table 6 suggests that the top three criteria are technical competence as the most important criteria with a weight score of 0.2366, followed by management quality with a score of 0.1564 and product reliability with a weight score of 0.1457. From the list, the least three scores are record history, financial strength and purchase price.

4.3. Using TOPSIS to Integrate the Performance of Suppliers and Their Priority Ranking

Finally, TOPSIS is used to integrate suppliers’ performance data to form a final performance score, which is used to determine the ratings of the suppliers. In this case, performance data for four (4) suppliers were collected as shown in Table 7, Table 8, Table 9 and Table 10. The decision-maker only had to investigate using the core criteria for the suppliers, thus saving a lot of time and investigation costs.

The results indicate that suppliers S1* is the most appropriate supplier with a performance score of 0.7089 and S3*, is rated second with a performance score of 0.6355. If a third supplier needs to be included, then S2* can be selected (Table 11).

5. Discussion

The supplier ranking results demonstrate strong stability even when moderate variations are introduced into the importance weights of evaluation criteria. This robustness indicates that the proposed hybrid model is not overly sensitive to small fluctuations in input parameters, thereby enhancing its practical reliability and making it suitable for real-world procurement decision-making under uncertain conditions. The findings confirm that supplier evaluation in contemporary supply chains extends beyond economic considerations to encompass technical, managerial, and sustainability-related factors.

The analysis revealed that technical competence (c₁₅), management quality (c₁₄), and product reliability (c₄) carry the highest importance weights as determined by the Random Forest model. These attributes are essential for ensuring long-term supplier competence, process reliability, and innovation, all critical enablers of sustainable supply chain performance. From a sustainability perspective, strong technical and managerial capacities support environmental compliance, resource efficiency, and quality assurance systems that reduce waste and improve operational resilience.

In addition to technical and managerial dimensions, the study recognises that environmental and social sustainability aspects, including adherence to environmental management standards, occupational health and safety practices, and corporate social responsibility are integral to sustainable supplier selection. By embedding these dimensions within the evaluation framework, the proposed model promotes a balanced approach that values both profitability and sustainability, aligning with global supply chain sustainability objectives.

The integration of the Technique for Order Preference by Similarity to an Ideal Solution (TOPSIS) further enhances the interpretability and objectivity of supplier ranking. The hybrid data-driven approach, which combines Non-negative Matrix Factorization for dimensionality reduction and Random Forest for data-derived weighting, minimise human bias and enhances analytical transparency. This positions the model as an intelligent decision-support system capable of improving supplier evaluation accuracy, strengthening sustainability performance assessment, and supporting strategic sourcing decisions. Overall, the study underscores that sustainable supplier evaluation should integrate economic efficiency, technical capability, environmental stewardship, and social responsibility. The proposed hybrid framework provides procurement professionals with a reliable and scalable tool that advances both operational excellence and sustainability performance across the supply chain.

5.1. Theoretical Implications

This study contributes to the literature on multi-criteria decision-making (MCDM) and supplier selection by proposing a hybrid model that combines machine learning techniques with classical MCDM tools. The novelty of this study lies in its unique combination of Random Forest (RF) for feature importance with Non-negative Matrix Factorization (NMF) for dimensionality reduction, followed by Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) for final decision-making. While previous studies have explored hybrid frameworks such as RF combined with Analytic Hierarchy Process (AHP) [93] or RF integrated with DEMATEL [94], these models rely on expert judgement for pairwise comparisons or influence relationships, which introduces subjectivity and limits scalability. In contrast, the proposed model offers a fully data-driven pipeline where RF automates the determination of criteria importance, NMF eliminates noise and redundancy in high-dimensional data, and TOPSIS ensures interpretability and structured ranking. This eliminates dependency on extensive stakeholder inputs and improves computational efficiency, which is particularly advantageous in industrial contexts where procurement data is abundant but expert availability is limited. Specifically:

(i): It bridges the gap between traditional supplier selection approaches and intelligent decision support systems, integrating machine learning to automate weight generation and dimension reduction through RF and NMF, respectively.
(ii): The approach introduces an interpretable and systematic method of weighting supplier evaluation criteria using RF, addressing the subjective biases associated with expert-based methods.
(iii): The study further validates the application of tree-based machine learning models within MCDM frameworks, enhancing the interpretability and justification of decision processes.
(iv): By validating the proposed model against human expert decisions and comparing it to other established MCDM techniques, the study strengthens the theoretical reliability and replicability of hybrid evaluation methods.

5.2. Managerial Implications

For practitioners in the pharmaceutical sector, this study offers practical and actionable guidance for improving supplier selection and procurement strategy:

(i): Efficiency and Cost Reduction: The hybrid model significantly reduces decision-making time and cost by automating data pre-processing, feature selection, and ranking through the integration of Non-negative Matrix Factorization and Random Forest. In the pharmaceutical case study, this automation simplified the evaluation of 24 complex supplier criteria into 8 key performance dimensions, enabling procurement teams to make faster, evidence-based decisions without compromising accuracy.
(ii): Sustainability-Oriented Decision Support: The model enables procurement managers to prioritise suppliers using multidimensional sustainability criteria, covering technical capability, environmental compliance, and social responsibility. This is especially critical in the pharmaceutical industry, where supplier quality directly affects regulatory compliance, patient safety, and sustainable production practices.
(iii): System Integration and Scalability: The framework is designed to be ERP-compatible and can be embedded into existing procurement systems such as SAP Ariba, Oracle SCM, or Microsoft Dynamics through API integration or data import modules. The model’s outputs (supplier rankings and performance weights) can be periodically updated using live procurement data, allowing for continuous monitoring and dynamic supplier performance evaluation.
(iv): Reducing Subjectivity and Enhancing Transparency: By using objective, data-driven weighting and ranking, the model minimises cognitive bias in supplier assessment. Procurement managers can rely on transparent, repeatable evaluation logic that aligns supplier performance with strategic and sustainability goals specific to regulated industries.
(v): Cross-Industry Applicability: While validated using pharmaceutical supplier data, the model’s modular architecture allows for easy adaptation to other sectors such as mining, manufacturing, and energy, where complex supplier networks and sustainability compliance are equally critical. This positions the framework as a versatile tool for organisations aiming to institutionalise intelligent, data-driven supplier management practices.

6. Conclusions

This study developed a hybrid supplier ranking and selection framework that integrates data-driven machine learning techniques with a structured multi-criteria decision-making approach. The model effectively addresses the long-standing challenges of identifying relevant evaluation criteria and assigning objective weights by automating these processes through dimension reduction and feature importance analysis using historical procurement data. This automation enhances efficiency, transparency, and reliability in supplier assessment, reducing dependence on subjective stakeholder inputs and time-consuming consensus-building procedures. The proposed framework was validated through a pharmaceutical sector case study, demonstrating its ability to produce supplier rankings consistent with both expert judgments and results from established MCDM methods. Importantly, the findings reveal that sustainable supplier selection extends beyond economic factors to include technical capability, managerial effectiveness, and compliance with environmental and social responsibility standards. By embedding sustainability criteria directly into the evaluation process, the model supports organisations in aligning procurement decisions with broader corporate sustainability and regulatory objectives. Overall, this study contributes a robust, interpretable, and sustainability-oriented decision-support tool for supplier evaluation. The hybrid model not only strengthens analytical rigour but also promotes responsible sourcing practices, offering a scalable framework adaptable to diverse industrial contexts such as pharmaceuticals, mining, and manufacturing.

Research Limitations and Future Direction

While the proposed hybrid supplier selection framework demonstrates strong potential, several limitations should be acknowledged when considering its wider application. First, since the model integrates machine learning techniques, it requires access to adequate and high-quality data. Its performance may be constrained in organisations with limited digitalization or where data availability is restricted due to confidentiality or inconsistent recording practices. Secondly, the model is particularly beneficial in contexts involving many evaluation criteria, where feature selection and dimensionality reduction yield clear efficiency gains. In supplier selection settings with only a few criteria, the computational advantage of applying data-driven algorithms may be marginal.

Looking ahead, future research can be structured around four prioritised directions:

1: Cross-sectoral Validation:

Further case studies in sectors such as food, mining, automotive, and oil and gas are recommended to validate the model’s adaptability and performance across diverse supply chain environments.

2: System Integration and Practical Deployment:

Future work should explore how the hybrid model can be seamlessly embedded within enterprise procurement workflows, particularly through integration with ERP systems (e.g., SAP, Oracle, or Microsoft Dynamics). Developing the model as a service-based decision-support module would enhance its accessibility and enable continuous, automated supplier evaluation.

3: Expansion of Decision Scope:

Extending the framework to include supplier order allocation, logistics coordination, and delivery performance could transform it into a more comprehensive business intelligence tool for strategic sourcing and supply management.

4: Incorporation of Natural Language and Large Language Models (LLMs):

Future studies should investigate how natural language processing (NLP) and LLMs can complement traditional data-driven criteria by analysing unstructured information such as supplier reports, audits, and sustainability disclosures. While recent studies have highlighted the promise of NLP–MCDM integrations for text-based decision support, practical challenges remain. These include data privacy, model explainability, and ensuring contextual accuracy when interpreting qualitative supplier data. Addressing these challenges will be key to ensuring responsible and transparent adoption of LLM-assisted decision frameworks.

In summary, the proposed research trajectory prioritises validation, system integration, scope expansion, and intelligent automation. Collectively, these directions aim to evolve the current model into a robust, interpretable, and context-aware decision-support ecosystem for sustainable supplier selection across multiple industries.

Author Contributions

Conceptualization, O.J.G.; Methodology, O.J.G.; Writing—original draft, O.J.G.; Writing—review & editing, L.T. and M.O. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Abdulla, A.; Baryannis, G.; Badi, I. An integrated machine learning and MARCOS method for supplier evaluation and selection. Decis. Anal. J. 2023, 9, 100342. [Google Scholar] [CrossRef]
Gergin, R.E.; Peker, İ.; Kısa, A.C.G. Supplier selection by integrated IFDEMATELIFTOPSIS method: A case study of automotive supply industry. Decis. Mak. Appl. Manag. Eng. 2022, 5, 169–193. [Google Scholar] [CrossRef]
Ghobadian, A.; Stainer, A.; Liu, J.; Kiss, T. A computerised vendor rating system. Dev. Logist. Supply Chain Manag. 2016, 103–112. [Google Scholar]
Azeem, M.; Haleem, A.; Bahl, S.; Javaid, M.; Suman, R.; Nandan, D. Big data applications to take up major challenges across manufacturing industries: A brief review. Mater. Today Proc. 2021, 49, 339–348. [Google Scholar] [CrossRef]
Gregory, T.D.; Perry, M.L.; Albertus, P. Cost and price projections of synthetic active materials for redox flow batteries. J. Power Sources 2021, 499, 229965. [Google Scholar] [CrossRef]
Memari, A.; Dargi, A.; Akbari Jokar, M.R.; Ahmad, R.; Abdul Rahim, A.R. Sustainable supplier selection: A multi-criteria intuitionistic fuzzy TOPSIS method. J. Manuf. Syst. 2019, 50, 9–24. [Google Scholar] [CrossRef]
Buyukozkan, G.; Gocer, F. A novel approach integrating AHP and COPRAS under pythagorean fuzzy sets for digital supply chain partner selection. IEEE Trans. Eng. Manag. 2019, 68, 1486–1503. [Google Scholar] [CrossRef]
Ali, M.R.; Nipu, S.M.A.; Khan, S.A. A decision support system for classifying supplier selection criteria using machine learning and random forest approach. Decis. Anal. J. 2023, 7, 100238. [Google Scholar] [CrossRef]
Chen, K.S.; Wang, C.H.; Tan, K.H. Developing a fuzzy green supplier selection model using six sigma quality indices. Int. J. Prod. Econ. 2019, 212, 1–7. [Google Scholar] [CrossRef]
Liu, Y.; Eckert, C.; Yannou-Le Bris, G.; Petit, G. A fuzzy decision tool to evaluate the sustainable performance of suppliers in an agrifood value chain. Comput. Ind. Eng. 2019, 127, 196–212. [Google Scholar] [CrossRef]
Sumrit, D. Supplier selection for vendor-managed inventory in healthcare using fuzzy multi-criteria decision-making approach. Decis. Sci. Lett. 2020, 9, 233–256. [Google Scholar] [CrossRef]
Liao, C.N.; Kao, H.P. An integrated fuzzy TOPSIS and MCGP approach to supplier selection in supply chain management. Expert. Syst. Appl. 2011, 38, 10803–10811. [Google Scholar] [CrossRef]
Govindan, K.; Rajendran, S.; Sarkis, J.; Murugesan, P. Multi criteria decision making approaches for green supplier evaluation and selection: A literature review. J. Clean. Prod. 2015, 98, 66–83. [Google Scholar] [CrossRef]
Ho, W.; Xu, X.; Dey, P.K. Multi-criteria decision making approaches for supplier evaluation and selection: A literature review. Eur. J. Oper. Res. 2010, 202, 16–24. [Google Scholar] [CrossRef]
Zakeri, S.; Chatterjee, P.; Cheikhrouhou, N.; Konstantas, D. Ranking based on optimal points and win-loss-draw multi-criteria decision-making with application to supplier evaluation problem. Expert. Syst. Appl. 2022, 191, 116258. [Google Scholar] [CrossRef]
Dweiri, F.; Kumar, S.; Khan, S.A.; Jain, V. Designing an integrated AHP based decision support system for supplier selection in automotive industry. Expert. Syst. Appl. 2016, 62, 273–283. [Google Scholar] [CrossRef]
Quan, M.Y.; Wang, Z.L.; Liu, H.C.; Shi, H. A hybrid MCDM approach for large group green supplier selection with uncertain linguistic information. IEEE Access 2018, 6, 50372–50383. [Google Scholar] [CrossRef]
Kannan, D.; Mina, H.; Nosrati-Abarghooee, S.; Khosrojerdi, G. Sustainable circular supplier selection: A novel hybrid approach. Sci. Total Environ. 2020, 722, 137936. [Google Scholar] [CrossRef]
Yang, J.J.; Lo, H.W.; Chao, C.S.; Shen, C.C.; Yang, C.C. Establishing a sustainable sports tourism evaluation framework with a hybrid multi-criteria decision making model to explore potential sports tourism attractions in Taiwan. Sustainability 2020, 12, 1673. [Google Scholar] [CrossRef]
Li, H.; Wang, W.; Fan, L.; Li, Q.; Chen, X. A novel hybrid MCDM model for machine tool selection using fuzzy DEMATEL, entropy weighting and later defuzzification VIKOR. Appl. Soft Comput. 2020, 91, 106207. [Google Scholar] [CrossRef]
Liou, J.J.; Chang, M.H.; Lo, H.W.; Hsu, M.H. Application of an MCDM model with data mining techniques for green supplier evaluation and selection. Appl. Soft Comput. 2021, 109, 107534. [Google Scholar] [CrossRef]
Hastie, T.; Tibshirani, R.; Friedman, J. The Elements of Statistical Learning: Data Mining, Inference, and Prediction; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2009. [Google Scholar]
Saura, J.R. Using data sciences in digital marketing: Framework, methods, and performance metrics. J. Innov. Knowl. 2020, 6, 92–102. [Google Scholar] [CrossRef]
Gupta, S.; Modgil, S.; Bhattacharyya, S.; Bose, I. Artificial intelligence for decision support systems in the field of operations research: Review and future scope of research. Ann. Oper. Res. 2022, 308, 215–274. [Google Scholar] [CrossRef]
Tirkolaee, E.B.; Sadeghi, S.; Mooseloo, F.M.; Vandchali, H.R.; Aeini, S. Application of machine learning in supply chain management: A comprehensive overview of the main areas. Math. Probl. Eng. 2021, 2021, 1476043. [Google Scholar] [CrossRef]
Luan, J.; Yao, Z.; Zhao, F.; Song, X. A novel method to solve supplier selection problem: Hybrid algorithm of genetic algorithm and ant colony optimization. Math. Comput. Simul. 2019, 156, 294–309. [Google Scholar] [CrossRef]
Dogan, A.; Birant, D. Machine learning and data mining in manufacturing. Expert. Syst. Appl. 2021, 166, 114060. [Google Scholar] [CrossRef]
Lo, H.W. A data-driven decision support system for sustainable supplier evaluation in the Industry 5.0 era: A case study for medical equipment manufacturing. Adv. Eng. Inform. 2023, 56, 101998. [Google Scholar] [CrossRef]
Khan, S.A.; Yu, Z.; Golpira, H.; Sharif, A.; Mardani, A. A state-of-the-art review and meta-analysis on sustainable supply chain management: Future research directions. J. Clean. Prod. 2021, 278, 123357. [Google Scholar] [CrossRef]
Saberi, S.; Kouhizadeh, M.; Sarkis, J.; Shen, L. Blockchain technology and its relationships to sustainable supply chain management. Int. J. Prod. Res. 2018, 57, 2117–2135. [Google Scholar] [CrossRef]
Yazdani, M.; Pamucar, D.; Chatterjee, P.; Ebadi, A. A multi-tier sustainable food supplier selection model under uncertainty. Oper. Manag. Res. 2022, 15, 116–145. [Google Scholar] [CrossRef]
Xue, M.; Fu, C.; Feng, N.P.; Lu, G.Y.; Chang, W.J.; Yang, S.L. Evaluation of supplier performance of high-speed train based on multi-stage multi-criteria decision-making method. Knowl.-Based Syst. 2018, 162, 238–251. [Google Scholar] [CrossRef]
Piwowar-Sulej, K. Human resources development as an element of sustainable HRM–with the focus on production engineers. J. Clean. Prod. 2021, 278, 124008. [Google Scholar] [CrossRef] [PubMed]
Ullah, A.; Yousaf, K. Sustainable supplier selection for the cold supply chain (CSC) in the context of a developing country. Environ. Dev. Sustain. 2021, 23, 13135–13164. [Google Scholar] [CrossRef]
Khan, S.A.; Kusi-Sarpong, S.; Arhin, F.K.; Kusi-Sarpong, H. Supplier sustainability performance evaluation and selection: A framework and methodology. J. Clean. Prod. 2018, 205, 964–979. [Google Scholar] [CrossRef]
Goswami, M.; Daultani, Y.; Chan, F.T.S.; Pratap, S. Assessing the impact of supplier benchmarking in manufacturing value chains: An intelligent decision support system for original equipment manufacturers. Int. J. Prod. Res. 2022, 60, 7411–7435. [Google Scholar] [CrossRef]
Chen, Y.J. Structured methodology for supplier selection and evaluation in a supply chain. Inform. Sci. 2011, 181, 1651–1670. [Google Scholar] [CrossRef]
Yang, Y.H.; Hui, Y.V.; Leung, L.C.; Chen, G. An analytic network process approach to the selection of logistics service providers for air cargo. J. Oper. Res. Soc. 2010, 61, 1365–1376. [Google Scholar] [CrossRef]
Nurjanni, K.P.; Carvalho, M.S.; Costa, L. Green supply chain design: A mathematical modeling approach based on a multi-objective optimization model. Int. J. Prod. Econ. 2017, 183, 421–432. [Google Scholar] [CrossRef]
Shao, Y.; Barnes, D.; Wu, C. External R&D supplier evaluation and selection: A three-stage integrated funnel model. IEEE Trans. Eng. Manag. 2022, 71, 4101–4115. [Google Scholar]
Wu, C.; Barnes, D. Partner selection in green supply chains using PSO—A practical approach. Prod. Plan. Control. 2016, 27, 1041–1061. [Google Scholar] [CrossRef]
Ghorbani, M.; Arabzad, S.M.; Shahin, A. A novel approach for supplier selection based on the Kano model and fuzzy MCDM. Int. J. Prod. Res. 2013, 51, 5469–5484. [Google Scholar] [CrossRef]
Lima Junior, F.R.; Osiro, L.; Carpinetti, L.C.R. A comparison between fuzzy AHP and fuzzy TOPSIS methods to supplier selection. Appl. Soft Comput. 2014, 21, 194–209. [Google Scholar] [CrossRef]
Bai, C.; Zhu, Q.; Sarkis, J. Supplier portfolio selection and order allocation under carbon neutrality: Introducing a cooling model. Comput. Ind. Eng. 2022, 170, 108335. [Google Scholar] [CrossRef]
Van der Rhee, B.; Verma, R.; Plaschka, G. Understanding trade-offs in the supplier selection process: The role of flexibility, delivery, and value-added services/support. Int. J. Prod. Econ. 2009, 120, 30–41. [Google Scholar] [CrossRef]
Wu, C.; Barnes, D. Formulating partner selection criteria for agile supply chains: A Dempster–Shafer belief acceptability optimisation approach. Int. J. Prod. Econ. 2010, 125, 284–293. [Google Scholar] [CrossRef]
Wu, Z.; Xu, J.; Jiang, X.; Zhong, L. Two MAGDM models based on hesitant fuzzy linguistic term sets with possibility distributions: VIKOR and TOPSIS. Inform. Sci. 2019, 473, 101–120. [Google Scholar] [CrossRef]
Chen, S.M.; Han, W.H. An improved MADM method using interval-valued intuitionistic fuzzy values. Inform. Sci. 2018, 467, 489–505. [Google Scholar] [CrossRef]
Li, C.C.; Rodríguez, R.M.; Martínez, L.; Dong, Y.; Herrera, F. Consistency of hesitant fuzzy linguistic preference relations: An interval consistency index. Inform. Sci. 2018, 432, 347–361. [Google Scholar] [CrossRef]
Liu, P.; Chen, S.M. Multiattribute group decision making based on intuitionistic 2-tuple linguistic information. Inform. Sci. 2018, 430, 599–619. [Google Scholar] [CrossRef]
Islam, S.; Amin, S.H.; Wardley, L.J. Machine learning and optimization models for supplier selection and order allocation planning. Int. J. Prod. Econ. 2021, 242, 108315. [Google Scholar] [CrossRef]
Khan, M.M.; Bashar, I.; Minhaj, G.M.; Wasi, A.I.; Hossain, N.U.I. Resilient and sustainable supplier selection: An integration of SCOR 4.0 and machine learning approach. Sustain. Resil. Infrastruct. 2023, 8, 453–469. [Google Scholar] [CrossRef]
Zangaro, F.; Minner, S.; Battini, D. A supervised machine learning approach for the optimisation of the assembly line feeding mode selection. Int. J. Prod. Res. 2020, 59, 4881–4902. [Google Scholar] [CrossRef]
Duan, Y.; Edwards, J.S.; Dwivedi, Y.K. Artificial intelligence for decision making in the era of big data–evolution, challenges and research agenda. Int. J. Inf. Manag. 2019, 48, 63–71. [Google Scholar] [CrossRef]
Elbadawi, M.; Gaisford, S.; Basit, A.W. Advanced machine-learning techniques in drug discovery. Drug Discov. Today 2021, 26, 769–777. [Google Scholar] [CrossRef] [PubMed]
Usama, M.; Qadir, J.; Raza, A.; Arif, H.; Yau, K.-L.A.; Elkhatib, Y.; Hussain, A.; Al-Fuqaha, A. Unsupervised machine learning for networking: Techniques, applications and research challenges. IEEE Access 2019, 7, 65579–65615. [Google Scholar] [CrossRef]
Nasteski, V. An overview of the supervised machine learning methods. Horiz. B 2017, 4, 51–62. [Google Scholar] [CrossRef]
Shuja, J.; Bilal, K.; Alasmary, W.; Sinky, H.; Alanazi, E. Applying machine learning techniques for caching in next-generation edge networks: A comprehensive survey. J. Netw. Comput. Appl. 2021, 181, 103005. [Google Scholar] [CrossRef]
Chhay, L.; Hossain, A.; Rathny, R.; Rafiqul, S.; Manik, I. Municipal solid waste generation in China: Influencing factor analysis and multi-model forecasting. J. Mater. Cycles Waste Manag. 2018, 20, 1761–1770. [Google Scholar] [CrossRef]
Gan, T.; Liu, L.; Li Zhang, J. Non-negative matrix factorization: A survey. Comput. J. 2021, 64, 1080–1092. [Google Scholar] [CrossRef]
He, Z.; Xie, S.; Zdunek, R.; Zhou, G.; Cichocki, A. Symmetric nonnegative matrix factorization: Algorithms and applications to probabilistic clustering. IEEE Trans. Neural Netw. 2011, 22, 2117–2131. [Google Scholar]
Yang, Z.; Oja, E. Linear and nonlinear projective nonnegative matrix factorization. IEEE Trans. Neural Netw. 2010, 21, 734–749. [Google Scholar] [CrossRef]
Yi, Y.; Wang, J.; Zhou, W.; Zheng, C.; Kong, J.; Qiao, S. Non-Negative Matrix Factorization with Locality Constrained Adaptive Graph. IEEE Trans. Circuits Syst. Video Technol. 2020, 30, 427–441. [Google Scholar] [CrossRef]
Li, Z.; Tang, J.; He, X. Robust Structured Nonnegative Matrix Factorization for Image Representation. IEEE Trans. Neural Netw. Learn. Syst. 2018, 29, 1947–1960. [Google Scholar] [CrossRef]
Pei, X.; Chen, C.; Gong, W. Concept Factorization with Adaptive Neighbours for Document Clustering. IEEE Trans. Neural Netw. Learn. Syst. 2018, 29, 343–352. [Google Scholar] [CrossRef] [PubMed]
Peng, S.; Yang, Z.; Ling, B.W.; Chen, B.; Lin, Z. Dual semi-supervised convex nonnegative matrix factorization for data representation. Inform. Sci. 2022, 585, 571–593. [Google Scholar] [CrossRef]
Saberi-Movahed, F.; Berahmand, K.; Sheikhpour, R.; Li, Y.; Pan, S. Nonnegative Matrix Factorization in Dimensionality Reduction: A Survey. Proc. ACM Meas. Anal. Comput. Syst. 2024, 37, 111. [Google Scholar] [CrossRef]
Brunton, S.L.; Kutz, J.N. Data-Driven Science and Engineering: Machine Learning, Dynamical Systems, and Control; Cambridge University Press: Cambridge, UK, 2022. [Google Scholar]
Huo, W.; Li, W.; Zhang, Z.; Sun, C.; Zhou, F.; Gong, G. Performance prediction of proton-exchange membrane fuel cell based on convolutional neural network and random forest feature selection. Energy Convers. Manag. 2021, 243, 114367. [Google Scholar] [CrossRef]
Liou, J.J.H.; Chuang, Y.C.; Zavadskas, E.K.; Tzeng, G.H. Data-driven hybrid multiple attribute decision-making model for green supplier evaluation and performance improvement. J. Clean. Prod. 2019, 241, 118321. [Google Scholar] [CrossRef]
Wilson, V.H.; NS, A.P.; Shankharan, A.; Kapoor, S.; Rajan, J. Ranking of supplier performance using machine learning algorithm of random forest. Int. J. Adv. Res. Eng. Technol. 2020, 11, 298–308. [Google Scholar]
Guo, Y. Application research of supplier evaluation based on random forest. In Proceedings of the 6th International Conference on Software and Computer Applications, Bangkok, Thailand, 26–28 February 2017; pp. 316–323. [Google Scholar]
Neji, H.; Rekik, M.; Souifi, L.; Rodriguez, I.B. A Systematic Review of Sustainable Supplier Selection Using Advanced Artificial Intelligence Methods. In Proceedings of the International Conference on Agents and Artificial Intelligence 2025, Porto, Portugal, 23–25 February 2025; Volume 3, pp. 451–460. [Google Scholar]
Cheng, Y.; Peng, J.; Gu, X.; Zhang, X.; Liu, W.; Zhou, Z.; Yang, Y.; Huang, Z. An intelligent supplier evaluation model based on data-driven support vector regression in global supply chain. Comput. Ind. Eng. 2020, 139, 105834. [Google Scholar] [CrossRef]
Lostakova, H.; Pecinova, Z.; Branska, L.; Vlckova, V.; Patak, M. Evaluation of Supplier Performance from the Perspective of Customers by their Attitudes. Appl. Mech. Mater. 2015, 708, 39–46. [Google Scholar] [CrossRef]
Goodarzi, F.; Abdollahzadeh, V.; Zeinalnezhad, M. An integrated multi-criteria decision-making and multi-objective optimization framework for green supplier evaluation and optimal order allocation under uncertainty. Decis. Anal. J. 2022, 4, 100087. [Google Scholar] [CrossRef]
Baryannis, G.; Dani, S.; Validi, S.; Antoniou, G. Decision support systems and artificial intelligence in supply chain risk management. In Revisiting Supply Chain Risk; Zsidisin, G.A., Henke, M., Eds.; Springer International Publishing: Berlin/Heidelberg, Germany, 2019; pp. 53–71. [Google Scholar]
Abdulla, A.; Baryannis, G.; Badi, I. Weighting the key features affecting supplier selection using machine learning techniques. In Proceedings of the 7th International Conference on Transport and Logistics, Niš, Serbia, 6 December 2019; pp. 15–20. [Google Scholar]
Forghani, A.; Sadjadi, S.J.; Farhang, M.B. A supplier selection model in pharmaceutical supply chain using PCA, Z-TOPSIS and MILP: A case study. PLoS ONE 2018, 13, e0201604. [Google Scholar] [CrossRef]
Chen, Z.; Fu, A.; Deng, R.H.; Liu, X.; Yang, Y.; Zhang, Y. Secure and verifiable outsourced data dimension reduction on dynamic data. Inform. Sci. 2021, 573, 182–193. [Google Scholar] [CrossRef]
Li, Y.; Yang, M.; Zhang, Z. Coordinate ranking regularized non-negative matrix factorization. In Proceedings of the 2nd IAPR Asian Conference on Pattern Recognition, Naha, Japan, 5–8 November 2013. [Google Scholar] [CrossRef]
Lee, D.D.; Seung, H.S. Learning the parts of objects by non-negative matrix factorization. Nature 1999, 401, 788–791. [Google Scholar] [CrossRef]
Satopaa, V.; Albrecht, J.; Irwin, D.; Raghavan, B. Finding a “kneedle” in a haystack: Detecting knee points in system behaviour. In Proceedings of the 31st International Conference on Distributed Computing Systems Workshops, Minneapolis, MN, USA, 20–24 June 2011; pp. 166–171. [Google Scholar]
Niu, D.; Wang, K.; Sun, L.; Wu, J.; Xu, X. Short-term photovoltaic power generation forecasting based on random forest feature selection and CEEMD: A case study. Appl. Soft Comput. 2020, 93, 106389. [Google Scholar] [CrossRef]
Hasan, M.M.; Nasser, M.; Ahmad, S.; Molla, K.I. Feature selection for intrusion detection using random forest. J. Inf. Secur. 2016, 7, 129–140. [Google Scholar] [CrossRef]
Li, X.K.; Chen, W.; Zhang, Q.; Wu, L. Building auto-encoder intrusion detection system based on random forest feature selection. J. Comput. Secur. 2020, 95, 101851. [Google Scholar] [CrossRef]
Sharma, A.; Guleria, K.; Goyal, N. Prediction of diabetes disease using machine learning model. In Proceedings of the International Conference on Communication, Computing and Electronics Systems (ICCCES), Coimbatore, India, 21–22 October 2020; pp. 683–692. [Google Scholar]
Rezk, N.G.; Hemdan, E.E.D.; Attia, A.F.; El-Sayed, A.; El-Rashidy, M.A. An efficient IoT based smart farming system using machine learning algorithms. Multimed. Tools Appl. 2021, 80, 773–797. [Google Scholar] [CrossRef]
Kakhki, F.D.; Freeman, S.A.; Mosher, G.A. Evaluating machine learning performance in predicting injury severity in agribusiness industries. Saf. Sci. 2019, 117, 257–262. [Google Scholar] [CrossRef]
Rashidi, K.; Cullinane, K. A comparison of fuzzy DEA and fuzzy TOPSIS in sustainable supplier selection: Implications for sourcing strategy. Expert. Syst. Appl. 2019, 121, 266–281. [Google Scholar] [CrossRef]
Shahin, A.; Masoomi, B.; Shafiei, M.A. Ranking the obstacles of green supply chain management using fuzzy approaches of TOPSIS and DEMATEL with a case study in a pharmaceutical industry. Int. J. Logist. Syst. Manag. 2019, 33, 404–419. [Google Scholar] [CrossRef]
Pourjavad, E.; Shahin, A. A hybrid model for analyzing the risks of green supply chain in a fuzzy environment. J. Ind. Prod. Eng. 2020, 37, 422–433. [Google Scholar] [CrossRef]
Golnam, A.; Regev, G.; Wegmann, A.; Kyriakopoulou, S. The integration of an RE method and AHP: A pilot study in a large swiss bank. In Proceedings of the IEEE International Requirements Engineering Conference (RE), Rio de Janeiro, Brasil, 15–19 July 2013. [Google Scholar] [CrossRef]
Singha, C.; Swain, K.C.; Pradhan, B.; Rusia, D.K.; Moghimi, A.; Ranjgar, B. Mapping groundwater potential zone in the subarnarekha basin, india, using a novel hybrid multi-criteria approach in google earth engine. Heliyon 2024, 10, e24308. [Google Scholar] [CrossRef]

Figure 1. Proposed hybrid ML-TOPSIS supplier selection model.

Figure 2. Weight distribution for selected criteria.

Figure 3. Reconstruction error vs. rank in NMF decomposition.

Table 1. Supplier selection criteria in pharmaceutical companies [79].

Main Criteria	Sub-Criteria
Cost	c₁	Purchase price
	c₂	Payment conditions
	c₃	Transport cost
Quality	c₄	Product reliability
	c₅	Defective rate
	c₆	Package and label quality
	c₇	ISO 9001 certification
	c₈	Innovation capability
Services	c₉	Customer relations
Services	c₁₀	After-sales support
Delivery	c₁₁	Supplier location
Delivery	c₁₂	Delivery reliability
Supplier profile	c₁₃	Financial strength
	c₁₄	Management quality
	c₁₅	Technical competence
	c₁₆	Facility adequacy
	c₁₇	Production capacity
	c₁₈	Record history
	c₁₉	GMP compliance
	c₂₀	ISO 14001 certification
	c₂₁	OHSAS 18001 certification
	c₂₂	Risk management system
Overall personnel capabilities	c₂₃	Workforce skill
Overall personnel capabilities	c₂₄	Employee experience

Table 2. Linguistic scale for TOPSIS model.

Semantic Attributes	Corresponding Values
Very high	5
High	4
Moderate	3
Low	2
Very low	1

Table 3. Questionnaire [79].

	c₁	c₂	c₃	c₄	c₅	c₆	c₇	c₈	c₉	c₁₀	c₁₁	c₁₂	c₁₃	c₁₄	c₁₅	c₁₆	c₁₇	c₁₈	c₁₉	c₂₀	c₂₁	c₂₂	c₂₃	c₂₄
x₁	10	5	6	10	3	4	4	3	9	7	3	4	6	5	7	4	2	10	10	5	3	3	6	6
x₂	9	9	8	9	6	7	8	4	10	5	8	8	10	8	7	7	7	0	10	6	6	6	9	8
x₃	10	6	6	10	6	6	6	5	8	8	2	6	6	6	7	6	6	9	10	7	4	4	6	5
x₄	9	5	4	10	2	1	4	3	10	9	0	4	7	5	6	6	5	10	10	4	3	1	6	6
x₅	8	6	6	9	6	6	6	5	7	2	4	6	6	6	8	6	6	9	10	6	4	4	6	5
x₆	10	6	7	10	3	5	2	4	9	6	5	1	6	6	4	4	4	8	10	5	2	1	7	4
x₇	10	6	6	10	5	5	5	4	8	7	3	5	6	5	7	5	5	9	10	6	4	3	7	5
x₈	10	10	10	10	8	6	8	8	9	9	10	10	8	8	7	8	8	10	10	8	6	3	5	5
x₉	10	7	7	10	5	6	4	4	10	6	2	4	7	6	7	4	4	9	10	6	2	0	5	6
x₁₀	9	9	10	10	7	7	6	10	10	8	7	10	7	7	8	8	8	10	10	7	6	3	5	5
x₁₁	10	10	10	10	10	10	8	10	10	10	6	10	6	6	7	8	8	7	10	6	4	4	6	5
x₁₂	10	8	6	10	4	3	3	6	7	5	0	6	6	5	8	4	4	9	10	5	3	2	4	6
x₁₃	10	8	8	10	7	7	7	6	8	8	4	7	8	7	8	7	7	10	10	8	6	5	8	8
x₁₄	10	3	6	10	3	6	6	1	9	6	0	2	4	2	6	2	5	8	10	5	3	3	5	7
x₁₅	9	5	4	10	2	1	4	2	10	9	1	3	7	5	8	6	6	9	10	4	3	1	7	6
x₁₆	10	8	4	10	7	4	3	1	8	5	0	1	4	7	7	4	5	10	10	5	4	4	5	5
x₁₇	9	7	6	10	10	10	10	8	10	10	3	9	8	6	10	7	9	9	10	10	9	9	10	10
x₁₈	10	8	9	10	7	7	7	9	9	8	7	8	6	7	7	9	9	10	10	7	7	7	8	7
x₁₉	9	6	4	10	5	4	5	3	10	4	1	7	6	6	8	5	5	10	10	6	3	2	7	6
x₂₀	10	7	7	10	6	6	6	5	10	7	2	6	7	7	8	6	6	10	10	5	3	3	7	7
x₂₁	10	4	5	10	4	2	5	4	8	7	2	3	5	3	6	4	2	8	10	6	3	1	5	5
x₂₂	10	10	10	10	10	9	10	10	10	10	7	10	10	10	10	10	10	10	10	10	8	10	10	10
x₂₃	9	6	7	10	6	9	8	6	9	9	8	9	7	6	8	7	7	9	10	8	8	7	6	7
x₂₄	10	6	7	10	9	10	8	6	10	10	5	9	8	7	10	7	7	9	10	8	8	7	6	7
x₂₅	7	6	7	10	6	7	7	5	8	7	3	7	7	7	7	6	6	7	10	6	4	4	6	6
x₂₆	10	5	6	10	3	4	4	3	9	5	1	2	3	8	7	5	5	8	10	6	3	2	6	7
x₂₇	9	5	6	10	6	8	10	7	10	8	6	5	8	10	10	10	9	10	10	10	10	8	8	7
x₂₈	10	7	7	10	6	6	6	5	10	7	3	6	7	6	8	6	6	9	10	7	5	4	7	7
x₂₉	10	7	7	10	7	7	7	6	10	8	4	7	7	7	8	7	7	6	10	7	5	5	7	7
x₃₀	9	6	3	10	6	3	4	4	9	4	2	7	5	6	6	5	4	8	10	7	3	2	7	6
x₃₁	9	6	5	10	6	3	4	1	8	6	1	1	5	7	8	4	4	8	10	5	4	5	5	5
x₃₂	10	7	7	10	6	6	6	5	10	7	3	6	7	6	8	6	6	9	10	7	5	4	7	7
x₃₃	10	7	7	10	10	10	7	8	10	10	6	10	10	9	10	10	10	9	10	7	6	7	10	10
x₃₄	9	8	7	10	9	9	9	8	8	8	8	8	8	8	10	9	9	7	10	7	7	9	8	9

Table 4. Weight distribution for dimension reduction using NMF.

S/N	Criteria	Score
1	c₄	3.8385
2	c₁₈	3.7611
3	c₁	3.3129
4	c₁₄	2.7282
5	c₁₅	2.4766
6	c₂	2.3853
7	c₉	2.3461
8	c₁₃	2.0366
9	c₁₆	2.0308
10	c₁₀	2.0007
11	c₁₇	1.8113
12	c₂₃	1.6775
13	c₅	1.6415
14	c₁₂	1.6260
15	c₃	1.5692
16	c₂₂	1.3988
17	c₈	1.3980
18	c₂₄	1.3975
19	c₇	1.3531
20	c₆	1.3188
21	c₂₀	1.2628
22	c₁₁	1.2258
23	c₂₁	0.9946
24	c₁₉	0.0000

Table 5. Machine learning model accuracy.

ML Technique	Random Forest	SVC	KNN	Logistic Regression
CV Score	0.8429	0.7476	0.8190	0.7476

Table 6. Criteria weight determination using random forest.

S/N	Criteria	Weight
1	c₁₅	0.2366
2	c₁₄	0.1564
3	c₄	0.1457
4	c₂	0.1385
5	c₉	0.1122
6	c₁₈	0.0937
7	c₁₃	0.0637
8	c₁	0.0532

Table 7. Rating of four suppliers by DM4 for all criteria.

Criteria	Supplier 1	Supplier 2	Supplier 3	Supplier 4
c₁₅	5	4	5	3
c₁₄	4	4	4	3
c₄	4	5	4	5
c₂	5	5	4	4
c₉	4	5	3	4
c₁₈	4	3	4	3
c₁₃	4	4	5	4
c₁	5	4	3	4

Table 8. Normalised decision matrix.

Sustainability Factors	Supplier 1	Supplier 2	Supplier 3	Supplier 4
c₁₅	0.5774	0.4619	0.5774	0.3464
c₁₄	0.5298	0.5298	0.5298	0.3974
c₄	0.4417	0.5522	0.4417	0.5522
c₂	0.5522	0.5522	0.4417	0.4417
c₉	0.4924	0.6155	0.3693	0.4924
c₁₈	0.5657	0.4243	0.5657	0.4243
c₁₃	0.4682	0.4682	0.5852	0.4682
c₁	0.6155	0.4924	0.3693	0.4924

Table 9. Weighted normalised decision matrix.

Sustainability Factors	Weightage	Supplier 1	Supplier 2	Supplier 3	Supplier 4
c₁₅	0.2366	0.1366	0.1093	0.1366	0.0820
c₁₄	0.1564	0.0829	0.0829	0.0829	0.0621
c₄	0.1457	0.0644	0.0804	0.0644	0.0804
c₂	0.1385	0.0765	0.0765	0.0612	0.0612
c₉	0.1122	0.0552	0.0691	0.0414	0.0552
c₁₈	0.0937	0.0530	0.0398	0.0530	0.0398
c₁₃	0.0637	0.0298	0.0298	0.0373	0.0298
c₁	0.0532	0.0327	0.0262	0.0196	0.0262

Table 10. Ideal best to ideal worst.

V⁺	0.1366	0.1366	0.1366	0.1366	0.1366	0.1366	0.1366	0.1366
V⁻	0.0820	0.0820	0.0820	0.0820	0.0820	0.0820	0.0820	0.0820

Table 11. Supplier performance.

		Supplier 1	Supplier 2	Supplier 3	Supplier 4
	S+	0.0260	0.0497	0.0354	0.0641
	S−	0.0634	0.0497	0.0618	0.0222
	(S+) + (S−)	0.0894	0.0995	0.0972	0.0863
Performance score	P+	0.7089	0.5000	0.6355	0.2571
Rank		1	3	2	4

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Gidiagba, O.J.; Tartibu, L.; Okwu, M. Integrating Machine Learning with Multi-Criteria Decision-Making Models for Sustainable Supplier Selection in Dynamic Supply Chains. Logistics 2025, 9, 152. https://doi.org/10.3390/logistics9040152

AMA Style

Gidiagba OJ, Tartibu L, Okwu M. Integrating Machine Learning with Multi-Criteria Decision-Making Models for Sustainable Supplier Selection in Dynamic Supply Chains. Logistics. 2025; 9(4):152. https://doi.org/10.3390/logistics9040152

Chicago/Turabian Style

Gidiagba, Osheyor Joachim, Lagouge Tartibu, and Modestus Okwu. 2025. "Integrating Machine Learning with Multi-Criteria Decision-Making Models for Sustainable Supplier Selection in Dynamic Supply Chains" Logistics 9, no. 4: 152. https://doi.org/10.3390/logistics9040152

APA Style

Gidiagba, O. J., Tartibu, L., & Okwu, M. (2025). Integrating Machine Learning with Multi-Criteria Decision-Making Models for Sustainable Supplier Selection in Dynamic Supply Chains. Logistics, 9(4), 152. https://doi.org/10.3390/logistics9040152

Article Menu

Integrating Machine Learning with Multi-Criteria Decision-Making Models for Sustainable Supplier Selection in Dynamic Supply Chains

Abstract

1. Introduction

2. Literature Review

3. Proposed Model

3.1. Identification of Criteria and Data Preprocessing

3.2. Dimension Reduction with NMF Method

3.2.1. Matrix Construction and Notation

3.2.2. Optimisation Objective

3.2.3. Selecting the Optimal Rank r Using the Elbow Method and KneeLocator

3.3. Random Forest Feature Selection

3.4. TOPSIS

4. Case Study

4.1. Model Establishment and Calculation of NMF

4.2. Using Random Forest to Obtain Criteria Weights

4.3. Using TOPSIS to Integrate the Performance of Suppliers and Their Priority Ranking

5. Discussion

5.1. Theoretical Implications

5.2. Managerial Implications

6. Conclusions

Research Limitations and Future Direction

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI