Next Article in Journal
Assisting the Human Embryo Viability Assessment by Deep Learning for In Vitro Fertilization
Previous Article in Journal
Sustainable Supplier Selection and Order Allocation Using an Integrated ROG-Based Type-2 Fuzzy Decision-Making Approach
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Supervised Learning by Evolutionary Computation Tuning: An Application to Blockchain-Based Pharmaceutical Supply Chain Cost Model

1
Mechanical Engineering Department, École de Technologie Supérieure, Montreal, QC H3C1K3, Canada
2
Department of Systems Engineering, École de Technologie Supérieure, Montreal, QC H3C1K3, Canada
*
Author to whom correspondence should be addressed.
Mathematics 2023, 11(9), 2021; https://doi.org/10.3390/math11092021
Submission received: 13 March 2023 / Revised: 16 April 2023 / Accepted: 18 April 2023 / Published: 24 April 2023

Abstract

:
A pharmaceutical supply chain (PSC) is a system of processes, operations, and organisations for drug delivery. This paper provides a new PSC mathematical cost model, which includes Blockchain technology (BT), that can improve the safety, performance, and transparency of medical information sharing in a healthcare system. We aim to estimate the costs of the BT-based PSC model, select algorithms with minimum prediction errors, and determine the cost components of the model. After the data generation, we applied four Supervised Learning algorithms (k-nearest neighbour, decision tree, support vector machine, and naive Bayes) combined with two Evolutionary Computation algorithms (ant colony optimization and the firefly algorithm). We also used the Feature Weighting approach to assign appropriate weights to all cost model components, revealing their importance. Four performance metrics were used to evaluate the cost model, and the total ranking score (TRS) was used to determine the most reliable predictive algorithms. Our findings show that the ACO-NB and FA-NB algorithms perform better than the other six algorithms in estimating the costs of the model with lower errors, whereas ACO-DT and FA-DT show the worst performance. The findings also indicate that the shortage cost, holding cost, and expired medication cost more strongly influence the cost model than other cost components.

1. Introduction

The supply chain system (SCS) is an accepted approach to increase profit margins, protect the pharmaceutical industry against introduced pressures, and overcome obstacles for obtaining high efficiency while taking the limited available resources into account [1]. Conversely, the pharmaceutical supply chain (PSC) is a system of processes, operations, and organisations involved in drug discovery, development, and production. PSC processes are crucial for ensuring medication quality and favourable final patient outcomes [2]. As a system of processes, operations, and organisations, the PSC plays a significant role in delivering the right medication to the right customers (patients) at the right time and in the right conditions. In the current SCS, pharmacies and manufacturers cannot track their products and have no clear system visibility. Recalls are costly and complicated in the SCS, making follow-up with patients difficult for companies. Therefore, the current SCS in the pharmaceutical industry appears to be outdated and may not provide visibility and control for manufacturers and regulatory authority over drug distribution [3]. In particular, it cannot withstand 21st century cybersecurity threats [3]. The use of a Blockchain-based pharmaceutical supply chain (BT-based PSC) appears to be necessary for any pharmacy system. A BT-based PSC helps the system improve the safety, performance, and transparency of medical information sharing and the data transformation cost/time as well as the manufacturing process, the distribution of the materials/drugs, and the tracking of the materials/drugs sourced for manufacturing. BT in a PSC system can develop patient data cards for other medical practitioners’ centres, especially in hospitals, leading to saved time and improved healthcare service. In this system, patients and healthcare centres can have different accessibility choices with the PSC data. In addition, any block in BT contains the medical information with a hash connecting it to another block.
In contrast to several previous studies that have reported on the advantages and disadvantages of using BT in a PSC, the present study seeks to address the cost problem of a BT-based PSC. Other studies do not provide a cost mathematical model or related cost components for a PSC system based on the BT approach. This paper differs from others by introducing a cost mathematical model for the BT-based PSC system and providing the cost components of the system. The cost factor is important to managers because the knowledge of costs helps them to control all the financial resources employed in the performance of the system, control the cash flow, identify the rate of return and profitability, and correctly decide whether the new system benefits their organisation. Moreover, the knowledge of costs helps to monitor the business financial health, optimize the institution’s financial planning, reduce expenses, stay within the budget, and analyse the information to identify unnecessary costs and better business opportunities. Another important contribution of this study is to provide a PSC system with BT. BT can improve the safety, performance, and transparency of medical information sharing in a healthcare system, minimise the data transformation cost and time, and maintain the financial statements in hospitals.
The purpose of this study is to estimate the costs of the BT-based PSC model in a hospital, select algorithms with the minimum prediction errors, and determine the cost components of the BT-based PSC model in a hospital. To understand the importance of the BT-based PSC cost model, determining the cost components of the model is essential. Thus, this paper also aims to measure the importance of each cost component (feature) of the model, which is the degree of relevance of each feature to the model. To achieve these objectives, this research attempts to respond to the following research questions: (i) What are the cost components of the BT-based PSC cost model in a hospital, and what is the mathematical cost model? (ii) Which algorithms show better performance in minimising the prediction errors of the BT-based PSC cost model? (iii) What are the important cost components of the model? The research questions were answered using the following directions. First, we designed a mathematical BT-based PSC cost model after determining the cost components. Then, following the data generation, we applied four Supervised Learning (SL) algorithms (k-nearest neighbors (KNN), decision tree (DT), support vector machine (SVM), and naive Bayes (NB)) combined with two Evolutionary Computation (EC) algorithms (ant colony optimization (ACO) and the firefly algorithm (FA)) for a total of eight algorithms. These algorithms were selected because they are well-known algorithms that have been successfully applied to solve many engineering problems, which can facilitate the discussion of their behaviours in our new cost model. Finally, four performance metrics were used to evaluate the cost model, and the total ranking score (TRS), which is a score-based ranking system, was used to determine the most reliable predictive algorithms.
The rest of the paper is organised as follows. First, we provide an overview of BT in a PSC, EC, and SL in Section 2. Next, we discuss the methodology and data generation used to optimize the estimation of the BT-based PSC cost model in Section 3. Then, the design of the mathematical cost model for the BT-based PSC is outlined in Section 4. All experiments and results are described in Section 5. Then, the results, limitations, and future research are discussed in Section 6, which is called Discussion. Finally, we briefly present the conclusions in Section 7.

2. Literature Review

This background explains the related literature regarding the PSC and its components for a hospital, how BT drives PSC, Evolutionary Computation, and Supervised Learning (ACO and FA; KNN, DT, NB, and SVM).

2.1. PSC and Its Components for a Hospital

The SCS is required for any industry that moves materials and goods in any way; on the other hand, the PSC is important for tracking the materials and goods sourced for manufacturing and for the distribution of the products [4]. PSC processes affect the quality of medication and patient outcomes [2]. As an accepted approach, the SCS protects the pharmaceutical industry against the introduced pressures, increases profit margins, and overcomes efficiency issues [1]. PSCs seek to ensure that the right people receive the right medication at the right time and in the right conditions [5]. These responsibilities of PSCs are complex and increase their vulnerability and the probability of distribution [5]. A PSC can be defined as “the integration of all activities associated with the flow and transformation of drugs from raw materials through to the end user, as well as the associated information flows, through improved SC relationships to achieve a sustainable competitive advantage” [6]. The pharmaceutical industry is a system of processes, operations, and organisations involved in drug detection, development, and production [7]. In [7], the PSC is described as an approach with a suitable quality that distributes drugs at the right time and place to reach the customers. The healthcare sector includes publicly traded companies supporting all facets of the healthcare sector [3]. The authors of [3] state that the healthcare sector consists of clinical, preventive, treatment, and therapeutic services and providers, including doctors, nurses, hospitals, drugs, medical equipment suppliers, and health insurance companies in addition to other private, government, and voluntary institutions such as residential, educational, dental, domestic health, medical, surgical, and ambulatory institutions, and medical and diagnostic laboratories. The PSC includes three significant players: producers, purchasers, and pharmaceutical providers [6]. The authors of [6] describe the producers as the pharmaceutical companies, medical–surgical product companies, device manufacturers, capital equipment manufacturers, and information systems manufacturers. According to Uthayakumar and Priyan [6], purchasers comprise the grouped purchasing organisations, the pharmaceutical wholesalers, the medical–surgical distributors, independently contracted distributors, and the product representatives. The authors of [6] also explain that providers include hospitals and their systems, integrated delivery networks, and alternative site facilities. The BT-enabled PSC cost model in this article contains eight elements: (a) regular purchases cost, (b) emergency purchases cost, (c) shipping cost, (d) expired medication cost, (f) holding cost, (g) shortage cost, (h) Blockchain transaction cost, (i) Blockchain installation cost.

2.2. BT Drives PSC

The current PCS of the pharmaceutical industry appears to be outdated, does not provide visibility and control for manufacturers and drug distributions, and cannot withstand current cybersecurity threats [3]. BT is a cutting-edge technology that has been used in different applications such as cryptocurrency, financial services, risk management, and public and social services [8]. BT can be public, private, hybrid, or part of a consortium. Each BT type has various advantages and disadvantages that influence its optimal applications. According to Haq and Esuka [3], the defects of the SCS are as follows: information is not shared between systems, manufacturers cannot track their products, the drug regulatory authority has no visibility of the system, recalls are complicated and costly, and the healthcare system cannot follow up with patients. Haq and Esuka [3] also mention that the products in a PSC are verifiable without any information about the manufacturer’s private techniques.
Conversely, Haq and Esuka [3] believe that it is possible to share the patient’s medical record with various participants on the network without disclosing the patient’s private data. Several players move a product throughout the PSC: (i) primary manufacturers, (ii) secondary manufacturers, (iii) distribution centers/wholesalers, and (iv) retailers (i.e., pharmacies/hospitals) [9]. BT improves safety, displays information, achieves transparency, and is used for health record keeping, clinical trials, and patient monitoring [10]. According to Zahiri et al. [9], BT maintains financial statements in the hospitals and minimises the time and cost of data transformation. Haleem et al. [10] highlight that BT preserves and exchanges patient data via hospitals, diagnostic laboratories, pharmacy firms, and physicians in a healthcare system. BT in the PSC can detect fake medicines by facilitating the proper control over the supply and demand of the drugs and can enable pharmaceutical companies to control fake and unregistered medicines [11]. Kumar Badhotiya et al. [11] assert that fake and unregistered medicines with no medical recovery pose a significant threat to human life, causing many side effects leading to severe damage to health or even death. Therefore, BT’s advantages improve the performance, security, and transparency of medical data sharing in the healthcare system [10]. Haleem et al. [10] also state that BT gains insight and enhances the analysis of medical records in medical institutions. BT applied within a PSC enables data integration, secure transactions, serialisation, and traceability [8]. Importantly, Haq and Esuka [3] note that visibility and privacy are largely contradictory, and to obtain one, the other is often lost. They [3] clarify that BT can guarantee the verification of the origin of data that are made available publicly while keeping the private data of an entity secret without compromising privacy. According to these authors [3], the decentralized nature of BT allows patients, doctors, and healthcare providers to share data quickly and securely. Hosseini Bamakan et al. [8] also show that traceability plays a significant role in securing drugs and is the basis for the reliance by the consumer on the PSC and its products. They [8] continue that the traceability of BT enables the PSC to verify the background of a product and tracks the path of all the locations and the participants that handle it. They also state that [8] BT can also provide transparency to the PSC and considers the needs of the suppliers, producers, logistics, distributors, and customers in the PSC. They [8] assert that all pharmaceutical institutions adhere to patient protection maintenance, and intelligent contracts can facilitate this process if a system applies BT. Among all the factors of a BT-enabled PSC, the cost factor is significant for an organisation. Supervised Learning (SL) algorithms can predict the costs of the system and Evolutionary Computation is applied to optimize the hyperparameters of SL to build a model, allowing the exploration of the possible combinations of parameters.

2.3. Evolutionary Computation and Supervised Learning

The Evolutionary Computation (EC) algorithm is the main object of interest in evolutionary computation [12]. The scientific community has demonstrated that metaheuristics are a viable and often superior alternative to the more traditional (exact) methods of mixed-integer optimization such as branch and bound algorithms and dynamic programming [13]. Metaheuristics often offer a better trade-off between solution quality and computing time, particularly for complicated problems or large problem instances [13]. Using metaheuristic techniques, reasonably good solutions are obtained without exploring the whole solution space [14]. Rather than searching for the global optimum solution, these techniques aim to find sufficiently “good” solutions to efficiently exploit the characteristics of the problem and provide an attractive alternative for large-scale applications [15]. SL tries to predict the output feature’s value based on the input features’ values [16]. The authors of [16] point out that SL learns the relationship between the target feature and the input features from the training data for which the target feature value is already known.

2.3.1. ACO and FA

Metaheuristics are powerful techniques for solving complex real-world problems in many application domains [17]. The behaviour and performance of EC algorithms depend strongly on their ability to efficiently explore and exploit the search space [17]. The authors of [17] state that ACO is a well-known EC that was inspired by the collective performance of real-life ant colonies and has been used to solve many engineering problems. The ACO algorithm employs a colony of simple cooperating agents and solves combinatorial optimization problems [18]. The FA is mainly inspired by the light connection between fireflies [7]. Goodarzian et al. [7] explain that in swarm intelligence the cooperation (and possibly the competition) of more straightforward and less intelligent members creates a higher degree of intelligence that certainly cannot be achievable by any of the components alone. Thus, according to Goodarzian et al. [7], the FA is inspired by the natural species behaviour to optimize nonlinear functions that simultaneously use low-cost algorithms. Each member of the fireflies’ group in the FA moves to a point where their most efficient outcome has occurred [19].

2.3.2. KNN, DT, NB, and SVM

SL is defined as the use of labeled datasets to train algorithms that classify datasets or predict outcomes accurately [12]. Zhang et al. [12] mention that SL adjusts the weights until the model is fitted appropriately after inputting the data into the model. According to Zhang et al. [12], the cross-validation process ensures that the model can avoid overfitting and under-fitting. They [12] present SL using several methods, including neural networks, naïve Bayes (NB), linear regression, logistic regression, random forest, and support vector machine (SVM).
K-nearest neighbors (KNN) is an SL algorithm used in classification and regression problems [20]. The authors of [20] highlight that KNN is applied in a variety of applications such as text categorisation, agriculture, medicine, finance, facial recognition, economic forecasting, and heart disease diagnosis. They [20] express that KNN calculates the distance between each unlabelled data point and all other points in the dataset to classify the unlabelled data. Then, KNN, according to these authors [20], assigns each unlabelled data point to the class of the most identically labelled data by finding patterns in the dataset.
A DT is an SL algorithm primarily used to analyse data and perform regression and classification problems [21]. The authors of [21] explain that DT contains decision nodes (to test the value of an attribute), edges (to determine the outcome of a test and connect with the next node), and a leaf node (to predict the result), all of which are combined and comprise a complete structure of the DT. In the DT, according to these authors [21], each dataset attribute is treated as a node, and a special and unique node is a root node. The process starts with a unique node and proceeds down the tree to satisfy the parameters and decision [21]. The authors of [21] believe this procedure is carried out until a terminal node is encountered.
An NB classifier is a simple probabilistic classifier that has been widely used due to its high efficiency, solid theoretical foundation, and good generalisation ability [22]. According to Ren, et al. [22], NB assumes that the attribute variables are conditionally independent when class variables are given. To classify the given item, they [22] state that NB determines the probability of each category appearing under the condition of the occurrence of this item and classifies the item that belongs to the category with the highest probability [22].
An SVM is a binary classification model that has been widely used due to its global optimization capability and good robustness in fields such as environment, medicine, and finance [23]. The authors of [23] explain that the basic principle of SVM requires that, when solving a classification problem, the distance from the nearest sample point to the decision surface is the largest; that is, the minimum distance maximises the two classes of the sample points to separate the edges. A straight line in a two-dimensional space makes it the most suitable segmentation line in the middle of the two data classes, and SVM finds an optimal decision plane as the classification benchmark in the high-dimensional dataset [23]. The next section explains the methodology of the study and the procedure to generate data.

3. Methodology and Data Generation

In this section, we introduce the procedure of data generation as well as the methodology for the optimization of the estimation of the BT-based PSC cost model in a hospital. We first used Python software to generate raw data for the nonlinear BT-based PSC cost model, including the cost components of a hospital. The widely known tool for the generation of random data in Python is its random module, and we applied randint() as an inbuilt function of the random module. This module returns a random integer value from the inclusive range between the two lower and higher limits (including both limits) provided as two parameters. In the data generation step, the following features, which are the components of the model, were generated using a Python program: Cp,Regular_Purchases; Cp,Emergency_Purchases; Cp,Shipping; Cp,Expired_Medication; Cp,Holding; Cp,Shortage; Cp,BT_Transaction; Cp,BT_Installation. A total of 5000 series of the generated raw data for all 8 components of the BT-based PSC cost model and the total cost were uploaded to https://data.mendeley.com/datasets/jxv5jrydnc (accessed on 1 November 2022) [24]. The research method selected in this paper is to combine two approaches (EC and SL) for the evaluation of the BT-based PSC cost model: for EC, the ACO and FA are used, and for SL, the KNN, DT, SVM, and NB algorithms are used. These algorithms are well-known and can be successfully applied to solve many engineering problems, which can facilitate the discussion of their behaviours in our new cost model. Here, ACO and the FA are used to improve the parameters of the KNN, DT, SVM, and NB algorithms, as well as to minimise the model prediction errors. The parameters of SL algorithms are usually set empirically, and it takes much time to test and find the best predictive performance of the model. Therefore, the EC algorithms explore the possible combinations of parameters, optimize the hyperparameters of the SL algorithms, and reduce the prediction errors of the SL algorithms. Therefore, the EC algorithms play a significant role in enhancing the performance of the selected SL algorithms. Thus, EC, combined with four algorithms (KNN, DT, SVM, and NB) is used to reduce prediction errors. The generated dataset has eight features (including Cp,Regular_Purchases; Cp,Emergency_Purchases; Cp,Shipping; Cp,Expired_Medication; Cp,Holding; Cp,Shortage; Cp,BT_Transaction; and Cp,BT_Installation), and has the total cost (CTotal) as the label in the regression process (see https://data.mendeley.com/datasets/jxv5jrydnc (accessed on 1 November 2022) [24]). Although the dataset was generated using Python, the implementation of the algorithm, as illustrated in the flowchart presented in Figure 1, was carried out in MATLAB. Figure 1 illustrates the flowchart of the methodology for the four SL algorithms and two EC algorithms. The flowchart starts with creating the population and initialising the parameters. In the next step, the Feature Weighting (FW) approach, which is one of the most efficient approaches, is applied to evaluate the importance of features, assign an appropriate weight to each feature, and estimate the degree of relevance of each feature to the model. The FW process is executed by multiplying the value of every instance of all the features and orders them by their values [25]. FW is considered to be more efficacious than the Feature Selection process in several problems and cases because the features are very sensitive, so removing these kinds of features may negatively affect the classification performance [25]. Traditionally, all of the selected features are equally important when estimating the output, but if some features have a higher weight than others, the results can be strongly influenced by them, affecting the performance and the accuracy of the overall algorithm. The dataset was then participated with 70% of the dataset used for training, while the remaining 30% was used for testing. Four different SL algorithms were used to find the optimal method to estimate the cost of the BT-based PSC cost model, including KNN, DT, SVM, and NB. In the next step, we applied two EC algorithms, ACO and the FA, to improve the performance of the SL algorithms and optimize their hyperparameters, reducing the prediction errors. Four metrics were used to evaluate the cost model, including the mean square error (MSE), root mean square error (RMSE), mean absolute error (MAE), and correlation coefficient (R2). Therefore, this approach produced results for the following metrics: eight FWs (one weight for each feature), MSE, RMES, MAE, and R2. Eventually, a score-based ranking system called total ranking score (TRS) was used to determine the most reliable predictive algorithms. In TRS, each method received a score based on the calculated MSE, RMES, MAE, and R2 values. Finally, the ranking position of each model was assigned based on the sum of all obtained score states. The following section models the casts of the BT-based PSC system based on the literature review and the methodology.

4. Mathematical Cost Model for BT-Based PSC

This section proposes the nonlinear BT-based PSC cost model for a hospital. The model focuses on the costs of a pharmaceutical supply chain that used a public Blockchain as a modern, trustworthy, and traceable database. Inspired by recent articles by Weraikat, Kazemi Zanjani, and Lehoux [26], Franco and Alfonso-Lizarazo [27], and Havaeji, Dao, and Wong [28], we determine that the cost components of the BT-based PSC model are regular purchases cost, emergency purchases cost, shipping cost, expired medication cost, holding cost, Shortage cost, Blockchain transaction cost, and Blockchain installation cost. Based on these references and our knowledge, this research proposes a mathematical model and assumptions to estimate the costs of a PSC based on Blockchain technology in a hospital. Equations (1) and (2) show the cost components of the mathematical model. The assumptions of the mathematical model in the approach are as follows:
A 1. The planning horizon is one year.
A 2. The drug expiration date is constant, and the producer ships drugs with a long life cycle to the hospital to reduce the likelihood of expiration.
A 3. The hospital orders medicines in bulk packs (and not units).
A 4. At the beginning of the planning horizon, the age and the number of medications available in the hospital (including the initial stock) are zero.
A 5. Demand should be satisfied at all levels in a hospital. Emergency purchases are made if the hospital’s pharmacy has a drug shortage. The price of emergency medicine is equal to or higher than its regular purchase price.
A 6. Medicines in the form of both regular and emergency purchases are available.
A 7. The producer ships only fresh medications to the hospital to minimise the likelihood of expiration.
A 8. The cost of medicines is determined at the cycle time when a decision to purchase is made. This is because drug prices can change every period.
A 9. The medicines are used to satisfy the demand and/or are kept in the inventory after arrival.
Hospitals are highly recommended to have a high level of technology and infrastructure that would facilitate the future implementation of a technological system supporting BT. As the main customers of medications, hospitals adopt conservative inventory control policies by keeping large quantities of drugs in stock [26]. The demand of the hospital should be fulfilled by the producer over the planning horizon because of the importance of the medications [26]. In addition, the producer communicates with the hospital to decide on a minimum amount of each medication that must be available in the hospital stock at all times in the safety stock level [26]. Medications move from the producer, via a transportation provider, to the hospital site to satisfy its demand in each period of the planning horizon [26]. Leaving expired medications at customer zones and disposing them improperly can both generate a significant negative environmental footprint [26]. Once the medicine arrives, it can be used to satisfy the demand and/or be kept in inventory by using the age-based inventory constraints and the age of medicines to model the perishability [27]. After a number of cycles, some medicines expire during the time that passes; therefore, if there is not enough medicine on inventory to satisfy the demand, which is also a random element, an emergency purchase can be made to satisfy the demand, but the purchase will be made at a higher price [27]. Any medication that reaches the end of its shelf life is quarantined and then shipped to government safe disposal sites, while unexpired medications remain at the hospital to be used in a next period [27]. The parameters, variables, and constraints used in the model are listed in Table 1.
Equations (1) and (2) describe the components of the model mathematically.
Min   TC = M i n ( p = 1 M [ C p , R e g u l a r _ P u r c h a s e s + C p , E m e r g e n c y _ P u r c h a s e s + C p , S h i p p i n g + C p , E x p i r e d _ M e d i c a t i o n + C p , H o l d i n g + C p , S h o r t a g e + C p , B T _ T r a n s a c t i o n + C p , B T _ I n s t a l l a t i o n ] )
Min   TC = M i n p = 1 M [ r p × q p +   e p × eq p + ( eq p   +   q p   + s p )   ×   t p   +   ( ex p   +   tex p )   ×   exq p   +   h p   ×   i p + ( π p   +   o p )   ×   s p ]   +   G u   ×   g   ×   365 + s   ×   C s +   C fixed   +   ( C onboarding   ×   U   +   C mc   +   C mo )   ×   ( eq p   +   q p   +   S p )
Equation (2) specifies the objective function that seeks to minimise the producer costs that involve the following elements:
Cp,Regular_Purchases: The regular purchases cost is the cost of buying different types of medicines (rp × qp).
(1)
Cp,Emergency_Purchases: The emergency purchases cost, at a higher price than the regular cost, can be made to satisfy the demand if there is not a sufficient amount of drugs in the inventory or if some medicines are expired (ep × eqp).
(2)
Cp,Shipping: The shipping cost of medications from the producer site to the hospital site is ((eqp + qp + sp) × tp). During shipping, it is necessary to keep some medications in certain conditions (such as temperature, light, or humidity). Therefore, the transportation cost varies with each medication type, although the distance between the producer and the hospital is constant.
(3)
Cp,Expired_Medication: The expired medication cost involves the safe disposal fees for different types of expired medication at government sites and the cost of shipping from the hospital to the safe government disposal sites ( ( exp + texp) × exqp). Expired medications incur governmental penalties and environmental forfeits due to their negative impact on the environment.
(4)
Cp,Holding: The holding cost is the inventory cost of the different types of medications at the hospital site (hp × ip).
(5)
Cp,Shortage: The Shortage cost is the cost that producers pay to the hospital due to the shortage in the supply of different types of medications (unsatisfied demand) and due to the outsourced medication cost satisfied by another pharmaceutical producer ( ( πp + op) × sp).
(6)
Cp,BT_transaction: The BT transaction cost consists of gasCost (gasUsed × gasPrice) and Storage cost (Gu × g × 365 + s × Cs) [24,30,31,32]. Gu × g is the BT transaction cost per day, and s × Cs is the storage cost per year. A secured cloud-based warehouse stores the actual data off-chain. An IBM Cloud website was used to calculate the storage cost [29].
(7)
Cp,BT_Installation: The BT installation cost includes the Fixed cost (cfixed), Onboarding cost (conboarding), Maintenance cost (cmc), and Monitoring cost (cmo) (cfixed + (conboarding × U + cmc + cmo) × (eqp + qp + sp)) [24,33].
The most widely used consensus protocol in the public Blockchain is Proof-of-Work (PoW) (used in, e.g., Bitcoin and Ethereum) [34]. The cost of the Blockchain transaction consists of gasLimit and gasPrice. The gasLimit is the maximum amount of gas used to execute the transaction and is purchased from the sender’s account balance. At the end of the transaction, any unused gas is refunded to the sender’s account [24,32]. According to Wood [32], gasPrice (a scalar value) is the number of Wei to be paid for each unit of gas and consists of all computation costs incurred as a result of the execution of this transaction. The ETH Gas Station is a suitable place to calculate Gu × g and incentivise computation within the network [31,35]. The gWei includes the cost of a transaction on the Ethereum Blockchain as well as the cost of the transaction validators and the network. To convert gWei to USD, we can use the ETH Gas Station website based on the current price of Ethereum [35]. In Table 1, to calculate the Gu × g cost and convert the gWei cost to USD via the ETH Gas Station website, the gasUsed (as the total gas used in transactions) is 21,000 (a scalar value), and the range of the gasPrice is between 20 and 60 gWei. The range of Gu × g is between USD 1.31 and USD 3.94. This transaction cost was calculated using the ETH Gas Station website [35].
The cost of Blockchain installation consists of the Fixed cost, Onboarding cost, Maintenance cost, and Monitoring cost [24,33]. The Onboarding cost is the cost of training clients and suppliers to become active users of a service or product (training cost) and is the cost related to integrating new employees into a company to learn and train in BT. The cmc and cmo costs are 15–25% of the project value per year [33,36]. After designing the mathematical model, it is necessary to evaluate it statistically, which is discussed in the next section.

5. Results

This section comprehensively presents all experiments and the obtained results and shows the performance metrics of eight algorithms on the generated dataset in addition to the weights of the cost features of the BT-based PSC cost model. The numerical examples examined in this research validate the proposed methodology and the cost model and demonstrate the performance of the proposed approach. We designed and executed the proposed method using MATLAB software to validate the BT-based PSC cost model. We used the FA and ACO (EC algorithms) to improve the results of the KNN, DT, SVM, and NB SL algorithms. To reduce prediction errors and improve the SL results, this combination provides eight algorithms, namely, FA-KNN, FA-DT, FA-SVM, FA-NB, ACO-KNN, ACO-DT, ACO-SVM, and ACO-NB. The MSE, RMSE, MAE, and R2 performance metrics were used to evaluate the efficiency of the proposed algorithms. We also used the FW approach to estimate the influencing features for the generated dataset in the cost model. FW has an important role in analysis without changing the initial data content. The authors ran each algorithm (FA-KNN, FA-DT, FA-SVM, FA-NB, ACO-KNN, ACO-DT, ACO-SVM, and ACO-NB) for 10 runs (80 runs in total) with 1000 iterations. These algorithms helped us validate our results. This experiment assessed the ‘‘average’’ of the performance metrics and the ‘‘average’’ of the weight of the cost features to improve the reliability of all methods in eighty (4 × 10 + 4 × 10) runs. These averages are used because the runs can have various outcomes, and they can help us achieve stability and reliability in behavioural data. Each run was performed for, at most, 1000 iterations. Therefore, instead of comparing the predictions of the BT-based PSC cost model in each run, we compared the average of every 10 runs. This research finally used TRS to determine the most reliable predictive algorithms for the BT-based PSC cost model. Table 2, Table 3, Table 4, Table 5, Table 6, Table 7, Table 8 and Table 9 present the related results.

5.1. FA Combined with Four SL

In this step, the FA was combined with four SL algorithms (FA-KNN, FA-DT, FA-SVM, and FA-NB) and was run 40 times) each algorithm was executed 10 times) in 1000 iterations. Four performance metrics and the weights of eight cost features were evaluated for the BT-based PSC cost model in the 40 runs. Table 2 presents the 8 weights of the cost features and the 4 performance metrics for the 4 algorithms in 40 runs. The average values for each performance metric and each cost feature are presented in Table 2. Table 3 and Table 4 are derived from Table 2.
Table 2. Feature weighting and performance evaluation for FA combined with four SL algorithms in 10 runs for each.
Table 2. Feature weighting and performance evaluation for FA combined with four SL algorithms in 10 runs for each.
FA_
KNN
RunFeature WeightingPerformance Metrics
W_(Regular_Purchases)W_(Emergency_Purchases)W_(Shipping)W_(Expired_Medication)W_(Holding)W_(Shortage)W_(BT_Transaction)W_(BT_Installation)MSERMSEMAER2
10.427610.703720.40190.778050.664410.543030.38980.755496,410,531.1812531.90271911.48670.93669
20.380970.406960.671080.071360.577760.966330.256490.715046,543,559.6362558.03821844.53670.95009
30.399180.227440.8819510.457450.609810.283450.496677,109,497.6392666.36411895.780.94165
40.484720.704270.117420.561760.603250.554160.727560.701654,820,413.6612195.5441861.42670.94784
50.442930.36279110.2784110.533270.666438,836,400.8622972.60842380.510.95397
60.571360.430840.0868560.802610.388240.619530.72070.569453,341,730.0691828.041563.430.96499
70.299710.255150.259570.568990.696760.899020.379480.581597,298,087.422701.49732191.10330.93169
80.519980.392270.112920.502070.309360.739010.257180.710754,636,038.3672153.14621730.30670.95829
90.380970.406960.671080.071360.577760.966330.256490.715046,543,559.6362558.03821844.53670.95009
100.399180.227440.8819510.457450.609810.283450.496677,109,497.6392666.36411895.780.94165
Average0.4306610.4117840.50847260.635620.5010850.7507030.4087870.6408786,264,931.6112483.154321911.889680.947695
FA_
DT
RunFeature WeightingPerformance Metrics
W_(Regular_Purchases)W_(Emergency_Purchases)W_(Shipping)W_(Expired_Medication)W_(Holding)W_(Shortage)W_(BT_Transaction)W_(BT_Installation)MSERMSEMAER2
10.47321100.550290.0779320.9812900.7649920,931,129.394575.05513459.96210.78428
20.6810300.2235400.861190.380030.886030.6445235,006,777.185916.65254540.53660.71695
30.154660.56860.549320.76120.588960.7938100.222415,666,495.593958.09243054.38020.69542
40.696630.444510.602970.454210.664560.240650.2454019,723,753.674441.14333129.70530.82664
50.0192540.209550.566310.743430.661300.058820.9297830,911,374.485559.79994279.75940.70113
60.885110.144170.8037300.86510.454870.462280.685516,236,754.94029.48573445.52850.8264
70.47321100.550290.0779320.9812900.7649920,931,129.394575.05513459.96210.78428
80.6810300.2235400.861190.380030.886030.6445235,006,777.185916.65254540.53660.71695
90.154660.56860.549320.76120.588960.7938100.2224815,666,495.593958.09243054.38020.69542
100.696630.444510.602970.454210.664560.240650.2454019,723,753.674441.14333129.70530.82664
Average0.49154240.4379940.412170.4274830.59116840.5246430.2783960.48791822,980,444.114737.117223609.445630.757411
FA_
SVM
RunFeature WeightingPerformance Metrics
W_(Regular_Purchases)W_(Emergency_Purchases)W_(Shipping)W_(Expired_Medication)W_(Holding)W_(Shortage)W_(BT_Transaction)W_(BT_Installation)MSERMSEMAER2
10.231750.22590.530190.000116580.989630.999710.373480.32717173,343.7829416.3457333.25850.99958
20.498270.12130.0443410.374690.0789560.0763380.840560.025916123,519.554351.4535295.53380.99944
30.231750.22590.530190.000116580.989630.999710.373480.32717173,343.7829416.3457333.25850.99958
40.498270.12130.0443410.374690.0789560.0763380.840560.025916123,519.554351.4535295.53380.99944
50.111450.182450.800710.915710.33370.278080.9990.1206187,904.5992433.4796345.02640.99973
60.0625050.138310.176540.838710.558520.436340.210790.072986133,984.4159366.0388304.28861
70.125350.107580.888760.999710.280390.381970.997970.11452217,426.4673466.2901349.16490.99967
80.0696190.117320.221290.891670.58060.53680.594390.075245170,350.8234412.7358348.60530.9999
90.0706160.150110.190640.954250.42040.925130.229810.082785119,067.7801345.062276.88730.99995
100.0388190.373210.0981690.30570.175060.191890.308620.044591167,675.474409.482356.80050.99984
Average0.19383990.1763380.35251710.5655363160.44858420.49023060.5768660.1216899159,013.6234396.86867323.835760.999713
FA_
NB
RunFeature WeightingPerformance Metrics
W_(Regular_Purchases)W_(Emergency_Purchases)W_(Shipping)W_(Expired_Medication)W_(Holding)W_(Shortage)W_(BT_Transaction)W_(BT_Installation)MSERMSEMAER2
10.0325310.0417450.080320.98080.109750.1819510.0313380.027880.166970.12571
20.0104570.0159390.0337510.991430.0562180.058160.966780.0110130.0163160.127730.106971
30.0609310.0848560.182780.914420.594130.31980.300780.0633570.013690.1170.0962421
40.0609310.0848560.182780.914420.594130.31980.300780.0633570.013690.1170.0962421
50.0299860.0522860.0867250.999940.141850.241680.317560.0334750.0194940.139620.114911
60.0151730.0176650.0384830.999920.722790.0775460.223720.0144340.0351560.18750.136051
70.0325310.0417450.080320.98080.109750.1819510.0313380.027880.166970.12571
80.0104570.0159390.0337510.991430.0562180.058160.966780.0110130.0163160.127730.106971
90.0609310.0848560.182780.914420.594130.31980.300780.0633570.013690.1170.0962421
100.0325310.0417450.080320.98080.109750.1819510.0313380.027880.166970.12571
Average0.03464590.04816320.0982010.9668380.30887160.19407960.6377180.0354020.02119920.1434490.11307261
Table 3 compares the average of four performance evaluation metrics for each method. In Table 3, FA-DT has the weakest result in terms of all performance metrics (Ave-MSE = 22,980,444.11, Ave-RMSE = 5916.65, Ave-MAE = 3609.44), and R2 = 0.75, while FA-NB demonstrates robust behaviour with the best average R2 of 1 and a minimum Ave-MSE = 0.021, Ave-RMSE = 0.143, Ave-MAE = 0.113 among the four methods. Although the averages R2 values for all methods are acceptable (ranging from 0.757 to 1), the average MSE, RMSE, and MAE values for FA-KNN, FA-DT, and FA-SVM are the worst (ranging from 323.83 to 22,980,444.11). Therefore, the abovementioned metrics indicate that the FA-NB algorithm has better performance for the BT-based PSC cost model than the other proposed algorithms.
Table 3. Performance metrics evaluation for FA combined with four SL algorithms.
Table 3. Performance metrics evaluation for FA combined with four SL algorithms.
Performance Metrics
MethodsAve_MSEAve_RMSEAve_MAEAve_R2
FA_KNN6,264,931.6112483.154321911.889680.930856
FA_DT22,980,444.115916.65253609.445630.757411
FA_SVM159,013.6234396.86867323.835760.999713
FA_NB0.02119920.1434490.11307261
We also focused on the average of the weights of each cost feature via the FW approach for the FA-KNN, FA-DT, FA-SVM, and FA-NB algorithms, as shown in Table 4. Among these four methods, FA-NB provides the highest average weight for the expired medication cost feature (0.966) and the lowest average weight for the regular purchases cost feature (0.034). The second highest average weight is allocated to the BT installation cost feature by the FA-KNN algorithm (0.849) followed by the holding cost feature by the FA-DT algorithm (0.591). The BT installation cost feature also has the second lowest average weight with FA-SVM (0.121). In addition, the BT transaction cost fluctuates because it has the average weight of 0.576 and 0.278 for FA-SVM and FA-DT, respectively.
Table 4. FW criteria for FA combined with four SL algorithms.
Table 4. FW criteria for FA combined with four SL algorithms.
Feature Weighting
MethodsMax_Ave_WeightingMin_ Ave_Weighting
FA_KNNW_(BT_Installation) = 0.84988W_(Expired_Medication) = 0.34659
FA_DTW_(Holding) = 0.5911684W_(BT_Transaction) = 0.278396
FA_SVMW_(BT_Transaction) = 0.576866W_(BT_Installation) = 0.1216899
FA_NBW_(Expired_Medication) = 0.966838W_(Regular_Purchases) = 0.0346459

5.2. ACO Combined with Four SL Algorithms

In the second step, ACO combined with 4 SL algorithms (ACO -KNN, ACO -DT, ACO -SVM, and ACO -NB) was executed 40 times (each algorithm ran 10 times). Then, the 4 performance metrics and the weights of the 8 cost features were evaluated for the BT-based PSC cost model in the 40 runs. Table 5 illustrates 8 weightings of cost features and performance metrics for 4 algorithms in 10 runs for each algorithm. Table 5 presents the average values of the performance metrics and the average values of the weights of the cost features. Table 6 and Table 7 are also derived from the data presented in Table 5.
Table 5. FW and performance evaluation for ACO combined with four SL algorithms in 10 runs for each.
Table 5. FW and performance evaluation for ACO combined with four SL algorithms in 10 runs for each.
ACO_KNNRunFeature WeightingPerformance Metrics
W_(Regular_Purchases)W_(Emergency_Purchases)W_(Shipping)W_(Expired_Medication)W_(Holding)W_(Shortage)W_(BT_Transaction)W_(BT_Installation)MSERMSEMAER2
10.356950.925490.8903910.0098560.189120.472270.999035,543,172.3642354.39431949.46330.92596
210.764340.4367600.8869710.3467114,695,227.882166.84751864.39330.95763
30.920090.625120.873410.7667500.867760.840054,692,998.9432166.33311818.20330.93246
40.703240.649350.478640.0217110.597720.265980.366320.457786,719,708.6262592.24012065.38330.89936
510.6679500.156660.7226510.555660.503756,928,149.8332632.13792027.50670.94752
60.488510.973330.6589200.3037110.4104416,221,307.4722494.25492142.38330.92888
70.454460.519320.493370.381960.9348510.342290.980864,565,317.9182136.66051844.12670.94594
80.363260.3437410.472940.912250.47440.205980.996514,394,372.3172096.27581794.39330.92742
90.520390.703820.07773800.6959700.92397110,706,939.153272.1462655.270.90856
100.408420.363880.218830.432670.9583310.612360.720775,833,434.9562415.25052088.69670.93483
Average0.6215320.6536340.51280480.34659410.67890560.592950.5103760.8498756,030,062.9462432.654062024.981990.930856
ACO_DTRunFeature WeightingPerformance Metrics
W_(Regular_Purchases)W_(Emergency_Purchases)W_(Shipping)W_(Expired_Medication)W_(Holding)W_(Shortage)W_(BT_Transaction)W_(BT_Installation)MSERMSEMAER2
10.368310.6531300010.48975027,725,280.245265.48013918.3450.8088
20.78548100.35183000.966790.7807717,268,337.624155.51893358.66050.73903
30.391390.30496010000.07832920,862,939.194567.59673543.85080.78735
410.70817010.619450.153740.364630.3827614,114,845.233756.97293134.3790.79848
50.78077000.39765000122,127,369.334703.97383421.13340.77706
60.119780.4658610.43738010.86613015,853,729.53981.67422765.3020.83271
70.41597100010113,566,853.13683.32092900.37030.82056
80.729630.5107000.4432700.58999118,853,360.554342.04573218.80450.80982
90.3545300.503040.260060.29699000.1022319,129,278.44373.70313173.79690.83869
100.356280.3777100.8925400.4444500.2721223,472,850.574844.87883620.12480.8251
Average0.5302140.5020530.1503040.4339460.1359710.3598190.3277290.461620919297484.374367.516513305.476720.80376
RunFeature WeightingPerformance Metrics
W_(Regular_Purchases)W_(Emergency_Purchases)W_(Shipping)W_(Expired_Medication)W_(Holding)W_(Shortage)W_(BT_Transaction)W_(BT_Installation)MSERMSEMAER2
ACO_SVM10.132050.179190.472130.997970.410980.534240.940920.16085214,988.4336463.6685372.70330.99976
20.982370.263470.970170.00542870.650090.995630.694190.52966118,716.9279344.5532291.89310.99858
30.363410.075290.441520.9420510.135310.353340.8421285,394.6138292.2236239.07610.99908
40.318390.218780.8019300.884180.863790.343680.59425178,499.0214422.4914337.95360.99957
50.224780.999940.909230.999960.680760.623220.281970.2350379,662.2247282.245220.61870.99944
60.18910.1810.81952110.629850.999130.17333282,360.1537531.3757415.76380.99926
70.150830.329860.404810.999840.969170.933840.986620.21624170,985.299413.5037343.6920.9999
80.976050.200160.475140.99650.989310.852580.998170.18793168,623.3192410.6377350.23680.99905
90.318390.218780.8019300.884180.863790.343680.59425178,499.0214422.4914337.95360.99957
100.224780.999940.909230.999960.680760.623220.281970.2350379,662.2247282.245220.61870.99944
Average0.3880150.3666410.7005610.694170870.8149430.7055470.6223670.376869155,739.1239386.54352313.050970.999365
RunFeature WeightingPerformance Metrics
ACO_NBW_(Regular_Purchases)W_(Emergency_Purchases)W_(Shipping)W_(Expired_Medication)W_(Holding)W_(Shortage)W_(BT_Transaction)W_(BT_Installation)MSERMSEMAER2
10.0662660.0834970.1618810.221710.367690.999740.0646890.0289750.170220.128331
20.0516760.0765680.1573910.398030.2580610.0550180.0168520.129810.105611
30.0367120.0559010.0837550.9998310.189250.191810.0417790.014590.120790.0929241
40.0520310.06550.112170.999770.321180.202820.999530.0476360.0207840.144170.121461
50.0662660.0834970.1618810.221710.367690.999740.0646890.0289750.170220.128331
60.0516760.0765680.1573910.398030.2580610.0550180.0168520.129810.105611
70.0367120.0559010.0837550.9998310.189250.191810.0417790.014590.120790.0929241
80.0520310.06550.112170.999770.321180.202820.999530.0476360.0207840.144170.121461
90.057250.072260.1489110.441110.528770.0586020.0329840.181610.13561
100.0459110.0802620.132010.999840.230870.3364910.0537640.0126420.112440.0984231
Average0.05165310.07154540.131130.9999040.5112710.2813240.7910930.0530610.02080280.1424030.11306711
Table 6 presents the average values of the performance metrics considered in the evaluation of the proposed ACO combined with four SL algorithms. In Table 6, ACO-DT has the weakest result in terms of all performance metrics (Ave-MSE = 19,297,484.37, Ave-RMSE = 4367.51, Ave-MAE = 3305.47, and R2 = 0.80), while ACO-NB performs better with an average R2 of 1 and minimums of Ave-MSE = 0.020, Ave-RMSE = 0.142, and Ave-MAE = 0.113 among the four methods. Although the average R2 values are acceptable for all methods (ranging from 0.803 to 1), the average MSE, RMSE, and MAE values for ACO-KNN, ACO-DT, and ACO-SVM are the worst (ranging from 313.05 to 19,297,484.37). Therefore, we consider the ACO-NB results to be more reliable than the results obtained by the other proposed methods, all of which are shown in the following table.
Table 6. Performance metrics evaluation for ACO combined with four SL algorithms.
Table 6. Performance metrics evaluation for ACO combined with four SL algorithms.
Feature Weighting
MethodsAve_MSEAve_RMSEAve_ MAEAve_R2
ACO_KNN6,030,062.9462432.654062024.981990.930856
ACO_DT19,297,484.374367.516513305.476720.80376
ACO_SVM155,739.1239386.54352313.050970.999365
ACO_NB0.02080280.1424030.11306711
We also investigated the average weights of each cost feature via the FW approach in the ACO-KNN, ACO-DT, ACO-SVM, and ACO-NB algorithms (see Table 7). For the ACO-NB algorithm, the maximum average weight of 0.999 is allocated to the expired medication cost feature, and the minimum average weight of 0.051 is obtained for the regular purchases cost feature. For the ACO-KNN algorithm, the second maximum average weight is obtained for the BT installation cost feature (0.849) followed by the holding cost feature (0.814) obtained by ACO-SVM (see Table 7). Accordingly, the ACO-SVM algorithm gives the second lowest average weight of 0.135 for the holding cost feature. In addition, a variation in the average weight of the expired medication cost feature is observed, which is obtained as 0.346 and 0.999 by ACO-KNN and ACO-NB, respectively. Similarly, the holding cost feature weight varies as 0.135 and 0.814 obtained by ACO-DT and ACO-SVM, respectively.
Table 7. FW criteria for ACO combined with four SL algorithms.
Table 7. FW criteria for ACO combined with four SL algorithms.
Feature Weighting
MethodsMax_Ave_WeightingMin_Ave_Weighting
ACO_KNNW_(BT_Installation) = 0.849875W_(Expired_Medication) = 0.3465941
ACO_DTW_(Regular_Purchases) = 0.530214W_(Holding) = 0.135971
ACO_SVMW_(Holding) = 0.814943W_(Emergency_Purchases) = 0.366641
ACO_NBW_(Expired_Medication) = 0.999904W_(Regular_Purchases) = 0.0516531

5.3. Determining Reliable Algorithms for BT-Based PSC Cost Model

Table 8 summarises the TRS of eight algorithms based on the obtained Ave-MSE, Ave-RMSE, Ave-MAE, and Ave-R2 values. In TRS, the lowest Ave-MSE, Ave-RMSE, and Ave-MAE receive the highest scores, and the highest Ave-R2 obtains the highest score (and vice versa). Overall, the ACO-NB algorithm outperforms the other compared algorithms and achieves the first position among all the algorithms (a TRS of 32), followed by FA-NB with a TRS of 29, ACO-SVM with a TRS of 24, and FA-SVM with a TRS of 22. FA-DT and ACO-DT achieve the worst scores of 6 (rank 8th) and 10 (rank 7th), respectively.
Table 8. Ranking of eight selected algorithms based on TRS scores using performance metrics.
Table 8. Ranking of eight selected algorithms based on TRS scores using performance metrics.
Performance Metrics
MethodsAve_MSEAve_RMSEAve_MAEAve_R2TRSRank
FA_KNN6,264,931.6112483.154321911.889680.930856
FA_DT22,980,444.115916.65253609.445630.757411
FA_SVM159,013.6234396.86867323.835760.999713
FA_NB0.02119920.1434490.11307261
ACO_KNN6,030,062.9462432.654062024.981990.930856
ACO_DT19,297,484.374367.516513305.476720.80376
ACO_SVM155,739.1239386.54352313.050970.999365
ACO_NB0.02080280.1424030.11306711
Ranking ScoreFA_KNN3345156
FA_DT111368
FA_SVM5557224
FA_NB7778292
ACO_KNN4435165
ACO_DT2224107
ACO_SVM6666243
ACO_NB8888321
Table 9 shows the ranking of the average weights of eight costs features, namely, regular purchases, emergency purchases, shipping, expired medication, holding, shortage, BT transaction, and BT installation for the FA-KNN, FA-DT, FA-SVM, FA-NB, ACO-KNN, ACO-DT, ACO-SVM, and ACO-NB algorithms. This table assigns an appropriate weight to each cost feature to show their importance. The average weight with higher TRS receives a higher priority in the TRS process (and vice versa). Overall, the Shortage cost obtains the best TRS of 46 for the average weight followed by the holding cost with a TRS of 45 and the expired medication cost with a TRS of 43. The minimum TRS for the average weight is allocated to the emergency purchases cost (TRS = 28) followed by both the regular purchases cost and the shipping cost, which have the same rank of 5th with TRS = 30.
Table 9. FW ranking based on TRS scores for eight selected algorithms.
Table 9. FW ranking based on TRS scores for eight selected algorithms.
Methods
Ave_FWFA_
KNN
FA_DTFA_
SVM
FA_NBACO_
KNN
ACO_
DT
ACO_
SVM
ACO_
NB
TRSRank
W_(Regular_Purchases)0.4306610.49154240.19383990.03464590.6215320.5302140.3880150.0516531
W_(Emergency_Purchases)0.4117840.4379940.1763380.04816320.6536340.5020530.3666410.0715454
W_(Shipping)0.50847260.412170.35251710.0982010.51280480.1503040.7005610.13113
W_(Expired_Medication)0.635620.4274830.565536310.9668380.34659410.4339460.694170870.999904
W_(Holding)0.5010850.59116840.44858420.30887160.67890560.1359710.8149430.511271
W_(Shortage)0.7507030.5246430.49023060.19407960.592950.3598190.7055470.281324
W_(BT_Transaction)0.4087870.2783960.5768660.6377180.5103760.3277290.6223670.791093
W_(BT_Installation)0.6408780.4879180.12168990.0354020.8498750.46162090.3768690.053061
Ranking ScoreW_(Regular_Purchases)36315831305
W_(Emergency_Purchases)24236713286
W_(Shipping)52443264305
W_(Expired_Medication)63781558433
W_(Holding)48567186452
W_(Shortage)87654475461
W_(BT_Transaction)11872347334
W_(BT_Installation)75128622334

6. Discussion

This section discusses the results for the proposed eight algorithms that minimise the prediction errors of the BT-based PSC cost model. The proposed algorithms aim to answer the three research questions mentioned in the Introduction. As stated in Section 3, there are eight components in the BT-based PSC cost model: the regular purchases cost, the emergency purchases cost, the shipping cost, the expired medication cost, the holding cost, the shortage cost, the BT transaction cost, and the BT installation cost. This provides the answer to our first research question. Regarding the second research question, among the eight examined algorithms, we select some algorithms that demonstrate better performance in minimising the prediction errors of the BT-based PSC cost model. Figure 2 is derived from the data presented in Table 8 and shows the TRSs for all eight studied algorithms (FA-KNN, FA-DT, FA-SVM, FA-NB, ACO-KNN, ACO-DT, ACO-SVM, and ACO-NB). Four performance metrics (MSE, RMSE, MAE, and R2) evaluate the algorithm efficiency. The overall results of this study show that both ACO-NB (first position) and FA-NB (second position) algorithms outperform the other algorithms. This means that NB combined with either the FA or ACO is considered to be the most effective in performing the regression. The EC algorithms (FA and ACO) also play a significant role in optimizing the hyperparameters of the selected SL algorithms. Moreover, the SVM algorithm, combined with the FA and ACO, shows the second position with respect to TRS. The DT algorithm, combined with the FA and ACO, cannot predict the costs of the BT-based PSC model well. Thus, we have determined the most reliable predictive algorithms for our cost model using TRS and the four performance metrics.
The third research question is to determine the significant cost components of the model. The FW approach measures the importance of the features and assigns an appropriate weight to each feature. Figure 3 is derived from Table 9 and shows the TRS for the weights of all cost components (features) of the BT-based PSC model. These weights estimate the degree of relevance that each feature has for extracting the cost prediction. The shortage cost, the holding cost, and the expired medication cost strongly influence the cost model. The remaining five cost features have relatively the same position based on the weights (regular purchases cost, emergency purchases cost, shipping cost, BT transaction cost, and BT installation cost).
Compared to previous work that modelled the costs of PSC, this study developed a new mathematical cost model called the BT-based PSC that includes BT transaction cost and BT installation cost. Researchers in other fields can also use BT cost formulation (BT transaction cost and BT installation cost) in their mathematical models to estimate the SC costs integrated with BT. The practical significance of the study lies in providing the most reliable predictive algorithms for the BT-based PSC cost model, the cost components of the model, the degree of relevance of each component to the cost model, and the components of BT in the SC model.
However, similarly to the other studies, this research is subject to two limitations. The first significant limitation is the inaccessibility of real data because the use of BT in the PSC is a new area of research. Therefore, the eight algorithms studied in this work applied the generated data to validate the proposed BT-based PSC cost model. The use of generated data, rather than real data, may influence the outcomes and conclusions of the research. The second limitation is related to the model design. To design the cost model, we first selected some parts of the mathematical model inspired by other papers and then added BT costs. This means that the study may not cover some cost components of a real case, hindering the comparison of the results of this research with the results of other studies.
Finally, future research may extend the BT-based PSC cost model to a multi-objective model, which, for example, includes the uncertainty demand in PSCs. Another promising direction is to use other EC algorithms to enhance the performance of the SL algorithms or to test different SL algorithms to predict costs. The use of Feature Selection (instead of FW) is another direction that should be investigated to compare the optimization process of this model. The last potential future research direction is to determine the cost components of the private BT and formulate the private BT instead of the public BT used in the current study.

7. Conclusions

The BT-based PSC enables traceability and transparency of the drugs’ movement and of the stakeholders in the supply chain and can affect medication quality and final patient outcomes. This paper presents a mathematical cost model for a BT-based PSC system to estimate the costs of the model. This study is important because it provides a PSC system with BT (BT transaction cost and BT installation cost) that can improve the safety, performance, and transparency of medical information sharing in a healthcare system. One of the main contributions of this research is to formulate this cost problem and apply a combination of EC (ACO and FA) and SL (KNN, DT, SVM, and NB) algorithms and use four performance metrics (MSE, RMSE, MAE, and R2) to evaluate the efficiency of the proposed algorithms. This combination of EC and SL algorithms provides eight algorithms (as the regression producer), reducing prediction errors and improving the SL results. Overall, the ACO-NB and FA-NB algorithms outperform the other six algorithms in estimating the costs of the model with lower errors. ACO-DT and FA-DT show the worst performance for this comparison, showing that the DT algorithm is not an appropriate predictive approach for the current cost model. The findings also show that the shortage cost, the holding cost, and the expired medication cost strongly influence the cost model more than the other cost components that have almost the same effect on the model (regular purchases cost, emergency purchases cost, shipping cost, BT tansaction cost, and BT installation cost). This selection of components is derived from the allocation of an appropriate weight to each cost feature to show their importance using the FW approach. Therefore, the statistical outcomes on the generated dataset show that some of the proposed algorithms can obtain satisfactory results and assign appropriate feature weights. In the real world, managers in the field of healthcare services can use this model practically to control financial resources, stay within the budget, analyse information, and identify unnecessary costs, particularly if they decide to use BT in the system. The important contribution of this research is to provide a PSC system with BT. Selected SL algorithms can also help managers estimate costs with the minimum prediction errors and correctly decide whether the new system benefits their organisation. As the cost factor is important to managers, this study also determines and measures the importance of each cost component of the BT-based PSC model.

Author Contributions

Conceptualization, H.H. and T.-M.D.; methodology, H.H., T.W. and T.-M.D.; software, H.H. and T.W.; validation, H.H. and T.-M.D.; formal analysis, H.H. and T.W.; investigation, H.H.; resources, H.H.; data curation, H.H. and T.W.; writing—original draft preparation, H.H.; writing—review and editing, H.H., T.W. and T.-M.D.; visualization, T.-M.D.; supervision, T.W. and T.-M.D.; project administration, T.-M.D.; funding acquisition, T.-M.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

The data that support the findings of this study are openly available in https://data.mendeley.com/datasets/jxv5jrydnc (accessed on 1 November 2022). This sentence was stated in Section 3 of the manuscript to show data availability: “5000 series of the generated raw data for all eight components of the BT-based PSC cost model and the total cost were uploaded to https://data.mendeley.com/datasets/jxv5jrydnc (accessed on 1 November 2022)”.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Ahmadi, A.; Mousazadeh, M.; Torabi, S.A.; Pishvaee, M.S. OR Applications in Pharmaceutical Supply Chain Management. Oper. Res. Appl. Health Care Manag. 2017, 262, 461–491. [Google Scholar]
  2. Chircu, A.; Sultanow, E.; Saraswat, S.P. Healthcare RFID In Germany: An Integrated Pharmaceutical Supply Chain Perspective. J. Appl. Bus. Res. 2014, 30, 737–752. [Google Scholar] [CrossRef]
  3. Haq, I.; Esuka, O. Blockchain Technology in Pharmaceutical Industry to Prevent Counterfeit Drugs. Int. J. Comput. Appl. 2018, 180, 8–12. [Google Scholar] [CrossRef]
  4. Boulos, M.N.K.; Wilson, J.T.; Clauson, K.A. Geospatial blockchain: Promises, challenges, and scenarios in health and healthcare. Int. J. Health Geogr. 2018, 17, 25. [Google Scholar] [CrossRef] [PubMed]
  5. Salehi, V.; Salehi, R.; Mirzayi, M.; Akhavizadegan, F. Performance optimization of pharmaceutical supply chain by a unique resilience engineering and fuzzy mathematical framework. Hum. Factors Ergon. Manuf. Serv. Ind. 2020, 30, 336–348. [Google Scholar] [CrossRef]
  6. Uthayakumar, R.; Priyan, S. Pharmaceutical supply chain and inventory management strategies: Optimization for a pharmaceutical company and a hospital. Oper. Res. Health Care 2013, 2, 52–64. [Google Scholar] [CrossRef]
  7. Goodarzian, F.; Hosseini-Nasab, H.; Muñuzuri, J.; Fakhrzad, M.-B. A multi-objective pharmaceutical supply chain network based on a robust fuzzy model: A comparison of meta-heuristics. Appl. Soft Comput. 2020, 92, 106331. [Google Scholar] [CrossRef]
  8. Bamakan, S.M.H.; Moghaddam, S.G.; Manshadi, S.D. Blockchain-enabled pharmaceutical cold chain: Applications, key challenges, and future trends. J. Clean. Prod. 2021, 302, 127021. [Google Scholar] [CrossRef]
  9. Zahiri, B.; Jula, P.; Tavakkoli-Moghaddam, R. Design of a pharmaceutical supply chain network under uncertainty considering perishability and substitutability of products. Inf. Sci. 2018, 423, 257–283. [Google Scholar] [CrossRef]
  10. Haleem, A.; Javaid, M.; Singh, R.P.; Suman, R.; Rab, S. Blockchain technology applications in healthcare: An overview. Int. J. Intell. Netw. 2021, 2, 130–139. [Google Scholar] [CrossRef]
  11. Badhotiya, G.K.; Sharma, V.P.; Prakash, S.; Kalluri, V.; Singh, R. Investigation and assessment of blockchain technology adoption in the pharmaceutical supply chain. Mater. Today Proc. 2021, 46, 10776–10780. [Google Scholar] [CrossRef]
  12. Zhang, W.; Gu, X.; Tang, L.; Yin, Y.; Liu, D.; Zhang, Y. Application of machine learning, deep learning and optimization algorithms in geoengineering and geoscience: Comprehensive review and future challenge. Gondwana Res. 2022, 109, 1–17. [Google Scholar] [CrossRef]
  13. Glover, F.; Sörensen, K. Metaheuristics. Scholarpedia 2015, 10, 6532. [Google Scholar] [CrossRef]
  14. Yusta, S.C. Different metaheuristic strategies to solve the feature selection problem. Pattern Recognit. Lett. 2009, 30, 525–534. [Google Scholar] [CrossRef]
  15. Garg, P. A Comparison between Memetic algorithm and Genetic algorithm forthe cryptanalysis of Simplified Data Encryption Standard algorithm. Int. J. Netw. Secur. Its Appl. 2009, 1, 34–42. [Google Scholar]
  16. Abd-Alsabour, N. Investigating the effect of fixing the subset length on the performance of ant colony optimization for feature selection for supervised learning. Comput. Electr. Eng. 2015, 45, 1–9. [Google Scholar] [CrossRef]
  17. Rojas-Morales, N.; Riff, M.-C.; Neveu, B. Learning and focusing strategies to improve ACO that solves CSP. Eng. Appl. Artif. Intell. 2021, 105, 104408. [Google Scholar] [CrossRef]
  18. Nourelfath, M.; Nahas, N.; Montreuil, B. Coupling ant colony optimization and the extended great deluge algorithm for the discrete facility layout problem. Eng. Optim. 2007, 39, 953–968. [Google Scholar] [CrossRef]
  19. Mashhour, E.M.; Houby, E.M.; TawfikWassif, K.; IbrahimSalah, A. A Novel Classifier based on Firefly Algorithm. J. King Saud Univ.—Comput. Inf. Sci. 2020, 32, 1173–1181. [Google Scholar] [CrossRef]
  20. Almomany, A.; Ayyad, W.R.; Jarrah, A. Optimized implementation of an improved KNN classification algorithm using Intel FPGA platform: Covid-19 case study. J. King Saud Univ.—Comput. Inf. Sci. 2022, 34, 3815–3827. [Google Scholar] [CrossRef]
  21. Kushwah, J.S.; Kumar, A.; Patel, S.; Soni, R.; Gawande, A.; Gupta, S. Comparative study of regressor and classifier with decision tree using modern tools. Mater. Today Proc. 2022, 56, 3571–3576. [Google Scholar] [CrossRef]
  22. Ren, Q.; Zhang, H.; Zhang, D.; Zhao, X.; Yan, L.; Rui, J.; Zeng, F.; Zhu, X. A framework of active learning and semi-supervised learning for lithology identification based on improved naive Bayes. Expert Syst. Appl. 2022, 202, 117278. [Google Scholar] [CrossRef]
  23. Zhang, H.; Shi, Y.; Yang, X.; Zhou, R. A firefly algorithm modified support vector machine for the credit risk assessment of supply chain finance. Res. Int. Bus. Financ. 2021, 58, 101482. [Google Scholar] [CrossRef]
  24. Havaeji, H.; Dao, T.-M.; Wong, T. The Generated Raw Data to Evaluate the Blockchain-Based Pharmaceutical Supply Chain Cost Model. 1 November 2022. Available online: https://data.mendeley.com/datasets/jxv5jrydnc (accessed on 1 November 2022).
  25. Al-Zoubi, A.M.; Hassonah, M.A.; Heidari, A.A.; Faris, H.; Mafarja, M.; Aljarah, I. Evolutionary competitive swarm exploring optimal support vector machines and feature weighting. Methodol. Appl. 2021, 25, 3335–3352. [Google Scholar] [CrossRef]
  26. Weraikat, D.; Zanjani, M.K.; Lehoux, N. Improving sustainability in a two-level pharmaceutical supply chain through Vendor-Managed Inventory system. Oper. Res. Health Care 2019, 21, 44–55. [Google Scholar] [CrossRef]
  27. Franco, C.; Alfonso-Lizarazo, E. Optimization under uncertainty of the pharmaceutical supply chain in hospitals. Comput. Chem. Eng. 2020, 135, 106689. [Google Scholar] [CrossRef]
  28. Havaeji, H.; Dao, T.-M.; Wong, T. Cost Optimization of Blockchain Technology-enabled Supply Chain System using Evolutionary Computation Approaches: A Healthcare Case Study. World Wide J. Multidiscip. Res. Dev. 2022, 8, 27–42. [Google Scholar]
  29. IBM. IBM Cloud. 2022. Available online: https://cloud.ibm.com/objectstorage/create#pricing (accessed on 1 February 2022).
  30. Longo, F.; Nicoletti, L.; Padovano, A.; d’Atri, G.; Forte, M. Blockchain-enabled supply chain: An experimental study. Comput. Ind. Eng. 2019, 136, 57–69. [Google Scholar] [CrossRef]
  31. Jabbar, A.; Dani, S. Investigating the link between transaction and computational costs in a blockchain environment. Int. J. Prod. Res. 2020, 58, 3423–3436. [Google Scholar] [CrossRef]
  32. Wood, G. Ethereum: A Secure Decentralised Generalised Transaction Ledger. Ethereum Proj. Yellow Pap. 2014, 151, 1–32. [Google Scholar]
  33. Gopalakrishnan, P.K.; Hall, J.; Behdad, S. Cost analysis and optimization of Blockchain-based solid waste management traceability system. Waste Manag. 2021, 120, 594–607. [Google Scholar] [CrossRef] [PubMed]
  34. Wang, X.; Ni, W.; Zha, X.; Yu, G.; Liu, R.P.; Georgalas, N.; Reeves, A. Capacity analysis of public blockchain. Comput. Commun. 2021, 177, 112–124. [Google Scholar] [CrossRef]
  35. ETH Gas Station. 2022. Available online: https://ethgasstation.info/ (accessed on 1 February 2022).
  36. Takyar, A. How to Determine the Cost of Blockchain Implementation? 2021. Available online: https://www.leewayhertz.com/cost-of-blockchain-implementation/ (accessed on 1 November 2022).
Figure 1. Flowchart of the methodology.
Figure 1. Flowchart of the methodology.
Mathematics 11 02021 g001
Figure 2. TRS of all algorithms.
Figure 2. TRS of all algorithms.
Mathematics 11 02021 g002
Figure 3. TRS for feature weights.
Figure 3. TRS for feature weights.
Mathematics 11 02021 g003
Table 1. Parameters, variables, and constraints of the BT-based PSC cost model.
Table 1. Parameters, variables, and constraints of the BT-based PSC cost model.
ParametersExplanationConstraints
MNumber of medication types in the PSC variables (p = 1, 2, 3, …, M)45
UThe number of Blockchain users4
CsCost of storage per year (USD/TB) for public outbound bandwidth service [29]USD 20 × 12 = 2400 USD/yr
GuThe amount of ether gasUsed per dayUSD 1.31 ≤ Gu × g ≤ USD 3.94
gNumber of gWei to be paid for gasUsed per day
ipInventory level of medication type p25 ≤ ip ≤ 140 (integer)
qpOrder quantity for the pth medicine product per year10 ≤ qp ≤ 100 (integer)
sThe storage size to store the data2 TB/yr ≤ s ≤ 5 TB/yr
eqpNumber of lots of medicine types p purchased in case of emergency1 ≤ eqp ≤ 40 (integer)
exqpQuantity of expired medication type p sent to the government disposal site1 ≤ exqp ≤ 25 (integer)
spShortage quantity of medication type p that is needed to be outsourced1 ≤ sp ≤ 40 (integer)
rpRegular cost of medicine type p (USD)15 ≤ rp ≤ 250
rpep
epEmergency cost of medicine type p (USD)20 ≤ ep ≤ 300
tpShipping cost of medication type p shipped to the hospital (USD) 5 ≤ tp ≤ 35
expCosts obligated by governments for each unit of medication type p disposed at their sites (USD)5 ≤ exp ≤ 10
texpShipping cost of expired medication type p sent to the government disposal site (USD)1 ≤ texp ≤ 15
hpHolding cost of medication type p at the hospital site (USD)20 ≤ hp ≤ 30
πpPenalty that the producer pays to the hospital for each unit of shortage in the supply of medication type p (USD)10 ≤ πp ≤ 20
opCost of outsourced medication type p that the producer could not satisfy (USD)12 ≤ op ≤ 18
cfixedThe initial fixed cost per year580 ≤ cfixed ≤ 680
conboardingThe onboarding costUSD 20 ≤ conboarding ≤ USD 28
cmcThe unit maintenance cost; cmc + cmo is 15%–25% of the project valueUSD 15 ≤ cmc + cmo ≤ USD 25
cmoThe unit monitoring cost; cmc + cmo is 15%–25% of the project value
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Havaeji, H.; Dao, T.-M.; Wong, T. Supervised Learning by Evolutionary Computation Tuning: An Application to Blockchain-Based Pharmaceutical Supply Chain Cost Model. Mathematics 2023, 11, 2021. https://doi.org/10.3390/math11092021

AMA Style

Havaeji H, Dao T-M, Wong T. Supervised Learning by Evolutionary Computation Tuning: An Application to Blockchain-Based Pharmaceutical Supply Chain Cost Model. Mathematics. 2023; 11(9):2021. https://doi.org/10.3390/math11092021

Chicago/Turabian Style

Havaeji, Hossein, Thien-My Dao, and Tony Wong. 2023. "Supervised Learning by Evolutionary Computation Tuning: An Application to Blockchain-Based Pharmaceutical Supply Chain Cost Model" Mathematics 11, no. 9: 2021. https://doi.org/10.3390/math11092021

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop