1. Introduction
A supply chain (SC) is a collection of organizations’ operations that are linked and interact, both directly and indirectly, to transform inputs into outputs that are provided to the ultimate customer [
1]. The field of supply chain management concerns itself with the collaboration and coordination of several stakeholders to optimize the flow of goods, information, and finance along the entire SC [
2]. The financial supply chain differs from the physical supply chain in that it deals with the flow of cash rather than commodities and is in the opposite direction, according to Popa (2013) [
3]. Financial flows along the SC form an essential part of the continuum of the business operation [
4,
5], and despite the potential of supply chain management (SCM), relatively few companies utilize interorganizational network settings to drive financial performance in a collaborative way [
6].
Caldentey and Chen [
7] believed that planning, managing, and controlling financial flows along SCs positively affect SC profitability. In recent decades, the frameworks of financial supply chain management (FSCM) and supply chain finance (SCF) have advanced dramatically.
Supply chain finance strives to coordinate several stakeholders in order to optimize the flow of cash and internal and external funding along the supply chain. This paper aims to maximize the profitability of the entire whole supply chain from a regulatory point of view.
The rest of the paper is organized as follows: In
Section 2, we provide a review of related literature. In
Section 3, a discrete-time linear dynamical model of the entities within the supply chain network is proposed. In
Section 4, a constrained finite-horizon LQR problem is presented to demonstrate how decision variables are optimally obtained to reach the specified goals with given priorities implemented in the cost function. In
Section 5, the results of numerical simulation which validate our results are shown. A brief conclusion is rendered in
Section 6.
2. Literature Review
In recent years, academics have paid increased attention to the issue of integration and coordination among SCF members. FSCM is commonly referred to in the literature as programs developed to focus on supplier–buyer relationships and manage the financial flow throughout a SC; a flow that is in the opposite direction of physical and informational flows [
3,
8,
9]. SCF includes a range of financial instruments implemented to finance members of a supply chain and is generally considered a subset of FSCM [
10].
As Hofmann et al. [
11] stated, traditionally, the majority of SC actors are preoccupied with their own concerns and objectives, failing to coordinate financially with their SC partners. The lack of a shared vision among SC partners is a big impediment for SCF [
4]. FSCM thus requires collaboration beyond the firm’s borders with service providers (such as banks), suppliers, and customers [
8]. The working capital state of a firm is affected by the firm’s operational environment and its SC dynamics. Monto et al. [
12] stated that with efficient working capital management, a company could reduce the need for finance, free up cash, increase profitability, improve liquidity, increase the efficiency of operations, and decrease (financing) costs. However, Kouvelis and Zhao [
13] pointed out that to achieve a successful FSCM plan and its advantages, active working capital coordination among SC parties is needed.
There are two widely used types of financing a firm can accept: Bank Credit Financing (BCF) and Trade Credit Financing (TCF) [
14,
15]. The former refers to short-term financing provided by an external financier regarding the scope of a SC. The latter is used when short-term loans are acquired by extending payment terms between exchange partners within the SC. It is harder for small and medium enterprises (SMEs) to receive financing from banks due to their high levels of credit risk; thus, trade credit is considered as an alternative for financing SMEs. TCF has been proven to be more effective than BCF only when the production costs are relatively low [
16].
The measure of the cash conversion cycle (CCC) was introduced by Richards and Laughlin [
17] to evaluate and control a firm’s working capital management efficiency. CCC is composed of the cycle time of inventories (DIO), cycle time of accounts receivable (DRO), and the cycle time of accounts payable (DPO). Deloof [
18] suggested that shortening the CCC of a firm is positively correlated with its profitability. According to Hofmann and Kotzab [
19], improving a firm’s CCC (extending payables period and shortening receivables period) might be achieved at the expense of its SC partners; therefore, they proposed a measure named collaborative cash conversion cycle (CCCC), which is the sum of CCC of all exchange partners.
Numerous techniques have been applied to optimize the working capital within the SC [
19,
20,
21,
22,
23,
24]. There are numerous methods to model SC management problems; some of the most popular include linear programming [
25], non-linear programming, multi-objective programming [
26], and stochastic programming [
26] to account for uncertainty [
27]. These techniques have been demonstrated to be the most effective solutions for solving quantitative decision-making difficulties while still providing an optimal solution. Furthermore, constraints handling is straightforward and limited resources can be allocated based on the criteria formulated in the given cost function.
Linear programming is a popular approach in different areas of SC optimization. It is an efficient tool to model and analyze investment, production planning, and scheduling [
28,
29]. Sahay and Ierapetritou [
30] introduced an optimization model based on linear programming to manage the orders among different entities inside a SC. Recursive discrete-time equations are used to capture the dynamic nature of the inventory levels. Park and Kim [
31] proposed a framework to study operational planning of global SCs by formulating a linear programming problem considering the production lot sizes, order quantity, and ordering decisions of distribution centers (DCs). Kirilmaz and Erol [
32] presented a five-level approach based on a linear programming model to build a robust SC in terms of supply-side risk criteria, such as disruption or delay in supplier production. Non-linear programming has the capability to model more detailed aspects of the SCs, as well as non-linear constraints. Tabrizi and Razmi [
33] proposed the mixed-integer non-linear fuzzy approach to model the SC network by considering uncertainty in the supply, demand, and processing sides. In order to account for demand unpredictability, Applequist et al. [
34] solved a mixed-integer non-linear programming problem in which the risk premium is offered as a metric to explicitly incorporate economic risk into the model. Stochastic programming is another optimization framework that is specifically developed to solve problems involving uncertainty. Carneiro et al. [
35] applied a stochastic programming model to maximize the expected net present value. Sodhi [
36] employed stochastic modeling in the tactical SC planning to determine future shipments to customers over the pre-defined horizon. Klibi and Martel [
37] proposed several stochastic programming approaches to design resilient supply networks in the location–transportation context. In some cases, there may be contradictory objectives to address in SC management optimization models. Trisna et al. [
26] conducted a literature review in the area of multi-objective programming for SC management. Among these studies, D’Amore and Bezzo [
38] combined this idea with other programming techniques. They proposed a general multi-objective mixed-integer linear programming framework in order to determine the strategic planning decisions and design of a SC in a bioethanol context.
Although using optimization for modeling SC management and risk mitigation is common, most of these methods do not consider the dynamical nature of the system under study [
32,
38,
39]. Some researchers have focused on how to consider the dynamical behaviors of entities in the model. Petri Nets are mathematical modeling frameworks with a formal graphical structure for analyzing concurrent distributed systems. Kabir and Papadopoulos [
40] monitored dynamical changes of each node in SC systems. Timed Petri Net is one of the main extensions used for conducting risk analysis and real-time control of SC networks (Tuncel and Apan [
41]). The colored Petri Nets are another extension capable of describing information classification based on defining various colorful tags. Liu et al. [
42] implemented this method to classify information referring to environment, product quality, and processing data for a perishable SC network in which time is a crucial factor.
Regarding the topology of the SC understudy, Nair and Vidal [
43] pointed out that the topology of a supply network has a significant impact on its risk and resilience. Different types of networks have been investigated in the literature. The choice of topology is related to the purpose of the study. Two-level SCs are popular among game-theoretical approaches to study buyer–supplier relationships to determine decisions about wholesaling price, order quantity, credit term, and financing rates [
44,
45,
46,
47]. Complex topologies are examined to investigate SC finance design [
48].
The study of default propagation within the SC is another research stream that is utilized by complex SCs. Zhao et al. [
49] took advantage of a complex network to investigate credit risk diffusion within a SC. Some approaches considered a dynamic network topology to monitor the effect of different strategies implemented by SC entities on the network topology [
50,
51].
While optimization models could solve different SC-related problems, simulation is an efficient computer-based tool for evaluating behaviors of various entities in a SC network, from the conceptual development phase to SC usage in real-life scenarios [
27,
52,
53]. Simulation techniques are divided into four categories: Wu et al. [
54] conducted agent-based simulation to study stock-out effects in the two-stage SC. Elleuch et al. [
55] applied discrete-event simulation to experiment with risk impacts and mitigation policies in a pharmacy. System Dynamic was proposed by Forrester [
56] to model highly complex systems subject to great levels of uncertainty. For example, a System Dynamic model was introduced to analyze the behavior of the post-seismic SC through simulation [
57]. Liew and Lee [
58] performed a Monte Carlo simulation to quantify marine/offshore industry risks. The same approach for testing the resilience of SC management models toward risk is adopted by Olson and Wu [
59].
An optimization model is derived as a modeling framework in the optimization methodologies for SC management. Optimization problems, in general, have a static nature, and the performance of the proposed optimization models is studied under specific simulation settings. Furthermore, there is no guarantee for global optimality [
60]. In this paper, we propose a novel approach toward modeling the entities in the SC. The financial state of each agent is modeled by a system of discrete-time linear dynamical equations by considering the incoming order frequency of each entity, the daily production amount, short-term financing, and debt repayments to upstream entities. This model is implemented on two topologies: the first is a fully connected SC with six nodes and the second is a simple topology of the Iranian pharmaceutical SC.
The practical constraints in the SC are captured by introducing a set of inequalities. For the purpose of gaining profit for the SC members, a constrained finite-horizon (LQR) problem is formulated [
61]. We will show this problem can be transformed to quadratic programming with convex settings. Hence, the optimal global solution for managing the production level of entities and financial flow in the SC network is obtained in a centralized framework.
3. Model Description
Since SCF variables such as available working capital, inventory value, debt to other entities, and debt to the bank have memory (i.e., their future value is a function of their current value), modeling them by a dynamical system is appropriate. In addition, financial information and financial decisions are usually updated periodically. As a result, the state variables, control inputs, and SC network parameters listed in
Table 1,
Table 2 and
Table 3 are enough to represent each entity in the SC network using a set of discrete-time linear dynamical equations. The state variables are the variables sufficient to implement the total behavior of each entity and are recursively updated each day based on the previous state and the current input signals. The input signals are the decision variables that manipulate the corresponding states.
In
Table 1, state variables describe the amount of available working capital and profit value, inventory level, and debt value of each entity.
The production and payment value of each member and short-term financing received by each entity are considered input signals and are shown in
Table 2.
The external inputs necessary to model the physical and order flow among entities in the SC network are displayed in
Table 3.
The other parameters considered in the model and the entities within a SC network are presented in
Table 4.
Mathematical Formulation
In this section, the dynamical behavior of each SC entity’s working capital, inventory, and debt value is formulated with a discrete-time linear dynamical model.
Equation (1) calculates inventory value at the beginning of each day,
by considering the inventory value the day before,
and changes to the inventory value due to production and shipment of goods,
on the previous day.
Equation (2) calculates
i-th entity working capital at the beginning of day
k by reducing the fixed and variable costs, planned payments to the upstream entities, and the bank from the previous day working capital of entity
i.
Working capital is also increased by received short-term borrowing and payments from downstream businesses the day before. Short-term financing has a
T-day payment duration in the proposed paradigm. Hence, the value of the short-term financing is compounded for a
T-day period. Equations (3) and (4) keep track of the debt of entity
i to its upstream entities and the bank, respectively.
In Equation (5), the debt of each entity on the previous day is compounded daily, with new debts and repayments of the previous accounts taken into account to calculate i-th entity’s debt at the beginning of day k. Equation (6) has the same structure; only the compounding occurs to take the T-day payment term to the bank into account.
The model of working capital state can be modified to incorporate the profit value of each entity in the SC. Therefore, the profit value is obtained from Equation (2) by omitting the effects of received short-term financing.
The obtained profit equation until the
k-th day has resulted from the profit achieved until the prior day by reducing fixed and variable costs of each entity, payments to the upstream entities, and payments to the bank due to the received short-term financing. Received payments from downstream entities increase the profit value of each entity in the SC network. Recursively computing the profit at each day from the initial planning day yields the following equation for formulating the total profit value of each entity at each planning period.
The suggested model’s state and input variables must adhere to certain limitations. Equations (7)–(13) formulate these constraints:
Equation (7) shows that each entity must plan its production within a certain range to avoid high production fluctuation. The inventory level of an entity can never be negative, and it is also bounded Equation (8). Similarly, entities’ working capital, debt to upstream agents, short-term financing, and payment amounts take non-negative values and have upper bounds (Equations (9)–(13)). In addition, the indebtedness of an entity to the bank takes non-negative values because we do not consider risk-free investment of the nodes in our model. The upper bound for Equations (10)–(12) is considered the initial working capital of entities.
4. Solution Strategy
Consider the following linear time-invariant dynamical model:
where
and
are the state and input vectors, respectively, and the matrices
and
denote the state and input matrices, respectively, which operate on the present values of the state and input vectors to make the best linear estimation for the values of the state vector for the next period, as shown in Equation (14). Additionally, the pair
are controllable, which means the vector of input variables,
can steer a system in a given model, from any initial state vector
to any arbitrary state vector
infinite horizon [
62]. To optimally minimize the states and input of Equation (14), the following constrained finite-time LQR problem is formulated.
where
is the positive semi-definite state penalty matrix,
is the positive definite input penalty matrix, and
denotes the final state positive semi-definite penalty matrix [
63].
In order to implement the designer’s priorities toward reaching the optimal solution, these matrices should be chosen in diagonal form. The values of the diagonal elements demonstrate the relative weight that the optimizer considers in minimizing the total cost function. The state and input variables are assumed to be constrained to the closed set (
D) and have zero points in the interior. It is straightforward to show that the problem formulated in Equation (15) could yield to the convex QP with the globally optimal solution [
64].
Problem Formulation
This section seeks to determine the optimal financial flow and production level for the SC’s entities in order to maximize profit. As a result, we will illustrate how we may use Equation (14) to represent the complete SC network and formulate the appropriate constrained finite-time LQR to meet SC’s intended goal in the centralized view. According to this approach, part of the entity’s behavior is considered collectively in terms of joint input signals [
65]. Thus, the connected network in the SC obtains the optimal joint input variables and the optimal individual control variables for each entity. To model the SC with Equation (14), let us define
and
vectors as the augmented state and input vectors as follows:
in which the state variables
collect the state variables of entities at each day k as follows:
where
is the difference term between the profit value and the pre-defined profit level for
i-th entity
until the
k-th day. In addition,
collect the input signals of entities similar to the structure presented in Equation (18) as follows:
By considering collected state and input vectors as in Equation (16), the linear time-invariant dynamical model for the whole SC is obtained with an additional known parameter
as
, where
A and
B are the state and input matrices with sufficient dimensions, respectively, and
gathers the known time-varying and fixed parameters of the network. In practical financial SC problems, the amount of
is sufficiently small for all days, such that it cannot cause infeasibility [
66].
Now, by having the linear time-invariant model of the SC at hand, we will demonstrate how we can formulate the centralized constrained finite-time LQR for the SC network. Formulating the desired goals for each entity with the corresponding quadratic cost functions leads to Equation (15).
The cost function should implement the design goals of the SC. Therefore, to minimize the debt values, short-term financing received, and the inventory level among the SC members in the planning horizon, the following cost function is considered:
The terms
are positive weights corresponding to the goal of minimizing the debt of each entity to upstream members, the debt of each entity to the bank, the inventory level, and the amount of financing received, respectively. Furthermore, we must add an extra term
with relatively little amounts for
in comparison with other gains, only to make the corresponding
R matrix in Equation (15) be positive definite and ensure stability of the states as the following:
The resulted constrained finite-horizon LQR setting for
i-th entity is as follows:
Subject to:
Equations (1) and (3) to Equation (13).
where
is the stage cost that contains state and input variables of the model and is computed each day.
corresponds to the final state cost function to consider profitability at the end of the horizon for each entity and is defined as follows:
By considering state and input vectors as in Equation (16), the state penalty matrix
and the input matrix penalty
) result. Furthermore,
is equal to the final state penalty matrix in Equation (15), with “diag” denoting the diagonal matrix for corresponding cost coefficients. To acquire the input signals of entities in the SC network optimally using a centralized view, let us describe the network’s total cost function as an accumulation of the separately allocated costs for each entity as follows:
Subject to:
Equations (1) and (3) to Equation (13).
To solve the resulting optimization problem formulated for the centralized SC with batch approach, we define
as the accumulated state vector and the accumulated decision variables vector as
during the planning horizon (
N), respectively. The state and input penalty matrices are positive semi-definite and positive definite by construction, respectively. By writing dynamical Equations (1)–(5) that appeared in the constraints set of Equation (23) recursively into the form
and substituting it in the problem defined in Equation (18), it yields to the following general inequality constrained QP [
63]:
where
is the vector of initial state values and the fixed parameters related to the model and orders. Based on the facts provided in this section for the problem formulated with Equation (23), it implies that
is a positive definite matrix [
63]. If any
U satisfies the constraint set of problem Equation (24), the problem is feasible and the convexity yields the optimal solution.
5. Simulation
In order to demonstrate the validity of our proposed method, we consider two scenarios in which the SC members try to reach their pre-defined profit level at the end of the planning horizon. Topology 1 is a fully connected SC with six nodes and pre-defined values for parameters. Topology 2 is based on the Iranian pharmaceutical SC, with financial values extracted from the financial statement.
Among the several methodologies for solving quadratic programming, active-set methods and interior-point methods are the most frequently utilized. However, interior-point approaches are preferred due to their scalability when the number of constraints and decision variables is not small [
67].
The problem formulation and optimization modeling were carried out in the MATLAB optimization toolbox with its quadratic programming solver (quadprog), which uses the interior-point algorithm. In both cases, the solver converged to the optimal global solution after a few iterations since the matrix P was positive-definite which rendered the problem convex. The simulations were performed on a server with two processors, a 2.4 GHz CPU, and 32 GB of RAM.
5.1. Topology 1
A three-level SC network with two entities at each level is considered for the simulation. The SC topology is depicted in
Figure 1. Each entity in the SC is considered as a node, and the connections among nodes in which the downstream nodes only send orders to their upstream neighbors are demonstrated with arrows. The orders differ in amount (financial value of the order) and frequency.
Table 5 summarizes the amount and frequency of orders sent to the suppliers by each node in tuples of the form (amount, frequency).
Orders received (by Nodes 1 and 2) and sent (by Nodes 5 and 6) to the outside of the network by nodes in the output and input layer are described in
Table 6.
The initial inventory value and fixed-parameter values that are specific to each entity are presented in
Table 7. Entities plan their daily production to prepare for the complete delivery of downstream orders. To avoid inconsistency, upper and lower limits are defined for production. Fixed and variable costs are set to leave each entity a profit margin.
Parameter values that are common among all entities in the simulation are shown in
Table 8.
The results of the simulation rely heavily on the values of relative cost coefficients. The cost coefficients determine the priorities for reaching the optimal state of entities. Therefore, due to our scenario regarding maximizing the cost of bank financing due to banking constraints, the cost coefficient of bank financing is chosen to be relatively high in comparison with other cost coefficients.
Table 9 shows the simulated cost coefficient for each entity in the SC with Topology 1. The cost coefficients of the available working capital are considered the second priority to be minimized in the cost function. This is accomplished by maximizing the amount of
Ii. Additionally, we do not place a premium on minimizing the bank’s debt value in this simulation.
The value of the inventory value cost coefficient (gMi) is chosen relative to as it could minimize the inventory level of each entity. Besides, to satisfy the positive definiteness of the input penalty matrix in LQR, we assign small values (2) for production in the network, which makes no difference in the final expected response as it is relatively low in comparison with other coefficients related to our goal.
Because of the simulation constraint, the simulated debt value to the bank and bank financing results are presented in
Figure 2a, in which repetitive bank financing is avoided by entities due to the unnecessary cost of bank financing in comparison to profit cost. Moreover, various model assumptions, such as entity absolute profitability and the
T-day payment duration, have a role in this pattern. As shown in the results, the entities may seek bank funding and tolerate its high expenses in the early days in order to increase their working capital.
Figure 2b shows the planned production and inventory values. The behavior of inventory values follows the orders received by each entity, with the production optimally scheduled to minimize the inventory level in the planning horizon.
We fixed the lag period for payment of received orders,
to a constant value of 5 because we prefer to minimize indebtedness among entities.
Figure 2c depicts the working capital and profit behaviors among entities.
Since all entities run profitable operations, the working capital trajectory of entities plotted in
Figure 2c shows a general uptrend in most cases. This trend is more significant for those entities that receive higher order amounts. It is worth mentioning that the marginal profits of nodes are equal within each layer due to the same values of fixed and variable costs, as shown in
Table 7. According to Equation (2) and
Figure 2c, when the working capital of each node approaches zero, if the entity could not create liquidity by selling any goods from inventory, the short-term financing occurs to achieve business continuity.
5.2. Topology 2
This topology is related to the SC of the Iranian pharmaceutical industry. Topology 2 is created using the actual financial statements of companies in this SC (
Figure 3).
Tables of this topology, such as Topology 1, contain the following information.
Table 10 summarizes the amount and frequency of orders sent to the suppliers by each node in tuples of the form (amount, frequency).
Orders received (by Nodes 1 and 2) and sent (by Nodes 4 and 5) to outside the network by nodes in the output and input layer are described in
Table 11.
The initial inventory value and fixed-parameter values specific to each entity are presented in
Table 12. Fixed and variable costs are set to leave each entity a profit margin.
Parameter values that are common among all entities in the simulation are shown in
Table 13.
Table 14 shows the simulated cost coefficient for each entity in the SC with Topology 2.
The results of the simulation for Topology 2 are as follows.
The simulation results in Topology 2 in
Figure 4 are similar to the results in Topology 1. The results of Topology 2 in
Figure 4a show that each entity is initially more willing to take a loan from the bank and over time, by increasing profitability, their desire to receive a loan decreases and the required working capital is provided through profits. Moreover,
Figure 4b show the upward trend of the production value until it reaches its maximum value in all nodes. In addition, the results in
Figure 4c regarding the two variables of working capital and final profit show that with the passage of time and the activity of the company, these two variables have increased and the amount of short-term financing from the bank has decreased.
By simulating the aforementioned scenarios, each entity’s ideal production, funding by banks, and debt repayment to upstream businesses are scheduled. We arrive at a globally optimal solution for the aggregate profitability of all entities using the convex QP optimization model presented in Equation (24).
6. Conclusions
This paper proposes a constrained finite-time LQR approach for determining an entity’s optimal profit state in a SC. We focused on the optimization of the total profit in two different topologies of SC networks. Our model’s foundation presupposes a non-competitive environment in which the quantity and timing of orders are certain. The main issue in this framework is optimizing the total profit of the network by controlling external financing, internal financing, and inventory value. The problem is solved from a SC regulatory point of view, meaning the optimization is carried out to maximize profit of the entire SC.
Regarding the formulation of the model presented in this paper, variables are of two types. State variables are defined in the model to capture the dynamic behaviors of a system. Input variables are the means to control the model in our desired state, and they play the role of decision variables. Optimization is a relative concept in this framework. The chosen cost coefficients determine our priorities for reaching the optimal state of entities.
The results of simulated inventory and production values are shown in
Figure 2b and
Figure 4b. As is evident in both topology, the inventory value of entities presented in the first and last layers follows a clear repetitive pattern. This phenomenon happens because fixed order sizes are issued between every two entities in different layers throughout the simulation horizon. Although entities are allowed to postpone payment to upper entities by bearing the relative penalty cost, this behavior is not observed in the model due to the high priority of profit maximization and debt minimization.
The approach of dynamical systems in the solving of the model presented in this paper well answers the question of how the necessary changes on the control variables lead to determination of the appropriate values for the state variables and, ultimately, total profit maximization. According to the modeling and simulation results, which are clearly dependent on the entities’ characteristics (network parameters), it is inferred that proper control of production variables and internal and external financing causes an uninterrupted production flow by receiving and sending appropriate input and output flows. This leads to a lack of stop production and creates critical points in the supply chain. It manages the supply chain’s entities liquidity risk and maximizes total profit by using the potentials in the chain.
Based on the previous discussion, this helps policymakers who own the entire chain to plan for external financing and financial communication between members of the chain so that the production process is smooth and commensurate with the capacity of each entity and avoids the accumulation or a misplaced deficiency of production factors.
Since two issues, topology type and the characteristics of entities in the supply chain, have an important effect on the simulation results, sensitivity analysis on these cases can be the subject of future study in this topic.
Further development of this model can include considering demand uncertainty faced by first-layer SC members. Trade credit assumptions can also be added to this model in order to optimize the contracts regarding the payment delays of entities considering the whole SC. Risk-free investment opportunities and inventory costs could also be added to the model.