Four sections compose the conceptual classification. They are literature reviews (LRs), deterministic optimization (DO) models, uncertain optimization (UO) models, and game-theoretic (GT) models. Models of deterministic optimization are mathematical structures used to optimize solutions to well-defined problems with known variables and constraints. Uncertain optimization models, on the other hand, strive to discover optimal solutions while considering the variability in uncertain parameters. Game-theoretic models analyze strategic interactions between multiple decision-makers, considering their preferences, actions, and prospective outcomes to determine optimal strategies. The classification of references is shown in
Table 1.
2.1.1. Literature Reviews
Some literature review papers on CLSCs and RLs are discussed in this section. Ritola et al. [
10] and Amin et al. [
1] converge on the transformative potential of integrating sophisticated information systems and advanced operations research methodologies to enhance CLSC efficiency. Their findings are indicative of a broader acknowledgment within the literature of the need for robust methodological approaches that can capture the complex dynamics of CLSCs. This is complemented by the work of Simonetto et al. [
12], which echoes this sentiment by highlighting the role of Industry 4.0 technologies in transforming risks into opportunities within CLSCs. The insights provided by these authors do not stand in isolation but interlock to form a narrative that champions a progressive shift in the CLSC paradigm—from traditional, linear models to dynamic, circular, and technologically empowered systems.
In parallel, the push towards a circular economy, as scrutinized by MahmoumGonbadi et al. [
11], reveals a concerted effort to transcend the prevailing monetary-centric performance measures. Their critical analysis indicates that current CLSC models may not fully encapsulate the principles of circularity, thereby advocating for a realignment of design strategies to embody both economic and environmental imperatives. Supporting this perspective, Gunasekara et al. [
13] focus on the practicalities of the acquisition, sorting and disposition of used products within CLSCs. They highlight the necessity of efficient return forecasting and judicious channel selection as essential to upholding the circular economy ethos. Complementing these operational insights, Peng et al. [
9] delve deeper into the inherent uncertainties of CLSCs, such as those in the acquisition and market stages, underscoring the complexities they introduce to managing returns and optimizing processes. Complementing these operational insights, Peng et al. [
9] extend the discourse on uncertainty within CLSCs, analyzing uncertainties across various stages, including the acquisition and market stages, which align with and further complicate the concerns of forecasting returns, optimizing acquisition efforts and selecting appropriate return channels.
A thematic analysis has revealed that game-theoretic models, examined by De Giovanni and Zaccour [
5] and Shekarian [
3], serve as a cornerstone for understanding stakeholder interactions within CLSCs. The focus on return functions, product recovery and remanufacturing strategies underscores the significance of strategic decision-making in achieving sustainable and efficient CLSC operations. Furthermore, investigation into contracts and coordination mechanisms points to a need for a holistic perspective that recognizes the multifaceted nature of stakeholder dynamics within CLSCs.
In summary, these reviews highlight key gaps in CLSC research and unify calls for a holistic, technologically advanced, and circular approach. Such integration is essential for future explorations, particularly regarding cost diversity, inventory management and return quality, to further the field.
2.1.2. Deterministic Optimization Models
This section provides a broad collection of research that uses deterministic mathematical models for CLSCs with predetermined parameters to improve different aspects of supply chains, facility layouts and network architectures in the field of deterministic optimization models. Collectively, these studies demonstrate the effectiveness of optimization strategies in promoting sustainability and efficiency in a variety of scenarios.
Allehashemi et al. [
19] focused on a dynamic cellphone network situation in Ontario, Canada. This research improved the facility layout within a CLSCN using an MILP model, intending to reduce overall expenses. The model’s multi-objective formulation considers factors like CO
2 emissions and quality. Another noteworthy inclusion is the utilization of the fuzzy Quality Function Deployment (QFD) method for managing qualitative elements. The findings show that objective function weights have a considerable influence on facility selection and product flows among them. Valderrama et al. [
15] shifted their focus onto a significant issue in the mining industry: reducing Greenhouse gas (GHG) emissions across the supply chain (SC). Their study presents a multi-product, multi-echelon, multi-period environmental mining supply chain network design (SCND) model. This technique, based on an Emissions Trading Scheme (ETS), aims to reduce GHG emissions while optimizing investment, transportation, operating expenses and carbon credits. Valderrama’s study gives insights into how certain configurations influence costs and emissions by including ore grade concerns into the SC design, highlighting the usefulness of ETS in decreasing both economic and environmental consequences across the mining SC.
Research by Tirkolaee et al. [
7] unveils an MILP model for a sustainable mask CLSC network, where sustainable development is investigated in terms of concurrently reducing total costs, total pollutants and total human risk at the same time. Salehi-Amiri et al. [
16] developed an MILP model to construct a cost-effective CLSC network for the walnut industry. The suggested model is validated and improved using exact, metaheuristic and hybrid approaches. The findings demonstrate the importance of changing inventory holding costs, the impact of transportation costs on opening costs and the linear influence of demand on supply chain expenses. Salehi-Amiri et al. [
18] designed a closed-loop supply chain (CLSC) for the avocado industry, incorporating avocado seed recycling and compost utilization. That paper shows that the demand’s effect on the network strongly affects both cost and employment efficiency objectives through a real-world case study in Puebla, Mexico, using GAMS software and sensitivity analysis.
Santander et al. [
14] addressed plastic waste management for open-source 3D printing technology using distributed plastic recycling. An MILP model is employed to assess the economic and environmental sustainability of this dispersed recycling network. The model’s effectiveness was shown using a case study from a university. Ahmed et al. [
25] introduced a CLSC network for the tire industry, and the model was applied to the Greater Toronto Area in Canada. They addressed both economic and environmental objectives. That paper innovatively incorporates a multi-criteria decision-making method, integrating spherical fuzzy logic to determine supplier weighting factors based on qualitative criteria.
In the realm of green closed-loop supply chain networks (GCLSCNs) during the pandemic, Abbasi et al. [
17] demonstrated a model that navigates the trade-offs between cost and CO2 emissions, focusing on the Iranian automotive industry. Their research highlights the possibility of maintaining supply chain sustainability despite increased operational costs from enhanced hygiene practices, underscoring the resilience and adaptability of these systems during global disruptions.
2.1.3. Uncertain Optimization Models
Uncertainties can arise from a variety of factors, such as shifts in demand, technological advances, disruptions and even lockouts, making supply chains susceptible to unforeseen events. These risks are frequently perceived as both expected and unexpected occurrences, highlighting the dynamic and complex nature of supply chain management [
43]. Readers are encouraged to refer to the work of Peng et al. [
9] for a comprehensive understanding of the various uncertainty factors affecting CLSCs. They conducted an extensive review of 302 papers, providing insights into the causes of uncertainties at various stages of CLSCs. The sources of uncertainty in the reviewed CLSC papers of this article are shown in
Table 2 and
Figure 2.
Facility location models have recently integrated Sustainable Environmental Strategies (SESs) aimed at minimizing a company’s environmental footprint and cost simultaneously [
28]. Ruiz-Torres et al. [
36] contributed to the literature on CLSCs by presenting a unique model that accounts for numerous suppliers and return sources in a remanufacturing system. In contrast to earlier research, it uses a nonlinear function to simulate return behaviour based on incentives while accounting for uncertainty. In addition, that article investigates a mix of decisions in both forward and reverse flows, emphasizing the importance of supplier portfolio selection and returner incentive methods in improving cost-efficient closed-loop supply chains. Additionally, Wang et al. [
33] channeled their efforts into managing hazardous household waste through an RL network that includes collection, treatment, processing and disposal facilities. Multi-objective deterministic and stochastic mathematical models are introduced to optimize facility selection, route planning and waste allocation, aiming to minimize transportation costs and dangers and maximize convenience and participation. With a case study centered around paint waste in the City of Toronto, these models incorporate stochastic parameters such as paint waste generation, recycling rates and diversion rates.
In terms of CLSC models in the mining industry, the study by Akbari-Kasgari et al. [
8] is a step up from the study by Valderrama et al. [
15] on iron ore, which took a deterministic approach and focused on reducing costs and greenhouse gas emissions without looking at uncertainty or the social side of sustainable development. The model of Akbari-Kasgari et al. [
8] includes uncertain parameters and attempts to maximize supply chain profit, reduce water consumption and air pollution, and promote equitable activity allocation in a variety of socioeconomic regions. That study compares two versions of the model (one with and one without backup suppliers) and finds that including backup providers improves supply chain responsiveness and socioeconomic performance while increasing negative environmental externalities. In this vein of advancing CLSCs under uncertain conditions, Xu et al. [
44] develop a two-stage stochastic model that tackles the volatility of market demand and carbon pricing within a structured carbon trading scheme. This approach, situated within an eco-friendly CLSC context, offers a flexible and strategic methodology that can be adapted by various industries beyond the aluminum sector, including stainless steel manufacturing and plastic production. The model’s robustness in addressing cost and emission management under fluctuating market conditions, as evidenced by the aluminum case simulation results, serves as a guidepost for industries aiming to meet their emission targets amidst uncertainty.
Tosarkani and Amin [
28] created a robust, adaptable stochastic model for constructing wastewater treatment networks in hydraulic fracturing operations where costs, demand and resource capacity are uncertain. To address environmental concerns, that paper proposed a bi-objective optimization model that considers both total cost and CO
2 emissions. Its application in Alberta, Canada, was displayed. Similarly, Fathollahi-Fard et al. [
27] developed a socially and environmentally responsible water supply network utilizing the Social Engineering Optimizer, a specific optimization technique. The authors claim that this is the first study in the literature to construct a wastewater collection system under uncertain conditions. The study’s application tackles real-world water scarcity challenges, namely those in Iran’s Urmia Lake.
Research by Khorshidvand et al. [
34] revealed a unique hybrid approach, including pricing, greening and advertising options. That study discovered the best levels of advertising and greening decisions to guarantee the chain’s profitability. That paper navigated uncertainties and achieved improved outcomes by including a robust scenario-based stochastic programming model, while a Lagrangian relaxation technique enables the effective resolution of large-scale examples. Kchaou-Boujelben [
40] created a two-stage stochastic programming model with an unknown return quantity/quality and investigated the implications of return changes on the network’s performance and structure. They studied the trade-off between profit maximization and CO
2 emission minimization objectives. They compared their metaheuristic approach to tackling the problem with the
ε-constraint technique. Numerical tests show that network setup and performance are sensitive to differences in return quality and quantity, particularly when return processing penalties are significant.
Khalili Nasr et al. [
32] presented a two-stage approach for building a SCLSC using the fuzzy best–worst technique and multi-objective mixed-integer linear programming. They did this by integrating economic, environmental, social and circular aspects into supplier selection and order allocation. A case study in the garment manufacturing and distribution industry supports the method, which aims to reduce network costs, environmental impacts and missed sales while enhancing employment opportunities and long-term supplier purchasing. Meanwhile, Shekarian et al. [
26] offered a unique mixed-integer linear optimization model for a soybean supply chain network composed of producers, agricultural facilities, distributors and customers. Their approach optimizes profit under uncertain parameters, utilizing a pioneering possibilistic technique. The model was expanded to a bi-objective formulation to account for organic practices, with a case study in Ontario, Canada. It was highlighted that supply chain management approaches may successfully boost consumer satisfaction while lowering costs in food supply chains. For an investigation of the relevant literature on food supply chain management and uncertainty, readers can refer to Shekarian et al. [
26] as well as Alinezhad et al. [
38], since they offer valuable insights into addressing uncertainties and promoting sustainability in the food supply chain domain. Alinezhad et al. [
38] contributed to the field by configuring a sustainable closed-loop supply chain network under uncertain return rate and demand conditions using fuzzy theory, which was validated through a case study in the dairy industry.
2.1.4. Game-Theoretic Models
This section investigates game-theoretic models for CLSCs. Game theoretic models have emerged as important tools for examining complex interactions and decision-making processes across a wide range of fields. In game-theoretic models that explore the interactions and decisions of multiple stakeholders within the framework of a supply chain, strategic decision-making analysis is applied. Researchers have employed game theory to disentangle the complex dynamics of supply chain management, providing possibilities for more sustainable practices and improved supply chain performance.
Zhou et al. [
24] developed an equilibrium model for CLSC networks under various remanufacturing approaches while incorporating green factors and allowing decision-makers to choose between in-house and authorized remanufacturing approaches with factors like carbon trading and consumer preferences. Kharaji Manouchehrabadi and Yaghoubi [
49] investigated a solar cell supply chain with a closed loop that would recover old solar panels and cause less damage to the environment. Their model is based on dye-sensitized and perovskite solar cells, and the supply chain is made up of a seller, a 3PL provider, and a manufacturer. In the experts’ Stackelberg game model of this paper, the 3PL is a follower, while the provider and assembler are chain leaders. The effectiveness of government incentives in promoting solar cell returns was examined. The results indicated that government action significantly improved the situation. Meanwhile, Chai et al. [
51] focused on the Electric Vehicle (EV) industry in China, where EV batteries are replaced when their capacity decays to about 80%, generating a significant number of retired batteries. They formulated a Stackleberg game model that consisted of an upstream supplier and a downstream manufacturer. A three-stage model under three different investment schemes was developed. They concluded that process innovation techniques affecting green product remanufacturing may successfully increase remanufacturing performance while raising the manufacturer’s recovery rate.
Notably, Fander and Yaghoubi [
53] introduced a novel stochastic game model for a closed-loop automotive supply chain, incorporating both static and dynamic fuel considerations. Compared with the authors’ previous work [
6] that focused on low-consumption cars, this study offers a more comprehensive analysis, emphasizing the dynamic approach’s effectiveness in decision-making. The authors significantly extended their previous models on automotive supply chains by introducing a unique stochastic game model, including optimum capacity allocation, cooperative mechanisms for worn-out car collection and the impact of governmental interventions on fuel-efficient technology. Similarly, Lee [
47] investigated sustainable strategies in a CLSC, featuring a manufacturer, a retailer and a collector. That paper considers scenarios where the manufacturer and the retailer drive innovation separately or collaboratively, and they examined six different game models using pairings of green innovation strategies and market leadership responsibilities. In-depth game models and analytical solutions revealed the optimal tactics for coordinating green innovation efforts among supply chain participants to achieve a win–win conclusion.
This collection of research also includes studies addressing real-world complexities, such as a study by Hosseini-Motlagh et al. [
46], who offer a circular economy-based closed-loop system that incorporates sustainability issues by focusing on a real-world pharmaceutical scenario. The authors introduced an analytical coordination model to manage conflicts resulting from competitive dynamics between a manufacturer and two retailers. A Nash-bargaining game model and a profit-sharing contract were applied to assure the coordination strategy’s equitable operation. On a larger scale, Luo et al. [
54] addressed the complex relationship between carbon tax policy, manufacturing activities and remanufacturing decisions within closed-loop systems. They developed four game-theoretic models to analyze the impact of the carbon tax on decision-making in both centralized and decentralized contexts. With the evaluation of the impact of three collection strategies (no collection, partial collection and full collection), it was shown that a carbon tax might encourage investment in carbon reduction or remanufactured products to lower carbon emissions.
The literature also includes dual-channel supply chain dynamics, as demonstrated by work by Pal and Sana [
55] and Mondal et al. [
50]. A study by Pal and Sana [
55] digs into the intricacies of a dual-channel supply chain model for eco-friendly goods. The optimal price, rewards for returned products and levels of green innovation are examined using a variety of mathematical models for both centralized and decentralized situations while accounting for competitive channel dynamics. The findings indicate that coordinated decisions regarding green innovation have a positive impact on customers’ propensity to return products, and they provide evidence that strategic decision-making and consideration of consumer goodwill may improve market performance. Mondal et al. [
50], on the other hand, dig into pricing and greening tactics by dissecting dual-channel supply chain dynamics using several Stackelberg and Nash game models. Situations such as centralized, manufacturer-led decentralized, retailer-led decentralized and Nash games were investigated for pricing and greening strategies. It was found that the centralized method leads to higher retail prices, but the retailer-led decentralized policy yields the highest supply chain profit.
In contrast with earlier papers that have focused on return policies and environmental aspects, Quan et al. [
52] presented a novel perspective by addressing the interplay between trade-in services and direct sales in a two-period CLSC game involving a manufacturer and a retailer. That study explores two scenarios: one that is manufacturer-operated (Scenario M) and another featuring retailer-outsourced trade-in services (Scenario R). The authors calculated the relevant Stackelberg equilibrium for each scenario. They studied both scenarios as leader–follower games, looking into decisions regarding rebate rates as well as wholesale and retail prices. Genc and De Giovanni [
45] further enrich this framework with the integration of innovation-led lean programs within CLSCs, uncovering that strategic components coupled with process innovation significantly bolster supply chain performance. They posit a novel finding that consumers positively respond to environmentally conscious practices and enhanced operational responsiveness, incentivizing manufacturers and suppliers to adopt strategic lean approaches over purely process innovation-centric approaches. Additionally, the study indicates that centralized systems, free from the constraints of double marginalization, show a clear preference for strategic lean programs, underlining their effectiveness in improving both sustainability and profitability within CLSC frameworks. Zhao et al. [
48] offered another angle, focusing on the importance of component reuse. Their CLSC includes a producer, a supplier of new components and a supplier of recycled components. The results demonstrated that product characteristics, particularly for items with low price elasticity, have an impact on this strategy’s effectiveness.