1. Introduction
Because of the cost reduction advantage as well as the environmentally friendly advantage of the used-product remanufacturing, lots of manufacturers, such as HP, Lenovo, Apple, and Xerox, have launched the remanufacturing closed-loop supply chain (CLSC) strategy [
1,
2,
3]. Because of the modern supply chain, the used-product are widely and separately distributed, which brings the problem of uncertainty on quantity and quality [
4,
5,
6]. As a result, how the manufacturer should collect the used-products from customers is an essential problem when the collecting is involved with uncertainties [
7].
In the traditional operations management area, the decision-makers are commonly assumed to be rational, which means they only care about their payoffs and do not care about other players’ payoffs. However, there are increasing researches and literature that argue that the players in the supply chain may have fairness concern, especially when there are one leader and several followers in the supply chain system [
8,
9,
10,
11]. The Stackelberg leader in the supply chain always has the distributive power over other followers, which would result in the unfairness feelings for the followers. Lots of studies have been dedicated to addressing the operations management problems in the supply chain with fairness concern. Fehr and Schmidt [
8] and Cui et al. [
9] studied how fairness may affect the equilibrium of the supply chain and how to coordinate such a supply chain. Loch and Wu [
10] designed an experiment to study the influences of social preferences on the decisions of the supply chain. This paper is trying to deal with the used-product collecting problem in a closed-loop supply chain with the retailer being fairness concern.
In the past two decades, more and more scholars are focusing on the area of used-product return and remanufacturing because of the importance of remanufacturing. Atasu et al. [
12], Govindan et al. [
13], and Govindan and Soleimani [
14] concluded the recent achievement and possible directions for this area. Our paper is related to three aspects of literature: reverse channel management, dynamic return problem, and fairness concern operational management.
The reverse channel choice is an essential problem for the CLSC operations management. Savaskan et al. [
15] was the first attempt that formulated the reverse channel choice problem in the CLSC by using game-theoretic models. They formulated three typical reverse channels, such as manufacturer collection, retailer collection, and third-party collection. Their results showed that the retailer collection channel might be the best reverse channel for the CLSC system. Savaskan and Van Wassenhove [
2] further resolved the reverse channel choice problem in the presence of retailers competing. Huang et al. [
16] studied the product return problem in a CLSC with the third-party and retailer return used-products simultaneously. De Giovanni and Zaccour [
17] investigated the optimal outsource problem of used-product collecting for the manufacturer in a CLSC, where the third-party firm or the retailer can engage in the collection activities. These papers mainly focused on the reverse channel choice problem in a CLSC with only one supply chain member involved in the return activities. Some papers have looked at the responsibility sharing of used-product collecting among the CLSC members, such as Jacobs and Subramanian [
18], Subramanian et al. [
19], Jena and Sarmah [
20], and Ma et al. [
21]. These papers focused on how to better improve the used-product return efficiency by cost-sharing or co-operation in the CLSC members.
The above papers mainly adopt the static used-product return model, which ignore the dynamic characteristics during the collecting process. Some researchers have begun to investigate the effect of dynamic characteristics on used-product collecting in CLSCs. Guide and Wassenhove [
22] discussed the used-product return problem taking the quality uncertainty into consideration. Nakashima et al. [
23] explored dynamic control decisions in the remanufacturing systems. Fallah et al. [
24] investigated the product return problem with two CLSCs competing in the presence of uncertainty. These papers focused on the uncertainty of quality or timing during the used-product return process.
Regarding the dynamics in the collection process, Huang et al. [
6], De Giovanni and Zaccour [
25], De Giovanni et al. [
26] have developed differential game models to investigate the dynamic used-product collection control problems in different CLSCs. Huang et al. [
6] considered the stochastic disturbances in the return process and formulated the corresponding stochastic differential game and resolved the equilibrium return control strategy when the manufacturer collects in the CLSC. De Giovanni and Zaccour [
25] adopted the differential game model to investigate the used-product return problem in a CLSC. They designed the cost and revenue-sharing contract for the CLSC. To better motivate the CLSC members to invest in product collection activities, De Giovanni et al. [
26] studied the incentive mechanisms in the CLSC using the differential game model. Using the differential game model, our paper focuses on coping with the used-product return problem in the CLSC with retailer collecting as well as fairness concern for the retailer.
Our study is also related to the area of fairness concern in the supply chain management. Cui et al. [
9] studied how to coordinate a dyadic supply chain in the presence of fairness concern, and they showed the traditional wholesale price can coordinate the supply chain when the supply chain members are fairness concern. Caliskan-Demirag et al. [
27] further extended the model of Cui et al. into the scenario of nonlinear demand functions. Du et al. [
28] investigated the newsvendor problem in a dyadic supply chain where both manufacturer and retailer were concerned with fairness. These studies mainly focused on the distributional fairness concern in the supply chain, the peer-induced fairness concern is also receiving attention. Ho and Su [
11] first consider the distributional fairness and peer-induced fairness in the ultimatum game model. Ho et al. [
29] incorporated their model into the setting of the supply chain and discussed the contact design problem in the supply chain when the two retailers are concern with peer-induced fairness. Nie and Du [
30] further considered the quantity discount contracts with peer-induced fairness and distributional fairness in the supply chain. Shu et al. [
31] adopted a static model and considered the pricing and collection problem in a CLSC in which the collectors are concerned with both distributional fairness and peer-induced fairness. Our study incorporates the concept of fairness into the retailer collecting closed-loop supply chain and aims at investigating how the presence of fairness concern would affect the equilibrium strategies and profitability of the supply chain members in the stochastic model setting. Xiao and Huang [
32] also considered the stochastic collection problem in a CLSC, while they mainly focused on the third-party collection channel and we concerned about the retailer collection channel.
In this paper, we consider a closed-loop supply chain with one manufacturer and one retailer, where the manufacturer sells new products and collects used-products through the retailer, simultaneously. The manufacturer is the Stackelberg leader in the channel, and the retailer is the follower such as the retailer is assumed to be concerned of distribution unfairness. We consider two types of the retailer with different fairness concern preference, i.e., gap fairness concern retailer and self-due fairness concern retailer. The unfair feeling of gap fairness concern retailer comes from the profit gap between the retailer and the manufacturer, which is widely used in the literature, such as Nie and Du [
30], Li et al. [
33], and Li and Li [
34]. The unfairness of self-due fairness concern retailer comes from the profit difference between the profit the retailer actually receives and the profit the retailer considers as he deserved. Following Du et al. [
28], we take the Nash bargaining profit as the self-due profit for the retailer. Therefore, the main difference for the gap fairness concern for the self-due fairness concern is the fairness reference point. The gap fairness retailer takes the leader’s profit as the fairness reference point, while the self-due fairness retailer takes his Nash bargaining profit as the fairness reference point.
Our main results are as follows. First, we investigate the Markov equilibrium for the scenario with no fairness concern retailer, gap fairness concern retailer, and self-due fairness concern retailer. We find only under a specific condition, the feedback Markov equilibrium exists for a particular closed-loop supply chain system, and the expected return rate will approach to a stable state, whatever the preference of fairness concern is. Second, we compare the expected equilibrium results for the supply chain members in different scenarios. We find that the presence of fairness concern of the retailer would neither affect the equilibrium strategies for the retailer, nor the manufacturer. The type of self-due fairness concern is more reasonable for the retailer to express its concern of fairness and is more acceptable for the manufacturer to consider its profit shifting for the retailer. Third, we further design a hybrid coordinate contract for the manufacturer to coordinate with the retailer.
The remainder of this paper is organized as follows. 
Section 2 presents our modeling framework. 
Section 3 resolves the feedback equilibrium of the stochastic differential game with the retailer being fairness neutral. 
Section 4 is the equilibrium analysis for the stochastic differential game models with the retailer being fairness concern. 
Section 4.1 is the analysis for the gap fairness concern retailer and 
Section 4.2 is self-due fairness concern retailer. 
Section 5 conducts a numerical analysis to compare the gap fairness model with the self-due fairness model. We design a hybrid contract for the closed-loop supply chain in 
Section 6. 
Section 7 concludes the paper.
  2. Problem Formulation and Model Setup
Consider a closed-loop supply chain system, consisting of one manufacturer and one retailer. The manufacturer distributes its new products and collects the used products through the retailer. The unit production cost for the manufacturer is 
 when only the raw material is used to make the new product. The manufacturer also makes use of the used products to make the new product, with a unit production cost 
. It is reasonable to assume 
, which means remanufacturing is attractive for the manufacturer on saving production cost. This assumption can also be found in Savaskan and Van Wassenhove [
2], Huang et al. [
6], and Savaskan et al. [
15]. Denote 
 as the unit cost savings from remanufacturing the used product. We assume that the products made from the used products as materials are the same as the ones made from the raw materials. Thus, the case where the products were made from used products are differentiated from the products made from the materials is beyond the scope of this paper. The notations are summarized in 
Table 1.
The planning horizon of the CLSC members are infinite, i.e., .  is the return rate at time , which represents the percentage of products that are made by using used-products rather than raw materials. Denote  as the collection efforts level of the retailer at time , which indicates the efforts that the retailer invests on collecting activities, such as collecting advertising and collecting facilities maintaining. The cost function of the retailer for investing in collecting activities is assumed to be , where  is a scaling parameter that represents the cost coefficient for the retailer to collect used-products.
Following the setting in Huang et al. [
6] and Xiao and Huang [
32], we formulate the return rate by the Itô equation as
      
      where 
 is the effect of collection efforts on the return rate; 
 measures the decaying rate of the return rate. 
 is the initial return rate of the CLSC system. 
 is a variance term and 
 is a standard Wiener process. Equation (1) captures the dynamics and stochastic disturbance in the used-product return process.
To ensure that the return rate satisfies 
, 
 and 
 should be continuous functions. Similar to Huang et al. [
6], we will adopt 
 for the sake of mathematical simplicity. It can be verified that 
 when 
. Thus, we can conclude that the return rate 
 can be meted.
The manufacturer announces the wholesale price  and distributes new products to the retailer, and then the retailer sets the retail price  to sell new products to the consumers and decides its collecting efforts  to collect used-products from consumers and transfers to the manufacturer for remanufacturing. For every unit of collected used product, the retailer receives subsidy .
The demand of the retailer at time 
 is denoted by 
. We adopt a standard linear demand function [
6,
14], which is given by
      
      where 
 represents the market potential of the product, and 
 defines the elasticity of demand with respect to price.
The discount rate for the supply chain members is denoted as 
. 
 is the profit rate of player 
 at time 
, where 
 and 
 represent the manufacturer and the retailer, respectively.
      
Denote 
 as the objective function for player 
 under the model 
, where 
 and 
 represents the manufacturer and retailer, respectively. 
 represents the benchmark model with no fairness concern retailer, 
 represents the model with gap fairness concern retailer, 
 represents the model with self-due fairness concern retailer, 
 represents the centralized supply chain decision model, and 
 represents the coordinated supply chain model. We will deal with these models in the next sections. We first investigate the model with no fairness concern retailer, which serves as the benchmark model in 
Section 3. Then the models with fairness concern retailer are discussed in 
Section 4. 
Section 4.1 is the model with gap fairness concern retailer, and 
Section 4.2 is the model with self-due fairness concern retailer. We will design a coordinate contract for the manufacturer to coordinate the supply chain in 
Section 6.
  3. Benchmark: NF Model-No Fairness Concern
In this section, we will consider the problem with retailer collecting in the closed-loop supply chain in the presence of stochastic disturbance, and there is no fairness concern for the retailer. The objective function of the manufacturer is formulated as
      
The objective function of the retailer is formulated as
      
The supply chain members seek to maximize their expected discounted profit stream subject to the system dynamics in Equation (1).
  3.1. The Feedback Equilibrium Strategies
Denoting 
 as the value functions of the supply chain member, we formulate the Hamilton–Jacobi–Bellman equation for the retailer as
        
The best response of the retailer can be resolved by the first-order condition as
        
It is shown that the retail price is increasing in the return rate which means the retail price will go down when the return rate goes up. The collecting effort is relevant with the marginal value of the return rate to the retailer. The HJB equation of the manufacturer can be formulated by
        
Taking the best response of the retailer into the value function of the manufacturer, the optimal wholesale price control strategy is calculated by
        
Thus, the equilibrium control strategy of the retailer is derived as
        
Inserting the equilibrium control strategies into the HJB equations yields
        
As the value functions are quadratic in terms of the return rate after substituting the equilibrium control strategies into the value functions, we conjecture the value functions of the supply chain members as 
 and 
. Then we have 
, 
, 
, 
. Taking the value functions and their derivations back into the HJB equations yields the coefficients equations which are to be solved,
        
Denote , the coefficients are calculated by
Proposition 1 characterizes the equilibrium strategies, the retailer being no fairness concern.
Proposition 1.  When , 
NF model exists only one feedback Stackelberg Markov equilibrium. The equilibrium wholesale price is calculated by The equilibrium retail price is The equilibrium collecting control strategy is  Proof.  The equilibrium collection control strategy 
 should be positive, i.e.,
        
If ,  would be negative when  or  is small positive values. Since equilibrium control strategy  is requested to be positive,  is required to be positive, which means 
The equation regarding 
 is
		
Solving the equation yields,
        
Therefore,
(a) When 
, from 
 we can infer that 
, which is 
. This is counterintuitive as the collection cost coefficient should not be very small (Savaskan and Van Wassenhove [
2], and Savaskan et al. [
14]).
(b) When , from  we can infer that , which is . This is quite consistent with the reality that the collection cost coefficient would not be very small, otherwise the collection firm would like to collect all the used products.
Consequently, we rule out the larger root by assuming . When , we can verify that , and ,  □
 The equilibrium wholesale price in Proposition 1 can be rewritten as
        
It is evident that the manufacturer would raise its wholesale price according to the transfer subsidy . Thus, on the one hand, the manufacturer gives collecting subsidy to the retailer, and on the other hand, the manufacturer raises the wholesale price by , which is the same with the subsidy. The net unit profit for the retailer is , which is irrelevant to the transfer subsidy. As a result, the transfer subsidy has no impact on the profit for the retailer and the manufacturer.
  3.2. The Evolutionary Path of the Return Rate
We derive the evolutionary path of the return rate under the equilibrium control strategies in this subsection. Inserting the equilibrium collecting strategy into Equation (1),
        
Since 
, we have 
. Denote 
, and let 
, thus
        
Using the stochastic integral equation and taking the expectation,
        
The above can be seen as an ordinary differential equation in 
 with 
. Solving the equation, we have the following result,
        
Assuming that  to ensure that the long-run return rate is smaller than 1. Proposition 2 characterizes the expected evolutionary path of the stochastic return rate.
Proposition 2.  In model NF, the expected evolutionary path of return rate is calculated by,
        
 The long-run stable expected return rate can be calculated as,
        
  The return rate is not stable because of the stochastic disturbance. However, there exists a stable long-run expected return rate for a specific closed-loop supply chain system. The long-run expected return rate is unique which means the system will converge to a specific state for a particular closed-loop supply chain. From the expected evolutionary path of the return rate, we have:
When , ; when , . The expected return rate may increase or decrease over time, which depends on the initial return rate of the system. The long-run stable state is the best for the system, even when the initial return rate is above the long-run stable state. That is to say, keeping a high initial return rate is not as good as shrinking into the long-run stable state.
  3.3. The Numerical Analysis with No Fairness Concern
In this subsection, we would like to conduct a numerical analysis for the NF model to illustrate our theoretical results. The system parameters are chosen by 
. We utilize the following equation to approximate the system dynamics under equilibrium collecting control strategy,
        
        where 
 are independent and identically distributed (i.i.d) standard normal random variables. Time step 
 is set by 0.01.
Figure 1 illustrates the evolutionary path of the return rate in the presence of stochastic disturbances. The return rate may increase or decrease over time in terms of expected value. The expectation of the return rate will converge into a stable state along with time, whatever the initial return rate is. The optimal strategy for the retailer is to keep the system state as close as possible to the stable state, even though the initial return rate is above the stable state. The return rate always hovers around its expectation as a result of the stochastic disturbance.
 Figure 2 demonstrates the corresponding evolutionary path of the retail price with time. As the retail price is inversely proportional to the return rate, the evolutionary path of the retail price is inverse to the evolutionary path of the return rate. Similarly, the retail price will converge into a stable state in terms of the expectation.
 Figure 3 shows the impact of disturbance intensity on the collecting efforts as well as the return rate. The retailer will raise its collecting effort level when the stochastic disturbance intensity is increasing. As a result, the corresponding expected return rate will increase with the increasing of collecting effort. We can expect that the profit rate of the supply chain members will benefit from the increasing of stochastic disturbance intensity. This may come from that the retailer has to raise its collecting effort level to avoid the return rate deviated too far from the stable state, which results in a higher return rate, a lower retail price for the closed-loop supply chain system, and thus a higher profit rate for the supply chain members. However, it should be noticed that this effect is not that prominent which could infer from the increment value.
   5. Numerical Analysis between Different Fairness Type
In this section, we would investigate how the manufacturer would shift profit to the retailer according to different fairness concern type for the retailer. The approximation is the same as that used in 3.3. We compare the expected long-run equilibrium profit rate for the manufacturer and the retailer in 
Figure 4, 
Figure 5 and 
Figure 6. 
Figure 4 shows the comparison results of 
, which represents the scenario in which the Nash bargaining power is small for the retailer. Thus 
Figure 5 and 
Figure 6 represent the scenario with equal Nash bargaining power and high Nash bargaining power, respectively.
Figure 4, 
Figure 5 and 
Figure 6 illustrate that the more the retailer concerns about fairness, the more profit the manufacturer will shift to the retailer, irrespective of whether the retailer is gap fairness concern or self-due fairness concern. When the retailer is gap fairness concern, the profit rate of the retailer will increase over the fairness concern degree faster than the self-due fairness concern, which makes the gap fairness concern retailer an aggressive fairness concern type.
 Figure 4 and 
Figure 5 demonstrate that gap fairness concern can bring more profit shifting for the retailer when the Nash bargaining power of the retailer is below 0.5. When the Nash bargaining power of the retailer is above 0.5, self-due fairness concern may bring more profit shifting for the retailer. In model SF, the retailer sets the fairness reference point by Nash bargaining point, according to his bargaining power. The retailer can accept low distribution when his channel power is small, and he will request more distribution as his channel power is rising to large. Basically, the self-due fairness concern retailer expresses his feeling of unfairness by way of Nash bargaining point. It is seemed to be quite reasonable for both the retailer and the manufacturer.
 In contrast, gap fairness concern retailer is more aggressive when his channel power is low and conservative when his channel power is rising to large. Although the gap fairness concern may bring more profit shifting for the retailer when his channel power is low, it may incur the dislike of the manufacturer, who thought the retailer may be too greedy and thus may decide not to consider the feeling of the unfairness of the retailer.
Thus, we conclude that the type of self-due fairness concern is more reasonable for the retailer to express its concern of fairness, and is more acceptable for the manufacturer to consider its profit shifting for the retailer.
  7. Conclusions
This paper addresses the stochastic collecting control problem in a closed-loop supply chain consisting of one manufacturer and one retailer, concerned with fairness. Stochastic differential game models are formulated to discuss the optimal return problem with dynamic characteristics and random disturbance, and the feedback equilibriums are resolved by the HJB equation method. We also derived the evolutionary path of the return rate under the equilibrium control strategies for different models with different fairness concern types. Furthermore, we designed a coordinate contract for the manufacturer to coordinate with the retailer.
We have found that only under a specific condition there exists a unique feedback Markov equilibrium for the closed-loop supply chain. The conditions of the existence for the decentralized supply chain models are the same, whatever the fairness concern type the retailer is. The equilibrium wholesale price and retail price strategies decrease over the return rate, and the equilibrium collecting control strategy increases over the return rate. The evolutionary path of the return rate cannot be predicted precisely because of the stochastic disturbance in the collecting process. We derived the expectation of the stochastic return rate that approaches a stable state along with the time. The monotonicity of the expectation is relevant to the initial value of the return rate, and whatever the initial return rate is, the expectation approaches the same stable state for a CLSC system. The manufacturer and the retailer can benefit from the increasing of disturbance intensity, although the effect is not that obvious in terms of profit increment.
We further investigate how the presence of fairness concern for the retailer would affect the supply chain system. We derived the feedback equilibriums with two fairness concern types of the retailer, i.e., the gap fairness concern retailer and the self-due fairness concern retailer. The results indicate that whether gap fairness concern or self-due fairness concern would not affect the equilibrium feedback control strategies, in terms of retail price and collecting effort. The manufacturer has to shift profit to the retailer in the presence of fairness concern for the follower. The gap fairness concern retailer is more aggressive when his channel power is low and conservative when his channel power is rising to large. In contrast, the type of self-due fairness concern is more reasonable for the retailer to express its concern of fairness and is more acceptable for the manufacturer to consider its profit shifting for the retailer.
We found the traditional two-part tariff or the revenue sharing contract is not able to coordinate the closed-loop supply chain. We designed a hybrid contract with wholesale price, franchise fee and collecting expenditure sharing for the manufacturer to coordinate with the retailer. The collecting expenditure sharing percentage equals the ratio of the marginal value of the retailer to the total marginal value of the supply chain members.
The managerial implications for the supply chain members are as follows. From the long-run perspective, the manufacturer who leads the CLSC should shift some revenue to the retailer who are engaged in the used-products collecting, in order to relieve the fairness concern of the retailer. The retailer acts as a gap fairness concern type would be too aggressive while self-due fairness concern type would be more reasonable for both the manufacturer and the retailer to accept the fairness appeal. Although the manufacturer cannot simply adopt a two-party tariff or a revenue sharing contract to coordinate the CLSC, the manufacturer could employ the hybrid contract to coordinate the CLSC.
There are several limitations to this research. First, because of the complexity of the stochastic differential game model, we did not study the CLSC scenario with competing retailers. However, the competing retailers are more usual in a supply chain with the manufacturer being a leader. Moreover, the presence of competition would bring the peer-induced fairness problem. Therefore, it would be interesting how the presence of competition, as well as peer-induced fairness concern, would co-affect the equilibrium strategies as well as the stable state for the system. Second, we only consider that the retailer collects the used-product on his own. However, the manufacturer could employ some incentive programs to better motivate the retailer on used-product collection. As such, what kind of incentive program should be adopted for the manufacturer and what is the effect of the incentive program on the system return rate and profit of the supply chain members would be meaningful questions.