The Presence of a Powerful Retailer on Dynamic Collecting Closed-Loop Supply Chain From a Sustainable Innovation Perspective

In this study, the presence of a powerful retailer in the dynamic collecting closed-loop supply chain is considered. The supply chain consists of a manufacturer and a retailer who is responsible for the used-product collection. Specifically, we have considered three settings, that is, no channel leader, the manufacturer as the leader and the retailer as the leader. The steady equilibrium and the optimal control strategies are derived in different channel power structures utilizing the open-loop control strategy method. By the comparison of three models, we found that the system with manufacturer leading results in the lowest return rate in the three models. If the transfer price of the used-product is low, the presence of a power retailer would result in a higher return rate, which is beneficial for decreasing the market price for the consumers. When the transfer price is quite high, the scenario with no channel leader in the supply chain would result in a higher return rate. However, the numerical comparison results of the profit rate of the supply chain members indicate that both manufacturer and retailer prefer themselves to be the channel leader rather than the other channel member as the leader. This means that although the presence of a power retailer is beneficial for the collecting efficiency, it is not enough to increase the profit of the supply chain members. The manufacturer should transfer all unit cost savings to the retailer in the scenario of manufacturer leading and no one leading.


Introduction
Recently, the closed-loop supply chain has drawn increasing attention from academic, as it can significantly reduce the firm's production cost as well as reduce the pressure on the environment.Some large firms, such as Xerox, Kodak and HP, have already started to comprehensively employ the remanufacture strategy [1,2].Collecting and the remanufacturing the end-of-life product is a good way to close the loop for the supply chain and it can also reduce waste emissions and resource consumption which meets the goal of sustainability (Mota et al.) [3].Currently, the firms which employ a remanufacture strategy can be found in almost all kinds of industries, such as the computer industry, printer industry, ink-cartage industry, furniture industry, auto parts industry, medical equipment industry, etc. (Ferguson and Toktay, 2006) [4].To employ a remanufacturing strategy, the most important procedure is to collect the used-products since used-products serve as materials for the remanufacturing.
However, used-product collecting is quite difficult because the used-products disperse over a large geography area.In reality, there are three different collecting modes across a supply chain, including, retailer collecting, manufacturer collecting and third party collecting.Figuring out the best collecting mode is a focus which has been intensely studied by the researchers during the last decade.Decentralization of the supply chain structure makes the program even harder to carry out.In recent years, as the advances of the production technology, the competition between the manufacturers is increasing, which makes the retailer more important in the supply system.Additionally, the presence of large retailers such as Wal-mart, Carrefour, and the online retailers such as Amazon and JD.com has increased the importance of the retailers in the supply chain and changed the channel structure of the supply chain.The manufacturers have to rely on large retailers for their distribution channel; thus, the retailers are dominant.Therefore, within a supply chain, either the manufacturers or the retailers can be dominant, or comparable.To this end, what is the impact on used-product collecting and which is the optimal collecting mode under such circumstances?To answer these questions, this study employs the dynamic collecting model to research this problem under different supply chain power structures.
There are increasing numbers of studies focusing on the closed-loop supply chain operation.We refer to Govindan et al. [5] for a comprehensive review on closed-loop supply chain management.The main focus of the researchers is on the collecting strategy of the supply chain members as well as the optimal reverse channel across the supply chain.In 2004, Savaskan et al. [1] first set up the reverse channel design framework from the perspective of the supply chain.Their results indicated the retailer-collecting model is preferred compared to manufacturer-and third-party-collecting models.Savaskan and Wassenhove [2] further investigated the reverse channel design problem of competing retailers.Savaskan's series of papers provided a classic framework to investigate the reverse channel design problem of closed-loop supply chain, and many other scholars have further extended their model to more complicated scenarios (Hong and Yeh [6], Huang and Nie [7], Wei and Zhao [8], Ma et al. [9]).Recently, some other studies have focused on the supply chain network design or recycle decisions.For example, Tsao et al. [10] proposed a closed-loop supply chain network design considering RFID adoption.Jena et al. [11] investigated the impact of sharing or not sharing advertisement cost on the used-products collecting activities.Bhattacharya et al. [12] investigated the used-products price optimization problem in a closed-loop supply chain system.These studies are mainly focused on the reverse channel design problem in different scenarios but have ignored the dynamic characteristics of the used-product collecting process.The retailers who aim to collect the used products must cope with the dynamic characteristics.
The above studies have significantly enriched the theoretical results on the closed-loop supply chain, yet they were all based on static model, which did not consider the dynamic characteristics of the collection process.In the dynamic collection model, Kiesmüller [13] investigated a dynamic collecting system with lead times, and applied the maximum principle to resolve the optimal control strategies of the system.Hosoda and Disney [14] also investigated the dynamics of a closed-loop supply chain with arbitrary lead times and a proportional random yield in the triage of returns.Geyer et al. [15] studied the dynamic remanufacturing strategy considering the components durability and limited life-cycle.These studies mainly concern on the dynamic feature of the inventory characteristics, and focus on centralized supply chain structure which handles the overall optimization problem of a system.These studies mainly focused on the inventory dynamic characteristics and ignored the dynamics in the collection process.
The activities which have been invested to collect the used-products have their dynamic characteristics, such as marketing type return advertising to increase the environmental awareness of customers as well as the knowledge about return policy [16] and operational type building recycling facilities, collecting, reprocessing and so on [17,18].Both marketing type activities and the operational activities have long term and accumulative effects, which means the return rate of the used-product reflects the accumulative invest history.Consequently, the dynamic optimal control model could be applied to resolve the collecting problem.
De Giovanni and Zaccour [18] proposed a dynamic return model which utilized differential game methods to characterize the feedback equilibrium of the closed-loop supply chain with one manufacturer and one retailer.A cost-revenue sharing contract is proposed, and they found the retailer who acts as the leader always prefers the contract while the manufacturer only prefers it under particular conditions.Huang and Nie [7] also proposed a dynamic closed-loop supply chain model which captured the dynamic characteristics of the collecting process.They derived the open-loop control strategies of both the manufacturer and the retailer in a manufacturer lead Stackelberg game.Huang et al. [19] developed the dynamic model and proposed a stochastic differential game model to investigate the uncertainty factors which impact the collection strategy of a remanufacturing closed-loop supply chain [20].These studies mainly considered the optimal control strategy problem in the supply chain where the manufacturer leads the channel.
Our research differs from the literature in the area of closed-loop supply chains in two aspects.The first is that we consider the dynamic process in the collection activities.The basis of a remanufacturing closed-loop supply chain is the collection of used-products, without which the remanufacturing of new products is out of the question.However, most researches utilized the static model to model the collection process which ignores the dynamic character of the activities.The second one is that we consider the impact of supply chain channel power structure on the closed-loop supply chain.The presence of powerful retailer has become a phenomenon in practice.How it would affect the collection activities for the closed-loop supply chain is an important question.
Thus, utilizing the dynamic collection model, this paper considered three different channel power structures to further study the optimal closed-loop supply channel collecting strategy, i.e., no channel leader, the manufacturer as the leader, the retailer as the leader.Furthermore, we discussed the effect of the different channel power structures on the optimal collecting strategy [21][22][23][24][25][26][27][28] .
The rest of the paper is organized as follows.Section 2 presents the modeling framework.Section 3 derived the open-loop control strategy, respectively, for three models (N, M and R models).Section 4 conducts the comparison analysis for the three models.Section 5 concludes the paper.

The Model
The notations throughout the paper are summarized in Table 1.
Table 1.Notations throughout the paper.

Notation Definition ρ
The impact coefficient of collect effort on return rate.δ The deprecation ratio of the return rate.φ The market volume for the product.β The price sensitivity coefficient.c m , c r The unit production cost with new raw material and used-product.

∆
The unit cost savings of used-product.b The transfer price pay by manufacturer to retailer.k The collecting cost coefficient of the retailer.r The discount rate for manufacturer and retailer.

τ(t)
The return rate of used-product at time t.

A(t)
The collect effort level at time t.p(t) The product price at time t.

D(t)
The demand rate at time t.w(t) The wholesale price at time t.
Profit rate of manufacturer and retailer.
The cumulative profit in the planning interval of manufacturer and retailer.
The current value Hamiltonian for manufacturer and retailer.
The activities which have been put into the used-product collecting process, such as return advertising, recycling facilities building and maintaining, and technology for reprocessing, all have long-term and accumulative effects, which means the return rate of the used-product reflects the accumulative invest history.Following the dynamic return model of Huang and Nie [7], Huang et al. [19], we consider dynamic characteristics in the recycling process and define the return rate variation function as τ(t) < 1 represents the used-product return rate at time t, which is the system state variable.A(t) indicates the collect effort level at time t, which represents the efforts made by the retailer in the collecting activities, such as cost of recycling facilities, maintenance and advertising.ρ shows the effect of collect effort on return rate, and the larger of ρ, the faster change of return rate.δ indicates the deprecation ratio of the return rate.Equation (1) clearly depicts the dynamic process of the return rate with collecting effort of the retailer, which suppose that the return rate has a cumulative effect and is formulated as a continuous and weighted average of past return activities, with a decaying weighting function.
The manufacturer manufactures the product and passes it to the retailer for sale, the demand rate of the product is given by φ > 0 represents market volume, β > 0 indicates the price sensitivity coefficient.The product either produced by recycling material or new raw material are exactly the same.What should be noted is that it is more suitable for the case the manufacturer-made use of the used-product for new production instead of just refurbish the used-product.
The unit production cost with new raw material is c m , while the unit production cost with recycling material is c r , and ∆ = c m − c r > 0, which indicates the remanufacturing must be cost saving otherwise the manufacturer would not employ it.The manufacturer would pay b to the retailer for collecting the used-product back, and we have b ≤ ∆.When the return transfer payment through retailer is greater than the total cost saving, the manufacturer would not choose to collect used-products for remanufacturing.
The cost function of the retailer for collecting the used-products is kA(t) 2 /2, where k represent the collecting cost coefficient.In reality used-product may located in any place of retailer's distribution zone, thus, the retailer needs to perform advertising of their collection policy and to set collection facility over the entire distribution zone.As a result, the retailer may experience a high cost coefficient.
The retailer pays σ to the customers who return back the used-product for compensation.For simplicity, we set σ = 0, which should be noted that this assumption only simplified the mathematical complexity and will not change the main results.
The decision period for the members is [0, ∞), and the discount rate is r.We carry out the analysis in the two-stage supply chain with the retailer taking charge of the collection activities.The retailer simultaneously decides the retailer price p(t) and the collect effort level A(t).The objective function for the retailer is given below max The manufacturer decide the wholesale price w(t) to the retailer, and its objective function is given by max The constraint is Equation (1).According to the power structure of the supply chain members, there are three kinds of supply chain structure, i.e., no channel leader (model N), manufacturer as channel leader (model M), and retailer as channel leader (model R).In the next section, we apply the differential game theory to solve the open-loop control strategy [29][30][31][32][33][34][35].

Model Analysis
We utilize the open-loop control strategy method for solving the differential game formulated in Section 2. One can refer to Sethi and Thompson [20] and Nair and Narasimhan [36] for details about the open-loop control strategy method for solving the differential games.We derive the equilibrium solution by constructing the Hamiltonians for the supply chain members [37].Denote µ m and µ r to be the costate variable for the manufacturer and retailer.The current value Hamiltonian for the retailer is formulated by And the current value Hamiltonian for the manufacturer is given by

Nash Differential Game-Model N
In such a scenario, the manufacturer and the retailer have equal channel power; thus, there are no channel leaders in the supply chain and the game they play is a Nash differential game.The decision sequence for the Nash differential game is as follows: both the manufacturer and the retailer choose their decision separately and simultaneously.The manufacturer decides the wholesale price w(t), the retailer decides the retail price p(t) as well as the collection effort level A(t).According to retailer's current value Hamiltonian (5), the necessary conditions for retailer's optimization problem is given by Inserting p = m + w into manufacturer's current value Hamiltonian (6) we have The necessary conditions for manufacturer's optimization problem are given by .
From Equations ( 7) and ( 12) can derive the equilibrium strategies of manufacturer and retailer And we can derive the equilibrium collecting effort strategy from Equation ( 8) Substituting the equilibrium strategies into Equations ( 9) and ( 10) yields the differential equations about the state variable τ and the costate variable µ r as below . where . Proposition 1 gives the condition under which the steady equilibrium exists in the Nash setting.
, the steady equilibrium of Model N exists and the steady equilibrium is given by The wholesale price of the manufacturer when the system is steady state is calculated as The retail price and collecting effort level of the retailer are calculated as Proof.The steady state of return rate and costate variable can be obtained by solving the differential Equation (16).The characteristic root of differential Equation ( 16) is calculated by As long as the equations have real characteristic root, we always have λ 1 > 0. Thus, when the differential equations have convergence solution, it will be a saddle point equilibrium.When 4k 2 δr + 4k 2 δ 2 + 4kρ 2 E 1 < 0, it is obvious that λ 2 > 0, and in this case the solution of differential equations is divergence.When 4k 2 δr + 4k 2 δ 2 + 4kρ 2 E 1 > 0, it is obvious that λ 2 < 0, and in this case the solution of the differential equations is saddle point equilibrium, which means when kδ(r + δ) > −ρ 2 E 1 , there is steady state equilibrium for the system.We can derive the steady state of the system from the differential equations by .τ = 0 and .µ r = 0.And in addition, as the return rate of the system cannot exceed 1, thus the steady state of return rate should satisfy τ < 1, i.e., kδ(r Proposition 1 indicates if retailer experiencing a relatively high collection cost coefficient, the system will reach a steady state, otherwise when the cost coefficient is small, the system will be unstable.Furthermore, when the cost coefficient is too small, the retailer has incentive to collect all the used-product.However, the distribution of used product is relatively dispersed and combined with high transportation expense, thus it is very unlikely for retailer to collect all the used-product.Consequently, in reality, the collection cost coefficient is reality large, which could satisfy the stable equilibrium condition of the differential equations.Substitute the steady state results of Proposition 1 into profit function of supply chain member, we have the steady state profit rate of retailer.
And the steady state profit rate of manufacturer The following proposition characterizes the open-loop equilibrium control strategies for the supply chain members.Proposition 2. Under the setting of Nash differential game (Model N), the optimal wholesale price of manufacturer is given by The optimal collecting effort control strategy of retailer is The optimal retail price control strategy of retailer is Proof.Equation ( 16) is differential equations, as the problem we consider in this paper is infinite time limit, let the terminal condition as lim T→∞ τ(t) = τ N , where τ N is the steady state of the state variable.Combining the initial condition τ(0) = τ 0 , the solution of the differential equations is solved as Substituting Equation ( 26) into the equilibrium control strategies of the supply chain members can yield the optimal control strategies in Proposition 2. The proof is complete.

Manufacturer as Leader Stackelberg Differential Game-Model M
In this scenario, manufacturer is the channel leader in the supply chain and retailer be as the follower, thus the game is a manufacturer leader Stackelberg differential game.The decision sequence of Model M is as follows: the manufacturer first decides the wholesale price w(t), and then the retailer chooses its retail price p(t) and collecting effort level A(t).The necessary conditions are still Equations ( 7)- (10), and from which can derive the response function of retailer According to the retailer's response function, the current value Hamiltonian of the manufacturer could be formulated as According to the necessary optimality condition ∂H m /∂w = 0, we have Substituting back into the response function of retailer yields Substituting the equilibrium strategies of manufacturer and retailer into Equations ( 9) and ( 10) yields the differential equations about the state variable τ and the costate variable µ r as follows . where . Proposition 3 gives the condition under which the steady equilibrium exists in the manufacturer leader Stackelberg game setting.
, the steady equilibrium of Model M exists and the steady equilibrium is given by The wholesale price of the manufacturer when the system is steady state is And the retail price and collecting effort level of the retailer are Proof.The proof is similar to that of Proposition 1 and thus we omit here.
Proposition 3 also indicates that the system reaches a steady state only with the high collection cost coefficient, the supply chain is under the lead of manufacturer.Likewise, we have the steady state profit rate of retailer.
And the steady state profit rate of manufacturer is Proposition 4 gives the open-loop equilibrium control strategies for the supply chain members of model M. Proposition 4.Under the setting of manufacturer leader Stackelberg differential game (Model M), the optimal wholesale price of manufacturer is given by w The optimal collecting effort control strategy of retailer is calculated as And the optimal retail price control strategy of retailer is Proof.The proof is similar to that of Proposition 2 and, thus, we omit here.

Retailer as Leader Stackelberg Differential Game-Model R
In this scenario, the retailer is the channel leader in the supply chain and the manufacturer be as the follower, thus, the game is a retailer leader Stackelberg differential game.The decision sequence of Model R is as follows: the retailer first decides the retail price p(t) and the collecting effort level A(t), and then the manufacturer decides the wholesale price w(t).According to the backward induction, the current value Hamilton of manufacturer is formulated as Equation (11), and from the necessary optimality condition (12) can derive the response function of the manufacturer as following Considering manufacturer's response function, the current value Hamilton of retailer is formulated as The necessary optimality conditions are given by According to Equations ( 42) and (43) can yield the equilibrium strategies of retailer Taking back into the response function of the manufacturer yields the equilibrium strategy of manufacturer Substituting the equilibrium strategies of manufacturer and retailer into Equations ( 44) and (45) yields the differential equations about the state variable τ and the costate variable µ r as follows .
Proposition 5. When kδ(r + δ) > −ρ 2 (M 1 + N 1 ), the steady equilibrium of Model R exists and the steady equilibrium is given by The wholesale price of the manufacturer when the system is steady state is And the retail price and collecting effort level of the retailer are Proof.The proof is similar to that of Proposition 1 and thus we omit here.
Substitute the steady state results of Proposition 5 into profit function of supply chain member, we have the steady state profit rate of retailer.
And also the steady state profit rate of manufacturer The following proposition 6 gives the open-loop equilibrium control strategies for the supply chain members of model R. Proposition 6.Under the setting of retailer leader Stackelberg differential game (Model R), the optimal wholesale price of manufacturer is given by w The optimal collecting effort control strategy of retailer is calculated as And the optimal retail price control strategy of retailer is Proof.The proof is similar to that of Proposition 2 and thus we omit here.

Comparison Analysis under Different Models
Sections 3.1-3.3analyze the equilibrium control strategies for the supply chain members with no channel leader (model N), manufacturer as channel leader (model M), and retailer as channel leader (model R), respectively.In this section, we will conduct the comparison analysis on three models to study how the presence of the power retailer would affect the system price, product collecting rate as well as member profitability.We assume that all three models can reach steady state equilibrium conditions.
Proof.From the expressions of τ R and τ M , it is calculated as Similarly from τ N and τ M , we have And from τ N and τ R , we have Proposition 7 indicates that when in the steady state, the manufacturer acts as channel leader and this would result in the lowest return rate compared with the other two models.Regarding the retailer leader setting and no leader setting, when the transfer price of used-product is relatively high (b > 3∆/4), model N (no channel leader) would result in higher return rate.Furthermore, when the transfer price of used-product is not that high (b < 3∆/4), model R (retailer as channel leader) would result in higher return rate.The results indicate that when the manufacturer leads the channel, the retailer has the lowest incentive to collect in a dynamic return model.As the transfer price is high, the collecting efficiency of the retailer is higher under the scenario where there is no leader in the supply chain.However, when the transfer price is low, the retailer will have an incentive to collect only if the retailer were channel leader.
It is obvious to derive the conclusion in Proposition 8.The proof is complete.
Proposition 8 indicates, when the transfer price of used-product is high (b > 3∆/4), the retailer will put more effort into collecting activities in the setting that there is no channel leader.When the transfer price is not that attractive (b < 3∆/4), the retailer will put more collecting effort only when it leads the channel.
Proposition 9 indicates that the presence of the power retailer would lead to the lowest product market price when the transfer price of the used-product is low (b < ρ 2 ∆ 2 β − kδ(r + δ) / ρ 2 β∆ ); and when the transfer price is relatively high (b > ρ 2 ∆ 2 β − kδ(r + δ) / ρ 2 β∆ ), the product market price will be lowest under the setting of no channel leader (model N).The results show that the manufacturer lead setting (model M) would results in a highest product market price, which means it does not contribute to lowering of the market price which would bring about increasing of sales volume and consumer's utility.
The comparison of w M and w N is in the similar way.Compare with w N and w As the profit functions under different models are too complex to deal with by an analytical approach, we will conduct a numerical comparison instead in the following part.The parameters are chosen as: τ 0 = 0, ρ = 2, ∆ = 3, φ = 50, β = 0.8, c m = 6, δ = 1, r = 0.15, k = 150.The transfer price b will be limited in a reasonable section as b < ∆ to make the numerical results be consistent with the actual situations.
Figure 1 shows that π R r (t) > π N r (t), and Figure 2 shows that π N r (t) > π M r (t), consequently we can derive π R r (t) > π N r (t) > π M r (t).The result is quite intuitive: the retailer prefers itself to be the leader in the channel mostly in the three settings and does not prefer the manufacturer to be the leader.  .The transfer price b will be limited in a reasonable section as b < Δ to make the numerical results be consistent with the actual situations.
The result is quite intuitive: the retailer prefers itself to be the leader in the channel mostly in the three settings and does not prefer the manufacturer to be the leader.
The result is quite similar to the retailer's, which means both manufacturer and retailer prefer the setting that is led by themselves while they dislike the setting that is led by other supply chain members.Figure 3 shows that π M m (t) > π N m (t), and Figure 4 shows that π N m (t) > π R m (t); consequently, we can derive π M m (t) > π N m (t) > π R m (t).The result is quite similar to the retailer's, which means both manufacturer and retailer prefer the setting that is led by themselves while they dislike the setting that is led by other supply chain members.
The result is quite similar to the retailer's, which means both manufacturer and retailer prefer the setting that is led by themselves while they dislike the setting that is led by other supply chain members.
The result is quite similar to the retailer's, which means both manufacturer and retailer prefer the setting that is led by themselves while they dislike the setting that is led by other supply chain members.In the discussion above, we have not investigated the choice of the transfer price b for the manufacturer.With a higher transfer price, the retailer would have more enthusiasm to invest in collection activities, while the manufacturer would result in lower benefits from used-product.Proof.It is very easy to verify that ∂π N m /∂b > 0, ∂π M m /∂b > 0 and ∂π R m /∂b = 0.And as the transfer price has a range in [0, ∆], the manufacturer could easily choose the transfer price which is given in Proposition 11.The proof is complete.
Proposition 11 indicates that the manufacturer should transfer all the unit cost savings to the retailer when the manufacturer leads the supply chain, or no one leads the supply chain.Although the manufacturer transfers all the unit cost savings to the retailer, the manufacturer could benefit from increasing product sales.It is interesting that the transfer price has no impact on the steady state profit rate of the manufacturer when the retailer leads the supply chain.That is mainly because the manufacturer could not affect the decision of the retailer in the presence of power retailers.Therefore, the transfer price has lost the incentive to the retailer.Proposition 12. Let b = ∆ in all the three models, τ Proof.The proof is similar to that of Propositions 7-10 and thus we omit here.
When the manufacturer transfers all the unit cost savings to the retailer, the manufacturer leading the supply chain would result in the same return rate and product price.That means the manufacturers would incentive the retailer in the collection activities by transferring all the unit cost savings when they lead.Although the return rate is highest when no one leads the supply chain, the profitability is determined by channel power more than the product return efficiency.However, from a sustainability perspective, the channel power enhancement of the retailer would be beneficial to the environment with low waste emissions.The manufacturer would give away some channel power to increase the product reuse efficiency when it leads the supply chain.

Conclusions
In this study, we considered the presence of the power retailer on the dynamic collecting problem in a closed-loop supply chain, which consists of a manufacturer and a retailer who is responsible for the used-product collecting.For a closed-loop supply chain, the key procedure is the collection of the used-product which are to be used for the production of new products.The main focus of this study is the dynamic characteristics of the collecting process and the presence of the power retailer on the reverse channel.Firstly, we formulated the dynamic collecting model by the differential equation; and then we derived the equilibrium of three settings, i.e., no channel leader Nash differential game (model N), manufacturer leader Stackelberg differential game (model M), and retailer leader Stackelberg differential game (model R).Thirdly, we conducted the comparison analysis on return rate, price and profit rate in the three models.
The equilibrium results show that the collection cost coefficient must be relatively high or else the dynamic system cannot reach a steady state and will be unstable, regardless of the model setting, i.e., model N, M or R.
The comparison results show that when the manufacturer leads the channel, the retailer has the lowest incentive to collect in a dynamic return model.That means the presence of a power retailer is beneficial to the reverse channel of the closed-loop supply chain.When the transfer price is high, the collecting efficiency of the retailer is higher under the scenario there is no leader in the supply chain.That means when the transfer price the manufacturer affords is relatively high, the retailer no longer needs the channel power.However, when the transfer price is low, the retailer will have the incentive to collect only if the retailer itself were channel leader.That means when the transfer price is unattractive, the retailer needs the channel power to ensure the benefit of the collecting activities.
To the market price, the results show that manufacturer-led setting (model M) would result in a higher product market price.Furthermore, when the transfer price is low, the retailer leader setting can lead to lowest market price, as a result, the retailer puts more efforts in collecting used-products under this scenario.
To the profit rate of manufacturer and retailer, the comparison results of three models are quite similar.The results show that both the manufacturer and the retailer prefer to be the channel leader while they dislike the other channel members to be the leaders.This means that although the presence of power retailer is beneficial to the collecting efficiency in the closed-loop supply chain, it is not enough to increase the profit level of the manufacturer.However, from a sustainable innovation perspective, the presence of power retailer would be beneficial to the environments as it has high efficiency of used-products reuse and low waste emissions.
The proof is complete.□As the profit functions under different models are too complex to deal with by an analytical approach, we will conduct a numerical comparison instead in the following part.The parameters are chosen as:

Figure 1 .
Figure 1.Retailer's profit rate difference between model R and N.

Figure 1 . 18 Figure 2 .
Figure 1.Retailer's profit rate difference between model R and N. Sustainability 2018, 10, x FOR PEER REVIEW 14 of 18

Figure 2 .
Figure 2. Retailer's profit rate difference between model N and M.

Figure 2 .
Figure 2. Retailer's profit rate difference between model N and M.

Figure 3
Figure 3 shows that

Figure 3 .
Figure 3. Manufacturer's profit rate difference between model M and N.

Figure 4 .
Figure 4. Manufacturer's profit rate difference between model N and R.

Figure 3 .
Figure 3. Manufacturer's profit rate difference between model M and N.

Figure 2 .
Figure 2. Retailer's profit rate difference between model N and M.

Figure 3
Figure 3 shows that

Figure 3 .
Figure 3. Manufacturer's profit rate difference between model M and N.

Figure 4 .
Figure 4. Manufacturer's profit rate difference between model N and R.

Figure 4 .
Figure 4. Manufacturer's profit rate difference between model N and R.

Proposition 11 .
In model N and model M, the optimal transfer price of the used-product is b N * = b M * = ∆, while in model R, the optimal transfer price of the used-product is b R * ∈ [0, ∆].