Construction of Equilibria in Strategic Stackelberg Games in Multi-Period Supply Chain Contracts

Reza Azad Gholami; Leif Kristoffer Sandal; Jan Ubøe

doi:10.3390/g13060070

Abstract

Almost every supplier faces uncertain and time-varying demand. E-commerce and online shopping have given suppliers unprecedented access to data on customers’ behavior, which sheds light on demand uncertainty. The main purpose of this research project is to provide an analytic tool for decentralized supply channel members to devise optimal long-term (multi-period) supply, pricing, and timing strategies while catering to stochastic demand in a diverse set of market scenarios. Despite its ubiquity in potential applications, the time-dependent channel optimization problem in its general form has received limited attention in the literature due to its complexity and the highly nested structure of its ensuing equilibrium problems. However, there are many scenarios where a single-period channel optimization solution may turn out to be myopic as it does not consider the after-effects of current pricing on future demand. To remedy this typical shortcoming, using general memory functions, we include the strategic customers’ cognitive bias toward pricing history in the supply channel equilibrium problem. In the form of two constructive theorems, we provide explicit solution algorithms for the ensuing Nash–Stackelberg equilibrium problems. In particular, we prove that our recursive solution algorithm can find equilibria in the multi-periodic variation of many standard supply channel contracts such as wholesale, buyback, and revenue-sharing contracts.

Keywords:

game theory; Stackelberg games; supply chain management; stochastic demand; behavioral newsvendor; price-setting newsvendor

MSC:

90-10; 91A10; 91A15; 91A25; 91A50

1. Introduction

Almost every vendor faces uncertain and time-varying demand. The uncertainty of demand may be of different natures and varying levels of tractability for statistical modeling. The demand uncertainty for a specific commodity may stem from consumer behavior or the economic development condition for that commodity. For instance, the stochasticity of demand for commodities such as sports apparel may arise from changing trends of fashion; while for electronic devices or computer software, it may be caused by better products being rolled out. Anticipating future trends of theee market and addressing stochastic demand remains a challenge for manufacturers and vendors. In general, the uncertain demand for a specific product is price-dependent and dynamic in the sense that it evolves through time. The main goal of this paper is to demonstrate that a general structure for stochastic dynamic demand can be utilized for decision-making purposes. This general structure, as we will see, is such that many other demand models turn into sub-classes of our formulation.

The single-period (static) supply channel optimization problem while facing price-dependent uncertain demand has long been studied in the literature. For a comprehensive survey, see [1]. However, there are a variety of scenarios in which decision-makers need a multi-period perspective and analysis of stochastic demand for a commodity as its price (and price-elasticity of demand) changes. Among them, we enumerate a few below, and provide concrete examples in Section 7.

Consider an entrant price-setting supplier trying to secure a niche in a market with uncertain demand. The demand-based revenue optimization problem faced at the beginning of the market penetration campaign may be quite different from the one at a later time when she would have become an incumbent vendor. Initial market penetration campaigns are typically driven by demand-boosting strategies such as temporary offerings of free trial versions or low-price commodities [2,3]. Such strategies may incur huge losses at the beginning. Thus, a time-dependent analysis becomes necessary to determine the optimal length of the free distribution phase, and the pricing strategy for various times afterward.

Strategic customers, aware of the marketing plans or seasonal trends in pricing, may either expedite their purchase to benefit from those marketing schemes, or postpone it until they see a price lower than their reservation price. Khouja et al. (2020) demonstrate that in both cases a retailer would be better off by catering to these thrift consumers even by offering off-price commodities [4]. Levin et al. (2009) define strategic customers as those who are “aware that pricing is dynamic". Thus, obviously, incorporating the expectations of market-savvy customers in the supply channel’s revenue optimization problem also requires a dynamic (i.e., time-dependent) demand and pricing analysis [5]. An example of this type is brought in Section 7.3 where the solution algorithm provides optimal supply quantities and prices at various times as well as the optimal timing for offering off-price goods.

Demand for certain innovative products is built-up as time goes by, through word-of-mouth marketing (or network effect). See the classic work of [6], or that of [7]. Modeling such markets where demand requires a massive number of buyers to take off also necessities a time-dependent analysis. These type of markets have been covered in the example Section 7.4 and our results are consistent with predictions made by the Baas diffusion model for new product adoption.

The scenarios described above, despite demonstrating different demand patterns, have one feature in common—in all of those settings, the price-setting supplier(s) must come to a balance between immediate profits and maintaining future demand. A static (single-period) channel optimization method typically does not include such a balance as, for example, it does not guarantee that the current price will not stifle future demand. In our analysis, we take this point of view from marketing and behavioral economics that previous prices scale demand, for example, by affecting the number of customers taking interest in the product.

It should be noted that the same algorithm is used to model and solve the different optimization problems in each of the various scenarios described above. That is, in our model, a market is typically defined by its memory-based demand functionals. The solution algorithm, then, fed by these varying demand structures provides optimal supply quantities and pricing for different times. Thus, obviously, the algorithm’s scope of applicability is not limited to these example scenarios and may be expanded to many time-dependent supply and pricing optimization problems. Moreover, it is imperative to note that the section dedicated to the numerical implementation of the model (Section 7) is only intended to familiarize the reader with the scope of applicability of our solution algorithm and its implementation steps. This section uses simplified functionals demonstrating different demand patterns. The flexility of our model enables it to provide time-dependent prescriptive analyses for highly different economic and inventory management settings. Another feature of our model of dynamic price-dependent demand is that it can be embedded in a game theoretic setting where two vendors cater to the demand within a vertical supply channel. The general solution algorithm presented in the constructive Theorem 1 thus provides the optimal level of inventory and optimal prices for both of the channel members at each period.

The main challenge in a multi-period discrete time model of demand in which demand at the present may be affected by that of previous periods is the implicit interdependence of values of all periods. In order to emphasize the nestedness caused by this interdependence, we introduce the notion of memory functions. These memory functions carry the effects of present demand onto the future. They are generally price- and time-dependent and can be adjusted to model markets with stronger or weaker memories. By embedding memory effects in the model, not only do we emphasize this nestedness, but also cover the downstream (customer-side) effects of pricing. Thus, our model will be able to systematically cover price elasticity of demand as it changes through periods. Otherwise, by solving the same problem for demand distribution at every period, we cannot see the after effects of the pricing scheme on demand. Moreover, in Appendix A, we find the necessary conditions for the well-posedness of a decentralized supply channel equilibrium problem with respect to our model. Any supply channel contract satisfying these criteria can be decoupled and solved by our proposed method. In Appendix B, we prove that many conventional supply channel coordination contracts indeed meet these requirements.

The paper is organized as follows. We start with a single-supplier model. In Section 3.1, we introduce the multi-period expected revenue optimization problem of a supplier addressing a time-varying uncertain demand. In Section 3.2, we briefly sketch our solution algorithm for the (single-vendor) optimization problem using a backward induction approach. The structure of the uncertain demand in these sections remains arbitrary and no assumption is made about its statistical distribution. Next, in Section 4, we analyze the uncertain demand structure and how it is affected by market memory.

In Section 5, combining the two previously introduced segments of the model, we embed the memory-based demand structure in a general time-dependent profit and inventory optimization problem. Setting the stage for a complete solution algorithm, in Proposition 1, we prove that many conventional decentralized channel coordination contracts are of certain structure making them compatible with our backward induction solution approach. In the next section, we use this feature to decouple the nested equilibria problems with

3 n

unknown variables into

3 n

decoupled equations, each with only one unknown variable.

Finally, in Section 6, we extend the solution algorithm to a supply channel composed of two vendors competing in a Stackelberg framework. The direction of generalization in this section is based on the number of periods: first, in Section 6.1, the static single-period equilibrium problem is solved, and, finally, in Section 6.3, the general solution algorithm for the dynamic game is presented. The final theoretical results for equilibria problems are stated in Theorem 1 and its Corollary 1. The final solution algorithm yields the numerical values for optimal decision variables at different times while considering all the model parameters to also be time-dependent, thereby ensuring full non-autonomy of the model.

While this article should be regarded as a methodology paper, in Section 7, we provide examples of decision-making problems using our model. It is important to emphasize that the simplified demand functionals offered in this section are merely illustrative and not, for instance, the results of empirical studies. Through these examples, we will see how the model can be implemented in strategic games where the parties must balance immediate profits with future earnings. Among the scenarios in this section, we analyze a special case wherein the two suppliers integrate to form a centralized channel (Section 7.2). The diagram of section dependency can be found in Scheme 1.

Scheme 1. Diagram of Section Dependency.

2. Related Works

In this section, we survey different models employed by researchers in the analysis of supply channels addressing uncertain demand. We divide the large body literature on the subject into two classes. The first class of papers deem the distribution of stochastic demand to be unknown. The second group of researchers consider certain characteristics of the demand distribution to be part of the a priori knowledge of the decision-makers about the market. In many research works, a continuous probability distribution is assumed for the uncertain demand.

In the first group, ref. [8] formulates the problem of stochastic demand in a Bayesian setting. Assuming a known prior distribution for the uncertain demand, the newly gained information is incorporated into the posterior distribution with unknown parameters. These unknown parameters constitute a multi dimensional state space. The dimensionality of the resulting problem is then reduced such that the solution of the Bayesian model can be obtained by solving another dynamic program with a one-dimensional state space. Analogously, the study in [9] provides a Bayesian inventory management analysis in which demand distribution belongs to a parametric family of distributions. However, they assume that the unmet demand is lost and unobserved. Bensoussan et al. (2007), in their analysis of the multi-period newsvendor model, also consider demand distribution to be the state of their stochastic programming problem [10]. In their model, demand is a stationary Markov process with a known transition probability. Using the unnormalized probabilities, they convert the state transition equation to a linear one. A non-parametric Monte Carlo sampling algorithm is used in [11] to garner information about the underlying distribution of demand. Assuming that the demands in all periods are independent and identically distributed (i.i.d) random variables, they do not consider the inter-dependence of demand level in the current period to the future ones. All of these papers, addressing only the inventory management problem, consider the demand to be independent of the price. Therefore, they do not address the optimal pricing strategy problem.

In the second camp [12], analyzing a multi-period newvendor model, discretize the problem as a multi-stage stochastic programming problem. The stochasticity of demand in their model is formulated as a set of a finite number of scenarios and the occurrence of each scenario is associated with a probability. Moreover, they use a discrete probability distribution (Poisson) to represent the demand. The ensuing scenario-based stochastic problems can then be treated as discrete deterministic optimization problems. In their model, demand distribution is known to the newsvendor and is not price dependent. In the seminal analysis of the single-period newsvendor problem presented in [13], considers uncertain price-dependent demand for a perishable good to be of a general probability density function. The decision variables are the order quantity and the pricing policy, which are obtained by solving the maximization problem for the expected profit.

Gümüş et al. study a dyadic channel coordination problem in which two channel partners supply used goods to a peer-to-peer web-based market [14]. They construct a Stackelberg structure between the manufacture (leader) and the retailer (follower), and analyze the necessary conditions for return policies to constitute the equilibrium strategy. In their model, the time frame is comprised of two periods. In the first period, the degree of uncertainty in customer valuation for a specific product is assumed to be heterogenous and follow a uniform distribution. Trying to formulate the inter-dependence of demands at the two periods, they assume the demand potential for the second period to be positively correlated to the realization of demand in the first period, which becomes the known history when the second period starts. Khouja et al. argue that a supplier addressing uncertain demand must also consider the utility-optimizing attitude of strategic customers, in addition to the distribution of uncertain demand [4]. They prove that a supplier would be better off by complying with the expectations of the strategic customers who may expedite or postpone their purchase expecting certain pricing trends. The multi-period bi-level channel optimization models in [15,16] also employ a memory-based framework wherein different dynamic demand structures are represented and solve a basic supply channel coordination problem, including a price-setting newsvendor supplier.

We consider the mean and variance of demand to be arbitrary functions of time and price history. The dependence of demand mean and variance to the current price can be obtained from microeconomics theory or empirical data, while the inter-dependence of current demand to the prices of the past periods are represented by memory functions. The solution scheme that we propose is, however, independent of the functions representing the demand distribution. Moreover, based on the model presented in [17], we consider the current demand to be a function of current price, pricing strategy in the past, and demand history. Thereby, we avoid neglecting the downstream effect of pricing strategies. That is, our model also considers the fact that the pricing scheme set by the decision-maker(s) will affect the availability of the commodity to the costumers, which in turn, will affect their purchase decision, future demand, and the expected profit for the vendor(s). As a result of such a nested, price-dependent demand structure, the decision-makers will be able to engineer the demand through devising the optimal pricing strategy.

3. Problem Description

A decentralized supply channeladdresses the demand for a perishable commodity over a given timespan. The demand for the commodity is uncertain with a price-dependent distribution. The suppliers are risk-neutral as they seek to maximize their expected discounted total profit within the entire time horizon. We consider a discrete time frame for our model, in which the timespan is divided into n intervals referred to as periods. Each period is denoted by k;

k \in {1, \dots, n}

. The periods may be of different lengths and, hence, differently value-discounted. We assume all model parameters and variables to be constant within each period. The proposed solution algorithm is such that the centralized channel becomes a special (trivial) case of the general decentralized format (see Section 7.2).

3.1. Multi-Period Profit and Inventory Level Optimization Catering to Uncertain Demand

In this section, we outline a brief sketch of our general multi-period inventory management and profit optimization algorithm for a supplier who faces uncertain demand. For simplicity, we start off by analyzing the case with only one supplier. Later, in Section 6, we extend the analysis to a game theoretic scenario within a decentralized supply channel.

A price-setting supplier has to cater to an uncertain

D_{k}

within each period k in a timespan composed of n periods. Her decision variables are the retail prices

r_{k}

s and supply quantities

q_{k}

s. We denote the net running value of the profit obtained at period k, by

Π_{k}

, and its expected value with respect to a given distribution by

{\bar{Π}}_{k}

. The supply quantity

q_{k}

is to address the stochastic demand

D_{k}

. Luckily, there exists a classical closed form solution for each optimal

q_{k}

as a function of

r_{k}

—the solution for the single-period price-setting newsvendor problem [18]. Thus, the expected profit at period k will be of the following general form:

E [Π_{k}] = {\bar{Π}}_{k} (D_{k}, r_{k}, q_{k} (r_{k}), {parameters}_{k}) = {\bar{Π}}_{k} (D_{k}, r_{k}, {parameters}_{k})

(1)

It should be noted that in our model, in addition to the decision variables in this single-vendor model, i.e.,

r_{k}

and

q_{k}

, all of the given parameters are considered time-dependent entities. This full-blown non-autonomy with respect to both the variables and the parameters is kept throughout the entire analysis and is present in the expression of the final results in Theorem 1 and its Corollary 1 for general double-vendor models. Assuming the total number of periods to be n, we have the expected discounted total profit of the whole n periods as below.

J : = E [Π_{1}] + α_{2} E [Π_{2} ∣ D_{1}] + \dots + α_{n} E [Π_{n} ∣ D_{1}, \dots, D_{n - 1}]

(2)

where

0 < α_{k} \leq 1

are given discount factors (we set

α_{1} = 1

).

Now, we have to solve the following nested n-variable optimization problem,

max_{r_{n}} J = \sum_{k = 1}^{n} α_{k} {\bar{Π}}_{k} (r_{k}) (The barred symbol indicate expected value .)

(3)

where the price sequence at period k,

r_{k} = {r_{j}, j = 1, \dots, k} .

Moreover, throughout this paper, a hat superscript will denote an optimal decision variable. Solving (3), we seek to obtain the vector of optimal prices denoted by

{\hat{r}}_{k}

.

3.2. A General Solution Scheme Based on Backward Induction

In the final period, there is no need to worry about future demand. Moreover, by the time the decisions are being made for the last period, all the previous decision variables and the demands themselves are common knowledge. Moreover, because no term other than

{\bar{Π}}_{n}

in J as expressed in (3) depends on

r_{n}

, the problem of finding the nth argmax of J boils down to the single-variable problem of finding the

r_{n}

that maximizes

{\bar{Π}}_{n}

.

max_{r_{n}} J \equiv max_{r_{n}} {\bar{Π}}_{n}

(4)

Hence, given

r_{n - 1}

and

D_{n - 1}

, and assuming that the mapping

r_{n} \mapsto E [Π_{n} ∣ D_{n - 1}]

has a global maximum, we can construct a function

{\hat{r}}_{n} = {\hat{r}}_{n} (r_{n - 1}, D_{n - 1})

that maximizes this conditional expected value. Inserting this function in (3) and iterating the same procedure for

max_{r_{n - 1}} J, \dots, max_{r_{1}} J

, we obtain the vector

{\hat{r}}_{n}

with each

{\hat{r}}_{k}

being the global argmax to J at the kth period. Note that unless the profit at period k is independent from the previous demands and thereby from the previous prices,

{\hat{r}}_{k}

is not necessarily an argmax to the corresponding

{\bar{Π}}_{k} .

Later, in Section 6, we extend the procedure to solve the equilibrium problem when there are two suppliers (a leader and a follower) in the supply channel. The ensuing bilevel optimization problem will be of the following general form.

\begin{matrix} \begin{matrix} max_{{l_{i}}} (J^{Leader} ∣ {{\hat{f}}_{j}}) \\ s . t . & {{\hat{f}}_{j}} \subseteq \{A r g m a x (J^{Follower} ∣ {{\hat{l}}_{i}})\} \end{matrix} \end{matrix}

(5)

where the sets

{l_{i}}

and

{f_{j}}

denote the sets of decision variables for the leader and the follower, respectively.

4. Demand Structure

The expected profit as expressed in (1) is a function of demand. In the following three sections, we outline the general structure of time-dependent stochastic demand that the supply channel has to address. This demand expression is ultimately embedded in the optimization objective functions, i.e., the suppliers’ expected profits. In Section 4.1, we outline the demand structure with respect to its dependence on time and price. In Section 4.2, we delineate the stochastic nature of demand. Finally, in Section 4.3, combining the features, we introduce the memory-based demand structure.

4.1. Demand’s Dependence on Time and Price

It is classical to assume that the demand at present depends on the current price, i.e.,

D_{k} = ψ (r_{k})

. However, not all markets behave in such a simple manner. Many markets have some kind of memory, in the sense that pricing in the past may affect demand at present. In a dynamic market, the customers may become anchored to past prices, and this may affect their purchasing behavior. Consider a market in which a commodity with a limited lifespan is supplied to a base of potential customers. It is natural to assume that strategic customers are sensitive to previous prices when comparing them to the current price. Thus, one can conclude that, in general, in addition to the current price, previous prices may have a bearing on the current customer base by scaling the demand. For example, in a specific scenario, a price increase by 20% may reduce the customer base by, for example, 10%. We argue that a general time-dependent model of supply and price optimization should also consider the effect of anchoring to the past prices on current demand.

We build our time-dependent model of uncertain demand on the simple premise that the probability of an item being sold at time k for the price of

r_{k}

depends on the customers’ interest, which, in general, may depend on the past prices.

\begin{matrix} D_{k} & = Φ_{k} (r_{k - 1}, \dots, r_{1}) \cdot ψ_{k} (r_{k}) \end{matrix}

(6)

where the functional form

Φ

represents price history. We refer to the functional structure

Φ_{k} (r_{k - 1})

as the memory function.

4.2. Stochastic Demand

This section introduces uncertainty into the demand structure. We consider the uncertainty of demand at each period to be of a general i.e. additive-multiplicative form. Such a demand structure is comprised of two deterministic parts each a function of time and price history and a stochastic variable with a given distribution at each period. We consider demand at period k,

D_{k}

to be of the following general form.

D_{k} = {\tilde{μ}}_{k} (r_{k}) + {\tilde{σ}}_{k} (r_{k}) \cdot ϵ_{k}

(7)

where:

$ϵ_{k}$ ’s are independent and identically distributed random variables of arbitrary distribution,
normalized such that $E [ϵ_{k}] = 0$ and $Var [ϵ_{k}] = 1, \forall k$ and supported on intervals with continuous density function, $f_{ϵ_{k}} > 0,$ a.e. on its support.
$r_{k} =$ price per unit at period k.

It is obvious that with the normalization described above, we will get

{\tilde{μ}}_{k} = E [D_{k}]

and

{\tilde{σ}}_{k} = SD [D_{k}],

where:

{\tilde{μ}}_{k} (r_{k}) =

given function, representing mean of demand at period k, and

{\tilde{σ}}_{k} (r_{k}) =

given function, the standard deviation of demand at period at k.

4.3. Memory-Based Uncertain Demand

In this section, we formulate the demand scaling factors and the customers’ cognitive bias to the past prices (anchoring effects), as described in Section 4.1, within memory functions. These memory functions carry the effects of the past prices onto current demand. They are generally price- and time-dependent and can be adjusted to model markets with stronger or weaker memories. Thus, the demand expression equipped with memory functions at each period is a function of not only price at that period, but also can be affected by pricing policies in the previous periods.

In the emerging line of research referred to as behavioral newsvendor, the strategic customers are included in the demand analysis—those market-savvy customers who, aware of seasonal trends or marketing schemes, may postpone their purchase until observing a lower price, or may expedite it to exploit the initial low-price demand-boosting distributions. This implies the necessity of including the customers’ biases (anchoring) toward past prices in the supply channel’s optimization problem [19,20,21]. We argue that including the customers’ cognitive biases in the supply channel revenue optimization problem necessitates the inclusion of their memory of the past prices. Therefore, a dynamic (time-dependent) and memory-based analysis becomes necessary. For a comprehensive introduction of the anchoring effects, see [22,23]. The memory function structure implemented here is based on the more generalized models proposed in [24].

Combining (6) for memory-based demand and (7) for stochastic demand, we arrive at the general expression for demand as below.

\begin{matrix} \begin{matrix} D_{k} (r_{k}) = {\tilde{μ}}_{k} (r_{k}) + {\tilde{σ}}_{k} (r_{k}) ϵ_{k} = Φ_{k} (r_{k - 1}) \overset{ψ_{k} (r_{k})}{\overset{⏞}{(μ_{k} (r_{k}) + σ_{k} (r_{k}) ϵ_{k})}} \end{matrix} \end{matrix}

(8)

such that:

\begin{matrix} \begin{matrix} {\tilde{μ}}_{k} (r_{k}) & = Φ_{k} (r_{k - 1}) μ_{k} (r_{k}) \\ {\tilde{σ}}_{k} (r_{k}) & = Φ_{k} (r_{k - 1}) σ_{k} (r_{k}) . \end{matrix} \end{matrix}

(9)

Expressions for

μ_{k}^{'}

s and

σ_{k}^{'}

s are typically given by microeconomic theory, and

Φ

can be obtained through behavioral data anlayses. In Section 7, where numerical examples are analyzed, we present simple functional forms for both

μ_{k}

,

σ_{k}

and

Φ_{k}

. The memory function for the

k + 1

st period,

Φ_{k + 1} (r_{k})

retains the pricing information from the previous period, while being affected by the last piece of information that has become available, i.e.,

r_{k}

. The level of retainment of the price history information (determining the potential buyers’ anchoring to the past prices) may vary depending on the market and the behavior of strategic buyers. That is,

\frac{Φ_{k + 1}}{Φ_{k}} : = ϕ_{k} (r_{k})

(10)

we call these

ϕ_{k} (r_{k})

s the memory elements. Notice that the possibility of having different functional forms for

ϕ_{k}

s in different periods enables our demand structure to cover more non-autonomy. With the general memory structure in (10), we will have:

Φ_{k} (r_{k - 1}) = \prod_{i = 2}^{k} ϕ_{i} (r_{i - 1})

(11)

These memory functions, as we will see, are adjustable such that they can enable the model to represent different levels of influence from the past. For example,

Φ_{i} = 1, \forall i \in \{2, \dots, n\}

represent a memory-less market in which

D_{i}

s are decoupled from each other.

5. Embedding the Memory-Based Demand in the Expected Profit Expression

The general construction outlined in (3) and (4) is sufficiently explicit to enable solutions of the problem for most choices of functions

{\tilde{μ}}_{k}

and

{\tilde{σ}}_{k}

. However, the problem is so deeply nested that one cannot expect to find an analytical solution. The importance of our memory-based structure of demand, as described in Section 4.3, is that in many classical supply chain optimization problems, as it has been shown in the Appendix A, the running expected profit at each period, as outlined in (1), has the following form.

Proposition 1.

A decentralized supply channel composed of two channel members is to address a stochastic demand as given by (7) at each period

k \in {1, \dots, n}

. The channel members are bound by classical channel coordination contracts such as wholesale price, buyback, and revenue-sharing contracts (as defined in [1], and described in Appendix B), or their linear combinations. The running expected profit at each period, as outlined in (1), will then be of the following general structure.

{\bar{Π}}_{k} (r_{k}) = Ψ_{k} (r_{k}) {\tilde{μ}}_{k}^{p} (r_{k}) + Θ_{k} (r_{k}) {\tilde{σ}}_{k}^{p} (r_{k})

(12)

Proof.

See Appendix B.1, Appendix B.2 and Appendix B.3, where the functional structures for

Ψ_{k} (r_{k})

and

Θ_{k} (r_{k})

are obtained. □

The power p in these classical contracts is equal to 1. With the memory structure introduced in (8), we will have:

\begin{matrix} {\tilde{μ}}_{k}^{p} (r_{k}) = μ_{k}^{p} (r_{k}) Φ_{k}^{p} (r_{k - 1}) and \end{matrix}

(13)

\begin{matrix} {\tilde{σ}}_{k}^{p} (r_{k}) = σ_{k}^{p} (r_{k}) Φ_{k}^{p} (r_{k - 1}) \end{matrix}

(14)

and can recast (12) as below.

{\bar{Π}}_{k} = \overset{: = {\tilde{Π}}_{k} (r_{k})}{\overset{⏞}{(Ψ_{k} (r_{k}) μ_{k}^{p} (r_{k}) + Θ_{k} (r_{k}) σ_{k}^{p} (r_{k}))}} Φ_{k}^{p} (r_{k - 1}) = {\tilde{Π}}_{k} (r_{k}) Φ_{k}^{p} (r_{k - 1})

(15)

Now, the profit optimization problem in (3) can be simplified as follows.

J = \sum_{k = 1}^{n} α_{k} Φ_{k}^{p} (r_{k - 1}) {\tilde{Π}}_{k} (r_{k})

(16)

The multiplier effect in (16) is the crucial observation in this paper, as it reduces the nested n-variable optimization problem to n single-variable optimization problems.

This decoupling effect is shown (17). Again, starting from the final period, we observe that the only term in J containing

r_{n}

is

{\tilde{Π}}_{n}

. Thus, we have:

\begin{matrix} max_{r_{n}} J \equiv max_{r_{n}} {\tilde{Π}}_{n} \end{matrix}

(17)

This optimization problem is much more straightforward to solve compared to the general case in (4), as here, we can immediately obtain the numerical value for

{\hat{r}}_{n}

. Substituting this value in (16), we solve the next single-variable maximization problem with respect to

r_{n - 1}

. Continuing the same procedure backward in time, we can obtain all the optimal values of

{\hat{r}}_{k}

s.

6. Equilibria in Decentralized Channels with Two Suppliers

Having outlined our general model of memory-based stochastic demand, and embedding it in a single-vendor profit optimization problem, we now extend the scope of the analysis to problems where two vendors facing stochastic demand try to maximize their own respective profits. For the analysis of our proposed demand structure, we begin with the newsvendor model as it epitomizes the problem of inventory management when the demand for a commodity with short lifespan is stochastic.

We assume that goods are produced by a manufacturer and sold to a retailer. We also assume that the manufacturer and the retailer are risk neutral in the sense that they try to maximize expected discounted total profit. We consider a multi-period Stackelberg game between the manufacturer and the retailer where the actions of the two parties affect the actions of a third party, the customers. In this Stackelberg structure, the upstream vendor (the manufacturer), as the leader, has to find a sequence of optimal wholesale prices at different periods (

w_{k}

s) to ensure her maximum profit. The downstream vendor (the retailer), who is the follower, then faces the wholesale price and accordingly decides on the number of products to be ordered to the manufacturer (and supplied to the market) and the sequence of optimal retail prices (

r_{k}

s).

The dynamics of prices in game theoretical settings have been discussed in several publications by K. Bagwell, we mention [25,26]. The analysis presnted in [27] considers multi-period cases with price-dependent demand, and show how to adapt such models to include backorders. However, they do not discuss Stackelberg competition. Pricing strategies for retailers have been discussed intensively in the marketing literature—we mention [28,29].

The discussion in [30] partly explains why general multi-period problems are difficult to solve. Some types may admit numerical solutions, but the general problem is difficult to compute or analyze even in the two-period case. By comparison, the discrete version we consider in this paper is transparent. Our memory scheme decouples a multi-period problem into a sequence of one-period problems, each of which is fairly easy to solve. Our model retains the main essence of the problem itself, while simultaneously providing a solution that can be analyzed without the need for advanced optimizing techniques.

6.1. The Basic Model: The Game in Single-Period

The solution to the single-period newsvendor problem epitomizes a supply chain coordination scenario while facing stochastic demand. Therefore, in this section, we review some properties of the single-period model and in the next sections, we propose our multi-period model based on it.

Model Variables and Parameters

D = demand (random)
w = wholesale price per unit (chosen by the manufacturer)
r = retail price per unit (chosen by the retailer)
q = order quantity (chosen by the retailer)
$c_{m}$ = manufacturing cost per unit (fixed)
s = salvage price per unit (fixed)
$Π^{r}$ = profit for the retailer
$Π^{m}$ = profit for the manufacturer

In the classical newsvendor model, the manufacturer sets the wholesale price w for one unit of a certain commodity that needs to be sold within a short timespan. The retailer orders a quantity q units of the commodity to the manufacturer and plans to sell them for the price r (per unit) in a market with stochastic demand D. Any unsold item can be salvaged at the price

s < r

. The retailer’s profit

Π^{r}

is calculated as below.

\begin{matrix} Π^{r} & = r \min (D, q) + s {(q - D)}^{+} - w q \\ = r \min (D, q) + s (q - \min (D, q)) - w q \\ = (r - s) \min (D, q) - (w - s) q . \end{matrix}

(18)

From this expression, we obtain the expected profit for the retailer:

E [Π^{r}] = (r - s) E [min (D, q)] - (w - s) q .

(19)

In our model, we consider the additive–multiplicative model for the demand as given in (7). For a given r, it is well known that the maximum expected profit is obtained when:

P (D \leq q) = \frac{r - w}{r - s}

(20)

Inserting the general expression for the demand in (7) into (19) and using (20), we can prove the following proposition where

F_{ϵ}

denotes the cumulative distribution of

ϵ

.

Proposition 2.

Assume ϵ has a continuous distribution, supported on an interval, with density

f_{ϵ} > 0

a.e. on its support. Given r and w,

r \geq w > s

, the retailer will make an order:

q = μ (r) + σ (r) F_{ϵ}^{- 1} (\frac{r - w}{r - s}) .

(21)

in which case, they obtain the expected profit:

{\bar{Π}}^{r} = E [Π^{r}] = (r - w) μ (r, k) + L_{ϵ} (r, w) σ (r, k)

(22)

where

L_{ϵ}

is defined by:

\begin{matrix} L_{ϵ} (r, w) = (r - s) \int_{- \infty}^{z} ζ f_{ϵ} (ζ) d ζ z = F_{ϵ}^{- 1} (\frac{r - w}{r - s}) \end{matrix}

(23)

Proof.

See the Appendix C. □

The result in Proposition 2 is more or less well known within the literature. In our normalization, we assume that

E [ϵ] = \int_{- \infty}^{\infty} ζ f_{ϵ} (ζ) d ζ = 0

, and hence,

L_{ϵ} (r, w) \leq 0 .

The term

L_{ϵ} . σ

is typically referred to as the expected loss due to randomness.

It should be noted that the channel under study is considered to be a segment of a more complete market, such that a segmentation of the pool of customers are addressed by it. The market demand structure, in general, is an aggregation of the individual demands from possibly heterogenous consumers who may be affected by the supply of competing products from other vendors. This feature is embedded in D through the choice of

μ [r, k]

and

σ [r, k]

. Therefore, although the manufacturer and the retailer in our model are basically monopolistic suppliers, the model considers competition via demand structure.

In the one-period newsvendor model, to formulate a Stackelberg game, we assume that both parties are risk neutral. The manufacturer (leader) offers a wholesale price w. We ignore the possibility that the retailer can negotiate this wholesale price. Given w, the retailer (follower) then solves (22) to find the

\hat{r}

, which maximizes

{\bar{Π}}^{r}

, and then, substituting this

\hat{r}

into (21) to find out the optimum order quantity

\hat{q}

. The manufacturer also knows that the retailer is going to choose

\hat{q}

to maximize the expected profit. Therefore, given each possible value of w, the manufacturer can anticipate the resulting order quantity

\hat{q} = \hat{q} (w)

, and so chooses

\hat{w}

to maximize the expected profit (which happens be to be deterministic in this case). The manufacturer’s profit is given by:

Π^{m} = (\hat{w} - c_{m}) \hat{q} .

(24)

6.2. Multi-Period Vertical Contracting

Having discussed the solution to the single-period problem, we are now ready to provide a theoretical analysis of the multi-period Stackelberg game. Specifically, we consider the situation where demand in a future period is scaled by a factor based on price and demand now. Essentially, this is a Markovian assumption since it requires only knowledge of the current situation.

In the multi-period game, we assume that the parties are risk neutral and try to maximize their discounted expected profits:

J^{r} = {\bar{Π}}_{1}^{r} + α_{2} {\bar{Π}}_{2}^{r} + \dots + α_{n} {\bar{Π}}_{n}^{r}

(25)

J^{m} = {\bar{Π}}_{1}^{m} + α_{2} {\bar{Π}}_{2}^{m} + \dots + α_{n} {\bar{Π}}_{n}^{m}

(26)

where n is the number of periods.

6.3. Multi-Period Games with Memory Functions

Whereas it is straightforward to formulate an n-period game in the general case, numerical solutions are difficult to obtain even if n is moderately large. The nonlinear structure of the problem branching into separate cases for each particular choice made on every level quickly renders the problem computationally intractable.

In this section, we show how to generalize the memory-based approach described in the previous section to multi-period problems. First, we discuss an important technical issue. Consider a general three-period problem. Substituting the memories from (11), we will have the following structure.

\begin{matrix} D_{1} = μ_{1} (r_{1}) + σ_{1} (r_{1}) ϵ_{1} \end{matrix}

(27)

\begin{matrix} D_{2} = ϕ_{2} (r_{1}) (μ_{2} (r_{2}) + σ_{2} (r_{2}) ϵ_{2}) \end{matrix}

(28)

\begin{matrix} D_{3} = ϕ_{2} (r_{1}) ϕ_{3} (r_{2}) (μ_{3} (r_{3}) + σ_{3} (r_{3}) ϵ_{3}) \end{matrix}

(29)

In the following analysis, we consider only the retailer’s profit optimization procedure. The same arguments also hold true for the manufacturer’s. In the presence of the memory functions, the maximization problem expressed in (25), turns into the following.

\begin{matrix} max_{r_{1}, r_{2}, r_{3}} J^{r} & = {\bar{Π}}_{1} (r_{1}) + α_{2} ϕ_{2} (r_{1}) {\bar{Π}}_{2} (r_{2}) + α_{3} ϕ_{2} (r_{1}) ϕ_{3} (r_{2}) {\bar{Π}}_{3} (r_{3}) \end{matrix}

(30)

Starting the backward induction process from the final period, we define

J_{k}^{r}

as the expected discounted profit earned within the interval between period k and

n = 3

, inclusive.

\begin{matrix} J_{3}^{r} & = α_{3} ϕ_{2} (r_{1}) ϕ_{3} (r_{2}) {\bar{Π}}_{3} (r_{3}) \end{matrix}

(31)

\begin{matrix} J_{2}^{r} & = α_{2} ϕ_{2} (r_{1}) {\bar{Π}}_{2} (r_{2}) + J_{3}^{r} \end{matrix}

(32)

\begin{matrix} J_{1}^{r} & = J^{r} = {\bar{Π}}_{1} (r_{1}) + J_{2}^{r} \end{matrix}

(33)

where according to (22), the running expected profit obtained at each period is:

{\bar{Π}}_{k} (r_{k}) = (r_{k} - w_{k}) μ_{k} (r_{k}) + L_{ϵ_{k}} (r_{k}, w_{k}) σ_{k} (r_{k}) .

Thus,

J_{3}^{r} = α_{3} ϕ_{2} (r_{1}) ϕ_{3} (r_{2}) ((r_{3} - w_{3}) μ_{3} (r_{3}) + L_{ϵ_{3}} (r_{3}, w_{3}) σ_{3} (r_{3}))

(34)

Because period 3 is the final period, there is no need to worry about future demand, and therefore, given

w_{3}

, the retailer chooses the optimal

r_{3}

to maximize

J_{3}^{r}

. Note that because

r_{1}

and

r_{2}

have happened in the past, they are not considered as decision variables at period 3 and the optimal values of

r_{3}

and

w_{3}

are independent of them. Thus, the optimization problem reduces to the single-variable problem of maximizing

{\bar{Π}}_{3} (r_{3})

.

Assuming that

{\hat{r}}_{3}

is the (global) argmax value of

{\bar{Π}}_{3} (r_{3})

, we set

{\hat{Π}}_{3} = {\bar{Π}}_{3} ({\hat{r}}_{3})

. Then, the backward induction proceeds to the next subproblem, i.e., the problem of maximizing the expected profit in the second period. From (32):

\begin{matrix} max_{r_{2}} J_{2}^{r} & = ϕ_{2} (r_{1}) (α_{2} {\bar{Π}}_{2} (r_{2}) + α_{3} ϕ_{3} (r_{2}) {\hat{Π}}_{3}) \\ = α_{2} ϕ_{2} (r_{1}) ([(r_{2} - w_{2}) μ_{2} (r_{2}) + L_{ϵ_{2}} (r_{2}, w_{2}) σ_{2} (r_{2})] + \frac{α_{3}}{α_{2}} ϕ_{3} (r_{2}) {\hat{Π}}_{3}) \end{matrix}

(35)

Notice that in (35), similar to the case in (34), the only decision variable for the retailer is

r_{2}

, as

r_{1}

has happened in the past. Therefore, the retailer faces another single-variable optimization problem.

The same procedure is applied backward until all three optimal decision variables are found. Assuming that

{\hat{r}}_{2}

is the global argmax of

J_{2}^{r}

, as optimized with respect to

r_{2}

, from (35), we set:

{\hat{Π}}_{2} = {\bar{Π}}_{2} ({\hat{r}}_{2}) + \frac{α_{3}}{α_{2}} ϕ_{3} ({\hat{r}}_{2}) {\hat{Π}}_{3} = \frac{J_{2}^{r} ({\hat{r}}_{2})}{α_{2} ϕ_{2} (r_{1})}

(36)

Now, the remaining single-variable optimization problem is derived from (33) as below.

max_{r_{1}} J_{1}^{r} = {\bar{Π}}_{1} (r_{1}) + α_{2} ϕ_{2} (r_{1}) {\hat{Π}}_{2}

(37)

Generalizing the same procedure for an n-period game

(n > 3)

, we start by solving for the final period to obtain expected profits

{\hat{Π}}_{n}^{r}

and

{\hat{Π}}_{n}^{m}

. Once these values are known, the profit values in the previous period can be computed through backward induction processes. That produces numerical values of

{\hat{Π}}_{n - 1}^{r}

and

{\hat{Π}}_{n - 1}^{m}

. To determine the strategy for the

(n - 2)

nd period, we consider the problem:

\begin{matrix} max_{r_{n - 2}} J_{n - 2}^{r} = & ((r_{n - 2} - w_{n - 2}) μ_{n - 2} (r_{n - 2}) + L_{ϵ_{n - 2}} (r_{n - 2}, w_{n - 2}) σ_{n - 2} (w_{n - 2}) \\ + \frac{α_{n - 1}}{α_{n - 2}} ϕ_{n - 1} (r_{n - 2}) {\hat{Π}}_{n - 1}^{r}) α_{n - 2} \prod_{i = 2}^{n - 2} ϕ_{i} (r_{i - 1}) \end{matrix}

(38)

\begin{matrix} max_{w_{n - 2}} J_{n - 2}^{m} = & ((w_{n - 2} - c_{m_{n - 2}}) [μ_{n - 2} (r_{n - 2}) + σ_{n - 2} (r_{n - 2}) F_{ϵ_{n - 2}}^{- 1} (\frac{r_{n - 2} - w_{n - 2}}{r_{n - 2} - s})] \\ + \frac{α_{n - 1}}{α_{n - 2}} ϕ_{n - 1} (r_{n - 2}) {\hat{Π}}_{n - 1}^{m})) α_{n - 2} \prod_{i = 2}^{n - 2} ϕ_{i} (r_{i - 1}) \end{matrix}

(39)

We also set

α_{1}

and

ϕ_{1}

equal to 1. Note that in (38) and (39), the term

\prod_{i = 2}^{n - 2} ϕ_{i} (r_{i - 1}) = Φ_{n} (r_{n - 1})

represents the previous prices and has no bearing on the optimization problem. Thus, the equilibrium problem for period

n - 2

is reduced to a single-period problem that only involves

r_{n - 2}

and

w_{n - 2}

. The only difference from the problem for period

n - 1

, is that the values of (

{\hat{Π}}_{n - 1}^{r}, {\hat{Π}}_{n - 1}^{m}

) are different from the values

({\hat{Π}}_{n}^{r}, {\hat{Π}}_{n}^{m})

. Hence, all we have to do to solve this problem is repeat the previous step with updated values for

({\hat{Π}}^{r}, {\hat{Π}}^{m})

.

To simplify notation, we have suppressed dependence on arguments that are not yet active;

μ_{n - 2}

and

σ_{n - 2}

are in general functions of (

r_{n - 3}

) but according to our assumptions, this dependence enters as an independent multiplicative factor and can hence be factored out of the optimization problem. (See Equations (34) and (35) for example.)

By using the argument above repeatedly, it is clear that we can solve this problem for any value of n. Starting with the values

({\hat{Π}}_{n + 1}^{r}, {\hat{Π}}_{n + 1}^{m}) = 0

in the final period, we essentially solve the same problem in all periods. The values of

({\hat{Π}}^{r}, {\hat{Π}}^{m})

are updated as the construction progresses backwards, but those updated values come for free from the solution of the previous step. We state the generalized result as follows.

Theorem 1.

Let n be the number of periods and assume that demand in period k is given by:

D_{k} = (μ_{k} (r_{k}) + σ_{k} (r_{k}) ϵ_{k}) Φ_{k} (r_{k - 1})

(40)

where,

Φ_{1} = 1, Φ_{k} (r_{k - 1}) = \prod_{i = 2}^{k} ϕ_{i} (r_{i - 1}) k > 1

and

ϵ_{1}, \dots, ϵ_{n}

are continuously distributed with E

[ϵ_{k}] = 0

and Var

[ϵ_{k}] = 1

for all k, with

f_{ϵ_{k}} > 0

a.e. on their supports. If for each k, the one-period Stackelberg problem below has a unique equilibrium at

r_{k} = {\hat{r}}_{k}, w_{k} = {\hat{w}}_{k}

.

\begin{matrix} J_{k}^{r} (r_{k}) = ((r_{k} - w_{k}) μ_{k} (r_{k}) + L_{ϵ_{k}} (r_{k}, w_{k}) σ_{k} (r_{k}) + \frac{α_{k + 1}}{α_{k}} ϕ_{k + 1} (r_{k}) {\hat{Π}}_{k + 1}^{r}) α_{k} Φ_{k} (r_{k - 1}) \\ J_{k}^{m} (w_{k}) = ((w_{k} - c_{m_{k}}) (μ_{k} (r_{k}) + σ_{k} (r_{k}) F_{ϵ_{k}}^{- 1} [\frac{r_{k} - w_{k}}{r_{k} - s_{k}}]) \\ + \frac{α_{k + 1}}{α_{k}} ϕ_{k + 1} (r_{k}) {\hat{Π}}_{k + 1}^{m}) α_{k} Φ_{k} (r_{k - 1}) \end{matrix}

(41)

where

{\bar{Π}}_{k}^{r}

and

{\bar{Π}}_{k}^{m}

are found recursively from:

\begin{matrix} {\hat{Π}}_{n + 1}^{r} = 0 {\hat{Π}}_{n + 1}^{m} = 0 \end{matrix}

(42)

\begin{matrix} {\hat{Π}}_{k}^{r} = \frac{J_{k}^{r} ({\hat{r}}_{k})}{α_{k} Φ_{k} (r_{k - 1})} {\hat{Π}}_{k}^{m} = \frac{J_{k}^{m} ({\hat{w}}_{k})}{α_{k} Φ_{k} (r_{k - 1})}, k = 1, 2, \dots, n, α_{1} = Φ_{1} = 1 \end{matrix}

(43)

then the problem of maximizing,

J^{r} = {\bar{Π}}_{1}^{r} + α_{2} Φ_{2} (r_{1}) {\bar{Π}}_{2}^{r} + \dots + α_{n} Φ_{n} (r_{n - 1}) {\bar{Π}}_{n}^{r}

(44)

J^{m} = {\bar{Π}}_{1}^{m} + α_{2} Φ_{2} (r_{1}) {\bar{Π}}_{2}^{m} + \dots + α_{n} Φ_{n} (r_{n - 1}) {\bar{Π}}_{n}^{m}

(45)

has a unique equilibrium at

\hat{r} = ({\hat{r}}_{1}, {\hat{r}}_{2}, \dots, {\hat{r}}_{n}), \hat{w} = ({\hat{w}}_{1}, {\hat{w}}_{2}, \dots, {\hat{w}}_{n})

. The optimal order quantity at k is then calculated as below.

{\hat{q}}_{k} = Φ_{k} ({\hat{r}}_{k - 1}) [μ_{k} ({\hat{r}}_{k}) + σ_{k} ({\hat{r}}_{k}) F_{ϵ_{k}}^{- 1} (\frac{{\hat{r}}_{k} - {\hat{w}}_{k} - c_{r_{k}}}{{\hat{r}}_{k}})]

(46)

Theorem 1 delineates how the memory structure can decouple nested equilibria problems. The result of the theorem can be generalized to all optimization or equilibrium problems in which the running (single period) profit expression is of the structure stated in (12). In the Appendix B, we show that the expected profit expressions in all the classical coordination contracts (and their combinations), including the wholesale price contracts, the buyback contracts, and the revenue-sharing contracts are indeed of this structure.

Corollary 1.

Let n be the number of periods and assume that demand in period k is given by:

D_{k} = (μ_{k} (r_{k}) + σ_{k} (r_{k}) ϵ_{k}) Φ_{k} (r_{k - 1})

(47)

where:

Φ_{1} = 1, Φ_{k} (r_{k - 1}) = \prod_{i = 2}^{k} ϕ_{i} (r_{i - 1}) k > 1

and

ϵ_{1}, \dots, ϵ_{n}

are continuously distributed with E

[ϵ_{k}] = 0

and Var

[ϵ_{k}] = 1

for all k with

f_{ϵ_{k}} > 0

a.e. on their supports. Assuming that the running expected profit for the retailer and the manufacturer at period k can be written in the following formats. (In classical supply-chain optimization contracts,

p = 1 .

)

{\bar{Π}}_{k}^{r} (r_{k}) = Ψ_{k}^{r} (r_{k}) μ_{k}^{p} (r_{k}) + Θ_{k}^{r} (r_{k}) σ_{k}^{p} (r_{k})

{\bar{Π}}_{k}^{m} (r_{k}) = Ψ_{k}^{m} (r_{k}) μ_{k}^{p} (r_{k}) + Θ_{k}^{m} (r_{k}) σ_{k}^{p} (r_{k})

if for each k, the one-period Stackelberg problem below has a unique equilibrium at

r_{k} = {\hat{r}}_{k}, w_{k} = {\hat{w}}_{k} .

\begin{matrix} J_{k}^{r} (r_{k}) = (Ψ_{k}^{r} (r_{k}) μ_{k}^{p} (r_{k}) + Θ_{k}^{r} (r_{k}) σ_{k}^{p} (r_{k}) + \frac{α_{k + 1}}{α_{k}} ϕ_{k + 1}^{p} (r_{k}) {\hat{Π}}_{k + 1}^{r}) α_{k} Φ_{k} (r_{k - 1}) \\ J_{k}^{m} (w_{k}) = (Ψ_{k}^{m} (r_{k}) μ_{k}^{p} (r_{k}) + Θ_{k}^{m} (r_{k}) σ_{k}^{p} (r_{k}) + \frac{α_{k + 1}}{α_{k}} ϕ_{k + 1}^{p} (r_{k}) {\hat{Π}}_{k + 1}^{m}) α_{k} Φ_{k} (r_{k - 1}) \end{matrix}

(48)

where

{\hat{Π}}_{k}^{r}

and

{\hat{Π}}_{k}^{m}

are found recursively from:

\begin{matrix} {\hat{Π}}_{n + 1}^{r} = 0 {\hat{Π}}_{n + 1}^{m} = 0 \end{matrix}

(49)

\begin{matrix} {\hat{Π}}_{k}^{r} = \frac{J_{k}^{r} ({\hat{r}}_{k})}{α_{k} Φ_{k} (r_{k - 1})} {\hat{Π}}_{k}^{m} = \frac{J_{k}^{m} ({\hat{w}}_{k})}{α_{k} Φ_{k} (r_{k - 1})}, k = 1, 2, \dots, n, α_{1} = Φ_{1} = 1 \end{matrix}

(50)

then the problem of maximizing:

J^{r} = {\bar{Π}}_{1}^{r} + α_{2} Φ_{2} (r_{1}) {\bar{Π}}_{2}^{r} + \dots + α_{n} Φ_{n} (r_{n - 1}) {\bar{Π}}_{n}^{r}

(51)

J^{m} = {\bar{Π}}_{1}^{m} + α_{2} Φ_{2} (r_{1}) {\bar{Π}}_{2}^{m} + \dots + α_{n} Φ_{n} (r_{n - 1}) {\bar{Π}}_{n}^{m}

(52)

has a unique equilibrium at

\hat{r} = ({\hat{r}}_{1}, {\hat{r}}_{2}, \dots, {\hat{r}}_{n}), \hat{w} = ({\hat{w}}_{1}, {\hat{w}}_{2}, \dots, {\hat{w}}_{n})

.

Remark 1.

The Corollary 1 is a generalization of Theorem 1 based on general functional structures introduced in Section 5. In Appendix B.1, Appendix B.2 and Appendix B.3 we calculate the structures

Ψ_{k}^{r, m}

s and

Θ_{k}^{r, m}

s for some of the conventional supply chain contracts. These functional structures substituted in the procedure outlined in Corollary 1 yield the optimal results for a supply channel bound to the associated contract.

Remark 2.

In multi-variable problems such as the ones discussed here, multiple local maxima are detrimental to computational performance. The strength of Theorem 1, however, is that it reduces the dimension of the search space to one, and maxima for functions of one variable can always be handled by an exhaustive search.

Remark 3.

Theorem 1 and its corollary state that the uniqueness of the multi-period equilibria is contingent upon the uniqueness of the associated single-period equilibrium results. The unimodality of the multi-period equilibria solutions is determined by the unimodality of each of the decoupled single-period (i.e., single-variable) optimization problems. Finding necessary conditions to guarantee the unimodality of the single-period price-setting newsvendor problem has been exhaustively studied in the literature; see, for example, [31,32].

6.4. The Infinite Horizon Case

According to Theorem 1, in the infinite horizon problem, for given values of

{\hat{Π}}_{k}^{r}

and

{\hat{Π}}_{k}^{m}

, the parties try to optimize the following.

\begin{matrix} J_{k}^{r} (r_{k}) & = ((r_{k} - w_{k}) μ_{k} (r_{k}) + L_{ϵ_{k}} (r_{k}, w_{k}) σ_{k} (r_{k}) + α {\hat{Π}}_{k + 1}^{r} ϕ_{k + 1} (r_{k})) α Φ_{k} (r_{k - 1}) \end{matrix}

(53)

\begin{matrix} \begin{matrix} J_{k}^{m} (w_{k}) & = ((w_{k} - c_{m_{k}}) (μ_{k} (r_{k}) + σ_{k} (r_{k}) F_{ϵ_{k}}^{- 1} [\frac{r_{k} - w_{k}}{r_{k} - s_{k}}]) \\ + α {\hat{Π}}_{k + 1}^{m} ϕ_{k + 1} (r_{k})) α Φ_{k} (r_{k - 1}) \end{matrix} \end{matrix}

(54)

Here,

α

is the fixed discount factor and remains constant for the whole duration of the problem from period 1 to n. (The reason for this restrictions is that in the infinite-horizon case, a certain degree of autonomy is necessary for convergence to happen).

The first-order conditions for this problem yield two equations for the two unknowns

r_{k}

and

w_{k}

. In the multi-period case, we start by using

{\hat{Π}}_{n}^{r} = 0

and

{\hat{Π}}_{n}^{m} = 0

and iterate backward until we reach the starting period. However, if the horizon is infinite, this approach fails because an infinite number of iterations is needed to reach the start.

If

μ (r)

,

σ (r)

,

ϕ (r)

, and

ϵ

do not depend on k, or

lim_{k \to \infty} (μ (r, k), σ (r, k), ϕ (r, k), ϵ_{k}) = (μ (r), σ (r), ϕ (r), ϵ),

i.e., the same functions are used for any k, then cases with an infinite horizon can be solved. To do so, one needs a steady state for the system, i.e., one must find

{\hat{Π}}^{r}

and

{\hat{Π}}_{m}^{r}

such that:

{\hat{Π}}^{r} = (r - w) μ (r) + L_{ϵ} (r, w) + α {\hat{Π}}^{r} ϕ (r)

(55)

{\hat{Π}}^{m} = (w - M) (μ (r) + σ (r) F_{ϵ}^{- 1} [\frac{r - w}{r - s}]) + α {\hat{Π}}^{m} ϕ (r) .

(56)

The first-order conditions applied to (53) and (54), together with (55) and (56), yield four equations with the four unknowns,

r, w, {\hat{Π}}^{r}

, and

{\hat{Π}}^{m}

.

7. Numerical Implementation of the Model

In this section, we illustrate the theory in Section 6.2 by explicit examples. In these examples, we use a Cobb–Douglas demand function structure with a normally distributed random term. The problem is as easily solved when using other functional forms. The problem (given w) is reduced to finding maxima for a function of one variable, which is straightforward for almost any choice of

μ_{k}

,

σ_{k}

, and

ϕ_{k} .

In addition to many examples covering decentralized supply channels where the channel members compete within a Stackelberg framework, an example of cooperative behavior of the suppliers is offered in Section 7.2, where the two vendors integrate to form a centralized supply chain.

We remark that the purpose of this entire section is to offer practical advice on how our theory can be implemented in some special cases to prescribe optimal decision variables. To take full advantage of the model, one should try to vary scaling factors and functional forms in a systematic way. This makes it possible to model a wide range of economic contexts. A full discussion of the model and all the variations it has to offer, is, however, beyond the scope of this paper.

7.1. Multi-Period Buyback Contracts

As evidenced in Theorem 1, once we have an algorithm that solves the two-period case, the same algorithm can be used repeatedly to solve n-period problems. We merely have to update the remaining profits as the construction progresses backward. We consider the case in which demand in period k is given by:

D_{k} = ψ_{k} (r_{k}) \prod_{i = 2}^{k} ϕ_{i} (r_{i - 1})

(57)

ψ_{k} (r_{k}) = μ_{k} (r_{k}) + σ_{k} (r_{k}) ϵ_{k}

(58)

where

ϵ_{k}

is

N (0, 1)

. Because a normally distributed variable can take negative values, we must impose restrictions to exclude artificial cases. If q, as given by (21), is negative, we set

q = 0

. Likewise, if the expected profit in (22) is negative, we assume

q = 0

.

Setting the model parameters

As discussed in Section 3, in general, all the parameters and variables of the model are time-dependent. In examples 1 to 3, we set the manufacturing cost at period k to be decreasing as time goes on;

c_{m_{k}} = 2 - 0.01 k,

the buyback price

b_{k} = 0.3 \times c_{m_{k}}

, and the salvage price,

s_{k} = 0.2, \forall k .

Note that in a buyback contract, as described in [1], the remaining units at the end of each period need not be physically returned to the manufacturer. Instead, it may be such that the manufacturer credits the retailer with a price

b_{k}

(here equal to

30 %

of the manufacturing cost) for each unsold unit at the end of period

k .

In principle, the scaling factors

ϕ_{k}

s can change with k. For illustration purposes, we consider only cases in which the expected scaling factors satisfy the following:

ϕ_{k} (r_{k}) = {[1 + γ_{k} (κ_{k} - r_{k})]}^{+} \approx e^{γ_{k} (κ_{k} - r_{k})} for small values of γ_{k}

(59)

where

γ_{k} > 0

, the memory strength factor, and

κ_{k} > 0

are given parameters. The parameter

κ_{k}

can be interpreted as a price cap, i.e., any price above

κ_{k}

reduces demand, whereas demand is more likely to increase if

r_{k} < κ_{k}

. More complicated expressions can be computed without problems. For simplicity, in example 1 to 3, we set

γ_{k}

,

κ_{k}

, and

α_{k}

to remain constant through periods (hence, the subscript k has been dropped.)

Example 1.

In addition to the parameters determined earlier, we set

α = 1

(no discounting),

γ = 0.01

,

κ = 3

,

n = 40

,

μ_{k} (r_{k}) = \frac{1000}{r_{k}^{(2 - β \frac{n - k}{n})}}

,

β = 0.8

, and

σ_{k} (r_{k}) = \frac{1}{2} μ_{k} + \frac{5000}{r_{k}^{3}}

.The optimal pricing variables in each period

({\hat{r}}_{k}, {\hat{w}}_{k})

, as well as the values for

{\bar{D}}_{k}

(expected demand at k) and

{\hat{q}}_{k}

(optimal order quantity at k), are shown in Figure 1.

Figure 1. Pricing and supply quantities at equilibrium states in Example 1.

In this scenario, for the retailer, the optimal strategy is to increase demand by letting

r_{1} = \dots = r_{7} = 0

, then start selling in period 8. Defining the blow-up factor as

η : = α \cdot ϕ

, we have

max (η) = α max (ϕ) = α (1 + γ κ) = 1.03

. To obtain increased profits from an initial strategy in which

r_{1} = 0

, it is clearly necessary that

max (η) = α max (ϕ) > 1

. The total expected profits for the manufacturer and the retailer,

J^{m}

and

J^{r}

, respectively, are found as below.

J^{m} = 596.39, J^{r} = 659.61

The relationship between the optimal supplied quantity (

{\hat{q}}_{k}

) and expected demand (

{\bar{D}}_{k}

) can be interpreted as representative of the level of risk taken by the retailer (i.e., the newsvendor member of the channel). As seen in Figure 1, in this example, the optimal strategy for the retailer is to over-supply the market with respect to the expected demand

{\hat{q}}_{k} > {\bar{D}}_{k}, \forall k .

Remark 4.

The optimality of an over-supply strategy is not a general phenomenon; as seen in Example 4 (Section 7.3), where the retailer is prescribed to under-supply the market in all periods. In more complicated scenarios, the newsvendor (the downstream channel member) may be prescribed to apply a regime change in its supply strategy with respect to the expected demand. (See a secanrio with both over and under supplying regimes in Example 5, Section 7.4.)

The average over-supplying ratio can be measured as

\frac{1}{n} \sum_{k = 1}^{n} \frac{{\hat{q}}_{k} - {\bar{D}}_{k}}{{\hat{q}}_{k}} .

For Example 1, this value is calculated as

22.11 %

.

Example 2.

In this case, we analyze the previous example subject to discounting:

α = α_{k} = 0.95, \forall k

. The rest of the parameters and functional structures are the same as those of Example 1 (Section 1).

In this case,

max (η) = α max (ϕ) = α (1 + γ K) = 0.98 < 1

. This blow-up factor is not big enough to justify an initial retail price

r_{1} = 0

. Therefore, sales take place in all periods. Figure 2 shows the equilibrium prices and corresponding total expected profits for this case. For this case, we find

J^{m} = 368.49

and

J^{r} = 416.81 .

Again, a strategy of over-supplying is prescribed as optimal, with an average over-supply ratio of

21.9 %

.

Figure 2. Pricing and supply quantities at equilibrium states in Example 2.

7.2. Cooperative Agents: Centralized Channel, No Double Marginalization

So far, we have analyzed the equilibria in a Stackelberg framework. However, it is possible for the two parties to merge into a centralized channel. Note that, as outlined in (5), a Stackelberg equilibrium problem is essentially a bilevel optimization problem wherein the leader optimizes her objective function while being constrained by the optimality of the follower’s solution. Thus, the multi-period single-vendor price-setting newsvendor problem turns into an unconstrained special case of our general model. Such a deviation from the Stackelberg game is implemented by considering the two agents as a single decision-maker, substitution of

w_{k} = c_{m_{k}}

in Theorem 1, and optimizing only

J^{r}

with respect to

r_{k}

s.

Example 3.

Here, we consider

α = 1

,

γ = 0.01

,

K = 3

,

n = 40

,

c_{m_{k}} = 2 - 0.01 k, s_{k} = 0.2 \forall k

,

μ_{k} (r) = \frac{1000}{r^{(2 - β \frac{n - k}{n})}}

,

β = 0.8

, and

σ_{k} (r) = 0.5 μ_{k} (r) + \frac{5000}{r^{3}}

. Thus, the integrated channel in Example 3 faces the same market as the one in Example 1.

The results are illustrated in Figure 3. Due to vertical competition between its members, a decentralized channel suffers from double marginalization leading to its lower performance compared to a centralized supply channel facing the same market. Comparing the results of the centralized channel with its decentralized counterpart analyzed in Example 1, we observe that while the centralized channel charges comparatively lower prices in all periods, its overall expected profit is higher than the sum of expected profits for the two agents in a decentralized channel in the same market. Denoting the total expected profit for the centralized channel by

J^{c}

and the expected profits for the members of the corresponding decentralized channel by

J^{r}

and

J^{m}

(obtained in Example 1), we have:

J^{c} = 5836.41 > J^{m} + J^{r} = 596.39 + 659.61 = 1256.22

Figure 3. Pricing and supply quantities at equilibrium states in a centralized channel.

The centralized channel outperforms its decentralized counterpart while taking a higher risk in its supply quantity; the average over-supply ratio for the integrated channel is 31.74%, compared to 21.9% in Example 1. In addition, relatively lower prices offered by the centralized channel leads to a lower level of demand suppression (compare the expected demands in Figure 1 and Figure 3). Thus, unlike the decentralized channel in Example 1, the integrated channel does not need to resort to an early campaign of free distribution of products in order to boost the demand—as observed in Figure 3, in the case of this integrated channel, sale takes place in all periods.

7.3. Markets with Strategic Customers

The generality of the problem described in Section 6 and the flexibility of the solution algorithms presented in Theorem 1 and Corollary 1 allows our model to cover diverse sets of market scenarios. The model as a result can be fed with market representation functional much more complex than the ones presented in Examples 1 to 3.

Example 4.

Consider a market with two different sets of customer bases. One group, referred to as the strategic customers, postpones their purchase until they see a price lower than their “reservation price". The other group is less sensitive to the offered prices. We assume that the reservation price is known to the channel members. However, the assumption that there exists only one such reservation price among the customers is not necessary and has been adopted for simplification purposes. In general, the customer base can be divided into many factions in the model.

μ_{k} (R_{k}) = \{\begin{matrix} \frac{2000}{r_{k}^{2 + 0.1 (k - 1)}} & if r_{k} > r_{reservation} \\ \frac{2000}{r_{k}^{2 + 0.1 (k - 1)}} + \frac{30000}{r_{k}^{5 + 0.1 (k - 1)}} & if r_{k} \leq r_{reservation} \end{matrix}

σ_{k} (r_{k}) = 0.1 μ_{k} (R_{k}) + \frac{100}{r_{k}^{3}}

,

n = 40

,

α = 1

,

γ_{k} = 0.01

,

κ_{k} = 5

,

c_{m_{k}} = 2

,

s_{k} = 1

,

r_{reservation} = 2.5 .

As illustrated in Figure 4, the solution algorithm determines the optimal values for the three decision variables through the periods, and, proposes the optimal timing to begin the sales season (

k = 31

). Compare this feature with the prescription of the optimal time to end the free distribution phase in Example 1, Section 1.

Figure 4. Market with strategic customers: pricing and supply quantities at equilibrium states.

7.4. Positive Network Effects on Demand

Example 5.

Demand for certain innovative products is built-up as time goes by through the word-of-mouth effect (see [6,7]). In our model, this network effect, like other externalities is embedded in the demand function via time-dependent functionals. Unlike previous examples, in Example 5, we consider a time-dependent price cap, based on the Bass diffusion model for new-product-adaption:

κ_{k} = 100 exp (- 0.2 k) / {(1 + 10 exp (- 0.2 k))}^{2} .

Initially, the price cap is increasing with time, reflecting the customers’ gradually increasing willingness to purchase as time goes by (hence a decreasing sensitivity to prices). Again, we use a simple Cobb–Douglas function for the demand mean functional:

μ_{k} (r, k) = \frac{10000}{r^{4 - 0.01 k}}

.

σ_{k} (r_{k}) = 0.5 μ_{k} (r_{k}) + \frac{100}{r_{k}^{3}}

,

α = 0.9

,

n = 80

,

γ = 0.01

,

c_{m_{k}} = 2 - 0.01 k

,

s_{k} = 0.3

,

b_{k} = 0.5

.

The shape of the demand results in Figure 5 are consistent with typical shapes obtained by sales diffusion models. In addition to yielding optimal pricing strategies for the supply channel members at different times, the solution algorithm provides the optimal timing for a change in the supply regime. In the first phase, from period 1 until period 66, the retailer (newsvendor) is better off by under-supplying the market (

{\hat{q}}_{k} < {\bar{D}}_{k}

); while from period 67, a strategy of over-supplying is found to be optimal.

Figure 5. Market with positive network effect at equilibrium states.

8. Concluding Remarks

In this paper, we considered a decentralized supply chain composed of a manufacturer and a retailer within a bilevel (Stackelberg) structure. The manufacturer is the (global) leader and the retailer is the follower. The supply chain is to address an uncertain demand within a discretized time horizon. The two agents must strategically solve their respective optimization problems over the entire span of all periods. That is, the final solution algorithm is to give decision variables that are optimizers of the aggregate expected profit over n periods. The ensuing Nash–Stackelberg equilibria, thus, becomes highly interdependent in time. This nestedness has been a challenge in solving Nash–Stackelberg equilibrium problems in a multi-periodic setting within supply chain contracts. Static single-periodic versions of the problem have been solved in the literature. Although mathematically convenient, such reduced static models may turn out to be myopic as they do not cover the after-effects of current pricing on future demand.

In Theorem 1, we introduce a decoupling algorithm that breaks this nestedness and solves the ensuing three interdependent n-dimensional equilibria problems to three sequences of n single-variable equations. In doing so, we prove the existence of a Nash–Stackelberg for a general bilevel optimization problem for the two suppliers in the contract. Next, in the appendices, we show that many supply channel contracts, such as revenue-sharing and buyback contracts indeed produce the same general equilibrium problem structure and hence can be decoupled by our proposed algorithm.

To include strategic customers’ behavior in our model, we implemented a memory-based demand structure. For example, the potential buyers may have become anchored to the past prices and their decision to purchase a product may, to various degrees, be affected by the history of pricing. Such behavioral features are embedded in the model through the introduction of memory functions.

To demonstrate how such Nash–Stackelberg equilibria can be obtained by the recursive algorithm outlined in Theorem 1 and its corollary, we provided numerical solutions to a variety of special cases. Note, however, that our framework is not limited to such special cases. The numerical illustrations raise questions of interest for future research.

In the numerical section, we demonstrated an interesting link to marketing. Under certain conditions, an optimal strategy is to give away products in a pre-sales period. This stimulates demand, and the parties benefit from increased demand in the remaining time periods. Many high-tech products such as mobile phones and computers have a very short lifespan. Our paper, hence, offers a new framework where the optimality of sales strategies for such products can be discussed and analyzed. An example based on the Bass diffusion model for demand is given in Section 7.4.

The parameters and functional structures that we have used to illustrate the scope of applicability of the solution procedures are merely speculative. To take full advantage of the model, one should try to vary scaling factors and functional forms in a systematic way, for example, through the use of data obtained from empirical studies. Exploring the potential of our modeling approach is a topic for future research.

Author Contributions

Conceptualization, R.A.G., L.K.S. and J.U.; methodology, R.A.G., L.K.S. and J.U.; software, R.A.G.; validation, R.A.G., L.K.S. and J.U.; formal analysis, R.A.G., L.K.S. and J.U.; investigation, R.A.G., L.K.S. and J.U.; resources, R.A.G.; data curation, R.A.G.; writing—original draft preparation, R.A.G.; writing—review and editing, R.A.G.; visualization, R.A.G.; supervision, L.K.S. and J.U.; project administration, R.A.G., L.K.S. and J.U.; funding acquisition, L.K.S. and J.U. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. The Profit Structures Compatible with the Use of Multiplicative Memory Functions

In this section, we find the sufficient conditions for the profit structures enabling the memory-based algorithm to decouple nested optimization/equilibrium problems. In Appendix B, we show that many conventional supply channel contracts indeed follow these structures.

Let the following be the demand expression at period

k \in {1, \dots, n}

, where n is the number of periods.

D_{k} (r_{k}) = {\tilde{μ}}_{k} (r_{k}) + {\tilde{σ}}_{k} (r_{k}) ϵ_{k}

(A1)

Now, assume that the running profit expression for each period is as below.

{\bar{Π}}_{k}^{r} (r_{k}) = Ψ_{k} (r_{k}) {\tilde{μ}}_{k}^{p} (r_{k}) + Θ_{k} (r_{k}) {\tilde{σ}}_{k}^{p} (r_{k})

(A2)

Using the memory function structure in (8) and (9), the expression in (A1) turns into the following.

D_{k} (r_{k}) = (μ_{k} (r_{k}) + σ_{k} (r_{k}) ϵ_{k}) Φ_{k} (r_{k - 1})

(A3)

Therefore, for (A2), we will have:

\begin{matrix} {\bar{Π}}_{k}^{r} (r_{k}) & = (Ψ_{k} (r_{k}) μ_{k}^{p} (r_{k}) + Θ_{k} (r_{k}) σ_{k}^{p} (r_{k})) Φ_{k}^{p} (r_{k - 1}) \end{matrix}

(A4)

\begin{matrix} = {\tilde{Π}}_{k} (r_{k}) Φ_{k}^{p} (r_{k - 1}) \end{matrix}

(A5)

Thus, when solving the multi-variable optimization problem

max_{r_{n}} J = \sum_{k = 1}^{n} α_{k} {\bar{Π}}_{k}^{r}

, finding the kth armgax,

{\hat{r}}_{k}

, will be equivalent to finding the argmax of the single-variable function

{\tilde{Π}}_{k} (r_{k})

. Whence decoupling becomes possible.

The same argument applies to the expected profit for the manufacturer. In Appendix B, we will see that a special case of the structure in (A2), where

p = 1

, indeed appears in many supply chain optimization contracts.

Appendix B. The Expected Profit Structure in Supply Chain Coordination Problems

In Appendix A, we realized that in order for the decoupling scheme (using memory functions) to work in multiple periods, the single-period profits (for either vendor) must be of specific structures. In other words, in such cases, the multiplicative memory functions decouple the highly nested n-variable optimization (or equilibrium) problem, by turning it into n single-variable problems.

In the following sections, we prove that the desired general structures indeed appear in many classical channel coordination contracts; hence, making our theoretical formwork applicable to a large variety of multi-period optimization/coordinatin problems dealing with uncertain demand. This class of contracts include, e.g., The wholesale Price Contracts, The Buyback Contracts, and, The revenue-sharing Contracts. We have borrowed the nomenclature from [1]. Other sources may use different names; for example, [13] refers to buyback contracts as return policies.

Model Variables and Parameters

\begin{matrix} D = uncertain demand \\ μ = E [D] expected value of uncertain demand \\ r = retail price per unit \\ w = wholesale price per unit \\ q = order quantity \\ s = salvage price per unit \\ g_{r} = retailer ’ s goodwill penalty per unit, incurred for each unmet demand unit \\ g_{m} = manufacturer ’ s goodwill penalty per unit \\ c_{r} = retailer ’ s marginal \cos t per unit \\ c_{m} = manufacturer ’ s production \cos t per unit \end{matrix}

Appendix B.1. The Wholesale Price Contract

We start with analyzing the general newsvendor problem, also known as the wholesale price contract. In this contract, the retailer profit is obtained as below.

\begin{matrix} Π^{r} (r, q) & = r min (D, q) + s {(q - D)}^{+} - c_{r} q - g_{r} {(D - q)}^{+} - w q \\ = r min (D, q) + s (q - min (D, q)) - c_{r} q - g_{r} (D - min (D, q)) - w q \\ = (r - s + g_{r}) min (D, q) - (c_{r} + w - s) q - g_{r} D \end{matrix}

(A6)

In the analysis of all of the contracts presented here, our strategy is as follows.

Obtain the expression for the expected value of retailer’s profit as a function of r and q.
Apply the First Order Condition (F.O.C.) with respect to q on ${\bar{Π}}^{r}$ , i.e., $\frac{\partial {\bar{Π}}^{r}}{\partial q} = 0$ to obtain $q^{*}$ as a function of r.
Check the concavity of ${\bar{Π}}^{r}$ with respect to $q^{*} (r)$ .
Substitute the obtained $q^{*}$ as a function of r in the expression for ${\bar{Π}}^{r}$ and see if it is of the general structure in (A2).
Substitute the obtained $q^{*}$ as a function of r in the expression for ${\bar{Π}}^{m}$ and see if it is of the general structure in (A2).

To obtain the expected value of the retailer’s profit as stated in (A6), we need to obtain the expected sales,

S (q)

, i.e., the expected value of

min (D, q) .

For simplicity, we start with a distribution function,

f_{D}

, for D, instead of

f_{ϵ}

, bearing in mind that for

D = μ + σ ϵ

, we have:

F_{D}^{- 1} (\cdot) = μ + σ F_{ϵ}^{- 1} (\cdot) .

\begin{matrix} S (q) = E [min (D, q)] & = \int_{0}^{\infty} ζ f_{D} (ζ) d ζ = \int_{0}^{q} ζ f_{D} (ζ) d ζ + \int_{q}^{\infty} q f_{D} (ζ) d ζ \\ = q - \int_{0}^{q} F_{D} (ζ) d ζ \end{matrix}

(A7)

Alternatively, using our normalization based on

f_{ϵ}

instead of

f_{D}

, we obtain the following statement for the expected sales.

S (q) = E [min (D, q)] = μ + σ \int_{- \infty}^{\infty} ζ f_{ϵ} (ζ) d ζ = q - (q - μ) F_{ϵ} (\frac{q - μ}{σ}) + σ \int_{- \infty}^{\frac{q - μ}{σ}} ζ f_{ϵ} (ζ) d ζ

(A8)

we define

I (q) = E [{(q - D)}^{+}] = q - S (q)

as the expected left-over inventory. Similarly, let

L (q) = E [{(D - q)}^{+}] = μ - S (q)

be the expected lost sales function. Further, observe that

\frac{d S (q)}{d q} = 1 - F_{D} (q)

. Now, we can calculate the expected value of the retailer’s profit, as expressed in (A6).

{\bar{Π}}^{r} (r, q) = (r - s + g_{r}) S (q) - (c_{r} - s + w) q - g_{r} μ

(A9)

The Strict concavity of

{\bar{Π}}^{r} (q)

with respect to q is obvious:

\frac{\partial^{2} {\bar{Π}}^{r} (q)}{\partial q^{2}} = - (r - s + g_{r}) \times f_{D} (q)

(A10)

Thus, we apply the F.O.C. with respect to q and obtain

q^{*} (r)

as below.

q^{*} (r) = F_{D}^{- 1} (\frac{r - w + g_{r} - c_{r}}{r - s + g_{r}}) = μ (r) + σ (r) F_{ϵ}^{- 1} (\frac{r - w + g_{r} - c_{r}}{r - s + g_{r}})

(A11)

Substituting (A11) in (A9) and using (A8) we have:

{\bar{Π}}^{r} (r) = (r - w - c_{r}) μ (r) + (r - s + g_{r}) (\int_{- \infty}^{F_{ϵ}^{- 1} (\frac{r - w + g_{r} - c_{r}}{r - s + g_{r}})} ζ f_{ϵ} (ζ) d ζ) σ (r)

(A12)

In a single-vendor system, the F.O.C. applied to (A12) yields the

r^{*}

. However, in our Stackelberg framework, a numerical solution to

\frac{\partial {\bar{Π}}^{r}}{\partial r} = 0

provides us with

r^{*}

as a function of w:

r^{*} (w) .

The optimal order quantity,

q^{*}

can then be obtained using

r^{*} .

Let us now analyze the expected value of the manufacturer’s profit.

\begin{matrix} {\bar{Π}}^{m} (r^{*} (w), w) & = w q^{*} (w) - c_{m} q^{*} (w) + g_{m} S (q^{*} (w)) - g_{m} μ (r^{*} (w)) \\ = q^{*} (w - c_{m} + g_{m}) - g_{m} (σ F_{ϵ}^{- 1} (y) \times y + μ + \int_{- \infty}^{F_{ϵ}^{- 1} (y)} ζ f_{ϵ} (ζ) d ζ) \\ = μ (w) (w - c_{m}) + σ (w) (F_{ϵ}^{- 1} (y) (w - c_{m} - g_{m} (y - 1)) \\ + g_{m} \int_{- \infty}^{F_{ϵ}^{- 1} (y)} ζ f_{ϵ} (ζ) d ζ) \\ where y = \frac{r - w + g_{r} - c_{r}}{r - s + g_{r}} \end{matrix}

(A13)

The structure of

{\bar{Π}}^{m} (w)

, as stated in (A13) is also of the desired type.

Appendix B.2. The Buyback Contract

In a buyback contract, the manufacturer pays the retailer

b \leq w_{b}

per unit remaining at the end of the selling season (period).

{\bar{Π}}^{r} (r, q) = (r - s + g_{r} - b) S (q) - (w_{b} - b + c_{r} - s) q - g_{r} μ

(A14)

Similarly, assuming that

r - s + g_{r} - b > 0,

the strict concavity of

{\bar{Π}}^{r} (q)

with respect to q is obvious. Applying the F.O.C. with respect to q, we obtain the following for

q^{*} (r)

.

q^{*} (r) = F_{D}^{- 1} (\frac{r - w_{b} + g_{r} - c_{r}}{r - s + g_{r} - b}) = μ (r) + σ (r) F_{ϵ}^{- 1} (\frac{r - w_{b} + g_{r} - c_{r}}{r - s + g_{r} - b})

(A15)

Substituting (A15) in (A14), we obtain the following expression for the retailer’s expected profit.

{\bar{Π}}^{r} (r) = (r - w_{b} - c_{r}) μ (r) + (r - s + g_{r} - b) (\int_{- \infty}^{F_{ϵ}^{- 1} (\frac{r - w_{b} + g_{r} - c_{r}}{r - s + g_{r} - b})} ζ f_{ϵ} (ζ) d ζ) σ (r)

(A16)

Notice that, due to our normalization in which we set

E [ϵ] = 0

, the integral term

Γ : = \int_{- \infty}^{F_{ϵ}^{- 1} (\frac{r + g_{r} - c_{r} - w_{b}}{r - s + g_{r} - b})} ζ f_{ϵ} (ζ) d ζ

is always negative, making

(r - s + g_{r} - b) \cdot Γ \cdot σ (r)

in (A16) also negative. In the literature, the latter term is called the loss due to stochasticity. Similarly, we obtain the expected value of the manufacturer’s profit as below.

\begin{matrix} {\bar{Π}}^{m} (w) & = μ (w) (w_{b} - c_{m}) + σ (w) (F_{ϵ}^{- 1} (y) (w - c_{m} + g_{m} (1 - y) - b y) \\ + (g_{m} + b) \int_{- \infty}^{F_{ϵ}^{- 1} (y)} ζ f (ζ) d ζ) \\ where y = (\frac{r - w_{b} + g_{r} - c_{r}}{r - s + g_{r} - b}) \end{matrix}

(A17)

Appendix B.3. The Revenue-Sharing Contract

With a revenue-sharing contract, the manufacturer charges

w_{r}

per unit purchased and the retailer gives the manufacturer a percentage of their revenue. Let

θ

be the fraction of supply chain revenue the retailer keeps, so

(1 - θ)

is the fraction given to the manufacturer. The retailer’s expected profit function is:

{\bar{Π}}^{r} (r, q) = (θ (r - s) + g_{r}) S (q) - (w_{r} + c_{r} - θ s) q - g_{r} μ .

(A18)

we observe that

{\bar{Π}}^{r}

is concave with respect to q because

θ (r - s) + g_{r} > 0 .

Applying F.O.C. with respect to q yields:

q^{*} (r) = F_{D}^{- 1} (\frac{θ r - w_{r} + g_{r} - c_{r}}{θ (r - s) + g_{r}}) = μ (r) + σ (r) F_{ϵ}^{- 1} (\frac{θ r - w_{r} + g_{r} - c_{r}}{θ (r - s) + g_{r}})

(A19)

Substituting (A19) into (A18), we have:

{\bar{Π}}^{r} (r) = (θ r - w_{r} - c_{r}) μ (r) + ((θ (r - s) + g_{r}) \int_{- \infty}^{F_{ϵ}^{- 1} (\frac{θ r - w_{r} + g_{r} - c_{r}}{θ (r - s) + g_{r}})} ζ f_{ϵ} (ζ) d ζ) σ (r) .

(A20)

Furthermore, notice how

θ = 1

turns (A20) into (A12).

Next, we obtain the manufacturer’s expected profit as below.

\begin{matrix} {\bar{Π}}^{m} & = (g_{m} + (1 - θ) (r - s)) S (q) + (w_{r} + (1 - θ) s - c_{m}) q - g_{m} μ \\ = μ ((1 - θ) r + w_{r} - c_{m}) \\ + σ (F_{ϵ}^{- 1} (y) (g_{m} + (1 - θ) r + w_{r} - c_{m} - y (g_{m} + (r - s) (1 - θ))) \\ + g_{m} \int_{- \infty}^{F_{ϵ}^{- 1} (y)} ζ f_{ϵ} (ζ) d ζ) \\ where y = \frac{θ r - w_{r} + g_{r} + c_{r}}{θ (r - s) + g_{r}} . \end{matrix}

(A21)

Notice how

θ = 1

turns (A21) into (A13).

Appendix C. Proof of Proposition 2

Proof.

Let

F_{ϵ}

denote the cumulative distribution of

ϵ

. Since

ϵ

is continuous and supported on an interval, with density

f_{ϵ} > 0

a.e. on its support, the expected profit

{\bar{Π}}^{r}

is strictly concave in q on the support of D, and the order quantity q from (20) is unique. It is clear that:

q = μ (r, k) + σ (r, k) F_{ϵ}^{- 1} (\frac{r - w}{r - s}) .

(A22)

Using (7) and (20), we obtain:

\begin{matrix} E [Π^{r}] & = (r - s) E [\min (D, q)] - (w - s) q \\ = (r - s) (μ (r, k) + E [min (σ (r, k) ϵ, σ (r, k) F_{ϵ}^{- 1} (\frac{r - w}{r - s}))]) \\ - (w - s) (μ (r, k) + σ (r, k) F_{ϵ}^{- 1} (\frac{r - w}{r - s})) . \end{matrix}

(A23)

Equations (7) and(A22) indicate that:

\begin{matrix} E [\min (ϵ, F_{ϵ}^{- 1} (\frac{r - w}{r - s}))] \end{matrix}

(A24)

\begin{matrix} = \int_{- \infty}^{F_{ϵ}^{- 1} (\frac{r - w}{r - s})} ζ f_{ϵ} (ζ) d ζ + F_{ϵ}^{- 1} (\frac{r - w}{r - s}) P (ϵ \geq F_{ϵ}^{- 1} (\frac{r - w}{r - s})) \end{matrix}

(A25)

\begin{matrix} = \int_{- \infty}^{F_{ϵ}^{- 1} (\frac{r - w}{r - s})} ζ f_{ϵ} (ζ) d ζ + F_{ϵ}^{- 1} (\frac{r - w}{r - s}) (1 - \frac{r - w}{r - s}) . \end{matrix}

(A26)

Inserting (A26) into (A23) and simplifying the resulting expression yields:

{\bar{Π}}^{r} = E [Π^{r}] = (r - w) μ (r, k) + L_{ϵ} (r, w) σ (r, k)

(A27)

where

L_{ϵ}

is defined as:

L_{ϵ} (r, w) = (r - s) \int_{- \infty}^{z} ζ f_{ϵ} (ζ) d ζ z = F_{ϵ}^{- 1} (\frac{r - w}{r - s}) .

(A28)

□

References

Cachon, G.P. Supply chain coordination with contracts. In The Handbook of Operations Research and Management Science: Supply Chain Management: Design, Coordination and Operation; de Kok, A.G., Graves, S.C., Eds.; Elsevier: Amsterdam, The Amsterdam, 2003; Chapter 6; pp. 229–340. [Google Scholar]
Holmes, B. The Economics of Freemium. Wall Street Jorunal, 3 March 2013. Available online: http://blogs.wsj.com/accelerators/2013/03/03/the-economics-of-freemium/ (accessed on 25 August 2022).
Chowdhry, A. Samsung’s ‘Ultimate Test Drive’ Offers Free Trial For Note 5, S6 Edge+ And S6 Edge To iPhone Users. Forbes, 21 August 2015. Available online: http://www.forbes.com/sites/amitchowdhry/2015/08/21/samsung-ultimate-test/ (accessed on 25 August 2022).
Khouja, M.; Liu, X.; Zhou, J. To sell or not to sell to an off-price retailer in the presence of strategic consumers. Omega 2020, 90, 102002. [Google Scholar] [CrossRef]
Levin, Y.; McGill, J.; Nediak, M. Dynamic pricing in the presence of strategic consumers and oligopolistic competition. Manag. Sci. 2009, 55, 32–46. [Google Scholar] [CrossRef]
Bass, F.M. A new product growth for model consumer durables. Manag. Sci. 1969, 15, 215–227. [Google Scholar] [CrossRef]
Mahajan, V.; Muller, E.; Bass, F.M. New product diffusion models in marketing: A review and directions for research. J. Mark. 1990, 54, 1–26. [Google Scholar] [CrossRef]
Azoury, K.S. Bayes solution to dynamic inventory models under unknown demand distribution. Manag. Sci. 1985, 31, 1150–1160. [Google Scholar] [CrossRef]
Lariviere, M.A.; Porteus, E.L. Stalking information: Bayesian inventory management with unobserved lost sales. Manag. Sci. 1999, 45, 346–363. [Google Scholar] [CrossRef]
Bensoussan, A.; Çakanyildirim, M.; Sethi, S.P. A multiperiod newsvendor problem with partially observed demand. Math. Oper. Res. 2007, 32, 322–344. [Google Scholar] [CrossRef]
Levi, R.; Roundy, R.O.; Shmoys, D.B. Provably near-optimal sampling-based policies for stochastic inventory control models. Math. Oper. Res. 2007, 32, 821–839. [Google Scholar] [CrossRef]
Kim, G.; Wu, K.; Huang, E. Optimal inventory control in a multi-period newsvendor problem with non-stationary demand. Adv. Eng. Inform. 2015, 29, 139–145. [Google Scholar] [CrossRef]
Pasternack, B.A. Optimal pricing and return policies for perishable commodities. Mark. Sci. 1985, 4, 166–176. [Google Scholar] [CrossRef]
Gümüş, M.; Ray, S.; Yin, S. Returns policies between channel partners for durable products. Mark. Sci. 2013, 32, 622–643. [Google Scholar] [CrossRef]
Azad Gholami, R.; Sandal, L.K.; Ubøe, J. Channel Coordination in a Multi-Period Newsvendor Model with Dynamic, Price-Dependent Stochastic Demand; NHH Dept. of Business and Management Science Discussion Paper; Department of Business and Management Science NHH: Bergen, Norway, 2016. [Google Scholar]
Gholami, R.A.; Sandal, L.K.; Ubøe, J. A solution algorithm for multi-period bi-level channel optimization with dynamic price-dependent stochastic demand. Omega 2021, 102, 102297. [Google Scholar] [CrossRef]
Azad Gholami, R.; Sandal, L.K.; Ubøe, J. Markets With Memory: Dynamic Channel Optimization Models with Price-Dependent Stochastic Demand; NHH Dept. of Business and Management Science Discussion Paper; Department of Business and Management Science NHH: Bergen, Norway, 2019. [Google Scholar]
Whitin, T.M. Inventory control and price theory. Manag. Sci. 1955, 2, 61–68. [Google Scholar] [CrossRef]
Surti, C.; Celani, A.; Gajpal, Y. The Newsvendor Problem: The Role of Prospect Theory and Feedback. Eur. J. Oper. Res. 2020, 287, 251–261. [Google Scholar] [CrossRef]
Vipin, B.; Amit, R.K. Describing decision bias in the newsvendor problem: A prospect theory model. Omega 2019, 82, 132–141. [Google Scholar] [CrossRef]
Wu, M.; Bai, T.; Zhu, S.X. A loss averse competitive newsvendor problem with anchoring. Omega 2018, 81, 99–111. [Google Scholar] [CrossRef]
Sherif, M.; Taub, D.; Hovland, C.I. Assimilation and contrast effects of anchoring stimuli on judgments. J. Exp. Psychol. 1958, 55, 150. [Google Scholar] [CrossRef]
Tversky, A.; Kahneman, D. Judgment under Uncertainty: Heuristics and Biases: Biases in judgments reveal some heuristics of thinking under uncertainty. Science 1974, 185, 1124–1131. [Google Scholar] [CrossRef]
Azad Gholami, R.; Sandal, L.K.; Ubøe, J. Who is Benefiting from Postponement in Multi-Periodic Supply Channel Optimization? Available online: https://ssrn.com/abstract=4204363 (accessed on 25 August 2022).
Bagwell, K. Introductory Price as a Signal of Cost in a Model of Repeat Business. Rev. Econ. Stud. 1987, 54, 365–384. [Google Scholar]
Bagwell, K. The Economic Analysis of Advertising. In Handbook of Industrial Organization; Armstrong, M., Porter, R., Eds.; North-Holland: Amsterdam, The Amsterdam, 2007; Volume 3, pp. 1701–1844. [Google Scholar]
Petruzzi, C.N.; Dada, M. Pricing and the newsvendor problem: A review with extensions. Oper. Res. 1999, 47, 183–194. [Google Scholar] [CrossRef]
Rao, V.R. Pricing research in marketing: The state of the art. J. Bus. 1984, 57, S39–S60. [Google Scholar] [CrossRef]
Fassnacht, M.; El Husseini, S. EDLP vs. Hi-Lo Pricing Strategies in Retailing—A State of the Art Article. J. Bus. Econ. 2013, 83, 259–289. [Google Scholar]
Øksendal, B.; Sandal, L.; Ubøe, J. Stochastic Stackelberg equilibria with applications to time-dependent newsvendor models. J. Econ. Dyn. Control. 2013, 37, 1284–1299. [Google Scholar] [CrossRef]
Rubio-Herrero, J.; Baykal-Gürsoy, M. On the unimodality of the price-setting newsvendor problem with additive demand under risk considerations. Eur. J. Oper. Res. 2018, 265, 962–974. [Google Scholar] [CrossRef]
Xu, X.; Cai, X.; Chen, Y. Unimodality of price setting newsvendor’s objective function with multiplicative demand and its applications. Int. J. Prod. Econ. 2011, 133, 653–661. [Google Scholar] [CrossRef]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.