Multistage Adaptive Robust Binary Optimization: Uncertainty Set Lifting versus Partitioning through Breakpoints Optimization

Motamed Nasab, Farough; Li, Zukui

doi:10.3390/math11183883

Open AccessArticle

Multistage Adaptive Robust Binary Optimization: Uncertainty Set Lifting versus Partitioning through Breakpoints Optimization

by

Farough Motamed Nasab

and

Zukui Li

^*

Department of Chemical & Materials Engineering, University of Alberta, Edmonton, AB T6G 1H9, Canada

^*

Author to whom correspondence should be addressed.

Mathematics 2023, 11(18), 3883; https://doi.org/10.3390/math11183883

Submission received: 22 August 2023 / Revised: 4 September 2023 / Accepted: 9 September 2023 / Published: 12 September 2023

(This article belongs to the Section E2: Control Theory and Mechanics)

Download

Browse Figures

Versions Notes

Abstract

:

Two methods for multistage adaptive robust binary optimization are investigated in this work. These methods referred to as binary decision rule and finite adaptability inherently share similarities in dividing the uncertainty set into subsets. In the binary decision rule method, the uncertainty is lifted using indicator functions which result in a nonconvex lifted uncertainty set. The linear decision rule is further applied to a convexified version of the lifted uncertainty set. In the finite adaptability method, the uncertainty set is divided into partitions and a constant decision is applied for each partition. In both methods, breakpoints are utilized either to define the indicator functions in the lifting method or to partition the uncertainty set in the finite adaptability method. In this work, we propose variable breakpoint location optimization for both methods. Extensive computational study on an illustrating example and a larger size case study is conducted. The performance of binary decision rule and finite adaptability methods under fixed and variable breakpoint approaches is compared.

Keywords:

adaptive optimization; decision rule; scenario tree; uncertainty set; lifting; robust optimization

MSC:

90C15; 90C17

1. Introduction

Multistage decision making under uncertainty has practical applications in many areas such as finance, engineering, and operations management, etc. To mention a few examples, Goulart et al. [1] applied stochastic optimization for the robust control of linear discrete-time systems. Skaf and Boyd [2] designed an affine controller for linear dynamic systems. Ben Tal et al. [3] addressed the problem of minimizing the overall cost of a supply chain under demand uncertainty. See and Sim [4] proposed a robust optimization method to tackle an inventory control problem where only limited demand information is available. Gounaris et al. [5] studied the robust vehicle routing problem to minimize the delivery costs of a product to geographically dispersed customers using capacity-constrained vehicles. Calafiore [6] presented an affine control method for dynamic asset allocation. Fadda et al. [7] studied the multi-period stochastic assignment problem for social engagement.

There are challenges to solving multistage adaptive robust optimization problems. As Shapiro and Nemirovski [8] pointed out, multistage adaptive robust optimization problems including real-valued and binary decision variables are computationally intractable in general. Dyer and Stougie [9] demonstrated that obtaining the optimal solution for the class of single-stage uncertain problems involving only real-valued decisions is already P-hard. One popular solution approach to this problem is to use decision rules where variables are modeled as functions of uncertain parameters. The application of decision rules for real-valued functions in stochastic programming goes back to 1974 [10]. However, only recently did the decision rule-based approach received major attention with the research advances made in robust optimization [11,12]. Ben-Tal et al. [13] formulated the real-valued functions as affine functions of uncertain parameters. The simple structure of linear decision rules may result in some optimality loss. However, it has the advantage of providing the required scalability to deal with multistage stochastic adaptive problems. It should be mentioned that linear decision rules are shown to be optimal in some problem instances. For instance, Bertsimas et al. [14] showed the optimality of affine control policies in one-dimensional uncertainty within robust optimization context. The reader can refer to [5,15] for other cases.

In order to reduce the loss of optimality due to linear decision rules, various nonlinear decision rule structures were proposed. Motivated by the success of linear decision rules in providing the favorable scalability features for multistage problems, the nonlinear decision rules are formulated as

x (ξ) = X^{⊤} O (ξ)

where x is the decision variable,

ξ

denotes the uncertain parameters, X represents the decision rule coefficients, and

O : R^{m^{'}} \to R^{m}

is a nonlinear operator that defines the basis terms in the decision rule. This structure considerably improves the solution optimality while retaining the scalability property. To mention some instances, different authors have suggested various decision rule structures such as linear [1,2,13,16], piecewise linear [17,18,19,20], multilinear [19], quadratic [19,21] and polynomial. Kuhn et al. [16] proposed a method to estimate the approximation error introduced by linear decision rules. They have argued the method is applicable to problems with random recourse and multiple decision stages. Chen et al. [17] addressed uncertain problems where only limited information of underlying stochastic parameters are available. The authors discussed that linear decision rules are inadequate for this type of problem and can result in infeasible solutions. They suggested an alternative second-order conic optimization model that can be solved efficiently. Chen and Zhang [18] presented an extended affinely adjustable robust counterpart method to solve multistage uncertain linear problems and illustrated that the potential of their proposed method is beyond what is presented in the literature. Georghiou et al. [19] proposed a lifting technique that provides tighter upper and lower bounds compared to the case where the linear decision rule is directly applied to the original stochastic problem. They proposed a structured lifting method that gives rise to a flexible piecewise linear and nonlinear decision rules. Goh and Sim [20] developed new piecewise linear decision rules that provide a more flexible formulation of the original uncertain problem and results in better bounds on the objective. Bertsimas et al. [22] proposed a framework for tackling linear dynamic systems under uncertainty. They introduced a hierarchy of polynomial control policies that exhibited strong numerical performance at a moderate computational expense.

Although there is a wealth of literature available for real-valued decision rules, the available literature for binary decision rules is relatively scarce. Bertsimas and Georghiou [23] proposed a structure for binary decision rules that models binary variables as piecewise constant functions and can provide high-quality solutions. However, the computational expense is significant. Bertsimas and Caramanis [24] proposed a structure for integer decision rules formulated as

y (ξ) = y^{⊤} ⌈ ξ ⌉

, where

y \in Z^{k}

and

⌈ \cdot ⌉

is the ceiling function. In their work, the resulting problem is approximated and solved using a randomized algorithm [25] that provides only a limited guarantee on solution feasibility. Hanasusanto et al. [26] proposed a so-called “k-adaptable” structure that can only be applied to two-stage uncertain binary problems where the decision maker pre-commits to k second-stage policies and implements the best one once the uncertain parameters are revealed. Recently, Bertsimas and Georghiou [27] proposed a systematic lifting method for binary decision rule that trades off scalability and optimality. This method can be applied to large multistage problems. They demonstrated that the method is highly scalable and provides high-quality solutions and can readily be used along with real-valued decision rules with the general structure of

x (ξ) = x^{⊤} O (ξ), O : R^{m^{'}} \to R^{m}

. Postek and Den Hertog [28] proposed a method to iteratively split the uncertainty set into subsets based on some heuristics. The method keeps the computational complexity at the same level as the static robust optimization problem. Bertsimas and Dunning [29] extended the work of finite adaptability and presented a partition-and-bound method for the multistage adaptive robust mixed integer optimization problem.

While there are many possible ways for uncertainty lifting and uncertainty set partition, we focus on the grid partition-based method in this work considering its simplicity in implementation. This means that for the lifting method, we lift each uncertain parameter individually instead of any aggregated version among them. Whereas, for the uncertainty set partition method, we divide the uncertainty set using hyper-rectangles. Under this assumption, the binary decision rule (lifting) method and the finite adaptability method (with grid partitioning) for addressing multistage adaptive robust optimization problems inherently share similarities. The goal of this study is to compare the solution complexity of these two methods in order to obtain insights into the advantages and limitations of each method. The contribution of this work is summarized below:

Point out the similarity and difference between the lifting method and the finite adaptability (partitioning) method. We demonstrate that under the equivalent setting, the partition method in general leads to better solution quality since the lifting method has less flexibility due to the restriction of linear decision rule.
Propose novel breakpoint optimization formulations for both lifting and partitioning methods. It is shown that breakpoint optimization leads to improved solution quality with the cost of solving mixed integer nonlinear problems (MINLP) compared to mixed integer linear problems (MILP) under a fixed breakpoint setting.
Conduct an extensive computational study to investigate the advantages of lifting and finite adaptability methods under fixed breakpoint and variable breakpoint settings. We highlight the tractability of the lifting method for handling large cases with many time stages, and the limitation of finite adaptability method caused by the quick increase in model size.

This paper is organized as follows. Section 2 presents the general multistage adaptive robust binary optimization problem formulation and the traditional scenario tree-based method is applied to an illustrating example. Section 3 provides detailed formulations of the binary decision rule method based on the lifting technique and the variable breakpoint technique is explained. Section 4 provides the detailed formulation for the finite adaptability method based on uncertainty set partitioning and the variable breakpoint-based formulation is presented. Section 5 presents an extensive computational study using an inventory control problem and finally, Section 6 concludes the paper.

2. Multistage Adaptive Robust Binary Optimization

The multistage adaptive robust binary optimization problem to be addressed is as follows:

\begin{matrix} min & E_{ξ} (\sum_{t} {(D_{t} ξ_{[t]})}^{⊤} y_{t} (ξ_{[t]})) \end{matrix}

(1a)

\begin{matrix} s . t . & \sum_{τ = 1}^{t} B_{t, τ} y_{τ} (ξ_{[τ]}) \leq E_{t} ξ_{[t]} & \forall ξ \in Ξ, t = 1, . . ., T \end{matrix}

(1b)

\begin{matrix} y_{t} (ξ_{[t]}) \in {0, 1}^{N_{y_{t}}} \end{matrix}

(1c)

where

D_{t}

,

B_{t, τ}

and

E_{t}

are constant vectors/matrices,

y_{t} (ξ_{[t]})

is the adaptive binary decision variable expressed as a function of uncertainty vector

ξ_{[t]}

, based on the following notation:

$ξ_{q, t}$	scalar, q-th uncertain parameter of stage t
$ξ_{t}$	vector of uncertain parameters of stage t: ${[ξ_{1, t}, \dots, ξ_{\bar{q_{t}}, t}]}^{⊤}$ ,
	where $\bar{q_{t}}$ is the number of uncertain parameters in stage t
$ξ_{[t]}$	vector of uncertain parameters from stage 1 to t: ${[1, ξ_{1}^{⊤}, \dots, ξ_{t}^{⊤}]}^{⊤}$
$ξ$	vector of all uncertain parameters (from stage 1 to T): ${[1, ξ_{1}^{⊤}, \dots, ξ_{T}^{⊤}]}^{⊤}$ , that is, $ξ_{[T]}$

In this work, we consider polyhedral uncertainty set for

ξ \in R^{| ξ_{[T]} |}

(

| ξ_{[T]} |

is the size of

ξ

):

\begin{matrix} Ξ = {ξ : J ξ \geq h} \end{matrix}

(2)

where

J

and

h

are the constant matrix and vector, respectively. For each uncertain parameter, we assume that it is subject to an interval

Ξ_{q, t} = {{\underset{̲}{ξ}}_{q, t} \leq ξ_{q, t} \leq {\bar{ξ}}_{q, t}}

.

Illustrating example

Throughout this paper, we will study the following illustrating example while presenting various solution methods. The problem has two stages (t = 1, 2). Each stage includes only one uncertain parameter. First stage decision

y_{1}

could depend on

ξ_{1}

, and the second stage decision

y_{2}

could depend on both

ξ_{1}

and

ξ_{2}

. The problem is formulated as:

\begin{matrix} min & E_{ξ} (- y_{1} (ξ) - y_{2} (ξ)) \end{matrix}

(3a)

\begin{matrix} s . t . & 2 y_{1} (ξ) \leq 1 + 2 ξ_{1} & \forall ξ \in Ξ \end{matrix}

(3b)

\begin{matrix} 3 y_{1} (ξ) + 2 y_{2} (ξ) \leq 1 + 2 ξ_{1} + ξ_{2} & \forall ξ \in Ξ \end{matrix}

(3c)

\begin{matrix} y_{t} (ξ) \in {0, 1} & \forall ξ \in Ξ, t = 1, 2 \end{matrix}

(3d)

Assume that each uncertain parameter follows an independent uniform distribution in a certain interval and the uncertainty set

Ξ

is given as:

Ξ = {ξ : ξ_{1} \in [0, 3], ξ_{2} \in [0, 6]}

The above numeric example can be casted into the formulation (1a)–(1c) with

B_{11} = 2

,

B_{21} = 3

,

B_{22} = 2

,

E_{1} = [1, 2]

,

E_{2} = [1, 2, 1]

,

D_{1} = [- 1, 0]

,

D_{2} = [- 1, 0, 0]

,

ξ_{[1]} = {[1, ξ_{1}]}^{⊤}

,

ξ_{[2]} = {[1, ξ_{1}, ξ_{2}]}^{⊤}

.

Before presenting the lifting and the finite adaptability methods, the traditional scenario tree-based method is applied to solve the illustrating problem in order to investigate the problem’s true optimal solution. While the number of scenarios is reasonably large, we can find the approximate optimal solution of the adaptive optimization problem. Figure 1 illustrates the scenario tree for two-time stages where each node includes three-branches.

In the following scenario tree-based model,

y_{k}

represents the decision made at node k,

a (k)

denotes the parent node of k,

p_{k}

is the unconditional probability of node k. Equations (4a)–(4d) present the nodal formulation. For the above scenario tree shown in Figure 1,

K_{1} = {1, 2, 3}

,

K_{2} = {4, \dots, 12}

, which indicate the set of nodes at the first and second time steps, respectively.

\begin{matrix} min & - \sum_{k \in K_{1} \cup K_{2}} p_{k} y_{k} \end{matrix}

(4a)

\begin{matrix} s . t . & 2 y_{k} \leq 1 + 2 ξ_{k} & \forall k \in K_{1} \end{matrix}

(4b)

\begin{matrix} 3 y_{a (k)} + 2 y_{k} \leq 1 + 2 ξ_{a (k)} + ξ_{k} & \forall k \in K_{2} \end{matrix}

(4c)

\begin{matrix} y_{k} \in {0, 1} & \forall k \in K_{1} \cup K_{2} \end{matrix}

(4d)

In this work, all the mixed-integer linear optimization problems were modeled in GAMS 25 and solved using CPLEX 12 solver on a workstation (Intel Xeon Dual 20 Core 2.0 GHz Processor, 128 GB DDR4 ECC RAM). Table 1 present the results of the above model for 4, 11, 31 and 99 branches per node. Figure 2 and Figure 3 illustrate the binary solution for 4 and 31 branches per node, respectively. In these figures, the black squares indicate a value of 1 and the white squares indicate a value of 0. Table 1 shows that, by increasing the number of scenarios, the optimal objective value converges to a value around

- 1.6

. We use this as a benchmark for comparing the different methods to be discussed.

Regarding the recourse decision, Figure 3 illustrates that, for

y_{1} (ξ_{1})

, there is a change point around 1 for

ξ_{1}

. For

y_{2} (ξ_{1}, ξ_{2})

, there are three change points at around 0.5, 1 and 2 for

ξ_{1}

, and two change points at around 1 and 2 for

ξ_{2}

. Those values can be used as a basis for comparison while the lifting and partitioning methods are implemented.

3. Binary Decision Rule with Lifted Uncertainty

3.1. Uncertainty Lifting

Bertsimas and Georghiou [27] proposed a decision rule method for adaptive binary variables. This method enforces a linear relation between the binary variable and the lifted uncertain parameters. In this method, 0–1 indicator functions are defined based on a set of breakpoints for each uncertain parameter. The utilization of the indicator functions results in a nonconvex lifted uncertainty set. To resolve this problem, convex overestimation is applied to the lifted nonconvex set in order to obtain a convex set. The accuracy of the solution can be improved by increasing the number of breakpoints in the lifted set. While traditional scenario-tree methods result in an exponential growth of model size which restricts the application in large-scale problems, the lifting method provides the scalability and tractability required for large-scale problems.

Consider a single uncertain parameter

ξ_{q, t}

subject to the interval

Ξ_{q, t} = {{\underset{̲}{ξ}}_{q, t} \leq ξ_{q, t} \leq {\bar{ξ}}_{q, t}}

, and assume that the interval is divided into

({\bar{r}}_{q, t} + 1)

subintervals using

{\bar{r}}_{q, t}

breakpoints:

α_{r, q, t}, r = 1, \dots, {\bar{r}}_{q, t}

. To lift the uncertainty, the indicator functions

Q_{r, q, t} (\cdot)

of uncertain parameters are used. The following list summarizes the notation used for lifting a single uncertain parameter

ξ_{q, t}

:

${\bar{r}}_{q, t}$	scalar, number of breakpoints applied for $ξ_{q, t}$
$α_{r, q, t}$	scalar, value of the r-th breakpoint for $ξ_{q, t}$ , $r = 1, \dots, {\bar{r}}_{q, t}$
	as a generalization, we denote the bounds as: $α_{0, q, t} \equiv {\underset{̲}{ξ}}_{q, t}$ , $α_{{\bar{r}}_{q, t} + 1, q, t} \equiv {\bar{ξ}}_{q, t}$
	$α_{0, q, t} < α_{1, q, t} < . . . < α_{{\bar{r}}_{q, t}, q, t} < α_{{\bar{r}}_{q, t} + 1, q, t}$
$Q_{r, q, t}$	the r-th lifted uncertain parameter for $ξ_{q, t}$ (an indicator function of $ξ_{q, t}$ ),
	$Q_{r, q, t} (ξ_{q, t}) = \{\begin{matrix} 1, if ξ_{q, t} \geq α_{r, q, t} \\ 0, if ξ_{q, t} < α_{r, q, t} \end{matrix}$
$Q_{q, t}$	vector of all lifted parameters for $ξ_{q, t}$ : ${[Q_{1, q, t}, \dots, Q_{{\bar{r}}_{q, t}, q, t}]}^{⊤}$
$Q_{t}$	vector of all lifted parameters for $ξ_{t}$ : $= {[Q_{1, t}^{⊤}, \dots, Q_{{\bar{q}}_{t}, t}^{⊤}]}^{⊤}$
$Q_{[t]}$	vector, all lifted parameters for $ξ_{[t]}$ : ${[1, Q_{1}^{⊤}, . . ., Q_{t}^{⊤}]}^{⊤}$
	the first element 1 is used for the intercept term while implementing linear decision rule
$ξ_{q, t}^{'}$	vector of overall (original + lifted) uncertainty related to parameter $ξ_{q, t}$ : ${[ξ_{q, t}, Q_{q, t}^{⊤}]}^{⊤}$
$ξ_{t}^{'}$	vector of overall uncertainty for stage t: ${[ξ_{1, t}^{' ⊤}, \dots, ξ_{{\bar{q}}_{t}, t}^{' ⊤}]}^{⊤}$
$ξ_{[t]}^{'}$	vector of overall uncertainty from stage 1 to t: ${[1, ξ_{1}^{' ⊤}, \dots, ξ_{t}^{' ⊤}]}^{⊤}$
$ξ^{'}$	vector of overall uncertainty from stage 1 to T: ${[1, ξ_{1}^{' ⊤}, \dots, ξ_{T}^{' ⊤}]}^{⊤}$
$Ξ_{p, q, t}^{'}$	the p-th line segment of the lifted uncertainty set for $ξ_{q, t}^{'}$ , $p = 1, \dots, {\bar{r}}_{q, t} + 1$
$Ξ_{q, t}^{'}$	the lifted uncertainty set (nonconvex) for $ξ_{q, t}^{'}$
$Ξ^{'}$	the lifted uncertainty set (nonconvex) for $ξ^{'}$
$ν_{i, p, q, t}$	the two extreme points of $Ξ_{p, q, t}^{'}$ , $i = 1, 2$
	$ν_{1, p, q, t} = {[α_{p - 1, q, t}, \underset{p - 1 times}{\underset{︸}{1, \dots, 1}}, \underset{{\bar{r}}_{q, t} - p + 1 times}{\underset{︸}{0, \dots, 0}}]}^{⊤}, p = 1, \dots, {\bar{r}}_{q, t} + 1$
	$ν_{2, p, q, t} = {[α_{p, q, t}, \underset{p - 1 times}{\underset{︸}{1, \dots, 1}}, \underset{{\bar{r}}_{q, t} - p + 1 times}{\underset{︸}{0, \dots, 0}}]}^{⊤}, p = 1, \dots, {\bar{r}}_{q, t} + 1$
$η_{i, p, q, t}$	scalar, coefficient of extreme points in convex full formulation

Figure 4 illustrates the lifted uncertainty set (

Ξ_{q, t}^{'}

) for a single uncertain parameter

ξ_{q, t}

based on 1 and 2 breakpoints. It is clear that the lifted uncertainty set is nonconvex since it contains disconnected pieces (

Ξ_{1, q, t}^{'}, \dots, Ξ_{R_{q, t} + 1, q, t}^{'}

) in a higher dimensional space.

In addition, projection matrices

P_{ξ_{[t]}} \in R^{| ξ_{[t]} | \times | ξ^{'} |}

and

P_{Q_{[t]}} \in R^{| Q_{[t]} | \times | ξ^{'} |}

are used in order to obtain

ξ_{[t]}

and

Q_{[t]} (ξ)

from the overall uncertainty vector, as follows:

\begin{matrix} ξ_{[t]} = P_{ξ_{[t]}} ξ^{'} & t = 1, \dots, T \end{matrix}

(5a)

\begin{matrix} Q_{[t]} (ξ) = P_{Q_{[t]}} ξ^{'} & t = 1, \dots, T \end{matrix}

(5b)

Based on the above notation and the original uncertainty set definition in Equation (2), the lifted nonconvex uncertainty set for

ξ^{'}

can be written as:

\begin{matrix} Ξ^{'} = {ξ^{'} : P_{ξ_{[T]}} ξ^{'} \in Ξ; ξ_{q, t}^{'} \in Ξ_{q, t}^{'}, t = 1, \dots, T, q = 1, \dots, {\bar{q}}_{t}} \end{matrix}

(6)

The nonconvexity of the lifted uncertainty set poses challenges to the optimization problem. The lifted set is convexified such that the semi-infinite constraints can be addressed. Given the

2 ({\bar{r}}_{q, t} + 1)

extreme points, the convex hull of

Ξ_{q, t}^{'}

can be constructed as:

\begin{matrix} c o n v (Ξ_{q, t}^{'}) = { & ξ_{q, t}^{'} = {[ξ_{q, t}, Q_{q, t}^{⊤}]}^{⊤} : \exists η_{i, p, q, t} \\ ξ_{q, t}^{'} = \sum_{p = 1}^{{\bar{r}}_{q, t} + 1} \sum_{i = 1}^{2} η_{i, p, q, t} ν_{i, p, q, t} \\ \sum_{p = 1}^{{\bar{r}}_{q, t} + 1} \sum_{i = 1}^{2} η_{i, p, q, t} = 1 \\ η_{i, p, q, t} > 0, i = 1, 2; p = 1, \dots, {\bar{r}}_{q, t} + 1 & } \end{matrix}

(7)

Based on the above convex hull, we define the following convex overestimation for the overall uncertainty set

Ξ^{'}

defined in Equation (6):

\begin{matrix} {\hat{Ξ}}^{'} = {(ξ^{'}, η) : P_{ξ_{[T]}} ξ^{'} \in Ξ; (ξ_{q, t}^{'}, η_{q, t}) \in c o n v ({Ξ^{'}}_{q, t}), t = 1, \dots, T, q = 1, \dots, {\bar{q}}_{t}} \end{matrix}

(8)

Notice that the convex hull of

Ξ^{'}, c o n v (Ξ^{'}),

is a subset of the above overestimation. Figure 5 illustrates the relation between

c o n v (Ξ^{'})

and the overestimation

{\hat{Ξ}}^{'}

. The left figure illustrates

Ξ^{'}

according to Equation (6).

Ξ^{'}

is obtained by intersecting the original uncertainty set

Ξ

(the black triangle) and the nonconvex sets

Ξ_{1}^{'}

and

Ξ_{2}^{'}

denoted by discontinuous red lines segments. Notice that the sets

Ξ_{1}^{'}

and

Ξ_{2}^{'}

are at least two dimensional where the dimension depends on the number of breakpoints used in the definition of the lifted set. However, for illustration purposes, the dimension of

Ξ_{1}^{'}

and

Ξ_{2}^{'}

is assumed to be one since using two dimensions on each axis will make the visualization impossible. The blue shaded area demonstrates

c o n v (Ξ^{'})

. In the right figure, the blue triangle corresponds to the overestimation set

{\hat{Ξ}}^{'}

(Equation (8)), which is obtained by intersecting the original uncertainty set

Ξ

and

c o n v (Ξ_{1}^{'}) \times c o n v (Ξ_{2}^{'})

. As the figure shows,

{\hat{Ξ}}^{'}

is an overestimation of

c o n v (Ξ^{'})

.

Finally, for simplicity in derivation, we project the polyhedral uncertainty set

{\hat{Ξ}}^{'}

onto the space of

ξ^{'}

, and compactly write it as:

\begin{matrix} {\hat{Ξ}}^{'} = {ξ^{'} : J^{'} ξ^{'} \geq h^{'}} \end{matrix}

(9)

where

J^{'}

and

h^{'}

are constant matrix calculated from the breakpoints value and the original uncertainty set parameters

J, h

.

Illustrating example (cont.)

In this section, the lifting method is applied to the illustrating example. Two breakpoints are considered for each uncertain parameter (r = 1, 2):

\begin{matrix} for ξ_{1} : α_{1, 1, 1} = 1, α_{2, 1, 1} = 2 \\ for ξ_{2} : α_{1, 1, 2} = 2, α_{2, 1, 2} = 4 \end{matrix}

The lifted uncertainty vector is formulated as:

ξ^{'} = {[1, ξ_{1}, Q_{1, 1, 1} (ξ_{1}), Q_{2, 1, 1} (ξ_{1}), ξ_{2}, Q_{1, 1, 2} (ξ_{2}), Q_{2, 1, 2} (ξ_{2})]}^{⊤}

The 0–1 indicator functions are defined as:

\begin{matrix} Q_{1, 1, 1} (ξ_{1}) = \{\begin{matrix} 1, if ξ_{1} \geq 1 \\ 0, if ξ_{1} < 1 \end{matrix}, Q_{2, 1, 1} (ξ_{1}) = \{\begin{matrix} 1, if ξ_{1} \geq 2 \\ 0, if ξ_{1} < 2 \end{matrix} \\ Q_{1, 1, 2} (ξ_{2}) = \{\begin{matrix} 1, if ξ_{2} \geq 2 \\ 0, if ξ_{2} < 2 \end{matrix}, Q_{2, 1, 2} (ξ_{2}) = \{\begin{matrix} 1, if ξ_{2} \geq 4 \\ 0, if ξ_{2} < 4 \end{matrix} \end{matrix}

The associated projection matrices and the corresponding (original or lifted) uncertainty vectors are:

\begin{matrix} \begin{matrix} P_{ξ_{[1]}} = [\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 & 0 \end{matrix}], ξ_{[1]} = P_{ξ_{[1]}} ξ^{'} = {[1, ξ_{1}]}^{⊤} \end{matrix} \\ \begin{matrix} P_{ξ_{[2]}} = [\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 1 & 0 & 0 \end{matrix}], ξ_{[2]} = P_{ξ_{[2]}} ξ^{'} = {[1, ξ_{1}, ξ_{2}]}^{⊤} \end{matrix} \\ \begin{matrix} P_{Q_{[1]}} = [\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 & 0 \end{matrix}], Q_{[1]} (ξ) = P_{Q_{[1]}} ξ^{'} = {[1, Q_{1, 1, 1}, Q_{2, 1, 1}]}^{⊤} \end{matrix} \\ \begin{matrix} P_{Q_{[2]}} = [\begin{matrix} 1 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 \end{matrix}], Q_{[2]} (ξ) = P_{Q_{[2]}} ξ^{'} = {[1, Q_{1, 1, 1}, Q_{2, 1, 1}, Q_{1, 1, 2}, Q_{2, 1, 2}]}^{⊤} \end{matrix} \end{matrix}

The lifted uncertainty set is defined as:

Ξ^{'} = {ξ^{'} \in R^{2} : P_{ξ_{[2]}} ξ^{'} \in Ξ, ξ_{q, t}^{'} \in Ξ_{q, t}^{'}}

, where

Ξ = {ξ : ξ_{1} \in [0, 3], ξ_{2} \in [0, 6]}

. Next, the convex hull for each single lifted uncertain parameter

c o n v (Ξ_{q, t}^{'})

is constructed. For simplicity in presentation,

η_{i, p, 1, 1}

and

η_{i, p, 1, 2}

are represented by

η_{p, i}

and

η_{p, i}^{'}

in the following equations, respectively.

\begin{matrix} c o n v (Ξ_{1, 1}^{'}) = \\ \{ξ_{1, 1}^{'} : \begin{matrix} \exists η_{1, 1}, η_{1, 2}, η_{2, 1}, η_{2, 2}, η_{3, 1}, η_{3, 2} \geq 0, \\ η_{1, 1} + η_{1, 2} + η_{2, 1} + η_{2, 2} + η_{3, 1} + η_{3, 2} = 1 \\ [\begin{matrix} ξ_{1} \\ Q_{1, 1, 1} \\ Q_{2, 1, 1} \end{matrix}] = η_{1, 1} [\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}] + η_{1, 2} [\begin{matrix} α_{1, 1, 1} \\ 0 \\ 0 \end{matrix}] + η_{2, 1} [\begin{matrix} α_{1, 1, 1} \\ 1 \\ 0 \end{matrix}] + η_{2, 2} [\begin{matrix} α_{2, 1, 1} \\ 1 \\ 0 \end{matrix}] + η_{3, 1} [\begin{matrix} α_{2, 1, 1} \\ 1 \\ 1 \end{matrix}] + η_{3, 2} [\begin{matrix} 3 \\ 1 \\ 1 \end{matrix}] \end{matrix}\} \\ c o n v (Ξ_{1, 2}^{'}) = \\ \{ξ_{1, 2}^{'} : \begin{matrix} \exists η_{1, 1}^{'}, η_{1, 2}^{'}, η_{2, 1}^{'}, η_{2, 2}^{'}, η_{3, 1}^{'}, η_{3, 2}^{'} \geq 0, \\ η_{1, 1}^{'} + η_{1, 2}^{'} + η_{2, 1}^{'} + η_{2, 2}^{'} + η_{3, 1}^{'} + η_{3, 2}^{'} = 1 \\ [\begin{matrix} ξ_{2} \\ Q_{1, 1, 2} \\ Q_{2, 1, 2} \end{matrix}] = η_{1, 1}^{'} [\begin{matrix} 0 \\ 0 \\ 0 \end{matrix}] + η_{1, 2}^{'} [\begin{matrix} α_{1, 1, 2} \\ 0 \\ 0 \end{matrix}] + η_{2, 1}^{'} [\begin{matrix} α_{1, 1, 2} \\ 1 \\ 0 \end{matrix}] + η_{2, 2}^{'} [\begin{matrix} α_{2, 1, 2} \\ 1 \\ 0 \end{matrix}] + η_{3, 1}^{'} [\begin{matrix} α_{2, 1, 2} \\ 1 \\ 1 \end{matrix}] + η_{3, 2}^{'} [\begin{matrix} 6 \\ 1 \\ 1 \end{matrix}] \end{matrix}\} \end{matrix}

The overall convex hull

c o n v (Ξ^{'})

is a subset of the following overestimated set

{\hat{Ξ}}^{'} = {ξ^{'} \in R^{2} : P_{ξ_{[2]}} ξ^{'} \in Ξ, ξ_{q, t}^{'} \in c o n v ({Ξ^{'}}_{q, t}), q = 1, t = 1, 2}

.

3.2. Binary Decision Rule

To approximate the optimal adaptive binary solution, binary decision rule is employed. It enforces a linear relation with respect to the lifted uncertainty (indicator function). The binary decision

y_{t}

depends on uncertainty up to time stage t. In binary decision rule,

y_{t}

is approximated by a linear combination of indicator functions from stage 1 to stage t,

Q_{[t]} (ξ)

:

\begin{matrix} y_{t} (ξ_{[t]}) = Y_{t} Q_{[t]} (ξ) & t = 1, \dots, T \end{matrix}

(10)

where the coefficients only take the following possible integer values:

Y_{t} \in {- 1, 0, 1}^{| y_{t} | \times | Q_{[t]} |}

. Furthermore, binary restriction on original variables need to be enforced, hence:

\begin{matrix} 0 \leq Y_{t} Q_{[t]} (ξ) \leq e & \forall ξ \in Ξ, t = 1, \dots, T \end{matrix}

(11)

where

e

is the vector of all ones. By applying the binary decision rule to the general stochastic formulation, a semi-infinite optimization problem with a finite number of variables but infinite number of constraints is obtained:

\begin{matrix} min & E_{ξ} (\sum_{t} {(D_{t} ξ_{[t]})}^{⊤} Y_{t} Q_{[t]} (ξ)) \end{matrix}

(12a)

\begin{matrix} s . t . & \sum_{τ = 1}^{t} B_{t, τ} (Y_{τ} Q_{[τ]} (ξ)) \leq E_{t} ξ_{[t]} & \forall ξ \in Ξ, t = 1, \dots, T \end{matrix}

(12b)

\begin{matrix} 0 \leq Y_{t} Q_{[t]} (ξ) \leq e & \forall ξ \in Ξ, t = 1, \dots, T \end{matrix}

(12c)

In the above model, the constraints contain a nonconvex indicator functions of original uncertain parameters. Using Equations (5a) and (5b), the stochastic model can be written as the following model with linear constraints with respect to the lifted uncertainty

ξ^{'}

:

\begin{matrix} min & E_{ξ^{'}} (\sum_{t} {(D_{t} P_{ξ_{[t]}} ξ^{'})}^{⊤} Y_{t} P_{Q_{[t]}} ξ^{'}) \end{matrix}

(13a)

\begin{matrix} s . t . & \sum_{τ = 1}^{t} B_{t, τ} (Y_{τ} P_{Q_{[τ]}} ξ^{'}) \leq E_{t} P_{ξ_{[t]}} ξ^{'} & \forall ξ^{'} \in Ξ^{'}, \forall t \in 1, \dots, T \end{matrix}

(13b)

\begin{matrix} 0 \leq Y_{t} P_{Q_{[t]}} ξ^{'} \leq e & \forall ξ^{'} \in Ξ^{'}, \forall t \in 1, \dots, T \end{matrix}

(13c)

Since the lifted uncertainty set

Ξ^{'}

is nonconvex and consequently the problem is still intractable. The problem can be conservatively approximated by using the convex overestimation

{\hat{Ξ}}^{'}

of the lifted uncertainty set as defined in Equation (9).

\begin{matrix} min & E_{ξ^{'}} (\sum_{t} {(D_{t} P_{ξ_{[t]}} ξ^{'})}^{⊤} Y_{t} P_{Q_{[t]}} ξ^{'}) \end{matrix}

(14a)

\begin{matrix} s . t . & \sum_{τ = 1}^{t} B_{t, τ} (Y_{τ} P_{Q_{[τ]}} ξ^{'}) \leq E_{t} P_{ξ_{[t]}} ξ^{'} & \forall ξ^{'} \in {\hat{Ξ}}^{'}, t \in 1, \dots, T \end{matrix}

(14b)

\begin{matrix} 0 \leq Y_{t} P_{Q_{[t]}} ξ^{'} \leq e & \forall ξ^{'} \in {\hat{Ξ}}^{'}, t \in 1, \dots, T \end{matrix}

(14c)

Since constraints (14b) and (14c) are linear with respect to

ξ^{'}

, and the uncertainty set

{\hat{Ξ}}^{'}

is a polyhedral set, duality theorem can be used to covert this semi-infinite problem into its deterministic robust counterpart. As an example, constraint (14b) can be written as

(\sum_{τ = 1}^{t} B_{t, τ} Y_{τ} P_{Q_{[τ]}} - E_{t} P_{ξ_{[t]}}) ξ^{'} \leq 0 \forall ξ^{'} \in {\hat{Ξ}}^{'}, t \in 1, \dots, T

or equivalently as in the following, based on the uncertainty set definition in Equation (9)

max_{J^{'} ξ^{'} \geq h^{'}} (\sum_{τ = 1}^{t} B_{t, τ} Y_{τ} P_{Q_{[τ]}} - E_{t} P_{ξ_{[t]}}) ξ^{'} \leq 0 \forall t \in 1, \dots, T

Its deterministic counterpart is obtained as follows after applying duality to the inner maximization problem:

\{\begin{matrix} - h^{' ⊤} θ_{t} \leq 0 & \forall t \\ - J^{' ⊤} θ_{t} = {(\sum_{τ = 1}^{t} B_{t, τ} Y_{τ} P_{Q_{[τ]}} - E_{t} P_{ξ_{[t]}})}^{⊤} & \forall t \\ θ_{t} \geq 0 & \forall t \end{matrix}

Similarly, constraint (14c) can be divided into two parts:

0 \leq Y_{t} P_{Q_{[t]}} ξ^{'}

and

Y_{t} P_{Q_{[t]}} ξ^{'} \leq e

, and the robust counterpart is derived accordingly. The overall deterministic counterpart formulation is given by

\begin{matrix} min & \sum_{t} Tr (E [ξ^{'} ξ^{' ⊤}] P_{ξ_{[t]}}^{⊤} D_{t}^{⊤} Y_{t} P_{Q_{[t]}}) \end{matrix}

(15a)

\begin{matrix} s . t . & - h^{' ⊤} θ_{t} \leq 0 & \forall t \end{matrix}

(15b)

\begin{matrix} - J^{' ⊤} θ_{t} = {(\sum_{τ = 1}^{t} B_{t, τ} Y_{τ} P_{Q_{[τ]}} - E_{t} P_{ξ_{[t]}})}^{⊤} & \forall t \end{matrix}

(15c)

\begin{matrix} θ_{t} \geq 0 & \forall t \end{matrix}

(15d)

\begin{matrix} - h^{' ⊤} λ_{t} \leq 0 & \forall t \end{matrix}

(15e)

\begin{matrix} - J^{' ⊤} λ_{t} = {(- Y_{t} P_{Q_{[t]}})}^{⊤} & \forall t \end{matrix}

(15f)

\begin{matrix} λ_{t} \geq 0 & \forall t \end{matrix}

(15g)

\begin{matrix} - h^{' ⊤} ϕ_{t} \leq e & \forall t \end{matrix}

(15h)

\begin{matrix} - J^{' ⊤} ϕ_{t} = {(Y_{t} P_{Q_{[t]}})}^{⊤} & \forall t \end{matrix}

(15i)

\begin{matrix} ϕ_{t} \geq 0 & \forall t \end{matrix}

(15j)

\begin{matrix} Y_{t} \in {- 1, 0, 1}^{| y_{t} | \times | Q_{[t]} |} \end{matrix}

(15k)

where

Tr (\cdot)

is trace operator and

E [ξ^{'} ξ^{' ⊤}]

can be derived from the know distribution of the uncertainty.

Illustrating example (cont.)

Table 2 presents the results of the above lifting method for different numbers of breakpoints. For 1 breakpoint case, the breakpoint values are set as 1.5 for

ξ_{1}

, and 3 for

ξ_{2}

. For 2 breakpoints case, the breakpoint values are set as

(1, 2)

for

ξ_{1}

, and

(2, 4)

for

ξ_{2}

. For 9 breakpoints case, the breakpoint values are set as

(0.3, 0.6, 0.9, \dots, 2.7)

for

ξ_{1}

, and

(0.6, 1.2, \dots, 5.4)

for

ξ_{2}

. The objective did not improve beyond 1.333 even for nine breakpoints. This observation indicates that the lifting solution quality may be restricted. Using the definition of the lifting method for the binary variable, we can plot the

y_{1} (ξ_{1})

and

y_{2} (ξ_{1}, ξ_{2})

variables. For the case of 2 breakpoints, the model solution is

Y_{1} = [0, 1, 0]

and

Y_{2} = [0, 0, 0, 1, 0]

, and the adaptive binary variables are expressed as:

\begin{matrix} y_{1} (ξ_{1}) = Y_{1} Q_{[1]} = [0, 1, 0] \times {[1, Q_{1, 1, 1} (ξ_{1}), Q_{2, 1, 1} (ξ_{1})]}^{⊤} = Q_{1, 1, 1} (ξ_{1}) \\ y_{2} (ξ_{1}, ξ_{2}) = Y_{2} Q_{[2]} = [0, 0, 0, 1, 0] \times {[1, Q_{2, 1, 1} (ξ_{1}), Q_{1, 1, 2} (ξ_{2}), Q_{2, 1, 2} (ξ_{2})]}^{⊤} = Q_{1, 1, 2} (ξ_{2}) \end{matrix}

As shown in Figure 6, the binary variable

y_{2}

is only a function of parameter

ξ_{2}

which indicates a restricted solution quality.

3.3. Breakpoint Optimization for Lifting Method

In this section, we assume that the location of breakpoints is not pre-fixed and they are optimized instead. Based on Equation (9), the breakpoints information is contained in the parameters

J^{'}, h^{'}

and can be formulated as:

\begin{matrix} J^{'} = J_{0} + J_{1} α \\ h^{'} = h_{0} + h_{1} α \end{matrix}

where

J_{0}, J_{1}, h_{0}, h_{1}

are known constant matrices/vectors,

α

is the vector involving all location information of breakpoints. The above formulated

J^{'}

and

h^{'}

should be substituted in the dual counterpart formulation (Equations (15a)–(15j)) to complete the variable breakpoint lifting technique.

\begin{matrix} min & E_{ξ^{'}} \sum_{t} Tr (E [ξ^{'} ξ^{' ⊤}] P_{ξ_{[t]}}^{⊤} D_{t}^{⊤} Y_{t} P_{Q_{[t]}}) \end{matrix}

(16a)

\begin{matrix} s . t . & - {(h_{0} + h_{1} α)}^{' ⊤} θ_{t} \leq 0 & \forall t \end{matrix}

(16b)

\begin{matrix} - {(J_{0} + J_{1} α)}^{⊤} θ_{t} = {(\sum_{s = 1}^{t} B_{ts} Y_{s} P_{Q_{[s]}} - E_{t} P_{ξ_{[t]}})}^{⊤} & \forall t \end{matrix}

(16c)

\begin{matrix} θ_{t} \geq 0 & \forall t \end{matrix}

(16d)

\begin{matrix} - {(h_{0} + h_{1} α)}^{' ⊤} λ_{t} \leq 0 & \forall t \end{matrix}

(16e)

\begin{matrix} - {(J_{0} + J_{1} α)}^{' ⊤} λ_{t} = {(- Y_{t} P_{Q_{[t]}})}^{⊤} & \forall t \end{matrix}

(16f)

\begin{matrix} λ_{t} \geq 0 & \forall t \end{matrix}

(16g)

\begin{matrix} - {(h_{0} + h_{1} α)}^{' ⊤} ϕ_{t} \leq e & \forall t \end{matrix}

(16h)

\begin{matrix} - {(J_{0} + J_{1} α)}^{⊤} ϕ_{t} = {(Y_{t} P_{Q_{[t]}})}^{⊤} & \forall t \end{matrix}

(16i)

\begin{matrix} ϕ_{t} \geq 0 & \forall t \end{matrix}

(16j)

\begin{matrix} Y_{t} \in {- 1, 0, 1}^{| y_{t} | \times | Q_{[t]} |} \end{matrix}

(16k)

Notice that the expectation of the lifted uncertainty

E (ξ^{'})

also depends on the location of the variable breakpoints, so

E [ξ^{'} ξ^{' ⊤}]

is also a function of

α

(which can be analytically evaluated based on the distribution information). The resulting model will be a mixed integer nonlinear optimization problem, where the integer variables are the binary decision rule coefficients

Y_{t}

, and the continuous variables include the dual variables

θ, λ, ϕ

and the breakpoint locations

α

.

In this work, all MINLP problems were solved using the ANTIGONE 1.0 solver [30] in the GAMS 25 platform on a workstation (Intel Xeon Dual 20 Core 2.0 GHz Processor, 128 GB DDR4 ECC RAM) using a time resource limit of 10 h. The reported solution have zero optimality gap, otherwise it is reported. Using a single variable breakpoint in the illustrating example, the obtained objective (

- 1.333

) is better than using a single fixed breakpoint (

- 1.000

). This shows the advantage of variable breakpoint lifting compared to the fixed breakpoint method. However, in this example, using more than one variable breakpoint did not further improve the objective, as shown in Table 3. This observation shows that the lifting method’s solution quality is restricted.

The optimized breakpoint values are summarized below:

For 1 breakpoint case, 1 for $ξ_{1}$ , 2 for $ξ_{2}$
For 2 breakpoints case, (0, 1) for $ξ_{1}$ , (0, 2) for $ξ_{2}$
For 9 breakpoints case, (1.5, 1.5, 1.5, 1.5, 1.5, 1.5, 3, 3, 3) for $ξ_{1}$ , (1, 1, 1, 1, 2.52, 3.6, 4.2, 4.8, 5.4) for $ξ_{2}$

Notice that the variable breakpoints are not forced to be different to each other, so the solution has overlapped breakpoints while the number of breakpoints increases.

4. Finite Adaptability with Uncertainty Set Partitioning

4.1. Uncertainty Set Partitioning

In this method, the uncertainty set is divided into small partitions and a constant binary decision is assumed for each partition. Similarly to the lifting method, we define breakpoints for each uncertain parameter and each subset is a box type of uncertainty set. Figure 7 illustrates the partitioning of a two-dimensional rectangular uncertainty set and the corresponding scenario tree with each branch representing a subinterval for each parameter. In this figure, the interval of each uncertain parameter is divided into three segments such that there are three nodes in the first stage and nine nodes in the second stage. The following notations were used in this method:

s:: a scenario (each scenario is represented by a matrix structure (allowing empty elements), its element $s_{q, t}$ gives the subinterval index of the uncertain parameter $ξ_{q, t}$ ) under this scenario
${\bar{r}}_{q, t}$ :: scalar, number of breakpoints for $ξ_{q, t}$
$α_{s_{q, t}, q, t}$ :: scalar, upper bound value for element $ξ_{q, t}$ under scenario s

As an example, consider a two-stage problem: the first stage has uncertain parameters

ξ_{1, 1}, ξ_{2, 1}

and the second stage has one uncertain parameter

ξ_{1, 2}

. Assume no breakpoint is applied to

ξ_{1, 1}

, one breakpoint is applied to

ξ_{2, 1}

, and two breakpoints are applied to

ξ_{1, 2}

. Then, the set

S

is:

s \in S = \{[\begin{matrix} 1 & 1 \\ 1 & * \end{matrix}], [\begin{matrix} 1 & 1 \\ 2 & * \end{matrix}], [\begin{matrix} 1 & 2 \\ 1 & * \end{matrix}], [\begin{matrix} 1 & 2 \\ 2 & * \end{matrix}], [\begin{matrix} 1 & 3 \\ 1 & * \end{matrix}], [\begin{matrix} 1 & 3 \\ 2 & * \end{matrix}]\}

where “*” denotes that the corresponding element does not exist.

For each scenario

s \in S

, the uncertainty set is a hyper-rectangular (subset of original set

Ξ

)

Ξ_{s} = {ξ : α_{s_{q, t} - 1, q, t} \leq ξ_{q, t} \leq α_{s_{q, t}, q, t}; t = 1, \dots, T, q = 1, \dots, {\bar{q}}_{t}}

which can be compactly written as

\begin{matrix} Ξ_{s} = {ξ : W_{s} ξ \geq V_{s}} \end{matrix}

(17)

4.2. Finite Adaptability

Next, the finite adaptability method (also denoted as “partitioning method” in this work) is applied to the original problem (1a)–(1c). The idea is a combination of scenario tree-based stochastic formulation and robust optimization. For each scenario, we enforce constraint satisfaction over a uncertainty set as defined in Equation (17) instead of a single point in the uncertainty space. The resulting model can be cast as:

\begin{matrix} min_{y} & \sum_{s \in S} p_{s} \sum_{t = 1}^{T} D_{t} E_{Ξ_{s}} (ξ) y_{t, s} \end{matrix}

(18a)

\begin{matrix} s . t . & \sum_{τ = 1}^{t} A_{t τ} y_{τ, s} \leq E_{t} ξ & \forall t, s, ξ \in Ξ_{s} \end{matrix}

(18b)

\begin{matrix} y_{t, s} = y_{t, s^{'}} & \forall (t, s, s^{'}) \in S P \end{matrix}

(18c)

where the last constraint is enforcing non-anticipativity and

S P

is the set of all scenarios with the same path up to time t:

S P = {(t, s, s^{'}) : s_{q, τ} = s_{q, τ}^{'}, \forall τ = 1, \dots, t, q = 1, \dots, {\bar{q}}_{τ}}

Reformulating the semi-infinite constraint using linear programming duality, the corresponding robust scenario-based formulation is

\begin{matrix} min_{y} & \sum_{s \in S} p_{s} \sum_{t = 1}^{T} D_{t} E_{Ξ_{s}} (ξ) y_{t, s} \end{matrix}

(19a)

\begin{matrix} s . t . & V_{s}^{⊤} θ_{t, s} \geq \sum_{τ}^{t} A_{t τ} y_{τ, s} & \forall s, t \end{matrix}

(19b)

\begin{matrix} W_{s}^{⊤} θ_{t, s} = E_{t}^{⊤} & \forall s, t \end{matrix}

(19c)

\begin{matrix} θ_{t, s} \geq 0 & \forall s, t \end{matrix}

(19d)

\begin{matrix} y_{t, s} = y_{t, s^{'}} & \forall (t, s, s^{'}) \in S P \end{matrix}

(19e)

Illustrating example (cont.)

In this section, the partitioning method is applied to the illustrating example (3a)–(3d) and the results are presented. Figure 7 illustrates the partitioning and the corresponding scenario tree for the illustrating example.

The formulation for the partitioning of the uncertainty set is based on two equally-spaced breakpoints for each uncertainty element: for

ξ_{1, 1} \in [0, 3]

, the breakpoints are set as

0 < 1 < 2 < 3

; for

ξ_{1, 2} \in [0, 6]

the breakpoints are

0 < 2 < 4 < 6

. Hence,

S = \{[1, 1], [1, 2], [1, 3], [2, 1], [2, 2], [2, 3], [3, 1], [3, 2], [3, 3]\} .

The non-anticipativity condition set

S P

contains the following elements:

\begin{matrix} (1, [1, 1], [1, 2]), (1, [1, 1], [1, 3]), (1, [1, 2], [1, 3]) \\ (1, [2, 1], [2, 2]), (1, [2, 1], [2, 3]), (1, [2, 2], [2, 3]) \\ (1, [3, 1], [3, 2]), (1, [3, 1], [3, 3]), (1, [3, 2], [3, 3]) \end{matrix}

The partitioning-based model is the MILP problem. Table 4 presents the results from the finite adaptability method. As the number of partitions increases, the objective improves until it reaches close to the optimal solution obtained from scenario method (−1.594 for 99 branches per node). For comparison, the branches and breakpoints are evenly distributed in the scenario and finite adaptability method, respectively. For 29 breakpoints in the partitioning method and 31 branches in the scenario method, the optimal objectives are −1.589 and −1.605, respectively. Figure 8 and Figure 9 illustrate the partitioning solutions for 2 and 29 breakpoints. As the figures demonstrates, there is a close match between the partitioning and scenario solution.

4.3. Breakpoint Optimization in Partitioning Method

In the variable breakpoint partitioning technique, the location of breakpoints is unknown a priori and it is optimized instead. The location of breakpoints is reflected in the

V_{s}

matrix of Equation (17) and it is formulated as follows:

V_{s} = V_{s}^{0} + V_{s}^{1} α \forall s \in S

where

V_{s}^{0}

and

V_{s}^{1}

are constant matrices. The corresponding robust scenario-based formulation is

\begin{matrix} min_{y} & \sum_{s \in S} p_{s} \sum_{t = 1}^{T} D_{t} E_{Ξ_{s}} (ξ) y_{t, s} \end{matrix}

(20a)

\begin{matrix} s . t . & {(V_{s}^{0} + V_{s}^{1} α)}^{⊤} θ_{t, s} \geq \sum_{τ = 1}^{t} A_{t τ} y_{τ, s} & \forall s, t \end{matrix}

(20b)

\begin{matrix} W_{s}^{⊤} θ_{t, s} = E_{t}^{⊤} & \forall s, t \end{matrix}

(20c)

\begin{matrix} θ_{t, s} \geq 0 & \forall s, t \end{matrix}

(20d)

\begin{matrix} y_{t, s} = y_{t, s^{'}} & \forall (t, s, s^{'}) \in S P \end{matrix}

(20e)

In this formulation, the probability of the occurrence of each scenario depends on the location of breakpoints since the length of each dimension in each scenario depends on the location of breakpoints. The probability

p_{s}

for each scenario and

E_{Ξ_{s}} (ξ)

are both functions of

α

and they can be evaluated based on the distribution of the uncertainty.

Table 5 summarized the results of the variable partitioning technique. It can be observed that, for the same number of partitions, the variable partitioning method provided a better objective compared to the fixed breakpoint method. For instance, for 2 breakpoints per uncertain parameter, the variable and fixed methods’ objectives are −1.500 and −1.444, respectively. The variable method even provided a better objective with just three breakpoints compared to the fixed method with nine breakpoints. However, for a large number of partitions, the fixed breakpoint method is much faster in terms of solution and it could provide a better objective using 29 breakpoints in less than 1 s compared to the variable method using 5 breakpoints after about 18 min run time.

4.4. Flexibility Comparison

We can observe from Table 2 that, for the illustrating problem, the lifting method’s objective does not improve beyond −1.333, even for nine breakpoints, while the partitioning method could provide a better objective of −1.444 with two breakpoints for each parameter. In this section, we investigate the reason why lifting method leads to restricted solution quality compared to the partitioning method. For this purpose, the solution from partitioning method is substituted into the decision rule solution from the lifting method. Note that the same breakpoints are applied in the lifting and partitioning method (as shown in Figure 7). This will result in a linear system of equations. If the set of equations has no solution, this means that the lifting method has a restricted flexibility such that it cannot cover the solution of the partitioning method.

Figure 8 presents the partitioning solution. In this problem, the intervals for

ξ_{1}

,

ξ_{2}

are divided into three equally distributed pieces. Based on the binary decision rule equations:

\begin{matrix} y_{t} = Y_{t} Q_{[t]} \\ Q_{[1]} = [1, Q_{1, 1, 1}, Q_{2, 1, 1}] \\ Q_{[2]} = [1, Q_{1, 1, 1}, Q_{2, 1, 1}, Q_{1, 1, 2}, Q_{2, 1, 2}] \end{matrix}

For the three nodes at stage 1, the corresponding lifted uncertainty vector

Q_{[1]}

takes the three following values:

\begin{matrix} Node 1 : Y_{1} {[1, 0, 0]}^{⊤} = 0 \\ Node 2 : Y_{1} {[1, 1, 0]}^{⊤} = 1 \\ Node 3 : Y_{1} {[1, 1, 1]}^{⊤} = 1 \end{matrix}

There is a solution

Y_{1} = [0, 1, 0]

for the above equations. For the nine nodes at stage 2, the lifted uncertainty vector takes nine different values for nodes 4 to 12:

\begin{matrix} Node 4 : Y_{2} {[1, 0, 0, 0, 0]}^{⊤} = 0 \\ Node 5 : Y_{2} {[1, 0, 0, 1, 0]}^{⊤} = 1 \\ Node 6 : Y_{2} {[1, 0, 0, 1, 1]}^{⊤} = 1 \\ Node 7 : Y_{2} {[1, 1, 0, 0, 0]}^{⊤} = 0 \\ Node 8 : Y_{2} {[1, 1, 0, 1, 0]}^{⊤} = 1 \\ Node 9 : Y_{2} {[1, 1, 0, 1, 1]}^{⊤} = 1 \\ Node 10 : Y_{2} {[1, 1, 1, 0, 0]}^{⊤} = 1 \\ Node 11 : Y_{2} {[1, 1, 1, 1, 0]}^{⊤} = 1 \\ Node 12 : Y_{2} {[1, 1, 1, 1, 1]}^{⊤} = 1 \end{matrix}

There is no solution

Y_{2}

satisfying this linear system of equations (there are nine equations and five variables for

Y_{2}

). This evidence indicates that lifting the method’s solution is restricted compared to the partitioning method. The limited solution flexibility is due to the affine decision rule over the lifted uncertainty.

5. Case Study: Inventory Control Problem

In this section, an inventory control problem with discrete ordering decisions is studied where the problem is adapted from [27]. This problem can be formulated as an multistage adaptive robust optimization problem with fixed recourse and can be explained as follows. At the beginning of each time period

t \in T = {1, \dots, T}

, the product demand

ξ_{t}

is observed. This demand can be satisfied in two ways: (1) by pre-ordering at period t using maximum N lots (binary variable

z_{n, t}

is introduced to express whether we order from n lots in time period t or not), each delivering a fixed quantity

q^{z}

at the beginning of period t for a unit cost of

c^{z}

; (2) by placing a recourse order during the period t using maximum N lots (binary variable

y_{n, t}

is introduced for this decision), each delivering immediately a fixed quantity

q^{y}

for a unit cost of

c^{y}

. The pre-ordering cost is always less than the immediate ordering cost (

c^{z} < c^{y}

). If the ordered quantity is greater than the demand, the excess units are stored in a warehouse that incurs a unit holding cost of

c^{h}

and can be used to satisfy future demand. Furthermore, the cumulative volume of pre-orders

\sum_{τ = 1}^{t} \sum_{n = 1}^{N} q^{z} z_{n, τ}

cannot exceed the ordering budget

{\bar{B}}_{t}

. The objective is to minimize the total ordering and holding costs over the planning horizon by determining the optimal decision of

z_{n, t}

and

y_{n, t} (ξ_{[t]})

. Equations (21a)–(21e) express the problem formulation.

\begin{matrix} min & E_{ξ} (\sum_{t = 1}^{T} \sum_{n = 1}^{N} c^{z} q^{z} z_{n, t} + c^{y} q^{y} y_{n, t} (ξ_{[t]}) + c^{h} I_{t} (ξ_{[t]})) \end{matrix}

(21a)

\begin{matrix} s . t . & I_{t} (ξ_{[t]}) = I_{0} + \sum_{τ = 1}^{t} (\sum_{n = 1}^{N} q^{z} z_{n, τ} + q^{y} y_{n, τ} (ξ_{[τ]}) - ξ_{τ}) & \forall t \in T, ξ \in Ξ \end{matrix}

(21b)

\begin{matrix} I_{t} (ξ_{[t]}) \geq 0 & \forall t \in T, ξ \in Ξ \end{matrix}

(21c)

\begin{matrix} \sum_{τ = 1}^{t} \sum_{n = 1}^{N} q^{z} z_{n, τ} \leq {\bar{B}}_{t} & \forall t \in T \end{matrix}

(21d)

\begin{matrix} z_{n, t} \in {0, 1}, y_{n, t} (ξ_{[t]}) \in {0, 1} & \forall n, t \end{matrix}

(21e)

Notice that

z_{n, t}

is a static binary decision variable, while

y_{n, t}

is the adaptive binary decision variable dependent on the realized uncertainty

ξ_{[t]}

. The uncertainty set is represented as:

Ξ = {ξ \in R^{T} : l \leq ξ \leq u}

. Uniform distribution is assumed for all of them. The bounds for each random parameter are chosen from

l_{t} \in [0, 5]

and

u_{t} \in [10, 15]

for

t = 1, \dots, T

. The cumulative ordering budget equals to

{\bar{B}}_{t} = 10 t

for

t = 1, \dots T

. It is also assumed that

q^{z} = q^{y} = 15 / N

and the initial inventory is zero (

I_{0} = 0)

.

Two different configurations of the inventory problem are studied in this work (Table 6). In the second configuration, the unit cost for immediate ordering (

c^{y}

), unit cost for storage at warehouse (

c^{h}

) and the maximum number of delivery lots (N) are greater than the first configuration. Since, at the second configuration, the costs for immediate ordering and warehouse are higher, it is therefore more economical to satisfy demand using first-stage decisions. Lifting and partitioning methods using both fixed and variable breakpoints are applied to the case study and the obtained results are discussed.

First, the comparison is made between lifting and partitioning methods using same fixed breakpoint setting. The following results are obtained:

The number of variables grows exponentially in the partitioning method while it grows linearly in the lifting. For a large number of time stages (T) and number of breakpoints used ( $B r$ ), the number of variables in the partitioning method can be prohibitively large and it may not be possible to run the model (Table 7 and Table 8). Therefore, considering time and computational resource limitations, the lifting method is suggested for large models (experiments next to $T = 10$ , $B r = 2$ in this case study, Table 9 and Table 10). Thus, the ability to run large models can be considered the main advantage of the lifting method.
It is observed that for experiments with small and medium model size (up to $T = 10$ , $B r = 1$ in this case study), the partitioning method provides a better objective value in a shorter run time compared to the lifting method. For instance, for experiments with 5 time steps, the partitioning method provides a better solution in 9.5 s (Table 7, $T = 5$ , $B r = 2$ ) compared to the lifting method in 10 h run time (Table 9, $T = 5$ , $B r = 15$ ). Figure 10 and Figure 11 compare the fixed breakpoint lifting and finite adaptability (partitioning) methods for the first and second configurations, respectively.
In general, in both partitioning and lifting methods, the objective improves by increasing the number of breakpoints except for large problems where the optimality gap is still quite large after 10 h run time limitation (Table 9 and Table 10). Thus, for large models, in order to obtain the best objective in limited run time, there is no need to consider a large number of breakpoints.

Next, the comparison is made between fixed and variable breakpoint methods. The following observations are made:

Variable breakpoint lifting and partitioning techniques introduce additional variables and require the mixed integer nonlinear optimization compared to mixed integer linear optimization for fixed breakpoint case. As shown in Table 7, Table 9, Table 11 and Table 12, and Figure 12 and Figure 13, the variable breakpoint techniques can provide a better objective compared to fixed methods for small-to-medium size models ( $T = 2$ and $T = 5$ ). However, the run time is longer. For a large number of time steps and breakpoints, the problem size is too large such that it is impossible to run the model under computational resource restrictions.
For small-to-medium size experiments ( $T = 2$ , $T = 5$ ), the fixed breakpoints partitioning method is recommended since it provides the best objective within the shortest run time considering computational resource restrictions.
For large-size problems ( $T = 10$ , $T = 20$ ), the lifting method under fixed breakpoints is the only method that results in a feasible solution considering the computational resource limitations. The partitioning technique leads to large-size problems such that the solver could not find a solution in the 10 h time limit.

6. Conclusions

In this work, the lifting and partitioning methods for multistage adaptive robust binary optimization problems were studied. Different formulations based on fixed or variable breakpoints setting for each method were presented. Computational studies were made to compare the computational performance and the solution quality. The following conclusions can be made from this study. First, the binary decision rule (lifting)-based method and the finite adaptability (partitioning)-based method share a similar idea of uncertainty set splitting using breakpoints for each uncertain parameter. While the lifting-based method has less solution flexibility than the partitioning method (under the same breakpoint setting), the lifting method has the advantage of superior computational tractability and scalability for large problems. While computational resource restrictions is a major concern (especially for problems with a large number of stages), the lifting method with fixed breakpoints is suggested and the number of breakpoints can be moderately large to avoid inferior solution quality. Otherwise, the partitioning method is suggested. Variable breakpoints can be implemented for partitioning method especially for small number of stages with the usage of small number of breakpoints. As a future research direction, the number of breakpoints can be optimized to avoid unnecessary model complexity.

Author Contributions

Methodology, F.M.N. and Z.L.; Software, F.M.N.; Validation, F.M.N.; Writing—original draft, F.M.N.; Writing—review & editing, Z.L.; Supervision, Z.L.; Funding acquisition, Z.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data is contained within the article.

Acknowledgments

The authors gratefully acknowledge the financial support from the Natural Sciences and Engineering Research Council of Canada (NSERC).

Conflicts of Interest

The authors declare no conflict of interest.

References

Goulart, P.J.; Kerrigan, E.C.; Maciejowski, J.M. Optimization over state feedback policies for robust control with constraints. Automatica 2006, 42, 523–533. [Google Scholar] [CrossRef]
Skaf, J.; Boyd, S.P. Design of affine controllers via convex optimization. IEEE Trans. Autom. Control 2010, 55, 2476–2487. [Google Scholar] [CrossRef]
Aharon, B.T.; Boaz, G.; Shimrit, S. Robust multi-echelon multi-period inventory control. Eur. J. Oper. Res. 2009, 199, 922–935. [Google Scholar] [CrossRef]
See, C.T.; Sim, M. Robust approximation to multiperiod inventory management. Oper. Res. 2010, 58, 583–594. [Google Scholar] [CrossRef]
Gounaris, C.E.; Wiesemann, W.; Floudas, C.A. The robust capacitated vehicle routing problem under demand uncertainty. Oper. Res. 2013, 61, 677–693. [Google Scholar] [CrossRef]
Calafiore, G.C. An affine control method for optimal dynamic asset allocation with transaction costs. SIAM J. Control Optim. 2009, 48, 2254–2274. [Google Scholar] [CrossRef]
Fadda, E.; Perboli, G.; Tadei, R. A progressive hedging method for the optimization of social engagement and opportunistic IoT problems. Eur. J. Oper. Res. 2019, 277, 643–652. [Google Scholar] [CrossRef]
Shapiro, A.; Nemirovski, A. On complexity of stochastic programming problems. In Continuous Optimization; Springer: Berlin/Heidelberg, Germany, 2005; pp. 111–146. [Google Scholar]
Dyer, M.; Stougie, L. Computational complexity of stochastic programming problems. Math. Program. 2006, 106, 423–432. [Google Scholar] [CrossRef]
Garstka, S.J.; Wets, R.J.B. On decision rules in stochastic programming. Math. Program. 1974, 7, 117–143. [Google Scholar] [CrossRef]
Ben-Tal, A.; El Ghaoui, L.; Nemirovski, A. Robust Optimization; Princeton University Press: Princeton, NJ, USA, 2009; Volume 28. [Google Scholar]
Ben-Tal, A.; Nemirovski, A. Robust convex optimization. Math. Oper. Res. 1998, 23, 769–805. [Google Scholar] [CrossRef]
Ben-Tal, A.; Goryashko, A.; Guslitzer, E.; Nemirovski, A. Adjustable robust solutions of uncertain linear programs. Math. Program. 2004, 99, 351–376. [Google Scholar] [CrossRef]
Bertsimas, D.; Iancu, D.A.; Parrilo, P.A. Optimality of affine policies in multistage robust optimization. Math. Oper. Res. 2010, 35, 363–394. [Google Scholar] [CrossRef]
Anderson, B.D.; Moore, J.B. Optimal Control: Linear Quadratic Methods; Courier Corporation: North Chelmsford, MA, USA, 2007. [Google Scholar]
Kuhn, D.; Wiesemann, W.; Georghiou, A. Primal and dual linear decision rules in stochastic and robust optimization. Math. Program. 2011, 130, 177–209. [Google Scholar] [CrossRef]
Chen, X.; Sim, M.; Sun, P.; Zhang, J. A linear decision-based approximation approach to stochastic programming. Oper. Res. 2008, 56, 344–357. [Google Scholar] [CrossRef]
Chen, X.; Zhang, Y. Uncertain linear programs: Extended affinely adjustable robust counterparts. Oper. Res. 2009, 57, 1469–1482. [Google Scholar] [CrossRef]
Georghiou, A.; Wiesemann, W.; Kuhn, D. Generalized decision rule approximations for stochastic programming via liftings. Math. Program. 2015, 152, 301–338. [Google Scholar] [CrossRef]
Goh, J.; Sim, M. Distributionally robust optimization and its tractable approximations. Oper. Res. 2010, 58, 902–917. [Google Scholar] [CrossRef]
Ben-Tal, A.; Den Hertog, D. Immunizing conic quadratic optimization problems against implementation errors. SSRN Electron. J. 2011. [Google Scholar] [CrossRef]
Bertsimas, D.; Iancu, D.A.; Parrilo, P.A. A hierarchy of near-optimal policies for multistage adaptive optimization. IEEE Trans. Autom. Control 2011, 56, 2809–2824. [Google Scholar] [CrossRef]
Bertsimas, D.; Georghiou, A. Design of near optimal decision rules in multistage adaptive mixed-integer optimization. Oper. Res. 2015, 63, 610–627. [Google Scholar] [CrossRef]
Bertsimas, D.; Caramanis, C. Adaptability via sampling. In Proceedings of the 2007 46th IEEE Conference on Decision and Control, New Orleans, LA, USA, 12–14 December 2007; pp. 4717–4722. [Google Scholar]
Caramanis, C.C.M. Adaptable Optimization: Theory and Algorithms. Ph.D. Thesis, Massachusetts Institute of Technology, Cambridge, MA, USA, 2006. [Google Scholar]
Hanasusanto, G.A.; Kuhn, D.; Wiesemann, W. K-adaptability in two-stage robust binary programming. Oper. Res. 2015, 63, 877–891. [Google Scholar] [CrossRef]
Bertsimas, D.; Georghiou, A. Binary decision rules for multistage adaptive mixed-integer optimization. Math. Program. 2018, 167, 395–433. [Google Scholar] [CrossRef]
Postek, K.; Hertog, D.D. Multistage adjustable robust mixed-integer optimization via iterative splitting of the uncertainty set. INFORMS J. Comput. 2016, 28, 553–574. [Google Scholar] [CrossRef]
Bertsimas, D.; Dunning, I. Multistage robust mixed-integer optimization with adaptive partitions. Oper. Res. 2016, 64, 980–998. [Google Scholar] [CrossRef]
Misener, R.; Floudas, C.A. ANTIGONE: Algorithms for continuous/integer global optimization of nonlinear equations. J. Glob. Optim. 2014, 59, 503–526. [Google Scholar] [CrossRef]

Figure 1. Scenario tree for two-time stages and 3 branches for each node.

Figure 2. Solution under the scenario tree with 4 branches per node.

Figure 3. Solution under the scenario tree with 31 branches per node.

Figure 4. Lifting scheme for 1 breakpoint (left) and 2 breakpoints (right) on a single uncertain parameter

ξ_{q, t}

.

Figure 4. Lifting scheme for 1 breakpoint (left) and 2 breakpoints (right) on a single uncertain parameter

ξ_{q, t}

.

Figure 5. Illustration of the relation between

c o n v (Ξ^{'})

and its convex overestimation

{\hat{Ξ}}^{'}

.

Figure 5. Illustration of the relation between

c o n v (Ξ^{'})

and its convex overestimation

{\hat{Ξ}}^{'}

.

Figure 6.

y_{1} (ξ_{1}), y_{2} (ξ_{1}, ξ_{2})

solution under 2 breakpoints:

(1, 2)

for

ξ_{1}

, and

(2, 4)

for

ξ_{2}

.

Figure 6.

y_{1} (ξ_{1}), y_{2} (ξ_{1}, ξ_{2})

solution under 2 breakpoints:

(1, 2)

for

ξ_{1}

, and

(2, 4)

for

ξ_{2}

.

Figure 7. Partitioning of the two-dimensional uncertainty set (left figure) and scenario tree representation (right figure).

Figure 8. Solution from finite adaptability method using two breakpoints for each parameter.

Figure 9. Solution from finite adaptability method using 29 breakpoints for each parameter.

Figure 10. Comparison of lifting and finite adaptability method using the same fixed breakpoints setting (first configuration).

Figure 11. Comparison of lifting and finite adaptability method using the same fixed breakpoints setting (second configuration).

Figure 12. Comparison of the solution from the lifting method using fixed breakpoints and optimized breakpoints.

Figure 13. Comparison of solutions from finite adaptability method using fixed breakpoints and optimized breakpoints.

Table 1. Results of scenario tree method for the illustrating example.

Branches	Objective	Run Time	Binary Variables
4	−1.625	0.032 s	20
11	−1.562	0.041 s	132
31	−1.605	0.078 s	993
99	−1.594	0.433 s	9900

Table 2. Solution from lifting method with different numbers of fixed breakpoints.

Number of Breakpoints	Number of Variables	Objective	Run Time
1	5 Integer, 159 continuous	−1.000	0.022 s
2	8 Integer, 199 continuous	−1.333	0.078 s
9	29 Integer, 514 continuous	−1.333	0.094 s

Table 3. Solution statistics for variable breakpoint lifting.

Number of Breakpoints	Number of Variables	Objective	Run Time
1	5 integer, 513 continuous	−1.333	1.075 s
2	8 integer, 876 continuous	−1.333	2.375 s
9	29 integer, 6112 continuous	−1.333	2 min, 14 s

Table 4. Solution of finite adaptability method.

Number of Breakpoints	Number of Variables	Objective	Run Time
2	18 binary, 109 continuous	−1.444	0.015 s
9	200 binary, 1201 continuous	−1.510	0.016 s
29	1800 binary, 10,801 continuous	−1.589	0.078 s

Table 5. Variable breakpoint partitioning applied to the illustrating example.

Number of Breakpoints	Number of Variables	Objective	Run Time
1	8 binary, 83 continuous	−1.333	0.682 s
2	18 binary, 180 continuous	−1.500	0.604 s
3	32 binary, 315 continuous	−1.528	0.928 s
5	72 binary, 699 continuous	−1.562	18 min 54 s

Table 6. Inventory problem parameters.

First Configuration
N	$c^{z}$	$c^{y}$	$c^{h}$	$q^{y}$	$q^{z}$
2	2	3	4	7.5	7.5
Second Configuration
N	$c^{z}$	$c^{y}$	$c^{h}$	$q^{y}$	$q^{z}$
3	2	5	6	5	5

Table 7. Results for the partitioning method, first configuration.

T, Br	Objective	Run Time	Constraints	Cont. Vars	Discrete Vars
T = 2, Br = 1	118.0	8.42 s	89	71	20
T = 2, Br = 2	100.5	8.8 s	197	151	40
T = 2, Br = 3	96.43	8.64 s	349	263	68
T = 2, Br = 5	87.76	8.533 s	785	583	148
T = 2, Br = 7	85.30	8.48 s	1397	1031	260
T = 2, Br = 15	79.53	8.69 s	5605	4103	1028
T = 5, Br = 1	378.17	8.82 s	3244	2253	330
T = 5, Br = 2	320.30	9.52 s	24,797	17,023	2440
T = 5, Br = 3	291.70	11.65 s	104,800	71,693	10,250
T = 5, Br = 5	272.34	1 min 22 s	797,828	544,333	77,770
T = 5, Br = 7	243.10	25 min 10 s	3,365,752	2,293,773	327,690
T = 10, Br = 1	903.15	19.22 s	364,561	245,783	20,500

Note: “T” denotes the number of stages, “Br” denotes the number of breakpoints.

Table 8. Results for the partitioning method, second configuration.

T, Br	Objective	Run Time	Constraints	Cont. Vars	Discrete Vars
T = 2, Br = 1	124.5	8.41 s	91	81	30
T = 2, Br = 3	108.56	8.78 s	361	297	102
T = 2, Br = 5	101.37	8.71 s	815	657	222
T = 2, Br = 7	97.07	8.40 s	1453	1161	390
T = 2, Br = 15	93.21	8.49 s	5845	4617	1542
T = 5, Br = 1	476.75	8.9 s	3342	2418	495
T = 5, Br = 3	359.45	11.94 s	108,556	76,818	15,375
T = 5, Br = 5	323.29	1 min 56 s	827,378	583,218	116,655
T = 5, Br = 7	291.70	38 min 16 s	3,492,144	2,457,618	491,535
T = 10, Br = 1	1184.07	21.4 s	372,755	256,033	30,750

Table 9. Results for the lifting method, first configuration.

T, Br	Objective	Run Time	Opt. Gap	Constraints	Cont. Vars	Discrete Vars
T = 2, Br = 1	118.0	0.04 s	0	405	277	14
T = 2, Br = 2	118.0	0.04 s	0	545	363	20
T = 2, Br = 3	103.0	0.06 s	0	685	449	26
T = 2, Br = 5	99.25	0.08 s	0	965	621	38
T = 2, Br = 7	99.25	0.27 s	0	1245	793	50
T = 2, Br = 15	94.09	17.42 s	0	2365	1481	98
T = 5, Br = 1	403.25	0.11 s	0	2358	1603	50
T = 5, Br = 2	399.50	6.30 s	0	3233	2133	80
T = 5, Br = 3	390.12	29.53 s	0	4108	2663	110
T = 5, Br = 5	378.87	1 h 57 min	0	5858	3723	170
T = 5, Br = 7	371.37	10 h	6.05%	7608	4783	230
T = 5, Br = 15	369.03	10 h	22.98%	14,608	9023	470
T = 10, Br = 1	1115.25	18.83 s	0	10,893	5768	155
T = 10, Br = 3	1012.12	10 h	6.13%	16,213	10,473	370
T = 10, Br = 5	1004.62	10 h	12.83%	23,213	14,693	590
T = 10, Br = 7	1004.62	10 h	20.23%	30,213	18,913	810
T = 10, Br = 15	1073.06	10 h	25.76%	58,213	35,557	1454
T = 20, Br = 1	3575.0	10 h	1.55%	36,423	24,813	610
T = 20, Br = 3	3421.25	10 h	11.4%	64,423	41,873	1670
T = 20, Br = 5	3441.87	10 h	19.68%	92,423	58,889	2686
T = 20, Br = 7	3892.81	10 h	29.35%	120,423	75,069	2866
T = 20, Br = 15	4549.99	10 h	21.66%	232,423	139,657	3454

Table 10. Results for lifting method, second configuration.

T, Br	Objective	Run Time	Opt. Gap	Constraints	Cont. Vars	Discrete Vars
T = 2, Br = 1	124.5	0.11 s	0	565	388	21
T = 2, Br = 3	118.25	0.11 s	0	957	630	39
T = 2, Br = 5	107.62	0.23 s	0	1349	872	57
T = 2, Br = 7	107.62	1.38 s	0	1741	1114	75
T = 2, Br = 15	107.62	18 min 52 s	0	3309	2082	147
T = 5, Br = 1	498.0	0.13 s	0	3298	2248	75
T = 5, Br = 3	461.75	26 min 1 s	0	5748	3738	165
T = 5, Br = 5	433.0	10 h	2.64%	8198	5228	255
T = 5, Br = 7	418.62	10 h	6.28%	10,648	6718	345
T = 5, Br = 15	409.87	10 h	23.56%	20,448	12,678	705
T = 10, Br = 1	1306	3.7 s	0	12,893	8768	225
T = 10, Br = 3	1202.25	10 h	5.64%	22,693	14,698	555
T = 10, Br = 5	1149.12	10 h	11.95%	32,493	20,628	885
T = 10, Br = 7	1196.62	10 h	19.27%	42,293	26,558	1215
T = 10, Br = 15	1309.12	10 h	26.77%	81,493	49,924	2181
T = 20, Br = 1	4456.52	10 h	3.27%	50,983	34,798	915
T = 20, Br = 3	4091.24	10 h	9.5%	90,183	58,788	2505
T = 20, Br = 5	4006.25	10 h	19.27%	129,383	82,712	4029
T = 20, Br = 7	4984.37	10 h	31.27%	168,583	105,382	4299
T = 20, Br = 15	6457.18	10 h	27.51%	325,383	195,864	5181

Table 11. Results for lifting method with breakpoint optimization, first configuration.

T, Br	Objective	Run Time	Opt. Gap	Constraints	Cont. Vars	Discrete Vars
T = 2, Br = 1	94.07	5.98 s	0	756	630	14
T = 2, Br = 2	94.07	2 min 40 s	0	1212	1032	20
T = 2, Br = 3	90.95	48 min 48 s	0	1764	1530	26
T = 2, Br = 5	90.94	10 h	15.27%	3156	2814	38
T = 2, Br = 7	90.94	10 h	32.03%	4932	4482	50
T = 2, Br = 15	94.02	10 h	48.63%	15,876	14,994	98
T = 5, Br = 1	383.37	5 h	19.26%	4311	3561	50
T = 5, Br = 2	384.55	10 h	34.93%	7056	5961	80
T = 5, Br = 3	428.56	10 h	49.75%	10,401	9861	110
T = 5, Br = 5	479.42	10 h	62.52%	18,891	16,761	170
T = 10, Br = 1	1089.23	10 h	30.6%	16,716	13,766	150
T = 10, Br = 2	1126.13	10 h	49.30%	27,556	23,216	260

Table 12. Results for partitioning method with breakpoint optimization, first configuration.

T, Br	Objective	Run Time	Opt. Gap	Constraints	Cont. Vars	Discrete Vars
T = 2, Br = 1	94.07	1.92 s	0	137	117	20
T = 2, Br = 2	86.26	1 min 47 s	0	297	249	40
T = 2, Br = 3	80.01	10 h	1.28%	521	433	68
T = 2, Br = 5	78.06	10 h	2.72%	1161	957	148
T = 2, Br = 7	79.03	10 h	34.61%	2057	1689	260
T = 2, Br = 15	101.66	10 h	56.63%	8201	6697	1028
T = 5, Br = 1	371.86	10 h	17.22%	3872	2876	330

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Motamed Nasab, F.; Li, Z. Multistage Adaptive Robust Binary Optimization: Uncertainty Set Lifting versus Partitioning through Breakpoints Optimization. Mathematics 2023, 11, 3883. https://doi.org/10.3390/math11183883

AMA Style

Motamed Nasab F, Li Z. Multistage Adaptive Robust Binary Optimization: Uncertainty Set Lifting versus Partitioning through Breakpoints Optimization. Mathematics. 2023; 11(18):3883. https://doi.org/10.3390/math11183883

Chicago/Turabian Style

Motamed Nasab, Farough, and Zukui Li. 2023. "Multistage Adaptive Robust Binary Optimization: Uncertainty Set Lifting versus Partitioning through Breakpoints Optimization" Mathematics 11, no. 18: 3883. https://doi.org/10.3390/math11183883

APA Style

Motamed Nasab, F., & Li, Z. (2023). Multistage Adaptive Robust Binary Optimization: Uncertainty Set Lifting versus Partitioning through Breakpoints Optimization. Mathematics, 11(18), 3883. https://doi.org/10.3390/math11183883

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Multistage Adaptive Robust Binary Optimization: Uncertainty Set Lifting versus Partitioning through Breakpoints Optimization

Abstract

1. Introduction

2. Multistage Adaptive Robust Binary Optimization

3. Binary Decision Rule with Lifted Uncertainty

3.1. Uncertainty Lifting

3.2. Binary Decision Rule

3.3. Breakpoint Optimization for Lifting Method

4. Finite Adaptability with Uncertainty Set Partitioning

4.1. Uncertainty Set Partitioning

4.2. Finite Adaptability

4.3. Breakpoint Optimization in Partitioning Method

4.4. Flexibility Comparison

5. Case Study: Inventory Control Problem

6. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI