Abstract
Large-scale binary integer programs occur frequently in many real-world applications. For some binary integer problems, finding an optimal solution or even a feasible solution is computationally expensive. In this paper, we develop a discrete meta-control procedure to approximately solve large-scale binary integer programs efficiently. The key idea is to map the vector of n binary decision variables into a scalar function defined over a time interval and construct a linear quadratic tracking (LQT) problem that can be solved efficiently. We prove that an LQT formulation has an optimal binary solution, analogous to a classical bang-bang control in continuous time. Our LQT approach can provide advantages in reducing computation while generating a good approximate solution. Numerical examples are presented to demonstrate the usefulness of the proposed method.
1. Introduction
Many decision problems in economics and engineering can be formulated as binary integer programming (BIP) problems. These BIP problems are often easy to state but difficult to solve due to the fact that many of them are NP-hard [], and even finding a feasible solution is considered NP-complete [,]. Because of their importance in formulating many practical problems, BIP algorithms have been widely studied. These algorithms can be classified into exact and approximate algorithms as follows []:
(1) Exact algorithms: The exact algorithms are guaranteed either to find an optimal solution or prove that the problem is infeasible, but they are usually computationally expensive. Major methods for BIP problems include branch and bound [], branch-and-cut [], branch-and-price [], dynamic programming methods [], and semidefinite relaxations [].
(2) Approximate algorithms: The approximate algorithms are used to achieve efficient running time with a sacrifice in the quality of the solution found. Examples of well-known metaheuristics, as an approximate approach, are simulated annealing [], annealing adaptive search [], cross entropy [], genetic algorithms [] and nested partitions []. Moreover, many hybrid approaches that combine both the exact and approximate algorithms have been studied to exploit the benefits of each []. For additional references regarding large-scale BIP algorithms, see [,,,].
Another effective heuristic technique that transforms discrete optimization problems into problems falling in the control theory and information theory or signal processing domains has also been studied recently. In [,], circuit related techniques are used to transform unconstrained discrete quadratic programming problems and provide high quality suboptimal solutions. Our focus is on problems with linear objective functions, instead of quadratic, and linear equality constraints, instead of unconstrained.
In our previous work [], we introduced an approach to approximating a BIP solution using continuous optimal control theory, which showed promise for large-scale problems. The key innovation to our optimal control approach is to map the vector of n binary decision variables into a scalar function defined over a time interval and define a linear quadratic tracking (LQT) problem that can be solved efficiently. In this paper, we use the same mapping, but instead of solving the LQT problem in continuous time, we explore solving the LQT problem in discrete time, because the time index in our reformulation of the BIP represents the dimension of the problem, , and a discrete time approach more accurately represents the partial summing reformulation than the continuous approach. In addition, in our previous work, the transformation into a continuous LQT problem was based on a reduced set of constraints, and a least squares approach was used to estimate the error due to the constraint reduction. The algorithm iteratively solved the LQT problem and the least squares problem until convergence conditions were satisfied. In this paper, instead of iteratively solving the LQT problem based on a reduced set of constraints, we solve the LQT problem only once for the full state space. This approach improves the flow of information for convergence.
We have chosen a quadratic criterion for our approach because its formalism includes a measure of the residual entropy of the dynamics of the algorithm as it computes successive approximation to a solution. Because of the mapping used in our algorithm, the information measure is given by the inverse of the Riccati equation that we solve. That inverse of the solution of the Riccati equation is a Fisher information matrix of the algorithm as a dynamical system [,]. The information from the algorithm in the criterion determines the quality of the solution.
The computational complexity for solving the LQT problem is polynomial in the time horizon, the dimension of the state space and the number of control variables. In our LQT problem, the time horizon is n, the dimension of the state space is the number of constraints m, and the number of control variables is 1. Our meta-control approach solves the LQT problem to obtain an efficient approximate solution to the original BIP problem.
2. Development of the Meta-Control Algorithm for BIP Problems
The original BIP problem is:
Problem 1.
where for are binary decision variables. We assume , and are real known values for and and there exists at least one feasible point.
2.1. Partial Summing Formulation
We start by defining partial summing variables as in [] from the original BIP problem as
for and , with initial conditions
For ease of notation, we create a new vector and the element of is denoted for and for We also define the vector for , and the vector , where the element of b is denoted for . We define Problem 2 as follows, with initial conditions as a vector of zeros:
Problem 2.
Proposition 1.
Problem 2 exactly represents Problem 1.
The proof is straight-forward; the constraints ensure feasibility and the objective function is equivalent to Problem 1.
2.2. Construct the LQT Problem
We construct an LQT problem, Problem 3, by first defining an error term, as a measure of unsatisfied constraints, an vector for , as
We develop the dynamics in terms of the measure , by combining Equation (10) with Equation (6), yielding
and note that , given initial conditions . The criterion is to minimize the measure of unsatisfied constraints using a terminal penalty for infeasibility and objective function value, which is given by
We also relax constraint (9) with .
The parameters and F are positive semi-definite and user-specified. The matrix is used to penalize the unsatisfied constraints. The matrix F is used to penalize the terminating conditions and aid in minimizing the original objective function.
We now summarize our discrete LQT problem with the criterion in Equation (12) as follows:
Problem 3.
It is known that solving Problem 3 directly is numerically unstable []. However, Theorem 1 suggests an algorithmic approach to solving Problem 3, by making a discrete analog to a bang-bang control with a switching function.
Theorem 1.
Analogous to a bang-bang control in continuous time, Problem 3 has an optimal binary solution with for discrete times with non-singular arcs.
Proof.
We first construct the Hamiltonian function [] as follows
where is the costate vector, for , and it satisfies
Let and be the optimal solution, by the necessary conditions for the optimality [], we have:
Thus, we have
☐
If , binary values for are determined by Equation (20). When , the arc is singular, and we may reintroduce constraint (9), , to force a binary solution.
To get an intuitive understanding of the singularity issue, suppose all , and the element at row 1, column 1 of matrix F equals zero. Then Problem 3 reduces to minimize the infeasibility penalty term, . If this term equals zero, then , satisfying all of the original constraints (2), and from Equation (18), and because , all . Then for all j. However, if and the first element of F have positive values, then may be positive or negative and Equation (20) is useful. An auxiliary problem to determine values for and F that resolve the singularity will be explored in future research.
To create an LQT problem that is practical to solve, we introduce a penalty term in the criterion, where is a Lagrangian multiplier associated with constraint (9):
Problem 4.
The optimal control for Problem 4 can be solved by the standard dynamic programming method [] (see appendix for details). The computation associated with solving Problem 4 is . We then obtain an approximate binary solution to the original BIP problem as follows:
for .
Motivated by the successive overrelaxation method [], we introduce a weighting factor ω to improve the stability of our proposed method. Rather than applying quantization at the final step as shown in Equation (24), we did quantization at each step and propagate the binary value during the dynamic programming procedure (see appendix for details). At the final step, we then replace in Equation (24) with to get the approximate binary solution.
3. Numerical Results
We explore the limits of the algorithm with some test problems obtained from MIPLIB []. MIPLIB is a standard and widely used benchmark for comparing the performance of various mixed integer programming algorithms, and most of the problems in the MIPLIB arise from real-world applications. We have presented 6 tests in our numerical result section, where , , , and are airline crew scheduling type problems. The dimensions and the optimal solutions for the test problems and the numerical results are shown in Table 1. The CPU time is given for a single run with branch-and-cut with CPLEX, branch-and-bound in MATLAB, and our method in MATLAB. In Table 1, the feasibility measure is the summation of the absolute differences of feasibility over all constraints, and the optimality measure is defined as [], where denotes the true objective function value, denotes the function value found by our proposed method and denotes the worst (largest) function value. All tests are done on an Intel(R) Core(TM) i3 CPU @2.4 GHz machine under 64bit Windows7 with 4 GB RAM.
Table 1.
Test Problems from MIPLIB.
| Problem | n | m | Time(sec) with branch-and-cut in CPLEX | Time(sec) with branch-and-bound in MATLAB | Time(sec) with our method in MATLAB | Feasibility measure | Optimality measure (%) |
|---|---|---|---|---|---|---|---|
| enigma | 21 | 100 | 0.23 | 4.02 | 0.03 | 18 | 0 |
| air01 | 771 | 23 | 0.28 | 2.86 | 0.22 | 13 | 2.55% |
| air03 | 124 | 10,757 | 1.05 | 17.64 | 34.00 | 138 | -11.68% |
| air04 | 8,904 | 823 | 34.35 | too large to run | 3231.5 | 811 | 1.43% |
| air05 | 426 | 7,195 | 26.66 | too large to run | 698.6 | 322 | -0.55% |
| nw04 | 87,482 | 36 | 9.83 | too large to run | 37.9 | 19 | 1.36% |
In the numerical tests, we experimented with different values for parameters , and F on the small problems and . The diagonal elements of were set to 0, 1 and 10, and we found that smaller values were better, so we report results with in Table 1. We also tested values for parameter set to 1, 10, 100 and 1000, and there was not much difference in performance, so we set . As for parameter F, we found that bigger values were better, so we set the diagonal elements of F to . The parameters penalize the intermediate error values whereas the parameter F penalizes the terminal error at n. Since the terminal error better reflects the original BIP optimality and infeasibility measures, intuitively, it makes sense to set and F large.
Values for the weighting factor ω ranged between to in our exploratory tests, and the best results were typically for ω between and .
CPLEX ran very quickly and always found an optimal solution; branch-and-bound in MATLAB was slower and only found a feasible solution for , and ; our method in MATLAB ran slower than CPLEX, but generally faster than branch-and-bound in MATLAB. Even though our numerical results are “worse" than CPLEX, our methodology has a potential for extension with polynomial computational complexity.
4. Summary and Conclusion
The meta-control algorithm for approximately solving large-scale BIPs shows much promise because the computational complexity is linear in n (the number of variables) and polynomial in m (the number of constraints), specifically on the order of . An LQT approach is suggested by the result in Theorem 1, which proves the existence of an optimal binary solution to the LQT problem. We provide numerical results with experimentally chosen parameter values that demonstrate the effectiveness of our approach.
In our future research, we will develop an auxiliary iterative method that can provide an explicit algorithm for detecting valid parameter values automatically and investigate other ways to integrate the quantization into the meta-control algorithm to improve the performance of this algorithm. We will also develop a stochastic decomposition method to reduce the computation time.
Acknowledgements
This research is sponsored, in part, by the National Science Foundation through Grant CMMI-0908317.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix
We solve for in Problem 4 using a dynamic programming approach. We write the cost-to-go equation as:
with , and equate it to the Riccati form
where represents a symmetric positive-definite matrix, is a positive vector, and is a positive scalar.
Combining the Equations (25), (26) and the dynamics in Equation (22), we have
In order to minimize this expression we isolate the terms with in them
and take the derivative with respect to and set the value to 0,
This yields the solution for the optimal control
In order to simplify notation, we let
and we can now write
We equate the Riccati form Equation (26) with the value function in Equation (27) evaluated at from Equation (31), yielding
We now solve for and by separating the quadratic terms from the linear terms in . Isolating the quadratic terms in , we have
which yields the Riccati equation corresponding to
Isolating the linear terms in , we have
and factoring out , the tracking equation for is
Therefore, and can be found backwards in time by Equations (32) and (33) from initial conditions
Given and , we can calculate from Equations (28), (22) and (23). To calculate for our implementation with quantization, we use the same and , but introduce rounding to the nearest integer in Equations (28), (22) and (23) to obtain:
and
with .
References
- Wolsey, L.A. Integer Programming; Wiley: New York, NY, USA, 1998. [Google Scholar]
- Danna, E.; Fenelon, M.; Gu, Z.; Wunderling, R. Generating Multiple Solutions for Mixed Integer Programming Problems. In Integer Programming and Combinatorial Optimization; Fischetti, M., Williamson, D.P., Eds.; Springer: Berlin, Germany, 2007; pp. 280–294. [Google Scholar]
- Jarre, F. Relating Max-Cut Problems and Binary Linear Feasibility Problems. Available online: http://www.optimization-online.org (accessed on 15 June 2013).
- Bertsimas, D.; Tsitsiklis, J.N. Introduction to Linear Optimization; Athena Scientific: Nashua, NH, USA, 1997. [Google Scholar]
- Mitten, L.G. Branch-and-bound methods: General formulation and properties. Oper. Res. 1970, 18, 24–34. [Google Scholar] [CrossRef]
- Caprara, A.; Fischetti, M. Branch-and-Cut Algorithms. In Annotated Bibliographies in Combinatorial Optimization; Wiley: Chichester, UK, 1997; pp. 45–64. [Google Scholar]
- Barnhart, C.; Johnson, E.L.; Nemhauser, G.L.; Savelsbergh, M.W.P.; Vance, P.H. Branch-and-price: Column generation for solving huge integer programs. Oper. Res. 1998, 46, 316–329. [Google Scholar] [CrossRef]
- Lew, A.; Holger, M. Dynamic Programming: A Computational Tool; Springer: New York, NY, USA, 2007; Volume 38. [Google Scholar]
- Jünger, M.; Liebling, T.; Naddef, D.; Nemhauser, G.; Pulleyblank, W.; Reinelt, G.; Rinaldi, G.; Wolsey, L. 50 Years of Integer Programming 1958–2008: From the Early Years to the State-of-the-Art; Springer: Berlin, Germany, 2009. [Google Scholar]
- Kirkpatrick, S.; Gelatt, C.D., Jr.; Vecchi, M.P. Optimization by simulated annealing. Science 1983, 220, 671–680. [Google Scholar] [CrossRef] [PubMed]
- Zabinsky, Z.B. Stochastic Adaptive Search for Global Optimization; Kluwer Academic Publishers: Boston, MA, USA, 2003. [Google Scholar]
- Rubinstein, R.Y.; Kroese, D.P. The Cross Entropy Method: A Unified Combinatorial Approach to Combinatorial Optimization, Monte-Carlo Simulation and Machine Learning; Springer: Berlin, Germany, 2004. [Google Scholar]
- Haupt, R.L.; Sue, E.H. Practical Genetic Algorithms; Wiley: New York, NY, USA, 2004. [Google Scholar]
- Shi, L.; Ólafsson, S. Nested partitions method for global optimization. Oper. Res. 2000, 48, 390–407. [Google Scholar] [CrossRef]
- Hoffman, K.L. Combinatorial optimization: Current successes and directions for the future. J. Comput. Appl. Math. 2000, 124, 341–360. [Google Scholar] [CrossRef]
- Grötschel, M.; Krumke, S.O.; Rambau, J. Online Optimization of Large Scale Systems: State of the Art; Springer: Berlin, Germany, 2001. [Google Scholar]
- Martin, R.K. Large Scale Linear and Integer Optimization; Kluwer: Hingham, MA, USA, 1998. [Google Scholar]
- Schrijver, A. Combinatorial Optimization: Polyhedra and Efficiency; Springer: Berlin, Germany, 2003. [Google Scholar]
- Callegari, S.; Bizzarri, F.; Rovatti, R.; Setti, G. On the Approximate solution of a class of large discrete quadratic programming problems by ΔΣ modulation: The case of circulant quadratic forms. IEEE Trans. Signal Process. 2010, 58, 6126–6139. [Google Scholar] [CrossRef]
- Callegari, S.; Bizzarri, F. A Heuristic Solution to the Optimisation of Flutter Control in Compression Systems (and to Some More Binary Quadratic Programming Problems) via ΔΣ Modulation Circuits. In Proceedings of the 2010 IEEE International Symposium Circuits and Systems (ISCAS), Paris, France, 30 May–2 June 2010; pp. 1815–1818.
- Von Haartman, K.; Kohn, W.; Zabinsky, Z.B. A meta-control algorithm for generating approximate solutions to binary programming problems. Nonlinear Anal. Hybrid Syst 2008, 2, 1232–1244. [Google Scholar] [CrossRef]
- Frieden, B.R. Science from Fisher Information: A Unification; Cambridge University Press: Cambridge, UK, 2004. [Google Scholar]
- Zhen, S.; Chen, Y.; Sastry, C.; Tas, N.C. Optimal Observation for Cyber-Physical Systems: A Fisher-Information-Matrix-Based Approach; Springer: Berlin, Germany, 2009. [Google Scholar]
- Lewis, F.L.; Syrmos, V.L. Optimal Control; Wiley: New York, NY, USA, 1995. [Google Scholar]
- Bertsekas, D.P. Dynamic Programming and Optimal Control, 3rd ed.; Athena Scientific: Nashua, NH, USA, 2005; Volume I. [Google Scholar]
- Varga, R.S. Matrix Iterative Analysis; Springer: Berlin, Germany, 2000. [Google Scholar]
- MIPLIB—Mixed Integer Problem Library. Available online: http://miplib.zib.de/ (accessed on 15 June 2013).
- Ali, M.M.; Khompatraporn, C.; Zabinsky, Z.B. A numerical evaluation of several stochastic algorithms on selected continuous global optimization test problems. J. Glob. Optim. 2005, 31, 635–672. [Google Scholar] [CrossRef]
© 2013 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).