1. Introduction
Quantum annealing (QA) computers such as the commercially available D-Wave 2X and D-Wave 2000Q are designed to seek solutions to problems that are hard for the conventional computers, such as many NP-hard problems [
1]. The type of problems such computers can solve
directly in their quantum hardware is minimizing a quadratic form of the type
where variables
are either in the set
, in which case the problem is called an
Ising problem, or in the set
, in which case it is called a
Quadratic Unconstrained Binary Optimization (
QUBO) problem. Both versions are NP-hard [
1,
2] and they can easily be converted into each other by a linear variable transformation. To minimize
Q, QA computers are trying to find a low-energy state of the quantum system whose Hamiltonian corresponds to
Q [
3,
4].
Solving an optimization problem using a quantum annealer involves the following several steps.
(i) Represent the optimization problem of interest as a QUBO or Ising problem. Many well-known NP-hard problems such as Maximum Clique, Maximum Independent Set, Maximum Cut, and Graph Coloring have simple formulations as quadratic constrained binary optimization problems, i.e., problems with objective function of type (
1) plus constraints [
5]. A typical such constraint is of the form
for a QUBO, or
for an Ising problem, where
, meaning that exactly
k of the decision variables should be set to 1 (true). To have the same representation valid both for the QUBO and the Ising formulations of the constraint, we rewrite it as
As an example, the graph partitioning problem [
6], asking to divide the vertices of an
n-vertex graph into two equal (with difference in sizes at most one) parts so that the number of edges with endpoints in different parts is minimized, can be formulated as the Ising problem:
we will later discuss approaches for dealing with such constraints.
(ii) Map the Ising problem onto the annealer’s hardware. To define the mapping, we consider two graphs, the graph
P describing the structure of the Ising (the program graph), and the graph
H, describing the interconnections between the qubits in the hardware (the hardware graph). Specifically,
P has a vertex
i for each variable
and an edge
if
, while
H describes the connections between the qubits in the quantum processing unit. In the current D-Wave architecture,
H has the structure of a
Chimera graph, which consists of an array of
unit cells, where each unit cell is a complete
bipartite graph, and the cells are connected as illustrated on
Figure 1. Please note that with the current technology, due to errors during manufacturing and/or calibration, some qubits and couplers are defective and therefore disabled. We assume for now that the Chimera graph is complete, but in
Section 8.1.1 we describe a method for dealing with incomplete Chimera graphs.
Since P is almost never a subgraph of H, P has to be minor-embedded into H, which means that each vertex of P is mapped onto a connected set of vertices (qubits) of H called chain, and the parameters of the edges (couplers) connecting the qubits in a chain are chosen so that all such qubits are getting the same value in an optimal solution, with a high probability. While this approach allows any graph P to be embedded into a sufficiently large Chimera graph H, it comes at a cost. If the Ising/QUBO problem is dense, embedding a problem of n variables requires number of qubits proportional to .
(iii) Run the annealer in order to find values of the variables producing a low energy of the quantum system modeling the Ising, which corresponds to a low value of the objective function. Due to errors caused by hardware imperfections, thermal noise, decoherence, and the short anneal times (in the order of microseconds), usually this operation is being performed many thousands of times and the best of the proposed solutions is chosen as an answer.
Since a QUBO problem cannot, by definition, contain constraints, any constraints have to be added to the objective as penalties. For instance, constraint (
2) can be transformed into a penalty term
that is added to the objective of a QUBO. If
M is chosen large enough, then for any optimal solution the constraint (
2) should be satisfied, assuming the problem is a minimization one. A drawback of such an approach is that such squared penalty will result in a QUBO or Ising with
quadratic terms (nonzero coefficients
) and a complete graph
P even if the original objective was sparse (with
quadratic terms). Hence, our goal is to design more efficient ways to represent constraints for optimization with quantum annealers such as D-Wave.
Previous work on this problem includes [
7], where the constraint embedding problem is formulated as a mixed integer optimization problem, which is solved by using MathSat solver [
8], a Satisfiability Modulo Theory (SMT) [
9] type solver, in a combination with using dynamic programming and exploiting the special structure of the Chimera graph. They use their algorithm to find an embedding of the Ising constraint
,
, producing an embedding with 8 qubits, compared to the 12 qubits required by the squared constraint penalty. However, their approach is not scalable and is restricted to problems where
n is no more than 10–20 [
7]. There also has been a lot of work on solving constrained optimization problems on classical computers. The approach adopted in this paper and commonly used for solving constrained problems on the D-Wave annealer belongs to the class of penalty methods, which includes barrier methods [
10,
11]. Another approach is the Augmented Lagrangian Method [
12], which combines the penalty function with Lagrange multipliers in a single objective function. However, since the Lagrange multipliers are real numbers, the resulting problem cannot be directly solved on the D-Wave annealer, which only accepts binary variables. For further discussion of constrained methods, see [
10,
13,
14].
For minor-embedding a general graph into the Chimera graph, Vicky Choi [
15,
16] show that a Chimera graph consisting of
array of unit cells allows an embedding of a complete graph of
vertices. Klymko et al. [
17] design a polynomial-time algorithm for finding embedding of a complete graphs into a subgraph if a Chimera graph, which is relevant for the case of hardware graphs with faulty qubits and/or couplers. Boothy et al. [
18] improve on that result by increasing in some case the sizes of the cliques that can be embedded and reducing the chain sizes. Cai et al. [
19] design a heuristic algorithm that constructs an embedding of any graph (not necessarily clique) into the Chimera graph. Rieffel et al. [
20] explore alternative ways of mapping operational planning optimization problems to QUBO problems and embedding them into the D-Wave hardware, and analyze how different mappings and parameter choices affect the effectiveness of the annealers. Bian et al. [
21] develop a locally structured approach for embedding Boolean constraint satisfaction problems on the Chimera hardware and and propose decomposition algorithms for solving problems too large to fit into the hardware. Hamilton et el. [
22] study what graphs can be embedded into a complete bipartite virtual hardware by defining a minor set cover (MSC), which is a is a set of minors for a given graph
G, and show that the complete bipartite graph
has a MSC of
n minors. Goodrich et al. [
23] use a graph-theoretical approach and develop an odd cycle traversal decomposition (OCT)-based embedding algorithm by exploiting the bipartite structure of the Chimera graph. Date et al. [
24] propose an embedding algorithm for a Chimera graph without faults that is efficient and uses fewer qubits and produces solutions of the QUBO problem closer to the global minimum, compared to the current D-Wave embedder.
In this paper, we propose a method that allows embedding in the Chimera graph linear constraints of any number of variables. The idea of our approach is to define several patterns of variable assignments for a single cell of
H, we call “tiles”, so that there is one-to-one correspondence between the variable assignments and the tilings of
H. Our result implies that there exists an implementation of the constraint (
2) with
qubits on an
-qubit Chimera graph with
gap, or difference between feasible and infeasible solutions, equal to 2. Another advantage of our method is that the computation time required to compute the embedding does not depend on the number of the variables, but on the number of the tile types, so it is a constant with respect to
n. Although this result implies
-qubit implementation for the special case
, we describe a separate algorithm with smaller constant factors.
The paper is organized as follows. In
Section 2, we formally introduce the problem we want to solve and in
Section 3 we present an optimization-based approach for directly solving that problem, which is however not scalable. In the next section,
Section 4, we outline our proposed two-level approach that is scalable to constraints of type (
2) of any size. In
Section 5 and
Section 6 we describe details of the implementation. Finally, in
Section 7 and
Section 8 we propose a better solution for the special case
and describe the results of our experimental analysis. In the conclusion, we give a summary and offer some open problems.
2. Problem Formulation
Let
be the set of all binary variables
satisfying constraint (
2) and
, where
. The typical implementation of the constraint (
2) is as a penalty term of the type
where
. Ignoring the scaling factor
M, the squared difference term
has the following relevant properties:
- (i)
for each , ;
- (ii)
for each , for some ( in this case).
Clearly, the larger the value of the parameter
, which is called
gap, for which (ii) is satisfied, the larger the “penalty” to the objective, if a solution to the optimization problem violates the constraint. Hence, one would like to solve the following problem:
where the QUBO problem is normalized with specific restrictions on the values of its coefficients. For such restrictions, we will use the maximum values of biases allowed on qubits and couplers by D-Wave hardware, as discussed later.
To allows values of
that are better than (
3), one can use ancillary binary variables
and consider a quadratic binary form of both
and
variables of the type
to encode the constraint. Please note that linear terms
and
have been replaced in (
5) by quadratic terms
and
in order to simplify notation, since the variables
and
are in
. Then, the above optimization problem is replaced by the following one [
7,
25,
26].
Problem 1. Find coefficients , , and such that the QUBO problem (5) satisfies the conditions - (C1)
s.t. ,
- (C2)
,
- (C3)
,
for some . (Please note that we use a parameter μ to account for the lack of constant terms in (5) and (1). Conditions (C1) and (C2) correspond to (i) and (C3) corresponds to (ii). We also take into account the physical properties of the quantum processing unit (QPU), which includes constraints on the minimum and maximum values of the coefficients
,
, and
, which describe the range of values that the QPU hardware allows, and complying with the topology of the Chimera graph, which means that only a specific small subset of
,
, and
coefficients are allowed to be non-zeros. Please note that while the standard penalty version (
3) of the constraint does not explicitly include ancillary variables, such variables implicitly appear when
is embedded in the QPU, since the embedding represents most of the variables by chains (sets) of qubits, i.e., uses extra variables. Moreover, the number of these extra (ancillary) variables grows asymptotically as the square of the problem variables, i.e.,
, which is very inefficient. Therefore, we will be looking for solutions in which
m is a subquadratic function of
n; specifically, we propose a solution where
, and
if
k is bounded by a constant.
4. A Two-Level Approach
Next, we describe a solution for arbitrary large
n that uses
ancillary variables. Unless stated otherwise, we consider from now on the Ising case, i.e.,
, since it will make it easier to establish why and how it works. Our goal is to compute the coefficients
,
, and
as defined in (
5). However, since in the Ising version
for each
, i.e., it is a constant, we can use (in order to avoid the introduction of new variables) the diagonal values
and
as coefficients for the linear terms
and
in the Ising problem. Hence, the Ising version of (
5) is
Keys to our approach are the concepts of a gadget and a tile, discussed below, that allow any assignment of values in to the qubits of a Chimera graph to be analyzed as a geometric configuration built using several types of “tiles”, or patterns of value assignments, describing the individual unit cells of the Chimera. Depending on the types and positions of the tiles, one is able to derive the value of the Ising program for that assignment or an estimation of it, and, eventually, evidence that conditions (C1)–(C3) are satisfied. Intuitively, the difference between gadgets and tiles is that a gadget refers to the Ising coefficients of a cell, while a tile refers to a specific assignment to the variables related to a given gadget.
Next, we describe the approach in more detail. We will assume we have a fixed Chimera graph and we refer to the Ising program implementing the embedding that we are going to construct as , while its projection to a single cell c or to a set S of cells as and , respectively.
4.1. Gadgets
A gadget is an Ising program defined on variables corresponding to the qubits of a unit cell, or, equivalently, the values of the , , and coefficients of the cell. We will next describe several gadgets and, in each gadget, we are going to distinguish between three types of variables–problem, interface, and hidden variables–depending on their function. A variable can have more than one type/function. These types are the following.
Problem variables: variables in
, i.e., variables used in the original constraint (
2).
Interface variables: variables in or used for interaction and exchange of information between neighboring cells by being located on the endpoints of an edge (coupler) joining the cells.
Hidden variables: ancillary “working” variables in used to ensure that the given gadget/tile satisfies desired properties.
We define the following three types of gadgets depending on their variables and positions in the region R of cells implementing the constraint:
Internal gadget—for cells in the interior of
R and denoted by
;
Problem gadget—containing a problem variable (and the only type that contains such a variable) and denoted by
;
Boundary gadget—for cells on the boundary of the region
R of the Chimera graph implementing the constraint. The three possible orientations of this gadget will be denoted by
,
, and
.
Figure 2a,b shows an illustration of an internal type gadget. Please note that only the four couplers denoted with red lines (the slightly longer ones in b&w) can carry information to/from the neighboring cells; the other couplers between cells will have values zero, meaning they will not be used in such implementation. A problem type gadget is illustrated on
Figure 2d,e. Please note that
x is both a problem and an interface variable. The boundary gadget type is similar to the problem one, except that the
x variable is replaced by a
t variable whose linear term takes a maximum allowed value of
, forcing the variable to take value
in an optimal solution. So we can think of the problem and boundary tiles as holding an
x or
t interface variable fixed at some value:
for red, and
for green.
4.2. Tiles
A
tile is a combination of a gadget type and an assignment of values to the gadget’s variables. A tile has the same type as its corresponding gadget.
Figure 2c illustrates an internal type tile and
Figure 2f illustrates a problem type tile.
In addition to the interface and problem variables taking values that give the “color” of the tiles, the hidden variables also get values in
. We distinguish between
good and
bad tiles based on both the interface and the hidden variables. In a good tile, the values of the hidden variables of the corresponding cell
c have to minimize
. In addition, the values of the interface variables should belong to one of several predetermined patterns. Specifically,
Figure 3 illustrates all good tiles that we are going to need in order to implement the constraint (
2). As in
Figure 2, green corresponds to value −1 of the interface variable on the corresponding cell boundary, and red corresponds to value +1. For the problem tiles, the color also indicates the value of the corresponding problem variable
x. Recall that for the good tiles, the hidden
t variables are set to values that minimize the value of the cell Ising program. To denote good tile of certain type throughout the text, we will use an icon with the image of the corresponding tile as in
Figure 3, e.g.,
.
The idea of our approach is to assign gadget types to the cells of the Chimera graph so that in any variables assignment, if all tiles are good and the interface variables between neighboring cells are consistent with each other, then that assignment minimizes , if not, then the Ising program has a value that exceeds the minimum by at least . Next we outline the procedure in more detail.
4.3. From Variables Assignments to Configurations of Tiles
Given a constraint of the type (
2), determined by a pair of values
, we first assign a specific gadget type to each cell of an
region of cells
R of the Chimera graph, and then assign values to the
,
, and
coefficients for each cell
c defined by its gadget type, resulting in an Ising program
for the region
R. Determining the pattern of gadgets for
R and the coefficients of each gadget type will be detailed in the next sections.
Let us assume now we are given an arbitrary assignment
of values to the variables in the region
R. For each cell
c of
R, the projection of
to
c defines a tile that either is one of the types defined in
Figure 3, or it is a bad one. Therefore,
defines a unique tiling of
R. Thus, instead of analyzing all possible assignments
, we can study all possible tilings of
R, a much smaller problem. Moreover, our gadget and tile types will be designed in such a way that analyzing the tilings of
R can yield useful information about the minima of
. Specifically, any tiling that consists of only good tiles and such that each tile’s adjacent tiles are in appropriate colors (
good tiling) will correspond to a minimum value (optimal assignment) and, moreover, have a set of problem (
x) variable values satisfying (
2). On the other hand, any bad tiling (that has at least one bad tile or inconsistent colors of any pair of adjacent tiles) will have value at least
above the minimum and its problem variables will violate (
2). Hence, variable values satisfying (
2) will produce value of the Ising program at least
below the value of any problem variables violating (
2).
6. Solving the Optimization Problem
For each gadget type, we need to define a single-cell Ising program so that conditions (
6) are satisfied for all possible tiles
for that gadget, and the value of the gap parameter
is maximized. We will give details for the internal gadget
since it is the most interesting. For this gadget, we have 4 good tiles
,
,
, and
, and all others are bad tiles. Denote this Ising program by
and its coefficients by
(since internal gadgets do not have
variables, their coefficients in front of the
and
terms are zeros). Define sets
of all 4-tuples
of interface variables of good internal tiles (see
Figure 2) and
of all remaining 4-tuples. Then we formulate the following optimization problem.
Problem 2. Find values for the coefficients and parameters μ and γ such that satisfies the conditions
- (C1)
For all s.t. ,
- (C2)
For all ,
- (C3)
For all
and γ is maximized.
This is the same type of problem as Problem 1, but of much smaller size. We model Problem 2 as a mixed-integer programming (MIP) problem. We replace each variable
by a binary variable
using the linear transformation
. Since
is linear with respect to variables
, conditions (C2
) and (C3
) of Problem 2 can be easily encoded as linear constraints. To encode condition (C1
), we introduce, for each
, new binary variables
,
corresponding to the values
from condition (i). This introduces
binary variables. Then (C1
) is replaced by
This removes the ∃ predicate, but since the Ising program coefficients are variables, the resulting constraint is cubic (there are cubic terms of the type ).
We linearize each such product
by introducing a new real variable
and adding the constraints
and we use a similar technique to linearize quadratic terms of the type
, where
is real and
is binary.
We implemented Problem 2 using the AMPL modeling language [
27] and the Gurobi solver [
28]. The resulting linear program has 16 binary variables, 234 continuous variables, and 884 constraints. Gurobi found the optimal solution in a tiny fraction of a second, with
and
. Please note that a value
would have been sufficient, since the penalty for mismatched adjacent tiles is only 2 and the gap for the entire Ising program is the minimum of the gap between good and bad tiles and the gap between good and bad tile pairs.
Corollary 1. The Ising program defined in Section 5 is a solution of Problem 1 with .