Approximation Hierarchies for the Copositive Tensor Cone and Their Application to the Polynomial Optimization over the Simplex

Iqbal, Muhammad Faisal; Ahmed, Faizan

doi:10.3390/math10101683

Open AccessArticle

Approximation Hierarchies for the Copositive Tensor Cone and Their Application to the Polynomial Optimization over the Simplex

by

Muhammad Faisal Iqbal

¹ and

Faizan Ahmed

^2,*

¹

Department of Applied Mathematics and Statistics, Institute of Space Technology, Islamabad 44000, Pakistan

²

Formal Methods and Tools Group, University of Twente, 7522 NB Enschede, The Netherlands

^*

Author to whom correspondence should be addressed.

Mathematics 2022, 10(10), 1683; https://doi.org/10.3390/math10101683

Submission received: 1 April 2022 / Revised: 10 May 2022 / Accepted: 10 May 2022 / Published: 14 May 2022

(This article belongs to the Section E: Applied Mathematics)

Download Versions Notes

Abstract

:

In this paper, we discuss the cone of copositive tensors and its approximation. We describe some basic properties of copositive tensors and positive semidefinite tensors. Specifically, we show that a non-positive tensor (or Z-tensor) is copositive if and only if it is positive semidefinite. We also describe cone hierarchies that approximate the copositive cone. These hierarchies are based on the sum of squares conditions and the non-negativity of polynomial coefficients. We provide a compact representation for the approximation based on the non-negativity of polynomial coefficients. As an immediate consequence of this representation, we show that the approximation based on the non-negativity of polynomial coefficients is polyhedral. Furthermore, these hierarchies are used to provide approximation results for optimizing a (homogeneous) polynomial over the simplex.

Keywords:

copositive tensor; positive semidefinite tensor; sum of squares; approximation hierarchies; polynomial optimization; simplex

MSC:

15A69; 15B48; 52A27; 11E25; 90C23

1. Introduction

Multidimensional arrays or tensors arise naturally as an extension of matrices. They occur in applications where one needs to represent multidimensional data such as in signal processing [1,2,3], machine learning [2,4,5], material science [6], and speech recognition [7]. For example, any homogeneous polynomial of degree d in n-variables is associated with a symmetric tensor of order d and dimension n. In polynomial optimization and, likewise, in control theory, checking the non-negativity of a polynomial is a fundamental problem [8,9]. Nonnegativity of a polynomial over the real space or its non-negative orthant results in positive semidefinite and copositive tensors, respectively.

The copositive and completely positive cones of matrices, which are tensors of order two, are very well explored (see, e.g., [10,11,12] and also [13] for a list of open problems). Therefore, it seems natural to study similar results for copositive tensors. The generalization from matrix to tensor is not trivial since a higher dimension usually destroys the nice structure present at a lower dimension.

The current research on copositive tensor is focused on describing properties that can be generalized from the quadratic case to higher dimensional case. The area is not very well explored. Similar to its matrix analog a characterization of copositive tensors using eigenvectors of principal sub-tensors is described in [14]. Moreover, Qi and co-authors have discussed several basic properties of copositive tensors in a series of papers (see e.g., [15] and the book [16]).

The set of copositive tensors forms the copositive cone. The copositive cone is used to reformulate hard combinatorial optimization problem as a linear conic optimization problem. Reformulation of polynomial optimization as copositive program does not reduce its complexity. However, the complexity is packaged in the copositivity constraint, which is known to be NP-hard. To approximate copositive cone several tractable approximation hierarchies are developed. These approximation hierarchies are based on sum-of-squares conditions, non-negativity of polynomial coefficients, simplicial partition, or rational griding of the simplex. For instance, Parrilo [17] had provided a hierarchy of linear and semi-definite inner approximations for copositive cone (see also [18]). Moreover, Bomze and de Klerk [19] developed a hierarchy for copositive matrices based on non-negativity of polynomial coefficients.

In this paper, we describe approximation hierarchies for the cone of copositive tensors. We extend the hierarchies presented by Parrilo [17] (represented by

K_{n, d}^{r}

) an Bomze and de Kelerk [19] (represented by

C_{n, d}^{r}

) for higher dimension. Earlier work was focused on extending polynomial approximation schemes from lower to higher dimension. For example, Bomze and de Kelerk [19] focused on improving the approximation result as presented by Nesterov [20]. They first presented a polyhedral representation of

C_{n, 2}^{r}

and then used this representation to derive an approximation scheme for polynomial optimization over the simplex. The approximation scheme was extended to a higher-degree fixed-degree polynomial by De Klerk, Laurent and Parrilo [21]. However, they used rational griding of the simplex to derive this polynomial time approximation scheme. The results was further refined using the Bernstein approximation in [22]. Since these approximation schemes relied on rational griding, they have therefore also presented an error analysis based on multivariate hyper-geometric distribution (see [23] and also [24] for a note on the convergence rate).

Furthermore, we develop a compact representation of

C_{n, d}^{(r)}

with an aim to extend the initial results as presented in Bomze and de Kelerk [19]. To the best of our knowledge, the closest attempt in this direction is by [25]. However, their results are restricted to the tensor of order four. Secondly, the results rely heavily on breaking the tensor of dimension four into tensors of lower dimensions. The representation process was tedious and had no obvious generalization to higher dimension cases. Our representation is general and holds for all dimensions. These hierarchies can be used to develop polynomial time approximation schemes for polynomial optimization over the simplex. Moreover, we have provided results in this direction (see Section 5).

The main contributions of this paper are as follows. (a) To discuss basic properties of the copositive tensor cone. Moreover, we show that every Z-tensor is copositive if and only if it is positive semidefinite. (b) To describe approximation hierarchies for the copositive cone of tensors based on sum of square decomposition and non-negativity of the polynomial coefficients.Moreover, we present the polyhedral representation of the approximation hierarchical cone

C_{n, d}^{(r)}

based on non-negative coefficients. The representation is compact and has not appeared in the literature. (c) application of approximation hierarchies to the polynomial optimization over the simplex.

The article is arranged as follows. Section 2 is comprised of the basic definitions and notations. In Section 3, we define tensor cones and related results. In Section 4, we present approximation hierarchies for copositive cone of tensors. We discuss special cases and characterization of these hierarchies. Section 5 provides approximation results based on these hierarchies. In Section 6, we provide a conclusion and future directions.

2. Preliminaries

Throughout this article, the n-dimensional Euclidean space and its non-negative orthant are denoted by

R^{n}

and

R_{+}^{n}

, respectively. The set of natural numbers is denoted by

N

and the set of first n natural number is denoted by

[1 : n]

. The set of whole numbers is denoted by

N_{0} = {0, 1, 2, \dots}

while the set of first

n + 1

whole numbers is denoted by

[0 : n]

. For any

α \in {[0 : d]}^{n}

we define

| α | : = \sum_{i = 1}^{n} α_{i}

. We also define the index set

I^{n} (d) = {α \in {[0 : d]}^{n} : | α | = d} .

(1)

The cardinality of

I^{n} (d)

is

| I^{n} (d) | = (\binom{n + d - 1}{d})

(see, e.g., [22]).

A tensor is a multi-dimensional array of real numbers. Specifically, an n-dimensional dth order tensor is given by

A = {(a_{i_{_{1}} i_{_{2}} \dots i_{_{d}}})}_{_{1 \leq i_{_{1}}, \dots, i_{_{d}} \leq n}} .

Moreover, a tensor

A

is said to be symmetric if

a_{_{i_{_{1}} i_{_{2}} i_{_{3}} \dots i_{_{d}}}} = a_{_{i_{_{σ (1)}} i_{_{σ (2)}} i_{_{σ (3)}} \dots i_{_{σ (d)}}}} for all permutations σ o n {1, 2, \dots, d} .

The set of all symmetric tensors is denoted by

S_{n, d}

. For brevity of notation, if some index

i_{j}

of an element

a_{i_{_{1}} i_{_{2}} \dots i_{_{d}}} \in A

is repeated k-times, we write it as

{(i_{j})}^{k}

i.e.,

a_{_{i_{1} i_{2} \dots {(i_{j})}^{k} \dots i_{d - k}}} = a_{_{i_{1} i_{2} \dots \underset{k - t i m e s}{\underset{⏟}{i_{j} i_{j} \dots i_{j}}} \dots i_{d - k}}} .

Using above notation the i^th diagonal element of the tensor

A \in S_{n, d}

is denoted by

a_{{(i)}^{d}}

.

The inner product of two tensors

A, B \in S_{n, d}

is defined as follows

\begin{matrix} 〈A, B〉 = \sum_{i_{1}, i_{2}, \dots, i_{d} = 1}^{n} a_{i_{1}, i_{2}, \dots, i_{d}} b_{i_{1}, i_{2}, \dots, i_{d}} . \end{matrix}

(2)

For

α \in I^{^{n}} (d)

and

x \in R^{n}

,

x^{α} : = {x_{1}}^{α_{1}} {x_{2}}^{α_{2}} \dots {x_{n}}^{α_{n}} = \prod_{i = 1}^{n} x_{i}^{α_{i}}

represent a monomial. The maximum degree among all the monomials of

p (x)

is called the degree of a polynomial. A polynomial whose all monomials have the same degree is termed as a ‘form’ or a ‘homogeneous polynomial’. Moreover, for

x \in R^{n}

,

x^{d}

denotes a symmetric tensor of dimension n and degree d and is given below,

x^{d} = \underset{d - t i m e s}{\underset{⏟}{x ⊙ x ⊙ \dots ⊙ x}} = {(x_{_{i_{1}}} x_{_{i_{2}}} \dots x_{_{i_{d}}})}_{_{1 \leq i_{1}, \dots, i_{d} \leq n}} \in S_{n, d} .

Notice that, the entries of tensor

x^{d}

are the monomials of degree d in n-variables. Thus for any symmetric tensor

A \in S_{n, d}

its associated form can be written as

\begin{matrix} h_{A} (x) : = A x^{d} = \sum_{i_{1}, \dots, i_{d} = 1}^{n} a_{i_{1}, \dots, i_{d}} x_{i_{1}} \dots x_{i_{d}} = \sum_{α \in I^{n} (d)} c (α) A_{α} x^{α} . \end{matrix}

(3)

where

A_{α} = a_{i_{1}, \dots, i_{d}}

denote the coefficient of monomial

x^{α}

in

h_{_{A}} (x)

and

\begin{matrix} c (α) = \{\begin{matrix} \frac{| α |!}{\prod_{i}^{n} (α_{i}!)} & i f α \in N_{0}^{n} \\ 0 & i f α \in R^{n} \ N_{0}^{n} \end{matrix} \end{matrix}

(4)

and

α_{i}!

denotes the factorial of

α_{i}

.

For

A \in S_{n, d}

a subset

K \subseteq S_{n, d}

is said to be a cone if for each tensor

A \in K

the scalar product

λ A \in K

for all

λ \geq 0

. Moreover, the cone K is said to be a convex cone if for

A, B \in K

and for non-negative scalars

λ_{1}, λ_{2} \in R

we have,

λ_{1} A + λ_{2} B \in K

. The dual

K^{*}

of the cone K is defined as

\begin{matrix} K^{*} : = \{U \in S_{n, d} : 〈U, V〉 \geq 0, for all V \in K\} . \end{matrix}

(5)

A convex cone K is said to be pointed if

{K} \cap {- K} = {0}

, and K is said to be solid if its interior is nonempty. A convex cone which is closed, pointed, and solid is termed as proper cone. A convex cone K is said to be a polyhedral cone if it is finitely generated.

The cone of entry-wise non-negative tensors is denoted by

N_{n, d}

. Finally, a tensor with non-positive off-diagonal entries is termed as Z-tensor (it is also called essentially non-positive tensor see e.g., [26]). Finally, the standard simplex

Δ_{n}

in n-dimensional Euclidean space

R^{n}

is

Δ_{n} : = \{x \in R^{n} : e^{T} x = 1\}

(6)

where

e^{T} = [1, 1, \dots, 1] \in R^{n}

.

3. Positive Semidefinite Tensors, Copositive Tensors and Their Duals

The cone of dth-order (with d even) n-dimensional, positive semidefinite tensors is denoted by

S_{n, d}^{+}

and is given below

\begin{matrix} S_{n, d}^{+} & : = \{A \in S_{n, d} : h_{A} (x) = A x^{d} \geq 0, \forall x \in R^{n}\} . \end{matrix}

(7)

For a tensor

A \in S_{n, d}^{+}

, the polynomial

h_{_{A}} (x)

is called PSD polynomial. The dual of PSD cone

S_{n, d}^{+}

is the cone of completely positive semidefinite tensors, denoted by

S_{n, d}^{+^{*}}

, which is defined as.

\begin{matrix} S_{n, d}^{+^{*}} : = \{X \in S_{n, d} : 〈A, X〉 \geq 0, \forall A \in S_{n, d}^{+}\} = \{\sum_{k = 1}^{N} {(x_{_{k}})}^{d} : x_{_{k}} \in R^{n}, N \in N\} . \end{matrix}

(8)

For

d = 2

, PSD cone is self dual i.e.,

S_{n, 2}^{+} = S_{n, 2}^{+^{*}}

(cf. [16]). However, for

d \geq 4

we have

S_{n, d}^{+} \neq S_{n, d}^{+^{*}}

in general (see [27], Example 4.5).

It is well known that a function is convex if and only if its Hessian matrix is positive semidefinite (see, e.g., [28], Theorem 4.5). Therefore the convexity of homogeneous polynomial defined in (3) amounts to checking if

▿^{2} h_{A} (x) \in S_{n, 2}^{+}

. It has been shown that if a polynomial

h_{A} (x)

is convex then its associated tensor

A

is positive semidefinite (see [27], Proposition 5.10). However, the converse need not to be true in general (see [27] (Example 5.11) and [29]).

A tensor

A \in S_{n, d}

is said to be copositive if

h_{A} (x) = A x^{d} \geq 0

for all

x \in R_{+}^{n}

, whereas a tensor is called strictly copositive if

h_{A} (x) = A x^{d} > 0

for all

x \in R_{+}^{n} \ {0}

. The set of n-dimensional, d^th-order copositive tensors defines a cone given below,

\begin{matrix} C_{n, d} : = \{A \in S_{n, d} : A x^{d} \geq 0, \forall x \in R_{+}^{n}\} . \end{matrix}

(9)

It is well known that, a tensor

A \in i n t (C_{n, d})

if and only if it is strictly copositive, where

i n t (C_{n, d})

denotes the interior of the set

C_{n, d}

. The dual of copositive cone

C_{n, d}

is completely positive cone denoted by

C_{n, d}^{*}

(see e.g., [16] (Theorem 6.9), [30]), which is defined below,

\begin{matrix} C_{n, d}^{*} & = \{x^{d} \in S_{n, d} : 〈A, x^{d}〉 \geq 0, \forall A \in C_{n, d} and x \in R_{+}^{n}\} \end{matrix}

(10)

\begin{matrix} = \{\sum_{k = 1}^{N} {(x_{k})}^{d} : x_{k} \in R_{+}^{n}, N \in N\} . \end{matrix}

(11)

It is clear that if a tensor is positive semidefinite then it is copositive also. A copositive tensor need not to be positive semidefinite (cf. Example 1). The question arises in which case a copositive tensor is also positive semidefinite. In the following theorem, we describe one such case (for

d = 2

see [31], Lemma 2.6).

Theorem 1.

Let

A \in S_{n, d}

be a Z-tensor, then

A

is copositive if and only if it is positive semidefinite, where d is even.

Proof.

Let us take

A \in C_{n, d}

and since

A

is Z-tensor, we can write

A = P - Q

where

P, Q \in N_{n, d}

such that,

\begin{matrix} \begin{matrix} p_{_{i_{1} \dots i_{d}}} : = \{\begin{matrix} \begin{matrix} a_{{(i)}^{d}} & if i_{1} = \dots = i_{d} = i \\ 0 & otherwise \end{matrix} \end{matrix} & q_{i_{1} \dots i_{d}} : = \{\begin{matrix} \begin{matrix} 0 & if i_{1} = \dots = i_{d} = i \\ | a_{_{i_{1} \dots i_{d}}} | & otherwise \end{matrix} \end{matrix} \end{matrix} \end{matrix}

(12)

To show that

A \in S_{n, d}^{+}

take

x \in R^{n}

and consider,

\begin{matrix} h_{A} (x) & = A x^{d} = (P - Q) x^{d} = P x^{d} - Q x^{d} \forall x \in R^{n} . \end{matrix}

(13)

Since d is even, we have

h_{P} (x) : = P x^{d} = \sum_{i}^{n} a_{{(i)}^{d}} x_{i}^{d} \geq 0

for all

x \in R^{n}

. However,

h_{Q} (x) : = Q x^{d}

can be positive or negative. Note that if

h_{Q} (x) \leq 0

for some

x \in R^{n}

then clearly from (13) we have,

h_{A} (x) = P x^{d} + | Q x^{d} | \geq 0

.

So, the only case left is when

h_{Q} (x) \geq 0

for some

x \in R^{n}

. To show that

h_{A} (x) \geq 0

in this case also, we define

x^{+} \in R_{+}^{n}

such that

\begin{matrix} x_{i}^{+} = \{\begin{matrix} x_{i} & x_{i} \geq 0 \\ - x_{i} & x_{i} < 0 \end{matrix} \end{matrix}

. To show that

h_{Q} (x) \leq h_{Q} (x^{+})

consider,

S : = \{α \in I^{n} (d) : x^{α} < 0\}

and note that for

α \in S

we have

x^{α} = - {(x^{+})}^{α}

. Clearly, we have

\begin{matrix} 0 \leq h_{Q} (x) & = \sum_{α \in I^{n} (d)} c (α) Q_{α} x^{α} = \sum_{α \in I^{n} (d) ∖ S} c (α) Q_{α} x^{α} + \sum_{α \in S} c (α) Q_{α} x^{α} \\ = \underset{\geq 0}{\underset{⏟}{\sum_{α \in I^{n} (d) ∖ S} c (α) | A_{α} | {(x^{+})}^{α}}} - \underset{\geq 0}{\underset{⏟}{\sum_{α \in S} c (α) | A_{α} | {(x^{+})}^{α}}} \\ \leq \sum_{α \in I^{n} (d) ∖ S} c (α) | A_{α} | {(x^{+})}^{α} + \sum_{α \in S} c (α) | A_{α} | {(x^{+})}^{α} = h_{Q} (x^{+}) . \end{matrix}

(14)

Since d is even therefore we have

h_{P} (x) = h_{P} (x^{+})

. Furthermore,

A

is copositive and

x^{+} \in R_{+}^{n}

implying

h_{P} (x^{+}) \geq h_{Q} (x^{+})

. Thus, we have

h_{P} (x) = h_{P} (x^{+}) \geq h_{Q} (x^{+}) \geq h_{Q} (x) .

(15)

From (13) and (15) we deduce that

h_{_{A}} (x) \geq 0

in this case also. Hence,

A

is positive semidefinite.

Converse is obvious since every positive semidefinite tensor is also copositive. □

Note that the above result also appeared in Zhang et al. [32] (Theorem 3.5(e) and Theorem 3.12), where the proof is constructed based on the spectral properties of the so called M-tensors. The proof given above is self-contained and does not require any extra structure.

4. Approximation Hierarchies for the Copositive Cone

Recall that a tensor

A \in S_{n, d}

is copositive if

h_{A} (x) : = \sum_{α \in I^{n} (d)} A_{α} x^{α} \geq 0

for all

x \in R_{+}^{n}

. Notice that for any

x \in R_{+}^{n}

, we can write

x = z \circ z \in R_{+}^{n}

for some

z \in R^{n}

, where ∘ indicates the component wise (Hadamard) product, giving

\begin{matrix} h_{A} (z \circ z) : = \sum_{α \in I^{n} (d)} A_{α} {(z \circ z)}^{α} = \sum_{i_{1}, \dots, i_{d} = 1}^{n} a_{_{i_{1} i_{2} \dots i_{d}}} z_{i_{1}}^{2} z_{i_{2}}^{2} \dots z_{i_{d}}^{2} . \end{matrix}

(16)

Thus, the copositivity condition translates to

h_{A} (z \circ z) \geq 0

for all

z \in R^{n}

, for which a sufficient condition is that if (16) can be written as a sum of squares (SOS). Let us illustrate this with an example:

Example 1.

Consider a tensor

A \in S_{2, 4}

such that

a_{1111} = 0, a_{1112} = 1 / 4, a_{1122} = 1 / 6, a_{1222} = - 1 / 2, a_{2222} = 1

. Then for

x \in R^{2}

, we have

h_{A} (x) : = {(x_{1} - x_{2})}^{2} x_{2}^{2} + x_{1}^{3} x_{2} = x_{1}^{3} x_{2} + x_{1}^{2} x_{2}^{2} - 2 x_{1} x_{2}^{3} + x_{2}^{4} .

Clearly,

A

is not positive semidefinite since

h_{A} ({[- 5, 2]}^{T}) = - 54

. Let’s consider

x = z \circ z \in R_{+}^{2}

where

z \in R^{2}

with

x_{1} = z_{1}^{2}, x_{2} = z_{2}^{2}

, then we have,

\begin{matrix} h_{A} (z \circ z) = z_{1}^{6} z_{2}^{2} + z_{1}^{4} z_{2}^{4} - 2 z_{1}^{2} z_{2}^{6} + z_{2}^{8} = {((z_{1}^{2} - z_{2}^{2}) z_{2}^{2})}^{2} + {(z_{1}^{3} z_{2})}^{2} . \end{matrix}

(17)

Thus,

A

is copositive.

In the above example

h_{A} (z \circ z)

can be written as sum of squares. However, this is not the case in general (see Example 2). Therefore, to develop higher order sufficient conditions, the following polynomial, introduced by Parrilo [17], is most often used.

\begin{matrix} P^{(r)} (z) : = h_{A} (z \circ z) {(\sum_{k = 1}^{n} z_{k}^{2})}^{r} f o r a l l z \in R^{n} . \end{matrix}

(18)

Clearly,

P^{(r)} (z)

is a polynomial of degree

2 (r + d)

. Based on (18), one can define two cone approximations for the copositive cone (as we will see),

\begin{matrix} K_{n, d}^{(r)} & = \{A \in S_{n, d} : P^{(r)} (z) has an SOS decomposition\} \end{matrix}

(19)

\begin{matrix} C_{n, d}^{(r)} & = \{A \in S_{n, d} : P^{(r)} (z) has non - negative coefficients\} . \end{matrix}

(20)

For the tensor

A \in S_{2, 4}

given in Example 1, we have

A \in K_{2, 4}^{(0)}

. It is clear from

()

that

h_{A} (z \circ z)

has a negative coefficient implying

A \notin C_{n, d}^{(0)}

. However, one can show that

A \in C_{2, 4}^{(11)}

.

Inclusions

K_{n, d}^{(r)} \subseteq K_{n, d}^{(r + 1)}

and

C_{n, d}^{(r)} \subseteq C_{n, d}^{(r + 1)}

for

r \in N_{0}

are evident from the following,

\begin{matrix} P^{(r + 1)} (z) & = h_{_{A}} (z \circ z) {(\sum_{k = 1}^{n} z_{k}^{2})}^{r + 1} = (\sum_{k = 1}^{n} z_{k}^{2}) (h_{_{A}} (z \circ z) {(\sum_{k = 1}^{n} z_{k}^{2})}^{r}) \end{matrix}

(21)

\begin{matrix} = (\sum_{k = 1}^{n} z_{k}^{2}) P^{(r)} (z) . \end{matrix}

(22)

The cone hierarchies

K_{n, d}^{(r)}

and

C_{n, d}^{(r)}

approximates the copositive cone from the inside. For instance if

A \in K_{n, d}^{(r)}

then

P^{(r)} (z)

being sum of squares is non-negative i.e.,

P^{(r)} (z) \geq 0

for all

z \in R^{n}

, which in turn imply that

h_{_{A}} (z \circ z) \geq 0

for all

z \in R^{n}

, hence

A \in C_{n, d}

. Similarly, one can show that if

A \in C_{n, d}^{(r)}

for some

r \in N_{0}

then

A

is copositive. Thus, we have

\cup_{r = 0}^{\infty} K_{n, d}^{(r)} \subseteq C_{n, d} and \cup_{r = 0}^{\infty} C_{n, d}^{(r)} \subseteq C_{n, d} .

(23)

Referring to Polya’s theorem (see e.g., [25], Theorem 2.1), which states that for a tensor

A \in i n t (C_{n, d})

there exists a large enough r such that

A \in C_{n, d}^{(r)}

. This further implies that, for some

r \in N_{0}

the strictly copositive tensor

A \in i n t (C_{n, d})

allows

P^{(r)} (z)

to have sum of squares decomposition, that is

A \in K_{n, d}^{(r)}

. Therefore, the infinite union of these cones contains the interior of copositive cone i.e.,

i n t (C_{n, d}) \subseteq ⋃_{r = 0}^{\infty} C_{n, d}^{(r)} and i n t (C_{n, d}) \subseteq ⋃_{r = 0}^{\infty} K_{n, d}^{(r)} .

(24)

For the tensor

A \in S_{2, 4}

as given in Example 1 it holds

A \notin i n t (C_{2, 4})

since

h_{_{A}} ({[1, 0]}^{T}) = 0

. Thus, neither

\cup_{r = 0}^{\infty} K_{n, d}^{(r)} ⊈ i n t (C_{n, d})

nor

\cup_{r = 0}^{\infty} C_{n, d}^{(r)} ⊈ i n t (C_{n, d})

holds.

4.1. The Case $R = 0$

The case

r = 0

is interesting and require further exploration. It is clear that

C_{n, d}^{(0)} = N_{n, d}

, which require no further exploration. For

d = 2

, the tensor

A \in K_{n, 2}^{(0)}

is often characterized in terms of the decomposition

A = S + T

, where

S \in S_{n, 2}^{+}

and

T \in N_{n, 2}

. However, for

d > 4

only one direction is possible, as shown below.

Theorem 2.

Let

A \in S_{n, d}

and if

A \in K_{n, d}^{(0)}

then

A = S + T

, where

S \in S_{n, d}^{+}

and

T \in N_{n, d}

i.e.,

K_{n, d}^{(0)} \subseteq S_{n, d}^{+} + N_{n, d}

.

Proof.

The proof for the matrix case can be easily generalized to accommodate for higher order tensors see e.g., ([19], Theorem 2.1). □

The converse of the above theorem is not true in general. The following is a counter example,

Example 2.

Let

A \in S_{3, 6}

be such that

\begin{matrix} a_{_{i_{_{1}} i_{_{2}} i_{_{3}} i_{_{4}} i_{_{5}} i_{_{6}}}} = \{\begin{matrix} 1 & i_{_{1}} = i_{_{2}} = \dots = i_{_{6}} \in {1, 2, 3} \\ 1 / 30 & i_{_{1}} = i_{_{2}} = i, i_{_{3}} = i_{_{4}} = j, i_{_{5}} = i_{_{6}} = k, i \neq j \neq k \in {1, 2, 3} \\ - 1 / 15 & i_{_{1}} = i_{_{2}} = i_{_{3}} = i_{_{4}} = i, i_{_{5}} = i_{_{6}} = j, i \neq j \in {1, 2, 3} \\ 0 & otherwise \end{matrix} \end{matrix}

(25)

The associated polynomial is given by (cf. (3))

\begin{matrix} \begin{matrix} h_{A} (x) = x_{1}^{6} + x_{2}^{6} + x_{3}^{6} + 3 x_{1}^{2} x_{2}^{2} x_{3}^{2} - (x_{1}^{4} x_{2}^{2} + x_{1}^{4} x_{3}^{2} + x_{1}^{2} x_{2}^{4} + x_{1}^{2} x_{3}^{4} + x_{2}^{4} x_{3}^{2} + x_{2}^{2} x_{3}^{4}) \end{matrix} \end{matrix}

(26)

The polynomial given in (26) is the well known Robinson’s polynomial [33]. It is well known that

h_{A} (x) \geq 0

for all

x \in R^{3}

. Therefore, trivially

A

can be written as sum of positive semidefinite and non-negative (actually zero) tensor. It is also well known that Robinson polynomial cannot be written as SOS (see [33] for a proof) i.e.,

A \notin K_{3, 6}^{(0)}

.

To find special cases in which the converse of Theorem 2 holds note that from

A = S + T

, we have

\begin{matrix} h_{_{A}} (z \circ z) = 〈(S + T), {(z \circ z)}^{d}〉 = \underset{= h_{S} (z \circ z)}{\underset{⏟}{〈S, {(z \circ z)}^{d}〉}} + 〈T, {(z \circ z)}^{d}〉 \forall z \in R^{n} \end{matrix}

(27)

Clearly,

〈T, {(z \circ z)}^{d}〉

is SOS since

T

is non-negative. Thus, if

h_{S} (z \circ z)

can be written as SOS the converse of the Theorem 2 holds. It is well known that a matrix (i.e.,

d = 2

) is positive semidefinite if an only if its associated polynomial can be written as sum of squares. Therefore, the converse of Theorem 2 holds for

d = 2

(see [19] (Theorem 2.1) for a proof). Moreover, for

n = 2

and

d = 4

a tensor is positive semidefinite if and only if its associated form is sum-of-squares, see, e.g., [33]. Therefore, the converse of Theorem 2 holds in this case also. We close this discussion by showing that the converse of Theorem 2 holds for the Z-tensors.

Theorem 3.

Let

A \in S_{n, d}

be a Z-tensor and

A = S + T

, where

S \in S_{n, d}^{+}

and

T \in N_{n, d}

then

A \in K_{n, d}^{(0)}

.

Proof.

Note that since

A

is a Z-tensor and

T

is non-negative therefore

S

is also a Z-tensor. It is well-known that a Z-tensor is positive semidefinite if and only if it is sum of squares. (see e.g., [26] (Proposition 2.1), [34] (Theorem 11)). Hence (27) can be written as sum of square. □

4.2. Characterization of $K_{n, d}^{(r)}$

In this subsection, we formulate a characterization for

K_{n, d}^{(r)}

. Before presenting this characterization, we provide bounds on the value of coefficients for the polynomial

P^{(r)} (z)

. For this consider,

\begin{matrix} P^{(r)} (z) & = h_{A} (z \circ z) {(\sum_{k = 1}^{n} z_{k}^{2})}^{r} = (\sum_{i_{1}, \dots, i_{d} = 1}^{n} a_{i_{1} i_{2} \dots i_{d}} z_{i_{1}}^{2} z_{i_{2}}^{2} \dots z_{i_{d}}^{2}) (\sum_{α \in I^{n} (r)}^{} c (α) z^{2 α}) \end{matrix}

(28)

Note that

z_{i_{j}}^{2}

can be written as

z^{2 e_{i_{j}}}

where

e_{i_{j}}

is a unit vector and

1 \leq j \leq n

. Using this notation (28) can be written as,

\begin{matrix} P^{(r)} (z) & = (\sum_{i_{1}, \dots, i_{d} = 1}^{n} a_{i_{1} \dots i_{d}} z^{2 e_{i_{1}}} z^{2 e_{i_{2}}} \dots z^{2 e_{i_{d}}}) (\sum_{α \in I^{n} (r)}^{} c (α) z^{2 α}) \end{matrix}

(29)

Denote by

θ : = (e_{i_{_{1}}} + e_{i_{_{2}}} \dots + e_{i_{_{d}}}) \in I^{n} (d)

, then

ω : = α + θ \in I^{n} (d + r)

, and with these notations we can write (29) as follows,

\begin{matrix} P^{(r)} (z) & = \sum_{ω \in I^{n} (r + d)} [B_{_{ω}} (A)] z^{2 ω} where B_{ω} (A) : = \sum_{θ \in I^{n} (d)} (\frac{r!}{\prod_{i = 1}^{n} (ω_{i} - θ_{i})!}) c (θ) A_{θ} \end{matrix}

(30)

where

ω_{i} - θ_{i} \geq 0

for all

i = 1, \dots, n

.

Note that the maximum value of

\prod_{i = 1}^{n} (ω_{i} - θ_{i})!

is

r!

(Observe that

\sum_{i}^{n} (ω_{i} - θ_{i}) = r + d - d = r

. Take

r_{i} = ω_{i} - θ_{i}

then maximum value occurs when

r_{i} = r

and

r_{j} = 0

for all

i \neq j

). Moreover, the minimum value of

\prod_{i = 1}^{n} (ω_{i} - θ_{i})!

occurs when

ω_{i} - θ_{i} \in {0, 1}

for all

i \in [1 : n]

and the minimum value is 1. Thus, we have

1 \leq \frac{r!}{\prod_{i = 1}^{n} (ω_{i} - θ_{i})!} \leq r!, where ω_{i} - θ_{i} \geq 0 \forall i \in [1 : n] .

(31)

To show that the upper bound in (31) is sharp take

n = r + d

and

ω = {(1, 1, \dots, 1)}^{T} \in I^{n} (d + r)

then for

θ_{i} = i, 1 \leq i \leq d

, we have

(ω_{i} - θ_{i}) \in {0, 1}

, thus the denominator reduces to 1 and the upper bound is exactly

r!

. In the following theorem, we describe a characterization of

K_{n, d}^{(r)}

.

Theorem 4

(cf. [19] for

d = 2

). The tensor

A \in K_{n, d}^{(r)}

if and only if there exists a PSD matrix

\tilde{M} \in S_{_{{\tilde{n}}_{_{r}}, 2}}^{+}

associated with the polynomial (18)

P^{(r)} (z) = {\tilde{z}}^{T} \tilde{M} \tilde{z} where \tilde{z} = {[z^{α}]}_{_{α \in I^{^{n}} (d + r)}} \in R^{{\tilde{n}}_{r}} with {\tilde{n}}_{r} = (\begin{matrix} n + r + d - 1 \\ r + d \end{matrix})

such that

\begin{matrix} \sum_{(β_{_{i}}, β_{_{j}}) \in I^{^{n}} (d + r) \times I^{^{n}} (d + r) : β_{_{i}} + β_{_{j}} = 2 ω} {\tilde{m}}_{_{i, j}} & : = B_{_{ω}} (A) for all ω \in I^{^{n}} (d + r) \end{matrix}

(32)

\begin{matrix} \sum_{(β_{_{i}}, β_{_{j}}) \in I^{^{n}} (d + r) \times I^{^{n}} (d + r) : β_{_{i}} + β_{_{j}} = u} {\tilde{m}}_{_{i, j}} & : = 0 for all u \in I^{^{n}} (2 d + 2 r) \ 2 I^{^{n}} (d + r) . \end{matrix}

(33)

Proof.

The proof is an easy generalization of matrix case (Theorem 2.2, [19]), which is presented here for the sake of completeness. Let us consider the polynomial

P^{(r)} (z)

described in (30) and using its matrix formulation as follows

\begin{matrix} P^{(r)} (z) = {\tilde{z}}^{T} \tilde{M} \tilde{z} where \tilde{z} = {[z^{α}]}_{_{α \in I^{^{n}} (d + r)}} \in R^{{\tilde{n}}_{r}} with {\tilde{n}}_{r} = (\begin{matrix} n + d + r - 1 \\ d + 1 \end{matrix}) \end{matrix}

(34)

Taking

A \in K_{n, d}^{(r)}

implies that its associate polynomial

P^{(r)} (z)

allows a sum of squares decomposition, which further implies that the matrix

\tilde{M}

is PSD. For converse we assume that, the matrix

\tilde{M}

in (34) is PSD. So the following decomposition is evident

\begin{matrix} \tilde{M} = \sum_{k = 1}^{m_{r}} {\tilde{u}}_{k} {\tilde{u}}_{k}^{T} where m_{r} is the rank of \tilde{M} . \end{matrix}

(35)

Combining (34) and (35) gives,

\begin{matrix} P^{(r)} (z) & = \sum_{k = 1}^{m_{r}} {({\tilde{u}}_{k}^{T} \tilde{z})}^{T} ({\tilde{u}}_{k}^{T} \tilde{z}) where \tilde{z} = {[z^{α}]}_{_{α \in I^{^{n}} (d + r)}} \in R^{{\tilde{n}}_{r}} \end{matrix}

(36)

\begin{matrix} = \sum_{k = 1}^{m_{r}} {({\tilde{u}}_{k}^{T} \tilde{z})}^{2} where \tilde{z} = {[z^{α}]}_{_{α \in I^{^{n}} (d + r)}} \in R^{{\tilde{n}}_{r}} \end{matrix}

(37)

By comparing (30) and (34), we obtain the following

\begin{matrix} {\tilde{z}}^{T} \tilde{M} \tilde{z} = \sum_{ω \in I^{^{n}} (d + r)} [B_{_{ω}} (A)] z^{^{2 ω}} \end{matrix}

(38)

Comparing the coefficient of

z^{2 ω}

in (38) gives,

\begin{matrix} \sum_{(β_{_{i}}, β_{_{j}}) \in I^{^{n}} (d + r) \times I^{^{n}} (d + r) : β_{_{i}} + β_{_{j}} = 2 ω} {\tilde{m}}_{_{i, j}} = B_{_{ω}} (A) for all ω \in I^{^{n}} (d + r) . \end{matrix}

(39)

However, for

u \in I^{^{n}} (2 d + 2 r) \ 2 I^{^{n}} (d + r)

the coefficients of

z^{u}

on R.H.S. of (38) are all zero, thus we have

\sum_{(β_{_{i}}, β_{_{j}}) \in I^{^{n}} (d + r) \times I^{^{n}} (d + r) : β_{_{i}} + β_{_{j}} = u} {\tilde{m}}_{_{i, j}} = 0 for all u \in I^{^{n}} (2 d + 2 r) \ 2 I^{^{n}} (d + r) .

Hence the proof is complete. □

From Theorem 4, it is clear that the matrix

\tilde{M}

is sparse. To deal with the sparsity Ahmadi and Majumdar [35] has introduced polyhedral approximations using the observation that a diagonally dominant matrix is positive semidefinite.

4.3. Polyhedral Characterization of $C_{n, d}^{(r)}$

In this section, we present a compact representation for the cone

C_{n, d}^{(r)}

. The representation is useful in deducing polyhedral characterization of

C_{n, d}^{(r)}

. For this, recall that for any

ω \in I^{n} (r + d)

(30) can be re-written as follows

\begin{matrix} B_{_{ω}} (A) & = \sum_{θ \in I^{n} (d)} c (θ) c (ω - θ) A_{θ} \\ = \sum_{θ \in I^{n} (d)} c (θ) \frac{(| ω - θ |)!}{(ω_{1} - θ_{1})! (ω_{2} - θ_{2})! \dots (ω_{n} - θ_{n})!} A_{θ} \\ = r! \sum_{θ \in I^{n} (d)} c (θ) \frac{\begin{matrix} [ω_{1} (ω_{1} - 1) \dots (ω_{_{1}} - (θ_{_{1}} - 1))] \dots [ω_{_{n}} (ω_{_{n}} - 1) \dots (ω_{_{n}} - (θ_{_{n}} - 1))] \end{matrix}}{(ω_{_{1}})! \dots (ω_{_{n}})!} A_{θ} \\ = r! \sum_{θ \in I^{n} (d)} c (θ) \prod_{k = 1}^{n} \frac{\begin{matrix} [ω_{k} (ω_{k} - 1) \dots (ω_{_{k}} - (θ_{_{k}} - 1))] \end{matrix}}{(ω_{_{k}})!} A_{θ} \\ = \frac{r!}{\prod_{k = 1}^{n} (ω_{_{k}})!} \sum_{θ \in I^{n} (d)} c (θ) A_{θ} (\prod_{k = 1}^{n} [ω_{k} (ω_{k} - 1) \dots (ω_{_{k}} - (θ_{_{k}} - 1))]) \end{matrix}

(40)

Note that the first equality follows by using the definition of polynomial coefficient (see (4)). Recognize that the product

[ω_{k} (ω_{k} - 1) \dots (ω_{_{k}} - (θ_{_{k}} - 1))]

is linked with the falling factorials. The falling factorials can be represented using the Stirling number of the first kind as follows

ω_{k} (ω_{k} - 1) \dots (ω_{_{k}} - (θ_{_{k}} - 1)) = \sum_{m = 0}^{θ_{_{k}}} s (θ_{_{k}}, m) ω_{_{k}}^{m}

(41)

where

s (θ_{_{k}}, m) : = {(- 1)}^{θ_{k} - m} [\begin{matrix} θ_{_{k}} \\ m \end{matrix}]

is well known Stirling number of the first kind (see e.g., [36], Chapter 6.1). Using (41) in (40) gives

\begin{matrix} B_{_{ω}} (A) & = \frac{r!}{\prod_{k = 1}^{n} (ω_{_{k}})!} \sum_{θ \in I^{n} (d)} c (θ) A_{θ} (\prod_{k = 1}^{n} \sum_{m = 0}^{θ_{_{k}}} s (θ_{_{k}}, m) ω_{_{k}}^{m}) \end{matrix}

(42)

For

θ \in I^{n} (d)

and

α \in I^{n} (t)

the inequality

α \leq θ

is defined to hold element wise i.e., the inequality is true if

α_{i} \leq θ_{i}

for all

1 \leq i \leq n

. Furthermore,

s (θ, α) = s (θ_{1}, α_{1}) s (θ_{2}, α_{2}) \dots s (θ_{2}, α_{2})

. Observe that, if

θ_{k} = m

then

[\begin{matrix} θ_{_{k}} \\ m \end{matrix}] = 1

and also

[\begin{matrix} θ_{_{k}} \\ m \end{matrix}] = 0

if

θ_{k} < m

. The observation leads to the following simplification for each

θ \in I^{n} (d)

and

α \in I^{n} (t)

for all

t \in [0 : d]

\begin{matrix} \prod_{k = 1}^{n} \sum_{m = 0}^{θ_{_{k}}} s (θ_{_{k}}, m) ω_{_{k}}^{m} = & = \sum_{m_{1}, m_{2}, \dots, m_{n} = 0}^{θ_{_{1}}, θ_{_{2}}, \dots, θ_{_{n}}} s (θ_{_{1}}, m_{1}) s (θ_{_{k}}, m_{2}) \dots s (θ_{_{n}}, m_{n}) ω_{_{1}}^{m_{1}} ω_{_{2}}^{m_{2}} \dots ω_{_{n}}^{m_{n}} \\ = \sum_{α \leq θ}^{} s (θ, α) ω^{α} \end{matrix}

(43)

Combining (42) and (43) leads to

\begin{matrix} B_{_{ω}} (A) & = \frac{r!}{\prod_{k = 1}^{n} (ω_{_{k}})!} \sum_{θ \in I^{n} (d)} c (θ) A_{θ} (\sum_{\begin{matrix} α \in I^{n} (t), t \in [0 : d] \\ α \leq θ \end{matrix}}^{} s (θ, α) ω^{α}) \end{matrix}

(44)

We define the tensors

S (t)

and

W (t)

of order

| α | = t

and dimension n as follows,

S (t) = {(s (θ, α))}_{_{α \in I^{^{n}} (t)}} a n d W (t) = {(ω^{^{α}})}_{_{α \in I^{^{n}} (t)}} for all t \in [0 : d] .

(45)

Remark 1.

Interestingly for each

α \in I^{n} (d)

we have

\begin{matrix} {(S (d))}_{α} : = \{\begin{matrix} 1 & i f α = θ \\ 0 & o t h e r w i s e \end{matrix} . \end{matrix}

(46)

Consequently, the above notations lead to the following simplified representation of (44)

\begin{matrix} B_{_{ω}} (A) & = \frac{r!}{\prod_{k = 1}^{n} (ω_{_{k}})!} \sum_{θ \in I^{n} (d)} c (θ) A_{θ} (\sum_{t = 0}^{d} {〈S (t), W (t)〉}_{θ}) . \end{matrix}

(47)

Finally, we define the notation

Y_{θ}^{t} : = {〈S (t), W (t)〉}_{θ}

. The notation leads to define the tensor

Y^{t} : = {(Y_{θ}^{t})}_{θ \in I^{n} (d)}

(48)

of order d and dimension n, whose entries are either forms of degree t or zero that is,

Y_{θ}^{t} : = \{\begin{matrix} \sum_{α \in I^{n} (t)} s (θ, α) ω^{α} & α \leq θ \\ 0 & o t h e r w i s e \end{matrix}

(49)

Note that

Y_{θ}^{t} \in R

and

Y^{t} \in S_{n, d}

. Moreover, from (49) one can easily assert that

Y_{θ}^{d} = ω^{d}

.

Remark 2.

Recall that

θ \in I^{n} (d)

implying θ can be written as a linear combination of unit vectors as follows

θ = β_{1} e_{1} + \dots + β_{n} e_{n}

,

β \in {[0 : d]}^{n}

. Since,

θ \in I^{n} (d)

the maximum number of non-zero elements of β will be

m i n {d, n}

. Similarly, for

α \in I^{n} (t)

, we can have at most

m i n {n, t}

non-zero coefficients in the linear combination of the basis. Based on this observation we can obtain an explicit representation of the tensor

Y^{t} \in S_{n, d}

for all

t \in [1 : d - 1]

. Thus, for each

θ \in I^{n} (d)

the entries of tensor

Y^{t}

are described as

\begin{matrix} Y_{θ}^{t} : = \{\begin{matrix} s (d, t) ω_{i}^{t} & i f \begin{matrix} t e_{i} = α \leq θ = d e_{i} \\ \forall i \in [1 : n] \end{matrix} \\ \sum_{\begin{matrix} α_{i_{1}}, α_{i_{2}} = 1 \\ s . t . α_{i_{1}} + α_{i_{2}} = t \end{matrix}}^{t} s (θ_{i_{1}}, α_{i_{1}}) s (θ_{i_{2}}, α_{i_{2}}) ω_{i_{1}}^{α_{i_{1}}} ω_{i_{2}}^{α_{i_{2}}} & i f \{\begin{matrix} α \leq θ \\ α = α_{i_{1}} e_{i_{1}} + α_{i_{2}} e_{i_{2}} \\ θ = θ_{i_{1}} e_{i_{1}} + θ_{i_{2}} e_{i_{2}} \\ \forall i_{1} \neq i_{2} \in [1 : n] \end{matrix}\} \\ \sum_{\begin{matrix} α_{i_{1}}, α_{i_{2}}, α_{i_{3}} = 1 \\ s . t . α_{i_{1}} + α_{i_{2}} + α_{i_{3}} = t \end{matrix}}^{t} s (θ_{i_{1}}, α_{i_{1}}) s (θ_{i_{2}}, α_{i_{2}}) s (θ_{i_{3}}, α_{i_{3}}) ω_{i_{1}}^{α_{i_{1}}} ω_{i_{2}}^{α_{i_{2}}} ω_{i_{3}}^{α_{i_{3}}} & i f \{\begin{matrix} α \leq θ \\ α = α_{i_{1}} e_{i_{1}} + α_{i_{2}} e_{i_{2}} + α_{i_{3}} e_{i_{3}} \\ θ = θ_{i_{1}} e_{i_{1}} + θ_{i_{2}} e_{i_{2}} + θ_{i_{3}} e_{i_{3}} \\ \forall i_{1} \neq i_{2} \neq i_{3} \in [1 : n] \end{matrix}\} \\ ⋮ & ⋮ \\ \sum_{\begin{matrix} α_{i_{1}}, \dots, α_{i_{m}} = 1 \\ s . t . α_{i_{1}} + \dots + α_{i_{m}} = t \\ w h e r e m : = min {t, n} \end{matrix}}^{t} s (θ_{i_{1}}, α_{i_{1}}) \dots s (θ_{i_{m}}, α_{i_{m}}) ω_{i_{1}}^{α_{i_{1}}} \dots ω_{i_{m}}^{α_{i_{m}}} & i f \{\begin{matrix} α \leq θ \\ α = α_{i_{1}} e_{i_{1}} + \dots + α_{i_{m}} e_{i_{m}} \\ θ = θ_{i_{1}} e_{i_{1}} + \dots + θ_{i_{m}} e_{i_{m}} \\ \forall i_{1} \neq \dots \neq i_{3} \in [1 : n] \end{matrix}\} \\ 0 & o t h e r w i s e \end{matrix} \end{matrix}

(50)

Thus, from (45) and (48) one obtain the following notionally convenient formulation of (44),

\begin{matrix} B_{_{ω}} (A) & = \frac{r!}{\prod_{k = 1}^{n} (ω_{_{k}})!} (〈A, Y^{0} + Y^{1} + \dots + Y^{d - 1} + Y^{d}〉) \\ = \frac{r!}{\prod_{k = 1}^{n} (ω_{_{k}})!} (〈A, ω^{d}〉 + \sum_{t = 0}^{(d - 1)} 〈A, Y^{t}〉) . \end{matrix}

(51)

Remark 3.

For sanity check we consider a special case, the tensor of all ones

A = E

, that is

a_{_{i_{1}, \dots, i_{d}}} = 1

for all

i_{1}, i_{2}, \dots, i_{d} \in [1 : n]

. Note that

〈 E, ω^{d} 〉 = {| ω |}^{d}

and from (51) we have

\begin{matrix} B_{_{ω}} (E) & = \frac{r!}{\prod_{k = 1}^{n} (ω_{_{k}})!} (〈E, ω^{d}〉 + 〈E, Y^{_{d - 1}}〉 + \dots + 〈E, Y^{_{1}}〉 + 〈E, Y^{_{0}}〉) \\ = \frac{r!}{\prod_{k = 1}^{n} (ω_{_{k}})!} ({| ω |}^{d} + {s (d, d - 1) | ω |}^{d - 1} + \dots + s (d, 1) {| ω |}^{1}) \\ = \frac{r!}{\prod_{k = 1}^{n} (ω_{_{k}})!} ((r + d) (r + (d - 1)) \dots (r + 2) (r + 1)) \\ = \frac{r!}{\prod_{k = 1}^{n} (ω_{_{k}})!} \frac{(r + d)!}{r!} = \frac{(r + d)!}{\prod_{k = 1}^{n} (ω_{_{k}})!} = c (ω) \end{matrix}

(52)

The above representation leads to a polyhedral representation of the cone hierarchy

C_{n, d}^{(r)}

and is presented in the following theorem (cf. [19], Theorem 2.4).

Theorem 5.

For all

r \in N_{0}

and

n \in N

the polyhedral representation of cones

C_{n, d}^{(r)}

is given as

C_{n, d}^{(r)} = \{A \in S_{n, d} | 〈A, ω^{d} + \sum_{t = 0}^{(d - 1)} Y^{t}〉 \geq 0 for all ω \in I^{n} (r + d)\}

where

Y^{t} : = {(〈S (t), W (t)〉)}_{θ \in I^{n} (d)}

for all

t \in [0 : d - 1]

.

Proof.

The proof follows immediately from (19) and (51). □

5. Approximating Polynomial Optimization over the Simplex

In this section, we consider the homogeneous polynomial optimization over the simplex

\begin{matrix} min h_{A} (x) s u b j e c t t o : h_{B} (x) = 1 f o r x \in R_{+}^{n} \end{matrix}

(53)

where

A, B \in S_{n, d}

. It is well-known (see e.g., [18,19,25]) that (53) can be equivalently reformulated as a copositive program over the cone of completely positive tensors. The reformulation is as follows,

\begin{matrix} min 〈A, x^{d}〉 s u b j e c t t o : 〈B, x^{d}〉 = 1, x^{d} \in C_{n, d}^{*} . \end{matrix}

(54)

The dual formulation of (54) is also a conic program over the cone of copositive tensors, which is given below.

\begin{matrix} max_{λ \in R} λ s u b j e c t t o : (A - λ B) \in C_{n, d} \end{matrix}

(55)

We consider a special case where

B = E

. Obviously, the feasible set

{E x^{d} = 1, x \in R_{+}^{n}}

is precisely the simplex

Δ_{n}

. Thus, the minimum (maximum) value of (53) in this special case, that is optimization of the homogeneous polynomial over the simplex

Δ_{n}

is

\begin{matrix} h_{A}^{min (Δ_{n})} : = min_{x \in Δ_{n}} h_{A} (x) (h_{A}^{max (Δ_{n})} : = max_{x \in Δ_{n}} h_{A} (x)) \end{matrix}

(56)

As mentioned before testing if a tensor is copositive is co-NP-hard. To find an approximate solution we replace the cone

C_{n, d}^{}

(for the special case

B = E

) in (55) by it’s approximation

C_{n, d}^{(r)}

where

r \in N_{0}

,

\begin{matrix} v_{_{C}}^{(r)} : = max \{λ | A - λ E \in C_{n, d}^{(r)}, λ \in R\} \end{matrix}

(57)

We are interested to compute the bound on the difference of approximate solution (to the dual program)

v_{_{C}}^{(r)}

and the exact solution

h_{A}^{min (Δ_{n})}

. For this we use rational girding of the simplex

Δ_{n}

i.e., for non-negative integer

r \in N_{0}

we have,

Δ_{n} (r) : = {x \in Δ_{n} : (r + d) x \in N_{0}^{n}} = \frac{I^{n} (r + d)}{(r + d)} .

(58)

The rational grid

Δ_{n} (r)

is a discretization of

Δ_{_{n}}

, which leads to a natural approximation of (56), i.e.,

\begin{matrix} h_{A}^{min (Δ_{n} (r))} : = min_{x \in Δ_{n} (r)} {h_{A} (x) : = 〈A, x^{d}〉} \end{matrix}

(59)

Note that

v_{_{C}}^{(r)}

approximates the dual while

h_{A}^{min (Δ_{n} (r))}

approximates the primal as given in (56). It is interesting to investigate the connection between these two approximations. The connection is given below (cf. [19] (Theorem 3.1), for

d = 2

and [25] (Theorem 3.1) for

d = 4

),

Theorem 6.

Let

Δ_{n} (r)

be a rational discretization of simplex

Δ_{n}

as given in (58) for any

r \in N_{0}

, then for

Q \in S_{n, d}

we have

\begin{matrix} v_{_{C}}^{(r)} = \frac{r! {(r + d)}^{d}}{(r + d)!} min_{x \in Δ_{_{n}} (r)} 〈Q, x^{d} + \sum_{t = 1}^{(d - 1)} \frac{X^{t}}{{(r + d)}^{d - t}}〉 \end{matrix}

(60)

where

X^{t} \in S_{n, d}

.

Proof.

Let

A : = Q - λ E

be a feasible point of the program given in (57) then for

ω \in I^{n} (r + d)

it follows, from (51) and the definition of

C_{n, d}^{(r)}

, that,

\begin{matrix} 0 \leq B_{_{ω}} (A) & = B_{_{ω}} (Q) - λ B_{_{ω}} (E) \end{matrix}

(61)

\begin{matrix} = \frac{r!}{\prod_{k = 1}^{n} (ω_{_{k}})!} (〈Q, ω^{d}〉 + \sum_{t = 1}^{(d - 1)} 〈Q, Y^{t}〉) - λ c (ω) \end{matrix}

(62)

\begin{matrix} = \frac{(r + d) (r + d - 1) \dots (r + 1) r!}{\prod_{k = 1}^{n} (ω_{_{k}})!} (\frac{〈Q, ω^{d}〉 + \sum_{t = 1}^{(d - 1)} 〈Q, Y^{t}〉}{(r + d) (r + d - 1) \dots (r + 1)}) - λ c (ω) \end{matrix}

(63)

\begin{matrix} = c (ω) (\frac{〈Q, ω^{d}〉 + \sum_{t = 1}^{(d - 1)} 〈Q, Y^{t}〉}{(r + d) (r + d - 1) \dots (r + 1)} - λ) . \end{matrix}

(64)

Giving,

\frac{〈Q, ω^{d}〉 + \sum_{t = 1}^{(d - 1)} 〈Q, Y^{t}〉}{(r + d) (r + d - 1) \dots (r + 1)} \geq λ .

The above imply that the maximum value of

λ

in (57) is attained at the minimum value of

(\frac{〈Q, ω^{d}〉 + \sum_{t = 1}^{(d - 1)} 〈Q, Y^{t}〉}{(r + d) (r + d - 1) \dots (r + 1)})

. Thus, (57) can be equivalently written as follows,

\begin{matrix} v_{_{C}}^{(r)} & = min_{ω \in I^{n} (d + r)} \{\frac{{(r + d)}^{d - 1}}{(r + d - 1) \dots (r + 1)} (\frac{〈Q, ω^{d}〉 + \sum_{t = 1}^{(d - 1)} 〈Q, Y^{t}〉}{{(r + d)}^{d}})\} \\ = \frac{r! {(r + d)}^{d}}{(r + d)!} min_{ω \in I^{n} (d + r)} 〈Q, \frac{ω^{d}}{{(r + d)}^{d}} + \sum_{t = 1}^{(d - 1)} \frac{Y^{t}}{{(r + d)}^{d}}〉 . \end{matrix}

(65)

A mere change of variable

ω = (r + d) x

where

x \in Δ_{_{n}} (r)

and

Y^{t} = {(r + d)}^{t} X^{t}

in (65) yields the required expression, i.e.,

v_{_{C}}^{(r)} = \frac{r! {(r + d)}^{d}}{(r + d)!} min_{x \in Δ_{n} (r)} 〈Q, x^{d} + \sum_{t = 1}^{(d - 1)} \frac{X^{t}}{{(r + d)}^{d - t}}〉 .

(66)

□

Note that, for any

r \in N_{0}

we have the relation

h_{Q}^{min (Δ_{n} (r))} \geq h_{Q}^{min (Δ_{n})}

. However, in (66) a correction term

\sum_{t = 0}^{d - 1} \frac{〈 Q, X^{t} 〉}{{(r + d)}^{d - t}}

is deducted from the actual objective

〈 Q, x^{d} 〉

for obtaining a closer approximation to

h_{Q}^{min (Δ_{n})}

. Clearly, for increasing

r \in N_{0}

the value

v_{_{C}}^{(r)}

surpass the value

h_{Δ_{n}}^{(r)}

invariably. However, one has to compensate with the factor

\frac{r! {(r + d)}^{d}}{(r + d)!} = 1 + O (\frac{1}{r^{d - 1}}) > 1

. It would be interesting to find bounds on the difference between two approximations namely

v_{_{C}}^{(r)}

and

h_{Q}^{min (Δ_{n} (r))}

. To compute the bound we define some notations. First recall that

[\begin{matrix} θ \\ α \end{matrix}]

denotes Stirling number. In addition, for

x \in Δ_{n}, t \in [0 : d]

we define a function

q_{t} (x)

as follows,

\begin{matrix} q_{t} (x) & : = \frac{1}{{(r + d)}^{d - t}} \sum_{\begin{matrix} α \leq θ \\ θ \in I^{n} (d), α \in I^{n} (t) \end{matrix}} [\begin{matrix} θ \\ α \end{matrix}] c (θ) Q_{θ} x^{α} \end{matrix}

(67)

\begin{matrix} : = \frac{1}{{(r + d)}^{d - t}} \{\sum_{\begin{matrix} α \leq θ, Q_{θ} \geq 0 \\ θ \in I^{n} (d), α \in I^{n} (t) \end{matrix}} [\begin{matrix} θ \\ α \end{matrix}] c (θ) | Q_{θ} | x^{α} - \sum_{\begin{matrix} α \leq θ, Q_{θ} < 0 \\ θ \in I^{n} (d), α \in I^{n} (t) \end{matrix}} [\begin{matrix} θ \\ α \end{matrix}] c (θ) | Q_{θ} | x^{α}\} \end{matrix}

(68)

\begin{matrix} : = q_{t} {(x)}^{+} - q_{t} {(x)}^{-} \end{matrix}

(69)

If there is no dependence on the variable

x

we simply write

q_{t}

that is,

q_{t}^{+} : = \frac{1}{{(r + d)}^{d - t}} \sum_{\begin{matrix} α \leq θ, Q_{θ} \geq 0 \\ θ \in I^{n} (d), α \in I^{n} (t) \end{matrix}} [\begin{matrix} θ \\ α \end{matrix}] c (θ) | Q_{θ} |

One can define

q_{t}^{+}

analogously.

Theorem 7.

Let

Δ_{n} (r)

be a rational discretization of simplex

Δ_{n}

as given in (58) for any

r \in N_{0}

, then for

Q \in S_{n, d}

we have

\begin{matrix} - (\sum_{t \in [0 : (d - 1)] \cap N_{E v e n}} q_{t}^{-} + \sum_{t \in [0 : (d - 1)] \cap N_{O d d}} q_{t}^{+}) & \leq \frac{(r + d)!}{r! {(r + d)}^{d}} v_{_{C}}^{(r)} - h_{Q}^{min (Δ_{n} (r))} \end{matrix}

(70)

\begin{matrix} \leq (\sum_{t \in [0 : (d - 1)] \cap N_{E v e n}} q_{t}^{+} + \sum_{t \in [0 : (d - 1)] \cap N_{O d d}} q_{t}^{-}) \end{matrix}

(71)

Proof.

From the expression given in Equation (66),

\begin{matrix} \frac{(r + d)!}{r! {(r + d)}^{d}} v_{_{C}}^{(r)} & = min_{x \in Δ_{_{n}} (r)} \{〈Q, x^{d}〉 + 〈Q, \sum_{t = 0}^{(d - 1)} \frac{X^{t}}{{(r + d)}^{d - t}}〉\} \end{matrix}

(72)

\begin{matrix} = min_{x \in Δ_{_{n}} (r)} \{〈Q, x^{d}〉 + \sum_{t = 0}^{(d - 1)} \frac{\{\sum_{θ \in I^{n} (d)}^{} c (θ) Q_{θ} X_{θ}^{t}\}}{{(r + d)}^{d - t}}\} \end{matrix}

(73)

\begin{matrix} = min_{x \in Δ_{_{n}} (r)} \{〈Q, x^{d}〉 + \sum_{t = 0}^{(d - 1)} \frac{\{\underset{θ \in I^{n} (d) & α \in I^{n} (t)}{\sum_{α \leq θ}^{}} [\begin{matrix} θ \\ α \end{matrix}] c (θ) Q_{θ} x^{α}\}}{{(- 1)}^{t} {(r + d)}^{d - t}}\} \end{matrix}

(74)

Notice that

x \in Δ_{n}

then for all

α \in I^{n} (d)

, we have

x^{α} \leq 1

. From this observation, we have that

q_{t} {(x)}^{\pm} \leq q_{t}^{\pm}

. This observation leads to the following,

\begin{matrix} \frac{(r + d)!}{r! {(r + d)}^{d}} v_{_{C}}^{(r)} & = min_{x \in Δ_{_{n}} (r)} \{〈Q, x^{d}〉 + \sum_{t = 0}^{(d - 1)} q_{t} (x)\} \end{matrix}

(75)

\begin{matrix} = min_{x \in Δ_{_{n}} (r)} \{〈Q, x^{d}〉 + (\begin{matrix} \sum_{t \in [0 : (d - 1)] \cap N_{E v e n}} q_{t} {(x)}^{+} + \sum_{t \in [0 : (d - 1)] \cap N_{O d d}} q_{t} {(x)}^{-} \\ - (\sum_{t \in [0 : (d - 1)] \cap N_{E v e n}} q_{t} {(x)}^{-} + \sum_{t \in [0 : (d - 1)] \cap N_{O d d}} q_{t} {(x)}^{+}) \end{matrix})\} \end{matrix}

(76)

\begin{matrix} \leq min_{x \in Δ_{_{n}} (r)} \{〈Q, x^{d}〉 + (\begin{matrix} \sum_{t \in [0 : (d - 1)] \cap N_{E v e n}} q_{t} {(x)}^{+} + \sum_{t \in [0 : (d - 1)] \cap N_{O d d}} q_{t} {(x)}^{-} \end{matrix})\} \end{matrix}

(77)

\begin{matrix} \leq min_{x \in Δ_{_{n}} (r)} \{〈Q, x^{d}〉 + (\sum_{t \in [0 : (d - 1)] \cap N_{E v e n}} q_{t}^{+} + \sum_{t \in [0 : (d - 1)] \cap N_{O d d}} q_{t}^{-})\} \end{matrix}

(78)

\begin{matrix} \leq h_{Q}^{min (Δ_{n} (r))} + \sum_{t \in [0 : (d - 1)] \cap N_{E v e n}} q_{t}^{+} + \sum_{t \in [0 : (d - 1)] \cap N_{O d d}} q_{t}^{-} \end{matrix}

(79)

Thus, we have

\begin{matrix} \frac{(r + d)!}{r! {(r + d)}^{d}} v_{_{C}}^{(r)} - h_{Q}^{min (Δ_{n} (r))} \leq \sum_{t \in [0 : (d - 1)] \cap N_{E v e n}} q_{t}^{+} + \sum_{t \in [0 : (d - 1)] \cap N_{O d d}} q_{t}^{-} \end{matrix}

(80)

The lower bound could be done similar manner. □

6. Conclusions

The paper was focused on describing the copositive tensor cone and its approximations. We have shown that every Z-tensor is copositive if and only if it is positive definite. The result has appeared already in [32] (Theorem 3.5(e) and Theorem 3.12). However, the proof given by Zhang et al. relies heavily on the notion of M-tensor and convex analysis. The proof we have provided is simpler and self-contained. We had discussed some approximation hierarchies for the copositive cone, focusing on providing a compact representation of these hierarchies. For the Parrilo cone,

K_{n, d}^{r}

the proof techniques are a straightforward generalization to a high dimensional case. For the cone,

C_{n, d}^{r}

a more rigorous approach is used to derive the representation. Most notions used are unique and have not appeared in the literature. We have illustrated this by applying our compact representation to the polynomial optimization over the simplex. We have compared the approximation obtained by our representation

C_{n, d}^{r}

with the approximation based on the rational griding. The bounds are proved between the two approximations. Moreover, the characterization helped to simplify the proofs and results related to approximating polynomial optimization over the simplex. In future it would be interesting to investigate the convergence rate of these approximations.

In the future, we work towards utilizing these hierarchies for providing approximation results related to copositive optimization, especially to recover approximation results for polynomial optimization over the simplex as obtained by De Klerk and co-authors [21,22,23,24]. Furthermore, our aim is to use these approximation hierarchies to develop numerical algorithms for application domains such as approximating clique numbers for uniform hypergraphs (see, e.g., [37,38]).

Author Contributions

Writing—original draft, M.F.I.; Writing—review & editing, F.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

Maricic, B.; Luo, Z.; Davidson, T. Blind constant modulus equalization via convex optimization. IEEE Trans. Signal Process. 2003, 51, 805–818. [Google Scholar] [CrossRef]
Sidiropoulos, N.D.; De Lathauwer, L.; Fu, X.; Huang, K.; Papalexakis, E.E.; Faloutsos, C. Tensor decomposition for signal processing and machine learning. IEEE Trans. Signal Process. 2017, 65, 3551–3582. [Google Scholar] [CrossRef]
Weiland, S.; van Belzen, F. Singular Value Decompositions and Low Rank Approximations of Tensors. Signal Process. IEEE Trans. 2010, 58, 1171–1182. [Google Scholar] [CrossRef]
Cohen, N.; Sharir, O.; Shashua, A. On the expressive power of deep learning: A tensor analysis. In Proceedings of the Conference on Learning Theory, New York, NY, USA, 23–26 June 2016; pp. 698–728. [Google Scholar]
Kolda, T.G.; Bader, B.W. Tensor decompositions and applications. SIAM Rev. 2009, 51, 455–500. [Google Scholar] [CrossRef]
Soare, S.; Yoon, J.W.; Cazacu, O. On the use of homogeneous polynomials to develop anisotropic yield functions with applications to sheet forming. Int. J. Plast. 2008, 24, 915–944. [Google Scholar] [CrossRef]
Micchelli, C.A.; Olsen, P. Penalized maximum-likelihood estimation, the Baum-Welch algorithm, diagonal balancing of symmetric matrices and applications to training acoustic data. J. Comput. Appl. Math. 2000, 119, 301–331. [Google Scholar] [CrossRef] [Green Version]
Hamadneh, T.; Ali, M.; AL-Zoubi, H. Linear optimization of polynomial rational functions: Applications for positivity analysis. Mathematics 2020, 8, 283. [Google Scholar] [CrossRef] [Green Version]
Henrion, D.; Garulli, A. Positive Polynomials in Control; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2005; Volume 312. [Google Scholar]
Berman, A.; Shaked-Monderer, N. Completely Positive Matrices; World Scientific Publishing Company Pte Limited: Singapore, 2003. [Google Scholar]
Dickinson, P.J.C. The Copositive Cone, the Completely Positive Cone and Their Generalisations. Ph.D. Thesis, Groningen University, Groningen, The Netherlands, 2013. [Google Scholar]
Kostyukova, O.; Tchemisova, T. Structural Properties of Faces of the Cone of Copositive Matrices. Mathematics 2021, 9, 2698. [Google Scholar] [CrossRef]
Avi Berman, M.D.; Shaked-mondered, N. Open Problems in the Theory of Completely Positive and Copositive Matrices. Electron. J. Linear Algebra 2015, 29, 46–58. [Google Scholar] [CrossRef]
Song, Y.; Qi, L. Necessary and sufficient conditions for copositive tensors. Linear Multilinear Algebra 2015, 63, 120–131. [Google Scholar] [CrossRef] [Green Version]
Qi, L. Symmetric nonnegative tensors and copositive tensors. Linear Algebra Its Appl. 2013, 439, 228–238. [Google Scholar] [CrossRef]
Qi, L.; Luo, Z. Tensor Analysis: Spectral Theory and Special Tensors; SIAM: Philadelphia, PA, USA, 2017. [Google Scholar]
Parrilo, P.A. Structured Semidefinite Programs and Semialgebraic Geometry Methods in Robustness and Optimization. Ph.D. Thesis, California Institute of Information Technology, Pasadena, CA, USA, 2000. [Google Scholar]
de Klerk, E.; Pasechnik, D.V. Approximation of the Stability Number of a Graph via Copositive Programming. SIAM J. Optim. 2002, 12, 875–892. [Google Scholar] [CrossRef]
Bomze, I.M.; De Klerk, E. Solving standard quadratic optimization problems via linear, semidefinite and copositive programming. J. Glob. Optim. 2002, 24, 163–185. [Google Scholar] [CrossRef]
Nesterov, Y. Global quadratic optimization on the sets with simplex structure. In LIDAM Discussion Papers CORE 1999015; Université Catholique de Louvain, Center for Operations Research and Econometrics (CORE): Louvain-la-Neuve, Belgium, 1999. [Google Scholar]
de Klerk, E.; Laurent, M.; Parrilo, P.A. A PTAS for the minimization of polynomials of fixed degree over the simplex. Theor. Comput. Sci. 2006, 361, 210–225. [Google Scholar] [CrossRef] [Green Version]
de Klerk, E.; Laurent, M.; Sun, Z. An alternative proof of a PTAS for fixed-degree polynomial optimization over the simplex. Math. Program. 2015, 151, 433–457. [Google Scholar] [CrossRef] [Green Version]
de Klerk, E.; Laurent, M.; Sun, Z. An Error Analysis for Polynomial Optimization over the Simplex Based on the Multivariate Hypergeometric Distribution. SIAM J. Optim. 2015, 25, 1498–1514. [Google Scholar] [CrossRef] [Green Version]
de Klerk, E.; Laurent, M.; Sun, Z.; Vera, J.C. On the convergence rate of grid search for polynomial optimization over the simplex. Optim. Lett. 2017, 11, 597–608. [Google Scholar] [CrossRef] [Green Version]
Ling, C.; He, H.; Qi, L. Improved approximation results on standard quartic polynomial optimization. Optim. Lett. 2017, 11, 1767–1782. [Google Scholar] [CrossRef]
Hu, S.; Li, G.; Qi, L. A tensor analogy of Yuan’s theorem of the alternative and polynomial optimization with sign structure. J. Optim. Theory Appl. 2016, 168, 446–474. [Google Scholar] [CrossRef] [Green Version]
Luo, Z.; Qi, L.; Ye, Y. Linear operators and positive semidefiniteness of symmetric tensor spaces. Sci. China Math. 2015, 58, 197–212. [Google Scholar] [CrossRef] [Green Version]
Rockafellar, R. Convex Analysis; Princeton Landmarks in Mathematics and Physics; Princeton University Press: Princeton, NJ, USA, 1970. [Google Scholar]
Ahmadi, A.A.; Parrilo, P.A. A convex polynomial that is not SOS-convex. Math. Program. 2012, 135, 275–292. [Google Scholar] [CrossRef]
Peña, J.; Vera, J.C.; Zuluaga, L.F. Completely positive reformulations for polynomial optimization. Math. Program. 2015, 151, 405–431. [Google Scholar] [CrossRef] [Green Version]
Ahmed, F. Copositive Programming and Related Problems. Ph.D. Thesis, University of Twente, Twente, The Netherlands, 2014. [Google Scholar]
Zhang, L.; Qi, L.; Zhou, G. M-tensors and some applications. SIAM J. Matrix Anal. Appl. 2014, 35, 437–452. [Google Scholar] [CrossRef]
Reznick, B. Uniform denominators in Hilbert’s seventeenth problem. Math. Z. 1995, 220, 75–97. [Google Scholar] [CrossRef]
Chen, H.; Wang, Y.; Zhou, G. High-order sum-of-squares structured tensors: Theory and applications. Front. Math. China 2020, 15, 255–284. [Google Scholar] [CrossRef]
Ahmadi, A.A.; Majumdar, A. DSOS and SDSOS optimization: More tractable alternatives to sum of squares and semidefinite optimization. SIAM J. Appl. Algebra Geom. 2019, 3, 193–230. [Google Scholar] [CrossRef]
Graham, R.L.; Knuth, D.E.; Patashnik, O. Concrete Mathematics: A Foundation for Computer Science; Addison-Wesley: Boston, MA, USA, 1994. [Google Scholar]
Ahmed, F.; Still, G. Two methods for the maximization of homogeneous polynomials over the simplex. Comput. Optim. Appl. 2021, 80, 523–548. [Google Scholar] [CrossRef]
Rota Bulò, S.; Pelillo, M. A generalization of the Motzkin–Straus theorem to hypergraphs. Optim. Lett. 2009, 3, 287–295. [Google Scholar] [CrossRef] [Green Version]

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Iqbal, M.F.; Ahmed, F. Approximation Hierarchies for the Copositive Tensor Cone and Their Application to the Polynomial Optimization over the Simplex. Mathematics 2022, 10, 1683. https://doi.org/10.3390/math10101683

AMA Style

Iqbal MF, Ahmed F. Approximation Hierarchies for the Copositive Tensor Cone and Their Application to the Polynomial Optimization over the Simplex. Mathematics. 2022; 10(10):1683. https://doi.org/10.3390/math10101683

Chicago/Turabian Style

Iqbal, Muhammad Faisal, and Faizan Ahmed. 2022. "Approximation Hierarchies for the Copositive Tensor Cone and Their Application to the Polynomial Optimization over the Simplex" Mathematics 10, no. 10: 1683. https://doi.org/10.3390/math10101683

APA Style

Iqbal, M. F., & Ahmed, F. (2022). Approximation Hierarchies for the Copositive Tensor Cone and Their Application to the Polynomial Optimization over the Simplex. Mathematics, 10(10), 1683. https://doi.org/10.3390/math10101683

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Approximation Hierarchies for the Copositive Tensor Cone and Their Application to the Polynomial Optimization over the Simplex

Abstract

1. Introduction

2. Preliminaries

3. Positive Semidefinite Tensors, Copositive Tensors and Their Duals

4. Approximation Hierarchies for the Copositive Cone

4.1. The Case $R = 0$

4.2. Characterization of $K_{n, d}^{(r)}$

4.3. Polyhedral Characterization of $C_{n, d}^{(r)}$

5. Approximating Polynomial Optimization over the Simplex

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Approximation Hierarchies for the Copositive Tensor Cone and Their Application to the Polynomial Optimization over the Simplex

Abstract

1. Introduction

2. Preliminaries

3. Positive Semidefinite Tensors, Copositive Tensors and Their Duals

4. Approximation Hierarchies for the Copositive Cone

4.1. The Case R = 0

4.2. Characterization of K n , d ( r )

4.3. Polyhedral Characterization of C n , d ( r )

5. Approximating Polynomial Optimization over the Simplex

6. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.1. The Case $R = 0$

4.2. Characterization of $K_{n, d}^{(r)}$

4.3. Polyhedral Characterization of $C_{n, d}^{(r)}$