Cutting Convex Polytopes by Hyperplanes

Takayuki Hibi; Nan Li

doi:10.3390/math7050381

and

¹

Department of Pure and Applied Mathematics, Graduate School of Information Science and Technology, Osaka University, Toyonaka, Osaka 560-0043, Japan

²

Department of Mathematics, Massachusetts Institute of Technology, Cambridge, MA 02139, USA

^*

Author to whom correspondence should be addressed.

Mathematics2019, 7(5), 381;https://doi.org/10.3390/math7050381

Version Notes

Order Reprints

Abstract

Cutting a polytope is a very natural way to produce new classes of interesting polytopes. Moreover, it has been very enlightening to explore which algebraic and combinatorial properties of the original polytope are hereditary to its subpolytopes obtained by a cut. In this work, we devote our attention to all the separating hyperplanes for some given polytope (integral and convex) and study the existence and classification of such hyperplanes. We prove the existence of separating hyperplanes for the order and chain polytopes for any finite posets that are not a single chain, and prove there are no such hyperplanes for any Birkhoff polytopes. Moreover, we give a complete separating hyperplane classification for the unit cube and its subpolytopes obtained by one cut, together with some partial classification results for order and chain polytopes.

Keywords:

separating hyperplane; order polytopes; chain polytopes; Birkhoff polytopes

MSC:

Primary 52B05; Secondary 06A07

1. Introduction

Let

P \subset R^{n}

be a convex polytope of dimension d and

\partial P

its boundary. If

H \subset R^{n}

is a hyperplane, then we write

H^{(+)}

and

H^{(-)}

for the closed half-spaces of

R^{n}

with

H^{(+)} \cap H^{(-)} = H

. We say that

H

cuts

P

if

H \cap (P \ \partial P) \neq ⌀

and if each vertex of the convex polytopes

P \cap H^{(+)}

and

P \cap H^{(-)}

is a vertex of

P

. When

H \cap (P \ \partial P) \neq ⌀

, it follows that

H

cuts

P

if and only if, for each edge

e = conv ({v, v^{'}})

of

P

, where v and

v^{'}

are vertices of

P

, and not

e \subset H

, one has

H \cap e \subset {v, v^{'}}

. Cutting a polytope is a very natural way to produce new classes of interesting polytopes. For example, the hypersimplices are obtained from cutting the unit n cube by hyperplanes of the form

x_{1} + \dots + x_{n} = k, k + 1

, for some integer

0 \leq k < n

, which is a class of very interesting and well-studied polytopes (see for example [1,2,3]). A similar class of interesting polytopes obtained from cutting permutahedrons and in general any graphical zonotopes are studied in [4]. In general, it is a very interesting problem to explore which algebraic and combinatorial properties of

P

are hereditary to

P \cap H^{(+)}

and

P \cap H^{(-)}

. For example, in [5] the study on separating hyperplanes of the edge polytope

P_{G}

of a finite connected simple graph G is achieved and it is shown that

P_{G}

is normal if and only if each of

P_{G} \cap H^{(+)}

and

P_{G} \cap H^{(-)}

is normal.

In this paper, we look at the problem from another perspective, focusing more on the hyperplane that cuts the polytope. We are interested in the existence and classification of such hyperplanes.

If

H

cuts

P

, then we call

H

a separating hyperplane of

P

. If

H

is a separating hyperplane of

P

, then the decomposition of

P

via

H

is

P = (P \cap H^{(+)}) \cup (P \cap H^{(-)}) .

For example, if

{[0, 1]}^{3} \subset R^{3}

is the unit cube, then the hyperplane

H \subset R^{3}

defined by the equation

x_{i} + x_{j} = 1

with

1 \leq i < j \leq 3

is a separating hyperplane of

{[0, 1]}^{3}

.

Unless

n = d

, where

d = dim P

, two different separating hyperplanes

H

and

H^{'}

of

P

might yield the same decomposition of

P

. For example, if

{[0, 1]}^{2} \subset R^{3}

is the square, then its separating hyperplane defined by

x_{1} + x_{2} + x_{3} = 1

and that defined by

x_{1} + x_{2} - x_{3} = 1

clearly yield the same decomposition of

{[0, 1]}^{2}

.

An integral convex polytope is a convex polytope whose vertices have integer coordinates. Let

P \subset R^{n}

be an integral convex polytope of dimension d and suppose that

\partial P \cap Z^{n}

is the set of vertices of

P

. It then follows that a hyperplane

H \subset R^{n}

is a separating hyperplane of

P

if and only if each of the subpolytopes

P \cap H^{(+)}

and

P \cap H^{(-)}

is integral of dimension d.

The study of existence and classification for any general internal convex polytopes can be very hard. In the present paper, we focus our study on the following classes of polytopes: The unit cube and its subpolytopes cut by one hyperplane, order and chain polytopes, and Birkhoff polytopes. We prove the existence of separating hyperplanes for the order and chain polytopes for any finite posets that are not a single chain (Theorem 2), and prove there are no separating hyperplanes for any Birkhoff polytopes (Theorem 3). Moreover, we give a complete separating hyperplane classification for the unit cube and its subpolytopes cut by one hyperplane (Section 1), together with partial classification results for order and chain polytopes (Section 2).

2. The Unit Cube

Let

{[0, 1]}^{d} \subset R^{d}

be the unit cube with

d \geq 2

. In the study of its separating hyperplane

H

it is assumed that

H

passes through the origin of

R^{d}

. First, we discuss the question when a hyperplane

H

of

R^{d}

passing through the origin

\begin{matrix} H : a_{1} x_{1} + \dots + a_{d} x_{d} = 0, \end{matrix}

(1)

where each

a_{i} \in Q

, is a separating hyperplane of

{[0, 1]}^{d}

.

Lemma 1.

A hyperplane (1) is a separating hyperplane of

{[0, 1]}^{d}

if and only if there exist p and q with

a_{p} > 0

and

a_{q} < 0

and all nonzero coefficients of

H

have the same absolute value.

Proof.

(“If”) Let e be an edge of

{[0, 1]}^{d}

. Then there is

1 \leq i \leq d

with

e = {(x_{1}, \dots, x_{d}) : 0 \leq x_{i} \leq 1 and x_{j} = ε_{j} for all j \neq i},

where each

ε_{j}

belongs to

{0, 1}

. Suppose that there exist p and q with

a_{p} > 0

and

a_{q} < 0

and that all nonzero coefficients of

H

have the same absolute value. By relabeling the subscripts (if necessary), we may assume that

H : x_{1} + \dots + x_{s} - x_{s + 1} - \dots - x_{s + t} = 0,

where

s > 0

,

t > 0

and

s + t \leq d

. If

i > s + t

, then either

H \cap e = e

or

H \cap e = ⌀

. If

i \leq s + t

, then either

H \cap e = ⌀

or

H \cap e

is a vertex of

{[0, 1]}^{d}

. Thus, each of

{[0, 1]}^{d} \cap H^{(+)}

and

{[0, 1]}^{d} \cap H^{(-)}

is integral. Moreover, since

s > 0

and

t > 0

, it follows that

H \cap {(0, 1)}^{d} \neq ⌀

. Hence

H

is a separating hyperplane of

{[0, 1]}^{d}

.

(“Only if”) If every coefficient

a_{i}

of (1) is nonnegative, then

H \cap {[0, 1]}^{d}

consists only of the origin. Hence

H

cannot be a separating hyperplane of

{[0, 1]}^{d}

. Thus, there exists p and q with

a_{p} > 0

and

a_{q} < 0

.

Now, suppose that there exist

i \neq j

with

a_{i} \neq 0

,

a_{j} \neq 0

and

| a_{i} | \neq | a_{j} |

. Let, say,

| a_{i} | < | a_{j} |

. Let e be the edge defined by

x_{i} = 1

and

x_{k} = 0

for all k with

k \notin {i, j}

. If

a_{i} a_{j} < 0

, then

0 < - a_{i} / a_{j} < 1

and

v = (v_{1}, \dots, v_{n}) \in e

with

v_{j} = - a_{i} / a_{j}

belongs to

H

. Thus,

H

cannot be a separating hyperplane of

{[0, 1]}^{d}

. Hence

a_{i} a_{j} > 0

. In particular

| a_{p} | = | a_{q} |

. Let

1 \leq k \leq d

with

a_{k} > 0

. Then, since

a_{k} a_{q} < 0

, it follows that

| a_{k} | = | a_{q} |

. Similarly, if

a_{k} < 0

, then

| a_{k} | = | a_{p} |

. Consequently,

| a_{k} | = | a_{p} | (= | a_{p} |)

for all k with

a_{k} \neq 0

, as desired. □

By virtue of Lemma 1, it follows that up to permuting the coordinates, a separating hyperplane of

{[0, 1]}^{d}

passing through the origin is of the form

\begin{matrix} x_{1} + \dots + x_{s} - x_{s + 1} - \dots - x_{s + t} = 0 \end{matrix}

(2)

with

s > 0

,

t > 0

and

s + t \leq d

. Moreover, in (2), by replacing

x_{s + i}

with

1 - x_{s + i}

for

1 \leq i \leq t

, we can work with a separating hyperplane of

{[0, 1]}^{d}

of the form

\begin{matrix} x_{1} + \dots + x_{s} + x_{s + 1} + \dots + x_{s + t} = t . \end{matrix}

(3)

Finally, Equation (3) can be rewritten as

\begin{matrix} x_{1} + \dots + x_{k} = ℓ, 2 \leq k \leq d, 1 \leq ℓ < k . \end{matrix}

(4)

If

H_{k, ℓ}

is a separating hyperplane defined by (4) of

{[0, 1]}^{d}

, then

{[0, 1]}^{d} \cap H_{k, ℓ}^{(+)} = {(x_{1}, \dots, x_{d}) \in {[0, 1]}^{d} : x_{1} + \dots + x_{k} \leq ℓ},

{[0, 1]}^{d} \cap H_{k, ℓ}^{(-)} = {(x_{1}, \dots, x_{d}) \in {[0, 1]}^{d} : x_{1} + \dots + x_{k} \geq ℓ} .

In

{[0, 1]}^{d} \cap H_{k, ℓ}^{(-)}

, again, by replacing

x_{i}

with

1 - x_{i}

for

1 \leq i \leq k

, it follows that since

0 < k - ℓ < k

, each of the subpolytopes

{[0, 1]}^{d} \cap H_{k, ℓ}^{(\pm)}

is, up to unimodular equivalence, of the form

Q_{k, ℓ} = {(x_{1}, \dots, x_{d}) \in {[0, 1]}^{d} : x_{1} + \dots + x_{k} \leq ℓ}, 2 \leq k \leq d, 1 \leq ℓ < k .

(5)

It follows easily that when

(k, ℓ) \neq (k^{'}, ℓ^{'})

, two subpolytopes

Q_{k, ℓ}

and

Q_{k^{'}, ℓ^{'}}

cannot be unimodularly equivalent. Hence

Corollary 1.

The number of convex polytopes of the form

{[0, 1]}^{d} \cap H^{(\pm)}

, where

H

is a separating hyperplane of

{[0, 1]}^{d}

is, up to unimodular equivalence,

d (d - 1) / 2

.

We now turn to the problem of finding separating hyperplanes of the subpolytope

Q_{k, ℓ}

of

{[0, 1]}^{d}

. We say that a separating hyperplane of

Q_{k, ℓ}

(of the form (5)) is a second separating hyperplane of

{[0, 1]}^{d}

following the hyperplane

H_{k, ℓ}

.

Lemma 2.

Each of the separating hyperplanes of

{[0, 1]}^{d}

is of the form

\sum_{i \in I} x_{i} - \sum_{j \in J} x_{j} = h,

where

⌀ \neq I \subset [d]

,

⌀ \neq J \subset [d]

,

I \cap J = ⌀

and where

h \geq 0

is an integer with

0 \leq h < ♯ (I)

.

Proof.

Let

v = (v_{1}, \dots, v_{d})

be a vertex of

{[0, 1]}^{d}

. Let

I \subset [d]

and

J \subset [d]

with

I \cup J = [d]

and

I \cap J = ⌀

such that

v_{i} = 0

if

i \in I

and

v_{j} = 1

if

j \in J

. Let

\begin{matrix} H : a_{1} x_{1} + \dots + a_{d} x_{d} = \sum_{j \in J} a_{j}, \end{matrix}

(6)

with each

a_{i} \in Q

, be a separating hyperplane of

{[0, 1]}^{d}

passing through v. In (6) replace

x_{j}

with

1 - x_{j}

for

j \in J

, and the hyperplane

\begin{matrix} H^{'} : \sum_{i \in I} a_{i} x_{i} + \sum_{j \in J} (- a_{j}) x_{j} = 0 \end{matrix}

(7)

is a separating hyperplane of

{[0, 1]}^{d}

passing through the origin. It then follows from Lemma 1 that all nonzero coefficients of (7) have the same absolute value. Thus, each of

a_{i}

’s and

a_{j}

’ belongs to

{0, \pm 1}

.

It turns out that Equation (6) is

H : \sum_{p \in I^{'}} x_{p} - \sum_{q \in J^{'}} x_{q} = h,

where

⌀ \neq I^{'} \subset [d]

,

⌀ \neq J^{'} \subset [d]

,

I \cap J = ⌀

and where

h \geq 0

is an integer. If

h \geq ♯ (I^{'})

, then

{[0, 1]}^{d} \subset H^{(+)}

or

{[0, 1]}^{d} \subset H^{(-)}

. Hence

0 \leq h < ♯ (I^{'})

. If

0 \leq h < ♯ (I^{'})

, then

(H^{(+)} \ H) \cap {[0, 1]}^{d} \neq ⌀, (H^{(-)} \ H) \cap {[0, 1]}^{d} \neq ⌀ .

Thus,

H

is, in fact, a separating hyperplane of

{[0, 1]}^{d}

. □

Let

H^{'} \subset R^{d}

be a second separating hyperplane of

{[0, 1]}^{d}

following (4). Clearly

H^{'}

is a separating hyperplane of

{[0, 1]}^{d}

. It then follows from Theorem 2 that

\begin{matrix} H^{'} : \sum_{i \in I} x_{i} - \sum_{j \in J} x_{j} = h, \end{matrix}

(8)

where

⌀ \neq I \subset [d]

,

⌀ \neq J \subset [d]

,

I \cap J = ⌀

and where

h \geq 0

is an integer with

0 \leq h < ♯ (I)

.

Theorem 1.

A hyperplane

H^{'}

of (8) is a second separating hyperplane of

{[0, 1]}^{d}

following (4) if and only if one of the following conditions is satisfied:

$♯ (J) + h + k - ♯ (X) \leq ℓ$ ;
$♯ (I) - h + k - ♯ (Y) \leq ℓ$ ,

where

X = I \cap [k]

and

Y = J \cap [k]

.

Proof.

Let

P \subset R^{d}

denote the subpolytope (5) of

{[0, 1]}^{d}

. Then a hyperplane

H^{'}

of (8) is a second separating hyperplane of

{[0, 1]}^{d}

following (4) if and only if one has

H^{'} \cap {[0, 1]}^{d} \subset P

.

(“If”) Let

v = (v_{1}, \dots, v_{d}) \in {[0, 1]}^{d}

belong to

H^{'}

, i.e.,

\sum_{i \in I} v_{i} - \sum_{j \in J} v_{j} = h

. If

♯ (J) + h + k - ♯ (X) \leq ℓ

, then

\begin{matrix} v_{1} + \dots + v_{k} & = & \sum_{i \in X} v_{i} + \sum_{i \in [k] \ X} v_{i} \\ \leq & \sum_{i \in I} v_{i} + \sum_{i \in [k] \ X} v_{i} \\ = & h + \sum_{j \in J} v_{j} + \sum_{i \in [k] \ X} v_{i} \\ \leq & h + ♯ (J) + k - ♯ (X) \leq ℓ . \end{matrix}

Hence

v \in P

. If

♯ (I) - h + k - ♯ (Y) \leq ℓ

, then

\begin{matrix} v_{1} + \dots + v_{k} & = & \sum_{i \in Y} v_{i} + \sum_{i \in [k] \ Y} v_{i} \\ \leq & \sum_{i \in J} v_{i} + \sum_{i \in [k] \ Y} v_{i} \\ = & - h + \sum_{j \in I} v_{j} + \sum_{i \in [k] \ Y} v_{i} \\ \leq & - h + ♯ (I) + k - ♯ (Y) \leq ℓ . \end{matrix}

Hence

v \in P

.

(“Only if”) Let

♯ (J) + h + k - ♯ (X) > ℓ

and

♯ (I) - h + k - ♯ (Y) > ℓ

. We claim the existence of

v = (v_{1}, \dots, v_{d}) \in {[0, 1]}^{d}

with

v \in H^{'}

such that

v_{1} + \dots + v_{k} > ℓ

.

Let

♯ (I) \leq ♯ (J) + h

. Then

0 \leq h < ♯ (I) \leq ♯ (J) + h

. Thus, there is

v \in {[0, 1]}^{d}

belonging to

H^{'}

with

v_{i} = 1

for all

i \in I

such that if

j \in Y

and

j^{'} \in J \ Y

then

v_{j} \geq v_{j^{'}}

. Such

v \in H^{'} \cap {[0, 1]}^{d}

can be chosen with

v_{i} = 1

for all

[k] \ (X \cup Y)

. Then

\begin{matrix} v_{1} + \dots + v_{k} & \geq & ♯ (X) + min {♯ (I) - h, ♯ (Y)} + ♯ ([k] \ (X \cup Y)) \\ = & ♯ (X) + min {♯ (I) - h, ♯ (Y)} + k - ♯ (X) - ♯ (Y) \\ = & min {♯ (I) - h, ♯ (Y)} + k - ♯ (Y) \\ = & min {♯ (I) - h + k - ♯ (Y), k} . \end{matrix}

Since

♯ (I) - h + k - ♯ (Y) > ℓ

and

k > ℓ

, it follows that

v_{1} + \dots + v_{k} > ℓ

.

Let

♯ (I) > ♯ (J) + h

. Then there is

v \in {[0, 1]}^{d}

belonging to

H^{'}

with

v_{j} = 1

for all

j \in J

such that if

i \in X

and

i^{'} \in I \ X

then

v_{i} \geq v_{i^{'}}

. Such

v \in H^{'} \cap {[0, 1]}^{d}

can be chosen with

v_{i} = 1

for all

[k] \ (X \cup Y)

. Then

\begin{matrix} v_{1} + \dots + v_{k} & \geq & ♯ (Y) + min {♯ (J) + h, ♯ (X)} + ♯ ([k] \ (X \cup Y)) \\ = & ♯ (Y) + min {♯ (J) + h, ♯ (X)} + k - ♯ (X) - ♯ (Y) \\ = & min {♯ (J) + h, ♯ (X)} + k - ♯ (X) \\ = & min {♯ (J) + h + k - ♯ (X), k} . \end{matrix}

Since

♯ (J) + h + k - ♯ (X) > ℓ

and

k > ℓ

, it follows that

v_{1} + \dots + v_{k} > ℓ

. □

Corollary 2.

Let

\begin{matrix} x_{1} + \dots + x_{d} = ℓ, 1 \leq ℓ < d \end{matrix}

(9)

be a separating hyperplane of

{[0, 1]}^{d}

. Then a hyperplane

x_{1} + \dots + x_{s} = x_{s + 1} + \dots + x_{s + t} + h, 0 \leq h < s

is a second separating hyperplane of

{[0, 1]}^{d}

following (9) if and only if one of the following conditions is satisfied:

$d - ℓ \leq s - (t + h)$ ;
$d - ℓ \leq (t + h) - s$ .

3. Order and Chain Polytopes

Let

P = {ξ_{1}, \dots, ξ_{d}}

be a finite partially ordered set ( poset for short). To each subset

W \subset P

, we associate

ρ (W) = \sum_{ξ_{i} \in W} e_{i} \in R^{d}

, where

e_{1}, \dots, e_{d}

are the unit coordinate vectors of

R^{d}

. In particular

ρ (⌀)

is the origin of

R^{d}

. A poset ideal of P is a subset I of P such that for all

ξ_{i}

and

ξ_{j}

with

ξ_{i} \in I

and

ξ_{j} \leq ξ_{i}

, one has

ξ_{j} \in I

. An antichain of P is a subset A of P such that

ξ_{i}

and

ξ_{j}

belonging to A with

i \neq j

are incomparable. We say that

ξ_{j}

covers

ξ_{i}

if

ξ_{i} < ξ_{j}

and

ξ_{i} < ξ_{k} < ξ_{j}

for no

ξ_{k} \in P

. A chain

ξ_{j_{1}} < ξ_{j_{2}} < \dots < ξ_{j_{ℓ}}

of P is called saturated if

ξ_{j_{q}}

covers

ξ_{j_{q - 1}}

for

1 < q \leq ℓ

. A maximal chain is a saturated chain such that

ξ_{j_{1}}

is a minimal element and

ξ_{j_{ℓ}}

is a maximal element of the poset.

The order polytope of P is the convex polytope

O (P) \subset R^{d}

which consists of those

(x_{1}, \dots, x_{d}) \in R^{d}

such that

0 \leq x_{i} \leq 1

for every

1 \leq i \leq d

together with

x_{i} \geq x_{j}

if

ξ_{i} \leq ξ_{j}

in P.

The chain polytope of P is the convex polytope

C (P) \subset R^{d}

which consists of those

(x_{1}, \dots, x_{d}) \in R^{d}

such that

x_{i} \geq 0

for every

1 \leq i \leq d

together with

x_{i_{1}} + x_{i_{2}} + \dots + x_{i_{k}} \leq 1

for every maximal chain

ξ_{i_{1}} < ξ_{i_{2}} < \dots < ξ_{i_{k}}

of P.

One has

dim O (P) = dim C (P) = d

. The number of vertices of

O (P)

is equal to that of

C (P)

. Moreover, the volume of

O (P)

and that of

C (P)

are equal to

e (P) / d!

, where

e (P)

is the number of linear extensions of P ([6] (Corollary 4.2)). It also follows from [6] that the facets of

O (P)

are the following:

$x_{i} = 0$ , where $ξ_{i} \in P$ is maximal;
$x_{j} = 1$ , where $ξ_{j} \in P$ is minimal;
$x_{i} = x_{j}$ , where $ξ_{j}$ covers $ξ_{i}$ .

And that the facets of

C (P)

are the following:

$x_{i} = 0$ for all $ξ_{i} \in P$ ;
$x_{i_{1}} + \dots + x_{i_{k}} = 1$ , where $ξ_{i_{1}} < \dots < ξ_{i_{k}}$ is a maximal chain of P.

Moreover, we have the following descriptions for vertices, which will be used frequently in this section.

Lemma 3 ([6]).

(1): Each vertex of the order polytope $O (P)$ is of the form $ρ (I)$ , where I is a poset ideal of P.
(2): Each vertex of the chain polytope $C (P)$ is of the form $ρ (A)$ , where A is an antichain of P.

Please note that in [6], “dual poset ideals” are employed instead of poset ideals. However, no essential difference arises.

3.1. Existence of Separating Hyperplanes for Order and Chain Polytopes

In this subsection, we study the existence of separating hyperplanes of order polytopes and chain polytopes (Theorem 2). First we need an explicit description of edges in terms of vertices.

Recall that the comparability graph

Com (P)

of P is the finite simple graph on the vertex set

{ξ_{1}, \dots, ξ_{d}}

whose edges are those

{ξ_{i}, ξ_{j}}

with

i \neq j

for which

ξ_{i}

and

ξ_{j}

are comparable in P. In general, we say that a nonempty subset

Q = {ξ_{k_{1}}, \dots, ξ_{k_{q}}}

of P is connected inP if the induced subgraph of

Com (P)

on

{ξ_{k_{1}}, \dots, ξ_{k_{q}}}

is connected.

Lemma 4.

Let I and J be poset ideals of P with

I \neq J

. Then

conv ({ρ (I), ρ (J)})

forms an edge of

O (P)

if and only if

I \subset J

and

J \ I

is connected in P.

Proof.

If there exists a maximal element

ξ_{i}

of P not belonging to

I \cup J

, then

conv ({ρ (I), ρ (J)})

lies in the facet

x_{i} = 0

. If there exists a minimal element

ξ_{j}

of P belonging to

I \cap J

, then

conv ({ρ (I), ρ (J)})

lies in the facet

x_{j} = 1

. Hence, working with induction on d, we may assume that

I \cup J = P

and

I \cap J = ⌀

.

Let neither

I = ⌀

nor

J = ⌀

. Then P is the disjoint union of I and J. Now, suppose that

conv ({ρ (I), ρ (J)})

is an edge of

O (P)

. Then there exists a supporting hyperplane

H

of

O (P)

defined by the equation

h (x) = \sum_{i = 1}^{d} a_{i} x_{i} = 1

with each

a_{i} \in Q

such that

H \cap O (P) = conv ({ρ (I), ρ (J)})

. Since

\sum_{x_{i} \in I} a_{i} = \sum_{x_{j} \in J} a_{j} = 1

, one has

\sum_{i = 1}^{d} a_{i} = 2

. In particular

h (ρ (P)) > 1

and

h (⌀) < 1

. Thus,

H

cannot be a supporting hyperplane of P. In other words,

conv ({ρ (I), ρ (J)})

cannot be an edge of P.Hence, if

conv ({ρ (I), ρ (J)})

is an edge of P, then either

I = ⌀

or

J = ⌀

. Let

I = ⌀

and

J = P

. Suppose that P is disconnected and that

conv ({ρ (⌀), ρ (P)})

is an edge of P. Again, there exists a supporting hyperplane

H

of

O (P)

defined by the equation

h (x) = \sum_{i = 1}^{d} a_{i} x_{i} = 0

with each

a_{i} \in Q

such that

H \cap O (P) = conv ({ρ (⌀), ρ (P)})

. Let, say,

h (ρ (I)) > 0

for those poset ideals I with

I \neq ⌀

and

I \neq P

. Since P is disconnected, there exist poset ideals

I^{'}

and

J^{'}

with

I^{'} \cap J^{'} = ⌀

and

I^{'} \cup J^{'} = P

. Since

h (ρ (I^{'})) > 0

and

h (ρ (J^{'})) > 0

, it follows that

h (ρ (P)) = h (ρ (I^{'})) + h (ρ (J^{'})) > 0

, a contradiction. Thus, P must be connected.

Conversely, suppose that

I = ⌀

and

J = P

and that P is connected. Let

ξ_{i_{1}}, \dots, ξ_{i_{q}}

be the maximal elements of P and

A_{i_{j}}

the set of those elements

y \in P

with

y < ξ_{i_{j}}

. Let

k \notin {i_{1}, \dots, i_{q}}

. Then we write

b_{k}

for the number of

i_{j}

’s with

ξ_{k} \in A_{i_{j}}

. Let

b_{i_{j}} = - ♯ (A_{i_{j}})

. We then claim that the hyperplane

H

of

R^{d}

defined by the equation

h (x) = \sum_{i = 1}^{d} b_{i} x_{i} = 0

is a supporting hyperplane of

O (P)

with

H \cap O (P) = conv ({ρ (⌀), ρ (P)})

. Clearly

h (ρ (P)) = h (ρ (⌀)) = 0

. Let I be a poset ideal of P with

I \neq ⌀

and

I \neq P

. What we must prove is

h (ρ (I)) > 0

. To simplify the notation, suppose that

I \cap {ξ_{i_{1}}, \dots, ξ_{i_{q}}} = {ξ_{i_{1}}, \dots, ξ_{i_{r}}}

, where

0 \leq r < q

. If

r = 0

, then

h (ρ (I)) > 0

. Let

1 \leq r < q

and

J = \cup_{j = 1}^{r} (A_{i_{j}} \cup ξ_{i_{j}})

. Then J is a poset ideal of P and

h (ρ (J)) \leq h (ρ (I))

. We claim

h (ρ (J)) > 0

. One has

h (ρ (J)) \geq 0

. Moreover,

h (ρ (J)) = 0

if and only if no

z \in J

belongs to

A_{i_{r + 1}} \cup \dots \cup A_{i_{q}}

. Now, since P is connected, if follows that there exists

z \in J

with

z \in A_{i_{r + 1}} \cup \dots \cup A_{i_{q}}

. Hence

h (ρ (J)) > 0

. Thus,

h (ρ (I)) > 0

, as desired. □

Lemma 5.

Let A and B be antichains of Pwith

A \neq B

. Then

conv ({ρ (A), ρ (B)})

forms an edge of

C (P)

if and only if

(A \ B) \cup (B \ A)

is connected in P.

Proof.

If

A \cup B \neq P

and if

ξ_{i} \notin A \cup B

, then

conv ({ρ (A), ρ (B)})

lies in the facet

x_{i} = 0

. Furthermore, if

A \cup B = P

and

A \cap B \neq ⌀

, then

ξ_{j} \in A \cap B

is isolated in P and

ξ_{j}

itself is a maximal chain of P. Thus,

conv ({ρ (A), ρ (B)})

lies in the facet

x_{j} = 1

. Now, suppose that

A \cup B = P

and

A \cap B = ⌀

. Then

(A \ B) \cup (B \ A) = A \cup B = P

.

Let

conv ({ρ (A), ρ (B)})

be an edge of

C (P)

and

H

a supporting hyperplane of

C (P)

defined by

h (x) = \sum_{i = 1}^{d} a_{i} x_{i} = 1

, where each

a_{i} \in Q

, with

H \cap C (P) = conv ({ρ (A), ρ (B)})

and

C (P) \subset H^{(+)}

. If P is disconnected and if

A_{1} \cup B_{1}

and

A_{2} \cup B_{2}

are antichains of P, where A is the disjoint union of

A_{1} \cup A_{2}

and B is the disjoint union of

B_{1} \cup B_{2}

, then

h (ρ (A_{1} \cup B_{1})) < 1

and

h (ρ (A_{2} \cup B_{2})) < 1

. Hence

h (ρ (A \cup B) < 2

. However, since

h (ρ (A)) = 1

and

h (ρ (B)) = 1

, one has

h (ρ (A \cup B)) = 2

, a contradiction. Thus,

conv ({ρ (A), ρ (B)})

cannot be an edge of

C (P)

. Hence P must be connected if

conv ({ρ (A), ρ (B)})

is an edge of

C (P)

.

Now, suppose that P is connected. If there exist

x, x^{'} \in A

and

y, y^{'} \in B

with

x < y

and

y^{'} < x^{'}

, then P cannot be connected. We assume

y < x

if

x \in A

and

y \in B

are comparable. For each

ξ_{i} \in A

we write

a_{i}

for the number of elements

y \in B

with

y < ξ_{i}

. For each

ξ_{j} \in B

we write

b_{j}

for the number of elements

z \in A

with

ξ_{j} < z

. Clearly

\sum_{ξ_{i} \in A} a_{i} = \sum_{ξ_{j} \in B} b_{j} = q

, where q is the number of pairs

(x, y)

with

x \in A

,

y \in B

and

x < y

. Let

h (x) = \sum_{ξ_{i} \in A} a_{i} x_{i} + \sum_{ξ_{j} \in B} b_{j} x_{j}

and

H

the hyperplane of

R^{d}

defined by

h (x) = d

. Then

h (ρ (A)) = h (ρ (B)) = q

. We claim that for any antichain C of P with

C \neq A

and

C \neq B

, one has

h (ρ (C)) < q

. Let

C = A^{'} \cup B^{'}

with

A^{'} \subset A

and

B^{'} \subset B

. Since

P = A \cup B

is connected and since C is an antichain of P, it follows that

\sum_{ξ_{i} \in A^{'}} a_{i} + \sum_{ξ_{j} \in B^{'}} b_{j} < q

. Thus,

h (ρ (C)) < q

, as desired. □

Now we ask the question whether there exists a separating hyperplane of an order polytope as well as that of a chain polytope.

Lemma 6.

Let

ξ_{i}, ξ_{j} \in P

with

ξ_{i} \neq ξ_{j}

and

H_{i, j}

the hyperplane of

R^{d}

defined by the equation

x_{i} = x_{j}

. Then the following conditions are equivalent:

(i): $H_{i, j}$ is a separating hyperplane of $O (P)$ ;
(ii): $H_{i, j}$ intersects the interior of $O (P)$ ;
(iii): $ξ_{i}$ and $ξ_{j}$ are incomparable in P.

Proof.

The implication (i) ⇒ (ii) is obvious. Suppose (ii). Then there exist poset ideals I and J of P with

ρ (I) \in H_{i, j}^{(+)} \ H_{i, j}

and

ρ (J) \in H_{i, j}^{(-)} \ H_{i, j}

. In other words, there exist poset ideals I and J of P with

ξ_{i} \in I \ J

and

ξ_{j} \in J \ I

. Thus, in particular

ξ_{i}

and

ξ_{j}

are incomparable in P. Hence (ii) ⇒ (iii) follows.

Suppose (iii). Let I be the poset ideal of P consisting of those

y \in P

with

y \leq ξ_{i}

and J the poset ideal of P consisting of those

y \in P

with

y \leq ξ_{j}

. Since

ξ_{i}

and

ξ_{j}

are incomparable in P, it follows that

ξ_{i} \notin J

and

ξ_{j} \notin I

. Thus

ρ (I) \in H_{i, j}^{(+)} \ H_{i, j}

and

ρ (J) \in H_{i, j}^{(-)} \ H_{i, j}

. Hence

H_{i, j}

intersects the interior of

O (P)

. Let, in general,

I^{'}

and

J^{'}

be poset ideals of P with

ρ (I^{'}) \in H_{i, j}^{(+)} \ H

and

ρ (J^{'}) \in H_{i, j}^{(-)} \ H

. In other words,

ξ_{i} \in I \ J

and

ξ_{j} \in J \ I

. Hence

I \neg \subset J

and

J \neg \subset I

. Lemma 4 then guarantees that

conv ({ρ (I), ρ (J)})

cannot be an edge of

O (P)

. Hence

H_{i, j}

is a separating hyperplane of

O (P)

, as desired. □

Lemma 7.

Let

H

be the hyperplane of

R^{d}

defined by the equation

\sum_{i = 1}^{d} x_{i} - 1 = 0

. Then the following conditions are equivalent:

(i): $H$ is a separating hyperplane of $C (P)$ ;
(ii): $H$ intersects the interior of $C (P)$ ;
(iii): P is not a chain.

Proof.

The implication (i) ⇒ (ii) is obvious. Suppose (ii). Since the origin

ρ (⌀)

of

R^{d}

belongs to

H^{(-)} \ H

, there is an antichain A of P with

ρ (A) \in H^{(+)} \ H

. Then

♯ (A) \geq 2

. Thus, P cannot be a chain. Hence (ii) ⇒ (iii) follows.

Suppose (iii). One has an antichain A of P with

♯ (A) \geq 2

. Then

ρ (A) \in H^{(+)} \ H

and

ρ (⌀) \in H^{(-)} \ H

. Hence

H

intersects the interior of

C (P)

. Clearly

ρ (⌀)

is a unique vertex of

C (P)

belonging to

H^{(-)} \ H

. Let B be an antichain of P with

ρ (B) \in H^{(+)} \ H

. Thus,

♯ (B) \geq 2

. Since

B = (⌀ \ B) \cup (B \ ⌀)

is disconnected in P, Lemma 5 says that

conv ({ρ (⌀), ρ (B)})

cannot be an edge. Hence

H

is a separating hyperplane of

C (P)

, as desired. □

By virtue of Lemmas 6 and 7, it follows immediately that

Theorem 2.

Let P be a finite poset, but not a chain. Then each of the order polytope

O (P)

and the chain polytope

C (P)

possesses a separating hyperplane.

3.2. Description of Separating Hyperplanes for Order and Chain Polytopes

In this subsection, we study the necessary and sufficient conditions for the hyperplane

H

defined by

h (x) = c_{1} x_{1} + c_{2} x_{2} + \dots + c_{d} x_{d} = 0

to become a separating hyperplane for certain order polytopes and for certain chain polytopes. This study can be very difficult for general posets. Therefore, we focus on the following three basic posets: Disjoint chains; binary trees (assume connected); and zigzag posets (assume connected, a zigzag poset is a poset where its graph looks like a zigzag path, see examples of zigzag posets in Proposition 3). Notice that there are no “X” shape in all the three classes of posets, therefore their chain polytopes and order polytopes are unimodular equivalent ([7]). In this subsection, we will focus on order polytopes, and all results are also true for chain polytopes.

First, by the definition of separating hyperplanes, together with Lemmas 3 and 4 about the descriptions of the vertices and edges for order polytopes, we have the following description.

Lemma 8.

H

is a separating hyperplane for

O (P)

if and only if the following two properties are satisfied:

1.: there exist two poset ideals I and J such that $h (ρ (I)) > 0$ and $h (ρ (J)) < 0$ (getting two nontrivial subpolytopes);
2.: $h (ρ (I)) h (ρ (J)) \geq 0$ , for each pair of poset ideals I and J such that $(I \ J) \cup (J \ I)$ is connected in P.

We call a pair of poset ideals I and J that does not satisfy the second property in Lemma 8 a bad pair for h, i.e.,

h (ρ (I)) h (ρ (J)) < 0

and

(I \ J) \cup (J \ I)

is connected in P. In the rest of this subsection, we will prove most necessary conditions for being a separating hyperplane by constructing bad pairs. We are looking for posets which have the following property.

Consider the following three properties of the hyperplane

H

defined by

h (x) = 0

.

Property 1.

1.: There exist two minimal elements $ξ_{i}$ and $ξ_{j}$ such that $c_{i} > 0$ and $c_{j} < 0$ ;
2.: All nonzero coefficients have the same absolute value, i.e., $c_{i} \in {0, 1, - 1}$ after rescaling, for all $i = 1, 2, \dots, d$ ;
3.: All the coefficients $c_{i}$ , where $ξ_{i}$ is minimal element of P, uniquely determine the other coefficients.

In what follows, we discuss whether Property 1 can be a necessary and sufficient condition for

H

to be a separating hyperplane of the order polytope

O (P)

.

Note that once Property 1 is a necessary and sufficient condition for

H

to be a separating hyperplane of

O (P)

, we can easily check whether a given hyperplane is a separating hyperplane of

O (P)

. Moreover, assume all coefficients for minimal elements are not zero, then the total number of separating hyperplanes of

O (P)

will be

2^{# {of \min elements in P}}

, since each minimal element can only choose to be positive or negative, and then all other coefficients are uniquely determined by the minimal elements. Among the three classes of posets we mentioned: disjoint chains, connected binary trees and connected zigzag posets, only disjoint chains satisfy Property 1. We will provide counter examples for the other two posets and give the best possible results under certain conditions.

Proposition 1.

For the order polytope

O (P)

, where P consists of disjoint chains, Property 1 is a necessary and sufficient condition for

H

to be a separating hyperplane.

Proof.

We first prove that all three conditions listed in Property 1 are necessary for

H

to be a separating hyperplane.

By Lemma 8 (1), there exists one order ideal I of $P$ , such that $h (ρ (I)) > 0$ . We assume I is connected, otherwise we look at the chain decomposition of $P = C_{1} \cup \dots \cup C_{r}$ and consider $I \cap C_{i}$ , for $i = 1, \dots, r$ . At least one of the intersections is nonempty and satisfies $h (ρ (I \cap C_{i})) > 0$ . Now back to the case when I is connected. Since I is a chain, there exists a unique minimal element i in I. We claim that $c_{i} \geq 0$ , where $c_{i}$ is the coefficient of $x_{i}$ in $H$ . In fact, if $c_{i} < 0$ , I and $J = {i}$ is a bad pair. Actually, here we can assume $c_{i} > 0$ , since in the case $c_{i} = 0$ , we can simply throw this element away from the poset and look at the new minimal element in the subposet $P \ {i}$ . Since the whole I cannot have all coefficients zero, we will just assume $c_{i} \neq 0$ . Similarly, we also have another minimal element j with $c_{j} < 0$ .
We first prove that nonzero coefficients of the minimal elements need to have the same absolute value. For example, consider the following poset.

Without loss of generality, pick $c_{a} > 0$ , $c_{b} < 0$ . Suppose $| c_{b} | > | c_{a} |$ . Let $I = {a}$ , $J = {b, a}$ . Then $(I, J)$ is a bad pair. So, we need $| c_{b} | = | c_{a} |$ . Considering all pairs of minimal elements with opposite signs, it follows that their coefficients have the same absolute value.
Now consider the pair $I = {a}$ , $J = {a, d}$ , to make $(I, J)$ not bad, we need $c_{d} \geq - c_{a} = c_{b}$ . Consider the pair $I = {b}$ , $J = {b, a, d}$ , we have $c_{d} \leq 0$ . Then consider the pair $I = {a, d}$ , $J = {b, a, d}$ , since we want to avoid zero coefficient, assume $c_{d} \neq 0$ , therefore we have $c_{d} \leq - c_{a}$ . Therefore, we need $c_{d} = - c_{a}$ . For the same reason, we have $c_{g} = c_{a}$ . Now consider $c_{e}$ . Similar to above, the pairs $({a}, {a, d, e})$ , $({b}, {b, a, d, e})$ and $({a, d, e}, {b, a, d, e})$ provide $c_{e} = c_{a}$ . Continuing this way, we can show that the signs along each chain need to alternate and their coefficients have the same absolute value.
We have just shown in the previous part that given the coefficients of the minimal elements, there exists a unique way to extend the coefficients to other elements (assume avoiding zero coefficients), which is exactly Property 1 (3).

Now we want to show that if a hyperplane

H

satisfies the three conditions listed in Property 1, then

H

is a separating hyperplane. Condition (1) guarantees part (1) in Lemma 8. Now we want to show that there is no bad pair. For any pair of poset ideals

(I, J)

, if

J \ I

is connected, then

J \ I

is a segment in a chain. By the necessary conditions on the coefficients of

H

,

\sum_{i \in J \ I} c_{i} \in {- 1, 0, 1}

. As a result, no matter what the value of

h (v_{I})

is, we always have

h (v_{I}) h (v_{J}) \geq 0

. □

Proposition 2.

For binary trees, the following are true:

1.: Property 1 (1) is necessary.
2.: Property 1 (2) is not necessary.
3.: Assume a separating hyperplane $H$ satisfying Property 1 (1) and (2), then (3) is also necessary.
4.: However, all three conditions in Property 1 together are not sufficient for a hyperplane to be a separating hyperplane.

Proof.

We want to show that there exist two minimal elements i and j such that $c_{i} > 0$ and $c_{j} < 0$ . The argument in the proof for the disjoint union of chains also works here. The key point is that for any connected poset ideal I in the binary tree and one of its minimal element i, $I \ {i}$ is still connected in $P$ .
The argument that all the minimal elements have the same absolute value still holds as in the disjoint union of chains. However, it is possible that not all elements have the same absolute value. For example, consider the hyperplane as the following labelled represented poset, where the label for an element i in P is the coefficient $c_{i}$ in $H$ . We can check that there are no bad pairs for $H$ , thus $H$ is a separating hyperplane. However, not all coefficients in $H$ have the same absolute value.
Now assume all coefficients have the same absolute value, and thus can only take value from ${- 1, 0, 1}$ after rescaling. So here we only need to talk about the sign for an element i in P (+ refers to $c_{i} = 1$ and − refers to $c_{i} = - 1$ ). Now we want to show that the sign of an element is determined by the sign of its two children. Here “the sign of the child" refers to the sign of the poset ideal generated by that child. In particular, there are exactly six local sign patterns:

Notice that 0 appears if and only if its children have a + and a −. For two elements $a, b$ with a common parent d,
(a)
suppose $c_{b} = c_{a} = 1$ . Let e be a minimal element with $c_{e} = - 1$ . Then by the pair $(I = {e}, J = ⟨ d, e ⟩)$ (J is the poset ideal generated by d and e), we have $h (ρ (J)) \geq 0$ , and thus $c_{d} = - 1$ . This corresponds to the second tree above, and the same for the first tree.
(b)
suppose $c_{b} = - c_{a} > 0$ . Then by the pair $({b}, {a, b, d})$ and $({a}, {a, b, d})$ , we have $c_{d} = 0$ , which corresponds to the third tree above.
(c)
suppose $c_{b} > 0$ and $c_{a} = 0$ . This indicates that a is larger than some minimal element e with $c_{e} < 0$ . Then by the pair $({e}, ⟨ d ⟩)$ and $({b}, ⟨ d ⟩)$ , we have $h (ρ (⟨ d ⟩)) = 0$ , thus $c_{d} = - c_{b}$ , which corresponds to the forth tree above. The fifth and the sixth tree can be obtained in a similar way.
Following the above rule will not always result in a separating hyperplane. For example, consider the hyperplane represented by the following labelled poset.

One can easily check that the above hyperplane follows the six local rules listed above as well the other two conditions in Property 1. However, for example, $I = ⟨ g, e, f ⟩$ and $J = ⟨ a, b, c, d, e, f ⟩$ is a bad pair.

□

Proposition 3.

For the zigzag posets, Property 1 (1) is not necessary for

H

to be a separating hyperplane. However, for any hyperplane

H

with Property 1 (1), the other two conditions listed in Property 1 are necessary and sufficient conditions for

H

to be a separating hyperplane.

Proof.

The following example is a separating hyperplane but does not satisfy Property 1 (1).

Now assume

H

is a hyperplane satisfying Property 1 (1). We first prove that if

H

is a separating hyperplane, then both Property 1 (2) and (3) are true.

We want to prove that all the nonzero coefficients in any separating hyperplane for a zigzag poset have the same absolute value. First notice that all the minimal elements have the same absolute value, as proved in Proposition 1. Following the same proposition, all the non-maximal elements (if nonzero) have the same absolute value. As for the maximal elements, let us have a closer look at the zigzag poset. One maximal element m covers at most two minimal elements $p, q$ . For the case m only covers one minimal element, the coefficient $c_{m}$ needs to have the same absolute value for the same reason as disjoint chains proved in Proposition 1. Now there are two cases when m covers two minimal elements $p, q$ :
(a)
$c_{p} \cdot c_{q} < 0$ . Let $I = ⟨ m ⟩$ be the poset ideal generated by m. Consider the pair I and J, where $J = {p}$ or ${q}$ . We have $h (ρ (I)) = 0$ , which implies $| c_{m} | \leq 1$ .
(b)
$c_{p} \cdot c_{q} > 0$ . Say $c_{p} = c_{q} = 1$ . Let n be a maximal element adjacent to m that covers two minimal elements with different signs. For example,

Consider the poset ideal $I = ⟨ m, n ⟩$ . Similar to the previous case, we have $h (ρ (I)) = 0$ , which still implies $| c_{m} | \leq 1$ .
Since $H$ satisfies conditions (1) and (2) in Property 1, once we fix the signs of all the minimal elements, all elements except those maximal are uniquely determined the same way as the disjoint chains (Proposition 1). As for the maximal elements, they are uniquely determined by the signs of their two children the same as the binary trees (Proposition 2).

Now we want to prove that any hyperplane

h (x) = 0

satisfying the three conditions listed in Property 1 is a separating hyperplane. The condition (1) in Property 1 implies condition (1) in Lemma 8. Now we want to show that there are no bad pairs. Notice that by the rules described above, any connected component has value sum to

{1, 0, - 1}

. In the case

I \subset J

, if

h (ρ (I)) < 0

, then

h (ρ (J)) = h (ρ (I)) + h (ρ J \ I) < 1

, since

h (ρ (J \ I)) < 1

. Now we claim that for the zigzag poset, the condition that

(I \ J) \cup (J \ I)

is connected, implies that

I \ J

or

J \ I

is empty. Consider a generic connected subposet

S = (I \ J) \cup (J \ I)

. We want to show that

S \subset I

or

S \subset J

. If S only has one maximal element, then it is clear that all the elements belong to the same order ideal as the maximal element (either I or J). If there are more than one maximal element, see the following example.

Consider two adjacent maximal elements (they are a and b in the example). These two maximal elements cover a common minimal element d, because this subposet is connected. Then d belongs to the same poset ideal as both a and b. Therefore, both a and b belong to the same poset ideal. This shows that S belongs to either I or J. □

4. Birkhoff Polytopes

The Birkhoff polytope

B_{n}

is defined to be the convex hull of all

n \times n

nonnegative real matrices with row sum and column sum equal to one. These matrices are known as the doubly stochastic matrices. Here we consider an

n \times n

matrix as a

n^{2}

-vector. The Birkhoff polytope is a well-studied polytope and have many applications, in combinatorial optimization and Bayesian statistics, for example [8,9]. In this section, we look for separating hyperplanes for

B_{n}

(Theorem 3).

In the rest of the section, we assume the hyperplanes have the form

h (x) = c_{1} x_{1} + \dots + c_{n^{2}} x_{n^{2}} = 0,

but actually all the results hold for general hyperplanes

h (x) = r

for any constant r. We start with the following known properties of the Birkhoff polytope

B_{n}

. Here we use both the one-line notation and the cycle notation for a permutation. For example,

w = 34256187

is the one-line notation for the permutation sending

1 \to 3

,

2 \to 4

,

3 \to 2

,

4 \to 5

,

5 \to 6

,

6 \to 1

,

7 \to 8

and

8 \to 7

. The cycle notation for w is

(132456) (78)

, thus w has two cycles.

$dim B_{n} = {(n - 1)}^{2}$ ;
$B_{n}$ has $n!$ vertices, which are the $n \times n$ permutation matrices;
permutations w and u form an edge in $B_{n}$ if and only if $w^{- 1} u$ has one cycle (excluding the fixed points), [10].

In particularly, for

n = 3

,

w^{- 1} u

has one cycle for any

w, u \in S_{3}

with

w \neq u

. In other words, the skeleton graph for

B_{3}

is the complete graph

K_{6}

. Therefore, there are no separating hyperplanes for

B_{3}

. Moreover, we have

Lemma 9.

B_{4}

has no separating hyperplanes.

Proof.

Suppose there exists a separating hyperplane with coefficients indicated in the following matrix:

(\begin{matrix} a & b & c & d \\ e & f & g & h \\ i & j & k & ℓ \\ m & n & o & p \end{matrix}) .

We use

x_{w}

to represent the vector corresponding to the permutation matrix for a permutation w. By symmetry, assume

h (x_{id}) > 0

. The identity permutation is connected with all other permutations except for three with two cycles

(12) (34)

,

(13) (24)

and

(14) (23)

. Then for any permutation w that is not the above three, we have

h (x_{w}) \geq 0

, and the only possible u’s with

h (x_{u}) < 0

are among the above three. Without loss of generality, assume

h (x_{(12) (34)}) < 0

. Then note that the permutation

(12) (34)

is connected to all other permutations except for id,

(13) (24)

and

(14) (23)

. Therefore,

h (x_{v}) = 0

for all permutations v with one cycle.

Now notice that

h (x_{(12) (34)}) + h (x_{(13) (24)}) = h (x_{2143}) + h (x_{3412}) = (e + b + o + ℓ) + (i + n + c + h) = (e + n + c + ℓ) + (i + b + o + h) = h (x_{2413}) + h (x_{3142}) = h (x_{(1243)}) + h (x_{(1342)}) = 0

, therefore,

h (x_{(13) (24)}) > 0

. Similarly, we can get

h (x_{(14) (23)}) > 0

. However, then

0 < h (x_{(13) (24)}) + h (x_{(14) (23)}) = (i + n + c + h) + (m + j + g + d) = (i + n + g + d) + (m + j + c + h) = h (x_{(1324)}) + h (x_{(1423)}) = 0

, a contradiction. Therefore, there does not exist any separating hyperplane. □

Remark 1.

Even though Lemma 9 is a special case of Theorem 3, we still state it separately as a lemma, since its proof provides a good example for Theorem 3.

Theorem 3.

B_{n}

has no separating hyperplanes.

Proof.

Assume there is a hyperplane

h (x) = 0

. By symmetry, assume

h (x_{id}) > 0

. Since all permutations with one cycle are connected with id, we have

h (x_{u}) \geq 0

for all u with one cycle. Suppose

h (x_{v}) < 0

for some permutation v with k cycles. Assume k is the smallest such number. In other words,

h (x_{w}) \geq 0

, for all w with fewer than k cycles. Notice that

k > 1

. First notice that

h (x_{σ}) = 0

, for all

σ

connected with v, and have fewer cycles than v. In fact, since

σ

has fewer than k cycles, we have

h (x_{σ}) \geq 0

. On the other hand, since

σ

is connected with v,

h (x_{σ}) > 0

cannot happen. Therefore,

h (x_{σ}) = 0

.

Now we apply the method in Lemma 9 to show that

h (x_{v}) < 0

cannot happen. Write in cycle notation

v = (C_{1}) (C_{2}) (C_{3}) \dots (C_{k})

, where each

C_{i}

is some sequence of numbers. Without loss of generality, assume

C_{1} = 125 A

and

C_{2} = 34 B

, where A and B are sequences of numbers. First consider the permutation

τ_{1} = (325 A) (14 B) C_{3} \dots C_{k}

. Notice that

h (x_{v}) + h (x_{τ_{1}}) = h (x_{τ_{2}}) + h (x_{τ_{3}}),

where

τ_{2} = (125 A 34 B) C_{3} \dots C_{k}

and

τ_{3} = (325 A 14 B) C_{3} \dots C_{k}

. One can check that

τ_{2}

and

τ_{3}

are both connected with v, in fact

τ_{2}

differs with v by

(13)

and

τ_{3}

differs with v by

(24)

. Since

τ_{2}

and

τ_{3}

also have fewer than k cycles, we proved earlier that

h (x_{τ_{2}}) = 0

and

h (x_{τ_{3}}) = 0

. Therefore,

h (x_{τ_{1}}) > 0

.

Now consider the permutation

σ_{1} = (C_{3}) \dots (C_{k})

. Since it has fewer than k cycles, we have

h (x_{σ_{1}}) \geq 0

. Notice that

h (x_{τ_{1}}) + h (x_{σ_{1}}) = h (x_{σ_{2}}) + h (x_{σ_{3}}),

where

σ_{2} = (325 A) C_{3} \dots C_{k}

and

σ_{3} = (14 B) C_{3} \dots C_{k}

. One can check that

σ_{2}

and

σ_{3}

are both connected with v. Since

σ_{2}

and

σ_{3}

both have fewer cycles than v, we have

h (x_{(σ_{2})}) = 0

and

h (x_{σ_{3}}) = 0

. This is a contradiction, since

h (x_{τ_{1}}) > 0

and

h (x_{σ_{1}}) \geq 0

. □

Author Contributions

Both authors contribution equally on the result.

Funding

The first author is partially supported by JSPS KAKENHI 19H00637.

Acknowledgments

The authors are grateful for reviewers’ comments.

Conflicts of Interest

The authors declare no conflict of interest.

References

Li, N. Ehrhart h^*-vectors of hypersimplices. Discret. Comput. Geom. 2012, 48, 847–878. [Google Scholar] [CrossRef][Green Version]
Lam, T.; Postnikov, A. Alcoved polytopes I. Discret. Comput. Geom. 2007, 38, 453–478. [Google Scholar] [CrossRef]
Stanley, R. Eulerian partitions of a unit hypercube. In Higher Combinatorics; Aigner, M., Ed.; Reidel: Dordrecht, The Netherlands; Boston, MA, USA, 1977; p. 49. [Google Scholar]
Li, N.; Postnikov, A. Slices of graphical zonotopes. in preparation.
Hibi, T.; Li, N.; Zhang, Y. Separating hyperplanes of edge polytopes. J. Comb. Theory Ser. A 2013, 120, 218–231. [Google Scholar] [CrossRef]
Stanley, R. Two poset polytopes. Discret. Comput. Geom. 1986, 1, 9–23. [Google Scholar] [CrossRef]
Hibi, T.; Li, N. Unimodular equivalence of order and chain polytopes. Math. Scand. 2016, 118, 5–12. [Google Scholar] [CrossRef]
Linderman, S.; Mena, G.; Cooper, H.; Paninski, L.; Cunningham, J. Reparameterizing the Birkhoff Polytope for Variational Permutation Inference. In Proceedings of the 21st International Conference on Artificial Intelligence and Statistics (AISTATS), Lanzarote, Canary Islands, Spain, 9–11 April 2018. [Google Scholar]
Lim, C.; Wright, S. Beyond the Birkhoff Polytope: Convex Relaxations for Vector Permutation Problems. In Proceedings of the Neural Information Processing Systems Conference 2014, Montreal, QC, Canada, 8–13 December 2014. [Google Scholar]
Brualdi, R.; Gibson, P. Convex polyhedra of doubly stochastic matrices. I. Applications of the permanent function. J. Comb. Theory Ser. A 1977, 22, 194–230. [Google Scholar] [CrossRef]

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Cutting Convex Polytopes by Hyperplanes

Abstract

1. Introduction

2. The Unit Cube

3. Order and Chain Polytopes

3.1. Existence of Separating Hyperplanes for Order and Chain Polytopes

3.2. Description of Separating Hyperplanes for Order and Chain Polytopes

4. Birkhoff Polytopes

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics