A Numerical Approach for the Filtered Generalized Čech Complex

Espinoza, Jesús F.; Hernández-Amador, Rosalía; Hernández-Hernández, Héctor A.; Ramonetti-Valencia, Beatriz

doi:10.3390/a13010011

Open AccessArticle

A Numerical Approach for the Filtered Generalized Čech Complex

by

Jesús F. Espinoza

^*

,

Rosalía Hernández-Amador

,

Héctor A. Hernández-Hernández

and

Beatriz Ramonetti-Valencia

Departamento de Matemáticas, Universidad de Sonora, C.P. 83000, Hermosillo, Mexico

^*

Author to whom correspondence should be addressed.

Algorithms 2020, 13(1), 11; https://doi.org/10.3390/a13010011

Submission received: 24 November 2019 / Revised: 24 December 2019 / Accepted: 27 December 2019 / Published: 30 December 2019

(This article belongs to the Special Issue Topological Data Analysis)

Download

Browse Figures

Versions Notes

Abstract

In this paper, we present an algorithm to compute the filtered generalized Čech complex for a finite collection of disks in the plane, which do not necessarily have the same radius. The key step behind the algorithm is to calculate the minimum scale factor needed to ensure rescaled disks have a nonempty intersection, through a numerical approach, whose convergence is guaranteed by a generalization of the well-known Vietoris–Rips Lemma, which we also prove in an alternative way, using elementary geometric arguments. We give an algorithm for computing the 2-dimensional filtered generalized Čech complex of a finite collection of d-dimensional disks in

R^{d}

, and we show the performance of our algorithm.

Keywords:

disk system; generalized Čech complex; Čech scale; generalized Vietoris–Rips Lemma; miniball problem

MSC:

68U05; 55U05; 68W40; 65D18

Graphical Abstract

1. Introduction

Recently, in the study of data point clouds from a topological approach (cf. [1,2,3,4,5]), the need to develop algorithms to construct different simplicial structures has arisen, such as the Vietoris–Rips complex, the Čech complex, the piecewise linear lower star complex, etc. (cf. [6,7]).

Of particular interest to us is the generalized Čech complex structure, whereas the standard Čech complex is induced by the intersection of a collection of disks with fixed radius, the generalized version admits different radii (see [8]); when radii are rescaled, using the same scale factor each time, the corresponding simplicial complexes forms the filtered generalized Čech complex.

There exist efficient algorithms to calculate the standard Čech complex (see, e.g., in [9]), and software currently available to obtain the associated filtration (cf. [10,11]); also, in [12] the authors propose an algorithm to approximate the Čech filtration. On the other hand, we can find algorithms to calculate the generalized Čech complex (see, e.g., in [8]); however, as far as we know, there are neither algorithms nor software to provide the filtered generalized Čech complex. In the present work, we show an algorithm to compute the filtered generalized Čech complex for a finite collection of disks, specifically, in the plane. Actually, we also show an algorithm to build up to the 2-dimensional filtered generalized structure (or 2-skeleton), for higher-dimensional disk systems, which many applications only require, as we can see in [13,14,15,16,17].

The key step behind these proposed algorithms, is to calculate the minimum scale factor (called Čech scale) needed to ensure that the rescaled disks have a nonempty intersection; the generalized Vietoris–Rips lemma over multiple radii will allow us to calculate these scales numerically.

We must emphasize that our main algorithm (Algorithm 3) is only generalizable to higher-dimensional disk systems to obtain the 2-dimensional filtered generalized Čech structure, as we show as an application. Additionally, we show how our algorithm yields the minimal enclosing ball for a finite set of points in the plane.

This paper is organized as follows. In Section 2, we introduce basic notions and notation which will be used throughout the paper. We define the Vietoris–Rips system and the Čech system, associated to a finite collection of closed disks in the euclidean space (or disk system) in terms of their intersection. We also introduce the fundamental notions of Vietoris–Rips scale and Čech scale for a disk system, as the infimum over all rescaling factors such that the disk system becomes a Vietoris–Rips system or a Čech system, respectively. In Lemma 2, we state and prove a generalization, over multiple radii, of the well-known Vietoris–Rips Lemma [17] (Theorem 2.5) using elementary geometric arguments. In [18] there is a proof in the generalized case, following the ideas in [17].

In Section 3 we describe the generalized versions of standard Vietoris–Rips and Čech simplicial complex structures, to the case of disk systems with different radii. We explain how their respective filtrations are induced by weight functions, and we propose an algorithm to obtain the Čech-weight function of a given disk system, associating to each Čech simplex its corresponding Čech scale.

Section 4 focuses on studying the intersection properties of collections of disks in the plane. We define a real-valuated function associated to each disk system in the plane, such that, if it turns out to be non-negative, then its Čech scale agrees with its Vietoris–Rips scale, being then easy to compute; otherwise, the Čech scale will correspond to a root of such function, and we propose a numerical approach to obtain this Čech scale (Section 5), supported on the generalized Vietoris–Rips Lemma which provides appropriated bounds.

Section 5 contains our main result, the Cech.scale algorithm, whose input is a disk system in the plane, and the output is the corresponding Čech scale, as well as the unique intersection point of the rescaled disk system at its Čech scale (see Lemma 1). We show as a example the miniball problem, to show how our Cech.scale algorithm yields the minimal enclosing ball for a finite point cloud in the plane.

Finally, we conclude the paper illustrating in Section 6 an algorithm for computing the Čech filtration of the 2-skeleton of the generalized Čech complex structure for a d-dimensional disk systems in an arbitrary euclidean space

R^{d}

.

2. Vietoris–Rips and Čech systems

Throughout this paper, a finite collection of closed d-disks in the euclidean space

R^{d}

, with positive radius,

M = {D_{i} (c_{i}; r_{i}) \subset R^{d} ∣ r_{i} > 0, 1 \leq i \leq m}

(1)

will be called d-disk system, or simply disk system when there is no risk of confusion. In this section, we introduce and analyze two fundamental subclasses of disk systems, namely, the Vietoris–Rips systems and the Čech systems. We study the infimum of those scales that turn a disk system into a Vietoris–Rips or Čech system. We conclude this section presenting a generalized version of the Vietoris–Rips Lemma, extended to disk systems.

Definition 1.

Let

M = {D_{1}, D_{2}, \dots, D_{m}}

be a disk system. We say M is a Vietoris–Rips system if

D_{i} \cap D_{j} \neq \emptyset

for each pair

i, j \in {1, 2, \dots, m}

. Moreover, if the disk system M has the nonempty intersection property

⋂_{D_{i} \in M} D_{i} \neq \emptyset

, then M is called a Čech system.

For each

λ \geq 0

, and disk system M as in (1) we define the collection

M_{λ} : = {D_{i} (c_{i}; λ r_{i}) \subset R^{d} ∣ D_{i} \in M}

and say that

λ

is a scale. Geometrically, the set

M_{λ}

consists of disks with the same centers than those in M, but with rescaled radii by

λ

. Clearly, only when

λ > 0

the set

M_{λ}

will be again a disk system. Note that

M_{1} = M

, and

M_{0}

is the set consisting of the centers of the disks in M.

Definition 2.

Let M be a disk system. The Vietoris–Rips scale of M is defined by,

ν_{M} : = inf {λ \in R ∣ M_{λ} i s a V i e t o r i s - R i p s s y s t e m} .

Analogously, the Čech scale of M is defined by,

μ_{M} : = inf {λ \in R ∣ M_{λ} i s a Č e c h s y s t e m} .

Let

μ_{M}

be the Čech scale of the disk system M, then we have that

⋂_{D_{i} \in M} D_{i} (c_{i}; μ_{M} r_{i}) \neq \emptyset

. Essentialy, this is a consequence of the completeness of the euclidean space, the fact that

⋂_{D_{i} \in M} D_{i} (c_{i}; λ r_{i}) \subset R^{d}

is a closed subset for every scale

λ

in the set

{λ \in R ∣ M_{λ} is a Čech system}

and

⋂_{D_{i} \in M} D_{i} (c_{i}; λ^{'} r_{i}) \subset ⋂_{D_{i} \in M} D_{i} (c_{i}; λ r_{i})

for

λ^{'} < λ

.

A straightforward calculation shows the following characterizations. M is a Vietoris–Rips system if and only if,

ν_{M} \leq 1

(in particular,

ν_{M_{ν_{M}}} = 1

); similarly, M is a Čech system if and only if,

μ_{M} \leq 1

.

Note that it is easy to calculate the Vietoris–Rips scale

ν_{M}

for a given disk system

M = {D_{1}, D_{2}, \dots, D_{m}}

: if

r_{i}

denotes the radius of

D_{i}

, and

∥ c_{i} - c_{j} ∥

represents the distance between the center of

D_{i}

and

D_{j}

, then

ν_{M} = {max}_{i < j} {∥ c_{i} - c_{j} ∥ / (r_{i} + r_{j})}

.

For a disk system M with just one disk, its Vietoris–Rips scale is

ν_{M} = 0

; if

M = {D_{1}, D_{2}}

has two disks, then

ν_{M} = ∥ c_{1} - c_{2} ∥ / (r_{1} + r_{2})

. Actually, in both cases the Vietoris–Rips scale agrees with the Čech scale.

On the other hand, calculating the Čech scale is a more complicated issue if the disk system has at least three disks. Concerning to Čech scales, we have the following lemma, which will become important for our implementations.

Lemma 1.

Let

μ \geq 0

be a scale and let M be a disk system. Then, μ is the Čech scale of M if and only if, the μ-rescaled system

M_{μ}

has only one intersection point, i.e., the set

⋂_{D_{i} \in M} D_{i} (c_{i}; μ r_{i})

is unitary.

Such point in

⋂_{D_{i} \in M} D_{i} (c_{i}; μ_{M} r_{i})

will be denoted by

c_{M}

.

Proof.

The case

μ = 0

happens only when the disk system consists of a single disk or is a collection of concentric disks. In this case, the claim of the lemma is evident.

Let

μ > 0

be the Čech scale of M and suppose there exist a couple of points

p_{1}, p_{2} \in ⋂_{D_{i} \in M} D_{i} (c_{i}; μ r_{i})

such that

p_{1} \neq p_{2}

. By convexity of the disks, it follows that the middle point

\bar{p} = \frac{1}{2} (p_{1} + p_{2})

must belong to every disk

D_{i} (c_{i}; μ r_{i})

. On the other hand,

∥ \bar{p} - c_{i} ∥ < max {∥ p_{1} - c_{i} ∥, ∥ p_{2} - c_{i} ∥}

for any center

c_{i}

in the disk system. Let

μ_{i} < μ

be a scale such that

\bar{p} \in D_{i} (c_{i}; μ_{i} r_{i})

for every disk in M. It follows that

\bar{μ} = max {μ_{i}} < μ

and

\bar{p} \in ⋂_{D_{i} \in M} D_{i} (c_{i}; \bar{μ} r_{i})

which contradicts the minimality of the Čech scale

μ

. Therefore, the set

⋂_{D_{i} \in M} D_{i} (c_{i}; μ r_{i})

is unitary.

Now, suppose

⋂_{D_{i} \in M} D_{i} (c_{i}; μ r_{i})

is unitary and consider the set

S = {λ \in R ∣ M_{λ} is a Čech system}

.

If

⋂_{D_{i} \in M} D_{i} (c_{i}; μ r_{i}) = {p}

, then

p \in \partial D_{i_{0}} (c_{i_{0}}; μ r_{i_{0}})

for some

D_{i_{0}} \in M

, because otherwise there would exist a neighborhood of p entirely contained in

⋂_{D_{i} \in M} D_{i} (c_{i}; μ r_{i})

.

The fact that

μ = inf S

is a consequence of the following.

Let $λ \in S$ be a scale such that $⋂_{D_{i} \in M} D_{i} (c_{i}; λ r_{i}) \neq \emptyset$ . If $λ < μ$ , then $D_{i} (c_{i}; λ r_{i}) \subset D_{i} (c_{i}; μ r_{i})$ for any $D_{i} \in M$ , and $p \notin D_{i_{0}} (c_{i_{0}}; λ r_{i_{0}})$ ; thus, $⋂_{D_{i} \in M} D_{i} (c_{i}; λ r_{i}) \subset ⋂_{D_{i} \in M} D_{i} (c_{i}; μ r_{i}) = {p}$ and $p \notin ⋂_{D_{i} \in M} D_{i} (c_{i}; λ r_{i})$ ; therefore, $⋂_{D_{i} \in M} D_{i} (c_{i}; λ r_{i}) = \emptyset$ . Then, $λ \geq μ$ , for any $λ \in S$ .
For every $ε > 0$ , we have ${p} \in D_{i} (c_{i}; μ r_{i}) \subset D_{i} (c_{i}; (μ + ε) r_{i})$ . Then, $⋂_{D_{i} \in M} D_{i} (c_{i}; (μ + ε) r_{i}) \neq \emptyset$ , i.e., $μ + ε \in S$ .

□

Example 1.

Figure 1 shows (left picture) the following disk system in

R^{2}

,

M = {D_{1} ((- 3, 0); 4), D_{2} ((1, 3); 3), D_{3} ((2, - 1); 2)} .

This 2-disk system is a Vietoris–Rips system and also a Čech system. In this case, we have

ν_{M} = \sqrt{26} / 6 \approx 0.8498

, and, in actuality, the

ν_{M}

-rescaled 2-disk system

M_{ν_{M}}

(center picture) has an empty intersection, i.e.,

⋂_{D_{i} \in M} D_{i} (c_{i}; ν_{M} r_{i}) = \emptyset

, so it corresponds to a Vietoris Rips system which is not a Čech system. For such 2-disk system, theCech.scalealgorithm in Section 5 yields to

μ_{M} = 0.9188

and the Čech system

M_{μ_{M}}

is shown in the right picture.

There exists a close relationship between Vietoris–Rips systems and Čech systems. Obviously, every Čech system is also a Vietoris–Rips system, but the opposite statement does not hold in general (as we saw in the example above). However, we have the following result which extends the standard Vietoris–Rips Lemma ([17], Theorem 2.5), to any Vietoris–Rips system; such result can be found in ([18], Theorem 3.2) in the context of weighted simplicial complexes.

The standard Vietoris–Rips Lemma is established in the context of a disk system in which all radii are equal, and corresponds to a reformulation of the well-known Jung’s Lemma [19]. On the other hand, for a disk system in general, the Vietoris–Rips Lemma is also valid, although it does not follow directly from Jung’s Lemma. Next, we propose a proof of the Vietoris–Rips Lemma, for an arbitrary disk system, using elementary geometrical arguments.

Lemma 2.

Let

M = {D_{i} (c_{i}; r_{i}) \subset R^{d} ∣ r_{i} > 0}

be a finite set of closed disks in

R^{d}

. If

D_{i} (c_{i}; r_{i}) \cap D_{j} (c_{j}; r_{j}) \neq \emptyset

for every pair of disks in M, then

⋂_{D_{i} \in M} D_{i} (c_{i}; \sqrt{2 d / (d + 1)} r_{i}) \neq \emptyset .

In other words, for every Vietoris–Rips system M in

R^{d}

, the

\sqrt{2 d / (d + 1)}

-rescaled disk system

M_{\sqrt{2 d / (d + 1)}}

is a Čech system.

Proof.

First, we prove the result for the case where M has at most

d + 1

disks, say

M = {D_{1}, \dots, D_{d^{'}}}

,

d^{'} \leq d + 1

. Note that we need to prove that

μ_{M} \leq \sqrt{\frac{2 d}{d + 1}}

.

Let

{c_{M}} = ⋂_{i} D_{i} (c_{i}; μ_{M} r_{i})

be the unique intersection point of the

μ_{M}

-rescaled disk system (see Lemma 1). Without loss of generality, we assume that

∥ c_{i} - c_{M} ∥ = μ_{M} r_{i}

for

c_{1}, \dots, c_{m}

and

m \leq d^{'}

.

Then,

c_{M}

belongs to the convex hull of the set

{c_{1}, \dots, c_{m}}

, for if this were not true, there would exist an hyperplane across

c_{M}

such that the set

{c_{1}, \dots, c_{m}}

is completely contained at one side, let v be the normal vector for such hyperplane in the opposite direction, then

〈 v, c_{i} - c_{M} 〉 > 0

for all

i = 1, \dots, m

, therefore

∥ c_{i} - c_{M} ∥^{2} = ∥ c_{i} - (c_{M} + t v) ∥^{2} + 2 t 〈 v, c_{i} - c_{M} 〉 - t^{2} {∥ v ∥}^{2} > {∥ c_{i} - (c_{M} + t v) ∥}^{2}

for every

t \in I_{v} : = (0, 2 〈 v, c_{i} - c_{M} 〉 / {∥ v ∥}^{2})

; this implies that

c_{M} + t v \in D (c_{i}; μ_{M} r_{i})

for any

i = 1, \dots, m

and

t \in I_{v}

, which is a contradiction since

c_{M}

is the only point in the intersection

⋂_{i} D (c_{i}; μ_{M} r_{i})

. Therefore,

c_{M}

is in the convex hull.

Now, define

{\hat{c}}_{i} : = c_{i} - c_{M}

and let

θ_{i j}

denote the angle between vectors

{\hat{c}}_{i}

and

{\hat{c}}_{j}

. As

c_{M}

is in the convex hull of

{c_{1}, \dots, c_{m}}

, then the vector

0 \in R^{d}

can be written as a convex combination

\sum_{j = 1}^{m} a_{j} {\hat{c}}_{j} = 0

. Thus,

〈 \sum_{j = 1}^{m} a_{j} {\hat{c}}_{j}, {\hat{c}}_{i} 〉 = 0

for any

i = 1, \dots, m

, and

\sum_{j = 1}^{m} 〈 a_{j} {\hat{c}}_{j}, {\hat{c}}_{i} 〉 = \sum_{j = 1}^{m} a_{j} ∥ {\hat{c}}_{j} ∥ ∥ {\hat{c}}_{i} ∥ cos θ_{j i} = 0 .

Taking out the common factor

∥ {\hat{c}}_{i} ∥

, we have

\sum_{j = 1}^{m} a_{j} ∥ {\hat{c}}_{j} ∥ cos θ_{j i} = 0

. Now, taking the sum over i, we deduce that

\sum_{i = 1}^{m} \sum_{j = 1}^{m} a_{j} ∥ {\hat{c}}_{j} ∥ cos θ_{j i} = \sum_{j = 1}^{m} a_{j} ∥ {\hat{c}}_{j} ∥ \sum_{i = 1}^{m} cos θ_{j i} = 0 .

(2)

Note that

cos θ_{i i} = 1

. On the other hand, if were

cos θ_{j i} > - \frac{1}{m - 1}

for all

1 \leq i, j \leq m

,

i \neq j

, then for each j we should have

\sum_{i = 1}^{m} cos θ_{j i} = 1 + \sum_{j = 1, j \neq i}^{m} cos θ_{j i} > 0

. However, this contradicts (2) because

\sum_{j = 1}^{m} a_{j} = 1

and

a_{j} \geq 0

. Therefore, there must exist

i \neq j

, say

i = 1

and

j = 2

, such that

cos θ_{12} \leq - \frac{1}{m - 1} \leq - \frac{1}{d}

, then

0 \leq \frac{d}{d - 1} (1 + cos (θ_{12})) \leq 1 .

(3)

It follows from inequality above and from the AM-GM inequality, that

\sqrt{\frac{d}{d - 1} (1 + cos (θ_{12}))} \cdot \sqrt{r_{1} r_{2}} \leq \sqrt{r_{1} r_{2}} \leq \frac{r_{1} + r_{2}}{2} .

(4)

A straightforward calculation on (4) leads us to the following inequality

(d + 1) {(r_{1} + r_{2})}^{2} \leq 2 d (r_{1}^{2} + r_{2}^{2} - 2 r_{1} r_{2} cos (θ_{12}))

so,

\begin{matrix} (d + 1) μ_{M}^{2} {(r_{1} + r_{2})}^{2} & \leq 2 d (μ_{M}^{2} r_{1}^{2} + μ_{M}^{2} r_{2}^{2} - 2 μ_{M}^{2} r_{1} r_{2} cos (θ_{12})) \\ = 2 d (∥ {\hat{c}}_{1} ∥^{2} + ∥ {\hat{c}}_{2} ∥^{2} - 2 ∥ {\hat{c}}_{1} ∥ ∥ {\hat{c}}_{2} ∥ cos (θ_{12})) \\ = 2 d (∥ {\hat{c}}_{1} ∥^{2} + ∥ {\hat{c}}_{2} ∥^{2} - 2 〈 \hat{c_{1}}, \hat{c_{2}} 〉) = 2 d ∥ {\hat{c}}_{1} - {\hat{c}}_{2} ∥^{2} \end{matrix}

which implies,

μ_{M}^{2} \leq \frac{2 d}{d + 1} \cdot \frac{∥ {\hat{c}}_{1} - {\hat{c}}_{2} ∥^{2}}{{(r_{1} + r_{2})}^{2}} = \frac{2 d}{d + 1} \cdot \frac{∥ c_{1} - c_{2} ∥^{2}}{{(r_{1} + r_{2})}^{2}} \leq \frac{2 d}{d + 1}

(the last inequality holds because M is a Vietoris–Rips system) or equivalently

μ_{M} \leq \sqrt{\frac{2 d}{d + 1}}

.

For a collection with more than

d + 1

disks, the claim of the lemma is a consequence of the Helly’s Theorem (see [20], Problem 29), which establishes that for any finite collection, with at least

d + 1

convex subsets of the d-dimensional euclidean space

R^{d}

, if the intersection of every subcolection with

d + 1

of such sets is nonempty, then the whole collection has a nonempty intersection. This concludes the proof. □

The upper bound

\sqrt{2 d / (d + 1)}

in Lemma 2 is optimal: it suffices to take a disk system with

d + 1

disks of equal radii and pairwise tangents (cf. [6], Section III.2).

In the Example 1 we can see what the Vietoris–Rips Lemma claim for the 2-disk system M:

μ_{M} = 0.9188 < 0.9812 = \sqrt{4 / 3} ν_{M}

.

To conclude this section, notice that as for a Čech system

μ_{M} \leq 1

, then the Vietoris–Rips Lemma implies the following result.

Corollary 1.

If M is an arbitrary d-disk system and

ν_{M}

is its Vietoris–Rips scale, then its Čech scale satisfies that

μ_{M} \in [ν_{M}, \sqrt{2 d / (d + 1)} ν_{M}]

. In consequence, for every d-disk system M, the rescaled disk system

M_{\sqrt{2 d / (d + 1)} ν_{M}}

is always a Čech system.

In particular, if

\sqrt{2 d / (d + 1)} ν_{M} \leq 1

then

M_{ν_{M}}

is a Čech system.

3. Filtered Generalized Simplicial Structures for Disk Systems

In this section, we introduce two simplicial structures associated with a disk system M, as well as the filtration induced by rescaling the system M. The importance of these notions lies in their relation to the topological analysis through persistent homology of filtered simplicial structures, induced by point clouds with nonhomogeneous neighborhoods.

Let M be a disk system. Denote by

VR (M)

the family of all Vietoris–Rips subsystems of M, this is,

VR (M) = {σ \subset M ∣ D_{i} \cap D_{j} \neq \emptyset, D_{i}, D_{j} \in σ} .

Analogously, denote by

C (M)

the set of all Čech subsystems,

C (M) = {σ \subset M ∣ \cap_{D_{i} \in σ} D_{i} \neq \emptyset} .

On the other hand, recall that a simplicial structure on a (finite) set V is defined as a family

Δ (V) \subset 2^{V}

of subsets of V such that if

σ \in Δ (V)

and

τ \subset σ

, then

τ \in Δ (V)

. Thus, for any

σ \in VR (M)

every disk subsystem

τ \subset σ

is also in the family

τ \in VR (M)

. The same property is valid for the family

C (M)

. These properties imply that

VR (M)

and

C (M)

are simplicial complexes.

We refer to

VR (M)

as the generalized Vietoris–Rips complex associated to the disk system M, and to

C (M)

as the generalized Čech complex of M.

The above construction allows us to perform topological data analysis of point cloud data through the persistent homology of the generalized Vietoris–Rips or Čech complexes. However, to perform such analysis it is necessary to construct a filtered simplicial structure. We will define a filtration through weight functions.

Let

Δ

be a simplicial complex and let

ω : Δ \to R

be a function. We call

ω

a weight function over the simplicial complex

Δ

if

τ, σ \in Δ

and

τ \subset σ

, implies

ω (τ) \leq ω (σ)

.

For example, to the generalized Čech complex

C (M)

of the disk system M, the function

ω : C (M) \to R, σ \mapsto μ_{σ}

which assigns the Čech scale to any Čech subsystem

σ \subset M

, is a weight function, called the Čech-weight function. The analogous property holds for the Vietoris–Rips complex and the Vietoris–Rips scale (see [18] for the construction of the filtered generalized Čech complex using weighted point clouds).

Moreover, from the definition we have for the Čech-weight function and to every non-negative scale

λ \geq 0

, that

ω^{- 1} ((- \infty, λ]) = C (M_{λ}) .

We denote by

C_{M} (λ)

the family

C (M_{λ})

for

λ \geq 0

, i.e., the family of all Čech subsystems of the

λ

-rescaled disk system

M_{λ}

, in order to make the dependence explicit with respect the the parameter. We establish the analogous definition for

{VR}_{M} (λ)

, for any

λ \geq 0

.

Note that there is no restriction on the scale

λ \geq 0

, additional to the non-negativity, i.e., we allow greater values of

λ

than 1, in the interest of studying the generalized Čech complex of rescaled disk systems beyond the original.

For

λ^{'} \leq λ

we have the families contention:

{VR}_{M} (λ^{'}) \subset {VR}_{M} (λ)

and

C_{M} (λ^{'}) \subset C_{M} (λ)

. In general, given a simplicial complex

Δ

and a weight function

ω : Δ \to R

, any increasing sequence

λ_{1} < \dots < λ_{s}

of real numbers induces a simplicial filtration:

Δ_{1} \subset \dots \subset Δ_{s}

for

Δ_{i} : = ω^{- 1} ((- \infty, λ_{i}])

. Thus, for any disk system M, the generalized Čech complex

C (M)

has a filtered simplicial complex structure,

C_{M} (0) \subset C (λ_{1}) \subset \dots \subset C_{M} (λ_{s}) .

Of course, when we vary the scale

λ

on a interval the above filtration contains only a finite number of different sets. Moreover, those sets only change when the Čech scale of some disk system is reached, and therefore it is enough to compute all sets corresponding to Čech scales to characterize entirely the filtration. The goal of the next sections is the construction of algorithms to numerically estimate the Čech scale of every Čech subsystem of M.

The filtered generalized Čech complex can be “approximated” by the Vietoris–Rips structure, in the following sense. The inclusion

C (M) \subset VR (M)

holds clearly; in consequence,

C_{M} (λ) \subset {VR}_{M} (λ)

for any scale

λ \geq 0

, then by Lemma 2 any Vietoris–Rips d-system

σ \in VR (M)

rescaled by a factor of

\sqrt{2 d / (d + 1)}

is also a Čech d-system:

σ_{\sqrt{2 d / (d + 1)}} \in C (M)

. Therefore, for any d-disk system M the following relation is fulfilled:

{VR}_{M} (λ^{'}) \subset C_{M} (λ) \subset {VR}_{M} (λ),

where

\sqrt{2 d / (d + 1)} \cdot λ^{'} \leq λ

.

To any disk system M, the simplicial substructure

C {(M)}^{(1)}

given by the 1-skeleton of the generalized Čech complex of M is a basic combinatorial structure (actually, a graph) that can be easily defined, it just takes the relationship into account if every two vertices are neighbors: the set of vertices is M, and there exists an edge

{D_{i}, D_{j}}

whenever

D_{i} \cap D_{j} \neq \emptyset

. The Čech-weight function restricted to

C {(M)}^{(1)}

is, in fact:

ω ({D_{i}}) = 0

to every vertice, and

ω ({D_{i}, D_{j}}) = ∥ c_{i} - c_{j} ∥ / (r_{i} + r_{j})

to any edge.

In Algorithm 1, we calculate the Čech-weight function

ω : C_{M} {(λ)}^{(d i m)} \to R

, for the dim-skeleton of a

λ

-rescaled disk system M. To do this, we assume an arbitrary linear order in the disk system M, and for every disk

D \in M

we consider the following set.

λ

-LowerNbrs

(D) = {\tilde{D} \in M ∣ \tilde{D} < D, ω ({D, \tilde{D}}) \leq λ}

.

The following algorithm (based on work in [7]), is a standard expansion algorithm for simplicial complexes, and we are including the Čech-weight function value of each simplex when it is calculated.

Algorithm 1: Čech-weight function of a d-disk system.

We conclude the section with an application of Algorithm 1 to a 2-disk system.

Example 2.

Let M be the following 2-disk system,

\begin{matrix} M = { & D_{1} ((2.99, 0.56); 1.5), D_{2} ((0.99, 0.11); 1.0), \\ D_{3} ((1.69, 1.30); 0.6), D_{4} ((1.07, 1.93); 0.4), \\ D_{5} ((1.96, 2.64); 0.8)} . \end{matrix}

The output of Algorithm 1 applied to M, with

d = 2

,

λ = 1

and

d i m = 2

gives the Čech scales indicated next to every edge and in the triangle, in Figure 2. The Čech scale of the 2-disk system

{D_{1}, D_{2}, D_{3}}

was calculated with theCech.scalescript from Algorithm 3.

4. Intersection Properties of Disk Systems

In this section, we focus on studying disk systems in the plane, i.e., 2-disk systems. As we have seen in the last section, the study of the Čech scale is a key aspect to the construction and study of filtered generalized Čech complex. In this section, we establish several intersection properties of 2-disk systems, which will lead us to be able to calculate the Čech scale.

Let

\partial D_{i} (c_{i}; r_{i}) : = {x \in R^{2} ∣ ∥ x - c_{i} ∥ = r_{i}}

be the boundary of the closed 2-dimensional disk

D_{i} (c_{i}, r_{i}) \subset R^{2}

.

Let

D_{i}

and

D_{j}

be two closed disks in the plane, such that

D_{i} \cap D_{j} \neq \emptyset

. We define

D_{i} ⊓ D_{j}

to be the unitary set

{d_{i j}}

constructed as follows.

If $\partial D_{i} \cap \partial D_{j} \neq \emptyset$ , then $d_{i j} \in \partial D_{i} \cap \partial D_{j}$ is the only one point with the property $〈 d_{i j} - c_{i}, n_{i j} 〉 \geq 0$ , where $n_{i j} = (- b, a)$ is the normal vector to $c_{j} - c_{i} = (a, b)$ ,
If $\partial D_{i} \cap \partial D_{j} = \emptyset$ , we define $d_{i j}$ as the unique intersection point in $\partial D_{i} (c_{i}; λ r_{i}) \cap \partial D_{j} (c_{j}; λ r_{j})$ , for $λ$ given as the minimal scale such that $D_{i} (c_{i}; λ r_{i}) \subset D_{j} (c_{j}; λ r_{j})$ or $D_{j} (c_{j}; λ r_{j}) \subset D_{i} (c_{i}; λ r_{i})$ , i.e., $λ = ∥ c_{i} - c_{j} ∥ / | r_{i} - r_{j} |$ .

Clearly, if

\partial D_{i} \cap \partial D_{j} = \emptyset

, then

d_{i j} = d_{j i}

. In particular, when

D_{i}

and

D_{j}

are concentric, then

D_{i} ⊓ D_{j} = {c_{i}} = {c_{j}}

. On the other hand, if the closed disks

D_{i}

and

D_{j}

are internally or externally tangent, then

d_{i j} = d_{j i}

. We can think about

d_{i j}

, when

\partial D_{i} \cap \partial D_{j}

is not empty, as the intersection point of the boundaries at the left of the vector from

c_{i}

to

c_{j}

. Figure 3 shows the above construction.

We will denote by

d_{i j} (λ)

, instead of simply

d_{i j}

, for the intersection point of the

λ

-rescaled disks

D_{i} (c_{i}; λ r_{i})

and

D_{j} (c_{j}; λ r_{j})

.

In order to study Čech systems, we give the following characterization, according the intersection points

d_{i j}

.

Lemma 3.

Let

M = {D_{1}, D_{2}, \dots, D_{m}}

be a 2-disk system. Then M is a Čech system if and only if, there exist

D_{i}, D_{j} \in M

such that

d_{i j} \in D_{i} ⊓ D_{j}

satisfies

d_{i j} \in D_{k}

for all

1 \leq k \leq m

.

Proof.

Suppose M is a Čech system. Define

A : = ⋂_{1 \leq i \leq m} D_{i} \neq \emptyset

; then, A has only one of the following geometries.

(i): $A = {c_{M}}$ ,
(ii): A is a region bounded by more than one circumference arc,
(iii): $A = D_{i_{0}}$ for some $i_{0} \in {1, \dots, m}$ .

In the first case, necessarily

c_{M}

belongs to the boundary of two or more disks. Let

D_{i}

and

D_{j}

be two disks in M such that

c_{M} \in \partial D_{i} \cap \partial D_{j}

, it follows that

c_{M} = d_{i j}

or

c_{M} = d_{j i}

, in both cases the lemma holds.

For the second case, if

a \in \partial A \subset A

belongs to the boundary and is in the intersection of two arcs, say

\partial D_{i}

and

\partial D_{j}

, then

a = d_{i j}

or

a = d_{j i}

, and it satisfy

a \in D_{k}

for every

1 \leq k \leq m

.

For the last case, if

A = D_{i_{0}}

for some

i_{0}

, then for each

j \neq i_{0}

we have

d_{i_{0} j} \in D_{i_{0}} = A

and all of these points belongs to

D_{k}

for all

D_{k} \in M

.

Therefore, in any case, there exists such point

d_{i j}

. The converse is clear by definition of a Čech system. □

This criterion was presented in ([8], Section III) for a 2-disk system.

Next, we define the map

ρ

, a key tool for the rest of this work. This map will allow us to discern the minimal scale in which a 2-disk system has the nonempty intersection property.

Definition 3.

Let

M = {D_{1}, D_{2}, \dots, D_{m}}

be a Vietoris–Rips system in the plane, with

m \geq 3

. We define

ρ (M) : = max_{1 \leq i, j \leq m} \{min_{k \neq i, j} {r_{k} - ∥ d_{i j} - c_{k} ∥}\} .

If

ν_{M}

is the Vietoris–Rips scale of M, then we define the map

ρ_{M} : [ν_{M}, \infty) \to R

,

λ \mapsto ρ_{M} (λ) = ρ (M_{λ})

.

Given three disks

D_{i} (c_{i}; r_{i})

,

D_{j} (c_{j}; r_{j})

and

D_{k} (c_{k}; r_{k})

in the 2-disk system

M = {D_{1}, \dots, D_{m}}

, with Vietoris–Rips scale

ν_{M}

, denote by

Λ_{i, j}^{k} : [ν_{M}, \infty) \to R

the map

λ \mapsto λ r_{k} - ∥ d_{i j} (λ) - c_{k} ∥

, where

d_{i j} (λ)

is the element in

D_{i} (c_{i}; λ r_{i}) ⊓ D_{j} (c_{j}; λ r_{j})

. In other words,

Λ_{i, j}^{k} (λ)

is the signed distance from the point

d_{i j} (λ)

to the set

D_{k} (c_{k}; λ r_{k})

.

If

r_{i} \neq r_{j}

, then for each

k \neq i, j

the map

λ \mapsto λ r_{k} - ∥ d_{i j} (λ) - c_{k} ∥

is defined and continuous in the closed interval

[∥ c_{i} - c_{j} ∥ / (r_{i} + r_{j}), ∥ c_{i} - c_{j} ∥ / | r_{i} - r_{j} |]

, as it is the signed distance from an intersection point of two continuously deforming curves (therefore its position vary continuously as long as the intersection exists) to the continuously deforming set

D_{k} (c_{k}; λ r_{k})

with respect to

λ

. Also, the map

λ \mapsto λ r_{k} - ∥ d_{i j} (λ) - c_{k} ∥

vary linearly in the range

[∥ c_{i} - c_{j} ∥ / | r_{i} - r_{j} |, \infty)

because for

λ \geq ∥ c_{i} - c_{j} ∥ / | r_{i} - r_{j} |

, the term

∥ d_{i j} (λ) - c_{k} ∥

remains constant. The left picture in Figure 4 shows in bold red color the geometric place of

{d_{i j} (λ), d_{j i} (λ)}

, which vary continuously respect to the parameter

λ

and also the distance from it to the fix point

c_{k}

.

On the other hand, for

r_{i} = r_{j}

, the points

{d_{i j} (λ), d_{j i} (λ)}

vary continuously respect to

λ

on the line showed in the right picture of Figure 4. Therefore,

λ r_{k} - ∥ d_{i j} (λ) - c_{k} ∥

also depend continuously of

λ

.

From the above argument, each map

Λ_{i, j}^{k}

is continuous in the interval

[ν_{M}, \infty)

, and by the continuity of the min-max functions and that

ρ_{M} (λ) = max_{1 \leq i, j \leq m} \{min_{k \neq i, j} Λ_{i, j}^{k} (λ)\},

it follows that

ρ_{M}

is also a continuous map in the interval

[ν_{M}, \infty)

. However, the map

ρ_{M}

is not differentiable, in general, to every point in such interval.

The map

ρ_{M}

plays a key role in the rest of this work. We present the next characterization of Čech systems in terms of

ρ_{M}

.

Lemma 4.

Let M be a 2-disk system. Then

M_{λ}

is a Čech system if and only if,

ρ_{M} (λ) \geq 0

. In particular

ρ_{M} (\sqrt{4 / 3} ν_{M}) \geq 0

.

Proof.

By Lemma 3,

M_{λ}

is a Čech system in the plane if and only if, there exists

d_{i j} (λ)

such that

d_{i j} (λ) \in D_{k} (c_{k}; λ r_{k})

for every

k \neq i, j

, i.e.,

Λ_{i, j}^{k} (λ) \geq 0

for every

k \neq i, j

, which is equivalent to

ρ_{M} (λ) \geq 0

.

On the other hand, from Corollary 1 and taking

d = 2

, the rescaled system

M_{\sqrt{4 / 3} ν_{M}}

is a Čech system, then by the first assertion,

ρ_{M} (\sqrt{4 / 3} ν_{M}) \geq 0

. □

5. The `Cech.scale` Algorithm

Our main algorithm (Algorithm 3) computes the Čech scale of a given 2-disk system M. The key aspect on which this algorithm is based, is precisely the function

ρ_{M}

. Before we describe the algorithm, we need to analyze additional properties of

ρ_{M}

.

It follows immediately from Lemma 4 that

ρ_{M} (λ) \geq 0

for every

λ \geq μ_{M}

. Also, if at the Vietoris–Rips scale it holds that

ρ_{M} (ν_{M}) \geq 0

, then

μ_{M} = ν_{M}

by the minimality of the Čech scale. We conclude that in this case (this is,

ρ_{M} (ν_{M}) \geq 0

), the Čech scale is easily computable.

On the other hand, if

ρ_{M} (ν_{M}) < 0

then the Čech scale satisfies

μ_{M} \in (ν_{M}, \sqrt{4 / 3} ν_{M}]

and moreover

ρ_{M} (μ_{M}) = 0

. This is a consequence of the continuity of

ρ_{M}

, and the fact that

ρ_{M} (λ) < 0

for every

ν_{M} \leq λ < μ_{M}

and

ρ_{M} (λ) \geq 0

for

μ_{M} \leq λ

. Thus, to find the Čech scale of a 2-disk system M for which

ρ_{M} (ν_{M}) < 0

, we need to solve the equation

ρ_{M} (λ) = 0

.

We propose a numerical approach to solve the equation

ρ_{M} (λ) = 0

and calculate the Čech scale under the hypothesis

ρ_{M} (ν_{M}) < 0

, as in this case, we actually know that

μ_{M} \in (ν_{M}, \sqrt{4 / 3} ν_{M}]

(see Section 2) as consequence of the generalized Vietoris–Rips Lemma. We chose the bisection method for this purpose. We will denote the implementation of bisection method for the map

ρ_{M}

through the interval

[a, b]

, by bisection

(ρ_{M}, a, b)

. The output of bisection

(ρ_{M}, a, b)

is a real number

λ \in [a, b]

such that

ρ_{M} (λ) = 0

. For the numerical method we are working with a precision of

10^{- 12}

.

It is important to mention that the numerical method regula falsi was also used instead of the numerical method of bisection, in order to calculating the Čech scale. However, in our context, the efficiency of the program using the regula falsi numerical method is not better than if the numerical method of bisection is used.

The Algorithm 2 (below) has as input a 2-disk system M, and produces as output the Čech scale

μ_{M}

as well as the intersection point

{c_{M}} = ⋂_{D_{i} \in M} D_{i} (c_{i}; μ_{M} r_{i})

. This algorithm takes a naive approach to calculate the Čech scale, and is established to completeness and to be a reference for the principal algorithm (Algorithm 3).

Algorithm 2: The Čech scale calculation for a 2-disk system.

The following lemma claims that the Algorithm 2 is consistent.

Lemma 5.

For any 2-disk system M, the Algorithm 2 has as output the Čech scale

μ_{M}

of M, and the unique intersection point

{c_{M}} = ⋂_{D_{i} \in M} D_{i} (c_{i}; μ_{M} r_{i})

.

Proof.

In the case

ρ_{M} (ν_{M}) \geq 0

, it is clear that the algorithm has generated asseverated data (steps (2)–(3)). In otherwise, for the case

ρ_{M} (ν_{M}) < 0

, we assign

μ^{*} : = \sqrt{4 / 3} ν_{M}

.

Then,

ρ_{M} (ν_{M}) \cdot ρ_{M} (μ^{*}) \leq 0

and lets call again

μ^{*}

the output root in step (7). To check if

μ^{*}

is the Čech scale we are looking for, we calculate in step (8) the set of pairwise intersection points of the

μ^{*}

-rescaled system, contents in

⋂_{D_{i} \in M} D_{i} (c_{i}; μ^{*} r_{i})

.

If the set

⊓ M_{μ^{*}}

is unitary, then necessarily

⋂_{D_{i} \in M} D_{i} (c_{i}; μ^{*} r_{i})

is unitary, due the geometry of its boundary (see proof of Lemma 3). In such case (negative validation of step (9)) the steps (10)–(13) are omitted and, from Lemma 1, the algorithm returns the Čech scale as well as the intersection point

⊓ M_{μ^{*}}

at step (14); in otherwise (positive validation of step (9)), the root

μ^{*}

is not the Čech scale (see Figure 5), and then we should find another scale

μ^{'} \in (ν_{M}, μ^{*})

such that

ρ_{M} (μ^{'}) > 0

, and repeat from step (7). It is possible, for some configurations of the 2-disk system, that the map

ρ_{M}

has a behavior as in Figure 5.

The last iterative part is a finite process because

ρ_{M}

is algebraic over

Q

, then eventually the set

⊓ M_{μ^{*}}

will be unitary and the Čech scale will be calculated. □

The step (8) in Algorithm 2 is necessary, as show the following example, in which the map

ρ_{M}

has another root along side the Čech scale in the interval

[ν_{M}, \sqrt{4 / 3} ν_{M}]

.

Example 3.

Let

M = {D_{1}, D_{2}, D_{3}}

be the 2-disk system in Figure 6. A direct calculation, yields that

ν_{M} = μ_{M} = 0.8947

. On the other hand, we also have that

ρ_{M} (ν_{M}) = ρ_{M} (μ_{M}) = ρ_{M} (λ) = 0

for

λ = 1

. Therefore, the map

ρ_{M}

has more than one root on the interval

[ν_{M}, \sqrt{4 / 3} ν_{M}] = [0.8947, 1.0331]

.

In Example 3, the Vietoris–Rips scale

ν_{M}

, of the 2-disk system M, agrees with the Čech scale

μ_{M}

; however is possible to construct more sophisticated (and symmetric) disk system M such that

ν_{M} < μ_{M}

and

ρ_{M} (ν_{M}) < 0 < ρ_{M} (μ_{M})

, for which there exists

μ^{'} \in (ν, \sqrt{4 / 3} ν_{M})

with

ρ_{M} (μ^{'}) = 0

.

On the other hand, if the 2-disk system M consists of just three disks and

ρ_{M} (ν_{M}) < 0

, then its Čech scale can be computed with only one application of the numerical method, as we asseverate in the following lemma.

Lemma 6.

Let

M = {D_{1}, D_{2}, D_{3}}

be a 2-disk system such that

ρ_{M} (ν_{M}) < 0

. Then, there exists a unique root of the map

ρ_{M}

in

[ν_{M}, \sqrt{4 / 3} ν_{M}]

. Thus,

μ_{M}

will be the output ofbisection

(ρ_{M}, ν_{M}, \sqrt{4 / 3} ν_{M})

.

Proof.

It is straightforward to verify that

ρ_{M} (ν_{M}) \geq 0

for any configuration with

c_{1}

,

c_{2}

and

c_{3}

collinear. Thus, it follows

{c_{1}, c_{2}, c_{3}}

is in general position.

Let

μ_{M}

be the Čech scale of the 2-disk system M and

c_{M}

the intersection point. Define

A (λ) : = \cap_{i = 1}^{3} D_{i} (c_{i}; λ r_{i})

. Note that

A (μ_{M}) = {c_{M}}

.

We claim that there exist at least two distinct points p and q, in the set

{d_{i j} (λ) \in D_{i} (c_{i}; λ r_{i}) ⊓ D_{j} (c_{j}; λ r_{j}) ∣ D_{i}, D_{j} \in M_{λ}} \subset \partial A (λ) .

This is evident if

\partial A (λ)

is given by two or more circumference arcs. On the other hand, if

\partial A (λ) = \partial D_{i} (c_{i}; λ r_{i})

for some

1 \leq i \leq 3

, then

d_{i j} (λ), d_{i k} (λ) \in D_{i} (c_{i}; λ r_{i})

for

j \neq i

and

k \neq i

. Moreover,

d_{i j} (λ) \neq d_{i k} (λ)

since

{c_{1}, c_{2}, c_{3}}

is not a collinear set.

If both points p and q belongs to each boundary of the three disks,

p, q \in \cap_{i = 1}^{3} \partial D_{i} (c_{i}; λ r_{i})

, then

{c_{1}, c_{2}, c_{3}}

would be also a collinear set. Without loss of generality, we suppose that

p = d_{23} (λ) \notin \partial D_{1} (c_{1}; λ r_{1})

. Then,

ρ_{M} (λ) \geq λ r_{1} - ∥ d_{23} (λ) - c_{1} ∥ > 0

, and the lemma follows. Of course, the choose of the indexes depend of the value of

λ

, but the above arguments show that always there exist such combination which guarantee that

ρ_{M} (λ)

is positive for

λ > μ_{M}

. □

The following algorithm takes advantage of the unicity property for the root of

ρ_{M}

, in a 2-disk system with three disks. Essentially, the algorithm consist in iterating the Algorithm 2 systematically over every triplet of disks from M.

Algorithm 3:Cech.scale.

Theorem 1.

For any 2-disk system M, the Algorithm 3 has as output the Čech scale

μ_{M}

of M, and the unique intersection point

{c_{M}} = ⋂_{D_{i} \in M} D_{i} (c_{i}; μ_{M} r_{i})

.

Proof.

If

ρ_{M} (ν_{M}) \geq 0

, the algorithm returns the right data: steps (1)–(5).

On the other hand, by Helly’s Theorem (cf. [20]) the 2-disk system

M = {D_{1}, \dots, D_{m}}

, as a finite family of convex sets in the plane, has a nonempty intersection

⋂_{D_{i} \in M} D_{i}

if, and only if,

D_{i} \cap D_{j} \cap D_{k} \neq \emptyset

for every triplet

1 \leq i < j < k \leq m

. Let

μ

be the maximal Čech scale over every triplet in the disk system M, i.e.,

μ = max {μ_{N} ∣ N = {D_{i}, D_{j}, D_{k}} \subset M} .

It follows that every

μ

-rescaled triplet has a nonempty intersection. Therefore, the

μ

-rescaled 2-disk system

M_{μ}

also has the nonempty intersection property. Moreover,

⋂_{D_{i} \in M} D_{i} (c_{i}; μ r_{i}) \subset ⋂_{D_{i} \in N \subset M} D_{i} (c_{i}; μ r_{i})

for every triplet

N \subset M

. Therefore,

μ

is actually the Čech scale of the 2-disk system M, this is,

μ_{M} = μ

.

In steps (6)–(19), the algorithm search the scale

μ

systematically, over every triplet

{D_{i}, D_{j}, D_{k}} \subset M

, updating the maximal scale found if necessary in steps (15)–(17). By Lemma 6, every Čech scale calculation over any triplet, requires just one application of the bisection method. This implies the correctness of the algorithm.

Additionally, the condition in step (9) avoids calculating unnecessary Čech scales of triplets

N = {D_{i}, D_{j}, D_{k}}

. In effect, if

λ^{*}

is the maximal Čech scale found until the verification of the triplet N, and the condition in step (9) does not satisfy, i.e.,

μ_{N} \leq \sqrt{4 / 3} ν_{M} < λ^{*},

then, whatever is the Čech scale of N, it would be not greater than

λ^{*}

. □

The computational evidence to support the Algorithm 3 is more efficient than Algorithm 2, is given in Figure 7. The graphic shows the average time (in seconds) to computation of both algorithms, with respect to the number of disks in a randomly generated 2-disk system (see Remark 1).

An Example: The Miniball Problem

The miniball problem or smallest-circle problem in the euclidean space is a classical problem, proposed by James J. Sylvester in 1857.

Given a finite point cloud

N \subset R^{d}

, the miniball problem consists in finding the center

c \in R^{d}

and minimum radius

r \in R^{+}

of a d-disk

D = D (c; r) \subset R^{d}

such that

N \subset D

.

There exist many different approaches to solve this problem, and a variety of algorithms to reach the miniball data (e.g., [21,22]). In fact, the Čech scale has a close relation with the miniball problem, as we establish in the next lemma.

Lemma 7.

Let N be a finite point cloud in

R^{d}

, and let

N_{1}

be the associated d-disk system defined by

N_{1} : = {D_{i} (c_{i}; 1) \subset R^{d} ∣ c_{i} \in N} .

Then, the Čech scale

μ_{N_{1}}

is the radius of the minimal enclosing ball of N, and the intersection point

{c_{N_{1}}} = ⋂_{c_{i} \in N} D_{i} (c_{i}; μ_{N_{1}})

its center.

Proof.

Let

μ_{N_{1}}

be the Čech scale of the disk system

N_{1}

, and let

c_{N_{1}}

be the intersection point of the

μ_{M}

-rescaled disk system. Then, the point

c_{N_{1}}

belongs to every disk

D_{i} (c_{i}; μ_{N_{1}} \cdot 1)

, i.e.,

∥ c_{i} - c_{N_{1}} ∥ \leq μ_{N_{1}}

for any

c_{i} \in N

; thus,

N \subset D (c_{N_{1}}; μ_{N_{1}})

. On the other hand, by definition of Čech scale,

μ_{N_{1}}

is the minimal radius (scale) with such property. Therefore, by uniqueness,

D (c_{N_{1}}; μ_{N_{1}})

must be the minimal ball enclosing the point cloud N. □

In particular, for a point cloud N in the plane we can apply our algorithm Cech.scale (Algorithm 3) to the 2-disk system

N_{1}

, and get the minimal enclosing ball of N. However, the Čech scale of an arbitrary 2-disk system

M = {D_{1} (c_{1}; r_{1}), \dots, D_{m} (c_{m}; r_{m})}

cannot be obtained from the minimal enclosing ball data of the point cloud

M_{0} = {c_{1}, \dots, c_{m}}

.

In Figure 8, we show a point cloud N (black dots) and the 2-disk system

N_{1}

(blue circles). Applying the Cech.scale script to

N_{1}

we get the Čech scale

μ_{N_{1}}

(radio of the red circle) and the point

c_{N_{1}}

(red point, center of the red circle).

For the miniball problem there are many efficient algorithms available online, which are easy to find. For example, the C++ script in [23] can compute the miniball for point clouds in any dimension (efficiently up to dimension 10,000). Such algorithms are not comparable with the Cech.scale algorithm because if only disk systems with equal (and unitary) radii were considered, several issues that were addressed in the case of different radii would be avoided.

6. The Algorithm `Cech.scale` for Higher-Dimensional Disk Systems

It is not clear how to generalize the Algorithm 3 to determine the Čech scale of a disk system in

R^{d}

with

d > 2

. However, it is possible to calculate the Čech scale if the d-disk system consists of only three disks. This makes it possible to calculate the 2-skeleton associated with a d-disk system in an arbitrary dimension.

The relevance of this application lies in the possibility of calculating the 2-dimensional filtered simplicial Čech structure of a disk system immersed in a high-dimensional euclidean space. Many applications in topological data analysis concerns to the study of low-dimensional topological features associated to a point data cloud immersed in a high-dimensional representation space.

The key observation is that

⋂_{i = 1}^{3} D (c_{i}; r_{i}) \neq \emptyset \Leftrightarrow ⋂_{i = 1}^{3} (D (c_{i}; r_{i}) \cap P) \neq \emptyset,

where P is the affine plane generated by the set

{c_{1}, c_{2}, c_{3}}

.

Thus, the problem of determining whether

⋂_{i = 1}^{3} (D (c_{i}; r_{i}) \cap P)

is empty or not, can be treated as one in the plane, constructing a disk system in

R^{2}

that preserves the affine configuration of the points

{c_{1}, c_{2}, c_{3}}

in the affine space

P \subset R^{d}

. To do this, we set the first center

c_{1}

as the origin in

R^{2}

, and “translate” the others centers preserving their original configuration, taking care of moving the second center on the x-axis, as in Figure 9.

More precisely, to any d-disk system with three elements, say

M = {D_{1} (c_{1}; r_{1}), D_{2} (c_{2}; r_{2}), D_{3} (c_{3}; r_{3})}

, we associate the following 2-disk system, which clearly preserves the affine configuration of the original centers

{c_{1}, c_{2}, c_{3}}

,

Aff (M) = {D_{1} ({\tilde{c}}_{1}; r_{1}), D_{2} ({\tilde{c}}_{2}; r_{2}), D_{3} ({\tilde{c}}_{3}; r_{3})}

where

{\tilde{c}}_{1} : = (0, 0)

,

{\tilde{c}}_{2} : = (∥ c_{2} - c_{1} ∥, 0)

,

{\tilde{c}}_{3} : = (∥ c_{3} - c_{1} ∥ cos (θ), ∥ c_{3} - c_{1} ∥ sin (θ))

, where

θ

is the angle between the vectors

c_{2} - c_{1}

and

c_{3} - c_{1}

, which satisfies the following relationship:

cos θ = \frac{〈 c_{2} - c_{1}, c_{3} - c_{1} 〉}{∥ c_{2} - c_{1} ∥ \cdot ∥ c_{3} - c_{1} ∥} .

The Algorithm 4 is a variant of Algorithm 1, taking as input a d-disk system in

R^{d}

and a non-negative parameter

λ

, and as output the Čech weight function of the 2-skeleton of the generalized Čech complex structure. The algorithm first preprocess each triplet of d-disks as a 2-disk system, then the Čech scale is calculated.

Algorithm 4: 2-skeletal Čech-weight function.

Figure 10 shows the performance (in

10^{- 6}

s) of the C/C++ script Cech.scale (available in [24]) and the preprocessing of the d-disk system to a 2-disk system.

Remark 1.

All our timings were done on a 64-bit GNU/Linux machine with two Intel Xeon processors (3.40 GHz), although our script were not threaded and only one core was used per process. We measured all the timings withclock()from the Standard C library. The average times in both graphics (Figure 7 and Figure 10) are the mean times for

10^{4}

repetitions of each algorithm, for every number of disks multiple of 10 in Figure 7 from 10 to 500, and for every dimension multiple of 200 in Figure 10 from 200 to 10,000.

Author Contributions

Conceptualization, J.F.E., R.H.-A. and B.R.-V.; methodology, J.F.E., R.H.-A. and B.R.-V.; software, J.F.E., R.H.-A., H.A.H.-H. and B.R.-V.; validation, J.F.E., H.A.H.-H. and B.R.-V.; formal analysis, J.F.E., R.H.-A., H.A.H.-H. and B.R.-V.; investigation, J.F.E., R.H.-A., H.A.H.-H. and B.R.-V.; data curation, J.F.E. and B.R.-V.; writing-original draft preparation, J.F.E., R.H.-A., H.A.H.-H. and B.R.-V.; writing-review and editing, J.F.E., R.H.-A., H.A.H.-H. and B.R.-V.; visualization, J.F.E. and B.R.-V.; supervision, J.F.E.; project administration, J.F.E.; funding acquisition, J.F.E. All authors have read and agreed to the published version of the manuscript.

Funding

This research received funding of the project “Métodos de Topología Combinatoria en el Análisis de Datos” supported by PRODEP.

Acknowledgments

The author Jesús F. Espinoza acknowledges the financial support of PRODEP and of the Universidad de Sonora, as well as the ACARUS (High Performance Computing Area) for the support in the access to the clusters.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Carlsson, G. Topology and data. Bull. Am. Math. Soc. 2009, 46, 255–308. [Google Scholar] [CrossRef]
Carlsson, G. Topological pattern recognition for point cloud data. Acta Numer. 2014, 23, 289–368. [Google Scholar] [CrossRef]
Ghrist, R. Barcodes: The persistent topology of data. Bull. Am. Math. Soc. 2008, 45, 61–75. [Google Scholar] [CrossRef]
Lum, P.Y.; Singh, G.; Lehman, A.; Ishkanov, T.; Vejdemo-Johansson, M.; Alagappan, M.; Carlsson, J.; Carlsson, G. Extracting insights from the shape of complex data using topology. Sci. Rep. 2013, 3. [Google Scholar] [CrossRef] [PubMed]
Zomorodian, A.; Carlsson, G. Computing persistent homology. Discret. Comput. Geom. 2005, 33, 249–274. [Google Scholar] [CrossRef]
Edelsbrunner, H.; Harer, J. Computational Topology: An Introduction; American Mathematical Society: Providence, RI, USA, 2010. [Google Scholar]
Zomorodian, A. Fast construction of the Vietoris-Rips complex. Comput. Graph. 2010, 34, 263–271. [Google Scholar] [CrossRef]
Le, N.K.; Martins, P.; Decreusefond, L.; Vergne, A. Construction of the Generalized Čech Complex. In Proceedings of the 2015 IEEE 81st Vehicular Technology Conference (VTC Spring), Glasgow, UK, 11–14 May 2015; pp. 1–5. [Google Scholar] [CrossRef]
Dantchev, S.; Ivrissimtzis, I. Efficient construction of the Čech complex. Comput. Graph. 2012, 36, 708–713. [Google Scholar] [CrossRef]
Morozov, D. Dionysus 2. 2019. Available online: http://mrzv.org/software/dionysus2/ (accessed on 23 October 2019).
Otter, N.; Porter, M.A.; Tillmann, U.; Grindrod, P.; Harrington, H.A. A roadmap for the computation of persistent homology. EPJ Data Sci. 2017, 6, 17. [Google Scholar] [CrossRef]
Kerber, M.; Sharathkumar, R. Approximate Čech complex in low and high dimensions. In Algorithms and Computation; Cai, L., Cheng, S.W., Lam, T.W., Eds.; Springer: Berlin/Heidelberg, Germany, 2013; pp. 666–676. [Google Scholar]
Bendich, P.; Marron, J.S.; Miller, E.; Pieloch, A.; Skwerer, S. Persistent homology analysis of brain artery trees. Ann. Appl. Stat. 2016, 10, 198–218. [Google Scholar] [CrossRef] [PubMed]
Goldfarb, D. An application of topological data analysis to hockey analytics. arXiv 2014, arXiv:1409.7635v1. [Google Scholar]
Pokorny, F.T.; Hawasly, M.; Ramamoorthy, S. Topological trajectory classification with filtrations of simplicial complexes and persistent homology. Int. J. Robot. Res. 2016, 35, 204–223. [Google Scholar] [CrossRef]
Robins, V.; Turner, K. Principal component analysis of persistent homology rank functions with case studies of spatial point patterns, sphere packing and colloids. Phys. D Nonlinear Phenom. 2016, 334, 99–117. [Google Scholar] [CrossRef]
deSilva, V.; Ghrist, R. Coverage in sensor networks via persistent homology. Algebr. Geom. Topol. 2007, 7, 339–358. [Google Scholar] [CrossRef]
Bell, G.; Lawson, A.; Martin, J.; Rudzinski, J.; Smyth, C. Weighted persistent homology. arXiv 2017, arXiv:1709.00097v1. [Google Scholar] [CrossRef]
Jung, H. Ueber die kleinste Kugel, die eine räumliche Figur einschliesst. J. Reine Angew. Math. 1901, 123, 241–257. [Google Scholar] [CrossRef]
Bollobas, B.; Bollobás, B. The Art of Mathematics: Coffee Time in Memphis; Cambridge University Press: Cambridge, UK, 2006. [Google Scholar]
Fischer, K.; Gärtner, B.; Kutz, M. Fast smallest-enclosing-ball computation in high dimensions. In Algorithms—ESA 2003; Di Battista, G., Zwick, U., Eds.; Lecture Notes in Computer Science; Springer: Berlin/Heidelberg, Germany, 2003; Volume 2832, pp. 630–641. [Google Scholar] [CrossRef]
Welzl, E. Smallest enclosing disks (balls and ellipsoids). In New Results and New Trends in Computer Science; Maurer, H., Ed.; Springer: Berlin/Heidelberg, Germany, 1991; pp. 359–370. [Google Scholar]
Fischer, K.; Gärtner, B.; Kutz, M. Miniball (GitHub Repository). 2018. Available online: https://github.com/hbf/miniball/ (accessed on 23 October 2019).
Research Group in Geometric and Combinatorial Structures. Mathematics Department, University of Sonora—México. 2018. Available online: www.gcs.mat.uson.mx (accessed on 23 October 2019).

Figure 1. Vietoris–Rips and Čech systems in the plane.

Figure 2. The Čech-weight function of the 2-disk system M.

Figure 3. Intersection point

d_{i j}

.

Figure 3. Intersection point

d_{i j}

.

Figure 4. Geometric place of

d_{i j} (λ)

.

Figure 4. Geometric place of

d_{i j} (λ)

.

Figure 5. Plot of the map

ρ : [ν_{M}, \infty) \to R

.

Figure 5. Plot of the map

ρ : [ν_{M}, \infty) \to R

.

Figure 6. The 2-disk system M.

Figure 7. Average time of the script Cech.scale.

Figure 8. The miniball of the point cloud N.

Figure 9. Affin configuration of the d-disk system.

Figure 10. Average time (

μ

s) in high dimensions of the Cech.scale script and preprocessing disk systems.

Figure 10. Average time (

μ

s) in high dimensions of the Cech.scale script and preprocessing disk systems.

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Espinoza, J.F.; Hernández-Amador, R.; Hernández-Hernández, H.A.; Ramonetti-Valencia, B. A Numerical Approach for the Filtered Generalized Čech Complex. Algorithms 2020, 13, 11. https://doi.org/10.3390/a13010011

AMA Style

Espinoza JF, Hernández-Amador R, Hernández-Hernández HA, Ramonetti-Valencia B. A Numerical Approach for the Filtered Generalized Čech Complex. Algorithms. 2020; 13(1):11. https://doi.org/10.3390/a13010011

Chicago/Turabian Style

Espinoza, Jesús F., Rosalía Hernández-Amador, Héctor A. Hernández-Hernández, and Beatriz Ramonetti-Valencia. 2020. "A Numerical Approach for the Filtered Generalized Čech Complex" Algorithms 13, no. 1: 11. https://doi.org/10.3390/a13010011

APA Style

Espinoza, J. F., Hernández-Amador, R., Hernández-Hernández, H. A., & Ramonetti-Valencia, B. (2020). A Numerical Approach for the Filtered Generalized Čech Complex. Algorithms, 13(1), 11. https://doi.org/10.3390/a13010011

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Numerical Approach for the Filtered Generalized Čech Complex

Abstract

1. Introduction

2. Vietoris–Rips and Čech systems

3. Filtered Generalized Simplicial Structures for Disk Systems

4. Intersection Properties of Disk Systems

5. The `Cech.scale` Algorithm

An Example: The Miniball Problem

6. The Algorithm `Cech.scale` for Higher-Dimensional Disk Systems

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Numerical Approach for the Filtered Generalized Čech Complex

Abstract

1. Introduction

2. Vietoris–Rips and Čech systems

3. Filtered Generalized Simplicial Structures for Disk Systems

4. Intersection Properties of Disk Systems

5. The Cech.scale Algorithm

An Example: The Miniball Problem

6. The Algorithm Cech.scale for Higher-Dimensional Disk Systems

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

5. The `Cech.scale` Algorithm

6. The Algorithm `Cech.scale` for Higher-Dimensional Disk Systems