A New Geometric Metric in the Shape and Size Space of Curves in 
          
            
              
                R
                n

Epifanio, Irene; Gimeno, Vicent; Gual-Arnau, Ximo; Ibáñez-Gual, M. Victoria

doi:10.3390/math8101691

Open AccessArticle

A New Geometric Metric in the Shape and Size Space of Curves in R n

¹

Department of Mathematics-IF, Universitat Jaume I, 12071 Castelló, Spain

²

Department of Mathematics-IMAC, Universitat Jaume I, 12071 Castelló, Spain

³

Department of Mathematics-INIT, Universitat Jaume I, 12071 Castelló, Spain

^*

Author to whom correspondence should be addressed.

^†

All authors contributed equally to this work.

Mathematics 2020, 8(10), 1691; https://doi.org/10.3390/math8101691

Submission received: 31 August 2020 / Revised: 21 September 2020 / Accepted: 23 September 2020 / Published: 1 October 2020

Download

Browse Figures

Versions Notes

Abstract

:

Shape analysis of curves in

R^{n}

is an active research topic in computer vision. While shape itself is important in many applications, there is also a need to study shape in conjunction with other features, such as scale and orientation. The combination of these features, shape, orientation and scale (size), gives different geometrical spaces. In this work, we define a new metric in the shape and size space,

S_{2}

, which allows us to decompose

S_{2}

into a product space consisting of two components:

S_{4} \times R

, where

S_{4}

is the shape space. This new metric will be associated with a distance function, which will clearly distinguish the contribution that the difference in shape and the difference in size of the elements considered makes to the distance in

S_{2}

, unlike the previous proposals. The performance of this metric is checked on a simulated data set, where our proposal performs better than other alternatives and shows its advantages, such as its invariance to changes of scale. Finally, we propose a procedure to detect outlier contours in

S_{2}

considering the square-root velocity function (SRVF) representation. For the first time, this problem has been addressed with nearest-neighbor techniques. Our proposal is applied to a novel data set of foot contours. Foot outliers can help shoe designers improve their designs.

Keywords:

shape space; square-root velocity function (SRVF); outliers

1. Introduction

Shape analysis of curves in

R^{n}

, where

n \geq 2

, is an important branch in many applications, including computer vision and medical imaging. Using the landmark representation of objects, Dryden et al. [1] studied the joint shape and size features of objects. However, an over-abundance of digital data, especially image data, is prompting the need for a different kind of shape analysis. In particular, the representation of shapes as elements of infinite-dimensional Riemannian manifolds with a given metric is of interest at this time and has important applications [2,3]. More recently, Srivastava et al. [4] presented a special representation of curves, called the square-root velocity function, or SRVF, under which a specific elastic metric becomes an

L^{2}

metric and simplifies the shape analysis. This approach was analyzed by Kurtek et al. [5] on different scenarios, corresponding to different combinations of physical properties of the curves: shape, size, location and orientation. When the metric used in these infinite-dimensional spaces is invariant with respect to scaling, translation, rotation and reparameterization, the Riemannian manifold that represents the space of curves is known as the shape space, and following the notation of [5] will be denoted as

S_{4}

. However, in many of the applications, other features such as orientation or size (scale) also play important roles and need to be incorporated into the underlying framework. A prime example is medical imaging, where the size of the anatomical structure of interest can provide important diagnostic information. In the case where the curve length (size) is considered, the feature space is called the shape and size (or shape and scale) space and will be denoted as

S_{2}

[5]. Other spaces denoted as

S_{1}

and

S_{3}

consider also changes in the orientations of the curves [5]. In this work, we will focus on the shape space

S_{4}

and the shape and size space

S_{2}

, which are in general, completely different infinite-dimensional Riemannian manifolds. Curves in

S_{2}

are different elements of the space if their shape or scale are different. Curves in

S_{4}

are different elements of the space if their shape is different.

It seems natural, however, that the distance between two curves in space

S_{2}

should be related to the distance of these same curves in space

S_{4}

. We can thereby discern whether the distance between the curves in space

S_{2}

is due, to a greater extent, to their difference in size or to their difference in shape.

In this sense, in [6], the Sobolev-type metric given in [3] for the shape space of planar closed curves is extended to the space of all planar closed curves where the metric considered exhibits a decomposition of the space of closed planar curves into a product space consisting of three components; that is, centroid translations, scale changes and curves in the shape space.

In this approach, we will consider representations of curves in

R^{n}

from square-root velocity functions (SRVF). Using these representations, we will consider two feature spaces studied in [5]: the shape space

S_{4}

and the shape and size space

S_{2}

. The metric in

S_{4}

will be the same as in [5]; however, we propose a new metric in

S_{2}

, which is completely different to the metric considered in [5]. This metric enjoys the property that

S_{2}

can be decomposed into a product space consisting of two components:

S_{4} \times R

, where the second space is related to the length (size) of the curve.

The outline of the paper is as follows: In Section 2, we review the SRVF representation of curves and the standard elastic metrics. In Section 3, we introduce the new metric in

S_{2}

. The mean shape and geodesics with this new metric are introduced in Section 4 and Section 5, respectively. A comparison of the proposed metric and the standard elastic metric is carried out in Section 6 in a controlled setting with simulated curves, where we show the advantages of our proposal. We propose a procedure to detect outlier contours in

S_{2}

considering the SRVF representation. To the best of our knowledge, this is the first time this problem has been addressed with nearest-neighbor (NN) techniques. This is introduced in Section 7. Not only that, but so far outlier detection (in the multivariate context) in Anthropometry has only been used as a cleaning technique, for correcting or removing the outliers before analyzing data in the multivariate context [7,8]. However, outliers report very valuable information in the footwear design process: outliers can indicate which kinds of feet are more different from the rest and could therefore cause fitting problems in footwear if the design is not appropriate. In Section 8, our proposal is applied to a novel data set of foot contours. Finally, Section 9 contains the conclusions.

The code and data for reproducing the results are available at http://www3.uji.es/~epifanio/RESEARCH/metric.zip.

2. Classical Spaces of Curves in $R^{n}$ for the SRVF Representation

In this section, we review some results from [9]. In particular, we consider the SRVF representation of curves in

R^{n}

and we summarize the main results for the shape space,

S_{4}

, and for the shape and size space,

S_{2}

, with the standard elastic metrics.

Let

β : [0, 1] ⟶ R^{n}

be a parameterized curve that is absolutely continuous on

[0, 1]

. The square-root velocity (SRVF) of

β

is defined as the function

q : [0, 1] ⟶ R^{n}

given by:

q (t) = \frac{β^{'} (t)}{\sqrt{| β^{'} (t) |}} .

(1)

As it can be seen in [9], this representation exists even where

| β^{'} (t) | = 0

.

For every

q \in L^{2} ([0, 1], R^{n})

, there is a curve

β

(unique up to translation) such that the given q is the SRVF function of that

β

. In fact,

β (t) = \int_{0}^{t} q (s) | q (s) | d s .

(2)

If a curve

β

is of length one, then

\int_{0}^{1} {| q (t) |}^{2} d t = 1

. Furthermore, the hypersphere

C^{0} = \{q : [0, 1] ⟶ R^{n} | \int_{0}^{1} {| q (t) |}^{2} d t = 1\},

(3)

is a Hilbert manifold.

One way to study the shape and size (scale) space of open curves is to consider as a pre-shape space

L^{2} = L^{2} ([0, 1], R^{n})

with the usual inner product. To take care of the rotation and reparameterization of the curve

β

, we remember that a rotation is an element of

S O (n)

, the special orthogonal group of

n \times n

matrices; and a reparameterization is an element of

Γ = {γ : [0, 1] \to [0, 1] | : γ (0) = 0, γ (1) = 1, γ

is a diffeomorphism}.

The action of a reparameterization

γ \in Γ

transforms the curve

β : [0, 1] \to R^{n}

to the curve

t \mapsto β (γ (t))

. Hence, by the definition of the SRVF of a curve, we define the action of

γ \in Γ

in

C_{0}

by

γ (q (t)) : = q (γ (t)) \sqrt{γ^{'} (t)} .

Likewise, the action of

O \in S O (n)

on

q \in C^{0}

is just

O (q) (t) = O (q (t)) .

We shall denote the combined action

(O, γ) q (t) : = O (q (γ (t))) \sqrt{γ^{'} (t)}, O \in S O (n), γ \in Γ .

The orbit of a function

q \in L^{2}

is

[q] = \{(O, γ) (q) | (O, γ) \in S O (n) \times Γ\} .

(4)

If we consider the metric in

L^{2}

given by the usual inner product

{〈 v_{1}, v_{2} 〉}_{L^{2}} = \int_{0}^{1} 〈 v_{1} (t), v_{2} (t) 〉 d t,

(5)

the feature space of interest is:

S_{2} = \{[q] / q \in L^{2}\},

(6)

and the distance in

S_{2}

is:

d_{2} ([q_{1}], [q_{2}]) = inf_{O \in S O (n), γ \in Γ} {| q_{1} - (O, γ) (q_{2}) |}_{L^{2}} .

(7)

Then, the geodesic between

q_{1}

and the optimal reparameterization of

q_{2}

, which is denoted as

q_{2}^{*} = (O, γ^{*}) (q_{2}),

is

Ψ_{τ} = (1 - τ) q_{1} + τ q_{2}^{*}, τ \in R .

(8)

On the other hand, the shape space (without considering the size (scale) of the curves) is

S_{4} = {[q] | q \in C^{0}} .

(9)

The distance in

S_{4}

is given by

d_{4} ([q_{1}], [q_{2}]) = inf_{O \in S O (n), γ \in Γ} {cos}^{- 1} ({〈 q_{1}, (O, γ) (q_{2}) 〉}_{L^{2}}),

(10)

and the geodesic between

q_{1}

and

q_{2}^{*}

is

α (τ) = \frac{1}{sin θ} (sin (θ (1 - τ)) q_{1} + sin (θ τ) q_{2}^{*}),

(11)

where

θ = {cos}^{- 1} {〈 q_{1}, q_{2}^{*} 〉}_{L^{2}}

.

3. A New Metric in the Shape and Size Space of Curves in $R^{n}$

When

S_{2}

is considered as shape and size space, it is difficult to distinguish whether the distance between two shapes

[q_{1}]

and

[q_{2}]

is due to the difference in shape or to the difference in size between the corresponding curves

β_{1}

and

β_{2}

. We are therefore going to consider another shape-size space for curves that will be isometric to

S_{2}

with another appropriate product metric.

Instead of considering in

L^{2}

the usual

L^{2}

-metric given in Equation (5), if

q \in L^{2} ([0, 1], R^{n})

, for any two vectors

v_{1}, v_{2}

in

T_{q} L^{2} ([0, 1]) \equiv L^{2} ([0, 1], R^{n})

, we will consider the following metric to endow

L^{2} ([0, 1])

with a Riemannian structure,

\hat{g} (v_{1}, v_{2}) : = \frac{1}{R^{2} (q)} {〈 v_{1}, v_{2} 〉}_{L^{2}} .

(12)

where

R (q) : = {(\int_{0}^{1} {| q (t) |}^{2} d t)}^{\frac{1}{2}} .

The case

R (q) = 0

will be excluded, which will mean that curves of length 0 are not considered in our space.

From this metric, we will endow (Theorem 1)

C^{0} \times R

with a Riemannian structure in such a way that

C^{0} \times R

will be isometric to

(L^{2} ([0, 1]), \hat{g}) .

This isometry will be exported (Theorem 2) to an isometry between

S_{2}

and

S_{4} \times R

.

Therefore, we obtain a new metric which enjoys the property that

S_{2}

can be decomposed into a product space consisting of two components:

S_{4} \times R

, where the second space is related with the length (size) of the curve. This new metric is associated with a distance function, see Corollary 3, given by

d_{new} ([p], [q]) = \sqrt{d_{4}^{2} ([\frac{p}{R (p)}], [\frac{q}{R (q)}]) + {ln}^{2} (\frac{R (p)}{R (q)})} .

(13)

This distance is invariant under rotations and under rescaling in the sense that

d_{new} (O ([p]), O ([q])) = d_{new} ([p], [q])

and

d_{new} (λ [p], λ [q]) = d_{new} ([p], [q])

for any O in

S O (n)

and any

λ > 0

.

3.1. An Isometry between $L^{2} ([0, 1], R^{n}) \ {\vec{0}}$ and $C^{0} \times R$

We will begin this section defining a function F which will provide an isometry between

L^{2} ([0, 1], R^{n}) \ {\vec{0}}

and

C^{0} \times R

. We consider the smooth map

\begin{matrix} F : L^{2} ([0, 1], R^{n}) & \ {\vec{0}} \to C^{0} \times R, \\ q & \mapsto F (q) : = (\frac{q}{R (q)}, ln R (q)) . \end{matrix}

Observe that F is well defined for any

L^{2} ([0, 1], R^{n})) \ {\vec{0}}

where

{\vec{0}} : = \{q \in L^{2} ([0, 1], R^{n}) / \int_{0}^{1} {| q (t) |}^{2} d t = 0\} = R^{- 1} (0) .

The function F has immediate smooth inverse given by

\begin{matrix} F^{- 1} : C^{0} & \times R \to L^{2} ([0, 1], R^{n}) \ {\vec{0}}, \\ (q, t) \mapsto F^{- 1} (q, t) = e^{t} q . \end{matrix}

The Functions R, $π$ and F and Their Properties

Now we state some properties of the function R, some properties of

π (q) : = \frac{1}{R (q)} q

from

L^{2}

to

C^{0}

, that is the natural projection given by the normalization of an SRVF using its norm, and some properties of the function F:

Proposition 1.

Properties of the functions R, π and F:

1.: Given a curve $β (t)$ and $q (t) = \frac{β^{'} (t)}{\sqrt{| β^{'} (t) |}}$ . Then, $R (q) = \sqrt{L (β)}$ is the square root of the length of the curve β.
2.: For any $γ \in Γ$ ,

$R (q (γ (t)) \sqrt{γ^{'} (t)})) = R (q (t)) .$
3.: For any $O \in SO (n),$

$R (O (q (t)) = R (q (t)) .$
4.: For any $O \in SO (n)$ ,

$π (O q) = O π (q) .$

Namely, the rotation group commutes with the projection π.
5.: For any $λ > 0$

$π (λ q) = π (q)$
6.: Given $(O, γ) \in SO (n) \times Γ$ , and $q \in L^{2} ([0, 1], R^{n}) \ {\vec{0}}$ then

$F (O (q (γ (t))) \sqrt{γ^{'} (t)}) = (O (π (q (γ (t))), ln R (q)) .$
7.: F is smooth and admits smooth inverse with never vanishing differential map.

From the properties of

π

, R and F, we can conclude the following diffeomorphisms.

Corollary 1.

From the function F we have that

1.: $L^{2} ([0, 1], R^{n}) \ {\vec{0}} \overset{diff}{\sim} C^{0} \times R .$
2.: Since $S_{2} = L^{2} / Γ \times SO (n)$ , then

$S_{2} \ {\vec{0}} \overset{diff}{\sim} (C^{0} / Γ \times SO (n)) \times R \overset{diff}{\sim} S_{4} \times R .$

As already mentioned at the beginning of the section, if

q \in L^{2} ([0, 1], R^{n}) \ {0}

, the tangent space at q can be identified with

L^{2} ([0, 1], R^{n})

itself,

T_{q} L^{2} ([0, 1], R^{n}) \ {0} \equiv L^{2} ([0, 1], R^{n})

and for any two vectors

v_{1}, v_{2}

in

T_{q} L^{2} ([0, 1], R^{n}) \ {0}

we will use the following metric to endow

L^{2} ([0, 1], R^{n}) \ {0}

with a Riemannian structure,

\hat{g} (v_{1}, v_{2}) : = \frac{1}{R^{2} (q)} {〈 v_{1}, v_{2} 〉}_{L^{2}} .

(14)

Therefore, using

F^{- 1} : C^{0} \times R \to L^{2} ([0, 1], R^{n}) \ {\vec{0}}

, we can pullback the metric

\hat{g}

to

{\hat{g}}^{*}

in

C^{0} \times R

by the differential map

d F^{- 1} : T_{(q, t)} C^{0} \times R \to T_{e^{t} p} L^{2} ([0, 1],, R^{n}) \ {0}

in order to endow

C^{0} \times R

with a Riemannian structure in such a way that

(C^{0} \times R, {\hat{g}}^{*})

will be isometric to

(L^{2} ([0, 1], R^{n}) \ {0}, \hat{g})

.

Theorem 1.

There is an isometry

F : (L^{2} ([0, 1], R^{n}) \ {\vec{0}}, \hat{g}) ⟶ C^{0} \times R

given by

F (p) = (\frac{p}{R (p)}, ln R (p)),

(15)

where the usual product metric is considered in

C^{0} \times R

.

Proof.

We have shown in Corollary 1 that F is a diffeomorphism; therefore, we only have to prove that the pullback

{\hat{g}}^{*}

of the metric

\hat{g}

is the usual product metric in

C^{0} \times R

.

T_{(q, t)} C^{0} \times R = T_{q} C^{0} \oplus T_{t} R

Given two vectors in

v_{1}, v_{2} \in T_{(q, t)} C^{0} \times R

the pullback

{\hat{g}}^{*} (v_{1}, v_{2})

is given by

\begin{matrix} {\hat{g}}^{*} (v_{1}, v_{2}) & = \hat{g} (d F^{- 1} (v_{1}), d F^{- 1} (v_{2})) = \\ = \frac{1}{R^{2} (e^{t} q)} {〈 d F^{- 1} (v_{1}), d F^{- 1} (v_{2}) 〉}_{L_{2}} . \end{matrix}

But

R^{2} (e^{t} q) = \int_{0}^{1} | e^{t} q (s) |^{2} d s = e^{2 t} \int_{0}^{1} {| q |}^{2} d s = e^{2 t},

because

q \in C^{0}

. Therefore, for any two vectors

v_{1}, v_{2} \in T_{(q, t)} C^{0} \times R

{\hat{g}}^{*} (v_{1}, v_{2}) = e^{- 2 t} {〈 d F^{- 1} (v_{1}), d F^{- 1} (v_{2}) 〉}_{L_{2}} .

Now, first of all, we need to prove that in the tangent space to

C^{0} \times R

at

(q, t)

,

T_{q} C^{0}

is orthogonal to

T_{t} R

. In order to do that, consider two vectors

v_{1} \in T_{q} C^{0}

and

v_{2} \in T_{t} R

and two curves

γ_{1}, γ_{2} : (- ϵ, ϵ) \to C^{0} \times R

, such that

\{\begin{matrix} γ_{1} (s) = (γ_{1}^{1} (s), t), γ_{1} (0) = (q, t), \frac{d}{d s} γ_{1} {(s) |}_{s = 0} = v_{1} \\ γ_{2} (s) = (q, γ_{2}^{2} (s)), γ_{1} (0) = (q, t), \frac{d}{d s} γ_{2} {(s) |}_{s = 0} = v_{2} \end{matrix}

Then,

d F^{- 1} (v_{1}) = \frac{d}{d s} F^{- 1} (γ_{1} (s)) {|_{s = 0} = \frac{d}{d s} (e^{t} γ_{1}^{1} (s)) |}_{s = 0} = e^{t} v_{1}

with

v_{1} \in T_{q} C^{0}

. Likewise,

\begin{matrix} d F^{- 1} (v_{2}) = & \frac{d}{d s} F^{- 1} (γ_{2} (s)) {|_{s = 0} = \frac{d}{d s} (e^{γ_{2}^{2} (s)} q) |}_{s = 0} \\ = & v_{2} e^{t} q \end{matrix}

where

v_{2} \in R, q \in C^{0}

. Hence,

g^{*} (v_{1}, v_{2}) = e^{- 2 t} {〈 e^{t} v_{1}, v_{2} e^{t} q 〉}_{L_{2}} = v_{2} {〈 v_{1}, q 〉}_{L_{2}} = 0

because

{〈 v_{1}, q 〉}_{L_{2}} = 0

for any

v_{1} \in T_{q} C^{0}

.

Similarly if

v_{1}, v_{2} \in T_{q} C^{0}

,

{\hat{g}}^{*} (v_{1}, v_{2}) = e^{- 2 t} {〈 e^{t} v_{1}, e^{t} v_{2} 〉}_{L_{2}} = {〈 v_{1}, v_{2} 〉}_{L_{2}}

and if

v_{1}, v_{2} \in T_{t} R

{\hat{g}}^{*} (v_{1}, v_{2}) = e^{- 2 t} {〈 v_{1} e^{t} q, v_{2} e^{t} q 〉}_{L_{2}} = v_{1} v_{2} {〈 q, q 〉}_{L_{2}} = v_{1} v_{2} .

□

Since we are using the usual product metric in

C^{0} \times R

, we conclude the following result:

Corollary 2.

Let

d_{\hat{g}}

denote the distance function in

(L^{2} ([0, 1], R^{n}) \ {\vec{0}}, \hat{g})

, then

\begin{matrix} d_{\hat{g}} (p, q) = & \sqrt{d_{C^{0}}^{2} (π (p), π (q)) + d_{R}^{2} (ln R (p), ln R (q))} \\ = & \sqrt{d_{C^{0}}^{2} (\frac{p}{R (p)}, \frac{q}{R (q)}) + {ln}^{2} (\frac{R (p)}{R (q)})} \end{matrix}

(16)

where

d_{C^{0}}

is the usual distance in

C^{0}

.

From this explicit expression of the distance, it is easy to see that the metric

\hat{g}

is invariant under the action of reparameterizations and rotations.

Proposition 2.

The group

S O (n) \times Γ

acts isometrically on

(L^{2} ([0, 1], R^{n}) \ {\vec{0}}, \hat{g})

.

Proof.

We need to prove that

\begin{matrix} d_{\hat{g}} ((O, γ) (p), (O, γ) (q)) = & \sqrt{d_{C^{0}}^{2} (\frac{(O, γ) (p)}{R ((O, γ) (p))}, \frac{(O, γ) (q)}{R ((O, γ) (q))}) + {ln}^{2} (\frac{R ((O, γ) (p))}{R ((O, γ) (q))})} \\ = & d_{\hat{g}} (p, q), \forall (O, γ) \in S O (n) \times Γ . \end{matrix}

By Proposition 1 we know that

R ((O, γ) (p)) = R (p)

(and

R ((O, γ) (q)) = R (q)

). Hence, the proposition follows because

\begin{matrix} d_{C^{0}}^{2} (\frac{(O, γ) (p)}{R ((O, γ) (p))}, \frac{(O, γ) (q)}{R ((O, γ) (q))}) = & d_{C^{0}}^{2} (\frac{(O, γ) (p)}{R (p)}, \frac{(O, γ) (q)}{R (q)}) \\ = & \int_{0}^{1} {∥\frac{(O, γ) (p) (t)}{R (p)} - \frac{(O, γ) (q) (t)}{R (q)}∥}^{2} d t \\ = & \int_{0}^{1} {∥O ((\frac{p (γ (t))}{R (p)} - \frac{q (γ (t))}{R (q)}) \sqrt{γ^{'} (t)})∥}^{2} d t \\ = & \int_{0}^{1} {∥\frac{p (γ (t))}{R (p)} - \frac{q (γ (t))}{R (q)}∥}^{2} γ^{'} (t) d t = \int_{0}^{1} {∥\frac{p (τ)}{R (p)} - \frac{q (τ)}{R (q)}∥}^{2} d τ \\ = & d_{C^{0}}^{2} (\frac{p}{R (p)}, \frac{q}{R (q)}) \end{matrix}

□

3.2. The Isometry between $S_{2}$ and $S_{4} \times R$

Using the isometry given by Theorem 1, an isometry between

S_{2}

and

S_{4} \times R

can be constructed as stated in the following theorem:

Theorem 2.

The isometry F can be exported to an isometry

[F]

by using the following commutative diagram

\begin{matrix} L^{2} ([0, 1], R^{n}) \ {\vec{0}} & \overset{F}{\to} & C^{0} \times R \\ ↓ Π_{1} & ↓ Π_{2} \\ S_{2} \ {0} & \overset{[F]}{\to} & S_{4} \times R \end{matrix}

where

Π_{1} (q) = [q]

and

Π_{2} (q, t) = ([q], t)

. Namely,

[F] [q] = Π_{2} (F (q))

for any

q \in Π_{1}^{- 1} ([q])

.

Proof.

From Proposition 2 we know that

SO (n) \times Γ

acts by isometries on

(L^{2} \ {\vec{0}}, \hat{g})

. Therefore, since the action of the group

Γ \times S O (n)

on

(L^{2} \ {\vec{0}}, \hat{g})

and on

C^{0}

is by isometries, and bearing in mind the diffeomorphisms in Corollary 1, we obtain the result. □

The isometry

[F]

of the above theorem can be used to obtain the expression of the new distance function.

Corollary 3.

Let

d_{new}

denote the distance function in

S_{2} \ {\vec{0}}

when the isometry

[F]

of Theorem 2 is considered, then

\begin{matrix} d_{new} ([p], [q]) = & \sqrt{d_{4}^{2} ([π (p)], [π (q)]) + d_{R}^{2} (ln R (p), ln R (q))} \\ = & \sqrt{d_{4}^{2} ([\frac{p}{R (p)}], [\frac{q}{R (q)}]) + {ln}^{2} (\frac{R (p)}{R (q)})} . \end{matrix}

(17)

In the following proposition we are proving that

d_{new}

is a well defined distance function.

Proposition 3.

Let

d_{new}

denote the distance function in

S_{2} \ {\vec{0}}

when the isometry

[F]

of Theorem 2 is considered, then

1.: $d_{new} ([p], [q]) = d_{new} ([q], [p])$ for all $[p], [q] \in S_{2} \ \vec{0}$ .
2.: $d_{new} ([p], [q]) = 0$ , if and only if, $d_{4} ([\frac{p}{R (p)}], [\frac{q}{R (q)}]) = 0$ and $R (p) = R (q)$ .
3.: For any $[p], [q], [r] \in S_{2} \ \vec{0}$

$d_{new} ([p], [r]) \leq d_{new} ([p], [q]) + d_{new} ([q], [r]) .$
4.: $d_{new} (λ [p], λ [q]) = d_{new} ([q], [p])$ for all $[p], [q] \in S_{2} \ \vec{0}$ , $λ \in R, λ \neq 0$ .

Proof.

Most of the statements of the proposition follows directly from the definition and the properties of

d_{4}

and R. We shall prove the triangle inequality for the sake of completeness. Let us denote by

\vec{v}, \vec{w} \in R^{2}

the vectors given by

\vec{v} : = (d_{4} ([\frac{p}{R (p)}], [\frac{q}{R (q)}]), ln (\frac{R (p)}{R (q)})), \vec{w} : = (d_{4} ([\frac{q}{R (q)}], [\frac{r}{R (r)}]), ln (\frac{R (q)}{R (r)}))

Then, by applying the triangle inequality for

d_{4}

,

\begin{matrix} {(d_{new} ([p], [r]))}^{2} = & d_{4}^{2} ([\frac{p}{R (p)}], [\frac{r}{R (r)}]) + {(ln (\frac{R (p)}{R (r)}))}^{2} \\ \leq & {(d_{4} ([\frac{p}{R (p)}], [\frac{q}{R (q)}]) + d_{4} ([\frac{q}{R (q)}], [\frac{r}{R (r)}]))}^{2} + {(ln (\frac{R (p)}{R (q)}) + ln (\frac{R (q)}{R (r)}))}^{2} \\ = & ∥ \vec{v} + \vec{w} ∥^{2} = ∥ \vec{v} ∥^{2} + ∥ \vec{w} ∥^{2} + 2 ∥ \vec{v} ∥ ∥ \vec{w} ∥ cos (θ) \leq ∥ \vec{v} ∥^{2} + ∥ \vec{w} ∥^{2} + 2 ∥ \vec{v} ∥ ∥ \vec{w} ∥ = {(∥ \vec{v} ∥ + ∥ \vec{w} ∥)}^{2} \\ = & {(d_{new} ([p], [q]) + d_{new} ([q], [r]))}^{2} . \end{matrix}

□

4. The Mean Shape

Given

{β_{1}, \dots, β_{n}}

, a sample of parameterized curves, and their corresponding SRVF,

{q_{1}, \dots, q_{n}}

, the Karcher mean shape regarding the new metric

d_{new}

is defined as

[{\hat{μ}}_{n e w}] = arg m i n_{q} \sum_{i = 1}^{n} d_{new} {([q], [q_{i}])}^{2} = arg m i n_{q} \sum_{i = 1}^{n} (d_{4}^{2} ([\frac{q}{R (q)}], [\frac{q_{i}}{R (q_{i})}]) + {ln}^{2} (\frac{R (q)}{R (q_{i})})) .

The value of

R (q)

that minimizes

\sum_{i = 1}^{n} {ln}^{2} (\frac{R (q)}{R (q_{i})})

is the geometric mean of the

{R (q_{1}), \dots, R (q_{n})}

, i.e.,

\sqrt[n]{\prod_{i = 1}^{n} R (q_{i})},

and a gradient-based approach for finding the value of

q / R (q)

that minimizes

\sum_{i = 1}^{n} d_{4}^{2} ([\frac{q}{R (q)}], [\frac{q_{i}}{R (q_{i})}])

can be found in [10,11]. The detailed algorithm to find the Karcher mean in the shape space

S_{2}

can be found in [12].

Given

{\hat{μ}}_{C_{0}}

the Karcher mean of

{q_{i} / R (q_{i})}_{i = 1, \dots, n}

in the shape space, the Karcher mean in the shape and size space with the new metric is obtained as

{\hat{μ}}_{n e w} = {\hat{μ}}_{C_{0}} \sqrt[n]{\prod_{i = 1}^{n} R (q_{i})} .

Hence, applying Equation (2), the mean curve is

{\hat{β}}_{new} (t) = {(\prod_{i = 1}^{n} L (β_{i}))}^{\frac{1}{n}} \int_{0}^{t} {\hat{μ}}_{C_{0}} (s) | {\hat{μ}}_{C_{0}} (s) | d s

(18)

where

L (β_{i})

is the length of the curve

β_{i}

.

5. Geodesics

Moreover, we can use the isometry given by the Theorem 2 to provide an explicit expression for the geodesics in

(L^{2} ([0, 1]) \ {\vec{0}}, \hat{g})

.

Corollary 4.

Any geodesic in

C^{0} \times R

is obtained as

t \mapsto (α (t), a + b t)

where α is a geodesic in

C^{0}

and

a, b \in R

. Therefore any geodesic in

(L^{2} ([0, 1]) \ {\vec{0}}, \hat{g})

can be written as

C (t) = A e^{b t} α (t)

with

A, b \in R

and α a geodesic in

C^{0}

.

For any p and q in

L^{2} ([0, 1]) \ {\vec{0}}

, the above corollary allows us to obtain the geodesic segment (with respect to

\hat{g}

) joining p and q. Namely, we want a geodesic curve (in the new metric)

γ : [0, 1] \to L^{2} ([0, 1]) \ {\vec{0}}

such that

γ (0) = p

and

γ (1) = q

. By using the corollary, we only have to consider the geodesic segment in

C^{0}

with

α (0) = \frac{p}{R (p)}

and

α (1) = \frac{q}{R (q)}

and the geodesic segment in

R

joining

ln R (p)

and

ln R (p)

, i.e.,

τ \mapsto ln R (p) + τ (ln R (q) - ln R (p))

Therefore, the geodesic segment joining p and q is

γ (τ) = e^{ln R (p) + τ (ln R (q) - ln R (p))} α (τ) = R {(p)}^{1 - τ} R {(q)}^{τ} α (τ), τ \in [0, 1]

This geodesic segment in

L^{2} ([0, 1]) \ {\vec{0}}

can be understood as a family of deformations of curves in the following sense: if we have two curves

β_{1} : [0, 1] \to R^{n}

and

β_{2} : [0, 1] \to R^{n}

, we obtain two points in

L^{2} ([0, 1]) \ {\vec{0}}

p (t) = \frac{β_{1}^{'} (t)}{\sqrt{| β_{1}^{'} (t) |}} and q (t) = \frac{β_{2}^{'} (t)}{\sqrt{| β_{2}^{'} (t) |}}

with

R (p (t)) = \sqrt{L (β_{1})}

and

R (q (t)) = \sqrt{L (β_{2})}

. The geodesic segment

α

in

C_{0}

joining

\frac{p (t)}{R (p (t))}

and

\frac{q (t)}{R (q (t))}

is a family of length-one curves which can be labeled with

τ \in [0, 1]

,

α_{τ} : [0, 1] \to R^{n}, α_{0} (t) = \frac{p (t)}{R (p (t))}, α_{1} (t) = \frac{q (t)}{R (q (t))}

Finally, we obtain the family of curves

{\tilde{γ}}_{τ} (t) = L^{1 - τ} (β_{1}) L^{τ} (β_{1}) \int_{0}^{t} α_{τ} (s) | α_{τ} (s) | d s, τ \in [0, 1]

with

{\tilde{γ}}_{0} (t) = β_{1} (t), {\tilde{γ}}_{1} (t) = β_{2} (t) .

6. Application to a Simulated Data Set

In order to check the performance of the new metric (Equation (13)), we have simulated several curves with different shapes and sizes. In particular, we have simulated ten 3D cylindric spirals,

β_{1 i} (t)

i = 1, \dots, 10

,

t \in [0, 1]

, and ten circumferences,

β_{2 i} (t)

i = 1, \dots, 10

,

t \in [0, 1]

, from:

\begin{matrix} x_{1 i} = a_{i} cos (8 π t); y_{1 i} = a_{i} sin (8 π t); z_{1 i} = b_{i} t; \\ x_{2 i} = r_{i} cos (2 π t); y_{2 i} = r_{i} sin (2 π t); z_{2 i} = 0; \end{matrix}

where

t \in [0, 1]

and

\forall i \in {1, \dots, 10}

,

b_{i} = 10 + e_{1 i}

,

e_{1 i} \sim N (0, 1)

,

a_{i} = \sqrt{\frac{L^{2} - b_{i}^{2}}{64 π^{2}}}

,

L = 270

,

r_{i} = 15 + e_{2 i}

,

e_{2 i} \sim N (0, 2.5)

(so all these spirals will have different shape and the same length, and all the circumferences will have the same shape but different lengths). Figure 1a,b shows the simulations obtained.

The Karcher means of the ten spirals and of the ten circumferences are computed with the new metric (

{\hat{β}}_{n e w}

, Equation (18)) and by using the distance proposed by [5] in the shape and size space

S_{2}

. These means are shown in Figure 2, where the original curves are plotted in light blue;

{\hat{β}}_{n e w}

is plotted in black color and

{\hat{β}}_{2}

, the Karcher-mean using the distance

d_{2}

, is plotted in red color. As can be seen, in Figure 2a,b, there is a very slight difference between the means

{\hat{μ}}_{n e w}

and

{\hat{μ}}_{2}

of the ten spirals. Figure 2c,d show that the means coincide in the case of the circumferences.

An example comparing the geodesics obtained with

d_{n e w}

and

d_{2}

, can be seen in Figure 3, without great differences among them.

Finally, the distance matrices

D_{n e w}

and

D_{2}

between the twenty curves are computed using both metrics, and in order to compare the performance of

d_{2}

and

d_{n e w}

, a multidimensional scaling (MDS) analysis [13] has been carried out. The MDS algorithm is a descriptive data reduction procedure to display the information contained in a

(m \times m)

-distance matrix, D, in a low-dimensional space such that the between-object distances are preserved as well as possible. Then, for each distance matrix D, the method looks for a set of orthogonal variables

{y_{1}, \dots, y_{p}}

,

p < m

such that the Euclidean distances of the elements with respect to these variables are as close as possible to the distances given in the original matrix D. In Figure 4, MDS has been applied to the distance matrices computed with both metrics. In both graphics (Figure 4a,b), the black points represent the twenty spirals

α_{1 i}

shown in Figure 1a, and the green points represent the twenty circumferences

α_{2 i}

shown in Figure 1b.

As can be seen, there are slights differences among the MDS representations of both metrics. If we perform a k-means cluster with k = 2 from

D_{n e w}

and

D_{2}

, in both cases the two groups are perfectly recovered. We also recover the two groups if we apply DBSCAN [14,15].

If we re-scale the twenty figures, multiplying them by 50, and we consider the twenty resulting curves jointly to the twenty original ones, we can compute again the distance matrices

D_{n e w}

and

D_{2}

between the 40 curves. The MDS scaling representation of these distance matrices can be found in Figure 5. In both graphics, the initial spirals

{β_{i 1}}_{i = 1, \dots, 10}

are plotted in green; the circumferences

{β_{i 2}}_{i = 1, \dots, 10}

are plotted in black, and their scaled versions,

{50 β_{i 1}}_{i = 1, \dots, 10}

and

{50 β_{i 2}}_{i = 1, \dots, 10}

are plotted in red and blue color, respectively.

Figure 5 shows one important difference between the performance of both metrics. By definition,

d_{n e w}

is invariant to changes of scale i.e.,

d_{n e w} (β_{i l}, β_{j m}) = d_{n e w} (k β_{i l}, k β_{j m})

,

\forall k \in R

and this equality does not hold for

d_{2}

, where the distance among curves increases with the scaling factor.

If we perform a k-means cluster analysis with k = 4 from the distance matrices, the four groups are recovered from

D_{n e w}

, but for

D_{2}

, the distance among the scaled circumferences increases regarding to the distance among the initial circumferences, so the group of large circumferences is splitted into two clusters while the initial (short) curves (spirals and circumferences) are joined in a unique cluster (Figure 6). However, the algorithm DBSCAN applied on both distance matrices, allow us in both cases recover again the four initial groups.

As a third step of the simulation study, let us consider a broader data set with the initial spirals

{β_{i 1}}_{i = 1, \dots, 10}

, the circumferences

{β_{i 2}}_{i = 1, \dots, 10}

, their scaled versions

{50 β_{i 1}}_{i = 1, \dots, 10}

and

{50 β_{i 2}}_{i = 1, \dots, 10}

, jointly with two new re-scaled sets

{250 β_{i 1}}_{i = 1, \dots, 10}

and

{250 β_{i 2}}_{i = 1, \dots, 10}

. The distance matrices

D_{n e w}

and

D_{2}

between the 60 curves are computed and the MDS scaling representation of these distance matrices can be found in Figure 7.

A k-means cluster analysis with

k = 6

so as the DBSCAN algorithm applied on

D_{n e w}

recovers the 6 simulated groups. However, the DBSCAN algorithm applied on

D_{2}

provides 5 clusters on this data set joining in a single cluster the initial (short) curves (spirals and circumferences) and distinguishing the other groups (Figure 7d). The k-means algorithm with

k = 5

provides the same result, but if the k-means algorithm is applied with

k = 6

clusters, the set of the largest circumferences is split into two groups (Figure 7c). Once again, it can be clearly seen that the distance

d_{2}

among shapes increases with the scaling factor.

7. Detection of Outliers

Although there are a variety of techniques for outlier detection for different types of data in any metric space based on nearest-neighbor techniques (see [16] for a detailed explanation), they have not been fully exploited in the shape and size space of curves. Some of the main references are based on box-plots of the distances to the median to detect outliers, such as [12,17], and more recently the method based on elastic depths proposed by [18].

We propose a technique for outlier detection based on the proposed distance. Nearest-neighbor techniques are very popular due to their good results, conceptual simplicity and interpretability in the classic multivariate case [19]. We consider this idea for the shape and size space of curves. The k-NN Anomaly Detection algorithm searches for the nearest k-neighbors, i.e., the k closest curves, for every element in the database, and calculates the average distance of the k-neighbors. In the multivariate case, the Euclidean distance is used, but here we use the proposed distance to find the neighbors. This procedure returns outlier scores; as usual, the highest score denotes the highest degree of outlierness. A way to establish a binary decision about whether or not to label a point as an outlier is to use a box-plot with the outlier scores and to consider the points detected as outliers by the box-plot as anomalies.

We compare our procedure with that introduced in [12,17,18] using the data sets of open curves used in [12,17], which are available from [20]. For the Example 1 considered in [17] formed by 70 spirals, Ref. [12] found 6 outliers, Ref. [17] also found 6 outliers (2 scale outliers and 4 mild shape outliers) and [18] found 4 outliers (3 due to amplitude and 1 due to phase) with the recommended value of k = 2, which is the boxplot multiplier, while 9 outliers are found with the classical k = 1.5. However, with our methodology, we detect 8 outliers (the results are stable, we obtain the same outliers with k = 5, 10 or 15). We have also computed the square of the distance in Equation (17) (

d {(p, q)}^{2}

) and we have computed the contribution in percentage due to shape

d_{C^{0}}^{2} (π (p), π (q)) / d {(p, q)}^{2}

and due to size

d_{R}^{2} (ln R (p), ln R (q)) / d {(p, q)}^{2}

, for each outlier. The percentages of contribution due to shape for the 8 outliers are: 21%, 29%, 34%, 35%, 41%, 50%, 50% and 74%. For the Example 3 in [17] formed by 176 fiber tracts in the human brain extracted from a diffusion tensor magnetic resonance image (DT-MRI), we detect the same 11 outliers also detected by [12,17], and all are due to shape, with percentages of contribution due to shape of 90%. However, [18] with k = 2 returned 62 outliers (62 due to amplitude, 23 of them are also outliers due to phase), i.e., 35% of points of the sample are considered outliers.

8. Application to a Real Data Set

Footwear design relies greatly on knowledge of foot size and shape. Proper fit is an essential condition for potential shoe buyers, besides the fact that poorly fitting footwear can cause foot pain and deformity, especially in women. Although people with extreme feet (very different from the rest) may be the most likely customers with poor fit, in anthropometric studies they are not usually searched. However, outliers report very valuable information for the footwear design process, since they can help shoe designers adjust their designs to a larger part of the population and can increase their awareness of customers characteristics that will make them uncomfortable to wear, whether when considering a range of special sizes or modifying any shoe feature to fit more users.

The aim of this section was to detect the outliers in an anthropometric foot database. We carry out a separate analysis for men and women, since gender foot shape differences are well-known [21,22]. Furthermore, footwear designers usually propose different types of shoes for women and men.

8.1. Foot Database

A total of 770 3D right foot scans were carried out. A total of 389 men and 381 women representing the Spanish adult female and male population were measured. The data were collected in different regions across Spain at shoe shops and workplaces using an INFOOT laser scanner [23]. The scanning process is carried out while the participant stands upright placing equal weight on each foot, in a specific position and orientation (see Figure 8). The result is a 3D point cloud representing the complete outer surface of the foot, including the sole of the foot.

3D foot shapes were registered using the method described by [24] with a template made up of 5000 vertices, with five foot landmarks (i.e., 1st and 2nd toe tips; 1st and 5th metatarsal heads; and pternion; see Figure 9). This set of landmarks is automatically located on 3D foot scans and allows the extraction of foot measurements and contours, according to the definitions used by the Human Shape Lab of the Biomechanics Institute of Valencia (IBV), which comply with standards and are compatible with the accepted definitions found in the literature [25,26,27,28]. In particular, we consider the longitudinal contour passing through the Ball Position. The mean shapes for men and women in

S_{2}

are displayed in Figure 10.

8.2. Detection of Foot Outliers

We have applied our outlier procedure to the curves of men and women with k = 10. A total of 24 and 18 outliers are detected for men and women, respectively. In order to briefly describe the outlier curves detected, we show the percentiles of each outlier for the four variables that could most influence shoe fit according to shoe design experts. Specifically, these variables are: Foot Length, FL (distance between the rear and foremost point the foot axis); Ball Girth, BG (perimeter of the ball section); Ball Width, BW (maximal distance between the extreme points of the ball section projected onto the ground plane); and Instep Height, IH (maximal height of the instep section, located at 50% of the foot length). Table 1 and Table 2 show the percentile profiles of the outliers found for women and men, respectively. Note that for some of the outliers some of the variables show extreme percentiles, i.e., very high or very low percentiles. However, in many other cases, outliers do not show extreme values in these variables. Therefore, outliers can be due to the particular combination of the variables or due to the particular configuration of the curve that cannot be summarized by these four variables. In summary, with the proposed procedure, we can detect feet that are “not normal”, which may not be detected with a classic multivariate analysis. Figure 11 shows the most outlier feet for men and women. For men, the outlier feet are the 14th and 9th, while for the women the outliers are the 7th and 17th. Note that those feet do not have really extreme percentiles.

We have also computed the contributions of shape and size. The main contribution is due to shape for both men and women. The mean is 96% for men and 97% for women.

9. Conclusions

We have proposed a new metric in the shape and size space

S_{2}

that, unlike the previous proposals, allows us to distinguish whether the distance between two shapes

[q_{1}]

and

[q_{2}]

is due to the difference in shape or to the difference in size between the corresponding curves

β_{1}

and

β_{2}

. It has been compared with the metric proposed by [5] in a simulation study, where our proposal is shown to perform better. Furthermore, we also show the advantages of the new metric, such as its invariance to changes of scale.

For the first time, we have also proposed a procedure based on the distances and NN techniques in

S_{2}

for finding outlier curves in

S_{2}

. We have applied it to a novel industrial data set. The foot outliers found by considering their contour can help shoe designers improve their designs in order to provide customers with a better fit.

In future work, in regards to the theory, closed curves could be considered and an the appropriate metric defined. Furthermore, the new metric could be used in other types of statistical problems besides outlier detection, such as classification, clustering, or new ones, where curves in

S_{2}

have never been used before, such as archetype analysis [29] or archetypoid analysis [30]. Finally, in regards to the footwear application, the outlier procedure with the new metric could be applied to other kinds of foot contours, such as the Ball Girth, and of course, scopes for other fields of application.

Author Contributions

Conceptualization and methodology: X.G.-A., I.E., V.G.; software, I.E. and M.V.I.-G.; validation, writing—original draft preparation and writing—review and editing, I.E., V.G., X.G.-A. and M.V.I.-G. All authors have read and agreed to the published version of the manuscript.

Funding

This work is supported by the following grants: DPI2017-87333-R from the Spanish Ministry of Science, Innovation and Universities (AEI/FEDER, EU) and UJI-B2017-13 from Universitat Jaume I.

Acknowledgments

The authors would like to thank Sebastian Kurtek for providing us with the code from [5] and the “Biomechanics Institute of Valencia” for providing us with the foot database.

Conflicts of Interest

The authors declare no conflict of interest.

References

Dryden, I.; Mardia, K. Size and shape analysis of landmark data. Biometrika 1992, 79, 57–68. [Google Scholar] [CrossRef]
Klassen, E.; Srivastava, A.; Mio, M.; Joshi, S.H. Analysis of planar shapes using geodesic paths on shape spaces. IEEE Trans. Pattern Anal. Mach. Intell. 2004, 26, 372–383. [Google Scholar] [CrossRef] [PubMed]
Younes, L.; Michor, P.; Shah, J.; Mumford, D. A Metric on Shape Space With Explicit Geodesics. Rend. Lincei Mat. Appl. 2008, 19, 25–57. [Google Scholar] [CrossRef] [Green Version]
Srivastava, A.; Klassen, E.; Joshi, S.H.; Jermyn, I.H. Shape analysis of elastic curves in euclidean spaces. IEEE Trans. Pattern Anal. Mach. Intell. 2011, 33, 1415–1428. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Kurtek, S.; Srivastava, A.; Klassen, E.; Ding, Z. Statistical Modeling of Curves Using Shapes and Related Features. J. Am. Stat. Assoc. 2012, 107, 1152–1165. [Google Scholar] [CrossRef]
Sundaramoorthi, G.; Mennucci, A.; Soatto, S.; Yezzi, A. A new geometric metric in the space of curves, and applications to tracking deforming objects by prediction and filtering. SIAM J. Imaging Sci. 2011, 4, 109–145. [Google Scholar]
Kouchi, M. 3—Anthropometric methods for apparel design: Body measurement devices and techniques. In Anthropometry, Apparel Sizing and Design; Gupta, D., Zakaria, N., Eds.; Woodhead Publishing: Cambridge, UK, 2014; pp. 67–94. [Google Scholar]
Kuehnapfel, A.; Ahnert, P.; Loeffler, M.; Broda, A.; Scholz, M. Reliability of 3D laser-based anthropometry and comparison with classical anthropometry. Sci. Rep. 2016, 6, 26672. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Srivastava, A.; Klassen, E.P. Functional and Shape Data Analysis; Springer: Berlin/Heidelberg, Germany, 2016. [Google Scholar]
Pennec, X. Intrinsic Statistics on Riemannian Manifolds: Basic Tools for Geometric Measurements. J. Math. Imaging Vis. 2006, 25, 127–154. [Google Scholar] [CrossRef] [Green Version]
Le, H. Locating Fréchet means with application to shape spaces. Adv. Appl. Probab. 2001, 33, 324–338. [Google Scholar] [CrossRef]
Kurtek, S.; Su, J.; Grimm, C.; Vaughan, M.; Sowell, R.; Srivastava, A. Statistical analysis of manual segmentations of structures in medical images. Comput. Vis. Image Underst. 2013, 117, 1036–1050. [Google Scholar] [CrossRef]
Cox, T.F.; Cox, M.A. Multidimensional Scaling; CRC Press: Boca Raton, FL, USA, 2000. [Google Scholar]
Ester, M.; Kriegel, H.P.; Sander, J.; Xu, X. A density-based algorithm for discovering clusters in large spatial databases with noise. In Proceedings of the Kdd, Portland, OR, USA, 2–4 August 1996; Volume 96, pp. 226–231. [Google Scholar]
Hahsler, M.; Piekenbrock, M.; Doran, D. dbscan: Fast density-based clustering with R. J. Stat. Softw. 2019, 91, 1–30. [Google Scholar] [CrossRef] [Green Version]
Aggarwal, C.C. Outlier Analysis, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
Xie, W.; Chkrebtii, O.; Kurtek, S. Visualization and Outlier Detection for Multivariate Elastic Curve Data. IEEE Trans. Vis. Comput. Graph. 2019. [Google Scholar] [CrossRef]
Harris, T.; Tucker, J.D.; Li, B.; Shand, L. Elastic depths for detecting shape anomalies in functional data. Technometrics 2020, 1–25. [Google Scholar] [CrossRef]
Goldstein, M.; Uchida, S. A comparative evaluation of unsupervised anomaly detection algorithms for multivariate data. PLoS ONE 2016, 11, e0152173. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Cho, M.H.; Asiaee, A.; Kurtek, S. Elastic Statistical Shape Analysis of Biological Structures with Case Studies: A Tutorial. Bull. Math. Biol. 2019, 81, 2052–2073. [Google Scholar] [CrossRef] [PubMed]
Krauss, I.; Langbein, C.; Horstmann, T.; Grau, S. Sex-related differences in foot shape of adult Caucasians—A follow-up study focusing on long and short feet. Ergonomics 2011, 54, 294–300. [Google Scholar] [CrossRef] [PubMed]
Saghazadeh, M.; Kitano, N.; Okura, T. Gender differences of foot characteristics in older Japanese adults using a 3D foot scanner. J. Foot Ankle Res. 2015, 8, 29. [Google Scholar] [CrossRef] [PubMed] [Green Version]
I-Ware Laboratory. Available online: http://www.i-ware.co.jp/ (accessed on 24 August 2020).
Allen, B.; Curless, B.; Popović, Z. The space of human body shapes: Reconstruction and parameterization from range scans. ACM Trans. Graph. (TOG) 2003, 22, 587–594. [Google Scholar] [CrossRef]
Rossi, W.A.; Tennant, R. Professional Shoe Fitting; National Shoe Retailers Association: Bicester, UK, 2013. [Google Scholar]
Ramiro, J.; Alcántara, E.; Forner, A.; Ferrandis, R.; García-Belenguer, A.; Durá, J.; Vera, P.; Brizuela, G.; Llana, S. Guía de Recomendaciones Para el Diseño de Calzado; Instituto de Biomecánica de Valencia: Valencia, Spain, 1995; pp. 135–151. [Google Scholar]
Goonetilleke, R.S. The Science of Footwear; CRC Press: Boca Raton, FL, USA, 2012. [Google Scholar]
Luximon, A. Handbook of Footwear Design and Manufacture; Elsevier: Amsterdam, The Netherlands, 2013. [Google Scholar]
Cutler, A.; Breiman, L. Archetypal Analysis. Technometrics 1994, 36, 338–347. [Google Scholar] [CrossRef]
Vinué, G.; Epifanio, I.; Alemany, S. Archetypoids: A new approach to define representative archetypal data. Comput. Stat. Data Anal. 2015, 87, 102–115. [Google Scholar]

Figure 1. Simulated curves. (a) Ten spirals with different shapes and common length. (b) Ten circumferences with common shape and different lengths.

Figure 2. (a,c) Simulated curves and their corresponding means. (b,d) Comparison of the Karcher means obtained with the two different distances.

Figure 3. Comparison of the geodesic obtained using the two different distances.

Figure 4. Multidimensional scaling (MDS) applied to the distance matrices: (a) using

d_{n e w}

, (b) using

d_{2}

. The ten spirals are marked in green and the ten circumferences are represented with black asterisks.

Figure 4. Multidimensional scaling (MDS) applied to the distance matrices: (a) using

d_{n e w}

, (b) using

d_{2}

. The ten spirals are marked in green and the ten circumferences are represented with black asterisks.

Figure 5. MDS applied to the distance matrices: (a) using

d_{n e w}

, (b) using

d_{2}

. The ten initial spirals are marked in green and the ten initial circumferences are represented with black asterisk, the spirals re-scaled by 50 are the red asterisks and the circumferences are plotted in blue.

Figure 5. MDS applied to the distance matrices: (a) using

d_{n e w}

, (b) using

d_{2}

. The ten initial spirals are marked in green and the ten initial circumferences are represented with black asterisk, the spirals re-scaled by 50 are the red asterisks and the circumferences are plotted in blue.

Figure 6. MDS applied to the distance matrices: (a) using

d_{n e w}

, (b) using

d_{2}

. The ten initial spirals are marked with green asterisks and the ten initial circumferences are represented with black asterisks, the enlarged spirals are plotted in red and the enlarged circumferences are plotted in blue.

Figure 6. MDS applied to the distance matrices: (a) using

d_{n e w}

, (b) using

d_{2}

. The ten initial spirals are marked with green asterisks and the ten initial circumferences are represented with black asterisks, the enlarged spirals are plotted in red and the enlarged circumferences are plotted in blue.

Figure 7. MDS applied to the distance matrices: (a) using

d_{n e w}

, (b) using

d_{2}

. The ten initial spirals are marked in green and the ten initial circumferences are represented with black asterisks. The enlarged spirals are plotted in red (factor 50) and magenta (factor 250) and the enlarged circumferences are plotted in blue (factor 50) and cyan (factor 250). The clusters obtained on

d_{2}

are plotted: (c) using k-means algorithm with

k = 6

, (d) using DBSCAN and k-means,

k = 5

.

Figure 7. MDS applied to the distance matrices: (a) using

d_{n e w}

, (b) using

d_{2}

. The ten initial spirals are marked in green and the ten initial circumferences are represented with black asterisks. The enlarged spirals are plotted in red (factor 50) and magenta (factor 250) and the enlarged circumferences are plotted in blue (factor 50) and cyan (factor 250). The clusters obtained on

d_{2}

are plotted: (c) using k-means algorithm with

k = 6

, (d) using DBSCAN and k-means,

k = 5

.

Figure 8. Infoot^® scanner.

Figure 9. Foot landmarks used for registration in the database and foot template topology (the last image).

Figure 10. Mean shapes of contours for men (left) and women (right).

Figure 11. The most outlier feet for men (first row) and women (second row).

Table 1. Percentile profiles of outliers of foot shape variables for women.

FL	BG	BW	IH
64	44	70	84
36	90	61	65
25	39	17	24
61	95	100	100
66	57	83	70
28	35	73	53
70	58	100	99
66	49	76	46
63	47	79	88
69	88	67	94
48	50	84	64
79	66	23	22
42	54	45	11
73	40	68	35
100	99	68	98
43	4	41	23
30	31	32	13
57	17	35	77

Table 2. Percentile profiles of outliers of foot shape variables for men.

FL	BG	BW	IH
69	51	45	53
65	49	38	77
25	39	17	24
75	17	24	14
37	25	60	37
86	74	2	13
47	85	15	21
58	32	3	11
43	59	11	1
72	39	31	48
10	2	8	4
99	78	43	43
7	30	30	82
56	30	12	2
82	39	40	28
20	7	11	54
96	58	53	28
42	54	44	11
53	80	91	100
53	58	28	49
88	72	38	84
3	3	86	79
19	54	84	97
13	37	99	91

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Epifanio, I.; Gimeno, V.; Gual-Arnau, X.; Ibáñez-Gual, M.V. A New Geometric Metric in the Shape and Size Space of Curves in R n . Mathematics 2020, 8, 1691. https://doi.org/10.3390/math8101691

AMA Style

Epifanio I, Gimeno V, Gual-Arnau X, Ibáñez-Gual MV. A New Geometric Metric in the Shape and Size Space of Curves in R n . Mathematics. 2020; 8(10):1691. https://doi.org/10.3390/math8101691

Chicago/Turabian Style

Epifanio, Irene, Vicent Gimeno, Ximo Gual-Arnau, and M. Victoria Ibáñez-Gual. 2020. "A New Geometric Metric in the Shape and Size Space of Curves in R n " Mathematics 8, no. 10: 1691. https://doi.org/10.3390/math8101691

APA Style

Epifanio, I., Gimeno, V., Gual-Arnau, X., & Ibáñez-Gual, M. V. (2020). A New Geometric Metric in the Shape and Size Space of Curves in R n . Mathematics, 8(10), 1691. https://doi.org/10.3390/math8101691

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A New Geometric Metric in the Shape and Size Space of Curves in R n

Abstract

1. Introduction

2. Classical Spaces of Curves in $R^{n}$ for the SRVF Representation

3. A New Metric in the Shape and Size Space of Curves in $R^{n}$

3.1. An Isometry between $L^{2} ([0, 1], R^{n}) \ {\vec{0}}$ and $C^{0} \times R$

The Functions R, $π$ and F and Their Properties

3.2. The Isometry between $S_{2}$ and $S_{4} \times R$

4. The Mean Shape

5. Geodesics

6. Application to a Simulated Data Set

7. Detection of Outliers

8. Application to a Real Data Set

8.1. Foot Database

8.2. Detection of Foot Outliers

9. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

FL	BG	BW	IH
64	44	70	84
36	90	61	65
25	39	17	24
61	95	100	100
66	57	83	70
28	35	73	53
70	58	100	99
66	49	76	46
63	47	79	88
69	88	67	94
48	50	84	64
79	66	23	22
42	54	45	11
73	40	68	35
100	99	68	98
43	4	41	23
30	31	32	13
57	17	35	77

FL	BG	BW	IH
69	51	45	53
65	49	38	77
25	39	17	24
75	17	24	14
37	25	60	37
86	74	2	13
47	85	15	21
58	32	3	11
43	59	11	1
72	39	31	48
10	2	8	4
99	78	43	43
7	30	30	82
56	30	12	2
82	39	40	28
20	7	11	54
96	58	53	28
42	54	44	11
53	80	91	100
53	58	28	49
88	72	38	84
3	3	86	79
19	54	84	97
13	37	99	91

FL	BG	BW	IH
64	44	70	84
36	90	61	65
25	39	17	24
61	95	100	100
66	57	83	70
28	35	73	53
70	58	100	99
66	49	76	46
63	47	79	88
69	88	67	94
48	50	84	64
79	66	23	22
42	54	45	11
73	40	68	35
100	99	68	98
43	4	41	23
30	31	32	13
57	17	35	77

FL	BG	BW	IH
69	51	45	53
65	49	38	77
25	39	17	24
75	17	24	14
37	25	60	37
86	74	2	13
47	85	15	21
58	32	3	11
43	59	11	1
72	39	31	48
10	2	8	4
99	78	43	43
7	30	30	82
56	30	12	2
82	39	40	28
20	7	11	54
96	58	53	28
42	54	44	11
53	80	91	100
53	58	28	49
88	72	38	84
3	3	86	79
19	54	84	97
13	37	99	91

Article Menu

A New Geometric Metric in the Shape and Size Space of Curves in R n

Abstract

1. Introduction

2. Classical Spaces of Curves in R n for the SRVF Representation

3. A New Metric in the Shape and Size Space of Curves in R n

3.1. An Isometry between L 2 ( [ 0 , 1 ] , R n ) \ { 0 → } and C 0 × R

The Functions R, π and F and Their Properties

3.2. The Isometry between S 2 and S 4 × R

4. The Mean Shape

5. Geodesics

6. Application to a Simulated Data Set

7. Detection of Outliers

8. Application to a Real Data Set

8.1. Foot Database

8.2. Detection of Foot Outliers

9. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2. Classical Spaces of Curves in $R^{n}$ for the SRVF Representation

3. A New Metric in the Shape and Size Space of Curves in $R^{n}$

3.1. An Isometry between $L^{2} ([0, 1], R^{n}) \ {\vec{0}}$ and $C^{0} \times R$

The Functions R, $π$ and F and Their Properties

3.2. The Isometry between $S_{2}$ and $S_{4} \times R$

FL	BG	BW	IH
64	44	70	84
36	90	61	65
25	39	17	24
61	95	100	100
66	57	83	70
28	35	73	53
70	58	100	99
66	49	76	46
63	47	79	88
69	88	67	94
48	50	84	64
79	66	23	22
42	54	45	11
73	40	68	35
100	99	68	98
43	4	41	23
30	31	32	13
57	17	35	77

FL	BG	BW	IH
69	51	45	53
65	49	38	77
25	39	17	24
75	17	24	14
37	25	60	37
86	74	2	13
47	85	15	21
58	32	3	11
43	59	11	1
72	39	31	48
10	2	8	4
99	78	43	43
7	30	30	82
56	30	12	2
82	39	40	28
20	7	11	54
96	58	53	28
42	54	44	11
53	80	91	100
53	58	28	49
88	72	38	84
3	3	86	79
19	54	84	97
13	37	99	91