A Metric for Finite Power Multisets of Positive Real Numbers Based on Minimal Matching

Ray-Ming Chen

doi:10.3390/axioms7040094

Abstract

In this article, we show how to define a metric on the finite power multisets of positive real numbers. The metric, based on the minimal matching, consists of two parts: the matched part and the mismatched part. We also give some concrete applications and examples to demonstrate the validity of this metric.

Keywords:

metric; minimal matching; positive multisets

1. Introduction

A multiset, unlike a Cantorian set, is a collection of elements whose instances might be multiple (the number of its instances of an element is named multiplicity). The cardinality of a multiset A is defined by the sum of the multiplicities with respect to their corresponding elements and is denoted by

{| A |}_{m}

. For example, the cardinality of multiset

A = {2, 2, 3, 3, 3, 6, 11}

is 7, i.e.,

{| A |}_{m} = 7

. Though unconventional, the theory of multisets has well been developed (see Reference [1]) and it also has various applications in many situations (see Reference [2]). From practical point of view, multisets are easier to represent or simulate than mathematical objects with multiple instances. In this article, we mainly focus on the finite power multisets of positive real numbers.

Let

R^{+}

denote the set of all positive real numbers. Let

N_{0}

denote the set of natural numbers including 0. Let power multiset

MP (R^{+})

denote the set of all the sub-multisets of

R^{+}

. Suppose

K \subseteq MP (R^{+})

is an arbitrary set of some sub-multisets in

R^{+}

( each multiset is finite) of

MP (R^{+})

. We call

K

a finite power multiset of positive real numbers. The main result in this article is to define a metric on

K

based on the concept of minimal matching. The distance between any two multisets consists of two separated parts: the matched part and mismatched part. Matching has been an important problem and has wide applications in the fields of artificial intelligence, graph theories, and operation research (see References [3,4,5]). In this article, we come up with a new metric which is based on the concept of minimal matching. This metric is used to measure the distance between any two finite multisets of positive real numbers. Though what we define in this article is a standard metric, the whole setting could also be extended to other generalized metrics, for example,

G

–metric (see Reference [6]).

2. Definitions

In this section, we introduce and present multisets via the forms of functions. The basic concepts could be found in many textbooks or journals (see, e.g., References [7,8]). Let

Γ

denote the set

R^{+} \to N_{0}

, i.e., the set of all the functions from

R^{+}

to

N_{0}

. Let

D_{f}

denote the domain of a function f. Let set

D_{f}^{*} = {r \in R^{+} : f (r) \neq 0}

be the non-zero domain of f.

2.1. Multisets

Let

Γ^{<}

denote all the finite multi-subsets of

R^{+}

, i.e.,

Γ^{<} = {f \in Γ : | D_{f}^{*} | < \infty}

. Each element in

Γ^{<}

is simply named a multiset in this article. If for all

x \in R^{+}

,

f (x) \leq g (x)

, we say f is a multi-subset of g, denoted by

f \leq g

. Let

f, g \in Γ^{<}

be arbitrary.

Definition 1.

(Empty Multiset) We call the zero function in

Γ^{<}

the empty multiset.

Definition 2.

(Equality =)

f = g

if and only if

f \leq g

and

g \leq f

.

Definition 3.

(Intersection ∧) The intersection of f and g, denoted by the function

f \land g : R^{+} \to N_{0}

, is defined by

(f \land g) (a) : = m i n {f (a), g (a)}

for all

a \in R^{+}

.

Definition 4.

(Union ∨) The union of f and g, denoted by the function

f \lor g : R^{+} \to N_{0}

, is defined by

(f \lor g) (a) : = m a x {f (a), g (a)}

for all

a \in R^{+}

.

Definition 5.

(Difference ⊖) Exclusion of g from f, denoted by the function

f ⊖ g : R^{+} \to N_{0}

, is defined by

(f ⊖ g) (a) : = f (a) - (f \land g) (a)

.

Each multiset f in

Γ^{<}

could be uniquely represented by the following descending form (named a representative descending form):

f^{-} = (a_{1}^{f (a_{1})}, a_{2}^{f (a_{2})}, \dots, a_{n}^{f (a_{n})}),

(1)

or in brief

f^{-} = a_{1}^{f (a_{1})} a_{2}^{f (a_{2})} \dots a_{n}^{f (a_{n})}

; or by the following ascending form (named a representative ascending form):

f^{+} = (a_{n}^{f (a_{n})} a_{n - 1}^{f (a_{n - 1})}, \dots, a_{2}^{f (a_{2})} a_{1}^{f (a_{1})}),

(2)

or in brief

f^{+} = a_{n}^{f (a_{n})} a_{n - 1}^{f (a_{n - 1})} \dots a_{1}^{f (a_{1})}

, where

a_{1} > a_{2} > a_{3} \dots > a_{n} > 0

and

a_{1}, a_{2}, \dots, a_{n} \in D_{f}^{*}

and

f (a_{v}) > 0

for all

1 \leq v \leq n

. Let

| f | = f (a_{1}) + f (a_{2}) + \dots + f (a_{n})

.

Definition 6.

(Descending) Define the

p - t h

element in f by function

O D

as follows:

O D (p, f) : = \{\begin{matrix} a_{1} & i f 1 \leq p \leq f (a_{1}); \\ a_{j} & i f \sum_{l = 1}^{j - 1} f (a_{l}) < p \leq \sum_{l = 1}^{j} f (a_{l}) a n d | D_{f}^{*} | \geq j \geq 2; \\ 0 & otherwise . \end{matrix}

Definition 7.

(Ascending) Define the

p - t h

element in f by function

O A

as follows:

O A (p, f) : = \{\begin{matrix} a_{n} & i f 1 \leq p \leq f (a_{n}); \\ a_{n - j} & i f \sum_{l = 0}^{j - 1} f (a_{n - l}) < p \leq \sum_{l = 0}^{j} f (a_{n - l}) a n d | D_{f}^{*} | \geq j \geq 1; \\ 0 & otherwise . \end{matrix}

2.2. Background

In this article, we show how to define a metric on

K

(see Introduction). For any Cantorian set S, we use

| S |

to denote the cardinality of S. Let d be an arbitrary metric on

R^{+}

satisfying

\begin{matrix} d (a, b) \leq a + b, \end{matrix}

(3)

\begin{matrix} d (a, b) + a \geq b, \end{matrix}

(4)

\begin{matrix} d (a, b) + b \geq a, \end{matrix}

(5)

for all

a, b, c \in R^{+}

. Observe that d is a metric (for our generalization purpose) on

R^{+} \times R^{+}

, which lays a foundation for our latter definition of a metric on

K

. Let

A, B, C \in K

be arbitrary. Let

A \to B

denote the set of all the functions from A to B, in which the repeated elements are deemed distinct.

Example 1.

Suppose

A = {2, 2, 3, 5}

and

B = {6, 8, 8, 8, 8, 9, 12, 12}

, and

ρ (2) = 9, ρ (2) = 6, ρ (3) = 8,

ρ (5) = 12

. Then,

ρ \in A \to B

. For clarity, one could simply associate A and B with their ranked multiplicities as follows:

A = {(2, 1), (2, 2), (3, 1), (5, 1)}

and

B = {(6, 1), (8, 1), (8, 2), (8, 3), (8, 4), (9, 1), (12, 1), (12, 2)}

and ρ could also be represented by

ρ (2, 1) = (9, 1), ρ (2, 2) = (6, 1), ρ (3, 1) = (8, 1), ρ (5, 1) = (12, 1)

. To save space, we simply use

ρ (2_{1}) = 9_{1}, ρ (2_{2}) = 6_{1}, ρ (3_{1}) = 8_{1}, ρ (5_{1}) = 12_{1}

for the representation in this article.

For any function

φ

, we use

D_{φ}

and

R_{φ}

to denote its domain and codomain, respectively. For the previous example,

D_{ρ} = A

and

R_{ρ} = B

. We use

φ (S)

to denote the image

{φ (s) : s \in S}

, in particular

φ (D_{φ})

to denote the image of

φ

and

φ^{- 1} (S)

to denote the pre-image of S. If

S \subseteq D_{φ}

, we use

φ | S

to denote

φ

whose domain is restricted to S. One candidate in mind is

d (a, b) : = | a - b |

.

Definition 8.

(Bijective embeddings) Define

B F [A \to B] : = {φ \in A \to B : | A |_{m} = | φ (A) |_{m}},

B F [B \to A] : = {φ \in B \to A : | B |_{m} = | φ (B) |_{m}},

B F [A, B] : = B F [A \to B] \cup B F [B \to A] .

Example 2.

Suppose ρ is defined in Example 1. Since

{| A |}_{m} = 4 = {| ρ (A) |}_{m}

, by the above definition, one has

ρ \in B F [A \to B]

. On the other hand, suppose

κ (2_{1}) = 6_{1}, κ (2_{2}) = 6_{1}, κ (3_{1}) = 8, κ (5_{1}) = 12

, then

κ \notin B F [A \to B]

, since

{| A |}_{m} = 4 \neq {| κ (A) |}_{m} = 3

. Note that

B F [A, B] = B F [B, A]

. Moreover, if

{| A |}_{m} > {| B |}_{m}

, then

B F [A \to B] = \emptyset

; similarly, if

{| B |}_{m} > {| A |}_{m}

, then

B F [B \to A] = \emptyset

. Take A and B in Example 1 for example. One has

B F [B \to A] = \emptyset

.

Definition 9.

For any function

φ \in B F [A \to B]

, we call it a a matched function. We call

(a, φ (a))

a matched pair. Every remaining element in

B - φ (A)

is called a mismatched element.

On this basis, we could define the distance for the matched elements and the distance for the mismatched elements as follows:

Definition 10.

For any

φ \in B F [A, B]

, define

‖ φ ‖ : = \sum_{e \in D_{φ}} {d (e, φ (e)) a n d ‖ φ ‖}^{-} : = \sum_{e \in R_{φ} - φ (D_{φ})} e,

where

D_{φ}

and

R_{φ}

and denote the domain and codomain of φ, respectively.

‖ φ ‖

represents the distance of all the matched elements (or the sum of the distances of all the matched pairs), while

{‖ φ ‖}^{-}

represents the distance of all the mismatched elements in the range.

{‖ φ ‖}^{-} = 0

iff

| A | = | B |

. For example, if

A = {1, 1, 2, 3, 1}, B = {2, 4, 6, 2}

and

φ : A \to B

is defined by

φ (1_{1}) = 2, φ (1_{2}) = 6_{1}, φ (2) = 2_{1}, φ (3) = 2_{2}

, then the matched part yields

‖ φ ‖ = | 1 - 2 | + | 1 - 6 | + | 2 - 2 | + | 3 - 2 | = 7

, where

i_{n}

denotes the

n -

th repetition of i and the mismatched part

{‖ φ ‖}^{-} = 1

. Next, we define the set of all minimal distances consisting of the matched parts and the mismatched parts.

Definition 11.

(Minimal matched functions) Define

\begin{matrix} B F_{*} [A \to B] \\ : = {φ \in B F [A \to B] : ‖ φ ‖ + ‖ φ | |^{-} \leq ‖ ψ ‖ + ‖ ψ | |^{-}, \forall ψ \in B F [A \to B]}, \\ B F_{*} [B \to A] \\ : = {φ \in B F [B \to A] : ‖ φ ‖ + ‖ φ ‖^{-} \leq ‖ ψ ‖ + ‖ ψ ‖^{-}, \forall ψ \in B F [B \to A]}, \\ B F_{*} [A, B] \\ : = {φ \in B F [A, B] : ‖ φ ‖ + ‖ φ ‖^{-} \leq ‖ ψ ‖ + ‖ ψ ‖^{-}, \forall ψ \in B F [A, B]} . \end{matrix}

Definition 12.

(Distance function) Define

δ : K \times K \to R^{+}

by

δ (A, B) : = m i n {‖ φ ‖ + ‖ φ ‖^{-} : φ \in B F [A, B]} .

(6)

By the definition, one has

δ (A, B) = ‖ φ ‖ + {‖ φ ‖}^{-}

for any

φ \in B F_{*} [A, B]

. In the following, we show that

δ

is indeed a metric. The reasoning will be proceeded by their relations (i.e., larger, less than and equal to) between cardinalities of

A, B

, and C, i.e.,

{| A |}_{m}, {| B |}_{m}

, and

{| C |}_{m}

. To validate that

δ

is a metric, we need to consider all the 27 relations between

{| A |}_{m}, {| B |}_{m}

and

{| C |}_{m}

: for example,

{| A |}_{m} {> | B |}_{m} {> | C |}_{m} {, | A |}_{m} = {| B |}_{m} < {| C |}_{m}, e t c

. In order to facilitate our computing, we encode the 27 relations by the following set

{(n_{1}, n_{2}, n_{3}) : n_{1}, n_{2}, n_{3} \in {1, 2, 3}},

in which each

(n_{1}, n_{2}, n_{3})

represents the relation

{| A |}_{m} n_{1} {| B |}_{m} {, | B |}_{m} n_{2} {| C |}_{m}

and

{| A |}_{m} n_{3} {| C |}_{m}

, respectively, where

1, 2

, and 3 represent the relation

<, =

and > correspondingly. For example,

(1, 2, 3)

represents the relation

{| A |}_{m} {< | B |}_{m} {, | B |}_{m} = {| C |}_{m}

, and

{| A |}_{m} > {| C |}_{m}

. By the transitivity of their cardinalities, only 13 of the 27 relations are valid (shown in Lemma 1). Moreover, these 13 relations could be further reduced to 8 relations by the symmetry of

δ

, i.e.,

δ (A, B) + δ (B, C) \geq δ (A, C) \Leftrightarrow δ (C, B) + δ (B, A) \geq δ (C, A),

(7)

as shown in Corollary 1. If

φ

is a bijective function, we use

φ^{- 1}

to denote its inverse function. In the following, let

φ \in B F_{*} [A, B], \tilde{φ} \in B F_{*} [B, C]

, and

\tilde{\tilde{φ}} \in B F_{*} [A, C]

be arbitrary. Before we proceed further, we have the definitions:

We use $B_{A}$ to denote $φ (A)$ , if ${| A |}_{m} \leq {| B |}_{m}$ and $A_{B}$ to denote $φ (B)$ , if ${| A |}_{m} > {| B |}_{m}$ .
We use $C_{B}$ to denote $\tilde{φ} (B)$ , if ${| B |}_{m} \leq {| C |}_{m}$ and $B_{C}$ to denote $\tilde{φ} (C)$ , if ${| B |}_{m} > {| C |}_{m}$ .
We use $C_{A}$ to denote $\tilde{\tilde{φ}} (A)$ , if ${| A |}_{m} \leq {| C |}_{m}$ and $A_{C}$ to denote $\tilde{\tilde{φ}} (C)$ , if ${| C |}_{m} < {| A |}_{m}$ .

Though there are 27 relations between the cardinalities of

A, B

, and C, only 13 of them are valid as shown in the following lemma.

Lemma 1.

There are only 13 relations which do not violate the transitivity property in terms of their cardinalities:

\begin{matrix} (1, 1, 1), (1, 2, 1), (1, 3, 1), (1, 3, 2), (1, 3, 3), (2, 1, 1), \\ (2, 2, 2), (2, 3, 3), (3, 1, 1), (3, 1, 2), (3, 1, 3), (3, 2, 3), (3, 3, 3) . \end{matrix}

Proof.

The result follows immediately from their relations. Take the relation

(1, 1, 1)

for example. Recall that

(1, 1, 1)

represents the relation

{| A |}_{m} {< | B |}_{m} < {| C |}_{m}

, in which the property of transitivity

{| A |}_{m} < {| C |}_{m}

holds. One could verify that each of the other 12 relations also holds the transitivity property. However, the other 15 relations fail the transitivity property: for example

(1, 1, 3)

( i.e.,

{| A |}_{m} {< | B |}_{m} {, | B |}_{m} {< | C |}_{m} {, | A |}_{m} > {| C |}_{m}

). ☐

Lemma 2.

(Non-negative, symmetric)

$δ (A, B) \geq 0$ .
$δ (A, B) = 0$ iff $A = B$ .
$δ (A, B) = δ (B, A)$ .

Proof.

The first statement follows immediately from the definition and the third one follows from the fact that

B F [A, B] = B F [B, A]

. Here we show the second one. Suppose

A = B

, then

δ (A, B) = ‖ I ‖ = 0

, where I is the identity function. Suppose

A \neq B

. Then, there are two cases: either

{| A |}_{m} \neq {| B |}_{m}

or

{| A |}_{m} = {| B |}_{m}

. For the former one, one has

\forall φ \in B F [A, B] (‖ φ ‖^{-} > 0)

, i.e.,

δ (A, B) > 0

. For the latter one, one has

I \notin B F [A, B]

and thus

\forall φ \in B F [A, B] (‖ φ ‖ > 0)

, i.e.,

δ (A, B) > 0

. Hence, we have shown

δ (A, B) = 0

iff

A = B

. ☐

In the following, we show the triangle inequality of

δ

. Let us show the following corollary first.

Corollary 1.

To show δ satisfy the triangle inequality, it suffices to consider the following eight relations:

\begin{matrix} (2, 2, 2), (2, 3, 3), (2, 1, 1), (3, 1, 2), \\ (1, 3, 2), (1, 1, 1), (3, 1, 1), (1, 3, 1) . \end{matrix}

Proof.

By Equation (7) and Lemma 1, A and C are interchangeable, i.e., the relations

(1, 1, 1), (2, 3, 3), (1, 3, 1), (2, 1, 1), (3, 1, 1)

are equivalent to (respectively)

(3, 3, 3), (1, 2, 1), (1, 3, 3), (3, 2, 3), (3, 1, 3) .

☐

By this corollary, we only need to consider the triangle inequality of the above-mentioned eight relations.

Lemma 3.

(Relation (2, 2, 2))

If

{| A |}_{m} = {| B |}_{m} = {| C |}_{m}

, then

δ (A, B) + δ (B, C) \geq δ (A, C)

.

Proof.

Since

{| A |}_{m} = {| B |}_{m} = {| C |}_{m}

, it follows

δ (A, B) = ‖ φ ‖ = \sum_{e \in A} d (e, φ (e))

and

δ (B, C) = ‖ \tilde{φ} ‖ = \sum_{h \in A} d (h, \tilde{φ} (h)) = \sum_{e \in A} d (φ (e), \tilde{φ} \circ φ (e)) .

Since

\tilde{φ} \circ φ \in B F (A, C)

, by the definition of d, it follows

δ (A, B) + δ (B, C) \geq \sum_{e \in A} d (e, \tilde{φ} \circ φ (e)) \geq \sum_{e \in A} d (e, \tilde{\tilde{φ}} (e)),

i.e.,

δ (A, B) + δ (B, C) \geq δ (A, C)

. ☐

Lemma 4.

(Relation (2, 3, 3))

If

{| A |}_{m} = {| B |}_{m} > {| C |}_{m}

, then

δ (A, B) + δ (B, C) \geq δ (A, C)

.

Proof.

By the definition

δ

, it follows

δ (A, C) = \sum_{a \in A - A_{C}} a + \sum_{a \in A_{C}} d (a, \tilde{\tilde{φ}} (a)) \leq \sum_{a \in A - A_{B_{C}}} a + \sum_{a \in A_{B_{C}}} d (a, \tilde{φ} \circ φ (a)),

where

A_{B_{C}}

denotes

φ^{- 1} (B_{C})

. Furthermore,

δ (A, B) = \sum_{a \in A - A_{B_{C}}} d (a, φ (a)) + \sum_{a \in A_{B_{C}}} d (a, φ (a)),

\begin{matrix} δ (B, C) = \sum_{b \in B - B_{C}} b + \sum_{b \in B_{C}} d (b, \tilde{φ} (b)) \\ = \sum_{a \in A - A_{B_{C}}} φ (a) + \sum_{a \in A_{B_{C}}} d (φ (a), \tilde{φ} \circ φ (a)) . \end{matrix}

Henceforth, by the properties of d and the definition of

δ

\begin{matrix} δ (A, B) + δ (B, C) \\ = \sum_{a \in A - A_{B_{C}}} [d (a, φ (a)) + φ (a)] + \sum_{a \in A_{B_{C}}} [d (a, φ (a)) + d (φ (a), \tilde{φ} \circ φ (a))] \\ \geq \sum_{a \in A - A_{B_{C}}} a + \sum_{a \in A_{B_{C}}} d (a, \tilde{φ} \circ φ (a)) \geq δ (A, C) . \end{matrix}

☐

Lemma 5.

(Relation (2, 1, 1))

If

{| A |}_{m} = {| B |}_{m} < {| C |}_{m}

, then

δ (A, B) + δ (B, C) \geq δ (A, C)

.

Proof.

By the definition

δ

, it follows

δ (A, C) = \sum_{c \in C - C_{A}} c + \sum_{a \in A} d (a, \tilde{\tilde{φ}} (a)) \leq \sum_{c \in C - C_{B}} c + \sum_{a \in A} d (a, \tilde{φ} \circ φ (a)),

δ (A, B) = \sum_{a \in A} d (a, φ (a)),

δ (B, C) = \sum_{c \in C - C_{B}} c + \sum_{b \in B} d (b, \tilde{φ} (b)) = \sum_{c \in C - C_{B}} c + \sum_{a \in A} d (φ (a), \tilde{φ} \circ φ (a)) .

Henceforth, by the triangle inequality of d

\begin{matrix} δ (A, B) + δ (B, C) \\ = \sum_{a \in A} d (a, φ (a)) + \sum_{c \in C - C_{B}} c + \sum_{b \in B} d (b, \tilde{φ} (b)) \\ = \sum_{a \in A} d (a, φ (a)) + \sum_{c \in C - C_{B}} c + \sum_{a \in A} d (φ (a), \tilde{φ} \circ φ (a)) \\ \geq \sum_{c \in C - C_{B}} c + \sum_{a \in A} d (a, \tilde{φ} \circ φ (a)) \geq \sum_{c \in C - C_{A}} c + \sum_{a \in A} d (a, \tilde{\tilde{φ}} (a)) \\ = δ (A, C) . \end{matrix}

☐

Lemma 6.

(Relation (3, 1, 2))

If

{| A |}_{m} = {| C |}_{m} > {| B |}_{m}

, then

δ (A, B) + δ (B, C) \geq δ (A, C)

.

Proof.

Since

δ (A, C) = \sum_{a \in A} d (a, \tilde{\tilde{φ}} (a)),

δ (A, B) = \sum_{a \in A - A_{B}} a + \sum_{a \in A_{B}} d (a, φ (a)),

\begin{matrix} δ (B, C) = \sum_{c \in C - C_{B}} c + \sum_{b \in B} d (b, \tilde{φ} (b)) = \sum_{c \in C - C_{B}} c + \sum_{a \in A_{B}} d (φ (a), \tilde{φ} \circ φ (a)) . \end{matrix}

By the triangle inequality of d and the definitions of

δ

\begin{matrix} δ (A, B) + δ (B, C) \\ \geq \sum_{A - A_{B}} a + \sum_{c \in C - C_{B}} c + \sum_{a \in A_{B}} [d (a, φ (a)) + d (φ (a), \tilde{φ} \circ φ (a))] \\ \geq \sum_{a \in A - A_{B}} [a + ψ (a)] + \sum_{a \in A_{B}} [d (a, \tilde{φ} \circ φ (a))] for some bijective function ψ \\ between A - A_{B} and C - C_{B} \\ \geq \sum_{a \in A - A_{B}} d (a, ψ (a)) + \sum_{a \in A_{B}} [d (a, \tilde{φ} \circ φ (a))] for some bijective function ψ \\ between A - A_{B} and C - C_{B} \\ \geq δ (A, C) (since the coupling of {ψ |}_{A - A_{B}} {and φ |}_{A_{B}} lies in B F (A, C)) . \end{matrix}

☐

Lemma 7.

(Relation (1, 3, 2))

If

{| A |}_{m} = {| C |}_{m} < {| B |}_{m}

, then

δ (A, B) + δ (B, C) \geq δ (A, C)

.

Proof.

By the definitions,

\begin{matrix} δ (A, B) = \sum_{b \in B - B_{A}} b + \sum_{a \in A} d (a, φ (a)) \\ = \sum_{b \in B - B_{A}} b + \sum_{a \in A_{B_{A} \cap B_{C}}} d (a, φ (a)) + \sum_{a \in A - A_{B_{A} \cap B_{C}}} d (a, φ (a)) \\ \geq \sum_{b \in B_{C} - B_{A} \cap B_{C}} b + \sum_{a \in A_{B_{A} \cap B_{C}}} d (a, φ (a)) + \sum_{a \in A - A_{B_{A} \cap B_{C}}} d (a, φ (a)) \\ = \sum_{b \in B_{C} - B_{A} \cap B_{C}} b + \sum_{a \in A_{B_{A} \cap B_{C}}} d (a, φ (a)) + \sum_{a \in A - A_{B_{A} \cap B_{C}}} d (a, φ (a)) \end{matrix}

where

A_{B_{A} \cap B_{C}}

denotes

φ^{- 1} (B_{A} \cap B_{C})

. Moreover,

\begin{matrix} δ (B, C) \\ = \sum_{b \in B - B_{C}} b + \sum_{b \in B_{C}} d (b, \tilde{φ} (b)) \\ = \sum_{b \in B - B_{C}} b + \sum_{b \in B_{C} - B_{A} \cap B_{C}} d (b, \tilde{φ} (b)) + \sum_{b \in B_{A} \cap B_{C}} d (b, \tilde{φ} (b)) \\ \geq \sum_{b \in B_{A} - B_{A} \cap B_{C}} b + \sum_{b \in B_{C} - B_{A} \cap B_{C}} d (b, \tilde{φ} (b)) + \sum_{b \in B_{A} \cap B_{C}} d (b, \tilde{φ} (b)) \\ = \sum_{a \in A - A_{B_{A} \cap B_{C}}} φ (a) + \sum_{b \in B_{C} - B_{A} \cap B_{C}} d (b, \tilde{φ} (b)) + \sum_{a \in A_{B_{A} \cap B_{C}}} d (φ (a), \tilde{φ} \circ φ (a)) . \end{matrix}

Hence, by the triangle inequality of d and the definitions of

δ

\begin{matrix} δ (A, B) + δ (B, C) \\ \geq \sum_{a \in A - A_{B_{A} \cap B_{C}}} [φ (a) + d (a, φ (a))] + \sum_{b \in B_{C} - B_{A} \cap B_{C}} [b + d (b, \tilde{φ} (b))] \\ \sum_{a \in A_{B_{A} \cap B_{C}}} [d (a, φ (a)) + d (φ (a), \tilde{φ} \circ φ (a))] \\ \geq \sum_{a \in A - A_{B_{A} \cap B_{C}}} a + \sum_{b \in B_{C} - B_{A} \cap B_{C}} \tilde{φ} (b) + \sum_{a \in A_{B_{A} \cap B_{C}}} d (a, \tilde{φ} \circ φ (a)) \\ \geq \sum_{a \in A - A_{B_{A} \cap B_{C}}} d (a, ψ (a)) + \sum_{a \in A_{B_{A} \cap B_{C}}} d (a, \tilde{φ} \circ φ (a)), \\ where ψ \in B F (A - A_{B_{A} \cap B_{C}}, B_{C} - B_{A} \cap B_{C}) \\ \geq \sum_{a \in A} d (a, \tilde{\tilde{φ}} (a)) = δ (A, C) . \end{matrix}

☐

Lemma 8.

(Relation (1, 1, 1))

If

{| A |}_{m} {< | B |}_{m} < {| C |}_{m}

, then

δ (A, B) + δ (B, C) \geq δ (A, C)

.

Proof.

By the definitions of

δ

δ (A, C) = \sum_{c \in C - C_{A}} c + \sum_{a \in A} d (a, \tilde{\tilde{φ}} (a)) \leq \sum_{c \in C - C_{B_{A}}} c + \sum_{a \in A} d (a, \tilde{φ} \circ φ (a)),

where

C_{B_{A}}

denotes

\tilde{φ} (B_{A})

;

δ (A, B) = \sum_{b \in B - B_{A}} b + \sum_{a \in A} d (a, φ (a)) .

Furthermore,

\begin{matrix} δ (B, C) = \sum_{c \in C - C_{B}} c + \sum_{b \in B} d (b, \tilde{φ} (b)) \\ = \sum_{c \in C - C_{B}} c + \sum_{b \in B_{A}} d (b, \tilde{φ} (b)) + \sum_{b \in B - B_{A}} d (b, \tilde{φ} (b)) \\ = \sum_{c \in C - C_{B}} c + \sum_{a \in A} d (φ (a), \tilde{φ} \circ φ (a)) + \sum_{b \in B - B_{A}} d (b, \tilde{φ} (b)) . \end{matrix}

Then, by the triangle inequality of d and the definitions of

δ

\begin{matrix} δ (A, B) + δ (B, C) \\ \geq \sum_{a \in A} d (a, \tilde{φ} \circ φ (a)) + \sum_{b \in B - B_{A}} [b + d (b, \tilde{φ} (b))] + \sum_{c \in C - C_{B}} c \\ \geq \sum_{a \in A} d (a, \tilde{φ} \circ φ (a)) + \sum_{b \in B - B_{A}} \tilde{φ} (b) + \sum_{c \in C - C_{B}} c (by Equation (4)) \\ = \sum_{a \in A} d (a, \tilde{φ} \circ φ (a)) + \sum_{c \in C_{B} - C_{B_{A}}} c + \sum_{c \in C - C_{B}} c \\ \geq \sum_{a \in A} d (a, \tilde{φ} \circ φ (a)) + \sum_{c \in C - C_{B_{A}}} c \geq δ (A, C) . \end{matrix}

☐

Lemma 9.

(Relation (3, 1, 1))

If

{| C |}_{m} {> | A |}_{m} > {| B |}_{m}

, then

δ (A, B) + δ (B, C) \geq δ (A, C)

.

Proof.

We derive the three components one by one. Firstly, suppose

\bar{\bar{φ}} \in B M (A \to B)

is a function satisfying

\bar{\bar{φ}} | (A_{B}) = C_{B}

(as shown in Figure 1), i.e.,

\bar{\bar{φ}} (a) = \tilde{φ} \circ φ^{- 1} (a)

for all

a \in A_{B}

. By the definitions of

δ

\begin{matrix} δ (A, C) = \sum_{c \in C - C_{A}} c + \sum_{a \in A} d (a, \tilde{\tilde{φ}} (a)) \\ \leq \sum_{c \in C - \bar{\bar{φ}} (A)} c + \sum_{a \in A} d (a, \bar{\bar{φ}} (a)) \\ = \sum_{c \in C - \bar{\bar{φ}} (A)} c + \sum_{a \in A_{B}} d (a, \bar{\bar{φ}} (a)) + \sum_{a \in A - A_{B}} d (a, \bar{\bar{φ}} (a)) \end{matrix}

Figure 1. Triangle Inequality for

(3, 1, 1)

case.

Secondly,

δ (A, B) = \sum_{a \in A - A_{B}} a + \sum_{a \in A_{B}} d (a, φ^{- 1} (a)) .

Thirdly,

\begin{matrix} δ (B, C) = \sum_{c \in C - C_{B}} c + \sum_{b \in B} d (b, \tilde{φ} (b)) \\ = \sum_{c \in C - \bar{\bar{φ}} (A)} c + \sum_{c \in \bar{\bar{φ}} (A) - \bar{\bar{φ}} (A_{B})} c + \sum_{b \in B} d (b, \tilde{φ} (b)) \\ = \sum_{c \in C - \bar{\bar{φ}} (A)} c + \sum_{c \in \bar{\bar{φ}} (A) - \bar{\bar{φ}} (A_{B})} c + \sum_{a \in A_{B}} d (φ^{- 1} (a), \tilde{φ} \circ φ^{- 1} (a)) \\ = \sum_{c \in C - \bar{\bar{φ}} (A)} c + \sum_{c \in \bar{\bar{φ}} (A) - \bar{\bar{φ}} (A_{B})} c + \sum_{a \in A_{B}} d (φ^{- 1} (a), \bar{\bar{φ}} (a)) \\ = \sum_{c \in C - \bar{\bar{φ}} (A)} c + \sum_{a \in A - A_{B}} \bar{\bar{φ}} (a) + \sum_{a \in A_{B}} d (φ^{- 1} (a), \bar{\bar{φ}} (a)) \end{matrix}

Hence,

\begin{matrix} δ (A, B) + δ (B, C) \geq \sum_{c \in C - \bar{\bar{φ}} (A)} c + \sum_{a \in A_{B}} d (a, \bar{\bar{φ}} (a)) \\ + \sum_{a \in A - A_{B}} [a + \bar{\bar{φ}} (a)] \\ \geq \sum_{c \in C - \bar{\bar{φ}} (A)} c + \sum_{a \in A_{B}} d (a, \bar{\bar{φ}} (a)) + \sum_{a \in A - A_{B}} d (a, \bar{\bar{φ}} (a)) (Equation (3)) \\ \geq δ (A, C) . \end{matrix}

☐

Lemma 10.

(Relation (1, 3, 1))

If

{| B |}_{m} {> | C |}_{m} > {| A |}_{m}

, then

δ (A, B) + δ (B, C) \geq δ (A, C)

.

Proof.

Suppose

A_{2} = φ^{- 1} (B_{A} \cap B_{C}) \equiv A_{B_{A} \cap B_{C}}

. Suppose

A_{1} = A - A_{2}

. Suppose

B_{A_{1}} = φ (A_{1}) = B_{A} - B_{A} \cap B_{C}, B_{A_{2}} = φ (A_{2}) = B_{A} \cap B_{C} .

Choose a function

ψ \in B F (A, B_{C})

, in which

\forall a \in A_{2} [ψ (a) = φ (a)]

, i.e.,

ψ | A_{2} = φ | A_{2}

. Suppose

\bar{B} = B_{A_{2}} \cup ψ (A_{1}) \equiv B_{A_{2}} \cup B_{A_{1}}^{*}

, where

B_{A_{1}}^{*} \equiv ψ (A_{1})

. Let

ρ

denote the composition

{\tilde{φ}}^{- 1} | B \circ ψ

(or simply

{\tilde{φ}}^{- 1} \circ ψ

). Then,

ρ \in B F [A, C]

and

ρ (A) = {\tilde{φ}}^{- 1} \circ ψ (A) = {\tilde{φ}}^{- 1} \circ ψ (A_{1}) \cup {\tilde{φ}}^{- 1} \circ ψ (A_{2}) = {\tilde{φ}}^{- 1} \circ ψ (A_{1}) \cup {\tilde{φ}}^{- 1} (B_{A_{2}}),

as shown in Figure 2. Furthermore, by the definition of

δ

,

\begin{matrix} δ (A, C) = \sum_{c \in C - C_{A}} c + \sum_{c \in C_{A}} d (a, \tilde{\tilde{φ}} (a)) \leq \sum_{c \in C - ρ (A)} c + \sum_{a \in A} d (a, ρ (a)) \\ = \sum_{c \in C - ρ (A)} c + \sum_{a \in A_{1}} d (a, ρ (a)) + \sum_{a \in A_{2}} d (a, ρ (a)); \end{matrix}

\begin{matrix} δ (A, B) = \sum_{b \in B - B_{A}} b + \sum_{a \in A} d (a, φ (a)) \\ = \sum_{b \in B - B_{A}} b + \sum_{a \in A_{1}} d (a, φ (a)) + \sum_{a \in A_{2}} d (a, φ (a)) \\ = \sum_{b \in B - B_{A} \cup B_{C}} b + \sum_{b \in B_{C} - B_{A_{1}} \cup B_{A_{2}}} b + \sum_{b \in B_{A_{1}}} b \\ + \sum_{a \in A_{1}} d (a, φ (a)) + \sum_{a \in A_{2}} d (a, φ (a)) \\ \geq \sum_{b \in B_{C} - \bar{B}} b + \sum_{b \in B_{A_{1}}^{*}} b + \sum_{a \in A_{1}} d (a, φ (a)) + \sum_{a \in A_{2}} d (a, φ (a)) \end{matrix}

\begin{matrix} δ (B, C) = \sum_{b \in B - B_{C}} b + \sum_{b \in B_{C}} d (b, {\tilde{φ}}^{- 1} (b)) \\ = \sum_{b \in B - B_{C}} b + \sum_{b \in B_{A_{2}}} d (b, {\tilde{φ}}^{- 1} (b)) + \sum_{b \in B_{A_{1}}^{*}} d (b, {\tilde{φ}}^{- 1} (b)) + \sum_{b \in B_{C} - \bar{B}} d (b, {\tilde{φ}}^{- 1} (b)) \\ \geq \sum_{a \in A_{1}} φ (a) + \sum_{b \in B_{A_{2}}} d (b, {\tilde{φ}}^{- 1} (b)) + \sum_{b \in B_{A_{1}}^{*}} d (b, {\tilde{φ}}^{- 1} (b)) + \sum_{b \in B_{C} - \bar{B}} d (b, {\tilde{φ}}^{- 1} (b)) \\ = \sum_{a \in A_{1}} φ (a) + \sum_{a \in A_{2}} d (φ (a), ρ (a)) + \sum_{b \in B_{A_{1}}^{*}} d (b, {\tilde{φ}}^{- 1} (b)) + \sum_{b \in B_{C} - \bar{B}} d (b, {\tilde{φ}}^{- 1} (b)) . \end{matrix}

Figure 2. Triangle Inequality for

(1, 3, 1)

case.

Henceforth, by Equations (3)–(5), it follows

\begin{matrix} δ (A, B) + δ (B, C) \geq \sum_{b \in B_{C} - \bar{B}} [b + d (b, {\tilde{φ}}^{- 1} (b))] \\ + \sum_{b \in B_{A_{1}}^{*}} [b + d (b, {\tilde{φ}}^{- 1} (b))] + \sum_{a \in A_{1}} [d (a, φ (a)) + φ (a)] \\ + \sum_{a \in A_{2}} [d (a, φ (a)) + d (φ (a), ρ (a))] \\ \geq \sum_{b \in B_{C} - \bar{B}} {\tilde{φ}}^{- 1} (b) + \sum_{b \in B_{A_{1}}^{*}} {\tilde{φ}}^{- 1} (b) + \sum_{a \in A_{1}} a + \sum_{a \in A_{2}} d (a, ρ (a)) \\ = \sum_{b \in B_{C} - \bar{B}} {\tilde{φ}}^{- 1} (b) + \sum_{a \in A_{1}} [(ρ (a) + a] + \sum_{a \in A_{2}} d (a, ρ (a)) \\ \geq \sum_{b \in B_{C} - \bar{B}} {\tilde{φ}}^{- 1} (b) + \sum_{a \in A_{1}} d (a, ρ (a)) + \sum_{a \in A_{2}} d (a, ρ (a)) \\ = \sum_{c \in C - ρ (A)} c + \sum_{a \in A_{1}} d (a, ρ (a)) + \sum_{a \in A_{2}} d (a, ρ (a)) \\ \geq δ (A, C) . \end{matrix}

☐

Theorem 1.

(K, δ)

is a metric space.

Proof.

By Lemmas 2–10 and Corollary 1, the result follows immediately. ☐

3. Applications and Computations

In this section, we give a group of numerical data and demonstrate how to compute their distances (or adjacency matrix) via the metric

δ

. In order to facilitate our computing, we show the following lemmas first. Let

a_{1}, a_{2}, b_{1}, b_{2} \in R

be arbitrary.

3.1. Lemmas

Lemma 11.

If

a_{1} \leq a_{2}

and

b_{1} \leq b_{2}

, then

| a_{1} - b_{1} | + | a_{2} - b_{2} | \leq | a_{1} - b_{2} | + | a_{2} - b_{1} |

.

Proof.

Suppose

a_{2} = a_{1} + λ_{a}

, suppose

b_{2} = b_{1} + λ_{b}

, where

λ_{a}, λ_{b} \geq 0

. Let

k = a_{1} - b_{1}

. Then,

\begin{matrix} | a_{1} - b_{2} | + | a_{2} - b_{1} | - | a_{1} - b_{1} | - | a_{2} - b_{2} | \\ = | a_{1} - b_{1} - λ_{b} | + | a_{1} - b_{1} + λ_{a} | - | a_{1} - b_{1} | - | a_{1} - b_{1} + λ_{a} - λ_{b} | \\ = | k - λ_{b} | + | k + λ_{a} | - | k | - | k + λ_{a} - λ_{b} | . \end{matrix}

Furthermore, we consider the following cases:

$k = 0$ : Then,

$\begin{matrix} | k - λ_{b} | + | k + λ_{a} | - | k | - | k + λ_{a} - λ_{b} | \\ = λ_{b} + λ_{a} - | λ_{a} - λ_{b} | \geq 0; \end{matrix}$
$k > 0$ : Then,

$\begin{matrix} | k - λ_{b} | + | k + λ_{a} | - | k | - | k + λ_{a} - λ_{b} | \\ = | k - λ_{b} | + k + λ_{a} - k - | k + λ_{a} - λ_{b} | \geq 0; \end{matrix}$
$k < 0$ : Then,

$\begin{matrix} | k - λ_{b} | + | k + λ_{a} | - | k | - | k + λ_{a} - λ_{b} | \\ = - k + λ_{b} + | k + λ_{a} | + k - | k + λ_{a} - λ_{b} | \geq 0 . \end{matrix}$

Hence, we have shown $| a_{1} - b_{2} | + | a_{2} - b_{1} | - | a_{1} - b_{1} | - | a_{2} - b_{2} | \geq 0 .$

☐

Lemma 12.

Let

ρ \in B F_{*} (A, B)

be arbitrary. Let

e_{1}, e_{2} \in D_{ρ}

be arbitrary such that

e_{1} \leq e_{2}

. Then,

\exists η \in B F_{*} [A, B]

such that

η (e) = ρ (e)

for all

e \in D_{ρ} - {e_{1}, e_{2}}

and

η (e_{1}) \leq η (e_{2})

.

Proof.

If

ρ (e_{1}) \leq ρ (e_{2})

, then one simply chooses

η

to be

ρ

. If

ρ (e_{1}) > ρ (e_{2})

, then one could choose

η (e) = ρ (e)

for all

e \in D_{ρ} - {e_{1}, e_{2}}

and

η (e_{1}) : = ρ (e_{2})

and

η (e_{2}) : = ρ (e_{1})

. Then, one has

η (e_{1}) < η (e_{2})

. By Lemma 11, one has

| e_{1} - ρ (e_{2}) | + | e_{2} - ρ (e_{1}) | \leq | e_{1} - ρ (e_{1}) | + | e_{2} - ρ (e_{2}) |,

which together with

ρ \in B F_{*} [A, B]

yields

| e_{1} - ρ (e_{2}) | + | e_{2} - ρ (e_{1}) | = | e_{1} - ρ (e_{1}) | + | e_{2} - ρ (e_{2}) |,

i.e.,

| e_{1} - η (e_{1}) | + | e_{2} - η (e_{2}) | = | e_{1} - ρ (e_{1}) | + | e_{2} - ρ (e_{2}) |,

i.e.,

‖ η ‖ + {‖ η ‖}^{-} = ‖ ρ ‖ + {‖ ρ ‖}^{-}

, i.e.,

η \in B F_{*} [A, B]

. ☐

Corollary 2.

If

ρ \in B F_{*} [A, B]

and

D_{ρ} = {e_{1}, e_{2}, \dots, e_{n}}

with

e_{1} \leq e_{2} \leq \dots \leq e_{n}

, then

η \in B F_{*} [A, B]

, where

D_{η} = D_{ρ}

and

η (e_{1}) = m i n [ρ (D_{ρ})], η (e_{2}) = m i n [ρ (D_{ρ}) - {η (e_{1})}], \dots, η (k + 1) = m i n [ρ (D_{ρ}) - {η (e_{1}), η (e_{2}), \dots, η (e_{k})}]

for all

k \leq n - 1

.

Proof.

By applying Lemma 12 repeatedly, the result follows immediately. ☐

This corollary directly facilitates our computation in the next section. In addition, one could also simplify and redefine the metric

δ

by the result of this corollary.

3.2. Computation

In the following, we demonstrate the computation of our metric

δ

via a group of simulated data. Suppose

K = {A_{1}, A_{2}, A_{3}, A_{4}, A_{5}, A_{6}} \subseteq MP (R^{+})

is defined as follows:

$A_{1} = {91.67, 2, 39.53, 98.34, 8.78}$ ;
$A_{2} = {1.99, 62, 7, 9.52, 9, 8.11}$ ;
$A_{3} = {2.1, 6.22, 27.1, 9.67, 9.19, 81.29, 5.55, 12.41, 1.67, 11.08, 51.15, 0.33}$ ;
$A_{4} = {22.21, 61.26, 71.12, 29.61, 29.19, 29.29, 35.3, 40}$ ;
$A_{5} = {17.19, 2, 70.56, 9.52, 9.45, 18.16, 40}$ ;
$A_{6} = {1.26, 0.19, 2, 4.70, 8.56, 9.09}$ .

Suppose the distance function over

R^{+}

is defined by

d (e, φ (e)) : = | e - φ (e) | .

Then, our metric (defined in Equation (6)) derived from this d could be applied. We could then obtain the adjacency matrix of

K

via the following two methods:

(Method One) List all the permutations and find the optimal permutation and its associated distance, which is the summation of the matched and mismatched parts.
(Method Two) List all the combinations and find the optimal combination and its associated distance, which is the summation of the matched and mismatched parts.

Method One comes directly from the definition. By Corollary 2, Method Two is also justified. To demonstrate this, let us first compute

δ (A_{2}, A_{3})

. If Method One is applied, then one has to compute all the

P (12, 6) =

665,280 permutations, and measure the matched distances between these permutations and

A_{2}

and the mismatched distances between these permutations and

A_{3}

. If Method Two is applied, then one sorts the set

A_{2}

first, and then sorts each of the

C (12, 6) = 924

combinations to measure the matched part between each sorted combination and sorted

A_{2}

, and the mismatched part between the sorted combination and

A_{3}

. Both methods agree as follows:

δ (A_{2}, A_{3}) = m i n {‖ φ ‖ + {‖ φ ‖}^{-} : φ \in B F (A_{2}, A_{3})} = 120.14,

in which the matched distance is

‖ φ ‖ = 69.6

and the mismatched distance

{‖ φ ‖}^{-} = 50.54

, where the optimal

φ

is defined as follows:

φ (1.99) = 2.1, φ (62) = 81.29, φ (7) = 9.19, φ (9.52) = 51.15,

φ (9) = 12.41, φ (8.11) = 11.08

. Proceed similarly, the distances for other pairs

(A_{i}, A_{j})

could also be obtained and the resulting adjacency matrix is demonstrated in Figure 3.

Figure 3. Adjacency Matrix

{[δ (A_{i}, A_{j})]}_{i, j = 1}^{6}

.

One could verify that this adjacency matrix satisfies all the metric axioms, in particular,

δ (A_{i}, A_{j}) + δ (A_{J}, A_{k}) \geq δ (A_{i}, A_{k})

for all

i, j, k \in {1, 2, 3, 4, 5, 6}

.

4. Real World Applications

In addition to some trivial applications, one could consider other handy applications, for example, by replacing the usual Euclidean metric with our metric in the following fields:

k

–means, clustering analysis, graph comparisons, etc. (see Reference [2]). These are frequently-used techniques in analyzing data or theoretical computations. The author has also succeeded in defining a novel metric for graphs based on the metric defined in this article. This enables one to measure the distances between any two graphical structures or networks. The idea for the derived metric is to measure the differences between any two graphs by induction on vertexes. Suppose there are two graphs

G_{1}

and

G_{2}

with the same set of vertices V. For each vertex

v \in V

, one could then generate two multisets whose elements are the lengths between v and its respective set of endpoints in

G_{1}

and

G_{2}

. Then, he could compute the distances via the minimal sum of matched elements and mismatched elements as defined in this article. This approach yields a new metric for graphs.

4.1. Example

Let us consider a concrete example. Suppose the government in a country is trying to associate a village (among three candidate villages: VL1,VL2,VL3) which produces maize with the wholesalers which sell maize. In VL1, there are five farmers; in VL2, there are six farmers; in VL3, there are 10 farmers. The expected annual yields of maize for each farmer in VL1 are

3.2, 5.1, 7.6, 3.2, 8.8

tons; the ones in VL2 are

1.2, 2.1, 3.6, 7.9, 12.1, 6.4

tons; and the ones in VL3 are

2.6, 4.6, 8.1, 5.1, 2.2, 5, 7.9, 11.1, 12, 4.5

tons. On the other hand, suppose there are four wholesalers whose annual demands are

7.9, 9.2, 11.6, 8.3

tons, respectively. The government policy is to associate a village with the wholesalers based on the criterion that the total discrepancy between the village and the wholesalers must be minimal and the condition that each farmer could only exclusively sign the contract with exactly one wholesaler. Assume the government adopts the metric defined in this article. The results could be computed as follows Table 1.

Table 1. Analysis of Optimal Matchings.

Since the total discrepancy (i.e., matched part plus mismatched part) between VL2 and the wholesalers is minimal (or

11.3

), the government should associate VL2 with the wholesalers. Henceforth, the government should pick VL2 to sign the contract with the four wholesalers exclusively. In doing so, the total dissatisfaction (or discrepancy) from both the farmer and the wholesalers would be minimal.

4.2. Characteristic and Analysis

The main characteristic of our metric is that it takes the minimal discrepancy into consideration. For the usual metrics, one hardly associates a metric with the minimal matching via combinations or permutations of all sorts of choices. Our method successfully combines the usual definition of a metric with the concept of an optimal choice. With these two concepts combined, one could pick up an optimal decision purely based on the metric defined in this article. This approach gives one a much more direct decision-making process. In addition, since this metric consists of two parts: the matched and mismatched parts, it would provide one with much more insightful knowledge of the discrepancy between mathematical objects.

5. Conclusions

We have defined a metric on a finite power multiset of positive real numbers via the concept of minimal matchings, in which the distances of any two multisets consist of two parts: the distance of the matched part and the distance of the mismatched part. We also implement this metric by an adjacency matrix. A concrete example is also included in this article. In addition to the adjacency matrix, we show another definitional computation to facilitate our computing of the metric. The metric defined in this article could be further applied in some real problems regarding artificial intelligence, clustering, or some other theoretical mathematical research.

Funding

This research was funded by the Natural Science Foundation of Fujian Province of China (Grant No. 2017J01566).

Conflicts of Interest

The authors declare no conflict of interest.

References

Blizard, W.D. Multiset Theory. Notre Dame J. Form. Log. 1988, 30, 36–66. [Google Scholar] [CrossRef]
Singh, D.; Ibrahim, A.M.; Yohanna, T.; Singh, J.N. An overview of the applications of multisets. Novi Sad J. Math. 2007, 37, 73–92. [Google Scholar]
Mémoli, F. Gromov–Wasserstein distances and the metric approach to object matching. Found. Comput. Math. 2011, 11, 417–487. [Google Scholar] [CrossRef]
Bondy, A.; Ram, M.M. Graph Theory; Springer: London, UK, 2008. [Google Scholar]
Chong, E.K.P.; Zak, S.H. An Introduction to Optimization; Jonh Wiley and Sons, Inc.: New York, NY, USA, 2013. [Google Scholar]
Karapınar, E.; Yıldız-Ulus, A.; Erhan, İ.M. Cyclic Contractions on G-Metric Spaces. Abstr. Appl. Anal. 2012, 2012, 182947. [Google Scholar] [CrossRef]
Syropoulos, A. Mathematics of Multisets; Multiset Processing; Springer: Berlin/Heidelberg, Germany, 2001; pp. 347–358. [Google Scholar]
Blizard, W.D. Real-valued multisets and fuzzy sets. Fuzzy Sets Syst. 1989, 33, 77–97. [Google Scholar] [CrossRef]

Figure 1. Triangle Inequality for

(3, 1, 1)

case.

Figure 1. Triangle Inequality for

(3, 1, 1)

case.

Figure 2. Triangle Inequality for

(1, 3, 1)

case.

Figure 2. Triangle Inequality for

(1, 3, 1)

case.

Figure 3. Adjacency Matrix

{[δ (A_{i}, A_{j})]}_{i, j = 1}^{6}

.

Figure 3. Adjacency Matrix

{[δ (A_{i}, A_{j})]}_{i, j = 1}^{6}

.

Table 1. Analysis of Optimal Matchings.

Villages	Expected Annual Yield (tons)	Matched	Mismatched	Total Discrepancy
VL1	${3.2, 5.1, 7.6, 3.2, 8.8}$	$12.3$	$3.2$	$15.5$
VL2	${1.2, 2.1, 3.6, 7.9, 12.1, 6.4}$	$8.0$	$3.3$	$11.3$
VL3	${2.6, 4.6, 8.1, 5.1, 2.2, 5, 7.9, 11.1, 12, 4.5}$	$2.5$	24	$26.5$

© 2018 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Article Metrics

Citations

Article Access Statistics

Journal Statistics

Multiple requests from the same IP address are counted as one view.