Next Article in Journal
(L)-Semigroup Sums
Previous Article in Journal
Diffeomorphisms with Shadowable Measures
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Metric for Finite Power Multisets of Positive Real Numbers Based on Minimal Matching

School of Mathematics and Statistics, Baise University, No.21 Zhongshan Road, Baise 533000, Guangxi, China
Axioms 2018, 7(4), 94; https://doi.org/10.3390/axioms7040094
Submission received: 17 September 2018 / Revised: 2 December 2018 / Accepted: 12 December 2018 / Published: 14 December 2018

Abstract

:
In this article, we show how to define a metric on the finite power multisets of positive real numbers. The metric, based on the minimal matching, consists of two parts: the matched part and the mismatched part. We also give some concrete applications and examples to demonstrate the validity of this metric.

1. Introduction

A multiset, unlike a Cantorian set, is a collection of elements whose instances might be multiple (the number of its instances of an element is named multiplicity). The cardinality of a multiset A is defined by the sum of the multiplicities with respect to their corresponding elements and is denoted by | A | m . For example, the cardinality of multiset A = { 2 , 2 , 3 , 3 , 3 , 6 , 11 } is 7, i.e., | A | m = 7 . Though unconventional, the theory of multisets has well been developed (see Reference [1]) and it also has various applications in many situations (see Reference [2]). From practical point of view, multisets are easier to represent or simulate than mathematical objects with multiple instances. In this article, we mainly focus on the finite power multisets of positive real numbers.
Let R + denote the set of all positive real numbers. Let N 0 denote the set of natural numbers including 0. Let power multiset MP ( R + ) denote the set of all the sub-multisets of R + . Suppose K MP ( R + ) is an arbitrary set of some sub-multisets in R + ( each multiset is finite) of MP ( R + ) . We call K a finite power multiset of positive real numbers. The main result in this article is to define a metric on K based on the concept of minimal matching. The distance between any two multisets consists of two separated parts: the matched part and mismatched part. Matching has been an important problem and has wide applications in the fields of artificial intelligence, graph theories, and operation research (see References [3,4,5]). In this article, we come up with a new metric which is based on the concept of minimal matching. This metric is used to measure the distance between any two finite multisets of positive real numbers. Though what we define in this article is a standard metric, the whole setting could also be extended to other generalized metrics, for example, G –metric (see Reference [6]).

2. Definitions

In this section, we introduce and present multisets via the forms of functions. The basic concepts could be found in many textbooks or journals (see, e.g., References [7,8]). Let Γ denote the set R + N 0 , i.e., the set of all the functions from R + to N 0 . Let D f denote the domain of a function f. Let set D f * = { r R + : f ( r ) 0 } be the non-zero domain of f.

2.1. Multisets

Let Γ < denote all the finite multi-subsets of R + , i.e., Γ < = { f Γ : | D f * | < } . Each element in Γ < is simply named a multiset in this article. If for all x R + , f ( x ) g ( x ) , we say f is a multi-subset of g, denoted by f g . Let f , g Γ < be arbitrary.
Definition 1.
(Empty Multiset) We call the zero function in Γ < the empty multiset.
Definition 2.
(Equality =) f = g if and only if f g and g f .
Definition 3.
(Intersection ) The intersection of f and g, denoted by the function f g : R + N 0 , is defined by ( f g ) ( a ) : = m i n { f ( a ) , g ( a ) } for all a R + .
Definition 4.
(Union ) The union of f and g, denoted by the function f g : R + N 0 , is defined by ( f g ) ( a ) : = m a x { f ( a ) , g ( a ) } for all a R + .
Definition 5.
(Difference ) Exclusion of g from f, denoted by the function f g : R + N 0 , is defined by ( f g ) ( a ) : = f ( a ) ( f g ) ( a ) .
Each multiset f in Γ < could be uniquely represented by the following descending form (named a representative descending form):
f = ( a 1 f ( a 1 ) , a 2 f ( a 2 ) , , a n f ( a n ) ) ,
or in brief f = a 1 f ( a 1 ) a 2 f ( a 2 ) a n f ( a n ) ; or by the following ascending form (named a representative ascending form):
f + = ( a n f ( a n ) a n 1 f ( a n 1 ) , , a 2 f ( a 2 ) a 1 f ( a 1 ) ) ,
or in brief f + = a n f ( a n ) a n 1 f ( a n 1 ) a 1 f ( a 1 ) , where a 1 > a 2 > a 3     > a n > 0 and a 1 , a 2 , , a n D f * and f ( a v ) > 0 for all 1 v n . Let | f | = f ( a 1 ) + f ( a 2 ) + + f ( a n ) .
Definition 6.
(Descending) Define the p - t h element in f by function O D as follows: O D ( p , f ) : = a 1 i f 1 p f ( a 1 ) ; a j i f l = 1 j 1 f ( a l ) < p l = 1 j f ( a l ) a n d | D f * | j 2 ; 0 otherwise .
Definition 7.
(Ascending) Define the p - t h element in f by function O A as follows: O A ( p , f ) : = a n i f 1 p f ( a n ) ; a n j i f l = 0 j 1 f ( a n l ) < p l = 0 j f ( a n l ) a n d | D f * | j 1 ; 0 otherwise .

2.2. Background

In this article, we show how to define a metric on K (see Introduction). For any Cantorian set S, we use | S | to denote the cardinality of S. Let d be an arbitrary metric on R + satisfying
d ( a , b ) a + b ,
d ( a , b ) + a b ,
d ( a , b ) + b a ,
for all a , b , c R + . Observe that d is a metric (for our generalization purpose) on R + × R + , which lays a foundation for our latter definition of a metric on K . Let A , B , C K be arbitrary. Let A B denote the set of all the functions from A to B, in which the repeated elements are deemed distinct.
Example 1.
Suppose A = { 2 , 2 , 3 , 5 } and B = { 6 , 8 , 8 , 8 , 8 , 9 , 12 , 12 } , and ρ ( 2 ) = 9 , ρ ( 2 ) = 6 , ρ ( 3 ) = 8 , ρ ( 5 ) = 12 . Then, ρ A B . For clarity, one could simply associate A and B with their ranked multiplicities as follows: A = { ( 2 , 1 ) , ( 2 , 2 ) , ( 3 , 1 ) , ( 5 , 1 ) } and B = { ( 6 , 1 ) , ( 8 , 1 ) , ( 8 , 2 ) , ( 8 , 3 ) , ( 8 , 4 ) , ( 9 , 1 ) , ( 12 , 1 ) , ( 12 , 2 ) } and ρ could also be represented by ρ ( 2 , 1 ) = ( 9 , 1 ) , ρ ( 2 , 2 ) = ( 6 , 1 ) , ρ ( 3 , 1 ) = ( 8 , 1 ) , ρ ( 5 , 1 ) = ( 12 , 1 ) . To save space, we simply use ρ ( 2 1 ) = 9 1 , ρ ( 2 2 ) = 6 1 , ρ ( 3 1 ) = 8 1 , ρ ( 5 1 ) = 12 1 for the representation in this article.
For any function φ , we use D φ and R φ to denote its domain and codomain, respectively. For the previous example, D ρ = A and R ρ = B . We use φ ( S ) to denote the image { φ ( s ) : s S } , in particular φ ( D φ ) to denote the image of φ and φ 1 ( S ) to denote the pre-image of S. If S D φ , we use φ | S to denote φ whose domain is restricted to S. One candidate in mind is d ( a , b ) : = | a b | .
Definition 8.
(Bijective embeddings) Define
B F [ A B ] : = { φ A B : | A | m = | φ ( A ) | m } ,
B F [ B A ] : = { φ B A : | B | m = | φ ( B ) | m } ,
B F [ A , B ] : = B F [ A B ] B F [ B A ] .
Example 2.
Suppose ρ is defined in Example 1. Since | A | m = 4 = | ρ ( A ) | m , by the above definition, one has ρ B F [ A B ] . On the other hand, suppose κ ( 2 1 ) = 6 1 , κ ( 2 2 ) = 6 1 , κ ( 3 1 ) = 8 , κ ( 5 1 ) = 12 , then κ B F [ A B ] , since | A | m = 4 | κ ( A ) | m = 3 . Note that B F [ A , B ] = B F [ B , A ] . Moreover, if | A | m > | B | m , then B F [ A B ] = ; similarly, if | B | m > | A | m , then B F [ B A ] = . Take A and B in Example 1 for example. One has B F [ B A ] = .
Definition 9.
For any function φ B F [ A B ] , we call it a a matched function. We call ( a , φ ( a ) ) a matched pair. Every remaining element in B φ ( A ) is called a mismatched element.
On this basis, we could define the distance for the matched elements and the distance for the mismatched elements as follows:
Definition 10.
For any φ B F [ A , B ] , define
φ : = e D φ d ( e , φ ( e ) ) a n d φ : = e R φ φ ( D φ ) e ,
where D φ and R φ and denote the domain and codomain of φ, respectively. φ represents the distance of all the matched elements (or the sum of the distances of all the matched pairs), while φ represents the distance of all the mismatched elements in the range. φ = 0 iff | A | = | B | . For example, if A = { 1 , 1 , 2 , 3 , 1 } , B = { 2 , 4 , 6 , 2 } and φ : A B is defined by φ ( 1 1 ) = 2 , φ ( 1 2 ) = 6 1 , φ ( 2 ) = 2 1 , φ ( 3 ) = 2 2 , then the matched part yields φ = | 1 2 | + | 1 6 | + | 2 2 | + | 3 2 | = 7 , where i n denotes the n - th repetition of i and the mismatched part φ = 1 . Next, we define the set of all minimal distances consisting of the matched parts and the mismatched parts.
Definition 11.
(Minimal matched functions) Define
B F * [ A B ] : = { φ B F [ A B ] : φ + φ | | ψ + ψ | | , ψ B F [ A B ] } , B F * [ B A ] : = { φ B F [ B A ] : φ + φ ψ + ψ , ψ B F [ B A ] } , B F * [ A , B ] : = { φ B F [ A , B ] : φ + φ ψ + ψ , ψ B F [ A , B ] } .
Definition 12.
(Distance function) Define δ : K × K R + by
δ ( A , B ) : = m i n { φ + φ : φ B F [ A , B ] } .
By the definition, one has δ ( A , B ) = φ + φ for any φ B F * [ A , B ] . In the following, we show that δ is indeed a metric. The reasoning will be proceeded by their relations (i.e., larger, less than and equal to) between cardinalities of A , B , and C, i.e., | A | m , | B | m , and | C | m . To validate that δ is a metric, we need to consider all the 27 relations between | A | m , | B | m and | C | m : for example, | A | m > | B | m > | C | m , | A | m = | B | m < | C | m , e t c . In order to facilitate our computing, we encode the 27 relations by the following set
{ ( n 1 , n 2 , n 3 ) : n 1 , n 2 , n 3 { 1 , 2 , 3 } } ,
in which each ( n 1 , n 2 , n 3 ) represents the relation | A | m n 1 | B | m , | B | m n 2 | C | m and | A | m n 3 | C | m , respectively, where 1 , 2 , and 3 represent the relation < , = and > correspondingly. For example, ( 1 , 2 , 3 ) represents the relation | A | m < | B | m , | B | m = | C | m , and | A | m > | C | m . By the transitivity of their cardinalities, only 13 of the 27 relations are valid (shown in Lemma 1). Moreover, these 13 relations could be further reduced to 8 relations by the symmetry of δ , i.e.,
δ ( A , B ) + δ ( B , C ) δ ( A , C ) δ ( C , B ) + δ ( B , A ) δ ( C , A ) ,
as shown in Corollary 1. If φ is a bijective function, we use φ 1 to denote its inverse function. In the following, let φ B F * [ A , B ] , φ ˜ B F * [ B , C ] , and φ ˜ ˜ B F * [ A , C ] be arbitrary. Before we proceed further, we have the definitions:
  • We use B A to denote φ ( A ) , if | A | m | B | m and A B to denote φ ( B ) , if | A | m > | B | m .
  • We use C B to denote φ ˜ ( B ) , if | B | m | C | m and B C to denote φ ˜ ( C ) , if | B | m > | C | m .
  • We use C A to denote φ ˜ ˜ ( A ) , if | A | m | C | m and A C to denote φ ˜ ˜ ( C ) , if | C | m < | A | m .
Though there are 27 relations between the cardinalities of A , B , and C, only 13 of them are valid as shown in the following lemma.
Lemma 1.
There are only 13 relations which do not violate the transitivity property in terms of their cardinalities:
( 1 , 1 , 1 ) , ( 1 , 2 , 1 ) , ( 1 , 3 , 1 ) , ( 1 , 3 , 2 ) , ( 1 , 3 , 3 ) , ( 2 , 1 , 1 ) , ( 2 , 2 , 2 ) , ( 2 , 3 , 3 ) , ( 3 , 1 , 1 ) , ( 3 , 1 , 2 ) , ( 3 , 1 , 3 ) , ( 3 , 2 , 3 ) , ( 3 , 3 , 3 ) .
Proof. 
The result follows immediately from their relations. Take the relation ( 1 , 1 , 1 ) for example. Recall that ( 1 , 1 , 1 ) represents the relation | A | m < | B | m < | C | m , in which the property of transitivity | A | m < | C | m holds. One could verify that each of the other 12 relations also holds the transitivity property. However, the other 15 relations fail the transitivity property: for example ( 1 , 1 , 3 ) ( i.e., | A | m < | B | m , | B | m < | C | m , | A | m > | C | m ). ☐
Lemma 2.
(Non-negative, symmetric)
  • δ ( A , B ) 0 .
  • δ ( A , B ) = 0 iff A = B .
  • δ ( A , B ) = δ ( B , A ) .
Proof. 
The first statement follows immediately from the definition and the third one follows from the fact that B F [ A , B ] = B F [ B , A ] . Here we show the second one. Suppose A = B , then δ ( A , B ) = I = 0 , where I is the identity function. Suppose A B . Then, there are two cases: either | A | m | B | m or | A | m = | B | m . For the former one, one has φ B F [ A , B ] ( φ > 0 ) , i.e., δ ( A , B ) > 0 . For the latter one, one has I B F [ A , B ] and thus φ B F [ A , B ] ( φ > 0 ) , i.e., δ ( A , B ) > 0 . Hence, we have shown δ ( A , B ) = 0 iff A = B . ☐
In the following, we show the triangle inequality of δ . Let us show the following corollary first.
Corollary 1.
To show δ satisfy the triangle inequality, it suffices to consider the following eight relations:
( 2 , 2 , 2 ) , ( 2 , 3 , 3 ) , ( 2 , 1 , 1 ) , ( 3 , 1 , 2 ) , ( 1 , 3 , 2 ) , ( 1 , 1 , 1 ) , ( 3 , 1 , 1 ) , ( 1 , 3 , 1 ) .
Proof. 
By Equation (7) and Lemma 1, A and C are interchangeable, i.e., the relations
( 1 , 1 , 1 ) , ( 2 , 3 , 3 ) , ( 1 , 3 , 1 ) , ( 2 , 1 , 1 ) , ( 3 , 1 , 1 )
are equivalent to (respectively)
( 3 , 3 , 3 ) , ( 1 , 2 , 1 ) , ( 1 , 3 , 3 ) , ( 3 , 2 , 3 ) , ( 3 , 1 , 3 ) .
 ☐
By this corollary, we only need to consider the triangle inequality of the above-mentioned eight relations.
Lemma 3.
( Relation ( 2 , 2 , 2 ) )
If | A | m = | B | m = | C | m , then δ ( A , B ) + δ ( B , C ) δ ( A , C ) .
Proof. 
Since | A | m = | B | m = | C | m , it follows
δ ( A , B ) = φ = e A d ( e , φ ( e ) )
and
δ ( B , C ) = φ ˜ = h A d ( h , φ ˜ ( h ) ) = e A d ( φ ( e ) , φ ˜ φ ( e ) ) .
Since φ ˜ φ B F ( A , C ) , by the definition of d, it follows
δ ( A , B ) + δ ( B , C ) e A d ( e , φ ˜ φ ( e ) ) e A d ( e , φ ˜ ˜ ( e ) ) ,
i.e., δ ( A , B ) + δ ( B , C ) δ ( A , C ) . ☐
Lemma 4.
( Relation ( 2 , 3 , 3 ) )
If | A | m = | B | m > | C | m , then δ ( A , B ) + δ ( B , C ) δ ( A , C ) .
Proof. 
By the definition δ , it follows
δ ( A , C ) = a A A C a + a A C d ( a , φ ˜ ˜ ( a ) ) a A A B C a + a A B C d ( a , φ ˜ φ ( a ) ) ,
where A B C denotes φ 1 ( B C ) . Furthermore,
δ ( A , B ) = a A A B C d ( a , φ ( a ) ) + a A B C d ( a , φ ( a ) ) ,
δ ( B , C ) = b B B C b + b B C d ( b , φ ˜ ( b ) ) = a A A B C φ ( a ) + a A B C d ( φ ( a ) , φ ˜ φ ( a ) ) .
Henceforth, by the properties of d and the definition of δ
δ ( A , B ) + δ ( B , C ) = a A A B C [ d ( a , φ ( a ) ) + φ ( a ) ] + a A B C [ d ( a , φ ( a ) ) + d ( φ ( a ) , φ ˜ φ ( a ) ) ] a A A B C a + a A B C d ( a , φ ˜ φ ( a ) ) δ ( A , C ) .
 ☐
Lemma 5.
( Relation ( 2 , 1 , 1 ) )
If | A | m = | B | m < | C | m , then δ ( A , B ) + δ ( B , C ) δ ( A , C ) .
Proof. 
By the definition δ , it follows
δ ( A , C ) = c C C A c + a A d ( a , φ ˜ ˜ ( a ) ) c C C B c + a A d ( a , φ ˜ φ ( a ) ) ,
δ ( A , B ) = a A d ( a , φ ( a ) ) ,
δ ( B , C ) = c C C B c + b B d ( b , φ ˜ ( b ) ) = c C C B c + a A d ( φ ( a ) , φ ˜ φ ( a ) ) .
Henceforth, by the triangle inequality of d
δ ( A , B ) + δ ( B , C ) = a A d ( a , φ ( a ) ) + c C C B c + b B d ( b , φ ˜ ( b ) ) = a A d ( a , φ ( a ) ) + c C C B c + a A d ( φ ( a ) , φ ˜ φ ( a ) ) c C C B c + a A d ( a , φ ˜ φ ( a ) ) c C C A c + a A d ( a , φ ˜ ˜ ( a ) ) = δ ( A , C ) .
 ☐
Lemma 6.
( Relation ( 3 , 1 , 2 ) )
If | A | m = | C | m > | B | m , then δ ( A , B ) + δ ( B , C ) δ ( A , C ) .
Proof. 
Since
δ ( A , C ) = a A d ( a , φ ˜ ˜ ( a ) ) ,
δ ( A , B ) = a A A B a + a A B d ( a , φ ( a ) ) ,
δ ( B , C ) = c C C B c + b B d ( b , φ ˜ ( b ) ) = c C C B c + a A B d ( φ ( a ) , φ ˜ φ ( a ) ) .
By the triangle inequality of d and the definitions of δ
δ ( A , B ) + δ ( B , C ) A A B a + c C C B c + a A B [ d ( a , φ ( a ) ) + d ( φ ( a ) , φ ˜ φ ( a ) ) ] a A A B [ a + ψ ( a ) ] + a A B [ d ( a , φ ˜ φ ( a ) ) ] for some bijective function ψ between A A B and C C B a A A B d ( a , ψ ( a ) ) + a A B [ d ( a , φ ˜ φ ( a ) ) ] for some bijective function ψ between A A B and C C B δ ( A , C ) ( since the coupling of ψ | A A B and φ | A B lies in B F ( A , C ) ) .
 ☐
Lemma 7.
( Relation ( 1 , 3 , 2 ) )
If | A | m = | C | m < | B | m , then δ ( A , B ) + δ ( B , C ) δ ( A , C ) .
Proof. 
By the definitions,
δ ( A , B ) = b B B A b + a A d ( a , φ ( a ) ) = b B B A b + a A B A B C d ( a , φ ( a ) ) + a A A B A B C d ( a , φ ( a ) ) b B C B A B C b + a A B A B C d ( a , φ ( a ) ) + a A A B A B C d ( a , φ ( a ) ) = b B C B A B C b + a A B A B C d ( a , φ ( a ) ) + a A A B A B C d ( a , φ ( a ) )
where A B A B C denotes φ 1 ( B A B C ) . Moreover,
δ ( B , C ) = b B B C b + b B C d ( b , φ ˜ ( b ) ) = b B B C b + b B C B A B C d ( b , φ ˜ ( b ) ) + b B A B C d ( b , φ ˜ ( b ) ) b B A B A B C b + b B C B A B C d ( b , φ ˜ ( b ) ) + b B A B C d ( b , φ ˜ ( b ) ) = a A A B A B C φ ( a ) + b B C B A B C d ( b , φ ˜ ( b ) ) + a A B A B C d ( φ ( a ) , φ ˜ φ ( a ) ) .
Hence, by the triangle inequality of d and the definitions of δ
δ ( A , B ) + δ ( B , C ) a A A B A B C [ φ ( a ) + d ( a , φ ( a ) ) ] + b B C B A B C [ b + d ( b , φ ˜ ( b ) ) ] a A B A B C [ d ( a , φ ( a ) ) + d ( φ ( a ) , φ ˜ φ ( a ) ) ] a A A B A B C a + b B C B A B C φ ˜ ( b ) + a A B A B C d ( a , φ ˜ φ ( a ) ) a A A B A B C d ( a , ψ ( a ) ) + a A B A B C d ( a , φ ˜ φ ( a ) ) , where ψ B F ( A A B A B C , B C B A B C ) a A d ( a , φ ˜ ˜ ( a ) ) = δ ( A , C ) .
 ☐
Lemma 8.
( Relation ( 1 , 1 , 1 ) )
If | A | m < | B | m < | C | m , then δ ( A , B ) + δ ( B , C ) δ ( A , C ) .
Proof. 
By the definitions of δ
δ ( A , C ) = c C C A c + a A d ( a , φ ˜ ˜ ( a ) ) c C C B A c + a A d ( a , φ ˜ φ ( a ) ) ,
where C B A denotes φ ˜ ( B A ) ;
δ ( A , B ) = b B B A b + a A d ( a , φ ( a ) ) .
Furthermore,
δ ( B , C ) = c C C B c + b B d ( b , φ ˜ ( b ) ) = c C C B c + b B A d ( b , φ ˜ ( b ) ) + b B B A d ( b , φ ˜ ( b ) ) = c C C B c + a A d ( φ ( a ) , φ ˜ φ ( a ) ) + b B B A d ( b , φ ˜ ( b ) ) .
Then, by the triangle inequality of d and the definitions of δ
δ ( A , B ) + δ ( B , C ) a A d ( a , φ ˜ φ ( a ) ) + b B B A [ b + d ( b , φ ˜ ( b ) ) ] + c C C B c a A d ( a , φ ˜ φ ( a ) ) + b B B A φ ˜ ( b ) + c C C B c ( by Equation ( 4 ) ) = a A d ( a , φ ˜ φ ( a ) ) + c C B C B A c + c C C B c a A d ( a , φ ˜ φ ( a ) ) + c C C B A c δ ( A , C ) .
 ☐
Lemma 9.
( Relation ( 3 , 1 , 1 ) )
If | C | m > | A | m > | B | m , then δ ( A , B ) + δ ( B , C ) δ ( A , C ) .
Proof. 
We derive the three components one by one. Firstly, suppose φ ¯ ¯ B M ( A B ) is a function satisfying φ ¯ ¯ | ( A B ) = C B (as shown in Figure 1), i.e., φ ¯ ¯ ( a ) = φ ˜ φ 1 ( a ) for all a A B . By the definitions of δ
δ ( A , C ) = c C C A c + a A d ( a , φ ˜ ˜ ( a ) ) c C φ ¯ ¯ ( A ) c + a A d ( a , φ ¯ ¯ ( a ) ) = c C φ ¯ ¯ ( A ) c + a A B d ( a , φ ¯ ¯ ( a ) ) + a A A B d ( a , φ ¯ ¯ ( a ) )
Secondly,
δ ( A , B ) = a A A B a + a A B d ( a , φ 1 ( a ) ) .
Thirdly,
δ ( B , C ) = c C C B c + b B d ( b , φ ˜ ( b ) ) = c C φ ¯ ¯ ( A ) c + c φ ¯ ¯ ( A ) φ ¯ ¯ ( A B ) c + b B d ( b , φ ˜ ( b ) ) = c C φ ¯ ¯ ( A ) c + c φ ¯ ¯ ( A ) φ ¯ ¯ ( A B ) c + a A B d ( φ 1 ( a ) , φ ˜ φ 1 ( a ) ) = c C φ ¯ ¯ ( A ) c + c φ ¯ ¯ ( A ) φ ¯ ¯ ( A B ) c + a A B d ( φ 1 ( a ) , φ ¯ ¯ ( a ) ) = c C φ ¯ ¯ ( A ) c + a A A B φ ¯ ¯ ( a ) + a A B d ( φ 1 ( a ) , φ ¯ ¯ ( a ) )
Hence,
δ ( A , B ) + δ ( B , C ) c C φ ¯ ¯ ( A ) c + a A B d ( a , φ ¯ ¯ ( a ) ) + a A A B [ a + φ ¯ ¯ ( a ) ] c C φ ¯ ¯ ( A ) c + a A B d ( a , φ ¯ ¯ ( a ) ) + a A A B d ( a , φ ¯ ¯ ( a ) ) ( Equation ( 3 ) ) δ ( A , C ) .
 ☐
Lemma 10.
( Relation ( 1 , 3 , 1 ) )
If | B | m > | C | m > | A | m , then δ ( A , B ) + δ ( B , C ) δ ( A , C ) .
Proof. 
Suppose A 2 = φ 1 ( B A B C ) A B A B C . Suppose A 1 = A A 2 . Suppose B A 1 = φ ( A 1 ) = B A B A B C , B A 2 = φ ( A 2 ) = B A B C . Choose a function ψ B F ( A , B C ) , in which a A 2 [ ψ ( a ) = φ ( a ) ] , i.e., ψ | A 2 = φ | A 2 . Suppose B ¯ = B A 2 ψ ( A 1 ) B A 2 B A 1 * , where B A 1 * ψ ( A 1 ) . Let ρ denote the composition φ ˜ 1 | B ψ (or simply φ ˜ 1 ψ ). Then, ρ B F [ A , C ] and
ρ ( A ) = φ ˜ 1 ψ ( A ) = φ ˜ 1 ψ ( A 1 ) φ ˜ 1 ψ ( A 2 ) = φ ˜ 1 ψ ( A 1 ) φ ˜ 1 ( B A 2 ) ,
as shown in Figure 2. Furthermore, by the definition of δ ,
δ ( A , C ) = c C C A c + c C A d ( a , φ ˜ ˜ ( a ) ) c C ρ ( A ) c + a A d ( a , ρ ( a ) ) = c C ρ ( A ) c + a A 1 d ( a , ρ ( a ) ) + a A 2 d ( a , ρ ( a ) ) ;
δ ( A , B ) = b B B A b + a A d ( a , φ ( a ) ) = b B B A b + a A 1 d ( a , φ ( a ) ) + a A 2 d ( a , φ ( a ) ) = b B B A B C b + b B C B A 1 B A 2 b + b B A 1 b + a A 1 d ( a , φ ( a ) ) + a A 2 d ( a , φ ( a ) ) b B C B ¯ b + b B A 1 * b + a A 1 d ( a , φ ( a ) ) + a A 2 d ( a , φ ( a ) )
δ ( B , C ) = b B B C b + b B C d ( b , φ ˜ 1 ( b ) ) = b B B C b + b B A 2 d ( b , φ ˜ 1 ( b ) ) + b B A 1 * d ( b , φ ˜ 1 ( b ) ) + b B C B ¯ d ( b , φ ˜ 1 ( b ) ) a A 1 φ ( a ) + b B A 2 d ( b , φ ˜ 1 ( b ) ) + b B A 1 * d ( b , φ ˜ 1 ( b ) ) + b B C B ¯ d ( b , φ ˜ 1 ( b ) ) = a A 1 φ ( a ) + a A 2 d ( φ ( a ) , ρ ( a ) ) + b B A 1 * d ( b , φ ˜ 1 ( b ) ) + b B C B ¯ d ( b , φ ˜ 1 ( b ) ) .
Henceforth, by Equations (3)–(5), it follows
δ ( A , B ) + δ ( B , C ) b B C B ¯ [ b + d ( b , φ ˜ 1 ( b ) ) ] + b B A 1 * [ b + d ( b , φ ˜ 1 ( b ) ) ] + a A 1 [ d ( a , φ ( a ) ) + φ ( a ) ] + a A 2 [ d ( a , φ ( a ) ) + d ( φ ( a ) , ρ ( a ) ) ] b B C B ¯ φ ˜ 1 ( b ) + b B A 1 * φ ˜ 1 ( b ) + a A 1 a + a A 2 d ( a , ρ ( a ) ) = b B C B ¯ φ ˜ 1 ( b ) + a A 1 [ ( ρ ( a ) + a ] + a A 2 d ( a , ρ ( a ) ) b B C B ¯ φ ˜ 1 ( b ) + a A 1 d ( a , ρ ( a ) ) + a A 2 d ( a , ρ ( a ) ) = c C ρ ( A ) c + a A 1 d ( a , ρ ( a ) ) + a A 2 d ( a , ρ ( a ) ) δ ( A , C ) .
 ☐
Theorem 1.
( K , δ ) is a metric space.
Proof. 
By Lemmas 2–10 and Corollary 1, the result follows immediately. ☐

3. Applications and Computations

In this section, we give a group of numerical data and demonstrate how to compute their distances (or adjacency matrix) via the metric δ . In order to facilitate our computing, we show the following lemmas first. Let a 1 , a 2 , b 1 , b 2 R be arbitrary.

3.1. Lemmas

Lemma 11.
If a 1 a 2 and b 1 b 2 , then | a 1 b 1 | + | a 2 b 2 | | a 1 b 2 | + | a 2 b 1 | .
Proof. 
Suppose a 2 = a 1 + λ a , suppose b 2 = b 1 + λ b , where λ a , λ b 0 . Let k = a 1 b 1 . Then,
| a 1 b 2 | + | a 2 b 1 | | a 1 b 1 | | a 2 b 2 | = | a 1 b 1 λ b | + | a 1 b 1 + λ a | | a 1 b 1 | | a 1 b 1 + λ a λ b | = | k λ b | + | k + λ a | | k | | k + λ a λ b | .
Furthermore, we consider the following cases:
  • k = 0 : Then,
    | k λ b | + | k + λ a | | k | | k + λ a λ b | = λ b + λ a | λ a λ b | 0 ;
  • k > 0 : Then,
    | k λ b | + | k + λ a | | k | | k + λ a λ b | = | k λ b | + k + λ a k | k + λ a λ b | 0 ;
  • k < 0 : Then,
    | k λ b | + | k + λ a | | k | | k + λ a λ b | = k + λ b + | k + λ a | + k | k + λ a λ b | 0 .
    Hence, we have shown | a 1 b 2 | + | a 2 b 1 | | a 1 b 1 | | a 2 b 2 | 0 .
 ☐
Lemma 12.
Let ρ B F * ( A , B ) be arbitrary. Let e 1 , e 2 D ρ be arbitrary such that e 1 e 2 . Then, η B F * [ A , B ] such that η ( e ) = ρ ( e ) for all e D ρ { e 1 , e 2 } and η ( e 1 ) η ( e 2 ) .
Proof. 
If ρ ( e 1 ) ρ ( e 2 ) , then one simply chooses η to be ρ . If ρ ( e 1 ) > ρ ( e 2 ) , then one could choose η ( e ) = ρ ( e ) for all e D ρ { e 1 , e 2 } and η ( e 1 ) : = ρ ( e 2 ) and η ( e 2 ) : = ρ ( e 1 ) . Then, one has η ( e 1 ) < η ( e 2 ) . By Lemma 11, one has
| e 1 ρ ( e 2 ) | + | e 2 ρ ( e 1 ) | | e 1 ρ ( e 1 ) | + | e 2 ρ ( e 2 ) | ,
which together with ρ B F * [ A , B ] yields
| e 1 ρ ( e 2 ) | + | e 2 ρ ( e 1 ) | = | e 1 ρ ( e 1 ) | + | e 2 ρ ( e 2 ) | ,
i.e.,
| e 1 η ( e 1 ) | + | e 2 η ( e 2 ) | = | e 1 ρ ( e 1 ) | + | e 2 ρ ( e 2 ) | ,
i.e., η + η = ρ + ρ , i.e., η B F * [ A , B ] . ☐
Corollary 2.
If ρ B F * [ A , B ] and D ρ = { e 1 , e 2 , , e n } with e 1 e 2 e n , then η B F * [ A , B ] , where D η = D ρ and η ( e 1 ) = m i n [ ρ ( D ρ ) ] , η ( e 2 ) = m i n [ ρ ( D ρ ) { η ( e 1 ) } ] , , η ( k + 1 ) = m i n [ ρ ( D ρ ) { η ( e 1 ) , η ( e 2 ) , , η ( e k ) } ] for all k n 1 .
Proof. 
By applying Lemma 12 repeatedly, the result follows immediately. ☐
This corollary directly facilitates our computation in the next section. In addition, one could also simplify and redefine the metric δ by the result of this corollary.

3.2. Computation

In the following, we demonstrate the computation of our metric δ via a group of simulated data. Suppose
K = { A 1 , A 2 , A 3 , A 4 , A 5 , A 6 } MP ( R + )
is defined as follows:
  • A 1 = { 91.67 , 2 , 39.53 , 98.34 , 8.78 } ;
  • A 2 = { 1.99 , 62 , 7 , 9.52 , 9 , 8.11 } ;
  • A 3 = { 2.1 , 6.22 , 27.1 , 9.67 , 9.19 , 81.29 , 5.55 , 12.41 , 1.67 , 11.08 , 51.15 , 0.33 } ;
  • A 4 = { 22.21 , 61.26 , 71.12 , 29.61 , 29.19 , 29.29 , 35.3 , 40 } ;
  • A 5 = { 17.19 , 2 , 70.56 , 9.52 , 9.45 , 18.16 , 40 } ;
  • A 6 = { 1.26 , 0.19 , 2 , 4.70 , 8.56 , 9.09 } .
Suppose the distance function over R + is defined by
d ( e , φ ( e ) ) : = | e φ ( e ) | .
Then, our metric (defined in Equation (6)) derived from this d could be applied. We could then obtain the adjacency matrix of K via the following two methods:
  • (Method One) List all the permutations and find the optimal permutation and its associated distance, which is the summation of the matched and mismatched parts.
  • (Method Two) List all the combinations and find the optimal combination and its associated distance, which is the summation of the matched and mismatched parts.
Method One comes directly from the definition. By Corollary 2, Method Two is also justified. To demonstrate this, let us first compute δ ( A 2 , A 3 ) . If Method One is applied, then one has to compute all the P ( 12 , 6 ) = 665,280 permutations, and measure the matched distances between these permutations and A 2 and the mismatched distances between these permutations and A 3 . If Method Two is applied, then one sorts the set A 2 first, and then sorts each of the C ( 12 , 6 ) = 924 combinations to measure the matched part between each sorted combination and sorted A 2 , and the mismatched part between the sorted combination and A 3 . Both methods agree as follows:
δ ( A 2 , A 3 ) = m i n { φ + φ : φ B F ( A 2 , A 3 ) } = 120.14 ,
in which the matched distance is φ = 69.6 and the mismatched distance φ = 50.54 , where the optimal φ is defined as follows: φ ( 1.99 ) = 2.1 , φ ( 62 ) = 81.29 , φ ( 7 ) = 9.19 , φ ( 9.52 ) = 51.15 , φ ( 9 ) = 12.41 , φ ( 8.11 ) = 11.08 . Proceed similarly, the distances for other pairs ( A i , A j ) could also be obtained and the resulting adjacency matrix is demonstrated in Figure 3.
One could verify that this adjacency matrix satisfies all the metric axioms, in particular, δ ( A i , A j ) + δ ( A J , A k ) δ ( A i , A k ) for all i , j , k { 1 , 2 , 3 , 4 , 5 , 6 } .

4. Real World Applications

In addition to some trivial applications, one could consider other handy applications, for example, by replacing the usual Euclidean metric with our metric in the following fields: k –means, clustering analysis, graph comparisons, etc. (see Reference [2]). These are frequently-used techniques in analyzing data or theoretical computations. The author has also succeeded in defining a novel metric for graphs based on the metric defined in this article. This enables one to measure the distances between any two graphical structures or networks. The idea for the derived metric is to measure the differences between any two graphs by induction on vertexes. Suppose there are two graphs G 1 and G 2 with the same set of vertices V. For each vertex v V , one could then generate two multisets whose elements are the lengths between v and its respective set of endpoints in G 1 and G 2 . Then, he could compute the distances via the minimal sum of matched elements and mismatched elements as defined in this article. This approach yields a new metric for graphs.

4.1. Example

Let us consider a concrete example. Suppose the government in a country is trying to associate a village (among three candidate villages: VL1,VL2,VL3) which produces maize with the wholesalers which sell maize. In VL1, there are five farmers; in VL2, there are six farmers; in VL3, there are 10 farmers. The expected annual yields of maize for each farmer in VL1 are 3.2 , 5.1 , 7.6 , 3.2 , 8.8 tons; the ones in VL2 are 1.2 , 2.1 , 3.6 , 7.9 , 12.1 , 6.4 tons; and the ones in VL3 are 2.6 , 4.6 , 8.1 , 5.1 , 2.2 , 5 , 7.9 , 11.1 , 12 , 4.5 tons. On the other hand, suppose there are four wholesalers whose annual demands are 7.9 , 9.2 , 11.6 , 8.3 tons, respectively. The government policy is to associate a village with the wholesalers based on the criterion that the total discrepancy between the village and the wholesalers must be minimal and the condition that each farmer could only exclusively sign the contract with exactly one wholesaler. Assume the government adopts the metric defined in this article. The results could be computed as follows Table 1.
Since the total discrepancy (i.e., matched part plus mismatched part) between VL2 and the wholesalers is minimal (or 11.3 ), the government should associate VL2 with the wholesalers. Henceforth, the government should pick VL2 to sign the contract with the four wholesalers exclusively. In doing so, the total dissatisfaction (or discrepancy) from both the farmer and the wholesalers would be minimal.

4.2. Characteristic and Analysis

The main characteristic of our metric is that it takes the minimal discrepancy into consideration. For the usual metrics, one hardly associates a metric with the minimal matching via combinations or permutations of all sorts of choices. Our method successfully combines the usual definition of a metric with the concept of an optimal choice. With these two concepts combined, one could pick up an optimal decision purely based on the metric defined in this article. This approach gives one a much more direct decision-making process. In addition, since this metric consists of two parts: the matched and mismatched parts, it would provide one with much more insightful knowledge of the discrepancy between mathematical objects.

5. Conclusions

We have defined a metric on a finite power multiset of positive real numbers via the concept of minimal matchings, in which the distances of any two multisets consist of two parts: the distance of the matched part and the distance of the mismatched part. We also implement this metric by an adjacency matrix. A concrete example is also included in this article. In addition to the adjacency matrix, we show another definitional computation to facilitate our computing of the metric. The metric defined in this article could be further applied in some real problems regarding artificial intelligence, clustering, or some other theoretical mathematical research.

Funding

This research was funded by the Natural Science Foundation of Fujian Province of China (Grant No. 2017J01566).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Blizard, W.D. Multiset Theory. Notre Dame J. Form. Log. 1988, 30, 36–66. [Google Scholar] [CrossRef]
  2. Singh, D.; Ibrahim, A.M.; Yohanna, T.; Singh, J.N. An overview of the applications of multisets. Novi Sad J. Math. 2007, 37, 73–92. [Google Scholar]
  3. Mémoli, F. Gromov–Wasserstein distances and the metric approach to object matching. Found. Comput. Math. 2011, 11, 417–487. [Google Scholar] [CrossRef]
  4. Bondy, A.; Ram, M.M. Graph Theory; Springer: London, UK, 2008. [Google Scholar]
  5. Chong, E.K.P.; Zak, S.H. An Introduction to Optimization; Jonh Wiley and Sons, Inc.: New York, NY, USA, 2013. [Google Scholar]
  6. Karapınar, E.; Yıldız-Ulus, A.; Erhan, İ.M. Cyclic Contractions on G-Metric Spaces. Abstr. Appl. Anal. 2012, 2012, 182947. [Google Scholar] [CrossRef]
  7. Syropoulos, A. Mathematics of Multisets; Multiset Processing; Springer: Berlin/Heidelberg, Germany, 2001; pp. 347–358. [Google Scholar]
  8. Blizard, W.D. Real-valued multisets and fuzzy sets. Fuzzy Sets Syst. 1989, 33, 77–97. [Google Scholar] [CrossRef]
Figure 1. Triangle Inequality for ( 3 , 1 , 1 ) case.
Figure 1. Triangle Inequality for ( 3 , 1 , 1 ) case.
Axioms 07 00094 g001
Figure 2. Triangle Inequality for ( 1 , 3 , 1 ) case.
Figure 2. Triangle Inequality for ( 1 , 3 , 1 ) case.
Axioms 07 00094 g002
Figure 3. Adjacency Matrix [ δ ( A i , A j ) ] i , j = 1 6 .
Figure 3. Adjacency Matrix [ δ ( A i , A j ) ] i , j = 1 6 .
Axioms 07 00094 g003
Table 1. Analysis of Optimal Matchings.
Table 1. Analysis of Optimal Matchings.
VillagesExpected Annual Yield (tons)MatchedMismatchedTotal Discrepancy
VL1 { 3.2 , 5.1 , 7.6 , 3.2 , 8.8 } 12.3 3.2 15.5
VL2 { 1.2 , 2.1 , 3.6 , 7.9 , 12.1 , 6.4 } 8.0 3.3 11.3
VL3 { 2.6 , 4.6 , 8.1 , 5.1 , 2.2 , 5 , 7.9 , 11.1 , 12 , 4.5 } 2.5 24 26.5

Share and Cite

MDPI and ACS Style

Chen, R.-M. A Metric for Finite Power Multisets of Positive Real Numbers Based on Minimal Matching. Axioms 2018, 7, 94. https://doi.org/10.3390/axioms7040094

AMA Style

Chen R-M. A Metric for Finite Power Multisets of Positive Real Numbers Based on Minimal Matching. Axioms. 2018; 7(4):94. https://doi.org/10.3390/axioms7040094

Chicago/Turabian Style

Chen, Ray-Ming. 2018. "A Metric for Finite Power Multisets of Positive Real Numbers Based on Minimal Matching" Axioms 7, no. 4: 94. https://doi.org/10.3390/axioms7040094

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop