Reciprocal Convex Costs for Ratio Matching: Axiomatic Characterization

Washburn, Jonathan; Rahnamai Barghi, Amir

doi:10.3390/axioms15020151

Open AccessArticle

Reciprocal Convex Costs for Ratio Matching: Axiomatic Characterization

by

Jonathan Washburn

and

Amir Rahnamai Barghi

^*

Recognition Physics Research Institute, Austin, TX 78701, USA

^*

Author to whom correspondence should be addressed.

Axioms 2026, 15(2), 151; https://doi.org/10.3390/axioms15020151

Submission received: 22 January 2026 / Revised: 9 February 2026 / Accepted: 13 February 2026 / Published: 19 February 2026

Download Versions Notes

Abstract

We study ratio-induced mismatch cost functions of the form

c (s, o) = J (ι_{S} (s) / ι_{O} (o))

built from positive scale maps

ι_{S} : S \to R_{> 0}

and

ι_{O} : O \to R_{> 0}

and a penalty

J : (0, \infty) \to [0, \infty)

. Assuming inversion symmetry, strict convexity, coercivity, normalization at 1, and a multiplicative d’Alembert identity, we show that

f (u) : = 1 + J (e^{u})

is continuous and satisfies the additive d’Alembert equation; hence, by a classical classification theorem, there exists

a > 0

such that

J (x) = cosh (a log x) - 1 = \frac{1}{2} (x^{a} + x^{- a}) - 1, x > 0

. We then analyze the associated argmin mapping over feasible scale sets: existence under explicit subspace-closedness assumptions, an explicit geometric-mean decision geometry for finite dictionaries with stability away from boundaries, exact compositionality for product models, and an optimal sequential mediation principle described by a geometric mean (or its log-space projection when infeasible). The paper is purely mathematical; any semantic interpretation is optional and external to theorems proved here.

Keywords:

functional equations; d’Alembert equation; reciprocal convex cost; ratio-based optimization; geometric-mean decision boundaries; compositionality; sequential mediation

MSC:

39B52; 49J40; 26A51; 90C25; 94A17; 90C31

1. Introduction

This section introduces the optimization-based model of reference, fixes terminology and standing assumptions, and outlines the main results and organization.

The paper’s goal is to make the ratio-matching paradigm mathematically explicit. We fix ratio-induced costs of the form

c (s, o) = J (ι_{S} (s) / ι_{O} (o))

and define meanings by the argmin rule. The first question is structural: under inversion symmetry, convexity/regularity, and a multiplicative compatibility axiom, which mismatch penalties J are admissible, and how canonical is the resulting form? The second question is geometric: once J is fixed, what decision boundaries and stability properties are forced for finite dictionaries, and how do these behave under products and sequential mediation? The intended contribution is a self-contained set of theorems that separate what is proved inside the axioms from any external semantic or empirical interpretation.(e.g., Wigner [1] for a classic motivation about mathematics and empirical applicability)

We start with two sets:

a configuration (token) space S (words, codes, internal states, messages, …),
an object space O (candidate referents, concepts, states of affairs, …).

Terminology.

Throughout, we use configuration (or token) for an arbitrary element

s \in S

. We reserve the term symbol for o for a configuration s satisfying the predicate in Definition 8, i.e.,

o \in Mean (s)

together with the compression inequality

J_{S} (s) < J_{O} (o)

.

Model ingredients and notation.

For quick reference, the functionals and maps used throughout are organized as follows:

Scale maps: $ι_{S} : S \to R_{> 0}$ and $ι_{O} : O \to R_{> 0}$ .
Mismatch penalty: $J : (0, \infty) \to [0, \infty)$ (axioms in Definition 1, explicit choice in Definition 2).
Reference cost: $c (s, o) : = J (ι_{S} (s) / ι_{O} (o))$ (1).
Meaning set (argmin rule): $Mean (s) : = {arg min}_{o \in O} c (s, o)$ (Definition 7).
Intrinsic costs: $J_{S} (s)$ and $J_{O} (o)$ (Definition 3); in the canonical setting these are induced by scales via $J_{S} (s) = J (ι_{S} (s))$ and $J_{O} (o) = J (ι_{O} (o))$ (see Definition 6).
Symbol predicate: s is a symbol for o if $o \in Mean (s)$ and the compression inequality $J_{S} (s) < J_{O} (o)$ holds (Definition 8).

Each space is equipped with a positive scale map

ι_{S} : S \to R_{> 0}

and

ι_{O} : O \to R_{> 0}

, interpreted as an intrinsic “size/complexity” in a common currency. A cost functional

J : (0, \infty) \to [0, \infty)

is fixed with the properties stated in Section 2 (symmetry under inversion, strict convexity, and a unique minimum at 1). We then define a ratio-induced reference cost

c (s, o) : = J (\frac{ι_{S} (s)}{ι_{O} (o)}), (s, o) \in S \times O .

(1)

Meaning as minimization.

The meaning set of a configuration s is the set of objects achieving minimal cost:

Mean (s) : = \underset{o \in O}{arg min} c (s, o) .

Equivalently,

o \in Mean (s)

iff

c (s, o) \leq c (s, o^{'})

for all

o^{'} \in O

(Definition 7). Ties are allowed: meaning is set-valued unless uniqueness is proved under additional hypotheses.

Interpretive content (and its limits).

Because J is minimized at 1, low reference cost forces scale matching: a configuration can only refer cheaply to objects whose scale is close to its own. This yields an explicit, checkable constraint on admissible reference patterns. The framework is deliberately axiomatic: the scale maps and the chosen J are inputs.

1.1. A Toy Example: Three-Object Dictionary

Let

O = {o_{1}, o_{2}, o_{3}}

with scales

y_{i} : = ι_{O} (o_{i})

satisfying

0 < y_{1} < y_{2} < y_{3}

. For a configuration s with scale

x : = ι_{S} (s)

, the meaning rule compares the three costs

J (x / y_{i})

. For the explicit functional (3), the boundary between preferring

o_{1}

and

o_{2}

occurs at the geometric mean

\sqrt{y_{1} y_{2}}

, and similarly between

o_{2}

and

o_{3}

at

\sqrt{y_{2} y_{3}}

(Theorem 8). Thus, the model induces a piecewise-constant semantic partition of the positive line in the configuration ratio x, with stability away from the boundary points.

1.2. Relation to Prior Work

Classical analyses of reference emphasize logical form and truth conditions (e.g., Frege and Russell) [2,3]. The symbol-grounding literature highlights that purely formal symbol manipulation does not by itself determine what symbols are about [4]. The present paper does not attempt to resolve these debates empirically. Instead, it isolates a mathematically tractable selection principle: aboutness is determined by minimizing an explicit mismatch cost. For comparison with contemporary subject matter/aboutness and truthmaker-semantics accounts (e.g., Yablo [5], Hawke [6], and the Philosophical Studies symposium discussion [7,8,9]), see Section 9. The intended payoff is that, once scales are fixed, aboutness becomes a tractable variational problem with explicit decision boundaries and composition theorems.

This paper adopts an optimization-first viewpoint: once a mismatch cost is fixed, semantic meaning is defined by an argmin rule (Definition 7). A closely related measurement-first stance appears in Recognition Geometry [10], which takes recognition events as primitive and derives observable space as a quotient under an operational indistinguishability relation ([10], Def. 4). In the same spirit, the present framework treats mismatch costs as primitive measurements and regards stable meanings as effective equivalence classes of cost minimization events. Both viewpoints emphasize operationally defined structure over a priori metaphysical commitments, and both isolate exactly which axioms must be validated when connecting the formalism to an empirical domain.

1.3. Contributions and What Is Proved

Within the ratio-induced model (1) (and the explicit choice (3) used throughout), we establish the following structural facts under clearly stated hypotheses:

Existence. If the feasible scale set $ι_{O} (O) \subset R_{> 0}$ is nonempty and closed in the usual topology on $(0, \infty)$ and if the minimum is attained (as made precise in Theorem 2), then every configuration admits at least one meaning.
Finite-dictionary decision geometry. For finite ordered dictionaries, decision boundaries are given by geometric means of adjacent object scales, and meanings are locally stable away from these boundaries (Theorem 8 and Corollary 7).
Compositionality. For product symbol/object spaces with separable scales, meaning factorizes componentwise (Theorem 5).
Mediation. For sequential reference through an intermediate representation, the set of optimal mediator ratios is characterized explicitly in log-coordinates; and whenever the balance-point ratio $b_{geo}$ is feasible, mediation weakly decreases the total mismatch cost relative to direct reference (Theorem 6 and Corollary 3).

1.4. Organization

Section 2 states the axioms for J and fixes the explicit mismatch functional (3). Section 3 defines costed spaces, ratio-induced reference, and the meaning relation. Section 4 contains the principal theorems, followed by compositionality (Section 5), extensions, and examples.

2. The Mismatch Functional $J$

This section fixes the scalar mismatch functional

J : (0, \infty) \to [0, \infty)

used throughout to compare configuration and object scales via the ratio-induced cost (1). The role of J here is purely mathematical: it is an explicit penalty for scale mismatch, and no physical, cognitive, or linguistic interpretation is assumed.

2.1. Standard Properties and Canonicity

The conditions below are recorded as a compact axiom package for the mismatch penalty. They encode inversion symmetry, strict convexity, and a multiplicative compatibility under scale multiplication. After a log change of variables, the compatibility axiom becomes d’Alembert’s functional equation, so the resulting class of penalties is classical. We include a tailored derivation in Appendix A to keep the paper self-contained and to emphasize that the axioms are used only as mathematical assumptions, not as a claim of novelty.

Definition 1 (Cost Functional Axioms).

A mismatch functional is a function

J : (0, \infty) \to [0, \infty)

satisfying:

1.: Normalization: $J (1) = 0$ .
2.: Strict convexity: J is strictly convex on $(0, \infty)$ .
3.: Multiplicative d’Alembert identity: for all $x, y > 0$ ,

$J (x y) + J (x / y) = 2 J (x) + 2 J (y) + 2 J (x) J (y) .$

(2)

The d’Alembert identity (2) is the dominant structural constraint. Inversion symmetry is not assumed as an axiom; it is derived from (2) and normalization in Lemma 1. We invoke strict convexity only in statements where uniqueness is required; existence and attainment statements are formulated without using strict convexity.

Lemma 1 (Derived inversion symmetry).

Assume J satisfies normalization

J (1) = 0

and the multiplicative d’Alembert identity (2). Then for every

x > 0

one has

J (x) = J (x^{- 1})

.

Proof.

Set

y = 1

in (2). Using

J (1) = 0

, obtain

J (x) + J (x^{- 1}) = 2 J (x) + 2 J (1) + 2 J (x) J (1) = 2 J (x),

hence

J (x^{- 1}) = J (x)

. □

First, record a basic consequence used repeatedly: under strict convexity, the normalization point $x = 1$ is the unique zero of J.

Lemma 2 (Uniqueness of the zero-cost point).

If J satisfies Definition 1, then

J (x) = 0

implies

x = 1

.

Proof.

By (2) and (1), J attains its minimum value 0 at

x = 1

. By strict convexity (3), the minimizer is unique. Hence

J (x) = 0

forces

x = 1

. □

2.2. The Explicit Choice Used in This Paper

Definition 2 (The functional fixed below).

In the remainder of this paper, we fix the explicit functional

J (x) = \frac{1}{2} (x + x^{- 1}) - 1 = \frac{{(x - 1)}^{2}}{2 x} (x > 0) .

(3)

The next proposition verifies that the explicit functional indeed satisfies the axioms, so subsequent sections can treat Definition 1 as established.

Proposition 1 (Verification of the axioms).

Function (3) satisfies Definition 1.

Proof.

Normalization and inversion symmetry are immediate from (3), and (3) shows

J (x) \geq 0

for all

x > 0

. Differentiating

J (x) = \frac{1}{2} (x + x^{- 1}) - 1

gives

J^{'} (x) = \frac{1}{2} - \frac{1}{2 x^{2}}, J^{″} (x) = \frac{1}{x^{3}} > 0 (x > 0),

so J is strictly convex on

(0, \infty)

. For (4), set

C (x) = 1 + J (x) = \frac{1}{2} (x + x^{- 1})

. Then

C (x y) + C (x / y) = \frac{1}{2} (x y + \frac{1}{x y} + \frac{x}{y} + \frac{y}{x}) = \frac{1}{2} (x + \frac{1}{x}) (y + \frac{1}{y}) = 2 C (x) C (y),

which is equivalent to (2) after substituting

C = 1 + J

and expanding. □

Proposition 2 (Classical characterization of J).

Assume

J : (0, \infty) \to [0, \infty)

satisfies Definition 1. Then there exists a constant

a > 0

such that for all

x > 0

,

J (x) = cosh (a log x) - 1 = \frac{1}{2} (x^{a} + x^{- a}) - 1 .

Moreover, if we replace the scale maps by

{\tilde{ι}}_{S} : = ι_{S}^{a}

and

{\tilde{ι}}_{O} : = ι_{O}^{a}

, then the ratio-induced model with parameter a becomes the same model written with parameter 1. Consequently, one may take

a = 1

without loss of generality at the level of the induced reference costs.

Proof.

See Appendix A. □

Example 1 (Small-mismatch regime).

For

| u | ≪ 1

, one has

J (1 + u) = \frac{u^{2}}{2} + O (u^{3}),

so near balance the mismatch cost behaves like a quadratic penalty in the relative deviation.

3. Costed Spaces and Reference Structures

We now formalize the axioms of the model introduced in Section 1. Throughout, the mismatch functional J is fixed as in Section 2. The intent is to make precise which pieces of data are inputs (configuration/object spaces and their scale maps) and which pieces are derived (reference costs and meaning).

3.1. Costed Spaces

Definition 3 (Costed space).

Fix a mismatch functional

J : (0, \infty) \to [0, \infty)

(Section 2). A costed space is a triple

(C, J_{C}, ι_{C})

consisting of

a set C of configurations,
a map $ι_{C} : C \to R_{> 0}$ called the scale map ,
a cost function $J_{C} : C \to R_{\geq 0}$ satisfying $J_{C} (c) = J (ι_{C} (c))$ for all $c \in C$ .

Equivalently, once

ι_{C}

is fixed,

J_{C}

is determined by J; we retain

J_{C}

in the notation since later statements compare configuration costs and object costs directly.

Notation 1.

We write

S = (S, J_{S}, ι_{S})

for a configuration (token) costed space and

O = (O, J_{O}, ι_{O})

for an object costed space.

Throughout, we identify

R_{> 0}

with

(0, \infty)

and equip

R_{> 0}

(and

{(R_{> 0})}^{d}

) with the usual Euclidean topology on

(0, \infty)

(equivalently, the Euclidean subspace topology inherited from

R

). Accordingly, when we say that a set

Y \subset R_{> 0}

is closed, we mean closed in the usual topology on

(0, \infty)

(equivalently,

Y = (0, \infty) \cap F

for some closed

F \subset R

). Likewise, for

Y \subset {(R_{> 0})}^{d}

the term closed means closed in the usual topology on

{(0, \infty)}^{d}

.

Example 2 (Ratio space).

The canonical example is

C = R_{> 0}

with

ι_{C} = id

and

J_{C} = J

.

The next example isolates a small neighborhood of the balanced point; it serves as a convenient test class for stability statements.

Example 3 (Near-balanced configurations).

For

ϵ > 0

, let

C_{ϵ} : = {x \in R_{> 0} : | x - 1 | < ϵ}

. Then every

c \in C_{ϵ}

satisfies

J_{C} (c) = J (c) < J (1 + ϵ)

.

3.2. Reference Structures

Definition 4 (Reference structure).

A reference structure from

S

to

O

is a function

c_{R} : S \times O \to R_{\geq 0},

(4)

called the reference cost. It assigns to each pair

(s, o)

the cost of using s to refer to o.

In the remainder of the paper, we focus on the ratio-induced costs generated by J and the scale maps.

Definition 5 (Ratio-induced reference).

Given scale maps

ι_{S}

and

ι_{O}

, the ratio-induced reference structure is defined by

c_{R}^{J} (s, o) : = J (\frac{ι_{S} (s)}{ι_{O} (o)}) .

(5)

This is the cost used in the Introduction (Equation (1)).

Link to comparative recognizers.

The ratio-induced reference cost (5) can be viewed as a specific instantiation of a comparative recognizer in the sense of Recognition Geometry ([10], Axiom 5 (RG4)). In that framework, a comparative recognizer maps pairs of configurations to an event space ([10], Axiom 2 (RG1)) so as to induce comparative structure (order/distance) from observable events. Here, the “event” is the scalar mismatch value

J (ι_{S} (s) / ι_{O} (o))

, and the induced indistinguishability relation ([10], Def. 4) corresponds to the zero-cost condition

J (ι_{S} (s) / ι_{O} (o)) = 0

, which forces exact scale match

ι_{S} (s) = ι_{O} (o)

by Lemma 2.

The following admissibility condition specifies when the reference cost is exactly the canonical ratio penalty.

Definition 6 (Admissible reference structure).

A reference structure

R

from

S

to

O

is called admissible (with respect to J and the scale maps

ι_{S}, ι_{O}

) if it is ratio-induced, i.e.,

c_{R} (s, o) = J (\frac{ι_{S} (s)}{ι_{O} (o)}) \forall (s, o) \in S \times O .

(6)

Unless stated otherwise, we work with admissible reference structures.

Admissibility transfers the symmetry properties of Jto the reference cost; we record this for later use.

Proposition 3 (Inversion symmetry of the reference cost).

If

R

is admissible, then for all

(s, o) \in S \times O

one has

c_{R} (s, o) = J (\frac{ι_{S} (s)}{ι_{O} (o)}) = J (\frac{ι_{O} (o)}{ι_{S} (s)}) .

Proof.

Immediate from admissibility and inversion symmetry

J (x) = J (x^{- 1})

(Lemma 1). □

3.3. Meaning and the Symbol Predicate

Definition 7 (Meaning).

Let

R

be a reference structure from

S

to

O

. A configuration

s \in S

means an object

o \in O

, written

{Mean}_{R} (s, o)

, if o minimizes the reference cost among all objects:

{Mean}_{R} (s, o) \Leftrightarrow \forall o^{'} \in O, c_{R} (s, o) \leq c_{R} (s, o^{'}) .

(7)

For each

s \in S

, we write

{Mean}_{R} (s) : = {o \in O : {Mean}_{R} (s, o)}

for the (possibly multi-valued) meaning set. If

R

is admissible, then equivalently

{Mean}_{R} (s) = \underset{o \in O}{arg min} J (\frac{ι_{S} (s)}{ι_{O} (o)}) .

Definition 8 (Symbol).

Let

R

be a reference structure from

S

to

O

. A configuration

s \in S

is a symbol for an object

o \in O

(relative to

R

) if

1.: Reference: ${Mean}_{R} (s, o)$ .
2.: Compression: $J_{S} (s) < J_{O} (o)$ .

The compression requirement is a modeling assumption: it enforces that symbols are lower-cost encodings than their referents in the common currency induced by J. No empirical interpretation is asserted; the condition is simply part of the definition used in later results.

4. Main Theorems

This section collects the main mathematical consequences of the ratio-induced reference model. Throughout, we fix the explicit mismatch functional

J (x) = \frac{{(x - 1)}^{2}}{2 x} = \frac{1}{2} (x + x^{- 1}) - 1 (x > 0)

(8)

which satisfies Definition 1, and we assume the reference structure is admissible:

c_{R} (s, o) = J (\frac{ι_{S} (s)}{ι_{O} (o)}) .

(9)

Thus, for each

s \in S

, the meaning set

{Mean}_{R} (s)

is the set of minimizers of

o \mapsto J (ι_{S} (s) / ι_{O} (o))

.

4.1. Sublevel Geometry of the Explicit Mismatch Cost

Lemma 3 (Sublevel intervals).

Assume J is given by (3) (equivalently (8)). For each

ϵ > 0

, the sublevel set

L_{ϵ} : = {x \in R_{> 0} : J (x) \leq ϵ}

coincides with the closed interval

[a_{ϵ}, b_{ϵ}]

, where

b_{ϵ} : = (1 + ϵ) + \sqrt{ϵ (2 + ϵ)}, a_{ϵ} : = (1 + ϵ) - \sqrt{ϵ (2 + ϵ)} = \frac{1}{b_{ϵ}} .

Proof.

Using

J (x) = \frac{{(x - 1)}^{2}}{2 x}

, the inequality

J (x) \leq ϵ

is equivalent (after multiplying by

2 x > 0

) to

{(x - 1)}^{2} \leq 2 ϵ x ⟺ x^{2} - 2 (1 + ϵ) x + 1 \leq 0 .

The quadratic has discriminant

Δ = 4 ϵ (2 + ϵ)

and roots

x_{\pm} = (1 + ϵ) \pm \sqrt{ϵ (2 + ϵ)}

. Since it opens upward, the inequality holds exactly for

x \in [x_{-}, x_{+}]

. Set

a_{ϵ} : = x_{-}

and

b_{ϵ} : = x_{+}

. Then

a_{ϵ} b_{ϵ} = {(1 + ϵ)}^{2} - ϵ (2 + ϵ) = 1

, so

a_{ϵ} = 1 / b_{ϵ}

. □

4.2. Meaning Constraints from a Balanced Baseline

Theorem 1 (Scale window for meanings of low-cost configurations).

Assume

1 \in Y : = ι_{O} (O)

and choose

o_{0} \in O

with

ι_{O} (o_{0}) = 1

. Let

s \in S

and let

o \in {Mean}_{R} (s)

. Then

c_{R} (s, o) \leq c_{R} (s, o_{0}) = J (ι_{S} (s)) = J_{S} (s) .

(10)

In particular, for every

ϵ > 0

, if

J_{S} (s) \leq ϵ

then

\frac{ι_{S} (s)}{ι_{O} (o)} \in [a_{ϵ}, b_{ϵ}]

(11)

and hence

\frac{ι_{S} (s)}{b_{ϵ}} \leq ι_{O} (o) \leq \frac{ι_{S} (s)}{a_{ϵ}},

(12)

where

[a_{ϵ}, b_{ϵ}]

is as in Lemma 3.

Proof.

Since

o \in {Mean}_{R} (s)

, by definition

c_{R} (s, o) \leq c_{R} (s, o_{0})

. By admissibility (9) and

ι_{O} (o_{0}) = 1

,

c_{R} (s, o_{0}) = J (ι_{S} (s)) = J_{S} (s)

, which gives (10). If

J_{S} (s) \leq ϵ

, then (10) implies

J (ι_{S} (s) / ι_{O} (o)) \leq ϵ

, hence (11) by Lemma 3. Rearranging yields (12). □

Corollary 1 (Near-balanced configurations force near-balanced meanings).

Under the hypotheses of Theorem 1, if

J_{S} (s) \leq ϵ

and

o \in {Mean}_{R} (s)

, then

ι_{O} (o) \in [\frac{1}{b_{ϵ}^{2}}, b_{ϵ}^{2}] .

In particular, as

ϵ ↓ 0

, any meaning of an ϵ-cheap symbol must satisfy

ι_{O} (o) \to 1

.

Proof.

From

J_{S} (s) = J (ι_{S} (s)) \leq ϵ

and Lemma 3, we have

ι_{S} (s) \in [a_{ϵ}, b_{ϵ}]

. Combining this with (12) and

a_{ϵ} = 1 / b_{ϵ}

gives the stated bounds. □

4.3. Existence of Meanings Under Attainment Hypotheses

Lemma 4 (Coercivity of J).

Assume J is given by (3). Then

J (x) \to \infty

as

x \to 0^{+}

and as

x \to \infty

. In particular, for each

M \geq 0

the sublevel set

{x \in R_{> 0} : J (x) \leq M}

is compact in

R

.

Proof.

From (8),

J (x) = \frac{1}{2} (x + x^{- 1}) - 1

. As

x \to \infty

the term

\frac{1}{2} x

dominates, and as

x \to 0^{+}

the term

\frac{1}{2} x^{- 1}

dominates, so in both limits

J (x) \to \infty

. If

J (x) \leq M

then

x + x^{- 1} \leq 2 (M + 1)

, hence both x and

x^{- 1}

are bounded; the sublevel set is therefore closed and bounded away from 0 and ∞, hence compact. □

Theorem 2 (Existence of meanings for ratio-induced reference).

Assume

R

is admissible and that J is given by (3). Let

Y : = ι_{O} (O) \subset R_{> 0}

be nonempty and closed in the usual topology on

(0, \infty)

. Then for every

s \in S

there exists

o \in O

such that

{Mean}_{R} (s, o)

(equivalently,

{Mean}_{R} (s) \neq \emptyset

). Moreover, if

x : = ι_{S} (s) \in Y

, then any

o \in O

with

ι_{O} (o) = x

is a meaning and satisfies

c_{R} (s, o) = 0

.

Proof.

Fix s and set

x : = ι_{S} (s)

. Consider

f : Y \to R_{\geq 0}

defined by

f (y) : = J (x / y)

. The map f is continuous. By Lemma 4,

f (y) \to \infty

as

y \to 0^{+}

or

y \to \infty

, so the infimum of f over Y is achieved on a compact sublevel set. Concretely, choose a minimizing sequence

y_{n} \in Y

with

f (y_{n}) ↓ {inf}_{Y} f

. Coercivity implies

(y_{n})

is bounded away from 0 and ∞, hence has a convergent subsequence; since Y is closed in

(0, \infty)

, the limit

y_{*} \in Y

, and continuity gives

f (y_{*}) = {inf}_{Y} f

. Choose

o \in O

with

ι_{O} (o) = y_{*}

. Then

c_{R} (s, o) = f (y_{*}) \leq f (ι_{O} (o^{'})) = c_{R} (s, o^{'})

for all

o^{'} \in O

, i.e.,

{Mean}_{R} (s, o)

. If

x \in Y

, take

y_{*} = x

; then

J (x / x) = J (1) = 0

, so any o with

ι_{O} (o) = x

is a meaning with zero reference cost. □

Remark 1.

If

Y = ι_{O} (O)

is not closed in

(0, \infty)

, the minimum need not be attained; in that case

{Mean}_{R} (s)

may be empty even though the infimum exists.

4.4. A Simple Total-Cost Benchmark

Theorem 3 (Balanced reference minimizes the intrinsic + reference sum).

Assume admissible reference (9) and intrinsic costs

J_{S} (s) = J (ι_{S} (s))

,

J_{O} (o) = J (ι_{O} (o))

. Define

C (s, o) : = J_{S} (s) + J_{O} (o) + c_{R} (s, o) .

Then

C (s, o) \geq 0

for all

(s, o) \in S \times O

, and

C (s, o) = 0 \Leftrightarrow ι_{S} (s) = 1 and ι_{O} (o) = 1 .

In particular, if there exist

s_{0} \in S

and

o_{0} \in O

with

ι_{S} (s_{0}) = ι_{O} (o_{0}) = 1

, then

(s_{0}, o_{0})

is a global minimizer of C over

S \times O

.

Proof.

Each term in C is nonnegative, hence

C \geq 0

. If

C (s, o) = 0

, then all three terms vanish; by Lemma 2 this forces

ι_{S} (s) = ι_{O} (o) = 1

. The converse is immediate from

J (1) = 0

. □

4.5. A Backbone Window for Near-Balanced Configuration Classes

Definition 9 (Referential capacity).

Given a reference structure

R

from

S

to

O

, define the referential capacity to be

Cap (S, O; R) : = |{o \in O : \exists s \in S with o \in {Mean}_{R} (s)}| .

(If O is infinite, this cardinality may be infinite.)

We now show that restricting to near-balanced configurations forces all attainable meanings to lie in an explicit scale window.

Theorem 4 (Backbone window for near-balanced configurations).

Let

S_{δ} = (S_{δ}, J_{δ}, ι_{δ})

be the near-balanced ratio space

S_{δ} : = {x \in R_{> 0} : | x - 1 | < δ}, ι_{δ} = id, J_{δ} {= J |}_{S_{δ}} .

Let

O = (O, J_{O}, ι_{O})

be a costed space such that

Y : = ι_{O} (O) \subset R_{> 0}

is nonempty, closed in the usual topology on

(0, \infty)

, and contains 1. Assume

R

is admissible and J is given by (3).

Set

ϵ_{δ} : = J (1 + δ)

and let

[a_{ϵ_{δ}}, b_{ϵ_{δ}}]

be as in Lemma 3. Define the window

I_{δ} : = [\frac{1 - δ}{b_{ϵ_{δ}}}, \frac{1 + δ}{a_{ϵ_{δ}}}] .

Then

1.: For every $s \in S_{δ}$ the meaning set ${Mean}_{R} (s)$ is nonempty.
2.: If $s \in S_{δ}$ and $o \in {Mean}_{R} (s)$ , then $ι_{O} (o) \in I_{δ}$ . Equivalently, if $ι_{O} (o) \notin I_{δ}$ , then no $s \in S_{δ}$ can mean o under admissible reference.

In particular,

Cap (S_{δ}, O; R) \leq |{o \in O : ι_{O} (o) \in I_{δ}}| .

Proof.

(1) is a direct application of Theorem 2 to the closed (in

(0, \infty)

) nonempty set Y.

For (2), fix

s \in S_{δ}

and write

x : = ι_{δ} (s) \in (1 - δ, 1 + δ)

. Let

o \in {Mean}_{R} (s)

and choose

o_{0} \in O

with

ι_{O} (o_{0}) = 1

(possible since

1 \in Y

). By Theorem 1,

J (\frac{x}{ι_{O} (o)}) = c_{R} (s, o) \leq c_{R} (s, o_{0}) = J (x) \leq J (1 + δ) = ϵ_{δ} .

Applying Lemma 3 gives

x / ι_{O} (o) \in [a_{ϵ_{δ}}, b_{ϵ_{δ}}]

, hence

\frac{x}{b_{ϵ_{δ}}} \leq ι_{O} (o) \leq \frac{x}{a_{ϵ_{δ}}} .

Using

x \in [1 - δ, 1 + δ]

yields

ι_{O} (o) \in I_{δ}

.

For the capacity bound, any object counted in

Cap (S_{δ}, O; R)

lies in

{Mean}_{R} (s)

for some

s \in S_{δ}

, hence satisfies

ι_{O} (o) \in I_{δ}

by (2). □

5. Compositionality

This section records two elementary composition mechanisms for reference costs: (i) product composition (independent coordinates) and (ii) sequential mediation through an intermediate space. Both are purely variational constructions: they introduce no semantic primitive beyond the cost function(s).

5.1. Product Reference and Coordinatewise Meaning

Definition 10 (Product reference).

Let

R_{1}

be a reference structure from a configuration (token) set

S_{1}

to an object set

O_{1}

, and let

R_{2}

be a reference structure from a configuration (token) set

S_{2}

to an object set

O_{2}

. Write their costs as

c_{R_{i}}

. The product reference structure

R_{1} \otimes R_{2} : S_{1} \times S_{2} \to O_{1} \times O_{2}

is defined by

c_{R_{1} \otimes R_{2}} ((s_{1}, s_{2}), (o_{1}, o_{2})) : = c_{R_{1}} (s_{1}, o_{1}) + c_{R_{2}} (s_{2}, o_{2}) .

(13)

With the product cost in hand, meaning decomposes coordinatewise; the next theorem makes this precise.

Theorem 5 (Compositionality of product meaning).

For any reference structures

R_{1}, R_{2}

and their product

R_{1} \otimes R_{2}

, and for every

(s_{1}, s_{2}) \in S_{1} \times S_{2}

, the meaning set in the product structure factorizes as the Cartesian product

{Mean}_{R_{1} \otimes R_{2}} (s_{1}, s_{2}) = {Mean}_{R_{1}} (s_{1}) \times {Mean}_{R_{2}} (s_{2}) .

Equivalently, viewing meaning as a relation

{Mean}_{R_{i}} \subseteq S_{i} \times O_{i}

, one has equality of relations inside

(S_{1} \times S_{2}) \times (O_{1} \times O_{2})

:

{Mean}_{R_{1} \otimes R_{2}} = {Mean}_{R_{1}} \times {Mean}_{R_{2}},

where the right-hand side denotes the Cartesian product relation.

Proof.

Fix

(s_{1}, s_{2}) \in S_{1} \times S_{2}

and write

A : = {Mean}_{R_{1} \otimes R_{2}} (s_{1}, s_{2}) \subseteq O_{1} \times O_{2}, A_{i} : = {Mean}_{R_{i}} (s_{i}) \subseteq O_{i} (i = 1, 2) .

By definition of the product reference structure, for every

(o_{1}^{'}, o_{2}^{'}) \in O_{1} \times O_{2}

,

c_{R_{1} \otimes R_{2}} ((s_{1}, s_{2}), (o_{1}^{'}, o_{2}^{'})) = c_{R_{1}} (s_{1}, o_{1}^{'}) + c_{R_{2}} (s_{2}, o_{2}^{'}) .

Inclusion

A \subseteq A_{1} \times A_{2}

. Let

(o_{1}, o_{2}) \in A

. Then for all

(o_{1}^{'}, o_{2}^{'}) \in O_{1} \times O_{2}

,

c_{R_{1}} (s_{1}, o_{1}) + c_{R_{2}} (s_{2}, o_{2}) \leq c_{R_{1}} (s_{1}, o_{1}^{'}) + c_{R_{2}} (s_{2}, o_{2}^{'}) .

Specializing to

o_{2}^{'} = o_{2}

gives, for all

o_{1}^{'} \in O_{1}

,

c_{R_{1}} (s_{1}, o_{1}) \leq c_{R_{1}} (s_{1}, o_{1}^{'}),

so

o_{1} \in A_{1}

. Similarly, specializing to

o_{1}^{'} = o_{1}

gives

o_{2} \in A_{2}

. Hence

(o_{1}, o_{2}) \in A_{1} \times A_{2}

.

Inclusion $A_{1} \times A_{2} \subseteq A$ . Let $o_{1} \in A_{1}$ and $o_{2} \in A_{2}$ . Then for all $o_{1}^{'} \in O_{1}$ and all $o_{2}^{'} \in O_{2}$ ,

c_{R_{1}} (s_{1}, o_{1}) \leq c_{R_{1}} (s_{1}, o_{1}^{'}), c_{R_{2}} (s_{2}, o_{2}) \leq c_{R_{2}} (s_{2}, o_{2}^{'}) .

Adding yields, for all

(o_{1}^{'}, o_{2}^{'}) \in O_{1} \times O_{2}

,

c_{R_{1}} (s_{1}, o_{1}) + c_{R_{2}} (s_{2}, o_{2}) \leq c_{R_{1}} (s_{1}, o_{1}^{'}) + c_{R_{2}} (s_{2}, o_{2}^{'}),

which is exactly the defining inequality for

(o_{1}, o_{2}) \in A

in the product structure. Thus

A_{1} \times A_{2} \subseteq A

. Combining the two inclusions gives

A = A_{1} \times A_{2}

, i.e.,

{Mean}_{R_{1} \otimes R_{2}} (s_{1}, s_{2}) = {Mean}_{R_{1}} (s_{1}) \times {Mean}_{R_{2}} (s_{2})

. □

Corollary 2 (Existence of product meanings under the explicit mismatch cost).

Assume the explicit mismatch cost (8) and admissible reference on each component. If, for

i = 1, 2

, the object ratio set

Y_{O_{i}} : = ι_{O_{i}} (O_{i}) \subset R_{> 0}

is nonempty and closed in the usual topology on

(0, \infty)

, then for every

(s_{1}, s_{2}) \in S_{1} \times S_{2}

the product meaning set

{Mean}_{R_{1} \otimes R_{2}} (s_{1}, s_{2})

is nonempty.

Proof.

Under the stated hypotheses, Theorem 2 implies

{Mean}_{R_{i}} (s_{i}) \neq \emptyset

for each i. Pick

o_{i} \in {Mean}_{R_{i}} (s_{i})

. Then Theorem 5 yields

(o_{1}, o_{2}) \in {Mean}_{R_{1} \otimes R_{2}} (s_{1}, s_{2})

. □

5.2. Sequential Mediation

Definition 11 (Sequential reference).

Let

R_{1} : S \to M

and

R_{2} : M \to O

be reference structures. Their sequential composition

R_{2} \circ R_{1} : S \to O

is defined by the infimal convolution

c_{R_{2} \circ R_{1}} (s, o) = inf_{m \in M} [c_{R_{1}} (s, m) + c_{R_{2}} (m, o)] .

(14)

A mediator m is optimal for

(s, o)

if it attains the infimum in (14).

We next compute the optimal mediator explicitly under the canonical mismatch cost.

Theorem 6 (Geometric-mean mediator for the explicit mismatch cost).

Assume the explicit mismatch functional (8) and admissible reference for

R_{1} : S \to M

and

R_{2} : M \to O

with scale maps

ι_{S}, ι_{M}, ι_{O}

. Fix

s \in S

and

o \in O

and set

a : = ι_{S} (s)

and

c : = ι_{O} (o)

. Let

Y_{M} : = ι_{M} (M) \subset R_{> 0}

. Assume that

Y_{M}

is nonempty and closed in the usual topology on

(0, \infty)

. Set

b_{geo} : = \sqrt{a c}

and

U : = {log b : b \in Y_{M}} \subset R

. Then the infimum in (14) is attained by at least one mediator

m_{*} \in M

. Moreover, a mediator

m \in M

with

b : = ι_{M} (m)

is optimal if and only if

log b

minimizes

| log b - log b_{geo} |

over U (equivalently, b minimizes

| log (b / b_{geo}) |

over

Y_{M}

). If

b_{geo} \notin Y_{M}

, write

u_{0} : = log b_{geo}

and let

δ : = dist (u_{0}, U)

, where

U = {log b : b \in Y_{M}} \subset R

. Let

u_{*} \in U

be a closest point to

u_{0}

(so

| u_{*} - u_{0} | = δ

) and set

b_{*} : = e^{u_{*}}

. Then, for the explicit cost, the constrained optimum value admits the closed form

\begin{matrix} c_{R_{2} \circ R_{1}} (s, o) & = (cosh (\frac{1}{2} log (a / c) + δ) - 1) + (cosh (\frac{1}{2} log (a / c) - δ) - 1) \\ = 2 cosh (\frac{1}{2} log \frac{a}{c}) cosh (δ) - 2 . \end{matrix}

In particular, the suboptimality gap relative to the unconstrained geometric mean (i.e., relative to

δ = 0

) is

c_{R_{2} \circ R_{1}} (s, o) - 2 J (\sqrt{\frac{a}{c}}) = 2 cosh (\frac{1}{2} log \frac{a}{c}) (cosh (δ) - 1) \geq 0 .

In particular, if

b_{geo} \in Y_{M}

, then the optimal mediator ratio is unique and equals

b_{geo}

; in that case, choosing

m_{*} \in M

with

ι_{M} (m_{*}) = b_{geo}

gives

c_{R_{2} \circ R_{1}} (s, o) = J (\frac{a}{b_{geo}}) + J (\frac{b_{geo}}{c}) = 2 J (\sqrt{\frac{a}{c}}) .

Proof.

Under admissibility, the objective in (14) depends on m only through

b : = ι_{M} (m) \in Y_{M}

, namely

F (b) : = J (\frac{a}{b}) + J (\frac{b}{c}) .

For the explicit penalty (3), one has

J (x) = cosh (log x) - 1

. Writing

t : = log a

,

s : = log c

, and

u : = log b

, we obtain

F (b) = (cosh (t - u) - 1) + (cosh (u - s) - 1) = cosh (t - u) + cosh (u - s) - 2 .

Using

cosh (α) + cosh (β) = 2 cosh (\frac{α + β}{2}) cosh (\frac{α - β}{2})

with

α = t - u

and

β = u - s

gives

F (b) = 2 cosh (\frac{t - s}{2}) cosh (u - \frac{t + s}{2}) - 2 = 2 cosh (\frac{log (a / c)}{2}) cosh (u - log b_{geo}) - 2 .

Since

cosh (\frac{log (a / c)}{2}) > 0

is constant and cosh is even and strictly increasing on

[0, \infty)

, minimizing F over

b \in Y_{M}

is equivalent to minimizing

| u - log b_{geo} |

over

u \in U = log Y_{M}

. Because

log : (0, \infty) \to R

is a homeomorphism and

Y_{M}

is closed and nonempty, the set U is closed and nonempty in

R

, hence the distance function

u \mapsto | u - log b_{geo} |

attains its minimum on U. This proves existence of at least one minimizer

u_{*} \in U

, and the stated characterization of optimal ratios. If

b_{geo} \in Y_{M}

(equivalently

log b_{geo} \in U

), then the unique minimizer of

u \mapsto | u - log b_{geo} |

on U is

u = log b_{geo}

, hence the optimal mediator ratio is unique and equals

b_{geo}

. Substituting

b_{geo} = \sqrt{a c}

yields

J (a / b_{geo}) = J (\sqrt{a / c}) = J (b_{geo} / c)

and the displayed formula. □

Corollary 3 (Mediation can strictly reduce mismatch).

For every

x > 0

one has

2 J (\sqrt{x}) \leq J (x),

with equality if and only if

x = 1

. Consequently, in the setting of Theorem 6, if

b_{geo} \in Y_{M}

and a direct admissible reference

R : S \to O

is available (built from the same J and scale maps), then

c_{R_{2} \circ R_{1}} (s, o) \leq c_{R} (s, o),

with equality if and only if

ι_{S} (s) = ι_{O} (o)

.

Proof.

Let

t : = \sqrt{x} > 0

. Using (3), a direct calculation gives

J (t^{2}) - 2 J (t) = \frac{1}{2} ({(t - 1)}^{2} + {(t^{- 1} - 1)}^{2}) \geq 0,

with equality if and only if

t = 1

, i.e.,

x = 1

. If

b_{geo} \in Y_{M}

, Theorem 6 gives

c_{R_{2} \circ R_{1}} (s, o) = 2 J (\sqrt{x})

with

x = ι_{S} (s) / ι_{O} (o)

; comparing with

c_{R} (s, o) = J (x)

, it yields the stated inequality. □

6. Extensions: Multi-Dimensional Scales and Robustness

The core framework above uses a single positive scale coordinate

ι (\cdot) \in R_{> 0}

. In some applications one may want a finite list of independent scale coordinates (for instance, a configuration might carry multiple features, each measured in the same “cost currency” through J). This section records a minimal extension of the model to d coordinates and a simple robustness lemma for finite dictionaries.

6.1. Multi-Dimensional Costed Spaces

Definition 12 (Multi-dimensional costed space).

Let

d \in N

. A d-dimensional costed space is a triple

(C, J_{C}, ι_{C})

where

C is a set,
$ι_{C} : C \to {(R_{> 0})}^{d}$ is a scale map, and
$J_{C} : C \to R_{\geq 0}$ is the induced (separable) cost

$J_{C} (c) : = \sum_{i = 1}^{d} J (ι_{C} {(c)}_{i}), c \in C .$

We extend admissible reference by taking a separable, coordinatewise ratio penalty.

Definition 13 (Multi-dimensional admissible reference).

Let

(S, J_{S}, ι_{S})

and

(O, J_{O}, ι_{O})

be d-dimensional costed spaces. A reference structure

R

from S to O is multi-dimensionally admissible if its reference cost is the coordinatewise ratio cost

c_{R} (s, o) = \sum_{i = 1}^{d} J (\frac{ι_{S} {(s)}_{i}}{ι_{O} {(o)}_{i}}), (s, o) \in S \times O .

(15)

The separable form immediately implies that meanings factor coordinatewise.

Corollary 4 (Coordinatewise meaning for product models).

Assume

S = \prod_{i = 1}^{d} S_{i}

and

O = \prod_{i = 1}^{d} O_{i}

and that the scale maps factor coordinatewise:

ι_{S} {(s)}_{i} = ι_{S_{i}} (s_{i})

and

ι_{O} {(o)}_{i} = ι_{O_{i}} (o_{i})

. If

R

is multi-dimensionally admissible, then

(o_{1}, \dots, o_{d}) \in {Mean}_{R} (s_{1}, \dots, s_{d}) ⟺ \forall i, o_{i} \in {Mean}_{R_{i}} (s_{i}),

where

R_{i}

denotes the induced one-dimensional admissible reference on

(S_{i}, O_{i})

.

Proof.

By (15) the cost is a separable sum of d nonnegative terms, each depending only on

(s_{i}, o_{i})

. Thus, minimizing over

O = \prod_{i} O_{i}

is equivalent to minimizing each summand over its coordinate; this is the same argument as in Theorem 5. □

6.2. Log-Space Geometry for the Explicit Mismatch Cost

In this subsection, we specialize to the explicit mismatch functional

J (x) = \frac{1}{2} (x + x^{- 1}) - 1 (x > 0),

(16)

already used in Section 2, Section 3, Section 4 and Section 5.

Lemma 5 (Log-coordinate form).

For all

t \in R

one has

J (e^{t}) = cosh (t) - 1

.

Proof.

Immediate from (16):

J (e^{t}) = \frac{1}{2} (e^{t} + e^{- t}) - 1 = cosh (t) - 1

. □

Proposition 4 (Quadratic regime with explicit remainder).

For all

t \in R

,

0 \leq J (e^{t}) - \frac{t^{2}}{2} \leq \frac{t^{4}}{24} cosh (| t |) .

In particular, for

| t | \leq 1

,

\frac{t^{2}}{2} \leq J (e^{t}) \leq \frac{t^{2}}{2} + \frac{cosh (1)}{24} t^{4} .

Proof.

By Lemma 5 it suffices to estimate

cosh (t) - 1 - \frac{1}{2} t^{2}

. Taylor’s theorem at 0 with remainder gives

cosh (t) = 1 + \frac{t^{2}}{2} + \frac{t^{4}}{24} cosh (ξ)

for some

ξ

between 0 and t. Since cosh is even and increasing on

R_{\geq 0}

, one has

cosh (ξ) \leq cosh (| t |)

, yielding the upper bound. Nonnegativity follows since

cosh (ξ) > 0

. □

Corollary 5 (Local Euclidean geometry in log-ratio).

For the explicit mismatch cost (16), set

x : = ι_{S} (s)

and

y : = ι_{O} (o)

. If

| log (x / y) | \leq 1

, then

\frac{1}{2} {(log (x / y))}^{2} \leq c_{R} (s, o) \leq \frac{1}{2} {(log (x / y))}^{2} + \frac{cosh (1)}{24} {(log (x / y))}^{4} .

Thus, in the small-mismatch regime, meanings behave like nearest neighbors in the log-ratio metric.

Proof.

For admissible reference,

c_{R} (s, o) = J (x / y)

with

x : = ι_{S} (s)

and

y : = ι_{O} (o)

. Write

t : = log (x / y)

. Then

x / y = e^{t}

and

| t | \leq 1

by hypothesis. Apply Proposition 4 to obtain

\frac{1}{2} t^{2} \leq J (e^{t}) \leq \frac{1}{2} t^{2} + \frac{cosh (1)}{24} t^{4}

, and substitute

t = log (x / y)

. □

6.3. Margin Stability for Finite Dictionaries

Definition 14 (Decision margin).

Fix a configuration

s \in S

and a finite object dictionary

O = {o_{1}, \dots, o_{N}}

. Write

C_{k} : = c_{R} (s, o_{k})

and let

M : = {min}_{1 \leq k \leq N} C_{k}

. The decision margin at s is

Δ (s) : = min {C_{k} - M : 1 \leq k \leq N, C_{k} > M} \in [0, \infty],

with the convention

Δ (s) = \infty

if all

C_{k}

are equal.

The margin parameter controls how stable the argmin is under perturbations of the cost values.

Proposition 5 (Robustness under bounded perturbations).

In the setting of Definition 14, suppose the costs

C_{k}

are perturbed to numbers

{\tilde{C}}_{k}

satisfying

max_{1 \leq k \leq N} | {\tilde{C}}_{k} - C_{k} | \leq η .

If

Δ (s) > 2 η

, then the set of minimizers is unchanged:

{k : C_{k} = min_{j} C_{j}} = {k : {\tilde{C}}_{k} = min_{j} {\tilde{C}}_{j}} .

Proof.

Let

I : = {k : C_{k} = M}

be the (nonempty) set of original minimizers. For

k \in I

one has

{\tilde{C}}_{k} \leq M + η

. If

k \notin I

, then

C_{k} \geq M + Δ (s)

by definition of

Δ (s)

, hence

{\tilde{C}}_{k} \geq M + Δ (s) - η

. If

Δ (s) > 2 η

then

M + Δ (s) - η > M + η

, so every perturbed minimizer must lie in I and conversely every

k \in I

remains minimal. □

6.4. Existence (And Optional Uniqueness) in d Dimensions

Here we discuss the multi-dimensional analogue of Theorem 2; it follows by the same attainment argument under the multi-dimensional admissibility and closedness hypotheses.

Corollary 6.

Let

d \in N

and let

(S, J_{S}, ι_{S})

and

(O, J_{O}, ι_{O})

be d-dimensional costed spaces. Assume

R

is multi-dimensionally admissible in the sense of Definition 13. Let

Y : = ι_{O} (O) \subset {(R_{> 0})}^{d}

be nonempty and closed in the usual topology on

{(0, \infty)}^{d}

. Then for every

s \in S

the meaning set

{Mean}_{R} (s)

is nonempty. Moreover, if

x : = ι_{S} (s)

lies in Y, then any

o \in O

with

ι_{O} (o) = x

is a meaning and satisfies

c_{R} (s, o) = 0

.

Proof.

Fix

s \in S

and write

x : = ι_{S} (s) \in {(R_{> 0})}^{d}

. Consider the continuous objective on Y,

F_{x} (y) : = \sum_{i = 1}^{d} J (\frac{x_{i}}{y_{i}}), y = (y_{1}, \dots, y_{d}) \in Y .

By Lemma 4, for each

M \geq 0

the one-dimensional sublevel set

K_{M} : = {z > 0 : J (z) \leq M}

is compact. Hence there exist

0 < a_{M} \leq 1 \leq b_{M} < \infty

such that

K_{M} \subset [a_{M}, b_{M}]

. If

F_{x} (y) \leq M

then each term satisfies

J (x_{i} / y_{i}) \leq M

, so

x_{i} / y_{i} \in K_{M} \subset [a_{M}, b_{M}]

, i.e.,

\frac{x_{i}}{b_{M}} \leq y_{i} \leq \frac{x_{i}}{a_{M}} (i = 1, \dots, d) .

Therefore, the sublevel set

{y \in Y : F_{x} (y) \leq M}

is closed and contained in the bounded box

\prod_{i} [x_{i} / b_{M}, x_{i} / a_{M}]

, so it is compact (Heine–Borel). Thus,

F_{x}

attains its minimum on Y at some

y_{*} \in Y

. Choose

o \in O

with

ι_{O} (o) = y_{*}

; then

o \in {Mean}_{R} (s)

by (15).

If

x \in Y

, then

F_{x} (x) = \sum_{i} J (1) = 0

. Since each term is nonnegative, 0 is the global minimum, so any o with

ι_{O} (o) = x

is a meaning. □

Definition 15 (Log-image and log-convexity).

For

Y \subset {(R_{> 0})}^{d}

define

log Y : = {(log y_{1}, \dots, log y_{d}) : y \in Y} \subset R^{d} .

We call Y log-convex if

log Y

is convex.

When the log-image of the dictionary is convex, strict convexity yields uniqueness and continuity of the optimizer.

Theorem 7 (Uniqueness and continuity for log-convex dictionaries).

Assume the explicit mismatch cost (16) and the hypotheses of Theorem 6. If

U : = log Y \subset R^{d}

is closed and convex, then the minimizer

y_{*} (x) \in Y

of

F_{x}

is unique. Equivalently, the meaning set

{Mean}_{R} (s)

equals the fiber

{o \in O : ι_{O} (o) = y_{*} (ι_{S} (s))}

. Moreover, the optimizer is continuous in log-coordinates: the map

t \mapsto u_{*} (t)

is continuous, where

t : = log x

and

u_{*} (t) : = log y_{*} (e^{t}) \in U

.

Proof.

Let

t : = log x \in R^{d}

and write

u : = log y \in U

. By Lemma 5,

F_{x} (y) = \sum_{i = 1}^{d} (cosh (t_{i} - u_{i}) - 1) = : G_{t} (u) .

For each i, the map

u_{i} \mapsto cosh (t_{i} - u_{i}) - 1

is strictly convex, hence

G_{t}

is strictly convex on

R^{d}

. Restricting to the convex set U preserves strict convexity, so

G_{t}

has at most one minimizer on U; existence follows from Theorem 6. Thus, the optimizer

u_{*} (t)

is unique, and so is

y_{*} (x) = e^{u_{*} (log x)}

.

For continuity, let

t_{n} \to t

and set

u_{n} : = u_{*} (t_{n}) \in U

. Fix

u_{0} \in U

. Since

u_{n}

minimizes

G_{t_{n}}

on U, one has

G_{t_{n}} (u_{n}) \leq G_{t_{n}} (u_{0})

. The right-hand side is bounded because

(t, u) \mapsto G_{t} (u)

is continuous and

t_{n} \to t

. As in the proof of Theorem 6, boundedness of

G_{t_{n}} (u_{n})

implies boundedness of

{u_{n}}

in

R^{d}

. Passing to a convergent subsequence (still denoted

u_{n}

) with limit

\bar{u} \in U

(closedness), continuity gives

G_{t} (\bar{u}) = {lim}_{n} G_{t_{n}} (u_{n}) \leq {lim}_{n} G_{t_{n}} (u) = G_{t} (u)

for all

u \in U

. Hence

\bar{u}

minimizes

G_{t}

on U, and by uniqueness

\bar{u} = u_{*} (t)

. Therefore, every subsequence has the same limit, so

u_{n} \to u_{*} (t)

and continuity holds. □

7. Worked Examples

This section gives explicit computations in simple settings. The purpose is not to add new axioms but to make the definition of meaning

{Mean}_{R} (s) = {arg min}_{o \in O} J (ι_{S} (s) / ι_{O} (o))

concrete and to illustrate the decision-geometry proved earlier.

7.1. Continuous Ratio Model

Proposition 6 (Meaning in the continuous ratio model).

Let

S = O = R_{> 0}

with

ι_{S} = ι_{O} = id

and intrinsic costs

J_{S} = J_{O} = J

. Let

R

be admissible (Definition 6), so that

c_{R} (s, o) = J (\frac{s}{o}) .

Then for every

s \in R_{> 0}

there exists a unique meaning, namely

{Mean}_{R} (s) = {s}

, and the minimum reference cost equals 0.

Proof.

By Lemma 2, one has

J (x) \geq 0

for all

x > 0

with equality if and only if

x = 1

. Hence

c_{R} (s, o) = J (s / o) \geq 0

with equality if and only if

s / o = 1

, i.e.,

o = s

. Therefore,

o = s

is the unique minimizer and the minimum cost is 0. □

7.2. Finite Dictionaries and Boundary Points

Example 4 (Finite object dictionary).

Let

O = {o_{1}, \dots, o_{n}}

be finite, set

y_{i} : = ι_{O} (o_{i})

, and keep

S = R_{> 0}

with

ι_{S} = id

. Under admissible reference, for a given configuration s with ratio

x : = ι_{S} (s)

the meaning set is

{Mean}_{R} (s) = \{o_{i} : J (\frac{x}{y_{i}}) = min_{1 \leq j \leq n} J (\frac{x}{y_{j}})\} .

In general, boundary points (where the meaning set is not a singleton) occur when two or more of the values

J (x / y_{i})

tie.

7.3. Geometric-Mean Boundaries for the Explicit Mismatch Cost

Theorem 8 (Geometric-mean decision boundaries for the explicit mismatch cost).

Assume the explicit mismatch functional (3) and admissible (ratio-induced) reference

c_{R} (s, o) = J (ι_{S} (s) / ι_{O} (o))

. Let

O = {o_{1}, \dots, o_{N}}

be a finite object set such that the ratios

y_{i} : = ι_{O} (o_{i})

are pairwise distinct and ordered

0 < y_{1} < \dots < y_{N}

. For

x : = ι_{S} (s) \in R_{> 0}

, define the boundary points

m_{i} : = \sqrt{y_{i} y_{i + 1}} (i = 1, \dots, N - 1),

and set

m_{0} : = 0

,

m_{N} : = + \infty

. Then

If $m_{k - 1} < x < m_{k}$ for some $k \in {1, \dots, N}$ , then $o_{k}$ is the unique meaning of s.
If $x = m_{k}$ for some $k \in {1, \dots, N - 1}$ , then s has exactly two meanings, namely $o_{k}$ and $o_{k + 1}$ .

Equivalently, the map

x \mapsto {arg min}_{i} J (x / y_{i})

is piecewise constant on the open intervals

(m_{k - 1}, m_{k})

.

Proof.

Using (3) one computes, for each i,

c_{R} (s, o_{i}) = J (\frac{x}{y_{i}}) = \frac{{(\frac{x}{y_{i}} - 1)}^{2}}{2 (x / y_{i})} = \frac{{(x - y_{i})}^{2}}{2 x y_{i}} .

Fix

i \in {1, \dots, N - 1}

and define the adjacent difference

Δ_{i} (x) : = c_{R} (s, o_{i + 1}) - c_{R} (s, o_{i}) .

Multiplying by

2 x > 0

and simplifying gives

2 x Δ_{i} (x) = (y_{i + 1} - y_{i}) (1 - \frac{x^{2}}{y_{i} y_{i + 1}}) .

Hence

Δ_{i} (x) = 0

if and only if

x^{2} = y_{i} y_{i + 1}

, i.e.,

x = m_{i}

. Moreover,

Δ_{i} (x) > 0

when

x < m_{i}

and

Δ_{i} (x) < 0

when

x > m_{i}

. Therefore,

if $x < m_{i}$ , then $c_{R} (s, o_{i}) < c_{R} (s, o_{i + 1})$ (so the adjacent comparison favors $o_{i}$ ),
if $x > m_{i}$ , then $c_{R} (s, o_{i + 1}) < c_{R} (s, o_{i})$ (so it favors $o_{i + 1}$ ).

Fix

k \in {1, \dots, N}

such that

m_{k - 1} < x < m_{k}

. For every

i \leq k - 1

we have

x > m_{i}

, hence

Δ_{i} (x) < 0

, so

c_{R} (s, o_{i + 1}) < c_{R} (s, o_{i})

. Iterating these strict inequalities yields

c_{R} (s, o_{k}) < c_{R} (s, o_{i})

for all

i < k

. For every

i \geq k

, we have

x < m_{i}

, hence

Δ_{i} (x) > 0

, so

c_{R} (s, o_{i + 1}) > c_{R} (s, o_{i})

. Iterating yields

c_{R} (s, o_{k}) < c_{R} (s, o_{j})

for all

j > k

. Therefore,

o_{k}

is the unique minimizer.

If

x = m_{k}

for some

k \in {1, \dots, N - 1}

, then for every

i < k

we still have

x > m_{i}

and the costs strictly decrease up to index k, while for every

i \geq k + 1

we have

x < m_{i}

and the costs strictly increase from index

k + 1

onward. At

i = k

, one has

Δ_{k} (m_{k}) = 0

, i.e.,

c_{R} (s, o_{k}) = c_{R} (s, o_{k + 1})

. Hence the argmin consists of exactly two meanings,

{o_{k}, o_{k + 1}}

. □

Corollary 7 (Stability away from boundaries).

Under the hypotheses of Theorem 8, if

m_{k - 1} < x < m_{k}

then there exists

δ > 0

such that every

x^{'}

with

| x^{'} - x | < δ

satisfies

m_{k - 1} < x^{'} < m_{k}

and hence has the same unique meaning

o_{k}

.

Proof.

Since

(m_{k - 1}, m_{k})

is open and contains x, choose

δ : = min {x - m_{k - 1}, m_{k} - x} / 2 > 0

. Then

| x^{'} - x | < δ

implies

x^{'} \in (m_{k - 1}, m_{k})

, and the conclusion follows from Theorem 8. □

Finite local resolution and discrete meaning cells.

The emergence of stable decision regions around geometric means (Theorem 8) provides a concrete realization of the Finite Local Resolution axiom of Recognition Geometry ([10], Axiom 4 (RG3)). While classical geometry typically assumes the idealization of infinite measurement precision, Recognition Geometry posits that local distinguishing power is always finite ([10], Axiom 4 (RG3)). Our results show that, under a cost minimization dynamic with a finite dictionary

ι_{O} (O) = {y_{1}, \dots, y_{N}}

, this discreteness emerges naturally: the continuous ratio axis

R_{> 0}

is partitioned into open intervals on which the argmin is constant, separated by the discrete boundary set of geometric means

{\sqrt{y_{i} y_{i + 1}}}

. In particular, meanings form discrete stable cells with a positive stability margin away from boundaries (Corollary 7).

7.4. Numerical Micro-Example (Three-Object Dictionary)

Take

O = {o_{1}, o_{2}, o_{3}}

with ratios

y_{1} = \frac{1}{4} < y_{2} = 1 < y_{3} = 4

, and keep

S = R_{> 0}

with

ι_{S} = id

. The boundary points are

m_{1} = \sqrt{y_{1} y_{2}} = \frac{1}{2}

and

m_{2} = \sqrt{y_{2} y_{3}} = 2

. Thus, a configuration with ratio x means

o_{1}

for

0 < x < \frac{1}{2}

, means

o_{2}

for

\frac{1}{2} < x < 2

, and means

o_{3}

for

x > 2

(with ties at the boundary points).

x	$c_{R} (s, o_{1})$	$c_{R} (s, o_{2})$	$c_{R} (s, o_{3})$	meaning(s)
$\frac{3}{10}$	$\frac{1}{60}$	$\frac{49}{60}$	$\frac{1369}{240}$	$o_{1}$
$\frac{3}{2}$	$\frac{25}{12}$	$\frac{1}{12}$	$\frac{25}{48}$	$o_{2}$
3	$\frac{121}{24}$	$\frac{2}{3}$	$\frac{1}{24}$	$o_{3}$

Example 5 (Mediation can sharply reduce cost in a toy case).

Let

a : = ι_{S} (s) = 4

and

c : = ι_{O} (o) = \frac{1}{4}

, so the direct admissible reference cost is

J (a / c) = J (16) = \frac{225}{32}

. If the mediator space contains a configuration m with ratio

b_{geo} : = \sqrt{a c} = 1

, then Theorem 6 gives an optimal sequential cost

c_{R_{2} \circ R_{1}} (s, o) = 2 J (\sqrt{\frac{a}{c}}) = 2 J (4) = \frac{9}{4},

which is strictly smaller, in accordance with Corollary 3.

8. Applications

This section records short corollaries and interpretive remarks that follow directly from the formal definitions and theorems; it makes no empirical or metaphysical claims beyond the stated axioms.

This section collects immediate, checkable consequences of the formal development. Each statement below follows from earlier definitions and theorems, and no external or empirical claim is being made. The meaning rule is an optimization rule (Definition 7) driven by the canonical mismatch penalty J (Definition 2); the axiomatic characterization of Jis classical and recorded for completeness in Appendix A.

8.1. Symbol Grounding as a Criterion

We treat “grounding” as an internal consistency condition in this model: a token s is grounded for an object o when (i) o is a meaning of s (Definition 7) and (ii) the symbol condition

J_{S} (s) < J_{O} (o)

holds (Definition 8).

Corollary 8 (Grounding criterion under admissible reference).

Fix an admissible reference structure

R

(Definition 6). Then, for

s \in S

and

o \in O

,

(s, o) is a symbol (Definition 8) ⟺ o \in {Mean}_{R} (s) and J_{S} (s) < J_{O} (o) .

Proof.

This is immediate from Definition 8 and Definition 7. □

Corollary 9 (Grounding rule for finite object dictionaries).

Assume the finite-dictionary hypotheses of Theorem 8. As the configuration ratio

x = ι_{S} (s)

varies, the meaning set

{Mean}_{R} (s)

is piecewise constant: it is a singleton on each interval

(m_{i - 1}, m_{i})

and can change only at the geometric-mean boundaries

m_{i} = \sqrt{y_{i} y_{i + 1}}

. In particular, away from the boundaries the meaning is stable under small perturbations (Corollary 7).

Proof.

Immediate from Theorem 8 and Corollary 7. □

8.2. Mathematical Effectiveness via Low-Cost Primitives

The next corollary records a purely internal “near-balance” restriction: if a configuration has small intrinsic cost, then any of its meanings must lie in the corresponding low-mismatch window determined by the sublevel sets of J.

Corollary 10 (Near-balance restricts possible referents).

Assume

R

is admissible and that the hypotheses of Theorem 4 hold. If

s \in S

satisfies

J_{S} (s) \leq ϵ

, and if o is a meaning of s, then

J (\frac{ι_{S} (s)}{ι_{O} (o)}) \leq ϵ,

so

ι_{O} (o)

must lie in the corresponding bounded sublevel window determined by ϵ (as in Theorem 4).

Proof.

This is a direct restatement of Theorem 4. □

Remark 2 (Compositional “range expansion” (model-dependent)).

In a continuous ratio model where ratios can be realized densely (e.g.,

S = O = R_{> 0}

with

ι = id

as in Proposition 6), large mismatches can be decomposed into many small mismatches: write a target ratio

r = e^{t}

as a product

r = {(e^{t / k})}^{k}

. Since

J (e^{u}) = cosh (u) - 1 \to 0

as

u \to 0

, choosing k large makes each primitive step low-cost. Coupled with the compositionality results (Theorem 5) and optimal mediation (Corollary 3), this shows that, in the continuous ratio model, large ratios can be factored into many small-ratio steps, each incurring small mismatch cost. This is an interpretive program; empirical relevance depends on what ratios are actually realizable in the intended application domain.

8.3. Information-Theoretic Interpretation

Although our framework is stated in intrinsic-cost terms, the canonical mismatch penalty admits a simple log-ratio form. We record the identity as a proposition; any further links to coding/learning are interpretive and not used in the proofs.

Proposition 7 (Log-ratio form of the canonical mismatch cost).

For

x > 0

write

x = e^{t}

. Then the canonical cost satisfies

J (x) = J (e^{t}) = cosh (t) - 1 .

In particular, J is a convex even function of the log-ratio

t = log x

and vanishes exactly at

t = 0

.

Proof.

Substitute

x = e^{t}

into

J (x) = \frac{1}{2} (x + x^{- 1}) - 1

(Definition 2). □

9. Related Work and Positioning

This section places the framework in context, highlighting connections to aboutness in formal semantics, truthmaker-style ideas, and compression-based modeling, and clarifying what is new in the present optimization-based formulation.

Relation to Recognition Geometry.

Recognition Geometry [10] develops an axiomatic recognition-first framework in which observable space is derived from recognition events via an operational quotient construction. While the present paper does not attempt to construct an ambient geometry, it shares the same operational posture: the fundamental primitive is a measurable comparison (here the mismatch cost), and the induced semantic categories are those determined by minimizing or equating that comparison. The comparative recognizer formalism of [10] provides a natural abstract home for the reference costs used here; we make this link explicit in Section 3.

This section positions the paper relative to standard themes in semantics and information theory. We do not present the mismatch penalty as novel: the axiom package in Definition 1 is a convenient specification whose solutions are classical (Appendix A). The contribution of the paper is instead the explicit optimization semantics (Definition 7) and the structural theorems derived from it (existence, stability geometry, compositionality, and mediation).

Symbol grounding and operational meaning rules.

The symbol grounding problem concerns how tokens acquire meaning without a homunculus [4]. The present work is compatible with grounding motivations, but it is formulated as a mathematical model: the meaning of s is defined as an argmin under an explicit cost. Any interpretation as a cognitive mechanism requires extra hypotheses beyond those stated.

Compression principles.

The general idea that effective representations trade off succinctness and fidelity is classical in information theory (Shannon [11]) and in algorithmic notions of complexity [12]; MDL makes this tradeoff concrete in model selection [13]. Our setup uses a different primitive: a ratio map

ι

into

R_{> 0}

and a fixed mismatch penalty J, with compression enforced by the symbol condition

J_{S} (s) < J_{O} (o)

. Within this model, reference and compositional behavior become theorem-level consequences.

Remark 3 (Coding/learning viewpoint).

In coding theory and learning, one often selects representations by minimizing a tradeoff between description length and distortion (e.g., Shannon [11] and MDL [13]). Our framework instantiates a specific distortion—

J (ι_{S} (s) / ι_{O} (o))

—that is symmetric in under-/over-shooting and naturally expressed in log-scale (Proposition 7). This suggests interpreting meanings as “best matches” under a fixed mismatch penalty, with compression enforced by the symbol condition

J_{S} (s) < J_{O} (o)

.

Subject matter/aboutness and truthmaker-semantics literature.

There is a substantial contemporary literature on “aboutness”/“subject matter” in semantics and logic, including Yablo’s monograph [5] and subsequent discussion and refinements (e.g., Rothschild [7], Fine [8], and Yablo’s reply [9]); see also Hawke’s survey [6]. Related frameworks connect hyperintensional content with truthmakers/truthmaker semantics (e.g., Fine [14]). The present paper does not attempt to adjudicate between these accounts. Rather, it provides an explicit optimization layer which, once a modeling choice of scale maps is made, selects a subject matter/referent by minimizing a mismatch cost.

Novelty signal and conceptual payoff.

Many of the analytic lemmas are consequences of the specific penalty J and convexity. The intended novelty is the resulting checkable decision geometry and compositional calculus for meanings: finite dictionaries induce geometric-mean boundaries and stability margins, product models factorize exactly, and sequential mediation admits an explicit optimizer. These consequences are the main mathematical payoffs of the framework, and they make clear which modeling assumptions (the scale maps and admissibility hypotheses) must be checked in any intended application.

What is mathematically concrete here.

Two examples of explicit structure are: (i) for finite object dictionaries under the canonical mismatch penalty, decision boundaries occur at geometric means (Theorem 8) and meanings are locally stable away from them (Corollary 7); (ii) for sequential mediation, the optimal intermediate ratio is explicit (Theorem 6) and strictly improves over direct reference when the mediator set contains the balance point (Corollary 3).

Interpretation layer.

Section 8 and Section 10 illustrate how the proved statements can be read once a modeling choice for

ι

is fixed. These illustrations are optional: removing them does not affect the correctness of theorems.

10. Discussion

This section clarifies scope and interpretation: which parts are mathematical consequences of the axioms, which parts are modeling choices, and what additional assumptions would be needed to connect the formalism to empirical systems.

This section clarifies scope: which statements are proved inside the model and which statements are interpretation. It also records limitations and concrete mathematical extensions.

10.1. What Is Proved vs. What Is Modeled

The core mathematical content consists of the definitions and theorems in Section 2, Section 3, Section 4, Section 5, Section 6 and Section 7. In particular, meaning is defined by optimization (Definition 7); existence is conditional on an attainment hypothesis (Theorem 2); and explicit geometry, stability, compositionality, and mediation statements follow for admissible reference structures and the canonical mismatch penalty (Theorems 5, 6 and 8).

By contrast, any claim that a given real-world domain does admit a scale map

ι

with the required properties, or that agents compute meaning by solving the optimization problem, is an interpretation and is outside the theorem-level scope of this paper.

10.2. Limitations

1.: Ratio embedding: Our framework requires configurations to embed into $R_{> 0}$ via a ratio map. Not all semantic domains naturally admit such embeddings.
2.: Single penalty: We work with the canonical mismatch penalty J. Alternative penalties may be appropriate in domains where under- and over-shooting are not symmetric.
3.: Static analysis: The theory is synchronic. Incorporating learning or time-evolution requires additional structure (e.g., dynamics for $ι$ or for admissible reference classes).

10.3. Open Problems

To make the forward-looking agenda explicit, we record a few concrete open problems aligned with the motivation above.

1.: Penalty universality beyond d’Alembert. Identify alternative axiom packages (weaker than Definition 1(3)) that still force a small, classifiable family of penalties, and determine which decision-geometry and compositionality results remain valid.
2.: Structure of argmin ties. Characterize, in terms of $ι_{O} (O)$ and J, when the meaning set $Mean (s)$ is multi-valued and how tie sets propagate under products and sequential mediation.
3.: Stability under perturbations of $ι$ . Quantify how errors in the scale maps affect decision boundaries and compositionality: derive uniform Lipschitz/margin bounds in log-space over admissible reference classes.

10.4. Future Directions

1.: Broader admissible reference. Classify reference structures beyond the ratio-induced form (Definition 6) for which analogues of the stability and compositionality theorems remain true.
2.: Multi-dimensional ratios. Extend the decision-geometry and boundary descriptions to $ι : C \to {(R_{> 0})}^{d}$ with non-separable penalties, and quantify how coupling between coordinates affects stability margins.
3.: Learning the scale map. Given data of successful/unsuccessful references, formulate and analyze estimation procedures for $ι$ (and admissible reference parameters) that preserve the proved invariances.

11. Conclusions

This section summarizes the contributions and limitations of the model and records a few directions for refinement and application within the axioms fixed above.

We developed a mathematical model of reference grounded in cost minimization. The theorem-level contributions are internal to the stated axioms and hypotheses.

We summarize the main points:

1.: Reference as compression: Symbols are low-cost encodings of high-cost objects.
2.: Canonical mismatch geometry: The canonical penalty $J (x) = \frac{1}{2} (x + x^{- 1}) - 1$ yields explicit decision boundaries and stability regions for finite dictionaries (Theorem 8).
3.: Universal backbone: Near-balanced configurations provide a provable backbone window around balance under admissible reference (Theorem 4). Global descriptive reach is obtained by composing many such low-cost primitives (Section 5).
4.: Compositionality: Reference structures compose via products and sequences.

The framework connects a simple optimization semantics with explicit geometric and compositional structure. Any application to a specific empirical domain requires specifying an appropriate scale map

ι

and verifying that the admissibility assumptions reasonably match that domain.

Author Contributions

Conceptualization, J.W.; formal analysis, J.W. and A.R.B.; writing—original draft preparation, J.W.; writing—review and editing, J.W. and A.R.B. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

No new data were created or analyzed in this study. Data sharing is not applicable.

Acknowledgments

We briefly acknowledge contributions and feedback that improved the exposition. We thank colleagues and readers for helpful discussions and feedback on earlier drafts.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A. Classical Characterization of the Mismatch Penalty

This appendix records a classical functional-equation characterization showing that the explicit mismatch penalty used in the paper is essentially forced (up to scale) by the stated axioms.

We prove Proposition 2. The underlying functional-equation step is classical; see, for example, Aczél [15] or Kuczma [16]. We include the argument here to keep the paper self-contained and to clarify that the mismatch penalty is not introduced as a new object.

Lemma A1 (Convexity implies continuity).

Let

I \subset R

be an open interval and let

g : I \to R

be finite-valued and convex. Then g is continuous on I. (See, e.g., Rockafellar ([17], Thm. 10.1).)

We apply this standard convexity fact to the mismatch penalty to obtain the regularity needed for the functional-equation classification.

Lemma A2 (Regularity for the log-transformed d’Alembert equation).

Assume J satisfies Definition 1. Define

C : (0, \infty) \to R

by

C (x) : = 1 + J (x)

and

f : R \to R

by

f (u) : = C (e^{u})

. Then, f is continuous, and it satisfies

f (u + v) + f (u - v) = 2 f (u) f (v) (u, v \in R),

and obeys

f (0) = 1

. In particular, the hypotheses of Lemma A3 (and of the classical theorems of Aczél and Kuczma) apply to f.

Proof.

By strict convexity (Definition 1(2)), J is convex and finite-valued on

(0, \infty)

, hence continuous by Lemma A1; therefore,

C = 1 + J

and

f (u) = C (e^{u})

are continuous. The multiplicative identity in Definition 1(3) is equivalent to (A1) for C, and substituting

x = e^{u}

,

y = e^{v}

yields the displayed d’Alembert equation for f. Finally,

f (0) = C (1) = 1

by normalization. □

Lemma A3 (Continuous solutions of d’Alembert’s equation).

Let

f : R \to R

be continuous and satisfy

f (t + s) + f (t - s) = 2 f (t) f (s) (t, s \in R),

with

f (0) = 1

. Then, either

f \equiv 1

, or there exists

a > 0

such that

f (t) = cos (a t)

for all

t \in R

, or there exists

a > 0

such that

f (t) = cosh (a t)

for all

t \in R

.

Proof.

This classification is classical; see Aczél ([15], Ch. 2) or Kuczma ([16], Ch. 13). □

Proof of Proposition 2.

Let J satisfy Definition 1. Define

C (x) : = 1 + J (x) (x > 0) .

Then (2) is equivalent to the multiplicative identity

C (x y) + C (x / y) = 2 C (x) C (y) (x, y > 0) .

(A1)

Define

f : R \to R

by

f (t) : = C (e^{t})

. By Lemma A2, f is continuous. Substituting

x = e^{t}

and

y = e^{s}

into (A1) yields d’Alembert’s functional equation

f (t + s) + f (t - s) = 2 f (t) f (s) (t, s \in R) .

(A2)

Moreover,

f (0) = C (1) = 1

and

f (t) \geq 1

for all t since

J \geq 0

.

By Lemma A3, the continuous solutions of (A2) with

f (0) = 1

are

f \equiv 1

,

f (t) = cos (a t)

, or

f (t) = cosh (a t)

(for some

a > 0

, with the constant solution corresponding to

a = 0

). The constraint

f (t) \geq 1

rules out the cosine family unless

a = 0

, and strict convexity rules out the constant solution. Hence, there exists

a > 0

such that

f (t) = cosh (a t)

for all t.

Undoing the change of variables gives

C (x) = f (log x) = cosh (a log x), x > 0,

and therefore

J (x) = C (x) - 1 = cosh (a log x) - 1 = \frac{1}{2} (x^{a} + x^{- a}) - 1 .

Finally, note that

cosh (a log (ι_{S} / ι_{O})) - 1 = cosh (log ({(ι_{S})}^{a} / {(ι_{O})}^{a})) - 1,

so replacing

ι_{S}, ι_{O}

by

{\tilde{ι}}_{S} : = ι_{S}^{a}

and

{\tilde{ι}}_{O} : = ι_{O}^{a}

absorbs the parameter a into the scale maps and produces the normalized choice

a = 1

at the level of ratio-induced reference costs. □

References

Wigner, E. The unreasonable effectiveness of mathematics in the natural sciences. Commun. Pure Appl. Math. 1960, 13, 1–14. [Google Scholar] [CrossRef]
Frege, G. Über Sinn und Bedeutung. Z. Philos. Philos. Krit. 1892, 100, 25–50. [Google Scholar]
Russell, B. On denoting. Mind 1905, 14, 479–493. [Google Scholar] [CrossRef]
Harnad, S. The symbol grounding problem. Phys. D Nonlinear Phenom. 1990, 42, 335–346. [Google Scholar] [CrossRef]
Yablo, S. Aboutness; Princeton University Press: Princeton, NJ, USA, 2014. [Google Scholar]
Hawke, P. Theories of aboutness. Australas. J. Philos. 2018, 96, 697–723. [Google Scholar] [CrossRef]
Rothschild, D. Yablo’s semantic machinery. Philos. Stud. 2017, 174, 787–796. [Google Scholar] [CrossRef][Green Version]
Fine, K. Yablo on subject-matter. Philos. Stud. 2020, 177, 129–171. [Google Scholar] [CrossRef]
Yablo, S. Reply to Fine on aboutness. Philos. Stud. 2018, 175, 1495–1512. [Google Scholar] [CrossRef]
Washburn, J.; Zlatanović, M.; Allahyarov, E. Recognition Geometry. Axioms 2026, 15, 90. [Google Scholar] [CrossRef]
Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J. 1948, 27, 379–423; 623–656. [Google Scholar] [CrossRef]
Kolmogorov, A.N. Three approaches to the quantitative definition of information. Probl. Inf. Transm. 1965, 1, 3–11. [Google Scholar] [CrossRef]
Rissanen, J. Modeling by shortest data description. Automatica 1978, 14, 465–471. [Google Scholar] [CrossRef]
Fine, K. Truth-maker semantics for intuitionistic logic. J. Philos. Log. 2014, 43, 549–577. [Google Scholar] [CrossRef]
Aczél, J. Lectures on Functional Equations and Their Applications; Academic Press: Cambridge, MA, USA, 1966. [Google Scholar]
Kuczma, M. An Introduction to the Theory of Functional Equations and Inequalities: Cauchy’s Equation and Jensen’s Inequality, 2nd ed.; Birkhäuser: Basel, Switzerland, 2009. [Google Scholar]
Rockafellar, R.T. Convex Analysis; Princeton University Press: Princeton, NJ, USA, 1970. [Google Scholar]

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Washburn, J.; Rahnamai Barghi, A. Reciprocal Convex Costs for Ratio Matching: Axiomatic Characterization. Axioms 2026, 15, 151. https://doi.org/10.3390/axioms15020151

AMA Style

Washburn J, Rahnamai Barghi A. Reciprocal Convex Costs for Ratio Matching: Axiomatic Characterization. Axioms. 2026; 15(2):151. https://doi.org/10.3390/axioms15020151

Chicago/Turabian Style

Washburn, Jonathan, and Amir Rahnamai Barghi. 2026. "Reciprocal Convex Costs for Ratio Matching: Axiomatic Characterization" Axioms 15, no. 2: 151. https://doi.org/10.3390/axioms15020151

APA Style

Washburn, J., & Rahnamai Barghi, A. (2026). Reciprocal Convex Costs for Ratio Matching: Axiomatic Characterization. Axioms, 15(2), 151. https://doi.org/10.3390/axioms15020151

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Reciprocal Convex Costs for Ratio Matching: Axiomatic Characterization

Abstract

1. Introduction

1.1. A Toy Example: Three-Object Dictionary

1.2. Relation to Prior Work

1.3. Contributions and What Is Proved

1.4. Organization

2. The Mismatch Functional J

2.1. Standard Properties and Canonicity

2.2. The Explicit Choice Used in This Paper

3. Costed Spaces and Reference Structures

3.1. Costed Spaces

3.2. Reference Structures

3.3. Meaning and the Symbol Predicate

4. Main Theorems

4.1. Sublevel Geometry of the Explicit Mismatch Cost

4.2. Meaning Constraints from a Balanced Baseline

4.3. Existence of Meanings Under Attainment Hypotheses

4.4. A Simple Total-Cost Benchmark

4.5. A Backbone Window for Near-Balanced Configuration Classes

5. Compositionality

5.1. Product Reference and Coordinatewise Meaning

5.2. Sequential Mediation

6. Extensions: Multi-Dimensional Scales and Robustness

6.1. Multi-Dimensional Costed Spaces

6.2. Log-Space Geometry for the Explicit Mismatch Cost

6.3. Margin Stability for Finite Dictionaries

6.4. Existence (And Optional Uniqueness) in d Dimensions

7. Worked Examples

7.1. Continuous Ratio Model

7.2. Finite Dictionaries and Boundary Points

7.3. Geometric-Mean Boundaries for the Explicit Mismatch Cost

7.4. Numerical Micro-Example (Three-Object Dictionary)

8. Applications

8.1. Symbol Grounding as a Criterion

8.2. Mathematical Effectiveness via Low-Cost Primitives

8.3. Information-Theoretic Interpretation

9. Related Work and Positioning

10. Discussion

10.1. What Is Proved vs. What Is Modeled

10.2. Limitations

10.3. Open Problems

10.4. Future Directions

11. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Classical Characterization of the Mismatch Penalty

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2. The Mismatch Functional $J$