# Divergence and Sufficiency for Convex Optimization

## Abstract

## 1. Introduction

## 2. Structure of the State Space

## 3. Optimization

**Definition**

**1.**

**Proposition**

**1.**

- ${D}_{F}\left(s,a\right)\ge 0$ with equality if a is optimal for s.
- $s\to {D}_{F}\left(s,a\right)$ is a convex function.
- If $\overline{a}$ is optimal for the state $\overline{s}=\sum {t}_{i}\xb7{s}_{i}$ where $\left({t}_{1},{t}_{2},\dots ,{t}_{\ell}\right)$ is a probability vector then$$\sum {t}_{i}\xb7{D}_{F}\left({s}_{i},a\right)=\sum {t}_{i}\xb7{D}_{F}\left({s}_{i},\overline{a}\right)+{D}_{F}\left(\overline{s},a\right).$$
- $\sum {t}_{i}\xb7{D}_{F}\left({s}_{i},a\right)$ is minimal if a is optimal for $\overline{s}=\sum {t}_{i}\xb7{s}_{i}$.

**Definition**

**2.**

**Proposition**

**2.**

**Proposition**

**3.**

- ${D}_{F}\left({s}_{1},{s}_{0}\right)\ge 0$ with equality if there exists an action a that is optimal for both ${s}_{1}$ and ${s}_{0}$.
- ${s}_{1}\to {D}_{F}\left({s}_{1},{s}_{0}\right)$ is a convex function.

- ${D}_{F}\left({s}_{1},{s}_{0}\right)=0$ implies ${s}_{1}={s}_{0}$.
- The function F is strictly convex.

**Example**

**1.**

**Proposition**

**4.**

- The function F is differentiable in the interior of any face of $\mathcal{S}$.
- The regret ${D}_{F}$ is a Bregman divergence.
- The Bregman identity (5) is always satisfied.
- For any probability vectors $\left({t}_{1},{t}_{2},\dots ,{t}_{n}\right)$ the sum $\sum {t}_{i}\xb7{D}_{F}\left({s}_{i},s\right)$ is always minimal when $s=\sum {t}_{i}\xb7{s}_{i}$.

## 4. Examples

#### 4.1. Information Theory

#### 4.2. Scoring Rules

**Example**

**2.**

#### 4.3. Statistical Mechanics

#### 4.4. Portfolio Theory

**Example**

**3.**

**Definition**

**3.**

**Example**

**4.**

**Lemma**

**1.**

**Proof.**

**Theorem**

**1.**

**Proof.**

## 5. Sufficiency Conditions

**Theorem**

**2.**

- The function F equals entropy times a negative constant plus an affine function.
- The regret ${D}_{F}$ is proportional to information divergence.
- The regret is monotone.
- The regret satisfies sufficiency.
- The regret is local.

#### 5.1. Entropy and Information Divergence

**Definition**

**4.**

**Definition**

**5.**

#### 5.2. Monotonicity

**Proposition**

**5**(The principle of lost opportunities).

**Proof.**

**Corollary**

**1**(Semi-monotonicity)

**Proof.**

**Definition**

**6.**

**Proposition**

**6.**

**Proof.**

**Theorem**

**3.**

**Proof.**

#### 5.3. Sufficiency

**Definition**

**7.**

**Proposition**

**7.**

**Proof.**

**Definition**

**8.**

**Proposition**

**8.**

**Proof.**

#### 5.4. Locality

**Definition**

**9.**

**Example**

**5.**

**Proposition**

**9.**

**Proof.**

**Theorem**

**4.**

**Proof.**

## 6. Applications

#### 6.1. Information Theory

**Theorem**

**5.**

**Proof.**

#### 6.2. Statistics

**Definition**

**10.**

**Theorem**

**6.**

**Proof.**

**Corollary**

**2.**

**Example**

**6.**

**Example**

**7.**

#### 6.3. Statistical Mechanics

#### 6.4. Monotone Regret for Portfolios

**Theorem**

**7.**

**Proof.**

**Example**

**8.**

**Corollary**

**3.**

**Proof.**

## 7. Concluding Remarks

## Acknowledgments

## Conflicts of Interest

