POVMs and the Two Theorems of Naimark and Sz.-Nagy

James D. Malley; Anthony R. Fletcher

doi:10.3390/axioms4030400

and

Center for Information Technology, NIH, Bethesda, MD 20892, USA

^*

Author to whom correspondence should be addressed.

Axioms2015, 4(3), 400-411;https://doi.org/10.3390/axioms4030400

Version Notes

Order Reprints

Abstract

In 1940 Naimark showed that if a set of quantum observables are positive semi-definite and sum to the identity then, on a larger space, they have a joint resolution as commuting projectors. In 1955 Sz.-Nagy showed that any set of observables could be so resolved, with the resolution respecting all linear sums. Crucially, both resolutions return the correct Born probabilities for the original observables. Here, an alternative proof of the Sz.-Nagy result is given using elementary inner product spaces. A version of the resolution is then shown to respect all products of observables on the base space. Practical and theoretical consequences are indicated. For example, quantum statistical inference problems that involve any algebraic functionals can now be studied using classical statistical methods over commuting observables. The estimation of quantum states is a problem of this type. Further, as theoretical objects, classical and quantum systems are now distinguished by only more or less degrees of freedom.

Keywords:

Naimark dilation; Sz.-Nagy resolution; quantum statistical inference

1. Introduction

For a finite dimensional quantum system in an arbitrary state consider a set of positive semi-definite observables that sum to the identity. These define a POVM, or, positive operator valued measure. Their central importance in quantum theory and practice flows from a beautiful result of Naimark, derived in 1940; see [1,2]. It shows that any POVM can be realized as commuting projectors on a larger space, and such that the projectors return the correct Born probabilities for any state of the system. Also, on this larger space the realizations of the original observables allow measurements over them without disturbing the state, and the realizations do not depend on the state. For these reasons POVMs are often assigned the coveted status of most general type of quantum measurement possible, and are often a starting point for foundational discussions in quantum information theory. Background POVM details are given in [3,4,5,6,7,8,9,10,11,12,13,14,15,16,17] and proofs of Naimark’s result appear in [3,4,5,6,7,8].

Following Naimark’s insight, Sz.-Nagy showed in 1955 how all observables on a finite system could be simultaneously realized as simple linear functions of commuting projectors on a single larger space; see the Appendix in [3]. It is paradoxical that this equally wonderful result of Sz.-Nagy is rarely discussed. Summarizing, from Sz.-Nagy resolutions are surrogate classical random variables with respect to all linear functions of observables. And as now shown below, the resolutions are also surrogate classical random variables with respect to all algebraic functions of observables.

In this project the result of Sz.-Nagy is re-derived and discussed using an elementary inner product space construction. It is motivated by a proof of Naimark’s result as was sketched in [8], but where [8] itself makes no mention of Sz.-Nagy’s result.

There are three other goals in this project.

First, along theoretical lines, it is shown that a version of the resolution respects products of arbitrary—possibly noncommuting—observables on the base space. Under this scheme all the resolutions of the observables commute, exactly as in the Naimark and Sz.-Nagy results. However, this product property is not part of either result. Therefore, as the resolution retains the algebraic structure of the observables on the base space, acquires commutativity for them, and has the correct Born probabilities, the notion of a POVM as representing the most general quantum measurement might be open to further discussion.

Second, along practical lines, as with the Naimark and Sz.-Nagy results, the resolution captures the marginal and conditional probabilities on the base space. This, coupled with the product property, suggests applications of classical statistical methods to quantum statistical inference problems. For example, consider a search for solutions of a functional parametric equation written over sums and products as defined by a quantum statistical inference problem on the base space. Utilizing the product property these functionals can be studied using entirely classical statistical methods over commuting observables in the resolution. Specific statistical inference examples of this process are discussed below.

Finally, Third, the details presented here are much more than necessary for the essential arguments. This project therefore is an effort in learning and teaching, an attempt to parse the machinery of the methods, at least for the sake of the authors. It is hoped that doing so makes these two distinctive results—these happy inventions—more transparent and accessible.

2. Naimark’s Theorem and the Sz.-Nagy Extension

Recall that a POVM is a set of positive semi-definite observables that sum to the identity. The original result of Naimark is the scheme by which any POVM, or, generalized resolution of the identity, is realized as a set of commuting projectors on a larger space; see [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17]. Shortly after the derivation by Naimark, an extension was presented by Sz.-Nagy [3] such that:

(i): The original observables need not be positive semi-definite or sum to the identity;
(ii): The resolution of any observable on the base system is given as a linear sum over orthogonal projectors on the larger space;
(iii): The derivation is state independent;
(iv): The resolution returns the correct Born probabilities with respect to the state of the base system.

As necessary background for the extension of Sz.-Nagy, the original Naimark result is this: Naimark’s Theorem (1940 [1]; 1943 [2]; and see the Appendix in [3]). Suppose given a quantum system in a specific state, and a finite set of observables that are positive semi-definite and sum to the identity. Then the system can be embedded in a larger one such that the observables are simultaneously realized as commuting projectors that return the original Born probabilities with respect to the state.

A matrix-based proof of the result is outlined in [5]; an alternative matrix derivation is given in [6]; a spectral measure proof is presented in [7]; and an inner product space construction sketched in [8]. The discussion in [5] is self-contained, offering a proof of Naimark and a completely worked example for a two-dimensional base space. The proof in [6] offers an approach that enables other matrix examples of the Naimark scheme; see especially [6] (pp. 80–83).

Using the proof of Naimark given in [8], the result of Sz.-Nagy is re-derived. It leads to a proof that all products of observables are respected on a suitable subspace of the Naimark resolution.

The re-derivation is itself a slight extension of the Sz.-Nagy result, and is given by:

Theorem. Suppose given any two sets of observables

{A_{j} : 1 \leq j \leq m}

and

{B_{i} : 1 \leq i \leq k}

acting on a finite dimensional Hilbert space H. Then:

(1): There exists an embedding of H in a larger Hilbert space, $\overset{⌣}{H},$ and extensions of ${A_{j}}$ and ${B_{i}}$ to operators ${{\overset{⌣}{A}}_{j}}$ and ${{\overset{⌣}{B}}_{i}}$ acting on $\overset{⌣}{H},$ such that all the ${\overset{⌣}{B}}_{i}$ commute pairwise;
(2): The embedding is an isometry and is trace-preserving in this sense:

$t r [A_{j} B_{i}] = t r [{\overset{⌣}{A}}_{j} {\overset{⌣}{B}}_{i}], 1 \leq j \leq m, 1 \leq i \leq k$

(1)

Proof. If

Σ B_{i} \neq 1

then introduce

B_{k + 1} = I - Σ B_{i} .

If extensions

{{\overset{⌣}{B}}_{i} : 1 \leq i \leq k + 1}

are found such that the

{\overset{⌣}{B}}_{i}

commute pairwise, then the same is true of the subset

{{\overset{⌣}{B}}_{i} : 1 \leq i \leq k} .

Moreover, given some extensions

{\overset{⌣}{A}}_{j},

if

t r [{\overset{⌣}{A}}_{j} {\overset{⌣}{B}}_{i}] = t r [A_{j} B_{i}],

for all

1 \leq j \leq m, 1 \leq i \leq k + 1,

then the same is true for all the

{\overset{⌣}{A}}_{j}

and the subset

{{\overset{⌣}{B}}_{i} : 1 \leq i \leq k} .

Hence without loss of generality re-number the

{B_{i} : 1 \leq i \leq k + 1}

as

{B_{i} : 1 \leq i \leq k},

and assume that

Σ B_{i} = I .

Next, let a be any real number such that

a > \max | λ |,

for

λ

equal to any eigenvalue of any

B_{i} .

It follows that

{\tilde{B}}_{i} = (B_{i} + a I) / (1 + k a) 1 \leq i \leq k

(2)

is positive definite and

Σ {\tilde{B}}_{i} = I .

Again, without loss of generality, re-label every

{\tilde{B}}_{i}

as

B_{i} .

Introduce the tensor product space

\overset{⌣}{H} \equiv H \otimes H_{E}

with

H_{E}

a Hilbert space of dimension k. Let any positive definite inner product on H be given by

(φ, ζ),

and let

{ω_{i}}

be any basis for

H_{E} .

For ϕ in H define a mapping

φ \to \overset{⌣}{φ}

into

\overset{⌣}{H}

by

\overset{⌣}{φ} = Σ [φ \otimes ω_{i}] = φ \otimes \bar{ω},

where

\bar{ω} = Σ ω_{i} .

Since an arbitrary element in

\overset{⌣}{H}

can be written as

ψ = Σ [φ_{i} \otimes ω_{i}],

for any selected set of elements in H, introduce the inner product on

\overset{⌣}{H}

given by

{(ψ, ψ)}_{\overset{⌣}{H}} \equiv (ψ, ψ \overset{⌣}{)} \equiv Σ (φ_{i}, B_{i} φ_{i})

(3)

that averages over the inner products induced by each

B_{i}

on H. Because

Σ B_{i} = I

this embedding of H in

\overset{⌣}{H}

is such that:

(\overset{⌣}{φ}, \overset{⌣}{φ} \overset{⌣}{)} = (φ \otimes \bar{ω}, φ \otimes \bar{ω} \overset{⌣}{)} = Σ (φ, B_{i} φ) = (φ, (Σ B_{i}) φ) = (φ, φ)

(4)

The embedding is therefore an isometry. Define projection operators

E_{i}

acting on

\overset{⌣}{H}

by

E_{i} (Σ [φ_{t} \otimes ω_{t}]) \equiv φ_{i} \otimes ω_{i}

(5)

By inspection these are orthogonal and

Σ E_{i} = \overset{⌣}{I} .

Most importantly,

(\overset{⌣}{φ}, E_{i} \overset{⌣}{φ} \overset{⌣}{)} = (φ, E_{i} (φ \otimes \bar{ω}) \overset{⌣}{)} = (\overset{⌣}{φ}, φ \otimes ω_{i} \overset{⌣}{)} = (φ, B_{i} φ)

(6)

so that each

E_{i}

is an inner product conserving extension of each

B_{i} .

Next, let

A_{j} = (φ_{j}) {(φ_{j})}^{*}

be a one-dimensional observable acting on H, and define

{\overset{⌣}{A}}_{j}

by

{\overset{⌣}{A}}_{j} = (φ_{j} \otimes \bar{ω}) {(φ_{j} \otimes \bar{ω})}^{*} = ({\overset{⌣}{φ}}_{j}) {({\overset{⌣}{φ}}_{j})}^{*}

(7)

where the conjugate transpose is with respect to the product in

\overset{⌣}{H}

as at (4). Finally, check that for every

E_{i} :

t r [{\overset{⌣}{A}}_{j} E_{i}] = t r [({\overset{⌣}{φ}}_{j}) {({\overset{⌣}{φ}}_{j})}^{*} E_{i}] = ({\overset{⌣}{φ}}_{j}, E_{i} {\overset{⌣}{φ}}_{j} \overset{⌣}{)} = (φ_{j}, B_{i} φ_{j}) = t r [A_{j} B_{i}]

(8)

As any operator

A_{j}

can be written as a sum over projectors, by linearity the proof is complete.

The notion of Naimark space is next introduced, followed by the definition of Naimark model.

3. Naimark Spaces and Naimark Models

Acting on the extension space

\overset{⌣}{H},

let N be defined as the family of observables that is spanned by the commutative realizations

{{\overset{⌣}{B}}_{i}}

of the observables

{{\overset{⌣}{B}}_{i}} .

Call space N the Naimark space and write

N = N ({{\overset{⌣}{B}}_{i}})

for this set of observables. Several comments are in order.

First, letting m = 1 and

{A_{j}} = {ρ}

for density

ρ

on the base space, and assuming the set

{B_{i}}

consists of a set of positive definite observables that sum to the identity, the Theorem now implies the original Naimark result. Note, importantly, the proof of the Theorem does not depend on the

{A_{j}}

and thus also does not depend on the state of the base system. The flexibility in the Theorem for an arbitrary finite family of observables

{A_{j}}

for

1 \leq j \leq k,

with

1 < k,

allows for problems that have many possible densities on the base system under consideration. An example of the need for such a family is the problem of quantum state discrimination over multiple, possible unknown states.

Second, suppose the

{B_{i}}

are a spanning set of operators on H, but are not necessarily linearly independent, or positive semi-definite, or commuting. Since every operator on H now has a linear realization over a commutative space of projectors, this implies that there always exists a single classical joint distribution function over these commuting extensions of the original observables. This joint probability distribution is exactly that given by the standard von Neumann spectral resolution result for commuting observables, and the Born marginal probabilities for the realizations agree with those in the base space.

Third, the two sets of observables,

{A_{j}}

and

{B_{i}},

in the Theorem do not extend in the same way. That is, arbitrary observables

B_{i}

extend to linear sums over the projectors

E_{i},

while observables given by states

ρ = (φ) {(φ)}^{*}

extend to operators

(φ) {(φ)}^{*} \otimes \bar{P} = ρ \otimes \bar{P},

for

\bar{P}

defined as the projector on the one dimensional space spanned by

\bar{ω},

and by linearity otherwise. By inspection, the conclusion of the Theorem also obtains by using the alternative embedding that extends states

ρ = (φ) {(φ)}^{*}

to operators of the form

(φ) {(φ)}^{*} \otimes I = ρ \otimes I .

Thus, a Naimark space will not necessarily render states as commutative in the larger space if they are given as observables in the set

{A_{j}},

but will do so if they appear as observables in the set

{B_{i}} .

And Fourth, a given Naimark space,

N = N ({{\overset{⌣}{B}}_{\underset{⌣}{i}}}),

is not uniquely specified by the base operators

{B_{i}}

acting on H, so that

N = N ({{\overset{⌣}{B}}_{i}}) = N ({{\overset{⌣}{C}}_{i *}}),

is possible with the

{B_{i}} \neq {C_{i *}} .

Additional technical facts and distinctions are these:

(a): Consider the subspace in $\overset{⌣}{H}$ given by $H \otimes {ω_{i}},$ where ${ω_{i}}$ is the space in $H_{E}$ spanned by $ω_{i} .$ This space is a copy in $\overset{⌣}{H}$ of the base space H. Assume now that the $ω_{i}$ form an orthogonal basis for $H_{E} .$ Then the projector $P_{i}$ from $H_{E}$ onto the space ${ω_{i}}$ can be written as $P_{i} = (ω_{i}) {(ω_{i})}^{*},$ where the transpose here is with respect to the inner product on $H_{E} .$ By inspection it follows that: $E_{i} = I \otimes P_{i};$
(b): Note that the inner product on $\overset{⌣}{H}$ is not the same as the multiplication over the separate inner products H and $H_{E} .$ That is, in general,

$(φ \otimes ω, ζ \otimes ν \overset{⌣}{)} = {(φ \otimes ω, ζ \otimes ν)}_{\overset{⌣}{H}} \neq {(φ, ζ)}_{H} {(ω, ν)}_{E}$

(9)

for $φ, ζ$ in H and $ω, ν$ in $H_{E};$
(c): As further indication of the distinction just noted in (b), check that the projectors $E_{i} = I \otimes P_{i}$ as defined in the Theorem can also be written as:

$E_{i} = {Σ_{i}} \otimes P_{i}$

(10)

where

$Σ_{i} = Σ (φ_{i (j)}) (φ_{i (j)} \overset{⌣}{)} and P_{i} = (ω_{i}) {(ω_{i})}^{*}$

(11)

and where ${φ_{i (j)}},$ for $1 \leq j \leq \dim H,$ is an orthogonal basis of H with respect to the inner product induced by the specific observable $B_{i},$ so that:

$E_{i} = ((Σ φ_{i (j)}) \otimes ω_{i}) ((Σ φ_{i (j)}) \otimes ω_{i} \overset{⌣}{)}$

(12)
(d): For any U in the Naimark space, that is for $U = Σ α_{i} E_{i},$ with complex coefficients $α_{i},$ inspection shows that $(X \otimes I_{E}) U = U (X \otimes I_{E}),$ since $Σ P_{i} = I_{E},$ for $I_{E}$ the identity on $H_{E},$ and $E_{i} = I_{H} \otimes P_{i}$ for $I_{H}$ the identity on H.

4. Naimark Models

It is advantageous to formalize the embedding of a copy of the base space in the resolving space. As above, introduce a basis for

\overset{⌣}{H}

that begins with a isomorphic copy of the base system defined as

H \otimes {\bar{ω}},

where

{\bar{ω}}

denotes the one dimensional space spanned by the vector

\bar{ω}

in

H_{E} .

Continuing, for any operator U on

\overset{⌣}{H},

introduce the observable

U_{H}

acting on

H \otimes {\bar{ω}},

where:

{[U_{H}]}_{i j} = {[U]}_{i j} 1 \leq i, j \leq n = \dim H

(13)

Call

U_{H}

the Naimark component of U, and call the set of all such observables a Naimark model. The model contains, for example, the projections onto

H \otimes {\bar{ω}}

of all the observables in N.

Some clarifications are these:

(a): The term Naimark model is only a convenient name introduced here for operators in the Naimark space that fix the subspace $H \otimes {\bar{ω}},$ and Naimark space is itself an introduced term. Both directly follow from the combined results of Naimark and Sz.-Nagy. Since $\dim H = \dim (H \otimes H_{E}),$ the space on which operators in the Naimark model act has the same dimension as the base space H. On the other hand the space, $H \otimes {\bar{ω}},$ viewed as a subspace of $\overset{⌣}{H},$ is equipped with the inner product constructed in the Theorem, and this is distinct from whatever product is defined on H; see (b) in Section 3 above for details;
(b): For any observable X acting on H, and any U in the Naimark space:

$t r [(X \otimes I) U] = t r [(X \otimes \bar{P}) U] = t r [X U_{H}]$

(14)

It follows that every observable in the Naimark model correctly returns the Born probability for the associated observable $U_{H}$ on the base space. This is the same probability as that given for the observable on H, for which observable U is the Naimark resolution acting on $\overset{⌣}{H} .$ In simpler terms, Naimark models are probability preserving;
(c): For any pair of observables on the base system, in state D, the quantum conditional probabilities are given by

$\begin{array}{l} \Pr [A | B] = \Pr_{D} [A | B] = t r [B D B A] / t r [D B] \\ \Pr [B | A] = \Pr_{D} [B | A] = t r [A D A B] / t r [D A] \end{array}$

(15)

Consequently

$\Pr [A | B] = t r [D C_{A | B}], \Pr [B | A] = t r [D C_{B | A}]$

(16)

for the two observables on the base system as defined by

$C_{A | B} = B A B / t r [D B], C_{B | A} = A B A / t r [D A]$

(17)

Since any observables in the base system are expressible as simple linear sums over any spanning set, it follows that their resolutions in the Naimark model are also thus expressible, in terms of the commuting projectors in the Naimark model. In particular the pair of observables in Equation (17) have linear resolutions in the model. On the other hand, from classical probability any joint distribution on a pair of random variables is specified by the two marginal probabilities and the two conditional probabilities. Since elements of the Naimark model correctly return all marginal Born probabilities for observables on the base system, from Equations (16) and (17) it now follows that the model also correctly returns the correlation structure for any pair of observables on the base system;
(d): The Naimark component is defined for any operator U on $\overset{⌣}{H}$ and not only for those in a Naimark space, N;
(e): For any U acting on $\overset{⌣}{H},$ $(U_{H} \overset{⌣}{)}$ is always in N; and if U is in N then $(U_{H} \overset{⌣}{)} = U;$
(f): For any B acting on H: ${(\overset{⌣}{B})}_{H} = B .$

The product property of Naimark models is next presented.

5. Products and Naimark Models

Given the number, k, of observables in the set

{B_{i}},

and the dimension, m, of the base space H, it is convenient to expand the size of H. This is most simply done by tensoring H over k copies of H. Then the dimension of the base space becomes is

t = m k,

and the dimension of

\overset{⌣}{H}

becomes

n = m k^{2} .

Further, the observables

{B_{i}}

can be trivially extended to observables of the form

{B_{i} \otimes I}

on the expanded base space. But not so trivial is this: the extension of an observable to an operator on the expanded base space is not the same as its resolution in the Naimark space, or in the Naimark model.

One possible objective for an expansion of the base space is proposed in [8], if the dimension of the base space is not a multiple of k. Doing so as in [8] yields a simpler, block matrix representation of the projectors

E_{i} .

However, the goal of the expansion used here is different, and the utility of this particular increase in the size of H is given by the following facts, later applied in Section 6.

Begin by letting G be the projector of

\overset{⌣}{H}

onto the base space, H, where that space now has adjusted dimension

t = m k .

Then, the following several conclusions obtain:

(i): By definition ${(G)}_{H} = I_{H},$ the identity on H. Recall that $I_{H} = Σ B_{i},$ for ${B_{i}}$ as in the Theorem. Also $(I_{H} \overset{⌣}{)} = (Σ B_{i} \overset{⌣}{)} = Σ E_{i} = I_{\overset{⌣}{H}} = \overset{⌣}{I} .$ It follows that $(G_{H} \overset{⌣}{)} = \overset{⌣}{I} \neq G .$
(ii): Consider any two observables $C_{1} = A_{1} \otimes B_{1}, C_{2} = A_{2} \otimes B_{2},$ acting on $\overset{⌣}{H},$ such that $A_{1}, A_{2}$ act on $H \otimes {\bar{ω}},$ and $B_{1}, B_{2}$ act on $H_{E} .$ If $A_{1} A_{2} = A_{2} A_{1},$ and $B_{1} B_{2} = B_{2} B_{1}$ then trivially: $C_{1} C_{2} = C_{2} C_{1} .$
Next, under the base space dimension adjustment just described it follows that the projector G can be written as $G = Z \otimes I_{k},$ for a matrix Z, of order $m k,$ having the identity matrix $I_{m}$ in the upper left corner, the zero matrix ${(0)}_{t},$ in the lower right corner, with $t = m (k - 1),$ and zeros elsewhere. With the adjusted dimension, the projector G is an operator on $\overset{⌣}{H},$ with Z in the Naimark model, that is, an operator acting on $H \otimes {\bar{ω}},$ a space of dimension mk, and with $I_{k}$ acting on $H_{E},$ a space of dimension k;
(iii): Using (d) in Section 3, and (ii) just given, and for any U in the Naimark space N: $G U = U G .$ Note that G is not necessarily in N, but upon using the set ${B_{i}} = {G, N}$ and then applying the Theorem, the resolutions of G and all elements of N would then commute;
(iv): From (iii) just given, and for any U, V in the Naimark space N: ${(U V)}_{H} = U_{H} V_{H} .$ That is

${(U V)}_{H} = G (U V) G = (G U G) (G V G) = U_{H} V_{H}$

(18)

This is the product property of the Naimark model mentioned in the Introduction. It immediately extends to any linear function over finite products of observables in the model. Moreover, for U, V in N it is always true that $U V = V U,$ and this implies: $U_{H} V_{H} = V_{H} U_{H};$
(v): The extensions of the base space operators from ${B_{i}}$ to ${B_{i} \otimes I_{k m}}$ work consistently with respect to the expansion of the base space. That is, a sum over operators of the form $B_{i} \otimes I_{k m}$ has the same form, since $B = Σ B_{i}$ implies: $Σ (B_{i} \otimes I_{k m}) = (Σ B_{i}) \otimes I_{k m} = B \otimes I_{k m};$
(vi): Every observable $U_{H}$ in the Naimark model is by definition the Naimark component of the observable U in the associated Naimark space N. However, if an observable already fixes the base space it will not necessarily have a resolution in the Naimark space whose Naimark component is the original observable. Yet, the Naimark component will still return the correct Born probability for the original observable;
(vii): Given observable $U_{0}$ acting on the base space H, it follows that ${\overset{⌣}{U}}_{0}$ is in the associated Naimark space N, and ${({\overset{⌣}{U}}_{0})}_{H}$ is in the associated Naimark model;
(viii): Finally, every observable in the Naimark model correctly returns the Born probabilities:

$t r [ρ {({\overset{⌣}{U}}_{0})}_{H}] = t r [(ρ \otimes I) {\overset{⌣}{U}}_{0}]$

(19)

6. Applications of Naimark Models

Consider any outcomes for any finite set of observables on the base space,

X = {X_{i}},

and a multivariate polynomial functional equation of the form

f (X, β) = 0.

Using the outcomes considered as a data vector X, an estimated value of the parameter

β

is required, such that it solves the equation to sufficient accuracy. By definition any such equation is a linear sum over products of the observables. Hence by extending the equation on the base space, the observables in the data vector are simply replaced by their resolutions in the Naimark model. And then using the product property of the extension, as above, it follows that:

0 = f (X, β \overset{⌣}{)} = f (\overset{⌣}{X}, β)

(20)

In the Naimark model the extended observables

\overset{⌣}{X}

act as classical random variables, and have linear representations over a set of commuting projectors. Therefore, if a statistical solution with sufficient accuracy can be found in the Naimark model, then the product property, result (iv) in Section 5 above, can be applied so that exactly, or at least approximately:

0 = f {(\overset{⌣}{X}, β)}_{H} = f ({\overset{⌣}{X}}_{H}, β) = f (X, β)

(21)

Significant to note here is that any polynomial equation over the data in the Naimark model, obtained by just adding degrees of freedom as in result (v) of Section 5 above, reduces to a solution of the equation with the same form in the base space. That is:

0 = f (X \otimes I, β) if and only if 0 = f (X, β) \otimes I if and only if 0 = f (X, β)

(22)

In still other words, the Naimark model respects all algebraic constraints posed by functional equations on the base space, and the observables resolved in the Naimark model are all commuting.

As one example of this process consider using the Expectation-Maximization (EM) algorithm, a scheme from classical statistics, for study of neutron absorption tomography, as given in [12] (Section 10.3). An alternative path here is introduction of the Naimark model, followed by application of the classical EM algorithm to the set of classical random variables that represent the commuting observables in the model. For more detail on the classical EM algorithm itself see, for example ([18] Chapter 4).

As another example of a quantum statistical problem that starts from an equation of the form

f (X, β) = 0

on the base space, consider quantum state discrimination, or, state estimation. Such is presented in a Bayesian solution derived by Holevo, and Yuen et al. For a detailed discussion see [5,12].

Important to notice here is that a Naimark model also respects any required algebraic side conditions necessary for a quantum statistical problem that involve the original—or possibly additional—observables in the data vector. And this is exactly the case in the solution for the Bayesian state estimation problem just mentioned, where a certain semi-positivity side condition for these observables on the larger Naimark model is required; see [5,12]. Using the commutativity of the observables realized over the Naimark model, and since the observables in the Naimark space project onto the Naimark model, the resolving observables in the Naimark model must also respect the semi-positivity condition.

A more detailed resolution of this problem of Bayesian quantum state estimation using a Naimark model will be given elsewhere; see [19].

Continuing, two other classical, and widely deployed statistical estimation techniques are generalized linear models and generalized estimating equations, about which see [20]. In these schemes the functional equation

f (X, β) = 0

is replaced by:

f (Y, X, β) = 0

(23)

where Y is an outcome for which a good approximation or prediction is sought, using observations X as mediated by a parameter

β,

that is to be estimated in (23). Both these methods are now, in principle, applicable to quantum estimation problems and such that a classical Naimark model solution over classical random variables is then obtained.

Another application of the Theorem is this. Consider given two POVMs,

{P_{i}}

and

{Q_{j}},

where the number of observables in each set need not be the same in number. In [11] the following problem is stated: is it possible to identify a single POVM,

{R_{i}},

such that its elements contain all members of the original two POVMs?

If the purpose of any single POVM is to describe a set of observables having a resolution as commuting observables on a larger system, then the Theorem already provides that. In this case the resolution over the union of the observables in the two POVMs on a larger system jointly resolves all elements of the two POVMs as commuting observables.

On the other hand, if the task is to identify a single POVM on the base space that contains both sets of observables, then the following can be applied. As each POVM separately sums to the identity, the sum over all the observables in both POVMs must sum to twice the identity. Now divide all observables by ½. Then the full list of observables sums to the identity, and the Theorem applies. The result is a resolution that, apart from the factor ½, jointly returns the Born probabilities and is composed of commuting observables on the larger space. The central point here is that no loss in generality of quantum measurement is incurred in this scheme.

Finally, here is a method that uses only the original version of Naimark ([1,2]), but now applied twice. Let the first POVM be labeled, P and the second Q. Using the original Naimark result embed the original space in a larger one, such that all observables in the extensions of elements of P are commuting among themselves and sum to the identity on the larger space. As usual the extensions of the elements of P now sum to the identity on the larger space. Further, the elements of Q, as with all other observables apart from those in P, will extend in the usual way to observables that are tensor products of the elements of Q, by the identity on the larger space.

At this point inspection of the Naimark result shows that all elements of the extensions of P and Q must commute. That is the extensions of P are linear sums over projectors on the larger space and these commute with the extensions of Q that are simple tensors by the identity; see the Third note in Section 3.

However, at this stage, elements of the extensions of Q need not be commuting among themselves. Thus, for the next step, apply the classical Naimark result to the observables in the extension of Q on the larger space. The extensions of the original observables in P, on the larger space, will extend in the usual way for tensor products, to observables on the still larger space that also commute, since they did so on the larger space. Finally, note that the doubly extended forms of P and Q must jointly commute on the double extension, and the two together sum to the identity.

In effect, this method uses the Theorem above, where elements of the POVMs, Q and P are respectively the subsets

{A_{j}}

and

{B_{i}} .

7. A Theoretical Perspective for Naimark Spaces

The construction of the resolving Naimark space shows that it, itself, lives in the larger space where—also by construction—there are observables that are not necessarily commuting. Hence every such Naimark resolution is nested in a space that again reveals quantum and not just classical behavior. In turn, the family of all observables in the resolving space is in principle realizable in a still larger space, for which the resolved observables would all commute and act classically. Yet this entails introduction of still other noncommuting observables, thus continuing the sequence.

The scheme presented here using Naimark spaces and models suggests that for finite systems there may not be a sharply defined or even usefully declared boundary between quantum and classical phenomena. If a system is seen as classical in one Naimark representation then there are necessarily other observables in the embedding space that are noncommuting and show quantum behavior. In other words, classical and quantum phenomena are nested within each other.

On this view, quantum systems differ from classical ones simply by having different degrees of freedom. Further, the introduction of Naimark models shows something else. The Naimark model has the same dimension of the base space and this is strictly less than the dimension of the supervening Naimark space. Also, the observables on the base space, as trivially extended to those on the Naimark model are now all commuting. Finally, all algebraic functionals or constraints given over observables on the base space remain valid in the model.

In words, algebraic functionality on the base space is invariant with respect to its representation as a Naimark model, and the Naimark model observables are commuting and in this sense act classically.

8. Conclusions

The introduction of Naimark spaces leads to a canonical procedure by which any finite dimensional quantum system can be rendered as a classical system, such that it appears as a subspace of a larger quantum system, and such that all the original observables now have commutative versions, and possess a single classical joint distribution. Notably, under the extension it is not required that the original observables on the base space be projectors, or positive semi-definite, or sum to the identity, or be commuting. This much is all valid using the original results of Naimark and Sz.-Nagy.

As shown above the resolving Naimark model also respects algebraic constraints over any observables on the base space, and this property is not part of the original Naimark result or the Sz.-Nagy extension. Thus, the family of jointly measureable observables in a Naimark model can resolve some forms of quantum statistical estimation and detection problems, but now using entirely classical procedures over commuting observables.

Naturally any experimental implementation of the results of Naimark and Sz.-Nagy, and solutions found in a Naimark model, may not be a feasible. Hence for some problems, the resolving classical systems and the derived classical inference solutions might remain theoretical constructions. Still, the increasing use of so-called ancilla and ancillary systems suggests that the engineering task of invoking Naimark models is increasingly cost efficient.

Finally, the schemes presented here suggest that a sharp boundary between classical and quantum systems is more porous and more fluid than suspected, as the systems are nested within each other and distinguished only by counting degrees of freedom. In still other words, quantum behavior arises by restriction of classical behavior, and classical systems lurk within quantum ones, and only an engineering task separates them.

Acknowledgments

This work was supported by the Intramural Research Program of the National Institutes of Health.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Naimark, M.A. Spectral functions of a symmetric operator. Izv. Akad. Nauk SSSR Ser. Mat. 1940, 4, 277–318. [Google Scholar]
Naimark, M.A. On a representation of additive operator set functions. C. R. Acad. Sci. URSS 1943, 41, 359–361. [Google Scholar]
Riesz, F.; Sz.-Nagy, B. Functional Analysis; Dover Publications: New York, NY, USA, 1990; Originally published by Frederick Ungar Publications: New York, NY, USA, 1955. [Google Scholar]
Holevo, A.S. Statistical problems in quantum physics. In Second Japan-USSR Symposium on Probability Theory; Maruyama, G., Prokhorov, Y.V., Eds.; Springer-Verlag: Berlin, Germany, 1973. [Google Scholar]
Hesltrom, C.W. Quantum Detection and Estimation Theory; Academic Press: Waltham, MA, USA, 1976. [Google Scholar]
Peres, A. Quantum Theory: Concepts and Methods; Kluwer Academic Publishers: New York, NY, USA, 1995. [Google Scholar]
Akhiezer, N.I.; Glazman, I.M. Theory of Linear Operators in Hilbert Space; Dover Publications: New York, NY, USA, 1993. [Google Scholar]
Holevo, A.S. Probabilistic and Statistical Aspects of Quantum Theory; North-Holland Publishing: Amsterdam, The Netherlands, 1982. [Google Scholar]
Holevo, A.S. Statistical Structure of Quantum Theory; Springer-Verlag: Berlin, Germany, 2001. [Google Scholar]
Auletta, G. Foundations and Interpretation of Quantum Mechanics; World Scientific Publishing: Singapore, 2001. [Google Scholar]
De Muynck, W.M. Foundations of Quantum Mechanics, an Empiricist Approach; Springer-Verlag: Berlin, Germany, 2002. [Google Scholar]
Paris, M.G.A.; Řeháček, J. Quantum State Estimation; Springer-Verlag: Berlin, Germany, 2004. [Google Scholar]
Jaeger, G. Quantum Information: An Overview; Springer-Verlag: Berlin, Germany, 2010. [Google Scholar]
Hayashi, M. Quantum Information: An Introduction; Springer-Verlag: Berlin, Germany, 2010. [Google Scholar]
Nielsen, M.A.; Chuang, I.L. Quantum Computation and Quantum Information; Cambridge University Press: Cambridge, UK, 2010. [Google Scholar]
Beneduci, R. Mathematical Structure of Positive Operator Valued Measures and Applications. Ph.D. Thesis, University of Debrecen, Debrecen, Hungary, December 2014. [Google Scholar]
Beneduci, R. Joint measurability through Naimark’s theorem. ArXiv E-Prints 2014. arXiv:1404.1477. [Google Scholar]
Malley, J.D. Statistical Applications of Jordan Algebras. In Lecture Notes in Statistics; Springer-Verlag: Berlin, Germany, 1994; Volume 91. [Google Scholar]
Malley, J.D.; Fletcher, A. A Universal Bayesian Solution to Quantum State Estimation. Phys. Rev. A 2015. Submitted. [Google Scholar]
Hardin, J.W.; Hilbe, J.M. Generalized Estimating Equations; Chapman & Hall: Boca Raton, FL, USA, 2002. [Google Scholar]

© 2015 by the authors; licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/4.0/).