Abstract
In this paper, we consider the classical capacity problem for Gaussian measurement channels. We establish Gaussianity of the average state of the optimal ensemble in the general case and discuss the Hypothesis of Gaussian Maximizers concerning the structure of the ensemble. Then, we consider the case of one mode in detail, including the dual problem of accessible information of a Gaussian ensemble. Our findings are relevant to practical situations in quantum communications where the receiver is Gaussian (say, a general-dyne detection) and concatenation of the Gaussian channel and the receiver can be considered as one Gaussian measurement channel. Our efforts in this and preceding papers are then aimed at establishing full Gaussianity of the optimal ensemble (usually taken as an assumption) in such schemes.
1. Introduction
From the viewpoint of information theory, measurements are hybrid communication channels that transform input quantum states into classical output data. As such, they are described by the classical information capacity which is the most fundamental quantity characterizing their ultimate information-processing performance [1,2,3,4]. Channels with continuous output, such as bosonic Gaussian measurements, do not admit direct embedding into properly quantum channels and, hence, require separate treatment. In particular, their output entropy is the Shannon differential entropy, instead of the quantum entropy, which completely changes the pattern of the capacity formulas. The classical capacity of multimode Gaussian measurement channels was computed in Reference [5] under so-called threshold condition (which includes phase-insensitive or gauge covariant channels as a special case [6]). The essence of this condition is that it reduces the classical capacity problem to the minimum output differential entropy problem solved in Reference [7] (in the context of quantum Gaussian channels, a similar condition was introduced and studied in References [8,9]; also see references therein).
In this paper, we approach the classical capacity problem for Gaussian measurement channels without imposing any kind of threshold condition. In particular, in the framework of quantum communication, this means that both (noisy) heterodyne and (noisy/noiseless) homodyne measurements [10,11] are treated from a common viewpoint. We prove Gaussianity of the average state of the optimal ensemble in general and discuss the Hypothesis of Gaussian Maximizers (HGM) concerning the structure of the ensemble. The proof uses the approach of the paper of Wolf, Giedke, and Cirac [12] applied to the convex closure of the output differential entropy. Then, we discuss the case of one mode in detail, including the dual problem of accessible information of a Gaussian ensemble.
In quantum communications, there are several studies of the classical capacity in the transmission scheme where not only the Gaussian channel but also the receiver is fixed, and the optimization is performed over certain set of the input ensembles (see References [10,13,14,15] and references therein). These studies are practically important in view of greater complexity of the optimal receiver in the Quantum Channel Coding (HSW) theorem (see, e.g., Reference [16]). Our findings are relevant to such a situation where the receiver is Gaussian and concatenation of the channel and the receiver can be considered as one Gaussian measurement channel. Our efforts in this and preceding papers are then aimed at establishing full Gaussianity of the optimal ensemble (usually taken as a key assumption) in such schemes.
2. The Measurement Channel and Its Classical Capacity
An ensemble consists of probability measure on a standard measurable space and a measurable family of density operators (quantum states) on the Hilbert space of the quantum system. The average state of the ensemble is the barycenter of this measure:
the integral existing in the strong sense in the Banach space of trace-class operators on .
Let be an observable (POVM) on with the outcome standard measurable space . There exists a finite measure such that, for any density operator , the probability measure is absolutely continuous w.r.t. thus having the probability density (one can take , where is a nondegenerate density operator). The affine map will be called the measurement channel.
The joint probability distribution of on is uniquely defined by the relation
where A is an arbitrary Borel subset of , and B is that of The classical Shannon information between is equal to
In what follows, we will consider POVMs having (uniformly) bounded operator density, with so that the probability densities are uniformly bounded, . (The probability densities corresponding to Gaussian observables we will be dealing with possess this property). Moreover, without loss of generality [6] we can assume Then, the output differential entropy
is well defined with values in (see Reference [6] for the details). The output differential entropy is concave lower semicontinuous (w.r.t. trace norm) functional of a density operator . The concavity follows from the fact that the function is concave. Lower semicontinuity follows by an application of the Fatou-Lebesgue lemma from the fact that this function is nonnegative, continuous, and
Next, we define the convex closure of the output differential entropy (1):
which is the “measurement channel analog” of the convex closure of the output entropy for a quantum channel [17].
Lemma 1.
The functional is convex, lower semicontinuous and strongly superadditive:
Proof.
The lower semicontinuity follows from the similar property of the output differential entropy much in the same way as in the case of quantum channels, treated in Reference [17], Proposition 4; also see Reference [18], Proposition 1.
Let H be a Hamiltonian in the Hilbert space of the quantum system, E a positive number. Then, the energy-constrained classical capacity of the channel M is equal to
where maximization is over the input ensembles of states satisfying the energy constraint , as shown in Reference [5], proposition 1.
3. Gaussian Maximizers for Multimode Bosonic Gaussian Observable
Consider now multimode bosonic Gaussian system with the quadratic Hamiltonian where is the energy matrix, and is the row vector of the bosonic position-momentum observables, satisfying the canonical commutation relation
(see, e.g., References [11,16]). This describes quantization of a linear classical system with s degrees of freedom, such as finite number of physically relevant electromagnetic modes on the receiver’s aperture in quantum optics.
From now on, we will consider only states with finite second moments. By , we denote the set of all states with the fixed correlation matrix
For centered states (i.e., states with vanishing first moments), the covariance matrix and the matrix of second moments coincide. We denote by centered Gaussian state with the correlation matrix . For states , we have by the maximum entropy principle.
The energy constraint reduces to
(We denote Sp trace of -matrices as distinct from trace of operators on .)
For a fixed correlation matrix , we will study the -constrained capacity
With the Hamiltonian the energy-constrained classical capacity of observable M is
We will be interested in the approximate position-momentum measurement (observable, POVM)
where is centered Gaussian density operator with the covariance matrix and
are the unitary displacement operators. Thus, and the operator-valued density of POVM (11) is In quantum optics, some authors [11,19] call such measurements (noisy) general-dyne detections.
In what follows, we will consider n independent copies of our bosonic system on the Hilbert space We will supply all the quantities related to th copy () with upper index , and we will use tilde to denote quantities related to the whole collection on n copies. Thus,
and
Lemma 2.
Let be a real orthogonal matrix and U—the unitary operator on implementing the linear symplectic transformation
so that
Then, for any state on ,
Proof.
The covariance matrix of is block-diagonal, ; hence, . Thus, we have and taking into account (12),
Therefore, for any state on , the output probability density of the measurement channel corresponding to the input state is
Hence, by using orthogonal invariance of the Lebesgue measure,
If then and taking in the previous formula, we deduce
hence, (13) follows. □
Lemma 3.
Let M be the Gaussian measurement (11). For any state ρ with finite second moments, , where α is the covariance matrix of ρ.
Proof.
The proof follows the pattern of Lemma 1 from the paper of Wolf, Giedke, and Cirac [12]. Without loss of generality, we can assume that is centered. We have
where with symplectic unitary U in corresponding to an orthogonal matrix O as in Lemma 2, and is the th partial state of
Step (1) follows from the additivity (4). Step (2) follows from Lemma 2, and step (3) follows from the superadditivity of (Lemma 1). The final step of the proof,
uses ingeniously constructed U from Reference [12] and lower semicontinuity of (Lemma 1). Namely, and U corresponds via (12) to the following special orthogonal matrix
Every row of the matrix O, except the first one which has all the elements 1, has elements equal to 1 and elements equal to −1. Then, the quantum characteristic function of the states is equal to , where is the quantum characteristic function of the state This allows to apply Quantum Central Limit Theorem [20] to show that as in a uniform way, implying (16); see Reference [12] for details. □
Theorem 1.
Proof.
Lemma 3 implies that, for any with finite second moments, , where is the covariance matrix of . On the other hand, by the maximum entropy principle, . Hence, (17) is maximized by a Gaussian density operator. □
Remark 1.
The proof of Lemma 2 and, hence, of Theorem 1 can be extended to a general Gaussian observable M in the sense of References [16,21], defined via operator-valued characteristic function of the form
where K is a scaling matrix, γ is the measurement noise covariance matrix, and . Then, the Fourier transform of the measurement probability density is equal to , and one can use this function to obtain generalization of the relation (14) for the measurement probability densities. The case (11) corresponds to the type 1 Gaussian observable [21] with . However, (19) also includes type 2 and 3 observables (noisy and noiseless multimode homodyning), in which case K is a projection onto an isotropic subspace of Z (i.e., one on which the symplectic form Δ vanish.)
Remark 2.
Theorem 1 establishes Gaussianity of the average state of the optimal ensemble for a general Gaussian measurement channel. However, Gaussian average state can appear in a non-Gaussian ensemble. An immediate example is thermal state represented as a mixture of the Fock states with geometric distribution. Thus, Theorem 1 does not necessarily imply full Gaussianity of the optimal ensemble as formulated in the following conjecture.
Hypothesis of Gaussian Maximizers (HGM).
Let M be an arbitrary Gaussian measurement channel. Then, there exists an optimal Gaussian ensemble for the convex closure of the output differential entropy (2) with Gaussian ρ and, hence, for the energy-constrained classical capacity (6) of the channel M. More explicitly, the ensemble consists of (properly squeezed) coherent states with the displacement parameter having Gaussian probability distribution.
For Gaussian measurement channels of the type 1 (essentially of the form (11), see Reference [21] for complete classification) and Gaussian states satisfying the “threshold condition”, we have
with the minimum attained on a squeezed coherent state, which implies the validity of the HGM and an efficient computation of ; see Reference [5]. On the other hand, the problem remains open in the case where the “threshold condition” is violated, and in particular, for all Gaussian measurement channels of the type 2 (noisy homodyning), with the generic example of the energy-constrained approximate measurement of the position subject to Gaussian noise (see Reference [22], where the entanglement-assisted capacity of such a measurement was computed). In the following section, we will touch upon the HGM in this case for one mode system.
4. Gaussian Measurements in One Mode
Our framework in this section will be one bosonic mode described by the canonical position and momentum operators . We recall that
are the unitary displacement operators.
We will be interested in the observable
where is centered Gaussian density operator with the covariance matrix
Let be a centered Gaussian density operator with the covariance matrix
The problem is, to compute and, hence, the classical capacity for the oscillator Hamiltonian (as shown in the Appendix of Reference [22], we can restrict to Gaussian states with the diagonal covariance matrix in this case). The energy constraint (9) takes the form
The measurement channel corresponding to POVM (21) acts on the centered Gaussian state by the formula
so that
In this expression, c is a fixed constant depending on the normalization of the underlying measure in (1). It does not enter the information quantities which are differences of the two differential entropies.
Assuming validity of the HGM, we will optimize over ensembles of squeezed coherent states
where is centered Gaussian state with correlation matrix and the vector has centered Gaussian distribution with covariance matrix Then, the average state of the ensemble is centered Gaussian with the covariance matrix (23), where
hence,
For this ensemble,
Then, the hypothetical value:
The derivative of the minimized expression vanishes for Thus, depending on the position of this value with respect to the interval (27), we obtain three possibilities):
Here, the column C corresponds to the case where the “threshold condition” holds, implying (20). Then the full validity of the HGM in much more general multimode situation was established in Reference [5]. All the quantities in this column, as well as the value of in the central column of Table 2, were obtained in that paper as an example. On the other hand, the HGM remains open in the cases of mutually symmetric columns L and R (for the derivation of the quantities in column L of Table 1 and Table 2 see Appendix A).
Table 1.
The three parameter ranges.
Maximizing over which satisfy the energy constraint (24) (with the equality): , we obtain depending on the signal energy E and the measurement noise variances
where we introduced the “energy threshold function”
Table 2.
The values of the capacity .
Table 2.
The values of the capacity .
| L: HGM open | C: HGM valid [5] | R: HGM open |
In the gauge invariant case when , the threshold condition amounts to , which is fulfilled by definition, and the capacity formula gives the expression equivalent to one obtained in Hall’s 1994 paper [13].
Let us stress that, opposite to column C, the values of in the L and R columns are hypothetic, conditional upon validity of the HGM. Looking into the left column, one can see that and do not depend at all on Thus, we can let the variance of the momentum p measurement noise and, in fact, set which is equivalent to the approximate measurement only of the position q described by POVM
which belongs to type 2 according to the classification of Reference [21]. In other words, one makes the “classical” measurement of the observable
with the quantum energy constraint .
The measurement channel corresponding to POVM (29) acts on the centered Gaussian state by the formula
In this case, we have
which differ from the values in the case of finite by the absence of the factor under the logarithms, while the difference and the capacity have the same expressions as in that case (column L).
5. The Dual Problem: Accessible Information
Let us sketch here ensemble-observable duality [1,2,4] (see Reference [6] for details of mathematically rigorous description in the infinite dimensional case).
Let be an ensemble, a finite measure and an observable having operator density with values in the algebra of bounded operators in . The dual pair ensemble-observable is defined by the relations
Then, the average states of both ensembles coincide
and the joint distribution of is the same for both pairs and so that
Moreover,
where the supremum in the right-hand side is taken over all ensembles satisfying the condition . It can be shown (Reference [6], Proposition 4), that the supremum in the lefthand side remains the same if it is taken over all observables M (not only of the special kind with the density we started with), and then it is called the accessible information of the ensemble . Thus,
Since the application of the duality to the pair results in the initial pair we also have
Coming to the case of bosonic mode, we fix the Gaussian state and restrict to ensembles with Let M be the measurement channel corresponding to POVM (21). Then, according to formulas (34), the dual ensemble where is the Gaussian probability density (25) and
By using the formula for , where are Gaussian operators (see Reference [24] and also Corollary in the Appendix of Reference [25]), we obtain
where
and
Since then, from second and third equations in (39), we obtain By denoting , the density of this normal distribution, we can equivalently rewrite the ensemble as with the average state Then, HGM is equivalent to the statement
where the values of are given in Table 1; however, they should be reexpressed in terms of the ensemble parameters . In Reference [25], we treated the case C in multimode situation, establishing that the optimal measurement is Gaussian, and described it. Here, we will discuss the case L (R is similar) and show that, for large (including ), the HGM is equivalent to the following: the value of the accessible information
is attained on the sharp position measurement (in fact, this refers to the whole domain L: which, however, has rather cumbersome description in the new variables , cf. Reference [25]).
In the one mode case we are considering, the matrix is given by (23), —by (22), and so that Computations according to (39) and (40) give
But under the sharp position measurement one has (in the formulas below, means that is Gaussian probability density with mean m and variance ):
while (note that ), and
which is identical to the expression in (41).
In the case of the position measurement channel M corresponding to POVM (29) (, we have otherwise, the argument is essentially the same. Thus, we obtain that the HGM concerning in case L is equivalent to the following:
The accessible information of a Gaussian ensemble where
is given by the expression (43) and attained on the sharp position measurement
6. Discussion
In this paper, we investigated the classical capacity problem for Gaussian measurement channels. We established Gaussianity of the average state of the optimal ensemble in full generality and discussed the Hypothesis of Gaussian Maximizers concerning the detailed structure of the ensemble. Gaussian systems form the backbone of information theory with continuous variables, both in the classical and in the quantum case. Starting from them, other, non-linear models can be constructed and investigated. Therefore, the quantum Gaussian models must be studied exhaustively. Despite the progress made, there are still intriguing gaps along this way. A major problem remains the proof (or refutation) of the hypothesis of Gaussian optimizers for various entropy characteristics of quantum Gaussian systems and channels. So far, the proof of this hypothesis in special cases required tricky and special constructions, such as in the path-breaking paper [7] concerning gauge-covariant channels, or in Section 3 of the present work concerning general Gaussian measurement channels. It seems plausible that quantum Gaussian systems may have some as yet undiscovered structural property, from which a proof of this hypothesis in its maximum generality would follow in a natural way.
Funding
This work was performed at the Steklov International Mathematical Center and supported by the Ministry of Science and Higher Education of the Russian Federation (agreement no. 075-15-2019-1614).
Acknowledgments
The author is grateful to M. J. W. Hall for sending a copy of his paper [13], and to M. E. Shirokov for the comments improving the presentation.
Conflicts of Interest
The author declares no conflict of interest.
Appendix A. Case L in Tables 1 and 2
By taking the Gaussian ensemble parameters in (28) as
we get the hypothetic value
hence taking into account (26),
The Gaussian constrained capacity is
where, in the second line, we took the maximal value . Differentiating, we obtain the equation for the optimal value :
the positive solution of which is
whence
The parameters of the optimal Gaussian ensemble are obtained by substituting the value (A5) into (A1) with .
The above derivation concerns the measurement (21) ( The case of the measurement (29) ( is treated similarly, with (A2), (26) replaced by (32), (31). Notably, in this case, the expression (A6) coincides with the one obtained in Reference [13] by optimizing the information from applying sharp position measurement to noisy optimally squeezed states (the author is indebted to M. J. W. Hall for this observation).
References
- Hall, M.J.W. Quantum information and correlation bounds. Phys. Rev. A 1997, 55, 1050–2947. [Google Scholar] [CrossRef]
- Dall’Arno, M.; D’Ariano, G.M.; Sacchi, M.F. Informational power of quantum measurements. Phys. Rev. A 2011, 83, 062304. [Google Scholar] [CrossRef]
- Oreshkov, O.; Calsamiglia, J.; Munoz-Tapia, R.; Bagan, E. Optimal signal states for quantum detectors. New J. Phys. 2011, 13, 073032. [Google Scholar] [CrossRef]
- Holevo, A.S. Information capacity of quantum observable. Probl. Inform. Transm. 2012, 48, 1–10. [Google Scholar] [CrossRef][Green Version]
- Holevo, A.S.; Kuznetsova, A.A. Information capacity of continuous variable measurement channel. J. Phys. A Math. Theor. 2020, 53, 175304. [Google Scholar] [CrossRef]
- Holevo, A.S. Gaussian maximizers for quantum Gaussian observables and ensembles. IEEE Trans. Inform. Theory 2020, 66, 5634–5641. [Google Scholar] [CrossRef]
- Giovannetti, V.; Holevo, A.S.; Mari, A. Majorization and additivity for multimode bosonic Gaussian channels. Theor. Math. Phys. 2015, 182, 284–293. [Google Scholar] [CrossRef]
- Schäfer, J.; Karpov, E.; Garćia-Patrón, R.; Pilyavets, O.V.; Cerf, N.J. Equivalence Relations for the Classical Capacity of Single-Mode Gaussian Quantum Channels. Phys. Rev. Lett. 2013, 111, 030503. [Google Scholar] [CrossRef]
- Holevo, A.S. On the constrained classical capacity of infinite-dimensional covariant channels. J. Math. Phys. 2016, 57, 15203. [Google Scholar] [CrossRef]
- Caves, C.M.; Drummond, P.D. Quantum limits on bosonic communication rates. Rev. Mod. Phys. 1994, 68, 481–537. [Google Scholar] [CrossRef]
- Serafini, A. Quantum Continuous Variables: A Primer of Theoretical Methods; CRC Press: Boca Raton, FL, USA, 2017. [Google Scholar]
- Wolf, M.M.; Giedke, G.; Cirac, J.I. Extremality of Gaussian quantum states. Phys. Rev. Lett. 2006, 96, 080502. [Google Scholar] [CrossRef] [PubMed]
- Hall, M.J.W. Gaussian noise and quantum optical communication. Phys. Rev. A 1994, 50, 3295–3303. [Google Scholar] [CrossRef] [PubMed]
- Takeoka, M.; Guha, S. Capacity of optical communication in loss and noise with general Gaussian receivers. Phys. Rev. A 2014, 89, 042309. [Google Scholar] [CrossRef]
- Lee, J.; Ji, S.W.; Park, J.; Nha, H. Gaussian benchmark for optical communication aiming towards ultimate capacity. Phys. Rev. A 2016, 93, 050302. [Google Scholar] [CrossRef]
- Holevo, A.S. Quantum Systems, Channels, Information: A Mathematical Introduction, 2nd ed.; De Gruyter: Berlin, Germany; Boston, MA, USA, 2019. [Google Scholar]
- Shirokov, M.E. On entropic quantities related to the classical capacity of infinite dimensional quantum channels. Theor. Probab. Appl. 2007, 52, 250–276. [Google Scholar] [CrossRef]
- Shirokov, M.E. On properties of the space of quantum states and their application to the construction of entanglement monotones. Izv. Math. 2010, 74, 849–882. [Google Scholar] [CrossRef]
- Wiseman, H.M.; Milburn, G.J. Quantum Measurement and Control; Cambridge University Press: New York, NY, USA, 2010. [Google Scholar]
- Cushen, C.D.; Hudson, R.L. A quantum mechanical central limit theorem. J. Appl. Prob. 1971, 8, 454–469. [Google Scholar] [CrossRef]
- Holevo, A.S. The structure of general quantum Gaussian observable. arXiv 2020, arXiv:2007.02340. [Google Scholar]
- Holevo, A.S.; Yashin, V.I. Maximum information gain of approximate quantum position measurement. Quantum Inf. Process. 2021, 20, 97. [Google Scholar] [CrossRef]
- Hall, M.J.W. Information exclusion principle for complementary observables. Phys. Rev. Lett. 1995, 74, 3307. [Google Scholar] [CrossRef] [PubMed]
- Lami, L.; Das, S.; Wilde, M.M. Approximate reversal of quantum Gaussian dynamics. J. Phys. A 2018, 51, 125301. [Google Scholar] [CrossRef]
- Holevo, A.S. Accessible information of a general quantum Gaussian ensemble. arXiv 2021, arXiv:2102.01981. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).