The Role of Auxiliary Stages in Gaussian Quantum Metrology

Triggiani, Danilo; Facchi, Paolo; Tamma, Vincenzo

doi:10.3390/photonics9050345

Open AccessReview

The Role of Auxiliary Stages in Gaussian Quantum Metrology

by

Danilo Triggiani

¹

,

Paolo Facchi

^2,3

and

Vincenzo Tamma

^1,4,*

¹

School of Mathematics and Physics, University of Portsmouth, Portsmouth PO1 3QL, UK

²

Dipartimento di Fisica and MECENAS, Università di Bari, I-70126 Bari, Italy

³

INFN, Sezione di Bari, I-70126 Bari, Italy

⁴

Institute of Cosmology and Gravitation, University of Portsmouth, Portsmouth PO1 3FX, UK

^*

Author to whom correspondence should be addressed.

Photonics 2022, 9(5), 345; https://doi.org/10.3390/photonics9050345

Submission received: 30 March 2022 / Revised: 5 May 2022 / Accepted: 6 May 2022 / Published: 14 May 2022

(This article belongs to the Special Issue Quantum Optics: Entanglement and Coherence in Photonic Systems)

Download

Browse Figures

Versions Notes

Abstract

:

The optimization of the passive and linear networks employed in quantum metrology, the field that studies and devises quantum estimation strategies to overcome the levels of precision achievable via classical means, appears to be an essential step in certain metrological protocols achieving the ultimate Heisenberg-scaling sensitivity. This optimization is generally performed by adding degrees of freedom by means of auxiliary stages, to optimize the probe before or after the interferometric evolution, and the choice of these stages ultimately determines the possibility to achieve a quantum enhancement. In this work we review the role of the auxiliary stages and of the extra degrees of freedom in estimation schemes, achieving the ultimate Heisenberg limit, which employ a squeezed-vacuum state and homodyne detection. We see that, after the optimization for the quantum enhancement has been performed, the extra degrees of freedom have a minor impact on the precision achieved by the setup, which remains essentially unaffected for networks with a larger number of channels. These degrees of freedom can thus be employed to manipulate how the information about the structure of the network is encoded into the probe, allowing us to perform quantum-enhanced estimations of linear and non-linear functions of independent parameters.

Keywords:

quantum metrology; quantum sensing; distributed parameter; heisenberg limit; typicality; gaussian metrology; squeezing; estimation of functions

1. Introduction

In recent years, much attention has been put in the study of metrological schemes that exploit quantum resources, such as entanglement and squeezing, to enhance the sensitivity in the estimation of physical properties beyond the possibilities of classical strategies, with applications to imaging [1,2], thermometry [3,4], mapping of magnetic fields [5,6] and gravitational waves detection [7], among others. One of the most emblematic quantum enhancements sought in quantum metrology is the renown Heisenberg limit [8,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28], which consists in achieving a scaling of the estimation error in the number N of probes (typically photons, or atoms) of order of

1 / N

, which surpasses the classical (or shot-noise) limit

1 / \sqrt{N}

.

Gaussian metrology, which specializes in the study of estimation schemes employing Gaussian states of light and squeezing as metrological resource [29,30,31,32], represents a promising path towards a feasible quantum-enhancement in estimation strategies and the Heisenberg-scaling sensitivity [33,34,35,36,37,38,39,40,41]. It exploits the possibility to reduce the intrinsic noise of the electromagnetic field quadratures below the quantum fluctuations of the vacuum. Such a reduced noise, together with relatively easy-to-implement experimental procedures to produce these squeezed-noise states, and their increased robustness to decoherence compared to entangled states, make the Gaussian approach of great interest for short-term applications of quantum technologies. A particular case analysed by quantum metrology is the estimation of a single unknown parameter that appears within a given optical linear network multiple times, affecting for example different interferometric components [34,35,36,38,42,43,44,45,46,47,48] (see Figure 1). This is the case of unknown temperatures or magnitudes of the electromagnetic field, which modifies the physical properties of the optical parts composing the network within the regime of passive and linear evolution of the probe. The field investigating this type of schemes is generally referred to as distributed metrology, since the unknown parameter is effectively distributed among multiple components of the network.

On the other hand, estimation schemes based on Gaussian metrology usually incur in the challenge of adaptivity, i.e., the fact that the protocol depends on the value of the parameter that tries to estimate [44,49,50,51,52]. A typical approach to deal with adaptivity in Gaussian metrology consists in limiting the values that the unknown parameter can take, for example restricting the working range of the estimation scheme only to small values of the parameter, a condition that is common in a typical interferometric setup [43,46,48,53]. However, this solution unfortunately excludes certain experimental situations which require the ability to perform a quantum-enhanced estimation of the unknown parameter without imposing restrictions on its value.

Remarkably, it has been found in a recent work that is possible to achieve Heisenberg-scaling sensitivity in the estimation of a given unknown parameter distributed in an arbitrary M-channel network with a Gaussian scheme that only requires a classical knowledge of the parameter to optimize the network, i.e., the unknown parameter must be known with a prior precision that can be achieved with a classical estimation strategy [35]. To implement this scheme, a single auxiliary optical network is required in order to correctly refocus the probe, a squeezed-vacuum state, into the only output port observed through homodyne detection after the interferometric evolution, and a classical knowledge on the unknown parameter is required to engineer this auxiliary network. In other words, since in general an arbitrary M-channel linear network which encodes an unknown parameter does not refocus the probe into a single output port, some degrees of freedom must be introduced in the network by adding an auxiliary stage, which need to be optimized through a classical estimation strategy, which assures that the refocusing is correctly performed [35].

Within this scheme, provided that the optimization of the degrees of freedom has been performed, and the probe is correctly refocused in the only observed output port, it is always possible to add a second auxiliary network, which represents further extra degrees of freedom introduced in the optical network. One may wonder how the choice of this further auxiliary stage, and thus the presence of extra degrees of freedom, can influence the estimation scheme. Remarkably, in Ref. [36] it has been shown that the extra degrees of freedom introduced with a second auxiliary network do not have a major impact on the precision of the estimation of the unknown parameter. In particular, for networks with a large number of channels, it was shown that a random choice of the non-optimized auxiliary network leaves the precision of the estimation scheme essentially unaltered [36]. Therefore, this auxiliary stage and the extra degrees of freedom can be used to manipulate the way the information on the structure of the network is encoded in the probe.

In Ref. [37] it has been shown that it is possible to employ the previous scheme to achieve the Heisenberg-scaling sensitivity in the estimation of suitable functions of multiple parameters, and to exploit the degrees of freedom introduced by the non-optimized network to control the form of the function that can be estimated. Despite, in principle, complications may arise due to the presence of multiple parameters, representing multiple sources of uncertainty, which require a more complex mathematical formalism to describe a multi-parameter scenario, it is possible to employ the further degrees of freedom to manipulate the way the information on the structure of the network is encoded into the probe, and ultimately control the function of the parameters that is estimated at the Heisenberg-scaling sensitivity [37]. It is worth mentioning that, although the task of estimating functions of unknown parameters can be easily performed estimating separately each single parameter, and then evaluating the function in the data post-processing analysis, the ability to directly estimate a global property (e.g., spatial average of a field, field-gradients, or non-linear functions) allows one to avoid the waste of resources to obtain superfluous information on each single parameter, hence its relevance in applications, such as evaluation of averages of magnetic fields or temperatures.

This review is organized as follows. First, we briefly review the general scheme presented in Ref. [35] for the estimation of a single distributed parameter in Section 2. In Section 3 we discuss the effect of the presence of a second auxiliary network on the precision of the estimation, showing that the extra degrees of freedom introduced after the optimization of the refocusing has been performed do not essentially affect the precision of the estimation scheme [36]. Lastly, we will see in Section 4 how it is possible to exploit the exceeding degrees of freedom of the auxiliary stages which are not employed to optimize the network in a network with multiple parameters to change the function that can be estimated at the Heisenberg-scaling precision [37]. We conclude presenting two examples. The first is a 2-channel network which allows to estimate a function of three parameters (two optical phases and a beam-splitter reflectivity) parametrized by some quantities that can be chosen arbitrarily through the auxiliary stages. We will see that, according to the choice of the auxiliary networks, the function estimated can be linear or non-linear in the three parameters. The second is a scheme for the estimation of any linear combination of parameters with positive weights. In particular, we will show how it is possible to employ this scheme when the unknown parameter are not only phase-shifts, but also reflectivities of beam-splitters, or more in general phases acquired through complex local networks.

2. Distributed-Parameter Quantum-Enhanced Estimation

We will start introducing the Gaussian estimation scheme for M-channel networks which achieves the Heisenberg-scaling sensitivity recently proposed in Ref. [35]. Let us consider an arbitrary linear passive network

{\hat{U}}_{φ}

which depends on a single parameter

φ

possibly distributed among several components of the network. The preparation of the input probe consists in the injection of a single-mode squeezed vacuum state in the first port of an auxiliary linear and passive network

{\hat{V}}_{in}

, which is used to scatter the photons injected among all the modes. The input state in our protocol is therefore given by

∣ ψ_{0} 〉 = {\hat{V}}_{in} {\hat{S}}_{1} (r) ∣ vac 〉

(1)

where

{\hat{S}}_{1} (r) = e^{\frac{r}{2} ({\hat{a}}_{1}^{2} - {\hat{a}}_{1}^{† 2})}

is the squeezing operator associated with the first channel with squeezing parameter

r > 0

, and

∣ vac 〉

is the M-channel vacuum state. The average number N of photons injected in the network is thus

N = {sinh}^{2} r

. At the output of

{\hat{U}}_{φ}

, a further auxiliary network

{\hat{V}}_{out}

is employed to refocus all the photons into a single mode, namely the first one, in order to capture all the information about the parameter in a single channel, at which homodyne detection is performed with a given local oscillator phase

θ

. For a linear passive unitary

\hat{U}

, it is possible to introduce the

M \times M

unitary matrix U whose elements

U_{i j}

represent the single-photon transition amplitudes from the i-th input port to the j-th output port, defined by the map

{\hat{U}}^{†} {\hat{a}}_{i}^{†} \hat{U} = \sum_{j = 1}^{M} U_{i j} {\hat{a}}_{j}^{†} .

(2)

We now introduce the

M \times M

unitary matrix

u_{φ} = V_{out} U_{φ} V_{in}

associated with the evolution through the whole network

{\hat{V}}_{out} {\hat{U}}_{φ} {\hat{V}}_{in}

. Then, we can write the probability

P_{φ} = | {(u_{φ})}_{11} |^{2} = {| {(V_{out} U_{φ} V_{in})}_{11} |}^{2}

(3)

that a photon injected in the first port of

{\hat{V}}_{in}

comes out from the first port of

{\hat{V}}_{out}

, and the phase

γ_{φ} = arg [{(u_{φ})}_{11}] = arg [{(V_{out} U_{φ} V_{in})}_{11}]

(4)

accumulated through this interferometric evolution. One can show, by employing Cramér-Rao analysis [54,55], that it is possible to achieve Heisenberg-scaling sensitivity in the estimation of

φ

if the conditions

\begin{matrix} γ_{φ} - θ = \pm \frac{π}{2} + \frac{k}{N} + O (\frac{1}{N^{2}}), \end{matrix}

(5a)

\begin{matrix} P_{φ} = 1 - \frac{ℓ}{N} + O (\frac{1}{N^{2}}), \end{matrix}

(5b)

are satisfied, where

ℓ ⩾ 0

and

k \neq 0

are arbitrary but both independent of N, and where

θ

is an optimal choice for the local oscillator phase. Under conditions (5), the ultimate precision achievable by this scheme with any estimator

\tilde{φ}

after

ν

iterations of the measurement is given by

Var [\tilde{φ}] ⩾ \frac{1}{ν F (φ)} = \frac{1}{8 ν ϱ (k, ℓ) {(\partial_{φ} γ_{φ})}^{2} N^{2}},

(6)

where

F (φ)

is the Fisher information associated with this estimation scheme, and

ϱ (k, ℓ) = {(\frac{8 k}{1 + 16 k^{2} + 4 ℓ})}^{2},

(7)

is an N-independent factor which reaches its maximum

ϱ = 1

at

k = 1 / 4

and

ℓ = 0

.

One can show that it is possible to optimize the refocusing network

{\hat{V}}_{out}

with only classical prior information on the parameter

φ

, namely after a classical strategy is employed to perform a prior coarse estimation of the unknown parameter.

In the following we will focus on the relation between the non-optimized auxiliary network

{\hat{V}}_{in}

, which yields further degrees of freedom in the linear network of the scheme, and the precision shown in Equation (6). Then, we will show how it is possible to employ these extra degrees of freedom to manipulate the function of multiple parameters that can be estimated with this scheme.

3. Typicality of Quantum Enhanced Sensitivity

In the previous section we have presented a generic protocol that allows us to achieve the Heisenberg limit in the estimation of a parameter

φ

distributed in an arbitrary network

{\hat{U}}_{φ}

when conditions (5) are met. In particular, in order to satisfy condition (5b), either the scattering stage

{\hat{V}}_{in}

or the refocusing stage

{\hat{V}}_{out}

needs to be optimized after a classical prior estimation of the parameter

φ

is carried out. The remaining auxiliary stage is thus left completely arbitrary, and one may wonder how the choice of this stage can influence the precision of the estimation, and in particular the pre-factor

{(\partial_{φ} γ_{φ})}^{2}

appearing in the sensitivity in Equation (6). More precisely, it may happen that a particularly unfortunate choice of the non-optimized stage causes this pre-factor to vanish, for example if this auxiliary stage transforms the optical mode of the probe into a mode which is insensible on or independent of the value of the parameter (A trivial example is the case where the unitary matrix describing the network is

U_{φ} = diag (1, exp (i φ))

, and the auxiliary stage is the identity

V_{in} = 𝟙_{2}

, for

M = 2

. In this case the probe is left in the first channel of the network, which does not depend on

φ

.) In this section we will see that, for an arbitrary choice of the non-optimized auxiliary stage, the pre-factor

{(\partial_{φ} γ_{φ})}^{2}

tends to be far from zero, meaning that small values are mostly unlikely, especially for networks with a large number M of channels [36]. For simplicity, we will explicitly consider the case in which the refocusing stage

{\hat{V}}_{out}

has been optimized to satisfy condition (5b) while

{\hat{V}}_{in}

is left arbitrary, but similar considerations can be done in the opposite scenario due to the symmetry of the problem.

3.1. The Role of the Generator $G_{φ}$

We can link the pre-factor

{(\partial_{φ} γ_{φ})}^{2}

, appearing in Equation (6), to the derivative of the matrix element

{(u_{φ})}_{11}

\begin{matrix} | {(\partial_{φ} u_{φ})}_{11} |^{2} & = | (\partial_{φ} \sqrt{P_{φ}} + i (\partial_{φ} γ_{φ}) \sqrt{P_{φ}}) e^{i γ_{φ}} |^{2} \\ = {(\partial_{φ} \sqrt{P_{φ}})}^{2} + {(\partial_{φ} γ_{φ})}^{2} P_{φ} . \end{matrix}

(8)

When condition (5b) holds, Equation (8) simplifies to

{(\partial_{φ} γ_{φ})}^{2} = {| {(\partial_{φ} u_{φ})}_{11} |}^{2} + O (\frac{1}{N}),

(9)

so that the two quantities are equal up to a term of order

1 / N

. Condition (5b) can be recast in terms of a constraint on the form of

V_{out}

{(V_{out})}_{1 i} = {(V_{in}^{†} U_{φ}^{†})}_{1 i} + O (\frac{1}{\sqrt{N}})

(10)

If we now introduce the (generally

φ

-dependent) generator

G_{φ} : = i U_{φ}^{†} \frac{\partial U_{φ}}{\partial φ}

(11)

of the unitary matrix

U_{φ}

, we can further manipulate the pre-factor

{(\partial_{φ} γ_{φ})}^{2}

. Employing the definition of

G_{φ}

in Equation (11), and the relation in Equation (10), we can write

\begin{matrix} | {(\partial_{φ} u_{φ})}_{11} |^{2} & = | {(V_{out} \frac{\partial U_{φ}}{\partial φ} V_{in})}_{11} |^{2} \\ = | {(V_{out} U_{φ} G_{φ} V_{in})}_{11} |^{2} \\ = {(V_{in}^{†} G_{φ} V_{in})}_{11}^{2} + O (\frac{1}{N}) \end{matrix}

(12)

Equations (9) and (12) conveniently express the pre-factor

{(\partial_{φ} γ_{φ})}^{2}

as a function of the generator

G_{φ}

of the network and a unitary matrix

U = V_{in}

independent of the optimized stage

V_{out}

, so that ultimately we can write for large N

{(\partial_{φ} γ_{φ})}^{2} \sim f (U, G_{φ}) = {(U^{†} G_{φ} U)}_{11}^{2}

(13)

and thus the Fisher information appearing in the sensitivity in Equation (6) becomes

F (φ) \sim 8 ϱ (k, ℓ) f (U, G_{φ}) N^{2} .

(14)

It is easy to see that the pre-factor

f (U, G_{φ})

can be written as the square of a convex combination of the eigenvalues

{g_{i}}_{i = 1, \dots, M}

of the generator

G_{φ}

. In fact, if we call

V_{φ}

the matrix whose columns are the eigenvectors of

G_{φ}

, so that

D \equiv diag (g_{1}, \dots, g_{M}) = V_{φ}^{†} G_{φ} V_{φ}

, we can recast the pre-factor in terms of the eigenvalues

{g_{i}}_{i = 1, \dots, M}

{(U^{†} G_{φ} U)}_{11}^{2} = {(\sum_{j = 1}^{M} w_{j} g_{j})}^{2},

(15)

where

w_{j} = {| {(V_{φ}^{†} U)}_{j 1} |}^{2}

, with

\sum_{j} w_{j} = 1

by the unitarity of

V_{φ}^{†} U

. If we suppose, without lack of generality, that the eigenvalues

g_{i}

are ordered so that

| g_{i} | ⩾ | g_{i + 1} |

, the maximum value of the pre-factor is achieved for

w_{1} \equiv {| {(V_{φ}^{†} U)}_{11} |}^{2} = 1,

(16)

namely when the first column of U coincides (up to a complex phase) with the first eigenvector of

G_{φ}

, belonging to the eigenvalue with the larger absolute value. Recalling that U represents the action of the input auxiliary stage (see Equation (12)), and that the elements of its first column coincides with the transition amplitudes from the first input channel (see Equation (2)), we can understand the meaning of the condition (16): the input stage must be chosen in order to maximize the effect of the network

{\hat{U}}_{φ}

on the probe, which must evolve under the optical mode (not necessarily coinciding with a physical channel) which is most sensitive to the variations of

φ

—i.e., corresponding to the eigenvalue of

G_{φ}

with maximum absolute value. For a choice of U satisfying condition (16) (e.g., for

U = V_{φ}

), the pre-factor in Equation (13) coincides with the highest eigenvalue of the generator squared

max_{U} f (U, G_{φ}) \equiv f_{\max} = g_{1}^{2} \equiv ‖ G_{φ} ‖_{2},

(17)

namely the squared norm of the generator

G_{φ}

. This coincides with the pre-factor of the maximum Quantum Fisher information for Gaussian states found in Ref. [44], meaning that the scheme presented in Section 2 is the optimal Gaussian strategy when also the auxiliary stage

{\hat{V}}_{in} \equiv U

is optimized according to Equation (16).

3.2. Typical Behaviour of the Pre-Factor in the Heisenberg Scaling

Although the condition for the optimal choice of the unitary U has been easily found, the eigenvectors of

G_{φ}

in general depend on the value of the distributed parameter

φ

, and on the structure of the network

{\hat{U}}_{φ}

, and so do the solutions of Equation (16). On the other hand, one may be interested in the case in which no prior knowledge on the structure of

{\hat{U}}_{φ}

—and thus on

G_{φ}

—is given. In general, with these assumptions, finding the optimal stage that maximizes the pre-factor

f (U, G_{φ})

and satisfies condition (16) is impossible. Instead, in this scenario, it becomes more appropriate to study the behaviour of the pre-factor for arbitrary choices of the auxiliary network, and ultimately of the non-fixed degrees of freedoms introduced with it. In particular, one may be interested in knowing how likely the value of

f (U, G_{φ})

is equal or close to zero, when the auxiliary stage is chosen at random. However, to introduce concepts such as ‘how likely’ and ‘at random’, we must somehow endow the set of all the auxiliary linear network with a probability measure. Conveniently, there is already a mathematical structure which well represents this set, which we have already extensively employed: every M-channel linear and passive network is described through the action of a

M \times M

unitary matrix, and the ‘composition rules’ of linear networks is well represented by the composition rules of U(M), the group of

M \times M

unitary matrices.

Since we are supposing that we do not possess any prior information on the structure of

U_{φ}

, all the auxiliary stages U are equivalent candidate to achieve the optimal value for the pre-factor. For this reason, we can endow U(M) with its Haar measure

P

[56]. This is the measure on the space of the

M \times M

unitary matrices that generalizes the uniform distribution on finite intervals of

R

. In fact, the Haar measure on U(M) is defined so that the measure

P

of any open subset

U \subset U (M)

is invariant under left or right unitary transformations, namely

P (U) = P (U^{'} U) = P (U U^{'})

, for every

U^{'} \in U (M)

. In other words, it is a measure that only depends on the ‘size’ of the subsets of U(M). Once we have chosen the prescription to sample the network U, we are able to evaluate statistical properties of the pre-factor

f (U, G_{φ})

. In particular, we show in Appendix A that

E_{P} [f (U, G_{φ})] = \frac{Tr (G_{φ}^{2}) + {Tr (G_{φ})}^{2}}{M (M + 1)}

(18)

is the expectation value of the pre-factor (13) over random choices of the matrix U, with respect to the Haar measure. In the case of a generator proportional to the identity

G_{φ} = ‖ G_{φ} ‖ 𝟙_{M}

, which corresponds to a network made of M identical copies of single-channel unitaries acting in parallel (This condition is satisfied for example for the metrological scheme introduced in Ref. [12], although in such a case the generator

G_{φ}

is independent of

φ

), we have

{Tr (G_{φ})}^{2} = M^{2} {‖ G_{φ} ‖}^{2}

and

Tr (G_{φ}^{2}) = M {‖ G_{φ} ‖}^{2}

, so that the average value of the pre-factor equals the maximum

f_{\max} = {G_{φ}}^{2}

in Equation (17). Indeed, with this kind of network, the auxiliary stage which distribute the probe on all M channels becomes irrelevant, since the network acts identically in each channel. Indeed, in such a case any unitary matrix

V_{φ} \equiv U

diagonalizes the generator

G_{φ}

and therefore Equation (16) is satisfied for any

V_{in} \equiv U

.

In general, an average value of the pre-factor close to the maximum in Equation (17) is favourable to a smaller value, since it implies a better Fisher information and a better precision in average. The only case for which

E_{P} [f (U, G_{φ})]

is equal to zero is when the whole generator is vanishing, occurrence happening only for networks

U_{φ}

which do not actually depend on the unknown parameter. A generator

G_{φ}

with small eigenvalues—i.e., a network which is not very sensitive to the variations of the parameter

φ

—would cause the average in Equation (18) to decrease, diminishing the average precision of the estimation scheme. However, it is possible to find a lower bound on the average value (18) using Jensen’s inequality

E [X^{2}] ⩾ E {[X]}^{2}

to obtain

E_{P} [f (U, G_{φ})] ⩾ E_{P} {[{(U^{†} G_{φ} U)}_{11}]}^{2} = {[\frac{Tr (G_{φ})}{M}]}^{2}

(19)

(see Appendix A). We notice how the right-hand side term in Equation (19) is the squared average of the eigenvalues of

G_{φ}

. This means that if we are able to control the value of the average of the generator, say in such a way that it is larger of a certain fraction

R > 0

of the norm of the generator, i.e.,

Tr [G_{φ}] / M ⩾ R ‖ G_{φ} ‖

, then it follows from Equation (19) that we can assure that the average of the pre-factor is larger than a fraction

R^{2}

of its maximum value

E_{P} [f (U, G_{φ})] ⩾ R^{2} {‖ G_{φ} ‖}^{2} \equiv R^{2} f_{\max}

.

Even though we may be able to control the average of

f (U, G_{φ})

through Equation (19), it may happen that the typical values that the pre-factor takes are far from

E_{P} [f (U, G_{φ})]

, for random choices of the unitary U. A paradigmatic example of a random variable which typically takes very different values than its average is a quantity which can only be equal to 0 or 1 with equal probabilities: in such case, its average

1 / 2

is never an effective outcome of the random variable, let alone typical. Fortunately, this is not the case for the pre-factor

f (U, G_{φ})

, which is instead a well-behaved function of the random unitary U. In fact, we see in Appendix A that

f (U, G_{φ})

is instead typical for networks with many channels, i.e., that it is possible to apply results on the concentration of measure in high-dimensional spaces, which assure that

f (U, G_{φ})

becomes almost constant for random choices of

U \in U (M)

for large M, and thus it concentrates around its average value

P (| f - E_{P} [f] | ⩾ ε) \leq 2 exp (- \frac{A M}{{‖ G_{φ} ‖}^{4}} ε^{2}), \forall ε > 0

(20)

with

A = {(72 π^{3})}^{- 1}

. This results tell us that, for an arbitrary choice of auxiliary stage

U = V_{in}

, the value of the pre-factor

{(\partial_{φ} γ_{φ})}^{2} \equiv f (U, G_{φ})

appearing in the Fisher information in Equation (14) is with overwhelming probability close to its average, for networks with a large enough number of channels M (see Figure 2). Moreover, Equation (19) shows that it is possible to bound from below the average

E_{P} (f (U, G_{φ}))

of the pre-factor, if some control on the average of the eigenvalues of

G_{φ}

is possible. This shows that, beside very unlikely exceptions, the choice of the non-optimized stage is mostly irrelevant as for the precision of the estimation scheme. In the next section, we will see how it is possible to exploit the additional degrees of freedom introduced by the non-optimized network to manipulate how the information on the structure of the network is encoded in the probe. This will give us some freedom in choosing the function of a given set of parameters that can be estimated with the Heisenberg limit precision.

4. Estimation of Functions of Parameters

In the previous section, we have seen that it is always possible to add a

φ

-independent auxiliary stage to the estimation setup shown in Section 2, which introduces degrees of freedom that essentially do not affect the precision of the estimation scheme, especially for networks with a large number of channels. A natural question that may arise is whether the same setup can be employed in achieving the Heisenberg limit when the number of unknown parameters affecting the arbitrary network increases, and if it is possible to employ these degrees of freedom to select a specific function of the parameters that can be measured with Heisenberg-scaling sensitivity. Although such task can be performed estimating separately each single parameter, and then evaluating the function during the data analysis, the ability to directly estimate a global property (e.g., spatial average of a field, field-gradients, or non-linear functions) allows us to not waste resources to obtain superfluous information on each single parameter. However, it cannot be excluded in principle that complications may arise due to the presence of multiple sources of uncertainty, complications which already materialize starting from the more complex mathematical formalism required to describe the multi-parameter scenario [57,58].

In this section we will describe a scheme for the estimation of functions of multiple parameters encoded in a generic linear passive network [37]. We will show that, employing a single squeezed vacuum state and a single homodyne detector, i.e., the same probe and measurement of the setup shown in Section 2, it is possible to reach the Heisenberg limit in the estimation of functions of the parameters, satisfying conditions that are similar to the ones found for the single-parameter scheme. We find also in this scenario that a classical knowledge on the unknown parameters is required to optimize the network through the use of auxiliary stages, allowing us to conceive two-steps protocols achieving the Heisenberg limit. Moreover, we will see how the exceeding degrees of freedom of the auxiliary stages which are not employed to optimize the network can be used to change the functional dependence between the quantity that can be estimated and the unknown parameters.

Once we will have described the estimation scheme and discussed the conditions that need to be met in order to reach the Heisenberg limit, we will present two examples. The first is a 2-channel network which allows us to estimate a function of three parameters, of which two optical phases and a beam-splitter reflectivity, parametrized by some quantities that can be chosen arbitrarily through the auxiliary stages employed. We will see that, according to the choice of the auxiliary networks, the function estimated can be linear or non-linear in the three parameters. The second is a scheme for the estimation of any linear combination of parameters with positive weights. In particular, we will show how it is possible to employ this scheme when the unknown parameter are not only phase-shifts, but also reflectivities of beam-splitters, or more in general phases acquired through complex local networks.

4.1. Setup

Let us consider a M-channel linear and passive network

{\hat{U}}_{φ}

which depends on p unknown parameters

φ = (φ_{1}, \dots, φ_{p})

. The parameters

φ

may represent certain physical properties associated with each component of the network, such as reflectivities of beam-splitters, or phase-shift magnitudes, or they may be the values of external non-uniform fields which influence several components of the network, such as the temperature and the electromagnetic field (see Figure 3). The action of the network can be described with the usual unitary matrix representation

U_{φ}

, defined through Equation (2). The probe employed is the same as shown in Equation (1), injected in a single input port of an auxiliary network

{\hat{V}}_{in}

, say the first, with

N = {sinh}^{2} r

number of photons in average in the probe. In order to infer some information on the parameters from the interferometric evolution of the probe, we will perform homodyne measurements at a single output port, say the first, of the quadrature

{\hat{x}}_{θ}

, where

θ

is the phase of the local oscillator. Similarly to the setup described in Section 2, we will consider in this model the presence of a further refocusing network

{\hat{V}}_{out}

acting on the probe after the evolution given by the network

{\hat{U}}_{φ}

respectively. Intuitively, the role of the stage

{\hat{V}}_{in}

is to distribute the probe among all the channels of the network, while

{\hat{V}}_{out}

refocuses the probe into the only output port which is observed. However, in light of the results of typicality presented in Section 3, which showed that the overall precision of the single-parameter estimation scheme is essentially not affected by the choice of the non-optimized network for a large number M of channels, we will exploit the remaining degrees of freedom in the two auxiliary stages to manipulate how the information about the structure of the network

{\hat{U}}_{φ}

and on the parameters

φ

is encoded into the probe.

Since the photons of the probe are all injected in the first channel, and only the first output port is observed, the only relevant element of the unitary matrix

u_{φ} = V_{out} U_{φ} V_{in}

—representing the action of the whole setup on the probe—is

{(u_{φ})}_{11}

, namely the transition amplitude from the first input to the first output port. We will employ the parametrization

{(u_{φ})}_{11} \equiv {(V_{out} U_{φ} V_{in})}_{11} = \sqrt{P_{φ}} e^{i f (φ)},

(21)

which emphasizes the two relevant physical quantities, i.e., the transition probability

P_{φ} : = {| {(u_{φ})}_{11} |}^{2}

and the phase

f (φ) : = arg {(u_{φ})}_{11}

acquired by the probe through the network, which is in general a function of the unknown parameters

φ

. We will see later that the function

f (φ)

can be estimated at the Heisenberg limit. After the interferometric evolution, the squeezed variance hence becomes

{\hat{x}}_{f (φ) + π / 2}

. If the quadrature

{\hat{x}}_{θ}

is observed through homodyne detection, the probability distribution

p_{φ} (x)

which governs the outcomes of the measurement is Gaussian, due to the Gaussian nature of the probe [30], and centred in zero, due to the absence of displacement. We thus write the Gaussian probability distribution

p_{φ} (x) = \frac{1}{\sqrt{2 π σ_{φ}^{2}}} e^{- \frac{x^{2}}{2 σ_{φ}^{2}}},

(22)

with variance

σ_{φ}

calculated in Appendix B

σ_{φ}^{2} = \frac{1 - P_{φ}}{2} + \frac{P_{φ}}{2} [cosh (2 r) + cos (2 f (φ) - 2 θ) sinh (2 r)],

(23)

and can be thought as the average between the noises of the vacuum and of the squeezed states, weighted by the factors

1 - P_{φ}

and

P_{φ}

respectively.

The presence of multiple independent parameters imposes the multi-parameter approach for the analysis of the ultimate precisions achievable with this setup, and the use of the Fisher information matrix [54,55],

F {(φ)}_{i j} = \int d x p_{φ} (x) (\frac{\partial}{\partial φ_{i}} ln p_{φ} (x)) (\frac{\partial}{\partial φ_{j}} ln p_{φ} (x))

(24)

By plugging Equation (22) into Equation (24) one gets

F (φ) = \frac{1}{2 σ_{φ}^{4}} (\nabla σ_{φ}^{2}) {(\nabla σ_{φ}^{2})}^{T},

(25)

where

\nabla = \nabla_{φ} = {(\frac{\partial}{\partial φ_{1}}, \dots, \frac{\partial}{\partial φ_{p}})}^{T}

. Having the probability distribution in Equation (22) a null displacement, all the information on the parameters

φ

is encoded in the variance

σ_{φ}

through the transition probability

P_{φ}

and the phase acquired

f (φ)

. It is thus convenient to separate the derivative of the variance in Equation (23) in two contributions

\nabla σ_{φ}^{2} = (\partial_{P} σ_{φ}^{2}) \nabla P_{φ} + (\partial_{f} σ_{φ}^{2}) \nabla f (φ),

(26)

where

\partial_{P}

and

\partial_{f}

denote the partial derivatives with respect to P and f, so that we easily obtain from Equation (23)

\begin{matrix} \partial_{P} σ_{φ}^{2} & = \frac{1}{2} (- 1 + cosh (2 r) + cos (2 f (φ) - 2 θ) sinh (2 r)), \end{matrix}

(27a)

\begin{matrix} \partial_{f} σ_{φ}^{2} & = - P_{φ} sin (2 f (φ) - 2 θ) sinh (2 r) . \end{matrix}

(27b)

4.2. Heisenberg Scaling

We can see from Equations (23)–(27) that, if no specific conditions are imposed on the network

u_{φ}

, the Fisher information matrix

F (φ)

cannot generally reach the Heisenberg limit. This is due to the presence of the squared variance

σ_{φ}^{4}

at the denominator of Equation (25), which may grow with order

N^{2}

, with N number of photons injected, since

\begin{matrix} σ_{φ}^{2} & = \frac{1 - P_{φ}}{2} + \frac{P_{φ}}{2} [1 + 2 N + 2 cos (2 f (φ) - 2 θ) \sqrt{N (N + 1)}] \\ = N P_{φ} (1 + cos (2 f (φ) - 2 θ)) + O (1), \end{matrix}

(28)

while the derivatives in Equation (27) only contain terms of the type

sinh (2 r)

and

cosh (2 r)

, which are of order N. Thus it arises the need to impose conditions which prevent the variance of the observed quadrature to grow with the number of photons. We show in Appendix C that this setup reaches the Heisenberg limit in the estimation of

f (φ)

if similar conditions to the ones in Equation (5) for the single-parameter scenario are met. In particular, the conditions are

\begin{matrix} f (φ) - θ = \pm \frac{π}{2} + \frac{k}{N} + O (\frac{1}{N^{2}}), k \neq 0, \end{matrix}

(29a)

\begin{matrix} P_{φ} = 1 - \frac{ℓ}{N} + O (\frac{1}{N^{2}}), ℓ \geq 0, \end{matrix}

(29b)

with k and ℓ arbitrary factors which are independent of N. In Section 4.3, we will discuss about the meaning of the conditions in Equation (29), exploring their consequences and highlighting the similarities with the single-parameter protocol in Section 2. In Appendix C we show that, when conditions (29) are met, the Fisher information matrix in Equation (25) becomes

F (φ) \sim 8 ϱ (k, ℓ) N^{2} (\nabla f (φ)) {(\nabla f (φ))}^{T},

(30)

where

ϱ (k, ℓ)

is the same N-independent constant factor defined in Equation (7) for the single-parameter estimation.

Despite the presence of the factor

N^{2}

in the Fisher information matrix in Equation (30), it is generally impossible to reach the Heisenberg limit in the estimation of each of the p parameters

φ

. In fact, we can easily see that the (asymptotic) expression of the Fisher information matrix (30) is non-invertible: the (column) vector

\nabla f (φ)

is trivially the only eigenvector associated with a non-vanishing eigenvalue of the Fisher information matrix. As discussed in Refs. [59,60], a singular Fisher information matrix is symptomatic of the presence of parameters which do not admit estimators with finite variances, and the traditional multi-parameter Cramér-Rao bound [54,55]

Cov ({\tilde{φ}}_{i}, {\tilde{φ}}_{j}) ⩾ \frac{1}{ν} F {(φ)}_{i j}^{- 1}

(31)

is not applicable due to the non-invertibility of

F (φ)

. In fact, only specific functions

ψ (φ)

of the parameters

φ

admit unbiased estimators with finite variance [59,60]. In particular, specialized to the Fisher information matrix of the problem at hand in Equation (30), the functions

ψ (φ)

which admit finite variance are the ones satisfying [59]

\nabla ψ (φ) \propto \nabla f (φ) .

(32)

The solutions of Equation (32) for all the possible values of

φ

are of the form

ψ (φ) \equiv a_{1} f (φ) + a_{2}

, with

a_{1 / 2}

independent of

φ

, and in particular

ψ (φ) \equiv f (φ)

belongs to this family, which entails the fact that the function

f (φ)

can be estimated with finite variance. We can finally evaluate the Cramér-Rao bound associated with any estimator

\tilde{f}

of

f (φ)

, obtaining in the asymptotic regime (see Appendix E)

{Var}_{φ} [\tilde{f}] ⩾ \frac{1}{ν} \frac{1}{8 ϱ (k, ℓ) N^{2}},

(33)

which reaches the Heisenberg limit in the mean number of photons N.

Lastly, it is worth to mention that we can saturate the Cramér-Rao bound in the limit of large samples, namely it is always possible to find an estimator, the maximum-likelihood estimator

{\tilde{f}}_{MLE}

, which is unbiased and efficient in the asymptotic regime of large samples

ν \to + \infty

[55]. In order to find the maximum-likelihood estimator, we need to maximize the Likelihood function

L (φ; x) = \prod_{i = 1}^{ν} p_{φ} (x_{i}) = \frac{1}{{(2 π σ_{φ}^{2})}^{ν / 2}} exp (- \frac{{| x |}^{2}}{2 σ_{φ}^{2}})

(34)

In Appendix D we see that the solution which maximizes the Likelihood function (34) is given by the estimator

{\tilde{f}}_{MLE}

satisfying

σ^{2} ({\tilde{f}}_{MLE}) = S^{2} (x),

(35)

where

σ^{2} (f)

is the variance in Equation (23) as a function of

f \equiv f (φ)

, supposing that

P_{φ}

satisfies Equation (29b) and is known, while

S {(x)}^{2}

is the usual sample variance

S^{2} (x) = \frac{1}{ν} \sum_{i = 1}^{ν} x_{i}^{2} .

(36)

Inverting the function

σ^{2} (f)

, we can obtain the explicit expression of the maximum-likelihood estimator in this regime

\begin{matrix} {\tilde{f}}_{MLE} (x) = θ + \frac{1}{2} (2 n π \pm arccos (\frac{(2 S {(x)}^{2} - 1) - 2 P_{φ} {sinh}^{2} r}{2 P_{φ} sinh r cosh r})), \end{matrix}

(37)

with n integer. We notice that the presence of the cosine function, which is invertible only on intervals of its argument of the type

[n π, (n + 1) π]

, requests a prior knowledge on the argument

2 f (φ) - 2 θ

in order to choose the correct value of n in Equation (37). However, we will discuss in the next section that a classical coarse estimation of the parameters

φ

is required in order to optimize the network and satisfy condition (29b). In other words, the error

δ φ

committed in the prior coarse estimation must be of order

1 / \sqrt{N}

, decreasing at the SQL with the number of photons N. For a large enough N,

δ φ

will be small enough to unequivocally choose the correct interval of invertibility of the cosine, and thus the correct n in Equation (37).

4.3. On the Conditions for the Heisenberg Scaling

Despite the presence of multiple parameters, the substantial similarity of conditions (29) with the single-parameter counterparts in Equation (5) allows us to draw the same consideration discussed for the single-parameter scheme in Ref. [35]. Condition (29a) is a minimum resolution requirement on the tuning of local-oscillator phase

θ

, which must be controlled with steps of order

1 / N

. Moreover, the requirement

k \neq 0

in Equation (29a) implies that

θ

must be tuned in such a way to measure a quadrature field

{\hat{x}}_{θ}

which is slightly different from the minimum-variance quadrature

{\hat{x}}_{f (φ) \pm π / 2}

. This can be explained by the fact that the minimum variance—i.e.,

σ_{φ}^{2}

for

θ = f (φ) \pm π / 2

—is a stationary point for variations of

f (φ)

, and hence its gradient in Equation (26) is vanishing for

P_{φ}

close to its maximum and

σ_{φ}^{2}

to its minimum.

Condition (29b) is a requirement on the refocusing of the probe into the only observed channel. In order for the variance in Equation (23) of the observed quadrature to be ‘squeezed’—i.e.,

σ_{φ}^{2} \sim 1 / N

—the contribution of the vacuum

(1 - P_{φ}) / 2

must be of order

1 / N

. Moreover, similar considerations also for the prior knowledge on the parameter required for the optimization of the refocusing network can be drawn. First of all, condition (29b) can be satisfied by only optimizing a single auxiliary stage, while choosing arbitrarily the other. We can call

φ_{cl}

the result of a prior coarse estimation of the parameters

φ

required to optimize a single auxiliary stage, say

{\hat{V}}_{out} \equiv {\hat{V}}_{out} (φ_{cl})

. The single-photon probability transition

P_{φ}

can be written as the squared modulus of the scalar product of the two vectors

U_{φ} V_{in} e_{1}

and

V_{out} {(φ_{cl})}^{†} e_{1}

, with

e_{1} = {(1, 0, \dots, 0)}^{T}

P_{φ} = {| e_{1}^{T} V_{out} (φ_{cl}) U_{φ} V_{in} e_{1} |}^{2} \equiv η (φ, φ_{cl}),

(38)

where

η (φ, φ_{cl})

is a smooth function of

φ

and

φ_{cl}

which is maximized for

φ_{cl} = φ

, since

{\hat{V}}_{out} (φ_{cl} = φ)

would satisfy

η (φ, φ) \equiv 1

. For small deviations

δ φ = φ - φ_{cl}

of the coarse estimation from the true values of the parameters

φ

, we can expand Equation (38)

\begin{matrix} η (φ, φ - δ φ) = 1 - \sum_{i = 1}^{p} \underset{0}{\underset{︸}{\frac{\partial η (φ, x)}{\partial x_{i}} |_{x = φ}}} δ φ_{i} + \frac{1}{2} \sum_{i, j = 1}^{p} \frac{\partial^{2} η (φ, x)}{\partial x_{i} \partial x_{j}} |_{x = φ} δ φ_{i} δ φ_{j} + {(O (δ φ))}^{3}, \end{matrix}

(39)

where the gradient

\nabla_{x} η (φ, x)

is zero for

x = φ = φ_{cl}

. Thus, also in the presence of multiple unknown parameter

φ

, if the errors

δ φ

in the prior estimations

φ_{cl}

are of order

1 / \sqrt{N}

, it becomes possible to engineer the refocusing stage

{\hat{V}}_{out} (φ_{cl})

that satisfies condition (29b), and thus that allows to reach the Heisenberg limit, similarly to the single-parameter estimation protocol.

4.4. Examples of Quantum-Enhanced Estimation of Functions

The presence of two auxiliary stages

{\hat{V}}_{in}

and

{\hat{V}}_{out}

, and the need to optimize only one of them in order to satisfy condition (29b)—and ultimately to reach the Heisenberg limit—entail the possibility to exploit the remaining degrees of freedom in the network to manipulate how the information on the parameters

φ

are encoded in the probe. In this section, two examples are proposed, which make use of these degrees of freedom to allow us to choose the function to be estimated from a family of functions of some parameters

φ

. The first example is a 2-channel network for the estimation at the Heisenberg limit of non-linear functions of three parameters, of which two are magnitudes of phase-shifts and one is the reflectivity of a beam-splitter. The second is a network for the estimation of linear combination with positive weights of an arbitrary number of parameters.

4.4.1. Non-Linear Functions

We here consider a 2-channel network

{\hat{U}}_{φ}

for the estimation of non-linear functions

f (φ; α)

of the reflectivity of a beam-splitter

φ_{1}

and the magnitudes of two phase-shifts

φ_{2 / 3}

, with reference to Figure 4. The functions

f (φ; α)

are parametrized by the quantities

α = (α_{1}, α_{2}, α_{3}, α_{4})

, which can be implemented arbitrarily, with the only condition that

α_{1} - α_{2} = α_{4} - α_{3} \equiv Δ α

. The possibility to choose the quantities

α

stems from the presence of abundant degrees of freedom in the auxiliary networks

{\hat{V}}_{in}

and

{\hat{V}}_{out}

, which are not employed for the optimization of the network needed to satisfy Equation (29b). The protocol employed is the same considered in Section 4.1, with a single squeezed-vacuum state with

N = {sinh}^{2} r

average number of photons injected in the first input port of the overall network, with r squeezing parameter of the probe, and only the first output channel observed through homodyne detection, to measure the quadrature

{\hat{x}}_{θ}

, where

θ

is the phase of the homodyne local oscillator.

We can write the matrices representing the action of the beam-splitter and the phase-shifts in the network

{\hat{U}}_{φ}

as

\begin{matrix} U_{BS} (φ_{1}) & = exp (i φ_{1} σ_{2}) = (\begin{matrix} cos φ_{1} & sin φ_{1} \\ - sin φ_{1} & cos φ_{1} \end{matrix}) \\ U_{PS} (φ_{2}, φ_{3}) & = exp (i \frac{φ_{2} + φ_{3}}{2} 𝟙_{2} + i \frac{φ_{2} - φ_{3}}{2} σ_{3}) = (\begin{matrix} exp (i φ_{2}) & 0 \\ 0 & exp (i φ_{3}) \end{matrix}) \end{matrix}

(40)

respectively, where

σ_{i}

,

i = 1, 2, 3

, is the i-th Pauli matrix and

𝟙_{2}

is the

2 \times 2

identity matrix, so that the network

{\hat{U}}_{φ}

is represented by the matrix

U_{φ} = U_{PS} (φ_{2}, φ_{3}) U_{BS} (φ_{1}) = (\begin{matrix} cos φ_{1} exp (i φ_{2}) & sin φ_{1} exp (i φ_{2}) \\ - sin φ_{1} exp (i φ_{3}) & cos φ_{1} exp (i φ_{3}) \end{matrix}) .

(41)

The quantity

Δ α = α_{1} - α_{2}

that we will use to parametrize the family of functions

f (φ; α)

is the relative phase between the arms of the input auxiliary stage

{\hat{V}}_{in} \equiv {\hat{V}}_{in} (α_{1}, α_{2})

, which is overall described by the unitary matrix

V_{in} (α_{1}, α_{2}) = U_{PS} (α_{1}, α_{2}) U_{BS} (ω) .

(42)

This auxiliary network consists of two phase-shifts of arbitrary magnitudes

α_{1}

and

α_{2}

in the first and second channel, and a beam splitter with reflectivity

ω = \frac{1}{2} arctan (\frac{cos (φ_{cl, 1})}{sin (φ_{cl, 1}) cos Δ α}),

(43)

which can be engineered once a classical estimation

φ_{cl, 1}

of the unknown reflectivity

φ_{1}

of the beam-splitter in

{\hat{U}}_{φ}

has been carried out, namely after a coarse prior estimation so that the error

δ φ_{1} = φ_{1} - φ_{cl, 1}

is of order

1 / \sqrt{N}

. The refocusing auxiliary network

{\hat{V}}_{out}

is described, with reference to Figure 4, by the unitary matrix

V_{out} (α_{3}, α_{4}) = U_{BS} (ω - π / 2) U_{PS} (α_{3} - φ_{cl, 2}, α_{4} - φ_{cl, 3}),

(44)

where the quantity

Δ α = α_{4} - α_{3}

enters in the relative phase of the two channels, this time changed in sign. We can see from the expression of

V_{out}

that this auxiliary stage in general depends on the classical estimation

φ_{cl}

of the unknown parameters, namely on the results of a coarse prior estimation for which the committed errors

δ φ = φ - φ_{cl}

are of order

1 / \sqrt{N}

.

We will now see that the Heisenberg limit can be achieved in the estimation of the complex phase

f (φ; α) \equiv arg (V_{out} (α_{3}, α_{4}) U_{φ} V_{in} (α_{1}, α_{2}))

of the network

{\hat{u}}_{φ}

depicted in Figure 4, showing that it satisfies condition (29b). In fact, employing Equations (41)–(44), we can explicitly evaluate the transition amplitude

\begin{matrix} {(u_{φ})}_{11} & = e^{i (α_{1} + α_{3} + \frac{δ φ_{2} + δ φ_{3}}{2})} (cos (\frac{δ φ_{2} - δ φ_{3}}{2}) sin 2 ω cos φ_{1} \\ + cos (Δ α - \frac{δ φ_{2} - δ φ_{3}}{2}) cos 2 ω sin φ_{1} + i sin (Δ α - \frac{δ φ_{2} - δ φ_{3}}{2}) sin φ_{1}), \end{matrix}

(45)

where the quantity

ω

is defined in Equation (43). In Appendix F we see that the transition probability

P_{φ} \equiv {| u_{φ} |}_{11}^{2}

actually satisfies the condition (29b) on the network for the Heisenberg limit, which means that the auxiliary networks correctly operate on the probe so that it gets refocused on the only observed channel. In Appendix F the complex phase is evaluated

\begin{matrix} f (φ; α) & = arctan (\frac{Im {(u_{φ})}_{11}}{Re {(u_{φ})}_{11}}) = α_{1} + α_{3} + \frac{δ φ_{2} + δ φ_{3}}{2} + arctan (Φ), \end{matrix}

(46)

with

Φ = \frac{sin φ_{1} sin (Δ α - \frac{δ φ_{2} - δ φ_{3}}{2}) \sqrt{1 - {sin}^{2} (φ_{cl, 1}) {sin}^{2} Δ α}}{cos φ_{1} cos φ_{cl, 1} cos (\frac{δ φ_{2} - δ φ_{3}}{2}) + sin φ_{1} sin φ_{cl, 1} cos Δ α cos (Δ α - \frac{δ φ_{2} - δ φ_{3}}{2})},

(47)

which is in general a non-linear function of the parameters

φ

. We see from Equations (46) and (47) how the choice of the arbitrary values of the parameters

α

affects the functional dependence of the phase

arg {(u_{φ})}_{11}

on the parameters

φ

, and thus of the quantity which can be estimated at the Heisenberg limit. In particular, beside the term

α_{1} + α_{3}

which simply adds an overall phase, the relative phase

Δ α

affects the functional dependence between the quantity

Φ

in Equation (47) and

φ

.

For example, for

Δ α = π / 2

the beam-splitters in the auxiliary stages

{\hat{V}}_{in}

and

{\hat{V}}_{out}

become balanced—i.e.,

ω = \pm π / 4

from Equation (43)—while the function of

φ

in Equation (46) we can estimate reduces to

f (φ; Δ α = π / 2) = α_{1} + α_{2} - \frac{φ_{cl, 2} + φ_{cl, 3}}{2} + φ_{1} + \frac{φ_{2} + φ_{3}}{2},

(48)

thus becoming a linear function of the parameters. From Equation (45), we can evaluate the transition probability

P_{φ} = {| (u_{φ}) |}_{11}^{2}

for the choice of

Δ α = π / 2

, and for

ω = π / 4

P_{φ} = {cos}^{2} (\frac{δ φ_{2} - δ φ_{3}}{2}) = 1 - \frac{{(δ φ_{2} - δ φ_{3})}^{2}}{4} + O {(δ φ)}^{3},

(49)

which satisfies condition (29b) as expected, since the error in the coarse estimation are assumed to be classical, i.e., so that

δ φ = O (1 / \sqrt{N})

. Comparing Equations (29b) and (49), and assuming that

δ φ_{i} = k_{i} / \sqrt{N}

with

k_{i}

independent of N, it is easy to evaluate the factor

ℓ = \frac{{(k_{2} - k_{3})}^{2}}{4}

(50)

which enters through

ϱ (k, ℓ)

in Equation (7) in the Cramér-Rao bound in Equation (33). Moreover we notice that, if the two phase-shifts are completely known quantities, so that we are able to perfectly balance them with the auxiliary stage

{\hat{V}}_{out}

with

δ φ_{2 / 3} = 0

, the overall network

{\hat{u}}_{φ}

in Figure 4 reduces (up to a global phase

(α_{1} + α_{2}) / 2

) to a setup which transform the reflectivity

φ_{1}

in an optical delay, without any prior information on the parameter. In fact in this case, the overall phase

f (φ; Δ α = π / 2)

in Equation (48) becomes

α_{1} + α_{2} + φ_{1}

. In Section 4.4.2 we will make use of this type of networks specifically for this purpose, and be able to treat the reflectivities of beam-splitter as if they were simple phase-shift.

For

Δ α = 0

, the reflectivity

ω

in Equation (43) becomes

ω = - φ_{cl, 1} / 2 - π / 4

, while the function estimated at the Heisenberg limit in Equation (48) reads

\begin{matrix} f (φ; Δ α = 0) & = α_{1} + α_{2} + \frac{δ φ_{2} + δ φ_{3}}{2} + arctan (- tan (\frac{δ φ_{2} + δ φ_{3}}{2}) \frac{sin φ_{1}}{cos δ φ_{1}}) \\ = α_{1} + α_{2} + \frac{δ φ_{2} + δ φ_{3}}{2} - \frac{δ φ_{2} + δ φ_{3}}{2} sin φ_{1} + O {(δ φ)}^{3}, \end{matrix}

(51)

where the term

O {(δ φ)}^{3}

can be neglected since it is of order

N^{- 3 / 2}

or smaller, i.e., beyond the Heisenberg limit resolution. In particular, we notice from Equation (51) that

f (φ; Δ α = 0)

is a relatively simple non-linear function, in which we can find the products

φ_{2} sin φ_{1}

and

φ_{3} sin φ_{1}

, between each phase-shifts and the transmittivity amplitude of the beam-splitter. If we once again evaluate the transition probability

P_{φ} = {| (u_{φ}) |}_{11}^{2}

from Equation (45) for

Δ α = 0

we obtain

P_{φ} = 1 - δ φ_{1}^{2} - \frac{{(δ φ_{2} - δ φ_{3})}^{2} {cos}^{2} φ_{1}}{4} + O {(δ φ)}^{3},

(52)

which satisfies condition (29b) as expected, since

δ φ

is of order

1 / \sqrt{N}

. In particular, we can easily evaluate the N-independent factor

ℓ = k_{1}^{2} + \frac{{(k_{2} - k_{3})}^{2} {cos}^{2} φ_{1}}{4}

(53)

which enters in the Cramér-Rao bound in Equation (33), if we assume

δ φ_{i} = k_{i} / \sqrt{N}

with

k_{i}

independent of N. If moreover

φ_{1} = 0

, the network in Figure 4 reduces to a balanced Mach-Zehnder, which allows us to estimate the average of the two phase-shifts since the overall phase in Equation (51) reduces, for

φ_{1} = 0

, to

f (φ; Δ α = 0) = α_{1} + α_{2} - \frac{φ_{cl, 2} - φ_{cl, 3}}{2} + \frac{φ_{2} + φ_{3}}{2},

(54)

where the average of the two phase-shifts

φ_{2}

and

φ_{3}

sums a known quantity, which can thus be subtracted during the estimation without affecting the overall precision. In the next example, we will generalize such scheme, considering a similar network, but with an arbitrary number of channels and generally unbalanced beam-splitters.

4.4.2. Linear Combinations of Arbitrary Parameters

We now consider a network

{\hat{U}}_{φ}

which depends on M independent parameters

φ

(see Figure 5), that allows us to estimate with a precision at the Heisenberg limit any linear combination

L (φ) = \sum_{i = 1}^{M} w_{i} φ_{i} \equiv w \cdot φ

(55)

with positive weights

w = (w_{1}, \dots, w_{M})

. As also discussed previously in this chapter, the ability to change arbitrarily the weights stems from the presence of degrees of freedom in the auxiliary stages

{\hat{V}}_{in}

and

{\hat{V}}_{out}

that are not employed to refocus the probe in the only observed channel, i.e., to satisfy condition (29b). At first, we will suppose that the parameters

φ

can either be magnitudes of phase-shifts or reflectivities of beam-splitters. Later, we will show how it is possible to generalize the scheme for an arbitrary set of parameters. We will also suppose, without loss of generality, that the weights

w

sum to one, namely that the linear combination in Equation (55) is a convex sum. To estimate a generic linear combination, it would suffice to rescale the estimated convex sum, causing a rescaling of the error in the estimation which does not affect the Heisenberg limit.

With reference to Figure 5, we now describe the network

{\hat{U}}_{φ}

affected by M unknown parameters

φ

. The parameters

φ

act in parallel, in the sense that the i-th parameter

φ_{i}

only affects the i-th mode of the network, and they can equivalently be the magnitudes of a phase-shift

U_{PS} (φ_{i}) = e^{i φ_{i}}

or the reflectivities of beam-splitters

U_{BS} (φ_{i}) = e^{i φ_{i} σ_{2}}

, with

σ_{2}

the second Pauli matrix. For each unknown beam-splitter, a 2-channel passive and linear network is employed in

{\hat{U}}_{φ}

, whose purpose is to transform the reflectivity into a relative phase between the two ports of the beam-splitter (see the panel (b) in Figure 5). This can be done through the network

V U_{BS} (φ) V^{†} = U_{PS} (φ) \equiv (\begin{matrix} e^{i φ} & 0 \\ 0 & e^{- i φ} \end{matrix}),

(56)

with

V = U_{BS} (\frac{π}{4}) U_{PS} (\frac{π}{4}) .

(57)

We also remark that, despite each mode of

{\hat{U}}_{φ}

with an unknown beam-splitter is practically composed of two separated channels, the overall network

V U_{BS} (φ) V^{†}

in Equation (56) acts as a phase-shift in each of the two channels, namely the two modes are not mixed. Since this local network

V U_{BS} (φ) V^{†}

is only fed through a single input port, it essentially acts as a single-channel phase shift of magnitude

\pm φ

on the probe, depending on which of the two arms are employed. The overall network

{\hat{U}}_{φ}

can be thus described with the unitary matrix

U_{φ} = diag (e^{i φ_{1}}, \dots, e^{i φ_{M}}),

(58)

regardless of the nature of the parameters

φ

, whether they are phase magnitudes or beam-splitter reflectivities.

The input auxiliary stage is a M-channel generalized beam-splitter, which scatters the probe injected in the first input port into each of the M channels of

{\hat{U}}_{φ}

according to the weights

w

. Specifically, the unitary matrix

V_{in}

representing the input network is chosen so that

| {(V_{in})}_{i 1} |^{2} = w_{i} .

(59)

Noticeably, this constraint is

φ

-independent, i.e.,

{\hat{V}}_{in}

does not need to be optimized after a prior coarse estimation of

φ

. The output auxiliary network

{\hat{V}}_{out}

can be though as been composed of three separate stages. First, a phase shift of magnitude

- φ_{cl, i}

is applied to the i-th channel, where

φ_{cl, i}

is a coarse estimate of the prior classical measurement of

φ_{i}

, so that the error committed

δ φ_{i} = φ_{i} - φ_{cl, i}

is of order

1 / \sqrt{N}

. Then, a second generalized beam-splitter which does not depend on

φ

is in place, whose purpose is to invert the action of

{\hat{V}}_{in}

and thus refocusing the probe into the first channel of the network. Finally, a phase-shift of magnitude

L (φ_{cl})

is applied before the homodyne detection at the first output port. The overall action of the network

{\hat{V}}_{out}

is thus described by the unitary matrix

V_{out} = U_{PS} (L (φ_{cl}), 0, \dots, 0) V_{in}^{†} U_{PS} (- φ_{cl}),

(60)

where we denoted with

U_{PS} (λ_{1}, \dots, λ_{l}) = diag (e^{i λ_{1}}, \dots, e^{i λ_{l}})

.

With this setup, the probability amplitude

{(u_{φ})}_{11} = {(V_{out} U_{φ} V_{in})}_{11}

found in Equation (21) can be easily evaluated through Equations (58)–(60), and it reads

\begin{matrix} {(u_{φ})}_{11} & = e^{i L (φ_{cl})} \sum_{i = 1}^{M} w_{i} e^{i δ φ_{i}} = e^{i L (φ_{cl})} (1 + \sum_{i = 1}^{M} i w_{i} δ φ_{i} - \frac{1}{2} \sum_{i = 1}^{M} w_{i} δ φ_{i}^{2}) + O {(δ φ)}^{3}, \end{matrix}

(61)

where we can neglect the term

O {(δ φ)}^{3}

, since it is of order

N^{- 3 / 2}

and thus beyond the Heisenberg limit resolution we can achieve. We can now evaluate the transition probability

P_{φ} = {| u_{φ} |}_{11}^{2}

through Equation (61)

\begin{matrix} P_{φ} & = | e^{i L (φ_{cl})} \sum_{i = 1}^{M} w_{i} e^{i δ φ_{i}} |^{2} = 1 + {(\sum_{i = 1}^{M} w_{i} δ φ_{i})}^{2} - \sum_{i = 1}^{M} w_{i} δ φ_{i}^{2} + O {(δ φ)}^{3} \end{matrix}

(62)

which clearly satisfies condition (29b), so that the Heisenberg limit can be achieved in the estimation of the complex phase

f (φ) = arg {(u_{φ})}_{11}

. We can thus evaluate, comparing Equations (29b) and (62), the factor ℓ which enters in the Fisher information shown in Equation (33),

ℓ = {(\sum_{i} w_{i} k_{i})}^{2} - \sum_{i} w_{i} k_{i}^{2},

(63)

where we supposed that

δ φ_{i} = k_{i} / \sqrt{N}

with

k_{i}

independent of N. From Equation (61) we obtain

\begin{matrix} f (φ) & = L (φ_{cl}) + \sum_{i = 1}^{M} w_{i} δ φ_{i} + O {(δ φ)}^{3} = L (φ) + O {(δ φ)}^{3}, \end{matrix}

(64)

so that it is possible to recover the linear combination (55) at the Heisenberg limit from the estimation of

f (φ)

. We notice in fact that

L (φ)

in (55) and the complex phase

f (φ)

in (64) are equal up to a quantity of order

O (N^{- 3 / 2})

, which is beyond the Heisenberg resolution, since the errors

δ φ

in the classical prior estimation are of order

1 / \sqrt{N}

.

In conclusion, we will discuss about two different features of this protocol. First, we notice from Equation (56) that the network employed to transform the reflectivity

φ

of the beam-splitters into phase-shifts inverts the sign of the parameter in the second channel. This means that, if the portion of the probe which has been scattered into this network is injected in its second port, it acquires a phase

- φ

. It is then possible to exploit this behaviour to estimate a linear combination which admits negative weights for the reflectivities of the beam-splitters, employing the same protocol described in this section with the only precaution to invert the sign of the classical estimation

φ_{cl}

of this parameter. Second, it is possible to further generalize the local networks in each mode of

{\hat{U}}_{φ}

. In fact, we can replace the networks in the panel of Figure 5 with a generic m-channel network

\hat{V}

and, provided that the probe comes out of this local network from a single output channel—i.e., the network acts as a phase-shift on the probe—the same results found in this section still apply. Not only, in Appendix G we show that it is not necessary that all the photons come out from a single output port of each local network: instead, it suffices that a similar refocusing condition to the one in Equation (29b) is satisfied locally by each

\hat{V}

. It is thus possible to conceive, for example, a scheme that allows to reach the Heisenberg limit in the estimation of linear combination of parameters and functions of parameters, if we choose as

\hat{V}

the whole network described in the example in Section 4.4.1.

5. Conclusions

The quantum metrological revolution is, at this moment in time, exciting and thrilling. The recent advances in the field of quantum mechanics have been incessantly stimulating the development of increasingly ingenious and innovative technologies which exploit the laws and rules of the microscopic world. Ranging from computing to biology, medicine, cosmology, imaging, sensing, cryptography and neural networks, quantum technologies appear to consistently outperform their classical counterparts. In particular, the fields of quantum sensing and quantum metrology propose schemes for the estimation of physical properties, such as lengths, time intervals, temperatures, and more, achieving enhanced levels of precision. In particular, the field of distributed Gaussian metrology, which studies strategies employing Gaussian states for the estimation of unknown parameters distributed arbitrarily on a passive and linear network, has recently witnessed some improvements in overcoming the challenge of adaptivity of the network, usually found in Gaussian and, more in general, quantum-enhanced schemes. In fact, as we briefly summarize in Section 2, employing a single squeezed vacuum state and performing homodyne detection at a single output port, it has been shown that it is possible to achieve the elusive Heisenberg-scaling sensitivity by only adding a single auxiliary network, whose purpose is to refocus the probe into the only observed channel. This auxiliary network, which introduced degrees of freedom in the interferometer, can be engineered with only a classical prior knowledge on the parameter.

The main focus of this review was then put on discussing the role and the effect of further degrees of freedom that can be added in the network, for example employing a second, non-optimized auxiliary stage. We have shown in Section 3 that this auxiliary stage leaves essentially unchanged the precision achieved by the setup, especially for networks with a large number of channel, and we have discussed how this result allows us to assume that the constant factor multiplying the Heisenberg scaling of the precision can be controlled and is typically far from zero. In Section 4 we demonstrated that it is possible to exploit this ability to manipulate how the information on multiple parameters encoded in a passive and linear network. We have seen that this results in the possibility to estimate functions of the unknown parameters which we can manipulate by acting on these further degrees of freedom through the auxiliary stages. The advantage of estimating directly functions of parameters lies in the fact that it allows us to save resources which would otherwise be employed to estimate singularly and at a high precision each parameter. In this way, both linear and non-linear functions can be estimated at the Heisenberg limit, and two example have been proposed.

Funding

This work was partially supported by the Office of Naval Research Global (N62909-18-1-2153). P.F. is partially supported by Istituto Nazionale di Fisica Nucleare (INFN) through the project QUANTUM, and by the Italian National Group of Mathematical Physics (GNFM-INdAM).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We thank Giovanni Gramegna and Frank A. Narducci for useful discussions.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Typicality of Gaussian Metrology

In this appendix, we will derive the statistical results discussed in Section 3 regarding the pre-factor

f (U, G_{φ})

appearing in the Fisher information in Equation (6). First, we will obtain the average of the pre-factor shown in Equation (18), employing results of computation of averages over the unitary group. Then, we will apply standard results on concentration of measure to derive the result in Equation (20).

Appendix A.1. Derivation of the Average of the Pre-Factor

Denoting with

P

the Haar probability measure defined on the unitary group

U (M)

of the

M \times M

unitary matrices, it is possible to define the average of a given function

f : U (M) \to C

E [f (U)] \equiv E_{P} [f (U)] = \int f (U) d P (U) .

(A1)

In order to derive the average

E [f (U, G_{φ})]

in Equation (13)

f (U, G_{φ}) = {(U^{†} G_{φ} U)}_{11}^{2},

(A2)

we are interested only in the moments of the matrix elements

U_{i j}

up to the fourth orders, i.e., the averages of powers of the matrix elements and their complex conjugates. For random choices of the unitary matrix

U \in U (M)

, the only non-vanishing moments up to the fourth order of the elements

U_{i j}

are given by [61]

\begin{matrix} (A3a) & E [| U_{i j} |^{2}] & = \frac{1}{M} & (1 \leq i, j \leq M), \\ (A3b) & E [| U_{i j} |^{4}] & = \frac{2}{M (M + 1)} & (1 \leq i, j \leq M), \\ (A3c) & E [| U_{i j} |^{2} | U_{k j} |^{2}] & = \frac{1}{M (M + 1)} & (i \neq k), \\ (A3d) & E [| U_{i j} |^{2} | U_{i l} |^{2}] & = \frac{1}{M (M + 1)} & (j \neq l), \\ (A3e) & E [| U_{i j} |^{2} | U_{k l} |^{2}] & = \frac{1}{M^{2} - 1} & (i \neq k, j \neq l), \\ (A3f) & E [U_{i j} U_{k l} U_{i l}^{*} U_{k j}^{*}] & = - \frac{1}{M (M^{2} - 1)} & (i \neq k, j \neq l) . \end{matrix}

The results shown in Equation (A3) can be conveniently expressed in two compact formulas [62] as:

\begin{matrix} E [U_{i j} U_{k l}^{*}] = \frac{δ_{i k} δ_{j l}}{M} \\ E [U_{i j} U_{k l} U_{m n}^{*} U_{p q}^{*}] = \frac{δ_{i m} δ_{j n} δ_{k p} δ_{l q} + δ_{i p} δ_{j q} δ_{k m} δ_{l n}}{M^{2} - 1} \end{matrix}

(A4a)

\begin{matrix} - \frac{δ_{i m} δ_{j q} δ_{k p} δ_{l n} + δ_{i p} δ_{j n} δ_{k m} δ_{l q}}{M (M^{2} - 1)} \end{matrix}

(A4b)

Employing the formulas in Equations (A4), we are able to derive the averages in the main text in Equations (18) and (19). In fact, given a generic

M \times M

complex matrix A, employing the result in Equation (A4a), we have

\begin{matrix} E [{(U^{†} A U)}_{i j}] & = \sum_{k, l} E [U_{i k}^{†} A_{k l} U_{l j}] \\ = \sum_{k, l} A_{k l} E [U_{k i}^{*} U_{l j}] \\ = \sum_{k, l} A_{k l} \frac{δ_{k l} δ_{i j}}{M} \\ = \frac{Tr (A)}{M} δ_{i j}, \end{matrix}

(A5)

while employing the formula in Equation (A4), we have

\begin{matrix} E [{(U^{†} A U)}_{i j}^{2}] & = \sum_{k, l, m, n} E [U_{i k}^{†} A_{k l} U_{l j} U_{i m}^{†} A_{m n} U_{n j}] \\ = \sum_{k, l, m, n} A_{k l} A_{m n} E [U_{l j} U_{n j} U_{m i}^{*} U_{k i}^{*}] \\ = [Tr (A^{2}) + Tr {(A)}^{2}] (\frac{1}{M^{2} - 1} - \frac{1}{M (M^{2} - 1)}) δ_{i j} \\ = \frac{Tr (A^{2}) + Tr {(A)}^{2}}{M (M + 1)} δ_{i j} \end{matrix}

(A6)

The expressions in Equations (A5) and (A6) reduce to the equalities in Equations (18) and (19) for

A = G_{φ}

and

i = j = 1

, respectively.

Appendix A.2. Derivation of the Typicality Results

To show how to derive the result in Equation (20), we start from a standard result on concentration of measure in high-dimensional spaces known as Levy’s Lemma

Theorem A1.

Let

f : S^{n - 1} \to R

be a function defined over the unit euclidean sphere

S^{n - 1} = \{x \in R^{n} | \sum_{k = 1}^{n} x_{k}^{2} = 1\}

(A7)

endowed with the uniform probability measure

P

. Denote with L the Lipschitz constant of the function, such that

| f (x) - f (y) | \leq L {∥x - y∥}_{2},

(A8)

for all

x, y \in S^{n - 1}

, where

{∥x∥}_{2} = \sqrt{\sum_{k = 1}^{n} x_{k}^{2}}

is the Euclidean norm. Then:

P (| f - E [f] | \geq ε) \leq 2 e^{- \frac{n ε^{2}}{C L^{2}}},

(A9)

where C is some positive constant which can be taken to be

C = 9 π^{3}

[63,64].

To prove the concentration result in Equation (20), we want to apply Theorem A1 to our case. Thus, we first need to compute the Lipschitz constant L associated with the pre-factor in Equation (13). To do so, we first notice that

f (U, G_{φ})

can be thought as a function defined on the real unit sphere

S^{n - 1}

. In fact, it can be written as

\begin{matrix} f (U, G_{φ}) & = {(\sum_{j = 1}^{M} {| u_{j} |}^{2} g_{j})}^{2} . \end{matrix}

(A10)

where

u

is the complex M-dimensional vector given by

u = V_{φ}^{†} U e

with

e = {(1, 0, \dots, 0)}^{T} \in C^{M}

, as shown in Equation (15), with

V_{φ}^{†}

being the matrix whose columns are the eigenvectors of

G_{φ}

. Since only the squared moduli

| u_{j} |^{2}

of this complex vector appear in Equation (A10), we can consider the pre-factor

f (U, G_{φ})

as a function of a real vector

x \in R^{2 M}

whose

2 M

components are defined by

x_{2 j - 1} = Re u_{j}, x_{2 j} = Im u_{j}, j = 1, \dots, M .

(A11)

The unitarity constraint

\sum_{j = 1}^{M} {| u_{j} |}^{2} = 1

translates into

\sum_{j = 1}^{2 M} x_{j}^{2} = 1

, so that

x \in S^{2 M - 1}

, the unit sphere sitting inside

R^{2 M}

. We see then that the random factor in Equation (A10) can be thought as a function defined over the unit sphere

S^{2 M - 1}

:

\begin{matrix} f (U, G_{φ}) & = {(\sum_{j = 1}^{M} {| u_{j} |}^{2} g_{j})}^{2} \end{matrix}

\begin{matrix} = {(\sum_{j = 1}^{M} (x_{2 j - 1}^{2} + x_{2 j}^{2}) g_{j})}^{2} \end{matrix}

(A12)

\begin{matrix} = {(x^{T} \tilde{G} x)}^{2} = : f (x), \end{matrix}

(A13)

where we have defined the diagonal matrix

\tilde{G} = diag (\tilde{g})

with

\tilde{g} = (g_{1}, g_{1}, \dots, g_{M}, g_{M}) \in R^{2 M}

. We can now estimate the Lipschitz constant L of the function

f (x)

to apply Theorem A1. To this aim, we evaluate the gradient of f, which is given by:

\begin{matrix} \nabla f (x) & = 4 (x^{T} \tilde{G} x) \tilde{G} x . \end{matrix}

(A14)

The Lipschitz constant for f can be then obtained as

L = max_{x \in S^{2 M - 1}} {∥\nabla f (x)∥}_{2} = 4 {∥G_{φ}∥}^{2},

(A15)

since

\begin{matrix} {∥\nabla f (x)∥}_{2} & = \sqrt{{[\nabla f (x)]}^{T} [\nabla f (x)]} \\ = 4 | x^{T} \tilde{G} x | \sqrt{x^{T} {\tilde{G}}^{2} x} \\ \leq 4 ∥ \tilde{G} ∥^{2} \\ = 4 {∥G_{φ}∥}^{2} \end{matrix}

(A16)

where we used the fact that

| x^{T} \tilde{G} x | \leq ∥ \tilde{G} ∥

and

x^{T} {\tilde{G}}^{2} x = ∥ \tilde{G} {x ∥}^{2} \leq {∥ \tilde{G} ∥}^{2}

, while in the last equality we used the fact that

∥ \tilde{G} ∥ = ∥G_{φ}∥ = {max}_{i} g_{i}

. The value

{∥\nabla f (x)∥}_{2} = 4 {∥G_{φ}∥}^{2}

can be obtained with

x = e_{i}

, supposing that

g_{i}

is the eigenvalue with highest absolute value. The value of

L = 4 {∥G_{φ}∥}^{2}

in Equation (A15) can then be used when applying Theorem A1, with

n = 2 M

, to finally prove the result in Equation (20).

Appendix B. Probability Distributions from Homodyne Measurements

In this appendix, we will obtain the probability density functions which governs the outcomes of homodyne detections performed on a single-mode squeezed state

∣ r 〉 = \hat{S} (r) vac 〉

injected in the first port of a M-channel passive and linear network

\hat{U}

, with

r = (r, 0, \dots, 0)

, so that

N = {sinh}^{2} r

, with

r > 0

, and

N = N_{S}

average number of photons. The state

∣ r 〉

is a Gaussian state which is completely described, through its Wigner distribution [29,30,31,32], by its covariance matrix

Γ = diag (e^{2 r}, 1 \dots, 1, e^{- 2 r}, 1 \dots, 1) / 2

. Moreover, passive and linear networks preserve the Gaussian nature of

∣ r 〉

[29,30,31,32]. For this reason, we will first obtain in generality the final covariance matrix

Γ_{U}

of the state

\hat{U} ∣ r 〉

, which define its Wigner distribution. Then, we will marginalize the Wigner distribution associated with the first channel—since it is the only observed port—rotated by the symplectic and orthogonal matrix

R (θ) = (\begin{matrix} cos θ & sin θ \\ - sin θ & cos θ \end{matrix}),

(A17)

where

θ

is the local oscillator phase of the homodyne. In doing so, we will be able to obtain the expression of the variance

σ_{φ}

in Equation (23).

The covariance matrix

Γ_{U}

of the state

\hat{U} ∣ r 〉

is given by the transformation

Γ_{U} = R Γ R^{T}

, with

R = (\begin{matrix} Re (U) & - Im (U) \\ Im (U) & Re (U) \end{matrix}),

(A18)

orthogonal and symplectic matrix representing the rotation in the phase space generated by the network

\hat{U}

[29,30,31,32]. A straightforward calculation shows that

Γ_{U} = R Γ R^{T} = (\begin{matrix} Δ X^{2} & Δ X P \\ Δ X P^{T} & Δ P^{2} \end{matrix}) .

(A19)

where we have defined the

M \times M

matrices

\begin{matrix} Δ X^{2} & \equiv \frac{1}{2} [Re [U] e^{2 R} Re [U^{†}] - Im [U] e^{- 2 R} Im [U^{†}]] \\ (A20a) & = \frac{1}{2} [Re [U cosh (2 R) U^{†}] + Re [U sinh (2 R) U^{T}]], \\ Δ P^{2} & \equiv \frac{1}{2} [- Im [U] e^{2 R} Im [U^{†}] + Re [U] e^{- 2 R} Re [U^{†}]] \\ (A20b) & = \frac{1}{2} [Re [U cosh (2 R) U^{†}] - Re [U sinh (2 R) U^{T}]], \\ Δ X P & \equiv \frac{1}{2} [- Re [U] e^{2 R} Im [U^{†}] - Im [U] e^{- 2 R} Re [U^{†}]] \\ (A20c) & = \frac{1}{2} [- Im [U cosh (2 R) U^{†}] + Im [U sinh (2 R) U^{T}]] . \end{matrix}

The

2 \times 2

reduced covariance matrix

Γ_{U}^{1}

of the first mode reads

Γ_{U}^{1} = (\begin{matrix} {(Δ X^{2})}_{11} & {(Δ X P)}_{11} \\ {(Δ X P)}_{11} & {(Δ P^{2})}_{11} \end{matrix}) .

(A21)

Our final step is to recover the variance of the quadrature

{\hat{x}}_{θ}

. In order to do that, we employ the orthogonal and symplectic matrix

R (θ)

in Equation (A17), representing the action of a phase-shift

e^{- i θ}

, namely a clock-wise rotation of an angle

θ

in the first mode phase-space. The variance

σ_{φ}^{2}

in (23) is finally obtained by a direct computation, by recalling the parametrization

{(U_{φ})}_{11} = \sqrt{P_{φ}} e^{i f (φ)}

in Equation (21)

\begin{matrix} σ_{φ}^{2} & = {(O_{θ} Γ_{U} O_{θ}^{T})}_{11} \\ = \frac{1}{2} + P_{φ} ({sinh}^{2} (r) + cos (2 f (φ) - 2 θ) cosh (r) sinh (r)) . \end{matrix}

(A22)

recovering the expression in Equation (23).

Appendix C. Asymptotic Analysis of Gaussian Metrology

We here evaluate the asymptotic expressions of the Fisher information matrix in Equation (30), showing that the conditions (29) yield the Heisenberg-scaling sensitivity for the estimation of

f (φ)

. As shown in Equations (26) and (27), the dependence of the variance

σ_{φ}^{2} = \frac{1 - P_{φ}}{2} + \frac{P_{φ}}{2} [cosh (2 r) + cos (2 γ_{φ} - 2 θ) sinh (2 r)],

(A23)

on the parameters

φ

only appears through the transition probability

P_{φ}

and the acquired complex phase

γ_{φ}

\nabla σ_{φ}^{2} = (\partial_{P} σ_{φ}^{2}) \nabla P_{φ} + (\partial_{f} σ_{φ}^{2}) \nabla f (φ),

(A24)

where

\partial_{P}

and

\partial_{f}

represent the differentiation with respect of

P_{φ}

and

f (φ)

, and

\begin{matrix} (A25a) & \partial_{P} σ_{φ}^{2} & = \frac{1}{2} (- 1 + cosh (2 r) + cos (2 f (φ) - 2 θ) sinh (2 r)) \\ (A25b) & \partial_{f} σ_{φ}^{2} & = - P_{φ} sin (2 f (φ) - 2 θ) sinh (2 r) . \end{matrix}

As discussed in Section 4.2, to achieve the HL, some conditions must be imposed so that the variance in Equation (A23) does not grow with N. The only option to do that without ruining the sensitivity of the setup is requesting that

γ_{φ} - θ ≃ π / 2

, as we can see from Equation (A23). In particular, we impose condition (29a), and evaluate the variance in (A23) and its gradient (A24) in the large N limit

\begin{matrix} σ_{φ}^{2} & = \frac{1}{2} + N P_{φ} (1 - cos (\frac{2 k}{N}) \sqrt{1 + \frac{1}{N}}) \\ = \frac{1}{2} + N P_{φ} (1 - (1 - \frac{2 k^{2}}{N^{2}}) (1 + \frac{1}{2 N} - \frac{1}{8 N^{2}})) + O (\frac{1}{N^{2}}) \\ (A26) & = \frac{1 - P_{φ}}{2} + P_{φ} (\frac{2 k^{2}}{N} + \frac{1}{8 N}) + O (\frac{1}{N^{2}}), \\ \nabla σ_{φ}^{2} & = N \nabla P_{φ} (1 - cos (\frac{2 k}{N}) \sqrt{1 + \frac{1}{N}}) + 2 N P_{φ} \nabla f (φ) sin (\frac{2 k}{N}) \sqrt{1 + \frac{1}{N}} \\ = N \nabla P_{φ} (1 - (1 + \frac{1}{2 N})) + 2 N P_{φ} \nabla f (φ) \frac{2 k}{N} + O (\frac{1}{N}) \\ (A27) & = - \frac{1}{2} \nabla P_{φ} + 4 k P_{φ} \nabla f (φ) + O (\frac{1}{N}) . \end{matrix}

Then, by imposing condition (29b) on

P_{φ}

, we get

\begin{matrix} (A28a) & σ_{φ}^{2} & = (2 k^{2} + \frac{1}{8} + \frac{ℓ}{2}) \frac{1}{N} + O (\frac{1}{N^{2}}), \\ (A28b) & \nabla σ_{φ}^{2} & = 4 k \nabla f (φ) + O (\frac{1}{N}) . \end{matrix}

By substituting the expressions in Equations (A28) into the Fisher information matrix in Equation (25), we can finally evaluate its asymptotic expression

F (φ) \sim 8 ϱ (k, ℓ) N^{2} (\nabla f (φ)) {(\nabla f (φ))}^{T}

(A29)

with

ϱ (k, ℓ) = {(\frac{8 k}{16 k^{2} + 1 + 4 ℓ})}^{2}

(A30)

a positive and N-independent pre-factor.

Appendix D. Maximum-Likelihood Estimators for Gaussian Distributions

In this Appendix we will find the solution which maximizes the Likelihood function in Equations (34) and (35).

Due to the monotonicity of the logarithmic function, we can maximize the log-likelihood function, and thus obtain

\begin{matrix} 0 & = \frac{\partial}{\partial f} ln L (φ | \vec{x}) |_{f = {\tilde{f}}_{MLE}} \\ = \frac{\partial}{\partial f} \sum_{j = 1}^{ν} ln p (x_{j} | φ) |_{f = {\tilde{f}}_{MLE}} \\ = \frac{\partial}{\partial f} \sum_{j = 1}^{ν} (- \frac{1}{2} ln σ_{φ}^{2} - \frac{x_{j}^{2}}{2 σ_{φ}^{2}}) |_{f = {\tilde{f}}_{MLE}} \\ = (\frac{\partial σ_{φ}^{2}}{\partial f} \sum_{j = 1}^{ν} (- \frac{1}{2 σ_{φ}^{2}} + \frac{x_{j}^{2}}{2 σ_{φ}^{4}})) |_{f = {\tilde{f}}_{MLE}} . \end{matrix}

(A31)

By assuming that

\partial σ_{φ}^{2} / \partial f \neq 0

the solution is given by the value of

f (φ)

that solves

σ_{φ}^{2} = σ^{2} (\vec{x}) \equiv \frac{1}{ν} \sum_{j = 1}^{ν} x_{j}^{2},

(A32)

where

σ_{φ}^{2}

is given by (23), with

P_{φ}

given by (29b) and

θ

being phase of the local oscillator. Thus the estimator is ultimately given by

\begin{matrix} {\tilde{f}}_{MLE} (x) = θ + \frac{1}{2} (2 n π \pm arccos (\frac{(2 S {(x)}^{2} - 1) - 2 P_{φ} {sinh}^{2} r}{2 P_{φ} sinh r cosh r})), \end{matrix}

(A33)

as shown in Equation (37).

Appendix E. Derivation of the Cramér-Rao Bound for Singular Fisher Information Matrix

In this appendix we will show that the Fisher information matrix in Equation (30)

F (φ) = 8 ϱ (k, ℓ) N^{2} (\nabla f (φ)) {(\nabla f (φ))}^{T},

(A34)

which, as discussed in Section 4.2, is a matrix of rank

(F (φ)) = 1

, whose only non-vanishing eigenvalue is given by

λ = 8 ϱ (k, ℓ) N^{2} {| \nabla f (φ) |}^{2},

(A35)

yields the Cramér-Rao bound found in Equation (33) for the estimation of the function

f (φ)

.

The Cramér-Rao bound associated with the estimation of

f (φ)

for non-invertible matrices can be written in terms of the Moore-Penrose pseudo-inverse

F {(φ)}^{+}

through the inequality [59]

Var [\tilde{f}] ⩾ \frac{1}{ν} H F {(φ)}^{+} H^{T},

(A36)

where

H = {(\nabla f (φ))}^{T} \equiv | \nabla f (φ) | v^{T},

(A37)

is the gradient of the function

f (φ)

which can be estimated with finite variance, which coincides with the eigenvector of

F (φ)

associated with the eigenvalue

λ

in Equation (A35), while

v

is the eigenvector normalized to the unit length. Since

v

is the only (normalized) eigenvector associated with

λ

, the pseudo-inverse

F {(φ)}^{+}

can be written as

F {(φ)}^{+} = \frac{1}{λ} v v^{T} = \frac{1}{{| \nabla f (φ) |}^{4}} \frac{1}{8 ϱ (k, ℓ) N^{2}} (\nabla f (φ)) {(\nabla f (φ))}^{T} .

(A38)

We can thus evaluate the Cramér-Rao bound in Equation (A36)

{Var}_{φ} [\tilde{f}] \geq \frac{1}{ν} | \nabla f (φ) | v^{T} (\frac{1}{λ} v v^{T}) | \nabla f (φ) | v = \frac{1}{ν} \frac{1}{8 ϱ (k, ℓ) N^{2}}

(A39)

as displayed in Equation (33).

Appendix F. Analysis of the Transition Amplitude

We will here show that the transition amplitude

\begin{matrix} {(u_{φ})}_{11} & = e^{i (α_{1} + α_{3} + \frac{δ φ_{2} + δ φ_{3}}{2})} (cos (\frac{δ φ_{2} - δ φ_{3}}{2}) sin 2 ω cos φ_{1} \\ + cos (Δ α - \frac{δ φ_{2} - δ φ_{3}}{2}) cos 2 ω sin φ_{1} + i sin (Δ α - \frac{δ φ_{2} - δ φ_{3}}{2}) sin φ_{1}), \end{matrix}

(A40)

shown in Equation (45) satisfies condition (29b) for the choices of

ω

satisfying Equation (43)

ω = \frac{1}{2} arctan (\frac{cos (φ_{cl, 1})}{sin (φ_{cl, 1}) cos Δ α}),

(A41)

and that the complex phase of

{(u_{φ})}_{11}

is the one given in Equations (46) and (47).

First, we notice that if condition (A41) holds, we can write

\begin{matrix} sin 2 ω & = \pm \frac{cos (φ_{cl, 1})}{\sqrt{{cos}^{2} (φ_{cl, 1}) + {sin}^{2} (φ_{cl, 1}) {cos}^{2} Δ α}} \end{matrix}

(A42a)

\begin{matrix} cos 2 ω & = \pm \frac{sin (φ_{cl, 1}) cos Δ α}{\sqrt{{cos}^{2} (φ_{cl, 1}) + {sin}^{2} (φ_{cl, 1}) {cos}^{2} Δ α}}, \end{matrix}

(A42b)

where the signs of both the right-hand expressions must be the same. We will only perform the calculation with both signs being positive, but the same steps can be done for the other case. In order to show that the probability of transition

P_{φ} = {| {(u_{φ})}_{11} |}^{2}

satisfies condition (29b), we will evaluate

P_{φ}

in the case of perfect prior knowledge on the parameters—i.e.,

δ φ_{i} = φ_{i} - φ_{cl, i} = 0

—and show that in this case

P_{φ} = 1

. This is enough to show that condition (29b) is satisfied when the prior knowledge on the parameter is classical—namely

δ φ_{i} = O (N^{- 1 / 2})

—as discussed in Section 4.3, since the first non-vanishing order in

δ φ_{cl}

of

P_{φ} - 1

is

O {(δ φ_{cl})}^{2}

. We thus employ the expressions in Equation (A42) with

φ_{cl, 1} = φ_{1}

to evaluate

(A43)

To evaluate the complex phase

arg {(u_{φ})}_{11}

in Equation (46), and in particular to prove the expression for

Φ

in Equation (47), we simply need to replace the condition on Equation (A42) into

Φ = arctan (\frac{sin (Δ α - \frac{δ φ_{2} - δ φ_{3}}{2}) sin φ_{1}}{cos (\frac{δ φ_{2} - δ φ_{3}}{2}) sin 2 ω cos φ_{1} + cos (Δ α - \frac{δ φ_{2} - δ φ_{3}}{2}) cos 2 ω sin φ_{1}}),

(A44)

from which we easily verify Equation (47).

Appendix G. Generalized Setup

In this appendix we will show that it is possible to employ more generic multi-channel local networks

{\hat{U}}_{i, φ_{i}}

within the overall network

{\hat{U}}_{φ}

in Figure 5, and still reaching the HL in the estimation of the linear combination

L (φ) = w \cdot φ

(A45)

in Equation (55), if conditions (29) are satisfied. In particular, the parameters

φ

appearing in Equation (A45) will be in this case the phases acquired by the portion of the probe injected in each local network

{\hat{U}}_{i, φ_{i}}

. These phases can be in turn parameters distributed in each local network, or functions of parameters, as in the setup shown in Figure 1 and Figure 4. We will show that the requirement that these local networks

{\hat{U}}_{i, φ_{i}}

must met, in order to satisfy condition (29b), is

P_{i} \sim 1 - \frac{ℓ_{i}}{N}, ℓ_{i} \geq 0, i = 1, \dots, m_{2},

(A46)

similar to the global condition (29b), where

P_{i}

is the transition probability, associated with each local setup

{\hat{U}}_{i, φ_{i}}

, that a photon injected in the first channel of the i-th local network comes out from its upper channel,—i.e.,

P_{i} = {| U_{i, φ_{i}} |}_{11}^{2}

. If conditions (A46) are satisfied, we can generalize the global transition amplitude in Equation (61) to

\begin{matrix} χ_{φ} & = e^{i L (φ_{cl})} \sum_{i = 1}^{M} w_{i} \sqrt{1 - \frac{ℓ_{i}}{N}} e^{i δ φ_{i}} = \\ = e^{i L (φ_{cl})} (1 + \sum_{i = 1}^{M} i w_{i} δ φ_{i} - \frac{1}{2} \sum_{i = 1}^{M} w_{i} (δ φ_{i}^{2} + \frac{ℓ_{i}}{N})) + O (N^{- 3 / 2}), \end{matrix}

(A47)

where we made use of condition (A46) to write the transition amplitudes associated with each local network of

{\hat{U}}_{i, φ_{i}}

. Exploiting once again the requirement that prior estimations are classical, i.e., that

δ φ_{i} = O (N^{- 1 / 2})

, we notice that the probability,

\begin{matrix} P_{φ} & = | e^{i L (φ c l)} \sum_{i = 1}^{M} w_{i} \sqrt{1 - \frac{ℓ_{i}}{N}} e^{i δ φ_{i}} |^{2} = \\ = 1 + {(\sum_{i = 1}^{M} w_{i} δ φ_{i})}^{2} - \sum_{i = 1}^{M} w_{i} (δ φ_{i}^{2} + \frac{ℓ_{i}}{N}) + O (N^{- 3 / 2}) \\ \equiv 1 - \frac{ℓ}{N} + O (N^{- 3 / 2}) \end{matrix}

(A48)

still satisfies condition (29b). Thus, the HL in the estimation of the total acquired phase shown in (A45) is still achieved for generic local networks satisfying the conditions (A46).

References

McConnell, R.; Low, G.H.; Yoder, T.J.; Bruzewicz, C.D.; Chuang, I.L.; Chiaverini, J.; Sage, J.M. Heisenberg scaling of imaging resolution by coherent enhancement. Phys. Rev. A 2017, 96, 051801. [Google Scholar] [CrossRef] [Green Version]
Unternährer, M.; Bessire, B.; Gasparini, L.; Perenzoni, M.; Stefanov, A. Super-resolution quantum imaging at the Heisenberg limit. Optica 2018, 5, 1150–1154. [Google Scholar] [CrossRef] [Green Version]
De Pasquale, A.; Stace, T.M. Quantum Thermometry. In Thermodynamics in the Quantum Regime; Springer: Cham, Switzerland, 2018; Volume 195, pp. 503–527. [Google Scholar] [CrossRef] [Green Version]
Seah, S.; Nimmrichter, S.; Grimmer, D.; Santos, J.P.; Scarani, V.; Landi, G.T. Collisional Quantum Thermometry. Phys. Rev. Lett. 2019, 123, 180602. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Razzoli, L.; Ghirardi, L.; Siloi, I.; Bordone, P.; Paris, M.G.A. Lattice quantum magnetometry. Phys. Rev. A 2019, 99, 062330. [Google Scholar] [CrossRef] [Green Version]
Bhattacharjee, S.; Bhattacharya, U.; Niedenzu, W.; Mukherjee, V.; Dutta, A. Quantum magnetometry using two-stroke thermal machines. New J. Phys. 2020, 22, 013024. [Google Scholar] [CrossRef]
Aasi, J.; Abadie, J.; Abbott, B.P.; Abbott, R.; Abbott, T.D.; Abernathy, M.R.; Adams, C.; Adams, T.; Addesso, P.; Adhikari, R.X.; et al. Enhanced sensitivity of the LIGO gravitational wave detector by using squeezed states of light. Nat. Photonics 2013, 7, 613–619. [Google Scholar] [CrossRef]
Caves, C.M. Quantum-mechanical noise in an interferometer. Phys. Rev. D 1981, 23, 1693–1708. [Google Scholar] [CrossRef]
Bondurant, R.S.; Shapiro, J.H. Squeezed states in phase-sensing interferometers. Phys. Rev. D 1984, 30, 2548–2556. [Google Scholar] [CrossRef]
Wineland, D.J.; Bollinger, J.J.; Itano, W.M.; Moore, F.L.; Heinzen, D.J. Spin squeezing and reduced quantum noise in spectroscopy. Phys. Rev. A 1992, 46, R6797–R6800. [Google Scholar] [CrossRef]
Giovannetti, V.; Lloyd, S.; Maccone, L. Quantum-Enhanced Measurements: Beating the Standard Quantum Limit. Science 2004, 306, 1330–1336. [Google Scholar] [CrossRef] [Green Version]
Giovannetti, V.; Lloyd, S.; Maccone, L. Quantum Metrology. Phys. Rev. Lett. 2006, 96, 010401. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Dowling, J.P. Quantum optical metrology—The lowdown on high-N00N states. Contemp. Phys. 2008, 49, 125–143. [Google Scholar] [CrossRef]
Paris, M.G. Quantum estimation for quantum technology. Int. J. Quantum Inf. 2009, 7, 125–137. [Google Scholar] [CrossRef]
Giovannetti, V.; Lloyd, S.; Maccone, L. Advances in quantum metrology. Nat. Photonics 2011, 5, 222–229. [Google Scholar] [CrossRef]
Lang, M.D.; Caves, C.M. Optimal Quantum-Enhanced Interferometry Using a Laser Power Source. Phys. Rev. Lett. 2013, 111, 173601. [Google Scholar] [CrossRef] [Green Version]
Tóth, G.; Apellaniz, I. Quantum metrology from a quantum information science perspective. J. Phys. A Math. Theor. 2014, 47, 424006. [Google Scholar] [CrossRef] [Green Version]
Erol, V.; Ozaydin, F.; Altintas, A.A. Analysis of Entanglement Measures and LOCC Maximized Quantum Fisher Information of General Two Qubit Systems. Sci. Rep. 2014, 4, 5422. [Google Scholar] [CrossRef] [Green Version]
Dowling, J.P.; Seshadreesan, K.P. Quantum Optical Technologies for Metrology, Sensing, and Imaging. J. Light. Technol. 2015, 33, 2359–2370. [Google Scholar] [CrossRef] [Green Version]
Czekaj, L.; Przysiężna, A.; Horodecki, M.; Horodecki, P. Quantum metrology: Heisenberg limit with bound entanglement. Phys. Rev. A 2015, 92, 062303. [Google Scholar] [CrossRef] [Green Version]
Ozaydin, F.; Altintas, A.A. Quantum Metrology: Surpassing the shot-noise limit with Dzyaloshinskii-Moriya interaction. Sci. Rep. 2015, 5, 16360. [Google Scholar] [CrossRef] [Green Version]
Szczykulska, M.; Baumgratz, T.; Datta, A. Multi-parameter quantum metrology. Adv. Phys. X 2016, 1, 621–639. [Google Scholar] [CrossRef]
Schnabel, R. Squeezed states of light and their applications in laser interferometers. Phys. Rep. 2017, 684, 1–51. [Google Scholar] [CrossRef] [Green Version]
Braun, D.; Adesso, G.; Benatti, F.; Floreanini, R.; Marzolino, U.; Mitchell, M.W.; Pirandola, S. Quantum-enhanced measurements without entanglement. Rev. Mod. Phys. 2018, 90, 035006. [Google Scholar] [CrossRef] [Green Version]
Pirandola, S.; Bardhan, B.R.; Gehring, T.; Weedbrook, C.; Lloyd, S. Advances in photonic quantum sensing. Nat. Photonics 2018, 12, 724–733. [Google Scholar] [CrossRef]
Tóth, G.; Vértesi, T. Quantum States with a Positive Partial Transpose are Useful for Metrology. Phys. Rev. Lett. 2018, 120, 020506. [Google Scholar] [CrossRef] [Green Version]
Polino, E.; Valeri, M.; Spagnolo, N.; Sciarrino, F. Photonic quantum metrology. AVS Quantum Sci. 2020, 2, 024703. [Google Scholar] [CrossRef]
Pál, K.F.; Tóth, G.; Bene, E.; Vértesi, T. Bound entangled singlet-like states for quantum metrology. Phys. Rev. Res. 2021, 3, 023101. [Google Scholar] [CrossRef]
Schleich, W. Quantum Optics in Phase Space; Wiley: Hoboken, NJ, USA, 2011. [Google Scholar] [CrossRef]
Weedbrook, C.; Pirandola, S.; García-Patrón, R.; Cerf, N.J.; Ralph, T.C.; Shapiro, J.H.; Lloyd, S. Gaussian quantum information. Rev. Mod. Phys. 2012, 84, 621–669. [Google Scholar] [CrossRef]
Adesso, G.; Ragy, S.; Lee, A.R. Continuous Variable Quantum Information: Gaussian States and Beyond. Open Syst. Inf. Dyn. 2014, 21, 1440001. [Google Scholar] [CrossRef] [Green Version]
Lvovsky, A.I. Squeezed Light. In Photonics; John Wiley and Sons, Ltd.: Hoboken, NJ, USA, 2015; Chapter 5; pp. 121–163. [Google Scholar] [CrossRef]
Maccone, L.; Riccardi, A. Squeezing metrology: A unified framework. Quantum 2020, 4, 292. [Google Scholar] [CrossRef]
Gatto, D.; Facchi, P.; Narducci, F.A.; Tamma, V. Distributed quantum metrology with a single squeezed-vacuum source. Phys. Rev. Res. 2019, 1, 032024. [Google Scholar] [CrossRef] [Green Version]
Gramegna, G.; Triggiani, D.; Facchi, P.; Narducci, F.A.; Tamma, V. Heisenberg scaling precision in multi-mode distributed quantum metrology. New J. Phys. 2021, 23, 053002. [Google Scholar] [CrossRef]
Gramegna, G.; Triggiani, D.; Facchi, P.; Narducci, F.A.; Tamma, V. Typicality of Heisenberg scaling precision in multimode quantum metrology. Phys. Rev. Res. 2021, 3, 013152. [Google Scholar] [CrossRef]
Triggiani, D.; Facchi, P.; Tamma, V. Heisenberg scaling precision in the estimation of functions of parameters in linear optical networks. Phys. Rev. A 2021, 104, 062603. [Google Scholar] [CrossRef]
Triggiani, D.; Facchi, P.; Tamma, V. Non-adaptive Heisenberg-limited metrology with multi-channel homodyne measurements. Eur. Phys. J. Plus 2022, 137, 125. [Google Scholar] [CrossRef]
Triggiani, D.; Tamma, V. Estimation with Heisenberg-Scaling Sensitivity of a Single Parameter Distributed in an Arbitrary Linear Optical Network. Sensors 2022, 22, 2657. [Google Scholar] [CrossRef]
Gatto, D.; Facchi, P.; Tamma, V. Heisenberg-limited estimation robust to photon losses in a Mach-Zehnder network with squeezed light. Phys. Rev. A 2022, 105, 012607. [Google Scholar] [CrossRef]
Triggiani, D.; Tamma, V. Estimation of the average of arbitrary unknown phase delays with Heisenberg-scaling precision. In Proceedings of the Optical and Quantum Sensing and Precision Metrology II; Scheuer, J., Shahriar, S.M., Eds.; International Society for Optics and Photonics, SPIE: San Francisco, CA, USA, 2022; Volume 12016, pp. 97–101. [Google Scholar] [CrossRef]
Proctor, T.J.; Knott, P.A.; Dunningham, J.A. Multiparameter Estimation in Networked Quantum Sensors. Phys. Rev. Lett. 2018, 120, 080501. [Google Scholar] [CrossRef] [Green Version]
Zhuang, Q.; Zhang, Z.; Shapiro, J.H. Distributed quantum sensing using continuous-variable multipartite entanglement. Phys. Rev. A 2018, 97, 032329. [Google Scholar] [CrossRef] [Green Version]
Matsubara, T.; Facchi, P.; Giovannetti, V.; Yuasa, K. Optimal Gaussian metrology for generic multimode interferometric circuit. New J. Phys. 2019, 21, 033014. [Google Scholar] [CrossRef] [Green Version]
Qian, K.; Eldredge, Z.; Ge, W.; Pagano, G.; Monroe, C.; Porto, J.V.; Gorshkov, A.V. Heisenberg-scaling measurement protocol for analytic functions with quantum sensor networks. Phys. Rev. A 2019, 100, 042304. [Google Scholar] [CrossRef] [Green Version]
Guo, X.; Breum, C.R.; Borregaard, J.; Izumi, S.; Larsen, M.V.; Gehring, T.; Christandl, M.; Neergaard-Nielsen, J.S.; Andersen, U.L. Distributed quantum sensing in a continuous-variable entangled network. Nat. Phys. 2020, 16, 281–284. [Google Scholar] [CrossRef] [Green Version]
Oh, C.; Lee, C.; Lie, S.H.; Jeong, H. Optimal distributed quantum sensing using Gaussian states. Phys. Rev. Res. 2020, 2, 023030. [Google Scholar] [CrossRef] [Green Version]
Grace, M.R.; Gagatsos, C.N.; Guha, S. Entanglement-enhanced estimation of a parameter embedded in multiple phases. Phys. Rev. Res. 2021, 3, 033114. [Google Scholar] [CrossRef]
Armen, M.A.; Au, J.K.; Stockton, J.K.; Doherty, A.C.; Mabuchi, H. Adaptive Homodyne Measurement of Optical Phase. Phys. Rev. Lett. 2002, 89, 133602. [Google Scholar] [CrossRef] [PubMed] [Green Version]
Monras, A. Optimal phase measurements with pure Gaussian states. Phys. Rev. A 2006, 73, 033821. [Google Scholar] [CrossRef] [Green Version]
Aspachs, M.; Calsamiglia, J.; Muñoz Tapia, R.; Bagan, E. Phase estimation for thermal Gaussian states. Phys. Rev. A 2009, 79, 033834. [Google Scholar] [CrossRef] [Green Version]
Berni, A.A.; Gehring, T.; Nielsen, B.M.; Händchen, V.; Paris, M.G.A.; Andersen, U.L. Ab initio quantum-enhanced optical phase estimation using real-time feedback control. Nat. Photonics 2015, 9, 577–581. [Google Scholar] [CrossRef]
Grace, M.R.; Gagatsos, C.N.; Zhuang, Q.; Guha, S. Quantum-Enhanced Fiber-Optic Gyroscopes Using Quadrature Squeezing and Continuous-Variable Entanglement. Phys. Rev. Appl. 2020, 14, 034065. [Google Scholar] [CrossRef]
Cramér, H. Mathematical Methods of Statistics (PMS-9); Princeton University Press: Princeton, NJ, USA, 1946. [Google Scholar] [CrossRef]
Rohatgi, V.K.; Saleh, A.M.E. An Introduction to Probability and Statistics; John Wiley and Sons: Hoboken, NJ, USA, 2015. [Google Scholar]
Haar, A. Der Massbegriff in der Theorie der Kontinuierlichen Gruppen. Ann. Math. 1933, 34, 147–169. [Google Scholar] [CrossRef]
Nichols, R.; Liuzzo-Scorpo, P.; Knott, P.A.; Adesso, G. Multiparameter Gaussian quantum metrology. Phys. Rev. A 2018, 98, 012114. [Google Scholar] [CrossRef] [Green Version]
Demkowicz-Dobrzanski, R.; Górecki, W.; Guţă, M. Multi-parameter estimation beyond quantum Fisher information. J. Phys. A Math. Theor. 2020, 53, 363001. [Google Scholar] [CrossRef]
Stoica, P.; Marzetta, T.L. Parameter estimation problems with singular information matrices. IEEE Trans. Signal Process. 2001, 49, 87–90. [Google Scholar] [CrossRef]
Gross, J.A.; Caves, C.M. One from Many: Estimating a Function of Many Parameters. J. Phys. A Math. Theor. 2021, 54, 014001. [Google Scholar] [CrossRef]
Hiai, F.; Petz, D. The Semicircle Law, Free Random Variables and Entropy; Number 77; American Mathematical Soc.: Providence, RI, USA, 2000. [Google Scholar] [CrossRef]
Puchała, Z.; Miszczak, J.A. Symbolic integration with respect to the Haar measure on the unitary groups. Bull. Pol. Acad. Sci. Tech. Sci. 2017, 65, 21–27. [Google Scholar] [CrossRef] [Green Version]
Facchi, P.; Garnero, G. Quantum thermodynamics and canonical typicality. Int. J. Geom. Methods Mod. Phys. 2017, 14, 1740001. [Google Scholar] [CrossRef] [Green Version]
Popescu, S.; Short, A.J.; Winter, A. Entanglement and the foundations of statistical mechanics. Nat. Phys. 2006, 2, 754. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Example of a passive and linear network

{\hat{U}}_{φ}

which depends on a single global parameter

φ

[35]. The parameter can be thought as a physical property of an external agent (e.g., temperature, electromagnetic field) which affects multiple components, possibly of different nature, of the network [35,38].

Figure 1. Example of a passive and linear network

{\hat{U}}_{φ}

which depends on a single global parameter

φ

[35]. The parameter can be thought as a physical property of an external agent (e.g., temperature, electromagnetic field) which affects multiple components, possibly of different nature, of the network [35,38].

Figure 2. Histograms of the pre-factor

f (U, G_{φ})

in Equation (13) for

M = 2

(left) and

M = 20

(right), obtained numerically with

10^{5}

samplings of U with respect of the unitary Haar measure [36]. The generator

G_{φ}

chosen is a diagonal matrix with half 1s and half 3s as entries. The histograms are normalized to the unity. We can see that in the histogram in the right, the values of the pre-factor are more concentrated around its average

E_{P} [f (U, G_{φ})] ≃ 4

.

Figure 2. Histograms of the pre-factor

f (U, G_{φ})

in Equation (13) for

M = 2

(left) and

M = 20

(right), obtained numerically with

10^{5}

samplings of U with respect of the unitary Haar measure [36]. The generator

G_{φ}

chosen is a diagonal matrix with half 1s and half 3s as entries. The histograms are normalized to the unity. We can see that in the histogram in the right, the values of the pre-factor are more concentrated around its average

E_{P} [f (U, G_{φ})] ≃ 4

.

Figure 3. Diagram of the setup described in Section 4.1 [37]. A squeezed vacuum state is injected in the first input port of a network composed of a first auxiliary stage

{\hat{V}}_{in}

, a linear and passive network

{\hat{U}}_{φ}

which depends on multiple unknown parameters

φ

, and a second auxiliary stage

{\hat{V}}_{out}

. The two auxiliary stages are linear and passive networks whose purpose is to manipulate how the information on

φ

and on the structure of the network is encoded into the probe, and to refocus it in the only output channel observed through homodyne detection. This setup reaches the Heisenberg limit in the estimation of the overall phase acquired by the probe, which is a function of the parameters

φ

that can be manipulated through the choice of

{\hat{V}}_{in}

and

{\hat{V}}_{out}

.

Figure 3. Diagram of the setup described in Section 4.1 [37]. A squeezed vacuum state is injected in the first input port of a network composed of a first auxiliary stage

{\hat{V}}_{in}

, a linear and passive network

{\hat{U}}_{φ}

which depends on multiple unknown parameters

φ

, and a second auxiliary stage

{\hat{V}}_{out}

. The two auxiliary stages are linear and passive networks whose purpose is to manipulate how the information on

φ

and on the structure of the network is encoded into the probe, and to refocus it in the only output channel observed through homodyne detection. This setup reaches the Heisenberg limit in the estimation of the overall phase acquired by the probe, which is a function of the parameters

φ

that can be manipulated through the choice of

{\hat{V}}_{in}

and

{\hat{V}}_{out}

.

Figure 4. A 2-channel example of a network which allows to estimate at the Heisenberg limit certain functions of the reflectivity

φ_{1}

of a beam-splitter and the magnitudes

φ_{2 / 3}

of phase-shifts [37]. The form of the function can be manipulated by changing the values of the arbitrary parameters

α

, with

α_{1} - α_{2} = α_{4} - α_{3} = Δ α

(see Equation (46)), while the value of

ω

is shown in Equation (43). Similarly to the setup for the single-parameter model described in Section 2, a classical knowledge

φ_{cl}

of the unknown parameters suffices to optimize the network.

Figure 4. A 2-channel example of a network which allows to estimate at the Heisenberg limit certain functions of the reflectivity

φ_{1}

of a beam-splitter and the magnitudes

φ_{2 / 3}

of phase-shifts [37]. The form of the function can be manipulated by changing the values of the arbitrary parameters

α

, with

α_{1} - α_{2} = α_{4} - α_{3} = Δ α

(see Equation (46)), while the value of

ω

is shown in Equation (43). Similarly to the setup for the single-parameter model described in Section 2, a classical knowledge

φ_{cl}

of the unknown parameters suffices to optimize the network.

Figure 5. Network for the estimation at the Heisenberg limit of any linear combination of M parameters with positive weights, as shown in Equation (55) [37]. As shown in the lower panel, each parameter

φ_{i}

can either be (a) an optical phase acquired through a single-mode phase-shift, or (b) the reflectivity of a lossless beam-splitter. The network (b) is given in Equation (56), and its purpose is to transform the reflectivity into an optical phase. Two auxiliary stages

{\hat{V}}_{in}

and

{\hat{V}}_{out}

are employed, whose purpose is to distribute the probe according to the weights in the linear combination (see Equation (59)), and to refocus the probe into the first output port of the network (see Equation (60)).

Figure 5. Network for the estimation at the Heisenberg limit of any linear combination of M parameters with positive weights, as shown in Equation (55) [37]. As shown in the lower panel, each parameter

φ_{i}

can either be (a) an optical phase acquired through a single-mode phase-shift, or (b) the reflectivity of a lossless beam-splitter. The network (b) is given in Equation (56), and its purpose is to transform the reflectivity into an optical phase. Two auxiliary stages

{\hat{V}}_{in}

and

{\hat{V}}_{out}

are employed, whose purpose is to distribute the probe according to the weights in the linear combination (see Equation (59)), and to refocus the probe into the first output port of the network (see Equation (60)).

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Triggiani, D.; Facchi, P.; Tamma, V. The Role of Auxiliary Stages in Gaussian Quantum Metrology. Photonics 2022, 9, 345. https://doi.org/10.3390/photonics9050345

AMA Style

Triggiani D, Facchi P, Tamma V. The Role of Auxiliary Stages in Gaussian Quantum Metrology. Photonics. 2022; 9(5):345. https://doi.org/10.3390/photonics9050345

Chicago/Turabian Style

Triggiani, Danilo, Paolo Facchi, and Vincenzo Tamma. 2022. "The Role of Auxiliary Stages in Gaussian Quantum Metrology" Photonics 9, no. 5: 345. https://doi.org/10.3390/photonics9050345

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

The Role of Auxiliary Stages in Gaussian Quantum Metrology

Abstract

1. Introduction

2. Distributed-Parameter Quantum-Enhanced Estimation

3. Typicality of Quantum Enhanced Sensitivity

3.1. The Role of the Generator $G_{φ}$

3.2. Typical Behaviour of the Pre-Factor in the Heisenberg Scaling

4. Estimation of Functions of Parameters

4.1. Setup

4.2. Heisenberg Scaling

4.3. On the Conditions for the Heisenberg Scaling

4.4. Examples of Quantum-Enhanced Estimation of Functions

4.4.1. Non-Linear Functions

4.4.2. Linear Combinations of Arbitrary Parameters

5. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Typicality of Gaussian Metrology

Appendix A.1. Derivation of the Average of the Pre-Factor

Appendix A.2. Derivation of the Typicality Results

Appendix B. Probability Distributions from Homodyne Measurements

Appendix C. Asymptotic Analysis of Gaussian Metrology

Appendix D. Maximum-Likelihood Estimators for Gaussian Distributions

Appendix E. Derivation of the Cramér-Rao Bound for Singular Fisher Information Matrix

Appendix F. Analysis of the Transition Amplitude

Appendix G. Generalized Setup

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

The Role of Auxiliary Stages in Gaussian Quantum Metrology

Abstract

1. Introduction

2. Distributed-Parameter Quantum-Enhanced Estimation

3. Typicality of Quantum Enhanced Sensitivity

3.1. The Role of the Generator G φ

3.2. Typical Behaviour of the Pre-Factor in the Heisenberg Scaling

4. Estimation of Functions of Parameters

4.1. Setup

4.2. Heisenberg Scaling

4.3. On the Conditions for the Heisenberg Scaling

4.4. Examples of Quantum-Enhanced Estimation of Functions

4.4.1. Non-Linear Functions

4.4.2. Linear Combinations of Arbitrary Parameters

5. Conclusions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Typicality of Gaussian Metrology

Appendix A.1. Derivation of the Average of the Pre-Factor

Appendix A.2. Derivation of the Typicality Results

Appendix B. Probability Distributions from Homodyne Measurements

Appendix C. Asymptotic Analysis of Gaussian Metrology

Appendix D. Maximum-Likelihood Estimators for Gaussian Distributions

Appendix E. Derivation of the Cramér-Rao Bound for Singular Fisher Information Matrix

Appendix F. Analysis of the Transition Amplitude

Appendix G. Generalized Setup

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3.1. The Role of the Generator $G_{φ}$