Estimation with Heisenberg-Scaling Sensitivity of a Single Parameter Distributed in an Arbitrary Linear Optical Network

Triggiani, Danilo; Tamma, Vincenzo

doi:10.3390/s22072657

Open AccessReview

Estimation with Heisenberg-Scaling Sensitivity of a Single Parameter Distributed in an Arbitrary Linear Optical Network

by

Danilo Triggiani

¹

and

Vincenzo Tamma

^1,2,*

¹

School of Mathematics and Physics, University of Portsmouth, Portsmouth PO1 3QL, UK

²

Institute of Cosmology and Gravitation, University of Portsmouth, Portsmouth PO1 3FX, UK

^*

Author to whom correspondence should be addressed.

Sensors 2022, 22(7), 2657; https://doi.org/10.3390/s22072657

Submission received: 28 January 2022 / Revised: 22 March 2022 / Accepted: 27 March 2022 / Published: 30 March 2022

(This article belongs to the Special Issue Novel Sensors and Techniques in Quantum Imaging Applications)

Download

Browse Figures

Versions Notes

Abstract

:

Quantum sensing and quantum metrology propose schemes for the estimation of physical properties, such as lengths, time intervals, and temperatures, achieving enhanced levels of precision beyond the possibilities of classical strategies. However, such an enhanced sensitivity usually comes at a price: the use of probes in highly fragile states, the need to adaptively optimise the estimation schemes to the value of the unknown property we want to estimate, and the limited working range, are some examples of challenges which prevent quantum sensing protocols to be practical for applications. This work reviews two feasible estimation schemes which address these challenges, employing easily realisable resources, i.e., squeezed light, and achieve the desired quantum enhancement of the precision, namely the Heisenberg-scaling sensitivity. In more detail, it is here shown how to overcome, in the estimation of any parameter affecting in a distributed manner multiple components of an arbitrary M-channel linear optical network, the need to iteratively optimise the network. In particular, we show that this is possible with a single-step adaptation of the network based only on a prior knowledge of the parameter achievable through a “classical” shot-noise limited estimation strategy. Furthermore, homodyne measurements with only one detector allow us to achieve Heisenberg-limited estimation of the parameter. We further demonstrate that one can avoid the use of any auxiliary network at the price of simultaneously employing multiple detectors.

Keywords:

quantum metrology; quantum sensing; distributed parameter; heisenberg limit; gaussian metrology; squeezing

1. Introduction

Due to the discrete nature of physical phenomena, the error in the estimation of physical properties, such as lengths, delays, temperatures, or refractive indexes, when employing N probes (e.g., photons, electrons) is strongly limited by the so-called shot-noise scaling factor of

1 / \sqrt{N}

when a classical estimation strategy, i.e., in which the probe and the measurement employed can be fully described classically, is performed. However, it has been proven that it is possible, by exploiting quantum features such as entanglement and the squeezing of light, to overcome this classical limitation, and to reach an enhanced scaling in the precision of order

1 / N

, called the Heisenberg limit [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17].

A promising path towards this quantum-enhanced sensitivity is the one enabled by the squeezing of light [18,19,20,21]. Squeezed states are particular states of the electromagnetic field characterised by a quadrature field with smaller fluctuations than the field quadratures of the vacuum itself. This useful property, together with the Gaussian nature of these states which makes them relatively easy to produce, resilient to noise, and mathematically simple to manipulate [20,22], makes these states evident candidates for metrological purposes. The working principle of these protocols is rather straightforward, and can be outlined as follows. The probe, a pure squeezed state, undergoes an optical phase delay of a magnitude which depends on the unknown parameter we are interested to measure. The phase delay causes a transformation, or more precisely a rotation, of the state of the probe, that needs to be observed to infer the unknown parameter. In order to obtain the quantum enhancement, the measured quadrature field must be ‘sufficiently’ squeezed, namely it must possess a variance reduced below the vacuum fluctuations.

It is possible to highlight two major different approaches undertaken in literature, according to what type of squeezed state is employed and, ultimately, to how the information about the parameter is encoded on, and then retrieved from, the probe. In ‘displacement-encoding’ approaches [2,23], the initial overall state possesses a non-vanishing displacement—i.e., the average of the quadrature fields—so that the value of the parameter is encoded into the variation of the displacement. This approach presents the advantage that the unknown parameter can be retrieved through the measurement of the displacement of the probe, namely the mean value of the signal experimentally observed in a laboratory when performing homodyne detection. In ‘squeezing-encoding’ approaches [24,25,26], all the resources are accumulated in the squeezing of the probes, which are thus squeezed vacuum states, so that the information on the parameter is encoded into the parameter-dependent rotation of the covariance matrix. This is the optimal approach for squeezing-based estimation protocols in the sense that, for a fixed total average number of photons in a Gaussian state, the strategy that maximises the precision is to concentrate all the photons in the squeezing [24,26,27]. Since the information about the parameter is encoded in the covariance matrix of the probe, with this approach the estimation consists of retrieving the value of the parameter from the modulation of the noise. Another interesting approach involves the use of an active component for the estimation of phases, i.e., anti-squeezing the signal before the detection [28,29,30,31,32]. In this approach, the goal is generally to estimate the value of the unknown phase detecting whether the final state of the probe is different from the one injected in the network, namely, performing a projective measurement on the initial state.

Another interesting aspect of quantum metrology that has been frequently investigated is the possibility to estimate a parameter which is not localised into a single node of the network which encodes it, but distributed in an arbitrary manner among multiple components. Although this framework entails a further complication, given by the generally non-trivial encoding of the value of the parameter onto the probe, it allows for the study of estimation schemes with arbitrary structures of the networks. Since, in this approach, the network is generally described as a ‘black-box’, namely the structure of the network is not specified, the applicability of the estimation technique is universal—i.e., guaranteed for any passive and unitary evolution of the probe—covering in this way experimental situations rarely treated in literature. Moreover, the absence of a defined network structure allows to analyse general properties, such as the average performance of the estimation over some set of local transformations [33], the best possible Gaussian strategy for a multi-channel network [27], or the typicality of the Heisenberg scaling [34]. Applications range from high-precision biomedical phase imaging, quantum enhanced pattern recognition, and gravitational-waves sensing, to the mapping of external fields (magnetic and electric fields or temperatures).

However, standard Gaussian estimation protocols still present some challenges. In particular, adaptivity—i.e., the fact that a scheme for the estimation of the parameter depends on the parameter itself—is known to be a typical feature of ab initio Gaussian metrology [24,35,36,37], namely schemes for the estimation of a parameter of which no prior information is known. Intuitively, the cause of the adaptivity stems from the fact that the phase of the squeezed-quadrature which needs to be measured to achieve the quantum-enhanced precision depends on the phase acquired by the probe through the interferometric evolution in arbitrary parameter-encoding networks and, ultimately, on the unknown parameter. Moreover, in distributed quantum metrology, a further experimental challenge arises from the need to adaptively optimize the preparation of the probe and of the measurement [27]. A common approach to avoid adaptivity is limiting the range of values that the unknown parameter is allowed to take, for example, requiring that the parameter is small [38,39,40,41]. In fact, in the regime of small phases, the transformation the probe undergoes is small enough to make the rotation of the squeezed quadrature negligible, which in turn remains practically unchanged. Nevertheless, in some practical scenarios the experimenters have no control on the value of the phase—e.g., the system under investigation is unreachable or cannot be manipulated—and in such cases, this approach ceases to be feasible.

We will show here how the challenge of adaptivity, and any restriction in the range of possible values of the parameter to be estimated, can be overcome thanks to some recent results in the field of distributed quantum metrology, for the estimation at the Heisenberg scaling sensitivity of a single parameter, distributed throughout an arbitrary, multi-channel passive and linear network (see Figure 1). In particular, the unknown parameter can be thought of as a physical property of an external field—such as the temperature of the environment, or the magnitude of an electromagnetic field—which globally affects some or all of the components of the network. This review is organised in two parts, in which we introduce two distinct estimation schemes, discussing the features that differentiate the two approaches. In Section 2, we will discuss a squeezing-encoding scheme achieving Heisenberg-scaling sensitivity by employing a single squeezed vacuum state and homodyne detection at a single output channel [42]. We will assume that no prior knowledge on the value of the parameter is known, and that the structure of the network, as well as the nature of the parameter it encodes, are completely arbitrary. We will present the conditions that need to be satisfied in order to reach the Heisenberg scaling in such a generic model, and we will show that, in general, only a classical prior estimation of the unknown parameter suffices to prepare a single auxiliary linear network required to optimize the setup. These results show that it is possible to conceive feasible two-step estimation strategies, composed of a first classical estimation of the unknown parameter required to engineer the auxiliary stage, and then the actual quantum estimation. In Section 3, we describe a different scheme achieving the Heisenberg scaling, which makes use of a single-mode squeezed-coherent state as a probe, and homodyne detection in every output port of the linear network [43]. Once again, the model will assume no prior knowledge on the parameter, nor on the structure of the network. We will see that, in this case, no auxiliary stage is required to achieve the Heisenberg scaling, so that the estimation can be carried out in a single step, at the cost of employing a multiple homodyne detector. Moreover, we will show that, due to the introduction of a non-vanishing displacement in the probe, the overall precision becomes the sum of two contributions, one deriving from the information encoded in the sample mean of the outcomes of the homodyne measurements, the other in the sample covariance matrix.

2. Quantum Estimation Based on Single-Homodyne Measurements

In this section, we will describe a generic model of a M-channel linear network

{\hat{U}}_{φ}

that allows the Heisenberg scaling to be reached in the estimation of a parameter

φ

distributed arbitrarily in

{\hat{U}}_{φ}

, regardless of the structure of the network. Our model makes use of two auxiliary stages, namely two other linear networks

{\hat{V}}_{in}

and

{\hat{V}}_{out}

, whose purposes are to distribute the probe, a single-mode squeezed vacuum injected in the first channel, in all the optical modes of

{\hat{U}}_{φ}

, and then to refocus it in the only channel observed through homodyne detection. We will show that only one of the two auxiliary networks must be optimized, and although the optimal choice of such network depends on the value of

φ

, the required precision for this optimization can be achieved with a classical estimation strategy. This allows us to conceive two-step estimation protocols, where a prior classical estimation to engineer the optimal auxiliary stage is followed by the quantum Heisenberg scaling-achieving estimation.

2.1. Setup

Let us consider a M-channel passive linear network, whose action on the state of the probe is described by the unitary operator

{\hat{U}}_{φ}

,

φ

being a single, generally distributed, unknown parameter we are interested to estimate. Due to its passive and linear nature, this network can be represented by a unitary matrix. Let then,

U_{φ}

be the

M \times M

unitary matrix representing the action of

{\hat{U}}_{φ}

on the annihilation operators

{\hat{a}}_{i}

,

i = 1, \dots, M

, associated with each channel of the network, satisfying the commutation relations

[{\hat{a}}_{i} {\hat{a}}_{j}] = 0

,

[{\hat{a}}_{i} {\hat{a}}_{j}^{†}] = δ_{i j}

, where we denote with

δ_{i j}

the Kronecker delta. The matrix

U_{φ}

is thus defined by the transformation

{\hat{U}}_{φ}^{†} {\hat{a}}_{i}^{†} {\hat{U}}_{φ} = \sum_{j} {(U_{φ})}_{i j} {\hat{a}}_{j}^{†} .

(1)

We will consider a single-mode squeezed state

| ψ_{0} 〉 = | r 〉 \equiv \hat{S} (r) | vac 〉, r = (r, 0, \dots, 0),

(2)

as a probe, with

\hat{S} (r) = e^{\frac{1}{2} r ({\hat{a}}_{1}^{† 2} - {\hat{a}}_{1}^{2})}

squeezing operator and

N = {sinh}^{2} r

average number of photons, all injected in a single channel of the apparatus, i.e., the first one with the choice of squeezing parameters

r

in Equation (2). In other words, the state

| ψ_{0} 〉

presents a non-vanishing number of photons only in the first mode. As discussed in Section 1, the approach of squeezing-based estimation strategies is to infer the value of

φ

from the transformation of the covariance matrix

Γ = diag (e^{2 r}, 1, \dots, e^{- 2 r}, 1, \dots) / 2

of the state

| ψ 〉

after the interferometric evolution

{\hat{U}}_{φ}

. To do so, we will consider the model where a single output channel, say the first, is measured through homodyne detection. We will denote with

θ

the phase of the local oscillator, which coincides with the phase of the measured quadrature

{\hat{x}}_{θ}

. We will assume that

r > 0

without loss of generality. We notice that, with this assumption, the squeezed quadrature of the state

| ψ_{0} 〉

is

{\hat{p}}_{1} \equiv - i ({\hat{a}}_{1} - {\hat{a}}_{1}^{†}) / \sqrt{2}

, with

Var [{\hat{p}}_{1}] = e^{- 2 r} / 2

, while the anti-squeezed quadrature field is

{\hat{x}}_{1} \equiv ({\hat{a}}_{1} + {\hat{a}}_{1}^{†}) / \sqrt{2}

, with

Var [{\hat{x}}_{1}] = e^{2 r} / 2

. In terms of the creation and annihilation operators, the measured quadrature field can be expressed as

{\hat{x}}_{θ} = (e^{- i θ} {\hat{a}}_{1} + e^{i θ} {\hat{a}}_{1}^{†}) / \sqrt{2}

.

Since the linear network

{\hat{U}}_{φ}

is arbitrary, the average number of photons that can be actually detected after the interferometric evolution ranges between 0 and N. Naturally, if this number were to be small, or far from N, we would expect a sub-optimal performance of the estimation scheme, since most of the photons would come out of the network

{\hat{U}}_{φ}

from channels that are not observed, and information on

φ

would in this way be lost. Moreover, it may happen that the transformation that it imposes on the probe in the transition to the first output port is trivial, namely that the element

{(U_{φ})}_{11}

of the transition amplitude matrix

U_{φ}

does not depend on

φ

. This occurrence would preclude the probe from acquiring any observable information on the parameter. In order to prevent these conditions from happening, this model includes the presence of two auxiliary linear and passive networks acting on the probe,

{\hat{V}}_{in}

before and

{\hat{V}}_{out}

after the linear network

{\hat{U}}_{φ}

(see Figure 2). The first auxiliary stage

{\hat{V}}_{in}

can be understood as network scattering, which distributes the probe through multiple input channels of the parameter-dependent network

{\hat{U}}_{φ}

. The purpose of the second stage

{\hat{V}}_{out}

is instead to refocus the probe, after the interaction with

{\hat{U}}_{φ}

, into the only observed channel. The unitary matrix describing the overall network is thus

u_{φ} = V_{out} U_{φ} V_{in},

(3)

given by the matrix product of the three single matrix representations

V_{out}

,

U_{φ}

and

V_{in}

.

Since in this model all the photons are injected in the first input channel, which is also the only channel observed at the output, the only relevant transition amplitude is the element

{(u_{φ})}_{11}

of the overall unitary matrix in Equation (3). We can then rewrite

{(u_{φ})}_{11} = {(V_{out} U_{φ} V_{in})}_{11} = \sqrt{P_{φ}} e^{i γ_{φ}},

(4)

where

P_{φ} = {| {(V_{out} U_{φ} V_{in})}_{11} |}^{2}

is the probability that a single photon injected in the first port is detected at the first output port of the overall network, and

γ_{φ} = arg ({(V_{out} U_{φ} V_{in})}_{11})

is the phase acquired by the probe during the interferometric evolution. The Gaussian nature of the probe and of the homodyne measurements yield a Gaussian probability density function

p_{φ} (x) = \frac{1}{\sqrt{2 π σ_{φ}^{2}}} e^{- \frac{x^{2}}{2 σ_{φ}^{2}}}

(5)

which governs the outcomes of the homodyne detection [18,19,20,21]. The univariate Gaussian probability density function in Equation (5) is centred at zero due to the absence of a displacement in the probe, while its variance is given by (see Appendix A)

σ_{φ}^{2} = \frac{1 - P_{φ}}{2} + \frac{P_{φ}}{2} [cosh (2 r) + cos (2 γ_{φ} - 2 θ) sinh (2 r)],

(6)

where

θ

is the phase of the local oscillator.

Once the probability density function

p_{φ} (x)

is known, it is possible to evaluate the Fisher information [44,45]

F (φ) = \int_{R} d x p_{φ} (x) \frac{d}{d φ} ln p_{φ} (x)

(7)

of the estimation scheme, which in turn fixes the ultimate precision

δ φ_{\min}

achievable in the estimation of

φ

through

ν

iteration of the measurement, given by the Cramer-Rao bound [44,45]

δ φ_{\min} = \frac{1}{\sqrt{ν F (φ)}} .

(8)

For a Gaussian distribution centred on zero, the Fisher information reads (see Appendix B)

F (φ) = \frac{1}{2} {(\frac{\partial_{φ} σ_{φ}^{2}}{σ_{φ}^{2}})}^{2},

(9)

where

\partial_{φ} : = d / d φ

. We notice from Equation (6) that all the information on the parameter

φ

is encoded in the variance

σ_{φ}^{2}

of the measured quadrature

{\hat{x}}_{θ}

through the two quantities

P_{φ}

and

γ_{φ}

. Thus, we can split

\partial_{φ} σ_{φ}^{2}

into two contributions, one containing the derivative of

P_{φ}

, the other the derivative of

γ_{φ}

, namely

\partial_{φ} σ_{φ}^{2} = (\partial_{φ} P_{φ}) \partial_{P} σ_{φ}^{2} + (\partial_{φ} γ_{φ}) \partial_{γ} σ_{φ}^{2},

(10)

where

\partial_{P}

and

\partial_{γ}

are derivatives with respect to

P_{φ}

and

γ_{φ}

, respectively, so that

\begin{matrix} \partial_{P} σ_{φ}^{2} & = \frac{1}{2} (- 1 + cosh (2 r) + cos (2 γ_{φ} - 2 θ) sinh (2 r)) \\ \partial_{γ} σ_{φ}^{2} & = - P_{φ} sin (2 γ_{φ} - 2 θ) sinh (2 r) . \end{matrix}

(11)

2.2. Heisenberg Scaling

Generally, without imposing any condition on the setup, this model does not achieve the Heisenberg scaling in the precision for the estimation of

φ

. In fact, we can explicitly rewrite the variance

σ_{φ}^{2}

in Equation (6) in terms of the average number of photons

N = {sinh}^{2} r

in the probe

\begin{matrix} σ_{φ}^{2} & = \frac{1 - P_{φ}}{2} + \frac{P_{φ}}{2} [1 + 2 N + 2 cos (2 γ_{φ} - 2 θ) \sqrt{N (N + 1)}] \\ = N P_{φ} (1 + cos (2 γ_{φ} - 2 θ)) + O (1), \end{matrix}

(12)

where

O (1)

is a term of order equal to or smaller than 1, negligible in the asymptotic regime of N large (We will say that given two functions

f (N)

and

g (N)

,

f (N) = O (g (N))

when

{lim}_{N \to \infty} | f (N) / g (N) | < + \infty

). We can also rewrite the derivatives

\partial_{P} σ_{φ}^{2}

and

\partial_{γ} σ_{φ}^{2}

in terms of N

\begin{matrix} \partial_{P} σ_{φ}^{2} & = (N + cos (2 γ_{φ} - 2 θ) \sqrt{N (N + 1)}) \\ = N (1 + cos (2 γ_{φ} - 2 θ)) + O (1) \\ \partial_{γ} σ_{φ}^{2} & = - 2 P_{φ} sin (2 γ_{φ} - 2 θ) \sqrt{N (N + 1)} \\ = - 2 N P_{φ} sin (2 γ_{φ} - 2 θ) + O (1) . \end{matrix}

(13)

Plugging the asymptotics shown in Equations (12) and (13) into the expression of the Fisher information in Equation (9), we notice that the numerator of

F (φ)

can be of the order

N^{2}

at most. Since the denominator is in general of the order

N^{2}

as well, it yields an overall general scaling of the Fisher information of

O (1)

—i.e., even lower than the SQL.

In order for this setup to reach the Heisenberg scaling, we thus need to impose some constraints which prevent the denominator of the Fisher information, i.e., the variance

σ_{φ}^{2}

, to grow with N. We show in Appendix C that the asymptotic conditions (Given a function

f (N)

and a finite sum

p (N)

of powers of N, we will say that

f (N) \sim p (N)

when they show the same asymptotic behaviour. In formulas,

f (N) \sim p (N)

when

f (N) = p (N) + O (N^{s - ε})

,

\forall ε > 0

, with s exponent of the smallest power of N appearing in the sum

p (N)

)

\begin{matrix} γ_{φ} - θ & \sim \pm \frac{π}{2} + \frac{k}{N} k \neq 0 \end{matrix}

(14)

\begin{matrix} P_{φ} & \sim 1 - \frac{ℓ}{N} 0 \leq ℓ < N \end{matrix}

(15)

need to be satisfied for large N, with

0 \leq ℓ < N

and

k \neq 0

arbitrary constant independent of N. We will discuss more in detail the physical meaning of these conditions in Section 2.3, and we will see that Equation (14) is a minimum-resolution requirement on the tuning of the local oscillator, while Equation (15) is the condition on the refocusing of the probe. Intuitively, Equations (14) and (15) assure that

σ_{φ}^{2}

at the denominator of

F (φ)

does not grow as fast as

\partial_{φ} σ_{φ}^{2}

at the numerator: instead, we can see in Appendix C that, when these conditions hold, the variance

σ_{φ}^{2}

becomes of the order

1 / N

, while its derivative

\partial_{φ} σ_{φ}^{2}

remains constant for large N. In particular, we show in Appendix C that the Fisher information in Equation (9) asymptotically reads

F (φ) \sim 8 ϱ (k, ℓ) {(\partial_{φ} γ_{φ})}^{2} N^{2},

(16)

proving the achievement of the Heisenberg scaling, with

ϱ (k, ℓ) = {(\frac{8 k}{1 + 16 k^{2} + 4 ℓ})}^{2}

(17)

positive factor reaching its maximum value for

k = \pm 1 / 4

and

ℓ = 0

, namely

ϱ (1 / 4, 0) = 1

.

Compared with the conditions found in the literature for single-parameter Gaussian estimation schemes based on squeezed-vacuum probes which, adjusted to the notation employed so far, can be translated into

γ_{φ} - θ = {tan}^{- 1} e^{2 r} \sim π / 2 + {(4 N)}^{- 1}

and

P_{φ} = 1

[26,27], we see that Equations (14) and (15) achieve two important further results

It is possible to loosen the optimal conditions found in literature, which still allow us to reach the Heisenberg scaling, at the price of a multiplying factor $0 < ϱ (k, ℓ) \leq 1$ which does not depend on N and hence does not ruin the scaling of the precision;
These conditions are explicitly expressed in terms of the average number N of photons in the probe and, therefore, in terms of the precision we want to achieve. In Section 2.3, we will discuss how this allows us to assess the precision needed to engineer suitable auxiliary stages $V_{in}$ and $V_{out}$ to reach the Heisenberg scaling, showing that it is possible to avoid an iterative adaptation of the optical network.

Lastly, we recall that it is always possible to asymptotically saturate the Cramér-Rao bound in Equation (8) in the limit

ν \to + \infty

of samples with a large number

ν

of observations. In particular, the maximum-likelihood estimator

{\tilde{φ}}_{MLE}

is an asymptotically efficient and Gaussian estimator which can be obtained through the maximisation of the Likelihood function

L (φ; x) = \prod_{i = 1}^{ν} p_{φ} (x_{ν}) = \frac{1}{{(2 π σ_{φ}^{2})}^{ν / 2}} exp (- \frac{{| x |}^{2}}{2 σ_{φ}^{2}}),

(18)

associated with the set

{x_{i}}_{i = 1, \dots, ν}

of the

ν

measurement outcomes of the quadrature field

{\hat{x}}_{θ}

[44,45]. In Appendix D we see that the non-trivial solution which maximises the Likelihood function

L (φ; x)

in Equation (18) is simply given by the estimator

{\tilde{φ}}_{MLE}

satisfying

σ {({\tilde{φ}}_{MLE})}^{2} : = σ_{{\tilde{φ}}_{MLE}}^{2} = S {(x)}^{2},

(19)

where

σ^{2} (φ)

is the variance

σ_{φ}^{2}

in Equation (6) as a function of

φ

, and

S {(x)}^{2}

is the usual sample variance

S {(x)}^{2} = \frac{1}{ν} \sum_{i = 1}^{ν} x_{i}^{2} .

(20)

Generally, Equation (19) cannot be solved analytically, so that numerical methods need to be employed to find non-trivial solutions. Nevertheless, it is possible to find some exceptions, particularly for elementary functional dependencies of

P_{φ}

and

γ_{φ}

on the unknown parameter

φ

. For example, in the case for which

P_{φ} \equiv P

is independent of

φ

, and the functional dependence of

γ (φ) : = γ_{φ}

on

φ

of the phase acquired by the probe is invertible, the function

σ {(φ)}^{2}

in Equation (19) can be easily inverted as well, and the maximum-likelihood estimator reads

\begin{matrix} {\tilde{φ}}_{MLE} (x) = γ^{- 1} (θ + \frac{1}{2} (2 n π \pm arccos (\frac{(2 S {(x)}^{2} - 1) - 2 P_{φ} {sinh}^{2} r}{2 P_{φ} sinh r cosh r}))) . \end{matrix}

(21)

We can notice how, due to the presence of the cosine in

σ_{φ}^{2}

in Equation (6), some prior knowledge on the parameter

φ

is required in order to correctly choose the invertibility interval for

cos (2 γ (φ) - 2 θ)

—i.e., to choose the correct value of

n \in N

and the sign of the arccos function in Equation (21). In the next section, we will see how a classical prior knowledge of the parameter

φ

is required to satisfy condition (15), achievable with a prior coarse estimation reaching an uncertainty of the order of

1 / \sqrt{N}

. Such prior knowledge on the parameter, for a large enough N, can be employed to choose the correct invertibility interval.

2.3. Conditions for the Heisenberg Scaling

We can see that both conditions in Equations (14) and (15) are

φ

-dependent, suggesting that an adaptive procedure must take place in order to employ the estimation scheme described in Section 2.1, as it is customary for ab-initio Gaussian estimation strategies [24,27,35,36,37]. However, some considerations can be made in this regard.

Condition (14) fixes the phase of the quadrature

{\hat{x}}_{θ}

which needs to be measured. The quantity

γ_{φ}

is in fact the phase acquired by the squeezed vacuum during the interferometric evolution from the first input port to the first output port, and for

γ_{φ} \equiv φ

Equation (14) resembles the condition

θ = φ + {tan}^{- 1} e^{2 r}

found in the literature for single-phase estimation [24]. On the other hand, condition (14) is a looser condition to reach the Heisenberg scaling, and it puts in relation the precision with which we are able to choose the phase

θ

of the local oscillator—given by the resolution of the homodyne detection apparatus—with the precision achievable in the estimation of

φ

. In particular, it is evident how the minimum resolution for the homodyne detector required to reach an uncertainty

δ φ

of order

1 / N

must be, in turn, of order

1 / N

. This is in agreement with the common notion in metrology for which a sensor cannot detect changes in the quantity that is being measured which are smaller than its resolution.

Interestingly, we notice from Equation (14) that the constant k cannot be equal to zero. Counterintuitively, the value

k = 0

coincides with the choice of measuring the quadrature

{\hat{x}}_{γ_{φ} + π / 2}

, namely the minimum-variance quadrature after the squeezed vacuum undergoes a phase-shift of magnitude

γ_{φ}

, i.e., after the interferometric evolution given by

{\hat{u}}_{φ}

in Equation (3). This apparent incongruity can be explained by observing the expression of

\partial_{φ} σ_{φ}^{2}

in Equation (10). Differently from displacement-encoding approaches—in which the information on the parameter is obtained from the transformation of the displacement of the probe, and thus minimizing the noise of the signal is always the optimal choice—here the value of

φ

is encoded in the variance of the quadrature itself. For us to be able to extract information on the parameter, the variance of the signal needs to be sensitive to small variations in

φ

—i.e., the derivative

\partial_{φ} σ_{φ}^{2}

must be non-vanishing. Of the two contributions of

\partial_{φ} σ_{φ}^{2}

in Equation (10), the one originating from the variations in

P_{φ}

is identically vanishing when condition (15) is satisfied, since

\partial_{φ} P_{φ} ≃ 0

for

P_{φ}

close to its maximum. The remaining contribution derives from the variations in the overall phase

γ_{φ}

, and the variance of the maximally squeezed quadrature

{\hat{x}}_{γ_{φ} + π / 2}

is a stationary point with respect to variations in the phase

γ_{φ}

and is thus insensitive to

φ

, namely

\partial_{γ} σ_{φ}^{2} = 0

for

γ_{φ} - θ = π / 2

in Equation (11) (See Figure 3).

Condition (15) is the requirement that most of the photons injected into the network end up in the observed output channel. In fact, this condition can be rewritten in terms of the average number of photons that are not correctly refocused

N (1 - P_{φ}) \sim ℓ

. Thus, condition (15) tells us that the number of photons which are not observed must be a constant ℓ, not growing with N. In other words, this condition assures that most of the information on

φ

encoded in the probe is not lost in channels that are not observed. As a matter of fact, we can see from Equation (6) that the variance of the observed quadrature

{\hat{x}}_{θ}

after the interferometric evolution is the convex combination of the variances of a squeezed vacuum and the pure vacuum, with coefficients

P_{φ}

and

1 - P_{φ}

, respectively. In order for this variance to be ‘squeezed’, in the sense that it is of order

1 / N

, the contribution from the pure vacuum must be of order

1 / N

, namely

1 - P_{φ} = O (1 / N)

.

This condition can also be seen as a requirement of the performance of the refocusing stage

{\hat{V}}_{out}

. In fact, in order to satisfy condition (15) for a given choice of

{\hat{V}}_{in}

, the auxiliary stage

{\hat{V}}_{out}

must be chosen so that

| V_{out} U_{φ} V_{in} |^{2} \sim 1 - ℓ / N

. As discussed earlier, this implies that, in general, the auxiliary stage

{\hat{V}}_{out}

which satisfies this condition depends on the value of the parameter itself, requiring an adaptive approach to find an optimal refocusing stage to reach the Heisenberg scaling. We show now that the information on

φ

required to engineer an adequate refocusing stage to reach the Heisenberg scaling can be obtained through a classical estimation strategy, namely that which achieves the scaling

1 / \sqrt{N}

typical of the shot-noise limit. This result is due to the structure of

P_{φ} = {| {(V_{out} U_{φ} V_{in})}_{11} |}^{2}

, which is essentially a transition probability

P = | v_{out} \cdot v_{in} |^{2}

between the unitary vectors

v_{in} = U_{φ} V_{in} e_{1}

and

v_{out} = V_{out}^{†} e_{1}

, with

e_{1} = {(1, 0, \dots, 0)}^{T}

. Hence, a small tilt of order

O (1 / \sqrt{N})

between the unit vectors

v_{in}

and

v_{out}

yields a quadratic reduction in their transition probability. To prove this, we will call

φ_{cl}

the rough guess of the value of

φ

that is sufficiently precise to engineer a refocusing stage

{\hat{V}}_{out}

which satisfy condition (15), and we will show that the estimation strategy to obtain this rough estimate of

φ

is classical, namely that the error

δ φ = φ - φ_{cl}

associated with the prior rough estimation is allowed to be of order

1 / \sqrt{N}

. For a given choice of

{\hat{V}}_{in}

and ℓ, we will call

{\hat{V}}_{out} (φ_{cl})

a solution of Equation (15). The single-photon transition probability

P_{φ}

appearing in this condition can be written as the squared complex modulus of the scalar product of two M-dimensional complex vectors

U_{φ} V_{in} e_{1}

and

V_{out} {(φ_{cl})}^{†} e_{1}

, with

e_{1} = {(1, 0, \dots, 0)}^{T}

. We can then write

P_{φ} = {| e_{1}^{T} V_{out} (φ_{cl}) U_{φ} V_{in} e_{1} |}^{2} \equiv η (φ, φ_{cl}),

(22)

where the transition probability

η (φ, φ_{cl})

is a smooth function of

φ

and

φ_{cl}

, with a locus of points of maxima along the condition

φ_{cl} = φ

, since, for a perfect knowledge of the parameter the auxiliary stage,

{\hat{V}}_{out} (φ_{cl} = φ)

would satisfy

η (φ, φ) = 1

. If the prior estimation

φ_{cl}

slightly deviates from the real value of the parameter

φ = φ_{cl} + δ φ

, we can write the expansion

\begin{matrix} P_{φ} & = η (φ, φ - δ φ) \\ = 1 - \underset{0}{\underset{︸}{\frac{\partial η (φ, x)}{\partial x} |_{x = φ}}} δ φ + \frac{1}{2} \frac{\partial^{2} η (φ, x)}{\partial x^{2}} |_{x = φ} δ φ^{2} + O (δ φ^{3}), \end{matrix}

(23)

where the derivative of

η (φ, φ_{cl})

is zero along the condition

φ = φ_{cl}

. We can see, comparing Equations (23) and (15), that an error

δ φ

of order

1 / \sqrt{N}

suffices to correctly engineer a refocusing stage

{\hat{V}}_{out}

that allows for the Heisenberg scaling. It is then possible to conceive two-step ab initio protocols exploiting the model presented in this section: a first, coarse, classical estimation of the parameter

φ

is performed and the rough estimate

φ_{cl}

is obtained, with an error

δ φ = φ - φ_{cl} = O (1 / \sqrt{N})

of the same order of the shot-noise limit. Then, the classical information obtained on

φ

can be employed to engineer the refocusing stage

{\hat{V}}_{out}

, once

{\hat{V}}_{in}

is fixed, so that the overall network satisfies condition (15), allowing us to reach the Heisenberg scaling through the quantum strategy described in Section 2.1.

Lastly, we notice that, in order to satisfy condition (15), it is also possible to optimize the input auxiliary stage

{\hat{V}}_{in}

while arbitrarily fixing the refocusing stage

{\hat{V}}_{out}

. In such a case, identical considerations can be made regarding the possibility of a two-step protocol, since the optimization

{\hat{V}}_{in}

still requires only a classical coarse estimation

φ_{cl}

of the parameter. Interestingly, only one of the auxiliary stages needs to be optimized, and thus depends on

φ_{cl}

, whether it is

{\hat{V}}_{in}

or

{\hat{V}}_{out}

. This leaves the choice of the second auxiliary stage completely arbitrary, notwithstanding that the pre-factor

{(\partial_{φ} γ_{φ})}^{2}

appearing in the Fisher information in Equation (16) is not vanishing. Indeed, the condition

{(\partial_{φ} γ_{φ})}^{2} = 0

corresponds to the situation in which the optimized network

{\hat{u}}_{φ} = {\hat{V}}_{out} (φ_{cl}) {\hat{U}}_{φ} {\hat{V}}_{in}

acts trivially, namely without imprinting any information about

φ

, on the probe. Remarkably, it has been shown that, for a random choice of the non-optimized auxiliary network, sampled uniformly within the set of all the possible linear networks, the pre-factor

{(\partial_{φ} γ_{φ})}^{2}

multiplying the scaling

N^{2}

in the Fisher information is typically different from zero [34]. In other words, within certain non-restrictive regularity conditions and for linear networks with a large enough number of channels, the value of the pre-factor becomes essentially unaffected by the choice of the non-optimised auxiliary network. This important feature can be exploited for experimental applications, for example, employing the arbitrary non-optimised stage to manipulate the information encoded into the probe regarding the structure of a linear network with multiple unknown parameters, ultimately allowing the choice of functions of such parameters to be estimated at the Heisenberg scaling sensitivity [46].

2.4. A Two-Channel Network

In this section, we will apply our model for the estimation of distributed parameters to a particular example of a 2-channel network, in which the unknown parameter

φ

influences the reflectivity

η_{φ}

of a beam-splitter and the magnitudes

λ_{φ}

and

λ_{φ}^{'}

of two phase-shifts (see Figure 4). We can think of the global parameter

φ

as an external physical property, such as the temperature or the magnitude of the electromagnetic field, affecting the components of the network

{\hat{U}}_{φ}

. We will suppose that the functional dependence of the phase-shifts

λ_{φ}

,

λ_{φ}^{'}

and of the reflectivity

η_{φ}

on the true value of the parameter

φ

are known and smooth, whether given by some law of nature or opportunely engineered. With reference to Figure 4, we can write the matrices representing the action of the beam-splitter and the phase-shifts as

\begin{matrix} U_{BS} (η_{φ}) & = exp (i η_{φ} σ_{2}) = (\begin{matrix} cos η_{φ} & sin η_{φ} \\ - sin η_{φ} & cos η_{φ} \end{matrix}) \\ U_{PS} (λ_{φ}, λ_{φ}^{'}) & = exp (i \frac{λ_{φ} + λ_{φ}^{'}}{2} I_{2} + i \frac{λ_{φ} - λ_{φ}^{'}}{2} σ_{3}) = (\begin{matrix} exp (i λ_{φ}) & 0 \\ 0 & exp (i λ_{φ}^{'}) \end{matrix}) \end{matrix}

(24)

respectively, where

σ_{i}

,

i = 1, 2, 3

, is the i-th Pauli matrix and

I_{2}

is the

2 \times 2

identity matrix, so that the network

{\hat{U}}_{φ}

is represented by the matrix

U_{φ} = U_{PS} (λ_{φ}, λ_{φ}^{'}) U_{BS} (η_{φ}) = (\begin{matrix} cos η_{φ} exp (i λ_{φ}) & sin η_{φ} exp (i λ_{φ}) \\ - sin η_{φ} exp (i λ_{φ}^{'}) & cos η_{φ} exp (i λ_{φ}^{'}) \end{matrix}) .

(25)

We easily notice that

| {(U_{φ})}_{11} |^{2} = {cos}^{2} η_{φ}

, which, in general, is different from one and thus does not satisfy the condition (15), with the exception of the two values

η_{φ} = 0, π

which correspond to the absence of the mixing between the two modes. As described in the model earlier, we then add two auxiliary stages

V_{in}

and

V_{out} \equiv V_{out} (φ_{cl})

, of which only one depends on a prior coarse estimation

φ_{cl}

of

φ

realised with a classical strategy, so that

φ - φ_{cl} = δ φ = O (1 / \sqrt{N})

. In particular we choose as input stage

V_{in} = U_{PS} (\frac{π}{4}, - \frac{π}{4}) U_{BS} (\frac{π}{4}) = \frac{1}{\sqrt{2}} (\begin{matrix} exp (i \frac{π}{4}) & exp (i \frac{π}{4}) \\ - exp (- i \frac{π}{4}) & exp (- i \frac{π}{4}) \end{matrix}),

(26)

and as output stage

V_{out} = U_{BS} (\frac{π}{4}) U_{PS} (- α_{φ_{cl}}, α_{φ_{cl}}) = \frac{1}{\sqrt{2}} (\begin{matrix} exp (- i α_{φ_{cl}}) & exp (i α_{φ_{cl}}) \\ - exp (- i α_{φ_{cl}}) & exp (i α_{φ_{cl}}) \end{matrix}),

(27)

where

α_{φ_{cl}} = (λ_{φ_{cl}} - λ_{φ_{cl}}^{'}) / 2 - π / 4

is a quantity which can be obtained through a classical estimation

φ_{cl}

of

φ

. A straightforward calculation of

P_{φ} = {| {(V_{out} (φ_{cl}) U_{φ} V_{in})}_{11} |}^{2}

yields

P_{φ} = \frac{1}{2} (1 + sin (λ_{φ} - λ_{φ}^{'} - 2 α_{φ_{cl}})) = \frac{1}{2} (1 + cos (δ λ - δ λ^{'})),

(28)

where

δ λ = λ_{φ} - λ_{φ_{cl}}

and

δ λ^{'} = λ_{φ}^{'} - λ_{φ_{cl}}^{'}

are the error in the estimates of

λ_{φ}

and

λ_{φ}^{'}

due to the imprecision of the classical estimation

φ_{cl}

. We can then easily see that

P_{φ}

in Equation (28) satisfies condition (15), since both the errors

δ λ

and

δ λ^{'}

are of order

1 / \sqrt{N}

,—i.e.,

δ λ = (\partial_{φ} λ_{φ}) δ φ = O (1 / \sqrt{N})

, and similarly for

δ λ^{'}

—and thus we obtain from Equation (28)

P_{φ} \sim 1 - \frac{1}{4} {(\partial_{φ} λ_{φ} - \partial_{φ} λ_{φ}^{'})}^{2} δ φ^{2} .

(29)

In order to evaluate the Fisher information in Equation (16), we need to calculate both the phase acquired by the probe throughout the whole interferometric evolution

γ_{φ}

, and the coefficient ℓ. The phase

γ_{φ}

is easily obtained as the complex phase of

{(u_{φ})}_{11} \equiv {(V_{out} (φ_{cl}) U_{φ} V_{in})}_{11}

γ_{φ} = \frac{λ_{φ} + λ_{φ}^{'}}{2} + η_{φ} + \frac{π}{2} .

(30)

Since

δ φ = O (1 / \sqrt{N})

, we call h the finite N-independent constant such that

δ φ \sim h / \sqrt{N}

. The transition probability

P_{φ}

can then be written as

P_{φ} \sim 1 - \frac{h^{2}}{4} {(\partial_{φ} λ_{φ} - \partial_{φ} λ_{φ}^{'})}^{2} \frac{1}{N},

(31)

so that the factor

ℓ = h^{2} {(\partial_{φ} λ_{φ} - \partial_{φ} λ_{φ}^{'})}^{2} / 4

appearing in the Fisher information in Equation (16) is easily evaluated comparing Equations (15) and (31). The Fisher information can be obtained from Equation (16), with

\partial_{φ} γ_{φ}

given by Equation (30), and

ϱ (k, ℓ)

given by Equation (17), with k given by the condition on the local oscillator phase and

ℓ = h^{2} {(\partial_{φ} λ_{φ} - \partial_{φ} λ_{φ}^{'})}^{2} / 4

.

We notice from the expression of

α_{φ_{cl}} = (λ_{φ_{cl}} - λ_{φ_{cl}}^{'}) / 2 - π / 4

that the unknown reflectivity of the beam splitter

η_{φ}

does not influence the refocusing stage

{\hat{V}}_{out} (φ_{cl})

, but it appears in the phase

γ_{φ}

acquired by the probe in Equation (30). In particular, if the two phases

λ_{φ}

and

λ_{φ}^{'}

are vanishing, the dependence of

α_{φ_{cl}}

, and thus of the refocusing stage

{\hat{V}}_{out} (φ_{cl})

, on the classical estimation

φ_{cl}

of the parameter disappears completely. In other words, this network for

λ_{φ} = λ_{φ}^{'} = 0

transforms the reflectivity

η_{φ}

of a beam splitter into the magnitude of a phase shift, independently from

η_{φ}

.

3. Quantum Estimation Based on Multi-Homodyne Measurements

In Section 2 we have presented a scheme for the estimation of a distributed parameter encoded in a multi-channel network, reaching the Heisenberg scaling employing a squeezed vacuum state and homodyne detection performed at a single output channel. In particular, in Section 2.2, we have discussed in depth about the conditions in Equations (14) and (15) which need to be satisfied to reach the Heisenberg scaling: Equation (14) imposes a minimum resolution in tuning the local oscillator phase, in order to infer the value of the parameter from the noise of a sufficiently squeezed quadrature. Equation (15) is a requirement on the refocusing of the probe into the only observed channel and, in order to be satisfied, a classical knowledge of the parameter is generally required to engineer the optimal refocusing network. A natural question that arises is whether it is possible to ease these conditions by carrying out some changes on our scheme. In particular, what would happen if homodyne measurements were performed, not only at a single channel, but at all the output ports of the network instead? Would condition (15) become looser, allowing us to engineer the optimal auxiliary stages with even less information on the unknown parameter? Moreover, as we have already discussed in Section 1, a non-vanishing displacement in the probe would make it possible to infer the value of the parameter directly from the average of the quadrature with minimum variance, and not from its noise, which may be a more feasible approach in particular experimental scenarios. In this section, we will investigate a model that implements these changes (see Figure 5). We will see that, with these assumptions, not only is there no need to perform a prior estimation of the parameter to optimize the network, but the Heisenberg scaling can be achieved without employing an auxiliary stage in the first place. Moreover, the presence of displacement in the probe will allow us to perform the estimation directly measuring the minimum-variance quadrature, relying on the information about the parameter encoded in the average signal of the homodyne, and not in its noise.

3.1. Setup

We will consider a linear and passive network

{\hat{U}}_{φ}

, which depends on a single and generally distributed parameter, for example affecting several components of the network, as shown in Figure 1. Once again,

{\hat{U}}_{φ}

admits a unitary matrix representation

U_{φ}

obtained through Equation (1). Differently from Section 2 though, we consider as a probe the single-mode squeezed state

| ψ_{0} 〉 = | α, r 〉 \equiv \hat{D} (α) \hat{S} (r) | vac 〉, α = (α, 0, \dots, 0), r = (r, 0, \dots, 0),

(32)

with

\hat{D} (α) = e^{α ({\hat{a}}_{1}^{†} - {\hat{a}}_{1})}

, i.e., a squeezed coherent state with a mean number of photons

N = N_{D} + N_{S} = α^{2} + {sinh}^{2} r

,

α, r > 0

, where we can introduce the displacement

d

, given by

α_{i} = (d_{i} + i d_{i + M}) / \sqrt{2}

,

i = 1, \dots, M

. We remark that the choice

α, r > 0

is a specific (

φ

-independent) condition which is required in displacement-encoding approaches: squeezing and displacement can, in general, have different complex phases, and the condition

α, r > 0

assures that the squeezed quadrature (

{\hat{x}}_{π / 2}

for

r > 0

) is orthogonal—i.e., conjugated—to the displaced quadrature (

{\hat{x}}_{0}

for

α > 0

), and hence it is the most sensible to the presence of phases. Moreover, in our model, we will consider homodyne detection in all M output channels of the linear network, so that M quadrature fields

{\hat{x}}_{i, θ_{i}}

are measured, where

θ_{i}

is the phase of the i-th local oscillator,

i = 1, \dots, M

.

Since we are observing all the output ports of the network, but a non-vanishing number of photons is injected only in the first channel, only the first column of the unitary matrix

U_{φ}

is relevant in our model, consisting of the transition amplitude of single photons from the first to every channel of the network (see Equation (1)). We can thus employ the parametrisation

{(U_{φ})}_{i 1} = \sqrt{P_{i}} e^{i γ_{i}},

(33)

where

P_{i}

is the probability that a single photon is transmitted through the linear network from the first to the i-th channel, and

γ_{i}

is the phase that it would acquire (In order to keep the notation lighter, we are dropping the subscript

φ

in

P_{i}

,

γ_{i}

,

μ

and

Σ

. Nonetheless, it rests assured that these quantities, in general, depend on the unknown parameter). Once again, due to the Gaussian nature of the model, the (joint) probability distribution associated with the outcome

x

of the M homodyne detectors is also Gaussian and reads [18,19,20,21]

p_{φ} (x) = \frac{1}{\sqrt{{(2 π)}^{M} det [Σ]}} exp (- \frac{1}{2} {(x - μ)}^{T} Σ^{- 1} (x - μ)) .

(34)

In Equation (34), both the covariance matrix

Σ

and the mean

μ

depend on the parameter

φ

. The elements of the covariance matrix

Σ

are evaluated in Appendix A, and read

{(Σ)}_{i j} = \frac{δ_{i j}}{2} + \sqrt{P_{i} P_{j}} (cos ({\bar{γ}}_{i} - {\bar{γ}}_{j}) sinh {(r)}^{2} + cos ({\bar{γ}}_{i} + {\bar{γ}}_{j}) cosh (r) sinh (r)),

(35)

where

δ_{i j}

is the Kronecker delta,

{\bar{γ}}_{i} = γ_{i} - θ_{i}

is the phase acquired at the output of the i-th channel relative to the correspondent local oscillator, and

μ

is the mean vector (see Appendix A)

μ_{i} = d \sqrt{P_{i}} cos {\bar{γ}}_{i}, i = 1, \dots, M .

(36)

The determinant

det [Σ]

can also be written in compact form (see Appendix A)

\begin{matrix} det [Σ] = \frac{1}{2^{M}} & + \frac{sinh (r)}{2^{M - 1}} \sum_{i = 1}^{M} P_{i} (sinh (r) + cos (2 {\bar{γ}}_{i}) cosh (r)) \\ - \frac{{sinh}^{2} (r)}{2^{M - 2}} \sum_{i = 1}^{M} \sum_{j = i + 1}^{M} P_{i} P_{j} {sin}^{2} ({\bar{γ}}_{i} - {\bar{γ}}_{j}) . \end{matrix}

(37)

For a multivariate Gaussian distribution of the form shown in Equation (34), the Fisher information can be easily evaluated from its definition in Equation (7) and employ the expression of the probability distribution

p_{φ} (x)

in Equation (34) as (see Appendix B)

\begin{matrix} F (φ) = \underset{F_{D} (φ)}{\underset{︸}{\frac{1}{det [Σ]} \partial_{φ} μ^{T} C \partial_{φ} μ}} + \underset{F_{S} (φ)}{\underset{︸}{\frac{1}{2} {(\frac{\partial_{φ} det [Σ]}{det [Σ]})}^{2} - \frac{1}{2 det [Σ]} [(\partial_{φ} Σ) (\partial_{φ} C)]}} \end{matrix}

(38)

where

C = det [Σ] Σ^{- 1}

is the cofactor matrix of

Σ

and

[A] = \sum_{i} A_{i i}

denotes the trace of the matrix A. Compared with the Fisher information shown in Equation (9) for the model described in Section 2, the Fisher information for this setup includes an additional term

F_{D} (φ)

which depends on the derivative of the average

μ

with respect to the parameter

φ

. Moreover, the contribution

F_{S} (φ)

, representing the information on

φ

encoded in the covariance matrix

Σ

, can be split into two terms, of which the first resembles the Fisher information in Equation (9) once we perform the substitution

σ_{φ}^{2} \to det [Σ]

: we will show in the following that, in the asymptotic regime of large

N_{S}

, this is the only contribution of

F_{S} (φ)

which, besides

F_{D} (φ)

, reaches the Heisenberg scaling.

3.2. Heisenberg Scaling

Similarly to what happens to the setup described in Section 2, the Fisher information in Equation (38) does not generally reach the Heisenberg scaling unless certain conditions are met. To show this, it is necessary to evaluate all the contributions of

F (φ)

in terms of the number of photons

N_{D}

and

N_{S}

in the asymptotic regime. First, it is convenient to express the cofactor matrix C explicitly in terms of the squeezing parameter r. In Appendix B, we see that a closed form for C in such terms exists

\begin{matrix} C_{s s} & = \frac{1}{2^{M - 1}} + \frac{1}{2^{M - 2}} \sum_{\begin{matrix} i = 1 \\ i \neq s \end{matrix}}^{M} (Σ_{i i} - \frac{1}{2}) - \frac{1}{2^{M - 3}} \sum_{\begin{matrix} i = 1 \\ i \neq s \end{matrix}}^{M} \sum_{\begin{matrix} j = i + 1 \\ j \neq s \end{matrix}}^{M} S_{i i j}, \end{matrix}

(39a)

\begin{matrix} C_{s t} & = - \frac{1}{2^{M - 2}} Σ_{s t} + \frac{1}{2^{M - 3}} \sum_{\begin{matrix} i = 1 \\ i \neq s, t \end{matrix}}^{M} S_{s t i}, s \neq t, \end{matrix}

(39b)

where

S_{s t i} = {sinh}^{2} r \sqrt{P_{s} P_{t}} P_{i} sin (γ_{s} - γ_{i}) sin (γ_{t} - γ_{i}),

(40)

and

Σ_{i j}

are shown in Equation (35). We then notice that, in order to analyse the generic asymptotic behaviour of the Fisher information in Equation (38), it suffices to separately examine the asymptotics of the terms

μ

,

Σ_{i j}

,

S_{s t i}

,

det [Σ]

, and of their derivatives.

From Equations (35)–(37), (39) and (40) we see that

Σ

, S and

det [Σ]

are all of order

N_{S}

in general, while

μ

is of order

\sqrt{N_{D}}

. The same asymptotic behaviours are also kept for their respective derivatives, since both

\partial_{φ} {\bar{γ}}_{i} \equiv \partial_{φ} γ_{i}

and

\partial_{φ} P_{i}

are independent of the probe, and thus of

N_{D}

and

N_{S}

. We can thus see from the expression of

F (φ)

in Equation (38) that its numerator grows at most with

N^{2}

, while the denominator—i.e., the denominator

det [Σ]

—in general grows with

N_{S}

. Therefore, in order for

F (φ)

to reach the Heisenberg scaling, namely a scaling of order

N^{2}

, some conditions must be imposed so that the denominators in

F_{D} (φ)

and

F_{S} (φ)

do not grow for large

N_{S}

. In Appendix C, it can be seen that, to achieve the Heisenberg scaling, the determinant

det [Σ]

must be of the order

N^{- 1}

, similarly to what happens to

σ_{φ}^{2}

in the single-homodyne setup in Section 2.2, and the conditions for this to occur are

{\bar{γ}}_{i} = \pm \frac{π}{2} + O (N_{S}^{- 1}), i = 1, \dots, M .

(41)

When these conditions hold, we can introduce the finite (possibly vanishing) quantities

k_{i} = {lim}_{N_{S} \to \infty} N_{S} (γ_{i} \mp π / 2)

, so that the determinant

det [Σ]

reduces to (see Appendix C)

det [Σ] \sim \frac{1}{2^{M - 2} N_{S}} ({(\sum_{i = 1}^{M} P_{i} k_{i})}^{2} + \frac{1}{16}),

(42)

while

\partial_{φ} det [Σ]

,

\partial_{φ} Σ

,

\partial_{φ} C

and C tend to constant values, and

\partial_{φ} μ

scales as

\sqrt{N_{D}}

. We can easily see that this makes only the first two terms of

F (φ)

in Equation (38) dominant for large

N_{D}

and

N_{S}

. When conditions (41) are met, we can thus neglect the last term in Equation (38), and the Fisher information

\begin{matrix} F (φ) & \sim \frac{1}{det [Σ]} \partial_{φ} μ^{T} C \partial_{φ} μ + \frac{1}{2} {(\frac{\partial_{φ} det [Σ]}{det [Σ]})}^{2} \\ \sim 8 {(\partial γ)}_{avg}^{2} (ζ (k_{avg}) 2 N_{D} N_{S} + ϱ (k_{avg}) N_{S}^{2}), \end{matrix}

(43)

asymptotically reaches the Heisenberg scaling in

N_{D} N_{S}

and

N_{S}^{2}

(see Appendix C), where we introduced the quantities

k_{avg} \equiv \sum_{i = 1}^{M} P_{i} k_{i}, {(\partial γ)}_{avg} \equiv \sum_{i = 1}^{M} P_{i} \partial_{φ} γ_{i},

(44)

while

\begin{matrix} ζ (x) = {(16 x^{2} + 1)}^{- 1}, ϱ (x) = {(8 x)}^{2} / {(16 x^{2} + 1)}^{2} \end{matrix}

(45)

are positive, even functions which reach their maxima at

x = \pm 1 / 4

and

x = 0

, respectively, namely

ϱ (1 / 4) = 1

and

ζ (0) = 1

.

It is now possible to compare the Fisher information in Equation (43) achieved with the present scheme, with the Fisher information in Equation (16) obtained with the setup for distributed metrology employing a squeezed vacuum state and homodyne detection on a single channel. Since this setup employs a squeezed probe with a non-vanishing displacement, it lends itself to both displacement-based and squeezing-based estimation approaches. This is reflected by the presence of two separate contributions to the Fisher information in Equation (43): the first is given by the variations in the displacement

μ

, the second by the variations in the determinant

det [Σ]

, with respect to changes in the value of

φ

. Both terms present a pre-factor

{(\partial γ)}_{avg}^{2}

shown in Equation (44) which resembles the pre-factor

{(\partial_{φ} γ_{φ})}^{2}

in Equation (16) for the single-homodyne counterpart: in particular,

{(\partial γ)}_{avg}^{2}

is a weighted average of the sensitivities of the complex phases

γ_{i}

to changes in the parameter

φ

, each one weighted by the ‘fraction’

P_{i}

of the probe undergoing the phase shift

γ_{i}

. Indeed, we can see that

{(\partial γ)}_{avg}^{2}

reduces to the single-homodyne counterpart when the whole probe is refocused into a single channel—say the first—so that

P_{1} = 1

and

P_{i} = 0

for

i = 2, \dots, M

, i.e.,

{(\partial γ)}_{avg}^{2} = {(\partial_{φ} γ_{1})}^{2}

.

In the term of the Fisher information associated with the squeezing-encoding in Equation (43), the factor

ϱ (k_{avg})

coincides in turn with the function

ϱ (k, ℓ)

in Equation (17) for

ℓ = 0

—i.e., the ideal case of perfect refocusing of the probe, and all photons being observed—and after the substitution

k \to k_{avg}

, namely the weighted average of the coefficients

k_{i}

with the same weights

P_{i}

. In other words, this term can be thought of as a generalisation of the Fisher information for a single homodyne due to the presence of multiple observed channels. Noticeably, this term can be set as equal to zero only for

N_{S} = 0

, in which case, the whole expression in Equation (43) vanishes, ruining the Heisenberg scaling, or for

k_{avg} = 0

. In fact, we can see from Equation (37) that the condition

k_{avg} = 0

—i.e.,

{\bar{γ}}_{i} = \pm π / 2

, namely when the quadratures with minimum variances are measured at each channel minimises

det [Σ]

, which becomes a stationary point with respect to the variations in

φ

, and

\partial_{φ} det [Σ]

in Equation (43) vanishes. On the other hand, the first contribution to Equation (43) is instead a new term not present in the Fisher information for the single homodyne, deriving from the information on the parameter encoded in the displacement

μ

. This term achieves the Heisenberg scaling in

N_{D} N_{S}

, in the sense that it reaches the Heisenberg scaling in

N = N_{D} + N_{S}

if both

N_{D}

and

N_{S}

grow with N, i.e., if

N_{D} = β N

and

N_{S} = (1 - β) N

with

0 \leq β < 1

independently of N. The function

ζ (k_{avg})

in Equation (45) does not have roots, hence the first contribution to

F

vanishes only for

N_{D} = 0

, i.e., for a squeezed vacuum as a probe.

Finally, we will now write the Likelihood function

L (φ; x)

for the setup described here, and discuss the maximum-likelihood estimator

{\tilde{φ}}_{MLE}

saturating the Cramér-Rao bound shown in Equation (8). After performing

ν

measurements of the field quadratures

{\hat{x}}_{i, θ_{i}}

, the Likelihood of the outcomes

(x_{1}, \dots, x_{ν})

is given by

L (φ; x_{1}, \dots, x_{ν}) = \prod_{j = 1}^{ν} p_{φ} (x_{j}),

(46)

with

p_{φ} (x_{j})

found in Equation (34). In Appendix D, we show that the maximum-likelihood estimator, a non-trivial solution of the maximisation of the Likelihood function in Equation (46), is implicitly given by the estimator

{\tilde{φ}}_{MLE}

which satisfies the equation

\begin{matrix} 0 & = \frac{1}{2} Tr {[\partial_{φ} Σ^{- 1} (ν Σ - \sum_{j = 1}^{ν} (x_{j} - μ) {(x_{j} - μ)}^{T})]}_{φ = {\tilde{φ}}_{MLE}} \\ + {[{(\partial_{φ} μ)}^{T} Σ^{- 1} (ν μ - \sum_{j = 1}^{ν} x_{j})]}_{φ = {\tilde{φ}}_{MLE}}, \end{matrix}

(47)

where

Σ

is the covariance matrix in Equation (35), and

μ

the mean vector in Equation (36). This equation cannot, in general, be solved analytically, hence numerical methods typically need to be in place to find

{\tilde{φ}}_{MLE}

. On the other hand, Equation (47) simplifies for certain particular cases. For example, when measuring all the minimum-variance quadratures so that

{\bar{γ}}_{i} = \pm π / 2

in Equation (41), and the probabilities

P_{i}

are independent of

φ

, we can see from Equation (35) that

\partial_{φ} Σ = 0

. In this case, the Likelihood Equation becomes

0 = {[{(\partial_{φ} μ)}^{T} Σ^{- 1} (μ - \frac{1}{ν} \sum_{j = 1}^{ν} x_{j})]}_{φ = {\tilde{φ}}_{MLE}},

(48)

where the term in the right-hand side can be seen as a linear combination of the quantities

μ - \tilde{μ}

, where

\tilde{μ} = \sum_{j = 1}^{ν} x_{j} / ν

are estimators of the mean

μ

. We see here how the displacement of the probe allows us to perform the estimation of

φ

through the sample mean

μ

, i.e., the average of the outcomes of the homodyne measurements. On the other hand, for a squeezed vacuum such as a probe,

μ = 0

, so that Equation (47) becomes

0 = \frac{1}{2} Tr {[\partial_{φ} Σ^{- 1} (Σ - \frac{1}{ν} \sum_{j = 1}^{ν} x_{j} x_{j}^{T})]}_{φ = {\tilde{φ}}_{MLE}},

(49)

where it is possible to recognise the sample covariance matrix

\tilde{Σ} = \sum_{j = 1}^{ν} x_{j} x_{j}^{T} / ν

, estimator of the covariance matrix

Σ

.

3.3. Conditions for the Heisenberg Scaling

For the model introduced in this section, some considerations regarding the conditions in Equation (41) can also be drawn, especially in light of the feature of the single-homodyne, squeezed-vacuum scheme discussed in Section 2.3.

In fact, these conditions do nothing but fix the phases

θ_{i}

of the quadratures

{\hat{x}}_{i, θ_{i}}

that need to be measured to achieve the Heisenberg scaling. Regarding the minimum resolution of the homodyne sensors, it appears that there is no evident advantage in this setup compared to the single-homodyne scheme, since the resolution required to tune the local oscillators of each channel is still of order

1 / N

. On the other hand, introducing displacement in the probe allows, as previously discussed, the value of the parameter to be inferred from the information inscribed in the average of the measured quadratures

μ

. Therefore, it is possible with this scheme to exactly measure the minimum-variance quadratures

{\hat{x}}_{i, γ_{i} \pm π / 2}

, a possibility that was prevented in the previous scheme due to the requirement

k \neq 0

in Equation (14). Although the situation for which

k_{i} = 0

for

i = 1, \dots, M

sets to zero the contribution

F_{S} (φ)

in Equation (43), associated with the information encoded in the covariance matrix

Σ

, the first contribution

F_{D} (φ)

of the Fisher information still reaches the Heisenberg scaling.

Another interesting feature of this protocol, which differentiates it from its single-homodyne counterpart, is that it does not require any adaptation of the network: since every output channel is observed, no condition on the transition probabilities

P_{i}

is required. Therefore, not only there is no requirement for a

φ

-dependent auxiliary stage, but there is no need for an auxiliary stage to reach the Heisenberg scaling in the first place. However, the precision in the estimation is still affected by the network through the terms

k_{avg}

and

{(\partial γ)}_{avg}

, which appear in the constant factors multiplying the scaling in the Fisher information in Equation (38). In fact, we can see from Equation (45) that these two quantities depend on the transition probabilities

P_{i}

and on the derivatives of the complex phases

\partial_{φ} γ_{i}

. In particular,

F (φ)

can be vanishing for exceptionally poorly conceived networks, for which

{(\partial γ)}_{avg} = 0

: for example, a network for which

γ_{i}

is independent of

φ

for all values of i such that

P_{i} \neq 0

, is associated with a vanishing factor

{(\partial γ)}_{avg}

, as we can see from its definition in Equation (44). In this case, adding a

φ

-independent auxiliary network V, either at the input or at the output of

U_{φ}

, would modify the transition amplitude of the overall interferometric evolution, and ultimately yield a non-vanishing value of

{(\partial γ)}_{avg}

.

4. Conclusions

The recent advances in quantum metrology made possible the realisation of protocols achieving super-sensitivity in the estimation of optical lengths, time delays and space-time distortions due to gravitational waves, refractive indices in given materials, density and thickness of biological samples up to the nanometer scale, temperatures, polarisations of light, magnitudes of fields and their gradients, and more. However, current quantum sensing technologies still present some limitations, whether caused by the requirement of adaptively optimising the estimation procedure to the unknown value of the parameter that needs to be measured, by the fragility of the metrological resources which yield super-sensitivity, or by the scarce operating range of the protocols, impracticalities arise when tackling the metrological schemes with quantum mechanics. Moreover, most standard approaches to distributed quantum metrology suffer from a lack of universal estimation schemes, namely those which can operate independently of the unknown parameter nature and value, and of the unitary evolution which encodes the value of the parameter into the probe employed for the estimation.

In this work, we have reviewed in detail two schemes which address these limitations [42,43]. By employing analyses based on the Cramér–Rao bound, i.e., the ultimate precision achievable for a given estimation scheme, and on the Fisher information, we were able to assess the super-sensitivity of various feasible metrological setups, always achievable in the regime of large statistical samples through the maximum-likelihood estimator. We showed that, without making any assumptions on the structure of the multi-channel passive and linear network encoding the unknown parameter, which can as well be distributed among multiple components of the interferometer (such as temperature affecting the network in a distributed manner) it is possible to relax the adaptivity of the network to a requirement of classical knowledge—i.e., not at the quantum limit—on the parameter. This prior knowledge serves to engineer an auxiliary stage needed to perform the super-sensitive estimation with a single squeezed-vacuum state and homodyne detection at a single output port. Moreover, the adaptivity on the network can actually be completely circumvented when all the output ports of the network are observed with homodyne detectors. We discussed how this allows us to conceive estimation schemes at the Heisenberg limit, which can be performed with a single optimization step, when a classical knowledge of the parameter is needed to engineer the auxiliary stage, or without any optimization at all, when no prior knowledge is required.

Author Contributions

Both authors contributed to the paper and the obtained results. The project was concieved and managed by V.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the Office of Naval Research Global (N62909-18-1-2153).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Acknowledgments

We acknowledge our collaborators in this research P. Facchi, G. Gramegna and F. A. Narducci.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Probability Distributions from Homodyne Measurements

In this appendix, we will obtain the probability density functions which govern the outcomes of homodyne detections performed on a single-mode squeezed state

| α, r 〉 = \hat{D} (α) \hat{S} (r) | vac 〉

injected in the first port of a M-channel passive and linear network

\hat{U}

, with

α = (α, 0, \dots, 0) = (\sqrt{N_{D}}, 0, \dots, 0)

, and

r = (r, 0, \dots, 0)

, so that

N_{S} = {sinh}^{2} r

, with

α, r > 0

, and

N = N_{D} + N_{S}

sum of the average number of photons in the displacement and in the squeezing of the state

| α, r 〉

. We refer to refs. [18,19,20,21] for reviews on Gaussian states, and how to describe and manipulate them mathematically. The state

| α, r 〉

is a Gaussian state which is completely described, through its Wigner distribution

W (z) = \frac{1}{\sqrt{{(2 π)}^{2 M} det [Γ]}} exp (- \frac{1}{2} {(z - d)}^{T} Γ^{- 1} (z - d)),

(A1)

by its vector of displacement

d = \sqrt{2} (α, 0, \dots, 0)

, and its matrix of covariances

Γ = diag (e^{2 r}, 1 \dots, 1, e^{- 2 r}, 1 \dots, 1) / 2

. Moreover, passive and linear networks preserve the Gaussian nature of

| α, r 〉

. For this reason, we will first obtain in generality the final displacement

d_{U}

and covariance matrix

Γ_{U}

of the state

\hat{U} | α, r 〉

. Then, we will marginalise the Wigner distribution, rotated by the symplectic and orthogonal matrix

R (θ) = (\begin{matrix} cos (Θ) & sin (Θ) \\ - sin (Θ) & cos (Θ) \end{matrix}),

(A2)

accordingly to the phases

Θ = diag (θ) = diag (θ_{1}, \dots, θ_{M})

of the local oscillators of the homodyne. In doing so, we will be able to obtain the expressions in Equations (6), and (35)–(37).

The action of a passive and linear unitary

\hat{U}

on a state is described by a symplectic rotation of the state Wigner distribution [18,19,20,21]. The Wigner distribution

W_{U} (z)

of the state

\hat{U} | α, r 〉

after the interferometric evolution

\hat{U}

is thus obtained, rotating the initial Wigner distribution

W (z)

found in Equation (A1) with the orthogonal and symplectic matrix

R = (\begin{matrix} Re (U) & - Im (U) \\ Im (U) & Re (U) \end{matrix}),

(A3)

where U is the unitary matrix representing the transition amplitudes which can be found through Equation (1), thus obtaining the transformation

W_{U} (z) = W (R^{T} z) .

(A4)

Since the probe in our model is Gaussian, the transformation of the state is completely described by the rotations

d_{U} = R d

of the displacement and

Γ_{U} = R Γ R^{T}

of the covariance matrix of the initial state, with

Γ = diag (e^{2 R}, e^{- 2 R}) / 2

,

R = diag (r, 0, \dots, 0)

, and

d = \sqrt{2} diag (α, 0, \dots, 0)

. A straightforward calculation shows that

d_{U} = R d = \sqrt{2} α (\begin{matrix} Re [U_{11}] \\ ⋮ \\ Re [U_{M 1}] \\ Im [U_{11}] \\ ⋮ \\ Im [U_{M 1}] \end{matrix})

(A5)

and that

Γ_{U} = R Γ R^{T} = (\begin{matrix} Δ X^{2} & Δ X P \\ Δ X P^{T} & Δ P^{2} \end{matrix}) .

(A6)

where we have defined the

M \times M

matrices

\begin{matrix} Δ X^{2} & \equiv \frac{1}{2} [Re [U] e^{2 R} Re [U^{†}] - Im [U] e^{- 2 R} Im [U^{†}]] \\ = \frac{1}{2} [Re [U cosh (2 R) U^{†}] + Re [U sinh (2 R) U^{T}]], \end{matrix}

(A7a)

\begin{matrix} Δ P^{2} & \equiv \frac{1}{2} [- Im [U] e^{2 R} Im [U^{†}] + Re [U] e^{- 2 R} Re [U^{†}]] \\ = \frac{1}{2} [Re [U cosh (2 R) U^{†}] - Re [U sinh (2 R) U^{T}]], \end{matrix}

(A7b)

\begin{matrix} Δ X P & \equiv \frac{1}{2} [- Re [U] e^{2 R} Im [U^{†}] - Im [U] e^{- 2 R} Re [U^{†}]] \\ = \frac{1}{2} [- Im [U cosh (2 R) U^{†}] + Im [U sinh (2 R) U^{T}]] . \end{matrix}

(A7c)

We can see that the only elements of U entering in Equation (A5) are the ones in the first column

U_{i 1}

, since these are all the transition amplitudes from the only populated input port (namely the first) to every output port. We can easily see that the same is true for the matrices in Equation (A7). In fact, exploiting the relations

\begin{matrix} cosh {(2 R)}_{i j} = δ_{i j} + δ_{i 1} δ_{j 1} (cosh 2 r - 1) \end{matrix}

(A8a)

\begin{matrix} sinh {(2 R)}_{i j} = δ_{i 1} δ_{j 1} sinh 2 r, \end{matrix}

(A8b)

where

δ_{i j}

is the Kronecker delta, we can evaluate

\begin{matrix} {(U cosh (2 R) U^{†})}_{i j} = \sum_{k, l = 1}^{M} U_{i k} cosh {(2 R)}_{k l} U_{l j}^{†} = δ_{i j} + U_{i 1} U_{j 1}^{*} (cosh 2 r - 1) \end{matrix}

(A9a)

\begin{matrix} {(U sinh (2 R) U^{T})}_{i j} = \sum_{k, l = 1}^{M} U_{i k} sinh {(2 R)}_{k l} U_{l j}^{T} = U_{i 1} U_{j 1} sinh 2 r . \end{matrix}

(A9b)

We can thus parametrise only the first column of U, introducing the quantities

P_{i} = {| U_{i 1} |}^{2}

and

γ_{i} = arg U_{i 1}

for

i = 1, \dots, M

, which are respectively the transition probabilities from the first input to the i-th output port, and the complex phases of the relevant transition amplitudes, so that

U_{i 1} = \sqrt{P_{i}} e^{i γ_{i}}

. We can then rewrite the displacement

d^{'}

in Equation (A5) and the covariance matrix

Γ^{'}

in Equation (A6) in terms of

P_{i}

and

γ_{i}

d_{U} = R d = \sqrt{2} α (\begin{matrix} \sqrt{P_{1}} cos γ_{1} \\ ⋮ \\ \sqrt{P_{M}} cos γ_{M} \\ \sqrt{P_{1}} sin γ_{1} \\ ⋮ \\ \sqrt{P_{M}} sin γ_{M} \end{matrix}),

(A10)

and

\begin{matrix} Δ X_{i j}^{2} & = \frac{1}{2} δ_{i j} + \sqrt{P_{i} P_{j}} (cos (γ_{i} - γ_{j}) {sinh}^{2} r + cos (γ_{i} + γ_{j}) sinh r cosh r) \end{matrix}

(A11a)

\begin{matrix} Δ P_{i j}^{2} & = \frac{1}{2} δ_{i j} + \sqrt{P_{i} P_{j}} (cos (γ_{i} - γ_{j}) {sinh}^{2} r - cos (γ_{i} + γ_{j}) sinh r cosh r) \end{matrix}

(A11b)

\begin{matrix} Δ X P_{i j} & = \sqrt{P_{i} P_{j}} [- sin (γ_{i} - γ_{j}) {sinh}^{2} r + sin (γ_{i} + γ_{j}) sinh r cosh r] . \end{matrix}

(A11c)

Finally, we notice that if we are performing homodyne to measure the quadratures

{\hat{x}}_{i, θ_{i}}

, where

θ = (θ_{1}, \dots, θ_{M})

are the phases of the local oscillators at each output channel, this corresponds to apply a further unitary rotation

U (θ) = diag (e^{- i θ_{1}}, \dots, e^{- i θ_{M}})

on the state

U | α, r 〉

. This corresponds to substituting

U \to U (θ) U

in the expression of R in Equation (A3), whose effect is to add a phase

e^{- i θ_{i}}

to every elements of the i-th row of U. Thus, if we define

{\bar{γ}}_{i} = γ_{i} - θ_{i}

, we can write the displacement

d_{θ}

and the covariance matrix

Γ_{θ}

of the (accessory) state

\hat{U} (θ) \hat{U} | α, d 〉

d_{θ} = \sqrt{2} α (\begin{matrix} \sqrt{P_{1}} cos {\bar{γ}}_{1} \\ ⋮ \\ \sqrt{P_{M}} cos {\bar{γ}}_{M} \\ \sqrt{P_{1}} sin {\bar{γ}}_{1} \\ ⋮ \\ \sqrt{P_{M}} sin {\bar{γ}}_{M} \end{matrix}),

(A12)

and

Γ_{θ} = (\begin{matrix} Δ X_{θ}^{2} & Δ X P_{θ} \\ Δ X P_{θ}^{T} & Δ P_{θ}^{2} \end{matrix}),

(A13)

with

\begin{matrix} Δ {(X_{θ}^{2})}_{i j} & = \frac{1}{2} δ_{i j} + \sqrt{P_{i} P_{j}} (cos ({\bar{γ}}_{i} - {\bar{γ}}_{j}) {sinh}^{2} r + cos ({\bar{γ}}_{i} + {\bar{γ}}_{j}) sinh r cosh r) \end{matrix}

(A14a)

\begin{matrix} Δ {(P_{θ}^{2})}_{i j} & = \frac{1}{2} δ_{i j} + \sqrt{P_{i} P_{j}} (cos ({\bar{γ}}_{i} - {\bar{γ}}_{j}) {sinh}^{2} r - cos ({\bar{γ}}_{i} + {\bar{γ}}_{j}) sinh r cosh r) \end{matrix}

(A14b)

\begin{matrix} Δ {(X P_{θ})}_{i j} & = \sqrt{P_{i} P_{j}} [- sin ({\bar{γ}}_{i} - {\bar{γ}}_{j}) {sinh}^{2} r + sin ({\bar{γ}}_{i} + {\bar{γ}}_{j}) sinh r cosh r] . \end{matrix}

(A14c)

If we are performing homodyne detection only on a single channel, say the first, as it is done, for example, in Section 2, we need to first trace away the

M - 1

channels we are not observing. The probability density function

p (x)

is thus given by the marginal of the Wigner distribution of the state

\hat{U} (θ) \hat{U} | α, r 〉

associated with the first symplectic variable

x_{1}

and, equivalently, a Gaussian distribution with average

μ

given by the first element of

d_{θ}

in Equation (A12)

μ = \sqrt{2} α \sqrt{P_{1}} cos {\bar{γ}}_{1}

(A15)

and variance

σ^{2}

given by the first row, first column element of

Γ_{θ}

in Equation (A13)

σ^{2} = \frac{1}{2} + P_{1} ({sinh}^{2} r + cos 2 {\bar{γ}}_{1} sinh r cosh r) .

(A16)

For a squeezed vacuum state,

α = 0

and we recognise the expression of the variance in Equation (6) from Equation (A16).

If, instead, we are observing all M output channels of the network U, the probability density function

p (x)

will be a Gaussian distribution with average

μ

given by the first M elements of

d_{θ}

μ = \sqrt{2} α (\begin{matrix} \sqrt{P_{1}} cos {\bar{γ}}_{1} \\ ⋮ \\ \sqrt{P_{M}} cos {\bar{γ}}_{M} \end{matrix}),

(A17)

from which we recognise Equation (36), while the covariance matrix

Σ

is given by the first M rows and columns of

Γ_{θ}

, i.e., by the matrix in Equation (A14a)

Σ_{i j} = \frac{1}{2} δ_{i j} + \sqrt{P_{i} P_{j}} (cos ({\bar{γ}}_{i} - {\bar{γ}}_{j}) {sinh}^{2} r + cos ({\bar{γ}}_{i} + {\bar{γ}}_{j}) sinh r cosh r),

(A18)

from which we obtain the expression in Equation (35).

To evaluate the determinant

| Σ |

in Equation (37), we first notice that the covariance matrix

Γ

of the initial state

| α, r 〉

can be written as the sum

Γ \equiv I_{2 M} / 2 + K_{0}

, where

I

is the

2 M \times 2 M

identity matrix and

K_{0} = Γ - I_{2 M} / 2

is a diagonal matrix of rank 2. Since the rank is invariant under orthogonal rotations, the same holds true for

Γ_{θ} = R Γ R^{T} \equiv I_{2 M} + K

, with

rank (K) = 2

. By definition of rank, none of the sub-matrices of K can have rank greater than 2, hence we can write

Σ = I_{M} / 2 + (Σ - I_{M} / 2) \equiv I_{M} / 2 + A,

(A19)

with

rank (A) \leq 2

, since A is a submatrix of K

A_{i j} = \sqrt{P_{i} P_{j}} (cos ({\bar{γ}}_{i} - {\bar{γ}}_{j}) {sinh}^{2} r + cos ({\bar{γ}}_{i} + {\bar{γ}}_{j}) sinh r cosh r)

(A20)

We can thus apply the results in Appendix E to

Σ

and write

| Σ | = | I_{M} / 2 + A |

as a sum of determinants of the matrices obtained replacing any number of columns of

I_{M} / 2

, with the respective columns of A

| Σ | = \frac{1}{2^{M}} + \frac{1}{2^{M - 1}} \sum_{i = 1}^{M} A_{i i} + \frac{1}{2^{M - 2}} \sum_{i = 1}^{M} \sum_{j = i + 1}^{M} A_{i i} A_{j j} - A_{i j}^{2},

(A21)

where the first term is the determinant of

I_{M} / 2

, the terms in the first summation are determinants of the matrices obtained by substituting the i-th column of

I_{M} / 2

with the i-th column of A, and the terms in the last summation from substituting the i-th and j-th columns, while also exploiting the symmetry of A. Noticeably, since rank

A \leq 2

, all the contributions involving the replacement of three or more columns of A are vanishing for the definition of rank of a matrix. Replacing the expression of A found in Equation (A20) into Equation (A21), the expression in Equation (37) can then be easily obtained after some elementary trigonometry.

Appendix B. Fisher Information for Gaussian Probabilities

In this Appendix, we will evaluate the Fisher information

F (φ)

associated with the estimation of a parameter

φ

from a M-variate Gaussian distribution

p_{φ} (x) = \frac{1}{\sqrt{{(2 π)}^{M} det [Σ]}} exp (- \frac{1}{2} {(x - μ)}^{T} Σ^{- 1} (x - μ)),

(A22)

where we assume that both the covariance matrix

Σ

and the average

μ

depend on the parameter

φ

.

Recalling the definition of the Fisher information in Equation (7), we first evaluate the logarithmic derivative

\begin{matrix} \frac{\partial}{\partial φ} ln p_{φ} (x) & = - \frac{1}{2} \frac{\partial}{\partial φ} ln det [Σ] - \frac{1}{2} {(x - μ)}^{T} \frac{\partial Σ^{- 1}}{\partial φ} (x - μ) \\ + \frac{\partial μ^{T}}{\partial φ} Σ^{- 1} (x - μ) . \end{matrix}

(A23)

In order to evaluate the Fisher information in Equations (9) and (38), we first notice that the expression

{(\frac{\partial}{\partial φ} ln p_{φ} (x))}^{2}

contains terms of order up to fourth in

(x - μ)

. Obtaining the Fisher information thus reduces to the evaluation of the expectation values of a polynomial in

(x - μ)

, for which we can exploit standard results of Gaussian integrals:

\begin{matrix} E_{p_{φ}} [x_{i} - μ_{i}] & = 0 \end{matrix}

(A24a)

\begin{matrix} E_{p_{φ}} [(x_{i} - μ_{i}) (x_{j} - μ_{j})] & = Σ_{i j} \end{matrix}

(A24b)

\begin{matrix} E_{p_{φ}} [(x_{i} - μ_{i}) (x_{j} - μ_{j}) (x_{k} - μ_{k})] & = 0 \end{matrix}

(A24c)

\begin{matrix} E_{p_{φ}} [(x_{i} - μ_{i}) (x_{j} - μ_{j}) (x_{k} - μ_{k}) (x_{l} - μ_{l})] & = Σ_{i j} Σ_{k l} + Σ_{i k} Σ_{j l} + Σ_{i l} Σ_{j k}, \end{matrix}

(A24d)

from which we see that only even powers of the term

(x - μ)

have a non-vanishing contribution. We can thus evaluate

\begin{matrix} F (φ) & = E_{p_{φ}} [{(\frac{\partial}{\partial φ} ln p_{φ} (x))}^{2}] \\ = \frac{1}{4} {(\frac{\partial}{\partial φ} ln det [Σ])}^{2} \\ + \sum_{s, t = 1}^{M} {(\frac{\partial μ^{T}}{\partial φ} Σ^{- 1})}_{s} {(\frac{\partial μ^{T}}{\partial φ} Σ^{- 1})}_{t} E_{p_{φ}} [(x_{s} - μ_{s}) (x_{t} - μ_{t})] \\ + \frac{1}{2} (\frac{\partial}{\partial φ} ln det [Σ]) \sum_{s, t = 1}^{M} \frac{\partial Σ_{s t}^{- 1}}{\partial φ} E_{p_{φ}} [(x_{s} - μ_{s}) (x_{t} - μ_{t})] \\ + \frac{1}{4} \sum_{s, t, u, v = 1}^{M} \frac{\partial Σ_{s t}^{- 1}}{\partial φ} \frac{\partial Σ_{u v}^{- 1}}{\partial φ} E_{p_{φ}} [(x_{s} - μ_{s}) (x_{t} - μ_{t}) (x_{u} - μ_{u}) (x_{v} - μ_{v})] . \end{matrix}

(A25)

By substituting the expressions in Equations (A24) into Equation (A25), and exploiting the Jacobi’s formula for the derivative of the determinant of square matrices which assures that

\frac{1}{det [Σ]} \frac{\partial det [Σ]}{\partial φ} = Tr [Σ^{- 1} \frac{\partial Σ}{\partial φ}] \equiv - Tr [\frac{\partial Σ^{- 1}}{\partial φ} Σ],

(A26)

we see that most of the terms in Equation (A25) cancel out, yielding the Fisher information

\begin{matrix} F (φ) & = \frac{\partial μ^{T}}{\partial φ} Σ^{- 1} \frac{\partial μ}{\partial φ} + \frac{1}{2} Tr [\frac{\partial Σ^{- 1}}{\partial φ} Σ \frac{\partial Σ^{- 1}}{\partial φ} Σ] . \end{matrix}

(A27)

We can easily reduce Equation (A27) to the Fisher information in Equation (9) for the single-homodyne case with

μ = 0

.

To obtain the FIM in Equation (38), we first exploit the relation

Σ \frac{\partial Σ^{- 1}}{\partial φ} Σ = - \frac{\partial Σ}{\partial φ},

(A28)

and thus we express the covariance matrix

Σ^{- 1}

in terms of its cofactor matrix, i.e.,

Σ^{- 1} = C / det [Σ]

, where we have exploited the symmetry of

Σ

so that

C^{T} = C

. We can thus rewrite, in the single-parameter case

\begin{matrix} F (φ) & = \frac{\partial μ^{T}}{\partial φ} Σ^{- 1} \frac{\partial μ}{\partial φ} - \frac{1}{2} Tr [\frac{\partial Σ^{- 1}}{\partial φ} \frac{\partial Σ}{\partial φ}] \\ = \frac{\partial μ^{T}}{\partial φ} Σ^{- 1} \frac{\partial μ}{\partial φ} + \frac{1}{2 det {[Σ]}^{2}} \frac{\partial det [Σ]}{\partial φ} Tr [C \frac{\partial Σ}{\partial φ}] \\ - \frac{1}{2 det [Σ]} Tr [(\frac{\partial C}{\partial φ}) (\frac{\partial Σ}{\partial φ})] \end{matrix}

(A29)

We can recognise in the second term of Equation (A29) the Jacobi’s formula for the derivative of the determinant in Equation (A26), which allows us to obtain the expression for the Fisher information shown in Equation (38)

\begin{matrix} F (φ) & = \frac{\partial μ^{T}}{\partial φ} Σ^{- 1} \frac{\partial μ}{\partial φ} + \frac{1}{2 det {[Σ]}^{2}} {(\frac{\partial det [Σ]}{\partial φ})}^{2} - \frac{1}{2 det [Σ]} Tr [(\frac{\partial C}{\partial φ}) (\frac{\partial Σ}{\partial φ})] . \end{matrix}

(A30)

In order to elicit the cofactor matrix C in terms of the elements of

Σ

and thus obtain Equation (39), we first need to make some observations. First, the

(s, t)

-cofactor

C_{s t}

is defined as the determinant of the

L - 1 \times L - 1

sub-matrix of

Σ

obtained by deleting its s-th row and t-th column, then multiplied by

{(- 1)}^{s + t}

. This can be equivalently thought of as the determinant of the

L \times L

matrix

Σ^{[s, t], 1}

, where we denote with

X^{[s, t], n}

the matrix obtained from the matrix X replacing all its elements in the the s-th row and in the p-th column with zeros, with the exception of the element

(s, t)

which is replaced by n, namely

C_{s t} = det [(\begin{matrix} Σ_{11} & \dots & Σ_{1 t - 1} & 0 & Σ_{1 t + 1} & \dots & Σ_{1 L} \\ ⋮ & ⋮ & ⋮ \\ Σ_{s - 11} & \dots & Σ_{s - 1 t - 1} & 0 & Σ_{s - 1 t + 1} & \dots & Σ_{s - 1 L} \\ 0 & \dots & 0 & 1 & 0 & \dots & 0 \\ Σ_{s + 11} & \dots & Σ_{s + 1 t - 1} & 0 & Σ_{s + 1 t + 1} & \dots & Σ_{s + 1 L} \\ ⋮ & ⋮ & ⋮ \\ Σ_{L 1} & \dots & Σ_{L t - 1} & 0 & Σ_{L t + 1} & \dots & Σ_{L L} \end{matrix})] .

(A31)

For the particular case of the covariance matrix

Σ

in Equation (35), as discussed in Appendix A, we can write

Σ = I_{M} / 2 + A

, with A symmetric matrix such that

rank (A) = ρ \leq 2

given in Equation (A20). Thus, we can write the

(s, t)

-cofactor as

C_{s t} = det [{(I_{M} / 2)}^{[s, t], 0} + A^{[s, t], 1}],

(A32)

and evaluate this determinant with the methods discussed in detail in Appendix E. In particular,

C_{s t}

is obtained as a sum of determinants of matrices obtained by replacing any possible set of columns of

{(I_{M} / 2)}^{[s, t], 0}

with the columns in the same positions of

A^{[s, t], 1}

. Noticeably, by substituting a row and a column of A with an arbitrary row or column, in general increases its rank by one, so that

rank (A^{[s, t], 1}) \leq 3

. To evaluate the cofactor matrix C, it is thus convenient to consider separately the simpler case

s = t

first, and then

s \neq t

.

For

s = t

in Equation (A32), the cofactor becomes

C_{s s} = det [{(I_{M} / 2)}^{[s, s], 0} + A^{[s, s], 1}]

. We notice that the matrix

{(I_{M} / 2)}^{[s, s], 0}

is a diagonal matrix with a single zero eigenvalue—i.e., the s-th—and thus each contribution to

C_{s s}

is non-vanishing only if the s-th columns of

{(I_{M} / 2)}^{[s, s], 0}

—a column of only 0s—is replaced. Moreover, the s-th column of

A^{[s, s], 1}

is also a column of all 0s, with the exception of the s-th element, which is equal to 1. We thus obtain

C_{s s} = \frac{1}{2^{M - 1}} + \frac{1}{2^{M - 2}} \sum_{\begin{matrix} i = 1 \\ i \neq s \end{matrix}}^{M} A_{i i} - \frac{1}{2^{M - 3}} \sum_{\begin{matrix} i = 1 \\ i \neq s \end{matrix}}^{M} \sum_{\begin{matrix} j = i + 1 \\ j \neq s \end{matrix}}^{M} A_{i i} A_{j j} - A_{i j}^{2},

(A33)

which is the sum of terms obtained substituting the s-th, the s-th and i-th, and the s-th, i-th and j-th columns, respectively. Noticeably, replacing more than 3 columns yields vanishing contributions, since

rank (A^{[s, t], 1}) \leq 3

. When

s \neq t

,

{(I_{M} / 2)}^{[s, t], 0}

is still a diagonal matrix but with two zeros in the diagonal, more precisely the s-th an the t-th elements. Both the s-th and the t-th columns are thus columns of zeros, and all non-vanishing contributions to

C_{s t}

must replace these two columns. For example, the only contribution obtained by swapping the s-th and t-th columns is of the type

\frac{1}{2^{M - 2}} det [(\begin{matrix} 0 & 1 \\ A_{t s} & 0 \end{matrix})] = - \frac{1}{2^{M - 2}} A_{s t},

(A34)

exploiting at the same time the symmetry of A. The contributions obtained by swapping the s-th, t-th and i-th columns, with

i \neq s, t

, are of the type

\frac{1}{2^{M - 3}} det [(\begin{matrix} 0 & 1 & 0 \\ A_{t s} & 0 & A_{t i} \\ A_{i s} & 0 & A_{i i} \end{matrix})] = \frac{1}{2^{M - 3}} (A_{s i} A_{t i} - A_{s t} A_{i i}),

(A35)

where once again, we are exploiting the symmetry of A. Once again, substituting more than 3 columns yields no contributions, since

rank (A^{[s, t], 1}) \leq 3

. The final expression for

C_{s t}

,

s \neq t

thus reads

C_{s t} = - \frac{1}{2^{M - 2}} A_{s t} + \frac{1}{2^{M - 3}} \sum_{\begin{matrix} i = 1 \\ i \neq s, t \end{matrix}}^{M} (A_{s i} A_{t i} - A_{s t} A_{i i}) .

(A36)

Replacing in (A33) and (A36) the definition of

A = Σ - I_{M} / 2

in Equation (A32), it is straightforward to obtain Equation (39).

Appendix C. Asymptotic Analyses of Gaussian Metrology

In this appendix, we will perform all the asymptotic analyses for a large number of photons N in the setups presented in Section 2 and Section 3.

Appendix C.1. Single Homodyne

Here, we evaluate the asymptotic expressions of the Fisher information in Equation (16), showing that the conditions in Equations (14) and (15) yield the Heisenberg scaling for the estimation of the respective quantities of interest.

As shown in Equations (10) and (11), the dependence of the variance

σ_{φ}^{2} = \frac{1 - P_{φ}}{2} + \frac{P_{φ}}{2} [cosh (2 r) + cos (2 γ_{φ} - 2 θ) sinh (2 r)],

(A37)

on the parameters

φ

only appears through the transition probability

P_{φ}

and the acquired complex phase

γ_{φ}

\frac{\partial σ_{φ}^{2}}{\partial φ} = (\partial_{P} σ_{φ}^{2}) \frac{\partial P_{φ}}{\partial φ} + (\partial_{γ} σ_{φ}^{2}) \frac{\partial γ_{φ}}{\partial φ},

(A38)

where

\partial_{P}

and

\partial_{f}

represent the differentiation with respect to

P_{φ}

and

γ_{φ}

, and

\begin{matrix} \partial_{P} σ_{φ}^{2} & = \frac{1}{2} (- 1 + cosh (2 r) + cos (2 γ_{φ} - 2 θ) sinh (2 r)) \end{matrix}

(A39a)

\begin{matrix} \partial_{γ} σ_{φ}^{2} & = - P_{φ} sin (2 γ_{φ} - 2 θ) sinh (2 r) . \end{matrix}

(A39b)

To achieve the Heisenberg scaling, some conditions must be imposed so that the variance in Equation (A37) does not grow with N. The only option to do that without ruining the sensitivity of the setup is requiring that

γ_{φ} - θ ≃ π / 2

, as we can see from Equation (A37). In particular, we impose condition (14), and evaluate the variance in (A37) and its gradient (A38) in the large N limit

\begin{matrix} σ_{φ}^{2} & = \frac{1}{2} + N P_{φ} (1 - cos (\frac{2 k}{N}) \sqrt{1 + \frac{1}{N}}) \\ = \frac{1}{2} + N P_{φ} (1 - (1 - \frac{2 k^{2}}{N^{2}}) (1 + \frac{1}{2 N} - \frac{1}{8 N^{2}})) + O (\frac{1}{N^{2}}) \\ = \frac{1 - P_{φ}}{2} + P_{φ} (\frac{2 k^{2}}{N} + \frac{1}{8 N}) + O (\frac{1}{N^{2}}), \end{matrix}

(A40)

\begin{matrix} \frac{\partial}{\partial φ} σ_{φ}^{2} & = N \frac{\partial}{\partial φ} P_{φ} (1 - cos (\frac{2 k}{N}) \sqrt{1 + \frac{1}{N}}) + 2 N P_{φ} \frac{\partial}{\partial φ} γ_{φ} sin (\frac{2 k}{N}) \sqrt{1 + \frac{1}{N}} \\ = N \frac{\partial}{\partial φ} P_{φ} (1 - (1 + \frac{1}{2 N})) + 2 N P_{φ} \frac{\partial}{\partial φ} γ_{φ} \frac{2 k}{N} + O (\frac{1}{N}) \\ = - \frac{1}{2} \frac{\partial}{\partial φ} P_{φ} + 4 k P_{φ} \frac{\partial}{\partial φ} γ_{φ} + O (\frac{1}{N}) . \end{matrix}

(A41)

Then, by imposing condition (15) on

P_{φ}

, we get

\begin{matrix} σ_{φ}^{2} & = (2 k^{2} + \frac{1}{8} + \frac{ℓ}{2}) \frac{1}{N} + O (\frac{1}{N^{2}}), \end{matrix}

(A42a)

\begin{matrix} \frac{\partial}{\partial φ} σ_{φ}^{2} & = 4 k \frac{\partial}{\partial φ} γ_{φ} + O (\frac{1}{N}) . \end{matrix}

(A42b)

By substituting the expressions in Equation (A42) into the Fisher information in Equation (9), we can finally evaluate its asymptotic expression

F (φ) \sim 8 ϱ (k, ℓ) N^{2} (\frac{\partial γ_{φ}}{\partial φ}) (\frac{\partial γ_{φ}}{\partial φ^{T}})

(A43)

with

ϱ (k, ℓ) = {(\frac{8 k}{16 k^{2} + 1 + 4 ℓ})}^{2}

(A44)

a positive and N-independent pre-factor.

Appendix C.2. Multiple Homodyne

In this appendix, we will study the asymptotic regime of the Fisher information in Equation (38),

\begin{matrix} F (φ) = \underset{F_{D} (φ)}{\underset{︸}{\frac{1}{det [Σ]} \partial_{φ} μ^{T} C \partial_{φ} μ}} + \underset{F_{S} (φ)}{\underset{︸}{\frac{1}{2} {(\frac{\partial_{φ} det [Σ]}{det [Σ]})}^{2} - \frac{1}{2 det [Σ]} Tr [(\partial_{φ} Σ) (\partial_{φ} C)]}} \end{matrix}

(A45)

for large

N = N_{S} + N_{D} = {sinh}^{2} r + α^{2}

. We will show that conditions (41) are necessary to reach the Heisenberg scaling, and that, when they hold, the asymptotic expression for the Fisher information is the one shown in Equation (43).

As discussed in Section 3.2, all terms in the numerators of the Fisher information in Equation (A45) are at most of the order

N_{S}^{2}

or

N_{S} N_{D}

. We thus need to study the asymptotic behaviour of the determinant

det [Σ]

in Equation (37), which is at the denominators of the FI, and find what conditions prevent

det [Σ]

from growing with

N_{S}

as well, ruining the Heisenberg scaling. In particular, in order to reach the Heisenberg scaling, namely a scaling of the order of

N^{2}

in the FI, the determinant

det [Σ]

must be at most of the order

O (1)

.

Thus, we first focus our attention on

det [Σ]

in Equation (37)

\begin{matrix} det [Σ] = \frac{1}{2^{M}} & + \frac{sinh (r)}{2^{M - 1}} \sum_{i = 1}^{M} P_{i} (sinh (r) + cos (2 {\bar{γ}}_{i}) cosh (r)) \\ - \frac{{sinh}^{2} (r)}{2^{M - 2}} \sum_{i = 1}^{M} \sum_{j = i + 1}^{M} P_{i} P_{j} {sin}^{2} ({\bar{γ}}_{i} - {\bar{γ}}_{j}) . \end{matrix}

(A46)

In particular, we will suppose that the asymptotic behaviour of the complex phases

{\bar{γ}}_{i}

for large

N_{S}

is given by

{\bar{γ}}_{i} = {\bar{γ}}_{0 i} + k_{i} N_{S}^{- α}

, with

k_{i} \in R

independent of N and

α > 0

(Notice that this condition is not restrictive, since if

{\bar{γ}}_{i}

were to grow with N (i.e.,

α < 0

), it would yield an oscillating asymptotic behaviour of

det [Σ]

and every other term appearing in the FI). By expanding in powers of

N_{S}

the terms in which the squeezing parameter r appears in (A46) we obtain

\begin{matrix} det [Σ] = D_{1} N_{S} + D_{2} + D_{3} \frac{1}{N_{S}} + O (\frac{1}{N_{S}^{2}}), \end{matrix}

(A47)

where

\begin{matrix} D_{1} & = \frac{1}{2^{M - 1}} (1 + \sum_{i = 1}^{M} P_{i} cos (2 {\bar{γ}}_{i})) - \frac{1}{2^{M - 2}} (\sum_{i = 1}^{M} \sum_{j = i + 1}^{M} P_{i} P_{j} sin {({\bar{γ}}_{i} - {\bar{γ}}_{j})}^{2}), \end{matrix}

(A48a)

\begin{matrix} D_{2} & = \frac{1}{2^{M}} (1 + \sum_{i = 1}^{M} P_{i} cos (2 {\bar{γ}}_{i})), \end{matrix}

(A48b)

\begin{matrix} D_{3} & = - \frac{1}{2^{M + 2}} \sum_{i = 1}^{M} P_{i} cos (2 {\bar{γ}}_{i}) . \end{matrix}

(A48c)

To prevent the scaling with

N_{S}

of the determinant

det [Σ]

,

D_{1}

must be set as equal to, or tending to, zero. After some trigonometry, and exploiting the fact that

\sum_{i} P_{i} = 1

due to the unitarity of the network, we can rewrite

D_{1} = \frac{1}{2^{M - 2}} ({(\sum_{i = 1}^{M} P_{i} {cos}^{2} {\bar{γ}}_{i})}^{2} + {(\sum_{i = 1}^{M} P_{i} sin {\bar{γ}}_{i} cos {\bar{γ}}_{i})}^{2}),

(A49)

which tends to zero only if both terms in the parenthesis tend to zero, i.e., when

{\bar{γ}}_{0 i} = π / 2 + n π

, with

n \in Z

. To analyse the order for large

N_{S}

of the determinant when the condition

{\bar{γ}}_{0 i} = π / 2 + n π

holds—and thus

{\bar{γ}}_{i} = π / 2 + n π + k_{i} N_{S}^{- α}

—we need to analyse the behaviour of each term in Equation (A48). Since

cos (2 {\bar{γ}}_{i}) \sim - 1 + 2 k_{i} N_{S}^{- 2 α}

, and

sin {({\bar{γ}}_{i} - {\bar{γ}}_{j})}^{2} \sim (k_{i} - k_{j}) N_{S}^{- 2 α}

, both

D_{1}

and

D_{2}

scale with

N_{S}^{- 2 α}

, while

D_{3}

is asymptotically constant. Thus, we find the asymptotic behaviour of

det [Σ]

through Equation (A47): the term

D_{2}

is always negligible with respect to

D_{1} N_{S}

, while for

α \leq 1

the term

D_{1} N_{S}

dominates over

D_{3} / N_{S}

so that

det [Σ]

is of order

N_{S}^{1 - 2 α}

, and for

α > 1

instead

D_{3} / N_{S}

dominates

D_{1} N_{S}

and the determinant scales with

N_{S}^{- 1}

. In equations:

det [Σ] \sim \{\begin{matrix} D_{1} N_{S} \propto N_{S}^{1 - 2 α}, for α \leq 1 \\ D_{3} N_{S}^{- 1} \propto N_{S}^{- 1}, for α > 1 \end{matrix}

(A50)

Noticeably, the determinant stops growing with

N_{S}

only for

α \geq 1 / 2

.

We thus now study the asymptotic behaviour of the numerators appearing in the Fisher information in Equation (A45), when the condition

{\bar{γ}}_{i} = π / 2 + k_{i} N_{S}^{- α}

, with

α \geq 1 / 2

, is true. We will perform the same analysis done for

det [Σ]

earlier, considering only the dominating term for every value of

α

. We first obtain the derivative of

Σ

from Equation (35), substituting

{\bar{γ}}_{i} = π / 2 + k_{i} / N_{S}^{α}

, with

α \geq 1 / 2

,

\begin{matrix} \partial_{φ} Σ_{i j} = & - \frac{1}{2} \partial_{φ} (\sqrt{P_{i} P_{j}}) + \sqrt{P_{i} P_{j}} (\partial_{φ} ({\bar{γ}}_{i} + {\bar{γ}}_{j}) (k_{i} + k_{j}) \\ - \partial_{φ} ({\bar{γ}}_{i} - {\bar{γ}}_{j}) (k_{i} - k_{j})) N_{S}^{1 - α} + O (N_{S}^{1 - 2 α}) + O (N_{S}^{- 1}), \end{matrix}

(A51)

and we notice that each element of

Σ

scales at most with

N_{S}^{1 - α}

for

1 / 2 \leq α < 1

, while it becomes constant for

α \geq 1

. We then analyse the auxiliary term (40)

\begin{matrix} S_{s t i} & = {sinh}^{2} (r) \sqrt{P_{s} P_{t}} P_{i} sin ({\bar{γ}}_{s} - {\bar{γ}}_{i}) sin ({\bar{γ}}_{t} - {\bar{γ}}_{i}) = O (N_{S}^{1 - 2 α}), \end{matrix}

(A52)

of which we evaluate the derivative, always with the conditions

{\bar{γ}}_{i} = π / 2 + k_{i} / N_{S}^{α}

, with

α \geq 1 / 2

\begin{matrix} \partial_{φ} S_{s t i} & = \sqrt{P_{s} P_{t}} P_{i} ((\partial_{φ} ({\bar{γ}}_{s} - {\bar{γ}}_{i})) (k_{t} - k_{i}) \\ + (\partial_{φ} ({\bar{γ}}_{t} - {\bar{γ}}_{i})) (k_{s} - k_{i})) N_{S}^{1 - α} + O (N^{1 - 2 α}), \end{matrix}

(A53)

which scales at most with

N_{S}^{1 - α}

for large

N_{S}

. Moreover, the covariance matrix

Σ

in (35) asymptotically reads

Σ_{i j} = \frac{δ_{i j} - \sqrt{P_{i} P_{j}}}{2} + O (N_{S}^{1 - 2 α}) + O (N_{S}^{- 1}),

(A54)

which is in general constant asymptotically for large

N_{S}

. Inserting Equations (A52) and (A54) in the cofactor matrix (39), we obtain

C_{s t} = \frac{1}{2^{M - 1}} \sqrt{P_{s} P_{t}} + O (N_{S}^{1 - 2 α}) + O (N_{S}^{- 1}) .

(A55)

Exploiting the asymptotic behaviours of the derivatives found in Equations (A51) and (A53), we are able to analyse the remaining terms appearing in the FI

\begin{matrix} \partial_{φ} det [Σ] & = \frac{1}{2^{M - 1}} \sum_{i = 1}^{M} \partial_{φ} Σ_{i i} - \frac{1}{2^{M - 2}} \sum_{i = 1}^{M} \sum_{j = i + 1}^{M} \partial_{φ} S_{i i j}, \end{matrix}

(A56)

\begin{matrix} \partial_{φ} C_{s s} & = \frac{1}{2^{M - 2}} \sum_{\begin{matrix} i = 1 \\ i \neq s \end{matrix}}^{M} \partial_{φ} Σ_{i i} - \frac{1}{2^{M - 3}} \sum_{\begin{matrix} i = 1 \\ i \neq s \end{matrix}}^{M} \sum_{\begin{matrix} j = i + 1 \\ j \neq s \end{matrix}}^{M} \partial_{φ} S_{i i j}, \end{matrix}

(A57)

\begin{matrix} \partial_{φ} C_{s t} & = - \frac{1}{2^{M - 2}} \partial_{φ} Σ_{s t} + \frac{1}{2^{M - 3}} \sum_{\begin{matrix} i = 1 \\ i \neq s, t \end{matrix}}^{M} \partial_{φ} S_{s t i} . \end{matrix}

(A58)

It is easy to see that

\partial_{φ} det [Σ]

scales at most with

N_{S}^{1 - α}

, since

\sum_{i}^{M} \partial_{φ} P_{i} = 0

, while the derivatives of the elements of C scale at most with

N_{S}^{1 - α}

for

1 / 2 \leq α < 1

, and become constant for

α \geq 1

. The last step is to evaluate the asymptotic behaviour for the derivative

\partial_{φ} μ

of the average, which can be easily obtained by differentiating Equation (36)

\begin{matrix} \partial_{φ} μ_{i} & = \sqrt{2 N_{D}} (\frac{\partial_{φ} P_{i}}{2 \sqrt{P_{i}}} cos {\bar{γ}}_{i} - \sqrt{P_{i}} \partial_{φ} {\bar{γ}}_{i} sin {\bar{γ}}_{i}) \\ = - \sqrt{2 N_{D} P_{i}} \partial_{φ} {\bar{γ}}_{i} + O (\sqrt{N_{D}} N_{S}^{- α}) . \end{matrix}

(A59)

Now we can finally draw the conclusions, and obtain the scaling of the Fisher information shown in Equation (43) by putting together all the asymptotic regimes found. First, we notice that, independently of

α \geq 1 / 2

, the third contribution to the Fisher information in Equation (A45) always scales with

N_{S}

, hence only reaching shot-noise precision. Thus, it is always dominated by the other two terms in the Heisenberg scaling regime. The only terms which allow the Heisenberg scaling are then the first two: in fact, the term

F_{D} (φ)

scales with

N_{D} N_{S}^{2 α - 1}

for

1 / 2 \leq α \leq 1

, and with

N_{D} N_{S}

for

α > 1

, thus reaching the Heisenberg scaling for

α \geq 1

, while the second term reaches sub-shot noise scaling

N_{S}^{2 - 2 | α - 1 |}

for

1 / 2 < α < 3 / 2

, achieving the Heisenberg scaling for

α = 1

. It is interesting to notice the major difference between the two contributions to the Heisenberg scaling: the term

F_{D} (φ)

reaches the Heisenberg scaling for every value of

α \geq 1

, which means that the choice

{\bar{γ}}_{i} = π / 2

—corresponding to

α = + \infty

—still yields the Heisenberg scaling. On the other hand,

α

must be equal to 1 in order for the first term of

F_{S} (φ)

to achieve the Heisenberg scaling. The consequences and the physical implications of this difference are discussed in depth in Section 3.2.

The conditions to reach the Heisenberg scaling can thus be condensed into the request on the complex phases

{\bar{γ}}_{i} = π / 2 + O (N_{S}^{- 1}),

(A60)

for

i = 1, \dots, M

, as shown in Equation (41), or equivalently that

{\bar{γ}}_{i} ≃ π / 2 + k_{i} / N_{S}

asymptotically, with

k_{i} \in R

independent of

N_{S}

. Moreover, we have seen that the only relevant terms under this condition are the first two in Equation (A45), and that only for

α = 1

the first term of

F_{S} (φ)

is non-vanishing.

Finally, we can now prove (43). Substituting condition (A60) into Equations (A51) and (A53), we obtain from Equation (A56)

\begin{matrix} \partial_{φ} det [Σ] & \sim \frac{1}{2^{M - 1}} \sum_{i = 1}^{M} 4 P_{i} k_{i} (\partial_{φ} {\bar{γ}}_{i}) - \frac{1}{2^{M - 2}} \sum_{i = 1}^{M} \sum_{j = i + 1}^{M} 2 P_{i} P_{j} (k_{i} - k_{j}) (\partial_{φ} {\bar{γ}}_{i} - \partial_{φ} {\bar{γ}}_{j}) \\ \sim \frac{1}{2^{M - 3}} \sum_{i = 1}^{M} P_{i} k_{i} \sum_{j = 1}^{M} P_{j} \partial_{φ} {\bar{γ}}_{j} \end{matrix}

(A61)

where we exploited the passivity of the interferometer which assures

\sum_{i = 1}^{M} P_{i} = 1

. We also obtain from (A46)

det [Σ] ≃ \frac{1}{2^{M - 2} N_{S}} ({(\sum_{i = 1}^{M} P_{i} k_{i})}^{2} + \frac{1}{16}),

(A62)

which coincides with the expression in Equation (42). Lastly, from Equation (A59) when conditions (A60) hold, we obtain the asymptotic behaviour

\partial_{φ} μ_{i} \sim - \sqrt{2 N_{D} P_{i}} \partial_{φ} {\bar{γ}}_{i} .

(A63)

Substituting Equations (A55) and (A61)–(A63) into the Fisher information in Equation (38) yields the asymptotic expression shown in Equation (43).

Appendix D. Maximum-Likelihood Estimators for Gaussian Distributions

In this Appendix, we will derive the Likelihood equation associated with the estimation schemes presented in Section 2 and Section 3, i.e., the equation whose solution maximises the Likelihood functions in Equations (18) and (46). In doing so, we will demonstrate the expressions in Equations (19) and (47).

We first suppose that the M-variate Gaussian probability distribution

p_{φ} (x) = \frac{1}{\sqrt{{(2 π)}^{M} det [Σ]}} exp (- {(x - μ)}^{T} Σ^{- 1} (x - μ))

(A64)

governing the outcomes

x = (x_{1}, \dots, x_{M})

depends on a single parameter

φ

through its average

μ

and its covariance matrix

Σ

. If we perform the observation

ν

times

x_{1}, \dots, x_{ν}

, the Likelihood function becomes

L (φ; x_{1}, \dots, x_{ν}) = \frac{1}{\sqrt{{(2 π)}^{M ν} det {[Σ]}^{ν}}} exp (- \frac{1}{2} \sum_{i = 1}^{ν} {(x_{i} - μ)}^{T} Σ^{- 1} (x_{i} - μ)) .

(A65)

The maximum-likelihood estimator is obtained maximising the Likelihood function in Equation (A65)—or its logarithm. Applying the maximisation to the likelihood function in Equation (A65), we obtain the equation

\begin{matrix} 0 & = \partial_{φ} ln L (φ; x_{1}, \dots, x_{ν}) |_{φ = {\tilde{φ}}_{MLE}} \\ = \partial_{φ} \sum_{j = 1}^{ν} ln p (x_{j}; φ) |_{φ = {\tilde{φ}}_{MLE}} \\ = {[- \frac{ν}{2} \partial_{φ} ln (det [Σ]) - \frac{1}{2} \partial_{φ} \sum_{j = 1}^{ν} {(x_{j} - μ)}^{T} Σ^{- 1} (x_{j} - μ)]}_{φ = {\tilde{φ}}_{MLE}} \\ = {[- \frac{ν}{2} Tr [Σ^{- 1} \partial_{φ} Σ] - \frac{1}{2} \partial_{φ} \sum_{j = 1}^{ν} Tr [Σ^{- 1} (x_{j} - μ) {(x_{j} - μ)}^{T}]]}_{φ = {\tilde{φ}}_{MLE}} \\ = \frac{1}{2} Tr {[\partial_{φ} Σ^{- 1} (ν Σ - \sum_{j = 1}^{ν} (x_{j} - μ) {(x_{j} - μ)}^{T})]}_{φ = {\tilde{φ}}_{MLE}} \\ + {[{(\partial_{φ} μ)}^{T} Σ^{- 1} (ν μ - \sum_{j = 1}^{ν} x_{j})]}_{φ = {\tilde{φ}}_{MLE}}, \end{matrix}

(A66)

where we exploited Jacobi’s formula for the derivative of the determinant of a matrix shown in Equation (A26), the identity in Equation (A28), and the symmetry of

Σ

. Equation (A66) is the expression of the Likelihood Equation (47). This can be easily simplified to the case of univariate Gaussian distribution for

M = 1

and absence of average

μ

, for which

Σ \to σ^{2}

and

x_{j} \to x_{j}

, obtaining

0 = Σ - \frac{1}{ν} \sum_{j = 1}^{ν} x_{j}^{2}

(A67)

as shown in Equation (19).

Appendix E. Formulas for the Determinant of a Sum of Two Matrices

Let us consider an

L \times L

matrix Z of the form

Z = D + W

, where

D = diag (d_{1}, \dots, d_{L})

is a diagonal matrix, and

rank (W) = ρ \leq L

. In this appendix, we will show a convenient method to express the determinant

det [Z]

in terms of the elements of W, for the evaluation of the determinant in Equation (37) and of the cofactor matrix in Equations (39).

We exploit the identity [47]

det [Z] = det [D + W] = \sum_{α = 0}^{L} Θ_{α} (D, W),

(A68)

where

Θ_{α} (X, Y)

is the sum of the determinants of the matrices obtained by replacing any set of

α

columns (rows) of X with the

α

columns (rows) of Y at the same position. For example, for two

3 \times 3

matrices

X = (\begin{matrix} X_{11} & X_{12} & X_{13} \\ X_{21} & X_{22} & X_{23} \\ X_{31} & X_{32} & X_{33} \end{matrix}) Y = (\begin{matrix} Y_{11} & Y_{12} & Y_{13} \\ Y_{21} & Y_{22} & Y_{23} \\ Y_{31} & Y_{32} & Y_{33} \end{matrix}),

(A69)

the quantity

Θ_{α} (D, W)

is given, for

α = 1

, by

Θ_{1} (X, Y) = det [(\begin{matrix} Y_{11} & X_{12} & X_{13} \\ Y_{21} & X_{22} & X_{23} \\ Y_{31} & X_{32} & X_{33} \end{matrix})] + det [(\begin{matrix} X_{11} & Y_{12} & X_{13} \\ X_{21} & Y_{22} & X_{23} \\ X_{31} & Y_{32} & X_{33} \end{matrix})] + det [(\begin{matrix} X_{11} & X_{12} & Y_{13} \\ X_{21} & X_{22} & Y_{23} \\ X_{31} & X_{32} & Y_{33} \end{matrix})],

(A70)

while for

α = 2

Θ_{2} (X, Y) = det [(\begin{matrix} Y_{11} & Y_{12} & X_{13} \\ Y_{21} & Y_{22} & X_{23} \\ Y_{31} & Y_{32} & X_{33} \end{matrix})] + det [(\begin{matrix} X_{11} & Y_{12} & Y_{13} \\ X_{21} & Y_{22} & Y_{23} \\ X_{31} & Y_{32} & Y_{33} \end{matrix})] + det [(\begin{matrix} Y_{11} & X_{12} & Y_{13} \\ Y_{21} & X_{22} & Y_{23} \\ Y_{31} & X_{32} & Y_{33} \end{matrix})] .

(A71)

Since

rank (W) = ρ

, the determinant of any

α \times α

sub-matrix of W is zero if

α > ρ

, so we can write

det [Z] = \sum_{α = 0}^{ρ} Θ_{α} (D, W) .

(A72)

Let us now make explicit the first, easier terms of the summation. In particular, with the goal of applying this expression to the covariance matrix

Σ

in Equation (35) and to the cofactor matrix C in Equation (39), which can both be expressed as the sum of a diagonal matrix and a rank-two and rank-three matrix (see Equations (A19), (A20) and (A32)), we will explicitly report the first three terms

Θ_{0} (X, Y)

,

Θ_{1} (X, Y)

and

Θ_{2} (X, Y)

.

For

α = 0

, no columns are replaced from D, hence

Θ_{0} (D, W) = det [D] = \prod_{k} d_{k}

. We also notice that, if at least one of the eigenvalues of D is zero, this term vanishes.

For

α = 1

,

Θ_{1} (D, W)

is the sum of determinants of matrices of the form

(\begin{matrix} d_{1} & 0 & \dots & 0 & W_{1 i} & 0 & \dots & 0 & 0 \\ 0 & d_{2} & \dots & 0 & W_{2 i} & 0 & \dots & 0 & 0 \\ ⋱ & ⋮ & ⋮ \\ 0 & 0 & \dots & d_{i - 1} & W_{i - 1 i} & 0 & \dots & 0 & 0 \\ 0 & 0 & \dots & 0 & W_{i i} & 0 & \dots & 0 & 0 \\ 0 & 0 & \dots & 0 & W_{i + 1 i} & d_{i + 1} & \dots & 0 & 0 \\ ⋮ & ⋮ & ⋱ \\ 0 & 0 & \dots & 0 & W_{L - 1 i} & 0 & \dots & d_{L - 1} & 0 \\ 0 & 0 & \dots & 0 & W_{L i} & 0 & \dots & 0 & d_{L} \end{matrix})

(A73)

with

i = 1, \dots, L

. The determinant of a matrix of the type in Equation (A73) is straightforward, and reduces to

W_{i i} \times \prod_{k \neq i} d_{k}

. Thus, in general,

Θ_{1} (D, W) = \sum_{i} W_{i i} \times \prod_{k \neq i} d_{k}

. A key observation to make, in view of the application to the covariance matrix

Σ

and cofactor matrix C, is that if two or more eigenvalues of D are zero, all these determinants are zero, and thus

Θ_{1} (D, W) = 0

; if, instead, a single eigenvalue is zero, say

d_{j} = 0

, only one of these determinants of

Θ_{1} (D, W)

is non-vanishing, namely the determinant of the matrix obtained by replacing exactly the j-th column of D. In this case, for

d_{j} = 0

, then we have

Θ_{1} (D, W) = W_{j j} \times \prod_{k \neq j} d_{k}

.

Let us now consider lastly the case

α = 2

. The matrices contributing to

Θ_{2} (D, W)

are of the form

(\begin{matrix} d_{1} & \dots & 0 & W_{1 i} & 0 & \dots & 0 & W_{1 i^{'}} & 0 & \dots & 0 \\ ⋱ & ⋮ & ⋮ & ⋮ & ⋮ \\ 0 & \dots & d_{i - 1} & W_{i - 1 i} & 0 & \dots & 0 & W_{i - 1 i^{'}} & 0 & \dots & 0 \\ 0 & \dots & 0 & W_{i i} & 0 & \dots & 0 & W_{i i^{'}} & 0 & \dots & 0 \\ 0 & \dots & 0 & W_{i + 1 i} & d_{i + 1} & \dots & 0 & W_{i + 1 i^{'}} & 0 & \dots & 0 \\ ⋮ & ⋮ & ⋱ & ⋮ & ⋮ \\ 0 & \dots & 0 & W_{i^{'} - 1 i} & 0 & \dots & d_{i^{'} - 1} & W_{i^{'} - 1 i^{'}} & 0 & \dots & 0 \\ 0 & \dots & 0 & W_{i^{'} i} & 0 & \dots & 0 & W_{i^{'} i^{'}} & 0 & \dots & 0 \\ 0 & \dots & 0 & W_{i^{'} + 1 i} & 0 & \dots & 0 & W_{i^{'} + 1 i^{'}} & d_{i^{'} + 1} & \dots & 0 \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋱ \\ 0 & \dots & 0 & W_{L i} & 0 & \dots & 0 & W_{L i^{'}} & 0 & \dots & d_{L} \end{matrix}),

(A74)

with

i < i^{'} = 1, \dots, L

. Once again, the determinants of the matrices of the type in Equation (A74) are easy to be evaluated, and read

det [W^{(i, i^{'})}] \prod_{k \notin {i, i^{'}}} d_{k}

, where

W^{(i, i^{'})} = (\begin{matrix} W_{i i} & W_{i i^{'}} \\ W_{i^{'} i} & W_{i^{'} i^{'}} \end{matrix}) .

(A75)

We notice that

det [W^{(i, i^{'})}] = det [W^{(i^{'}, i)}]

. Thus, we can rewrite the terms

Θ_{2} (D, W)

as the sum

Θ_{2} (D, W) = \sum_{i} \sum_{i^{'} > 1} det [W^{(i, i^{'})}] \prod_{k \notin {i, i^{'}}} d_{k}

. Once again, key observations can be made: if the diagonal matrix D has at least three null eigenvalues, then

Θ_{2} (D, W)

is vanishing. If only two eigenvalues are zero, e.g.,

d_{j} = d_{j^{'}} = 0

, with

j \neq j^{'}

, then there is only one non-vanishing contribution to

Θ_{2} (D, W)

, given by the matrix obtained by substituting the j-th and

j^{'}

-th columns of D, and in this case we have

Θ_{2} (D, W) = det [W^{(j, j^{'})}] \prod_{k \notin {j, j^{'}}} d_{k}

. If only one eigenvalue is zero, namely

d_{j} = 0

, then

Θ_{2} (D, W)

is given by the sum of all the determinants of the matrices where the j-th column of D has been replaced, namely

Θ_{2} (D, W) = \sum_{i \neq j} det [W^{(i, j)}] \prod_{k \neq i, j} d_{k}

.

It is then possible to extend with similar considerations the same method for every value of

α

, eventually obtaining the compact expression for the determinant in Equation (A68)

det [Z] = \sum_{α = 0}^{ρ} \sum_{γ \in C_{α}^{L}} det [W^{(γ)}] \prod_{k \notin γ} d_{k},

(A76)

where

C_{α}^{L}

is the set of all the combinations without repetitions of

α

columns out of L,

W^{(γ)}

denotes the

α \times α

sub-matrix of W obtained by selecting the rows and columns with indices

γ_{1}, \dots, γ_{α}

.

References

Caves, C.M. Quantum-mechanical noise in an interferometer. Phys. Rev. D 1981, 23, 1693–1708. [Google Scholar] [CrossRef]
Bondurant, R.S.; Shapiro, J.H. Squeezed states in phase-sensing interferometers. Phys. Rev. D 1984, 30, 2548–2556. [Google Scholar] [CrossRef]
Wineland, D.J.; Bollinger, J.J.; Itano, W.M.; Moore, F.L.; Heinzen, D.J. Spin squeezing and reduced quantum noise in spectroscopy. Phys. Rev. A 1992, 46, R6797–R6800. [Google Scholar] [CrossRef]
Giovannetti, V.; Lloyd, S.; Maccone, L. Quantum-Enhanced Measurements: Beating the Standard Quantum Limit. Science 2004, 306, 1330–1336. Available online: https://www.science.org/doi/pdf/10.1126/science.1104149 (accessed on 28 January 2022). [CrossRef] [PubMed] [Green Version]
Giovannetti, V.; Lloyd, S.; Maccone, L. Quantum Metrology. Phys. Rev. Lett. 2006, 96, 010401. [Google Scholar] [CrossRef] [Green Version]
Dowling, J.P. Quantum optical metrology—The lowdown on high-N00N states. Contemp. Phys. 2008, 49, 125–143. [Google Scholar] [CrossRef]
Paris, M.G. Quantum estimation for quantum technology. Int. J. Quantum Inf. 2009, 7, 125–137. [Google Scholar] [CrossRef]
Giovannetti, V.; Lloyd, S.; Maccone, L. Advances in quantum metrology. Nat. Photonics 2011, 5, 222–229. [Google Scholar] [CrossRef]
Lang, M.D.; Caves, C.M. Optimal Quantum-Enhanced Interferometry Using a Laser Power Source. Phys. Rev. Lett. 2013, 111, 173601. [Google Scholar] [CrossRef] [Green Version]
Tóth, G.; Apellaniz, I. Quantum metrology from a quantum information science perspective. J. Phys. A Math. Theor. 2014, 47, 424006. [Google Scholar] [CrossRef] [Green Version]
Dowling, J.P.; Seshadreesan, K.P. Quantum Optical Technologies for Metrology, Sensing, and Imaging. J. Light. Technol. 2015, 33, 2359–2370. [Google Scholar] [CrossRef] [Green Version]
Szczykulska, M.; Baumgratz, T.; Datta, A. Multi-parameter quantum metrology. Adv. Phys. X 2016, 1, 621–639. [Google Scholar] [CrossRef]
Schnabel, R. Squeezed states of light and their applications in laser interferometers. Phys. Rep. 2017, 684, 1–51. [Google Scholar] [CrossRef] [Green Version]
Braun, D.; Adesso, G.; Benatti, F.; Floreanini, R.; Marzolino, U.; Mitchell, M.W.; Pirandola, S. Quantum-enhanced measurements without entanglement. Rev. Mod. Phys. 2018, 90, 035006. [Google Scholar] [CrossRef] [Green Version]
Pirandola, S.; Bardhan, B.R.; Gehring, T.; Weedbrook, C.; Lloyd, S. Advances in photonic quantum sensing. Nat. Photonics 2018, 12, 724–733. [Google Scholar] [CrossRef]
Polino, E.; Valeri, M.; Spagnolo, N.; Sciarrino, F. Photonic quantum metrology. AVS Quantum Sci. 2020, 2, 024703. [Google Scholar] [CrossRef]
Maccone, L.; Riccardi, A. Squeezing metrology: A unified framework. Quantum 2020, 4, 292. [Google Scholar] [CrossRef]
Schleich, W. Quantum Optics in Phase Space; Wiley: Hoboken, NJ, USA, 2011. [Google Scholar] [CrossRef]
Weedbrook, C.; Pirandola, S.; García-Patrón, R.; Cerf, N.J.; Ralph, T.C.; Shapiro, J.H.; Lloyd, S. Gaussian quantum information. Rev. Mod. Phys. 2012, 84, 621–669. [Google Scholar] [CrossRef]
Adesso, G.; Ragy, S.; Lee, A.R. Continuous Variable Quantum Information: Gaussian States and Beyond. Open Syst. Inf. Dyn. 2014, 21, 1440001. [Google Scholar] [CrossRef] [Green Version]
Lvovsky, A.I. Squeezed Light. In Photonics; John Wiley and Sons, Ltd.: Hoboken, NJ, USA, 2015; Chapter 5; pp. 121–163. Available online: https://onlinelibrary.wiley.com/doi/pdf/10.1002/9781119009719.ch5 (accessed on 28 January 2022). [CrossRef]
Cerf, N.J.; Leuchs, G.; Polzik, E.S. Quantum Information with Continuous Variables of Atoms and Light; Imperial College Press: London, UK, 2007; Available online: https://www.worldscientific.com/doi/pdf/10.1142/p489 (accessed on 28 January 2022). [CrossRef]
Grangier, P.; Slusher, R.E.; Yurke, B.; LaPorta, A. Squeezed-light—Enhanced polarization interferometer. Phys. Rev. Lett. 1987, 59, 2153–2156. [Google Scholar] [CrossRef]
Monras, A. Optimal phase measurements with pure Gaussian states. Phys. Rev. A 2006, 73, 033821. [Google Scholar] [CrossRef] [Green Version]
Yonezawa, H.; Nakane, D.; Wheatley, T.A.; Iwasawa, K.; Takeda, S.; Arao, H.; Ohki, K.; Tsumura, K.; Berry, D.W.; Ralph, T.C.; et al. Quantum-Enhanced Optical-Phase Tracking. Science 2012, 337, 1514–1517. Available online: https://www.science.org/doi/pdf/10.1126/science.1225258 (accessed on 28 January 2022). [CrossRef] [Green Version]
Oh, C.; Lee, C.; Rockstuhl, C.; Jeong, H.; Kim, J.; Nha, H.; Lee, S.Y. Optimal Gaussian measurements for phase estimation in single-mode Gaussian metrology. npj Quantum Inf. 2019, 5, 10. [Google Scholar] [CrossRef]
Matsubara, T.; Facchi, P.; Giovannetti, V.; Yuasa, K. Optimal Gaussian metrology for generic multimode interferometric circuit. New J. Phys. 2019, 21, 033014. [Google Scholar] [CrossRef] [Green Version]
Yurke, B.; McCall, S.L.; Klauder, J.R. SU(2) and SU(1,1) interferometers. Phys. Rev. A 1986, 33, 4033–4054. [Google Scholar] [CrossRef]
Gatto, D.; Facchi, P.; Narducci, F.A.; Tamma, V. Distributed quantum metrology with a single squeezed-vacuum source. Phys. Rev. Res. 2019, 1, 032024. [Google Scholar] [CrossRef] [Green Version]
Gatto, D.; Facchi, P.; Tamma, V. Phase space Heisenberg-limited estimation of the average phase shift in a Mach–Zehnder interferometer. Int. J. Quantum Inf. 2020, 18, 1941019. [Google Scholar] [CrossRef]
Ou, Z.Y.; Li, X. Quantum SU(1,1) interferometers: Basic principles and applications. APL Photonics 2020, 5, 080902. [Google Scholar] [CrossRef]
Gatto, D.; Facchi, P.; Tamma, V. Heisenberg-limited estimation robust to photon losses in a Mach-Zehnder network with squeezed light. Phys. Rev. A 2022, 105, 012607. [Google Scholar] [CrossRef]
Farace, A.; Pasquale, A.D.; Adesso, G.; Giovannetti, V. Building versatile bipartite probes for quantum metrology. New J. Phys. 2016, 18, 013049. [Google Scholar] [CrossRef]
Gramegna, G.; Triggiani, D.; Facchi, P.; Narducci, F.A.; Tamma, V. Typicality of Heisenberg scaling precision in multimode quantum metrology. Phys. Rev. Res. 2021, 3, 013152. [Google Scholar] [CrossRef]
Armen, M.A.; Au, J.K.; Stockton, J.K.; Doherty, A.C.; Mabuchi, H. Adaptive Homodyne Measurement of Optical Phase. Phys. Rev. Lett. 2002, 89, 133602. [Google Scholar] [CrossRef] [Green Version]
Aspachs, M.; Calsamiglia, J.; Muñoz Tapia, R.; Bagan, E. Phase estimation for thermal Gaussian states. Phys. Rev. A 2009, 79, 033834. [Google Scholar] [CrossRef] [Green Version]
Berni, A.A.; Gehring, T.; Nielsen, B.M.; Händchen, V.; Paris, M.G.A.; Andersen, U.L. Ab initio quantum-enhanced optical phase estimation using real-time feedback control. Nat. Photonics 2015, 9, 577–581. [Google Scholar] [CrossRef]
Zhuang, Q.; Zhang, Z.; Shapiro, J.H. Distributed quantum sensing using continuous-variable multipartite entanglement. Phys. Rev. A 2018, 97, 032329. [Google Scholar] [CrossRef] [Green Version]
Guo, X.; Breum, C.R.; Borregaard, J.; Izumi, S.; Larsen, M.V.; Gehring, T.; Christandl, M.; Neergaard-Nielsen, J.S.; Andersen, U.L. Distributed quantum sensing in a continuous-variable entangled network. Nat. Phys. 2020, 16, 281–284. [Google Scholar] [CrossRef] [Green Version]
Grace, M.R.; Gagatsos, C.N.; Zhuang, Q.; Guha, S. Quantum-Enhanced Fiber-Optic Gyroscopes Using Quadrature Squeezing and Continuous-Variable Entanglement. Phys. Rev. Appl. 2020, 14, 034065. [Google Scholar] [CrossRef]
Grace, M.R.; Gagatsos, C.N.; Guha, S. Entanglement-enhanced estimation of a parameter embedded in multiple phases. Phys. Rev. Res. 2021, 3, 033114. [Google Scholar] [CrossRef]
Gramegna, G.; Triggiani, D.; Facchi, P.; Narducci, F.A.; Tamma, V. Heisenberg scaling precision in multi-mode distributed quantum metrology. New J. Phys. 2021, 23, 053002. [Google Scholar] [CrossRef]
Triggiani, D.; Facchi, P.; Tamma, V. Non-adaptive Heisenberg-limited metrology with multi-channel homodyne measurements. Eur. Phys. J. Plus 2022, 137, 125. [Google Scholar] [CrossRef]
Cramér, H. Mathematical Methods of Statistics (PMS-9); Princeton University Press: Princeton, NJ, USA, 1946. [Google Scholar] [CrossRef]
Rohatgi, V.K.; Saleh, A.M.E. An Introduction to Probability and Statistics; John Wiley and Sons: Hoboken, NJ, USA, 2015. [Google Scholar]
Triggiani, D.; Facchi, P.; Tamma, V. Heisenberg scaling precision in the estimation of functions of parameters in linear optical networks. Phys. Rev. A 2021, 104, 062603. [Google Scholar] [CrossRef]
Xu, S.; Darouach, M.; Schaefers, J. Expansion of det(A+B) and robustness analysis of uncertain state space systems. IEEE Trans. Autom. Control 1993, 38, 1671–1675. [Google Scholar] [CrossRef]

Figure 1. Example of a passive and linear network

{\hat{U}}_{φ}

which depends on a single global parameter

φ

. The parameter can be thought of as a physical property of an external agent (e.g., temperature, electromagnetic field) which affects multiple components, possibly of different natures, of the network [42,43]. Reprinted with permission from ref. [42], © 2021 The Author(s).

Figure 1. Example of a passive and linear network

{\hat{U}}_{φ}

which depends on a single global parameter

φ

. The parameter can be thought of as a physical property of an external agent (e.g., temperature, electromagnetic field) which affects multiple components, possibly of different natures, of the network [42,43]. Reprinted with permission from ref. [42], © 2021 The Author(s).

Figure 2. Schematic diagram of the setup described in Section 2.1. The squeezed vacuum state in Equation (2) is injected in the first channel of a network composed of a first auxiliary stage

{\hat{V}}_{in}

, a network

{\hat{U}}_{φ}

which depends on a generally distributed parameter

φ

we want to estimate, and a second auxiliary stage

{\hat{V}}_{out}

, before being detected through homodyne measurements in the first output port. The role of the two auxiliary stages

{\hat{V}}_{in}

and

{\hat{V}}_{out}

is to respectively distribute the photons of the probe through multiple channels, and then to refocus them into the only observed channel. We will show that only one auxiliary network needs to be optimized to reach the Heisenberg scaling, while, for networks with a large number of channels, the effect of the non-optimized network is typically irrelevant on the overall precision of the estimation [42]. Reprinted with permission from ref. [42], © 2021 The Author(s).

Figure 2. Schematic diagram of the setup described in Section 2.1. The squeezed vacuum state in Equation (2) is injected in the first channel of a network composed of a first auxiliary stage

{\hat{V}}_{in}

, a network

{\hat{U}}_{φ}

which depends on a generally distributed parameter

φ

we want to estimate, and a second auxiliary stage

{\hat{V}}_{out}

, before being detected through homodyne measurements in the first output port. The role of the two auxiliary stages

{\hat{V}}_{in}

and

{\hat{V}}_{out}

is to respectively distribute the photons of the probe through multiple channels, and then to refocus them into the only observed channel. We will show that only one auxiliary network needs to be optimized to reach the Heisenberg scaling, while, for networks with a large number of channels, the effect of the non-optimized network is typically irrelevant on the overall precision of the estimation [42]. Reprinted with permission from ref. [42], © 2021 The Author(s).

Figure 3. Polar plot of the standard deviation

σ_{φ}

(see Equation (6)) in blue, and of the Fisher information

F (φ)

in Equation (16) in orange as functions of the phase

θ

of the quadrature

{\hat{x}}_{θ}

measured, for

P_{φ} = 1

. The large values of

F (φ)

are reached for

θ

, satisfying condition (14). Interestingly, for

θ = γ_{φ} \pm π / 2

, namely when measuring the quadrature with minimum variance,

σ_{φ}

reaches its minimum, but the Fisher information drops to zero: as a squeezing-encoding estimation scheme, this model relies on the information about

φ

inscribed in the variance of the quadrature measured. On the other hand, the minimum variance is a stationary point as a function of

φ

Figure 3. Polar plot of the standard deviation

σ_{φ}

(see Equation (6)) in blue, and of the Fisher information

F (φ)

in Equation (16) in orange as functions of the phase

θ

of the quadrature

{\hat{x}}_{θ}

measured, for

P_{φ} = 1

. The large values of

F (φ)

are reached for

θ

, satisfying condition (14). Interestingly, for

θ = γ_{φ} \pm π / 2

, namely when measuring the quadrature with minimum variance,

σ_{φ}

reaches its minimum, but the Fisher information drops to zero: as a squeezing-encoding estimation scheme, this model relies on the information about

φ

inscribed in the variance of the quadrature measured. On the other hand, the minimum variance is a stationary point as a function of

φ

Figure 4. Schematic diagram of the two-channel network described in Section 2.4. The linear network

{\hat{U}}_{φ}

is composed of a beam splitter with coefficient

η_{φ}

and two phase-shifts of magnitudes

λ_{φ}

and

λ_{φ}^{'}

. The auxiliary stage

{\hat{V}}_{in}

at the input is

φ

-independent, while the output stage

{\hat{V}}_{out}

is optimized after a classical prior estimation

φ_{cl}

of the parameter. In particular, the quantity

α_{φ_{cl}} = (λ_{φ_{cl}} - λ_{φ_{cl}}^{'}) / 2 - π / 4

depends on

φ_{cl}

only through the phase-shifts

λ_{φ_{cl}}

and

λ_{φ_{cl}}^{'}

Figure 4. Schematic diagram of the two-channel network described in Section 2.4. The linear network

{\hat{U}}_{φ}

is composed of a beam splitter with coefficient

η_{φ}

and two phase-shifts of magnitudes

λ_{φ}

and

λ_{φ}^{'}

. The auxiliary stage

{\hat{V}}_{in}

at the input is

φ

-independent, while the output stage

{\hat{V}}_{out}

is optimized after a classical prior estimation

φ_{cl}

of the parameter. In particular, the quantity

α_{φ_{cl}} = (λ_{φ_{cl}} - λ_{φ_{cl}}^{'}) / 2 - π / 4

depends on

φ_{cl}

only through the phase-shifts

λ_{φ_{cl}}

and

λ_{φ_{cl}}^{'}

Figure 5. Scheme of the setup described in Section 3. A squeezed coherent state is injected in the first input port of a network

{\hat{U}}_{φ}

which depends on a parameter

φ

that is generally distributed among multiple components of the network. Homodyne detection is performed at each of the output ports. Differently from the setup in Figure 2, no auxiliary stage is required to reach the Heisenberg scaling. Reprinted with permission from ref. [43], © 2022 The Author(s).

Figure 5. Scheme of the setup described in Section 3. A squeezed coherent state is injected in the first input port of a network

{\hat{U}}_{φ}

which depends on a parameter

φ

that is generally distributed among multiple components of the network. Homodyne detection is performed at each of the output ports. Differently from the setup in Figure 2, no auxiliary stage is required to reach the Heisenberg scaling. Reprinted with permission from ref. [43], © 2022 The Author(s).

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Triggiani, D.; Tamma, V. Estimation with Heisenberg-Scaling Sensitivity of a Single Parameter Distributed in an Arbitrary Linear Optical Network. Sensors 2022, 22, 2657. https://doi.org/10.3390/s22072657

AMA Style

Triggiani D, Tamma V. Estimation with Heisenberg-Scaling Sensitivity of a Single Parameter Distributed in an Arbitrary Linear Optical Network. Sensors. 2022; 22(7):2657. https://doi.org/10.3390/s22072657

Chicago/Turabian Style

Triggiani, Danilo, and Vincenzo Tamma. 2022. "Estimation with Heisenberg-Scaling Sensitivity of a Single Parameter Distributed in an Arbitrary Linear Optical Network" Sensors 22, no. 7: 2657. https://doi.org/10.3390/s22072657

APA Style

Triggiani, D., & Tamma, V. (2022). Estimation with Heisenberg-Scaling Sensitivity of a Single Parameter Distributed in an Arbitrary Linear Optical Network. Sensors, 22(7), 2657. https://doi.org/10.3390/s22072657

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Estimation with Heisenberg-Scaling Sensitivity of a Single Parameter Distributed in an Arbitrary Linear Optical Network

Abstract

1. Introduction

2. Quantum Estimation Based on Single-Homodyne Measurements

2.1. Setup

2.2. Heisenberg Scaling

2.3. Conditions for the Heisenberg Scaling

2.4. A Two-Channel Network

3. Quantum Estimation Based on Multi-Homodyne Measurements

3.1. Setup

3.2. Heisenberg Scaling

3.3. Conditions for the Heisenberg Scaling

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A. Probability Distributions from Homodyne Measurements

Appendix B. Fisher Information for Gaussian Probabilities

Appendix C. Asymptotic Analyses of Gaussian Metrology

Appendix C.1. Single Homodyne

Appendix C.2. Multiple Homodyne

Appendix D. Maximum-Likelihood Estimators for Gaussian Distributions

Appendix E. Formulas for the Determinant of a Sum of Two Matrices

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI