Abstract
Quantum sensing and quantum metrology propose schemes for the estimation of physical properties, such as lengths, time intervals, and temperatures, achieving enhanced levels of precision beyond the possibilities of classical strategies. However, such an enhanced sensitivity usually comes at a price: the use of probes in highly fragile states, the need to adaptively optimise the estimation schemes to the value of the unknown property we want to estimate, and the limited working range, are some examples of challenges which prevent quantum sensing protocols to be practical for applications. This work reviews two feasible estimation schemes which address these challenges, employing easily realisable resources, i.e., squeezed light, and achieve the desired quantum enhancement of the precision, namely the Heisenberg-scaling sensitivity. In more detail, it is here shown how to overcome, in the estimation of any parameter affecting in a distributed manner multiple components of an arbitrary M-channel linear optical network, the need to iteratively optimise the network. In particular, we show that this is possible with a single-step adaptation of the network based only on a prior knowledge of the parameter achievable through a “classical” shot-noise limited estimation strategy. Furthermore, homodyne measurements with only one detector allow us to achieve Heisenberg-limited estimation of the parameter. We further demonstrate that one can avoid the use of any auxiliary network at the price of simultaneously employing multiple detectors.
1. Introduction
Due to the discrete nature of physical phenomena, the error in the estimation of physical properties, such as lengths, delays, temperatures, or refractive indexes, when employing N probes (e.g., photons, electrons) is strongly limited by the so-called shot-noise scaling factor of when a classical estimation strategy, i.e., in which the probe and the measurement employed can be fully described classically, is performed. However, it has been proven that it is possible, by exploiting quantum features such as entanglement and the squeezing of light, to overcome this classical limitation, and to reach an enhanced scaling in the precision of order , called the Heisenberg limit [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17].
A promising path towards this quantum-enhanced sensitivity is the one enabled by the squeezing of light [18,19,20,21]. Squeezed states are particular states of the electromagnetic field characterised by a quadrature field with smaller fluctuations than the field quadratures of the vacuum itself. This useful property, together with the Gaussian nature of these states which makes them relatively easy to produce, resilient to noise, and mathematically simple to manipulate [20,22], makes these states evident candidates for metrological purposes. The working principle of these protocols is rather straightforward, and can be outlined as follows. The probe, a pure squeezed state, undergoes an optical phase delay of a magnitude which depends on the unknown parameter we are interested to measure. The phase delay causes a transformation, or more precisely a rotation, of the state of the probe, that needs to be observed to infer the unknown parameter. In order to obtain the quantum enhancement, the measured quadrature field must be ‘sufficiently’ squeezed, namely it must possess a variance reduced below the vacuum fluctuations.
It is possible to highlight two major different approaches undertaken in literature, according to what type of squeezed state is employed and, ultimately, to how the information about the parameter is encoded on, and then retrieved from, the probe. In ‘displacement-encoding’ approaches [2,23], the initial overall state possesses a non-vanishing displacement—i.e., the average of the quadrature fields—so that the value of the parameter is encoded into the variation of the displacement. This approach presents the advantage that the unknown parameter can be retrieved through the measurement of the displacement of the probe, namely the mean value of the signal experimentally observed in a laboratory when performing homodyne detection. In ‘squeezing-encoding’ approaches [24,25,26], all the resources are accumulated in the squeezing of the probes, which are thus squeezed vacuum states, so that the information on the parameter is encoded into the parameter-dependent rotation of the covariance matrix. This is the optimal approach for squeezing-based estimation protocols in the sense that, for a fixed total average number of photons in a Gaussian state, the strategy that maximises the precision is to concentrate all the photons in the squeezing [24,26,27]. Since the information about the parameter is encoded in the covariance matrix of the probe, with this approach the estimation consists of retrieving the value of the parameter from the modulation of the noise. Another interesting approach involves the use of an active component for the estimation of phases, i.e., anti-squeezing the signal before the detection [28,29,30,31,32]. In this approach, the goal is generally to estimate the value of the unknown phase detecting whether the final state of the probe is different from the one injected in the network, namely, performing a projective measurement on the initial state.
Another interesting aspect of quantum metrology that has been frequently investigated is the possibility to estimate a parameter which is not localised into a single node of the network which encodes it, but distributed in an arbitrary manner among multiple components. Although this framework entails a further complication, given by the generally non-trivial encoding of the value of the parameter onto the probe, it allows for the study of estimation schemes with arbitrary structures of the networks. Since, in this approach, the network is generally described as a ‘black-box’, namely the structure of the network is not specified, the applicability of the estimation technique is universal—i.e., guaranteed for any passive and unitary evolution of the probe—covering in this way experimental situations rarely treated in literature. Moreover, the absence of a defined network structure allows to analyse general properties, such as the average performance of the estimation over some set of local transformations [33], the best possible Gaussian strategy for a multi-channel network [27], or the typicality of the Heisenberg scaling [34]. Applications range from high-precision biomedical phase imaging, quantum enhanced pattern recognition, and gravitational-waves sensing, to the mapping of external fields (magnetic and electric fields or temperatures).
However, standard Gaussian estimation protocols still present some challenges. In particular, adaptivity—i.e., the fact that a scheme for the estimation of the parameter depends on the parameter itself—is known to be a typical feature of ab initio Gaussian metrology [24,35,36,37], namely schemes for the estimation of a parameter of which no prior information is known. Intuitively, the cause of the adaptivity stems from the fact that the phase of the squeezed-quadrature which needs to be measured to achieve the quantum-enhanced precision depends on the phase acquired by the probe through the interferometric evolution in arbitrary parameter-encoding networks and, ultimately, on the unknown parameter. Moreover, in distributed quantum metrology, a further experimental challenge arises from the need to adaptively optimize the preparation of the probe and of the measurement [27]. A common approach to avoid adaptivity is limiting the range of values that the unknown parameter is allowed to take, for example, requiring that the parameter is small [38,39,40,41]. In fact, in the regime of small phases, the transformation the probe undergoes is small enough to make the rotation of the squeezed quadrature negligible, which in turn remains practically unchanged. Nevertheless, in some practical scenarios the experimenters have no control on the value of the phase—e.g., the system under investigation is unreachable or cannot be manipulated—and in such cases, this approach ceases to be feasible.
We will show here how the challenge of adaptivity, and any restriction in the range of possible values of the parameter to be estimated, can be overcome thanks to some recent results in the field of distributed quantum metrology, for the estimation at the Heisenberg scaling sensitivity of a single parameter, distributed throughout an arbitrary, multi-channel passive and linear network (see Figure 1). In particular, the unknown parameter can be thought of as a physical property of an external field—such as the temperature of the environment, or the magnitude of an electromagnetic field—which globally affects some or all of the components of the network. This review is organised in two parts, in which we introduce two distinct estimation schemes, discussing the features that differentiate the two approaches. In Section 2, we will discuss a squeezing-encoding scheme achieving Heisenberg-scaling sensitivity by employing a single squeezed vacuum state and homodyne detection at a single output channel [42]. We will assume that no prior knowledge on the value of the parameter is known, and that the structure of the network, as well as the nature of the parameter it encodes, are completely arbitrary. We will present the conditions that need to be satisfied in order to reach the Heisenberg scaling in such a generic model, and we will show that, in general, only a classical prior estimation of the unknown parameter suffices to prepare a single auxiliary linear network required to optimize the setup. These results show that it is possible to conceive feasible two-step estimation strategies, composed of a first classical estimation of the unknown parameter required to engineer the auxiliary stage, and then the actual quantum estimation. In Section 3, we describe a different scheme achieving the Heisenberg scaling, which makes use of a single-mode squeezed-coherent state as a probe, and homodyne detection in every output port of the linear network [43]. Once again, the model will assume no prior knowledge on the parameter, nor on the structure of the network. We will see that, in this case, no auxiliary stage is required to achieve the Heisenberg scaling, so that the estimation can be carried out in a single step, at the cost of employing a multiple homodyne detector. Moreover, we will show that, due to the introduction of a non-vanishing displacement in the probe, the overall precision becomes the sum of two contributions, one deriving from the information encoded in the sample mean of the outcomes of the homodyne measurements, the other in the sample covariance matrix.
Figure 1.
Example of a passive and linear network which depends on a single global parameter . The parameter can be thought of as a physical property of an external agent (e.g., temperature, electromagnetic field) which affects multiple components, possibly of different natures, of the network [42,43]. Reprinted with permission from ref. [42], © 2021 The Author(s).
2. Quantum Estimation Based on Single-Homodyne Measurements
In this section, we will describe a generic model of a M-channel linear network that allows the Heisenberg scaling to be reached in the estimation of a parameter distributed arbitrarily in , regardless of the structure of the network. Our model makes use of two auxiliary stages, namely two other linear networks and , whose purposes are to distribute the probe, a single-mode squeezed vacuum injected in the first channel, in all the optical modes of , and then to refocus it in the only channel observed through homodyne detection. We will show that only one of the two auxiliary networks must be optimized, and although the optimal choice of such network depends on the value of , the required precision for this optimization can be achieved with a classical estimation strategy. This allows us to conceive two-step estimation protocols, where a prior classical estimation to engineer the optimal auxiliary stage is followed by the quantum Heisenberg scaling-achieving estimation.
2.1. Setup
Let us consider a M-channel passive linear network, whose action on the state of the probe is described by the unitary operator , being a single, generally distributed, unknown parameter we are interested to estimate. Due to its passive and linear nature, this network can be represented by a unitary matrix. Let then, be the unitary matrix representing the action of on the annihilation operators , , associated with each channel of the network, satisfying the commutation relations , , where we denote with the Kronecker delta. The matrix is thus defined by the transformation
We will consider a single-mode squeezed state
as a probe, with squeezing operator and average number of photons, all injected in a single channel of the apparatus, i.e., the first one with the choice of squeezing parameters in Equation (2). In other words, the state presents a non-vanishing number of photons only in the first mode. As discussed in Section 1, the approach of squeezing-based estimation strategies is to infer the value of from the transformation of the covariance matrix of the state after the interferometric evolution . To do so, we will consider the model where a single output channel, say the first, is measured through homodyne detection. We will denote with the phase of the local oscillator, which coincides with the phase of the measured quadrature . We will assume that without loss of generality. We notice that, with this assumption, the squeezed quadrature of the state is , with , while the anti-squeezed quadrature field is , with . In terms of the creation and annihilation operators, the measured quadrature field can be expressed as .
Since the linear network is arbitrary, the average number of photons that can be actually detected after the interferometric evolution ranges between 0 and N. Naturally, if this number were to be small, or far from N, we would expect a sub-optimal performance of the estimation scheme, since most of the photons would come out of the network from channels that are not observed, and information on would in this way be lost. Moreover, it may happen that the transformation that it imposes on the probe in the transition to the first output port is trivial, namely that the element of the transition amplitude matrix does not depend on . This occurrence would preclude the probe from acquiring any observable information on the parameter. In order to prevent these conditions from happening, this model includes the presence of two auxiliary linear and passive networks acting on the probe, before and after the linear network (see Figure 2). The first auxiliary stage can be understood as network scattering, which distributes the probe through multiple input channels of the parameter-dependent network . The purpose of the second stage is instead to refocus the probe, after the interaction with , into the only observed channel. The unitary matrix describing the overall network is thus
given by the matrix product of the three single matrix representations , and .
Figure 2.
Schematic diagram of the setup described in Section 2.1. The squeezed vacuum state in Equation (2) is injected in the first channel of a network composed of a first auxiliary stage , a network which depends on a generally distributed parameter we want to estimate, and a second auxiliary stage , before being detected through homodyne measurements in the first output port. The role of the two auxiliary stages and is to respectively distribute the photons of the probe through multiple channels, and then to refocus them into the only observed channel. We will show that only one auxiliary network needs to be optimized to reach the Heisenberg scaling, while, for networks with a large number of channels, the effect of the non-optimized network is typically irrelevant on the overall precision of the estimation [42]. Reprinted with permission from ref. [42], © 2021 The Author(s).
Since in this model all the photons are injected in the first input channel, which is also the only channel observed at the output, the only relevant transition amplitude is the element of the overall unitary matrix in Equation (3). We can then rewrite
where is the probability that a single photon injected in the first port is detected at the first output port of the overall network, and is the phase acquired by the probe during the interferometric evolution. The Gaussian nature of the probe and of the homodyne measurements yield a Gaussian probability density function
which governs the outcomes of the homodyne detection [18,19,20,21]. The univariate Gaussian probability density function in Equation (5) is centred at zero due to the absence of a displacement in the probe, while its variance is given by (see Appendix A)
where is the phase of the local oscillator.
Once the probability density function is known, it is possible to evaluate the Fisher information [44,45]
of the estimation scheme, which in turn fixes the ultimate precision achievable in the estimation of through iteration of the measurement, given by the Cramer-Rao bound [44,45]
For a Gaussian distribution centred on zero, the Fisher information reads (see Appendix B)
where . We notice from Equation (6) that all the information on the parameter is encoded in the variance of the measured quadrature through the two quantities and . Thus, we can split into two contributions, one containing the derivative of , the other the derivative of , namely
where and are derivatives with respect to and , respectively, so that
2.2. Heisenberg Scaling
Generally, without imposing any condition on the setup, this model does not achieve the Heisenberg scaling in the precision for the estimation of . In fact, we can explicitly rewrite the variance in Equation (6) in terms of the average number of photons in the probe
where is a term of order equal to or smaller than 1, negligible in the asymptotic regime of N large (We will say that given two functions and , when ). We can also rewrite the derivatives and in terms of N
Plugging the asymptotics shown in Equations (12) and (13) into the expression of the Fisher information in Equation (9), we notice that the numerator of can be of the order at most. Since the denominator is in general of the order as well, it yields an overall general scaling of the Fisher information of —i.e., even lower than the SQL.
In order for this setup to reach the Heisenberg scaling, we thus need to impose some constraints which prevent the denominator of the Fisher information, i.e., the variance , to grow with N. We show in Appendix C that the asymptotic conditions (Given a function and a finite sum of powers of N, we will say that when they show the same asymptotic behaviour. In formulas, when , , with s exponent of the smallest power of N appearing in the sum )
need to be satisfied for large N, with and arbitrary constant independent of N. We will discuss more in detail the physical meaning of these conditions in Section 2.3, and we will see that Equation (14) is a minimum-resolution requirement on the tuning of the local oscillator, while Equation (15) is the condition on the refocusing of the probe. Intuitively, Equations (14) and (15) assure that at the denominator of does not grow as fast as at the numerator: instead, we can see in Appendix C that, when these conditions hold, the variance becomes of the order , while its derivative remains constant for large N. In particular, we show in Appendix C that the Fisher information in Equation (9) asymptotically reads
proving the achievement of the Heisenberg scaling, with
positive factor reaching its maximum value for and , namely .
Compared with the conditions found in the literature for single-parameter Gaussian estimation schemes based on squeezed-vacuum probes which, adjusted to the notation employed so far, can be translated into and [26,27], we see that Equations (14) and (15) achieve two important further results
- It is possible to loosen the optimal conditions found in literature, which still allow us to reach the Heisenberg scaling, at the price of a multiplying factor which does not depend on N and hence does not ruin the scaling of the precision;
- These conditions are explicitly expressed in terms of the average number N of photons in the probe and, therefore, in terms of the precision we want to achieve. In Section 2.3, we will discuss how this allows us to assess the precision needed to engineer suitable auxiliary stages and to reach the Heisenberg scaling, showing that it is possible to avoid an iterative adaptation of the optical network.
Lastly, we recall that it is always possible to asymptotically saturate the Cramér-Rao bound in Equation (8) in the limit of samples with a large number of observations. In particular, the maximum-likelihood estimator is an asymptotically efficient and Gaussian estimator which can be obtained through the maximisation of the Likelihood function
associated with the set of the measurement outcomes of the quadrature field [44,45]. In Appendix D we see that the non-trivial solution which maximises the Likelihood function in Equation (18) is simply given by the estimator satisfying
where is the variance in Equation (6) as a function of , and is the usual sample variance
Generally, Equation (19) cannot be solved analytically, so that numerical methods need to be employed to find non-trivial solutions. Nevertheless, it is possible to find some exceptions, particularly for elementary functional dependencies of and on the unknown parameter . For example, in the case for which is independent of , and the functional dependence of on of the phase acquired by the probe is invertible, the function in Equation (19) can be easily inverted as well, and the maximum-likelihood estimator reads
We can notice how, due to the presence of the cosine in in Equation (6), some prior knowledge on the parameter is required in order to correctly choose the invertibility interval for —i.e., to choose the correct value of and the sign of the arccos function in Equation (21). In the next section, we will see how a classical prior knowledge of the parameter is required to satisfy condition (15), achievable with a prior coarse estimation reaching an uncertainty of the order of . Such prior knowledge on the parameter, for a large enough N, can be employed to choose the correct invertibility interval.
2.3. Conditions for the Heisenberg Scaling
We can see that both conditions in Equations (14) and (15) are -dependent, suggesting that an adaptive procedure must take place in order to employ the estimation scheme described in Section 2.1, as it is customary for ab-initio Gaussian estimation strategies [24,27,35,36,37]. However, some considerations can be made in this regard.
Condition (14) fixes the phase of the quadrature which needs to be measured. The quantity is in fact the phase acquired by the squeezed vacuum during the interferometric evolution from the first input port to the first output port, and for Equation (14) resembles the condition found in the literature for single-phase estimation [24]. On the other hand, condition (14) is a looser condition to reach the Heisenberg scaling, and it puts in relation the precision with which we are able to choose the phase of the local oscillator—given by the resolution of the homodyne detection apparatus—with the precision achievable in the estimation of . In particular, it is evident how the minimum resolution for the homodyne detector required to reach an uncertainty of order must be, in turn, of order . This is in agreement with the common notion in metrology for which a sensor cannot detect changes in the quantity that is being measured which are smaller than its resolution.
Interestingly, we notice from Equation (14) that the constant k cannot be equal to zero. Counterintuitively, the value coincides with the choice of measuring the quadrature , namely the minimum-variance quadrature after the squeezed vacuum undergoes a phase-shift of magnitude , i.e., after the interferometric evolution given by in Equation (3). This apparent incongruity can be explained by observing the expression of in Equation (10). Differently from displacement-encoding approaches—in which the information on the parameter is obtained from the transformation of the displacement of the probe, and thus minimizing the noise of the signal is always the optimal choice—here the value of is encoded in the variance of the quadrature itself. For us to be able to extract information on the parameter, the variance of the signal needs to be sensitive to small variations in —i.e., the derivative must be non-vanishing. Of the two contributions of in Equation (10), the one originating from the variations in is identically vanishing when condition (15) is satisfied, since for close to its maximum. The remaining contribution derives from the variations in the overall phase , and the variance of the maximally squeezed quadrature is a stationary point with respect to variations in the phase and is thus insensitive to , namely for in Equation (11) (See Figure 3).
Figure 3.
Polar plot of the standard deviation (see Equation (6)) in blue, and of the Fisher information in Equation (16) in orange as functions of the phase of the quadrature measured, for . The large values of are reached for , satisfying condition (14). Interestingly, for , namely when measuring the quadrature with minimum variance, reaches its minimum, but the Fisher information drops to zero: as a squeezing-encoding estimation scheme, this model relies on the information about inscribed in the variance of the quadrature measured. On the other hand, the minimum variance is a stationary point as a function of , and thus is locally insensitive to the variations of the parameter. Reprinted with permission from ref. [34], © 2021 The Author(s).
Condition (15) is the requirement that most of the photons injected into the network end up in the observed output channel. In fact, this condition can be rewritten in terms of the average number of photons that are not correctly refocused . Thus, condition (15) tells us that the number of photons which are not observed must be a constant ℓ, not growing with N. In other words, this condition assures that most of the information on encoded in the probe is not lost in channels that are not observed. As a matter of fact, we can see from Equation (6) that the variance of the observed quadrature after the interferometric evolution is the convex combination of the variances of a squeezed vacuum and the pure vacuum, with coefficients and , respectively. In order for this variance to be ‘squeezed’, in the sense that it is of order , the contribution from the pure vacuum must be of order , namely .
This condition can also be seen as a requirement of the performance of the refocusing stage . In fact, in order to satisfy condition (15) for a given choice of , the auxiliary stage must be chosen so that . As discussed earlier, this implies that, in general, the auxiliary stage which satisfies this condition depends on the value of the parameter itself, requiring an adaptive approach to find an optimal refocusing stage to reach the Heisenberg scaling. We show now that the information on required to engineer an adequate refocusing stage to reach the Heisenberg scaling can be obtained through a classical estimation strategy, namely that which achieves the scaling typical of the shot-noise limit. This result is due to the structure of , which is essentially a transition probability between the unitary vectors and , with . Hence, a small tilt of order between the unit vectors and yields a quadratic reduction in their transition probability. To prove this, we will call the rough guess of the value of that is sufficiently precise to engineer a refocusing stage which satisfy condition (15), and we will show that the estimation strategy to obtain this rough estimate of is classical, namely that the error associated with the prior rough estimation is allowed to be of order . For a given choice of and ℓ, we will call a solution of Equation (15). The single-photon transition probability appearing in this condition can be written as the squared complex modulus of the scalar product of two M-dimensional complex vectors and , with . We can then write
where the transition probability is a smooth function of and , with a locus of points of maxima along the condition , since, for a perfect knowledge of the parameter the auxiliary stage, would satisfy . If the prior estimation slightly deviates from the real value of the parameter , we can write the expansion
where the derivative of is zero along the condition . We can see, comparing Equations (23) and (15), that an error of order suffices to correctly engineer a refocusing stage that allows for the Heisenberg scaling. It is then possible to conceive two-step ab initio protocols exploiting the model presented in this section: a first, coarse, classical estimation of the parameter is performed and the rough estimate is obtained, with an error of the same order of the shot-noise limit. Then, the classical information obtained on can be employed to engineer the refocusing stage , once is fixed, so that the overall network satisfies condition (15), allowing us to reach the Heisenberg scaling through the quantum strategy described in Section 2.1.
Lastly, we notice that, in order to satisfy condition (15), it is also possible to optimize the input auxiliary stage while arbitrarily fixing the refocusing stage . In such a case, identical considerations can be made regarding the possibility of a two-step protocol, since the optimization still requires only a classical coarse estimation of the parameter. Interestingly, only one of the auxiliary stages needs to be optimized, and thus depends on , whether it is or . This leaves the choice of the second auxiliary stage completely arbitrary, notwithstanding that the pre-factor appearing in the Fisher information in Equation (16) is not vanishing. Indeed, the condition corresponds to the situation in which the optimized network acts trivially, namely without imprinting any information about , on the probe. Remarkably, it has been shown that, for a random choice of the non-optimized auxiliary network, sampled uniformly within the set of all the possible linear networks, the pre-factor multiplying the scaling in the Fisher information is typically different from zero [34]. In other words, within certain non-restrictive regularity conditions and for linear networks with a large enough number of channels, the value of the pre-factor becomes essentially unaffected by the choice of the non-optimised auxiliary network. This important feature can be exploited for experimental applications, for example, employing the arbitrary non-optimised stage to manipulate the information encoded into the probe regarding the structure of a linear network with multiple unknown parameters, ultimately allowing the choice of functions of such parameters to be estimated at the Heisenberg scaling sensitivity [46].
2.4. A Two-Channel Network
In this section, we will apply our model for the estimation of distributed parameters to a particular example of a 2-channel network, in which the unknown parameter influences the reflectivity of a beam-splitter and the magnitudes and of two phase-shifts (see Figure 4). We can think of the global parameter as an external physical property, such as the temperature or the magnitude of the electromagnetic field, affecting the components of the network . We will suppose that the functional dependence of the phase-shifts , and of the reflectivity on the true value of the parameter are known and smooth, whether given by some law of nature or opportunely engineered. With reference to Figure 4, we can write the matrices representing the action of the beam-splitter and the phase-shifts as
respectively, where , , is the i-th Pauli matrix and is the identity matrix, so that the network is represented by the matrix
Figure 4.
Schematic diagram of the two-channel network described in Section 2.4. The linear network is composed of a beam splitter with coefficient and two phase-shifts of magnitudes and . The auxiliary stage at the input is -independent, while the output stage is optimized after a classical prior estimation of the parameter. In particular, the quantity depends on only through the phase-shifts and . Reprinted with permission from ref. [42], © 2021 The Author(s).
We easily notice that , which, in general, is different from one and thus does not satisfy the condition (15), with the exception of the two values which correspond to the absence of the mixing between the two modes. As described in the model earlier, we then add two auxiliary stages and , of which only one depends on a prior coarse estimation of realised with a classical strategy, so that . In particular we choose as input stage
and as output stage
where is a quantity which can be obtained through a classical estimation of . A straightforward calculation of yields
where and are the error in the estimates of and due to the imprecision of the classical estimation . We can then easily see that in Equation (28) satisfies condition (15), since both the errors and are of order ,—i.e., , and similarly for —and thus we obtain from Equation (28)
In order to evaluate the Fisher information in Equation (16), we need to calculate both the phase acquired by the probe throughout the whole interferometric evolution , and the coefficient ℓ. The phase is easily obtained as the complex phase of
Since , we call h the finite N-independent constant such that . The transition probability can then be written as
so that the factor appearing in the Fisher information in Equation (16) is easily evaluated comparing Equations (15) and (31). The Fisher information can be obtained from Equation (16), with given by Equation (30), and given by Equation (17), with k given by the condition on the local oscillator phase and .
We notice from the expression of that the unknown reflectivity of the beam splitter does not influence the refocusing stage , but it appears in the phase acquired by the probe in Equation (30). In particular, if the two phases and are vanishing, the dependence of , and thus of the refocusing stage , on the classical estimation of the parameter disappears completely. In other words, this network for transforms the reflectivity of a beam splitter into the magnitude of a phase shift, independently from .
3. Quantum Estimation Based on Multi-Homodyne Measurements
In Section 2 we have presented a scheme for the estimation of a distributed parameter encoded in a multi-channel network, reaching the Heisenberg scaling employing a squeezed vacuum state and homodyne detection performed at a single output channel. In particular, in Section 2.2, we have discussed in depth about the conditions in Equations (14) and (15) which need to be satisfied to reach the Heisenberg scaling: Equation (14) imposes a minimum resolution in tuning the local oscillator phase, in order to infer the value of the parameter from the noise of a sufficiently squeezed quadrature. Equation (15) is a requirement on the refocusing of the probe into the only observed channel and, in order to be satisfied, a classical knowledge of the parameter is generally required to engineer the optimal refocusing network. A natural question that arises is whether it is possible to ease these conditions by carrying out some changes on our scheme. In particular, what would happen if homodyne measurements were performed, not only at a single channel, but at all the output ports of the network instead? Would condition (15) become looser, allowing us to engineer the optimal auxiliary stages with even less information on the unknown parameter? Moreover, as we have already discussed in Section 1, a non-vanishing displacement in the probe would make it possible to infer the value of the parameter directly from the average of the quadrature with minimum variance, and not from its noise, which may be a more feasible approach in particular experimental scenarios. In this section, we will investigate a model that implements these changes (see Figure 5). We will see that, with these assumptions, not only is there no need to perform a prior estimation of the parameter to optimize the network, but the Heisenberg scaling can be achieved without employing an auxiliary stage in the first place. Moreover, the presence of displacement in the probe will allow us to perform the estimation directly measuring the minimum-variance quadrature, relying on the information about the parameter encoded in the average signal of the homodyne, and not in its noise.
Figure 5.
Scheme of the setup described in Section 3. A squeezed coherent state is injected in the first input port of a network which depends on a parameter that is generally distributed among multiple components of the network. Homodyne detection is performed at each of the output ports. Differently from the setup in Figure 2, no auxiliary stage is required to reach the Heisenberg scaling. Reprinted with permission from ref. [43], © 2022 The Author(s).
3.1. Setup
We will consider a linear and passive network , which depends on a single and generally distributed parameter, for example affecting several components of the network, as shown in Figure 1. Once again, admits a unitary matrix representation obtained through Equation (1). Differently from Section 2 though, we consider as a probe the single-mode squeezed state
with , i.e., a squeezed coherent state with a mean number of photons , , where we can introduce the displacement , given by , . We remark that the choice is a specific (-independent) condition which is required in displacement-encoding approaches: squeezing and displacement can, in general, have different complex phases, and the condition assures that the squeezed quadrature ( for ) is orthogonal—i.e., conjugated—to the displaced quadrature ( for ), and hence it is the most sensible to the presence of phases. Moreover, in our model, we will consider homodyne detection in all M output channels of the linear network, so that M quadrature fields are measured, where is the phase of the i-th local oscillator, .
Since we are observing all the output ports of the network, but a non-vanishing number of photons is injected only in the first channel, only the first column of the unitary matrix is relevant in our model, consisting of the transition amplitude of single photons from the first to every channel of the network (see Equation (1)). We can thus employ the parametrisation
where is the probability that a single photon is transmitted through the linear network from the first to the i-th channel, and is the phase that it would acquire (In order to keep the notation lighter, we are dropping the subscript in , , and . Nonetheless, it rests assured that these quantities, in general, depend on the unknown parameter). Once again, due to the Gaussian nature of the model, the (joint) probability distribution associated with the outcome of the M homodyne detectors is also Gaussian and reads [18,19,20,21]
In Equation (34), both the covariance matrix and the mean depend on the parameter . The elements of the covariance matrix are evaluated in Appendix A, and read
where is the Kronecker delta, is the phase acquired at the output of the i-th channel relative to the correspondent local oscillator, and is the mean vector (see Appendix A)
The determinant can also be written in compact form (see Appendix A)
For a multivariate Gaussian distribution of the form shown in Equation (34), the Fisher information can be easily evaluated from its definition in Equation (7) and employ the expression of the probability distribution in Equation (34) as (see Appendix B)
where is the cofactor matrix of and denotes the trace of the matrix A. Compared with the Fisher information shown in Equation (9) for the model described in Section 2, the Fisher information for this setup includes an additional term which depends on the derivative of the average with respect to the parameter . Moreover, the contribution , representing the information on encoded in the covariance matrix , can be split into two terms, of which the first resembles the Fisher information in Equation (9) once we perform the substitution : we will show in the following that, in the asymptotic regime of large , this is the only contribution of which, besides , reaches the Heisenberg scaling.
3.2. Heisenberg Scaling
Similarly to what happens to the setup described in Section 2, the Fisher information in Equation (38) does not generally reach the Heisenberg scaling unless certain conditions are met. To show this, it is necessary to evaluate all the contributions of in terms of the number of photons and in the asymptotic regime. First, it is convenient to express the cofactor matrix C explicitly in terms of the squeezing parameter r. In Appendix B, we see that a closed form for C in such terms exists
where
and are shown in Equation (35). We then notice that, in order to analyse the generic asymptotic behaviour of the Fisher information in Equation (38), it suffices to separately examine the asymptotics of the terms , , , , and of their derivatives.
From Equations (35)–(37), (39) and (40) we see that , S and are all of order in general, while is of order . The same asymptotic behaviours are also kept for their respective derivatives, since both and are independent of the probe, and thus of and . We can thus see from the expression of in Equation (38) that its numerator grows at most with , while the denominator—i.e., the denominator —in general grows with . Therefore, in order for to reach the Heisenberg scaling, namely a scaling of order , some conditions must be imposed so that the denominators in and do not grow for large . In Appendix C, it can be seen that, to achieve the Heisenberg scaling, the determinant must be of the order , similarly to what happens to in the single-homodyne setup in Section 2.2, and the conditions for this to occur are
When these conditions hold, we can introduce the finite (possibly vanishing) quantities , so that the determinant reduces to (see Appendix C)
while , , and C tend to constant values, and scales as . We can easily see that this makes only the first two terms of in Equation (38) dominant for large and . When conditions (41) are met, we can thus neglect the last term in Equation (38), and the Fisher information
asymptotically reaches the Heisenberg scaling in and (see Appendix C), where we introduced the quantities
while
are positive, even functions which reach their maxima at and , respectively, namely and .
It is now possible to compare the Fisher information in Equation (43) achieved with the present scheme, with the Fisher information in Equation (16) obtained with the setup for distributed metrology employing a squeezed vacuum state and homodyne detection on a single channel. Since this setup employs a squeezed probe with a non-vanishing displacement, it lends itself to both displacement-based and squeezing-based estimation approaches. This is reflected by the presence of two separate contributions to the Fisher information in Equation (43): the first is given by the variations in the displacement , the second by the variations in the determinant , with respect to changes in the value of . Both terms present a pre-factor shown in Equation (44) which resembles the pre-factor in Equation (16) for the single-homodyne counterpart: in particular, is a weighted average of the sensitivities of the complex phases to changes in the parameter , each one weighted by the ‘fraction’ of the probe undergoing the phase shift . Indeed, we can see that reduces to the single-homodyne counterpart when the whole probe is refocused into a single channel—say the first—so that and for , i.e., .
In the term of the Fisher information associated with the squeezing-encoding in Equation (43), the factor coincides in turn with the function in Equation (17) for —i.e., the ideal case of perfect refocusing of the probe, and all photons being observed—and after the substitution , namely the weighted average of the coefficients with the same weights . In other words, this term can be thought of as a generalisation of the Fisher information for a single homodyne due to the presence of multiple observed channels. Noticeably, this term can be set as equal to zero only for , in which case, the whole expression in Equation (43) vanishes, ruining the Heisenberg scaling, or for . In fact, we can see from Equation (37) that the condition —i.e., , namely when the quadratures with minimum variances are measured at each channel minimises , which becomes a stationary point with respect to the variations in , and in Equation (43) vanishes. On the other hand, the first contribution to Equation (43) is instead a new term not present in the Fisher information for the single homodyne, deriving from the information on the parameter encoded in the displacement . This term achieves the Heisenberg scaling in , in the sense that it reaches the Heisenberg scaling in if both and grow with N, i.e., if and with independently of N. The function in Equation (45) does not have roots, hence the first contribution to vanishes only for , i.e., for a squeezed vacuum as a probe.
Finally, we will now write the Likelihood function for the setup described here, and discuss the maximum-likelihood estimator saturating the Cramér-Rao bound shown in Equation (8). After performing measurements of the field quadratures , the Likelihood of the outcomes is given by
with found in Equation (34). In Appendix D, we show that the maximum-likelihood estimator, a non-trivial solution of the maximisation of the Likelihood function in Equation (46), is implicitly given by the estimator which satisfies the equation
where is the covariance matrix in Equation (35), and the mean vector in Equation (36). This equation cannot, in general, be solved analytically, hence numerical methods typically need to be in place to find . On the other hand, Equation (47) simplifies for certain particular cases. For example, when measuring all the minimum-variance quadratures so that in Equation (41), and the probabilities are independent of , we can see from Equation (35) that . In this case, the Likelihood Equation becomes
where the term in the right-hand side can be seen as a linear combination of the quantities , where are estimators of the mean . We see here how the displacement of the probe allows us to perform the estimation of through the sample mean , i.e., the average of the outcomes of the homodyne measurements. On the other hand, for a squeezed vacuum such as a probe, , so that Equation (47) becomes
where it is possible to recognise the sample covariance matrix , estimator of the covariance matrix .
3.3. Conditions for the Heisenberg Scaling
For the model introduced in this section, some considerations regarding the conditions in Equation (41) can also be drawn, especially in light of the feature of the single-homodyne, squeezed-vacuum scheme discussed in Section 2.3.
In fact, these conditions do nothing but fix the phases of the quadratures that need to be measured to achieve the Heisenberg scaling. Regarding the minimum resolution of the homodyne sensors, it appears that there is no evident advantage in this setup compared to the single-homodyne scheme, since the resolution required to tune the local oscillators of each channel is still of order . On the other hand, introducing displacement in the probe allows, as previously discussed, the value of the parameter to be inferred from the information inscribed in the average of the measured quadratures . Therefore, it is possible with this scheme to exactly measure the minimum-variance quadratures , a possibility that was prevented in the previous scheme due to the requirement in Equation (14). Although the situation for which for sets to zero the contribution in Equation (43), associated with the information encoded in the covariance matrix , the first contribution of the Fisher information still reaches the Heisenberg scaling.
Another interesting feature of this protocol, which differentiates it from its single-homodyne counterpart, is that it does not require any adaptation of the network: since every output channel is observed, no condition on the transition probabilities is required. Therefore, not only there is no requirement for a -dependent auxiliary stage, but there is no need for an auxiliary stage to reach the Heisenberg scaling in the first place. However, the precision in the estimation is still affected by the network through the terms and , which appear in the constant factors multiplying the scaling in the Fisher information in Equation (38). In fact, we can see from Equation (45) that these two quantities depend on the transition probabilities and on the derivatives of the complex phases . In particular, can be vanishing for exceptionally poorly conceived networks, for which : for example, a network for which is independent of for all values of i such that , is associated with a vanishing factor , as we can see from its definition in Equation (44). In this case, adding a -independent auxiliary network V, either at the input or at the output of , would modify the transition amplitude of the overall interferometric evolution, and ultimately yield a non-vanishing value of .
4. Conclusions
The recent advances in quantum metrology made possible the realisation of protocols achieving super-sensitivity in the estimation of optical lengths, time delays and space-time distortions due to gravitational waves, refractive indices in given materials, density and thickness of biological samples up to the nanometer scale, temperatures, polarisations of light, magnitudes of fields and their gradients, and more. However, current quantum sensing technologies still present some limitations, whether caused by the requirement of adaptively optimising the estimation procedure to the unknown value of the parameter that needs to be measured, by the fragility of the metrological resources which yield super-sensitivity, or by the scarce operating range of the protocols, impracticalities arise when tackling the metrological schemes with quantum mechanics. Moreover, most standard approaches to distributed quantum metrology suffer from a lack of universal estimation schemes, namely those which can operate independently of the unknown parameter nature and value, and of the unitary evolution which encodes the value of the parameter into the probe employed for the estimation.
In this work, we have reviewed in detail two schemes which address these limitations [42,43]. By employing analyses based on the Cramér–Rao bound, i.e., the ultimate precision achievable for a given estimation scheme, and on the Fisher information, we were able to assess the super-sensitivity of various feasible metrological setups, always achievable in the regime of large statistical samples through the maximum-likelihood estimator. We showed that, without making any assumptions on the structure of the multi-channel passive and linear network encoding the unknown parameter, which can as well be distributed among multiple components of the interferometer (such as temperature affecting the network in a distributed manner) it is possible to relax the adaptivity of the network to a requirement of classical knowledge—i.e., not at the quantum limit—on the parameter. This prior knowledge serves to engineer an auxiliary stage needed to perform the super-sensitive estimation with a single squeezed-vacuum state and homodyne detection at a single output port. Moreover, the adaptivity on the network can actually be completely circumvented when all the output ports of the network are observed with homodyne detectors. We discussed how this allows us to conceive estimation schemes at the Heisenberg limit, which can be performed with a single optimization step, when a classical knowledge of the parameter is needed to engineer the auxiliary stage, or without any optimization at all, when no prior knowledge is required.
Author Contributions
Both authors contributed to the paper and the obtained results. The project was concieved and managed by V.T. All authors have read and agreed to the published version of the manuscript.
Funding
This work was supported by the Office of Naval Research Global (N62909-18-1-2153).
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Not applicable.
Acknowledgments
We acknowledge our collaborators in this research P. Facchi, G. Gramegna and F. A. Narducci.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A. Probability Distributions from Homodyne Measurements
In this appendix, we will obtain the probability density functions which govern the outcomes of homodyne detections performed on a single-mode squeezed state injected in the first port of a M-channel passive and linear network , with , and , so that , with , and sum of the average number of photons in the displacement and in the squeezing of the state . We refer to refs. [18,19,20,21] for reviews on Gaussian states, and how to describe and manipulate them mathematically. The state is a Gaussian state which is completely described, through its Wigner distribution
by its vector of displacement , and its matrix of covariances . Moreover, passive and linear networks preserve the Gaussian nature of . For this reason, we will first obtain in generality the final displacement and covariance matrix of the state . Then, we will marginalise the Wigner distribution, rotated by the symplectic and orthogonal matrix
accordingly to the phases of the local oscillators of the homodyne. In doing so, we will be able to obtain the expressions in Equations (6), and (35)–(37).
The action of a passive and linear unitary on a state is described by a symplectic rotation of the state Wigner distribution [18,19,20,21]. The Wigner distribution of the state after the interferometric evolution is thus obtained, rotating the initial Wigner distribution found in Equation (A1) with the orthogonal and symplectic matrix
where U is the unitary matrix representing the transition amplitudes which can be found through Equation (1), thus obtaining the transformation
Since the probe in our model is Gaussian, the transformation of the state is completely described by the rotations of the displacement and of the covariance matrix of the initial state, with , , and . A straightforward calculation shows that
and that
where we have defined the matrices
We can see that the only elements of U entering in Equation (A5) are the ones in the first column , since these are all the transition amplitudes from the only populated input port (namely the first) to every output port. We can easily see that the same is true for the matrices in Equation (A7). In fact, exploiting the relations
where is the Kronecker delta, we can evaluate
We can thus parametrise only the first column of U, introducing the quantities and for , which are respectively the transition probabilities from the first input to the i-th output port, and the complex phases of the relevant transition amplitudes, so that . We can then rewrite the displacement in Equation (A5) and the covariance matrix in Equation (A6) in terms of and
and
Finally, we notice that if we are performing homodyne to measure the quadratures , where are the phases of the local oscillators at each output channel, this corresponds to apply a further unitary rotation on the state . This corresponds to substituting in the expression of R in Equation (A3), whose effect is to add a phase to every elements of the i-th row of U. Thus, if we define , we can write the displacement and the covariance matrix of the (accessory) state
and
with
If we are performing homodyne detection only on a single channel, say the first, as it is done, for example, in Section 2, we need to first trace away the channels we are not observing. The probability density function is thus given by the marginal of the Wigner distribution of the state associated with the first symplectic variable and, equivalently, a Gaussian distribution with average given by the first element of in Equation (A12)
and variance given by the first row, first column element of in Equation (A13)
For a squeezed vacuum state, and we recognise the expression of the variance in Equation (6) from Equation (A16).
If, instead, we are observing all M output channels of the network U, the probability density function will be a Gaussian distribution with average given by the first M elements of
from which we recognise Equation (36), while the covariance matrix is given by the first M rows and columns of , i.e., by the matrix in Equation (A14a)
from which we obtain the expression in Equation (35).
To evaluate the determinant in Equation (37), we first notice that the covariance matrix of the initial state can be written as the sum , where is the identity matrix and is a diagonal matrix of rank 2. Since the rank is invariant under orthogonal rotations, the same holds true for , with . By definition of rank, none of the sub-matrices of K can have rank greater than 2, hence we can write
with , since A is a submatrix of K
We can thus apply the results in Appendix E to and write as a sum of determinants of the matrices obtained replacing any number of columns of , with the respective columns of A
where the first term is the determinant of , the terms in the first summation are determinants of the matrices obtained by substituting the i-th column of with the i-th column of A, and the terms in the last summation from substituting the i-th and j-th columns, while also exploiting the symmetry of A. Noticeably, since rank , all the contributions involving the replacement of three or more columns of A are vanishing for the definition of rank of a matrix. Replacing the expression of A found in Equation (A20) into Equation (A21), the expression in Equation (37) can then be easily obtained after some elementary trigonometry.
Appendix B. Fisher Information for Gaussian Probabilities
In this Appendix, we will evaluate the Fisher information associated with the estimation of a parameter from a M-variate Gaussian distribution
where we assume that both the covariance matrix and the average depend on the parameter .
Recalling the definition of the Fisher information in Equation (7), we first evaluate the logarithmic derivative
In order to evaluate the Fisher information in Equations (9) and (38), we first notice that the expression contains terms of order up to fourth in . Obtaining the Fisher information thus reduces to the evaluation of the expectation values of a polynomial in , for which we can exploit standard results of Gaussian integrals:
from which we see that only even powers of the term have a non-vanishing contribution. We can thus evaluate
By substituting the expressions in Equations (A24) into Equation (A25), and exploiting the Jacobi’s formula for the derivative of the determinant of square matrices which assures that
we see that most of the terms in Equation (A25) cancel out, yielding the Fisher information
We can easily reduce Equation (A27) to the Fisher information in Equation (9) for the single-homodyne case with .
To obtain the FIM in Equation (38), we first exploit the relation
and thus we express the covariance matrix in terms of its cofactor matrix, i.e., , where we have exploited the symmetry of so that . We can thus rewrite, in the single-parameter case
We can recognise in the second term of Equation (A29) the Jacobi’s formula for the derivative of the determinant in Equation (A26), which allows us to obtain the expression for the Fisher information shown in Equation (38)
In order to elicit the cofactor matrix C in terms of the elements of and thus obtain Equation (39), we first need to make some observations. First, the -cofactor is defined as the determinant of the sub-matrix of obtained by deleting its s-th row and t-th column, then multiplied by . This can be equivalently thought of as the determinant of the matrix , where we denote with the matrix obtained from the matrix X replacing all its elements in the the s-th row and in the p-th column with zeros, with the exception of the element which is replaced by n, namely
For the particular case of the covariance matrix in Equation (35), as discussed in Appendix A, we can write , with A symmetric matrix such that given in Equation (A20). Thus, we can write the -cofactor as
and evaluate this determinant with the methods discussed in detail in Appendix E. In particular, is obtained as a sum of determinants of matrices obtained by replacing any possible set of columns of with the columns in the same positions of . Noticeably, by substituting a row and a column of A with an arbitrary row or column, in general increases its rank by one, so that . To evaluate the cofactor matrix C, it is thus convenient to consider separately the simpler case first, and then .
For in Equation (A32), the cofactor becomes . We notice that the matrix is a diagonal matrix with a single zero eigenvalue—i.e., the s-th—and thus each contribution to is non-vanishing only if the s-th columns of —a column of only 0s—is replaced. Moreover, the s-th column of is also a column of all 0s, with the exception of the s-th element, which is equal to 1. We thus obtain
which is the sum of terms obtained substituting the s-th, the s-th and i-th, and the s-th, i-th and j-th columns, respectively. Noticeably, replacing more than 3 columns yields vanishing contributions, since . When , is still a diagonal matrix but with two zeros in the diagonal, more precisely the s-th an the t-th elements. Both the s-th and the t-th columns are thus columns of zeros, and all non-vanishing contributions to must replace these two columns. For example, the only contribution obtained by swapping the s-th and t-th columns is of the type
exploiting at the same time the symmetry of A. The contributions obtained by swapping the s-th, t-th and i-th columns, with , are of the type
where once again, we are exploiting the symmetry of A. Once again, substituting more than 3 columns yields no contributions, since . The final expression for , thus reads
Replacing in (A33) and (A36) the definition of in Equation (A32), it is straightforward to obtain Equation (39).
Appendix C. Asymptotic Analyses of Gaussian Metrology
In this appendix, we will perform all the asymptotic analyses for a large number of photons N in the setups presented in Section 2 and Section 3.
Appendix C.1. Single Homodyne
Here, we evaluate the asymptotic expressions of the Fisher information in Equation (16), showing that the conditions in Equations (14) and (15) yield the Heisenberg scaling for the estimation of the respective quantities of interest.
As shown in Equations (10) and (11), the dependence of the variance
on the parameters only appears through the transition probability and the acquired complex phase
where and represent the differentiation with respect to and , and
To achieve the Heisenberg scaling, some conditions must be imposed so that the variance in Equation (A37) does not grow with N. The only option to do that without ruining the sensitivity of the setup is requiring that , as we can see from Equation (A37). In particular, we impose condition (14), and evaluate the variance in (A37) and its gradient (A38) in the large N limit
Then, by imposing condition (15) on , we get
By substituting the expressions in Equation (A42) into the Fisher information in Equation (9), we can finally evaluate its asymptotic expression
with
a positive and N-independent pre-factor.
Appendix C.2. Multiple Homodyne
In this appendix, we will study the asymptotic regime of the Fisher information in Equation (38),
for large . We will show that conditions (41) are necessary to reach the Heisenberg scaling, and that, when they hold, the asymptotic expression for the Fisher information is the one shown in Equation (43).
As discussed in Section 3.2, all terms in the numerators of the Fisher information in Equation (A45) are at most of the order or . We thus need to study the asymptotic behaviour of the determinant in Equation (37), which is at the denominators of the FI, and find what conditions prevent from growing with as well, ruining the Heisenberg scaling. In particular, in order to reach the Heisenberg scaling, namely a scaling of the order of in the FI, the determinant must be at most of the order .
Thus, we first focus our attention on in Equation (37)
In particular, we will suppose that the asymptotic behaviour of the complex phases for large is given by , with independent of N and (Notice that this condition is not restrictive, since if were to grow with N (i.e., ), it would yield an oscillating asymptotic behaviour of and every other term appearing in the FI). By expanding in powers of the terms in which the squeezing parameter r appears in (A46) we obtain
where
To prevent the scaling with of the determinant , must be set as equal to, or tending to, zero. After some trigonometry, and exploiting the fact that due to the unitarity of the network, we can rewrite
which tends to zero only if both terms in the parenthesis tend to zero, i.e., when , with . To analyse the order for large of the determinant when the condition holds—and thus —we need to analyse the behaviour of each term in Equation (A48). Since , and , both and scale with , while is asymptotically constant. Thus, we find the asymptotic behaviour of through Equation (A47): the term is always negligible with respect to , while for the term dominates over so that is of order , and for instead dominates and the determinant scales with . In equations:
Noticeably, the determinant stops growing with only for .
We thus now study the asymptotic behaviour of the numerators appearing in the Fisher information in Equation (A45), when the condition , with , is true. We will perform the same analysis done for earlier, considering only the dominating term for every value of . We first obtain the derivative of from Equation (35), substituting , with ,
and we notice that each element of scales at most with for , while it becomes constant for . We then analyse the auxiliary term (40)
of which we evaluate the derivative, always with the conditions , with
which scales at most with for large . Moreover, the covariance matrix in (35) asymptotically reads
which is in general constant asymptotically for large . Inserting Equations (A52) and (A54) in the cofactor matrix (39), we obtain
Exploiting the asymptotic behaviours of the derivatives found in Equations (A51) and (A53), we are able to analyse the remaining terms appearing in the FI
It is easy to see that scales at most with , since , while the derivatives of the elements of C scale at most with for , and become constant for . The last step is to evaluate the asymptotic behaviour for the derivative of the average, which can be easily obtained by differentiating Equation (36)
Now we can finally draw the conclusions, and obtain the scaling of the Fisher information shown in Equation (43) by putting together all the asymptotic regimes found. First, we notice that, independently of , the third contribution to the Fisher information in Equation (A45) always scales with , hence only reaching shot-noise precision. Thus, it is always dominated by the other two terms in the Heisenberg scaling regime. The only terms which allow the Heisenberg scaling are then the first two: in fact, the term scales with for , and with for , thus reaching the Heisenberg scaling for , while the second term reaches sub-shot noise scaling for , achieving the Heisenberg scaling for . It is interesting to notice the major difference between the two contributions to the Heisenberg scaling: the term reaches the Heisenberg scaling for every value of , which means that the choice —corresponding to —still yields the Heisenberg scaling. On the other hand, must be equal to 1 in order for the first term of to achieve the Heisenberg scaling. The consequences and the physical implications of this difference are discussed in depth in Section 3.2.
The conditions to reach the Heisenberg scaling can thus be condensed into the request on the complex phases
for , as shown in Equation (41), or equivalently that asymptotically, with independent of . Moreover, we have seen that the only relevant terms under this condition are the first two in Equation (A45), and that only for the first term of is non-vanishing.
Finally, we can now prove (43). Substituting condition (A60) into Equations (A51) and (A53), we obtain from Equation (A56)
where we exploited the passivity of the interferometer which assures . We also obtain from (A46)
which coincides with the expression in Equation (42). Lastly, from Equation (A59) when conditions (A60) hold, we obtain the asymptotic behaviour
Appendix D. Maximum-Likelihood Estimators for Gaussian Distributions
In this Appendix, we will derive the Likelihood equation associated with the estimation schemes presented in Section 2 and Section 3, i.e., the equation whose solution maximises the Likelihood functions in Equations (18) and (46). In doing so, we will demonstrate the expressions in Equations (19) and (47).
We first suppose that the M-variate Gaussian probability distribution
governing the outcomes depends on a single parameter through its average and its covariance matrix . If we perform the observation times , the Likelihood function becomes
The maximum-likelihood estimator is obtained maximising the Likelihood function in Equation (A65)—or its logarithm. Applying the maximisation to the likelihood function in Equation (A65), we obtain the equation
where we exploited Jacobi’s formula for the derivative of the determinant of a matrix shown in Equation (A26), the identity in Equation (A28), and the symmetry of . Equation (A66) is the expression of the Likelihood Equation (47). This can be easily simplified to the case of univariate Gaussian distribution for and absence of average , for which and , obtaining
as shown in Equation (19).
Appendix E. Formulas for the Determinant of a Sum of Two Matrices
Let us consider an matrix Z of the form , where is a diagonal matrix, and . In this appendix, we will show a convenient method to express the determinant in terms of the elements of W, for the evaluation of the determinant in Equation (37) and of the cofactor matrix in Equations (39).
We exploit the identity [47]
where is the sum of the determinants of the matrices obtained by replacing any set of columns (rows) of X with the columns (rows) of Y at the same position. For example, for two matrices
the quantity is given, for , by
while for
Since , the determinant of any sub-matrix of W is zero if , so we can write
Let us now make explicit the first, easier terms of the summation. In particular, with the goal of applying this expression to the covariance matrix in Equation (35) and to the cofactor matrix C in Equation (39), which can both be expressed as the sum of a diagonal matrix and a rank-two and rank-three matrix (see Equations (A19), (A20) and (A32)), we will explicitly report the first three terms , and .
For , no columns are replaced from D, hence . We also notice that, if at least one of the eigenvalues of D is zero, this term vanishes.
For , is the sum of determinants of matrices of the form
with . The determinant of a matrix of the type in Equation (A73) is straightforward, and reduces to . Thus, in general, . A key observation to make, in view of the application to the covariance matrix and cofactor matrix C, is that if two or more eigenvalues of D are zero, all these determinants are zero, and thus ; if, instead, a single eigenvalue is zero, say , only one of these determinants of is non-vanishing, namely the determinant of the matrix obtained by replacing exactly the j-th column of D. In this case, for , then we have .
Let us now consider lastly the case . The matrices contributing to are of the form
with . Once again, the determinants of the matrices of the type in Equation (A74) are easy to be evaluated, and read , where
We notice that . Thus, we can rewrite the terms as the sum . Once again, key observations can be made: if the diagonal matrix D has at least three null eigenvalues, then is vanishing. If only two eigenvalues are zero, e.g., , with , then there is only one non-vanishing contribution to , given by the matrix obtained by substituting the j-th and -th columns of D, and in this case we have . If only one eigenvalue is zero, namely , then is given by the sum of all the determinants of the matrices where the j-th column of D has been replaced, namely .
It is then possible to extend with similar considerations the same method for every value of , eventually obtaining the compact expression for the determinant in Equation (A68)
where is the set of all the combinations without repetitions of columns out of L, denotes the sub-matrix of W obtained by selecting the rows and columns with indices .
References
- Caves, C.M. Quantum-mechanical noise in an interferometer. Phys. Rev. D 1981, 23, 1693–1708. [Google Scholar] [CrossRef]
- Bondurant, R.S.; Shapiro, J.H. Squeezed states in phase-sensing interferometers. Phys. Rev. D 1984, 30, 2548–2556. [Google Scholar] [CrossRef]
- Wineland, D.J.; Bollinger, J.J.; Itano, W.M.; Moore, F.L.; Heinzen, D.J. Spin squeezing and reduced quantum noise in spectroscopy. Phys. Rev. A 1992, 46, R6797–R6800. [Google Scholar] [CrossRef]
- Giovannetti, V.; Lloyd, S.; Maccone, L. Quantum-Enhanced Measurements: Beating the Standard Quantum Limit. Science 2004, 306, 1330–1336. Available online: https://www.science.org/doi/pdf/10.1126/science.1104149 (accessed on 28 January 2022). [CrossRef] [PubMed] [Green Version]
- Giovannetti, V.; Lloyd, S.; Maccone, L. Quantum Metrology. Phys. Rev. Lett. 2006, 96, 010401. [Google Scholar] [CrossRef] [Green Version]
- Dowling, J.P. Quantum optical metrology—The lowdown on high-N00N states. Contemp. Phys. 2008, 49, 125–143. [Google Scholar] [CrossRef]
- Paris, M.G. Quantum estimation for quantum technology. Int. J. Quantum Inf. 2009, 7, 125–137. [Google Scholar] [CrossRef]
- Giovannetti, V.; Lloyd, S.; Maccone, L. Advances in quantum metrology. Nat. Photonics 2011, 5, 222–229. [Google Scholar] [CrossRef]
- Lang, M.D.; Caves, C.M. Optimal Quantum-Enhanced Interferometry Using a Laser Power Source. Phys. Rev. Lett. 2013, 111, 173601. [Google Scholar] [CrossRef] [Green Version]
- Tóth, G.; Apellaniz, I. Quantum metrology from a quantum information science perspective. J. Phys. A Math. Theor. 2014, 47, 424006. [Google Scholar] [CrossRef] [Green Version]
- Dowling, J.P.; Seshadreesan, K.P. Quantum Optical Technologies for Metrology, Sensing, and Imaging. J. Light. Technol. 2015, 33, 2359–2370. [Google Scholar] [CrossRef] [Green Version]
- Szczykulska, M.; Baumgratz, T.; Datta, A. Multi-parameter quantum metrology. Adv. Phys. X 2016, 1, 621–639. [Google Scholar] [CrossRef]
- Schnabel, R. Squeezed states of light and their applications in laser interferometers. Phys. Rep. 2017, 684, 1–51. [Google Scholar] [CrossRef] [Green Version]
- Braun, D.; Adesso, G.; Benatti, F.; Floreanini, R.; Marzolino, U.; Mitchell, M.W.; Pirandola, S. Quantum-enhanced measurements without entanglement. Rev. Mod. Phys. 2018, 90, 035006. [Google Scholar] [CrossRef] [Green Version]
- Pirandola, S.; Bardhan, B.R.; Gehring, T.; Weedbrook, C.; Lloyd, S. Advances in photonic quantum sensing. Nat. Photonics 2018, 12, 724–733. [Google Scholar] [CrossRef]
- Polino, E.; Valeri, M.; Spagnolo, N.; Sciarrino, F. Photonic quantum metrology. AVS Quantum Sci. 2020, 2, 024703. [Google Scholar] [CrossRef]
- Maccone, L.; Riccardi, A. Squeezing metrology: A unified framework. Quantum 2020, 4, 292. [Google Scholar] [CrossRef]
- Schleich, W. Quantum Optics in Phase Space; Wiley: Hoboken, NJ, USA, 2011. [Google Scholar] [CrossRef]
- Weedbrook, C.; Pirandola, S.; García-Patrón, R.; Cerf, N.J.; Ralph, T.C.; Shapiro, J.H.; Lloyd, S. Gaussian quantum information. Rev. Mod. Phys. 2012, 84, 621–669. [Google Scholar] [CrossRef]
- Adesso, G.; Ragy, S.; Lee, A.R. Continuous Variable Quantum Information: Gaussian States and Beyond. Open Syst. Inf. Dyn. 2014, 21, 1440001. [Google Scholar] [CrossRef] [Green Version]
- Lvovsky, A.I. Squeezed Light. In Photonics; John Wiley and Sons, Ltd.: Hoboken, NJ, USA, 2015; Chapter 5; pp. 121–163. Available online: https://onlinelibrary.wiley.com/doi/pdf/10.1002/9781119009719.ch5 (accessed on 28 January 2022). [CrossRef]
- Cerf, N.J.; Leuchs, G.; Polzik, E.S. Quantum Information with Continuous Variables of Atoms and Light; Imperial College Press: London, UK, 2007; Available online: https://www.worldscientific.com/doi/pdf/10.1142/p489 (accessed on 28 January 2022). [CrossRef]
- Grangier, P.; Slusher, R.E.; Yurke, B.; LaPorta, A. Squeezed-light—Enhanced polarization interferometer. Phys. Rev. Lett. 1987, 59, 2153–2156. [Google Scholar] [CrossRef]
- Monras, A. Optimal phase measurements with pure Gaussian states. Phys. Rev. A 2006, 73, 033821. [Google Scholar] [CrossRef] [Green Version]
- Yonezawa, H.; Nakane, D.; Wheatley, T.A.; Iwasawa, K.; Takeda, S.; Arao, H.; Ohki, K.; Tsumura, K.; Berry, D.W.; Ralph, T.C.; et al. Quantum-Enhanced Optical-Phase Tracking. Science 2012, 337, 1514–1517. Available online: https://www.science.org/doi/pdf/10.1126/science.1225258 (accessed on 28 January 2022). [CrossRef] [Green Version]
- Oh, C.; Lee, C.; Rockstuhl, C.; Jeong, H.; Kim, J.; Nha, H.; Lee, S.Y. Optimal Gaussian measurements for phase estimation in single-mode Gaussian metrology. npj Quantum Inf. 2019, 5, 10. [Google Scholar] [CrossRef]
- Matsubara, T.; Facchi, P.; Giovannetti, V.; Yuasa, K. Optimal Gaussian metrology for generic multimode interferometric circuit. New J. Phys. 2019, 21, 033014. [Google Scholar] [CrossRef] [Green Version]
- Yurke, B.; McCall, S.L.; Klauder, J.R. SU(2) and SU(1,1) interferometers. Phys. Rev. A 1986, 33, 4033–4054. [Google Scholar] [CrossRef]
- Gatto, D.; Facchi, P.; Narducci, F.A.; Tamma, V. Distributed quantum metrology with a single squeezed-vacuum source. Phys. Rev. Res. 2019, 1, 032024. [Google Scholar] [CrossRef] [Green Version]
- Gatto, D.; Facchi, P.; Tamma, V. Phase space Heisenberg-limited estimation of the average phase shift in a Mach–Zehnder interferometer. Int. J. Quantum Inf. 2020, 18, 1941019. [Google Scholar] [CrossRef]
- Ou, Z.Y.; Li, X. Quantum SU(1,1) interferometers: Basic principles and applications. APL Photonics 2020, 5, 080902. [Google Scholar] [CrossRef]
- Gatto, D.; Facchi, P.; Tamma, V. Heisenberg-limited estimation robust to photon losses in a Mach-Zehnder network with squeezed light. Phys. Rev. A 2022, 105, 012607. [Google Scholar] [CrossRef]
- Farace, A.; Pasquale, A.D.; Adesso, G.; Giovannetti, V. Building versatile bipartite probes for quantum metrology. New J. Phys. 2016, 18, 013049. [Google Scholar] [CrossRef]
- Gramegna, G.; Triggiani, D.; Facchi, P.; Narducci, F.A.; Tamma, V. Typicality of Heisenberg scaling precision in multimode quantum metrology. Phys. Rev. Res. 2021, 3, 013152. [Google Scholar] [CrossRef]
- Armen, M.A.; Au, J.K.; Stockton, J.K.; Doherty, A.C.; Mabuchi, H. Adaptive Homodyne Measurement of Optical Phase. Phys. Rev. Lett. 2002, 89, 133602. [Google Scholar] [CrossRef] [Green Version]
- Aspachs, M.; Calsamiglia, J.; Muñoz Tapia, R.; Bagan, E. Phase estimation for thermal Gaussian states. Phys. Rev. A 2009, 79, 033834. [Google Scholar] [CrossRef] [Green Version]
- Berni, A.A.; Gehring, T.; Nielsen, B.M.; Händchen, V.; Paris, M.G.A.; Andersen, U.L. Ab initio quantum-enhanced optical phase estimation using real-time feedback control. Nat. Photonics 2015, 9, 577–581. [Google Scholar] [CrossRef]
- Zhuang, Q.; Zhang, Z.; Shapiro, J.H. Distributed quantum sensing using continuous-variable multipartite entanglement. Phys. Rev. A 2018, 97, 032329. [Google Scholar] [CrossRef] [Green Version]
- Guo, X.; Breum, C.R.; Borregaard, J.; Izumi, S.; Larsen, M.V.; Gehring, T.; Christandl, M.; Neergaard-Nielsen, J.S.; Andersen, U.L. Distributed quantum sensing in a continuous-variable entangled network. Nat. Phys. 2020, 16, 281–284. [Google Scholar] [CrossRef] [Green Version]
- Grace, M.R.; Gagatsos, C.N.; Zhuang, Q.; Guha, S. Quantum-Enhanced Fiber-Optic Gyroscopes Using Quadrature Squeezing and Continuous-Variable Entanglement. Phys. Rev. Appl. 2020, 14, 034065. [Google Scholar] [CrossRef]
- Grace, M.R.; Gagatsos, C.N.; Guha, S. Entanglement-enhanced estimation of a parameter embedded in multiple phases. Phys. Rev. Res. 2021, 3, 033114. [Google Scholar] [CrossRef]
- Gramegna, G.; Triggiani, D.; Facchi, P.; Narducci, F.A.; Tamma, V. Heisenberg scaling precision in multi-mode distributed quantum metrology. New J. Phys. 2021, 23, 053002. [Google Scholar] [CrossRef]
- Triggiani, D.; Facchi, P.; Tamma, V. Non-adaptive Heisenberg-limited metrology with multi-channel homodyne measurements. Eur. Phys. J. Plus 2022, 137, 125. [Google Scholar] [CrossRef]
- Cramér, H. Mathematical Methods of Statistics (PMS-9); Princeton University Press: Princeton, NJ, USA, 1946. [Google Scholar] [CrossRef]
- Rohatgi, V.K.; Saleh, A.M.E. An Introduction to Probability and Statistics; John Wiley and Sons: Hoboken, NJ, USA, 2015. [Google Scholar]
- Triggiani, D.; Facchi, P.; Tamma, V. Heisenberg scaling precision in the estimation of functions of parameters in linear optical networks. Phys. Rev. A 2021, 104, 062603. [Google Scholar] [CrossRef]
- Xu, S.; Darouach, M.; Schaefers, J. Expansion of det(A+B) and robustness analysis of uncertain state space systems. IEEE Trans. Autom. Control 1993, 38, 1671–1675. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).