A Note on Shift Retrieval Problems

Rusu, Cristian

doi:10.3390/math14030532

Open AccessFeature PaperArticle

A Note on Shift Retrieval Problems

by

Cristian Rusu

Faculty of Mathematics and Computer Science, University of Bucharest, 030018 Bucharest, Romania

Mathematics 2026, 14(3), 532; https://doi.org/10.3390/math14030532

Submission received: 27 December 2025 / Revised: 22 January 2026 / Accepted: 27 January 2026 / Published: 2 February 2026

(This article belongs to the Section E1: Mathematics and Computer Science)

Download

Browse Figures

Versions Notes

Abstract

In this paper, we discuss several shift retrieval problems, both classical and compressed, and provide connections between them using the general framework of circulant matrices. We review the properties of circulant matrices necessary for our calculations and then show how circular shifts can be recovered from as few measurements as possible in different scenarios. We treat several cases: circular shifts between two signals and between multiple pairs of signals, and linear combinations of circular shifts. In all these cases, we provide conditions under which the shift recovery is successful, we give explicit formulas, and we state convex optimization problems for the practical recovery of the shifts for both the noiseless and noisy measurement scenarios. Our goal is to accurately and robustly recover shift information from as few linear measurements as possible. Experimental results then validate the findings through simulation, where we compare the classic cross-correlation result with the proposed approaches.

Keywords:

shift retrieval; compressed sensing; cross-correlation; phase correlation; Fourier transform; sparsity

MSC:

62M10; 94A12; 62J07; 15A23; 15A24

1. Introduction

The shift retrieval problem is fundamental to many areas of signal and image processing. Shift retrieval refers to the problem of estimating an unknown translation between two observations or signals. This problem appears wherever alignment or relative displacement must be identified, usually as a preprocessing step before further processing takes place.

The shift retrieval solutions often reveal patterns, delays, or matching features, which are of fundamental importance in pattern recognition, radar processing, or time series analysis.

There are two domains where shift retrieval is essential: time and space. When dealing with signals in time, the problem bears the name Time Delay Estimation (TDE) [1,2], while for space we have image registration [3] and alignment problems [4]. Several applications include synchronization for Global Positioning Systems (GPSs) and wireless communications [5], radar ranging [6] and sonar direction estimation [7], alignment and calibration for speaker localization [8], and augmented reality [9]. Very recent machine learning and multimedia cybersecurity applications include complex alignment and matching tasks in the context of robust feature representations for accurate human parsing in complex scenes [10] and robust watermarking techniques for light-field imaging [11]. Furthermore, modern image retrieval techniques such as the one proposed in [12], which is histogram-based on local neighborhood difference patterns (ELNDP), could benefit from preprocessing with alignment invariance techniques to add robustness to the retrieval process.

Previous work [13,14] has dealt with the problem of recovering the shift between signals given as few measurements as possible. They were the first to show that the shift can be recovered from Fourier measurements using a few samples and less computation compared to the classical cross-correlation setup. They showed that only one Fourier coefficient may suffice to recover the true shift. Subsequently, the work in [15] established a robust shift recovery method based on Bezout’s identity and then also analyzed the recovery of weighted sums of two shifts.

Contribution. The shift retrieval problem is typically solved by maximizing the multiplicative cross-correlation between the two signals. Cross-correlation is a powerful tool for finding similarities between signals by systematically testing their alignment. In this paper, we consider different optimization problems that involve the minimization of quantities and whose solutions reduce to entry-wise division operations. In this setting, several known results can be contextualized together, and new results are naturally obtained. The contributions are summarized as follows:

We give a unified, noiseless/noisy classic, and compressed view of previously known results on the shift retrieval problems (Results 1, 2 and 3; Remark 3);
We make explicit the connections between the proposed approach and the classical circular convolution and phase cross-correlation theorems (Remark 2);
We provide the exact necessary conditions under which scaling and shift information is retrieved from noiseless and noisy measurements (Remarks 1 and 4);
We provide new explicit results that show the performance of the shift recovery accuracy depending on the noise level and the lengths of the signals (Remark 6);
We provide new results regarding the generalization of shift retrieval problems to pairs of multiple signals (Results 4 and 5);
We provide a new result related to the recovery of multiple shifts from measurements that are circular linear combinations (Result 6).

This paper is organized as follows: In Section 2, we provide the basic properties of circulant matrices and provide the classic cross-correlation shift retrieval method. Then, in Section 3, in different sub-sections, we analyze several shift retrieval problems and provide solutions based on optimization problems involving circulant matrices. In Section 4, we provide numerical simulation results for shift retrieval accuracy, and then we conclude the paper in Section 5.

Notation.

Normal text presents scalars (real or complex valued), lowercase bold letters denote vectors, uppercase bold letters denote matrices,

{(\cdot)}^{T}

is the transpose of a vector/matrix,

{(\cdot)}^{H}

is the Hermitian transpose,

{(\cdot)}^{*}

is the complex conjugate of a scalar/vector/matrix, we denote

j = \sqrt{- 1}

, ⊙ and ⊘ denote element-wise multiplication and division, respectively, between matrices of the same size, given a scalar x, then

| x |

is the absolute value, given a vector

x

then

| x |

is the entry-wise absolute value. Given a vector

x

, we denote its Fourier transform as

\tilde{x}

or explicitly

FFT (x)

. For a vector

x

of size n, the

ℓ_{p}

norms are defined as

{∥ x ∥}_{p} = (\sum_{i = 0}^{n - 1} | x_{i} {|^{p})}^{\frac{1}{p}}

, and for a matrix

X

of size

n \times N

, the squared Frobenius norm is defined as

{∥ X ∥}_{F}^{2} = \sum_{i, k} {| X_{i k} |}^{2}

. The matrix

I_{n}

is the identity of size

n \times n

, and

0_{n \times N}

and

1_{n \times N}

are the zero and one matrices of size

n \times N

, respectively. Symbols

N, Z, R

and

C

denote the natural, integer, real and complex numbers, respectively, and

ℜ (\cdot)

denotes the real part of a complex number. Then

E [w]

denotes the expected value of a random variable w. For a vector

x

,

diag (x)

is the diagonal matrix with

x

on the diagonal, and for a square matrix

X

,

diag (X)

returns the diagonal vector of

X

, and

tr (X) = \sum_{i} X_{i i}

is the trace. Finally,

gcd (\cdot, \cdot)

denotes the greatest common divisor of two natural numbers, and

P (\cdot)

is the probability that an event will occur. Further, specific notation is explained in the place where it is used.

2. Classic Shift Retrieval Problems

In this section, we introduce several fundamental concepts to our analysis and provide an overview of the classic shift retrieval problem and solution.

2.1. Circulant Matrices Primer

Consider the square circulant matrices defined as

\begin{matrix} C = circ (c) \overset{def}{=} & [\begin{matrix} c_{0} & c_{n - 1} & \dots & c_{2} & c_{1} \\ c_{1} & c_{0} & \dots & c_{3} & c_{2} \\ ⋮ & ⋱ & ⋱ & ⋱ & ⋮ \\ c_{n - 2} & c_{n - 3} & \dots & c_{0} & c_{n - 1} \\ c_{n - 1} & c_{n - 2} & \dots & c_{1} & c_{0} \end{matrix}] \\ = & [\begin{matrix} c & P c & P^{2} c & \dots & P^{n - 1} c \end{matrix}] \in R^{n \times n} . \end{matrix}

(1)

The matrix

P \in R^{n \times n}

denotes the orthonormal circulant matrix that circularly down-shifts a target vector

c

by one position, i.e.,

P = circ (e_{2})

where

e_{2}

is the second vector of the standard basis of

R^{n}

. Notice that

P^{q - 1} = circ (e_{q})

is also orthonormal circulant and denotes a circular down-shift by

q - 1

. Positive powers of

P

perform the circular down-shift in a vector, while the negative powers perform the circular up-shift in a vector. Observe that

{(P^{q - 1})}^{- 1} = {(P^{q - 1})}^{T} = P^{1 - q}

.

The eigenvalue factorization of circulant matrices reads

C = F^{H} Σ F, Σ = diag (σ) \in C^{n \times n},

(2)

where

F \in C^{n \times n}

is the unitary Fourier matrix (

F^{H} F = F F^{H} = I_{n}

) and the diagonal

σ = \sqrt{n} Fc, σ \in C^{n}

. In some situations the factorization is presented as

C = F^{- 1} diag (Fc) F

. The

\sqrt{n}

factor is missing explicitly, as it is absorbed in the inverse. In this paper, we will omit the

\sqrt{n}

term as it does not qualitatively change any of the results. It all depends on whether the Fourier matrix has its elements normalized by some factor

\frac{1}{\sqrt{n}}

,

\frac{1}{n}

, or none at all. Therefore, the quantities we compute hold up to scaling, in general. Different sources in the literature and software implementations use different conventions.

The multiplication with

F

is equivalent to the application of the Fast Fourier Transform, i.e.,

\tilde{c} = Fc = FFT (c)

, while the multiplication with

F^{H}

is equivalent to the inverse Fourier transform, i.e.,

F^{H} c = F^{- 1} c = IFFT (c)

. Vectors that are Fourier transforms have the same name as their time-domain counterparts, but with an additional tilde to differentiate them. Of course,

F^{H} Fc = IFFT (FFT (c)) = c

and

F F^{H} c = FFT (IFFT (c)) = c

. Both fast transforms are applied in

O (n log n)

time and memory.

Given two real-valued matrices

X

and

Y

, both

n \times N

, an immediate application [16,17] of the eigenvalue factorization with the Fourier matrix is to the solution of the problem

\underset{σ}{minimize} {∥ Y - CX ∥}_{F}^{2},

(3)

whose solution is given by

σ_{0} = \frac{{\tilde{x}}_{0}^{T} {\tilde{y}}_{0}}{∥ {\tilde{x}}_{0} ∥_{2}^{2}}, σ_{i} = \frac{{\tilde{x}}_{i}^{H} {\tilde{y}}_{i}}{∥ {\tilde{x}}_{i} ∥_{2}^{2}}, σ_{n - i} = σ_{i}^{*}, i = 1, \dots, n - 1,

(4)

where

{\tilde{y}}_{i}^{T}

and

{\tilde{x}}_{i}^{T}

are the rows of

\tilde{Y} = FY

and

\tilde{X} = FX

. In the special case when

N = 1

, we have the vectors

x

and

y

and their Fourier transforms

\tilde{x}

and

\tilde{y}

, respectively. In this paper, we assume the working signals are real-valued.

2.2. Classic Shift Retrieval

Given two signals

x, y \in R^{n}

, assuming that

y

is a circular shift in

x

and that

x

is not periodic, in order to find the unique shift quantity, we maximize the inner product

\underset{q}{arg max} | y^{T} P^{q - 1} x |,

(5)

where

P^{q - 1} \in R^{n \times n}

denotes a circular shift. The calculations above explicitly find all inner products between

x

and all possible circular shifts in

y

, or vice versa. Practically, to recover the shift, we use the circular cross-correlation theorem,

arg max | IFFT (FFT {(x)}^{*} ⊙ FFT (y)) |,

(6)

i.e., we take the index of the maximum-magnitude entry of the correlation vector.

The result follows directly from the factorization of Equation (2) by computing correlations between all circular shifts in

x

and the vector

y

as

\begin{matrix} C^{T} y & = circ {(x)}^{T} y \\ = circ {(x)}^{H} y \\ = F^{H} diag {(Fx)}^{H} F y \\ = IFFT (FFT {(x)}^{*} ⊙ FFT (y)) . \end{matrix}

(7)

Therefore, the problem in Equation (6) becomes that of maximizing

∥ C^{T} {y ∥}_{\infty}

. The absolute value removes the distinction between positive and negative correlation of real-valued signals. Our proposed approaches start from a different point of view where the shift quantity q and the circular shift matrix

P

are explicitly used. These methods are detailed next.

3. A Circulant Matrix Perspective on the Shift Retrieval Problems

For the sake of clarity, we express and discuss each shift retrieval problem in separate subsections.

3.1. Noiseless Shift Retrieval

In this section, we provide a different view on the classic shift retrieval problem and give the following main result:

Result 1 (Noiseless shift retrieval).

We are given two signals

x

and

y

, and we assume that there is a unique circular shift q between them like

y = P^{q - 1} x,

(8)

then, assuming that

{\tilde{x}}_{i} \neq 0

, we have that

IFFT (FFT (y) ⊘ FFT (x)) = e_{q} .

(9)

Proof.

Assuming that

y = P^{q - 1} x

, with

P = circ (e_{2})

, we consider the problem

\underset{q}{minimize} {∥ y - P^{q - 1} x ∥}_{2}^{2} .

(10)

Use

P^{q - 1} = F^{H} Σ F

and to develop

∥ y - P^{q - 1} {x ∥}_{2}^{2} = ∥ y - F^{H} {Σ Fx ∥}_{2}^{2} = {∥ Fy - Σ Fx ∥}_{2}^{2} = {∥ \tilde{y} - Σ \tilde{x} ∥}_{2}^{2}

, where

Σ = diag (σ), σ = {Fe}_{q} = f_{q} \in C^{n}

(the

q^{th}

column of the Fourier matrix). If we relax the constraint and allow

P^{q - 1}

to be any circulant matrix, to minimize the Frobenius norm, as the special case of Equation (3) for

N = 1

, we have

σ_{i} = {\tilde{y}}_{i} / {\tilde{x}}_{i}, {\tilde{x}}_{i} \neq 0,

and therefore

{Fe}_{q} = \tilde{y} ⊘ \tilde{x}

. □

The assumption that

{\tilde{x}}_{i} \neq 0

seems restrictive (and is missing in Equation (6)). We do not need to apply the inverse Fourier transform, but instead compute only

σ_{i}

where

{\tilde{x}}_{i} \neq 0

, and by inspection of all columns of

F

on the rows where this quantity was computed, we find the shift q.

If we want to recover the true shift between signals

x

and

y

, then the following conditions need to hold:

Remark 1 (Necessary conditions for the recovery of the true shift).

In order to uniquely recover the true shift q from a single measurement i, the following properties need to hold:

1.: ${\tilde{x}}_{i} \neq 0$ , so that the ratio $σ_{i}$ is well defined;
2.: $i \neq 0$ , which is the DC component;
3.: $gcd (i, n) = 1$ , which excludes the special case $i = \frac{n}{2}$ for n even.

Proof.

Notice that

σ_{0}

and

σ_{\frac{n}{2}}

when n is even are

{\pm 1}

for all columns of the Fourier matrix, and thus they cannot provide an unambiguous answer. In fact,

σ_{0}

provides no information about the shift, while

σ_{\frac{n}{2}}

only establishes the parity of the shift. On the other hand, in the best-case scenario, we need to compute a single

σ_{i}

to recover the true shift q. Given the ratio

σ_{i} = \frac{{\tilde{y}}_{i}}{{\tilde{x}}_{i}} = exp (- j \frac{2 π i}{n} q)

, two different shifts q and

q^{'}

lead to the same quantity

σ_{i}

iff

exp (- j \frac{2 π i}{n} (q - q^{'})) = 1

and therefore

\frac{i}{n} (q - q^{'}) \in Z

. To avoid having the exponential equal one for

q \neq q^{'}

, we require

gcd (i, n) = 1

. □

By this remark, the complexity of the shift retrieval problem is

O (n)

, as also observed for the compressive shift retrieval result [14], which is discussed in the next subsection.

Remark 2 (Connection to the classical circular and phase cross-correlation theorems).

We can rewrite Equation (9) as

IFFT (FFT (y) ⊘ FFT (x)) = IFFT (FFT {(x)}^{*} ⊙ FFT (y) ⊘ | FFT (x) |^{2}),

(11)

where

{| FFT (x) |}^{2}

computes the square absolute values of each element of the Fourier transform of

x

which we assume are all non-zero. Notice that Equation (9) represents a weighted variant of Equation (6), a type of “whitened” cross-correlation quantity.

The expression in Equation (11) naturally relates to the well-known phase correlation formula

{IFFT (FFT (x)}^{*} ⊙ FFT (y) ⊘ {(| FFT (x)}^{*} ⊙ FFT (y) |)),

(12)

The normalization performed in the formula above removes the amplitude information, i.e., this is essentially a phase-only calculation because magnitude is removed by the division operation. In our scenario, we are interested in preserving and recovering the amplitude information as well.

If the two signals

x

and

y

are shifted versions of each other, then Equations (6) and (9) provide the same answer. If this is not the case, or the signals are noisy, then Equation (9) seems a weaker result in general since the minimizer

P^{q - 1}

in Equation (10) might no longer be

P = circ (e_{2})

, but some other circulant matrix that minimizes Equation (10). In this high-noise case, we might not be able to interpret that the signals are shifted versions of each other. The circulant cross-correlation theorem does not have this feature, as it will always provide the maximum correlation between the signals.

We note that the approach to maximize the quadratic form Equation (5) and that of norm minimization Equation (10) are equivalent since

∥ y - P^{q - 1} {x ∥}_{2}^{2} = {∥ y ∥}_{2}^{2} + {∥ x ∥}_{2}^{2} - 2 y^{T} P^{q - 1} x,

(13)

and then also

\begin{matrix} ∥ y - P^{q - 1} {x ∥}_{2}^{2} & = ∥ Fy - {(diag (F e_{2}))}^{q - 1} {Fx ∥}_{2}^{2} \\ = ∥ \tilde{y} - diag (F e_{q}) \tilde{x} ∥_{2}^{2} \\ = ∥ \tilde{y} - diag (f_{q}) \tilde{x} ∥_{2}^{2} \\ = ∥ \tilde{y} ∥_{2}^{2} + {∥ \tilde{x} ∥}_{2}^{2} - 2 ℜ ({\tilde{y}}^{H} diag (f_{q}) \tilde{x}) \\ = ∥ \tilde{y} ∥_{2}^{2} + {∥ \tilde{x} ∥}_{2}^{2} - 2 \sum_{i = 0}^{n - 1} {\tilde{y}}_{i}^{*} {\tilde{x}}_{i} F_{i q}, \end{matrix}

(14)

where

F_{i q}

is an element from the Fourier matrix and the last quantity is real-valued due to the conjugate valued symmetries of the vectors

\tilde{x}

and

\tilde{y}

, and of the columns

f_{q}

. The last summation quantity is equivalent to Equation (6) for a fixed q. The result of Equation (9) is obtained by allowing the unknown to be the overall general circulant matrix denoted

P^{q - 1}

, not just the power q.

Finally, note that for real-valued

x

and

y

we have that

IFFT (FFT {(x)}^{*} ⊙ FFT (y))

is equivalent to

FFT (FFT (x) ⊙ FFT {(y)}^{*})

, up to a normalization factor depending on n.

Remark 3 (Calculation of the circular shift from one measurement).

Notice that Equation (9) is equivalent to

FFT (y) ⊘ FFT (x) = f_{q},

(15)

where

f_{q}

is the

q^{th}

column of the Fourier matrix

F

. We can find the shift by computing a single entry

σ_{i} = {\tilde{y}}_{i} / {\tilde{x}}_{i}

and then inspecting the entries of only the

i^{th}

row of the Fourier matrix. In this case, the recovery of the shift is performed via

q^{★} = - i^{- 1} arg (σ_{i}) \frac{n}{2 π} mod n,

(16)

where

i^{- 1}

is the modular inverse of i modulo n.

This is the result previously developed by the work in [13,14]. Essentially, these results are a consequence of the well-known shift theorem for the Discrete Fourier Transform (DFT), which makes the connection between multiplication by pure phase factors and circular shifts in vectors, i.e., circular shift in a vector corresponds to multiplying its DFT by a linear phase. Treatments of this classic result can be found in fundamental signal processing sources such as ([18], Section 8.6.2).

More generally, when the measurement

y

has a different scale and mean than the signal

x

, we have the following remark:

Remark 4 (Necessary conditions for the recovery of the true shift, scale, and mean).

When the measurements are given by

y = α P^{q - 1} x + β 1

, where

α, β \in R

are scalars such that

α \neq 0

and

\sum_{i = 0}^{n - 1} x_{i} \neq 0

, then

IFFT (\tilde{y} ⊘ \tilde{x}) = α e_{q} + (β / \sum_{i = 0}^{n - 1} x_{i}) 1,

(17)

where

1 \in R^{n}

is the ones vector. In general, we need three measurements to recover all parameters

(q, α, β)

.

Proof.

The mean component is easy to identify by using the previously uninformative DC component. Notice that for a single non-DC component such that

{\tilde{x}}_{i} \neq 0

we have that

σ_{i} = \frac{{\tilde{y}}_{i}}{{\tilde{x}}_{i}} = α exp (- j \frac{2 π i}{n} q) .

(18)

When

α > 0

, there is no change in the phase, and therefore the scale and shift can be recovered simultaneously. For any integer

α \neq 0

, then we have that the phase of

σ_{i}

is not modified (if

α

is positive) or is modified by

π

(if

α

is negative). As a consequence of this phase change we cannot distinguish between the shift-scale pair

(q, α)

and

(q + \frac{n}{2 i}, - α)

when

\frac{n}{2 i}

is a valid shift integer in

{0, 1, \dots, n - 1}

. In general, this ambiguity cannot be resolved from a single non-DC measurement. To uniquely recover all three parameters, we will require two non-DC components and the DC component (for the calculation of the mean

β

). Given two distinct non-DC measurements

k_{1} \neq k_{2}

,

σ_{k_{1}} = \frac{{\tilde{y}}_{k_{1}}}{{\tilde{x}}_{k_{1}}} = α exp (- j \frac{2 π k_{1}}{n} q),

(19)

σ_{k_{2}} = \frac{{\tilde{y}}_{k_{2}}}{{\tilde{x}}_{k_{2}}} = α exp (- j \frac{2 π k_{2}}{n} q) .

(20)

Now take the ratio of the two quantities to eliminate

α

and reach

\frac{σ_{k_{1}}}{σ_{k_{2}}} = exp (- j \frac{2 π (k_{1} - k_{2})}{n} q) .

(21)

Analogously to Remark 1, the necessary condition for the recovery of the true shift is that

gcd (k_{1} - k_{2}, n) = 1

. We call the best estimated shift

q^{★} = - {(k_{1} - k_{2})}^{- 1} round (arg (\frac{σ_{k_{1}}}{σ_{k_{2}}}) \frac{n}{2 π}) mod n .

(22)

Then, we compute the other two parameters,

α^{★} = \frac{σ_{k_{1}}}{exp (- j \frac{2 π k_{1}}{n} q^{★})},

(23)

β^{★} = \frac{{\tilde{y}}_{0} - α^{★} {\tilde{x}}_{0}}{n} .

(24)

□

Just as the classic shift retrieval of Equation (5) is indifferent to the sign of the correlation between signals, Remark 4 provides the result indifferent to the sign of the correlation, allowing for both positive and negative

α

and therefore positive and negative correlations.

The results developed in this section assume no noise is present in the measurements. As already explained in Remark 2, we expect noisy measurements to significantly degrade the performance of the shift retrieval and make impractical the recovery mechanism from a single measurement.

3.2. Noisy Shift Retrieval

We are again given two signals

x

and

y

such that there is a noisy circular shift q between them,

y = P^{q - 1} x + w,

(25)

where

w \sim N (0_{n \times 1}, ζ^{2} I_{n})

is an i.i.d. zero-mean Gaussian noise vector of size n. Unlike the noiseless scenario, here we will assume the number of measurements

m > 1

and try to recover the most likely shift parameter. Then we have the following result.

Result 2 (The noisy recovery of the true shift, scale, and mean).

When the measurements are given by

y = α P^{q - 1} x + β 1 + w,

(26)

where

α, β \in R

are scalars such that

α \neq 0

, then given m measurements with indices in the set

K = {k_{i}}_{i = 1}^{m - 1}

, such that

gcd (k_{1} - k_{2}, n) = 1

for any two distinct

k_{1}, k_{2} \in K

and

m - 1 \geq 2

, plus the DC component, which is treated separately, we have the following estimates:

q^{★} = \underset{q}{arg max} {[ℜ \{\sum_{i = 1}^{m - 1} {| {\tilde{x}}_{k_{i}} |}^{2} σ_{k_{i}} exp (j \frac{2 π k_{i}}{n} q)\}]}^{2},

(27)

α^{★} = \frac{ℜ \{\sum_{i = 1}^{m - 1} {| {\tilde{x}}_{k_{i}} |}^{2} σ_{k_{i}} exp (j \frac{2 π k_{i}}{n} q^{★})\}}{\sum_{i = 1}^{m - 1} {| {\tilde{x}}_{k_{i}} |}^{2}},

(28)

β^{★} = \frac{{\tilde{y}}_{0} - α^{★} {\tilde{x}}_{0}}{n} .

(29)

Proof.

Assuming

{\tilde{x}}_{k_{i}} \neq 0

for

k_{i} \in K

, define

σ_{k_{i}} = {\tilde{y}}_{k_{i}} / {\tilde{x}}_{k_{i}}

and the following measurement vector

{(\tilde{y})}_{K}

, which is of length

m - 1

and contains the Fourier transform of the vector

y

restricted to the indices from the set

K

. Define also the vector

s = {(\tilde{x})}_{K} ⊙ ω

, where

ω

is again a vector of size

m - 1

whose elements are the quantities

ω_{i} = exp (- j \frac{2 π k_{i}}{n} q)

for

k_{i} \in K

. The parameter

β

is estimated again from the DC component by minimizing

{({\tilde{y}}_{0} - α {\tilde{x}}_{0} - β n)}^{2}

for

β

. The scale is estimated by minimizing

∥ {(\tilde{y})}_{K} {- α s ∥}_{2}^{2}

for

α

which leads to

α^{★} = \frac{ℜ {s^{H} {(\tilde{y})}_{K}}}{{∥ s ∥}_{2}^{2}} = \frac{ℜ {\sum_{i = 1}^{m - 1} {\tilde{x}}_{k_{i}}^{*} {\tilde{y}}_{k_{i}} exp (j 2 π k_{i} q / n)}}{∥ {(\tilde{x})}_{K} ∥_{2}^{2}}

, note that the result is a function of q. Then, in terms of the shift, the minimum residual quantity is given by

∥ {(\tilde{y})}_{K} {- α s ∥}_{2}^{2} = {∥ {(\tilde{y})}_{K} ∥}_{2}^{2} - \frac{{[ℜ {s^{H} {(\tilde{y})}_{K}}]}^{2}}{{∥ s ∥}_{2}^{2}}

. Then, the optimal

q^{★}

is given by maximizing the numerator in the residual quantity. □

As we increase the number of measurements m, we expect to improve the accuracy of the recovery. It is of interest to investigate if there are ways to choose the index set

K

to improve accuracy. The following remark speaks to this choice.

Remark 5 (Selection criteria for the Fourier measurements).

Given noisy measurements

y = P^{q - 1} x + w

, in order to recover the shift quantity q while reducing the effects of the noise, one should choose indices

k_{i}

such that

| {\tilde{x}}_{k_{i}} |

have the largest values.

Proof.

The observation stems from the ratio

σ_{k_{i}}

whose variance is proportional to

\frac{1}{| {\tilde{x}}_{k_{i}} |^{2}}

. This follows from

σ_{k_{i}} = \frac{α {\tilde{y}}_{i} + {\tilde{w}}_{k_{i}}}{{\tilde{x}}_{k_{i}}} = α exp (- j \frac{2 π k_{i}}{n} q) + \frac{{\tilde{w}}_{k_{i}}}{{\tilde{x}}_{k_{i}}},

(30)

where

\tilde{w}

is the Fourier transform of the noise vector. Because

F

is unitary and the noise is zero mean and i.i.d., we have that the variance is

E [{|\frac{{\tilde{w}}_{k_{i}}}{{\tilde{x}}_{k_{i}}}|}^{2}] = \frac{E [| {\tilde{w}}_{k_{i}} |^{2}]}{| {\tilde{x}}_{k_{i}} |^{2}} = \frac{ζ^{2}}{| {\tilde{x}}_{k_{i}} |^{2}} .

(31)

To maximally reduce this quantity we take the largest

| {\tilde{x}}_{k_{i}} |

. □

In the simulation results section, we investigate the impact of this index choice on the accuracy performance of recovering the shift from the noisy measurements. While choosing the highest entries

| {\tilde{x}}_{k_{i}} |

is desirable, this might be difficult to achieve in general without computing the whole spectrum. Without any information on the spectrum of the signal, the Goertzel algorithm [19] can be applied to compute a few Fourier coefficients, still obeying

gcd (k_{i}, n) = 1

. Still, the indices of these coefficients have to be given a priori. Keeping

gcd (k_{i}, n) = 1

means that the set of eligible indices has size

φ (n)

, the Euler totient function. If no information is available about the spectrum then selecting the coefficients randomly is a last method of choice. Approaches such as the Fastest Fourier Transform in the West (FFTW) [20] allow for the calculation of pruned FFT, which compute only a subset of outputs of the FFT at the cost of

O (n log s)

for s Fourier coefficients.

Furthermore, in some applications where we expect the spectrum to be very sparse (many Fourier coefficients will be zero, or close to zero), the Sparse Fourier Transform (SFT) [21,22] can be applied to compute the s largest magnitude Fourier coefficients with complexity

O (s log n)

if the signal is exactly s-sparse or

O (s log n log (n / s))

if the signal is s-sparse plus noise. The SFT is probabilistic, so usually, we do not compute just one coefficient, but a few

s > 1

and select the largest in magnitude. An interesting point here is to note that these SFTs work on the principle of intentionally allowing aliasing to occur in order to group Fourier coefficients in the same bins, allowing conflicts that are subsequently resolved. This is relevant in our case because, by carefully choosing the aliasing, we could group the calculations in bins such that Fourier coefficients that are not of interest are grouped in the same bins, and then in separate bins, we group only the coefficients of interest. Therefore, resolving the collisions occurs only in the bins of interest.

The next remark addresses the issue of comparing, in the noisy case, the classic shift recovery method against the one measurement approach as a function of n and the signal-to-noise ratio.

Remark 6 (Expected accuracy performance and comparison between the circular convolution theorem and the one measurement model).

Given Result 2, we expect the shift recovery performance to follow:

1.: The recovery of the shift by the circular convolutional theorem improves as the length of the signals n increases or the variance of the noise $ζ^{2}$ decreases, i.e., it is easier to recover the correct shift under these circumstances;
2.: For shift recovery from a single measurement, as n increases, the noise variance $ζ^{2}$ for which the recovery accuracy approaches 100% decreases, i.e., it is harder to recover the correct shift under these circumstances;
3.: For shift recovery from a single measurement, as the variance decreases to zero, i.e., $ζ^{2} \to 0$ , the probability of correct recovery is bounded by $\frac{1}{gcd (i, n)}$ .

Proof.

Assuming the measurement model in Equation (25), for two different shifts q and

q^{'}

such that

q \neq q^{'}

, and the respective shifted vectors

x_{q}

and

x_{q^{'}}

, the idea is to compute the probability of choosing the wrong shift

q^{'}

over the correct q as

P (∥ y - x_{q^{'}} ∥_{2}^{2} \leq ∥ y - x_{q} ∥_{2}^{2})

. The inequality is reduced from

∥ y - x_{q^{'}} ∥_{2}^{2} {∥ y - x_{q} ∥}_{2}^{2} \leq 0

to the equivalent

∥ x_{q^{'}} - x_{q} ∥_{2}^{2} \leq 2 w^{T} (x_{q^{'}} - x_{q})

. Because

w \sim N (0_{n \times 1}, ζ^{2} I_{n})

we have that

w^{T} (x_{q^{'}} - x_{q}) \sim N (0, ζ^{2} ∥ x_{q^{'}} - x_{q} ∥_{2}^{2})

. Finally, note that

\begin{matrix} ∥ x_{q^{'}} - x_{q} ∥_{2}^{2} & = ∥ P^{q^{'} - 1} x - P^{q - 1} {x ∥}_{2}^{2} \\ = ∥ x - {(P^{q^{'} - 1})}^{T} P^{q - 1} {x ∥}_{2}^{2} \\ = ∥ x - P^{1 - q^{'}} P^{q - 1} {x ∥}_{2}^{2} \\ = ∥ x - P^{q - q^{'}} {x ∥}_{2}^{2} \\ = ∥ x - x_{q - q^{'} + 1} ∥_{2}^{2} . \end{matrix}

(32)

Therefore, the results depend only on the nonzero differences between the two shifts we consider. Then, if follows that for any shift difference

d = q - q^{'} + 1 \neq 0

we have that

∥ x - x_{d} ∥_{2}^{2} = {∥ x ∥}_{2}^{2} + ∥ x_{d} ∥_{2}^{2} - 2 x^{T} x_{d} = 2 {∥ x ∥}_{2}^{2} (1 - ρ_{d}),

(33)

where

ρ_{d}

is the normalized circular autocorrelation coefficient for delay d. We assume

| ρ_{d} | < 1

for all

d \neq 0

, i.e., there is no periodicity in the signal

x

. Then, the probability of error is given by

\begin{matrix} P (∥ y - x_{q^{'}} ∥_{2}^{2} & \leq ∥ y - x_{q} ∥_{2}^{2}) = Q (\frac{∥ x - x_{d} ∥_{2}}{2 ζ}) \\ = Q (\sqrt{\frac{(1 - ρ_{d}) {∥ x ∥}_{2}^{2}}{2 ζ^{2}}}) \\ = Q (\sqrt{\frac{n (1 - ρ_{d}) SNR}{2}}) . \end{matrix}

(34)

If we would allow

ρ_{d} = 1

then

Q (0) = \frac{1}{2}

, describing the inherit uncertainty in the shift recovery between q and

q^{'}

.

We have defined the Q-function as the tail distribution function of the standard normal distribution and

SNR = \frac{1}{n} \frac{{∥ x ∥}_{2}^{2}}{ζ^{2}}

. Finally, for

q^{★}

given by Result 2, by a union bound we have

\begin{matrix} P (q^{★} \neq q) & \leq \sum_{d = 1}^{n - 1} Q (\sqrt{\frac{n (1 - ρ_{d}) SNR}{2}}) \\ = (n - 1) Q (\sqrt{\frac{n (1 - ρ_{max}) SNR}{2}}), \end{matrix}

(35)

where

ρ_{max} = max_{d} ρ_{d}

is the highest autocorrelation coefficient. This shows that increasing n or SNR leads to a lower expected error upper bound. Naturally, high magnitude coefficients in the autocorrelation increase the probability of error. In fact,

P (q^{★} \neq q) \to 0

as

SNR \to \infty

or

n \to \infty

.

In the case of a single measurement, the main difficulty is the angular separation between adjacent phases on the unit circle, which is

2 π / n

. As n increases, this separation decreases, so at any fixed measurement SNR, the success probability of recovery decreases. Assuming that we use the classic approximation

2 sin (π / n) \approx 2 π / n

instead of the arc length distance between neighboring phases, correct identification occurs when the phase error in absolute value stays within half the grid spacing, i.e.,

π / n

. Therefore, by Remark 5, the probability of error for a single measurement whose estimate

q^{★}

is given by Euqation (16) is

P_{1} (q^{★} \neq q) \approx 2 Q (\sqrt{2 \frac{| {\tilde{x}}_{i} |^{2}}{{∥ x ∥}_{2}^{2}} SNR} \frac{π}{n}) .

(36)

Note there that the relationship with SNR is the same as in Equation (35), but the relationship with n is inverse, i.e., larger n decreases the probability of correct shift recovery from a single measurement. According to Equation (36), for signals of length

2 n

we need an SNR approximately

10 {log}_{10} (4) \approx 6

dB larger to achieve the same shift recovery accuracy as for n.

Finally, note that because

i (q - q^{'}) = 0 mod n

has exactly

gcd (i, n)

solutions, and assuming that q is uniformly distributed among all possible shifts, then the maximum possible probability of exact recovery is at most

\frac{1}{gcd (i, n)}

. This is because

gcd (i, n)

shifts the map to the same phase and only

\frac{n}{gcd (i, n)}

distinct phases. Therefore, the probability of Equation (36) asymptotically tends to one as the SNR

\to \infty

iff

gcd (i, n) = 1

. □

Next, we look at the compressed sensing extension of the shift retrieval problem and several other, more general shift retrieval problems.

3.3. The Compressive Shift Retrieval Problem

The compressive shift retrieval problem has been previously introduced [13,14]. In this section, we show how this result can also be described in the overall structure developed in this paper.

Define the sensing matrix

A \in C^{m \times n}, m \leq n,

and the compressed measurement signals

z = Ay \in C^{m}

and

v = Ax \in C^{m}

. Assuming that

y

is a circular shift in

x

, the goal is to determine the shift from

z

and

v

. Similarly to Equation (5), consider the test (Corollary 2 in [13]):

\underset{q}{argmax} ℜ {z^{H} {\bar{P}}^{q - 1} v},

(37)

where

{\bar{P}}^{q - 1} = A P^{q - 1} A^{H}

. It has been shown that when

A

is taken to be a partial Fourier matrix, then ([13], Corollary 4):

\max_{q \in {0, \dots, n - 1}} ℜ \{\sum_{i = 1}^{m} z_{i}^{*} v_{i} e^{\frac{- 2 π j k_{i} q}{n}}\},

(38)

recovers the true shift if there exists

i \in {1, \dots, m}

such that

{\tilde{x}}_{k_{i}} \neq 0

(the

k_{i}^{th}

coefficient of the Fourier transform of

x

) and

{1, \dots, n - 1} \frac{k_{i}}{n}

contain no integers. The set

K = {k_{i}}_{i = 1}^{m}

contains the indices of the rows contained in the partial Fourier matrix

A

. Following ([13], Theorem 1), we assume that the sensing matrix

A

obeys:

A^{H} {AP}^{q - 1} = P^{q - 1} A^{H} A

,

\exists γ \in R

so that

γ {AA}^{H} = I_{m}

and all columns of

A circ (x)

are different so that there is no ambiguity in the shift in the measurements. Without loss of generality, assume

γ = 1

.

The compressive shift retrieval result is partly based on the fact that

A^{H} {AP}^{q - 1} = P^{q - 1} A^{H} A

. Notice that

A^{H} A = F^{H} Σ F

where the diagonal

Σ

contains

{0, 1}

with ones on the positions where the rows of the Fourier matrix are selected (the set

K

). Notice that

A^{H} A

is a circulant and thus it commutes with

P^{q - 1}

– they have the same eigenspace. Also, given a set

K

of indices, we define the operation

{[a]}_{K} = b

for vectors

a \in C^{n}, b \in C^{m}, m \leq n,

as equality between values

b

and positions

K

of

a

, leaving the rest of the values of

a

indexed in

{1, 2, \dots, n} ∖ K

to zero.

Result 3 (Circulant compressive shift retrieval with a proof based on circulant matrices).

Given

z = Ay

and

v = Ax

where

y = P^{q - 1} x

, assuming

v_{i} \neq 0, i = 1, \dots, m

, then

{({Fe}_{q})}_{K} = z ⊘ v .

(39)

Proof.

We start again from the least squares problem,

\underset{q}{minimize} {∥ z - A P^{q - 1} A^{H} v ∥}_{2}^{2} .

(40)

With the assumption that

y - P^{q - 1} x = 0_{n \times 1}

the objective reaches the zero minimum,

\begin{matrix} Ay - {AP}^{q - 1} x & = Ay - {AA}^{H} {AP}^{q - 1} x \\ = Ay - {AP}^{q - 1} A^{H} Ax \\ = z - A P^{q - 1} A^{H} v, \end{matrix}

(41)

where we used the commutativity of circulant matrices and that

A A^{H} = I_{m}

. To develop Equation (40), start again from Equation (2) and the expression of the matrix multiplication as

vec (A F^{H} Σ F A^{H} v) = ({({FA}^{H} v)}^{T} \otimes ({AF}^{H})) vec (Σ)

. We finally obtain

\begin{matrix} ∥ z - A P^{q - 1} A^{H} {v ∥}_{2}^{2} = & ∥ z - A F^{H} Σ F A^{H} {v ∥}_{2}^{2} \\ = & ∥ vec (z) - vec (A F^{H} Σ F A^{H} v) ∥_{2}^{2} \\ = & ∥ z - ({({FA}^{H} v)}^{T} \otimes ({AF}^{H})) {vec (Σ) ∥}_{2}^{2} \\ = & ∥ z - {VFe}_{q} ∥_{2}^{2}, \end{matrix}

(42)

where the matrix

V

of size

m \times n

contains only the columns of the Kronecker product that match the non-zero elements of the diagonal matrix

Σ

. The matrix contains the elements of

v

in positions

(k_{i}, i)

. The second equality holds because the

ℓ_{2}

norm is element-wise, and therefore applying the vec operator does not change the value. It follows that

{VFe}_{q} = z

and

\begin{matrix} {({Fe}_{q})}_{K} = & V^{H} {(V V^{H})}^{- 1} z \\ = & V^{H} {(z ⊘ | v |}^{2}) \\ = & v^{*} ⊙ z ⊘ {| v |}^{2} \\ = & z ⊘ v . \end{matrix}

(43)

The compressive shift retrieval is equivalent to Equation (9), the regular shift retrieval, on the set of Fourier components

K

. This is a unified view of the classic and compressed shift retrieval solutions. □

In relation to Equation (38), we use the circulant structures to reach

\begin{matrix} z^{H} {\bar{P}}^{q - 1} v = & z^{H} A F^{H} Σ F A^{H} v \\ = & vec (z^{H} A F^{H} Σ F A^{H} v) \\ = & ({({FA}^{H} v)}^{T} \otimes (z^{H} {AF}^{H})) vec (Σ) \\ = & r^{T} {Fe}_{q}, \end{matrix}

(44)

where we expressed the matrix multiplications as a linear transformation on

Σ = diag ({Fe}_{q})

and

r \in C^{n}

is the expression in the parenthesis with

{[r]}_{K} = z^{*} ⊙ v .

The matrix

{FA}^{H} \in R^{n \times m}

is a partial permutation matrix—only positions

(k_{i}, i)

are non-zero. The products with

v

and

z

produce extended vectors

{[v]}_{K}, {[z]}_{K} \in C^{n}

. Thus, maximizing

z^{H} {\bar{P}}^{q - 1} v

reduces to the selection of

e_{q}

.

Due to the natural appearance of the Fourier matrix

F

in the factorization of circulant matrices its rows are also the natural choice in the rows of the measurement matrix

A

. Cancelations that occur because of this choice lead to the analytic results found. This shows a simple, but equivalent, alternative way to develop Equation (38) of [13].

3.4. The 1-to-N Shift Retrieval Problem

In the previous sections, we have assumed that the signals to be compared are singletons (we could call this the 1-to-1 shift retrieval problem). In this section, we explore what happens when we want to solve the shift retrieval problem between a signal

x

and a group of signals

Y \in R^{n \times N}

, i.e., find the shift for the signal

x

such that it aligns best with all N signals from

Y

. Just as before, we can approach this problem as maximizing Equation (7) or like a minimization problem (Equation (10)).

In our case, the quantity in Equation (7) generalizes to

\underset{q}{arg max} {∥ Y^{T} P^{q - 1} x ∥}_{1},

(45)

and this is equivalent to the approach:

arg max | circ {(x)}^{T} Y | 1_{N \times 1},

(46)

i.e., we take the index of the maximum entry of the

n \times 1

argument vector. The argument we want is the index where the quantity

{∥ circ (x)}^{T} {Y ∥}_{\infty}

is achieved. This is the matrix ∞-norm, i.e.,

{∥ Z ∥}_{\infty} = max_{i} \sum_{k} | Z_{i k} |

. The next result provides the way to compute the optimum shift.

Result 4 (One-to-many shift retrieval).

We are given a signal

x

and a group of signals

Y

, we aim to find the shift that achieves the highest correlation, in absolute value, between

x

and all the vectors

y_{i}

from

Y

in the sense of Equation (46). The shift q that maximizes this quantity is returned by

arg max | IFFT (diag (FFT {(x)}^{*}) FFT (Y)) | 1_{N \times 1},

(47)

i.e., we take the index of the maximum entry of the

n \times 1

argument vector.

Proof.

We use Equation (2) and expand the quantity in Equation (46),

\begin{matrix} circ {(x)}^{T} Y & = {(F^{H} diag (Fx) F)}^{H} Y = F^{H} diag {(Fx)}^{*} F Y . \end{matrix}

(48)

The matrix-vector product that follows computes the row-wise sums of the absolute value matrix. The computational complexity is dominated by

O (n N log n)

for the Fourier transforms and

O (n N)

for the summations. □

This result establishes the circular shift that, on average, aligns the data points as well as possible.

3.5. The N-to-N Shift Retrieval Problem

In the most general case of pairwise shifts, we are given two sets of signals

X \in R^{n \times N}

and

Y \in R^{n \times N}

; the problem is to find a single shift such that each signal

x_{i}

aligns as best as possible with the corresponding signal

y_{i}

. This can be seen as the generalization of the problem in the previous sections.

In this case, the quantity in Equation (45) further generalizes to

\underset{q}{arg max} trace (| Y^{T} P^{q - 1} X |),

(49)

We state the following result, as a generalization of Result 4.

Result 5 (Many-to-many shift retrieval).

We are given the signals

X

and

Y

, we aim to find the shift that achieves the highest correlation, in absolute value, between all pairs

x_{i}

and

y_{i}

in the sense of Equation (49). The shift q that maximizes this quantity is returned by

arg max | IFFT (FFT {(X)}^{*} ⊙ FFT (Y)) | 1_{N \times 1},

(50)

i.e., we take the index of the maximum entry of the

n \times 1

argument vector.

Proof.

We use Equation (2), Result 4 and expand the quantity

\begin{matrix} diag (Y^{T} P^{q - 1} X) & = diag (Y^{T} F^{H} diag ({Fe}_{q}) F X) \\ = diag ({\tilde{Y}}^{H} diag (f_{q}) \tilde{X}) \\ = ({\tilde{Y}}^{H} ⊙ {\tilde{X}}^{T}) f_{q} . \end{matrix}

(51)

The last equality leads to the expression in the result statement, as the trace is the sum of the diagonal vector entries. The matrix-vector product that follows computes the row-wise sums in absolute value. The computational complexity is dominated by

O (n N log n)

for the Fourier transforms and

O (n N)

for the summations. □

Notice how Results 4 and 5 are generalizations of the multiplicative cross-correlation formula from Equation (6). In these cases, when we do not expect alignment to be performed exactly, the division approach taken in Result 1 is not appropriate. In the context of these results, if indeed signals are circular shifts in each other, then

N = 1

is enough to recover the true shift. Thus, these methods actually recover an average shift that maximally aligns the data point pairs.

3.6. Linear Combinations of a Known Circularly Shifted Signal

In all previous sections, our objective was to recover a single shift that maximally aligns data points, either 1-to-1, 1-to-N, or N-to-N. Now, we consider a scenario where a single signal is circularly shifted in multiple positions, and we take linear combinations of these. The task is to recover all the shifts performed and their weights from the minimum number of measurements. Consider the following result, a generalization of Result 1:

Result 6 (Recovery of linear combinations of circular shifts).

We are given a signal

x

and the measurement

y

, which we assume is a linear combination of an unknown number of weighted circular shifts in

x

such that

y = \sum_{q = 1}^{n} α_{q} P^{q - 1} x,

(52)

then, stacking the real-valued weights

α_{q}

in the vector

α \in R^{n}

, and assuming

{\tilde{x}}_{i} \neq 0

holds for all indices, we have that

IFFT (FFT (y) ⊘ FFT (x)) = α .

(53)

Proof.

We start by solving the optimization problem

\underset{α}{minimize} {∥ y - \sum_{q = 1}^{n} α_{q} P^{q - 1} x ∥}_{2}^{2} .

(54)

Note that the optimization variables are the weights

α_{q}

, not the shifts. If a circular shift is missing in the linear combination, then the corresponding weight is zero. We develop the objective function value

\begin{matrix} ∥ y - \sum_{q = 1}^{n} α_{q} P^{q - 1} {x ∥}_{2}^{2} & = ∥ y - \sum_{q = 1}^{n} α_{q} F^{H} diag ({Fe}_{q}) {Fx ∥}_{2}^{2} \\ = ∥ Fy - \sum_{q = 1}^{n} α_{q} diag (f_{q}) {Fx ∥}_{2}^{2} \\ = ∥ \tilde{y} - \sum_{q = 1}^{n} α_{q} diag (f_{q}) \tilde{x} ∥_{2}^{2} \\ = ∥ \tilde{y} - diag (\tilde{x}) \sum_{q = 1}^{n} α_{q} f_{q} ∥_{2}^{2} \\ = ∥ \tilde{y} - diag (\tilde{x}) {F α ∥}_{2}^{2}, \end{matrix}

(55)

here we have used that

diag (f_{q}) \tilde{x} = f_{q} ⊙ \tilde{x} = diag (\tilde{x}) f_{q}

. Assuming that Equation (52) holds, we have

\tilde{y} = diag (\tilde{x}) F α

and finally

\tilde{y} ⊘ \tilde{x} = F α

, to reach the desired result. □

This result establishes that weighted linear combinations of a single known signal, which is circularly shifted, can be efficiently recovered from noiseless linear measurements. Note that we need not use all possible shifts, but only a subset—equivalent to having a sparse weight vector

α

.

A natural question might be what is the minimum number of measurements needed to recover the weights, and what happens when noise is added to the measurements? In general, we will need all n measurements

\tilde{y} ⊘ \tilde{x}

, but when the weight vector

α

is sparse, then well-known results from the signal processing literature provide better insights. First, note that for

{\tilde{x}}_{i} \neq 0

, the problem can be seen as a linear measurement problem of the type:

\tilde{y} ⊘ \tilde{x} = F α .

(56)

Assuming sparsity

s \in N, 1 \leq s ≪ n

for

α

, this is now a standard problem in Compressed Sensing (CS) [23] where we ask how many Fourier measurements we need, from the total n available, in order to correctly recover the weights

α

. To understand this problem and its solution, we make use of the following well-established results from the literature:

In the noiseless case, we know that in order to recover an exactly s-sparse vector $α$ , we need at least $m = 2 s$ consecutive Fourier measurements. This result is described in ([23], Theorem 2.15) via a Prony-type reconstruction procedure. Note that this is consistent with the findings of Remark 4 for $s = 1$ , where it is established that two non-DC components are required;
In the noisy measurements case, Prony-type methods cannot be used, as they are not robust against noise. Now, the stable recovery of an s-sparse $α$ of length n needs, with high probability, order $m \approx s poly \log (n)$ random Fourier measurements. The recovery of $α$ is performed via the $ℓ_{1}$ optimization problem Basis Pursuit Denoizing (BPD),

$\underset{α \in R^{n}}{minimize} {∥ α ∥}_{1} subject to {∥ {(\tilde{y} ⊘ \tilde{x})}_{K} - {(F)}_{K} α ∥}_{2} \leq ϵ,$

(57)

where positive $ϵ \in R$ is given and depends on the expected noise level. We have denoted here ${(F)}_{K}$ the $m \times n$ sub-matrix of the $n \times n$ Fourier matrix consisting of all the columns and only the rows indexed in the set $K$ . For the technical details on this result, the reader can consult ([23], Chapter 11). In the case of $s = 1$ , note that the solution is computed by finding the largest absolute value correlation between the columns of ${(F)}_{K}$ and ${(\tilde{y} ⊘ \tilde{x})}_{K}$ . Normalizing the columns of ${(F)}_{K}$ and defining the mutual coherence denoted $0 \leq μ ({(F)}_{K}) \leq 1$ , the shift retrieval for $s = 1$ and the recovery of the single weight $α$ is robust against noise whenever $| α | \geq \frac{2 ϵ}{1 - μ ({(F)}_{K})}$ . In general, $μ ({(F)}_{K})$ decreases with increasing m [24].

When the signal

x

is unknown and we try to recover both the shifts and the signal itself, the problem is much more difficult, as it requires some alternating optimization strategy, in general. This is related to the circulant dictionary learning problem [16,17,25].

4. Experimental Results

Some results described in this paper are algebraic in nature, and therefore, beyond their proofs, simulation experiments do not bring any further significant insights. In this section, we check numerically Result 2 and the noisy variant of Result 6, as in these cases, estimation accuracy needs to be computed and insights verified empirically.

4.1. Shift Recovery from Multiple Noisy Measurements

In the first experimental setting, we want to validate the findings in Result 2, Remarks 5 and 6. We will generate random signals

x

of size n, circularly shift them by a uniformly random quantity q, and then try to recover them from the

m \leq n

noisy measurements from

y

, as described in Equation (25). We also want to validate the intuition and findings of Remark 5 by proposing two ways of selecting the Fourier measurements: uniformly at random and then such that the m chosen Fourier coefficients are from the first half of the spectrum (to avoid duplicates due to conjugation) and have maximum

ℓ_{2}

norm, i.e., maximum sum squared magnitude. We expect the latter to perform much better in experiments. Performance is measured in two ways: the percentage of average correct shift recovery and as the Root Mean Squared Error (RMSE) between the true shifts and the estimates obtained, always modulo n.

In Figure 1 and Figure 2, we show the experimental results for several n and as the number of Fourier measurements m increases. In Figure 1 we show the percentage of correct shift recovery by Result 2 over 10,000 realizations for SNR in the range

[- 30, 60]

dB. We recover the shift from signals of size n with an increasing number of Fourier measurements m for various SNR values. Largest magnitude entries are selected, as per Remark 5. As highlighted by Remark 6, notice how the performance of the circular convolutional approach improves as the length of the signals n increases, and also notice how the performance of the one-measurement approach degrades (top to bottom). For the bottom plot, where

n = 8192

, we expected near-perfect shift recovery from one measurement only around 65 dB SNR. Note that all other experiments, which take the number of measurements m to be a percentage of the signal size, also exhibit improved accuracy as n grows, from top to bottom. Also note that for

n = 8192

we have almost perfect recovery by 0 dB SNR for

m = 25

measurements (

0.03 %

of n). For all n used in Figure 1, we have that

φ (n) = n / 2

, since these are perfect powers of two, and therefore we have 50% of measurements that obey

gcd (k_{i}, n) = 1

, i.e., the measurements with odd indices. Notice that when we approach 50% measurements, we reach the accuracy of the full circular convolution method. At the opposite end, for prime n we have

φ (n) = n - 1

, and therefore all indices would obey

gcd (k_{i}, n) = 1

.

In Figure 2, we show RMSE for the shift recovery accuracy when dealing with signals of size

n \in {300, 600, 900, 1200}

with an increasing number of Fourier measurements m. We show the two indices selection rules: uniformly at random and by largest magnitude, as per Remark 5. Results in all plots are averaged over 10.000 realizations. Observe that the index selection rule from Remark 5 significantly outperforms a random selection strategy and quickly reaches a near-zero RMSE. Of course, in order to be able to select the top magnitude m entries in the spectrum, we have to compute the whole spectrum. This might not always be possible, and it comes with the cost of calculating the full Fourier transform at the cost of

O (n log n)

, as opposed to

O (n m)

when only m Fourier coefficients are needed.

4.2. Recovering Multiple Shifts via $ℓ_{1}$ Minimization

In the last experimental setup, we experimentally test the findings of Result 6. We generate a random signal

x

with entries from the standard Gaussian distribution. For a fixed sparsity level s, we generate the weight vector

α

whose support is generated uniformly at random and whose values are from the standard Gaussian distribution. The shifts

α

are also generated uniformly at random in the feasible set. We acquire m noisy Fourier measurements, similarly to Equation (52),

y = \sum_{q = 1}^{n} α_{q} P^{q - 1} x + w, w \sim N (0_{n \times 1}, ζ^{2} I_{n}) .

(58)

Our goal is to recover the vector

α

, as its entries provide the weights, and the positions of the non-zero entries provide the shifts. To solve

ℓ_{1}

optimization problems, we use the publicly available CVXPY library [26]. Recovery results, averaged over 100 realizations, are shown in Figure 3. For

α

, we show the RMSE, which expresses the accuracy of weight estimation, and the successful support recovery rate, which expresses the accuracy of shift retrieval. For the support recovery, we compute the positions of the s largest magnitude entries in the solutions to the

ℓ_{1}

optimization problem, and we check the overlap with the true support. We report the recovery of support as a percentage. As expected, increasing the number of measurements leads to better performance, and of course, increasing the dimension n and the sparsity s degrades the recovery of both the weights and the support.

5. Conclusions

In this letter, we provide an overview of several shift retrieval problems based on optimization problems involving circulant matrices. We demonstrate that while the classic multiplicative cross-correlation method performs a perfectly adequate job in the shift retrieval problem, in many scenarios, the shift can be retrieved naturally from a few measurements by using a weighted correlation-like quantity. Our proposed approach also unifies several previously known results and methods under a single framework, providing natural generalizations. When appropriate, we successfully validate the algebraic results through numerical experimental simulations where the goal is to perform shift retrieval with as few measurements as possible.

Funding

This research was funded by Romanian Hub for Artificial Intelligence—HRIA, Smart Growth, Digitization and Financial Instruments Program, 2021–2027 (MySMIS no. 334906).

Data Availability Statement

The data presented in this study are openly available in GitHub at https://github.com/cristian-rusu-research/shift-invariance (accessed on 20 December 2025).

Conflicts of Interest

The author declare no conflicts of interest.

References

Carter, G.C. Time Delay Estimation. In Adaptive Methods in Underwater Acoustics; Springer: Dordrecht, The Netherlands, 1985; pp. 175–196. [Google Scholar] [CrossRef]
Cao, H.; Chan, Y.T.; So, H.C. Compressive TDOA Estimation: Cramer-Rao Bound and Incoherent Processing. IEEE Trans. Aerosp. Electron. Syst. 2020, 56, 3326–3331. [Google Scholar] [CrossRef]
Tong, X.; Ye, Z.; Xu, Y.; Gao, S.; Xie, H.; Du, Q.; Liu, S.; Xu, X.; Liu, S.; Luan, K.; et al. Image Registration with Fourier-Based Image Correlation: A Comprehensive Review of Developments and Applications. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2019, 12, 4062–4081. [Google Scholar] [CrossRef]
Peng, Y.; Ganesh, A.; Wright, J.; Xu, W.; Ma, Y. RASL: Robust alignment by sparse and low-rank decomposition for linearly correlated images. In Proceedings of the 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, San Francisco, CA, USA, 13–18 June 2010; pp. 763–770. [Google Scholar] [CrossRef]
Chandrakasan, A.P.; Lee, F.S.; Wentzloff, D.D.; Sze, V.; Ginsburg, B.P.; Mercier, P.P.; Daly, D.C.; Blazquez, R. Low-Power Impulse UWB Architectures and Circuits. Proc. IEEE 2009, 97, 332–352. [Google Scholar] [CrossRef]
Amiri, R.; Behnia, F.; Noroozi, A. An Efficient Estimator for TDOA-Based Source Localization with Minimum Number of Sensors. IEEE Commun. Lett. 2018, 22, 2499–2502. [Google Scholar] [CrossRef]
Spiesberger, J.L. Finding the right cross-correlation peak for locating sounds in multipath environments with a fourth-moment function. J. Acoust. Soc. Am. 2000, 108, 1349–1352. [Google Scholar] [CrossRef] [PubMed]
Nishiura, T.; Yamada, T.; Nakamura, S.; Shikano, K. Localization of multiple sound sources based on a CSP analysis with a microphone array. In Proceedings of the 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100), Istanbul, Turkey, 5–9 June 2000; Volume 2, pp. II1053–II1056. [Google Scholar] [CrossRef]
Huber, M.; Schlegel, M.; Klinker, G. Application of time-delay estimation to mixed reality multisensor tracking. J. Virtual Real. Broadcast. 2014, 11. [Google Scholar] [CrossRef]
Liu, Y.; Wang, C.; Lu, M.; Yang, J.; Gui, J.; Zhang, S. From Simple to Complex Scenes: Learning Robust Feature Representations for Accurate Human Parsing. IEEE Trans. Pattern Anal. Mach. Intell. 2024, 46, 5449–5462. [Google Scholar] [CrossRef] [PubMed]
Wang, C.; Zhang, Q.; Wang, X.; Zhou, L.; Li, Q.; Xia, Z.; Ma, B.; Shi, Y.Q. Light-Field Image Multiple Reversible Robust Watermarking Against Geometric Attacks. IEEE Trans. Dependable Secur. Comput. 2025, 22, 5861–5875. [Google Scholar] [CrossRef]
Kelishadrokhi, M.K.; Ghattaei, M.; Fekri-Ershad, S. Innovative local texture descriptor in joint of human-based color features for content-based image retrieval. Signal Image Video Process. 2023, 17, 4009–4017. [Google Scholar] [CrossRef]
Ohlsson, H.; Eldar, Y.C.; Yang, A.Y.; Sastry, S.S. Compressive Shift Retrieval. IEEE Trans. Signal Process. 2014, 62, 4105–4113. [Google Scholar] [CrossRef]
Ohlsson, H.; Eldar, Y.C.; Yang, A.Y.; Sastry, S.S. Compressive shift retrieval. In Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada, 26–31 May 2013; pp. 6034–6038. [Google Scholar] [CrossRef]
Clausen, M.; Kurth, F. Robust compressive shift retrieval in linear time. In Proceedings of the 2016 24th European Signal Processing Conference (EUSIPCO), Budapest, Hungary, 28 August–2 September 2016; IEEE: New York, NY, USA, 2016; pp. 364–368. [Google Scholar]
Rusu, C.; Dumitrescu, B.; Tsaftaris, S.A. Explicit Shift-Invariant Dictionary Learning. IEEE Signal Process. Lett. 2014, 21, 6–9. [Google Scholar] [CrossRef]
Rusu, C. On learning with shift-invariant structures. Digit. Signal Process. 2020, 99, 102654. [Google Scholar] [CrossRef]
Oppenheim, A.V.; Schafer, R.W.; Buck, J.R. Discrete-Time Signal Processing, 2nd ed.; Prentice Hall: Englewood Cliffs, NJ, USA, 1999. [Google Scholar]
Goertzel, G. An Algorithm for the Evaluation of Finite Trigonometric Series. Am. Math. Mon. 1958, 65, 34–35. [Google Scholar] [CrossRef]
Frigo, M.; Johnson, S.G. The Design and Implementation of FFTW3. Proc. IEEE 2005, 93, 216–231. [Google Scholar] [CrossRef]
Hassanieh, H.; Indyk, P.; Katabi, D.; Price, E. Nearly optimal sparse Fourier transform. In Proceedings of the Forty-Fourth Annual ACM Symposium on Theory of Computing (STOC ’12), New York, NY, USA, 19–22 May 2012; pp. 563–578. [Google Scholar] [CrossRef]
Hassanieh, H.; Indyk, P.; Katabi, D.; Price, E. Simple and practical algorithm for sparse Fourier transform. In Proceedings of the Twenty-Third Annual ACM-SIAM Symposium on Discrete Algorithms (SODA ’12), Kyoto, Japan, 17–19 January 2012; pp. 1183–1194. [Google Scholar]
Foucart, S.; Rauhut, H. A Mathematical Introduction to Compressive Sensing; Applied and Numerical Harmonic Analysis; Birkhäuser: Basel, Switzerland, 2013; pp. I–XVIII, 1–625. [Google Scholar]
Rusu, C.; González-Prelcic, N.; Heath, R.W. Algorithms for the construction of incoherent frames under various design constraints. Signal Process. 2018, 152, 363–372. [Google Scholar] [CrossRef]
Pope, G.; Aubel, C.; Studer, C. Learning phase-invariant dictionaries. In Proceedings of the 2013 IEEE International Conference on Acoustics, Speech and Signal Processing, Vancouver, BC, Canada, 26–31 May 2013; pp. 5979–5983. [Google Scholar] [CrossRef]
Diamond, S.; Boyd, S. CVXPY: A Python-embedded modeling language for convex optimization. J. Mach. Learn. Res. 2016, 17, 2909–2913. [Google Scholar]

Figure 1. Average accuracy of correct shift recovery by Result 2. From top to bottom, the signal sizes are

n \in {1024, 2048, 4096, 8192}

.

Figure 1. Average accuracy of correct shift recovery by Result 2. From top to bottom, the signal sizes are

n \in {1024, 2048, 4096, 8192}

.

Figure 2. RMSE of recovered shift for increasing number of Fourier measurements m. The top four plots have

SNR = 0

dB, while the bottom four plots have

SNR = 15

dB.

Figure 2. RMSE of recovered shift for increasing number of Fourier measurements m. The top four plots have

SNR = 0

dB, while the bottom four plots have

SNR = 15

dB.

Figure 3. RMSE of recovered shifts and their weights according to Result 6. The top two plots have

n = 300

and sparsity

s = 15

, and the bottom two plots have

n = 600

and sparsity

s = 20

.

Figure 3. RMSE of recovered shifts and their weights according to Result 6. The top two plots have

n = 300

and sparsity

s = 15

, and the bottom two plots have

n = 600

and sparsity

s = 20

.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Rusu, C. A Note on Shift Retrieval Problems. Mathematics 2026, 14, 532. https://doi.org/10.3390/math14030532

AMA Style

Rusu C. A Note on Shift Retrieval Problems. Mathematics. 2026; 14(3):532. https://doi.org/10.3390/math14030532

Chicago/Turabian Style

Rusu, Cristian. 2026. "A Note on Shift Retrieval Problems" Mathematics 14, no. 3: 532. https://doi.org/10.3390/math14030532

APA Style

Rusu, C. (2026). A Note on Shift Retrieval Problems. Mathematics, 14(3), 532. https://doi.org/10.3390/math14030532

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Note on Shift Retrieval Problems

Abstract

1. Introduction

2. Classic Shift Retrieval Problems

2.1. Circulant Matrices Primer

2.2. Classic Shift Retrieval

3. A Circulant Matrix Perspective on the Shift Retrieval Problems

3.1. Noiseless Shift Retrieval

3.2. Noisy Shift Retrieval

3.3. The Compressive Shift Retrieval Problem

3.4. The 1-to-N Shift Retrieval Problem

3.5. The N-to-N Shift Retrieval Problem

3.6. Linear Combinations of a Known Circularly Shifted Signal

4. Experimental Results

4.1. Shift Recovery from Multiple Noisy Measurements

4.2. Recovering Multiple Shifts via $ℓ_{1}$ Minimization

5. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Note on Shift Retrieval Problems

Abstract

1. Introduction

2. Classic Shift Retrieval Problems

2.1. Circulant Matrices Primer

2.2. Classic Shift Retrieval

3. A Circulant Matrix Perspective on the Shift Retrieval Problems

3.1. Noiseless Shift Retrieval

3.2. Noisy Shift Retrieval

3.3. The Compressive Shift Retrieval Problem

3.4. The 1-to-N Shift Retrieval Problem

3.5. The N-to-N Shift Retrieval Problem

3.6. Linear Combinations of a Known Circularly Shifted Signal

4. Experimental Results

4.1. Shift Recovery from Multiple Noisy Measurements

4.2. Recovering Multiple Shifts via ℓ 1 Minimization

5. Conclusions

Funding

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.2. Recovering Multiple Shifts via $ℓ_{1}$ Minimization