A Method for Reducing Timing Jitter’s Impact in Through-Wall Human Detection by Ultra-Wideband Impulse Radar

: Ultra-wideband (UWB) impulse radar is widely used for through-wall human respiration detection due to its high range resolution and high penetration capability. UWB impulse radar emits very narrow time pulses, which can directly obtain the impulse response of the target. However, the time interval between successive pulses emitted is not ideally ﬁxed because of timing jitter. This results in the impulse response position of the same target not being ﬁxed, but it is related to slow-time. The clutter scattered by the stationary target becomes non-stationary clutter, which affects the accurate extraction of the human respiration signal. In this paper, we propose a method for reducing timing jitter’s impact in through-wall human detection by UWB impulse radar. After the received signal is processed by the Fast Fourier transform (FFT) in slow-time, we model the range-frequency matrix in the frequency domain as a superposition of the low-rank representation of jitter-induced clutter data and the sparse representation of human respiratory data. By only extracting the sparse component, the impact of timing jitter in human respiration detection can be reduced. Both numerical simulated data and experimental data demonstrate that our proposed method can effectively remove non-stationary clutter induced by timing jitter and improve the accuracy of the human target signal extraction. change of time is equivalent to the change of distance, we simulated the timing jitter of the transmitted signal by randomly varying the height of the antenna model. The distance of the antenna from the upper ﬂoor obeys a Gaussian distribution with a mean value of 15 cm and a standard deviation of 3 mm.

In microwave radar, radar signal processing is usually full of challenges, under non-ideal conditions including lower signal-to-clutter-and-noise ratio (SCNR) [28] and effectiveness of the proposed algorithm is verified by both the numerical simulated data and experimental data.
A detailed description of the proposed method for reducing timing jitter's impact is provided in this paper. This paper is organized as follows. Section 2 analyzes the effect of timing jitter on scattered clutter from the stationary target. Section 3 details the proposed method to remove non-stationary clutter induced by timing jitter. In Section 4, the result from the numerical simulation shows the effectiveness of the proposed method. In Section 5, the experimental validation of the proposed method in realistic through-wall and through-floors situations is presented, together with the description of the radar system developed. Finally, Section 6 concludes this paper.

Timing Jitter Analysis
The Gaussian pulse emitted by the UWB impulse radar can be formulated as [42]: where σ t is related to the pulse width. Consider a simple situation, and assume that there is only one human target and one stationary target in the detection area. The impulse response of the UWB impulse radar can be expressed as [43]: h(t, τ) = a r δ(t − t r (τ)) + a s δ(t − t s ), (2) where t represents fast-time and τ represents slow-time. a s δ(t − t s ) denotes the response of the static target with propagation time t s and amplitude a s . a r δ(t − t r (τ)) is the respiration response of the human target with propagation time t r (τ) and amplitude a r . The propagation time t r (τ) varies approximately as harmonic vibration due to respiration [44]. The received signal can be expressed as: y(t, τ) = x(t) * h(t, τ) = a r x(t − t r (τ)) + a s x(t − t s ) = r(t, τ) + c(t). ( The human respiration signal is r(t, τ), and c(t) represents the stationary target clutter which is considered constant in slow-time.
It is worth noting that the stationary target clutter, c J (t, τ), becomes variable with slow-time, τ, due to timing jitter. Impulse responses recorded by the UWB radar for this scenario are shown in Figure 1, where the vertical axis represents the fast-time dimension of the impulse response and the horizontal axis represents the slow-time dimension. One vertical line indicates one impulse response. Waves scattered from the stationary target are illustrated by the black bipolar monopulse and waves scattered from the human target are depicted by the red bipolar monopulse. In the ideal absence of timing jitter, the clutter scattered by the stationary target creates a horizontal line parallel to the slow-time axis, which can be regarded as a time-invariant background. When timing jitter is present, impulse responses are not aligned to each other, and the time-invariant horizontal line no longer exists. As a result, the position where clutter appears in each impulse response will be random, and the clutter, like the human respiratory signal, will be related to slow-time.
scenario are shown in Figure 1, where the vertical axis represents the fast-time dimension of the impulse response and the horizontal axis represents the slow-time dimension. One vertical line indicates one impulse response. Waves scattered from the stationary target are illustrated by the black bipolar monopulse and waves scattered from the human target are depicted by the red bipolar monopulse. In the ideal absence of timing jitter, the clutter scattered by the stationary target creates a horizontal line parallel to the slow-time axis, which can be regarded as a time-invariant background. When timing jitter is present, impulse responses are not aligned to each other, and the time-invariant horizontal line no longer exists. As a result, the position where clutter appears in each impulse response will be random, and the clutter, like the human respiratory signal, will be related to slow-time. can be approximated as [46]: We denote the first-order derivative of J c m n also obeys the Gaussian random process in the slow-time dimension, which can be expressed as: Furthermore, expanding c J (t, τ) in Equation (5) using Taylor series around t m , and ignoring the high-order items, c J (t, τ) can be approximated as [46]: We denote the first-order derivative of x(t − t s ) by k(t) = ∂x(t − t s )/∂t, then c J (t, τ) can be expressed in discrete form as: where m = 0, 1, . . . , M − 1 is the fast-time sampling point and n = 0, 1, . . . , N − 1 is the slow-time sampling point. δ T is the fast-time sampling interval and T s is the pulse repetition time, which is also the slow-time sampling interval. In general, timing jitter, t J (τ), obeys a Gaussian random process with mean zero and standard deviation σ J [47], as: Thus, c J [m, n] also obeys the Gaussian random process in the slow-time dimension, which can be expressed as: c J [m, n] has mean a s x(mδ T − t s ) and variation |a s k(mδ T )| 2 σ 2 J . It can be obtained that the distortion introduced by timing jitter depends on the change rate of the signal and the jitter level.
The distortion caused by timing jitter is schematically shown in Figure 2. The two solid lines indicate the sloped part of received signals y 1 (t) and y 2 (t), with different slopes. The signals are assumed to be recorded at t m . However, due to the timing jitter, t J (τ), they are recorded at t m + t J (τ). In this example, for the signals y 1 (t) and y 2 (t), the distortions it causes are ∆V 1 and ∆V 2 , respectively.  The same conclusion can be obtained that the distortion introduced by timi depends on the slope of the signal and the level of timing jitter. Since UWB impul transmits pulses that are narrow in time, this results in a large change rate of th mitted signal. At the same time, to improve the SCNR, increasing the transmitted of the radar is a simple and effective method. This in turn makes the change rat transmitted signal and jitter level further increase, worsening the distortion. In quency domain, the distortion occupies the entire frequency band and is quite the human respiration region. Therefore, it is difficult to perform human res signal extraction in the frequency domain without distortion suppression.

Proposed Method for Reducing Timing Jitter's Impact in Through-Wall Hum Detection
From the above analysis, it can be obtained that timing jitter can turn the scattered from stationary targets into dynamic interference. Inspired by the recent in low-rank approximations, we model the range-frequency matrix in the fre domain as a superposition of the low-rank representation of jitter-induced clut and the sparse representation of human respiratory data. The problem of supp non-stationary clutter introduced by timing jitter in human respiration detection converted into an optimization problem of low-rank and sparse representation section, we propose a method to effectively reduce the effect of timing jitt flowchart of the proposed method is illustrated in Figure 3. The proposed meth sists of three main steps: (A) initial compensation method in the time domain, (B enhancement method, and (C) sparse component extraction method in the fre domain. First, in step A, we use cross-correlation to correct the time offset of e pulse response, achieving initial compensation for timing jitter in the time domai The same conclusion can be obtained that the distortion introduced by timing jitter depends on the slope of the signal and the level of timing jitter. Since UWB impulse radar transmits pulses that are narrow in time, this results in a large change rate of the transmitted signal. At the same time, to improve the SCNR, increasing the transmitted power of the radar is a simple and effective method. This in turn makes the change rate of the transmitted signal and jitter level further increase, worsening the distortion. In the frequency domain, the distortion occupies the entire frequency band and is quite large in the human respiration region. Therefore, it is difficult to perform human respiration signal extraction in the frequency domain without distortion suppression.

Proposed Method for Reducing Timing Jitter's Impact in Through-Wall Human Detection
From the above analysis, it can be obtained that timing jitter can turn the clutter scattered from stationary targets into dynamic interference. Inspired by the recent studies in low-rank approximations, we model the range-frequency matrix in the frequency domain as a superposition of the low-rank representation of jitter-induced clutter data and the sparse representation of human respiratory data. The problem of suppressing non-stationary clutter introduced by timing jitter in human respiration detection can be converted into an optimization problem of low-rank and sparse representation. In this section, we propose a method to effectively reduce the effect of timing jitter. The flowchart of the proposed method is illustrated in Figure 3. The proposed method consists of three main steps: (A) initial compensation method in the time domain, (B) signal enhancement method, and (C) sparse component extraction method in the frequency domain. First, in step A, we use cross-correlation to correct the time offset of each impulse response, achieving initial compensation for timing jitter in the time domain. Then, in step B, we suppress background and noise signals, and enhance weak respiration signals to improve SCNR. Finally, in step C, the difference of sparsity in the frequency dimension between the human respiration signal and the jitter-induced clutter is used to accurately extract the human respiration signal in the frequency domain.
Remote Sens. 2021, 13, 3577 6 of 21 in step B, we suppress background and noise signals, and enhance weak respiration signals to improve SCNR. Finally, in step C, the difference of sparsity in the frequency dimension between the human respiration signal and the jitter-induced clutter is used to accurately extract the human respiration signal in the frequency domain.

Step A: Initial Compensation Method in the Time Domain
To achieve initial compensation for timing jitter, a simple idea is to find the time offset of each impulse response and then compensate it to the correct time instant. Since the timing jitter interval is often shorter than the fast time sampling interval, interpolation is performed first to fill the gap between the two fast time sampling points. Next, we find the time offset between the two impulse responses by cross-correlation in the time domain and then align each impulse response, thus achieving initial compensation for timing jitter. Considering the computing time, we only perform the cross-correlation operation on the direct wave part of the impulse response. We choose one impulse response as a reference and denote it by [ ] Align each impulse response with the reference impulse response to achieve initial compensation for timing jitter in the time domain.

Step A: Initial Compensation Method in the Time Domain
To achieve initial compensation for timing jitter, a simple idea is to find the time offset of each impulse response and then compensate it to the correct time instant. Since the timing jitter interval is often shorter than the fast time sampling interval, interpolation is performed first to fill the gap between the two fast time sampling points. Next, we find the time offset between the two impulse responses by cross-correlation in the time domain and then align each impulse response, thus achieving initial compensation for timing jitter. Considering the computing time, we only perform the cross-correlation operation on the direct wave part of the impulse response. We choose one impulse response as a reference and denote it by y r [m]. y n [m] denotes the nth impulse response. Then, the cross-correlation function between the nth impulse response y n [m] and the reference response y r [m] can be defined as: where 0 ≤ m 1 , m 2 ≤ M, and m = m 1 − m 2 . The peak position of the cross-correlation function indicates the time offset of the two impulse responses, Correspondingly, the nth impulse response is shifted by ∆m n in the fast-time dimension so that it is aligned with the reference impulse response, Align each impulse response with the reference impulse response to achieve initial compensation for timing jitter in the time domain.

Adaptive Background Subtraction
The strong time-invariant background usually covers the respiration signal of interest, so the background needs to be removed from the received data. The simplest method is to calculate an average impulse response from all measured impulse responses. However, this method can only be processed after the measurement is completed and cannot be operated in real time. When detecting people based on their micro-movement, respiration or cardiac activity, these reflected signals have the same time delay and are easily misinterpreted as static clutter and removed using background subtraction.
Based on the above analysis, we choose the adaptive background subtraction (ABS) method based on the exponential averaging, which replaces a scalar weighting factor α with a vector of weighting coefficient λ [48].
where p n [m] and q n [m] are one-dimensional vectors with the size [M × 1] and contain 'sampled background estimate' and 'measured impulse response', respectively. Thus, the new background estimate takes a fraction of the previous estimate and a fraction from the measured impulse response. The weighting coefficient has the size [M × 1] and is determined by two thresholds related to motions, and motions are automatically divided into micro-and macro-motions.

Filtering in Fast-Time Dimension
In order to further improve SCNR, a bandpass filter with the same bandwidth as the transmitted signal is added and performed along the fast-time dimension to filter out substantial high-frequency noise introduced by oversampling.

Advance Normalization
It follows from the basic radar equation that the level of signal components scattered by a target and received by the radar depends among others on the distance between transmitting antenna-target-receiving antenna. In addition, the low reflectivity of human body and high signal attenuation of obstacles make the energy of the respiration signal recorded by the radar weak.
The advance normalization (AN) method is used to enhance the weak respiration signal [49]. The basic idea of the method is to search the maximum in the interval (t Lmax , t end ] serially, and normalize the current signal in the interval (t Lmax , t Nmax ], where t Lmax represents the propagation time instant of the last found maximum, t Nmax is the propagation time instant of the new found maximum, and t end is the last time instant of the measured impulse response.

Step C: Sparse Component Extraction Method in the Frequency Domain
To extract the frequency of the respiration signal, the Fast Fourier transform (FFT) is performed in the slow-time dimension. Based on the prior knowledge of human respiration frequency, a selected frequency window ranging from 0.1 to 0.6 Hz is added to remove high-frequency components. Its resultant matrix is M × N w range-frequency matrix Y, where N w represents the number of frequency points after windowing.
In an ideal situation without timing jitter, the matrix Y will only include respiration signal: With timing jitter present, not only respiration signals but also non-stationary clutter will exist in the matrix Y. The non-stationary clutter induced by timing jitter occupies the entire frequency band, and in the range-frequency matrix, it can be considered as a lowrank component. The respiration signal exhibits concentrated characteristics in both range and frequency dimensions, and it can be regarded as a sparse component relative to the whole range-frequency matrix. We model the range-frequency matrix Y as a superposition of the low-rank representation of jitter-induced clutter data and the sparse representation of human respiratory data: Remote Sens. 2021, 13, 3577 8 of 21 Using the difference in the sparsity of the two signals in the frequency dimension, we can separate them well in the frequency domain. Low-rank and sparse representation recovers a low-rank matrix L and a sparse matrix S from a data matrix Y (Y = L + S) through the following process: min However, Equation (16) is a highly nonconvex optimization problem with no efficient solutions. To make it become a tractable optimization problem, many methods have been presented in the literature. Robust principal component analysis (RPCA) [50] optimizes the objective function by replacing the l 0 -norm with the l 1 -norm, and the rank of L with the nuclear norm L * : min Some works have shown that the Frobenius norm based representation is more computationally efficient than nuclear norm based representation [51,52]. The objective function proposed by principal coefficients embedding (PCE) [53] is as follows: Robust non-negative matrix factorization (RNMF) is a novel dimensionality reduction method based on matrix factorization that can separate the low-rank and sparse parts of the data in case the data are grossly corrupted [54]. We choose RNMF to extract the sparse component. Y can be expressed as where Y ∈ R M×N w , W ∈ R M×k + , H ∈ R k×N w + , S ∈ R M×N w , and k is the rank. The sparse respiration signal is approximated by the sparse error matrix S: The low-rank non-stationary clutter is approximated by the product of two nonnegative matrices W and H: We chose the square Euclidean distance between Y and S + WH to measure the approximation error in Equation (19), and added a l 1 -norm constraint on S to enhance the sparsity. Thus, Equation (19) can be formulated as an optimization problem: where · F denotes the matrix Frobenius norm, S ij , and λ > 0. λ is the regularization parameter, which determines the sparsity of S.
Problem (22) is separately convex for W, H, and S, so a local optimal solution can be obtained by solving a series of convex optimization problems. The values of W, H, and S are updated separately when other variables are fixed. For W and H, the optimization problems are in the form of non-negative quadratic programming, which can be solved by multiplicative updates [55]. Consider a non-negative quadratic programming problem of the following form: where A is a symmetric positive semidefinite matrix. It has been proven that the optimal solution of problem (23) can be obtained by multiplicative updates [55]: Both A + and A − represent non-negative matrices, and they are defined as follows: With each iteration of Equation (24), the value of the objective function will decrease. Consider the update of H, when W and S are fixed. Y − WH − S 2 F of Equation (22) can be expressed as: In this way, the convex optimization problem for updating H can be expressed as: Problem (27) can be solved using Equations (23) and (24). W T W in Equation (27) corresponds to the matrix A in problem (23). Since W is a non-negative matrix, we can obtain: The iterative update equation for H is as follows: Similarly, the update rule for W is given by: In order to ensure that the solution to the optimization problem is unique, each basis vector in W is normalized and the norms of the bases are compensated to H. Two normalization steps can be given as For S, the optimization problem can be solved by the soft-thresholding operator [56]. The definition of the soft-thresholding operator, T v (·), is as follows: where x ∈ R and v > 0. Consider a l 1 -minimization problem of the following form: The solution b * of Equation (34) can be given by T v (x) [56]. When W and H are fixed, the optimization problem for updating S is: Thus, the optimal solution to problem (35) is T λ/2 (Y − WH). The iterative update equation for the matrix S can be formulated as: Equation (37) shows that if (λ/2) > max ij (Y − WH) ij , S will become a null matrix. It should be noted that, for RNMF, the regularization parameter λ is a key parameter that controls the trade-off between the low-rank component and the sparse component. When the value of λ is too large, the matrix S represents little sparse information and corrupts the respiration signal of interest. When the value of λ is too small, the matrix S contains clutter and noise, increasing the difficulty of respiration extraction. In our method, we carefully chose the value of λ that maximizes the SCNR for the UWB impulse radar prototype used and applied the same value in all experiments.

Numerical Simulation
Numerical simulation of a realistic through-floors respiration detection scenario has been performed using the gprMax software to validate the effectiveness of the proposed algorithm. Two floors are included in the simulation scenario, as shown in Figure 4. The floor is homogeneous and made up of a material with a relative dielectric constant of 4 and a conductivity of 0.01 S/m, and its thickness is 12 cm. The spacing between the two floors is 80 cm. We used a metal cylinder whose radius is modulated by the simple harmonic motion at 0.18 Hz to simulate the chest motion caused by human breathing. The metal cylinder is placed 15 cm above the lower floor. The antenna model was chosen from its inbuilt library as Geophysical Survey Systems, Inc. (GSSI), working at 400 MHz. Since the change of time is equivalent to the change of distance, we simulated the timing jitter of the transmitted signal by randomly varying the height of the antenna model. The distance of the antenna from the upper floor obeys a Gaussian distribution with a mean value of 15 cm and a standard deviation of 3 mm. and a conductivity of 0.01 S/m, and its thickness is 12 cm. The spacing be floors is 80 cm. We used a metal cylinder whose radius is modulated by t monic motion at 0.18 Hz to simulate the chest motion caused by human metal cylinder is placed 15 cm above the lower floor. The antenna mod from its inbuilt library as Geophysical Survey Systems, Inc. (GSSI), workin Since the change of time is equivalent to the change of distance, we simula jitter of the transmitted signal by randomly varying the height of the anten distance of the antenna from the upper floor obeys a Gaussian distributio value of 15 cm and a standard deviation of 3 mm. The direct FFT results of the raw echo signal without timing jitter an jitter are shown in Figure 5a,b, respectively. From Figure 5a, we can see th tion signal with a Doppler frequency of 0.18 Hz and its harmonic com cluded in the range-frequency matrix Y. Due to the differential effect of th respiration signal exhibits multiple peaks in the range dimension. As can ure 5b, in addition to the respiration signal and its harmonic componen tered by floors is also detected, which should be time-invariant non-stationary clutter due to timing jitter. The strength of the clutter sc upper floor is higher than that of the clutter scattered by the lower floor. fects the extraction of the respiration signal. The non-stationary clutter in ing jitter occupies the entire frequency band, and in the range-frequency m be considered as a low-rank component. The respiration signal exhibit characteristics in both range and frequency dimensions, and it can be sparse component relative to the whole range-frequency matrix Y. The direct FFT results of the raw echo signal without timing jitter and with timing jitter are shown in Figure 5a,b, respectively. From Figure 5a, we can see that the respiration signal with a Doppler frequency of 0.18 Hz and its harmonic component are included in the range-frequency matrix Y. Due to the differential effect of the antenna, the respiration signal exhibits multiple peaks in the range dimension. As can be seen in Figure 5b, in addition to the respiration signal and its harmonic component, clutter scattered by floors is also detected, which should be time-invariant but becomes non-stationary clutter due to timing jitter. The strength of the clutter scattered by the upper floor is higher than that of the clutter scattered by the lower floor. The clutter affects the extraction of the respiration signal. The non-stationary clutter induced by timing jitter occupies the entire frequency band, and in the range-frequency matrix Y, it can be considered as a low-rank component. The respiration signal exhibits concentrated characteristics in both range and frequency dimensions, and it can be regarded as a sparse component relative to the whole range-frequency matrix Y. The proposed algorithm was compared with MS, SVD, and the method in [36]. Figure 6a-d represents the respiration detection results of MS, SVD, the method in [36], and the proposed algorithm, respectively. Unlike the traditional MS algorithm, we averaged the data in the frequency dimension instead of the slow-time dimension. The MS algorithm can suppress non-stationary clutter, but the suppression is limited and does not The proposed algorithm was compared with MS, SVD, and the method in [36]. Figure 6a-d represents the respiration detection results of MS, SVD, the method in [36], and the proposed algorithm, respectively. Unlike the traditional MS algorithm, we averaged the data in the frequency dimension instead of the slow-time dimension. The MS algorithm can suppress non-stationary clutter, but the suppression is limited and does not completely remove the clutter. The SVD algorithm eliminates the clutter by discarding the eigenvectors corresponding high-valued eigenvalues. The result indicates that the SVD algorithm also retains part of the non-stationary clutter. The method in [36] removes the jitter distortion in clutter mitigation by only aligning impulse responses in the time domain. In its result matrix, the clutter scattered by both floors is well-removed, and the respiration signal and its harmonic component are retained. Our proposed algorithm can suppress the non-stationary clutter scattered by both floors, and only the respiration signal is included in the range-frequency matrix Y. Although the extracted respiration signal is corrupted in the range dimension, its most important part is retained, and this corruption is acceptable. The proposed algorithm was compared with MS, SVD, and the method in [36]. Figure 6a-d represents the respiration detection results of MS, SVD, the method in [36], and the proposed algorithm, respectively. Unlike the traditional MS algorithm, we averaged the data in the frequency dimension instead of the slow-time dimension. The MS algorithm can suppress non-stationary clutter, but the suppression is limited and does not completely remove the clutter. The SVD algorithm eliminates the clutter by discarding the eigenvectors corresponding high-valued eigenvalues. The result indicates that the SVD algorithm also retains part of the non-stationary clutter. The method in [36] removes the jitter distortion in clutter mitigation by only aligning impulse responses in the time domain. In its result matrix, the clutter scattered by both floors is well-removed, and the respiration signal and its harmonic component are retained. Our proposed algorithm can suppress the non-stationary clutter scattered by both floors, and only the respiration signal is included in the range-frequency matrix Y. Although the extracted respiration signal is corrupted in the range dimension, its most important part is retained, and this corruption is acceptable. We used improvement factor (IF) for quantitative analysis, which indicates the improvement of the SCNR after suppressing non-stationary clutter. IF in decibels (dB) is given as: 10 IF=10 log (SCNR SCNR ), after before (38) where SCNR before and SCNR after are the SCNRs of the result of respiration detection before and after applying the non-stationary clutter suppression method, respectively. The SCNR of the range-frequency matrix is defined as: We used improvement factor (IF) for quantitative analysis, which indicates the improvement of the SCNR after suppressing non-stationary clutter. IF in decibels (dB) is given as: IF =10 log 10 (SCNR a f ter /SCNR be f ore ), (38) where SCNR be f ore and SCNR a f ter are the SCNRs of the result of respiration detection before and after applying the non-stationary clutter suppression method, respectively. The SCNR of the range-frequency matrix is defined as: where N S and N C are the number of elements in the selected signal region Y S and clutter region Y C , respectively. Y(p) denotes the pth element of the matrix. Quantitative results of different methods are compared in Table 1. The MS algorithm averaging in the frequency dimension is a little better than SVD. Both the method in [36] and our proposed method have higher scores. Although our proposed method partially corrupted the respiration signal, it still obtained the highest IF score of 13.45 dB.

Experimental Verification of The Proposed Method
To validate the effectiveness of the proposed method in realistic scenarios, we used the UWB impulse radar prototype developed by ourselves to perform a series of experiments at the Aerospace Information Research Institute, Chinese Academy of Sciences.

Experiment Radar System
The UWB impulse radar used consists of a pair of bow-tie antennas, a balanced pulse generator, a receiver, and a clock synchronization module, as shown in Figure 7. The balanced pulse generator is based on the avalanche transistor and Marx Bank. Marx Bank is a commonly used method of generating high-voltage impulses, whose main idea is to charge a number of capacitors in parallel and then discharge them in series. Additionally, a Xilinx Artix-7 Field Programmable Gate Array (FPGA) was used to control the entire system and provide trigger signals to the transmitter, ensuring the synchronization of the whole system by locking these signals to the common clock. At the receiver, the signal from the antenna is fed into the low noise amplifier (LNA), and the output is recorded by a 16-bit analog to digital converter (ADC) with a maximal sampling rate of 160 Mbps and a full-power bandwidth of 1.4 GHz. To reduce the system cost and improve the spurious-free dynamic range (SFDR), the equivalent-time sampling technique was adopted. One Micrel's SY89297U programmable delay line, which can achieve a maximal delay of 5 ns with 5 ps fine increments, was used to provide a fine delay to the ADC sampling clock. A pair of folded bow-tie antennas were used for electromagnetic radiation and reception, whose basic geometry is combined by a novel acorn-shaped bent bow-tie patch and a hollowed stepped back cavity [57]. The key parameters of the UWB impulse radar used are shown in Table 2. was adopted. One Micrel's SY89297U programmable delay line, which can achieve a maximal delay of 5 ns with 5 ps fine increments, was used to provide a fine delay to the ADC sampling clock. A pair of folded bow-tie antennas were used for electromagnetic radiation and reception, whose basic geometry is combined by a novel acorn-shaped bent bow-tie patch and a hollowed stepped back cavity [57]. The key parameters of the UWB impulse radar used are shown in Table 2.  The output balanced pulses of the transmitter are showed in Figure 8a. The peak amplitudes of these pulses are −162 and +141 V at 50 Ω load, respectively. The power spectrum of the pulse calculated using the Fourier transform is shown in Figure 8b, normalized to the peak value. The −10 dB pulse bandwidth is 433 MHz. The timing jitter  The output balanced pulses of the transmitter are showed in Figure 8a. The peak amplitudes of these pulses are −162 and +141 V at 50 Ω load, respectively. The power spectrum of the pulse calculated using the Fourier transform is shown in Figure 8b, normalized to the peak value. The −10 dB pulse bandwidth is 433 MHz. The timing jitter was measured using the bench shown in Figure 9a. It includes the mixed signal oscilloscope (MSO) Keysight MSOS804A (with 8 GHz bandwidth and 20 GSa/s sampling rate), the arbitrary function generator (AFG) AFG3252C from Tektronix, and a power divider. The AFG generates a periodic square signal of 32 KHz to be used as the trigger signal. The trigger signal is fed into the power divider and is split into two signals. One is used to trigger the pulse generator and the other is used to trigger the oscilloscope. The pulse generator outputs are connected to the oscilloscope after 40 dB attenuation. The jitter measurement plots for the pulses are shown in Figure 9b. Due to the large amplitude of these pulses, they inevitably have a high level of timing jitter at the same time. During the 30 min observation period, the maximum time base offset was 170 ps.
to trigger the pulse generator and the other is used to trigger the oscilloscope. The pulse generator outputs are connected to the oscilloscope after 40 dB attenuation. The jitter measurement plots for the pulses are shown in Figure 9b. Due to the large amplitude of these pulses, they inevitably have a high level of timing jitter at the same time. During the 30 min observation period, the maximum time base offset was 170 ps.

Detection Performance in Through-Wall Situation
The experiment scenario for through-wall detection is shown in Figure 10. One stationary human target is breathing steadily at 10 m from a brick wall. The thickness of the wall is 40 cm. The UWB impulse radar is placed on the other side of the wall for respiration signal detection. to trigger the pulse generator and the other is used to trigger the oscilloscope. The pulse generator outputs are connected to the oscilloscope after 40 dB attenuation. The jitter measurement plots for the pulses are shown in Figure 9b. Due to the large amplitude of these pulses, they inevitably have a high level of timing jitter at the same time. During the 30 min observation period, the maximum time base offset was 170 ps.

Detection Performance in Through-Wall Situation
The experiment scenario for through-wall detection is shown in Figure 10. One stationary human target is breathing steadily at 10 m from a brick wall. The thickness of the wall is 40 cm. The UWB impulse radar is placed on the other side of the wall for respiration signal detection.

Detection Performance in Through-Wall Situation
The experiment scenario for through-wall detection is shown in Figure 10. One stationary human target is breathing steadily at 10 m from a brick wall. The thickness of the wall is 40 cm. The UWB impulse radar is placed on the other side of the wall for respiration signal detection. Figure 11a shows the two-dimensional raw echo matrix. The vital sign signals are masked by a large amount of strong clutter and cannot be extracted directly. The rangefrequency matrix Y of the received radar signal after the conventional FFT in slow-time is shown in Figure 11b. The respiration signal at 10 m becomes obvious, but there is also very strong non-stationary clutter induced by timing jitter at other distances. The respiration signal of the tested human is difficult to distinguish by automatic detection algorithms. The results processed by MS, SVD, and the method in [36] are shown in Figure 11c-e, respectively. After being processed by MS, the non-stationary clutter from the wall scattering is partially suppressed, but it is still difficult to extract the respiration signal. After being processed by SVD, the non-stationary clutter is also partially attenuated, but the remaining clutter around the respiration signal is still apparent. Different from the good performance in the simulated data, the method in [36] has very limited suppression of clutter in real data. It shows that the time offset of each impulse response is not fixed in a realistic detection scenario. There are two reasons for this result. One is that we have adopted the equivalent-time sampling technique in our developed radar system, which obtains an impulse response through multiple impulse responses, and the other is the presence of noise in the environment. The result obtained by our proposed algorithm is shown in Figure 11f. When λ is set to 0.3, the non-stationary clutter is effectively eliminated, and the human target can be easily extracted. Figure 11f shows that the range of the target is 10 m, which is consistent with the actual situation.  Figure 11a shows the two-dimensional raw echo matrix. The vital sign signals are masked by a large amount of strong clutter and cannot be extracted directly. The range-frequency matrix Y of the received radar signal after the conventional FFT in slow-time is shown in Figure 11b. The respiration signal at 10 m becomes obvious, but there is also very strong non-stationary clutter induced by timing jitter at other distances. The respiration signal of the tested human is difficult to distinguish by automatic detection algorithms. The results processed by MS, SVD, and the method in [36] are shown in Figure 11c-e, respectively. After being processed by MS, the non-stationary clutter from the wall scattering is partially suppressed, but it is still difficult to extract the respiration signal. After being processed by SVD, the non-stationary clutter is also partially attenuated, but the remaining clutter around the respiration signal is still apparent. Different from the good performance in the simulated data, the method in [36] has very limited suppression of clutter in real data. It shows that the time offset of each impulse response is not fixed in a realistic detection scenario. There are two reasons for this result. One is that we have adopted the equivalent-time sampling technique in our developed radar system, which obtains an impulse response through multiple impulse responses, and the other is the presence of noise in the environment. The result obtained by our proposed algorithm is shown in Figure 11f. When λ is set to 0.3, the non-stationary clutter is effectively eliminated, and the human target can be easily extracted. Figure 11f shows that the range of the target is 10 m, which is consistent with the actual situation.   Figure 11a shows the two-dimensional raw echo matrix. The vital sign signals are masked by a large amount of strong clutter and cannot be extracted directly. The range-frequency matrix Y of the received radar signal after the conventional FFT in slow-time is shown in Figure 11b. The respiration signal at 10 m becomes obvious, but there is also very strong non-stationary clutter induced by timing jitter at other distances. The respiration signal of the tested human is difficult to distinguish by automatic detection algorithms. The results processed by MS, SVD, and the method in [36] are shown in Figure 11c-e, respectively. After being processed by MS, the non-stationary clutter from the wall scattering is partially suppressed, but it is still difficult to extract the respiration signal. After being processed by SVD, the non-stationary clutter is also partially attenuated, but the remaining clutter around the respiration signal is still apparent. Different from the good performance in the simulated data, the method in [36] has very limited suppression of clutter in real data. It shows that the time offset of each impulse response is not fixed in a realistic detection scenario. There are two reasons for this result. One is that we have adopted the equivalent-time sampling technique in our developed radar system, which obtains an impulse response through multiple impulse responses, and the other is the presence of noise in the environment. The result obtained by our proposed algorithm is shown in Figure 11f. When λ is set to 0.3, the non-stationary clutter is effectively eliminated, and the human target can be easily extracted. Figure 11f shows that the range of the target is 10 m, which is consistent with the actual situation.  [36], and (f) the result processed by our method, where λ = 0.3. Table 3 lists the SCNR and IF values of the above methods. Compared with the proposed method, the performance of MS, SVD, and the method in [36] is not effective, because the IF score of the proposed non-stationary clutter suppression method is much higher than other methods. After being processed by the proposed method, the SCNR  [36], and (f) the result processed by our method, where λ = 0.3. Table 3 lists the SCNR and IF values of the above methods. Compared with the proposed method, the performance of MS, SVD, and the method in [36] is not effective, because the IF score of the proposed non-stationary clutter suppression method is much higher than other methods. After being processed by the proposed method, the SCNR was improved by more than 20 dB.

Detection Performance in Through-Floors Situation
To explore the robustness of the proposed method in different situations, the human respiration detection experiment in the through-floors situation was carried out on the stairs. The tested human was lying on the first floor and breathing steadily, and the UWB impulse radar was placed on the second or third floor, respectively. The single floor has a thickness of about 45 cm and a complex structure with internal wires. Detection scenarios are shown in Figures 12a and 13a  For the experiment through two floors, the energy of the human signal is very weak due to the attenuation of the two floors. In the direct FFT result, there was more non-stationary clutter interfering with the respiration signal detection, as shown in Figure 13b. Figure 13f shows that our proposed method outperformed MS, SVD, and the method in [36]. The non-stationary clutter was suppressed and the human respiration signal was retained in the range-frequency matrix Y. There are two peaks in Figure 13f, which can be explained by the multiple-path respiration responses caused by the complex structure of the stairs. It should be noted that we have not changed the value of λ , which is fixed at 0.3. ure 13b. Figure 13f shows that our proposed method outperformed MS, SVD, and the method in [36]. The non-stationary clutter was suppressed and the human respiration signal was retained in the range-frequency matrix Y. There are two peaks in Figure 13f, which can be explained by the multiple-path respiration responses caused by the complex structure of the stairs. It should be noted that we have not changed the value of λ , which is fixed at 0.3. The corresponding SCNR and IF values for the two scenarios are provided in Table  4. The performance of SVD with the largest principal component removed was comparable to that of MS. The method in [36] had the worst performance. However, our proposed method also had the highest IF score.

Conclusions
When using the UWB impulse radar for through-wall human respiration detection, the time interval between successive pulses emitted cannot be guaranteed to be ideally fixed. This results in non-stationary clutter scattered by the stationary target, which affects the extraction of the human respiration signal. We proposed a method for reducing the impact of timing jitter. First, we used cross-correlation to correct the time offset of each impulse response, achieving initial compensation for timing jitter in the time domain. Then, we enhanced the weak respiration signal to improve SCNR. Finally, the difference of sparsity in the frequency dimension between the human respiration signal and the jitter-induced clutter was used to accurately extract the human respiration signal in the frequency domain. The results of numerical simulation and a series of experiments, including through-wall detection and through-floor detection, showed that the proposed algorithm can effectively suppress the non-stationary clutter induced by timing jitter. This will improve the performance of the UWB impulse radar in human detection under NLOS conditions.  For the experiment through one floor, as shown in the direct FFT result in Figure 12b, strong clutter with energy comparable to that of the respiration signal was detected at 0-2 and 4-6 m respectively, and covered the whole frequency band of 0.1-0.6 Hz. It can be regarded as the non-stationary clutter introduced by timing jitter of floors' scattering. Due to the complex experimental scenario, more clutter is also detected at long distances. It is difficult to visually distinguish the respiration signal from the non-stationary clutter. Figure 12c-f shows the range-frequency matrices obtained with different algorithms. Visually, MS, SVD, and the method in [36] cannot effectively remove non-stationary clutter, and the human respiration signal is masked by a large amount of clutter. After being processed by our proposed algorithm, the range-frequency matrix Y was 'cleaner', and most of the clutter was suppressed, except for a little clutter at close range. The extraction of the respiration signal with a range of 4 m and a Doppler frequency of 0.16 Hz becomes easier.
For the experiment through two floors, the energy of the human signal is very weak due to the attenuation of the two floors. In the direct FFT result, there was more nonstationary clutter interfering with the respiration signal detection, as shown in Figure 13b. Figure 13f shows that our proposed method outperformed MS, SVD, and the method in [36]. The non-stationary clutter was suppressed and the human respiration signal was retained in the range-frequency matrix Y. There are two peaks in Figure 13f, which can be explained by the multiple-path respiration responses caused by the complex structure of the stairs. It should be noted that we have not changed the value of λ, which is fixed at 0.3.
The corresponding SCNR and IF values for the two scenarios are provided in Table 4. The performance of SVD with the largest principal component removed was comparable to that of MS. The method in [36] had the worst performance. However, our proposed method also had the highest IF score.

Conclusions
When using the UWB impulse radar for through-wall human respiration detection, the time interval between successive pulses emitted cannot be guaranteed to be ideally fixed. This results in non-stationary clutter scattered by the stationary target, which affects the extraction of the human respiration signal. We proposed a method for reducing the impact of timing jitter. First, we used cross-correlation to correct the time offset of each impulse response, achieving initial compensation for timing jitter in the time domain. Then, we enhanced the weak respiration signal to improve SCNR. Finally, the difference of sparsity in the frequency dimension between the human respiration signal and the jitter-induced clutter was used to accurately extract the human respiration signal in the frequency domain. The results of numerical simulation and a series of experiments, including through-wall detection and through-floor detection, showed that the proposed algorithm can effectively suppress the non-stationary clutter induced by timing jitter. This will improve the performance of the UWB impulse radar in human detection under NLOS conditions.