Joint Vital Signs and Position Estimation of Multiple Persons Using SIMO Radar

: Respiration rate monitoring using ultra-wideband (UWB) radar is preferred because it provides contactless measurement without restricting the person’s privacy. This study considers a novel non-contact-based solution using a single-input multiple-output (SIMO) UWB impulse radar. In the proposed system, the collected radar data are converted to several narrow-band signals using the generalized Goertzel algorithm (GGA), which are used as the input of the designed phased arrays for position estimation. In this context, we introduce the incoherent signal subspace methods (ISSM) for the direction of arrivals (DOAs) and distance evaluation. Meanwhile, a beam focusing approach is used to determine each individual and estimate their breathing rate automatically based on a linearly constrained minimum variance (LCMV) beamformer. The experimental results prove that the proposed algorithm can achieve high estimation accuracy in a variety of test environments, with an error of 2%, 5%, and 2% for DOA, distance, and respiration rate, respectively.


Introduction
Non-contact vital sign (VS) detection, such as respiration rate, built upon a radio frequency (RF)-based system has attracted a great deal of interest in recent years. It is used in a growing field of applications, such as healthcare and clinical assistance, sleep and baby monitoring, survivor rescue and research operations, through the wall human detection, and human activity classification [1][2][3][4][5][6][7][8][9]. Recently, the impulse response ultra wide-band (IR-UWB) radar has gained more attention thanks to its high-range resolution, good penetration capability, and low power consumption [10,11].
The use of RF signals for VS waveform extraction lies in the detection of the chest motion associated with the VS activities of a human subject. Specifically, it is based on the emission and reception of electromagnetic (EM) waves. Notably, the phase variation on the received signal is exploited for this purpose [12]. To do so, several radar types have been investigated, including unmodulated continuous wave (CW) [13], frequencymodulated continuous-wave (FMCW) [14], step frequency continuous-wave (SFCW) [15], and IR-UWB [4]. Meanwhile, many signal processing techniques have been proposed, covering clutter rejection, human localization, and VS estimation [16][17][18][19][20][21]. Despite the great advances in the field, most research has focused on a single target for VS measurement. To better deal with complex and critical scenarios, such as people buried under debris, multiple-person VS motioning plays an important role in addressing these scenarios [5,8].
To date, two approaches have been mainly used to achieve the VS measurement of multiple persons. The first solution requires the range information associated with the human targets to separate their VS signals. In this context, a single-input single-output (SISO) radar is used for this purpose, namely FMCW, SFCW, or IR-UWB platforms [22]. Nevertheless, it is still difficult to obtain reliable separation when the persons are located at the same distance from the radar. The second one separates the VS signals based on the incident angle information. To this end, spatially distributed antennas have been exploited in designing the radar system, such as single-input multiple-output (SIMO) [8] and multiple-input multiple-output (MIMO) radars [6]. In particular, digital beamforming (DBF) plays an important role in beam scanning and beam focusing tasks thanks to its low-complexity structure, good control flexibility, and high accuracy [8].
In this work, a SIMO IR-UWB radar prototype is presented. The proposed system is based on a single IR-UWB radar in which eight spatially distributed antennas are connected to its receiving channel through a UWB RF switch. To deal with the array signal processing-based model, we propose a proper pre-processing step to obtain a multivariant narrow-band signal via the generalized Goertzel algorithm (GGA) [23]. Further, we consider an incoherent signal subspace method (ISSM) for direction of arrival (DOA) estimation [24]. In particular, we use the incoherent multiple signal classification (IMUSIC) to estimate the incident angle of each person [25,26]. Then, we perform beam focusing at each estimated angle for VS separation using a linearly constrained minimum variance (LCMV) beamformer [27]. Finally, the respiration rate is estimated for each person by analyzing the spectrum of the separated signals.
In summary, the main contributions of this work are as follows: 1.
The proposed SIMO IR-UWB radar system allows us to simultaneously measure the VSs of spaced persons at the same distance from the radar. It offers a low-cost and good solution for the non-contact Vs measurement of multiple persons in sleep and baby monitoring scenarios; 2.
The chest motions of multiple targets are accurately separated with the combination of the IMUSIC and LMCV algorithms. The respiration rate estimation is significantly enhanced by forming individual narrow beam focusing on the chest regions associated with each person; 3.
The experimental results are presented to investigate the performance of the proposed system, showing that our proposed method outperforms the state of the art.
The remainder of this paper is organized as follows. Section 1 presents the mathematical model of the IR-UWB SIMO radar and the problem setup. In Section 2, we tackle the estimation procedure of position and respiration rate for each person. Experimental results and numerical analyses are described in Section 3. Section 4 concludes this work.

Mathematical Model
UWB impulse radar has been widely used for VS detection thanks to its high-range resolution. In this context, it transmits very short pulses, usually in the order of picoseconds, to the human subject. As the electromagnetic wave returns from the chest's body, the VS motions, which introduce a small displacement (typically in the order of millimeters) on the chest, will produce a phase variation in the received signal [12].

Received Signal for SISO Radar
Consider a finite number of persons L with incident angles θ θ θ = [θ 1 0 , θ 2 0 , ..., θ L 0 ] over a limited region, where the received signal, from L persons, can be simplified as: where t and τ denote the slow and fast time indexes, respectively. p(τ) represents the transmitted pulse. α l and τ l (t) are the attenuations and the time of arrival (TOA) associated with the l th person, respectively. These letters depend on the radar cross-section (RCS) of the human subject and its distance from the radar d l 0 . Assuming that the human chest remains stationary during the coherent processing interval (CPI), usually in the order of milliseconds, the TOA in Equation (1) is expressed as: where d l (t) depicts the chest displacement of the l th person, which can be expressed as [4]: where d l r and f l r represent the maximum displacement of the chest and the respiration frequency associated with the l th person, respectively.

Received Signal for SIMO Radar
In our scheme, we consider a uniform linear array (ULA) composed of M antenna receivers. Depending on the incident angle of each person, the received signal at the m th antenna can be expressed as: where w m (t, τ) represents the measurement noise. Using the first antenna as a reference, τ l m (t) in Equation (4) can be derived as: where d depicts the inter-distance of the antenna array. Figure 1 illustrates the receiving signal model of the SIMO radar.

Position and VS Estimation
In this section, we first establish an appropriate SIMO radar based on a single UWB impulse transceiver P440 for VS and position estimation. Then, we develop a method to transom the collected UWB data into multiple narrow-band signals. Next, we explain how to estimate the respiration rate of each person based on their estimated incident angles. Figure 2 illustrates the basic modules of the SIMO radar system adopted in this work. Specifically, the impulse UWB transceiver P440 is used to transmit and receive data. In particular, it transmits a Gaussian impulse RF signal with a carrier frequency of 4.3 GHz and a bandwidth of 2.2 GHz. It acts as a short-range coherent radar using the monostatic radar module (MRM). On the other hand, a broadband RF switch HMC321A (covering a band from DC to 8 GHz) is used to select one specific antenna receiver at each scan to build an array receiver. Specially, we use a Raspberry Pi 4 to control and automate this process. Finally, the collated data are transferred to the PC via a WiFi connection for signal processing.

Algorithm Scheme
The joint estimation approach exploits the spatial and spectrum, in the fast time domain, information of the received signals for estimating the human positions and separating their VS waveforms. Moreover, the temporal information, in the slow time domain, is used for respiration rate estimation. Figure 3 illustrates the process of the proposed method.

Pre-Processing
In this section, we introduce a pre-processing step to transform the IR-UWB signals to multiple narrow-band signals. Specifically, we use the GGA to evaluate the Fourier transform (FT) at specific frequency indexes. Then, a complex band pass filter is applied to remove the clutter response and negative frequencies and enhance the VS waveform [4].
Applying the FT to Equation (4), in the fast time domain, reads: where P(ω), C m (ω), and W m (t, ω) represent the FT of the transmitted pulse, the clutter response, and the measurement noise, receptively. Inserting Equation (5) into Equation (6) results in: in Equation (7) can be expressed, using the Bessel functions, as: where J n depicts the first kind of Bessel function of order n. By further taking into consideration that d l r is small, in the order of millimeters, and regarding the range of ω, in the order of GHz, the quantity β l r (ω) is close to zero. Hence, Equation (8) can be simplified, retaining only the Bessel functions from up to the second order, as:  Inserting Equation (9) into Equation (7) reads: in which: At this point, we apply the CBPF [28], presented in our previous work [4], to remove C m (ω) and enhance the VS waveform. The result of such a filter can be expressed as: where S l (t, ω) = α l J 1 (β l r (ω))P(ω)e jβ l 0 (ω) e j2π f l r t , which represents the source signal associated with the l th person, and N m (t, ω) depicts the filtered noise of W m (t, ω). Notice that, for each ω, we obtain an array signal processing-based model, which can be written in matrix form as: in which: In view of this, the model presented in Equation (13) is established for several frequency indexes Ω Ω Ω = [ω 1 , ω 2 , ..., ω k , ..., ω K ], which are chosen from the IR-UWB radar frequency band. To do so, we use the GGA instead of FFT because we require a few spectral components. In such cases, the algorithm is significantly faster.

DOA Estimation
Initially, we perform the IMUSIC method on the received signal after the pre-processing step to estimate the DOA of each person with a high angular resolution. It is common to assume that the number of sources is known, or previously estimated by using MDL or AIC methods [29,30]. Firstly, the ideal covariance matrix is estimated using the sample covariance matrix for each frequency bin: where (.) H represents the conjugate transpose. Then, the noise covariance matrix can be estimated, based on the eigenvalue decomposition ofR(ω k ), as: where Q n (ω) is the estimated noise subspace obtained using the SVD onR(ω k ) for each frequency bin. Secondly, the estimatedθ l 0 of the l th person can be obtained by evaluating the peaks of the following spatial spectrum function [25]:

Distance Estimation
In order to estimate the distance of each person accurately, we use the well-known LCMV beamformer, which is widely used in acoustic array processing [31]. First, a weighted vector is designed according to certain criteria. In particular, the LCMV beamformer intends to make the desired directionθ l 0 gain associated with the l th person constant while minimizing the total output power with certain constraint conditions: in which: Based on the work in [8], the weight vector is expressed as: Thus, the final output signal associated with the l th person can be expressed as: Using Equation (12), Equation (20) can be rewritten as: where s l (t) = A l e j2π f l r t , A l = α l J 1 (β l r (ω k ))P(ω k ), and e l (t, ω k ) represents the estimation error. The matrix form of Equation (21) can be expressed as: in which:Ŝ Notice that Equation (23) represents an array signal processing based-model. Thus, the subspace methods can be applied to estimate d l 0 . In our work, we use the MUSIC method in which the number of sources is fixed to one [32]. This procedure is repeated for each l. To this end, the VS signal of the l th person can be extracted by applying a simple beam focusing technique as follows:

Respiration Rate Estimation
In this section, we focus on the estimation of respiration rate from the extracted VS waveformŝ l (t). Based on Equation (23),ŝ l (t) can be expressed as: s l (t) ≈ A l e j2π f l r t (24) Applying the FT to Equation (24) leads to: where δ represents the Dirac function. Thus, f l r can be easily estimated by finding the peak ofŜ l ( f ), which is calculated using FFT.

Experimental Results
In this section, we describe the experiments that were carried out to assess the performance of the proposed method. Three human subjects participated in these experiments, in which an accelerometer was attached to their chest as a ground truth device for breathing rate estimation. Several scenarios were considered to ensure the accuracy of the obtained results. A record of 30 min was achieved for different scenarios; each record was divided into several realizations of 180 s (22.5 s for each channel). The errors were calculated as follows: where N r represents the number of realizations that equal 10 in our experiment. The exact values of θ l 0 and d l 0 were set manually during the experiment. Figure 4 depicts an example of such scenarios.  Once the radar data were successfully recorded, they were transferred to the PC via a WiFi connection for signal processing. Figure 5 depicts the collected data of the first realization associated with each channel after the pre-processing step.

Position and Respiration Rate Estimation of One Person
The first experiment concerned a position and breathing rate estimation scenario for one person. Several assays were performed. The person was exposed directly in front of the radar at different positions with a normal breathing rate. In this context, the incidence angles were set to −45 • , 0 • , and 45 • , the nominal distance to 1 m from the radar, and the exact value of the breathing rate was obtained by the accelerometer attached to the person's chest. Based on the IMUSIC procedure, the angles were estimated with an error of 2%. As an example, Figure 6 shows the estimated angles for the first realization.  After applying the LMCV beamformer, the distance was calculated using the MUSIC algorithm with an error of 5%. Figure 7 depicts an example result of such a process. Using the beam focusing technique, the respiration rate was estimated with an error of 1.73%. Figure 8 shows the result of such a scheme for the first realization.

Position and Respiration Rate Estimation of Multiple Persons
In this section, we evaluate the separation procedure. Two sets of experiments were conducted. Firstly, two human subjects were asked to stand at different positions. Specifically, the angles were set to θ 1 0 = 0 • ; θ 2 0 = 45 • , and the nominal distance to d 1 0 = d 2 0 = 1 m for both of them. Secondly, we increased the number of persons to three, in which the angles were set to θ 1 0 = −30 • ; θ 2 0 = 0 • ; θ 3 0 = 45 • , and the nominal distances to d 1 0 = 1 m; d 2 0 = 2.5 m; d 3 0 = 1.5 m. Figure 9 depicts the estimated angles using the IMUSIC algorithm for both experiments.  Applying the LCMV allowed us to separate the VS signals. First, we estimated the distance of each person using the MUSIC algorithm [32]. The results are shown in Figure 10. Then, the respiration rate could be estimated with high accuracy and approximately 2.63% error, as depicted in Figures 11 and 12. As already mentioned, we used an accelerometer device to measure the respiration rate, which served as a reference signal for the comparison with our method. This allowed us to quantify the performance of the proposed system for the non-contact VS monitoring of multiple persons. Up to now, we have used the error presented in Equation (26) as a criterion, which is good for DOA and distance evaluation, since we manually set up the true values for both of them. However, this is not the case for the respiration rate, in which the true value is extracted from the accelerometer device.
In order to assess the comparability between our method and the accelerometer devicebased method, we used Altman and Bland (B&A) plot analysis, which is an effective method to describe the agreement between measurements. The B&A graph plots the difference in two paired measurements against the mean of the two measurements [33]. Figure 13 depicts the result of such a plot, with a confidence interval limit of −20% (from −10% to 10%), which is an acceptable error for clinical applications. The differences between measurements of the same substance are not significant. This can be represented by the mean difference, which is 0.43% for one person and 0.41% for multiple persons.

Discussions
To further verify the discrimination performance of the proposed scheme, a human subject was asked to stand at several distances from the radar. Two scenarios were considered in our laboratory. In the first scenario, the person was exposed directly in front of the SIMO radar. At each assay, the distance was changed with a step size of 0.5 m. In the second scenario, the person stood behind a wall, and the same assays were conducted with a step size of 0.25 m. The wall was made up of reinforced concrete with a 20 cm width. As can be seen from Figure 14, our system is capable of monitoring human subjects up to 3 m without obstacles and 0.5 m with an obstacle. Note that the maximum distance can be changed depending on the power of the transmitted signal, the material nature, and the size of the obstacle. Furthermore, our system was compared with the state-of-the-art research based on spatially distributed array approaches in terms of angle resolution, maximum distance, and respiration rate error. As shown in Table 1, the proposed system has the minimum respiration rate error. It also has the minimum angle resolution; this is due to its eight receiving antennas.

Conclusions
In this work, we aimed to devise a non-contact-based solution for multiple-person monitoring that can ensure adequate measurement and preserves the person's privacy. To this end, by combining an IR-UWB radar sensor and subspace methods, this study proposes a SIMO scheme that can identify each individual with its corresponding respiration rate. To interpret the collected UWB radar data using the subspace methods, we propose a pre-processing step, based on GGA, that converts the UWB data to multiple narrow-band signals that contain the spectrum and spatiotemporal features of each person. An ISSM is established for the proposed phased array system for DOA and distance estimation. Meanwhile, a separation procedure is introduced, based on LCMV, for individual identification and respiration rate estimation. The experimental results prove that the system can automatically find the direction and the distance of multiple human subjects and effectively detect their respiratory rates.