Open Access
This article is

- freely available
- re-usable

*Remote Sens.*
**2019**,
*11*(21),
2571;
https://doi.org/10.3390/rs11212571

Article

An Accurate Method to Distinguish Between Stationary Human and Dog Targets Under Through-Wall Condition Using UWB Radar

Department of Medical Electronics, School of Biomedical Engineering, Fourth Military Medical University, Xi’an 710032, China

^{*}

Authors to whom correspondence should be addressed.

Received: 27 September 2019 / Accepted: 30 October 2019 / Published: 1 November 2019

## Abstract

**:**

Research work on distinguishing humans from animals can help provide priority orders and optimize the distribution of resources in earthquake- or mining-related rescue missions. However, the existing solutions are few and their stability and accuracy of classification are less. This study proposes an accurate method for distinguishing stationary human targets from dog targets under through-wall condition based on ultra-wideband (UWB) radar. Eight humans and five beagles were used to collect 130 samples of through-wall signals using the UWB radar. Twelve corresponding features belonging to four categories were combined using the support vector machine (SVM) method. A recursive feature elimination (RFE) method determined an optimal feature subset from the twelve features to overcome overfitting and poor generalization. The results after ten-fold cross-validation showed that the area under the receiver operator characteristic (ROC) curve can reach 0.9993, which indicates that the two subjects can be distinguished under through-wall condition. The study also compared the ability of the proposed features of four categories when used independently in a classifier. Comparison results indicated that wavelet entropy-corresponding features among them have the best performance. The method and results are envisioned to be applied in various practical situations, such as post-disaster searching, hostage rescues, and intelligent homecare.

Keywords:

UWB radar; distinguishing human targets from dog targets; SVM; wavelet entropy-corresponding features## 1. Introduction

The rapid developments in ultra-wideband (UWB) radar technology [1,2,3,4,5,6,7] have attracted increasing interest in civilian and military applications, such as earthquake and hostage rescue operations and gesture recognitions. The technologies of radar imaging, localization, and action recognition have been significantly improved over the past decade, and several research groups are focusing on the distinction between humans and animals based on the advancements in radar technologies. The work in [8] developed a feature-based classification method for walking movements in animals versus humans for border security applications. The research in [9] proposed a classification algorithm using micro-Doppler signals obtained from humans and dogs moving in four different directions, and [10] combined the Gaussian mixture model and the hidden Markov model to distinguish between slow moving animal and human targets to detect potential livestock thieves and poachers within reserves and farmland.

However, these research works between humans and animals have concentrated on the moving states of humans and animals, and there have been no reports focusing on the distinction in stationary state and through-wall condition. There is an urgent need to distinguish between stationary humans and animals under through-wall conditions for post-disaster rescuing applications when trapped targets are buried and unable to move. This work will help optimize the distribution of rescue resources and boost the confidence of rescuers in earthquake search, thereby ensuring the safety of hostages by eliminating the impact of pet signals in anti-terrorist rescue operation in the presence of barriers such as walls. Therefore, distinction between stationary humans and other species under through-wall condition is a significant and to be further resolved research work.

UWB radar can detect the vital signs of human and animal targets based on their quasi-periodic respiration or non-periodic body movements. Considering that humans have more powerful psychological and mental control, the signs caused by the micro motions in humans are more regular than those in animals. The work in Otero, M. [11] found that the spectrum of radar echo signals is different between human targets and other detected slow-moving targets; however, it is mainly based on their different walking speed, implying that this method is not applicable in earthquake search and hostage rescue operations because the targets are generally stationary.

Dogs are most likely to appear where it is necessary to distinguish between humans and animals considering the actual situations. In Wang, Y. et al. [12], Wang extracted wavelet entropy and its standard deviation using signals of target position based on 400 MHz UWB radar. The results show that the two features have evident differences for human and dog targets. In [13], the classification accuracy of human target recognition can achieve 86% and the accuracy of dog target recognition can achieve 84% using wavelet entropy alone in through-wall conditions. When only standard deviation of wavelet entropy is utilized, the classification accuracy can achieve 86% for both human and dog targets. In her experiments, a total of ten humans and five dogs were used with five and ten times signal acquisitions per human and dog, respectively.

Meanwhile, Yu, X. et al. [14] and Yin, Y. et al. [15] proposed some other features, whose classification ability performs well when used independently in distinguishing humans from dogs, such as optimal correlation coefficient of micro vibration (OCCMV), standard deviation change rate of micro vibration (StdCRMV), energy ratio of the reference frequency band (ERRFB), etc. However, only one or two of the features above, at most, are combined together to achieve the classification objectives of human targets and dog targets each time in previous studies. Consequently, the robustness and classification accuracy are unguaranteed because of the individual differences, no matter whether for human targets or dog targets. For example, for some individual dogs, the values and ranges of OCCMV or StdCRMV can resemble those of human beings, so much so that it is almost impossible to distinguish between them when only one or two features are combined.

In this study, twelve feature species belonging to four categories are first extracted, including two energy-corresponding features, two correlation coefficient-corresponding features, four wavelet entropy-corresponding features, and four frequency-corresponding features. Then, a support vector machine (SVM)-based feature selection and classification strategy are utilized to choose an optimal feature subset. Finally, after 100 rounds of ten-fold cross-validation, a classifier with reliable generalization and stability is successfully modeled. Meanwhile, the value of the area under the curve (AUC) of the classifier also reaches the highest level, which is adopted to evaluate the overall performance of the classifier. Higher values of AUC indicate better classifier performance. Thus, the combination of these features improves the classification accuracy and enhances the stability of the proposed system in distinguishing between human and dog targets.

The paper is organized as follows. Section 2 introduces the working principles of the UWB radar and signal preprocessing. Section 3 describes the extraction of the abovementioned twelve features belonging to four categories. In Section 4, the classification procedure and feature selection strategy based on SVM are introduced. Experimental results are presented in Section 5. Section 6 discusses the significance and future research considerations. The work is concluded in Section 7.

## 2. UWB Radar System and Signal Preprocessing

#### 2.1. UWB Radar System

The block diagram of the utilized UWB radar system is illustrated in Figure 1. Firstly, the pulse generator generated trigger pulses with a center frequency and pulse repetition frequency of 500 MHz and 128 KHz, respectively. Then, the pulses were sent to the transmitter to excite the transmitter antenna (TA), where they were shaped into bipolar pulse. Next, the bow-tie dipole antenna transmitted the vertically polarized pulses with peak power of approximately 5 W. Meanwhile, the trigger pulses were sent to the delay unit to produce a series of software-controlled range gates of 300 ps in width to turn on the receiver, which was identical with the transmitting one. Finally, through the receiving antenna (RA), return pulse only within the range gates were sampled, integrated, amplified, and then transferred to the computer for further analysis [16]. The detailed parameters of the UWB radar system are listed in Table 1.

After signal acquisition, the raw echo data $D(m,n)$ are stored in the form of waveforms, where m = 1, 2,$\cdots $, M denotes the sampling point in propagation time, and n =1, 2,$\cdots $, N denotes the sampling point in observation time. The two-dimensional (2-D) pseudo-color image of raw echo data is illustrated in Figure 2, when the human target is 2.5 m behind the wall. The time-axis associated with range along each received waveform is termed as the “fast-time” and denoted by τ that is in the order of nanoseconds. The time-axis corresponding to the acquisition time of waveforms along the measurement duration is termed as “slow-time” and denoted by t that is in the order of seconds. Each waveform contains M = 2048 sample points and the recorded profile is ${\tau}_{max}=20$ ns long, corresponding to a detection range of 3 m. The amplitudes along slow-time at each sample point, which stands for a specific range, are defined as point signal. The scanning speed is 64 waveforms per second which is higher than the Nyquist sampling rate for the respiration; thus, the number of recorded waveforms in slow-time can be written as $N=64t$.

#### 2.2. Signal Preprocessing

Signal preprocessing steps, including distance accumulation, normalization along the fast-time dimension, direct current (DC) removal, 2 Hz low-pass (LP) filtering, and adaptive filtering based on least mean square (LMS) are first performed, as illustrated in Figure 3. It should be noted that all subsequent feature extractions are based on the preprocessed signals using the steps shown in Figure 3.

Distance accumulation will help reduce the computational complexity effectively on the premise of guaranteeing detailed information. It can be determined by:
where $DataA(x,n)$ is the echo matrix data after distance accumulation and Q is the distance window width along the fast-time dimension. The value of x is x = 1, 2,$\cdots $, X, and X denotes the distance point number on the fast-time dimension after accumulation.

$$DataA(x,n)=\frac{1}{Q}{\displaystyle \sum}_{m=Q(x-1)+1}^{Qx}D(m,n)$$

Next, normalization along the fast-time dimension is accomplished to guarantee that the waveforms have the comparable amplitudes, and its processing is as follows:

$$DataN(m,x)=2\times \frac{DataA(m,x)-\underset{1\le m\le M}{\mathrm{min}}\left[DataA(m,x)\right]}{\underset{1\le m\le M}{\mathrm{max}}\left[DataA(m,x)\right]-\underset{1\le m\le M}{\mathrm{min}}\left[DataA(m,x)\right]}-1$$

DC removal will remove the DC component and baseline drift. The step of DC removal is given by:

$$DataDC(x,n)=DataN(x,n)-\frac{1}{100}{\displaystyle \sum}_{n=x}^{x+99}DataN(x,n)$$

The cut-off frequency of the low-pass filter is chosen to be 2 Hz to filter out the high-frequency noise and retain respiratory signals. It can be determined by:
where H(t) is the impulse response function of the finite impulse response (FIR) filter [13].

$$DataLP(x,n)=DataDC(x,n)\times H(t)$$

Adaptive filtering by means of the least mean square (LMS) algorithm is used to suppress the strong clutters. It is illustrated and verified in [17].

Matrix data $DataAF(x,n)$ is obtained after the execution of all preprocessing steps. The 2-D pseudo-color image of the preprocessed signal is shown in Figure 4, when the human target is 2.5 m behind the wall. The subsequent feature calculation will proceed with $DataAF(x,n)$.

## 3. Feature Extraction

According to the accumulation of research results in recent years, 12 capable features belonging to four categories, consisting of two energy-corresponding features, two correlation coefficient-corresponding features, four wavelet entropy-corresponding features, and four frequency-corresponding features were utilized after signal preprocessing. The detailed feature descriptions are described below.

#### 3.1. Energy-Corresponding Features

• Standard deviation change rate of micro vibration ($StdCRMV$)

When calculating $StdCRMV$, $Q=1$. Preliminary researches have shown that the deviations in the amplitude values in target position are the greatest in $DataAF(x,n)$, indicating that the standard deviation ($Std$) of the target signal is the greatest. Target signal is defined as the specific point signal which is right at the target position. The calculation of $Std$ is expressed as:
where $y$ is a scalar denoting amplitudes of the point signals. Then, an optimal window (OW), where the target signal is in the middle position, is chosen along the fast-time dimension with fixed width, and the $Std$ of every point signal in the OW is calculated. Accordingly, the $StdCRMV$ is expressed as:
where $St{d}_{max}$ is the max $Std$ value and $St{d}_{min}$ is the min $Std$ value within the OW. According to the work in [18], it is better to choose the width of the OW as $o{w}_{Std}=15$, so that the difference between human targets and dog targets will be largest. Figure 5 illustrates the value of $Std$ closer and further than the target’s position for 15 points of human and dog targets, respectively. Generally, the changing trend of $Std$ in the OW of human targets is much gentler than that of dog targets. Therefore, the $StdCRMV$ of human targets is much lower than that of dog targets.

$$Std=\sqrt[2]{\frac{{{\displaystyle \sum}}_{i=1}^{n}{({y}_{i}-\overline{y})}^{2}}{n}}$$

$$StdCRMV=\frac{St{d}_{max}-St{d}_{min}}{St{d}_{max}}\times 100\%$$

• Energy ratio of the reference frequency band ($ERRFB$)

In the calculation of $ERRFB$, the preprocessing steps are similar to that of $StdCRMV$. The difference is that Q = 10 and the normalization is along the slow-time dimension, as illustrated in Equation (7). Thus, the 2048 points along the fast-time index associated to the range are compressed into 200 points [18].

$$DataN(x,n)=2\times \frac{DataA(x,n)-{\displaystyle \underset{1\le n\le N}{\mathrm{min}}}\left[DataA(x,n)\right]}{{\displaystyle \underset{1\le n\le N}{\mathrm{max}}}\left[DataA(x,n)\right]-{\displaystyle \underset{1\le n\le N}{\mathrm{min}}}\left[DataA(x,n)\right]}-1$$

Then, ensemble empirical mode decomposition (EEMD) is performed on the target signal. EEMD is a noise-eliminating algorithm, and it can decompose the original target signal into a series of intrinsic mode function (IMF) components with different characteristic scales by adding multiple sets of different white noises [19,20,21]. Its procedures are as follows:

- Add a random white noise signal $w{n}_{j}(t)$ to the original target signal $Signa{l}_{o}$(t):$$Signa{l}_{j}(t)=Signa{l}_{o}(t)+w{n}_{j}(t)$$
- Decompose $Signa{l}_{j}(\mathrm{t})$ into a series of IMF components $IM{F}_{i,j}$ as follows:$$Signa{l}_{j}(t)={\displaystyle \sum}_{i=1}^{{V}_{j}}IM{F}_{i,j}+Residu{e}_{j}$$
- If j < TN, then steps 1 and 2 are repeated, and different random white noise signals are added each time.
- Obtain $V=\mathrm{min}({V}_{1},{V}_{2},\dots ,{V}_{TN})$ and calculate the ensemble average of corresponding IMF components of the decompositions as the EEMD result:$$IM{F}_{i}=\frac{1}{TN}{\displaystyle \sum}_{j=1}^{TN}IM{F}_{i,j}$$

After dividing the original target signal into V-order IMF components by EEMD, noise will be eliminated using the discriminant method as expressed in Equations (11) and (12):
where $\beta $ is the energy concentration ratio and ${R}_{IM{F}_{i}}$ is the discrete autocorrelation sequence of the i-order (i = 2,3, …,V) IMF components of the target signal, which is defined in Equation (10). $\left[{n}_{1},{n}_{2}\right]$ denotes the interval with length of three points wherein the symmetric point of ${R}_{IM{F}_{i}}$ is in the middle and $\eta $ denotes energy concentration ratio in the interval. When the decline rate of energy concentration ratio $\beta (v)$ satisfies the condition $\beta (v)>2$, the $v$-order IMF component is considered as denoised signal. The reconstructed signal ${s}_{re}$ is expressed in Equation (13).

$$\beta (i)=\{\begin{array}{cc}0\text{}& i=1\\ \frac{\frac{1}{i-1}{{\displaystyle \sum}}_{j=1}^{i-1}\eta (j)-\eta (i)}{\eta (i)}\text{}& i=2,3,\cdots ,V\end{array}$$

$$\eta (i)=\frac{{{\displaystyle \sum}}_{{n}_{1}}^{{n}_{2}}{R}_{IM{F}_{i}}{}^{2}(n)}{\sum {R}_{IM{F}_{i}}{}^{2}(n)}i=2,3,\dots ,V$$

$${s}_{re}={\displaystyle \sum}_{i=v}^{V}IM{F}_{i}$$

$ERRFB$ is the energy proportion of the reconstructed signal in human respiratory frequency band (0.2–0.4 Hz), expressed in Equation (14).
where ${E}_{re}$ is the total energy of the reconstructed signal in frequency domain, and ${E}_{re\_hb}$ is the energy of reconstructed signal in the reference frequency band, i.e., the human respiratory frequency band. Generally, $ERRFB$ of humans is approximately 40% and that of dogs is approximately 18% [15,19].

$$ERRFB=\frac{{E}_{re\_hb}}{{E}_{re}}\times 100\%$$

#### 3.2. Correlation Coefficient-Corresponding Features

• Optimal Correlation Coefficient of Micro vibration ($OCCMV$)

Because the details are particularly important in calculation with $OCCMV$, the distance window width here is $\mathrm{Q}=1$, and the width of OW here is $o{w}_{OCCMV}=$ 15. In this way, $OCCMV$ is expressed as Equations (15) and (16):
where TS represents the amplitudes of target signal, and Y the amplitudes of other point signals within the OW. The horizontal bar above the TS and Y denotes the calculation of the average value. $CCMV(y)$ is the correlation coefficient between target signal and another point signal in the OW. $OCCMV$ is the mean value of the $CCMV$ of each point signal in the OW. Generally, the $CCMV$ of human targets is very stable in the OW, close to one. Whereas the change of $CCMV$ of dog targets is greater. So the $OCCMV$ of human targets is larger than that of dog targets. The comparison result of $CCMV$ of human targets and dog targets is expressed as in Figure 6.

$$CCMV(y)=\frac{{{\displaystyle \sum}}_{i=1}^{N}(T{S}_{i}-\overline{TS})({Y}_{i}-\overline{Y})}{\sqrt{{{\displaystyle \sum}}_{i=1}^{N}{(T{S}_{i}-\overline{TS})}^{2}}\sqrt{{{\displaystyle \sum}}_{i=1}^{N}{({Y}_{i}-\overline{Y})}^{2}}}$$

$$OCCMV=\frac{{{\displaystyle \sum}}_{y=1}^{2\times o{w}_{OCCMV}+1}CCMV(y)}{2\times o{w}_{OCCMV}+1}$$

• Change rate of correlation coefficient of micro vibration ($CRCCMV$)

As illustrated in Figure 6, the trend of change of human target is stable, whereas the trend of change of dog target is more intense. Therefore, the $CRCCMV$ of human targets is much smaller than that of dog targets. Equation (17) defines the calculation of $CRCCMV$.

$$CRCCMV=\frac{CCM{V}_{max}-CCM{V}_{min}}{CCM{V}_{max}}\times 100\%$$

#### 3.3. Wavelet Entropy-Corresponding Features

In the calculation of wavelet entropy-corresponding features, the parameter of Q in the preprocessing steps is chosen to be Q = 10 and the normalization is along slow-time dimension. Thus, the raw echo data are compressed into 200 points along the fast-time index, similar to the preprocessing of $ERRFB$.

• Mean of wavelet entropy of target signal ($MWE$)

Wavelet analysis can provide optimal time-frequency resolution of the signal and entropy can quantify the signal’s frequency patterns as a relevant measure of order or disorder in a dynamic system. Wavelet entropy (WE) combines the advantages of both. It can accurately characterize the dynamic change information of time-frequency variation of non-stationary signal complexity in time domain. More signal components indicate a more complex and irregular signal, and a larger value of wavelet entropy. Generally, dog signals are more complex than human signals [12,22]. Therefore, the $MWE$ of human targets is much smaller than that of dog targets.

In the calculation of $MWE$, wavelet transform is first performed, followed by the calculation of relative wavelet energy. In Matsui, T. et al. [23], Morlet first considered wavelets as a family of functions generated by translations and dilations of a unique function called the “mother wavelet” $\psi (t)$. The family wavelet is expressed as:
where $a$ is the scaling parameter which measures the degree of compression, $b$ is the translation parameter which determines the time location of the wavelet, and $t$ represents time. Let ${L}^{2}(\Re )$ be the real square integrable function space, the discrete wavelet transform of a signal $S(t)\in {L}^{2}(\Re )$ is defined as [12,13]:
where ${C}_{j}(k)$ is wavelet coefficient of wavelet sequence. For practical signal processing, the signal is assumed to be given by the sampled values $S=\left\{{s}_{0}(n),n=1,\dots ,N\right\}$. Then, the signal reconstructed by wavelet transform is expressed as:
where ${\psi}_{j,k}(t)={2}^{\raisebox{1ex}{$j$}\!\left/ \!\raisebox{-1ex}{$2$}\right.}\psi ({2}^{j}t-k)$ with j, k ∈ Z, and $\psi (t)$ is the mother wavelet. $j=-1,-2,\dots ,-J$ is the number of resolution levels and its maximum value is $J={\mathrm{log}}_{2}N$ if the decomposition is performed over all resolution levels. k is the time index and ${r}_{j}(t)$ is the residual at scale j.

$${\psi}_{a,b}(t)={\left|a\right|}^{-\frac{1}{2}}\psi (\frac{t-b}{a})a,b\in R,a\ne 0$$

$${C}_{j}(k)={{\displaystyle \int}}_{-\infty}^{+\infty}S(t){\psi}_{j,k}(t)dt$$

$$S(t)={\displaystyle \sum}_{j=-J}^{-1}{\displaystyle \sum}_{k}{C}_{j}(k){\psi}_{j,k}(t)={\displaystyle \sum}_{j=-J}^{-1}{r}_{j}(t)$$

In radar signal processing, the signal is divided among non-overlapping temporal windows of length $L$, and the length is chosen to be $L=128$ empirically. Then, appropriate signal values are assigned to the central point of the time window for each interval $i=1,2,\dots ,{N}_{T,}{N}_{T,}=N/L$. Next, by considering the mean wavelet energy instead of the total wavelet energy, the mean energy at each resolution level $j$ for the time window $i$ using the wavelet coefficient is:
where ${N}_{j}$ is the number of wavelet coefficients at resolution $j$ involved in the time widow $i$. Then, the total energy of wavelet coefficient at interval $i$ is expressed as:

$${E}_{j}^{(i)}=\frac{1}{{N}_{j}}{\displaystyle \sum}_{k=(i-1)L+1}^{i\xb7L}{\left|{C}_{j}(k)\right|}^{2}$$

$${E}_{total}^{(i)}={\displaystyle \sum}_{j<0}{E}_{j}^{(i)}$$

Next, the relative wavelet energy that represents the energy’s probability distribution in scales is obtained by:

$${p}_{j}^{(i)}={E}_{j}^{(i)}/{E}_{total}^{(i)}$$

Finally, the average wavelet entropy of the whole time period is given by:

$$MWE=\frac{1}{{N}_{T}}{\displaystyle \sum}_{i=1}^{{N}_{T}}(-{\displaystyle \sum}_{j<0}{p}_{j}^{(i)}\xb7\mathrm{ln}\left[{p}_{j}^{(i)}\right])$$

• Standard deviation of wavelet entropy of target signal ($StdWE$)

$StdWE$ is a feature to quantitatively describe the fluctuation degree of wavelet entropy. $StdWE$ of human targets is much smaller than that of dog targets, since human target signals are more regular [12,13], generally speaking. It is given by:

$$StdWE={\left\{\frac{1}{{N}_{T}}{\displaystyle \sum}_{i=1}^{{N}_{T}}{(-{\displaystyle \sum}_{j<0}{p}_{j}^{(i)}\xb7\mathrm{ln}\left[{p}_{j}^{(i)}\right]-MWE)}^{2}\right\}}^{\frac{1}{2}}$$

• Mean of MWE in the OW window ($MMWEOW$)

In the calculation of $MMWEOW$, an OW, where the target point signal is right in the middle, is also needed and the width is chosen to be $o{w}_{MMWE}=20$. $MMWEOW$ is the mean value of MWE of each point signal closer and further the target position for $o{w}_{MMWE}$ point, i.e., the mean value of MWE in the OW. It is given by:
where $P$ is the position of target point signal.

$$MMWEOW=\frac{1}{2o{w}_{MMWE}+1}{\displaystyle \sum}_{i=P-o{w}_{MMWE}}^{P+o{w}_{MMWE}}MWE(i)$$

• Ratio of wavelet entropy ($RWE$)

$RWE$ is the ratio of the mean value of $MWE$ inside and outside the OW. It is expressed as:

$$MMW{E}_{oow}=\frac{1}{N-2o{w}_{MMWE}-1}{\displaystyle \sum}_{i=1,i\ne (P-o{w}_{MMWE}):(P+o{w}_{MMWE})}^{N}MWE(i)$$

$$RWE=\frac{MMWEOW}{MMW{E}_{oow}}$$

#### 3.4. Frequency-Corresponding Features

In this section, according to the research and accumulation of our group, it was found that the spectrum distribution of human targets in Hilbert marginal spectrum is different from that of dog targets. The curve of Hilbert marginal spectrum of human targets steeps slightly and the shape of the area under the curve is narrow and high. In contrast, the curve of dog targets is slightly gentle and the shape of the area under the curve is wide and low. Therefore, the frequency value corresponding to 1/4 total frequency band area, 3/4 total frequency band area under the Hilbert marginal spectrum curve, and the width between the above two are extracted.

• ${f}_{1/4}$, ${f}_{3/4}$ and width between ${f}_{1/4}$ and ${f}_{3/4}$ ($WOHMS$)

In this step, EEMD is performed first to process the target signal as Equations (8)–(13) illustrate. Then, Hilbert transformation is implemented in each IMF component of the reconstructed signal ${s}_{re}$.

$${\widehat{IMF}}_{i}(t)=\frac{1}{\pi}{{\displaystyle \int}}_{-\infty}^{\infty}\frac{IM{F}_{i}(\tau )}{t-\tau}d\tau $$

Next, the analytical signals are constructed by:

$${z}_{i}(t)=IM{F}_{i}(t)+j{\widehat{IMF}}_{i}(t)={\alpha}_{i}(t){e}^{j{\phi}_{i}(t)}$$

The amplitude of the analytical signals is expressed by ${\alpha}_{i}(t)=\sqrt{IM{F}_{i}{(t)}^{2}+{\widehat{IMF}}_{i}{(t)}^{2}}$ and the instantaneous phase is expressed as ${\phi}_{i}(t)=\mathrm{arctan}[\frac{{\widehat{IMF}}_{i}(t)}{IM{F}_{i}(t)}]$. Then, the Hilbert marginal spectrum can be given by:
where $T$ is the total signal duration and ${V}_{useful}$ is the total IMF component number of the reconstructed signals. $\omega $ is the instantaneous frequency which is defined as $\omega =2\pi \frac{d{\phi}_{i}(t)}{dt}$ [24]. The comparison results of Hilbert marginal spectrum of human targets and dog targets are demonstrated in Figure 7.

$$h(\omega )={{\displaystyle \int}}_{0}^{T}(Re{\displaystyle \sum}_{i=1}^{{V}_{useful}}{\alpha}_{i}(t){e}^{j\int (\frac{dt}{d{\phi}_{i}(t)})dt})dt$$

The ${f}_{1/4}$ and ${f}_{3/4}$ are the frequency values of the points corresponding to 1/4 total frequency band area and 3/4 total frequency band area, respectively, in Figure 7. Accordingly, $WOHMS$ is given by:

$$WOHMS={f}_{3/4}-{f}_{1/4}$$

• Respiratory Frequency ($RF$)

After performing Fourier transform of the target signal, the $RF$ of human targets and dog targets can be obtained.

## 4. SVM-Based Classification Procedure and Feature Selection Strategy

A support vector machine (SVM) is a type of generalized linear classifier which classifies data by supervised learning. Its decision boundary is the maximum margin hyperplane for learning samples. SVM classifier is chosen here because it has good performance for small samples when dividing data into two categories, which fits well with our purpose of distinguishing human beings from dogs.

Although the 12 features are calculated for each sample, these features may not contribute equally to improving the ability of distinguishing human targets from dog targets. There may be overfitting and poor generalization when too many features less capable of distinguishing or highly correlated to each other are adopted in the classifier. To find an optimal feature subset with optimal distinguishing power, recursive feature elimination method on SVM (SVM-RFE) is performed for the overall feature subset. In the selection process of the SVM-RFE, all features are sorted by backward elimination, wherein for each iteration, the feature that has the least contribution in the distinction is removed, i.e., the lowest weight [25,26]. The weights are calculated by the widely used LIBSVM package [27,28]. The higher the weight is, the stronger the distinguishing capability of the feature is. Then, after sorting, different feature subsets would be obtained by choosing the Top-f (1 $\le $ f $\le $ F) features from the ranked features, where F is the total numbers of features (F = 12). Accordingly, there would be F classifiers. Next, the different classifiers involving different feature subsets are evaluated by calculating the receiver operating characteristic (ROC) curve and the area under the curve (AUC). Finally, to validate the stability and generalization of the optimal classifier, 100 rounds of the ten-fold cross-validation method are performed to calculate the average parameters of the classifier. The procedure of the classification and selection are expressed as follows.

- 1.
- Sorting the overall feature set.
- while (the number of features is not zero){
- calculating the weights of each feature using the LIBSVM package;
- removing the feature with the lowest weight;
- }

- 2.
- Modeling classifiers with the Top-f (1 $\le $ f $\le $ F) features from the sorted sequence.
- 3.
- Choose the optimal feature subset with the highest AUC from the classifier in step 2.
- 4.
- Calculating the key parameters of the classifier in step 3 after 100 round ten-fold cross-validation.

In the procedure of ten-fold cross-validation, the sample data are firstly divided randomly into ten copies with equal numbers. Then, nine of them are used as training samples and the remaining copy is used as the test data per time, until each data copy is used as test data.

The ordinate of the ROC curve is the true positive rate which is defined as “Sensitivity” and the abscissas is the false positive rate which is defined as “1 − Specificity”. The calculations of Sensitivity and Specificity are illustrated in Equations (33) and (34), respectively. One of the advantages of the ROC curve is that when the distribution of positive and negative samples changes, the shape of the ROC curve can be basically unchanged. Therefore, the interference caused by different test sets can be reduced and the evaluation of the performance of the model is more objective. The key parameter extracted from the ROC curve is the value of AUC; a larger value denotes a classifier with a better performance.

It is specified that in the calculation of the AUC of each classifier, the Grid-Search method in the LIBSVM package is applied to determine the optimal parameter combinations of (c, g) which are closely related to the distinguishing performance of the classifier. The c is the penalty coefficient corresponding to the generalization ability of the model. g, one of the parameters of the kernel function in the classifier, is related to the number of support vectors and further affects the speed of model training and prediction. Finally, an optimal feature subset for distinguishing task is selected by choosing the subsets with the highest AUC values. The overall steps in the selection of the optimal classifier and the detailed processes of the SVM-RFE are illustrated in Figure 8 and Figure 9, respectively.

In ten-fold cross-validation, sample data are divided into ten copies and implemented ten times for the training test procedure, until each copy is used as test data once. The results of the ten times are accumulated and performance of the classifiers are evaluated using AUC value, Sensitivity, Specificity, and Accuracy (ACC). The human target labels in the classifiers are “+1” as the positive samples, and the dog target labels are “−1” as the negative samples. Sensitivity is the accuracy of judging actual human targets as human targets. Specificity is the accuracy of judging actual dog targets as dog targets. ACC is the overall accuracy of judging the targets correctly, whether human or dog targets. AUC, the area under the ROC curve, is a parameter for evaluating the overall performance of the classifier. The calculations of Sensitivity, Specificity, and Accuracy are expressed as:

$$Sensitivity=\frac{thenumberof\u201c+1\u201d\mathrm{both}inpredictiveandtestlabelssimultaneously}{totalnumberof\u201c+1\u201dinthetestlabels}$$

$$Specificity=\frac{thenumberof\u201c-1\u201d\mathrm{both}inpredictiveandtestlabelssimultaneously}{totalnumberof\u201c-1\u201dinthetestlabels}$$

$$\begin{array}{cc}Accuracy& =\frac{thenumberof\u201c+1\u201d\mathrm{both}inpredictiveandtestlabelssimultaneously+}{totalnumberofthetestlabels}\\ & \frac{thenumberof\u201c-1\u201d\mathrm{both}inpredictiveandtestlabelssimultaneously+}{totalnumberofthetestlabels}\end{array}$$

The test labels are the labels (“+1” or “−1”) of the target samples when the samples are treated as test data. The predictive labels are the corresponding predictive classification results of the test data used in the optimal classifier.

In addition, the method is also tested to distinguish human targets versus no targets and dog targets versus no targets. Let us assume that the accuracies of both classifications are highly validated, then it prompts that any two types of targets within human, dog, and no target can be effectively classified using the method. Simultaneously, it can further prove that the vital information of human or dog is effectively captured. Fifty sets of environmental interference signals without any targets are collected and utilized for classification with signals of human targets and dog targets, respectively. The classification method used here is SVM-RFE, which is identical to that of the method in the classification between human and dog targets. No targets samples are labeled as “−1” in both classifications. The key result parameters of optimal classifiers with optimal subsets are described in Section 5.4.

## 5. Experimental Results

#### 5.1. Experimental Setup and Data acquisition

A total of eight healthy human targets aged between 23 and 42 and five grown-up beagle dog targets aged approximately 1 year were involved. All dogs were provided by the Experimental Animal Center of Fourth Military Medical University. A 28-cm-thick brick wall was present between the targets and the antennas. The photographs and geometries of the experiments are shown in Figure 10. Each target was detected ten times at about 2.5 m away from the wall. Because the time interval of every two acquisitions, distance away from the wall, posture facing radar, and environmental interference are not identical each time, the raw radar signals can be treated from different samples when the acquisition times are small (10 times per target). Each dog target laid prone quietly on the experimental table, whose middle line was 2.5 m away from the wall. Owing to the width of the table being 0.8 m, dog targets could choose a comfortable posture on the table following their wishes, until they remained still for a long time. Thus, dogs could lay facing different directions with random postures in each sample collection. Similarly, different directions of standing facing are set in the human signal collection scenario. What is more, for each individual human target, measuring intervals ranged from 1 minute to 20 minutes between two acquisition signals, and the measuring intervals of each individual dog target ranged from 1 minute to 12 hours. Therefore, the states of targets and the environmental interference were not identical each time. So, there may be large differences in the signal features for each acquisition, even if it is the same target. The detailed information for the experimental subjects is presented in Table 2.

#### 5.2. Classification Between Human Targets and Dog Targets

The results of the feature sorting and the optimal feature subset are illustrated in Table 4. The order of the features in the list of sorting results represents their distinguishing performance. The smaller the sequence number, the higher the feature weights. Among the features, the CRCCMV has the highest weight and contributes the most to the distinguishing task.

The classification performance using different subsets of ranked features is illustrated in Figure 11. The optimal feature subset is the Top-11 ranked features. The feature that contributes the least is the respiratory frequency ($RF$), which was expected since the $RF$ of human and dog targets are about the same, which is approximately 0.2–0.4 Hz. Therefore, the difference in the $RF$ feature between human and dog targets is the smallest. The feature that contributes the most is $CRCCMV$. As shown in Figure 6, the $CCMV$ curve of human targets is much smoother than that of dog targets. In this way, the difference of $CRCCMV$ between human targets and dog targets will be much bigger.

The key parameters of the classifier with the optimal feature subset after 100 rounds of ten-fold cross-validation is presented in Table 5.

#### 5.3. Analysis of Contribution of Different Categories

To compare the ability of different category features in distinguishing human targets from dog targets, the overall features are divided into five groups of features and their performance is evaluated using the ROC curve and key parameters of classifier: (1) Correlation coefficient-corresponding features; (2) wavelet entropy-corresponding features; (3) energy-corresponding features; (4) frequency-corresponding features; and (5) optimal feature subset. The comparison results are expressed in Figure 12 and Table 6. Correlation coefficient-corresponding features independently and wavelet entropy-corresponding features independently have similar performances, viewed from the comparison results of ROC curve, obviously superior to the other two groups when used along. The optimal feature subset has the best performance in any way viewed from the parameters in Table 6. From the perspective of the ability in identifying human targets, correlation coefficient-corresponding features utilized independently have the same performance as the optimal feature subset, but the ability of identifying dog targets has a much poorer comparison.

#### 5.4. Classification Between No Target Signals and Target Signals

The photograph and geometry of no target signals acquisition are demonstrated in Figure 13. The deployments are the same as those of dogs, except that there are no targets.

The collected 50 samples of no targets were used for classification with human target samples and dog target samples, respectively. The key parameter of the corresponding optimal classifiers after SVM-RFE and 100 rounds of ten-fold cross-validation are listed in Table 7.

The AUC of classifier between no targets and human targets can reach 1.0. Considered in conjunction with the classification between human and dog targets, the AUC value substantiates that the collected signals of human targets are veritable and useful, i.e., the 80 samples of humans truly contain the information of human targets. Similarly, the best AUC value between no targets and dog targets is 0.9976, which implies that the dog signals are also true target signals. Meanwhile, the results in Table 7 verify the validity of the method for the classification between target situations and no target situations.

## 6. Discussion

The proposed method provided an outstanding performance in distinguishing stationary human and dog targets under through-wall condition. Twelve features belonging to four categories were combined based on the SVM-RFE method, and the distinguishing accuracy was able to reach 0.9924. The method was significant for some actual applied post-disaster rescuing situations. Especially, it could help optimizing distribution of rescue resources and enhance the rescuing confidence in post-earthquake searching and miner accident rescuing, where the trapped subjects are buried under obstacles and unable to move. Furthermore, there are basically no other research groups that have reported on the classification of stationary humans and animals, since the existing method concentrates mostly on the distinguishing of moving targets. However, there are a few issues to further consider for practical application. Besides dogs, cats, rabbits, and some other common family pets that are likely to cause false alarm in post-disaster rescue applications could be included in further research.

## 7. Conclusions

An accurate algorithm to distinguish stationary humans from dogs under through-wall conditions is proposed, based on a 500 MHz center-frequency UWB radar. The algorithm combines four categories involving 11 feature species to form an optimal feature subset based on the SVM-RFE method. The classifier using the optimal feature subset was found to have excellent performance in the distinguishing task. The AUC average value and ACC value are 0.9993 and 0.9924, respectively, after 100 rounds of ten-fold cross validation, which confirms that the algorithm is efficient and suitable for the classification of stationary human and dog targets under through-wall conditions. In addition, the correlation coefficient-corresponding features have the most capable contribution compared to the other three groups, which are wavelet entropy-corresponding features, energy-corresponding features, and frequency-corresponding features. To be more rigorous, the classification between no target situations and target situations were performed. The results confirm that the collected human and dog target signals truly contain the information of the respective targets. Meanwhile, the AUC values also verify that the method proposed in this paper is valid in classification between no targets situations and targets situations. We envision that this algorithm can be applied to various practical situations such as earthquake and hostage rescue missions and intelligent homes.

## Author Contributions

Conceptualization, Y.M. and Y.Z.; methodology, Y.M., X.Y., and Y.Z.; validation, Y.M., P.W., and Y.Z.; investigation, Y.M. and P.W.; data curation, Y.M., P.W., and Y.Z.; writing—original draft preparation, Y.M.; writing—review and editing, F.L., Y.Z., and H.L.; funding acquisition, F.L., Y.Z., and J.W.

## Funding

This research was funded by the National Key R&D Program of China (No. 2018YFC0810201) and the National Natural Science Foundation of China (No. 31800822).

## Acknowledgments

The authors thank anonymous reviewers and academic editors for their valuable comments and helpful suggestions. The authors would also be grateful to assistant editor for her meticulous work.

## Conflicts of Interest

The authors declare no conflicts of interest.

## Abbreviations

The following abbreviations are used in this manuscript:

UWB | Ultra-wideband |

SVM | Support vector machine |

RFE | Recursive feature elimination |

StdCRMV | Standard deviation change rate of micro vibration |

OW | Optimal Window |

ERRFB | Energy ratio of the reference frequency band |

OCCMV | Optimal correlation coefficient of micro vibration |

CRCCMV | Change rate of correlation coefficient of micro vibration |

MWE | Mean of wavelet entropy of target signal |

StdWE | Standard deviation of wavelet entropy of target signal |

MMWEOW | Mean of MWE in the OW window |

RWE | Ratio of wavelet entropy |

${f}_{1/4}$ | The frequency values of the points corresponding to 1/4 total frequency band area in Hilbert marginal spectrum |

${f}_{3/4}$ | The frequency values of the points corresponding to 3/4 total frequency band area in Hilbert marginal spectrum |

WOHMS | Width between ${f}_{1/4}$ and ${f}_{3/4}$ |

RF | Respiratory frequency |

AUC | Area under the ROC curve |

ACC | Accuracy |

## References

- Li, C.; Lin, J. Microwave Noncontact Motion Sensing and Analysis; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2014. [Google Scholar]
- Zhang, Y.; Qi, F.; Lv, H.; Liang, F.; Wang, J. Bioradar Technology: Recent Research and advancements. IEEE Microw. Mag.
**2019**, 20, 58–73. [Google Scholar] [CrossRef] - Tang, Y.; Wang, J.; Li, C. Short-Range Indoor Localization Using a Hybrid Doppler-UWB System. In Proceedings of the IEEE MTT-S International Microwave Symposium (IMS), Honololu, HI, USA, 4–9 June 2017; pp. 1007–1010. [Google Scholar]
- GarciaFernandez, M.; AlvarezLopez, Y.; LasHeras, F. Autonomous Airborne 3D SAR Imaging System for Subsurface Sensing: UWB-GPR on Board a UAV for Landmine and IED Detection. Remote Sens.
**2019**, 11, 2357. [Google Scholar] [CrossRef] - Ryu, S.J.; Suh, J.S.; Baek, S.H.; Hong, S.; Kim, J.H. Feature-Based Hand Gesture Recognition Using an FMCW Radar and its Temporal Feature Analysis. IEEE Sens. J.
**2018**, 18, 7593–7602. [Google Scholar] [CrossRef] - Wang, Y.; Yu, X.; Zhang, Y.; Lv, H.; Jiao, T.; Lu, G.; Li, Z.; Li, S.; Jing, X.; Wang, J. Detecting and monitoring the micro-motions of trapped people hidden by obstacles based on wavelet entropy with low center-frequency UWB radar. Int. J. Remote Sens.
**2015**, 36, 1349–1366. [Google Scholar] [CrossRef] - Peng, Z.; Li, C. Portable Microwave Radar Systems for Short-Range Localization and Life Tracking: A Review. Sensors
**2019**, 19, 1136. [Google Scholar] [CrossRef] [PubMed] - Tahmoush, D.; Silvious, J. Remote detection of humans and animals. In Proceedings of the IEEE Applied Imagery Pattern Recognition Workshop, Washington, DC, USA, 14–16 October 2009. [Google Scholar]
- Lee, J.; Kwon, J.; Bae, J.H.; Lee, C.H. Classification Algorithms for Human and Dog Movement Based on Micro-Doppler Signals. IEIE Trans. Smart Process. Comput.
**2017**, 6, 10–17. [Google Scholar] [CrossRef] - Van Eeden, W.D.; De Villiers, J.P.; Berndt, R.J.; Nel, W.A.; Blasch, E. Micro-Doppler radar classification of humans and animals in an operational environment. Expert Syst. Appl.
**2018**, 102, 1–11. [Google Scholar] [CrossRef] - Otero, M. Application of a continuous wave radar for human gait recognition. SPIE
**2005**, 5809, 538–548. [Google Scholar] - Wang, Y.; Yu, X.; Zhang, Y.; Lv, H.; Jiao, T.; Lu, G.; Li, W.Z.; Li, Z.; Jing, X.; Wang, J. Using Wavelet Entropy to Distinguish Between Humans And Dogs Detection by UWB Radar. Prog. Electromagn. Res. PIER
**2013**, 139, 335–352. [Google Scholar] [CrossRef] - Wang, Y. Study on the Technology of Distinguishing between Humans and Animals via UWB Bio-Radar. Ph.D. Thesis, the Fourth Military Medical University, Xi’an, Shaanxi, China, 2014. [Google Scholar]
- Yu, X.; Jiao, T.J.; Lv, H.; Zhang, Y.; Li, Z.; Wang, J.Q. A New Use of UWB Radar to Detecting Victims and Discriminating Humans from Animals. In Proceedings of the 16th International Conference on Ground Penetrating Radar (GPR), Hong Kong, China, 13–16 June 2016. [Google Scholar]
- Yin, Y.; Yu, X.; Lv, H.; Liu, M.; Qi, F.; Wang, J. Micro-vibration Distinguishment Between Humans and Animals Based on EEMD Using UWB Radar. In Proceedings of the IET International Radar Conference, Nanjing, China, 17–19 October 2018. [Google Scholar]
- Lv, H.; Lu, G.H.; Jing, X.J.; Wang, J.Q. A new ultra-wideband radar for detecting survivors buried under earthquake rubbles. Microw. Opt. Technol. Lett.
**2010**, 52, 2621–2642. [Google Scholar] [CrossRef] - Li, Z.; Li, W.; Lv, H.; Zhang, Y.; Jing, X.; Wang, J. A Novel Method for Respiration-Like Clutter Cancellation in Life Detection by Dual-Frequency IR-UWB Radar. IEEE Trans. Microw. Theory Tech.
**2013**, 61, 2086–2092. [Google Scholar] [CrossRef] - Yu, X. Study on the Technology of Distinguishing Between Human Beings and Animals by Vital Signs Based on Bio-Radar Detection. Ph.D. Thesis, the Fourth Military Medical University, Xi’an, Shaanxi, China, 2012. [Google Scholar]
- Yin, Y.; Yu, X.; Lv, H.; Liu, M.; Qi, F.; Wang, J. Micro-Vibration Distinguishment of Radar Between Humans and Animals Based on EEMD and Energy Ratio Characteristics. China Med. Devices
**2018**, 33, 27–31. [Google Scholar] - Huang, N.E.; Shen, Z.; Long, S.R.; Wu, M.C.; Shih, H.H.; Zheng, Q.; Yen, N.; Tung, C.C.; Liu, H.H. The Empirical Mode Decomposition and the Hilbert Spectrum for Nonlinear and Non-Stationary Time Series Analysis. Proc. R. Soc. Lond. Ser. A Math. Phys. Eng. Sci.
**1998**, 454, 903–995. [Google Scholar] [CrossRef] - Wu, Z.; Huang, N.E. Ensemble Empirical Mode Decomposition: A Noise Assisted Data Analysis Method. Adv. Adapt. Data Anal.
**2009**, 1, 1–41. [Google Scholar] [CrossRef] - Shannon, C.E. A mathematical theory of communication. Bell Syst. Tech. J.
**1948**, 27, 379–423. [Google Scholar] [CrossRef] - Matsui, T.; Ishizuka, T.; Ishihara, M.; Ishihara, M.; Matsumura, K.; Kikuchi, M.; Kurita, A. The non-contact monitoring of heart and respiratory rates using laser irradiation: An experimental simultaneous monitoring with and without clothes during biochemical hazards. J. Med Eng. Technol.
**2003**, 27, 133–136. [Google Scholar] [CrossRef] [PubMed] - Hu, B.; Zhang, X.; Mu, J.; Wu, M.; Wang, Y. Spasticity assessment based on the Hilbert–Huang transform marginal spectrum entropy and the root mean square of surface electromyography signals: A preliminary study. BioMed. Eng. OnLine
**2018**, 17, 27. [Google Scholar] [CrossRef] - Zhang, X.; Xu, X.; Tian, Q.; Li, B.; Wu, Y.; Yang, Z.; Liang, Z.; Liu, Y.; Cui, G.; Lu, H. Radiomics assessment of bladder cancer grade using texture features from diffusion-weighted imaging. J. Magn. Reson. Imaging
**2017**, 46, 1281–1288. [Google Scholar] [CrossRef] - Xu, X.; Liu, Y.; Zhang, X.; Tian, Q.; Wu, Y.; Zhang, G.; Meng, J.; Yang, Z.; Lu, H. Preoperative prediction of muscular invasiveness of bladder cancer with radiomic features on conventional MRI and its high-order derivative maps. Abdom. Radiol.
**2017**, 42, 1896–1905. [Google Scholar] [CrossRef] - Andrew, A.M. An Introduction to Support Vector Machines and Other Kernel-Based Learning Methods; Cambridge University Press: Cambridge, UK, 2000; Volume 18, pp. 687–689. [Google Scholar]
- Chang, C.C.; Lin, C.J. LIBSVM: A library for support vector machines. ACM Trans. Intell. Syst. Technol.
**2011**, 2, 27. [Google Scholar] [CrossRef]

**Figure 1.**Block diagram of ultra-wideband (UWB) radar. Abbreviations: ADC, Analog-to-Digital Converter; TA, transmitter antenna; RA, receiving antenna.

**Figure 4.**Data after preprocessing $DataAF(x,n)$ when the human target is 2.5 m behind the wall. The scope of the target’s movements is not just fixed in one point along the range dimension, because the obtained reflected signals are due to the target’s skin and internal organ movements or movements in other parts of the body.

**Figure 5.**Comparison results of $StdCRMV$. (

**a**) $Std$ of human target; (

**b**) $Std$ of dog target. Abbreviations: OW, optimal window; StdCRMV, standard deviation change rate of micro vibration; Std, standard deviation.

**Figure 6.**Comparison results of the correlation coefficient of micro vibration ($CCMV$) of human targets and dog targets: (

**a**) $CCMV$ of human targets; (

**b**) $CCMV$ of dog targets.

**Figure 7.**Comparison results of the Hilbert marginal spectrum of human and dog targets: (

**a**) Hilbert marginal spectrum of human targets; (

**b**) Hilbert marginal spectrum of dog targets.

**Figure 8.**Overall steps of the selection of the optimal classifier. In “

**Step 1**: Radar data acquisition”, the raw echo data are collected. In “

**Step 2**: Feature extraction”, twelve feature species belonging to four categories are extracted. In “

**Step 3**: Optimal classifier selection by SVM-RFE”, an optimal classifier with the highest AUC value is selected.

**Figure 9.**Detailed processes of the SVM-RFE. In “

**Step 1**: Feature sorting”, the color of a feature represents its contribution in the distinguishing task. A warmer color denotes more contributions. Abbreviations: AUC, area under the curve; In “

**Step 2**: AUC values calculation”, the optimal parameter combinations of (c, g) are determined and AUC values with Top-f (1 $\le $ f $\le $ F) features from the sorted sequence are calculated. Finally, an optimal classifier with optimal Top-f features will be obtained.

**Figure 10.**Photographs and geometries of experiments: (

**a**) Sketch map of human target measuring; (

**b**) actual measuring photographs of human target; (

**c**) sketch map of dog target measuring; (

**d**) actual measuring photographs of dog target.

**Figure 11.**AUC values using different feature subset. The optimal feature subset is the Top-11 ranked features.

**Figure 12.**Comparison results of different category features in distinguishing human targets from dog targets.

**Figure 13.**Photograph and geometry of no target signals acquisition experiment: (

**a**) Sketch map of no target measuring; (

**b**) actual measuring photographs of no target.

Parameters | Value |
---|---|

Center frequency | 500 MHz |

Bandwidth | 500 MHz |

Pulse repetition frequency | 128 kHz |

Sampling points | 2048 |

Scanning speed | 64 Hz |

Sensitivity of receiver | −78 dBm |

Dynamic range of receiver | 60 dB |

Sampling bit number of ADC | 16 bits |

Human Target | Dog Target | Total | |
---|---|---|---|

Target number | 8 | 5 | 13 |

Data samples | 8 × 10 | 5 × 10 | 13 × 10 |

Male numbers | 8 | 2 | 10 |

Female numbers | 0 | 3 | 3 |

Detailed Species | Key Preprocessing Parameters | |
---|---|---|

Energy-corresponding features | StdCRMV; ERRFB | Q = 1, OW_{w} = 15 for StdCRMV;Q = 10 for ERRFB; |

Correlation coefficient-corresponding features | OCCMV; CRCCMV | Q = 1, OW_{w} = 15 |

Wavelet entropy-corresponding features | MWE; S_{td}MWE; MMWEOW; RWE | Q = 10, OW_{w} = 20 |

Frequency-corresponding features | ${f}_{1/4}$; ${f}_{3/4}$; WOHMS; RF | Q = 10 |

Total number | 12 |

Note: “OW

_{w}” represents the width of the OW.Sorting Results | Feature Species |
---|---|

① | CRCCMV |

② | MWE |

③ | MMWEOW |

④ | StdCRMV |

⑤ | WOHMS |

⑥ | RWE |

⑦ | StdWE |

⑧ | OCCMV |

⑨ | ${f}_{3/4}$ |

⑩ | ERRFB |

⑪ | ${f}_{1/4}$ |

⑫ | RF |

**Table 5.**Key parameters of the classifier with the optimal feature subset after 100 rounds of ten-fold cross-validation.

Parameters | Values |
---|---|

Sensitivity | 0.9877 |

Specificity | 0.9994 |

Accuracy | 0.9924 |

AUC | 0.9993 |

**Table 6.**Comparison results of different category features in distinguishing human targets from dog targets cross-validation.

Sensitivity | Specificity | Accuracy | AUC | |
---|---|---|---|---|

Correlation coefficient-corresponding features alone | 0.9877 | 0.92 | 0.9618 | 0.9941 |

Wavelet entropy-corresponding features alone | 0.9506 | 0.96 | 0.9542 | 0.9956 |

Energy-corresponding features alone | 0.9259 | 0.78 | 0.8702 | 0.9323 |

Frequency-corresponding features | 0.9136 | 0.82 | 0.8779 | 0.9363 |

Optimal feature subset | 0.9877 | 0.9994 | 0.9924 | 0.9993 |

**Table 7.**Key parameters of the corresponding optimal classifiers of no target classification with human targets and dog targets, respectively.

Sensitivity | Specificity | Accuracy | AUC | |
---|---|---|---|---|

No target with human target classification | 1.0000 | 0.9890 | 0.9958 | 1.0000 |

No target with dog target classification | 0.9530 | 0.9936 | 0.9733 | 0.9976 |

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).