Next Article in Journal
Pattern Recognition in Music on the Example of Reconstruction of Chest Organ from Kamień Pomorski
Next Article in Special Issue
Human-Machine Shared Driving Control for Semi-Autonomous Vehicles Using Level of Cooperativeness
Previous Article in Journal
Quantitative and Comparative Analysis of Effectivity and Robustness for Enhanced and Optimized Non-Local Mean Filter Combining Pixel and Patch Information on MR Images of Musculoskeletal System
Previous Article in Special Issue
Optimal H Control for Lateral Dynamics of Autonomous Vehicles
Order Article Reprints
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:

Feasibility of Using a MEMS Microphone Array for Pedestrian Detection in an Autonomous Emergency Braking System

Department of Signal Theory and Communications and Telematics Engineering, University of Valladolid, Paseo de Belén, 15, 47011 Valladolid, Spain
School of Industrial Engineering, University of Valladolid, Paseo del Cauce, 59, 47011 Valladolid, Spain
Authors to whom correspondence should be addressed.
Sensors 2021, 21(12), 4162;
Received: 19 May 2021 / Revised: 9 June 2021 / Accepted: 15 June 2021 / Published: 17 June 2021
(This article belongs to the Special Issue Sensors Fusion for Vehicle Detection and Control)


Pedestrian detection by a car is typically performed using camera, LIDAR, or RADAR-based systems. The first two systems, based on the propagation of light, do not work in foggy or poor visibility environments, and the latter are expensive and the probability associated with their ability to detect people is low. It is necessary to develop systems that are not based on light propagation, with reduced cost and with a high detection probability for pedestrians. This work presents a new sensor that satisfies these three requirements. An active sound system, with a sensor based on a 2D array of MEMS microphones, working in the 14 kHz to 21 kHz band, has been developed. The architecture of the system is based on an FPGA and a multicore processor that allow the system to operate in real time. The algorithms developed are based on a beamformer, range and lane filters, and a CFAR (Constant False Alarm Rate) detector. In this work, tests have been carried out with different people and in different ranges, calculating, in each case and globally, the Detection Probability and the False Alarm Probability of the system. The results obtained verify that the developed system allows the detection and estimation of the position of pedestrians, ensuring that a vehicle travelling at up to 50 km/h can stop and avoid a collision.

1. Introduction

Nowadays, road vehicles are the most widespread transportation system. Current estimations set the mortality due to traffic accidents and collisions at around 1.35 million people per year. Furthermore, 23% of this mortality is associated with pedestrian fatalities, totaling some 310,000 pedestrian deaths per year [1]. To try to reduce this fatal number, the automotive and transport sectors are actively working on different solutions, even more so since the development of autonomous cars began, where their use in urban environments is still an unsolved problem due to the high presence of pedestrians.
On the one hand, among the different actions to reduce the number of pedestrian accidents that are being carried out, some of these actions focus on studies related to the analysis of pedestrian behavior at crosswalks [2,3,4,5], to try to anticipate their movements, and take them into account when making decisions about braking the vehicle or slowing down its speed in areas likely to be at risk, due to the presence of pedestrians, who could become potential victims. This information is very valuable in terms of paying attention to pedestrian detection and tracking [4] and also in terms of designing routes to avoid potential risk areas as much as possible [5].
On the other hand, further actions to reduce the number of pedestrian accidents are focused on the development and improvement of intelligent transport systems. Many of these systems pay special attention to the interaction between vehicles and pedestrians, warning vehicle drivers that pedestrians are in the vicinity [6,7,8]; other systems are centered on information sharing between vehicles and the transport infrastructure itself [9,10].
In addition to these actions, over the last several decades, vehicles have been equipped with Advanced Driver Assistance Systems (ADAS), such as ABS (Anti-Blocking System), ESC (Electronic Stability Controllers), and AEB (Autonomous Emergency Braking), which improve their comfort, efficiency, and safety. All these systems are of particular interest in autonomous vehicles. Among these systems, there are AEB systems specifically focused on pedestrian detection, the so-called AEB-P (AEB for Pedestrians), whose studies are heavily focused on trying to reduce the number of accidents involving pedestrians, either by using monocular cameras [11], analyzing time-to-collision (TTC) [12,13], mapping the positions of detected pedestrians [14], including vacuum emergency braking (VEB) [15], or developing protocols for testing emergency braking systems [16,17].
For all these actions mentioned above, studies and developments are essential to improve pedestrian detection systems and algorithms, either by using statistical models [18], combining the use of different detection systems [19], defining complex scenarios [20], improving algorithms for tracking detected pedestrians [21,22], developing algorithms for detecting and tracking multiple pedestrians [23,24,25], or by using machine learning algorithms [26,27,28].
Most of the systems used for pedestrian detection are based on RGB cameras and image processing algorithms. These systems are very effective when the surrounding visibility conditions are adequate, but when the visibility is reduced, their performance drops considerably. For this reason, several studies are currently focused on trying to solve this problem. Some of the solutions being worked on are based on improving pedestrian detection algorithms in this type of low visibility environment [29,30,31], and others focus on using other types of systems to obtain information, such as the use of LIDAR [32,33] or infrared sensors [34,35], or on fusing images obtained from the classic RGB camera with other detection systems, such as thermal cameras [36], LIDAR [37], or an array of microphones [38].
Under these premises, the idea of analyzing the feasibility of detecting pedestrians by means of an acoustic array installed in a vehicle arose. The system, working in conjunction with AEB systems, would prevent traffic accidents. Using acoustic signals ensures that the system will function properly in environments with reduced visibility. Arrays are ordered sets of identical sensors, whose response pattern is controlled by modifying the amplitude and phase given to each sensor, their spacing, and their distribution in space (linear, planar, etc.) [39]. Beamforming techniques are used with arrays [40] so that the array pattern is electronically steered to different spatial positions.
The acronym MEMS (Micro-Electro-Mechanical System) refers to technology that develops mechanical systems with a dimension of less than 1 mm on integrated circuits (ICs) [41]. MEMS technology applied to acoustic sensors has developed high-quality microphones with high SNR (signal-to-noise ratio), high sensitivity, and low power consumption [42]. In sound source localization systems, MEMS microphones are often combined with FPGA-based architectures [43].
In this paper, based on the idea that the use of MEMS sensors is common in vehicles and the transport sector [44,45], and on the authors’ previous experience in the implementation of acoustic arrays for human detection [46,47,48], the authors study the feasibility of using an array of MEMS microphones embedded on a vehicle to detect pedestrians. Some tests have been carried out in order to detect the position of pedestrians in outdoor environments. The tests have been carried out at different distances, depending on the braking distances to be considered according to different speeds that a vehicle can carry on urban roads, and also according to the maximum deceleration that the AEB system gives to the vehicle. The main objective of this paper is to analyze the feasibility of using a system with the indicated characteristics to accurately detect pedestrians on a road.
Section 2 introduces the description of the system developed in this study, showing the different hardware platforms that compose the system and the system features. Section 3 presents the results obtained on the tests of the system and the corresponding discussion of these results. Finally, Section 4 contains the conclusions that authors have drawn on the basis of the obtained results.

2. Materials and Methods

This section presents the requirements to be met by the acoustic acquisition and processing system, based on a 2D array of MEMS microphones. Then the hardware on which the system is based is presented, as well as the processing algorithms with which the pedestrian presence is detected.

2.1. Requirements

The main objective of the designed system is to allow pedestrian detection using an acoustic array of digital MEMS microphones that will be embedded on a vehicle, in order to provide sensing information to an AEB system. To achieve this objective, a set of requirements has been defined that must be satisfied by the system:
  • The designed system will have to be onboard a vehicle, specifically at its front, in order to be able to detect pedestrians in its path.
  • The system will be based on the principle of RADAR (RAdio Detection and Ranging) but using sound waves instead of radio waves, i.e., the system will be based on a SODAR (SOund Detection and Ranging) system. This type of system is an active one, based on the generation of an acoustic signal. This signal is reflected on a pedestrian that could be in the vehicle’s path. The reflected signal is then received by the MEMS microphone array, for its subsequent analysis and processing.
  • The acoustic pedestrian detection system shall be able to communicate with the vehicle’s central vehicle control system to alert the AEB system, so that the car can act accordingly. The reason of this requirement is that the purpose of this system is to be integrated into a vehicle, together with other ADAS systems with which the vehicle may be equipped, to assist the AEB system in making a more reliable braking decision in a pedestrian detection in the vehicle’s path.
  • The system should provide reliable results taking into account that the vehicle may travel at different speeds. In this respect, since AEB systems are often used in urban environments, it has been defined that the system should be able to detect pedestrians for vehicle speeds below 50 km/h, which is the usual speed limit in urban environments (although in some cities, this limit is already being reduced to 30 km/h).
  • Another requirement that the system must comply with is that it must have a good resolution in the horizontal coordinate (azimuth). We assume that the pedestrian will be standing on the road, at the vertical height of the vehicle, so the resolution of the system should be focused on the horizontal coordinate. So, the position of the pedestrian will be represented in this azimuth coordinate.

2.2. Hardware Setup

2.2.1. MEMS Array

The acoustic images acquisition system used in this paper is based on a Uniform Planar Array (UPA) of MEMS microphones. This array, which has been entirely developed by the authors and is shown in Figure 1, is a rectangular array, of 5 × 30 MEMS microphones, consisting of 6 array modules. Each of these modules is based on a square array of 5 × 5 sensors, which are uniformly spaced every 0.9 cm in a rectangular Printed Circuit Board (PCB).
The selection of the working frequency range of the system responds to multiple factors: the physical size of the array, the sensor spacing, the frequency response of the acoustic MEMS sensors and the emitter, the acoustic reflectivity of people, and finally the required angular resolution of the system. As a result of all these considerations, it is necessary to reach a compromise relationship, since many of these parameters are opposite. Thus, a frequency band between 14 and 21 kHz has been selected. These high frequency values have been selected also to avoid the ambient noises, of much lower frequency, that could interfere with the behavior of the system. Once the working frequency range was defined, the acoustic array was designed in such a way that the sensor spacing provided good resolution at low frequencies, and it also avoided the appearance of grating lobes in the Field of View (FoV) for high frequencies.
SPH0641LU4H-1 digital MEMS microphones of Knowles were chosen for the implementation of the array. These microphones have a PDM (Pulse Density Modulation) interface and a one-bit digital output [49]. The main features of these microphones are: high performance, low-power, omnidirectional response, 64.3 dB SNR, high sensitivity (−25 dBFS), and an almost flat frequency response (±2 dB in the range of 10 kHz to 24 kHz).

2.2.2. Processing System

The hardware used to implement the system was selected taking into account the previous requirements defined in Section 2.1. A search for a commercial solution was done due to the high cost and time of a specific hardware design.
The base unit of the system is an sbRIO 9629 platform [50]. This platform belongs to National Instruments, particularly to the Reconfigurable Input-Output (RIO) family of devices. Specifically, this sbRIO platform is an embedded single-board controller, with an FPGA Artix-7 200T and a Quad-Core Intel Atom processor. The FPGA has 96 digital inputs/outputs, of which 75 are used as the connection interface with the 150 MEMS microphones of the array, so that in each I/O line, two microphones are multiplexed, while the other lines are used to generate the clock and synchronize. The Atom processor is equipped with 2 GB of DDR3 RAM, 4 GB of built-in storage space, USB Host port, and Giga Ethernet port. Finally, it has sixteen 16-bit analog inputs and four 16-bit analog outputs, which are used to generate the transmitted signal. All this hardware is mounted on a single PCB board (155 mm × 102 mm × 35 mm) with a weight of 330 g. These sbRIO devices are oriented to sensors with nonstandard acquisition procedures, allowing low-level programming of the acquisition routines.
The embedded processor included in sbRIO is capable of running all the software algorithms to detect targets, so it can be used as a standalone array module formed by a sbRIO connected to a MEMS array board as shown in Figure 2. The position of the detected target will be sent to the AEB system of a vehicle.

2.3. Software Algorithms

Based on being active, this system must consist of both a receiving subsystem, based on an acoustic array, and a transmitting subsystem, in charge of generating the signal(s) that will be reflected on the possible pedestrians to be detected. Thus, the algorithms implemented in the system, shown in Figure 3, can be divided into four blocks: Transmission Generation, MEMS Acquisition, Signal Processing, and Detection.
The Transmission Generation block synthesizes a pulsed multitone signal to be sent through the DA converter to the signal amplifier and from there to the loudspeaker which will output the transmitted signal.
In the Acquisition block, implemented in the FPGA, each MEMS microphone acquired the reflected signal with a PDM interface. This interface internally incorporates a one-bit sigma-delta converter with a sampling frequency of 2 MHz. In this block, a common clock signal is generated for all 150 MEMS to read signals simultaneously via the digital inputs of the FPGA.
In the Signal Processing block, three routines are implemented:
  • Delay and Sum beamforming: a discrete set of beams that cover the surveillance space in azimuth is generated for a fixed elevation angle. Each of these beams is implemented using a Delay and Sum beamformer, which essentially delays specifically each of the 150 microphone signals and sums them together, so that the array points in a specific direction in the surveillance space. A diagram of a beamformer can be observed in Figure 4a.
  • Decimation and Filtering: applying downsampling techniques, based on decimation and filtering [51], 150 independent signals are obtained and the sampling frequency is reduced from 2 MHz to 50 kHz.
  • Matched filter: A matched filter is then applied to the decimated signal to maximize the SNR at the input of the detection block.
The Detection block implements three processes:
  • First, the relative maxima for each of the beams are identified and a list of potential targets is generated.
  • Then, all targets that are outside the detection lane are eliminated, while those whose distance is within the surveillance range of the system [Rmin, Rmax] are selected, as shown in Figure 4b.
  • Finally, the selected targets are processed by a CFAR (Constant False Alarm Rate) detector [52]. The CFAR detector compares the energy of the potential target with a dynamic threshold that is proportional to the average of the energy received in a set of cells close to the evaluated one, where a number of cells contiguous to the evaluated one, called guard cells, have been excluded. Its scheme can be observed in Figure 4c. The CFAR threshold has been obtained by varying the CFAR gain according to the equation: −n
    T h r e s h o l d [ n ] = k · 1 N { m = n n 2 n n 1 x [ m ] + m = n + n 1 n + n 2 x [ m ] }
    where n1 and n2 determine the indices of the cells where the threshold is evaluated. Specifically, the threshold is evaluated in 2 × (n2 − n1) cells that are n1 cells away from the cell under test. On the other hand, k is the parameter that allows weighting the relationship between the Detection Probability and the False Alarm Probability. The values of n1 and n2 are calculated as a function of the transmitted pulse width.

3. Results

3.1. Test Setup

For the analysis of the system, a compact vehicle has been assumed on a normal road surface and an ABS braking system with a maximum deceleration of 0.8 g ms−2. A 4 m wide road has been used, with street lamps and trees along its edges, as shown in Figure 5a.
Taking into account that the typical vehicle speed limits in urban environments are between 30 km/h and 50 km/h, the minimum braking distance of a vehicle travelling at these speeds was calculated, as shown in Table 1. On the basis of these braking distances, a set of experiments has been designed, placing a person at six distances from the system: 5 m, 7.5 m, 10 m, 12.5 m, 15 m, and 20 m, defining the six detection distances to be considered on the analysis of the system.
It can be observed that for the speed of 30 km/h, all the defined test distances would be valid, i.e., if a pedestrian is detected at these distances, the AEB system would be able to brake the vehicle so that no collision would occur. For the speed of 40 km/h, only the defined test distances of 10 m and above would be adequate to avoid a collision between the vehicle and the pedestrian. For the speed of 50 km/h, only test distances defined from 15 m would be adequate. In the cases for detection distances shorter than the braking distance, it has been confirmed by simulations with CarSim [53], that although the vehicle would collide with the pedestrian, the collision would occur at a sufficiently low speed so that the impact would not cause serious injuries to the pedestrian.
The time required to detect a pedestrian is the time required for the transmitted signal to propagate to the pedestrian and be reflected back to the sensor. In addition, the time needed for signal processing (beamformer, filters, and detector) has to be added. The scenario configuration has a maximum range of 25 m, which, assuming a propagation speed of 343 m/s, takes 145 ms for the acoustic signal to travel. Signal processing is performed on the FPGA in real time simultaneously with the acquisition. At the end of the acquisition, the processed signal is already formed for all 11 beams. Additionally, 25 ms are required for the detection process to be implemented in the embedded multicore processor. Therefore, the system is capable of performing five detections per second, as the total time required for each detection is less than 200 ms.
For the tests carried out, a transmission signal was generated consisting of a 3 ms pulse composed of eight discrete frequencies between 14 kHz and 21 kHz, with a spacing of 1 kHz. Several frequencies have been used to improve the probability of detection, since depending on the physical characteristics and clothing of pedestrians, reflectivity varies with frequency [54]. Frequencies below 14 kHz have not been used because this would result in beam broadening, which would reduce the spatial resolution of the system. The pulse width of 3 ms was chosen as a compromise between the range resolution, which is inversely proportional to the pulse width, and the transmitted energy. The signal was generated with a tweeter loudspeaker with a flat frequency response (±1.5 dB) between 10 kHz and 24 kHz.

3.2. Scenario Analysis

For a person located at 10 m (Figure 5a), an acoustic image was obtained at the operating frequency of 20 kHz, generating 120 beams of 4° of beamwidth with steering azimuth angles from −60° to 60°, with 1° resolution and for a surveillance range between 4 and 25 m. In this case, Figure 5b represents the obtained acoustic image in azimuth/range space where the red crosses represent the detected targets. In the image, it is clear that the system detects both the person, placed at around 10 m, and other objects in front of it, such as lampposts and trees on the roadside. Table 2 shows the targets detected, with their position, expressed in range and azimuth. Comparing Figure 5a,b, it can be observed that in the acoustic image (Figure 5b) two detections appear that are not represented in the test scenario (Figure 5a). These targets actually exist in the scenario but do not appear in Figure 5a because the picture is not taken with a wide-angle camera.
In Figure 5b, it can also be visualized the lane boundaries, represented by a dashed green line. As shown in Figure 5b, most of the detected targets are outside the road lane and should therefore not be taken into consideration by our system. It is therefore justified to perform a selection of the targets that are inside the road in the preset surveillance range, from 5 to 25 m. This operation is performed by the detection algorithm (Figure 4c). After applying the lane filter, only the pedestrian remains as a detected target.

3.3. Detector Performance Analysis

In a real system, it is not necessary to know with such a high resolution the position of a pedestrian and on the other hand fast update rates between 5 and 10 frames per second are required, and it is advisable to work with a smaller number of beams.
First, the angles of excursion for the detection system must be determined. Assuming that the road has a width of d meters and that the car drives in the center, at a distance R, the angle of maximum excursion will be a function of distance R according to the equation:
θ m a x = a t a n 2 ( d / 2 R )
In turn, the array used has a beamwidth (Δθ) of 4° at 3 dB for 20 kHz; therefore, the number of beams required is:
N b e a m s = f l o o r { 2 · θ m a x Δ θ }
Combining Equations (2) and (3) gives the number of beams required for the distances defined in the test scenario. So, since the system has to detect any pedestrian in the range between 5 and 20 m, the minimum value defined for the distance R shall be used, giving a number of working beams of 11. In this case, the steering angles of the defined beams are equispaced between −20° and 20°, with a separation of 4°. Figure 6 shows the radiation patterns for the 11 defined beams, where it can be observed that they cut off at 3 dB and cover the scan angles between −22° and 22°.
The analysis of the CFAR detector has been carried out as a function of the associated threshold gain k. This value has varied between 3 and 10, with increments of 0.01. Several experiments have been carried out to obtain the values of the guard and reference window cells of this CFAR detector, n1 and n2, respectively. The values of these cells’ range have been determined based on the pulse width used. In this case, given that the signal pulse used lasts 3 ms, which corresponds to a value in range of 100 cm, both the guard cells and the reference window cells were calculated for an initial 100 cm value. After the study carried out to find suitable values for these cells, a guard cell value of 200 cm and a reference cell value of 300 cm have been selected.
Once the operating parameters of the different algorithms to be used in the system have been defined, 1000 experiments were carried out for each of the six defined test scenarios, placing the pedestrian on the center of the lane. Figure 7 below shows the overall Detection Probability (Pd) and False Alarm Probability (Pfa) obtained for the system. In Figure 8, the Detection Probability (Figure 8a) and the False Alarm Probability (Figure 8b) are displayed in detail for each scenario, as a function of the k value that controls the threshold of the CFAR detector.
Figure 7 shows that both the Detection Probability and the False Alarm Probability are monotonically decreasing functions with k. Based on RADAR theory, according to the Neyman-Pearson Observer theorem [52], the optimal detection threshold, and hence the value of the Detection Probability of the system, is obtained from a preset False Alarm Probability value. In this case, a False Alarm Probability value of 10−2 has been set, as this is a typical value used for in-vehicle detection systems. Based on this value of the False Alarm Probability, the threshold value obtained corresponds to k = 4.91, which in turn defines a value for the Detection Probability of 0.995. For this type of application, where a person’s life is at risk, the obtained Detection Probability is adequate. Furthermore, it should be taken into account that in these systems a False Alarm confirmation procedure is usually established, which allows the false detections obtained to be discarded, and therefore the value of the False Alarm Probability defined, which may seem high, is adequate in practice.
In addition to the overall behavior of the system, the Detection Probability and the False Alarm Probability have also been analyzed for each of the scenarios defined, shown in Figure 8. Analyzing the behavior of the Detection Probability as a function of distance (Figure 8a), it can be seen that for distances of 5 and 7.5 m, values above the Global Detection Probability are obtained (Figure 7); for distances of 15 and 20 m, the Detection Probability values obtained are around the global value; and finally, for distances of 10 and 12.5 m, their Detection Probability values are below the global value. In principle, this probability should decrease as the distance increases, because the SNR decreases. However, this behavior is not observed in the system detector, due to the existence of reflections from the multiple targets located at the road boundaries, which interfere with the reflections from the pedestrian itself.
Figure 8b shows that False Alarm Probabilities are different depending on the distance of the pedestrian when the detection threshold is low. These false detections are due to the sidelobes of the array beampattern, through which reflections from objects close to the lane are received. Depending on the relative position between the pedestrian and the nearby objects, destructive/constructive interference arises and influences the average energy estimation estimated by the CFAR in the detection environment. By increasing the detection threshold, this uneven behavior disappears as the CFAR detector eliminates these false detections. In this work, the experiments have been evaluated independently, and it has not been taken into account that in a dynamic environment, the detections of each experiment can be validated in subsequent experiments by confirming them as detections or discarding them as False Alarms, reducing the False Alarm Probability significantly.
With the value of the threshold obtained (k = 4.91), the values of the Probability of Detection and the Probability of False Alarm for each of the scenarios have been calculated individually and are shown in Table 3. In view of these, it can be seen that, for certain scenarios, the value of the Probability of False Alarm exceeds the established limit value of 10−2, although for other scenarios, it is lower. To ensure that the limit value for the Probability of False Alarm is met in all scenarios, it has been calculated that the value of k associated with the CFAR threshold to be considered should be 5.07. For this new value of k, it can be seen in Table 3 that all the values of the False Alarm Probability are below it. At the same time, it can also be observed that the average value of the Probability of Detection has decreased by only 0.3%, while the average value of the Probability of False Alarm has decreased by 50%.
In summary, a characterization of the detector has been made, both jointly for the six test scenarios and for each one independently, defining two possible detection thresholds: the first one based on the joint characterization and the second one based on considering the worst case of all the scenarios.

4. Conclusions

In this work, an active acoustic system has been presented that, by means of a 2D array of MEMS microphones and a processing architecture based on an FPGA/microprocessor, can estimate the 2D position of a person up to a distance of 20 m and prevent a car from colliding with a pedestrian up to speeds of 50 km/h. For higher speeds, the vehicle’s AEB system, which will interact with the presented system, can mitigate the impact with the pedestrian, avoiding fatal injuries. By estimating the 2D position of the pedestrian, the system could even warn the vehicle’s steering system to avoid the pedestrian.
The analysis of the system for a set of scenarios with people at different distances has shown the feasibility of the system. A processing scheme based on beamforming, range and lane filters, and finally a powerful CFAR detection algorithm has been developed, where the operating parameters of a False Alarm Probability lower than 0.01 and a Probability of Detection higher than 0.99, acquired by more than 6000 experiments, have been obtained.
This system can be fused with existing camera, LIDAR, and RADAR based pedestrian detection systems, decreasing the False Alarm Probability and increasing the joint Detection Probability. Finally, this system can work in low visibility situations: rain, fog, smoke, night, etc., where cameras/LIDAR are not operational; as well as improve the detection of a person by a RADAR system, since the radar section of people is usually very low compared to their acoustic section.

Author Contributions

Conceptualization, A.I., J.J.V. and L.d.V.; methodology, A.I. and L.d.V.; software, J.J.V.; validation, A.I., L.d.V. and J.J.V.; data curation, A.I.; writing—original draft preparation, A.I. and L.d.V.; writing—review and editing, J.J.V. and L.d.V. All authors have read and agreed to the published version of the manuscript.


This research was funded by Ministerio de Ciencia, Innovación y Universidades, grant number RTI2018-095143-B-C22.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Informed consent was obtained from all subjects involved in the study.

Data Availability Statement

The data used in this work may be requested by sending an e-mail to one of the authors of the article.

Conflicts of Interest

The authors declare no conflict of interest.


  1. World Health Organization. Global Status Report on Road Safety 2018: Summary; Technical Documents; WHO: Geneva, Switzerland, 2018; Available online: (accessed on 2 February 2021).
  2. Aghabayk, K.; Esmailpour, J.; Jafari, A.; Shiwakoti, N. Observational-based study to explore pedestrian crossing behaviors at signalized and unsignalized crosswalks. Accid Anal. Prev. 2021, 151, 105990. [Google Scholar] [CrossRef]
  3. Rasouli, A.; Kotseruba, I.; Tsotsos, J.K. Understanding pedestrian behavior in complex traffic scenes. IEEE Trans Intell. Veh. 2018, 3, 61–70. [Google Scholar] [CrossRef]
  4. Zhang, H.; Liu, Y.; Wang, C.; Fu, R.; Sun, Q.; Li, Z. Research on a pedestrian crossing intention recognition model based on natural observation data. Sensors 2020, 20, 1776. [Google Scholar] [CrossRef][Green Version]
  5. Lozano-Domínguez, J.M.; Mateo-Sanguino, T.J. Walking secure: Safe routing planning algorithm and pedestrian’s crossing intention detector based on fuzzy logic app. Sensors 2021, 21, 529. [Google Scholar] [CrossRef]
  6. Branquinho, J.; Senna, C.; Zúquete, A. An efficient and secure alert system for VANETs to improve crosswalks’ security in smart cities. Sensors 2020, 20, 2473. [Google Scholar] [CrossRef]
  7. Vourgidis, I.; Maglaras, L.; Alfakeeh, A.S.; Al-Bayatti, A.H.; Ferrag, M.A. Use of smartphones for ensuring vulnerable road user safety through path prediction and early warning: An in-depth review of capabilities, limitations and their applications in cooperative intelligent transport systems. Sensors 2020, 20, 997. [Google Scholar] [CrossRef][Green Version]
  8. Wang, P.; Motamedi, S.; Qi, S.; Zhou, X.; Zhang, T.; Chan, C.Y. Pedestrian interaction with automated vehicles at uncontrolled intersections. Transp. Res. Part F Traffic Psychol. Behav. 2021, 77, 10–25. [Google Scholar] [CrossRef]
  9. Shan, M.; Narula, K.; Wong, Y.F.; Worrall, S.; Khan, M.; Alexander, P.; Nebot, E. Demonstrations of cooperative perception: Safety and robustness in connected and automated vehicle operations. Sensors 2021, 21, 200. [Google Scholar] [CrossRef]
  10. Khalifa, A.B.; Alouani, I.; Mahjoub, M.A.; Rivenq, A. A novel multi-view pedestrian detection database for collaborative Intelligent Transportation Systems. Future Gener. Comput. Syst. 2020, 113, 506–527. [Google Scholar] [CrossRef]
  11. Kim, B.J.; Lee, S.B. A Study on the Evaluation Method of Autonomous Emergency Vehicle Braking for Pedestrians Test Using Monocular Cameras. Appl. Sci. 2020, 10, 4683. [Google Scholar] [CrossRef]
  12. Yang, W.; Zhang, X.; Lei, Q.; Cheng, X. Research on longitudinal active collision avoidance of autonomous emergency braking pedestrian system (AEB-P). Sensors 2019, 19, 4671. [Google Scholar] [CrossRef][Green Version]
  13. Lenard, J.; Welsh, R.; Danton, R. Time-to-collision analysis of pedestrian and pedal-cycle accidents for the development of autonomous emergency braking systems. Accid. Anal. Prev. 2018, 115, 128–136. [Google Scholar] [CrossRef][Green Version]
  14. Park, M.; Lee, S.; Kwon, C.; Kim, S. Design of pedestrian target selection with funnel map for pedestrian AEB system. IEEE Trans. Veh. Technol. 2017, 66, 3597–3609. [Google Scholar] [CrossRef]
  15. Jeppsson, H.; Östling, M.; Lubbe, N. Real life safety benefits of increasing brake deceleration in car-to-pedestrian accidents: Simulation of Vacuum Emergency Braking. Accid. Anal. Prev. 2018, 111, 311–320. [Google Scholar] [CrossRef]
  16. Lenard, J.; Badea-Romero, A.; Danton, R. Typical pedestrian accident scenarios for the development of autonomous emergency braking test protocols. Accid Anal. Prev. 2014, 73, 73–80. [Google Scholar] [CrossRef][Green Version]
  17. Song, Z.Q.; Cao, L.B.; Chou, C.C. Development of test equipment for pedestrian-automatic emergency braking based on C-NCAP (2018). Sensors 2020, 20, 6206. [Google Scholar] [CrossRef] [PubMed]
  18. Zhang, S.; Bauckhage, C.; Cremers, A.B. Efficient pedestrian detection via rectangular features based on a statistical shape model. IEEE Trans Intell. Transp. Syst. 2015, 16, 763–775. [Google Scholar] [CrossRef]
  19. Bu, F.; Le, T.; Du, X.; Vasudevan, R.; Johnson-Roberson, M. Pedestrian Planar LiDAR Pose (PPLP) network for oriented pedestrian detection based on planar LiDAR and monocular images. IEEE Rob. Autom. Lett. 2020, 5, 1626–1633. [Google Scholar] [CrossRef]
  20. Cao, J.; Song, C.; Peng, S.; Song, S.; Zhang, X.; Shao, Y.; Xiao, F. Pedestrian detection algorithm for intelligent vehicles in complex scenarios. Sensors 2020, 20, 3646. [Google Scholar] [CrossRef] [PubMed]
  21. Zhang, M.; Fu, R.; Cheng, W.; Wang, L.; Ma, Y. An approach to segment and track-based pedestrian detection from four-layer laser scanner data. Sensors 2019, 19, 5450. [Google Scholar] [CrossRef][Green Version]
  22. Han, T.; Wang, L.; Wen, B. The kernel based multiple instances learning algorithm for object tracking. Electronics 2018, 7, 97. [Google Scholar] [CrossRef][Green Version]
  23. Dolatabadi, M.; Elfring, J.; van de Molengraft, R. Multiple-joint pedestrian tracking using periodic models. Sensors 2020, 20, 6917. [Google Scholar] [CrossRef]
  24. Fan, Y.; Sun, Z.; Zhao, G. A Coarse-to-fine framework for multiple pedestrian crossing detection. Sensors 2020, 20, 4144. [Google Scholar] [CrossRef]
  25. Wang, X.; Hua, X.; Xiao, F.; Li, Y.; Hu, X.; Sun, P. Multi-object detection in traffic scenes based on improved SSD. Electronics 2018, 7, 302. [Google Scholar] [CrossRef][Green Version]
  26. Lin, G.T.; Shivanna, V.M.; Guo, J.I. A Deep-learning model with task-specific bounding box regressors and conditional back-propagation for moving object detection in ADAS applications. Sensors 2020, 20, 5269. [Google Scholar] [CrossRef]
  27. Wu, Q.; Gao, T.; Lai, Z.; Li, D. Hybrid SVM-CNN classification technique for human–vehicle targets in an automotive LFMCW radar. Sensors 2020, 20, 3504. [Google Scholar] [CrossRef] [PubMed]
  28. Wang, R.; Cui, Y.; Song, X.; Chen, K.; Fang, H. Multi-information-based convolutional neural network with attention mechanism for pedestrian trajectory prediction. Image Vis. Comput. 2021, 107, 104110. [Google Scholar] [CrossRef]
  29. Liu, Y.; Ma, J.; Wang, Y.; Zong, C. A Novel algorithm for detecting pedestrians on rainy image. Sensors 2021, 21, 112. [Google Scholar] [CrossRef] [PubMed]
  30. Jeong, M.; Ko, B.C.; Nam, J. Early detection of sudden pedestrian crossing for safe driving during summer nights. IEEE Trans. Circuits Syst. Video Technol. 2017, 27, 1368–1380. [Google Scholar] [CrossRef]
  31. Kim, T.; Kim, S. Pedestrian detection at night time in FIR domain: Comprehensive study about temperature and brightness and new benchmark. Pattern Recognit. 2018, 79, 44–54. [Google Scholar] [CrossRef]
  32. Miclea, R.C.; Dughir, C.; Alexa, F.; Sandru, F.; Silea, I. Laser and LIDAR in a system for visibility distance estimation in fog conditions. Sensors 2020, 20, 6322. [Google Scholar] [CrossRef]
  33. Goodin, C.; Carruth, D.; Doude, M.; Hudson, C. Predicting the influence of rain on LIDAR in ADAS. Electronics 2019, 8, 89. [Google Scholar] [CrossRef][Green Version]
  34. Piniarski, K.; Pawłowski, P.; Dąbrowski, A. Tuning of classifiers to speed-up detection of pedestrians in infrared images. Sensors 2020, 20, 4363. [Google Scholar] [CrossRef] [PubMed]
  35. Kwak, J.; Ko, B.C.; Nam, J.Y. Pedestrian tracking using online boosted random ferns learning in far-infrared imagery for safe driving at night. IEEE Trans. Intell. Transp. Syst. 2017, 18, 69–81. [Google Scholar] [CrossRef]
  36. Shopovska, I.; Jovanov, L.; Philips, W. Deep visible and thermal image fusion for enhanced pedestrian visibility. Sensors 2019, 19, 3727. [Google Scholar] [CrossRef][Green Version]
  37. Wei, P.; Cagle, L.; Reza, T.; Ball, J.; Gafford, J. LiDAR and camera detection fusion in a real-time industrial multi-sensor collision avoidance system. Electronics 2018, 7, 84. [Google Scholar] [CrossRef][Green Version]
  38. King, E.A.; Tatoglu, A.; Iglesias, D.; Matriss, A. Audio-visual based non-line-of-sight sound source localization: A feasibility study. Appl. Acoust. 2021, 171, 107674. [Google Scholar] [CrossRef]
  39. Van Trees, H. Optimum Array Processing: Part IV of Detection, Estimation and Modulation Theory; John Wiley & Sons: New York, NY, USA, 2002. [Google Scholar]
  40. Van Veen, B.D.; Buckley, K.M. Beamforming: A versatile approach to spatial filtering. IEEE ASSP Mag. 1988, 5, 4–24. [Google Scholar] [CrossRef]
  41. Beeby, S.; Ensell, G.; Kraft, K.; White, N. MEMS Mechanical Sensors; Artech House Publishers: Norwood, MA, USA, 2004. [Google Scholar]
  42. Scheeper, P.R.; van der Donk, A.G.H.; Olthuis, W.; Bergveld, P. A review of silicon microphones. Sens. Actuators A Phys. 1994, 44, 1–11. [Google Scholar] [CrossRef][Green Version]
  43. Da Silva, B.; Segers, L.; Braeken, A.; Steenhaut, K.; Touhafi, A. Design exploration and performance strategies towards power-efficient FPGA-based architectures for sound source localization. J. Sens. 2019, 2019, 1–27. [Google Scholar] [CrossRef]
  44. Kournoutos, N.; Cheer, J. Investigation of a directional warning sound system for electric vehicles based on structural vibrations. J. Acoust. Soc. Am. 2020, 148, 588–598. [Google Scholar] [CrossRef] [PubMed]
  45. Zhang, Y.K.X.; Yang, B. Traffic flow detection using thermopile array sensor. IEEE Sens. J. 2020, 20, 5155–5164. [Google Scholar] [CrossRef]
  46. Izquierdo, A.; Del Val, L.; Villacorta, J.J.; Zhen, W.; Scherer, S.; Fang, Z. Feasibility of discriminating UAV propellers noise from distress signals to locate people in enclosed environments using MEMS microphone arrays. Sensors 2020, 20, 597. [Google Scholar] [CrossRef] [PubMed][Green Version]
  47. Izquierdo, A.; Villacorta, J.J.; Del Val, L.; Suárez, L. Design and evaluation of a scalable and reconfigurable multi-platform system for acoustic imaging. Sensors 2016, 16, 1671. [Google Scholar] [CrossRef] [PubMed][Green Version]
  48. Del Val, L.; Izquierdo-Fuente, A.; Villacorta, J.J.; Raboso, M. Acoustic biometric system based on preprocessing techniques and linear support vector machines. Sensors 2015, 15, 14241–14260. [Google Scholar] [CrossRef][Green Version]
  49. SPH0641LU4H-1 MEMS Microphone. Available online: (accessed on 30 April 2021).
  50. sbRIO-9629 Platform. Available online: (accessed on 30 April 2021).
  51. Park, S. Principles of Sigma-Delta Modulation for Analog-to-Digital Converters. Available online: (accessed on 28 April 2021).
  52. Skolnik, M.I. Introduction to RADAR Systems, 3rd ed.; McGraw-Hill Education: New York, NY, USA, 2002. [Google Scholar]
  53. Sistema de Simulación de Prestaciones de Vehículos CarSim. Available online: (accessed on 4 May 2021).
  54. Izquierdo, A.; Del Val, L.; Jiménez, M.I.; Villacorta, J.J. Performance evaluation of a biometric system based on acoustic images. Sensors 2011, 11, 9499–9519. [Google Scholar] [CrossRef][Green Version]
Figure 1. System MEMS array.
Figure 1. System MEMS array.
Sensors 21 04162 g001
Figure 2. Hardware setup diagram.
Figure 2. Hardware setup diagram.
Sensors 21 04162 g002
Figure 3. Processing algorithms.
Figure 3. Processing algorithms.
Sensors 21 04162 g003
Figure 4. (a) Delay and Sum beamformer. (b) Lane window. (c) CFAR detector.
Figure 4. (a) Delay and Sum beamformer. (b) Lane window. (c) CFAR detector.
Sensors 21 04162 g004
Figure 5. (a) Test scenario with a person placed at 10 m. (b) Acoustic image with detected targets.
Figure 5. (a) Test scenario with a person placed at 10 m. (b) Acoustic image with detected targets.
Sensors 21 04162 g005aSensors 21 04162 g005b
Figure 6. Beampatterns of the 11 defined beams.
Figure 6. Beampatterns of the 11 defined beams.
Sensors 21 04162 g006
Figure 7. Global Detection Probability and False Alarm Probability of the system.
Figure 7. Global Detection Probability and False Alarm Probability of the system.
Sensors 21 04162 g007
Figure 8. (a) Detection Probability and (b) False Alarm Probability, for the test scenarios.
Figure 8. (a) Detection Probability and (b) False Alarm Probability, for the test scenarios.
Sensors 21 04162 g008
Table 1. Braking distances for the considered vehicle speeds, with a 0.8 g deceleration.
Table 1. Braking distances for the considered vehicle speeds, with a 0.8 g deceleration.
Speed [km/h]Braking Distance [m]
Table 2. Position (range and azimuth) of the detected targets.
Table 2. Position (range and azimuth) of the detected targets.
IDTargetRange [m]Azimuth [°]
Table 3. Detection Probability and False Alarm Probability for the test scenarios, for k = 4.91 and k = 5.07.
Table 3. Detection Probability and False Alarm Probability for the test scenarios, for k = 4.91 and k = 5.07.
R [m]k = 4.91k = 5.07
Average value0.9950.0100.9920.005
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Izquierdo, A.; Val, L.d.; Villacorta, J.J. Feasibility of Using a MEMS Microphone Array for Pedestrian Detection in an Autonomous Emergency Braking System. Sensors 2021, 21, 4162.

AMA Style

Izquierdo A, Val Ld, Villacorta JJ. Feasibility of Using a MEMS Microphone Array for Pedestrian Detection in an Autonomous Emergency Braking System. Sensors. 2021; 21(12):4162.

Chicago/Turabian Style

Izquierdo, Alberto, Lara del Val, and Juan J. Villacorta. 2021. "Feasibility of Using a MEMS Microphone Array for Pedestrian Detection in an Autonomous Emergency Braking System" Sensors 21, no. 12: 4162.

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop