Experimental Evaluation of Distortion in Amplitude Modulation Techniques for Parametric Loudspeakers

: Parametric loudspeakers can generate a highly directional beam of sound, having applications in targeted audio delivery. Audible sound modulated into an ultrasonic carrier will get self-demodulated along the highly directive beam due to the non-linearity of air. This non-linear demodularization should be compensated to reduce audio distortion, di ﬀ erent amplitude modulation techniques have been developed during the last years. However, some studies are only theoretical whereas others do not analyze the audio distortion in depth. Here, we present a detailed experimental evaluation of the frequency response, harmonic distortion and intermodulation distortion for various amplitude modulation techniques applied with di ﬀ erent indices of modulation. We used a simple method to measure the audible signal that prevents the saturation of the microphones when the high levels of the ultrasonic carrier are present. This work could be useful for selecting predistortion techniques and indices of modulation for regular parametric arrays.


Introduction
Parametric loudspeakers exploit the non-linear behavior of acoustic waves travelling through air to generate audible sound along a highly directive path due to the self-demodulation property of finite-amplitude ultrasonic waves [1]. The audible components are more directional than sounds produced by conventional loudspeakers, hence they can find application in contexts where audio must be targeted precisely in space. The directional nature of parametric speakers has been used for directing users towards specific objects [2], and a hand-held directional speaker was used to provide targeted information about the objects pointed by the user [3]. Additionally, sound landscapes in which the audience receives sound stimuli from specific locations can be created with directional speakers [4]. In general, directional speakers enable the targeted delivery of audio for applications in advertising, dual-language systems or notifications [5].
In 1963, Westervelt [6] theoretically described the generation of difference frequency waves from two high-frequency collimated beams referred to as primary waves. Berktay [7] extended this approach and evaluated some possible applications in underwater acoustic transmission. His analysis was not limited to two primary waves and could be applied to a single self-demodulated primary wave. If the primary wave p 1 is a generic carrier modulated in amplitude such that: where P 0 and ω c are the amplitude and the angular frequency of the carrier, and E(t) is the envelope function, then Berktay's farfield solution predicts a self-demodulated wave p 2 proportional to the second time derivative of the square of the modulation envelope: This dependence implies that the self-demodulated wave (p 2 ) is not linear to E(t) and that it will suffer from high levels of distortion due to the generated harmonics and a strong low-pass equalization. Various preprocessing techniques have been developed to reduce distortion using different modulations of the envelope E(t). Existing experimental measurements [8][9][10] are mostly based on the total harmonic distortion or at certain representative frequencies, which may not completely capture the non-linear response of the speakers. Here, we present an extensive comparison of the different amplitude modulation techniques under various modulation indices in terms of frequency response, harmonic distortion and intermodulation distortion. This study analyzes the intermodulation distortion and the harmonics distortions of the different amplitude modulation techniques for various modulation indices. Furthermore, the analysis is split by order. To avoid the presence of spurious signal in the measurements, our method previously selects a dynamic microphone with limited frequency response.
In the Section 2 we describe: the experimental setup, the measurement procedure, the method employed to measure audible sound in the presence of high-levels of ultrasonic carrier and the evaluated amplitude modulation techniques. In the Section 3, we report and analyze the response and distortion of the different modulation techniques and indices. We conclude by summarizing the results and their potential implications.

Experimental Setup
A PC (Intel Xeon with 16Gb of RAM, Intel Corporation, Santa Clara, California, USA) was connected to the audio output card (Focusrite Scarlett 18i20, Focusrite plc. High Wycombe, Buckinghamshire, England) for general audio input/output. For emitting audible sound, the card output was connected to an auto-amplified monitor (Neumann KH120A, Georg Neumann GmbH, Berlin, Germany). For the ultrasonic output, the card was connected into an amplifier (Akozon DC12-24V 2 × 100 W power amplifier, 14-100 KHz, Akozon, Shenzhen, Guangdong, China) and then into a parametric array shown in Figure 1 (array SSCI-018425 made of 49 transducers, Switch Science Inc., Tokyo, Japan). The largest output voltage amplitude of the amplifier was 24 Vpp, which was sufficient and within the normal operating voltage of the transducers forming the array. 1 envelope function, then Berktay's farfield solution predicts a self-demodulated wave 2 proportional to the second time derivative of the square of the modulation envelope: 4 This dependence implies that the self-demodulated wave ( ) is not linear to ( ) and that it will 5 suffer from high levels of distortion due to the generated harmonics and a strong low-pass 6 equalization.

31
Tokyo, Japan). The largest output voltage amplitude of the amplifier was 24 Vpp, which was 32 sufficient and within the normal operating voltage of the transducers forming the array.

36
For receiving audio, a microphone (different models were employed) was placed 2 meters away 37 from the sound source. Except for the GRAS microphone, which needed the Norsonic type 335 frontend for signal conditioning, the microphones were directly connected to the input of the sound card. For receiving audio, a microphone (different models were employed) was placed 2 meters away from the sound source. Except for the GRAS microphone, which needed the Norsonic type 335 front-end for signal conditioning, the microphones were directly connected to the input of the sound card. The measurements were taken in the listening room of an acoustics laboratory with different absorbent materials: foam, melamine foam and regenerated cotton bonded with thermosetting resin. The reverberation times were less than 0.25 s from the third octave band of 320 Hz.

Measurement Procedure
The frequency response of a loudspeaker describes the range of audible frequencies that it can reproduce. For conventional devices, it is usually measured in an anechoic chamber where the loudspeaker under test is excited by a sweep signal x(t). This sweep typically ranges from 20 Hz to 20 kHz. The signal emitted by the speaker was then recorded with a flat response microphone to obtain the signal y(t). By means of FFT techniques, the impulse response h(t) and its corresponding transfer function were obtained: For parametric loudspeakers, the process encompasses more stages. First, an upsampling of the excitation signal was performed to avoid aliasing when moving to the ultrasonic range. Then, the chosen predistortion technique was applied to the signal and emitted through the speaker. Afterwards, a microphone recorded the signal and a bandpass filter was applied. Subsequent downsampling was used to obtain the signal in the audible range. This signal was compared with the original excitation signal. This process is summarized in Figure 2. The spectral representation of the signal at each step of the process can be found in Supplementary Figure S2.
Appl. Sci. 2020, 10, x FOR PEER REVIEW 3 of 11 The measurements were taken in the listening room of an acoustics laboratory with different 1 absorbent materials: foam, melamine foam and regenerated cotton bonded with thermosetting resin.

2
The reverberation times were less than 0.25 s from the third octave band of 320 Hz.
For parametric loudspeakers, the process encompasses more stages. First, an upsampling of the 12 excitation signal was performed to avoid aliasing when moving to the ultrasonic range. Then, the 13 chosen predistortion technique was applied to the signal and emitted through the speaker.
14 Afterwards, a microphone recorded the signal and a bandpass filter was applied. Subsequent

28
All transfer functions and distortion measurements described in the paper have been performed   The International Standard IEC 60268-21 [11] was approved in 2018 and specifies measurement methods to evaluate the transfer behavior of a device under test (DUT). It can be applied to electroacoustic transducers, active and passive sound systems, amplified speakers, televisions, portable audio devices, car sound systems or professional equipment. Therefore, the DUT can contain components that perform signal processing before the transduction of the electrical signal into an acoustical output signal radiated by the passive actuators. This capability makes this standard suitable for evaluating parametric loudspeakers.
All transfer functions and distortion measurements described in the paper have been performed following this standard. Some of the measurements and post-processing has been carried out with the help of the ITA-Toolbox, an open source toolbox for Matlab [12].

Audio Measurements in the Presence of a High-Level Ultrasonic Carrier
The high level of the ultrasonic primary wave encumbers the measurement of the demodulated audible secondary signal. Carrier levels above 120 dB are common even at distances of 2 meters and generate spurious signals in the receiving system [13]. These spurious signals are not perceived by the human ear and thus it is desirable to filter them out.
Some methods proposed to avoid the appearance of the spurious signal are based on the use of acoustic filters [14][15][16][17][18][19][20][21] made of a thin plastic film or phononic crystals with a bandgap at the carrier frequency. These filters are mounted in front of the receiving transducer and reduce the amplitude of the primary wave. However, the audible frequencies may also be affected by this method, especially the high ones. The response in the audible range can be estimated without using filters by measuring in the secondary lobes, but the position of the lobes varies with the frequency. Recent methods to reduce the spurious signal based on phase-cancellation [22,23] are only suitable for axial measurements.
Other techniques use a condenser microphone orthogonal to the incident wave where its ultrasonic sensitivity is lower [24]. Thereby, the amplitude of the recorded ultrasonic waves is reduced while preserving the sound level in the audible range. This effectively prevents the generation of spurious signals.
Based on the previous principle, we explored the use of a microphone with reduced ultrasonic sensitivity but good audio performance. Two condenser (GRAS AC40 -GRO, G.R.A.S. Sound & Vibration, Holte, Denmark, Behringer ECM8000 -BHO, Behringer International GmbH, Willich, Germany) and three dynamic (Shure SM57 -SH7, Shure Inc. Niles, Illinois, USA, Behringer XM8500 -BHX and BCT MD1 -BCT, Behringer International GmbH, Willich, Germany) microphones were selected for the study. Detailed specification can be found in Supplementary Table SI. The frequency response of the microphones in the audible range was measured by emitting sweeps with a monitor (Neumann KH120A, Georg Neumann GmbH, Berlin, Germany) with flat response (±2 dB between 54 Hz and 20 kHz) and is shown in Figure 3a.
Appl. Sci. 2020, 10, x FOR PEER REVIEW 4 of 11 methods to reduce the spurious signal based on phase-cancellation [22,23] are only suitable for axial 1 measurements.

2
Other techniques use a condenser microphone orthogonal to the incident wave where its 3 ultrasonic sensitivity is lower [24]. Thereby, the amplitude of the recorded ultrasonic waves is 4 reduced while preserving the sound level in the audible range. This effectively prevents the 5 generation of spurious signals.

6
Based on the previous principle, we explored the use of a microphone with reduced ultrasonic

18
The same procedure was repeated in the ultrasonic range using the parametric speaker as an  The dynamic microphones (i.e., SH7, BHX and BCT) registered a carrier level reduction of 30 dB.

27
This means that the recorded audio is less likely to be affected by distortion artifacts generated from 28 the high levels of pressure. This effect is illustrated in Figure 4, which shows the frequency responses The same procedure was repeated in the ultrasonic range using the parametric speaker as an emitter. The responses are shown in Figure 3b. The response of the GRO microphone (i.e., upper curve) can be used as an approximation of the normalized frequency response of the ultrasonic emitter measured at 2 m since this microphone had the flattest ultrasonic response. The response of the parametric speaker peaked at 40.9 kHz, with a −3 dB bandwidth of 1.5 kHz and a −6 dB bandwidth of 3.9 kHz. This reduced bandwidth is the cause of a spurious signal being typically smaller at high frequencies. The progressive reduction of the levels at the sidebands, which are used to emit the modulated audio, leads to a partial reduction of the non-linearity acquired by the microphones.
The dynamic microphones (i.e., SH7, BHX and BCT) registered a carrier level reduction of 30 dB. This means that the recorded audio is less likely to be affected by distortion artifacts generated from the high levels of pressure. This effect is illustrated in Figure 4, which shows the frequency responses obtained with different microphones when the ultrasonic array is emitting with a conventional AM modulation (i.e., DSBAM, described in further sections). For the two condenser microphones, GRO and BHO, the spurious signal was affecting the recordings along most of the frequency range. This was not the case when using the dynamic microphones. Their experimental curves show good agreement with the theoretical curves of a parametric loudspeaker. These curves were proportional to f n , where f is the frequency and n is the ratio between diffraction length and absorption length [16]. Diffraction length is the area of the transmitter divided by the wavelength at the primary frequencies and absorption length is the inverse of the nominal absorption coefficient at the primary frequencies. n = 2 corresponds to the Westervelt solution [6] for a large ratio, n = 1 is a good approximation to the solution obtained by Berktay and Leahy [25] for a small ratio. In practice, the curves lie somewhere between n = 1 and n = 2 [26]. In our experiments, the response of the dynamic microphones was in good agreement with the curve f 1.6 from approximately 500 Hz-4 kHz.
where is the modulation index and has values ranging from 0 to 1. DSBAM is simple to implement 20 and requires low computational resources. In addition, it generates a louder audible signal since it uses both sidebands. However, due to the tone difference between the upper (USB) and lower band (LSB), the second harmonic is also louder.

23
A simple solution to reduce the second harmonic is to transfer the energy to a single band as in 24 the single side band amplitude modulations (SSBAM) [28], which are named LSBAM for the lower  The BHX microphone is considered suitable for evaluating predistortion techniques in view of the results shown in Figures 3 and 4 as well as the usable range of frequencies of the ultrasonic speaker. The rest of the measurements detailed in this article are performed with this microphone.

Preprocessing Techniques
Amplitude modulation (AM) is the most common preprocessing technique in parametric loudspeakers. It consists of changing the amplitude of a relatively high-frequency carrier of angular frequency ω c according to the amplitude of a modulating signal. The envelope of the carrier wave is weighted with the desired audible sound signal s(t).
Double side band amplitude modulation (DSBAM) was the original method used by Yoneyama [27] to produce wideband audio with parametric loudspeakers. The modulated ultrasonic wave p(t) is expressed as where m is the modulation index and has values ranging from 0 to 1. DSBAM is simple to implement and requires low computational resources. In addition, it generates a louder audible signal since it uses both sidebands. However, due to the tone difference between the upper (USB) and lower band (LSB), the second harmonic is also louder. A simple solution to reduce the second harmonic is to transfer the energy to a single band as in the single side band amplitude modulations (SSBAM) [28], which are named LSBAM for the lower or USBAM for the upper sideband. This modulation can be obtained by using high-pass, low-pass filter or adding a quadrature term to the conventional AM. For the latter, a Hilbert filter is needed to convert the modulating signal s(t) into its orthogonal counterpartŝ(t). The modulations are expressed as: p LSBAM (t) = [1 + ms(t)]sin(ω c t) − mŝ(t)cos(ω c t),

p USBAM (t) = [1 + ms(t)]sin(ω c t) + mŝ(t)cos(ω c t). (6)
Inspired by Berktay's solution, a square root operation was added to the AM modulation, leading to the so-called square root amplitude modulation (SRAM): This procedure theoretically removes distortion but introduces infinite harmonics of the original signal s(t). Therefore, the resulting reduction in the distortion is limited by the bandwidth of the ultrasonic emitters [29].
A set of preprocessing techniques based on quadrature amplitude modulation has been proposed to deal with the bandwidth limitation of the emitters [30]. This class of preprocessing techniques referred to as modified amplitude modulation (MAM) can be adapted to the available bandwidth using different orders of the Taylor expansion for the distortion term modulated by the orthogonal carrier. The modulated ultrasonic waves for the first two orders are described as: Due to the low-cost of its implementation, FM-based methods have also been analyzed empirically [31]. However, demodulating FM waves causes higher harmonic distortions at conventional sound pressure levels. There are combined methods that achieve higher sound pressure levels at lower frequencies [32]. We kept the scope of the paper to amplitude modulation techniques, but frequency modulation is a promising technique that could be included in future studies.

Frequency Response
We used an 11-second logarithmic sweep from 18 Hz to 22 kHz as excitation signal for each of the six modulations (using a modulation index of m = 1). Magnitude curves of the transfer functions are shown in Figure 5. The curves were smoothed by applying 1/12th bandwidth spectral averaging. The parametric speaker barely reproduced low frequencies and the response increased by about 12 dB/octave between 300 and 1500 Hz. After 1500 Hz, the transfer functions varied slightly for each modulation. LSBAM showed the largest difference with a marked decrease at 6.5 kHz. This is due to the asymmetric response of the loudspeaker array that also had a marked decrease at approximately 33.5 kHz (which is 6.5 kHz below the carrier frequency of 40 kHz; Figure 2b). This minimum was not present in the USBAM since this modulation only uses the upper sideband. Beyond 10 kHz, all responses dropped abruptly due to the limited frequency response of the recording microphone. Similar curves, with a lower relative level, were obtained for each modulation if the modulation index was reduced.   Using consecutive increments of stationary tones (step-by-step method) is the most common 8 procedure employed to measure distortion of parametric loudspeakers that can be found in the 9 literature [33]. However, in the audio range it is preferable to use the technique based on sweeps proposed by Farina [34] that offers benefits in terms of ease of use, signal-to-noise ratio and immunity 11 against the temporal variation of the DUT. We will use this method.

12
The advantage of measuring impulse responses using sweeps is that the artifacts generated by 13 the harmonic distortion can be eliminated, since these appear at negative times in relation to direct 14 sound, and can be separated from the desired ℎ( ) [35]. In addition, when using a logarithmic sweep 15 as excitation signal, the harmonics will have a constant group delay independent of the frequency 16 when it is deconvoluted with the reference spectrum. They will therefore appear in predictable   29 Figure 6 shows the obtained for all modulations with m = 1. smaller than 5% (i.e.,

Harmonic Distortion
In the audio range, there are different parameters and methods to evaluate the non-linear response of a speaker. The most widespread measure for characterizing the non-linear response of a speaker is the total harmonic distortion (THD) obtained by sinusoidal signals of increasing frequency. Using consecutive increments of stationary tones (step-by-step method) is the most common procedure employed to measure distortion of parametric loudspeakers that can be found in the literature [33]. However, in the audio range it is preferable to use the technique based on sweeps proposed by Farina [34] that offers benefits in terms of ease of use, signal-to-noise ratio and immunity against the temporal variation of the DUT. We will use this method.
The advantage of measuring impulse responses using sweeps is that the artifacts generated by the harmonic distortion can be eliminated, since these appear at negative times in relation to direct sound, and can be separated from the desired h(t) [35]. In addition, when using a logarithmic sweep as excitation signal, the harmonics will have a constant group delay independent of the frequency when it is deconvoluted with the reference spectrum. They will therefore appear in predictable positions of h(t) and can be separated by windowing if the sweep is sufficiently slow. Hence, with a single excitation signal, both the linear transfer function of the system and the harmonic distortion decomposed into several orders can be obtained, eliminating the influence of non-linearities. We measured harmonics up to the 5 th order.
In Supplementary Figure S3, we present the decomposed harmonic distortion for the different preprocessing techniques. It can be seen that beyond the 2 nd order harmonic, the distortion was negligible. Most of the distortion resides in the second harmonic (see Supplementary Figure S3). It is worth noting that some techniques (e.g., MAM1) reduced the 2 nd harmonic considerably at some frequencies, almost to the levels of the 3 rd harmonic.
The level of total harmonic distortion (L THD ) in decibels was determined by the formula: where p n f ( f ) is the RMS value of the nth-order harmonic component and p f ( f ) is the RMS value of the first harmonic of the fundamental component. Figure 6 shows the L THD obtained for all modulations with m = 1. L THD smaller than 5% (i.e., −26 dB) were generally acceptable levels for domestic audio equipment. MAM techniques provided the best results up to 2.6 kHz, and single sidebands (i.e., LSBAM and USBAM) provided the best overall result, staying below the distortion threshold throughout the working frequency of the speaker. For DSBAM, the high level of the second harmonic caused L THD greater than −20 dB in virtually the entire frequency range. MAM and SRAM modulations obtained good results up to 2 kHz but for higher frequencies. Its behavior was similar to the conventional DSBAM modulation. The smallest values for harmonic distortion were obtained with single sideband modulations. This is because these modulations used only one band, thus there was only one difference-signal. The other techniques used a double band and thus produced two difference-signals generating more harmonics and increasing the harmonic distortion.
overall result, staying below the distortion threshold throughout the working frequency of the 1 speaker. For DSBAM, the high level of the second harmonic caused greater than -20 dB in 2 virtually the entire frequency range. MAM and SRAM modulations obtained good results up to 2 3 kHz but for higher frequencies. Its behavior was similar to the conventional DSBAM modulation.

4
The smallest values for harmonic distortion were obtained with single sideband modulations. This is 5 because these modulations used only one band, thus there was only one difference-signal. The other 6 techniques used a double band and thus produced two difference-signals generating more harmonics 7 and increasing the harmonic distortion.

8
In Supplementary Figure 4, we show the for each predistortion technique and modulation 9 index ranging from 0.1 to 1 at 0.1 steps. In general, the larger the modulation index, the more distortion appeared. However, this increment was more pronounced for DSBAM along all 11 frequencies, for SRAM from 1 to 6 kHz, for MAM1 beyond 1 kHz, and for MAM2 beyond 1.5 kHz.

12
Both LSBAM and USBAM were less affected by an increase of total harmonic distortion as the 13 modulation index increased.

17
When a system is excited by a two-tone stimulus, intermodulation distortion may appear at the than the other tone ( 2 ) , the intermodulation components are concentrated around the highest 20 frequency, at frequencies 2 ± 1 (where is a natural number). According to the IEC Standard 21 60268-21 [26], the total intermodulation distortion ( ), in percentage and considering only 22 intermodulation components up to 3rd order, can be expressed as:

24
where ̃( 2 ± ( − 1) 1 ) is the RMS value of the nth-order intermodulation component at the sum

27
To analyze this parameter along the entire frequency range, the calculation has been divided 28 into two parts: variation of the low-frequency tone 1 from 20 Hz to 1 kHz while keeping the high-  In Supplementary Figure S4, we show the L THD for each predistortion technique and modulation index ranging from 0.1 to 1 at 0.1 steps. In general, the larger the modulation index, the more distortion appeared. However, this increment was more pronounced for DSBAM along all frequencies, for SRAM from 1 to 6 kHz, for MAM1 beyond 1 kHz, and for MAM2 beyond 1.5 kHz. Both LSBAM and USBAM were less affected by an increase of total harmonic distortion as the modulation index increased.

Intermodulation Distortion
When a system is excited by a two-tone stimulus, intermodulation distortion may appear at the sum and difference of the two tones and their harmonics. If one of the tones ( f 1 ) is significantly lower than the other tone ( f 2 ), the intermodulation components are concentrated around the highest frequency, at frequencies f 2 ± n f 1 (where n is a natural number). According to the IEC Standard 60268-21 [26], the total intermodulation distortion (IMD T ), in percentage and considering only intermodulation components up to 3rd order, can be expressed as: where p( f 2 ± (n − 1) f 1 ) is the RMS value of the nth-order intermodulation component at the sum and difference of the two tones, and p( f 2 ) is the RMS value of the fundamental component at the excitation frequency f 2 .
To analyze this parameter along the entire frequency range, the calculation has been divided into two parts: variation of the low-frequency tone f 1 from 20 Hz to 1 kHz while keeping the high-frequency tone f 2 at a constant frequency of 7 kHz, and variation of the high-frequency tone f 2 from 500 Hz to 20 kHz while keeping the high-frequency tone f 1 at a constant frequency of 60 Hz.
The intermodulation distortion is shown in Figure 7. The graph shows that the lowest values of the intermodulation distortions were obtained with SRAM and MAM modulations. Single sideband modulations obtained the worst results. It is also observed how the IMD T measured with the lower single lateral band (LSBAM) had a peak around 6.5 kHz caused again by the minimum that the parametric array had in the ultrasonic response.
Appl. Sci. 2020, 10, x FOR PEER REVIEW 9 of 11 The intermodulation distortion is shown in Figure 7. The graph shows that the lowest values of

19
When selecting a modulation technique and modulation index there must be a balance between 20 sound quality and audio level. A large modulation index will lead to louder audible sound but with 21 more distortion. In general, we recommend MAM1 with a modulation index of 0.6. This

27
We hope that this study promotes more experimental measurements with parametric arrays that 28 analyze the distortion of the speakers. The measurement technique that we used to avoid the 29 appearance of spurious signal is based on selecting a microphone with an adequate frequency 30 response, so it can be employed by most researchers. As future work, it would be desirable to measure 31 psychoacoustical parameters and user-subjective measurements. All measurements were repeated systematically for different modulation indices varying from 0.1 to 1 at steps of 0.1. The results are shown in Supplementary Figure S5. In this case, the MAM and SRAM techniques provide the lowest distortion even at large modulation indices. The rest of techniques are more sensitive to changes in the modulation index.

Conclusions
Parametric arrays have multiple applications in targeted audio delivery. However, the self-demodulation of the highly directive ultrasonic carrier is a non-linear process that introduces distortion of the audio signal. Multiple techniques have been proposed based on amplitude modulation. We have experimentally evaluated six different amplitude modulation techniques in terms of a frequency response as well as harmonic and intermodulation distortion. We analyzed the harmonic distortion at different frequencies and decomposed by orders. Additionally, the distortion of different modulation indices is evaluated for all the modulations techniques.
When selecting a modulation technique and modulation index there must be a balance between sound quality and audio level. A large modulation index will lead to louder audible sound but with more distortion. In general, we recommend MAM1 with a modulation index of 0.6. This configuration produces the lowest levels of harmonic and intermodulation distortion while only decreasing the audible signal by 2 dB. MAM1 technique makes a good use of the limited bandwidth of the transducers. We acknowledge that for transducers with wider band (e.g., electrostatic speakers), MAM2 could provide better results. In any case, the ultrasonic transducers that we evaluated are the most commonly used in this type of devices.
We hope that this study promotes more experimental measurements with parametric arrays that analyze the distortion of the speakers. The measurement technique that we used to avoid the appearance of spurious signal is based on selecting a microphone with an adequate frequency response, so it can be employed by most researchers. As future work, it would be desirable to measure psychoacoustical parameters and user-subjective measurements.
Supplementary Materials: The following are available online at http://www.mdpi.com/2076-3417/10/6/2070/s1, Table SI: Main characteristics of the evaluated microphones, Figure S1: Block diagrams of preprocessing amplitude modulation techniques, Figure S2: Spectrogram of the signals corresponding to each stage of the measurement procedure, Figure S3: Harmonics up to the 5th order measured with the predistortion techniques (m = 1), Figure  S4: Level of total harmonic distortion (LTHD) for the predistortion techniques at different modulation indices, and Figure