Next Article in Journal
Carbon Mass Balance in Arthrospira platensis Culture with Medium Recycle and High CO2 Supply
Previous Article in Journal
Evaluation of Performance and Challenges of Use of Waste Materials in Pavement Construction: A Critical Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Even Dispersion Design for a Compact Linear Loudspeaker Array with Adaptive Genetic Algorithm

1
Key Laboratory of Media Audio & Video (Communication University of China), Ministry of Education, Beijing 100024, China
2
School of Information and Communication Engineering, Communication University of China, Beijing 100024, China
*
Author to whom correspondence should be addressed.
Appl. Sci. 2020, 10(1), 227; https://doi.org/10.3390/app10010227
Submission received: 26 November 2019 / Revised: 18 December 2019 / Accepted: 23 December 2019 / Published: 27 December 2019
(This article belongs to the Section Acoustics and Vibrations)

Abstract

:
Even dispersion is important for live sound reinforcement systems; however, it needs to be carefully designed when using a linear loudspeaker array. This is because the audience area is often large, while the loudspeakers are placed centrally in this case for convenience, and thus both the level and the frequency balance may not remain reasonably constant for all audiences. To solve this problem, the adaptive genetic algorithm is firstly introduced in the parameters optimization. Secondly, taking the radiation characteristics at different frequencies into account, a linear-phase non-uniform filter bank is proposed to decompose the broad frequency band into several sub-bands. The audio is re-synthesized with the optimized parameters in each frequency band for a linear loudspeaker array. To show the validity of the proposed method, the simulations and the experiments are conducted to demonstrate that the sound pressure level in the audience area is distributed within ± 1.33 dB, ranging from 200 Hz to 4000 Hz.

1. Introduction

Live sound reinforcement aims to maintain constant listening experience for a large amount of audiences in real listening environments such as theater, cinema, auditoria, and stadium. The idea of the “democracy of sound” is a recent theme within the live sound reinforcement community [1]. Audiences should receive constant content regardless of the seat location. That is to say, for all audiences, the level and the frequency balance should stay nearly constant in the area that the loudspeakers are designed to cover. This paper studies the use of a compact linear loudspeaker array to achieve the even dispersion goal for practical applications.
Sound pressure level (SPL) distribution is one of the most popular objective measurements in studying the even/uneven dispersion of sound fields. To obtain a constant SPL in the audience area, line loudspeaker arrays have been well designed for sound reinforcement by many researchers. There are mainly two alternative groups of approaches in this field [2]. One is to compute the loudspeaker driving signals with a reproduction equation such as Ambisonics [3,4], the least squares (LS)-based solution [5,6], delay-and-sum beamformer [7,8], and so on. The excitation parameters of loudspeakers are calculated for matching the desired and the reproduced SPL distribution with the control points and the loudspeaker locations. However, these approaches are quite unstable because of the ill-conditioned problem [9] and sufficient only for the specific sound field with high accuracy [10]. The other is the well-known wave field synthesis (WFS) [11], which is derived from the Kirchhoff–Helmholtz integral [12]. For the planar and the linear arrays, the formulation is equivalent to Rayleigh’s first integral formula [13]. However, it is a quite complicated system with a large number of loudspeakers which must be set up carefully and driven individually [14,15]. In this paper, an adaptive genetic algorithm (GA)-based approach is proposed to compute the excitation parameters of loudspeakers. GA, firstly proposed by Holland [16], is an adaptive and competent approach for evolutionary search. It has been successfully applied in many fields; array optimization is an example [17,18,19,20,21], and can be treated as a kind of symbolic regression [22,23].
However, for real audio applications, it is necessary to consider the performance in a wide frequency range. Broadband designs are addressed in sound reproduction [24,25] and beamforming techniques [26]. There are some inevitable limitations for broadband sound field control with a uniformly spaced array [27]. With the increase of frequency, difficulties increase dramatically for constant sound radiation characteristics design. Firstly, the number of loudspeakers should be increased with the range of the frequency band. An array, which consists of 300 loudspeakers, is proposed by Nasim [25] for the signals up to 4000 Hz. However, only a limited number of loudspeakers is available for many practical applications. Lasso-based methods [6,27,28,29] are approved for loudspeaker location optimization with a reduced set of loudspeakers. Due to the complicated computation mentioned above, it is valuable to exploit a feasible system in real applications.
In this paper, we focus on providing the broadband constant SPL distribution in the audience area using a linear loudspeaker array ranging from 200 Hz to 4000 Hz. Firstly, an adaptive GA, which can overcome both the low speed and the premature convergence problems [30], is introduced for the even dispersion goal in narrowband. The proposed method shows good performance compared with conventional approaches such as delay and sum beamformer [7,8], contrast maximization [31], and LS-based pressure matching [32,33]. Secondly, non-uniform filter banks (NUFBs) [34,35], which have a linear-phase (LP) property, are proposed to divide the whole frequency band into sub-bands so that the optimized excitation parameters can be applied to each separate sub-band without causing phase variation. The results demonstrate that SPL distribution can be controlled within ± 1.33 dB in the broadband.
The following parts of the paper are organized as follows. In Section 2, the excitation parameters optimization is introduced for the even dispersion goal in the narrowband case. For the broadband, the method is introduced in Section 3. Then, the simulation and the experiment results are, respectively, shown in Section 4 and Section 5. Finally, the conclusions are drawn in Section 6.
Notation
Throughout this paper, we use the following notations: Matrices and vectors are represented by upper- and lowercase boldface, respectively, e.g., X and x . | x | denotes the Euclidean distance where | x | = x · x . X 1 and X + denotes inverse and the generalized inverse of matrix X . The denary logarithm is denoted by log 10 ( · ) . max ( · ) denotes the max function. exp ( · ) denotes the Euler’s number-based exponential function. G H is the Hamiltonian matrix of G. eign ( · ) denotes the eigenvector of the computing matrix. sin ( · ) is the sinusoidal function. cos ( · ) denotes the cosine function.

2. Excitation Parameters Optimization in Narrowband Case

To maintain constant SPL for audiences in a line for all seats, this section introduces the methods on deriving the excitation parameters in an offline manner (the proposed optimization algorithm can be extended for online estimation of the excitation parameters when a microphone array is placed at the line for all seats, which will be studied in future) for a line loudspeaker array. The placement of the loudspeaker array is plotted in Figure 1. The aim is to generate constant SPL for audiences sitting in line A B by using this compact loudspeaker array whose center is the point O. The angle between O A and O B is 45 , which means A B = O B . It is assumed that the sound field propagates under free-field conditions.

2.1. Theoretical Basis

The SPL at an observation position x = [ x , y ] in a line is given as
P ( x , k ) = l 2 l 2 D ( x 0 , k ) G 2 D ( x , x 0 , k ) d x 0 ,
where l is the length of the line array. k is the wave number that relates to the frequency f. x 0 = [ x , 0 ] refers to one position of the line array. D ( x 0 , k ) denotes the sound driving function at this position, which can be given by
D ( x 0 , k ) = A ( x 0 , k ) exp ( j α ( x 0 , k ) ) .
G 2 D ( x , x 0 , k ) is the two dimensional free-field Green’s function defined in [36]
G 2 D ( x , x 0 , k ) = exp ( j k | x x 0 | ) 4 π | x x 0 | .
When the observation position is far away from the center of the line loudspeaker array with a fixed distance r = | x x 0 | , the sound pressure at angle θ can be reduced to [37]
P ˜ ( θ , k ) = 1 r n = 1 N D ( x 0 , n , k ) exp ( j k n d k sin ( θ ) ) ,
where N is the number of loudspeakers. x 0 , n = [ x n , 0 ] is the postion of the nth loudspeaker. d k is the interspace of thetwo adjacent loudspeakers which could be different in different frequencies. θ is illustrated in Figure 1. The parameters that need to be optimized are the magnitudes A ( x 0 , k ) and the phases α ( x 0 , k ) in D ( x 0 , k ) .
The directivity is defined as
R ( θ , k ) = 20 log 10 | P ˜ ( θ , k ) | max ( | P ˜ ( θ , k ) | ) .

2.2. Conventional Approaches

There are several approaches to compute the excitation parameters for the desired sound field.

2.2.1. Delay and Sum Beamformer

Delay and sum beamformer is a simple method to control the sound field by setting control points. The sound driving function can be given by [7]
D D S ( x 0 , k ) = m = 1 M 1 G 2 D ( x m , x 0 , k ) ,
where x m is the control point and x 0 is the speaker position. M is the number of control points.

2.2.2. Contrast Maximization

In contrast maximization approach, we can set control points at target and non-target regions. The spatial correlation between control points x and loudspeaker locations x 0 is defined as [31]
C ( x , x 0 , k ) = 1 M m = 1 M G m H ( x , x 0 , k ) G m ( x , x 0 , k ) ,
where G m ( x , x 0 , k ) = [ G 2 D ( x m , x 0 , 1 , k ) G 2 D ( x m , x 0 , n , k ) ] . The superscript H denotes the Hamiltonian matrix operator.
The sound driving function can be given by
D C M ( x 0 , k ) = eign ( [ C t a r ( x , x 0 , k ) + C n o n t a r ( x , x 0 , k ) ] + ) C t a r ( x , x 0 , k ) ,
where C t a r ( x , x 0 , k ) and C n o n t a r ( x , x 0 , k ) are the spatial correlation matrix between target control points and loudspeaker positions and that between non-target control points and loudspeaker positions, respectively.

2.2.3. LS-Based Pressure Matching

In LS-based pressure matching approach [38], the driving function is given by
D L S P M ( x 0 , k ) = G 1 ( x , x 0 , k ) P ( x , k ) ,
where G 1 ( x , x 0 , k ) is the inverse of free-field Green’s function between control points x and loudspeaker locations x 0 . Two sets of control points are designed with higer and lower values.

2.3. Proposed Approaches

To obtain constant SPL in line AB as shown in Figure 1, the generated SPL should compensate the attenuation in sound propagation because of the varying distance between the array and points in line AB. Since the SPL produced by a point sound source is defined as [39]
L = L W 20 log 10 ( r ) 11 ,
where L W is determined by the power of the source and r is distance from the source to the measured point. If the SPL at point A is chosen as the reference, the SPL at the point C in Figure 1 can be given by
R r e f ( θ , k ) = R r e f ( 45 , k ) + 20 log 10 O C O A = R r e f ( 45 , k ) + 20 log 10 1 2 cos ( θ ) .
To generate such sound field, an adaptive GA-based approach is proposed for array excitation parameters optimization. The GA is a search algorithm motivated by the Darwin’s natural selection theories. Unlike the above mentioned traditional methods, GA can search parallelly for the global optimal solutions among the possible spaces such as the excitation parameters in this paper. Promoted by the crossover and the mutation operators, the individuals that satisfy a predefined goal will survive in the evolution of a population which is similar to the notions of natural evolution. The GA is chosen because the aim is to get the best parameters in the search spaces within a relatively short time. Moreover, it can be handled easily to maintain multiple targets: (1) The target SPL distribution in the audience area. (2) The SPL constraint in unconcerned area (sidelobe of the array pattern). It is a nonlinear problem when the sidelobe is required to be lower than a threshold which can not be solved by the above mentioned traditional methods generally. The sidelobe control is important for practical applications due to that the SPL distribution in sidelobe should be low enough to avoid interference with others. Note should be given that, although the low speed convergence problem is not a big problem for offline excitation parameters optimization, the premature convergence problem could degrade the performance a lot for practical applications. As mentioned above, when extending the adaptive GA to online excitation parameters optimization, the adaptive GA is expected to solve all the two problems of the traditional GA.
To describe the GA procedure, it is necessary to define the chromosome and the fitness function. The chromosomes, also known as individuals, represent the possible solutions in the search spaces. For simplicity, only the phases α ( x 0 , k ) are optimized in the following. The phases for loudspeaker array, range from 0 to 360 , can be encoded and spliced into a binary string. Quantitative accuracy is a parameter for the chromosome binary encoding. For example, if the phases are { α ( x 0 , 0 , k ) α ( x 0 , 1 , k ) α ( x 0 , N 2 , k ) α ( x 0 , N 1 , k ) } , where N is the number of loudspeakers, a chromosome for these parameters can be encoded as {01000 01010 ... 01011 11001}. New individuals can be decoded into phases for fitness calculation in the selection operator.
To obtain a more precise representation of the directivity pattern, the objective function is defined as [40]
O ( θ , k ) = a 1 0 < θ < 45 [ | R ( θ , k ) | R r e f ( θ , k ) ] b 1 + a 2 [ | max 90 < θ < 0 R θ , k | ] b 2 ,
where a 1 , a 2 , b 1 , and b 2 refer to the controlled coefficients. This function requests an optimized individual that should maintain (a) minimum differences between the objective distribution defined in Equation (12) and the optimized distribution defined in Equation (5) with decoded phases, (b) minimum SPL distribution in the sidelobe. Unlike the traditional methods, an exact objective sidelobe distribution is not well considered in the sidelobe control.
The fitness function is the transformation of the objective function [7,41]
J ( θ , k ) = c O ( θ , k ) + d ,
where c and d are controlled coefficients in the experiments.
The adaptive GA procedure is summarized in Algorithm 1. The initial populations are generated randomly. Each individual in the populations is awarded a fitness score computed by the fitness function after decoding its chromosome. After that, three basic genetic operators are conducted: Selection, crossover, and mutation. The selection operator is defined according to the fitness. To achieve diversity and convergence, the higher the fitness individual, the more likely to survive in the selection operator. In the crossover process, new generations are produced by the individuals after selection. A single-point crossover operator is applied in this paper to generate new individuals by dividing parent chromosomes into some parts and inheriting from different parents. The mutation operator adds diversity to the population and explores new parameter search spaces. The generations with optimized phases α ( x 0 , k ) can be produced when some stopping criteria become true.
Algorithm 1: Adaptive GA
  • Step 1. Set population size n p o p and the coefficients in the adaptive fitness function.
  • Step 2. Generate initial population for the phases α ( x 0 , k ) in the possible search space.
  • repeat
  • Step 3. Calculate the fitness for each chromosome by Equation (13). Compute crossover rate and mutaion rate.
  • Step 4. Peform selection based on the fitness.
  • Step 5. Peform crossover on the generation.
  • Step 6. Peform mutation on the generation.
  • until Satisfying the stopping criteria with optimized α ( x 0 , k ) .

3. Constant SPL Distribution Design for Broadband Signals

As mentioned above, the constant SPL distribution design is implemented in a narrowband case. However, due to the different radiation characteristics of high and low frequencies, it needs great efforts to achieve this goal for broadband applications. In this section, according to the radiation characteristics of a loudspeaker array on different frequencies, we divide the broadband into many independent sub-bands to apply the optimized parameters separately.

3.1. Frequency Division

Because of the different radiation characteristics, the frequency band should be divided properly to get better performance in each sub-band. With an equispaced array, the radiation characteristic varies with the interspace of the adjacent loudspeakers, the number of loudspeakers, and the excitation frequency. We take the arrays with 4, 6, and 8 loudspeakers, respectively, as examples to describe the division strategy with a fixed interspace of the two adjacent loudspeakers.
In the specific sub-band, the same delay should be applied since the phase is a frequency dependent parameter. With the phase α ( x 0 , k c t ) computed by adaptive GA at the central frequency of the same sub-band, the phase at the noncentral frequency is
α ( x 0 , k ) = α ( x 0 , k c t ) k k c t ,
where k and k c t refer to the wave number at the noncentral and that at the central frequency, respectively. One can see that the phase at the band edge may be significantly influenced when the bandwidth is too large. The SPL in the target line can be calculated by
R ˜ ( x 0 , k ) = R ( x C , x B , k ) + 20 log 10 ( cos x C , x B ) ,
where R ( x C , x B , k ) is the directivity computed by Equation (5) with the optimized phase α ( x 0 , k ) . x C is the position at the point C as shown in Figure 1. x B denotes the position of point B.
To optimize the proper bandwidth, the rule for the bandwidth decision can be given by
B r e s = max | R ˜ ( x 0 , k ) R ˜ ¯ ( x 0 , k ) | < 1.5 d B ( k k c t ) c / π ,
where c denotes the speed of sound. R ˜ ¯ ( x 0 , k ) is the average value of R ˜ ( x 0 , k ) over x 0 . The SPL distrubution R ˜ ( x 0 , k ) should meet the predefined performance requirement with maximum bandwidth, where the SPL difference over the target line should be less than ± 1.5 dB [42] in each sub-band. Besides, the band division strategy should obey the feasible condition in the NUFBs design [34]: For the lth band, i = 0 l 1 B r e s , i equals to B r e s , l v l with some integer v l . Hence, there are only a finite number of bandwidth choices in each sub-band design which can be solved with the enumeration method.
As a result, for the line loudspeaker array with four loudspeakers, the relative SPL distributions of the lower and the upper frequencies in the first band are shown in Figure 2a,b, respectively. With Equation (16), we can obtain the bandwidth in each band. For the central frequency from 200 Hz to 600 Hz, the bandwidth is 100 Hz. While for 600 Hz–1200 Hz, the bandwidth is 200 Hz. When the central frequency ranges from 1200 Hz to 2000 Hz, the bandwidth is 400 Hz. The remainder frequency band is divided into three parts, which are 2000 Hz–2500 Hz, 2500 Hz–3000 Hz, and 3000 Hz–4000 Hz. The lower and the upper frequencies of each band for 4 loudspeakers are summarized in Table 1.
For the line loudspeaker array with six and eight loudspeakers, the same frequency band division strategy can be applied with Equation (16). Specifically, for six loudspeakers, the relative SPL distributions of the lower and the upper frequencies in the first band are plotted in Figure 2c,d. For eight loudspeakers, the bandwidth of first sub-band is 30 Hz, where its relative SPL distributions of the lower and the upper frequencies are shown in Figure 2e,f. For the all the frequencies ranging from 200 Hz to 4000 Hz, the lower and the upper frequencies of each band are presented in Table 1 for six and eight loudspeakers.
One can see from Table 1 that, with the increase of the number of loudspeakers, the frequency band should become narrower. With a fixed number of loudspeakers, the frequency band becomes wider at high frequencies.
At high frequencies, the interspace of the array should be smaller due to shorter wavelengths than those of low frequencies. For the following experiment and analysis, the used loudspeaker array are shown in Figure 3. The sub-array is named as “Group 1” with the interspace 20 cm for 200 Hz to 1000 Hz. The sub-array “Group 2” with the interspace 10 cm is used for the band 1000 Hz–02000 Hz. For higher frequencies (>2000 Hz), the sub-array “Group 3” with the interspace 5 cm is chosen.

3.2. Digital Filter Bank Design

To handle the different radiation characteristics, a digital filter bank is chosen to perform the frequency division. It is necessary to achieve LP property for accurate phases in each sub-band. The design of LP-NUFBs [35] with the efficient partial modulation technique can generalize the phase modification structure for the LP property. Compared with traditional NUFB designs, it has much lower system delay and computational complexity when using LP-NUFBs, while a promising performance can be achieved.
The band division result should be transformed into the angular frequency domain. The corresponding angular frequency ω can be calculated as
ω = 2 π f f s ,
where f ranging from 200 Hz to 4000 Hz and f s is the sampling rate.
To ensure the frequency response performance in the narrow frequency band of the filter bank, the order of the filter bank is set to 1500. In the band division result of 6 loudspeakers, the sampling factors are [ 1 / 40 , / 160 , 1 / 160 , 1 / 160 , 1 / 160 , 1 / 80 , 1 / 80 , 1 / 80 , 1 / 80 , 1 / 80 , 1 / 80 , 1 / 40 , 1 / 40 , 1 / 40 , 1 / 40 , 1 / 40 , 1 / 16 , 1 / 16 , 1 / 8 , 1 / 2 ] . The designed 20-channel LP-NUFBs is shown in Figure 4. The magnitude responses of the analysis filters are plotted in Figure 4a. The aliasing and the amplitude distortions are E a = 0.013 and E p p = 0.041 , respectively, as shown in Figure 4b,c.

3.3. Magnitude Response Compensation

The magnitude response is one of the most important factors for the perceived sound quality of loudspeaker [43]. A flat magnitude response [44,45] is desired to reproduce the input signals. In this work, we focus on the constant listening experience in the target area. It is also necessary to realize a flat magnitude response system for constant SPL distribution in the whole designed frequency band.
The measured magnitude response curve of a loudspeaker used in the experiment is shown in Figure 5a. The magnitude optimization goal is to compensate the non-flat magnitude response by using an inverse of the magnitude response. The compensating magnitudes are shown in Figure 5b.
The same calibration procedure can be conducted on each loudspeaker to achieve the same flat magnitude response. Note that this paper only considers the case that all the loudspeakers are the same type. If a line loudspeaker array consists of different types of loudspeakers, the optimization problem becomes more difficult, which can be a future research work. In the simulations, we assume that all the loudspeakers have the same flat magnitude response. In the experiments, we use the line loudspeaker array that all the loudspeakers are the same type and their magnitude responses have been compensated as describe in this part.

4. Simulation

With the LP-NUFBs, the whole frequency band can be divided into several sub-bands. Thus, the excitation parameters can be optimized separately in each sub-band by the proposed adaptive GA. In this section, we discuss the performance at some specific frequencies firstly. After that, a sweep signal with its frequency ranging from 200 Hz to 4000 Hz is applied to evaluate the performance for broadband signals.

4.1. Performance Evaluation of Narrowband Signals

The control points are located in a line with y = 3 m and 3 m < x < 3 m. With the delay and sum beamformer and the contrast maximization methods, sixty control points ( M = 60 ) are discretized with Δ x = 0.1 m. The LS-based pressure matching method is applied with six control points in the line with y = 3 m, 3 m < x < 3 m. The frequency is chosen to be 600 Hz.
Figure 6 shows the produced relative SPL in the target line with the proposed adaptive GA and the other conventional methods including delay and sum beamformer, contrast maximization, and LS-based pressure matching. Compared with the other two traditional methods, the LS-based pressure matching shows better performance for x < = 2 m where the SPL difference is within ± 9.9 dB while it reduces rapidly for x > 2 m. The SPL distribution is nearly constant over the whole target line with the proposed adaptive GA. In all, the proposed adaptive GA with the initial population n p o p = 200 shows the best performance, where the SPL difference is within ± 0.72 dB which is much better than the competing methods.
The performances of 500 Hz, 1000 Hz, 2000 Hz, and 4000 Hz with the proposed adaptive GA are shown in Figure 7. One can see that the SPL difference is less than 1.5 dB over the whole target line for all frequencies, which satisfies the requirements.

4.2. Performance Evaluation of Broadband Signals

As described in Section 3, we divide the frequency pattern of the loudspeaker array with 12 loudspeakers. With the 20-channel LP-NUFBs design, the parameters calculated by the adaptive GA in each sub-band are applied in the whole frequency band.
To simulate the constant performance, a 20 s double swept-frequency signal is applied with the superposition of 200 Hz–2000 Hz and 400 Hz–4000 Hz swept-frequency signals which means there are that two frequency components k 1 and k 2 are always presented at the same time. Given that there are two separate sound sources with P ˜ ( θ , k 1 ) and P ˜ ( θ , k 2 ) radiated by the same loudpeaker array, the synthetic relative SPL can be given by [46]
R ˜ ( θ , k 1 , k 2 ) = 20 log 10 P ˜ ( θ , k 1 ) + P ˜ ( θ , k 2 ) max θ P ˜ ( θ , k 1 ) + P ˜ ( θ , k 2 ) .
At a specific time, the signal has two narrowband frequency components. Their sound pressures can be calculated separately with the phases optimized by adaptive GA. After that, the relative SPL at this specific time can be calculated with Equation (18). The SPL reproduced by the testing signals is shown in Figure 8a. It can be seen that the SPL in the target line tends to be consistent in the designed frequency range.

5. Experiment

As described above, the optimized excitation parameters of the loudspeaker array can be calculated by adaptive GA. Next, experiments are conducted to verify the performance of our system design. In these experiments, the wave files are synthesized by the LP-NUFBs. The SPL in the target line is measured when playing the multi-channel sound signals with the loudspeaker array.

5.1. Experiment Preparation

In the experiments, the same double swept-frequency signal in the simulation is applied. To validate the effectiveness of the proposed method, the SPL distributions are measured with the non-optimized and the optimized audio signals. In the non-optimized experiment, the same testing audio signals are played by the loudspeaker array which means the delays and the amplitudes are the same for all the loudspeakers. For the optimized SPL distribution measurements, the excitation parameters including the phases optimized by adaptive GA and the optimized magnitudes for flat magnitude responses are applied by LP-NUFBs for multi-channel audios synthesis. The input signal is divided into several sub-bands by the LP-NUFBs in each frame. After applying the optimized excitation parameters in each sub-band, the multi-channel audio signals can be obtained separately for the loudspeakers. Note that, the overlap-save method [47] is conducted in the frame processing procedure to avoid time aliasing. The overlapped portions in frames are discarded dicrectly which makes it more efficient than the well-known overlap-add method [48]. The frame length is set to one second. The frameshift is 0.7 s. The sampling rate of the wave file is 48 kHz.
The experiment is conducted in the Key Laboratory of Media Audio & Video (KLMAV) Lab, Communication University of China. In these experiments, the line loudspeaker array is placed as shown in Figure 3. The multi-channel signals generating with optimized excitation parameters are played by the loudspeakers. The SPL distribution is measured by sound level meters TES-1353 in the target line, which is about 3 m away from the array. Each test point is spaced 15 cm.

5.2. Experimental Results and Analysis

Before conducting the experiments, the background noise of the target line is measured firstly to figure out the noise distribution. By doing so, the SPL values contributed by the loudspeaker array can be obtained by subtracting the noise SPL values to improve the evaluation accuracy.

5.2.1. Non-Optimized SPL Distribution

The non-optimized experiment is conducted to prove the effectiveness of optimized results. The double swept-frequency signals with same phases and magnitudes are played directly by the loudspeaker array. The time-varying SPL distribution is measured in 20 s at an interval of 15 cm.
The SPL distribution for the non-optimized case is shown in Figure 8b, which indicates that the SPL distribution is much higher in the center of the loudspeaker array and it gradually reduces from the center to the left/right side. With the increase of the frequency, the mainlobe of the line loudspeaker array tends to become narrower. One can also observe that the SPL distribution can range from −5 dB to 20 dB at high frequency.

5.2.2. Optimized SPL Distribution

With the same experiment setup, the proposed method is applied to optimize the double swept-frequency signal for all the loudspeakers. The time-varying SPL distribution of the proposed method shows much better performance than the non-optimized result as shown in Figure 8c. To show the validity of optimization directly, the SPL distributions in different positions within different times are normalized and averaged. In Figure 9, the non-optimized SPL distributions range in ± 5.27 dB. With optimization, the SPL distributions are much smaller, which range in ± 1.33 dB.

6. Conclusions

In this work, we present an efficient and effective method to achieve the even dispersion goal using a compact linear loudspeaker array. To obtain a constant SPL in the audience area, an adaptive GA is introduced for parameters optimization to get much better results compared with conventional approaches in the narrowband. For the real audio application, the broadband is divided into several narrowbands when introducing the linear-phase non-uniform filter banks before parameters optimization. The simulated and the experimental results validate that the SPL distribution is controlled within ± 1.33 dB. Future work should concentrate on optimizing parameters of multiple linear loudspeaker arrays for more complicated live audio reinforcement systems.

Author Contributions

Conceptualization, J.C. and H.W.; Data curation, J.C. and Y.P.; Formal analysis, H.W. and Y.W.; Investigation, J.C. and Y.P.; Methodology, J.C. and Y.P.; Validation, J.C., Y.P. and Y.W.; Writing-original draft preparation, Y.P.; Writing-review & editing, Y.P. and J.C.; Supervision, H.W. All authors have read and agreed to the published version of the manuscript.

Acknowledgments

This research is supported by the National Natural Science Foundation of China (Grant No. 61631016, 61231015), the National Radio and Television Administration Project (Grant No. 2015-53) and the Fundamental Research Funds for the Central Universities (Grant No. 2014XNG1425). The authors would like to thank the anonymous reviewers and the associate editor for their valuable comments that helped to improve the quality of this work. The authors would like to thank Prof. Hao and Zheng in Institute of Acoustics, Chinese Academy of Sciences in discussing and proofreading this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hill, A. Live sound subwoofer system performance quantification. In Audio Engineering Society Convention 144; Audio Engineering Society: Milan, Italy, 2018. [Google Scholar]
  2. Ahrens, J.; Spors, S. Sound field reproduction using planar and linear arrays of loudspeakers. IEEE Trans. Audio Speech Lang. Process. 2010, 18, 2038–2050. [Google Scholar] [CrossRef]
  3. Gerzon, M. Ambisonics in multichannel broadcasting and video. J. Audio Eng. Soc. 1985, 33, 859–871. [Google Scholar]
  4. Yan, J.; Abhayapala, T. Theory and design of soundfield reproduction using continuous loudspeaker concept. IEEE Trans. Audio Speech Lang. Process. 2008, 17, 107–116. [Google Scholar]
  5. Marek, O.; Jackson, P.; Coleman, P.; Pedersen, J. Optimal source placement for sound zone reproduction with first order reflections. J. Acoust. Soc. Am. 2014, 136, 3085–3096. [Google Scholar]
  6. Radmanesh, N.; Burnett, I.; Rao, B. A Lasso-LS optimization with a frequency variable dictionary in a multizone sound system. IEEE/ACM Trans. Audio Speech Lang. Process. (TASLP) 2016, 24, 583–593. [Google Scholar] [CrossRef]
  7. Ping, T.; Gordon, L.; Lal, T. Nonlinear control using evolutionary fitness functions based on scaling transformations. In Proceedings of the 2008 World Automation Congress, Hawaii, HI, USA, 28 September–2 October 2008; pp. 1–6. [Google Scholar]
  8. Tamai, Y.; Kagami, S.; Mizoguchi, H.; Nagashima, K. Simultaneous generation/capture of multiple focuses sound beams. In Proceedings of the 2003 IEEE International Conference on Systems, Man and Cybernetics. Conference Theme—System Security and Assurance (Cat. No.03CH37483), Washington, DC, USA, 8 October 2003; pp. 4613–4618. [Google Scholar]
  9. Fazi, F.; Nelson, P. The ill-conditioning problem in sound field reconstruction. In Audio Engineering Society Convention 123; Audio Engineering Society: New York, NY, USA, 2007; Volume 10. [Google Scholar]
  10. Ueno, N.; Koyama, S.; Saruwatari, H. Listening-area-informed sound field reproduction based on circular harmonic expansion. In Proceedings of the 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), New Orleans, LA, USA, 5–9 March 2017; pp. 111–115. [Google Scholar]
  11. Hahn, N.; Winter, F.; Spors, S. Local wave field synthesis by spatial band-limitation in the circular/spherical harmonics domain. In Audio Engineering Society Convention 140; Audio Engineering Society: New York, NY, USA, 2016. [Google Scholar]
  12. Firtha, G.; Fiala, P.; Schultz, F.; Spors, S. On the general relation of wave field synthesis and spectral division method for linear arrays. IEEE/ACM Trans. Audio Speech Lang. Process. 2018, 26, 2393–2403. [Google Scholar] [CrossRef]
  13. Williams, E. Fourier Acoustics: Sound Radiation and Nearfield Acoustical Holography; Elsevier: London, UK, 1999. [Google Scholar]
  14. Völk, F. Minimum-Audible Angles in Wave-Field Synthesis: A Case Study. In Audio Engineering Society Convention 141; Audio Engineering Society: New York, NY, USA, 2016. [Google Scholar]
  15. Villegas, J. Improving perceived elevation accuracy in sound reproduced via a loudspeaker ring by means of equalizing filters and side loudspeaker grouping. Acoust. Sci. Technol. 2019, 40, 127–137. [Google Scholar] [CrossRef] [Green Version]
  16. Holland, J. Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence; MIT Press: Cambridge, MA, USA, 1992. [Google Scholar]
  17. Holm, S.; Austeng, A.; Iranpour, K.; Hopperstad, J. Sparse sampling in array processing. Nonuniform Sampl. 2001, 2001, 787–833. [Google Scholar]
  18. Ong, J.; King, Y.; Saw, L.; Theng, K. Optimization of the Design Parameter for Standing Wave Thermoacoustic Refrigerator using Genetic Algorithm. In IOP Conference Series: Earth and Environmental Science; IOP Publishing: Bristol, UK, 2019; p. 012021. [Google Scholar]
  19. Haupt, R. Thinned arrays using genetic algorithms. IEEE Trans. Antennas Propag. 1994, 42, 993–999. [Google Scholar] [CrossRef]
  20. Xu, Z.; Xia, X.; Lai, S.; He, Z. Improvement of interior sound quality for passenger car based on optimization of sound pressure distribution in low frequency. Appl. Acoust. 2018, 130, 43–51. [Google Scholar] [CrossRef]
  21. Bai, M.; Tung, C.; Lee, C. Optimal design of loudspeaker arrays for robust cross-talk cancellation using the taguchi method and the genetic algorithm. J. Acoust. Soc. Am. 2005, 117, 2802–2813. [Google Scholar] [CrossRef] [PubMed]
  22. Quade, M.; Gout, J.; Abel, M. Glyph: Symbolic Regression Tools. J. Open Res. Softw. 2019, 7, 19–26. [Google Scholar] [CrossRef] [Green Version]
  23. Lample, G.; Charton, F. Deep learning for symbolic mathematics. Int. Conf. Learn. Represent. 2020. Under review. [Google Scholar]
  24. Campbell, J.; Soloway, S. Generation of a nondiffracting beam with frequency-independent beamwidth. J. Acoust. Soc. Am. 1990, 88, 2467–2477. [Google Scholar] [CrossRef]
  25. Radmanesh, N.; Burnett, I. Reproduction of independent narrowband soundfields in a multizone surround system and its extension to speech signal sources. In Proceedings of the 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Prague, Czech Republic, 22–27 May 2011; pp. 461–464. [Google Scholar]
  26. Kolundzija, M.; Faller, C.; Vetterli, M. Design of a compact cylindrical loudspeaker array for spatial sound reproduction. In Audio Engineering Society Convention 130; Audio Engineering Society: New York, NY, USA, 2011. [Google Scholar]
  27. Radmanesh, N.; Burnett, I. Wideband sound reproduction in a 2D multi-zone system using a combined two-stage Lasso-LS algorithm. In Proceedings of the 2012 IEEE 7th Sensor Array and Multichannel Signal Processing Workshop (SAM), Hoboken, NJ, USA, 17–20 June 2012; pp. 453–456. [Google Scholar]
  28. Ciuperca, G. Adaptive group LASSO selection in quantile models. Stat. Pap. 2019, 60, 173–197. [Google Scholar] [CrossRef] [Green Version]
  29. Feng, Q.; Yang, F.; Yang, J. Time-domain sound field reproduction using the group Lasso. J. Acoust. Soc. Am. 2018, 143, EL55–EL60. [Google Scholar] [CrossRef]
  30. Wang, S.; Lu, Z.; Wei, L.; Ji, G.; Yang, J. Fitness-scaling adaptive genetic algorithm with local search for solving the Multiple Depot Vehicle Routing Problem. Simulation 2016, 92, 601–616. [Google Scholar] [CrossRef]
  31. Choi, J.; Kim, Y. Generation of an acoustically bright zone with an illuminated region using multiple sources. J. Acoust. Soc. Am. 2002, 111, 1695–1700. [Google Scholar] [CrossRef]
  32. Tahereh, A.; Elliot, P.; Miller, P. A Pseudoinverse Technique for the Pressure-Matching Beamforming Method. In Audio Engineering Society Convention 145; Audio Engineering Society: New York, NY, USA, 2018. [Google Scholar]
  33. Takuma, O.; Atsushi, S. Experimental validation of spatial Fourier transform based multiple sound zone generation with a linear loudspeaker array. J. Acoust. Soc. Am. 2017, 141, 1769–1780. [Google Scholar]
  34. Fang, L.; Zhong, W.; Zhang, Q. Design of M-channel linear-phase non-uniform filter banks with arbitrary rational sampling factors. IET Signal Process. 2016, 10, 106–114. [Google Scholar] [CrossRef]
  35. Zhong, W.; Shi, G.; Xie, X.; Chen, X. Design of linear-phase nonuniform filter banks with partial cosine modulation. IEEE Trans. Signal Process. 2010, 58, 3390–3395. [Google Scholar] [CrossRef]
  36. Philip, C.; Jackson, P.; Marek, O.; Martin, M.; Martin, O.; Abildgaard, P. Acoustic contrast, planarity and robustness of sound zone methods using a circular loudspeaker array. J. Acoust. Soc. Am. 2014, 135, 1929–1940. [Google Scholar]
  37. Ureda, M. Line arrays: Theory and applications. In Audio Engineering Society Convention 110; Audio Engineering Society: New York, NY, USA, 2001. [Google Scholar]
  38. Kirkeby, O.; Nelson, P. Reproduction of plane wave sound fields. J. Acoust. Soc. Am. 1993, 94, 2992–3000. [Google Scholar] [CrossRef]
  39. Rathe, E. Note on two common problems of sound propagation. J. Sound Vib. 1969, 10, 472–479. [Google Scholar] [CrossRef]
  40. Huang, Z.; Ma, S.; Wang, H.; Li, C. A uniform SPL distribution method of linear phased loudspeaker array using genetic algorithm. In Proceedings of the 2012 IEEE 14th International Conference on Communication Technology, Chengdu, China, 9–11 November 2012; pp. 976–980. [Google Scholar]
  41. Basilio, P.; Jose, E.; Miguel, R. Precise control of beam direction and beamwidth of linear loudspeaker arrays. In Proceedings of the Workshop Proceedings, 2004 Sensor Array and Multichannel Signal, Barcelona, Spain, 18–21 July 2004; pp. 538–541. [Google Scholar]
  42. Killion, M. Revised estimate of minimum audible pressure: Where is the “missing 6 dB”? J. Acoust. Soc. Am. 1978, 63, 1501–1508. [Google Scholar] [CrossRef] [PubMed]
  43. Gabrielsson, A.; Björn, L.; Ove, T. Loudspeaker frequency response and perceived sound quality. J. Acoust. Soc. Am. 1991, 90, 707–719. [Google Scholar] [CrossRef]
  44. Li, G.; Zheng, C.; Li, X.; Yu, T.; Bleeck, S.; Sang, J. Evaluation of headphone phase equalization on sound reproduction. Appl. Acoust. 2019, 156, 208–216. [Google Scholar] [CrossRef]
  45. Li, X.; Cai, Z.; Zheng, C.; Li, X. Equalization of loudspeaker response using balanced model truncation. J. Acoust. Soc. Am. 2015, 137, EL241–EL247. [Google Scholar] [CrossRef] [Green Version]
  46. Ding, D.; Shui, Y.; Lin, J.; Zhangi, D. A simple calculation approach for the second harmonic sound field generated by an arbitrary axial-symmetric source. J. Acoust. Soc. Am. 1996, 100, EL727–EL733. [Google Scholar] [CrossRef]
  47. Oppenheim, A.; Schafer, R. Linear Convolution Using the DFT. In Discrete-Time Signal Processing; Pearson Education: London, UK, 2014; pp. 671–673. [Google Scholar]
  48. Borgerding, M. Turning overlap-save into a multiband mixing, downsampling filter bank. IEEE Signal Process. Mag. 2006, 23, 158–161. [Google Scholar] [CrossRef]
Figure 1. The displacement of the compact line loudspeaker array and the seats of audiences.
Figure 1. The displacement of the compact line loudspeaker array and the seats of audiences.
Applsci 10 00227 g001
Figure 2. The radiation results comparison with four loudspeakers at (a) 200 Hz and (b) 300 Hz, six loudspeakers at (c) 200 Hz and (d) 250 Hz, and eight loudspeakers at (e) 200 Hz and (f) 220 Hz.
Figure 2. The radiation results comparison with four loudspeakers at (a) 200 Hz and (b) 300 Hz, six loudspeakers at (c) 200 Hz and (d) 250 Hz, and eight loudspeakers at (e) 200 Hz and (f) 220 Hz.
Applsci 10 00227 g002
Figure 3. The designed loudspeaker array.
Figure 3. The designed loudspeaker array.
Applsci 10 00227 g003
Figure 4. The designed 20-channel linear-phase non-uniform filter banks (LP-NUFBs). (a) Magnitude responses of the analysis filters. (b) Aliasing error. (c) Amplitude distortion.
Figure 4. The designed 20-channel linear-phase non-uniform filter banks (LP-NUFBs). (a) Magnitude responses of the analysis filters. (b) Aliasing error. (c) Amplitude distortion.
Applsci 10 00227 g004
Figure 5. (a) The magnitude response curve. (b) The compensation magnitude curve.
Figure 5. (a) The magnitude response curve. (b) The compensation magnitude curve.
Applsci 10 00227 g005
Figure 6. The relative sound pressure level (SPL) distribution in the target line with (a) adaptive genetic algorithm (GA), (b) delay and sum beamformer, (c) contrast maximization, and (d) least squares (LS)-based pressure matching approaches.
Figure 6. The relative sound pressure level (SPL) distribution in the target line with (a) adaptive genetic algorithm (GA), (b) delay and sum beamformer, (c) contrast maximization, and (d) least squares (LS)-based pressure matching approaches.
Applsci 10 00227 g006
Figure 7. The relative SPL distribution of 500 Hz, 1000 Hz, 2000 Hz, and 4000 Hz.
Figure 7. The relative SPL distribution of 500 Hz, 1000 Hz, 2000 Hz, and 4000 Hz.
Applsci 10 00227 g007
Figure 8. The results of simulation and experiments. (a) Simulated distribution of relative SPL. (b) Non- optimized time-varying distribution of SPL. (c) Optimized time-varying distribution of SPL.
Figure 8. The results of simulation and experiments. (a) Simulated distribution of relative SPL. (b) Non- optimized time-varying distribution of SPL. (c) Optimized time-varying distribution of SPL.
Applsci 10 00227 g008
Figure 9. Comparison diagram of relative SPL distribution. “exp1” shows the non-optimized relative SPL distribution. “exp2” shows the optimized relative SPL distribution.
Figure 9. Comparison diagram of relative SPL distribution. “exp1” shows the non-optimized relative SPL distribution. “exp2” shows the optimized relative SPL distribution.
Applsci 10 00227 g009
Table 1. The band and its corresponding lower feaquency f l , upper frequency f u and bandwidth B r e s for the line loudspeaker array with N = 4 , 6 , 8 loudspeakers.
Table 1. The band and its corresponding lower feaquency f l , upper frequency f u and bandwidth B r e s for the line loudspeaker array with N = 4 , 6 , 8 loudspeakers.
NBand f l f u B res NBand f l f u B res
(Hz)(Hz)(Hz) (Hz)(Hz)(Hz)
412003001004710001200200
423004001004812001600400
434005001004916002000400
4450060010041020002500500
4560080020041125003000500
468001000200412300040001000
61200250506109001000100
622503005061110001200200
633003505061212001400200
643504005061314001600200
6540050010061416001800200
6650060010061518002000200
6760070010061620002500500
6870080010061725003000500
69800900100618300040001000
812002202081578084060
822202402081684090060
83240260208179001000100
842602802081810001100100
852803204081911001200100
863203604082012001350150
873604004082113501500150
884004505082215001650150
894505005082316501800150
8105005505082418002100300
8115506005082521002400300
8126006606082624002800400
8136607206082728003200400
8147207806082832004000800

Share and Cite

MDPI and ACS Style

Cai, J.; Pang, Y.; Wang, H.; Wang, Y. Even Dispersion Design for a Compact Linear Loudspeaker Array with Adaptive Genetic Algorithm. Appl. Sci. 2020, 10, 227. https://doi.org/10.3390/app10010227

AMA Style

Cai J, Pang Y, Wang H, Wang Y. Even Dispersion Design for a Compact Linear Loudspeaker Array with Adaptive Genetic Algorithm. Applied Sciences. 2020; 10(1):227. https://doi.org/10.3390/app10010227

Chicago/Turabian Style

Cai, Juanjuan, Yongqiang Pang, Hui Wang, and Yutian Wang. 2020. "Even Dispersion Design for a Compact Linear Loudspeaker Array with Adaptive Genetic Algorithm" Applied Sciences 10, no. 1: 227. https://doi.org/10.3390/app10010227

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop