1. Introduction
The transmission of sound between two points in a room is formally represented by the impulse response of the transmission path. This impulse response is composed of direct sound and the numerous repetitions of the original sound impulse caused by reflections of the sound signal at the boundaries of the room. With respect to the direct sound, each of these reflections is specified by its level and its time delay. Since our ear is also sensitive to the direction of incidence of sound, this description must be completed by indicating the direction from which each reflection reaches the receiving point. There may be differences in the spectrum, since the various components of the impulse response, strictly speaking, are not exact replicas of the original sound signal due to the frequency-dependent reflection coefficients of the wall faces.
Two subjective effects of the reflected components of sound are known: the first occurring, under certain conditions, when said reflection can become an “echo” that is heard as a signal repeated over time that appears to come, subjectively, from another place than where the sound is located. This effect is due to the numerous acoustic signals delayed in time through a reflection on a surface, which reach the listener with sufficient energy compared to other sounds or reflections. This phenomenon can frequently be observed outdoors with sound reflections off the walls of houses or at the edge of forests. In closed spaces, these experiences are less common, since echoes therein are fortunately usually masked by the reverberation of the room. Whether a reflection will become an echo or not depends on its delay with respect to the direct sound, its relative level, the nature of the sound signal, and the presence of other reflections that eventually mask the reflection in question [
1].
The other subjective effect refers to humans’ ability to locate sound sources in closed spaces. Obviously, it is the sound signal that reaches the listener first that subjectively determines the direction from which the sound comes (Cremer, law of the first wave front [
2]).
The enormous amount of information that characterises the impulse response must be condensed into certain parameters that summarise the situation in a subjectively meaningful way. This is carried out primarily by comparing the energies contained in the different parts of the impulse response. This procedure is not only an expedient dictated by practical limitations but is also justified by the limited ability of human hearing to distinguish between all the countless repeated sound signals.
A reflection is not subjectively perceived as something separate from direct sound, as long as its delay and relative strength do not exceed certain limits. Its only effect is to make the sound source appear somewhat larger and increase the apparent loudness of the direct sound. Since these “early reflections” support the sound source, they are considered useful.
Reflections that reach the listener with a longer delay are perceived as echoes in unfavourable cases; in favourable cases, they contribute to the reverberation of the room. In principle, any reverberation harms the intelligibility of speech because it blurs its temporal structure and mixes the spectral characteristics of successive phonemes or syllables. Therefore, “late reflections” are considered harmful from the perspective of speech transmission. From our everyday experience with outdoor echoes, but even more precisely from the results of Haas [
3] and similar findings, it can be concluded that the critical delay time separating useful reflections from harmful ones lies in the range from 50 to 100 ms.
Typically, early reflections arriving within 50 ms relative to the direct sound improve the signal received by the listener. However, any early reflection that arrives after 50 ms can be perceived as a separate sound, with a disturbing effect. This is called an echo and gives the impression that the sound is repeated. To perceive such an echo, the total distance travelled by the reflected sound must be at least 17 m, which is derived by multiplying the speed of sound by the delay of 50 ms.
It is very unlikely for an echo to occur in a closed room, with a greater probability in positions far from the sound source (lower direct sound) and close to a reflective surface, as verified by Vera-Guarinos et al. [
4]. These authors, using a 2D geometric tool (GeoGebra), analyse the reflections of two pairs of parallelepiped walls, where they confirm that the most probable echo risk zone is always the area farthest from the source and closest to the rear wall. As the size of the enclosure increases, and the greatest risk of echo formation occurs in places far from the emitter and close to the reflective rear wall. The same conclusions are obtained when the floor and ceiling are included and analysed with Catt Acoustic. This risk of echo increases when focalisation occurs, such as with concave curved surfaces with cylindrical or elliptical geometry [
5,
6], as found in bullrings and Roman amphitheatres [
7]. Likewise, in an architectural space without a roof, the density of reflections is lower, and Rindel [
8] points out that the existence of echoes is therefore more possible.
Based on the findings of Haas, a criterion was proposed by Bolt and Doak [
9], represented by constant percentage disturbance contours of the amplitude of reflections as a function of delay time relative to the direct sound.
Furthermore, Dietsch and Kraak [
10] have proposed an echo detection criterion based on the time of the centre of gravity of the impulse response. The criteria have been investigated for both speech and music. It is assumed that the two opposing physical magnitudes of the delayed reflection, the sound pressure amplitude
p and the delay time
t, are linked by being a constant product with the acoustic pressure with an exponent
n or weighting factor to be determined. One objective criterion for room acoustics that adequately links both physical magnitudes is the centre of gravity time.
The centre of gravity time of an impulse response (
p(
t)) normalised to its total energy is
where
n = 2 for the traditional centre time formula, and
τ = ∞ in the upper limit of the integral, Kürer [
11]. For this criterion, a suitable value of n was found to be
n = 2/3 for speech and
n = 1 for music.
Lastly, the echo criterion (Echo Criterion EK) is expressed by the relationship between the time of the strong reflection
τ and the temporal duration of the reflection Δ
τ as
For speech, n = 2/3 and Δτ = 9 ms. For music, n = 1 and Δτ = 14 ms.
A disturbing echo for speech appears if EK exceeds a critical value of 1 at a time delay of s > 50 ms, then there is a 50% chance that a listener will detect a disturbing echo for the spoken word. If
EK exceeds a critical value of 1.8 at a time delay of
s > 80 ms, then there is a 50% chance that a listener will detect a disturbing echo for light music, while with “heavy” or “dense” music, this value is definitely higher. The maximum value of
EK(s) after 50 or 80 ms is the Dietsch echo parameter for speech and music, respectively.
Table 1 summarises all the information presented.
Through simulated and auralised impulsive responses to music and speech, listening tests are audited in order to indicate the existence of strong echoes and reflections in concert halls and auditoriums. In this way, Lovstad [
12] prepares suitable sets of impulsive responses generated by Odeon with a simple echo and three successive echoes for the spoken word as well as several pieces with various musical instruments and examines the correlation between the results of the tests and the objective criteria established based on the impulsive responses generated. Said author uses a 50 ms long Hann filter to convolve the Hann window with the energy of the impulse response by analysing its peaks in relation to the reverberation decay, and finds a good correspondence between the calculated values of the new criterion and the results of the listening tests.
Furthermore, Wargert [
13], in order to study strong reflections that can be perceived as echoes, proposes two parameters obtained from the impulse response of a room (RIR): the slope ratio, which is the instantaneous slope of the energy decay curve, normalised with its average slope. This parameter is defined by Jeong et al. [
14] to indicate when an impulse response has become diffuse. In this context the instantaneous slope corresponds to the individual behaviour of the reflection of interest, while the average expected behaviour of the adjacent reflections is represented by the average slope. The other parameter that completes the description of reflection in the impulse response is the total energy drop at the time of strong reflection, ΔEDC. This author employs these two proposed parameters and the EK proposed by Dietsch and Kraak according to the results of listening tests of virtual impulse responses for different volumes, delay times, and reverberation times. The results show that ΔEDC is a good parameter for detecting strong reflections that can be perceived as an echo.
Although the bullfighting spectacle is visual, the dialogue, cheers, applause, silences, and musical interpretations of bugle and timpani, trumpets (Lisbon), and marching bands are essential for each space and its idiosyncrasies to be understood. In this work, the EK parameter is studied in the receivers in the public area in four bullrings and in a Roman amphitheatre, since spaces without roofs run the risk of the presence of echoes [
8]. Another important facet of the problem related to echoes involves the experience of the interpreters themselves. For an orchestra to collaborate successfully, musicians need to hear themselves and their colleagues clearly. Accordingly, this study has also studied positions in which both the source and the receiver are in the ring or in the arena of the Roman amphitheatre. In particular, this acoustic phenomenon has also been studied in a covered bullring with a mobile roof, since several authors point out a case in a covered bullring in Spain where a flutter echo clearly appears in the ring [
4,
15]. Although this parameter, EK, is not found in the ISO standard, its ability to predict echoes is widely accepted, and is included in the best software applications that deal with the measurement of room acoustics.
3. Materials
This section briefly describes the spaces studied and where the receivers and the sound source are located: three open bullrings (Real Maestranza de Caballería de Ronda (Malaga), Real Maestranza de Caballería de Sevilla, and Las Ventas de Madrid, all in Spain); a fourth with a mobile roof measured with the roof closed and again with a segment of the roof open (Campo Pequeno de Lisboa, Portugal); and a Roman amphitheatre (Anfiteatro Romano de Itálica, Santiponce, Seville, Spain). The enclosures are described in the same order in which the acoustic measurements are carried out. The multi-source nature of the bullfighting spectacle makes it advisable to have a good number of source–receiver combinations to understand its sound field. The choice of sound-source positions is based on the locations of natural sound sources in bullfighting. The centre and side of the ring, S1 and S2 respectively, where most of the bullfighting takes place, are associated with the voice of the bullfighter, picador, and the other bullfighter assistants. The source position of a musical nature, S3, is associated with the box of the music band, while that of musical instruments, S4, is located where a change in the stage of the bullfight is announced: with bugle, and timpani calls in Spain, and with trumpet calls in Portugal. Lastly, for the ambient sound of the public, the S5 (voice) is positioned in a significant segment of each bullring. In Ronda, with only 4 sound-source positions, S4 is the ambient sound, and in Lisbon the S5 position has not been studied due to timing problems. More details can be found regarding this issue in Martín-Castizo et al. [
19,
20]. For the Roman amphitheatre of Italica, the three positions of the sound source are for speech and music.
3.1. The Bullring of the Real Maestranza de Caballería de Ronda
The Royal Maestranza de Caballería of Ronda (RMCR), created in the 16th century for the defence of the city and its territory, is today a non-profit charitable organisation dedicated to cultural promotion and the teaching of horsemanship [
22]. Among its assets, the bullring stands out; located within the historical centre of Ronda, it is one of the oldest in Spain and was declared a Site of Cultural Interest in 1993 in the monument category.
Construction on the building began in 1780, and after a temporary suspension, it was completed in 1785. The building was designed by architect Martín de Aldehuela. The nobility of its architectural design, with its double arcade and the absence of open grandstands, has more the spirit of a cloister than a performance venue and is reminiscent of the circular courtyard of the famous Palace of Charles V in the Alhambra in Granada. Its 66-metre-diameter arena is considered the widest in the world and is surrounded by a passageway formed by two stone rings. The continuous galleries have five rows of two-storey stands, with 136 columns forming 68 Tuscan column arches. Except for the Royal Box, covered with a gabled Arabic tile roof, the elegance of its interior remains unmatched by any other bullring worldwide.
The current capacity of the bullring is approximately 4890 spectators (
Figure 1).
Figure 2 depicts the four sound-source positions studied in this bullring.
Table 2 shows the 48 source–receiver combinations that have been recorded, including those combinations that lack direct sound depending on the positions of the sound sources, which often happens in certain stands when the sound sources are out of sight of the spectator.
In this bullring, due to the columns of the arches of the galleries, there are 9 receivers without direct sound, as can be observed in
Table 2. The 48 combinations have only been employed to determine the reverberation times; for the rest of the acoustic parameters, including the EK parameter of Dietsch and Kraak, only the 39 combinations with direct sound are considered.
3.2. The Bullring of the Real Maestranza de Caballería de Sevilla
The architecture of the bullring of Seville describes, both externally and internally, the shape of an irregular polygon, consisting of 30 unequal sides. The external appearance is in the Late Baroque style, pointing to Neoclassicism, in which the Puerta del Príncipe predominates, flanked by two towers, in contrast to the rest of the façade, which is mostly whitewashed. The ovoid-shaped arena, a product of the multiple historical vicissitudes suffered by the architectural complex, is known for its vast surface area, with diameters ranging between 63.2 and 57.9 metres, and for its albero sand, a yellowish-ochre soil that contributes to its luminosity which makes it unique. Its arena is not flat, but slightly conical (
Figure 3).
The bullring of the Real Maestranza de Caballería of Seville stands out precisely for not being perfectly circular (
Figure 3), since, during its prolonged construction from 1740 to 1881, it was forced to adjust to the surrounding dwellings and resembles a large house where the bullring is the central courtyard. With a total capacity of 11,500 seats, the public is accommodated in two areas: 12 segments of stalls, and grandstands. Above this seating area, there is an archway with an irregular layout, like the arena, with marble columns, protected with gabled Arabic tile roofs [
23]. These are made up of 117 numbered arches, plus the richly ornate Royal box.
Figure 4 displays the floor plan of the bullring, which shows the positions of the source and receivers, together with its section. In all the 75 source–receiver combinations that have been utilised to record the impulse responses, there is direct sound.
3.3. Las Ventas Bullring
This bullring was designed with the intention of building a monument on the same scale as that of the city, and hence the volume, the portal, and the antechamber of the bullring take on truly significant dimensions [
24]. Construction began in 1919 (architect José Espelius Anduaga) and was completed in 1931 (architect Manuel Muñoz Monasterio) (
Figure 5a).
This constitutes the largest bullring in Spain, built in the Neo-Mudéjar style and with a capacity for 23,798 spectators. It is the third largest bullring in the world, after those of Mexico City (CDMX, Mexico) and Valencia (Carabobo, Venezuela). It is also the second largest in Spain in terms of ring diameter, at 61.5 m, after the Ronda bullring (Malaga, Spain) [
25].
This bullring has four floors of circular galleries and five projecting turrets on its exterior façade. It is distributed across ten stands (four shaded stands, numbered 1, 2, 9, and 10; two stands in partially direct sunlight and shaded areas, numbered 3 and 8; and four stands in direct sunlight, numbered 4, 5, 6, and 7). The Presidential boxes, those of the royal family, and other authorities, are located to the east, in a central place of the shaded stands. In turn, the segments are divided by height into stalls, boxes, lower grandstands, and upper grandstands (
Figure 5b).
In this bullring, 71 source–receiver combinations have been recorded, all of which have direct sound corresponding to 5 positions of the sound source and 20 receivers (for certain sound sources, not all the receivers have provided measurements in accordance with the authors’ recommendations [
19,
20]), and 6 more combinations with the sound source and receivers both in the ring (
Figure 6).
3.4. Campo Pequeno Bullring
The Campo Pequeno bullring (in Portuguese) in Lisbon began its construction in 1889 in the Neo-Mudejar style by the Portuguese architect António José Dias da Silva and was completed in 1892. It had a capacity of 8500 people before the 2000 renovation; its current capacity is 6698 seated spectators. The architect was inspired by the old (now demolished) Madrid bullring built by Emilio Rodríguez Ayuso (
Figure 7).
This new bullring was built from solid red exposed brick and underwent extensive renovation in 2000, whereby the brick was replaced with reinforced concrete. A shopping arcade was created in the basement, along with other similar spaces at street level (José Bruchy, Pedro Fidalgo, Filomena Vicente, and Lourenço Vicente). The greatest architectural alteration has been the installation of a mobile roof (João Goes Ferreira) that enables it to host various activities throughout the year (
Figure 8).
The diameter of its arena is 36.4 m, and the stalls are distributed into 7 sectors: the North half (1 even, 7, 6, and 5) and the South half (1 odd, 2, 3, 4 odd, and 4 even). The plaza is distributed from bottom to top into 14 rows of stalls (letters C-P), boxes (on two levels), and galleries (on two levels) (
Figure 9).
Figure 9 shows the 108 source–receiver combinations corresponding to 4 positions of the sound source and 18 receivers with the roof oculus closed, and 2 positions of the sound source and 18 receivers with the roof oculus partially open. Of these measured combinations, 24 correspond to both the sound source and receiver located in the ring that have been used. Only in one combination, S3-R5, is there no direct sound.
3.5. The Italica Amphitheatre
The case study of the Roman amphitheatre of Italica (
Figure 10) takes place as a type of precursor to the modern bullring and represents a connection to the theoretical perspective of the acoustics of performances held in the ancient buildings of theatres and circuses, studied by the authors of this work [
26]. The modern conception of the circular bullring in the 18th century retains many influences from the ancient amphitheatres: “the arena becomes a ring”. Regarding the time of its construction, there is a general agreement of opinions that advocate the Hadrianic period, approximately between the years 117 and 138, although of unknown architect; like most Roman constructions, given the magnitude, it is thought that several architects intervened therein. Its original appearance would have been that of a great structure of opus caementicium covered with marble slabs and stone from Tarifa. Its seating would have consisted of a podium and three caveas (ima, media, and summa), with a distribution of the amphitheatre tiers on three equal surfaces (podium and ima, media, and summa), the first two tiers of which are still in existence today.
Likewise, in the centre of the arena is the fossa bestiaria (animal pit), originally covered by a wooden platform supported by the eight brick pillars that still remain. In Hispania, the construction of the stands often took advantage of the characteristics of the rocky hills with regard to seating. In the case of Italica, the perfect example of this type of layout is found, built between two hills upon which the stands rest, with the longitudinal axis located in the intervening valley. In this way, the building’s distribution axes (E-W) are formed, which correspond to the main entrances to the arena: the Porta Triumphalis (to the east), through which the gladiatorial retinue entered; and the Porta Libitinensis (to the west), through which the victims of the combat were carried. The latter is in a better state of preservation and reveals the beginnings of the superposition of classical orders.
Regarding the capacity, this is estimated to have been approximately 35,000 spectators, which supports the hypothesis that this building was designed not only for the enjoyment of the Italians themselves, but also for a military garrison that possibly existed in the city, and for the inhabitants of nearby towns. As for the use of the amphitheatre, it is known that the spectacle buildings of the most prominent cities of Hispania were in use throughout the 4th century AD. In fact, several inscriptions found in various areas of the podium include the names of individuals whose seats were reserved: these inscriptions date back to the 4th and 5th centuries AD. It is therefore legitimate to affirm that this building remained in use throughout the 4th century AD. The Italica amphitheatre is the largest of its kind built in Hispania (dimensions: 156 × 134 m) and constitutes the largest in terms of capacity outside the Italian Peninsula. It is surpassed only by the Colosseum in Rome, the Capua amphitheatre, and the Puteoli amphitheatre (modern name, Pozzuoli). This is one of the most complex and ambitious projects of Roman architecture on the Iberian Peninsula.
Figure 11 and
Table 3 show the 49 source–receiver combinations used in this space. In the Roman amphitheatre of Italica, the 3 positions of the sound source and the 17 receivers took into account that its arena was designed as an oval in which the relationship between its 4 foci forms a Pythagorean triangle with a ratio of 3:4:5. Therefore, these sources were selected near the foci due to the presence of the fossa bestiaria, and to certain alignments on which the curves corresponding to the four sectors of the oval were drawn.
5. Conclusions
Roofless buildings are more likely to experience echoes than are closed venues. To study this acoustic phenomenon, this paper presents the results of the Dietsch–Kraak echo criterion parameter for speech and music in four bullrings on the Iberian Peninsula, each with a unique feature (Ronda without open-air grandstands, Seville with an ovoid-shaped arena, Las Ventas as the largest bullring in Spain, and Campo Pequeno with a movable roof), and a Roman amphitheatre from the time of the Hadrianic Hispania as a precursor to bullrings. The experimental results of the parameter and the association of the times and signals of strong reflections have been obtained from the impulse responses measured in the venues. In addition to processing the echo parameter calculations, those reflections with sufficient acoustic energy and delay relative to the direct sound to be interpreted as echoes are identified. This delay has also been analysed by using energy–time decay curves, which allow the determination of whether the delay is either a true echo or is so large and delayed that it can be associated with background noise. The results show that in neither the bullrings nor the amphitheatre is there a risk of echo for music in any of the public areas: the grandstands, covered galleries, and the cavea. Echoes for music have only been experimentally detected in two receivers in the bullring when the sound source is located in the centre of Las Ventas bullring. Likewise, there are few cases of echo in the public area for speech (of the 307 experimental recordings in the spectator area, only 11 recordings show a risk of echo for speech, see above), particularly in the grandstands of Las Ventas when the sound source is in the centre of the ring, due to the focusing of the sound from the great height of this bullring.
In many cases when the echo parameter indicated the possibility of echo problems, these problems were merely very late noise reflections rather than actual echoes, while in other cases, they were early reflections that reinforced the direct sound. Furthermore, in several cases, although echoes exist for the spoken word signal, they are from sound-source positions of a musical nature that occur in the bullrings: a music band, bugles and timpani, or trumpets.
In cases where both the sound source and the receiving microphone were located in the arena (Las Ventas, Campo Pequeno, and the Roman Amphitheatre according to experimental data, and Ronda and Las Ventas according to simulation data with the bullring empty), the results indicate that the arenas of the performing venues have large areas where echoes occur for speech and even for music, including when the ring is covered with a movable roof (Campo Pequeno, for example). These data are of interest in the case of theatrical and musical performances where both the sound source for speech or music and the audience are located in the arena. In the case when the performers and the public are both in the ring with the grandstands empty, then one echo mitigation strategy would be to delimit the performers and public area with rectangular absorbent walls.
The results of the acoustic simulation by receivers and from the colour maps of the speech echo parameter obtained by acoustic simulation throughout the bullring for each of the sound sources present (case of RMCR and Las Ventas) corroborate these conclusions, and that the areas of the ring and in the grandstands even worsen from the point of view of echo problems with the presence of the public throughout the bullring. This increase in echo areas of the bullrings, with the presence of spectators, in the grandstands and in the ring, suggests that the increased acoustic absorption by spectators results in a lower masking of the strong reflections from the reduced existing reverberation.