1. Introduction
The study of past soundscapes and the acoustics of historical spaces has long fascinated researchers across disciplines, from archaeology and architecture to musicology and digital humanities, leading to what is now commonly identified as “archaeoacoustics”. The core aspect of this research is the acknowledgment of the role of acoustics in the definition of the cultural heritage of the building or, more generally, of the site under investigation, despite its “intangible” nature [
1,
2,
3,
4].
Archaeoacoustics, as an interdisciplinary domain, integrates principles from acoustical engineering, virtual reconstruction, and heritage studies to revive lost sonic environments. The field has evolved significantly in recent decades, benefiting from advancements in digital modeling and immersive technologies that allow researchers to simulate the acoustic behavior of a number of past or vanished structures [
5,
6,
7,
8,
9,
10]. One of the primary goals of such studies is to evaluate how architecture influenced auditory experiences, whether in the grand halls of ancient palaces, the reverberant interiors of sacred temples, or performance spaces designed for specific acoustic effects, as well as pointing out the interplay of other social and cultural factors.
The literature on acoustic reconstruction of spaces reveals a variety of methodologies employed to approximate the sound properties, the major concern being represented by the significant uncertainties that may characterize the spaces that are being reconstructed. The use of textual descriptions, artistic representations, and analogies with existing structures to infer possible acoustic behaviors are among the most frequent strategies. However, given the difficulties often associated with creating and calibrating acoustic models of existing spaces [
11], it has been proposed by several researchers to adopt rigorous practices in order to create and validate virtual acoustic models of non-existing spaces [
12], while collecting the greatest possible amount of acoustic data about materials and objects that could contribute to improved accuracy in simulations [
13,
14,
15].
Thus, even though advances in immersive technologies as virtual and augmented reality hold the potential to further enhance the accuracy and experiential quality of reconstructions [
4], when dealing with archaeoacoustic reconstructions, it is important to take results “cum grano salis”, remembering the limitations related to the absence of precise architectural data, use of different materials (or in a different conservation state), etc. Additionally, the subjective nature of human auditory perception means that reconstructed soundscapes, though grounded in scientific principles, still involve interpretive elements. The role of cultural context further complicates the analysis, as the meaning and perception of sound vary across societies and historical periods.
In recent years, research on the relationship between sound, architecture, and ephemeral installations in the Renaissance has gained increasing attention. Studies such as Sound and Space in Renaissance Venice by Howard and Moretti [
16] paved the way to other studies on the relationship between art, architecture and sound [
17,
18] that highlighted the significance of sound in the perception of artistic and architectural spaces as well as the positive effect of tapestries in large reverberant churches [
19]. Within this context, our study focuses on the role of tapestries in the Sistine Chapel, not only in terms of their visual and decorative impact but also in their acoustic influence, starting from a calibrated acoustic model of the Chapel in its current state and simulating the effects of modifications by means of geometrical acoustic modeling. The aim of this paper is to understand whether such modifications induced audible differences that might have stimulated a discussion on their acoustic role in the space.
2. Historical Outline
The Sistine Chapel is the result of the restoration of the old Cappella Magna (or “Maggiore”) that Pope Sixtus IV (from whom it derived its name) promoted between 1473 and 1481 [
20]. The Chapel was conceived from the outset as a highly decorative space intended for major liturgical celebrations and functionary activities of the Pope and his household (the Papal Chapel or Papal Court), in addition to the Papal Conclave to elect the new Pope. At the time of Sixtus IV the Papal Court comprised about 200 people, including clerics, officials of the Vatican and distinguished laity. The official gatherings of the Court were distributed throughout the year according to the Papal Calendar, and among the 50 occasions prescribed, about half of them were masses that could be held in a smaller, less public space, for which the Cappella Maggiore was used before it was rebuilt as the Sistine Chapel.
The earliest decorative campaign, executed between 1481 and 1482, involved a team of leading Florentine and Umbrian painters of the time, including Sandro Botticelli, Domenico Ghirlandaio, Pietro Perugino, and Cosimo Rosselli. These artists, working in a coordinated yet stylistically diverse manner, created the lateral wall frescoes depicting scenes from the lives of Moses and Christ, thereby establishing a visual parallel between the Old and New Testaments as a legitimizing framework for papal authority.
The chapel’s decorative scheme underwent its most transformative phase in the early 16th century under Pope Julius II, who, seeking to assert the spiritual and cultural supremacy of the papacy, commissioned Michelangelo Buonarroti to repaint the ceiling. From 1508 to 1512, Michelangelo worked largely alone, replacing the original star-studded ceiling (
Figure 1a) with a monumental fresco cycle that redefined the scale, complexity, and theological depth of Renaissance art.
A subsequent commission in the 1530s, initiated by Pope Clement VII and completed under Pope Paul III, saw Michelangelo return to the chapel to paint The Last Judgment on the altar wall. Executed between 1536 and 1541, this colossal composition reflected the tumultuous spiritual climate of the Counter-Reformation and Michelangelo’s own evolving religious sensibility.
The Sistine Chapel also included sculpted architectural elements. The screen or
cancellata made of marble by Mino da Fiesole, Andrea Bregno, and Giovanni Dalmata was intended, just like the jubé typically found in late medieval churches, and like the iconostasis of the orthodox churches, to divide the chapel into two parts, one for the members of the Papal Chapel close to the presbytery area and the other for the worshippers. The position of the
cancellata, originally dividing the space into two halves, was later moved by 5 m to accommodate the increasing number of people belonging to the Papal Court. The sculptors of the
cancellata also provided the
cantoria or projecting choir gallery (
Figure 1b).
In fact, the history of this space is strictly interwoven with that of the Sistine Chapel Choir (Cappella Musicale Pontificia), which has historically played a central role in its liturgical function. Established long before the chapel’s construction, the choir gained particular prominence under Sixtus IV, who reorganized it as a permanent papal institution (making it its personal choir). The choir became a prominent musical institution and some of the most influential composers, like Josquin des Prez (from 1486 to 1494), Giovanni Pierluigi da Palestrina (around 1555), and Gregorio Allegri (from 1629 to 1652), worked and composed for the choir, contributing to the sacred and ceremonial identity of the space. As a consequence, it is likely that special attention was given to the acoustics of the Chapel.
An aspect that has only recently been considered in depth is the use of tapestries, which were hung according to the liturgical calendar to adapt the space to different celebrations. Pope Leo X was one of the principal proponents of this practice, commissioning Raphael to design a cycle of tapestries depicting scenes from the lives of Saint Peter and Saint Paul [
21]. These were woven in Brussels by Pieter van Aelst, one of the most esteemed tapestry makers of the period and first displayed in 1519. Tapestries in the Sistine Chapel served as an ephemeral “skin” for the building, altering not only its visual appearance but also the spatial perception of those within it. The installations were carefully scheduled for specific occasions. During Holy Week, for instance, the chapel was stripped of all textile decorations, emphasizing its stark solemnity. In contrast, on significant feast days such as Christmas Eve in 1513, golden tapestries from Saint Peter’s Basilica were brought in to enhance the space’s magnificence.
Beyond their decorative purpose, tapestries played an essential role in the chapel’s acoustic environment. The architectural features of the Sistine Chapel, characterized by its elongated shape and high vaulted ceiling, caused a long reverberation time that can obscure speech intelligibility and affect polyphonic music performances. The issue was well known in Leo X’s time, and various configurations were explored to improve sound quality within the sacred space, also in consideration of the introduction of polyphonic music in an increasing number of sacred celebrations, as decreed by Leo X soon after his election [
22].
One of the members of the Pope’s court, Evangelista Tarascono, suggested removing the tapestries to allow sound to reflect more clearly off the bare walls, making it “sharper and more pleasing” [
18]. However, as a demonstration that excessively long reverberation negatively affects clarity of singing, tapestries continued to be used, often indulging in richer versions including golden and silver threads.
4. Results
The results showed (
Figure 5) that, as expected, the presence of the tapestries determined a significant variation under unoccupied conditions as mid-frequency reverberation time decreased from 7.4 s to 5.1 s, with a relative decrease of more than 30%. Under fully occupied conditions mid frequency reverberation time dropped from 4.1 s to 3.4 s, with a relative variation of about 20%, after application of the tapestries. Thus, in both cases, considering that the JND for reverberation time is 5%, variations could be potentially perceived by attentive listeners that used to attend ceremonies requiring different wall installations. At low frequencies, given the low sound absorption of the tapestries, variations in reverberation time were not so big (around 10%).
Thus, it can be concluded that the presence of tapestries compared to the unoccupied room introduced a clearly detectable variation in reverberation time and, given the frequency distribution of the added absorption, tapestries made it drier and amplified bass sounds. With a congregation in the room, variations due to tapestries were still audible, but less dramatic than they were in the first case. In terms of reverberation, the presence of the tapestries in the unoccupied room provided acoustical conditions that were more similar to those in the occupied room (as seen in
Figure 5, where values were almost overlapped in the high-frequency range), likely making rehearsals and other more private celebrations more acoustically similar to events with increased occupancy.
In order to understand the effects of the tapestries and audience in different listening positions, the values for individual source–receiver positions are discussed.
Figure 6 shows that for the selected parameters (EDT, G, C
80, J
LF, and STI), whose numerical values largely depend on the intensity and time distribution of the reflections, the variations due to different configurations mostly reflected the change in reverberation time, with a few interesting anomalies. Early decay time (EDT) followed an overall change that reflected the spatially averaged reverberation time variations appearing in the same frequency bands (500 and 1000 Hz), resulting from the different configurations already discussed in
Figure 5. Despite EDT being a more position dependent parameter, in the present case, it showed only small point-by-point variations. Sound strength (G), reflecting the contribution that the room gives to the loudness of the sound, was highest in the unoccupied room across the considered conditions, in agreement with the theoretical behavior that assumes a logarithmic dependance on the ratio of the reverberation time to the room volume. Thus, if room volume remains the same, as in this case, G is expected to increase by 3 dB when reverberation time doubles. The effect of tapestries is evident when compared to the empty configuration, with the important effect of producing conditions that are more similar to the occupied space (as shown in
Figure 6b, where lines tend to overlap). Occupation plus tapestries brings values closer to the optimal range (which is between 4 and 8 dB [
38]) and, differing on average by more than 1 dB from the simply occupied condition, suggests a clearly audible change. Point-by-point variations are observed among receivers, mostly as a consequence of different source–receiver distance (e.g., combinations P_R4 and P_R5) and of screening elements like the
cancellata (e.g., combinations P_R3 and P_R4). When the source is at the
cantoria, the G
mid is much more uniform across receiver positions and the presence of the tapestries (either in unoccupied or occupied conditions) induces an average 2 dB difference that would be clearly audible in most of the positions.
In terms of clarity (C
80), when the source was in the presbytery, receivers 03 and 06 in the “public” area showed the lowest values, possibly because of the
cancellata preventing early reflection to contribute. The presence of tapestries and occupancy improved the values a little, but clarity remained low. The major change was observed when tapestries were added to the empty configuration (blue and orange lines in
Figure 6c), whereas their addition to the occupied configuration produced only a minor effect (gray and yellow lines in
Figure 6c) except at a few specific receiver positions with source C. Conversely, in the “private” area, the values were higher and, when tapestries and occupation were added, were closer to 0 dB, particularly for receivers closer to the source. When the source was at the
cantoria, C
80 values were more uniformly distributed in space and showed differences as significant as for the other source position when tapestries were added to the empty room, with the only exception of receiver positions 4 and 5. Across the considered listening positions, when the source is located at the presbytery, the Pope’s position (R1) benefits from the greatest gain in clarity due to wall tapestries and audience. When the source is placed at the
cantoria, the clarity increase at R1 remains significant, although not markedly higher than at the other listening positions. In all cases the role of the tapestries was significant, allowing us to obtain acoustic conditions less dependent on the occupancy.
Lateral energy fraction (J
LF) showed significant point-by-point variations, mostly due to the relative position of sources and receivers with reference to reflecting walls, with little variations as a function of occupancy, as already observed elsewhere [
11,
31], being the parameter independent of reverberation time variations. Conversely, with few exceptions, the presence of the tapestries caused a predictable reduction in the parameter, with variations that were more audible (being higher than JND) at certain combinations (e.g., P_R2 or C_R1), where strong lateral reflections were attenuated by the increased wall absorption.
Finally, with reference to STI, the trend was very similar to what was observed for C80, with milder point-by-point variations resulting from the nature of the parameter (being less sensitive than clarity to small variations in the reflections sequence). The addition of tapestries caused a clear and audible improvement (corresponding to a 2JND increase) with reference to the empty room, and a smaller but still audible improvement (corresponding to slightly higher increase than 1 JND) under occupied conditions. The receivers that showed the largest improvements due to tapestries were R6 (particularly under empty conditions) and R5 (in both cases), independent of source position.
In order to better understand the spatial distribution of the parameters and the effect due to tapestries, C
80 and STI were mapped with reference to the source located in the presbytery and the source located at the
cantoria. The results, shown in
Figure 7 and
Figure 8, confirmed what was observed previously, providing further elements of discussion. In particular, for C
80, when the source was in the private area, a clearly bimodal distribution of values appeared, with the
cancellata decreasing the early reflections in the area immediately opposite to the source, despite its partial transparency (of the upper part), and the back wall providing extra reflections that contributed to raise the parameter’s values. This distribution appears similarly in all the occupancy/tapestries combinations, with a comparable span of about 25 dB from maximum to minimum in each individual configuration, while showing a significant upward shift in absolute values depending on the surface arrangements, which demonstrated that the presence of the tapestries contributed to improving the listening conditions in all cases.
When the source moved to the raised
cantoria, (above the
cancellata), the overall span of the parameters’ values reduced to about 15 dB with no bimodal behavior, although the distribution was obviously skewed towards the lower values as an obvious consequence of the prevalence of diffuse field conditions as the distance from the source grew. Again, increasing the absorption in the room shifted the values upward, with tapestries contributing to increase the mean values by about 2 dB with reference to the empty conditions, and by about 1.3 dB with reference to the occupied conditions, with most of the listeners laying in the range from −5 dB to 0 dB, which is optimal for choral singing and music [
24].
When the speech transmission index (STI) was considered (
Figure 8), although C
80 and STI maps look similar in general, STI maps still present a more complex distribution—particularly with the source in the presbytery, where speech mattered the most—likely due to its multi-frequency nature (being a weighted average of all frequency bands). In fact, the distribution of the values was far from being “normal” and local peaks appeared, as observed in the histograms of
Figure 8, representing a generally better intelligibility in the private part of the chapel, while in the public part it was generally poorer and only the listeners close to the back wall were favored by early reflections from the back wall. In absolute terms, increasing the absorption STI moves from a range spanning from 0.22 to 0.44 under empty conditions (with most of the listeners perceiving a “bad” intelligibility according to the descriptors of the subjective perception given in
Appendix A), to a range spanning from 0.35 to 0.58 under occupied conditions with tapestries (with all of the listeners in the private area having “fair” intelligibility). The interesting part is that the presence of the tapestries contributed to improving the overall conditions in all the cases, particularly in the empty configuration where an average improvement of 0.07 appeared, suggesting that it was audible (being higher than 2 JNDs for STI), and that, for private ceremonies with a limited audience, it could make a difference. However, a positive variation, although less dramatic, was observed also under occupied conditions, allowing all the receivers in the private area to experience at least a “fair” intelligibility.
The analysis of STI values when the source was moved to the cantoria, although less interesting in practical terms (as the cantoria was not used to deliver speeches or sermons), showed acoustical interest because the combined effect of the raised position (resulting in longer minimum distances) and location at the center of the room (resulting in shorter maximum distances), yielded a narrower distribution of values, although with reduced absolute values. However, the positive contribution of the tapestries was again clearly audible.
5. Discussion
The study of the acoustics of Sistine Chapel at the time of Leo X, with an emphasis on the role of Raphael’s tapestries to modify listening conditions, pointed out that with reference to speech-oriented activities, like gatherings and masses for the Papal Court, their positive effect was certainly appreciated. In fact, the addition of tapestries contributed to reducing reverberation time in the room, significantly narrowing the range of variation as a function of the occupancy (that, at mid-frequencies, dropped from 7.4 s for an unoccupied room to 4.1 s for an occupied room to 5.1 s for an unoccupied room and, finally, to 3.4 s for an occupied room). Given the height of the space, its regular shape, and the fact that most of the absorbing materials were located at floor level, the absorption was slightly less effective than it would have been in ideally diffuse conditions (as predicted by simple formulas like Sabine’s or Eyring’s). Nonetheless, as shown, it was enough to be aurally noticeable by an attentive listener. Thanks to tapestries, in the “private” area reserved for the Papal Court, the listening conditions improved significantly as indicated by the STI analysis. In fact, with the unoccupied chapel, without tapestries, only 18% of the private area benefited of at least a “fair” intelligibility, while with tapestries the percentage increased to 38%. Under fully occupied conditions, the private area fraction with at least “fair” intelligibility raised to 54%, but thanks to the tapestries, 95% of the area was above the 0.45 threshold, and 20% was above 0.6, suggesting that verbal communication could take place, with fair or even good intelligibility, provided that background noise was limited. In fact, taking advantage of simulations, it was demonstrated that an increase by 5 dB in the noise profile induced an average drop of 0.04 in STI, basically nullifying the advantage resulting from tapestries installation unless the speaker raised its voice to compensate. Considering that the natural loudness “gain” that human voice can achieve when moving from a “natural” to a “loud” emission level is approximately 12 dB, Leo X’s request for “stationing officials” to discourage chattering during celebrations, thereby limiting background noise, becomes understandable.
With reference to choral singing, the analysis of results when the source was at the
cantoria demonstrates the beneficial (and clearly audible) effect of tapestries. In fact, thanks to tapestries, EDT under fully occupied conditions was uniformly decreased to around 3 s at mid frequencies, which is perfectly acceptable for choral music. Sound strength under occupied conditions was further decreased by the presence of tapestries to an average of 7.7 dB, against 8.9 dB without them, which is positive to reduce the sensation of being overwhelmed by music and singing as it happens in small and reverberant rooms. In terms of clarity, the presence of tapestries played a relevant role both in unoccupied conditions (contributing to making the sound more similar to the occupied conditions, which is useful for rehearsal purposes), and in occupied conditions (contributing to increasing clarity within a range from −6 dB to 0 dB, which is optimal for choral performances in worship spaces [
24]). The sense of spaciousness, as revealed by the analysis of lateral fraction, is scarcely affected by tapestries (and by occupancy), confirming what was already observed in other studies [
11,
31]. Finally, unlike what happened for speech, given the position of the source, the previously described advantages were equally available for both the private and public part of the audience.
In order to allow a practical comparison of the previously described acoustic differences based on auditory stimuli, auralizations of the different configurations as perceived at the Pope seat (Rec. 01) when sources were in the presbytery and at the
cantoria were created. For auralization purposes, in order to allow a reproduction of audio excerpts without using special equipment, stereophonic impulse responses (in X-Y configuration) generated by the simulation software were used, in combination with anechoic recordings of an excerpt of Nicene Creed read in Latin and Josquin Des Prez’s Missa Pange Lingua sung by faculty members and students from Southern Methodist University’s (SMU) Division of Music. Josquin’s Missa Pange Lingua was selected after consultation with musicologists as being a typical example of the polyphonic Mass performed in Leo X’s times. The auralized material can be found in the
Supplementary Materials.