1. Introduction
Musical tension is widely associated with musical expression, particularly with expectations and musical emotions [
1,
2]. Most listeners are able to perceive musical tension subjectively, yet musical tension is difficult to be measured objectively [
3,
4,
5]. Increasing musical tension is commonly linked to building stress and impending climax, while decreasing tension is linked to relaxation or resolution [
5]. Musical tension is one of the most important and elusive concepts in Western tonal music. The perception of musical tension is speculated to arise from the combined interaction of several musical parameters, such as dynamics, melody, harmony, rhythm, tempo, and timbre, among others [
1,
5,
6,
7,
8,
9,
10,
11,
12]. Existing models acknowledge the multidimensional nature of musical tension, but there is no consensus on the individual contribution of each musical parameter to musical tension or how these parameters interact [
5,
13]. For example, dynamics is known to play an important role in tension with increasing loudness associated with building tension [
4,
6,
13] and certain timbral changes associated with modulation of tension [
11,
14,
15]. Similarly, it is generally agreed that a suspension is tense in relation to its resolution [
2] and so is dissonance in relation to consonance [
8,
10,
16,
17].
The role of harmony in musical tension has long been investigated [
1,
3,
4,
5,
8,
10,
12,
13]. Lerdahl and Krumhansl [
1] refer to 
tonal tension as a specific aspect of musical tension created by melodic and harmonic motion. For example, in the tonal context, it is widely agreed [
1,
4,
8,
10] that motion away from the tonic increases tension, whereas returning to the tonic provides resolution. This article focuses on the 
tonal tension of 
chord progressions, whose fundamental role within Western tonal music has been historically acknowledged [
18,
19,
20]. In this article, 
chord designates a set of three or more musical notes played simultaneously, and 
chord progression designates temporal sequences of chords [
10]. Specifically, we are interested in how the perception of tonal tension changes over time as a chord progression unfolds. We aim to create a computational model that captures the perceived changes in tonal tension associated with a particular chord progression.
Tonal tension can be explained from the perspective of music theory [
18,
20] or music psychology, which includes cognitive [
1,
4,
21,
22] and perceptual approaches [
16,
23,
24,
25]. In music theory, models of tonal tension traditionally result from the interaction of tonal indicators from multiple pitch levels, such as the 
tonal function a chord plays within a particular key, their 
sensory dissonance, and the 
horizontal motion between the voices in the chord sequence [
20,
22,
26,
27]. The cognitive approach emphasizes the importance of expectations resulting from previous exposure to the tonal system [
1,
4,
21,
22]. Thus, changes in tonal tension can be attributed to musical events breaking or fulfilling these expectations [
28]. Finally, musical concepts such as consonance and dissonance are commonly explained by association with perceptual phenomena [
25]. For example, 
roughness has long been used to explain tonal dissonance [
16,
23,
24]. Some authors have postulated that neither music theory nor music psychology alone is enough to satisfactorily explain tonal tension because these perspectives are conceptually complementary [
10,
29,
30]. For example, a chord can be very dissonant and have a stable tonal function. Ultimately, we aim to develop a computationally feasible model of tonal tension that leverages the complementary nature of the music psychology and the music theory perspectives by incorporating perceptual concepts into music theoretical measures [
31].
Tonal tension can be associated with specific musical events (i.e., a particular chord in a progression), short-term contexts (e.g., a preceding chord in a progression) or long-term contexts (e.g., an entire chord progression). Bigand et al. [
10] investigated the perceived musical tension created by the second chord in a three-chord progression and Navarro et al. [
32] furthers the previous design by considering two preceding chords in a three-chord progression. Later, Bigand and Parncutt [
33] investigated tonal tension in longer chord progressions to emphasize the distinction between the tonal functions of the chords and their psychoacoustical features. Lerdahl [
22] proposed a model of tonal tension based on his influential tonal pitch space [
22,
34], in which multiple explicit conditions expressed as distances are adopted as dimensions of his model. Furthermore, Lerdahl emphasized the importance of the long-term hierarchical context representation of functional dependencies across all preceding or succeeding chords in a progression [
22]. In an earlier publication [
35], a tension model which notably includes a tree representation to capture long-term functional dependencies in chord progressions based on Rohrmeier’s [
27] generative syntax model has been proposed  [
35].
To date, Lerdahl’s model of tonal tension is one of the most influential and validated theories of tonal tension [
1,
4,
5,
10,
33], combining music theory with our current understanding of music perception and cognition. Nevertheless, Lerdahl’s model is notoriously difficult to implement computationally [
36]. There is no automated procedure to calculate the hierarchical structure of the chord progressions and the parameters of the model rely mostly on ad hoc tabulated values driven from empirical musical theory experience. For example, the 
surface tension parameter assigns integers between 0 and 4 to musical properties of the chords (i.e., scale degree, inversion) and estimates the final value as the sum of the individual scores. Consequently, the surface tension scale of the model is not constructed to be linear. Ideally, if chord 
 has a surface tension value that is twice the surface tension of 
, 
 should contribute to add twice as much surface tension as 
, for example.
Recently, Herremans and Chew [
12] presented MorpheuS, a tonal tension model based on the 
spiral array [
37], incorporating algorithmic aspects more readily apt for computational implementation. Their model use measures for cloud diameter (dissonance), cloud momentum (tonal movement between chords), and tensile strain (distance to the key). Furthermore, long-term features, such as typical movements of tonal functions or common melodic sequences, are extrapolated from statistical analysis of style-specific music corpora. However, the statistical nature of the models is more appropriate to capture linear changes that happen over time, whereas long-term dependencies such as phrase structures require modeling the intrinsic harmonic hierarchy. In a preliminary investigation, Navarro et al. [
35] presented a computational model of tonal tension associated with each chord in a progression. It adapts Lerdahl’s tonal tension model [
1] into a fully automated framework. To this end, they compute the parameters of Lerdahl’s model in the Tonal Interval Space (TIS) [
38], a tonal pitch space where musical parameters can be captured as geometrical distances. The mathematical basis of the TIS promotes a computationally feasible model from which we can automatically compute musical information across multi-level pitch configurations such as chords and scales. Furthermore, the TIS has proven to encompass properties of musical theoretical value and of perceptual relevancy to create metrics that capture chord similarity [
39,
40,
41,
42]. The musical theoretical value stems from the adoption of the discrete Fourier transform (DFT) of pitch classes to represent tonal pitch in the TIS space. Pitch class is closely related to pitch chroma, which represents the group of all pitches associated with the same musical note in different octaves. For example, the pitch class C represents all the C notes in any octave. The properties derived from the mathematical model of the Fourier transform help to reflect empirical ratings of consonance between two pitches [
31,
39].
This article proposes a computational model of 
tonal tension profile of a chord progression that extends the previous work by Navarro-Cáceres et al. [
35]. This tension profile can be conceptualized as a continuous curve that captures changes in tonal tension over the temporal sequence of a chord progression. The tonal tension profile accounts for dependencies across immediately neighboring chords and the overall tonal hierarchies of the progression and is computed by connecting all the instantaneous tonal tension values given by the measure detailed in our previous work [
35]. The original contributions of our work are four-fold. First, our work advances a computational model that aims to capture the global tension profile of a chord progression beyond state-of-the-art models. Second, we tackle the challenging problem of automating the computation of long-term dependencies across chords in a progression as syntactic tree structures and by adopting the TIS in computing remaining local music properties. Third, the adoption of the TIS promotes a view from music theory and psychology, which has been shown to complementarily contribute to the perception of tonal tension [
10,
29,
30]. Fourth, we unpack the relative importance of each parameter inspired by Lerdahl’s [
1] in better predicting the tonal tension profile of a chord progression.
To evaluate our tonal tension profile model, we conducted two new experiments with contrasting approaches, whose designs were inspired by Farbood [
5]. Both experiments use twelve chord progressions with five to ten chords, six of those from compositions and six others created for this investigation. The first experiment (Experiment 1) used a slider to collect real-time responses of perceived changes of 
instantaneous tonal tension. We asked expert musicians to move the slider upwards or downwards as they listened to the chord progressions to reflect increasing or decreasing tonal tension, respectively. The second experiment (Experiment 2) is an online listening test that aims to assess the 
global perception of changes in tonal tension associated with chord progressions. We proposed five 
prototypical curves that represent 
global tonal tension profiles and then we asked the listeners to select the curve that best represents the overall perceived changes in tonal tension of twelve chord progressions. Finally, we compare the results of the proposed model with two representative state-of-the-art models of tonal tension, namely Lerdahl’s [
22] and MorpheuS [
12]. The comparison with Lerdahl’s [
22] model aims to explicitly assess how the representation space can affect the measurement of the 
tonal tension, whereas the comparison with MorpheuS [
12] highlights the impact of tonal space representations and different musical properties from short-term contexts in capturing tonal tension.
The remainder of this paper is structured as follows. 
Section 2 presents an overview of the proposed approach to model the tonal tension profile of chord progressions. Then, 
Section 3 explains the estimation of the 
instantaneous tonal tension in the TIS [
38], with a particular focus on an algorithmic procedure to (automatically) construct a tree that represents the hierarchical structure of a chord progression. Next, 
Section 4 describes two experiments we performed to evaluate the model. Finally, 
Section 5 presents the conclusions and the future work.
  2. Overview
In this section, we present an overview of the approach we propose to model the tonal tension profile of chord progressions. 
Figure 1 illustrates the four levels of the problem addressed in our work. From top to bottom, the 
global tension profile, the 
hierarchical structure, the 
chord progression, the 
model of tonal tension profile, and the 
instantaneous tonal tension profile. The entry point is the chord progression because the other levels depend on it. The 
global tension profile is represented by a 
prototypical curve that represents the changes in tonal tension over time experienced by the listener after the chord progression is played. In the case of 
Figure 1, the global tension profile captures rising tension followed by relaxation. The hierarchical structure is constructed from the chord progression. One of the contributions of this work is the description (see 
Section 3.6) of a set of 
analytical rules to construct a tree from the chord progression based on the 
generative procedure proposed by Rohrmeier [
27]. Between the chord progression and the instantaneous tonal tension profile, 
Figure 1 also shows a table with the values of the individual indicators of tonal tension, namely, 
tonal distances divided into 
tonal distance between chords, 
tonal distance from the key and the 
tonal distance from the tonal function; 
dissonance, 
voice leading and 
hierarchical tension. The final tonal tension associated with each chord in the progression is obtained as a linear combination of the model components. Finally, our proposed model fits a curve to the time-series of 
instantaneous tonal tension values to estimate the 
instantaneous tonal tension profile associated with the chord progression.
Figure 1 illustrates how to calculate the 
instantaneous tonal tension of the second chord (F Major). The measure of 
consonance only depends on the chord itself (highlighted with an ellipse in green), whereas the calculation of both 
tonal distance between two chords (red rectangle) and 
voice leading (orange rectangle) use a sliding window spanning two chords, the current (F Major), and the previous one. The measure of 
tonal distances from the key and from the tonal function uses the tonal context and 
hierarchical tension uses the tree that represents the 
hierarchical structure of the entire chord progression to compare the current chord with all its parents nodes. Finally, these six tonal indicators are linearly combined into the final 
instantaneous tonal tension. Then, the sliding window shifts forward by one chord and the 
instantaneous tonal tension value is estimated for the next chord. This process is repeated until the end of the chord progression is reached, resulting in a set of discrete 
instantaneous tonal tension estimations that comprise the 
instantaneous tonal tension profile. The aim of this work is to evaluate how well the 
instantaneous tension profile resulting from the model captures both the 
instantaneous tonal tension profile and the 
global tonal tension profile experienced by the listener. For such, we need to evaluate four components of the model, namely, the 
instantaneous tonal tension profile experienced by the listener, how it is associated with the 
instantaneous tonal tension profile predicted by the model, and the 
global tonal tension profile perceived by the listener and its association with the 
instantaneous tonal tension profile predicted by the model.
   Tonal Tension Indicators
We propose to model the instantaneous tonal tension M of a chord  in a progression P as a combination of the following four components:
- Tonal Distances (in  Figure 1- , the first three rows of the table (highlighted in red)); 
- Dissonance (in  Figure 1- , the fourth row of the table (highlighted in green)); 
- Voice Leading (i.e., melodic attraction) (in  Figure 1- , the fifth row of the table (highlighted in orange)), and; 
- Hierarchical tension (in  Figure 1- , the last row of the table (highlighted in blue)). 
Our proposal incorporates all four components into a fully automated computational model that does not require manual calculations due to the representation in the TIS (see 
Section 2). The 
tonal distances are key for the development of a model of tonal tension. In the TIS, we measure three properties: (i) perceptual similarity between two consecutive chords, (ii) perceptual similarity between the chord and the key of the chord progression, and (iii) perceptual similarity between the chord and the tonal function. This last property measures how the chord is related to the most important chords in the tonal music: the Tonic, Subdominant, and Dominant.
The dissonance measures the psychoacoustic dissonance of chords in a progression from the interaction among the vertical component notes of each chord (closely related to the psychoacoustic concept of roughness). In our work, surface dissonance results from measuring the Euclidean distance between the chord encoded in the TIS and the center of the space.
Voice leading captures the melodic attraction between consecutive chords per voice. Voice leading is a horizontal measure that estimates the tension of each note in a voice according to the previous note. In our work, the number of semitones and the relationship between the notes can affect the evaluation of the voices. Therefore, we measure the voice leading in the TIS as the distance of both notes in semitones and the perceptual similarity between the notes of the two consecutive chords.
Finally, 
hierarchical tension captures the tension of the multiple levels of the musical structure by adopting tree-based structures driven from functional harmonic categories (i.e., tonic, subdominant and dominant categories) in a similar fashion to the syntactic trees from early generative linguistics [
43]. We analyze the hierarchical structure by measuring the similarity between the chord we aim to evaluate all the parent chords in the tree. This process is illustrated in 
Figure 1 for the second chord in the progression, where the similarity between this chord and all the parent chords above it is calculated. The chords involved for this particular chord evaluated are indicated with arrows in blue.
Section 3 describes the calculation of 
M in detail. An important contribution of our work that is also explained in detail in 
Section 3 is the fully automated 
analytical rules to automatically extract a tree to represent the hierarchical structure of chord progressions based on Rorhrmeier’s 
generative rules.
   3. Instantaneous Tonal Tension in the Tonal Interval Space
Equation (
1) computes the tonal tension 
M of a chord 
 in a progression 
P as
      
      where 
 is the 
i-th-chord of the progression 
P, and 
 are constants that represent the weights of each tonal indicator. 
M is expressed as the linear combination of the tonal indicators detailed in 
Section 2 tonal distance (between chords) (
), 
tonal distance (from the tonal interval vector (TIV) that represents the key () (
), 
tonal distance (from the tonal function) (
), 
dissonance (
c), 
voice leading (
m), and 
hierarchical tension (
h). As illustrated in 
Figure 1, 
M estimates the 
instantaneous tonal tension of each chord 
 in a progression 
P, resulting in a time series 
, where 
m is the last chord in the progression 
P. 
M results from an adaptation of Lerdahl’s tonal indicators [
1] calculated in the Tonal Interval Space (TIS) [
31]. The next section briefly summarizes the most important features of the TIS. Refer to [
31] for further information. The implementation of the model can be found at 
https://github.com/merismeris/tonal-tension-TIS.
  3.1. Tonal Interval Space
The twelve pitches commonly used in Western tonal music can be represented by whole numbers from 0 to 11. Thus, the C major chord can be written as the row vector  and the C major key as . One inconvenience of this representation is that the number of pitches represented dictates the number of dimensions of the underlying space spanned by these vectors. In the previous example, the C major chord results in three dimensions while the C major scale results in seven. Therefore, this representation does not yield a natural way to compare different pitch configurations, such as chords and scales, for example. The chroma vector C is an alternative representation with twelve binary positions corresponding to the pitches comprising the chromatic scale arranged in ascending frequency. The value 1 in a specific position indicates the presence of that pitch, and 0 indicates absence. For example, the chroma vector representation of the C major chord is  and, similarly, for the C major key, .
An advantage of the chroma vector representation is that all vectors have twelve dimensions independently of the number of pitches present. However, distance measures commonly used in vector spaces such as the Euclidean distance do not capture musical properties of the pitches encoded as chroma vectors. Instead, Fourier analysis can be used to explore the harmonic relations between pitch classes represented as binary chroma vectors [
31,
39,
40,
41,
42,
44,
45,
46]. In particular, the TIS [
31] maps twelve-dimensional chroma vectors onto six-dimensional complex-valued Tonal Interval Vectors (TIVs) 
T with the DFT. The six dimensions are weighted by empirical ratings of dyads’ consonance to enhance the perceptual basis of the space. TIVs [
31] are useful to explore the properties of the tonal system because distances between TIVs capture perceptual proximity among pitch configurations and the magnitude of TIVs reflects the dissonance of the pitch configuration represented. Additionally, pitch configurations that share harmonic relations are mapped close together in the TIS. For example, the chords from the C major scale will be all mapped close to the tonal center of the C major scale. Finally, the magnitude of 
T, which is the distance of each TIV from the origin, can be used as a measure of 
dissonance [
31]. Note that the TIS is capable of comparing pitch configurations with different number of pitches, such as chords with a different number of notes and even scales in the same space and, at the same time, unlike Lerdahl’s tonal pitch space [
22] or Chew’s spiral [
47].
In the TIS, two fundamental measures are used to compare two tonal interval vectors 
 and 
, namely the Euclidean distance and the dot product. The Euclidean distance 
 is computed as
        
        where 
 is the dimension of each TIV and 
k is each component of the vectors 
T. The dot product between 
 and 
 is calculated as
        
        where 
, 
 denotes the magnitude of 
 computed as the Euclidean distance 
 between 
 and the center of the space 
 using Equation (
2). Finally, the angle 
 is calculated from Equation (
3) as
        
Bernardes et al. [
31] showed that Euclidean distances and angles calculated with Equation (
2) and Equation (
4) reflect music theoretical and perceptual properties of the relationship between 
 and 
. Although both metrics can be applied to pitch configurations on the same level (e.g., the distance between two chords) or across levels (e.g., the distance between a chord and a key), there are slight differences between them. The Euclidean distance from the center of the space and across pitch configurations has been shown to capture consonance and perceptual similarity, respectively. The arc-cosine, i.e., angles, across pitch configurations has been shown to capture the “alignment” between two pitch configurations, such as the best TIV translation or transposition of a given note or chord to a key. The latter measure is vital to objectively measure the extent to which a chord fits in the diatonic set (i.e., scale) of a key. The measures are extensively tested and validated in previous articles, such as Bernardes et al. [
38], Bernardes et al. [
31], and Navarro et al. [
48].
Next, 
Section 3.2, 
Section 3.3, 
Section 3.4 and 
Section 3.5 present the details of the calculation of the four indicators adopted in Equation (
1), followed by a description of the hierarchical tree construction in 
Section 3.6, and the description on the linear combination of the four indicators and their respective contribution to the indicator of instantaneous tonal tension in 
M in 
Section 4.
  3.2. Tonal Distances
The tonal distances of chord 
 capture three different distances: the perceptual distance 
 from the previous chord 
, the distance 
 from the key 
, and distance 
 from the chord tonal function 
 respectively as
        
        where 
 uses Equation (
2) and 
 uses Equation (
4), so 
 measures the degree of membership of 
 to the main key of the full chord progression and 
 measures the similarity to the tonal function.
In the TIS, chords perceptually close to the key (e.g., its set of diatonic chords) have smaller distances from the key [
31]. 
 is represented in the TIS by the set of diatonic pitch classes, or, in other words, its corresponding diatonic scale. To measure the proximity of one chord to its tonal function in the TIS, we compute the chord function by finding the smallest angle calculated with Equation (
4) from the chord 
 and three vectors 
 representing the diatonic chords of the tonic I, subdominant IV, or dominant V. 
 is the chord using 
 as reference. 
 is the tonal interval vector resulting from computing the DFT from the diatonic chords that best represent the harmonic functions of tonic (we used I degree), subdominant (we used IV degree), and dominant (we used V degree), also referenced by 
.
  3.3. Dissonance
In the TIS, the magnitude 
 measures the tonal dissonance 
c of the chord 
. If 
c results in small values of 
, the chord is very dissonant. The minimum dissonance 
c value corresponds to a single pitch class and the maximum to a pitch configuration with all classes (i.e., all 12 chromatic notes). Within this range, the dissonance 
c of chords results from the contribution of all intervallic dimensions of the space. Refer to Bernardes et al. [
31] for a comprehensive description of the algorithm.
  3.4. Voice Leading
Equation (
8) computes the voice leading 
m of chord 
 from the previous chord 
. We consider two properties to capture voice leading between chords, namely the stability between the notes and the number of semitones between the notes. Then, 
m is calculated as
        
        where 
V is the number of voices, 
s is the number of semitones between the 
l-note of the inspected chord 
 and its corresponding 
l-note in the previous chord 
. To compute the stability of the notes from a chord in a progression, we measure the perceptual distance 
 between notes 
 and 
.
  3.5. Hierarchical Tension
Equation (
9) computes the hierarchical tension 
h following the conceptual basis of Lerdahl [
22]. It accounts for the distance from neighboring chords in the hierarchical tree structure of the progression 
P as
        
        where 
N is the length of the path between the chord node and the root, 
 is the parent chord of 
, and 
 is the parent chord of 
 in the hierarchical tree structure of the chord progression 
P, and 
 is the Euclidean distance between two chords computed with Equation (
2). The computation of the hierarchical tension depends on the hierarchical structure (the tree) associated with the chord progression with the specific parent chords of the tree, as is shown in 
Figure 1. The next section explains in detail how the tree and the parent nodes are calculated.
  3.6. Computing Hierarchical Trees from Chord Progressions
Rohrmeier [
27] proposed a generative grammar to automatically generate a 
chord progression from a starting node. In this work, we adopt Rohrmeier’s grammatical rules to invert the process and generate a 
hierarchical tree structure given a chord progression 
P. Rohrmeier’s rules can be automated and the resulting tree can be used to compute neighboring hierarchical relations in Equation (
9), unlike the procedure proposed by Lerdahl [
1]. The trees that Lerdahl’s uses to compute the tonal tension of a progression are manually constructed, as they are based on the particularities presented in each chord progression and the musical experience of the authors. Therefore, they do not propose general rules that can be computed to generate trees for every chord progression, unlike Rohrmeier’s proposal.
Rohrmeier’s grammar has been validated [
27] and used in previous literature to construct hierarchies and analyze tonal music [
49,
50,
51]. Lerdahl et al. [
51] compared the trees created manually against the trees generated by Rohrmeier grammar, and the results suggested that Rohrmeier can capture the idiosyncrasies of musical properties quite well. Due to the success of these previous experiments, we considered Rohrmeier’s proposal as a successful approach and construct tree structures following his instructions.
According to Rohrmeier [
27], the rules in Equation (
10a) to Equation (
10g) characterize the core behavior of a harmonic functional grammar in a chord progression. 
Non-terminal symbols TR, SR, DR, and XR represent the tonic, subdominant, dominant regions and any of the previous regions, respectively. Non-terminal symbols can be expanded into different harmonic regions following the rules in Equation (
10a) to Equation (
10d). 
Terminal symbols t, 
d, and 
s represent tonic, dominant, or subdominant chords in the sequence, respectively. Terminal symbols can be instantiated by applying Equation (
10e) to Equation (
10g) and cannot be further expanded.
        
To compute a hierarchical tree from a chord progression 
P, we start by replacing each chord by its respective harmonic function 
t, 
s or 
d, computed as the minimal distance to their respective diatonic triads in the TIS using Equation (
4). Then, we recursively apply the grammar rules in Equation (
10a) to Equation (
10g) according to the three following criteria:
- To avoid unfeasible trees, first select the ‘s’ element that appears in the lowest number of rules. 
- To avoid unfeasible trees, build the tree from the inner leaves and to connect them with the outer ones. 
- To avoid trees with too many nodes, always apply a rule which implies a greater number of elements. If several rules respect this criterion, select one randomly. 
To clarify this process, a simple hierarchical tree construction is illustrated in 
Figure 2. In the first stage, the chords are replaced by ‘t’, ‘s’ or ‘d’, according to their alignment with the harmonic functions in the TIS. Secondly, the ‘s’ element can be connected to the ‘d’ element by application of Rule (
10c) (
Figure 2c). Next, either the first or the last ‘t’ element can be connected. The last ‘t’ element is selected at random and Rule (
10b) is inversely applied. Finally, the first ‘t’ is connected with Rule (
10e) and Rule (
10d) (
Figure 2d).
At this stage, we have constructed a tree where the leaves are the specific tonal functions of each chord in the progression and the rest represent tonal regions (
Figure 2e). However, to compute the hierarchical tension of a chord, we need to calculate the distances between the chord and all the chords in the progression that can hierarchically influence the tension of this chord. Therefore, all the chords should be ordered and placed accordingly in the tree structure, as illustrated in 
Figure 1.
According to Lerdahl’s proposal, in a tree structure, a chord can become a parent chord when it is the most stable chord among the children. This stability is related to the perceptual distance of the chord and the tonal function, and the perceptual distance between the children chord involved. Following this concept proposed by Lerdahl, we based the stability on two premises:
We iteratively replace each parent node of the tree with the least tense chord selected from its children. Thus, starting from the deepest levels of the tree (i.e., the root), each parent node is replaced and we move up to the the next level until the top level is reached (i.e., the level with no nodes, only leaves). The parent nodes with only one child are automatically replaced by the leaf chord. The parents whose children represent different tonal regions are automatically replaced by the child with the same tonal function or region as the parent to maintain the same tonal region. Finally, the node with children of the same tonal regions is replaced by the chord that gets a lower value according to its tonal distance.
A simple example considering the tree of 
Figure 2 is illustrated in 
Figure 3. 
Figure 3a represents the chord progression and the tree associated. 
Figure 3b replaces the terminal by the specific chords (G, Am, D7, and Em). 
Figure 3c replaces the 
 and 
 nodes with the only child they have. 
Figure 3d replaces the 
 (the dominant region) with the dominant chord D7, as Am represents a subdominant function. Similarly, 
Figure 3e replaces 
 with the tonic chord Em and not the dominant chord D7. To replace the last node, we might select either Em or G, as both are tonic chords. However, as 
Figure 3f shows, G is selected as the parent node because it is the least tense chord according to the tonal distance proposed in 
Section 3.2.
  5. Conclusions and Future Work
In this work, we proposed a model that captures the 
tonal tension profile of a chord progression by encoding the chords in the Tonal Interval Space (TIS) [
31]. We measure the 
instantaneous tonal tension of each chord considering factors such as dissonance, the tonal distances to measure the similarity with the previous chord, with the tonal function and with the key, voice leading, and the hierarchical tension of the chord with respect to the rest of the progression. To create the tonal tension profile, we connect the values associated with each individual chord and create a curve that represents the tonal tension profile of the chord progression. We considered the TIS as an appropriate representation for multiple reasons. Firstly, multi-level pitch configurations can be represented, such as chord with different number of notes, or keys. Therefore, we can handle chord progressions written in different keys or different modes. Additionally, topological properties of the space reflect musical properties of the tonal system. Consequently, we have constructed a flexible computational analysis framework which allows for excluding annotations on the analyzed musical data by encoding pitch configurations and capturing their musical properties systematically.
The evaluation of a chord progression depends on the musical context. This is influenced by both linear properties, related to how the progression is changing along the time and hierarchical properties, related to the structure of the whole progression. Our measure is appropriate, as it captures linear properties of the chord progressions, namely, dissonance, the tonal distances and the voice leading; and the hierarchical properties, such as the hierarchical tension of the chord with respect to the rest of the progression.
In order to evaluate how our measure predicts the tonal tension profile of a chord progression, we performed two experiments. For both, we used 12 chord progressions in different keys, with different trees (i.e., hierarchical structures), and different tonal tension profiles. Experiment 1 aimed to evaluate if our model captures the instantaneous tonal tension profile while the participants are listening to a chord progression, and which musical properties are essential to predict the tension of a progression. Experiment 2 was intended to evaluate if our measure captures the general perception of the tonal tension profile after the participants listened to a chord progression. In both cases, statistical analysis showed that the subjective ratings correlate with the objective values, validating the objective model as a proxy for the subjective measure of 
instantaneous and 
global tonal tension profile of the chord progressions proposed in the experiments. In both experiments, we also compared the results obtained with present model and with Lerdahl’s [
22] and Herremans’ models [
12]. Despite the positive results for all the tonal tension models, in most cases, the model 
M proposed was a better indicator of tonal tension profiles.
Finally, it is important to highlight that both global and instantaneous tonal tension are high-level properties of music, so tonal tension can be musically perceived and subjectively rated differently. Despite the limitations, one of the contributions of this model is the beginning of a generalization function that can study a long-term chord progression based on both hierarchical and linear structures in different tonalities. The adaptation of Lerdahl’s space to the TIS to explain the tonal properties of a chord besides tonal tension and voice leading can also be considered a contribution for the community. The positive results are encouraging to apply this measure to real analysis problems and computational generative systems.
Future work will study how to extend this analysis and extrapolate the model to any chord progression that has been composed following the tonal music standards. We will also analyze properties like pitch height and rhythm, which also influences the tension perception. Additionally, we will also investigate how to improve a system to generate chord progressions using an artificial immune system described in a previous work [
32] with this new measure to create chord progressions that follow a given pattern, like a tension curve or a given structural tree.