Abstract
The entropy of the partition generated by an n-tone music scale is proposed to quantify its regularity. The normalized entropy relative to a regular partition and its complementary, here referred to as the bias, allow us to analyze various conditions of similarity between an arbitrary scale and a regular scale. Interesting particular cases are scales with limited bias because their tones are distributed along specific interval fractions of a regular partition. The most typical case in music concerns partitions associated with well-formed scales generated by a single tone h. These scales are maximal even sets that combine two elementary intervals. Then, the normalized entropy depends on each number of intervals as well as their relative size. When well-formed scales are refined, several nested families stand out with increasing regularity. It is proven that a scale of minimal bias, i.e., with less bias than those with fewer tones, is always a best rational approximation of .
Keywords:
metric entropy; convex sets; convex functions; continued fractions; best rational approximation; maximal even sets; well-formed scales; Pythagorean tuning MSC:
94A17; 11A55; 39B62; 52A20; 11A07
1. Introduction
An n-tone music scale determines a partition of the octave in n intervals. Regarding their regularity, scales can be of equal divisions of the octave, i.e., with n tones of equal temperament (n-TET scale), where discretization precision increases as the number of tones grows, or that of different divisions, where precision increases in two ways: as the size of the intervals decreases and as their regularity increases. In particular, regarding well-formed scales of one generator, hereinafter referred to as cyclic scales, if they are non-degenerate (of equal temperament), they are formed from two elementary intervals [1,2], one longer than the interval of an n-TET scale and the other of shorter width. These scales also fulfill the condition of maximum evenness [3,4]; that is, they present the most even distribution, which does not depend on the relative size of the intervals. In many cases, as the number of tones of a cyclic scale increases, the regularity of the intervals decreases, still satisfying the condition of a maximal even set. Therefore, for cyclic scales, it is not obvious how to quantify precision or the regularity of the partition as the number of tones increases. The present work had the purpose of analyzing this using partition entropy, as well as studying the bias from regular temperament in more general cases.
The regularity of the intervals of can be quantified in several ways. A fairly common way is from their standard deviation. In the octave, for the intervals between consecutive tones, when they are considered a discrete random variable of equal probability such that , have expected value , i.e., the size of the elementary interval of the n-TET scale. A measure of dispersion of these values about the mean is the standard deviation , defined from its square, the variance, . In this way, it is possible to compare different n-tone scales in terms of the average dispersion of their intervals relative to the n-TET scale. However, this procedure does not present interesting properties to deal with, for instance, successive refinements of cyclic scales.
Another way to measure the regularity of may be based on the quadratic sum of its intervals. We consider this case in a more general way. Let us assume that is a Lebesgue-measurable space such that and is a finite measurable partition of ; that is, and if . (Equalities that involve measures are understood to be true except in a null set; that is, we should strictly write , but we will not write this to simplify the notation.) Then, the sum of squares for the partition relative to the measure is . The function is a convex function of , which, when constrained to , has an absolute minimum for , . In this case, is a decreasing function of n. When the partition is refined, that is, if is another partition of the octave, then its sum is a refinement of them. In order to make successive refinements, it would be desirable to be able to express with a function F that is a linear combination of squares of , i.e., also a sum of squares of the refinement. But this is not possible because in F there will necessarily appear cross products.
Fortunately, the notions of regularity and fineness of a partition have, in several fields, a well-known way of being quantified, which is the partition entropy. The concept of entropy was introduced by Clasius in thermodynamics (in 1850), Boltzmann applied it to statistical mechanics (1877), Planck related it to probability theory (1906), Shannon applied it to information theory (1948), Jaynes used it as a measure of uncertainty (1957), and Kolmogorov extended it to deterministic systems (1958).
Certainly, entropy has been used in music almost from the beginning of information theory [5,6,7] by considering the musical language as a source that produces a sequence of symbols representing musical tones and by associating them with certain probabilities according to their frequency of appearance. In particular, a wide range of works have used entropy to identify music styles (e.g., [8,9,10,11]) although results may vary depending on the preanalytical assumptions made in order to treat the data (scale degrees, pitch class, octave equivalence, weighting by duration, key-signature dependency, modal bias, etc.). With the same purpose, cross-entropy (e.g., [12]) has been used to quantify stylistic similarity between two sequences [13]. An up-to-date review on entropy and other physical parameters applied to music is provided by Gündüz [14].
Nevertheless, it seems that entropy as a measure of regularity of music scales has been neglected. There is a significant difference between discrete state spaces and continuous state spaces. In most cases, entropy is considered in terms of information theory, where the natural application is to symbolic frequency analysis. Although Shannon did suggest a generalization to the continuum through infinitesimal equal measure states, becoming an integral form of his discrete theory, partition entropy measures an inverse density rather than a frequency over equal measure.
Therefore, in the current paper, it is proposed to use the normalized entropy as a measure of the regularity. Its application is illustrated in two cases: first, to study the similarity between an arbitrary scale and an n-TET scale; and second, to study the regularity of cyclic scales generated by a tone h and the relationship between their bias and the rational approximation of that they provide.
2. Partition Entropy
Following the notation and terminology of Arnold and Avez [15], the entropy of the partition is defined from a concave function , as shown below. (The entropy of the partition is also called metric entropy, and it consists of changing the metric of the theory of probabilities, i.e., the probability, by a generic metric, where the concepts relative to the partitions remain.)
The following relationship is fulfilled,
Let us remember that if the function of a random variable is concave, the expected value satisfies (Jensen’s inequality).
Therefore, for a partition of n elements, the value is the maximum value that the entropy can reach, and it is reached when the elements of the partition are of equal measure.
Thus, is the weighted average of the values (in probabilities, it would be the information associated with the measure of the random variable , i.e., the expected value of the information content or self-information of the variable ). The smaller , the greater the above logarithm; so, this average gives an idea of how refined the partition is.
Shannon (1948) showed that the entropy defined in this way is the only function that, except for a multiplicative constant, satisfies the following postulates:
- 1.
- is a continuous function of .
- 2.
- If , then is an increasing function of n.
Let us consider partitions and and a refinement of both, . The conditional entropy of relative to is defined as
where is the conditional measure of relative to .
- 3.
- The following sub-additivity property holds,(In general, for three measurable partitions, and , then .)
- 4.
- If is an automorfism of , then and .
Therefore, the partition entropy seems like an appropriate parameter to measure how much a scale of n arbitrary tones differs from an n-TET scale. It is also the parameter that provides us with an estimate of how refined a scale is, because the entropy of a scale of n tones will always be greater than that of a subscale of tones, and less than or equal to that of an n-TET scale. This will be useful when dealing with cyclic scales. In addition, as the intervals of a partition are subdivided, the entropy is additive with regard to the intervals being refined, which saves calculations.
3. Cyclic Scales
The computation of entropy for cyclic scales requires a brief review of their properties, which hereinafter are summarized by following Cubarsi [16,17].
A ratio of 2 between frequencies corresponds to the range of one octave. For any frequency ratio , the values , , define one equivalence class. The set is a commutative group for multiplication. The set of all the octaves of the fundamental frequency ratio () is a monogenous subgroup of of an infinite cardinal, . The frequency classes are the elements of the quotient group , also commutative for multiplication. For each equivalence class, we choose a representative in (with identified extremes) as the reference octave, which we identify with . A finite set of these representatives will be referred to as scale tones.
An n-tone cyclic scale is a scale of one generator, a real positive value h. The scale tones satisfy the symmetry condition, consisting of displaying several degrees of rotational symmetry that are equivalent to the closure condition [18,19,20], although such an equivalence does not hold for scales with more than two generators [21,22]. The partition of the octave induced by the scale notes has exactly two sizes of scale steps, and each number of generic intervals occurs in two different sizes, which is known as Myhill’s property [1,2]. For , we get the particular case of the 12-tone Pythagorean scale, generated by fifths, as well as those listed in Table 1. In general, for h other than a rational power of 2 (which would lead to degenerate cases of equal temperament scales) we would obtain generalized Pythagorean scales.
Table 1.
Properties of cyclic scales: GRA = Good rational approximation (accurate scale); RID = Regular interval distributed; BRA = Best rational approximation (optimal scale); MB = Minimal bias; I|Xn = maximum distance to a note of the n-TET scale.
The scale tones are ; ; with (floor function). When the scale tones are ordered from lowest to highest pitch in (say, in cyclic order or by ordinal), we find two extreme tones—the minimum tone and the maximum tone —which determine the two elementary factors (up the fundamental) and (down the fundamental) associated with the generic widths of the step interval such that . The indices satisfy , which are all coprime.
The tone , which does not belong to the scale , provides the closure condition (either or ) determining the n-order comma (in the frequency space), i.e., the error in closing the scale near the fundamental with no other scale tones between them. The comma itself does not provide information about whether closes above or below the fundamental. In using the index , two parameters provide this information: on the one hand, the scale closure, , which is a value close to 1 satisfying ; and on the other hand, the scale digit , taking values 0 or 1. Then, () or (). The value measures the distance from to 1. (The distance between two scale tones , is measured as .)
Figure 1 (left and central panels) displays how the 12-tone cyclic scale for (Pythagorean scale) is formed. The h iterates of indices and are the extreme tones, while provides the closure, since there is no other tone with a frequency that is between and the fundamental. The ratio between consecutive iterates is either or , except for the last iterate (the wolf fifth), which compensates for the comma. When the scale tones are arranged in pitch order, i.e., by ordinals, their ratios are either or . Such a distinction is much clearer under the following alternative approach.
Figure 1.
(Left) Tones (frequencies ) of the cyclic scale for h = 3 and n = 12 in in order of iterates (blue) and in pitch order (green). The red dot is assumed as closing the scale. (Center) Ratios between iterates (blue) and consecutive scale tones (green). (Right) Scale intervals in cents () along the circle of the octave clockwise direction (number in white for the first one) with two interval sizes.
Each tone in the frequency space is associated with a note or pitch class in the octave , so that the above quantities have the corresponding ones in . Thus, the elementary intervals and generate the partition of the octave in n intervals, with , satisfying , with interval closure and interval comma . Sometimes, it is more useful to work in the multiplicative space of tones, and sometimes, in the additive space of notes. In the latter case, a frequency is usually expressed by musicians as in cents (¢), so that each semitone of the 12-TET scale is divided into 100 parts. Figure 1 (right panel) displays how the intervals between the 12 notes of the Pythagorean scale are distributed along the circle of the octave (clockwise direction). All of the intervals, even the last one, are either or .
The fraction is associated with the convergent and semi-convergent continued fraction expansions of [23,24,25]. Among cyclic scales, two categories may be pointed out. The first category is formed by optimal scales, associated with the best closure , corresponding to the best rational approximations, i.e., the convergents of its canonical continued fraction expansions from both sides. The second category, which we shall name accurate scales, is associated with the best estimations of the generator tone , corresponding to the good rational approximations of . Apart from accurate scales, which include optimal scales, there are cyclic scales not associated with good or best rational approximations, still corresponding to semiconvergents.
Notice that the terms “good” and “best” rational approximations [26] are equivalent to the best approximation “of the first kind” and “of the second kind”, respectively [27]. The conditions mean the following.
A one-sided best approximation of occurs when and .
A one-sided best approximation of occurs when and .
A best rational approximation satisfies and .
A one-sided good approximation of occurs when and .
A one-sided good approximation of occurs when and .
A good rational approximation satisfies and .
For all these scales, the values , and have bounds according to Appendix A. In particular, quantifies the error of the rational approximation of . These bounds determine the interval between a note of the cyclic scale and the one with the same ordinal in the n-TET scale, which is analyzed in Appendix B.
Finally, the family of cyclic scales follow a chain, , so that in starting from the indices of , the same values for the next scale are (i) and (ii) (see Table 1).
4. Entropy of a Cyclic Scale
For a cyclic scale , with the measure defined for tone intervals as for , consider the partition of the octave composed of M intervals of width u and m intervals of width d, regardless of the order in which the intervals follow each other. The partition entropy is
We will explicitly write it in terms of the indices of the minimum and maximum tones m and M. In order to do so, we define the values
where is the mantissa of , that is, , such that the elemental intervals u and d can be expressed in terms of the indices of the minimum and maximum tones as
In this way, Equation (4) becomes
Let us see how to express the entropy of the cyclic scale , the one following in the chain of cyclic scales. The scale refinement process is as follows. The partition breaks, so that the major interval splits into two, one of the same size as the minor one plus a remainder. This residual is only smaller than the size of the smaller interval when the scale is optimal. This process is iterated.
As explained in Section 3, we must distinguish two cases, depending on whether is greater or less than 1, or equivalently, if equals 0 or 1:
- (i)
- , , , . In this case, and .
The refinement is performed in the M intervals of size u, and the m intervals of size d are maintained:
Since and , it holds that . Thus, we express the entropy according to Equation (3), with the new partition :
In terms of the indices of the extreme tones, in noting , it can be written as
- (ii)
- , , , . In this case, and .
The refinement is performed in the m intervals of size d, and the M intervals of size u are maintained:
The entropy of the new partition is
In terms of the indices of the extreme tones, they are equivalent to
Figure 2 displays how entropy increases in terms of the number of tones of a Pythagorean scale. The left panel refers to the scales with a lower number of tones, while the right panel shows the larger trend (in logarithmic scale), by making explicit the values of .
Figure 2.
Small and large scale trends of the entropy for cyclic scales with h = 3.
5. Partition Modulation
According to Equation (A2), we write the elementary intervals of a cyclic scale as
We have referred to one degenerate case of a cyclic scale, the limiting case when the two intervals are one, . In this case, and , the n-TET scale. This is equivalent to in the expressions of Equation (9). However, we should consider two more degenerate cases. The larger is one interval, the smaller is the other, always filling a full octave, . Therefore, if , then the cyclic scale becomes an equal temperament scale of M tones, and if , the scale becomes an equal temperament scale of m tones. In other words, one of the following cases applies:
- (i)
- According to Appendix A, for , by Equation (A5), . If , then . In this case, the intervals of the cyclic scale satisfy . Since , then when , the scale is not non-degenerate anymore and becomes an M-TET scale, with entropyObviously, an optimal cyclic scale is far from this situation, because it satisfies .
- (ii)
- Also, according to Appendix A, for , owing to Equation (A8), . If , then . In this case, the intervals of the cyclic scale satisfy . Then, , so that the scale becomes a degenerate m-TET scale with entropyAn optimal cyclic scale case is also far from this situation, since it satisfies .
Lemma 1.
The scales formed from the two elementary intervals and with values generate an infinite and continuous family of n-tone cyclic scales , which are neighbors of , such that and . In addition, if , the scales are optimal.
These results are immediate consequence of the bounds obtained in Appendix A.
5.1. Modulating Temperament Scales
The most usual case of cyclic scale is the one associated with a generator corresponding to a harmonic of the fundamental tone. Then, the tonal class of the generator is , which can be written as , where is the ordinal of the scale tone that better approximates g [17], known as the chromatic length of the pitch class of the generator. The value also determines the indices m and M of the extreme tones, corresponding to the ordinals 1 and of the scale notes. Since as , m is the positive integer, so that .
For fixed n (and also m and M), the family of cyclic scales has generators that, in general, are not the tonal class of any harmonic; that is, they are real scales. In this case, we write them as so that, for , we obtain a partition modulation, which is a continuum of irregular temperaments [28,29] close to and , also corresponding to cyclic scales. Hence, the n-tone cyclic scales are a family of modulating temperament scales around the generator , namely, .
For example, for and , the value gives rise to the quarter-comma meantone temperament, which fits the class of the fifth harmonic well, in exchange for decreasing by 5¢ in the accuracy of the third one. Since the value of is lower than , it results in an optimal scale.
5.2. Entropy in Terms of the Closure
Let us consider an n-tone modulating the temperament scale . According to Equations (4) and (9), we write the entropy of as
In using the relationship , it is immediate to see that
that is,
By deriving Equation (10) with respect to , we obtain
If , the two terms of the previous equation are positive and the derivative is negative; therefore, in this interval, the entropy is a decreasing function. If , both terms in the above equation are negative and the derivative is positive, so in this interval, the entropy is an increasing function. At , there is a local maximum, since if and only if . Thus, in terms of is a concave function, since . Therefore, the following apply:
- (a)
- If , then .
- (b)
- If , then .
In addition, it is straightforward to see the following properties:
- (c)
- If , for and for .
- (d)
- If , for and for .
- (e)
- for .
Properties (c) and (d) refer to the slight asymmetry of the entropy far from , which, as we shall see in the last section, is negligible around . Property (e) means that a cyclic scale generated by h and its inverse scale, generated counter-clockwise with swapped indices of the extreme tones, although they have different notes that differ in one comma, have the same entropy.
Therefore, the entropy of any n-tone modulating temperament scale in the family only depends on the closure.
6. Normalized Entropy
Since the entropy increases as the partition is refined, to be able to compare qualities of different scales, whether they are cyclic or not, we will consider the normalized entropy, i.e., relative to the maximum value that it can reach, as that of the equally tempered scale. For an n-tone scale , it is defined as
such that tends to 1 as approaches an n-TET scale. This ratio has also been called efficiency and relative entropy [30]. For a cyclic scale , we will write .
Let us see in which case the normalized entropy increases when a scale is being refined. Let us assume two scales, not necessarily cyclic: with (i.e., is a refinement of ).
Lemma 2.
The normalized entropy increases if and only if the relative increment of entropy in refining the partition is greater than the relative increment of the entropy of the corresponding equal temperament scales.
Proof.
By the sub-additivity property, . It will be if and only if , so . Hence,
□
The deviation of an n-tone scale , not necessarily cyclic, relative to the regular n-TET scale will be measured from the complementary of the normalized entropy, which we will call bias:
7. Bias of a Cyclic Scale
For a non-degenerate n-tone cyclic scale , the bias depends on the divergence of its elementary intervals u and d with regard to the elementary interval of the n-TET scale. Thus, in bearing in mind Equation (10),
The graph of the function is the mirror image up-to-down of , scaled by the factor ; hence, it is convex (Figure 3). The value is its minimum in the interval , and at the extremes (corresponding to degenerate scales), it takes the values and . Therefore, in the interval , is a decreasing function, and in , it is an increasing function of .
Figure 3.
Graphs of normalized entropy and bias.
Note that this behavior of in terms of holds when n is fixed. In other words, . Then, although for another -tone cyclic scale, , the bias has a similar behavior with respect to , we cannot assure that if , then , since this also depends on the values of and , such that .
Figure 4 shows the trends of the bias and the interval comma for Pythagorean scales (generated by ) in terms of n. In general, optimal scales have low bias, but the bias not always decreases as a scale is refined. For example, the first optimal scales are those of , and the bias decreases for , but the scale of has greater bias than that of , and the one of has greater bias than those of . Also notice that there are intervals where decreases but increases.
Figure 4.
Behavior of (red) and (blue) in terms of the number of tones n (bilogarithmic scales). In gray is maximum bias for several similarity levels to n-TET scales.
We will say that a cyclic scale is of minimal bias (MB) if for any cyclic scale with , . Obviously, it is tantamount to say that ; hence, the scale has greater normalized entropy than the previous ones. Not all optimal cyclic scales are MB.
Before a deeper analysis, in an approximate way, we may estimate how close an MB scale and an equal temperament scale of the same number of tones are. We may use the criterion for which a cyclic scale (not necessarily optimal) unambiguously approximates an n-TET scale if every note of the former is at a distance equal or less than half an elementary interval from the latter. According to Equation (A12) with , this leads to the condition (condition I|2n).
This condition can be satisfied by optimal and non-optimal cyclic scales. For example, for , condition I|2n is holds although the scale is not optimal. On the contrary, are optimal scales but the condition I|2n is not met.
Let us see what happens with the MB condition. In this case, for up to , all scales that satisfy MB also satisfy I|2n, but there are scales that are I|2n and not MB scales, for example, for . Hence, the condition that a note of the cyclic scale is closer than half an interval of a note of a tempered scale is weaker than the MB condition. On the contrary, if we further restrict the above condition, for example, the respective notes are at most at a third interval, i.e., (condition I|3n), then for up to , all I|3n scales are MB scales. In Table 1, these and other properties are displayed.
8. Scales with Limited Bias
8.1. Scale Distributed within Regular Intervals
We study several families of scales for which it is possible to estimate a priori the lower limit of the entropy.
We say that two n-tone scales and sharing the fundamental tone alternate if their notes in cyclic order— and , where —fulfill one of the following conditions:
If is the equal temperament scale , then we write and say that is regular interval distributed (RID). In this case, the alternation can also be defined from the following relations involving intervals between tones,
(The interval between two tones in is .)
Note that these conditions are generally more restrictive than the condition
Obviously, Equation (16) implies the condition of Equation (17), although, as seen in Appendix B, for cyclic scales, they are equivalent.
For example, suppose case (a). If , then the interval between two consecutive notes of satisfies , so that . Instead, if , then . In a similar way, we would reason case (b).
Lemma 3.
The interval between two consecutive notes of an n-tone RID scale is lower than .
8.2. Scale r-Similar to n-TET
A criterion for measuring the proximity between scales is similarity [31]. Two n-tone scales T and are similar at level (r-similar) if for each tone , such that and .
For example, a cyclic scale and the n-TET scale have a level of similarity if and only if any pair of notes with ordinal j, i.e., and , satisfy . This condition is equivalent to the condition of Equation (A12) with , .
However, for the current purpose of evaluating and comparing entropy partitions, we will slightly modify such a concept by assuming that the fundamental is shared by both scales. Then, we say a scale is similar to the n-TET scale at level r, with , if their tones satisfy
8.3. Scale One-Side r-Similar to n-TET
We introduce a concept that will mix the one of similarity with the one of distribution within regular intervals, which will be appropriate for studying the bounds of the entropy for cyclic scales.
Let be the tones of the n-TET scale. We say that a scale is one-side r-similar at level r (r-OSS) to the n-TET scale, with , if
It is OSS by the right (a) or by the left (b), respectively. Therefore, when it matches the definition of an RID scale.
9. Entropy of a Scale -OSS to -TET
Consider an n-tone scale , not necessarily cyclic, which is -OSS to n-TET for . We already know that the maximum entropy is . Let us see the minimum entropy it can reach. We assume case (a) of Equation (19) and consider the n notes () of in cyclic order in , since the entropy does not change if we add a null set. We also extend the range of the possible variation in the intervals between notes at the extremes to include the limiting degenerate cases. The points determining the division of the octave are
By writing and assuming , the respective intervals are
Then, the scale generates the partition , , with entropy
The entropy is a concave and differentiable function , , with , a hypercube, which is a convex and compact space. Then, f has a local and global maximum at , corresponding to an n-TET scale with and , and the global minimum is reached at one or some of the vertices of V. These vertices are determined using the possible values , , and the resulting intervals can only take the following values: interval , values and ; interval , values and ; and the intermediate intervals, values , and .
The octave is covered by a number of different intervals satisfying
of which A in number have width , B width , and C width , so that . Therefore,
(Some of these intervals may have width zero, giving rise to a degenerate scale with less than n non-null intervals.)
The entropy at one of these vertices can be written in terms of the respective number of intervals as
where z is the concave function defined in Equation (1), also extended for values , where . Among possible configurations, we look for the minimum value that f can take, which will correspond to the minimum value of g. Notice that this quantity is added to the entropy of an n-TET scale, as it is consistent with the fact that the B intervals of width are not involved in the expression. However, the intervals in number C, of greater width than , contribute to decreasing the entropy, while those in number A, of width lower than , contribute to increasing it, since the corresponding function z evaluated in values less than 1 is positive.
If n is even, the maximum value for C, by assuming , is . Hence, the number of intervals of null-size is also .
The function with is defined for , where it satisfies , always increasing from until when . Then, , which correspond to a degenerate scale of non-null intervals. Therefore, if , the entropy satisfies .
If n is odd, the maximum value for C and A is , with . Then, , which correspond to a degenerate scale of non-null intervals. Therefore, if , the entropy satisfies .
Case (b) is similar. It would be reasoned by considering that , i.e., it is equivalent to case (a), but following the intervals from right to left starting at 1.
Theorem 1.
The entropy and bias of a scale -OSS to n-TET satisfy
We explicitly write two cases. For , since , the entropy of an RID scale satisfies, if , , and if , it satisfies .
For , since , the entropy of a scale -OSS to n-TET satisfies, if , and, if , .
Levels for are displayed in Figure 4.
10. Entropy of a Scale -Similar to -TET
Let us calculate the minimum entropy that can reach an n-tone scale , not necessarily cyclic, with a similarity level with the n-TET scale for . According to the previous notation and considerations, the points that determine the division of the octave are
The respective intervals are , by assuming . As before, we write the entropy of the partition , , as , where , with , a convex and compact space. The local and global maximum of f takes place at , corresponding to the scale of n-TET, with and . The global minimum is reached at one or some of the vertices of V. These vertices are determined using the possible values , , and the resulting intervals can only take the following values: the values and of the extreme intervals and , while the intermediate intervals can have values , , and . The octave is covered by a number of different intervals satisfying
of which A in number have width , B width , C width , D width , and E width , so that . Therefore,
The entropy at one of these vertices can be written in terms of the respective number of intervals:
Once again, the intervals of width are not involved in this expression. The intervals in numbers D and E, of greater width than , contribute to decreasing the entropy, while those who are there in numbers A and B, of width lower than , contribute to increasing it.
The minimum value of g is obtained using the highest possible values of E and D (in that order) and minimum of A and B.
If n is even, at most there can be intermediate intervals of width . Through the substitution of this value into Equation (28), we obtain , so that and . Hence, the minimum g is , corresponding to a degenerate scale of non-null intervals. Therefore, if , the entropy satisfies .
If n is odd, we should examine two alternatives. Firstly, if we assume that there are intermediate intervals of width , through substitution into Equation (28), we obtain . Possible interval values are . In this case, the minimum value of g is provided by the first triad, i.e., , , , which yields , corresponding to a degenerate scale of non-null intervals. In this case,
As the second alternative, if there are intermediate intervals of width , holds. Possible interval values are . The minimum of g is provided by the first triad, so that , , , and , corresponding to a degenerate scale of non-null intervals. Hence,
For , the factor multiplying is a positive value between and . Since the value is negative, the above factor increases it. Hence, the entropy of Equation (29) is lower than the entropy of Equation (30).
Theorem 2.
The entropy and bias of a scale -similar to n-TET satisfy
We explicitly evaluate the case for . Since , if , then , and if , then , which are the same bounds as for an RID scale.
11. Cyclic RID Scales
We may reformulate Theorem A1 according to the above definitions.
Theorem 3.
Optimal cyclic scales are RID scales.
Nevertheless, there are many non-optimal cyclic scales that are also RID scales. Cyclic RID scales correspond to partial convergents of continued fractions where the comma is not as low as for optimal scales, i.e., their closure is not a best approximation, although they belong to a family of relatively good partial convergents. However, although in most cases, a convergent that gives a good approximation (i.e., an accurate scale) generates a RID scale, there are exceptions, such as for . Between the notes of the respective scales and , there is a distance of nearly 1.5 elementary intervals of the equal temperament scale, and between two consecutive notes of the cyclic scale, there may be a distance equivalent to 2.1 regular intervals, which means that within some regular intervals, there are two notes of the cyclic scale.
Therefore, there exist accurate scales that are not RID, and RID scales that are not accurate.
11.1. Comma and Elementary Intervals of an RID Scale
It is immediate to identify an RID scale from its interval comma . As seen in Appendix B, the distance between the tones and for is . So, for all if and only if , that is, the interval comma is limited as .
Lemma 4.
A cyclic scale is RID if and only if .
This condition, which is satisfied by all optimal scales, assures us that the value of is small enough to be far from the previously seen degenerate cases of less than n notes, since for every RID scale, is fulfilled.
From another point of view, while considering RID scales, we are excluding “bad approximations” of that satisfy any of the following: if , then , or, if , then . This fact also has implications on the bounds of the two elementary intervals of cyclic RID scales, We distinguish the following cases:
- (i)
- If , according to Equation (9), . Hence,(We should distinguish two cases: (a) : and ; since , we have and , which does not add any new limitation. (b) : and . Therefore, the extremes still correspond to n-tone scales, by avoiding degenerate cases.)
- (ii)
- If , according to Equation (9), . Hence,(We should distinguish two cases: (a) : and ; hence, and . Once again, the extremes correspond to n-tone scales not degenerating toward scales of fewer tones. (b) : and ; hence, and , which does not add any new limitation.)
In both cases, the size of the greatest elementary interval of a cyclic RID scale is lower than , as stated in Lemma 3.
11.2. Bias of RID Scales
For large values of , that is, or , Equation (14) can have a quite arbitrary and non-symmetrical behavior, but for values , and in particular, for optimal scales with or , depending on the value , the bias behaves as proportional to .
With and , we can approximate the logarithms in the following expressions as
and through substitution into Equation (14), we obtain
Notice that with and with . The same happens with the higher-order terms. Therefore, for enough small , we can use the approximation derived from the following result.
Lemma 5.
The bias of a cyclic RID scale satisfies
11.3. Cyclic Scales of Minimal Bias
As explained in Section 4, while refining a cyclic scale, in each iteration, one of the indices of the extreme tones remains fixed, while the other increases by a value equal to the one that remains fixed. Let us see that in each refinement, the following function appearing in Equation (34) always increases:
Lemma 6.
For , with , and with .
Proof.
Indeed, it suffices to check that the function satisfies and .
Notice that for , the above inequalities hold, and if , then , so that the above results are also valid. □
Corollary 1.
Let us write . If , then .
With this notation, the bias of a cyclic RID scale can be estimated from
Theorem 4.
Every cyclic RID scale of minimal bias is optimal.
Proof.
We prove this by denying the consequent. Assume two cyclic RID scales and such that with interval commas satisfying , i.e., is not optimal. Then, applying the previous corollary to Equation (36), we have
Therefore, if a cyclic RID scale is not optimal, it cannot be MB. □
12. Conclusions
In the current paper, it was proposed to measure the regularity of the intervals of a music scale from their partition entropy. Among other properties, the fact of being a continuous increasing function of n for an n-TET scale, which is always the maximum value that the entropy of any n-tone scale can reach, together with the sub-additivity property, which guarantees that while refining the partition, the entropy always increases, make this parameter very suitable for our purpose. In order to compare scales with different numbers of tones, the entropy relative to the corresponding regular scale is used, which is the normalized entropy, so that their complementary to 1 quantifies the bias relative to the n-TET scale.
The main application of these concepts has been to cyclic scales, and their properties were reviewed and further investigated in the Appendices. Since non-degenerate cyclic scales are maximal even sets [3,4] and their intervals come in two possible sizes [1,2], the remaining properties allowing us to distinguish between scale distributions are the number of intervals of each size and the ratio between them.
Two situations have been analyzed. First, cyclic scales with a fixed number of tones, which is a family of modulating temperament scales around one generator. In this case, the bias only depends on the closure, i.e., the relative size of both elementary intervals. Second, as cyclic scales are refined, the bias also depends on how many intervals of each size there are.
In order to study such a dependency, it was necessary to restrict the scales in two ways. We centered our attention to scales with a lower limit of the entropy, determined using the condition that their notes are distributed along each of the intervals of a regular scale (RID scales). Such a study was conducted in a general way, by calculating the maximum bias of several scales, not necessarily cyclic, with different levels of similarity with an n-TET scale, either from one side or from both sides. Figure 4 displays the similarity levels for cyclic scales. In addition, we considered scales with the comma not exceeding that of an RID scale, since in this case, the dependency between bias and closure is well defined.
We proved that any cyclic scale of minimal bias (MB scale), i.e., with a bias that is lower than that of the cyclic scales of fewer tones, is necessarily optimal, i.e., corresponds to a best rational approximation of .
Notice that the bias, , according to Equations (35) and (36), is proportional to the product of , depending on each number of elementary intervals, and , depending on their relative size, so that accounts for the degrees of freedom allowed for cyclic scales.
Among the optimal scales, it was possible to select the ones for which intervals are distributed along the octave as regularly as possible relative to an equal temperament scale of the same number of tones. Therefore, in relation to the closure, scales can be ordered in nested families, from worst to best, as cyclic, accurate, and optimal scales, whilst in relation to their regularity, they can be ordered in nested families as cyclic, RID, optimal, and MB scales.
Although the current entropy-based measure has been particularly used to deepen the study of cyclic scales, the present work clearly suggests future applications to more general cases, also through using alternative metrics, either based on the distribution of the scale notes or the intervals, such as the recent Boltzmann–Shannon Interaction Entropy [32], allowing us to estimate a normalized entropy from a finite sample of points on a bounded interval.
Funding
This research received no external funding.
Data Availability Statement
Data are contained within the article.
Conflicts of Interest
The author declares no conflicts of interest.
Abbreviations
The following abbreviations are used in this manuscript:
| BRA | Best rational approximation (optimal scale); |
| GRA | Good rational approximation (accurate scale); |
| I|Xn | Maximum distance to n-TET; |
| MB | Minimal bias; |
| n-TET | n-tones of equal temperament; |
| OSS | One-side similar; |
| RID | Regular interval distributed. |
Appendix A. Bounds for the Elementary Intervals and Closure of Cyclic Scales
The notes of a cyclic scale divide the circle of the octave into n intervals of width or , beginning and ending in the pitch class of the fundamental so that
According to the definitions of U, D, and , these intervals can be referred to the elementary interval of the equal temperament scale as follows:
We may determine their bounds by distinguishing between the cases of being greater or less than 1:
- (i)
- If , then . Depending on the scale, we have the following:
- (a)
- The closure of an optimal cyclic scale satisfies ; hence, . By taking logarithms and taking into account Equation (A2), we obtainThe elementary intervals satisfy
- (b)
- The closure of a non-optimal cyclic scale satisfies ; hence, . Then, in taking logarithms,Therefore,
- (c)
- For an accurate cyclic scale for the above condition, , we must add that of a good rational approximation, which can be written asThen, ; hence, , and, since Equation (A4) is still valid, we obtain the following bounds:
Thus, in case (i), either for optimal or non-optimal scales, the following is satisfied: - (ii)
- If , then . Depending on the scale, we have the following:
- (a)
- The closure of an optimal cyclic scale satisfies ; hence, . Then, in taking logarithms,The elementary intervals satisfy
- (b)
- The closure of a non-optimal cyclic scale satisfies ; hence, . Then, in taking logarithms,Then, the elementary intervals satisfy
- (c)
- For an accurate cyclic scale for the above condition, , we must add that of a good rational approximation, which can be written asThen, ; hence, , and, since Equation (A7) is still valid, we obtain the following bounds:
Thus, in case (ii), either for optimal or non-optimal scales, the following is satisfied:
Table A1 summarizes the above results over the octave . Notice that, for optimal scales, Equations (A3) and (A6) give the following bounds, similarly to the theory of continued fractions:
Table A1.
Bounds for , , and , depending on the type of cyclic scale.
Table A1.
Bounds for , , and , depending on the type of cyclic scale.
| Cyclic Scale | ||
|---|---|---|
| optimal | ||
| non-optimal | ||
| accurate | ||
| in general | ||
Appendix B. Deviation of a Cyclic Scale from n-TET
We calculate the interval between the notes with ordinal j of the respective scales and , corresponding to the tones and . According to Equations (A2) and (A1), the deviations of each note of with regard to compensate each other, and in the end, they close the octave exactly. If the tone corresponds to the iteration , then (Equation (28) [17]). Therefore, . In bearing in mind that , the scale note is
Lemma A1.
The interval that separates the two notes is
Therefore, all of these intervals have the same sign, which is positive if and negative if . In taking absolute value, the interval between these close notes becomes a distance:
As expected, this distances increases with the iterations, so that the maximum is reached by the last iterate, , i.e., . Then, a condition such as , for , becomes , which in cents is
For an optimal cyclic scale, according to Equations (A3) and (A6), , so that and in Equation (A11). Therefore, we conclude that .
Theorem A1.
The notes , , of an optimal cyclic scale and the ones of the corresponding n-TET scale alternate, i.e., for , either for (i) or (ii) .
Such a situation is also possible for non-optimal scales, although there are exceptions where the tones do not alternate. For instance, in a non-optimal scale with , Equation (A7) implies that . But in a cyclic scale where , the condition , , stated in case (ii), is not fulfilled. An example of such a situation is the 29-tone scale, with and , where . The 28-th tone (454.74 cents) is below (496.55 cents) more than a factor of (41.38 cents), and between and , there are two consecutive tones, and , of the 29-TET scale.
References
- Clough, J.; Myerson, G. Variety and multiplicity in diatonic systems. J. Music. Theory 1985, 29, 249–270. [Google Scholar] [CrossRef]
- Clough, J.; Myerson, G. Musical scales and the generalized circle of fifths. Am. Math. Mon. 1986, 93, 695–701. [Google Scholar] [CrossRef]
- Clough, J.; Douthett, J. Maximally even sets. J. Music. Theory 1991, 35, 93–173. [Google Scholar] [CrossRef]
- Douthett, J.; Krantz, R. Maximally even sets and configurations: Common threads in mathematics, physics, and music. J. Comb. Optim. 2007, 14, 385–410. [Google Scholar] [CrossRef]
- Pinkerton, R. Information theory and melody. Sci. Am. 1956, 194, 77–87. [Google Scholar] [CrossRef]
- Youngblood, J. Style as information. J. Music. Theory 1958, 2, 24–35. [Google Scholar] [CrossRef]
- Lewin, D. Some applications of communication theory to the study of twelve-tone music. J. Music. Theory 1968, 12, 50–84. [Google Scholar] [CrossRef]
- Cohen, J. Information Theory and Music. Behav. Sci. 1962, 7, 137–163. [Google Scholar] [CrossRef]
- Knopoff, L.; Hutchinson, W. Information theory for musical continua. J. Music. Theory 1981, 25, 17–44. [Google Scholar] [CrossRef]
- Knopoff, L.; Hutchinson, W. Entropy as a measure of style: The influence of sample length. J. Music. Theory 1983, 27, 75–79. [Google Scholar] [CrossRef]
- Snyder, J. Entropy as a Measure of Musical Style: The Influence of a Priori Assumptions. Music. Theory Spectr. 1990, 12, 121–160. [Google Scholar] [CrossRef]
- Temperley, D. Music and Probability; The MIT Press: Cambridge, MA, USA, 2006. [Google Scholar] [CrossRef]
- Sakellariou, J.; Tria, F.; Loreto, V.; Pachet, F. Maximum entropy models capture melodic styles. Sci. Rep. 2017, 7, 9172. [Google Scholar] [CrossRef] [PubMed]
- Gündüz, G. Entropy, energy, and instability in music. Phys. A Stat. Mech. Its Appl. 2023, 609, 128365. [Google Scholar] [CrossRef]
- Arnold, V.; Avez, A. Ergodic Problems of Classical Mechanics; Mathematical Physics Monograph Series; Benjamin: New York, NY, USA, 1968. [Google Scholar]
- Cubarsi, R. An alternative approach to generalized Pythagorean scales. Generation and properties derived in the frequency domain. J. Math. Music. 2020, 14, 266–291. [Google Scholar] [CrossRef]
- Cubarsi, R. On the divisions of the octave in generalized Pythagorean scales and their bidimensional representation. J. Math. Music. 2024, 18, 1–19. [Google Scholar] [CrossRef]
- Carey, N.; Clampitt, D. Aspects of Well-Formed Scales. Music. Theory Spectr. 1989, 11, 187–206. [Google Scholar] [CrossRef]
- Carey, N.; Clampitt, D. Two Theorems Concerning Rational Approximations. J. Math. Music. 2012, 6, 61–66. [Google Scholar] [CrossRef]
- Carey, N.; Clampitt, D. Addendum to “Two Theorems Concerning Rational Approximations”. J. Math. Music. 2017, 11, 61–63. [Google Scholar] [CrossRef]
- Žabka, M. Generalized Tonnetz and well-formed GTS: A scale theory inspired by the Neo-Riemannians. In Mathematics and Computation in Music; Springer: Berlin, Germany, 2009; pp. 286–298. [Google Scholar]
- Žabka, M. Well-Formedness in Two Dimensions: A Generalization of Carey and Clampitt’s Theorem. J. Math. Music. 2010, 4, 1–30. [Google Scholar] [CrossRef]
- Hellegouarch, Y. Gammes naturelles I. Gaz. Des Mat. 1999, 81, 25–39. [Google Scholar]
- Hellegouarch, Y. Gammes naturelles II. Gaz. Des Mat. 1999, 82, 13–26. [Google Scholar]
- Kassel, A.; Kassel, C. On Hellegouarch’s definition of musical scales. J. Math. Music. 2010, 4, 31–43. [Google Scholar] [CrossRef]
- Loya, P. Amazing and Aesthetic Aspects of Analysis; Springer: New York, NY, USA, 2017. [Google Scholar]
- Khinchin, A. Continued Fractions; University of Chicago: Chicago, IL, USA, 1964. [Google Scholar]
- Milne, A.; Sethares, W.; Plamondon, J. Tuning continua and keyboard layouts. J. Math. Music. 2008, 2, 1–19. [Google Scholar] [CrossRef]
- Sethares, W.; Milne, A.; Tiedje, S.; Prechtl, A.; Plamondon, J. Spectral Tools for Dynamic Tonality and Audio Morphing. Comput. Music. J. 2009, 33, 71–84. [Google Scholar] [CrossRef]
- Wilcox, A. Indices of Qualitative Variation; Oak Ridge National Lab.: Oak Ridge, TN, USA, 1967.
- Liern, V. On the construction, comparison, and exchangeability of tuning systems. J. Math. Music. 2015, 9, 197–213. [Google Scholar] [CrossRef]
- Diggans, C.T.; AlMomani, A. Boltzmann–Shannon interaction entropy: A normalized measure for continuous variables with an application as a subsample quality metric. Chaos Interdiscip. J. Nonlinear Sci. 2023, 33, 123131. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).