1. Introduction
The cellist Pau Casals is often credited with the coining of the term expressive intonation. A former pupil of Casals [
1] explains that “when asked what was the secret of his playing, he would say:
It is my intonation: I know where each note is to go”.
This paper extends [
2] where a lineage of cyclic scales is defined by studying an application of the convergents and semiconvergents of the continuous fraction expansions generating a cyclic scale, i.e., a generalized non-degenerate Pythagorean scale. From a mathematical viewpoint, it is explained how they can be used to obtain a scale with specific regular microtonal subdivisions, i.e., of less than a semitone of the usual 12-tone scale.
After each convergent, a lineage ends and another begins, meaning that in each successive scale of the chain of cyclic scales, the same number of new tones (generic accidentals) are incorporated to the tones of the previous scale (generic diatones). Along a lineage, the new tones become regularly interspersed between the tones of the first scale of the lineage, making it easy for a music player (using a fretless instrument) to know where each note is to go.
Based on the algebraic structure of cyclic scales, Hellegouarch [
3] gave a mathematical interpretation of expressive intonation. Here, an alternative interpretation based on the elementary interval distribution is proposed. It has three novelties. First, it generalizes the concept of expressive intonation to scales of
tones, which is referred to as extended expressive intonation. Second, in the current approach, it is possible to choose where to place the enharmonics, that is, to choose which notes are we interested in allowing a microtonal variation. Third, along a lineage, the step and co-step indices associated with both elementary intervals become a coordinate system to refer the scale tones. The generic diatones remain fixed to one coordinate; likewise, they have a fixed position on the staff, and the generic accidentals share one coordinate with the diatones and differ in the other, like the accidental symbols that modify the diatones in the staff.
Section 2,
Section 3 and
Section 4 are necessary to introduce and review some basic concepts. A new degree of freedom, the starting index, has been introduced in the definition of a concrete scale. This is a key concept that allows to select specific microtones.
Section 5 explains how expressive intonation is understood in music.
Section 6 proposes several measures accounting for the degree of expressivity and transportability of the scale.
Section 7 and
Section 8 determine how, along a lineage, the step and co-step indices (accounting for the cyclic order of the scale tones) are related to the number of iterates of the generator, and vice versa. Such relationships relay on the starting index of the concrete scale and are used to represent the scale tones in a two-dimensional graph to describe their evolution along a lineage, as well as the position of the new accidentals.
Section 9 provides examples for lineages
and
, depending on the selection of the microtones. The last section summarizes the main results.
2. Preliminaries
2.1. Frequency and Pitch
In music, two frequencies (or tones) and are identified in one category, although saying that the latter is one octave higher than the former. This implies an equivalence relation between frequencies by considering all the frequencies , as one equivalence class. Since with the product in is a commutative group, by assuming the fundamental frequency as , the set of all its octaves, denoted as , is a cyclic subgroup of of infinite cardinal. The frequency classes (FCs) are the elements of the factor group , also commutative for multiplication. Thus, each frequency has its FC in .
By taking the binary logarithm of , we obtain its pitch (or note) . This is what the ear hears, since the sense of hearing detects relative variations of frequency. The corresponding pitch class (PC) is noted as . is the octave, usually represented as a circle of unit length. Distances between two FCs, , are measured according to the metric given by the circle distance in terms of their PCs, .
2.2. Abstract Scale
A cyclic scale (also known as a well-formed scale of one generator) is a scale generated by a single tone g other than a power of 2. In Pythagorean tuning, the generator is the third harmonic, (or equivalently, its PC, the fifth ). The infinitely many tones generated by g can be described as follows.
Given
, a
g-projection function [
4] is defined as
This family of projection functions is an Abelian group for composition, which is isomorphic to . We say that and belong to complementary FCs if they satisfy , i.e., . For , it holds that .
The family is an infinite cyclic subgroup of and its FCs are the elements of the factor group , therefore isomorphic to the family of projection functions generated by . Thus, we write .
In general, subfamilies of , which are referred to a set of indices A, namely , are not a subgroup of , unless A is a subgroup of , i.e., with , in which cases is of infinite cardinal. The only case where is a finite cyclic group is for . This corresponds to an n-tone equal temperament (n-TET) scale, which is considered as a degenerate cyclic scale.
In order to obtain a finite and closed set of FCs
, we should consider that a particular FC
plays the same role as the fundamental (for instance, when hearing is not able to differentiate both tones). This can be achieved if the iterates
are chosen so that in
the interval between
and the last iterate
contains no previous iterates. This is known as the closure condition [
5,
6,
7]. Then, we may assume
. Therefore, in the family of projection functions
, an equivalence relation identifying two projection functions
can be defined if there exists an integer
r such that
. Given the subgroup
, the factor group
determines an
n-tone
abstract scale.
2.3. Concrete Scale
The abstract scale is a finite cyclic group of classes of projection functions. For each class , we choose a representative in . In order to include the fundamental , the representatives must satisfy the condition of being n consecutive iterates with . Any set of n consecutive tones within the above set is a concrete scale.
The obvious case is to choose the representatives from the fundamental onward,
(forward iterates). This forms a musical section [
3]. Another musical section is formed by the representatives
(backward iterates). The equivalent representatives of both sections differ in the factor system
. Therefore, by choosing the
starting index , with
, it is possible to select FCs of a unique section (
), or FCs belonging to both sections, allowing to choose tones differing in a small quantity corresponding to the factor system, which in music is referred to as comma. Thus, depending on the starting index, a concrete scale is formed by the FCs (referred to as scale tones),
For example, in the case of the 7-tone Pythagorean scale, by assuming the fundamental as the note C, the iterates would generate the FCs of , which correspond to the notes C-G-D-A-E-B-F♯. Instead, the iterates would generate the FCs of , with notes F-C-G-D-A-E-B. The ratio between the two differing tones, (F♯) in the former case and (F) in the latter, is , which generates the factor group.
Several criteria to select the starting index of a concrete scale are discussed in [
3,
8,
9,
10,
11].
3. Properties of the Cyclic Scales
3.1. Parameters Determining the Scale
Among those studied in [
2], to follow the current paper, some important properties need to be pointed out. According to the notation in [
4,
12], the scale tones
are written as
, with
(
meaning the integer part, i.e., the floor function
if
and the ceiling function
if
).
Forward iterations ordered from lowest to highest pitch (in cyclic order, or by ordinal) have two extreme tones, the minimum tone and the maximum tone , which determine the two elementary factors in the multiplicative space of frequencies, namely (up the fundamental) and (down the fundamental), so that . For , the indices satisfy , all of them coprime. These factors have the corresponding elementary intervals in the additive logarithmic space of notes, and , with . The value is the interval closure.
The tone , which does not belong to the scale , provides the closure condition, either , above the fundamental, or , below the fundamental.
Two parameters inform us whether closes above or below the fundamental. For , and by using the index (also ), one parameter is the scale closure , which is a value close to 1 ().
The distance from the fundamental to the frequency class of is known as the (Pythagorean) comma . Then, either or . That is, . Then, the perfect closure of the scale would take place for or, by taking logarithms, for . Therefore, as we explain in the following section, the best closures can be determined from the best approximations , which connects the current geometrical approach with the theory of continued fractions and best rational approximations of an irrational number.
The other parameter is the scale digit , which takes values 0 or 1. Thus, () and (). The index N is known as chromatic length of the generator.
3.2. Pythagorean Tuning
In Pythagorean tuning, the PCs corresponding to the n scale tones can be written as (the iterate minus its integer part), with . In the circle of the octave , they can be placed clockwise, with 0 representing the fundamental. Multiplied by 1200, we obtain the value of the PC in cents ().
Figure 1 provides an example of how the 12-tone Pythagorean scale generated by the fifth (the class of the third harmonic) is formed. The fifths iterates of indices
and
are the extreme tones, while
provides the closure, since there is no other tone with a frequency that is between
and the fundamental. The ratio between consecutive iterates is either
or
, except for the last iterate (the wolf fifth), which compensates for the comma. When the scale tones are arranged in pitch order, i.e., by ordinals, their ratios are either
(the corresponding interval is
) or
(the corresponding interval is
).
In terms of the
kth iteration (fifth), the notation generally used for these notes is as follows (the diatones, i.e., the notes of the heptatonic Pythagorean scale, are in boldface):
… | B♭♭♭ | F𝄫 | C𝄫 | … | B𝄫 | F♭ | C♭ | … | B♭ | F | C | G | D | A | E | B | F♯ | C♯ | … | B♯ | F𝄪 | C𝄪 | … | B𝄪 | F𝄪♯ | C𝄪♯ | … |
… | −16 | −15 | −14 | … | −9 | −8 | −7 | … | −2 | −1 | 0 | 1 | 2 | 3 | 4 | 5 | 6 | 7 | … | 12 | 13 | 14 | … | 19 | 20 | 21 | … |
The accidentals added to a note X mean the following: X♯ is X plus , X♭ is X plus , and X𝄪 is equivalent to X♯♯.
3.3. Convergents and Semiconvergents
Determining a cyclic scale from the closure condition is equivalent to being a convergent or semiconvergent of the canonic continued fraction expansions of .
The scales corresponding to the best rational approximations (best approximation of the second kind [
13,
14])
of
are the convergents of its canonic continued fraction expansions (Best Approximation Theorem, e.g, [
15]). However, while the convergents are the best double-sided approximations, the semiconvergents are only best one-sided approximations.
Appendix A provides extensive information about the theory of continued fractions and its relation with one-sided best approximations, as well as the main properties of convergents and semiconvergents.
The convergents are associated with the best closure . They provide optimal scales. Then, by considering the interval closure in , is optimal for .
3.4. Chain of Cyclic Scales
Cyclic scales (whether optimal or not) form a chain,
We are assuming the same starting index for all of them. In each link, the scale
is composed of the tones of
, the generic diatones, in addition to the new non-adjacent tones, the generic accidentals [
12].
Cyclic scales can be obtained from the following iterative process. Starting from the indices of the extreme tones
m and
M of
, with
(
), for the next cyclic scale in the chain
, these values are obtained as
which, in matrix form, can be written as
Therefore, according to (
1), one of the indices of the extreme tones always repeats. This happens while approximations are improving by one side until reaching an optimal scale. Then, the approximations begin to improve by the other side.
3.5. Similar Algorithms
In addition to the approaches described above to build cyclic scales, there are other equivalent and related algorithms.
For the cyclic scale
, with extreme tones of indices
m and
M, the value
n provides a convergent or semiconvergent
of
according to one of these cases:
or
, depending on whether
or
. Then, we meet a situation such as
, together with the Bézout’s identity corresponding to pairs of coprime numbers,
, which leads to a new improvement
of the approximation. This situation is common to the continued fractions approach, the Farey sums, the structure of the Stern–Brocot tree, and its dual, the Raney tree [
16,
17,
18], also known as Calkin–Wilf tree [
19,
20], where for two consecutive approximations, by using the extended Euclidean algorithm, it is possible from one to determine the other.
In the case of the Farey sums [
21], if two rational approximations satisfy
then we can lessen the size of the interval from a new approximation
that substitutes the appropriate extreme. This algorithm basically reproduces the Stern–Brocot tree.
The Stern–Brocot tree is a full binary tree where the nodes are labeled in such a way that each positive rational number occurs exactly once. Vertically, it provides the usual ordering of the rationals. For and , the median is the fraction of which they are the parents. Every row consists of the fractions that are medians of elements of previous rows. Positive irrational numbers can be associated with a unique infinite pathway down the tree and the nodes which are passed by on such a finite or infinite path are the semiconvergents of the corresponding rational or irrational number.
Depending on the value of
in (
2), the matrix
is equivalent to the
matrix representing one branche of the Stern–Brocot tree [
22,
23,
24,
25] that encode the subsequent nodes of the fractions
. If
, the subsequent indices in the tree are obtained by multiplying by
. If
, the subsequent indices are obtained by multiplying by
.
4. Regular Subdivisions
4.1. Lineages
If R is the index of one of the extreme tones, all the scales where this index is repeated form the R-lineage (or simply lineage). Each scale represents an stage of the lineage. From the first repetition of R (second stage of a lineage) until the end of the lineage, R is the ruling index.
Table 1 of [
2] is reproduced here to facilitate understanding the process. It displays the lineages for Pythagorean scales (
) in terms of their
ith ordinal in the chain (indicated by the subindex of
). The beginning (and end) of a lineage is marked with a bold symbol. The symbol ↷ indicates an optimal scale, i.e., when the number of tones is the denominator of a continued fraction convergent, while the others are semiconvergents. The first lineages are
,
,
, etc. Some facts to remark:
In ends and starts, a lineage ⇔ is optimal.
The ruling index of a lineage is the number of tones of the optimal scale just before the first scale of the lineage.
The scale digit is the same along a lineage, except in the last stage, where a new lineage begins.
If the first scale of the lineage is optimal, it is also its penultimate. Then, the lineage is short.
Extended expressive intonation refers to the regular subdivision of the scales within a lineage. A lineage includes a series of consecutive semiconvergents and ends after the next convergent. During a lineage, the generator is approximated by one side by a convergent, say , and by the other side by a series of semiconvergents , , which do not improve the approximation of the convergent. Just after reaching a new convergent (on the opposite side of ), the lineage ends and a new one begins. In the next sections, we see that what happens during this process is that one of the elementary intervals remains constant, the one corresponding to the ruling index, and the other becomes successively subdivided in a regular manner. This facilitates a precise tuning while using scales with extended tones.
4.2. Mechanical Words
Cyclic scales can also be built from mechanical words [
12,
26,
27,
28]. It can be defined as a Christoffel word of the alphabet
, associated with both elementary factors, with slope
and length
n. The full octave is then represented by a word with
M letters
U and
m letters
D, since
. For forward iterates starting at the fundamental, from left to right, the first step must be
U and the last step of the octave must be
D. For backward iterates starting at the fundamental, the sequence is symmetric. Each value
n is associated with one word
, for instance,
,
,
.
Depending on the relative size of the elementary factors, the scale can be refined to form the next cyclic scale in the following way. Using forward iterates, if , by factorizing , otherwise by factorizing , and so on.
For instance, for the successive cyclic scales (with starting index 0), the word associated with is , where . In the next cyclic scale, , the smallest factor D is maintained (and so is the index M of the maximum tone). Therefore, in the 7-tone scale, the interval factorized is the one associated with the minimum tone, . Then, the generic accidentals of the 12-tone scale come just at the end of a factor (which come alone), and the generic diatones (the tones belonging to ) come at the end of a factor D. Thus, . Since the 7-tone scale is not optimal, in the next 12-tone scale, the relationship is still maintained. Hence, the next letter to be split is also U.
4.3. Scale Partition
The partition of the octave induced by the generator iterates has exactly two sizes of scale steps, and each number of generic intervals occurs in two different sizes (Myhill’s property) [
29,
30,
31]. Both elementary intervals are associated with the two families of tones, the generic diatones and accidentals [
12].
The semiconvergents of the canonic continued fraction expansions of the generator, although they do not provide optimal scales, induce a regular subdivision of one of the elementary intervals until reaching a convergent, which marks the scale lineages along the chain of cyclic scales.
The general properties of the scale lineages were studied in [
2] and are not reproduced here. Instead, we focus on the subdivisions within a lineage, which follow the same procedure as mechanical words. It is illustrated in the scheme of
Figure 2, corresponding to the lineage
.
The scale is optimal (in boldface). Subindex 2 indicates that this is the second scale in the chain (the first only contains the fundamental, and the second contains the fundamental and the class of the generator). The next scale, , is the beginning of the lineage (and the end of the previous lineage). The scale is optimal. Therefore, the next scale, , is the end of the lineage (and the beginning of the next lineage). In each stage of the lineage , the greatest interval of the scale () is subdivided into two intervals, one with the same size as the smaller interval () and the other with the remaining size ( on the left and on the right) until the size of the remaining interval becomes the smallest one.
In each stage, 2 tones (the ruling index), as many as the tones of the previous optimal scale, are added. The old notes are the generic diatones, and the new ones are the generic accidentals. Along the lineage, one interval is maintained (which becomes the diatonic interval in the next scale) by producing a regular subdivision of the intervals of the previous optimal scale .
The generic diatones of the first scale of the lineage are the subdiatones of the lineage, i.e., the tones of the previous optimal scale. The generic diatones of the last scale of the lineage are the superdiatones of the lineage. The sum of the number of subdiatones and superdiatones matches the number of tones of the last scale of the lineage.
5. Extended Expressive Intonation
We pay attention to the lineage , with ruling index . In each stage, 5 new tones are regularly added. In the last 17-tone scale, the elementary interval of the 7-tone scale is split twice into 2 intervals, one of them with the same size as .
In music, this lineage has the following interpretation. First, we may have the freedom of choosing a set of notes to which refer the added ones (it is not necessary to choose the first scale of the lineage). Traditionally, it has been the heptatonic scale C, D, E, F, G, A, B, corresponding to the iterates . We call them structural diatones of the lineage. Choosing the starting index , for we obtain five new tones, namely D♭, E♭, G♭, A♭, and B♭, which are added to the left of some structural diatones. For , we obtain another set of five accidentals, namely C♯, D♯, F♯, G♯, and A♯, which are added to the other side.
Usually, expressive intonation is understood [
32] as that sharps are played slightly higher and flats slightly flatter than they are on a keyboard tuned in equal temperament, that is, played as in Pythagorean tuning [
3]. Such pairs of notes that in the Pythagorean tuning do not produce an identical pitch, but are still called enharmonic, providing a (fretless) string player with more degrees of freedom in order to tune specific frequency ratios, such as closer to just intonation, to meantone temperament or to equal temperament [
11].
For example, in the 12-tone cyclic scale, seven fifth iterations forward starting at the note X lead to the note X♯ = X (modulo one octave), where is the chromatic semitone. Seven fifth iterations backward yield X♭ = X. The chromatic semitone is the interval between a diatone X and its homonymous accidentals X♭ and X♯. Instead, the semitone between a diatone and its enharmonic equivalent accidentals is the diatonic semitone , e.g., D and E♭ (instead of D♯), and D and C♯ (instead of D♭).
Notice that d is the smallest semitone of the heptatonic scale, e.g., the interval B-C. Their difference, , is the interval closure (the interval closure is related to the Pythagorean comma as ), resulting from 12 consecutive fifth iterations. Similarly, the interval between 7 iterations forward and 5 iterations backward (e.g., D♭←C→C♯) is . Therefore, in expressive intonation, it is possible to apply microtonal corrections of .
Between the tones of the 7-tone cyclic scale one tone apart (the intervals E-F and B-C are distant one diatonic semitone d) we can have the following accidentals: on the one hand, C♯ = C = D, D♯ = D = E, F♯
= F = G, G♯ = G = A, and A♯ = A = B, which form the typical 12-tone Pythagorean scale, and on the other hand, D♭ = C = D, E♭ = D = E, G♭ = F = G, A♭ = G = A, and B♭ = A = B, which complete a 17-tone cyclic scale. That is, each accidental is a structural diatone .
Figure 3 shows how the elementary intervals can be subdivided, including forward and backward iterates. Notice that the notation generally used in Pythagorean tuning is
and
, which are two different tones. But
and
have no specific symbol referred either to
X or Y. They are usually referred to the tone before
X or to the tone after
Y, respectively. For instance, if
, then
; if
, then
. Instead, the Turkish notation (p. 46 in [
33]) has specific symbols for all of them, although it is not consistent with the Pythagorean convention, since it actually assumes equal temperament and considers that the flat and sharp match.
Such a relationship between the 7-, 12-, and 17-tone consecutive cyclic scales in which expressive intonation is based (the lineage ) can be reproduced for other lineages. The tones of the first scale of the lineage are the tones to which the successively added accidentals can be referred with precision. In this way, in non-fretted instruments, such as stringed instruments and some brass instruments (natural horn, trombone, etc.), it is possible to know with precision “where each note is to go”.
For families beyond the lineage , extended expressive intonation may seem of difficult application. Nevertheless, the current work is also useful to highlight some important properties of cyclic scales and may help to understand their structure.
Some Middle-East music systems, such as the Ottoman, use a similar approach [
34], where a selected row of tones out of 53 are allowed to vary one Pythagorean comma up or down to producing melodies with emotional intent or to tuning to precise frequency ratios. As mentioned in [
2], according to [
34] (and references therein), Al-Kindi (ca. 800–873) was the first to make use of the Abjad (Arabic shorthand for “ABCD”) pitch notation to denote finger positions on the ud for his 12-note approach, which was purely Pythagorean. It was the precursor to Urmavi’s 17-tone scale (1216–1294), which added five additional fifth iterations. The resulting scale was formed according to a starting index
. This is an example of the lineage
.
On the other hand, several extensions of the 17-tone scale, by adding fifths backward and forward, generate the 24-tone Arel–Ezgi–Uzdilek and Yekta variants, with starting index −12 for the former and −14 for the latter.
By completing the above scales, the 53-tone Pythagorean scale is reached. This is an example of the lineage . It embodies Urmavi, Arel–Ezgi–Uzdilek, and Yekta-24 systems.
The 53-tone Pythagorean is very close to the 53-TET scale. We see in
Section 9.2 that the difference between the two elementary intervals is only
. For this reason, in practice, the 53-TET scale has replaced the Pythagorean (similarly to what happened with the usual 12-tone scale), although the procedure of adding generic accidentals to a set of generic diatones is governed by the Pythagorean approach. In the end, the 53-TET scale has 9 commas per whole tone and 53 commas per octave. This comma is Holdrian, i.e.,
wide, which is less than one cent error to the Pythagorean comma.
6. Expressivity and Transportability
In addition to the greater flexibility gained with expressive intonation to approximate simple frequency ratios, it also increases the possibilities of interval translations to match an existing scale note. We give an estimation of how transportable the intervals ranging from the fundamental to each note of an n-tone cyclic scale are.
Once the iterates of the scale are ordered by pitch along the octave, from C onward and according to the cardinal index , their elementary factors U and D form a word with two letters, such as (for the starting index ). In addition, this word can be broken into syllables, so that each syllable is the interval between two consecutive generic diatones, such as , or, in general, between two structural diatones. The syllabicated word of the scale intervals is noted as .
Given
, we select an interval
composed of the first
I elementary intervals. Then, we check whether this interval translated to the
jth tone (
) ends matching any note of the scale. If yes, it is transportable or transferable. The number of transportable intervals (labeled according to the starting note) is noted as
(the first interval, starting at the fundamental
, is not included). This is performed for all possible intervals from
to
(the full octave P8 is not included). For an equal temperament scale, all the intervals are transferable, so that the number of matches is
. Hence, the ratio
estimates the transportability of the intervals relative to an equal temperament scale.
Nevertheless, from a tonal perspective, what is more important is whether the intervals can be translated matching any diatone of the scale, since this can be used as a measure of how expressive the intonation is. In such a case, we let
be the number of diatones and
be the number of transferable intervals to the diatones. Since for an equal temperament scale the number of matches is
, the transportability over the diatones relative to an equal temperament scale is given through
which we call expressivity. Hence, the following ratio expresses the contribution of the diatones to the overall transportability:
Table 2 illustrates these qualities for the 12-tone scale, referred to the structural diatones of the 7-tone scale with starting index −1, notes C, D, E, F, G, A, B. For the 12-tone scale, we obtain the following values:
,
,
. It is worth remarking that the transportability of the diatonic intervals (i.e., M2, M3, P4, P5, M6, and M7) over the diatones is nearly full, with the exception of P4 that is not transferable to F. Thus, a high value of
r indicates that most transportability is due to the diatones; in other words, there is low transportability to non-diatones.
Table 3 displays the same values for the 17-tone scale, with starting index −6 and the same structural diatones as before. We obtain
,
,
. Therefore, the scale has gained transportability and expressivity relatively to the 12-tone scale, mostly for non-diatonic intervals.
7. Step and Co-Step Indices
The ratios corresponding to the tones of the scale
in
can be expressed in terms of the elementary factors
and
from the following relationship:
for
,
,
. Then, it is straightforward to obtain
Since
[
4], then
, so that the determinant of the foregoing matrix is
. Hence, there is an isomorphism between the two pairs of indices describing the scale. Therefore, the two elementary factors generate the scale tones and introduce the non-equal temperament.
Now, we translate this relationship to the logarithmic space of notes in terms of the elementary intervals and refer to the ith cyclic scale of the chain.
In a cyclic scale of tones, after iterations of the generator, the octave remains composed of intervals, which are a combination of the two elementary intervals of different sizes, and , so that . The scale tones can be expressed in terms of the step and co-step indices as , so that the note with cardinal is given by .
As mentioned before, the next cyclic scale to , with tones, is obtained by dividing the greatest of both intervals into two intervals, one with the same size as the smaller interval and the other with the remaining size.
The scale tone corresponding to the
kth iterate (fifth, in Pythagorean scales) and
th cardinal have a unique interval decomposition [
12] (for
, it holds that
,
, for other values of
k these values can also be negative) in terms of
and
, so that, by inverting (
3), we obtain
where
and
. Equation (
5) describes an automorphism of
.
Depending on the scale digit , we distinguish two cases. To a lineage, one of them applies.
- (i)
If (), then , , .
Since , (with the same size as ) is the diatonic elementary factor.
According to (
4), since
is maintained,
Then,
for
. Therefore, the isomorphism
redistributes the
tones along the octave, so that
is maintained (while
, the size of the
-intervals becomes smaller as
i increases) and the number
of
-intervals increases progressively (their size is maintained). In each step, the new accidentals associated with the
-interval generate a family of microtones with a regular intervalic space between them.
For all indices such that
, with
, since
, the
-tone scales in the lineage of
can be represented as a lattice in the coordinate system with axes accounting for the
u- and
d-intervals, according to
The number of steps, either horizontally or vertically, between notes is the note cardinal.
The generic diatones are determined by the index associated with the interval . In the last scale of the lineage, say of tones, the elementary intervals satisfy . The last added accidentals share their coordinate with a diatone and are at this smaller distance from it. Therefore, they play the role of the enharmonics.
- (ii)
If (), then , , .
Since , (with the same size as ) is the diatonic elementary factor.
According to (
4), as
is maintained,
Then,
for
. Hence,
In this case, is maintained while the size of the -intervals becomes smaller as i increases (on condition that ), and the number of -intervals progressively increases while their size is maintained. The new accidentals associated with the -interval generate a family of regular microtones.
For all the indices such that
for
, since
, the
-tone scales in the lineage of
have coordinates
The generic diatones are determined by the index associated with the interval . In the last scale of the lineage, say of tones, the last added accidentals share the coordinate with a diatone and are separated by the elementary interval .
The matrices in (
6) and (
8) are equivalent to the matrix
of (
2), representing the branches of the Stern–Brocot tree. Therefore, from this viewpoint, a lineage is composed of consecutive cyclic scales in the same branche, which are connected by the same matrix.
8. Step Intervals
Let us modify (Equation (38) in [
12]) in order to work explicitly in terms of the starting index, i.e., with tone iterates satisfying
, for
such that
. The
kth fifth iterate, with cardinal
, is the tone notated as
. The tone with cardinal cardinal
is given by
In other words, we may think of backward and forward iterates from to gain such a degree of freedom.
The inversion of (
5) yields
or, alternatively,
In the former equality, each iterate of the
-tone scale is described from a direct product
and, in the latter equality, as an automorphism of
.
In this way, from the values
, Equation (
11) determines
k. According to (
10), depending on the value
, we know which interval, either
u or
d, leads to the following tone.
Figure 4 shows two examples, corresponding to the case (i) of the previous section, for the scales
and
in relation to the structural diatones of
, with iterates
(F, C, G, D, A, E, B, blue dots).
Tonal Western music has a diatonic structure, which is reflected on the musical staff and key signature. Its historical evolution until serial dodecaphonism could be viewed as a process of gaining resources and flexibility about this diatonic skeleton.
On the top of
Figure 4, such a diatonic structure for the 12-tone scale becomes clear. The diatones, which in the pentagram occupy a precise position, either in a ledger line or in a space, in the above graph are assigned to a unique
q value (since
d is the interval associated with the diatones). The accidentals, which in the staff need to be notated with an accidental symbol since they occupy the position of a diatone, in the graph have the same coordinate
q as a diatone but a different coordinate
p.
In the next cyclic scale, on the bottom of
Figure 4, the 17-tone scale, the previous accidentals now occupy an exclusive coordinate
q, i.e., all the tones of the 12-tone scale are now the generic diatones of the 17-tone scale, each one in a different
q position. The new generic accidentals share their
q coordinate with a generic diatone, although with a different coordinate
p.
10. Conclusions
A non-degenerate cyclic scale is a well-formed scale of one generator, other than a rational power of two. It excludes equal temperament scales and is formed by a set of notes that fulfills the closure condition. The notes divide the octave into the same number of intervals of exactly two different sizes.
Its algebraic structure is given by the abstract scale, which is a quotient group of classes of projection functions. However, such an approach does not account for the size of the intervals between notes. For this purpose, it is necessary to choose a representative of each class in order to define a concrete scale. The n tones of a concrete scale must meet two basic conditions. First, the set must include the fundamental. Second, the scale tones in the octave must fulfill , for . Therefore, there are n different ways to choose the scale.
From a mathematical viewpoint, this means either to fix a tone as the origin of the iterations or to choose which tone takes the role of fundamental. From a musical viewpoint, this is important for two reasons. One is that depending on the chosen origin, the tones are more or less close to specific frequency ratios, such as
,
,
, etc., which is of interest in order to approximate just intonation scales [
11]. The other reason, which is a main point in the present article, is that depending on where the series of iterates starts, it is possible to select the microtones generated along a lineage.
Scale lineages are associated with the convergents and semiconvergents of the continued fraction expansions of the generator. According to [
2], after a scale corresponding to a convergent, a lineage of scales begins and the previous one ends. A lineage includes a series of consecutive semiconvergents and ends after the next convergent. During a lineage, one of the elementary intervals remains constant, and the other is successively subdivided in a regular manner. At each stage, the intervals that are subdivided are those of generalized diatones. In addition, it is possible to select a set of structural diatones to which the accidentals are referred.
Western musical notation follows such a similar process, and it is this process that the current work has tried to reproduce. The seven notes of the diatonic scale occupy a precise position in the pentagram, either in a ledger line or in a space. Over the centuries, the set of scale notes has become larger, by gaining in harmonic and melodic resources. For instance, old pentatonic and heptatonic scales have been extended with scales such as the usual 12-tone chromatic scale, scales based on quarter-tone temperament such as Arabic 24-tone, Vicentino’s 31-tone system, Arabic 17- and 53-tone systems, or a manifold of just intonation microtonal scales [
36].
New tones have been added to the heptatonic scale, which constitute the basic structure of the staff, although they need to be notated with an accidental symbol, since they occupy the same position of a diatone.
When the seven notes of the diatonic scale are notated with sharps and flats to modify their pitch, we speak of expressive intonation. This corresponds to the lineage . If the same is performed in one of the following lineages, we are speaking of extended expressive intonation.
This way of considering the scale notes, as a set of structural diatones in addition to forward and backward accidentals, has a two-dimensional representation to which we also pay attention.
The way of describing the scale notes in cyclic order from the fundamental is by giving two coordinates indicating how many generic diatonic and accidental intervals there are between each note and the fundamental. This is the representation of the scale in terms of the step and co-step indices, which is given by an automorphism, either (
7) or (
9), depending on the lineage, i.e., depending on the side where the semiconvergents are improving the approximations of the generator tone.
When the notes are represented in a two-dimensional graph, the generic diatones are anchored to fixed values on one of the coordinate axes. In this axis, the generic diatones take consecutive values (just like in the position of a staff). The generic accidentals added to a diatone maintain the coordinate linked to the diatone, but vary in the other coordinate (also similar to the accidentals symbols added to the notes on the staff). By changing the starting index of the scale, the microtones that are generated vary and their distribution changes, so it is possible to choose the distribution that is most interesting from a musical point of view.