Bell Shape Embodying Zhongyong: The Pitch Histogram of Traditional Chinese Anhemitonic Pentatonic Folk Songs

Liu, Hui; Jiang, Kun; Gamboa, Hugo; Xue, Tingting; Schultz, Tanja

doi:10.3390/app12168343

Open AccessArticle

Bell Shape Embodying Zhongyong: The Pitch Histogram of Traditional Chinese Anhemitonic Pentatonic Folk Songs

by

Hui Liu

^1,2,*

,

Kun Jiang

²,

Hugo Gamboa

³

,

Tingting Xue

¹

and

Tanja Schultz

¹

Cognitive Systems Lab, University of Bremen, Enrique-Schmidt-Str. 5, 28359 Bremen, Germany

²

China Quyi Artists Association, Beijing 100026, China

³

LIBPhys (Laboratory for Instrumentation, Biomedical Engineering and Radiation Physics), NOVA School of Science and Technology, Campus de Caparica, 2829-516 Caparica, Portugal

^*

Author to whom correspondence should be addressed.

Appl. Sci. 2022, 12(16), 8343; https://doi.org/10.3390/app12168343

Submission received: 28 July 2022 / Revised: 17 August 2022 / Accepted: 18 August 2022 / Published: 20 August 2022

(This article belongs to the Special Issue Advances in Music Informatics)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

As an essential subset of Chinese music, traditional Chinese folk songs frequently apply the anhemitonic pentatonic scale. In music education and demonstration, the Chinese anhemitonic pentatonic mode is usually introduced theoretically, supplemented by music appreciation, and a non-Chinese-speaking audience often lacks a perceptual understanding. We discovered that traditional Chinese anhemitonic pentatonic folk songs could be identified intuitively according to their distinctive bell-shaped pitch distribution in different types of pitch histograms, reflecting the Chinese characteristics of Zhongyong (the doctrine of the mean). Applying pitch distribution to the demonstration of the Chinese anhemitonic pentatonic folk songs, exemplified by a considerable number of instances, allows the audience to understand the culture behind the music from a new perspective by creating an auditory and visual association. We have also made preliminary attempts to feature and model the observations and implemented pilot classifiers to provide references for machine learning in music information retrieval (MIR). To the best of our knowledge, this article is the first MIR study to use various pitch histograms on traditional Chinese anhemitonic pentatonic folk songs, demonstrating that, based on cultural understanding, lightweight statistical approaches can progress cultural diversity in music education, computational musicology, and MIR.

Keywords:

pitch histogram; merged pitch histogram; pitch distribution; anhemitonic pentatonic; pentatonic; Chinese music; music information retrieval; music education; computational musicology; machine learning

1. Background

The traditional folk songs and music of various cultural groups worldwide are important vehicles for reflecting cultural characteristics. However, the rich diversity of the world’s music traditions has not been sufficiently appreciated in contemporary music information retrieval (MIR) studies. In contrast to the vast resources and literature that mainstream Western music culture possesses, the systematic presentation of traditional Chinese folk songs in international academic and educational circles is fragmented, lagging behind the overall development of the music informatics community, which has also created difficulties for the education in this area. We take a step forward in the computational musicology of the traditional Chinese anhemitonic pentatonic folk songs, intending to provide a reference for the cultural variety of the MIR study.

Most scholars use the general word “Chinese” to refer to the Han Chinese people, China’s largest cultural group. However, there is considerable linguistic, custom, and social diversity among the subgroups of the Han, primarily due to historical events, geographical conditions, and assimilation of various regional ethnicities and tribes [1]. Traditional Chinese folk songs, represented by the Han Chinese songs, were created by local people’s improvisation and passed on from one generation to the next orally [2], having a significant influence on the development of other forms of traditional music, including the traditional dance music, opera, instrumental music, and Quyi (story-telling and story-singing) art [3].

1.1. Movable Do Solfège and the Chinese Anhemitonic Pentatonic Scales/Modes

In order to make the traditional Chinese music notation correspond to the Western music notation, the numbered musical notation became the most common for transcribing Chinese folk music in modern times. The melody is sung with do, re, mi, fa, sol, la, and si (In some regions, such as North America, it is ti.). Traditional Chinese folk songs were usually recorded with the movable do solfège [4] or tonic sol-fa, invented in the nineteenth century by Sarah Ann Glover. Regardless of the piece’s key, each note’s solfège is the same as when the song was in C major/A minor. For a more intuitive example, the melody “G–A–B–C–D” in a G major song is sung alphabetically or as “sol–la–si/ti–do–re” in the fixed do solfège, but in the movable do solfège, it is sung as “do–re–mi–fa–sol” because that is how the phrase is transposed to C major.

As one of the central regions where the anhemitonic pentatonic scale is used, traditional Chinese music, especially Han Chinese folk songs, frequently uses do, re, mi, sol, and la, known in ancient Chinese as Gong, Shang, Jue, Zhi, and Yu. On the piano, they can be performed entirely in the black keys in F# major. Chinese cultural tradition has always considered pentatonic scales as modes, complete with a modal tonic and a classification of songs and instrumental pieces by way of the five types [5]. By altering which note is considered the tonic within the same collection of pitches (do, re, mi, sol, and la), one can rotate through all possible five modes of the anhemitonic pentatonic collection, the Gong/Shang/Jue/Zhi/Yu mode, which corresponds to the Raga Bhupali/Megh/Malkauns/Durga/Dhani Carnatic in the Indian pentatonic scale from the perspective of modal pitch orders. Modes in Chinese and Indian music theory, incorporating more than pitch orders, are also associated with certain daytimes, performance practices, emotional situations, and even weather conditions, which is beyond this work’s research.

Fa and si in the movable do solfège are occasionally used in traditional Chinese anhemitonic pentatonic folk songs, only as ornaments or transitions, called Qing Jue/Bian Jue and Bian Gong. Other semitones with # or ♭ in the movable do solfège are extremely rare in most traditional Chinese anhemitonic pentatonic folk songs, among which fa# and si♭ are called Bian Zhi and Run, respectively.

1.2. Pitch Histograms, Merged Pitch Histograms, and Their Anhemitonic Pentatonic Variants

Pitch distributions have been used in computational music study since the 1960s and into the 1980s in various fields such as mathematical music structure analysis and computer-assisted generative composition [6,7]. Subsequently, the concept of the pitch histogram, describing the pitch content of notes in music pieces, was systematically introduced in [8]. In [9], the authors summarize various variants of the pitch histogram and an extensive list of features extracted from them. MIR studies using pitch histograms emerge from time to time [10,11,12,13].

Each note has not only pitch information but also duration characteristics. Therefore, rhythmic value histograms [9], beat histograms [14,15,16,17,18], and (normalized) duration histograms [10] were created and play roles in MIR to describe the temporal information on a song’s melody.

Our study combines temporal information (pitch continuity) with pitch information. Therefore, in addition to applying the basic pitch histogram [8], we also utilized the recently proposed merged pitch histogram [19] to visualize songs’ main melody statistics. As a variant of the basic pitch histogram, the merged pitch histogram joins successive identical pitches to be counted only once, one of the alternatives regarding how to “count” pitches, which has been debated and often noted in some computational musicology works but was systematically summarized and named for the first time in [19]. For our research context, firstly, the merged pitch histogram downplays the continuous repetition of pitches due to splitting words in songs of polysyllabic languages, making them more applicable to be analyzed together with songs of monosyllabic languages (e.g., Mandarin). It is helpful as we will conduct in Section 4 a classification study of traditional Chinese anhemitonic pentatonic folk songs together with songs of other languages around the world. Secondly, some interesting phenomena in Chinese folk songs, such as the characteristic onomatopoeia, use successive identical pitches, for example, the “dong dong dong” simulating a rattle-drum. Merging the successive identical pitches improves the statistical pitch analysis, which will be explained in Section 2.2.

This study uses notation-based symbolic pitch histograms, and all histograms plotted are not normalized to facilitate the comparison between each pitch’s merged and original bin heights. The non-normalized visualization has no impact on the histograms’ shapes, the observation analysis, and the feature design in this study.

Because of the characteristics of Chinese anhemitonic pentatonic music introduced above, we will generate movable do solfège-based basic and merged pitch histograms by replacing the pitch letters (A–G) or frequencies in the traditional pitch histograms with the movable do solfège (do–si), as shown on the horizontal axis ticks in Figure 1. In addition, we will also apply the anhemitonic pentatonic pitch histograms that retain only the pitches fitting the pentatonic scale, which also has its merged version, the merged anhemitonic pentatonic pitch histogram (see Section 2.2).

1.3. Related Research on the Statistical Visualization of Western Folk Songs and Comparative Study

Prior literature displayed folk song melodies’ properties through statistical visualization, represented by [20], where, interestingly, the curves depicted by the authors bear similarities to our study, referred to as “melodic arch.” However, the two studies’ objects (Western versus Chinese folk songs) and mathematical models (aggregate phase contours versus pitch histograms) are entirely different.

In terms of promoting cultural diversity in computational musicology, ref. [21] compared the interval size and phrase position of German and Chinese folk songs, in which the author of [20] also participated. Still, our study investigates a different object (traditional Chinese anhemitonic pentatonic folk songs) through a different approach (pitch distribution).

1.4. Bell Shapes and the Degree of Bell Shape (DoBell)

In this work, a monotonic rise to a peak followed by a monotonic fall on a histogram (see the envelopes in Figure 1) is depicted by the term bell shape. We do not name such curves as “normal distributions” because of their asymmetry and non-smoothness. Being more appropriate than other names that simulate this shape visually, such as pyramid, triangle, and arch, “bell” has the symbolism of sound, suggesting musicological research.

Monotonic trends can be tested by applying the Mann–Kendall test [22,23,24], which is, however, unnecessarily complicated for this study. We propose a novel feature with a simple implementation, Degree of Bell Shape (

D o B e l l

), manifesting the degree that a histogram approximates a bell shape with a value between 0 and 1.

Let

P_{m a x}

denote the pitch of a song s’s highest bin in its histogram with zero bins removed,

b i n (p)

corresponds to each pitch p’s bin height, and

m i n_{pitch} (s)

and

m a x_{pitch} (s)

represent s’s lowest and highest pitches. We define p’s monotonic trend (MT) as

M T (p) = \{\begin{matrix} \{\begin{matrix} 1, if b i n (p) \leq b i n (p + 1) \\ 0, if b i n (p) > b i n (p + 1) \end{matrix} if p < P_{m a x} \\ \{\begin{matrix} 1, if b i n (p) \geq b i n (p + 1) \\ 0, if b i n (p) < b i n (p + 1) \end{matrix} if p \geq P_{m a x} \end{matrix}

(1)

with zero bins removed and the Degree of Bell Shape of song s as

D o B e l l (s) = \frac{\sum_{p in [m i n_{pitch} (s), m a x_{pitch} (s) - 1]} M T (p)}{m a x_{pitch} (s) - m i n_{pitch} (s)} .

(2)

The

D o B e l l

of a V-shaped distribution generates a value near 0.5, while 1 indicates a perfect bell shape, where adjacent equivalent values do not destroy the monotonicity. In this article, a histogram is announced conforming to the bell shape only in the perfect case that its DoBell is precisely equal to 1.

The pronunciation of the abbreviation

D o B e l l

is reminiscent of the bell-shaped pattern drawn on a doorbell, and the “Do” comes with a prompt for the movable do Solfège.

2. The Pitch Histogram of Traditional Chinese Anhemitonic Pentatonic Folk Songs

Jasmine Flower (Mo Li Hua) should be recognized as the most representative and popular traditional Chinese folk song globally. Both versions of Jasmine Flower from the Jiangsu province are lyrical and gentle [2]. They have different melodies, lyrics, and characteristics but are both in the blues minor scale/Zhi mode (ending in sol). The version starting with the lyrics “Hao yi duo mei li de mo li hua” (what a beautiful jasmine flower) was adapted in Puccini ’s opera Turandot and thus became widely well known.

Figure 1 visualizes the basic and merged pitch histograms of Jasmine Flower ’s both versions, revealing their bell-shaped envelopes significantly (

D o B e l l = 1

). Inspired by these two histograms of Jasmine Flower, we analyzed more Chinese anhemitonic pentatonic folk songs.

2.1. Data Preparation

We emphasize that the songs of interest for this study need to meet the following prerequisites:

1.: (Truly) traditional. The traditional Chinese folk songs in this study are created by local people’s improvisation and handed down orally among the people, for which the original composers and lyricists are generally unidentifiable. Many folk songs and music composed in modern times, even if they conform to the pentatonic modes, apply more musicological techniques and compositional means during the creation, contrary to traditionality.
2.: Chinese anhemitonic pentatonic. It does not mean that non-anhemitonic-pentatonic notes cannot be used. Researchers with relevant knowledge can quickly distinguish aurally or humming-wise whether a piece is Chinese pentatonic, even if it uses the infrequent fa, si, and other semitones.

Obtaining more songs’ data is a challenge. As mentioned in Section 1, traditional Chinese music has not been extensively studied in music informatics, let alone a systematic compilation of traditional, anhemitonic pentatonic folk songs.

The Essen Folksong Collection [25], developed in the early 1980s for folk music research, contains 1161 songs in its “Han” category. However, a large number could not be used directly in this study. A brief look at the title and an explorative humming and listening experience reveals that the "Han" category in EFC contains:

Many folk songs composed in the modern era instead of the improvisation and oral transmission, contradicting the first prerequisite above. There are hundreds of songs with obvious time backgrounds and compositional intent, and they are generally regarded as “revolutionary songs" nowadays and labeled “new folk songs” in the EFC. For instance, over 50 songs with Hongjun, Gongnong, Balujun, Canjun, Jiefang, Suqu, Mao Zedong, or Zhouzongli in their titles, apparently formed in a specific temporal context, are unlikely to come from oral folkloric transmission from ancient times.
Many non-pentatonic songs reflecting the characteristics of other cultural groups or a cultural mixture, contradicting the second prerequisite above.
Some score errors, such as the 19th note of the first song in alphabetical order, Ai Erwa (an octave higher).

Those songs involved in the first two cases above can undoubtedly be widely used in general studies of Chinese music but are inapplicable to this study.

A typical example is Caoyuan Qingge (The Grassland Love Song) in EFC’s “Han” category. Besides this original title, the song is widely known nowadays as In That Place Wholly Faraway (Zai Na Yaoyuan De Difang). The song’s creation background is debated, but there is no doubt that it was composed/adapted by Wang, Luobin in 1939/1940 based mainly on Kazakh melodies. One prevailing theory is that it also draws on Tibetan and Uyghur melodies. In short, it should not be in the "Han" category, not to mention the traditional and the Chinese anhemitonic pentatonic. Besides, this song was created by the composer with extensive musical knowledge, again departing from the subject of our study.

In an initial confirmation of the first 26 songs/2.24% in EFC’s alphabetical “Han” category, we found that:

Three songs/11.54% contain obvious score errors: Ai Erwa (the 19th note being an octave higher), Baogumiao Xiangbadao (missing ♮s), and Bawang Haozi (wrongly using the Gong mode);
Two songs/7.69% belong to other or mixed cultures: Alima (Salar) and Suwugeng (Hua’er);
Eight songs/30.77% are not truly traditional (“revolutionary”), composed, adapted, or reworked;
Three songs/11.54% have linking errors.

We are working on a comprehensive errata and information replenishment to the EFC for the benefit of other users.

In short, even if the EFC is filtered for the two prerequisites of the songs we studied (truly traditional and Chinese anhemitonic pentatonic), only a few dozen songs are expected to be sifted out. In addition, surprisingly, many traditional Chinese folk songs familiar to Chinese people, such as those included in [26], do not appear in the EFC. The EFC’s collection, which Western scholars dominated, was probably more about the fieldwork of digging. Therefore, it is not convincing to use the songs in the EFC as representatives of most favorite traditional Chinese folk songs for general music education and introduction.

Moving from the assumption of the “decremental” approach to the “incremental” collection, we built up a song collection straightforwardly and reliably. We referred to a collection of 100 most well-known classic Chinese folk song scores [26] and digitally documented the scores of all 42 songs from it which meet the prerequisites 1 and 2, in a movable do solfège-based

(p i t c h, d u r a t i o n)

sequence. The other 58 songs not included in our collection are:

Originated from other cultural groups that do not use anhemitonic pentatonic scales, such as the Uyghurs; or
Revolutionary songs and new folk songs with obvious compositional backgrounds; or
From Northern Shaanxi, where the songs, usually also labeled as Chinese/Han Chinese, are subject to mixed culture and often carry the impact of a revolutionary adaptation [27].

In addition, we have added 11 songs based on senior folk musicians’ input, including second, widely sung versions of the same titles for 4 of the 42 songs above mentioned and 7 folk songs often performed in concerts, audio/video products, and on folk stages. We plan to open our collection of the 53 songs’ movable do solfège-based

(p i t c h, d u r a t i o n)

sequences and metadata as a public library after refinement since, to our best knowledge, it should be the only comprehensive digital symbolic collection to date containing the most representative traditional Chinese anhemitonic pentatonic folk songs that Chinese people from all regions well recognize.

2.2. Illustrative and Quantitative Analysis

Figure 1, Figure 2, Figure 3, Figure 4, Figure 5 and Figure 6 demonstrate different types of pitch histograms of all 53 songs from our collection of most representative Chinese anhemitonic pentatonic folk songs. Being from 23 provinces of China (Anhui, Fujian, Gansu, Guangdong, Guangxi, Guizhou, Hebei, Heilongjiang, Henan, Hubei, Hunan, Inner Mongolia, Jiangsu, Jiangxi, Jilin, Liaoning, Qinghai, Shandong, Shanghai, Shanxi, Sichuan, Yunnan, and Zhejiang), the 53 pieces in Figure 1, Figure 2, Figure 3, Figure 4, Figure 5 and Figure 6 cover all five Chinese pentatonic modes:

The Gong mode/major pentatonic scale: Figure 2(1–5), Figure 3(1), Figure 4(1,2), and Figure 5(1).
The Shang mode/suspended pentatonic scale: Figure 2(6–8), Figure 3(2), and Figure 6(1).
The Jue mode/blues minor scale: Figure 2(9) and Figure 6(2).
The Zhi mode/blues major scale: Figure 1(1,2), Figure 2(10–19), Figure 4(3,4), Figure 3(3–9), and Figure 6(3).
The Yu mode/minor pentatonic scale: Figure 2(20–23), Figure 3(10,11), Figure 4(5–11), Figure 5(2), and Figure 6(4).

In Figure 1 and Figure 2, all 25 songs’ both basic and merged pitch histograms originally fit the bell shape perfectly (

D o B e l l = 1

).

In Figure 3, the histograms of the 11 songs are attributed to the bell (

D o B e l l = 1

) shape after the bins of the pitches si

_{3}

, fa

_{4}

, and si

_{4}

, which infrequently occur in the Chinese pentatonic songs, are removed. We call histograms that retain only anhemitonic pentatonic pitches anhemitonic pentatonic pitch histograms, which also have their merged versions, merged anhemitonic pentatonic pitch histograms.

In Figure 4, each of the 11 songs’ basic pitch histograms contains individual bins that violate the bell shape. Therefore, their

D o B e l l

values are less than 1. Applying merged pitch histograms to them will yield perfect bell curves (

D o B e l l = 1

), apparently. For example, the lowest pitch of Figure 4(2) is a significant exception, as it is interesting to note that the “do do do” is frequently repeated at phrase ends of this song to express an upbeat rhythm, causing this pitch to have a higher count number. Employing the merged pitch histogram, Figure 4(2)’s pitch distribution corresponds well to the bell shape.

In Figure 5, evidently, the two songs’ perfect bell shape (

D o B e l l = 1

) appears on their merged anhemitonic pentatonic pitch histograms.

The pitch histograms of four songs in the 53-song collection deviate from the bell shape (

D o B e l l < 1

), and applying various histogram variants to them does not improve or reduce the bell-shapedness (see Figure 6). Notably, they are all from Shanxi or Sichuan.

The basic pitch histogram of 25/53 (47.17%) songs innately fits the bell shape (

D o B e l l = 1

). The ideal bell-shapedness is reflected in 49/53 (92.45%) songs after retaining only anhemitonic pentatonic pitches or using the merged pitch histogram to circumvent some high counts due to the unique characteristics of the songs. Only 4/53 (7.55%) of the songs have a pitch distribution differing from the perfect bell shape (

D o B e l l < 1

).

2.3. Pearson’s $χ^{2}$ Test

Another reference corpus of 53 songs that are not traditional Chinese anhemitonic pentatonic (see Section 4.2 and Table A2) is added to the 53 songs presented in Section 2.2 for a Pearson

χ^{2}

test.

We hypothesized whether a song is a traditional Chinese anhemitonic pentatonic folk song or not is independent of whether its pitch histogram is in a perfect bell shape (

D o B e l l = 1

) and set a significance level of 5% for the Pearson’s

χ^{2}

test [28]. Using the statistics in Table A1 and Table A2, we calculated a

χ^{2}

value of 24.41 and a p-value of 0.0000037. Therefore, it is reasonable to reject the original hypothesis, clarifying whether a song is a traditional Chinese anhemitonic pentatonic folk song is related to whether it has a perfect bell-shaped pitch histogram (

D o B e l l = 1

). They are not independent of each other.

The test applied to the merged pitch histogram results in a

χ^{2}

value of 26.10 and a p-value of 0.00000032, inferring whether a song is a traditional Chinese anhemitonic pentatonic folk song is related to whether it has a perfect bell-shaped merged pitch histogram (

D o B e l l = 1

).

3. Explanation, Comparison, Discussion, and Extension

Pitch histograms allow a visually resonant introduction and demonstration of traditional Chinese anhemitonic pentatonic folk songs. Although this study is grounded in a music informatics perspective, we attempt to provide some brief cultural clues and references that may be relevant to the observations.

From an cultural perspective, the bell shape described in Section 2 reflects the Chinese cultural way of Zhongyong [29] (the doctrine of the mean). In [30], the author described Zhongyong as a value of “mean is beautiful” for the purpose of harmony. The basic meaning and spirit of Zhongyong thinking is mastering the extremes but deploying the mean [31]. The bell pattern of “the closer to the middle the pitch, the more it is sung” is an excellent expression of the cultural background of Zhongyong in folk songs. This phenomenon is also in line with the fact that traditional Chinese folk songs’ folkloric transmission from one generation to the next makes each song more and more “moderate” (Zhongyong), and in turn, the moderate melodies promote the broader dissemination of the songs because of the enhanced ease of singability, a two-way process of mutual improvement. The movable do Solfège for oral or simple written recording also facilitates such a transmission process.

Naturally, a song’s relatively low and high pitches should infrequently appear during the whole song, and the middle part pitches are the vast majority. However, folk songs of cultural groups other than Chinese rarely display perfect bell shapes (

D o B e l l = 1

), concluded by the visualization of dozens of well-known Gregorian, chromatic, or pentatonic traditional folk songs from more than 30 different countries, cultural groups, and languages, some examples of which are conveyed in Figure 7. Only three traditional Gregorian songs among the research materials we grasped inherently have a histogram of bell-shaped patterns, i.e.,

D o B e l l = 1

(see Figure 8). It is particularly worth pointing out that the pitch histogram of Figure 7(1), a famous Scottish song in the anhemitonic pentatonic scale, appears not to be bell-shaped (

D o B e l l = 0.86

), and becomes even less bell-shaped (

D o B e l l = 0.71

) after applying the merged pitch histogram, quite different from the Chinese anhemitonic pentatonic folk songs, where the merged pitch histogram never worsens the bell-shapedness.

It is worth mentioning that the three folk songs from Asia, Figure 7(4–6), show a bell shape (

D o B e l l = 1

) in their merged pitch distributions after the fa and si bins are removed, which is merely coincidental. Considering that these songs are not anhemitonic pentatonic songs per se, there is no reason to remove the fa and si bins for analysis. For example, Figure 7(6) is one of the most representative Japanese hemitonic in scale (Sakura pentatonic scale) songs, where fa and si are two of the essential pitches. Such observation reminds us that it may exist misjudgment to predict whether a song is a traditional Chinese anhemitonic pentatonic folk song simply by removing fa, si, and other semitones to construct an anhemitonic pentatonic histogram. In other song types, frequently-used pitches may be removed during creating the anhemitonic pentatonic histogram, which makes the pitch distribution more bell-like. Therefore, “a traditional Chinese anhemitonic pentatonic folk song” is an approximate sufficient rather than a necessary condition for “a bell-shaped anhemitonic pentatonic pitch histogram”. In short, in the context of this study, we do not conclude that “traditional folk songs worldwide that do not use the Chinese anhemitonic pentatonic scale are improbable to have bell-shaped pitch histograms”. The unique characteristics of each country’s folk songs need to be explored by researchers who grasp the cultures and materials involved.

Another interesting observation worth discussing is that anhemitonic pentatonic folk songs from cultural groups other than Chinese (more precisely, Han Chinese) may also have a perfect or approximate bell-shaped pitch histogram. For example, some cultural groups in East Asia, such as Korean and Japanese, have their own unique folk song systems [32,33], like the Japanese hemitonic in scale. There are also Korean and Japanese folk songs using the anhemitonic pentatonic scale (called yo scale in Japan), some of which fit the bell shape ideally or to a great extent. Historical interactions between the neighboring countries and peoples might be a reason; however, it is highly controversial whether the pentatonic scales present in different cultures originated independently or through cultural exchange, which is not the topic of this work. Several examples are provided in Figure 9, including Arirang, Doraji Taryeong, and Soran Bushi, the representative pieces of traditional Korean and Japanese folk songs. The repetition of the pitch la

_{4}

in Figure 9(3), exhibiting the rhythmic sense of the fishermen’s net-pulling work songs (“Dokkoisho”), is eliminated in the merged pitch histogram to produce a bell shape, similar to Figure 4(2). Some folk songs of not-Han cultural groups within China are also anhemitonic pentatonic. For instance, the basic pitch histogram of the Oroqen folk song in Figure 9(4) conforms to the bell shape, i.e.,

D o B e l l = 1

. Some traditional Japanese and Korean anhemitonic pentatonic songs do not perfectly fit the bell-shaped pattern, as Figure 10 exemplifies. However, as stated at the end of the previous paragraph, a comprehensive analysis of Japanese and Korean pentatonic songs is not the subject of this paper.

4. Initial Attempts at Lightweight Machine Learning

We discovered that many traditional Chinese anhemitonic pentatonic songs widely recognized by Chinese people in all regions today have a bell-shaped pitch distribution in different types of pitch histograms, which is the major contribution of this study and can provide material for the visualization of music education and demonstration. Taking it a step forward, we try to feature and model the findings preliminary to provide references and promote cultural diversity for the MIR study.

4.1. Features

The bell shape is close to the normal distribution in terms of appearance. Unfortunately, the features used to portray the state of a normal distribution, such as skewness and kurtosis, are not applicable here since they are not used to determine whether a distribution is normal-like/bell-shaped in appearance or not.

Besides our newly proposed feature for inscribing bell shape, Degree of Bell Shape (DoBell) (see Section 1.4), another appropriate classic feature we found and validated is negative turning (

N T

), which we utilized the Time Series Feature Extraction Library (TSFEL) [34] to compute. Treating the pitch histogram as a time series,

N T

counts its number of violations during monotonic rise and fall, where adjacent equivalent values do not break the monotonicity. We found that

N T

is ideally equal to the occurrence number of the local peaks throughout the histogram besides the highest peak. For example, in Figure 7(8),

N T_{pitch histogram}

equals 2, while

N T_{merged pitch histogram}

equals 1. A perfect bell shape’s

N T

equals 0. The

N T

s for both basic and merged pitch histograms are usually small integers, most possibly between 0 and 3. Some songs with wide pitch ranges have relatively large

N T

s, such as You Raise Me Up (both

N T

s are 7).

It is important to note that before calculating the

N T

and the

D o B e l l

features, the bins with zero height in all pitch histograms are removed.

Applying the

N T

and the

D o B e l l

features to the pitch histogram and the merged pitch histogram produces four different features. It can be learned from Section 2.2 that for a Chinese anhemitonic pentatonic folk song where non-anhemitonic-pentatonic pitches occur, the anhemitonic pentatonic pitch histogram may allow better generation of the bell shape. Therefore, we also apply the

N T

and the

D o B e l l

to the anhemitonic pentatonic pitch histogram and the merged anhemitonic pentatonic pitch histogram to generate another four features. For instance, in Figure 3(1), after removing the bins of si

_{3}

and si

_{4}

, the

N T

s of the basic/merged anhemitonic pentatonic pitch histograms drop from 2 to 0, and the

D o B e l l

s rise from 0.78 to 1.

In the subsequent classification experiments, we generate eight types of features by taking various combinations of one from each of the three groups (

N T

,

D o B e l l

), (all pitches, anhemitonic pentatonic pitches), and (pitch histogram, merged pitch histogram):

$N T_{pitch histogram}$ ;
$N T_{merged pitch histogram}$ ;
$N T_{anhemitonic pentatonic pitch histogram}$ ;
$N T_{merged anhemitonic pentatonic pitch histogram}$ ;
$D o B e l l_{pitch histogram}$ ;
$D o B e l l_{merged pitch histogram}$ ;
$D o B e l l_{anhemitonic pentatonic pitch histogram}$ ;
$D o B e l l_{merged anhemitonic pentatonic pitch histogram}$ .

4.2. Dataset for Pilot Classification Experiments

We used our 53-song collection mentioned in Section 2.1, which includes almost all the most representative traditional Chinese anhemitonic pentatonic folk songs well recognized by Chinese People from all regions nowadays.

Considering the data balance of binary classification, we applied a whole set of 106 songs in total. In addition to the 53 songs mentioned above, the other half consists of songs from the following types:

Representative traditional fork songs from various cultures of 30 countries/languages.
Folk songs adapted from major works, like symphonies, operas, musicals, and films.
Modern, popular songs in Mandarin.
Chinese new folk songs composed by musicians.

Table A1 and Table A2 in Appendix A exhibit all 106 songs’ information. We believe that the amount of data, together with cross-validation, is sufficient for preliminary experimental validation to confirm whether the

N T

and the

D o B e l l

features are practical and whether the non-deep machine learning using only succinct pitch statistics performs well in identifying traditional Chinese anhemitonic pentatonic folk songs.

To generate correct basic/merged anhemitonic pentatonic pitch histograms, the songs in the “not traditional Chinese anhemitonic pentatonic folk songs” set were also histogrammed using movable do Solfège, sixteen of which are illustrated in Figure 7 and Figure 8. Many of them do not have an original fixed key per se.

4.3. Classifiers

In this work, we explored machine learning preliminarily on the basis of the facts discovered, and more applications to MIR will be future work. Given this, we trained three classifiers, the Gaussian Naive Bayes (GaussianNB), the k-Nearest Neighbors (k-NN), and the Support Vector Machines (SVM), with various combinations of the eight features introduced in Section 4.1, to binarily classify whether the input is a traditional Chinese anhemitonic pentatonic folk song. The validity of these classifiers was confirmed in the previous work orienting European folk songs [35].

We applied the scikit-learn packages [36] of the classifiers and all default settings:

k-NN: the standard Euclidean distance metric.
SVM: the gamma of $1 / (σ^{2} (samples) * # features$ ) and the regularization of 1.

D o B e l l

-values are naturally in the

[0, 1]

interval due to the denominator in Equation (2), while

N T

-values were normalized beforehand to be applicable in conjunction with

D o B e l l

, which is particularly important for the k-NN classifier.

4.4. Experimental Results

For each classifier, we performed ten-fold cross-validation with pseudo-random, where each fold was trained using 90% of the data and tested with the rest 10%. Table 1 lists the average cross-validation accuracy of the binary classification for whether a song is traditional Chinese anhemitonic pentatonic folk, obtained by applying different combinations of features to the three classifiers, where the last two rows give the best leave-one-out cross-validation results with their corresponding feature combinations as references.

Overall, all the classifiers show their ability to discriminate the traditional Chinese anhemitonic pentatonic folk songs. Note that the k-NN results are significantly affected by k, and the value of 49 shows superiority in the ten-fold cross-validation, while the value of 3 leads the results in the leave-one-out cross-validation.

From the feature combination perspective, applying the complete feature set produced satisfactory results but did not reach the optimal performance on the k-NN classifier. The

N T

features work viably on their own, even the top on the SVM classifier. The

D o B e l l

feature set outperforms other combinations on the k-NN classifier.

Features extracted for both types of merged pitch histograms generally perform better than those of the not-merged histograms. However, the way merged pitch histograms help Chinese pentatonic folk songs enclose better bell-shaped envelopes incidentally benefits other songs’ shapes, as Table A2 implies. For example, the merged pitch histogram in Figure 7(12) promotes the distribution to a bell shape.

Anhemitonic pentatonic pitch histograms are not necessarily better than the original histograms and even work worse in the merged case. We witnessed that the anhemitonic pentatonic pitch histogram, like the merged histogram, may unexpectedly improve other songs’ bell-shapedness while advantaging the bell shape of the Chinese pentatonic folk songs, as exemplified in Figure 7(4–6) and explained in Section 3.

The fact that the merged operation and the anhemitonic-pentatonic operation make some non-anhemitonic-pentatonic songs challenging to distinguish is also reflected by the confusion matrices in Figure 11, where true positives outperform true negatives overall. The uncomplicated classifiers trained with our designed features perform well in correctly labeling the traditional Chinese anhemitonic pentatonic songs but are likely to misidentify some other types of songs.

5. Conclusions

We discovered that the pitch histogram of many representative traditional Chinese anhemitonic pentatonic folk songs known well by Chinese people from all regions nowadays naturally obeys a bell-like shape, which well reflects the Chinese characteristics of Zhongyong (the doctrine of the mean) in oral transmission.

Subsequently, we suggested that the recently proposed merged pitch histogram and the anhemitonic pentatonic pitch histograms put forward in this paper can reasonably eliminate the exceptions in the pitch histograms to endow many traditional Chinese anhemitonic pentatonic folk songs with perfect bell shapes, confirmed by the statistical visualization of numerous examples. Pearson’s

χ^{2}

test validated whether a song is a traditional Chinese anhemitonic pentatonic folk song is related to whether it has a perfect bell-shaped basic/merged pitch histogram.

As the main contribution of this article, our findings about the bell-shaped pitch distribution of traditional Chinese anhemitonic pentatonic folk songs can provide visual material for music education and demonstration. Since we did not extend to more extensive, less widely-circulated folk songs, we do not make a broad inference on entire traditional Chinese folk songs. Furthermore, in order to provide a reference for machine learning studies in the field of MIR, we also made preliminary attempts to feature and model the observation. We apply the classic negative turning feature and devise a new feature, Degree of Bell Shape (

D o B e l l

), to measure whether the histogram approximates bell-shaped.

Finally, we implemented three classifiers based on the different histograms and features. Preliminary experiments revealed that the classifiers have a discriminative capability on traditional Chinese anhemitonic pentatonic folk songs, which states that lightweight machine learning applying only pitch statistics can progress cultural diversity in MIR.

This study of the pentatonic scale and the traditional songs can evoke more research on the folk songs of other cultures and the unique scales they use. The different types of histograms and features we proposed or applied can serve as a reference for MIR studies related to cultural variety and ferment the design of more variants. More features that can be applied to pitch histograms, such as those summarized in [9], should be examined to find whether they are applicable to judging bell-shapedness or directly identifying traditional Chinese anhemitonic pentatonic folk songs. Moving from overall pitch statistics to segment-based features, many topics, such as subsequence search [37], deserve investigation.

It would also be promising to examine the peculiarities of other cultures’ anhemitonic pentatonic folk songs to distinguish them from Chinese. For example, traditional Korean folk songs often use the triple meter, which is rare in traditional Chinese folk songs. Moreover, in other regions that systematically use the anhemitonic pentatonic scale, such as India and Scotland, whether their folk songs have similar or unique pitch characteristics needs to be studied by researchers who grasp the cultures and materials (e.g., [38]) involved.

Author Contributions

Conceptualization, H.L.; methodology, H.L.; software, H.L. and T.X.; validation, H.L., K.J. and H.G.; data curation, H.L.; writing—original draft preparation, H.L. and T.X.; writing—review and editing, K.J. and H.G.; visualization, H.L. and T.X.; supervision, T.S.; funding acquisition, H.L. All authors have read and agreed to the published version of the manuscript.

Funding

The APC was funded by Open Access Initiative of the University of Bremen and the DFG via SuUB Bremen.

Conflicts of Interest

The authors declare no conflict of interest.

Abbreviations

The following abbreviations are used in this manuscript:

CAPS	Chinese Anhemitonic Pentatonic Scale
DoBell	Degree of Bell Shape
EFC	Essen Folksong Collection
GaussianNB	Gaussian Naive Bayes
k-NN	k-Nearest Neighbors
MIR	Music Information Retrieval
MT	Monotonic Trend
NT	Negative Turning
SVM	Support Vector Machines
TSFEL	Time Series Feature Extraction Library
TSSEARCH	Time Series Subsequence Search Library

Appendix A. The List of Songs Involved in This Article

Table A1 summarizes our 53-Song collection of the most representative traditional Chinese anhemitonic pentatonic folk songs well known by Chinese People from all regions, while Table A2 lists other 53 songs applied in the classification experiments.

Table A1. Information (titles, pentatonic modes, and provinces of origin) on all the 53 traditional Chinese anhemitonic pentatonic songs we collected and whether their pitch distribution show a bell shape in various histograms. Hanyu Pinyin is used for the titles to make it easier for readers with Mandarin knowledge to identify the songs. The English title corresponding to each song’s Hanyu Pinyin can be found in Figure 1, Figure 2, Figure 3, Figure 4, Figure 5 and Figure 6. PH: pitch histogram; MP: merged pitch histogram; AP: anhemitonic pentatonic pitch histogram; MAP: merged anhemitonic pentatonic pitch histogram; ✓: comforming to a perfect bell shape (

D o B e l l = 1

).

Table A1. Information (titles, pentatonic modes, and provinces of origin) on all the 53 traditional Chinese anhemitonic pentatonic songs we collected and whether their pitch distribution show a bell shape in various histograms. Hanyu Pinyin is used for the titles to make it easier for readers with Mandarin knowledge to identify the songs. The English title corresponding to each song’s Hanyu Pinyin can be found in Figure 1, Figure 2, Figure 3, Figure 4, Figure 5 and Figure 6. PH: pitch histogram; MP: merged pitch histogram; AP: anhemitonic pentatonic pitch histogram; MAP: merged anhemitonic pentatonic pitch histogram; ✓: comforming to a perfect bell shape (

D o B e l l = 1

).

No.	Song Title	Mode	Province	PH	MP	AP	MAP
1	Mo Li Hua (Version 1)	Zhi	Jiangsu	✓	✓	✓	✓
2	Mo Li Hua (Version 1)	Zhi	Jiangsu	✓	✓	✓	✓
3	Ba Gen Lu Chai Hua	Gong	Jiangsu	✓	✓	✓	✓
4	Ba Duan Jin	Gong	Henan	✓	✓	✓	✓
5	Wu Xi Jing	Gong	Jiangsu	✓	✓	✓	✓
6	Bian Hua Lan	Gong	Henan	✓	✓	✓	✓
7	Feng Yang Hua Gu	Gong	Anhui	✓	✓	✓	✓
8	Feng Yang Hua Gu	Shang	Anhui	✓	✓	✓	✓
9	Tai Yang Chu Lai Xi Yang Yang	Shang	Sichuan	✓	✓	✓	✓
10	Ban Jin You Mian Tui Wo Wo	Shang	Hebei	✓	✓	✓	✓
11	Jie Zai Tian Li Hao Dou Ke	Jue	Anhui	✓	✓	✓	✓
12	Ku Qi Qi/Si Ji Ge	Zhi	Jiangsu	✓	✓	✓	✓
13	Ban Jiu Diao	Zhi	Jiangxi	✓	✓	✓	✓
14	Long Chuan Diao	Zhi	Hubei	✓	✓	✓	✓
15	Xiao Bai Nian	Zhi	Hunan	✓	✓	✓	✓
16	Xiao Bai Cai	Zhi	Hebei	✓	✓	✓	✓
17	Xiao Fang Niu	Zhi	Hebei	✓	✓	✓	✓
18	Gua Di Feng	Zhi	Gansu	✓	✓	✓	✓
19	Gui Zhou Shan Ge	Zhi	Guizhou	✓	✓	✓	✓
20	Zhi Shou Jin	Zhi	Gansu	✓	✓	✓	✓
21	Zhi Shou Jin	Zhi	Shanxi	✓	✓	✓	✓
22	Yi Gen Bian Dan Liang Ge Luo	Yu	Hunan	✓	✓	✓	✓
23	Jiu Lian Huan	Yu	Jiangsu	✓	✓	✓	✓
24	Fang Ma Shan Ge	Yu	Yunnan	✓	✓	✓	✓
25	Cai Cha Deng	Yu	Fujian	✓	✓	✓	✓
26	Xiao Bai Nian	Gong	Liaoning		✓		✓
27	Yang Liu Qing	Gong	Jiangsu		✓		✓
28	Wang Da Niang Ding Gang	Zhi	Henan		✓		✓
29	Dong Bei Feng Diao	Zhi	Heilongjiang		✓		✓
30	Xiao He Tang Shui	Yu	Yunnan		✓		✓
31	Kang Ding Qing Ge	Yu	Sichuan		✓		✓
32	Huang Yang Bian Dan	Yu	Sichuan		✓		✓
33	Da Zhi Shan Ge Guo Heng Pai	Yu	Jiangxi		✓		✓
34	Cai Hua	Yu	Sichuan		✓		✓
35	Xiu He Bao	Yu	Yunnan		✓		✓
36	Luo Shui Tian	Yu	Guangdong		✓		✓
37	Cai Yun Zhui Yue	Gong	Guangdong			✓	✓
38	Cheng Qiang Shang Pao Ma	Shang	Inner Mongolia			✓	✓
39	Yao Lan Qu	Zhi	Liaoning			✓	✓
40	Yi Meng Shan Xiao Diao	Zhi	Shangdong			✓	✓
41	Meng Jiang Nü	Zhi	Jiangsu			✓	✓
42	Yue Ya Wu Geng	Zhi	Jilin			✓	✓
43	Jiao Cheng Shan	Zhi	Shanxi			✓	✓
44	Liu Zhou You Ge Li Yu Yan	Zhi	Guangxi			✓	✓
45	Cai Diao	Zhi	Yunnan			✓	✓
46	Zi Zhu Diao	Yu	Shanghai			✓	✓
47	Zuo Lou Ge	Yu	Zhejiang			✓	✓
48	Xiao Kan Xi	Gong	Jilin				✓
49	Si Ji Ge	Yu	Qinghai				✓
50	Xiu He Bao	Shang	Shanxi
51	Yi Gen Zhu Gan Rong Yi Wan	Jue	Sichuan
52	Kai Hua	Zhi	Shanxi
53	Huai Hua Ji Shi Kai	Yu	Sichuan

Table A2. Information (titles and languages) on the 53 songs which are not traditional Chinese anhemitonic pentatonic folk songs and whether their pitch distribution show a bell shape in various histograms. PH: pitch histogram; MP: merged pitch histogram; AP: anhemitonic pentatonic pitch histogram; MAP: merged anhemitonic pentatonic pitch histogram; ✓: conforming to a perfect bell shape (

D o B e l l = 1

). Note that the patterns of AP and MAP are judged only when the song is anhemitonic pentatonic.

Table A2. Information (titles and languages) on the 53 songs which are not traditional Chinese anhemitonic pentatonic folk songs and whether their pitch distribution show a bell shape in various histograms. PH: pitch histogram; MP: merged pitch histogram; AP: anhemitonic pentatonic pitch histogram; MAP: merged anhemitonic pentatonic pitch histogram; ✓: conforming to a perfect bell shape (

D o B e l l = 1

). Note that the patterns of AP and MAP are judged only when the song is anhemitonic pentatonic.

No.	Song Title	Language	PH	MP
1	Anile Anile Vaa Vaa Vaa	Tamil (India)
2	Auld Lang Syne	Scottish (United Kingdom)
3	Ayo Mama	Indonesian
4	Bella Ciao	Italian
5	Bengawan Solo	Indonesian
6	Bésame Mucho	Spanish (Mexico)
7	Click Go the Shears	English (Australia)
8	Dance of the Yao People	Mandarin
9	Du Jiao Xi	Mandarin
10	Edelweiss	English
11	El Cóndor Pasa	Quechuan (Peru)
12	Furusato	Japanese	✓	✓
13	Gada Meiren	Mongolian
14	Going Home	English (Czechia)
15	Greensleeves	English		✓
16	Home on the Range	English		✓
17	Home Sweet Home	English	✓
18	It’s a Small World	English
19	In That Place Wholly Faraway	Mandarin
20	Jigder Nana	Mongolian		✓
21	Korobeiniki	Russian		✓
22	La Donna È Mobile	Italian		✓
23	La Paloma	Spanish (Cuba)
24	La Vie En Rose	French
25	Marken Er Mejet	Danish
26	Moonlight on the Colorado	English
27	Natsu No Owari	Japanese
28	Nesta Rua	Portuguese (Brazil)
29	Oh, Susanna	English
30	Old Folks at Home	English
31	Olha a laranja	Portuguese	✓	✓
32	O More Duboko	Dalmatian (Croatia)
33	Orchid Grass (Lan Hua Cao)	Mandarin
34	O Sole Mio	Napolian (Italy)
35	Polish Waltz	Polish
36	Red Valley	English (Canada)
37	Reflection	English
38	Sakura Sakura	Japanese
39	Santa Lucia	Napolitan (Italy)
40	Shiki No Uta	Japanese		✓
41	Tancuj, Tancuj, Vykrúcaj	Slovak		✓
42	Teleat Ya Mahla Nourha	Arabic (Egypt)
43	That’s Why (You Go Away)	English (Denmark)
44	Think of Me	English
45	Troika Pochtovaya	Russian
46	Valurile Dunării	Romanian
47	Vo Lozärn uf Wäggis zue	German (Switzerland)
48	Wiegenlied (Brahms)	German
49	Wiegenlied (Schubert)	German
50	Without You	English
51	Yankee Doodle	English
52	Yi Qian Nian Yi Hou	Mandarin
53	You Raise Me Up	English (Ireland)

References

Khoo, S.S. The Single Hidden Layer Neural Network Based Classifiers for Han Chinese Folk Songs. Ph.D. Thesis, Swinburne University of Technology, Melbourne, Australia, 2013. [Google Scholar]
Li, J.; Luo, J.; Ding, J.; Zhao, X.; Yang, X. Regional classification of Chinese folk songs based on CRF model. Multimed. Tools Appl. 2019, 78, 11563–11584. [Google Scholar] [CrossRef]
Jin, J. Chinese Music; Cambridge University Press: Cambridge, UK, 2011. [Google Scholar]
Bentley, A. Fixed or Movable Do? J. Res. Music. Educ. 1959, 7, 163–168. [Google Scholar] [CrossRef]
Caporaletti, V. Concerning a Scottish Melody. The Issue of Tonal Centres in Anhemitonic Pentatonic Scales. J. Jazz Audiotactile Music. Stud. 2020. Available online: https://api.nakala.fr/data/10.34847/nkl.88achsk1/7c40960a11401619aebc9b50fd49f95d07d989b3 (accessed on 27 July 2022).
Fucks, W. Mathematical analysis of formal structure of music. IRE Trans. Inf. Theory 1962, 8, 225–228. [Google Scholar] [CrossRef]
Patrick, P.H.; Strickler, K. A computer-assisted study of dissonance in the masses of Josquin Desprez. Comput. Humanit. 1978, 12, 341–364. [Google Scholar] [CrossRef]
Tzanetakis, G.; Ermolinskyi, A.; Cook, P. Pitch histograms in audio and symbolic music information retrieval. J. New Music. Res. 2003, 32, 143–152. [Google Scholar] [CrossRef] [Green Version]
McKay, C. Automatic Music Classification with jMIR. Ph.D. Thesis, McGill University, Montreal, QC, Canada, 2010. [Google Scholar]
Karydis, I. Symbolic music genre classification based on note pitch and duration. In Proceedings of the East European Conference on Advances in Databases and Information Systems, Thessaloniki, Greece, 3–7 September 2006; pp. 329–338. [Google Scholar]
Gedik, A.C.; Bozkurt, B. Pitch-frequency histogram-based music information retrieval for Turkish music. Signal Process. 2010, 90, 1049–1063. [Google Scholar] [CrossRef] [Green Version]
Bozkurt, B. Pitch Histogram Based Analysis of Makam Music in Turkey; Les Corpus de l’oralité: Strasbourg, France, 2011. [Google Scholar]
Koduri, G.K.; Serrà Julià, J.; Serra, X. Characterization of intonation in carnatic music by parametrizing pitch histograms. In Proceedings of the 13th International Society for Music Information Retrieval Conference, Porto, Portugal, 8–12 October 2012; Gouyon, F., Herrera, P., Martins, L.G., Müller, M., Eds.; FEUP Ediçoes: Porto, Portugal, 2012. [Google Scholar]
Brown, J.C. Determination of the meter of musical scores by autocorrelation. J. Acoust. Soc. Am. 1993, 94, 1953–1957. [Google Scholar] [CrossRef]
Tzanetakis, G.; Essl, G.; Cook, P. Automatic musical genre classification of audio signals. In Proceedings of the 2nd International Symposium on Music Information Retrieval, Bloomington, IN, USA, 15–17 October 2001; Volume 144. [Google Scholar]
Tzanetakis, G.; Cook, P. Musical genre classification of audio signals. IEEE Trans. Speech Audio Process. 2002, 10, 293–302. [Google Scholar] [CrossRef]
Tzanetakis, G. Manipulation, Analysis and Retrieval Systems for Audio Signals; Princeton University: Princeton, NJ, USA, 2002. [Google Scholar]
Lykartsis, A.; Lerch, A. Beat histogram features for rhythm-based musical genre classification using multiple novelty functions. In Proceedings of the DAFx15—18th International Conference on Digital Audio Effects, Trondheim, Norway, 30 November–3 December 2015. [Google Scholar]
Liu, H.; Xue, T.; Schultz, T. Merged Pitch Histograms and Pitch-Duration Histograms. In Proceedings of the 19th International Conference on Signal Processing and Multimedia Applications, Copenhagen, Denmark, 25–26 June 2022; pp. 32–39. [Google Scholar]
Huron, D. The melodic arch in Western folksongs. Comput. Musicol. 1996, 10, 3–23. [Google Scholar]
Shanahan, D.; Huron, D. Interval Size and Phrase Position: A Comparison between German and Chinese Folksongs. Empir. Musicol. Rev. 2011, 6, 187–197. [Google Scholar] [CrossRef] [Green Version]
Mann, H.B. Nonparametric Tests Against Trend. Econometrica 1945, 13, 245–259. [Google Scholar] [CrossRef]
Kendall, M.; Gibbons, J.D. Rank Correlation Methods, 5th ed.; Charles Griffin & Co., Limited: London, UK, 1990. [Google Scholar]
Gilbert, R.O. Statistical Methods for Environmental Pollution Monitoring; John Wiley & Sons: Hoboken, NJ, USA, 1987. [Google Scholar]
Schaffrath, H. The Essen folksong collection. Database Contain. 1995, 6. Available online: http://essen.themefinder.org/ (accessed on 27 July 2022).
Yang, R. (Ed.) 100 Classic Chinese Folk Songs (Jing Dian Zhong Guo Min Ge 100 Shou); Anhui Literature and Art Publishing House: Hefei, China, 2011. [Google Scholar]
Lü, Z.; Sun, J. From Traditional Folk Songs to New Folk Songs: A Revolutionary Transformation of Folk Songs in Northern Shaanxi. J. Yulin Coll. 2012, 22, 16–22. [Google Scholar]
Pearson, K.X. On the criterion that a given system of deviations from the probable in the case of a correlated system of variables is such that it can be reasonably supposed to have arisen from random sampling. Lond. Edinb. Dublin Philos. Mag. J. Sci. 1900, 50, 157–175. [Google Scholar] [CrossRef] [Green Version]
Johnston, I.; Ping, W. Daxue and Zhongyong (A Bilingual Edition); The Chinese University of Hong Kong Press: Hong Kong, China, 2012. [Google Scholar]
Yang, Z. A Case of Attempt to Combine the Chinese Traditional Culture with the Social Science: The Social Psychological Research of. J. Renmin Univ. China 2009, 3, 53–60. [Google Scholar]
Yang, Z.; Zhao, Z. The primary exploration of zhongyong practical thinking style. In Proceedings of the Fourth International Symposium on Chinese Psychology and Behavior Interdisciplinary (CPBI), Taibei, China, 29–31 May 1997; pp. 29–31. [Google Scholar]
Provine, R.C. Korean music in historical perspective. World Music 1985, 27, 3–15. [Google Scholar]
Malm, W.P. Traditional Japanese Music and Musical Instruments; Kodansha International: Tokyo, Japan, 2000. [Google Scholar]
Barandas, M.; Folgado, D.; Fernandes, L.; Santos, S.; Abreu, M.; Bota, P.; Liu, H.; Schultz, T.; Gamboa, H. TSFEL: Time series feature extraction library. SoftwareX 2020, 11, 100456. [Google Scholar] [CrossRef]
Hillewaere, R.; Manderick, B.; Conklin, D. Global Feature Versus Event Models for Folk Song Classification. In Proceedings of the 10th International Society for Music Information Retrieval Conference (ISMIR 2009), Kobe, Japan, 26–30 October 2009. [Google Scholar]
Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
Folgado, D.; Barandas, M.; Antunes, M.; Nunes, M.L.; Liu, H.; Hartmann, Y.; Schultz, T.; Gamboa, H. TSSEARCH: Time Series Subsequence Search Library. SoftwareX 2022, 18, 101049. [Google Scholar] [CrossRef]
Patel, A.; Shah, A.; Gor, K.; Mankad, S.H. IFSC: A Database for Indian Folk Songs Classification. In Advances in Speech and Music Technology; Springer: Berlin/Heidelberg, Germany, 2021; pp. 171–186. [Google Scholar]

Figure 1. The movable do solfège-based basic pitch histograms and merged pitch histograms for the two versions of Jasmine Flower. (1) “Hao yi duo mo li hua”; (2) “Hao yi duo mei li de mo li hua”.

Figure 2. The movable do solfège-based basic pitch histograms and merged pitch histograms for 23 representative traditional Chinese anhemitonic pentatonic folk songs, which naturally conform to the bell shape (

D o B e l l = 1

), with the original names (Hanyu Pinyin) in curved brackets and the provinces of the origin in square brackets.

Figure 2. The movable do solfège-based basic pitch histograms and merged pitch histograms for 23 representative traditional Chinese anhemitonic pentatonic folk songs, which naturally conform to the bell shape (

D o B e l l = 1

), with the original names (Hanyu Pinyin) in curved brackets and the provinces of the origin in square brackets.

Figure 3. The movable do solfège-based basic pitch histograms (left column), merged pitch histograms (left column), anhemitonic pentatonic pitch histograms (right column), and merged anhemitonic pentatonic pitch histograms (right column) for 11 representative traditional Chinese anhemitonic pentatonic folk songs, with the original names (Hanyu Pinyin) in curved brackets and the provinces of the origin in square brackets. After removing the fa and si bins not belonging to the Chinese anhemitonic pentatonic from the basic and merged pitch histogram (left column,

D o B e l l < 1

), the basic and merged anhemitonic pentatonic pitch histograms (right column) conform to the bell shape (

D o B e l l = 1

).

Figure 3. The movable do solfège-based basic pitch histograms (left column), merged pitch histograms (left column), anhemitonic pentatonic pitch histograms (right column), and merged anhemitonic pentatonic pitch histograms (right column) for 11 representative traditional Chinese anhemitonic pentatonic folk songs, with the original names (Hanyu Pinyin) in curved brackets and the provinces of the origin in square brackets. After removing the fa and si bins not belonging to the Chinese anhemitonic pentatonic from the basic and merged pitch histogram (left column,

D o B e l l < 1

), the basic and merged anhemitonic pentatonic pitch histograms (right column) conform to the bell shape (

D o B e l l = 1

).

Figure 4. The movable do solfège-based basic pitch histograms and merged pitch histograms for 11 representative traditional Chinese anhemitonic pentatonic folk songs, with the original names (Hanyu Pinyin) in curved brackets and the provinces of the origin in square brackets. The basic pitch histograms (

D o B e l l < 1

) contain individual bins that violate the bell shape, and the merged pitch histograms are all perfect bell-shaped (

D o B e l l = 1

).

Figure 4. The movable do solfège-based basic pitch histograms and merged pitch histograms for 11 representative traditional Chinese anhemitonic pentatonic folk songs, with the original names (Hanyu Pinyin) in curved brackets and the provinces of the origin in square brackets. The basic pitch histograms (

D o B e l l < 1

) contain individual bins that violate the bell shape, and the merged pitch histograms are all perfect bell-shaped (

D o B e l l = 1

).

Figure 5. The movable do solfège-based basic pitch histograms (left column), merged pitch histograms (left column), anhemitonic pentatonic pitch histograms (right column), and merged anhemitonic pentatonic pitch histograms (right column) for two representative traditional Chinese anhemitonic pentatonic folk songs, with the original names (Hanyu Pinyin) in curved brackets and the provinces of the origin in square brackets. After removing the fa and si bins not belonging to the Chinese anhemitonic pentatonic scale for the basic and merged pitch histogram (left column,

D o B e l l < 1

), the anhemitonic pentatonic pitch histograms (right column) still have individual bins that violate the bell shape (

D o B e l l < 1

). However, the merged anhemitonic pentatonic pitch histograms (right column) conform to the bell shape (

D o B e l l = 1

).

Figure 5. The movable do solfège-based basic pitch histograms (left column), merged pitch histograms (left column), anhemitonic pentatonic pitch histograms (right column), and merged anhemitonic pentatonic pitch histograms (right column) for two representative traditional Chinese anhemitonic pentatonic folk songs, with the original names (Hanyu Pinyin) in curved brackets and the provinces of the origin in square brackets. After removing the fa and si bins not belonging to the Chinese anhemitonic pentatonic scale for the basic and merged pitch histogram (left column,

D o B e l l < 1

), the anhemitonic pentatonic pitch histograms (right column) still have individual bins that violate the bell shape (

D o B e l l < 1

). However, the merged anhemitonic pentatonic pitch histograms (right column) conform to the bell shape (

D o B e l l = 1

).

Figure 6. The movable do solfège-based basic and merged pitch histograms for four representative traditional Chinese anhemitonic pentatonic folk songs that do not present a perfect bell shape (

D o B e l l < 1

), with the original names (Hanyu Pinyin) in curved brackets and the provinces of the origin in square brackets. Preserving only the anhemitonic pentatonic bins does not improve the bell-shapedness of the histograms.

Figure 6. The movable do solfège-based basic and merged pitch histograms for four representative traditional Chinese anhemitonic pentatonic folk songs that do not present a perfect bell shape (

D o B e l l < 1

), with the original names (Hanyu Pinyin) in curved brackets and the provinces of the origin in square brackets. Preserving only the anhemitonic pentatonic bins does not improve the bell-shapedness of the histograms.

Figure 7. The movable do solfège-based basic and merged pitch histograms for fourteen well-known traditional folk songs from different countries, languages, and cultures, with the languages of the original version in curved brackets and the countries of the origin in square brackets.

Figure 8. The movable do solfège-based basic and merged pitch histograms for a Portuguese, a Japanese, and an English well-known traditional folk song, respectively, using the Gregorian scale where the basic pitch histograms conform to the bell shape (

D o B e l l = 1

), with the languages of the original version in curved brackets and the countries of the origin in square brackets.

Figure 8. The movable do solfège-based basic and merged pitch histograms for a Portuguese, a Japanese, and an English well-known traditional folk song, respectively, using the Gregorian scale where the basic pitch histograms conform to the bell shape (

D o B e l l = 1

), with the languages of the original version in curved brackets and the countries of the origin in square brackets.

Figure 9. The movable do solfège-based basic and merged pitch histograms for four well-known East Asian traditional folk songs from cultural groups other than (Han) Chinese, with the languages/cultural groups of the original version in curved brackets.

Figure 10. The movable do solfège-based basic and merged pitch histograms for two well-known traditional Korean and Japanese folk songs using the anhemitonic pentatonic scale without a perfect bell-shaped pitch distribution (

D o B e l l < 1

), with the languages/cultural groups of the original version in curved brackets.

Figure 10. The movable do solfège-based basic and merged pitch histograms for two well-known traditional Korean and Japanese folk songs using the anhemitonic pentatonic scale without a perfect bell-shaped pitch distribution (

D o B e l l < 1

), with the languages/cultural groups of the original version in curved brackets.

Figure 11. Confusion matrices of the GaussianNB (left), the k-NN (middle), and the SVM (right) classifiers accumulating ten-fold results with the local best feature combinations. CAPS: traditional Chinese folk songs in the Chinese Anhemitonic Pentatonic Scale.

Table 1. The mean and standard deviation values of the three classifiers’ ten-fold cross-validation accuracies applying different combinations of features. The bolded results indicate the optimal performance of each classifier. The last two rows give the best leave-one-out cross-validation accuracies and their corresponding feature combinations as references. PH: pitch histogram; AP: anhemitonic pentatonic.

Feature Combination	GaussianNB	k-NN (k = 49)	SVM
4 $N T$ features	0.81 ± 0.11	0.83 ± 0.13	0.82 ± 0.09
4 $D o B e l l$ features	0.80 ± 0.11	0.87 ± 0.16	0.79 ± 0.10
4 features extracted from basic PH	0.66 ± 0.10	0.75 ± 0.12	0.73 ± 0.12
4 features extracted from merged PH	0.81 ± 0.08	0.82 ± 0.11	0.80 ± 0.08
4 features extracted from basic AP PH	0.77 ± 0.10	0.76 ± 0.11	0.73 ± 0.14
4 features extracted from merged AP PH	0.80 ± 0.07	0.78 ± 0.13	0.77 ± 0.09
All 8 features	0.82 ± 0.10	0.85 ± 0.12	0.82 ± 0.09
Best leave-one-out cross-validation result	0.80	0.81	0.80
Corresponding feature combination	All 8 features	4 $D o B e l l$ , $k = 3$	4 $N T$ features

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, H.; Jiang, K.; Gamboa, H.; Xue, T.; Schultz, T. Bell Shape Embodying Zhongyong: The Pitch Histogram of Traditional Chinese Anhemitonic Pentatonic Folk Songs. Appl. Sci. 2022, 12, 8343. https://doi.org/10.3390/app12168343

AMA Style

Liu H, Jiang K, Gamboa H, Xue T, Schultz T. Bell Shape Embodying Zhongyong: The Pitch Histogram of Traditional Chinese Anhemitonic Pentatonic Folk Songs. Applied Sciences. 2022; 12(16):8343. https://doi.org/10.3390/app12168343

Chicago/Turabian Style

Liu, Hui, Kun Jiang, Hugo Gamboa, Tingting Xue, and Tanja Schultz. 2022. "Bell Shape Embodying Zhongyong: The Pitch Histogram of Traditional Chinese Anhemitonic Pentatonic Folk Songs" Applied Sciences 12, no. 16: 8343. https://doi.org/10.3390/app12168343

APA Style

Liu, H., Jiang, K., Gamboa, H., Xue, T., & Schultz, T. (2022). Bell Shape Embodying Zhongyong: The Pitch Histogram of Traditional Chinese Anhemitonic Pentatonic Folk Songs. Applied Sciences, 12(16), 8343. https://doi.org/10.3390/app12168343

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Bell Shape Embodying Zhongyong: The Pitch Histogram of Traditional Chinese Anhemitonic Pentatonic Folk Songs

Abstract

1. Background

1.1. Movable Do Solfège and the Chinese Anhemitonic Pentatonic Scales/Modes

1.2. Pitch Histograms, Merged Pitch Histograms, and Their Anhemitonic Pentatonic Variants

1.3. Related Research on the Statistical Visualization of Western Folk Songs and Comparative Study

1.4. Bell Shapes and the Degree of Bell Shape (DoBell)

2. The Pitch Histogram of Traditional Chinese Anhemitonic Pentatonic Folk Songs

2.1. Data Preparation

2.2. Illustrative and Quantitative Analysis

2.3. Pearson’s $χ^{2}$ Test

3. Explanation, Comparison, Discussion, and Extension

4. Initial Attempts at Lightweight Machine Learning

4.1. Features

4.2. Dataset for Pilot Classification Experiments

4.3. Classifiers

4.4. Experimental Results

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

Abbreviations

Appendix A. The List of Songs Involved in This Article

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Bell Shape Embodying Zhongyong: The Pitch Histogram of Traditional Chinese Anhemitonic Pentatonic Folk Songs

Abstract

1. Background

1.1. Movable Do Solfège and the Chinese Anhemitonic Pentatonic Scales/Modes

1.2. Pitch Histograms, Merged Pitch Histograms, and Their Anhemitonic Pentatonic Variants

1.3. Related Research on the Statistical Visualization of Western Folk Songs and Comparative Study

1.4. Bell Shapes and the Degree of Bell Shape (DoBell)

2. The Pitch Histogram of Traditional Chinese Anhemitonic Pentatonic Folk Songs

2.1. Data Preparation

2.2. Illustrative and Quantitative Analysis

2.3. Pearson’s χ 2 Test

3. Explanation, Comparison, Discussion, and Extension

4. Initial Attempts at Lightweight Machine Learning

4.1. Features

4.2. Dataset for Pilot Classification Experiments

4.3. Classifiers

4.4. Experimental Results

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

Abbreviations

Appendix A. The List of Songs Involved in This Article

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.3. Pearson’s $χ^{2}$ Test