1. Introduction
The Japanese plum (
Prunus salicina L.) is the most widely produced fresh plum. Due to its long cultivation history of over 3000 years, the genetic diversity is vast [
1]. This is evident in the range of colors of the fruit, tree architecture, chilling requirements, ripening timing and disease susceptibility differences among the cultivars. Proper characterization of these traits will greatly aid in generating the data required for genetic improvements.
Phenotyping, the process of observing and measuring an organism’s physical traits, plays a critical role in understanding and managing plant diseases [
2]. In the context of plant pathology, phenotyping involves the systematic assessment of plants to identify disease symptoms and quantify disease severity to understand the relationship between host plants and pathogens [
3]. This practice is fundamental for breeding disease-resistant plant varieties, developing effective disease management strategies, and ensuring food security. This study sets out to quantify the disease severity of the fungus black knot (BK)
Apiosporina morbosa (Schwein.) v. Arx on Japanese plum.
The stages of BK development have been described in detail earlier [
4,
5]. The life cycle of the BK fungus begins with the release of ascospores during the spring. The peak of ascospore release occurs at 10–27 °C when wetting periods exceed 30 mm of rain [
6]. These ascospores are forcibly ejected from mature knots during wet periods and are spread by wind and rain splash to land on susceptible young shoots or wounded areas of
Prunus species, particularly plums and tart cherries (
Prunus cerasus L.) [
7]. Once an ascospore lands on a suitable host, it germinates and penetrates the epidermis, growing internally for up to a year. A light brown swelling at the infection site develops the following spring, growing into a gall covered with the olive-green anamorphic stage (
Dibotryon morbosum (Schwein.) Theiss. & Syd.) by the early summer. The gall continues to grow and darken, until it becomes a black mass in the late summer. The fungus overwinters until the following spring, when the pseudothecia matures, and ejects ascospores to repeat the life cycle. The knot will be in a continual cycle of growth and dormancy as it is perennial and eventually impairs the vascular system of the tree, causing blockages.
Human activities, notably the movement of plant materials and inadequate horticultural practices, play a role in the dissemination of BK disease. These activities include improper sanitation of equipment, failure to effectively dispose of infected pruned materials, and use of infected but asymptomatic stock, which all lead to the introduction or spread of the disease [
8]. Orchard maintenance is compulsory; however, spores and mycelium can potentially spread via pruners. New cuts on trees also expose the interior of the branch to spores present in the air. If pruned knots or infected branches are not burned or buried, they remain a threat to the rest of the orchard [
9]. It was reported that ascospores could still be released four months after removal from the host [
6]. Using asymptomatic but infected nursery material which harbor the fungus will also lead to the spread of the disease. Currently, there is no reliable assay that can detect the presence of BK. For this reason, regions without BK have strict import restrictions on
Prunus spp. as
A. morbosa is on multiple watchlists listed as A1, according to the EPPO Global Database [
10]. Currently the distribution of BK remains in North America, covering Canada, the United States of America, and Mexico [
11].
Phenotyping diseases on trees presents a unique set of challenges that differ significantly from those encountered with smaller plants or crops. The complexities arise from the size, structure, and longevity of trees, as well as the multifaceted nature of tree–pathogen interactions. The sheer size of the trees poses the first obstacle to overcome. It can be difficult to achieve a comprehensive view of the organism in its entirety, leading to missed sections. Their size also influences the amount of space they require, as they are outdoor crops, which introduces many environmental variabilities in the research. Being perennial organisms, the length of time in which a pathogen completes its life cycle must also be taken into account. Seasonal variations will also affect the growth of both the tree and the pathogen. BK, being a systemic infection, compounds all these difficulties. Random sampling of tree parts is not effective since knots are distributed unevenly over the entire tree. The tree must be viewed holistically, in 360°, and from top to bottom to ensure each knot has been accounted for. This also means that there are currently no technological applications that can aid in quantifying BK.
The only visible symptoms of BK infection are the knots themselves. There is a strong correlation between the number and size of knots, and the severity of infection of each tree, making knot observation the key factor in phenotyping. As the fungus does not directly affect the leaves, fruit, or other visible regions besides the shoots, there are no other physiological markers from which to draw parallels [
12]. Therefore, a revised systematic rating system was developed, continuing from previous work [
5]. The empirically established route is through germination of an ascospore on the epidermis of a new tender shoot or wounded tissue [
7]. Furthermore, hyphae and conidia have been detected through the use of a scanning electron microscope (SEM) in areas of branches distal to knots [
4]. This is hypothesized to be another mode of transmission, but infection studies have yet to be carried out.
As the number or size of knots on a branch heightens, other symptoms begin to appear, such as girdling and leaf wilt [
8]. This is prevalent in canker-causing diseases, since the combination of the fungal invasion with the plant’s immune system causes abnormal growth of the tissues [
13]. Once the fungus has established itself in the host’s tissues upon germination, the plant’s immune system is activated due to its presence. It tries to compartmentalize the fungus by growing callus tissue [
14]. However, the fungus is able to colonize the callus tissue over time, turning the protective barrier into an amalgamation of host and fungal cells [
15]. The callus tissue is what leads to the girdling, as the xylem and phloem soon become overtaken by dysfunctional cambial cells [
15]. They gradually become obstructed so that water and carbon can no longer be transported [
4]. This is evident by the periphery of the branch turning yellow and wilting.
BK understanding up to this point has been limited by the lack of a systematic method to characterize the level of infection. It is a necessity for studying the disease and analyzing the relationships between genotypes and their inherent disease resistance. Only after this has been accomplished can BK research branch into new possibilities for genetic improvements.
4. Discussion
Low variability among the ratings is an important indicator of whether the raters could consistently follow the rating system. It was unsurprising there would be some degree of variability, due to the raters’ experience with BK, visual acuity, and general knowledge of
P. salicina tree health. Taking the median of all four ratings eliminated outliers and provided a rating that was based on a consensus. When combining the number of trees that had a range of 0 and 1 in their total counts (
Figure 3), this made up 69.5% of all the samples. This meant that of the 200 trees, 139 had a maximum difference of one between all four ratings. Therefore, the majority of counts fell within the low range, indicting low variability.
4.1. Improvements in the Revised BK Rating System
The main difference between the previous rating system by [
5] and the current one is the removal of the formula to determine the overall rating of a specific tree. In brief, each individual knot was given a rating based on its size from 1 to 5 and summed, then the total was divided by the number of knots on the tree. It was flawed because it was unable to differentiate between a tree with few but larger knots, and a tree with many smaller knots. The formula attempted to scale the number of knots per tree, causing misleading conclusions. For example, if a tree had 10 small knots rated as 1, the sum would be 10, and when divided by the number of knots, 10, the final score would be 1. However, if another tree had only 1 small knot rated as 1, the sum would also equate to 1, and dividing by the number of knots, 1, would yield the same final score of 1. In both cases, the final score would be identical, even though one tree had 10 knots, while the other had only 1. This meant the system could not differentiate between a heavily infected tree with many small knots and a minimally infected tree with just one small knot. As a result, the true severity of the infection was not reflected in the final rating, making it difficult to accurately assess and compare tree health based on the number and size of knots.
Additionally, the revised system began at 0, to indicate the absence of knots instead of 1. The tree’s health was also taken into account since the subjectivity of the rating depended on considering the tree as a whole, rather than solely the scaled sum of the knot values. Considering that the previous rating system used a different scale, comparisons being made between the past results and the current work must take this into account. The lower limit of both rating systems representing a knot-free tree served as the first constant. In the previous study, in 2015 and 2018, 20% and 16% of the trees were asymptomatic, respectively. Five years later in 2023, this number dropped significantly to 4%. The most likely explanation was that the environmental conditions were more favorable to
A. morbosa during this period. Due to the yearly increase in knots in the orchard, the abundance of
A. morbosa ascospores was likely on the rise. This would cause more of the pathogen to spread amongst the trees, since more inoculum was present. On the other hand, due to the atypical climate conditions in the previous years, this could have caused more favorable conditions for the fungus to spread and grow, which the tree’s defenses were not equipped to manage. Variations in temperature, humidity, and precipitation could have negatively influenced the tree’s overall health while also providing more favorable conditions to the pathogen. This included harsher winters, wetter springs, and dryer summers, all contributing to larger numbers of infections. The total annual precipitation and average temperatures increased from 2015 and 2018 to 2023 in the area, which correlated to amplified disease prevalence [
20]. A rating of 5, which was the upper limit of both scales, could also be reasonably compared. Contrasting to the counts of the lower limit, the number of trees given the highest severity increased from 2015 to 2018, from 0% to 6% but still remained at 6% in 2023. This implied that since there had not been a significant change in the number of highly susceptible trees in the population, the numbers had stabilized. The stabilization of the number of highly susceptible trees suggests that the overall population’s susceptibility had become consistent over the years, rather than suggesting individual trees were becoming more resistant.
4.2. Assessing Agreement with Kappa Statistic
Originally, Kappa scores were developed for the field of psychology. Since many diagnoses cannot be measured in empirical terms and often times there would be observations made by more than one professional, this statistic was developed to generate a way to report the reproducibility/reliability of a judgement. It was specifically designed for nominal scales; however, it was later improved to include scaled ratings, i.e., ordinal data. The Kappa statistic is superior to taking a simple calculation on percent agreement as it takes into the account the possibility of the consensus occurring by chance [
21]. Kappa scores are expressed as −1 to 1. A negative score signifies that the raters are in high disagreement with each other. A score of 1 indicates complete agreement, while 0 indicates agreement is only based on chance [
22].
4.3. Challenges with Visual Rating Systems
As noted, all eight of the trees with a difference of three across all four ratings consistently had higher ratings from both raters on the first day. Although this high level of variance accounted for the smallest proportion of all trees, it could be deduced that trees with a medium overall rating, were more prone to rating fluctuations, as seen from the majority of these trees having a median rating between 2.5 and 3.5. This pattern suggests that external factors, including environmental conditions and observer biases, may have influenced the assessments. The fact that the first-day ratings were higher could indicate that the raters were initially less familiar with both the disease symptoms and rating system, leading to more cautious and severe scoring. Additionally, the weather conditions such as lack of cloud cover and direct sunlight may have also played a role in making the knots more pronounced and obvious, as opposed to having them more easily mistaken as injury or the previous season’s dried and darkened plums.
Visual rating systems do not have the highest precision; however, for diseases like BK, options are scarce. With the help of machine learning and sensor-based image systems, we attempted to ascertain whether we could train a computer model to quantify the knots on a tree to improve the accuracy of the ratings. This included using a high-definition red, green, and blue (RGB) camera and a depth camera to generate a three-dimensional (3D) image of the tree. The plan was to manually select the knots on a sample set of data and then train an algorithm on detecting knots. With the depth camera, a 3D point cloud could have been generated to calculate the percentage of affected areas. Unfortunately, even with multiple viewpoints of the tree, the resolution was not high enough to differentiate between knots, shadows, or dried fruits. An even more sophisticated system must be implemented to phenotype this difficult disease as the entire tree has to be taken into account. A single missed knot could alter the rating greatly, from 0 to 1.
It is well understood that visual assessment in horticulture poses many limitations. There can be significantly different conclusions due to the results being subject to observer biases. As defined by Johnson et al., bias is the under- or over-estimation of disease due to errors in visual accuracy and diagnosis [
23]. When the disease is diagnosed by visual symptoms alone, anything that can affect the perception of the perceivable manifestations of the illness has the potential to alter the prognosis. Regardless of the limitations, visual assessments have been the longstanding method in breeding and cultivation due to their practicality and cost-effectiveness. Factors that may affect the precision of the ratings are the raters’ experience, amount of training, depth of instruction, visual acuity, and sensitivity to outdoor conditions such as light [
3]. One obvious drawback is that visual assessments lack the ability to detect asymptomatic plants. This can be especially problematic for diseases such as BK, which have a long inoculation period, in which there could be high disease prevalence in an orchard, but can go undetected for up to two years [
6]. Then, a sudden spike of infected trees can erupt, and heavy pruning and fungicides must be implemented to prevent the spread of the disease.
Standard area diagrams (SAD) have been used for over 50 years and have been shown to improve the accuracy of disease phenotyping [
24]. SADs are commonly used in plant pathology to provide visual references for quantifying disease severity on leaves or small plant parts. However, they are not applicable for assessing BK disease on whole trees due to several inherent limitations. First, BK disease manifests as galls of varying sizes and numbers across different parts of the tree, including the branches and the trunk, making it challenging to capture this complexity in a two-dimensional diagram. Additionally, the 3D nature of trees and the distribution of knots across a large and often irregular surface area cannot be adequately represented by SADs, which are designed for relatively uniform and flat surfaces like leaves, fruits, or plantlets [
25]. Moreover, the severity of BK disease impacts the overall structure and health of the tree, aspects that require a holistic view rather than isolated assessments of discrete areas. As such, subjective phenotyping systems that factor in the total number, size, and distribution of knots, as well as the overall health impact on the tree, are more appropriate for evaluating BK disease severity in whole trees.
Visual inspection of phony peach disease (
Xylella fastidiosa subsp.
multiplex Schaad) showed high accuracy, 0.914 and 0.816 (experienced vs. inexperienced raters), after confirmation with quantitative polymerase chain reaction (qPCR). Their visual assessment also involved the entire tree; however, their rating system was markedly different, comprising of only the presence or absence of the disease. In this sense, their rating system was much simpler, but the detection of the disease itself was far less obvious compared to BK. The symptoms were difficult to differentiate between stressed trees, so there were both false negatives and false positives [
23]. False negatives can be common in BK rating if knots have yet to mature, while false positives are rarer since few other fungi species have a similar presentation of BK in our region.
4.4. Potential to Improve Phenotyping Accuracy
Another approach that could be beneficial in the study of BK is artificial inoculation and assessment. Although infection with BK of plum shoots has yet to be reported in vitro, this could be an important milestone to accomplish in BK research. Many more genotypes could be tested in a small amount of space, while eliminating all other variables such that the growing conditions are virtually indistinguishable. Analysis of the growing lesions could be more easily assessed as they would not have to be conducted outdoors under variable weather and lighting conditions. Growth of the fungus would also not be dependent on the seasons, allowing for ongoing assessment. Lastly, the use of an automated image software would be more easily facilitated. A similar approach was taken by Li et al., (2015) studying bacterial cankers (
Pseudomonas syringae Van Hall) on a few
Prunus species [
26]. They coupled this approach with automated image analysis using neural networks to recognize the patterns. However, the application of this in BK research may pose certain problems as the susceptibility in vitro may not reflect the true phenotype in the field.