1. Introduction
Stripe rust caused by 
Puccinia striiformis f. sp. 
tritici (
Pst) and leaf rust caused by 
Puccinia triticina (
Pt), two important airborne fungal diseases on wheat (
Triticum aestivum), occur widely in wheat-growing areas worldwide and can cause severe yield losses of wheat [
1,
2,
3]. Both 
Pst and 
Pt are heteroecious and can produce five kinds of spores, including pycniospore, aeciospore, urediospore, teliospore, and basidiospore, throughout their life cycles [
4,
5]. Stripe rust and leaf rust can occur during all growth stages of wheat. In the seedling growth stage, the symptoms of stripe rust are easily confused with those of leaf rust because both diseases induce scattered uredia on the wheat leaves [
6]. In the adult plant stage, the uredia produced by 
Pst infection are usually distributed in intermittent, long, and narrow stripes on the wheat leaves [
6,
7], while the uredia produced by 
Pt infection are irregularly arranged on the leaves. In different growth stages, generally, the disease resistance levels of wheat are inconsistent, and the disease symptoms are different to a certain extent. In the field, the traditional identification method of wheat stripe rust and wheat leaf rust is mainly carried out by manual visual observation. However, many experienced plant protection personnel or technicians do not have enough time to reach the field to diagnose wheat diseases and provide services for farmers, which cannot meet the requirements for real-time disease diagnosis and monitoring. Rapid and accurate diagnosis of wheat stripe rust and wheat leaf rust is the key to the prevention and control of the diseases.
With the rapid development of machine learning, the identification and monitoring of plant diseases based on information technology have received extensive attention [
8]. Techniques for image acquisition and image processing have rapidly developed, and the corresponding technological progress has turned plant disease image processing issues (including segmentation, feature extraction, and identification of plant disease images) into hot research topics [
9,
10,
11,
12,
13,
14]. Many studies on plant disease image identification using traditional image processing technology have been reported [
9,
10,
11,
12,
14,
15]. Using traditional image processing technology, the disease lesions are segmented from plant disease images, then the useful features are extracted from the disease lesion images and are selected for modeling. Finally, machine learning methods are used to implement the identification of the plant disease images. Studies on image identification of wheat stripe rust and wheat leaf rust using traditional image processing technology have also been reported [
16,
17,
18]. The performance of traditional image processing technology in identifying plant disease images is affected by various factors, including the number, quality, and representativeness of the acquired disease images; the image processing techniques and methods used; the extracted disease image features and feature selection methods; and the machine learning methods used. In the traditional identification methods of plant disease images, the feature values, position distribution, and area sizes of the target regions in the disease images can change with the different development stages of plant diseases [
9,
11,
18]. Furthermore, if the disease images are acquired from the same kind of plant or if the number of the acquired disease images is not enough, the identification performance and generalization ability of plant disease identification models can be affected [
11,
19]. Deep learning can be used to realize the automatic extraction of image features and pattern recognition, and it shows outstanding performance in image recognition, image segmentation, target detection, and other fields, and thus the amount of research on deep learning applied to plant disease identification and plant disease segmentation is increasing [
13,
14,
20,
21,
22,
23,
24,
25,
26,
27]. At present, some deep learning models have been applied to image identification of wheat stripe rust and wheat leaf rust [
28,
29,
30]. Lu et al. [
28] implemented the identification and localization of wheat diseases (including powdery mildew, smut, black chaff, stripe rust, leaf blotch, and leaf rust) based on an in-field wheat disease image data set using deep multiple instance learning (DMIL) under two fully convolutional network (FCN) architectures (VGG-FCN-VD16 and VGG-FCN-S), achieving mean recognition accuracies of 97.95% and 95.12%, respectively. Based on the images of wheat powdery mildew, wheat stripe rust, and wheat leaf rust acquired in in-field environments, Feng et al. [
29] developed a wheat leaf disease image recognition model using the lightweight convolutional neural network MobileNetV2 and transfer learning under the deep learning framework TensorFlow 2.0, achieving an average accuracy of 99.96% for image recognition of the three wheat diseases. Jiang et al. [
30] conducted a comparison of seven convolutional neural networks (CNN) (including VGG-16, Inception-v3, ResNet-50, DenseNet-121, EfficentNet-B6, ShuffleNet-v2, and MobileNetV3) for the identification of field images of powdery mildew, leaf rust, and stripe rust on wheat.
Redmon et al. [
31] proposed a target detection model named YOLO (You Only Look Once). YOLO treats target detection in an image as a regression problem, taking the image as input and outputting the locations and classes of the targets. Compared with traditional target detection models, YOLO has the advantages of rapid running speed, detection in a global image, learning the generalized feature representation of the target, etc. YOLOv5 is one of the YOLO series models, which includes four network models: YOLOv5s, YOLOv5m, YOLOv5l, and YOLOv5x. Among them, YOLOv5s is the one with the smallest depth and the smallest feature map width. YOLOv5 has been applied to image-based plant disease identification [
32,
33,
34,
35]. Mathew and Mahesh [
32] trained a YOLOv5-based model with collected images of bell peppers and achieved a good performance of the trained model in detecting bacterial spot in leaf images of bell peppers. Qiu et al. [
33] developed a YOLOv5l-based method and an application (app) named ‘HLBdetector’ for image-based detection of citrus Huanglongbing (HLB). Xu et al. [
34] developed a lightweight ShuffleNetv2-based YOLOv5s network model for the detection of wheat stripe rust, achieving better detection performance and faster detection speed. A lightweight YOLO-V5s-based model for apple leaf disease detection (ALAD-YOLO) was developed by Xu and Wang [
35] and a detection accuracy of 90.2% was achieved in the apple leaf disease detection experiment using ALAD-YOLO. As far as we know, YOLOv5 has not been applied to the identification of wheat stripe rust and wheat leaf rust.
In recent years, some advances have been made in research on image identification of wheat stripe rust and wheat leaf rust, but there are still various challenges and difficulties to be faced. At the present time, public image data sets of wheat stripe rust and wheat leaf rust are still scarce. In most of the existing studies, the investigations of image identification of the two wheat diseases were carried out based on the disease images with simple backgrounds from self-built image data sets, rather than based on ones acquired in complex field environments. Image features of plant diseases can be affected by many factors, such as plant variety, disease development stage, and image acquisition environment [
9,
11,
18,
24]. During image identification of wheat stripe rust and wheat leaf rust, identification performance can be affected by changes in disease image features that may result from changes in disease development stages. Most of the existing studies on image-based plant disease identification are based on images acquired in a single plant growth stage or images acquired in unclear plant growth stages, and there is a lack of research on algorithms or models for identifying disease images acquired during different plant growth stages. To our best knowledge, there has been no reported research on the differences between the performances of the built models in identifying disease images acquired in different growth stages of plants. Building image identification models of stripe rust and leaf rust on wheat in different growth stages needs further research. Therefore, in this study, data sets of disease images of stripe rust and leaf rust on wheat with different backgrounds acquired at the different growth stages were constructed, and then effective methods for image identification of the two diseases on wheat in different growth stages were investigated.
To overcome the influence of wheat growth stages on disease image identification, image-based identification of stripe rust and leaf rust on wheat in the different growth stages was investigated using deep learning in this study. The images of wheat stripe rust and wheat leaf rust with complex backgrounds and single backgrounds were acquired at the seedling growth stage, stem elongation stage, booting stage, inflorescence emergence stage, anthesis stage, and milk development stage of wheat (in this study, the six growth stages refer to the stages of Z12–Z14, Z30–Z39, Z40–Z49, Z50–Z59, Z60–Z69, and Z70–Z77, respectively, according to the decimal code for growth stages of cereals determined by Zadoks et al. [
36]) in an indoor environment and in the field using a digital camera and a smartphone. Then, a total of seven disease image data sets of the seedling growth stage, stem elongation stage, booting stage, inflorescence emergence stage, anthesis stage, milk development stage, and all the growth stages were constructed. Based on the YOLOv5s model and the constructed image data sets, the image-based identification models of wheat stripe rust and wheat leaf rust at the seedling stage, stem elongation stage, booting stage, inflorescence emergence stage, anthesis stage, milk development stage, and all the growth stages were trained, validated, and tested. The main aims of this study are to investigate whether there are differences in the identification of the images of stripe rust and leaf rust on wheat at the different growth stages based on the YOLOv5s model and to break the limitations of unsatisfactory disease identification performance caused by different disease image features extracted during different growth stages, which can provide a reference for further extracting disease image features and building disease image identification models. A reference can be provided for accurately identifying the two wheat diseases, which is conducive to timely scientific disease control measures that reduce yield losses in wheat.
  3. Discussion
Image acquisition is the basis of plant disease image identification. With the growth of wheat plants and the change in the growth stages of wheat, the leaf cuticle will gradually thicken, the leaves may be chlorotic and yellowing in the later stages, and the color and shapes of uredia of wheat stripe rust and wheat leaf rust may be different at different periods, which will affect the extracted disease image features, thus further affecting the disease image identification. The changes in illumination conditions may lead to the difference between the color obtained from a disease image and the color obtained from human visual perception, thus affecting the extracted disease image features. At present, the images in the data sets used for image identification of wheat stripe rust and wheat leaf rust were acquired at an individual growth stage or not classified according to the growth stages [
16,
17,
18,
28,
29,
30]. Moreover, most of the disease images used had simple backgrounds and a single disease location, and they were acquired under similar illumination conditions. However, in actual agricultural production, wheat can be infected with 
Pst and 
Pt in the different growth stages, and different severity levels of wheat stripe rust and wheat leaf rust can occur at diverse disease locations under different environments. Therefore, to adapt to the actual agricultural production, the acquired disease images should contain complex backgrounds in the field, and factors such as image acquisition equipment and illumination conditions should be taken into account when acquiring disease images. Thus, the constructed disease image data sets may contain more abundant images, providing a basis for obtaining trained disease image identification models with greater robustness. Generally, there is a process of continuous expansion and change in the lesions during disease lesion formation, so disease image identification in the different development stages of plant diseases still needs further research [
37]. Until now, there have been no reported studies on image identification of plant diseases in the different growth stages. Therefore, it is necessary to carry out related research on the image identification of plant diseases in the different growth stages to investigate whether the disease image features extracted at an individual growth stage are also applicable to the identification of the same kind of disease in other growth stages, which provides a new idea for the construction of image data sets for disease identification using image processing technology. The results of this study showed that, during the process of building a disease image identification model, factors such as the differences between disease image features in the different growth stages should be taken into account. When selecting disease images for model training, disease images in the different growth stages should be included, rather than only disease images acquired in an individual growth stage or a minority of individual growth stages. Disease image identification modeling should be conducted on the basis of fully considering the different characteristics of disease images in the different growth stages, which may provide a solution to improve the identification performance of the disease identification models.
When using target detection models for plant disease image identification, image labeling and model optimization have a great influence on model building and identification performance of the built models. Manual labeling of disease images is time-consuming and labor-intensive, and the subjectivity of labeling affects the results of labeling [
24]. CNN-based semi-supervised learning algorithms and unsupervised learning algorithms can reduce the cost and time of data labeling, and they have been used in many fields of plant phenotype research [
38]. In this study, for wheat leaf rust, the produced uredia were scattered on the wheat leaves, and there were multiple uredia on a single wheat leaf, which brought great challenges to the image labeling, and this may affect the training of the YOLOv5s-based image identification models using the training sets, resulting in relatively low values in the evaluation metrics for identification of wheat leaf rust using the built models. In further studies, the results of disease image labeling can be checked, or the images can be re-labeled, the hyperparameters of the YOLOv5s model can be adjusted, and the model can be re-trained to build more optimized image identification models for image identification of wheat stripe rust and wheat leaf rust in the different growth stages.
In this study, although acceptable disease identification results were obtained by building a disease image identification model based on the training set composed of the images acquired in all the growth stages, the identification performance of the model was not particularly good. By increasing the number of disease images acquired at each individual growth stage, the disease image data set may contain more disease images acquired under complex environments so as to improve the identification performance and generalization ability of the built model. On the other hand, the model can be more optimized by changing the model structure, resulting in improved identification performance. Based on the original deep learning models, good identification performance may not be achieved using transfer learning. The original deep learning models can be improved through modification with other network structures and mechanisms, and the disease image identification models based on modified deep learning architectures may be more suitable for target detection and disease identification. For instance, Ma et al. [
14] used YOLOv8 as the base model, replaced the backbone network of YOLOv8 with a PP-LCNet (PP-lightweight CPU network), introduced the depthwise separable convolution (DepthSepConv) structure into the backbone layer, and then added a global attention mechanism (GAM) module and a lightweight content-aware reassembly of features (CARAFE) module to the neck section. They subsequently replaced the original loss function with the Wise-IoU (weighted interpolation of sequential evidence for intersection over union) boundary loss function, and finally, they built a lightweight detection model PGCW-YOLOv8 for wheat diseases. Using this PGCW-YOLOv8 model to identify images of wheat leaf rust, wheat stripe rust, wheat scab, wheat glume blotch, wheat powdery mildew, healthy wheat ears, and healthy wheat leaves, much better identification performance was achieved.
To facilitate practical applications, a number of PC-based computer systems and apps for plant disease image identification have been developed [
13,
28,
33,
39,
40]. Based on the color and shape features extracted from the images of common diseases of wheat, Wang et al. [
39] developed an identification system of common wheat leaf diseases using the VC++ platform in combination with image processing technology, achieving an identification accuracy of more than 96% in the image identification of wheat powdery mildew, wheat leaf rust, wheat stripe rust, and wheat stem rust using this system. Johannes et al. [
40] proposed an image processing algorithm combining candidate hot-spot detection with statistical inference methods and then developed a smartphone app for the identification of disease images of wheat septoria, rust, and tan spot. Based on DMIL and FCN, Lu et al. [
28] proposed an in-field wheat disease identification and localization model called the Multiple Instance Learning-Based Wheat Disease Diagnosis System (DMIL-WDDS), and then they packed this model into a mobile app. In further studies, the parameters of the YOLOv5s-based disease identification model based on the disease images acquired at all the growth stages can be optimized, or other network models can be trained to improve the image identification accuracy and generalization ability of the built model, and then an easy-to-use computer system or app can be developed based on the built model.
  4. Materials and Methods
In this study, YOLOv5s-based image identification models of wheat stripe rust and wheat leaf rust in the different growth stages were developed according to the workflow shown in 
Figure 1.
  4.1. Planting Wheat and Artificial Spray Inoculation of Urediospores of Pst and Pt
The wheat planting was conducted in a controlled climate chamber (12–15 °C, 50–70% relative humidity, and 12 h/12 h light/dark cycle) in the Laboratory of Macro-Phytopathology, China Agricultural University, Beijing, China, a field at the Shangzhuang Experimental Station of China Agricultural University, Haidian District, Beijing, China, and a field at the Gangu Testing Station at the Institute of Plant Protection, Gansu Academy of Agricultural Sciences, Gangu, Gansu, China, using the methods previously described by Wang et al. [
18]. In the controlled climate chamber, a wheat variety Mingxian 169 (highly susceptible to 
Pst and 
Pt) was used; at the Shangzhuang Experimental Station, wheat varieties Zhongmai 175 (highly resistant to 
Pst and moderately susceptible to 
Pt), Beijing 0045 (moderately resistant to 
Pst and 
Pt), Nongda 211 (moderately resistant to 
Pst and 
Pt), and Mingxian 169 were used; and at the Gangu Testing Station, wheat varieties Longjian 9825 (highly resistant to 
Pst), Tianxuan 66 (moderately resistant to 
Pst), and Longjian 9822 (moderately susceptible to 
Pst) were used. The inoculation operations of 
Pst and 
Pt were carried out using artificial spray inoculation methods. The pathogens (
Pst and 
Pt) used for inoculation were multiplied under controlled climate chamber environments.
  4.1.1. Planting Wheat and Artificial Spray Inoculation of Urediospores of Pst and Pt in the Controlled Climate Chamber
In each pot with a diameter of 10 cm and a height of 10 cm, nursery soil containing organic matter was used as the substrate, and 15 seeds of Mingxian 169 were sown. After sowing, the pots were placed into the controlled climate chamber under the conditions described above. When the first leaves of the wheat seedlings of Mingxian 169 fully expanded, pathogen inoculation operations were carried out using the artificial spray inoculation method described by Cheng et al. [
41]. Before inoculation, the inoculation operation environment, inoculation operation table, and inoculation tools were disinfected with a 75% ethanol solution. The emerging spear leaves of the wheat seedlings were cut off to retain only the first fully expanded leaves. The treated wheat seedlings were placed into an inoculation box with a small amount of water. To make the wheat seedlings more conducive to pathogen infection, the seedlings were sprayed evenly using a sprayer with clear water and were then gently washed by hand to remove the wax and pubescences on the surface of the leaves. The urediospore suspensions of 
Pst and 
Pt were made with 0.05% Tween 20 solution. The urediospore suspension of 
Pst or 
Pt was evenly sprayed onto the leaves of wheat seedlings in the inoculation box, and then the inoculation box was covered with its lid after the inside of the lid was sprayed with an appropriate amount of clear water. Subsequently, the inoculation box with the inoculated wheat seedlings was gently moved into the controlled climate chamber. After 24 h, each pot with the inoculated wheat seedlings was taken out of the inoculation box and was then covered with a transparent and clean glass cylinder that was wrapped with two layers of sterile cotton gauze on the top side. Finally, all the inoculated wheat seedlings were moved into the controlled climate chamber for incubation, and when appropriate, the seedlings were watered, and their emerging spear leaves were cut off.
  4.1.2. Planting Wheat and Artificial Spray Inoculation of Urediospores of Pst and Pt in the Field
At the Shangzhuang Experimental Station, wheat seeds were sown in October 2020, and October 2021. In each year, 21 experimental plots (3 × 4 m per plot) were set up. In 2020, Beijing 0045, Mingxian 169, and Zhongmai 175 were planted in the same order in each plot, and Nongda 211 was planted in protective belts between the plots. In 2021, Beijing 0045, Mingxian 169, and Nongda 211 were planted in the same order in each plot, and Zhongmai 175 was planted in protective belts between the plots. In April 2021, and April 2022, inoculation operations of 
Pst and 
Pt were carried out in the field using the artificial spray inoculation method described by Wang et al. [
42]. In each year, 
Pst and 
Pt were inoculated in nine plots, and the remaining three plots were used as controls. In 2021, the urediospore suspensions at three concentrations of 300 mg/L, 200 mg/L, and 100 mg/L were used for the artificial spray inoculation of 
Pst and 
Pt. In 2022, the urediospore suspensions at three concentrations of 600 mg/L, 400 mg/L, and 200 mg/L were used for the artificial spray inoculation of 
Pst and 
Pt. The urediospore suspensions at each concentration were used to inoculate the wheat seedlings in three plots. Before inoculation, the wheat seedlings of a plot to be inoculated were sprayed evenly using a sprayer with clear water and were gently washed by hand to remove the wax and pubescences on the surface of the leaves. Subsequently, the treated wheat seedlings were evenly sprayed with the prepared urediospore suspension and were immediately covered with a plastic film sprayed with water droplets. To moisturize the inoculated wheat seedlings in the plot, the edges of the plastic film were covered with earth. Between 8:00 and 9:00 (Beijing Time) the next day, all of the inoculated plots were unveiled.
At the Gangu Testing Station, nine experimental plots (4.5 × 4.6 m per plot) were set up, and wheat seeds were sown with one wheat variety per plot in October 2020. Each of the wheat varieties Longjian 9825, Tianxuan 66, and Longjian 9822 was planted in three plots, respectively, with a cluster of Mingxian 169 planted in the center of each plot and Longjian 9825 planted in protective belts between the plots. In March 2021, the wheat seedlings of Mingxian 169 planted in the center of each plot were inoculated with a Pst urediospore suspension at a concentration of 1500 mg/L. After spraying the urediospore suspension, the wheat seedlings of Mingxian 169 were covered with a plastic bag, and then the edges of the plastic bag were covered with earth to moisturize the inoculated wheat seedlings. Between 8:00 and 9:00 (Beijing Time) the next day, all the seedlings were unveiled.
  4.2. Acquisition of Disease Images and Construction of Disease Image Data Sets
In six growth stages of wheat, including the seedling growth stage, stem elongation stage, booting stage, inflorescence emergence stage, anthesis stage, and milk development stage, the acquisition of images of wheat stripe rust and wheat leaf rust was conducted. A Nikon D700 digital camera (Nikon Corp., Tokyo, Japan) and a HUAWEI P30 smartphone were used to acquire the disease images, and the sizes of the corresponding acquired disease images in jpeg format were 4256 × 2832 pixels and 3648 × 2736 pixels, respectively.
In the seedling stage, images of wheat stripe rust and wheat leaf rust with complex backgrounds and single backgrounds were taken using a multi-angle image shooting method under different light intensity conditions in the indoor environment. To acquire the disease images with complex backgrounds, two large sheets of white paper with the size of 1 × 1 m were used as the wall background and the desktop background; a 10 × 10 cm white foam board was placed on the white paper covering the desktop, and the diseased wheat seedlings in the pot to be photographed were placed on the foam board; two table lamps were placed on the left and right sides of the seedlings to illuminate the diseased wheat leaves; and different light intensities were obtained by adjusting the positions and heights of the lamps when taking the disease images. In the stem elongation stage, booting stage, inflorescence emergence stage, anthesis stage, and milk development stage, the images of wheat stripe rust and wheat leaf rust with complex backgrounds and single backgrounds were taken from multiple angles in multiple time periods in the field environment. To acquire the disease images with single backgrounds in the indoor environment and in the field, single leaves with typical symptoms of wheat stripe rust or wheat leaf rust at different severity levels were collected from the diseased wheat seedlings, then each diseased single leaf was fully expanded as flat as possible on a sheet of A4 white paper. Subsequently, the images of the diseased single leaves were taken. The specific acquisition information on the images of wheat stripe rust and wheat leaf rust is shown in 
Table 8. The examples of the images of wheat stripe rust and wheat leaf rust with complex backgrounds and single backgrounds, acquired in the indoor environment and in the field, are shown in 
Figure 2.
In this study, a total of 15,722 images of wheat stripe rust and wheat leaf rust were acquired. Based on disease images with typical disease symptoms and different backgrounds acquired in an indoor environment and in the field, a total of seven image data sets in the seedling growth stage, stem elongation stage, booting stage, inflorescence emergence stage, anthesis stage, milk development stage, and all the growth stages, named Data Set 1, Data Set 2, Data Set 3, Data Set 4, Data Set 5, Data Set 6, and Data Set 7, respectively, were constructed. The image data set at all the growth stages, i.e., Data Set 7, was composed of all the disease images acquired in the different growth stages. Due to the short period available for disease image acquisition in the stem elongation stage of the wheat, the number of disease images acquired in the stem elongation stage is relatively small compared with the number of disease images acquired in the other growth stages. There are not many differences between the number of disease images contained in the disease image data sets in the seedling growth stage, booting stage, inflorescence emergence stage, anthesis stage, and milk development stage. The information on the seven constructed image data sets of wheat stripe rust and wheat leaf rust is shown in 
Table 9.
  4.3. Configuration of the Data Analysis Operating Environment
Anaconda software (version 4.9.2) suitable for the system requirements of the computer used for data analysis was downloaded from the Anaconda official website (
https://www.anaconda.com/download) and was then installed. After starting Anaconda, a new virtual environment of Python (version 3.8.12) named yolov5 was created for installing PyTorch. The “Win” and “R” keys on the keyboard were simultaneously pressed to launch the Run command window. The “cmd” was input into the Run command window, and then the “OK” button was clicked, and the command line window was opened. In the command line, the command “nvidia-smi” was input, and then the “Enter” key on the keyboard was pressed. In the pop-up interface, the computer graphics card information, including its driver version and supported CUDA (compute unified device architecture) version, could be checked. The corresponding supported CUDA version (version 10.2) was downloaded from the CUDA official website (
https://developer.nvidia.com/cuda-toolkit-archive) and installed. Subsequently, the cuDNN version (version 8.1.0) corresponding to the CUDA was downloaded from the cuDNN official website (
https://developer.nvidia.com/rdp/cudnn-archive) and installed. The created virtual environment yolov5 was activated, and then the corresponding version of PyTorch (version 1.10.0), i.e., the GPU version, was downloaded from the PyTorch official website (
https://pytorch.org/) and installed.
From the GitHub website (
https://github.com/ultralytics/yolov5), the source code of YOLOv5 was downloaded. In the Terminal window of PyCharm (version 2021.2.1) (a Python-integrated development environment), the command “pip install -r requirements.txt” was input to install the dependencies (packages) listed in the file requirements.txt in the root directory of the source code. After logging on to the official website to download the weight file of YOLOv5 (
https://github.com/ultralytics/yolov5/releases), “Releases” was clicked, and then a web page appeared. In this web page, the YOLOv5 section was found, the weight file yolov5s.pt was downloaded from the Assets panel, and this weight file was placed into the root directory of the YOLOv5 source code.
The building, training, and testing of the YOLOv5s-based image identification models of stripe rust and leaf rust on wheat in different growth stages were conducted under the PyTorch framework using the CUDA parallel computing architecture. The configurations of the data analysis operating environment in detail are shown in 
Table 10.
  4.4. Disease Image Labeling
The LabelImg software (version 1.8.6), an open-source data labeling tool, was used to label the images of wheat stripe rust and wheat leaf rust. Using the PyCharm software, the virtual environment yolov5 was launched. In the Terminal window, the command “pip install labelimg” was input to install the LabelImg software. Then, the command “labelimg” was input in the Terminal window to open the user interface of the LabelImg software. After the user interface of LabelImg was opened, the file folder where the wheat disease images to be labeled were located was selected by clicking the “Open Dir” button, and by clicking the “Change Save Dir” button, the file folder was selected or a new file folder was created for saving the class labels of the labeled wheat disease images. Then, the seventh option from top to bottom on the left side of the user interface was switched to YOLO. Subsequently, the “View” menu was clicked to select the following options: Auto Save mode (used to automatically save the class label of the labeled image when switching to the next unlabeled image), Display Labels (used to display the bounding boxes and class labels), and Advanced Mode (used to set the labeling crosshair to always appear on the user interface), and the disease images were manually labeled one by one. The left mouse button was pressed and dragged to create a bounding box to select the target to be labeled, and the class of the target was manually set to “TX” (referring to wheat stripe rust) or “YX” (referring to wheat leaf rust). The hotkeys used during labeling and their functions are as follows: “W” (used to create a bounding box), “A” (used to switch to the previous image), “D” (used to switch to the next image), and “Delete” (used to delete the selected bounding box). Each hotkey was used individually, and it was not necessary to press any other hotkeys simultaneously. If deleting a bounding box was required, the “Edit RectBox” button, the ninth option from top to bottom on the left side of the user interface of LabelImg, was clicked, then the bounding box to be deleted was selected, and subsequently, the “Delete” hotkey was pressed, and the bounding box was deleted. After labeling the wheat disease images, corresponding .txt label files were generated containing the target information, including the class, x_center (i.e., the normalized X coordinate of the center point of the bounding box of the target), y_center (i.e., the normalized Y coordinate of the center point of the bounding box of the target), width (i.e., the normalized width of the bounding box of the target), and height (i.e., the normalized height of the bounding box of the target).
  4.5. Construction of the Training, Validation, and Testing Sets of the Disease Images
Each disease image in the image data sets in the seedling growth stage, stem elongation stage, booting stage, inflorescence emergence stage, anthesis stage, and milk development stage was labeled using LabelImg software, and according to the growth stages, the generated .txt label files were saved in the individual file folders corresponding to the growth stages. All the disease images in the image data sets were original, without any preprocessing. Based on the OS, Shutil, Random, and Tqdm libraries, using the Python programming language, both the disease images in the image data set at each growth stage and the .txt label files generated after the labeling of the corresponding images were divided into training, validation, and testing sets according to a ratio of 8:1:1. The training, validation, and testing sets of the disease images and the corresponding .txt label files for all the growth stages were composed of the images in the training, validation, and testing sets, and the corresponding .txt label files in the seedling growth stage, stem elongation stage, booting stage, inflorescence emergence stage, anthesis stage, and milk development stage. The number of disease images contained in the training, validation, and testing sets constructed using the images acquired during the different growth stages of the wheat is shown in 
Table 11.
  4.6. Disease Image Identification Model Training and Testing
In the file folder named data in the root directory of the YOLOv5 source code, a file named YOLO.yaml was created. In the file YOLO.yaml, the “path” was followed by the saved path of the disease images for an individual growth stage or all the growth stages; “train” was followed by the path of the training set of the disease images; “val” was followed by the path of the validation set of the disease images; “test” was followed by the path of the testing set of the disease images; “nc” was followed by the number of classes, and in this study, the number of classes was “2”; “names” was followed by the class names, and in this study, the class names were “TX” and “YX”, referring to wheat stripe rust and wheat leaf rust, respectively. When training the YOLOv5s-based models for image identification of stripe rust and leaf rust on wheat in the seedling growth stage, stem elongation stage, booting stage, inflorescence emergence stage, anthesis stage, milk development stage, and all the growth stages, the paths following the “path”, “train”, “val”, and “test” in the file YOLO.yaml were changed according to the saved paths of the disease images and the corresponding .txt label files of the image data sets.
The file yolov5s.yaml, in the file folder named models in the root directory of the YOLOv5 source code was opened, and then the number following “nc” in this file was changed to “2”. The file hyp.scratch.yaml, located in the subfolder hyps in the file folder data, contains various hyperparameters, including learning rate, momentum, and weight decay. The hyperparameter values can be modified according to the model training requirements to optimize the model.
In this study, based on the training sets of wheat stripe rust and wheat leaf rust in the seedling growth stage, stem elongation stage, booting stage, inflorescence emergence stage, anthesis stage, milk development stage, and all the growth stages, the YOLOv5s network models were trained by running the file train.py located in the root directory of the YOLOv5 source code for building the image identification models of stripe rust and leaf rust on wheat in the seedling growth stage, stem elongation stage, booting stage, inflorescence emergence stage, anthesis stage, milk development stage, and all the growth stages. During the model training based on the training set of the disease images acquired during an individual growth stage or all the growth stages, the corresponding validation set was used, and the model weight file best.pt for the model with the best performance was saved. Then, the file best.pt was used to implement the tests of the built image identification model based on the testing sets by running the test.py file in the root directory of the YOLOv5 source code. In this study, because of the limitations of the hardware configuration conditions, the parameters for the training of the YOLOv5s-based image identification models of wheat stripe rust and wheat leaf rust in the different growth stages were set as shown in 
Table 12.
  4.7. Evaluation Metrics of the Disease Image Identification Models
To evaluate the performance of the built YOLOv5-based image identification models of wheat stripe rust and wheat leaf rust for the identification of the images of individual diseases at the testing sets, Precision, Recall, F1 Score, and AP were used as the evaluation metrics. To evaluate the performance of the built YOLOv5-based image identification models for the identification of the images of wheat stripe rust and wheat leaf rust at the testing sets, mean F1 Score and mAP were used as the evaluation metrics.
Precision is used to describe the proportion of actual positive samples in the samples identified as positive samples in the final identification results, which can be calculated using Formula (1).
        
Recall is used to describe the proportion of identified positive samples in the samples that are actually positive, which can be calculated using Formula (2).
        
F1 Score is the harmonic mean of Precision and Recall, which can be calculated according to Formula (3).
        
AP is used to describe the area below the Precision–Recall curve with Recall as the abscissa and Precision as the ordinate, which can be calculated according to Formula (4).
        
Mean F1 Score is used to describe the mean of the values of the F1 Score for all the classes, which can be calculated using Formula (5).
        
mAP is used to describe the mean of AP of all the classes, which can be calculated using Formula (6).
        
In the above formulas, TP (true positive) represents the number of positive samples identified as positive samples; FP (false positive) represents the number of negative samples identified as positive samples; FN (false negative) represents the number of positive samples identified as negative samples; N denotes the number of the target classes; and i denotes the ith class.