Intelligent Monitoring System to Assess Plant Development State Based on Computer Vision in Viticulture

: Plant health plays an important role in inﬂuencing agricultural yields and poor plant health can lead to signiﬁcant economic losses. Grapes are an important and widely cultivated plant, especially in the southern regions of Russia. Grapes are subject to a number of diseases that require timely diagnosis and treatment. Incorrect identiﬁcation of diseases can lead to large crop losses. A neural network deep learning dataset of 4845 grape disease images was created. Eight categories of common grape diseases typical of the Black Sea region were studied: Mildew, Oidium, Anthracnose, Esca, Gray rot, Black rot, White rot, and bacterial cancer of grapes. In addition, a set of healthy plants was included. In this paper, a new selective search algorithm for monitoring the state of plant development based on computer vision in viticulture, based on YOLOv5, was considered. The most difﬁcult part of object detection is object localization. As a result, the fast and accurate detection of grape health status was realized. The test results showed that the accuracy was 97.5%, with a model size of 14.85 MB. An analysis of existing publications and patents found using the search “Computer vision in viticulture” showed that this technology is original and promising. The developed software package implements the best approaches to the control system in viticulture using computer vision technologies. A mobile application was developed for practical use by the farmer. The developed software and hardware complex can be installed in any vehicle. Such a mobile system will allow for real-time monitoring of the state of the vineyards and will display it on a map. The novelty of this study lies in the integration of software and hardware. Decision support system software can be adapted to solve other similar problems. The software product commercialization plan is focused on the automation and robotization of agriculture, and will form the basis for adding the next set of similar software.


Introduction
Another area of application of computer vision in agriculture is the automation of the process of harvesting and processing crops. Computer vision systems can automatically determine the maturity and size of fruits and vegetables, identify damage and disease, and classify and sort products according to various parameters. This allows one to increase the productivity and quality of agricultural products, as well as reduce the cost of their collection and processing [1][2][3]. Computer vision can also be used to control the quality of planting material and seeds, control the area under crop, determine the level of fertilizers and pesticides, and automatically control and monitor watering and irrigation systems [4][5][6][7]. In general, the use of computer vision in agriculture can improve the productivity, efficiency, and economic efficiency of agricultural production [8][9][10][11][12].
Diseases of grapes are a serious problem for their cultivation in the southern regions of Russia. Grapes are one of the most important agricultural crops, as they represent a

Diseases of Grapes Typical of the Black Sea Region
Two of the most common grape diseases in the southern regions of Russia are Grape Powdery Mildew (Peronospora viticola) and Oidium (Uncinula necator).
The peak of development falls in the period from mid-June to mid-August, depend- The developed software and hardware complex can be installed in any vehicle. Such a mobile system will allow for real-time monitoring of the state of the vineyards and will display it on a map. The novelty of this study lies in the integration of software and hardware.
The main objectives of this project were as follows: -Develop architecture, algorithms, and an original software and hardware complex based on the computer vision system for detecting and predicting the development of grape diseases. -Create a web service and a mobile application for the provision of an intelligent grape disease recognition system service for agricultural enterprises and winegrowers in southern Russia.

Diseases of Grapes Typical of the Black Sea Region
Two of the most common grape diseases in the southern regions of Russia are Grape Powdery Mildew (Peronospora viticola) and Oidium (Uncinula necator).
The peak of development falls in the period from mid-June to mid-August, depending on the ripening period of a particular variety. Also, this disease of grapes is characterized by outbreaks in early autumn, since treatments are stopped in the post-harvest period. This fungal disease is characterized by massive damage, affecting immature berries, chlorophyllcontaining tissues (leaves and young shoots), and the ridges of bunches, which leads to rapid drying of the latter. Oidium actively develops in both wet and dry hot conditions. This fungus does not require drip-liquid moisture (dew, precipitation) for spore germination.
The long-term effects of Oidium in the vineyards of the southern coast of Crimea indicate the continuity and relative constancy of the intensity of its development over the years, as well as a less-significant dependence on weather conditions than Mildew. At the same time, the average dependence of the intensity of Oidium development on leaves and relative air humidity in May was found. The established patterns indicate the importance of mandatory monitoring and short-term forecasting of the development of Oidium ( Figure 2).  Mildew (Peronospora viticola). Requires a large amount of drip-liquid moisture.
In recent years, September has been considered the most favorable period for the development of Mildew, when the heat subsides and humidity increases. During this time, the fungus infects the apical stepson leaves. In the temperature range of +10-+30 °C and at a humidity of 70% after the first prolonged precipitation, spores germinate, and this period lasts for 2-3 months. At the peak of growth, the fungus infects almost all green parts of the bush and penetrates through stomata into buds and flowers and then into berries, which is expressed in the wrinkling of the fruit at the stalk, followed by drying.
The seasonal dynamics of the epiphytotic process of Mildew in the Southwestern Mildew (Peronospora viticola). Requires a large amount of drip-liquid moisture.
In recent years, September has been considered the most favorable period for the development of Mildew, when the heat subsides and humidity increases. During this time, the fungus infects the apical stepson leaves. In the temperature range of +10-+30 • C and at a humidity of 70% after the first prolonged precipitation, spores germinate, and this period lasts for 2-3 months. At the peak of growth, the fungus infects almost all green parts of the bush and penetrates through stomata into buds and flowers and then into berries, which is expressed in the wrinkling of the fruit at the stalk, followed by drying.
The seasonal dynamics of the epiphytotic process of Mildew in the Southwestern Crimea were determined via hydrothermal conditions. In this regard, the time of manifestation of the first visual signs of the disease varied greatly throughout the years of the study (2014-2022).
The highest level of disease intensity was recorded in 2015. On average, during this period, Mildew developed moderately on leaves and to a lesser extent on bunches.
Thus, in the ampelocenoses of the Southwestern zone of Crimean viticulture, the development of Mildew from the year was continuous and uneven. The long-term dynamics of the disease indicate a high dependence of the development of the disease on the leaves and on the amount of precipitation in the period from May to August, which indicates the significant importance of the short-term forecast for the development of Mildew grapes ( Figure 3).  An analysis of the seasonal dynamics of the epiphytotic process of Oidium and Mildew in the vine plantations of the southern coast of Crimea allows us to state their relative constancy.
Also, in the Black Sea region, there are the following diseases of grapes: Anthracnose (Gloeosporium ampelophagum). This grape disease develops in spring during cool, damp weather. All young green organs of the plant suffer, as well as unripe berries. A point lesion turns into ulcers, which can ring the shoots and inflorescences, and as a result of which the latter break or dry out.
Alternariosis (Alternaria). The fungus develops in a wide temperature range and at any humidity. Overwintering seen in buds and on plant debris in the soil. In spring, with air currents and rain splashes, spores spread through the leaves. On the green parts and berries, characteristic bulges appear, quickly taking the form of clearly defined brown, red, and dark brown spots, which tend to merge and lead to a large-scale lesion.
Esca is a fungal disease that can affect all parts of the bush and often leads to its death. The disease is caused by several types of fungi. The main ones are Fomitiporia mediterranea, Phaeomoniella chlamydospore, and Phaeoacremonium aleophilum.
Black rot (Guignardia bidwellii). Requires a large amount of drip-liquid moisture. It develops most actively in a hot climate rich in precipitation. It affects vegetative and chlorophyll-containing organs: leaves, young shoots, and ridges, as well as unripe berries at the stage of cluster formation.
White rot (Coniella diplodiella). The disease affects the above-ground parts of plants and causes maximum damage when it covers berries and shoots on mother liquors of rootstock vines in the absence of support. Infects the peduncle pads.
Gray rot (Botrytis cinerea). It affects all green organs and berries in any climate, re- An analysis of the seasonal dynamics of the epiphytotic process of Oidium and Mildew in the vine plantations of the southern coast of Crimea allows us to state their relative constancy.
Also, in the Black Sea region, there are the following diseases of grapes: Anthracnose (Gloeosporium ampelophagum). This grape disease develops in spring during cool, damp weather. All young green organs of the plant suffer, as well as unripe berries. A point lesion turns into ulcers, which can ring the shoots and inflorescences, and as a result of which the latter break or dry out.
Alternariosis (Alternaria). The fungus develops in a wide temperature range and at any humidity. Overwintering seen in buds and on plant debris in the soil. In spring, with air currents and rain splashes, spores spread through the leaves. On the green parts and berries, characteristic bulges appear, quickly taking the form of clearly defined brown, red, and dark brown spots, which tend to merge and lead to a large-scale lesion.
Esca is a fungal disease that can affect all parts of the bush and often leads to its death. The disease is caused by several types of fungi. The main ones are Fomitiporia mediterranea, Phaeomoniella chlamydospore, and Phaeoacremonium aleophilum.
Black rot (Guignardia bidwellii). Requires a large amount of drip-liquid moisture. It develops most actively in a hot climate rich in precipitation. It affects vegetative and chlorophyll-containing organs: leaves, young shoots, and ridges, as well as unripe berries at the stage of cluster formation. White rot (Coniella diplodiella). The disease affects the above-ground parts of plants and causes maximum damage when it covers berries and shoots on mother liquors of rootstock vines in the absence of support. Infects the peduncle pads.
Gray rot (Botrytis cinerea). It affects all green organs and berries in any climate, resistant to adverse conditions. Bacterial cancer (Agrobacterium tumefaciens) is an incurable disease of cultivated grapes, leading to the death of the bush. There are ground and root forms of the disease. The disease is caused by the aerobic rod-shaped bacterium Agrobacterium tumefaciens, which is found everywhere in all types of soil.

Comparison of the Characteristics of Neural Networks of Different Architectures Using Computer Vision
In general, an artificial neural network is a mathematical model, and a type of software implementation, created on the principles of organization and functioning of biological neural networks-networks of nerve cells of a living organism. The ANN includes grouped neurons called layers. The classification of types of neural networks is given in Table 1 [24,25]. A convolutional neural network (ConvNet / CNN) is a deep learning algorithm that is able to receive an image as an input, set digestible weights and biases to different areas in the image, and distinguish between these areas. A feature of the network is less pre-processing, unlike other classification algorithms. Also, a CNN, with proper training, is able to independently learn filters and characteristics, unlike primitive methods (see Figure 4) [26][27][28][29].
Computation 2023, 11, x FOR PEER REVIEW 6 of 20 grouped neurons called layers. The classification of types of neural networks is given in Table 1 [24,25]. A convolutional neural network (ConvNet / CNN) is a deep learning algorithm that is able to receive an image as an input, set digestible weights and biases to different areas in the image, and distinguish between these areas. A feature of the network is less preprocessing, unlike other classification algorithms. Also, a CNN, with proper training, is able to independently learn filters and characteristics, unlike primitive methods (see  The architecture of the CNN is based on the way the visual cortex is organized. Individual neurons are only able to respond to stimuli in a narrow region of the visual field known as the receptive field. Unlike feedforward networks, which operate on data in the  The architecture of the CNN is based on the way the visual cortex is organized. Individual neurons are only able to respond to stimuli in a narrow region of the visual field known as the receptive field. Unlike feedforward networks, which operate on data in the form of vectors, convolutional networks operate on images in the form of tensors. Tensors are 3D arrays of numbers. Images on the computer are displayed in the video image, each pixel is the value corresponding to the corresponding parameters. In this case, each of the channels means an integer from 0 to 255. Most often, color images are used, which are based on RGB material-particles containing volumes in three channels: red, green and blue. The coverage of the entire zone of vision is achieved by the combination of such fields. Figure 5 shows an RGB image divided into three color channels-red, green, and blue. In addition to the RGB, there are other color spaces: grayscale, HSV, RGB, CMYK, etc. Given this, you can imagine an 8K image (7680 × 4320) and what will becom puting resources. A convolutional neural network is needed to transform the in in such a way that it is easier to process later without losing quality and chara which play a significant role in obtaining a reliable forecast. This is also importan of scalability for massive datasets.
Convolutional neural networks are based on filters that recognize certain im acteristics (straight lines, geometric shapes). A filter is one or a collection of ke kernel is a regular matrix of numbers, which are weight coefficients. These w adjusted to search for certain characteristics in the image [30][31][32]. The kernel mo the image and determines the presence or absence of the desired characteristic in part of it. To obtain an answer, the sum of the products of the filter element matrix of the input signals is calculated. Such a process is called a convolution o see Figure 6. Given this, you can imagine an 8K image (7680 × 4320) and what will become of computing resources. A convolutional neural network is needed to transform the input image in such a way that it is easier to process later without losing quality and characteristics, which play a significant role in obtaining a reliable forecast. This is also important in terms of scalability for massive datasets.
Convolutional neural networks are based on filters that recognize certain image characteristics (straight lines, geometric shapes). A filter is one or a collection of kernels. The kernel is a regular matrix of numbers, which are weight coefficients. These weights are adjusted to search for certain characteristics in the image [30][31][32]. The kernel moves along the image and determines the presence or absence of the desired characteristic in a specific part of it. To obtain an answer, the sum of the products of the filter elements and the matrix of the input signals is calculated. Such a process is called a convolution operation; see Figure 6.
If there is a characteristic in the image fragment, the convolution operation produces a number with a relatively large value as an output. If there is no characteristic, the output number will be small [32,33].
Let us analyze the convolution process on a two-dimensional convolution; see The 2D convolution operation, or 2D convolution, is the foundation of convolutional neural networks. The whole algorithm of work is concentrated in the following way: the kernel, which is a matrix of weights (weight matrix), iterates through the image and iteratively performs the multiplication operation on the input values that the matrix is currently over, and then all of the values are summed into one output pixel.
kernel is a regular matrix of numbers, which are weight coefficients. These weights are adjusted to search for certain characteristics in the image [30][31][32]. The kernel moves along the image and determines the presence or absence of the desired characteristic in a specific part of it. To obtain an answer, the sum of the products of the filter elements and the matrix of the input signals is calculated. Such a process is called a convolution operation; see Figure 6. If there is a characteristic in the image fragment, the convolution operation produces a number with a relatively large value as an output. If there is no characteristic, the output number will be small [32,33].  The 2D convolution operation, or 2D convolution, is the foundation of convolutiona neural networks. The whole algorithm of work is concentrated in the following way: th kernel, which is a matrix of weights (weight matrix), iterates through the image and iter atively performs the multiplication operation on the input values that the matrix is cur rently over, and then all of the values are summed into one output pixel.
All of this work of multiplication and summation is repeated with each place that th kernel is above, while at the same time transforming the input matrix into another featur matrix. The output feature matrix is a matrix of weighted sums of the input features where the weight values are determined by the kernel. All of this work of multiplication and summation is repeated with each place that the kernel is above, while at the same time transforming the input matrix into another feature matrix. The output feature matrix is a matrix of weighted sums of the input features, where the weight values are determined by the kernel.
In order to determine what kernel size is needed for a convolutional network, it is necessary to find out the number of features combined to obtain a new feature at the output. This will determine the size of the kernel [33][34][35][36][37][38]. Figure 8 shows the situation in which the input is 5 × 5 = 25, and the output is 3 × 3 = 9 signs. Considering a situation where the convolution process is not applied, but the standard layer is used (standard fully connected layer), then with this option, the weight matrix would consist of 25 × 9 = 225 parameters.
As you can see from the example, using the convolution process, subsequent operations with only 9 parameters can be carried out, since each feature in the output is the result of analyzing only one input, which is located in "about the same place", and not each feature in the input [39,40].
Top 1 and Top 5 accuracy refers to the performance of the model on the ImageNet validation dataset. Depth refers to the topological depth of the network. It includes activation layers, batch normalization layers, etc. The convolution operation can produce two types of results: the first, in which the final feature has a lower dimension than the input, while in the other, the dimension either increases or does not change (Table 2). kernel, which is a matrix of weights (weight matrix), iterates through the image and iteratively performs the multiplication operation on the input values that the matrix is currently over, and then all of the values are summed into one output pixel.
All of this work of multiplication and summation is repeated with each place that the kernel is above, while at the same time transforming the input matrix into another feature matrix. The output feature matrix is a matrix of weighted sums of the input features, where the weight values are determined by the kernel. In order to determine what kernel size is needed for a convolutional network, it is necessary to find out the number of features combined to obtain a new feature at the output. This will determine the size of the kernel [33][34][35][36][37][38].  Depth refers to the topological depth of the network. It includes activation layers, batch normalization layers, etc. [28,29].

Development of a Technique for Preparing and Marking an Image for Training Neural Networks
A neural network deep learning dataset of 4845 grape disease images was created by the authors. Eight categories of common grape diseases typical of the Black Sea region were studied: Mildew, Oidium, Anthracnose, Esca, Gray rot, Black rot, White rot, Grape rubella, Grape chlorosis, and bacterial cancer of grapes (Table 3). LabelIMG was used to mark up images for the dataset. This software is often used in the field of machine learning, especially in computer vision tasks where a dataset needs to be labeled to train a model. Figure 9 shows the markup of one of the images for training the model in the first stage.

Development of a Technique for Preparing and Marking an Image for Training Neural Networks
A neural network deep learning dataset of 4845 grape disease images was created by the authors. Eight categories of common grape diseases typical of the Black Sea region were studied: Mildew, Oidium, Anthracnose, Esca, Gray rot, Black rot, White rot, Grape rubella, Grape chlorosis, and bacterial cancer of grapes (Table 3). LabelIMG was used to mark up images for the dataset. This software is often used in the field of machine learning, especially in computer vision tasks where a dataset needs to be labeled to train a model. Figure 9 shows the markup of one of the images for training the model in the first stage. The dataset markup data for primary training is shown in Figure 10. The dataset markup data for primary training is shown in Figure 10. The graph of the algorithm efficiency for the last training of the model is shown in Figure 11 and it can be seen that the efficiency is at a fairly high level. The graph of the algorithm efficiency for the last training of the model is shown in Figure 11 and it can be seen that the efficiency is at a fairly high level. To separately assess the quality of the algorithm on each of the classes, there are precision (accuracy) and recall (completeness) metrics. Precision can be interpreted as the  The graph of the algorithm efficiency for the last training of the model is shown in Figure 11 and it can be seen that the efficiency is at a fairly high level. To separately assess the quality of the algorithm on each of the classes, there are precision (accuracy) and recall (completeness) metrics. Precision can be interpreted as the To separately assess the quality of the algorithm on each of the classes, there are precision (accuracy) and recall (completeness) metrics. Precision can be interpreted as the proportion of objects called positive by the classifier and at the same time are really positive, and recall shows what proportion of objects of a positive class out of all objects of a positive class the algorithm found. Confidence is the probability with which the found object will correspond to the class assigned to it. The F1 parameter provides a general estimate of the trade-off relationship between the accuracy and completeness of the model. It is used in problems where both the degree of true positive predictions and the degree of prediction of all true positive events are important.
Graphs showing the ratios of these quantities for the last training of the model are presented in Figures 12-14.
The graphs show that the model has high accuracy and a small number of unrecognized lesions, which indicates the correct training, as well as high key indicators and no retraining of the model.       The graphs show that the model has high accuracy and a small number of unrecognized lesions, which indicates the correct training, as well as high key indicators and no retraining of the model.

Development of the Interface and Program Module of the System
To automate the process of the primary processing of the results, a desktop version of the system was developed, in which a set of images (dataset) are loaded and then detected using the chosen neural network architecture. To upload a set of images, one must click on the "Upload Images" button in the menu bar or sidebar. After the user clicks on the button, the program will open the "Image Folder" dialog box to select the folder where the images are stored. To start the detection and recognition of objects in images, one must press the "Detection" button, which is located on the side menu. After pressing it, the search for affected areas of grapes in the images will start and the "Detection" tab will open. The result of the object detection is shown in Figures 15 and 16.
To view information about the disease found in the "Detection" tab, one must click the "Information" button; after which, the "Disease Information" window will open with a description of the disease, instructions for treatment, and information about the nearest point that one can go to for help. The result of displaying information about the disease for one of the pictures is shown in the figure.

Development of the Interface and Program Module of the System
To automate the process of the primary processing of the results, a desktop version of the system was developed, in which a set of images (dataset) are loaded and then detected using the chosen neural network architecture. To upload a set of images, one must click on the "Upload Images" button in the menu bar or sidebar. After the user clicks on the button, the program will open the "Image Folder" dialog box to select the folder where the images are stored. To start the detection and recognition of objects in images, one must press the "Detection" button, which is located on the side menu. After pressing it, the search for affected areas of grapes in the images will start and the "Detection" tab will open. The result of the object detection is shown in Figures 15 and 16.
To view information about the disease found in the "Detection" tab, one must click the "Information" button; after which, the "Disease Information" window will open with a description of the disease, instructions for treatment, and information about the nearest point that one can go to for help. The result of displaying information about the disease for one of the pictures is shown in the figure.
Also in the window is a tab "Objects", which displays similar images where the disease was found in the form of a list of images. The page with the output results is shown in Figure 16.
At the moment, a mobile application for the detection of grape diseases is being developed. In the future, the applications will be integrated into a software and hardware complex that will allow for real-time monitoring of the state of fields with grapes. Also in the window is a tab "Objects", which displays similar images where the disease was found in the form of a list of images. The page with the output results is shown in Figure 16.  Also in the window is a tab "Objects", which displays similar images where the disease was found in the form of a list of images. The page with the output results is shown in Figure 16.

Discussion
The yield of agricultural crops depends directly on the quality of field work, land cultivation, and timely receipt of information from sown areas through operational monitoring. Using large aircraft for these purposes is expensive and not always effective. Traditional methods of sampling plants and assessing their physical and chemical state in laboratories are time-consuming. And, besides, manual collection carries the risk of damage to cultures and the destruction of the studied specimens during laboratory work [40,41].
Traditional methods of controlling crop growth and development are based entirely on manual labor, and less often on the use of expensive aircraft. The monitoring of leaf blades, stem, and soil profiles carried out in this way for the presence of various problems has a number of disadvantages. This is significantly long and expensive, which in turn leads to a slowdown in the development of the agricultural sector in specific areas. It is also unable to determine as accurately as possible the whole picture of the state of crops as a whole. These are also factors that are directly determined by the work of people. In particular, we are talking about labor productivity and compliance with the technologies of cultivated crops [42][43][44].
Machine vision systems are becoming increasingly common onboard agricultural vehicles (autonomous and non-autonomous) for different tasks. The use of robotic systems, equipped with vision-based sensors, for site-specific treatments in Precision Agriculture (PA) is seeing continuous growth [45][46][47][48]. A common practice consists of image processing for deceases or crop identification. However, complex systems are rare. The developed application is part of the hardware and software complex shown in Figures 17 and 18.     The software and hardware complex can be installed in any vehicle. Such a mobile system will allow for real-time monitoring of the state of the vineyards and will display it on an interactive map ( Figure 19). The novelty of this study lies in the integration of software and hardware. So far, a dataset of images of grape diseases typical of the Black Sea region has been gathered, a neural network for the recognition of grape diseases has been constructed, and a desktop application and a mobile application have been developed. In the experimental version, a hardware complex for automated data collection was implemented. Detailed economic calculations may be considered in future studies. Accuracy scores and metrics will be compared to the existing ones. The software package implemented the best approaches to a control system in viticulture using computer vision technologies.

Conclusions
Grapes are very susceptible to damage caused by disease, which can be caused by insects or fungi. The effect of this damage is further exacerbated by now-more-frequent events caused by global warming, such as unseasonable temperatures and extreme weather. Given these factors, combined with global instability and rising prices, the need So far, a dataset of images of grape diseases typical of the Black Sea region has been gathered, a neural network for the recognition of grape diseases has been constructed, and a desktop application and a mobile application have been developed. In the experimental version, a hardware complex for automated data collection was implemented. Detailed economic calculations may be considered in future studies. Accuracy scores and metrics will be compared to the existing ones. The software package implemented the best approaches to a control system in viticulture using computer vision technologies.

Conclusions
Grapes are very susceptible to damage caused by disease, which can be caused by insects or fungi. The effect of this damage is further exacerbated by now-more-frequent events caused by global warming, such as unseasonable temperatures and extreme weather. Given these factors, combined with global instability and rising prices, the need to protect plants from disease cannot be overemphasized. To this end, this article evaluated the adaptation of deep learning convolutional neural networks to classify grape diseases according to the images of infected leaves.
Technology has penetrated every aspect of everyday life with varying degrees of success and influence. In this regard, artificial intelligence is one of the most promising and widespread areas for innovation. Deep learning algorithms can be used to solve many research problems. Convolutional neural networks are best suited for image-based applications. The results in this article demonstrate that the transfer learning approach has the potential to achieve robust performance. These results show that it is possible to classify grape diseases in such a way that the system satisfies performance requirements when deployed in the field. Moreover, the approach in this paper works with little overhead, no preprocessing or image processing, and no explicit feature extraction.
LabelIMG was used to prepare the dataset. LabelIMG is a data preparation software tool used for training computer models based on deep learning. It is designed to simplify and speed up the process of creating an image markup, which is a necessary step in the field of computer vision and object recognition.
The use of intelligent solutions using machine vision and video analytics allows companies to achieve benefits that positively affect the overall economic effect: -Saving time. A fully automated system not only works much faster, but can also work 24/7 if required. -Accuracy. Computer-vision-based decision making allows manufacturing companies to achieve higher levels of accuracy within acceptable tolerances. The combination of special equipment and advanced machine vision algorithms achieves a near-perfect level of precision in production and quality control.
In this study, the authors considered a new selective search algorithm for monitoring the state of plant development based on computer vision in viticulture, based on YOLOv5. The most difficult part of object detection is object localization. There are many ways to search for objects in an image; one of the methods is to use a sliding window of different sizes. As a result, the fast and accurate detection of grape health status was realized. The test results show that the accuracy was 97.5%, with a model size of 14.85 MB.
The software product commercialization plan is focused on the automation and robotization of agriculture, and will become the basis for adding the next set of similar software.
A dataset of images of grape diseases typical of the Black Sea region was created. The developed software and hardware complex can be installed in any vehicle. Such a mobile system will allow for real-time monitoring of the state of the vineyards and will display it on a map. The novelty of this study lies in the integration of software and hardware.

Conflicts of Interest:
The authors declare no conflict of interest.