Object-Based Wetland Vegetation Classiﬁcation Using Multi-Feature Selection of Unoccupied Aerial Vehicle RGB Imagery

: Wetland vegetation is an important component of wetland ecosystems and plays a crucial role in the ecological functions of wetland environments. Accurate distribution mapping and dynamic change monitoring of vegetation are essential for wetland conservation and restoration. The development of unoccupied aerial vehicles (UAVs) provides an efﬁcient and economic platform for wetland vegetation classiﬁcation. In this study, we evaluated the feasibility of RGB imagery obtained from the DJI Mavic Pro for wetland vegetation classiﬁcation at the species level, with a speciﬁc application to Honghu, which is listed as a wetland of international importance. A total of ten object-based image analysis (OBIA) scenarios were designed to assess the contribution of ﬁve machine learning algorithms to the classiﬁcation accuracy, including Bayes, K-nearest neighbor (KNN), support vector machine (SVM), decision tree (DT), and random forest (RF), multi-feature combinations and feature selection implemented by the recursive feature elimination algorithm (RFE). The overall accuracy and kappa coefﬁcient were compared to determine the optimal classiﬁcation method. The main results are as follows: (1) RF showed the best performance among the ﬁve machine learning algorithms, with an overall accuracy of 89.76% and kappa coefﬁcient of 0.88 when using 53 features (including spectral features (RGB bands), height information, vegetation indices, texture features, and geometric features) for wetland vegetation classiﬁcation. (2) The RF model constructed by only spectral features showed poor classiﬁcation results, with an overall accuracy of 73.66% and kappa coefﬁcient of 0.70. By adding height information, VIs, texture features, and geometric features to construct the RF model layer by layer, the overall accuracy was improved by 8.78%, 3.41%, 2.93%, and 0.98%, respectively, demonstrating the importance of multi-feature combinations. (3) The contribution of different types of features to the RF model was not equal, and the height information was the most important for wetland vegetation classiﬁcation, followed by the vegetation indices. (4) The RFE algorithm effectively reduced the number of original features from 53 to 36, generating an optimal feature subset for wetland vegetation classiﬁcation. The RF based on the feature selection result of RFE (RF-RFE) had the best performance in ten scenarios, and provided an overall accuracy of 90.73%, which was 0.97% higher than the RF without feature selection. The results illustrate that the combination of UAV-based RGB imagery and the OBIA approach provides a straightforward, yet powerful, approach for high-precision wetland vegetation classiﬁcation at the species level, in spite of limited spectral information. Compared with satellite data or UAVs equipped with other types of sensors, UAVs with RGB cameras are more cost efﬁcient and convenient for wetland vegetation monitoring and mapping.

and support vector machine (SVM) [32]. According to previous studies, RF and SVM are the most widely used models in classification research [39][40][41]. In addition, Bayes, Knearest neighbor (KNN), and decision tree (DT) are also widely used in OBIA [42], such as the mapping of asbestos cement roofs being implemented by Bayes and KNN [43], and land cover classification being realized by DT [44]. However, massive features acquired through OBIA may reduce the classification performance of machine learning algorithms when there is a large number of features over a finite subset [45]. Feature selection is a crucial step to optimize classification accuracy by eliminating irrelevant or redundant variables [46]. Cao et al. [47] used OBIA to classify mangrove species based on UAV hyperspectral images, and the results showed that feature selection effectively improved the classification accuracy of the SVM model from 88.66% to 89.55%. Zuo et al. [48] also pointed out that feature selection improved the accuracy of the RF model in object-based marsh vegetation classification, with an overall accuracy of 87.12% and kappa coefficient of 0.850, and the overall accuracy of the RF model without feature selection was less than 80%. Abeysinghe et al. [33] emphasized the influence of feature selection on machine learning classifiers when mapping Phragmites australis based on UAV multispectral images.
Taking the cost of multispectral and hyperspectral sensors into consideration, UAVs with RGB cameras are preferable for vegetation monitoring and mapping [25]. Digital orthophoto maps (DOM), digital surface models (DSM), and point clouds derived from RGB imagery provide multi-features for wetland vegetation classification. Therefore, machine learning algorithms based on feature selection enable UAV-based RGB imagery to achieve high-precision wetland vegetation classification, despite the limited spectral information. Bhatnagar et al. [26] used RF and the convolutional neural network (CNN) for raised bog vegetation mapping based on UAV RGB imagery, and the results showed that the classification accuracy of the CNN, based on multi-features, was over 90%. Both the studies of Boon et al. [49] and Corti Meneses et al. [27] highlighted that the combination of point clouds, DSM, and DOM can provide a more accurate description of wetland vegetation. Although good classification accuracy was achieved, few scholars have comprehensively evaluated the importance of multi-features and the performance of different machine learning algorithms based on UAV RGB imagery for wetland vegetation classification.
We aim to identify a cost-efficient and accurate method for wetland vegetation classification by evaluating the contribution of multi-features and machine learning algorithms, and exploring the applicability of UAV-based RGB imagery in wetland vegetation classification at the species level. We chose a vegetation-rich area in the south of the Honghu wetland as the research area, and used DOM and DSM, derived from the UAV-based RGB imagery, as the data source. The spectral information (RGB), vegetation indices (VIs), texture features, and geometric features were fully extracted from DOM, and the relative height information of vegetation was obtained from DSM. Specifically, we designed ten object-based classification scenarios to achieve the following objectives: (1) comprehensively compare the classification performance of Bayes, KNN, SVM, DT, and RF, to identify the best machine learning algorithm; (2) assess the contribution of different features to the best machine learning algorithm; (3) evaluate the effectiveness of feature selection to improve the classification accuracy of the best machine learning algorithm. This paper looks forward to realizing object-based classification of wetland vegetation in an economical, efficient and high-precision way, by using UAV-based RGB imagery.

Study Area
Honghu wetland (29 • 41 -29 • 58 N, 113 • 12 -113 • 28 E; Figure 1) is located in Southern Hubei Province and has a surface area of approximately 414 km 2 . As the seventh largest freshwater lake in China, Honghu was listed as an international important wetland under the Ramsar Convention in 2008 [50]. The study area belongs to a subtropical humid monsoon climate, with mean air temperatures ranging from +3.8 • C in January to +28.9 • C in July and annual mean precipitation ranging from 1000 mm to 1300 mm. Honghu Lake is a typical shallow lake formed by sediment silt with flat terrain, showing a zonal distribution of plants. More than 100 species of vegetation have been found in Honghu wetland during the field survey after 2010 [51], such as Humulus scandens, Alternanthera philoxeroides, Setaria viridis, Acalypha australis, Polygonum lapathifolium, Polygonum perfoliatum, Nelumbo nucifera, Zizania latifolia, Phragmites communis, Eichhornia crassipes, Potamogeton crispus, etc. The high plant biodiversity in Honghu wetland makes it a suitable and representative place for wetland vegetation classification.
Remote Sens. 2021, 13, x FOR PEER REVIEW 4 of 21 freshwater lake in China, Honghu was listed as an international important wetland under the Ramsar Convention in 2008 [50]. The study area belongs to a subtropical humid monsoon climate, with mean air temperatures ranging from +3.8 °C in January to +28.9 °C in July and annual mean precipitation ranging from 1000 mm to 1300 mm. Honghu Lake is a typical shallow lake formed by sediment silt with flat terrain, showing a zonal distribution of plants. More

Data Acquisition and Pre-Processing
We conducted a wetland vegetation field survey on 12 May 2021, and collected UAVbased RGB imagery from 11:00 a.m. to 12:00 p.m. using a DJI Mavic Pro with an L1D-20c camera. The UAV task was deployed at an altitude of 50 m, forward overlap of 80% and lateral overlap of 70% with clear and windless weather conditions. A total of 314 RGB images were obtained with a ground resolution of 1.16 cm. Pix4Dmapper was used to preprocess RGB images in the following three steps: (1) import 314 images into the software; (2) point cloud encryption and matching; (3) generation of DOM and DSM. We clipped image data and selected areas containing abundant aquatic vegetation as the study area according to the research requirement.
A ground verification sample collection was carried out simultaneously, and a total of 205 samples were obtained to assess the accuracy of ten classification scenarios. A handheld GPS was used to record the exact location of the samples in the accessible area. A DJI Mini2 was deployed to take photos in hovering mode at a height of 5-10 m to

Data Acquisition and Pre-Processing
We conducted a wetland vegetation field survey on 12 May 2021, and collected UAVbased RGB imagery from 11:00 a.m. to 12:00 p.m. using a DJI Mavic Pro with an L1D-20c camera. The UAV task was deployed at an altitude of 50 m, forward overlap of 80% and lateral overlap of 70% with clear and windless weather conditions. A total of 314 RGB images were obtained with a ground resolution of 1.16 cm. Pix4Dmapper was used to preprocess RGB images in the following three steps: (1) import 314 images into the software; (2) point cloud encryption and matching; (3) generation of DOM and DSM. We clipped image data and selected areas containing abundant aquatic vegetation as the study area according to the research requirement.
A ground verification sample collection was carried out simultaneously, and a total of 205 samples were obtained to assess the accuracy of ten classification scenarios. A handheld GPS was used to record the exact location of the samples in the accessible area. A DJI Mini2 was deployed to take photos in hovering mode at a height of 5-10 m to acquire verification samples from inaccessible areas. The vegetation type of these photographic verification samples was identified with the assistance of aquatic vegetation experts. Based on the official website, the horizontal positioning accuracy of DJI Mini2 is within ±0.1 m when the weather is cloudless and windless. Therefore, the location information of the verification samples was obtained from the photos, and it met the research requirements for wetland vegetation classification. According to the field survey, the following eight aquatic vegetation classes and two non-vegetation classes were identified in the study area: Zizania latifolia, Phragmites australis, Nelumbo nucifera, Alternanthera philoxeroides, Sambucus chinensis, Polygonum perfoliatum, Triarrhena lutarioriparia, water, tree shrubs (including Melia Azedarach, Pterocarya stenoptera, Broussonetia Papyrifera, Populus Euramevicana, and Sapium Sebiferum) and other (non-vegetation and non-water bodies).

Study Workflow
In this study, UAV-based RGB imagery was used to classify wetland vegetation with OBIA and feature selection. The methodology included the following six main phases: (1) optimal segmentation parameter determination and image segmentation; (2) 53 features derived from DOM and DSM; (3) comparison of machine learning algorithms based on 53 features; (4) evaluation of the contribution of different types of features; (5) effectiveness evaluation of feature selection; (6) accuracy assessment of ten object-based classification scenarios. Figure 2 shows the workflow of this study.

Image Segmentation
In our study, Trimble eCognition Developer 9.0, an object-based image analysis software, was used to implement efficient image segmentation through the multi-resolution segmentation algorithm [52]. The image segmentation process adopts the bottom-up region merging technique to merge adjacent pixels together based on the principle of minimum heterogeneity [53] to generate objects with similar or identical features. The multi-resolution segmentation algorithm contains the following four important parameters: layer weights, scale, shape index and compactness [54]. The setting of segmentation parameters affects the classification accuracy. The determination of segmentation parameters is generally performed by identifying the optimal parameter combination screened by repeated experiments and manual interpretation. In order to reduce the influence of subjective factors, we applied the estimation of scale parameter 2 (ESP2), a fast optimal segmentation scale selection tool. ESP2 generates the curves of local variance (LV) and rate of change (ROC) as a function of scale parameters, and the peak of the ROC curve represents the possible optimal segmentation scale [29].

Features Derived from DOM and DSM
Wetland vegetation distribution is greatly affected by factors such as water, soil and weather. It is challenging to distinguish wetland vegetation with homogenous spectral information from only optical information. Up to date, many scholars have explored the integration of multi-types of features to enhance the separability of vegetation [55][56][57]. In addition to spectral features (RGB), this paper also extracted height information, vegetation indices, texture and geometric features of the image objects generated by image segmentation for a total of 53 features for the next step of wetland vegetation classification and interpretation. The specific features are listed in Table 1

Image Segmentation
In our study, Trimble eCognition Developer 9.0, an object-based image analysis software, was used to implement efficient image segmentation through the multi-resolution segmentation algorithm [52]. The image segmentation process adopts the bottom-up region merging technique to merge adjacent pixels together based on the principle of minimum heterogeneity [53] to generate objects with similar or identical features. The multiresolution segmentation algorithm contains the following four important parameters: layer weights, scale, shape index and compactness [54]. The setting of segmentation parameters affects the classification accuracy. The determination of segmentation   Table 2  14 Texture feature GLCM_Mean r, GLCM_Mean g, GLCM_Mean b, GLCM_ StdDev r, GLCM_ StdDev g, GLCM_ StdDev b, GLCM_ Ent r, GLCM_ Ent g, GLCM_ Ent b, GLCM_ASM r, GLCM_ASM g, GLCM_ASM b, GLCM_ Cor r, GLCM_ Cor g, GLCM_ Cor b, GLCM_ Dis r, GLCM_ Dis g, GLCM_ Dis b, GLCM_Con r, GLCM_Con g, GLCM_Con b, GLCM_Hom r, GLCM_Hom g, GLCM_Hom b Standard deviation (StdDev), entropy (Ent), angular second moment (ASM), correlation (Cor), dissimilarity (Dis), contrast (Con), homogeneity (Hom), r: red band, g: green band, b: blue band Texture features are derived from red, green and blue bands by using gray-level co-occurrence matrix (GLCM).

24
Geometric feature Area, compactness, roundness, shape index, density, asymmetry Geometric features are calculated from the geometry information of the object.

6
(1) Spectral features are the basic features of wetland vegetation identification. Mean value and standard deviation of the red, green, blue bands and brightness were extracted. (2) DSM was used to represent the relative height of wetland vegetation. Height information can increase the separability between vegetation types, especially in areas with dense vegetation [58]. Therefore, the mean value and standard deviation of DSM were also used for wetland vegetation classification. (3) Vegetation indices, serving as a supplement of vegetation spectral information, were constructed using the available red, green and blue spectral bands [59]. They effectively distinguish vegetation from the surrounding terrain background and greatly improve the application potential of UAV-based RGB imagery. A total of 14 frequently used vegetation indices were calculated ( Table 2). (4) Texture features have been frequently applied for terrain classification [57] and successfully improved the accuracy of image classification and information extraction. The gray-level co-occurrence matrix (GLCM) was used to extract 24 texture features, including mean, standard deviation (StdDev), entropy (Ent), angular second moment (ASM), correlation (Cor), dissimilarity (Dis), contrast (Con) and homogeneity (Hom) of red, green and blue bands [60]. (5) Geometric features have been shown to have a positive effect on vegetation classification [38,61], and the following six frequent geometric features were utilized in this study: area, compactness, roundness, shape index, density and asymmetry.

Vegetation Indices Full Name Formulation Reference
ExG Excess green minus excess red index Red-green-blue ratio index

Machine Learning Algorithm Comparison
Previous studies have determined that different machine learning algorithms may show different classification performances [44,69,70]. Comparing machine learning algorithms often helps increase classification accuracy. In this study, five commonly used classifiers were applied to conduct object-based supervised classification in eCognition 9.0, for Bayes, KNN, DT, SVM and RF, respectively. In order to compare the performance of these classifiers in the application of wetland vegetation classification, all the 53 features were applied in the classifier training procedure, and the best classifier was selected for the follow-up research. The classifiers are introduced below.

• Bayes
Bayes is a simple probabilistic classification model based on the Bayesian theorem and assumes that features are unrelated to each other [43]. This algorithm uses training samples to estimate the mean vectors and covariance matrices for each class and then applies them for classification [44]. Bayes does not require many parameters, but may be subjective.
KNN is a commonly used nonlinear classifier and assumes similarities of neighboring objects. This algorithm seeks the K nearest neighbors in the feature space for the unclassified objects and identifies the unclassified objects through the features of its K nearest neighbors [43,44]. The parameter K is essential for the performance of the KNN classifier.
DT is a typical non-parametric rule-based classifier and has great capability to handle data sets with large numbers of attributes. The algorithm is constructed from the training sample subset [71] and is composed of a root node with all the training samples, a set of internal nodes with splits, and a set of leaf nodes [72,73]. The class label of the unclassified objects is determined according to the leaf node to which the object belongs. • Support vector machine (SVM) SVM is a non-parametric algorithm based on the statistical learning theory that was first proposed by Vapnik and Cortes in 1995 [74]. It attempts to define the optimal linear separating hyperplane from the training samples based on the maximum gap and then identifies any two classes [70]. Polynomial and radial basis function kernels are often used to project non-linear classes into separable linear ones in a higher dimension [44]. SVM does not rely on the distribution of the data and is robust to train small training sample sets [42,75].
• Random forest (RF) RF, first proposed by Breiman in 2001, is a novel machine learning method based on the statistical learning theory and composed of multiple decision trees [76]. Unlike traditional classification models with poor generalization ability, RF can significantly improve the accuracy of classification results, reduce the influence of outliers and avoid over-fitting [39]. The realization steps of RF are briefly summarized as follows: (1)  bootstrap resampling method to select k training sample sets randomly with replacement; (2) build k decision trees for k sample sets; (3) select m features randomly from the original features and set m as per tree splitting node; (4) form the random forest with k constructed decision trees and use the random forest to predict the classification result. The RF model shows stable robustness in processing complex data and has been shown to have excellent predictive capability in ecology [77].

Contribution of Different Types of Features Evaluation
Multi-features derived from UAV-based RGB imagery are of great significance for wetland vegetation classification. Scholars are concerned about the contribution of different types of features to the performance of the classifier. Therefore, many studies gradually add different types of features into the same classifier layer by layer, analyzing the contribution of the features through the overall classification accuracy [38,48,78,79]. From the results of Section 3.2, RF provided higher accuracy than the other four machine learning algorithms. Therefore, spectral features, height information, VIs, texture features and geometric features are added layer by layer to construct the RF classifier to evaluate the contribution of different types of features.

Effectiveness Evaluation of Feature Selection
Feature selection plays a critical role in object-based classification processes [45]. Studies have shown that features that were generally chosen by experience have high subjectivity and unequal contributions to classification accuracy. Furthermore, redundant features will bring negative effects to the performance and stability of a classification model [56]. Therefore, it is necessary to optimize and reduce the number of features by eliminating those that are not relevant and achieve, simultaneously, the highest intracluster similarity and the lowest inter-cluster similarity [53]. There are three popular feature selection strategies, which are as follows: filter, wrapper and embedded methods. Filter methods rely on variable ranking techniques to select the feature subset with no irrelevance to the modeling algorithm, but generate a large scale of feature subsets [80]. Wrapper methods use the performance of the learning algorithm as the feature selection criterion. It has been confirmed that wrapper methods can generate more appropriate feature subsets than filter methods, but are more time consuming [81]. Feature selection procedures and learning algorithms are achieved synchronously in embedded methods with higher treatment efficiency, but often cause over-fitting and poor robustness [80,82].
Recursive feature elimination (RFE) is a typical wrapper method based on a greedy algorithm, first proposed by Guyon et al. [83] in 2002. The essence of RFE is removing one of the least relevant features through cyclic iteration until all features are traversed, and outputting the feature ranking list [41]. In this study, the RFE algorithm was implemented using the "caret" package within statistical software R. A 10-fold cross-validation was applied to evaluate the optimal feature subset [84] in the RFE algorithm. According to the feature selection result, the last RF classifier was constructed by the optimal feature subset (RF-RFE).

Ten Object-Based Classification Scenarios Design and Accuracy Assessment
In order to explore the performance of different machine learning algorithms, the contribution of different types of features and the effectiveness of feature selection, ten scenarios were designed for object-based wetland vegetation classification (Table 3). We interpreted and classified a total of 1722 objects to serve as the training samples of ten classification scenarios, including 42 Triarrhena lutarioriparia objects, 24 Polygonum perfoliatum objects, 387 Zizania latifolia objects, 73 Sambucus chinensis objects, 228 Nelumbo nucifera objects, 296 Phragmites australis objects, 135 other objects, 232 tree shrub objects, 72 Alternanthera philoxeroides objects and 233 water objects. Firstly, scenarios 1-5 were designed to compare the performance of Bayes, KNN, SVM, DT and RF based on 53 features. Then, scenarios 5-9 were designed to evaluate the contribution of different types of features by adding spectral features, height information, vegetation indices, texture features and geometric features, layer by layer, to the RF classifier. Finally, scenarios 5 and 10 were used to verify the effectiveness of the feature selection on the classification accuracy of the RF classifier.
In order to identify the optimal scenario for object-based wetland vegetation classification using UAV-based RGB imagery, the confusion matrix provided the overall accuracy, the kappa coefficient, producer's accuracy (PA) and user's accuracy (UA) for each scenario.

Analysis of Image Segmentation Results
Three critical segmentation parameters (scale, shape index, and compactness) were selected through an iterative "trial and error" approach ( Figure 3). Firstly, shape index and compactness were determined. With a fixed scale (100) and default compactness (0.5), the shape index was set to 0.1, 0.3, 0.5, and 0.7 in separate tests (Figure 3a). When the shape index was set to 0.3, 0.5, and 0.7, the number of generated objects increased significantly, resulting in the segmentation of wetland vegetation being excessively fragmented. Then, with the shape index defined as 0.1 and the scale fixed at 100, the compactness was set as 0.1, 0.3, 0.5, and 0.7, respectively ( Figure 3b). When the compactness was 0.5, the different vegetation boundaries were well separated. As a result, equal weight was assigned to each layer, and the shape index and compactness were defined as 0.1 and 0.5, respectively. Combined with the defined parameters and the results generated by ESP2, four possible optimal segmentation scales were tested, which were 60, 106, 146, and 196 ( Figure 3c). The results show that when the scale parameter was set to 60, Nelumbo nucifera, Phragmites australis, and Zizania latifolia had visible "over-segmentation", and this phenomenon reduced the processing efficiency of the subsequent image classifications. When the segmentation scale was set to 146 and 196, the vegetation with small leaves and relatively independent distribution was "under-segmented" with the water body, and there was also visible confusion in the segmentation between Zizania latifolia and Alternanthera philoxeroides. Different vegetation was divided into the same object, and the segmentation effect was non-ideal. By contrast, the boundary of various vegetation types was visually best distinguished when the segmentation scale was set to 106, and "under-segmentation" or "over-segmentation" were not common. After comparison, it was determined that a scale of 106 had the best segmentation effect. Therefore, the scale, shape index, and compactness were set to 106, 0.1, and 0.5, respectively. Under the optimal parameter combination, wetland vegetation was effectively distinguished, and the size of the generated object was suitable for subsequent classification processing.
best distinguished when the segmentation scale was set to 106, and "under-segmentation" or "over-segmentation" were not common. After comparison, it was determined that a scale of 106 had the best segmentation effect. Therefore, the scale, shape index, and compactness were set to 106, 0.1, and 0.5, respectively. Under the optimal parameter combination, wetland vegetation was effectively distinguished, and the size of the generated object was suitable for subsequent classification processing.

Performance Comparison of Machine Learning Algorithms on Wetland Vegetation Classification
As shown in Table 4, the performance of five machine learning algorithms, based on 53 original features, was compared through the overall accuracy and the kappa coefficient. It was found that the performance of RF provided a higher classification accuracy compared to the other corresponding models, which had an overall accuracy of 89.76% and a kappa coefficient of 0.88. DT, SVM, and Bayes provided an overall accuracy greater than 84%, and KNN had the worst classification effect, with an overall accuracy lower than 60%. The PA and UA were calculated to evaluate the capability to discriminate single vegetation species of different classifiers. Non-vegetation (water and others) and vegetation were effectively distinguished, and misclassification and omission rarely occurred in the five scenarios. Compared with other classifiers, RF reduced the misclassification and omission of Zizania latifolia, and the PA and UA were, respectively, 95.45% and 72.41%. RF also showed a slight advantage in distinguishing Alternanthera philoxeroides. Although RF could not improve the PA and UA of all the species, most the misclassifications and omissions of wetland vegetation were reduced. In addition, DT had a significant advantage in distinguishing Nelumbo nucifera, and the PA and UA were 91.43% and 100.00%, respectively.
Compared with other species, Phragmites australis was most prone to misclassification and omission. Comparatively, SVM had a better effect on the classification of Phragmites australis, with a PA and UA of about 85%. The five types of machine learning algorithms produced different classifications of wetland vegetation [28]. These results are consistent with previous studies [39] showing that RF had the best performance in classifying invasive plant species, compared with the other four classifiers.

Contribution of Different Types of Features to Wetland Vegetation Classification
RF has been shown to have better performance than other machine learning algorithms in this study. Therefore, the contribution of multi-features in wetland vegetation classification was evaluated based on RF. Scenarios 5-9 showed that the combination of different features improved the classification accuracy by varying degrees when spectral features, height information, vegetation indices, texture features, and geometric features were gradually introduced into the RF layer by layer. As shown in Table 5, the initial overall accuracy of the RF constructed only by spectral features (scenario 6) was only 73.66%, and the kappa coefficient was 0.70. When the classification features added height information (scenario 7), the overall accuracy rapidly increased to 82.44%, which was 8.78% higher than in scenario 6, indicating that the DSM data were important for wetland vegetation classification. By adding vegetation indices (scenario 8) and texture features (scenario 9) to the RF classifier, the overall accuracy was 85.85% and 88.78%, respectively. When geometric features were also added to the classification features (scenario 5), the overall accuracy increased by 16.1% compared with scenario 6. Misclassification and omission were significantly reduced when different types of features were introduced into the RF layer by layer. The UA and PA of all the species were limited, except in the non-vegetation class, in which only spectral features were applied for classification. With the introduction of multi-types of features, the classification accuracy of Zizania latifolia was improved the most. When only spectral features were involved in the classification, the PA and UA of Zizania latifolia were only 36.36% and 66.67%, respectively; after adding height information and VIs, they increased to 54.55% and 75%, respectively, and they significantly increased to 95.45% and 72.41% with the addition of texture and geometric features. However, the continuous increase in features does not necessarily mean that the PA and UA of all species will be improved equally. The PA and UA of tree shrubs were 82.35% and 100%, respectively, when classified by only spectral features and height information, which were higher than other scenarios. This result illustrates that all types of features had positive effects on the vegetation classification procedure. Due to the complexity of wetland vegetation, the classification accuracy based only on spectral information is limited; the effective combination of different features can provide additional classification evidence for wetland vegetation mapping [55,57,85].

Analysis of Recursive Feature Elimination Results
In this study, a total of 53 features were extracted, and each of them contributed differently to the vegetation classification results. The RFE algorithm was performed to sort the importance of each feature. As shown in Figure 4, the overall accuracy and the kappa coefficient improved significantly with the increase in feature variables. When the number of feature variables was 20, the overall accuracy was 89.84% (85.02%-94.66%, 95% confidence interval) and the kappa coefficient was 0.88. When the number of feature variables exceeded 20, the curve showed a slight fluctuation trend first and then tended to be stable. When the number of feature variables reached 36, the overall accuracy was 91.06% (85.59%-96.53%, 95% confidence interval) and the kappa coefficient was 0.90. After that, the continuous increase in feature variables could not improve the overall accuracy significantly. This indicates that an excessive number of features causes data redundancy and over-fitting, thus affecting the accuracy of the classifier.  According to the result of RFE, the top 36 important features were determined as the optimal feature subset for wetland vegetation classification. As shown in Table 6, thirteen VIs, fifteen texture features, four spectral features, two geometric features, and two height features were selected from the original 53 features. The importance of height information (DSM) ranked first, and the homogeneity derived from green, red and blue bands were second, third and sixth, respectively. The results indicate that height information is of great significance and becomes the primary feature to discriminate wetland vegetation at the species level, due to the relative similarity of the spectral characteristics. Al-Najjar et al. [58] demonstrated that height information can ameliorate differences between vegetation, especially in dense vegetation areas. Cao et al. [47] also demonstrated the effectiveness of height information in the classification of mangrove species. Most vegetation indices also played an important role in classification, and most of them ranked within the top ten, such as COM2, NGBDI, VEG, BGRI, VDVI, and WI. Vegetation indices are used as predictors in many studies that use UAVs equipped with RGB cameras, such as Phragmites australis and mangrove mapping [27,47], wetland vegetation classification [38,48], crop mapping [86], mangrove and marsh vegetation biomass retrieval [87,88], coastal wetland restoration [89], and estimating the potassium (K) nutritional status of rice [90]. These studies have highlighted the high application value of vegetation indices. At the same time, texture features effectively compensate for the lack of spectral information [56], and also play an auxiliary role in wetland vegetation classification. In our study, 15 texture According to the result of RFE, the top 36 important features were determined as the optimal feature subset for wetland vegetation classification. As shown in Table 6, thirteen VIs, fifteen texture features, four spectral features, two geometric features, and two height features were selected from the original 53 features. The importance of height information (DSM) ranked first, and the homogeneity derived from green, red and blue bands were second, third and sixth, respectively. The results indicate that height information is of great significance and becomes the primary feature to discriminate wetland vegetation at the species level, due to the relative similarity of the spectral characteristics. Al-Najjar et al. [58] demonstrated that height information can ameliorate differences between vegetation, especially in dense vegetation areas. Cao et al. [47] also demonstrated the effectiveness of height information in the classification of mangrove species. Most vegetation indices also played an important role in classification, and most of them ranked within the top ten, such as COM2, NGBDI, VEG, BGRI, VDVI, and WI. Vegetation indices are used as predictors in many studies that use UAVs equipped with RGB cameras, such as Phragmites australis and mangrove mapping [27,47], wetland vegetation classification [38,48], crop mapping [86], mangrove and marsh vegetation biomass retrieval [87,88], coastal wetland restoration [89], and estimating the potassium (K) nutritional status of rice [90]. These studies have highlighted the high application value of vegetation indices. At the same time, texture features effectively compensate for the lack of spectral information [56], and also play an auxiliary role in wetland vegetation classification. In our study, 15 texture features were chosen by the RFE algorithm, with only five ranking among the top twenty. Pena et al. [53] also stated that the contribution of vegetation indices to the crop classification model was much higher than that of texture features, although texture features were necessary in crop classification. Compared with other features, the contribution of geometric features was significantly slight, and only area and shape index were selected by the RFE algorithm, ranking 29 and 35, respectively. Yu et al. [61] pointed out that the gaps and textures in high-resolution images caused the objects to have no obvious geometric pattern, so geometric features could not become the dominant feature to identify wetland vegetation. Specifically, wetland vegetation is affected by water levels, climate, soil, and other factors. The existing forms and growth conditions of vegetation in the same region are diverse; therefore, it is difficult for geometric features to discriminate wetland vegetation independently. In our study, some Zizania latifolia were affected by Alternanthera philoxeroides, and they were mixed in other vegetations in the form of lodging, showing morphological differences to the normal growth of Zizania latifolia. Therefore, the pattern of the generated objects showed a large difference in the same type of vegetation.

Analysis of the Effectiveness of Feature Selection
Comparing scenario 5 and 10 (Table 5), the overall accuracy of RF-RFE was 90.73%, which was 0.97% higher than that of RF based on the 53 original features. The confusion matrix showed that RF-RFE successfully reduced the misclassification and omission between vegetation species. The PA and UA of Sambucus chinensis were 100% and 96.97%, respectively, which were 3.12% and 0.09% higher than those in scenario 5. Similarly, feature selection also improved the UA of Zizania latifolia, Phragmites australis, Alternanthera philoxeroides, and Sambucus chinensis. Although RF-RFE provided a lower PA of 71.43% for tree shrubs than that in scenario 5, with a PA of 73.68%, RF-RFE was an optimal classification model for the differentiation of all vegetation species.
The results showed that the combination of multi-features can greatly improve the classification accuracy, but high-dimensional features will reduce the classification model efficiency. Feature selection is capable of screening out the best feature subset, which is an indispensable step to classify wetland vegetation by multi-features. Zuo et al. [48] designed 12 groups of comparative experiments using different feature combinations during marsh vegetation mapping, and they verified that the classification accuracy after RFE feature selection was higher than without feature selection. Zhou et al. [79] also showed that the RFE feature selection algorithm can select the most favorable features for forest stand classification, successfully improving the classification accuracy from 80.57% to 81.05%. Previous studies have also demonstrated the advantages of the RFE feature selection method in many research areas [41,[91][92][93], such as agricultural pattern recognition, mangrove species classification, and landslide susceptibility mapping. Using high-resolution remote sensing images to monitor and map wetland vegetation is becoming more common, and the RF-RFE model is a promising method for wetland vegetation classification.

Comparison of Object-Based Classification Results
The object-based classification results of ten object-based classification scenarios are presented in Figure 5. The classification maps of scenario 2 and scenario 6 had the worst classification effect, with many misclassifications. Based on the field survey, the vegetation distribution of the other eight scenarios is nearly consistent with the original image. The RF-RFE model has the best classification effect, and is effective at distinguishing Nelumbo nucifera, Zizania latifolia, and dead plants in the southwestern portion of the study area.
Our study shows that UAV-based RGB imagery has great potential in wetland vegetation classification. One key advantage of high-resolution UAV imagery is that the data collection is much more flexible and cost efficient. Additionally, it can provide massive shape, geometry and texture features to make up for the lack of spectral information, allowing high-accuracy classification to be achieved in this complex wetland environment. Combining the OBIA method with machine learning algorithms is a promising way to achieve high-precision wetland vegetation classification at the species level, and the RFE feature selection method is an important step to improve the accuracy of the classifier. Our results provide a basis for monitoring and mapping wetland vegetation in an efficient and economical way.
Although object-based wetland vegetation classification produces a good result using RF-RFE, some limitations remain. Firstly, previous studies pointed out that the accuracy of the vegetation classification varied seasonally [94]; therefore, the feasibility of the method applied in this paper remains to be explored for other seasons. Secondly, emerging machine learning algorithms still have some issues [95]. The classification model parameters were fixed in this study. Optimizing the model parameters, or combining different classifiers together, is a practicable way to achieve improved classification results in a complex wetland environment. Zhang et al. [40] produced both community-and species-level classifications of object-based marsh vegetation, combining SVM, RF, and minimum noise fraction, with an overall accuracy of over 90%. Zhang et al. [96] showed that there are no perfect image classifiers and different classifiers may complement each other, and their results showed that the integration of classifiers provided more accurate vegetation identification than a single classifier. Hao et al. [97] also found that single classifiers and hybrid classifiers performed differently under different conditions. Specifically, hybrid classifiers performed better with a small training sample, and a single classifier achieved good performance with abundant training samples. Furthermore, high-resolution images result in a huge amount of data with longer computing times. The development of an efficient and universal classification model based on UAV-based RGB imagery, which could be implemented with less professional expertise, is the focus of our future research, as it would contribute to vegetation monitoring and mapping of inaccessible wetlands. The contribution of additional features and feature selection methods can also be evaluated and assessed in future work.
The object-based classification results of ten object-based classification scenarios are presented in Figure 5. The classification maps of scenario 2 and scenario 6 had the worst classification effect, with many misclassifications. Based on the field survey, the vegetation distribution of the other eight scenarios is nearly consistent with the original image. The RF-RFE model has the best classification effect, and is effective at distinguishing Nelumbo nucifera, Zizania latifolia, and dead plants in the southwestern portion of the study area.

Conclusions
In this study, we evaluated the feasibility of using UAV-based RGB imagery for wetland vegetation classification at the species level. OBIA technology was employed to overcome the limitation of traditional pixel-based classification methods. After multifeature extraction and selection, we designed ten scenarios to compare the contributions of five machine learning algorithms, multi-feature combinations, and feature selections to the classification results. The optimal configuration for wetland vegetation classification was selected based on the overall accuracy, kappa coefficient, PA, and UA. Based on the analysis of ten scenarios, the following conclusions could be drawn: (1) When object-based wetland vegetation classification was carried out with all the original features, RF performed better than Bayes, KNN, SVM, and DT. (2) Multi-feature combination can help UAV-based RGB imagery realize wetland vegetation classification. Different types of features contributed unequally to the classifier.
Owing to the similarity of the spectral information in wetland vegetation, height information became the primary feature discriminating wetland vegetation. The contribution of vegetation indices to wetland vegetation classification was irreplaceable, and texture features were less important than vegetation indices, but still indispensable. (3) The classification effect of RF-RFE was the best among the ten scenarios. Feature selection is an effective way to improve the performance of the classifier. A large number of features of redundant or irrelevant information negatively affects the classification. The RFE feature selection algorithm can effectively select the best feature subset to improve the classification accuracy.
This paper demonstrates that UAVs with RGB cameras have the advantages of stronger operability, higher universality, and lower cost, which make them a powerful platform for wetland vegetation monitoring and mapping. The UAV images with high temporal and spatial resolution provide rich image features that can be used to produce wetland vegetation classification at the species level, overcoming the difficulties of traditional wetland vegetation surveys. The method proposed in this paper is more convenient to implement from data acquisition to data processing, simultaneously achieving rapid and high-precision wetland vegetation mapping without the sufficient professional background knowledge of relevant personnel. The approach presented in this paper can be applied to execute long-term and accurate dynamic monitoring of wetland vegetation.