Single-Class Data Descriptors for Mapping Panax notoginseng through P-Learning

Machine learning-based remote-sensing techniques have been widely used for the production of specific land cover maps at a fine scale. P-learning is a collection of machine learning techniques for training the class descriptors on the positive samples only. Panax notoginseng is a rare medicinal plant, which also has been a highly regarded traditional Chinese medicine resource in China for hundreds of years. Until now, Panax notoginseng has scarcely been observed and monitored from space. Remote sensing of natural resources provides us new insights into the resource inventory of Chinese materia medica resources, particularly of Panax notoginseng. Generally, land-cover mapping involves focusing on a number of landscape classes. However, sometimes a subset or one of the classes will be the only part of interest. In term of this study, the Panax notoginseng field is the right unit class. Such a situation makes single-class data descriptors (SCDDs) especially significant for specific land-cover interpretation. In this paper, we delineated the application such that a stack of SCDDs were trained for remote-sensing mapping of Panax notoginseng fields through P-learning. We employed and compared SCDDs, i.e., the simple Gaussian target distribution, the robust Gaussian target distribution, the minimum covariance determinant Gaussian, the mixture of Gaussian, the auto-encoder neural network, the k-means clustering, the self-organizing map, the minimum spanning tree, the k-nearest neighbor, the incremental support vector data description, the Parzen density estimator, and the principal component analysis; as well as three ensemble classifiers, i.e., the mean, median, and voting combiners. Experiments demonstrate that most SCDDs could achieve promising classification performance. Furthermore, this work utilized a set of the elaborate samples manually collected at a pixel-level by experts, which was intended to be a benchmark dataset for the future work. The measuring performance of SCDDs gives us challenging insights to define the selection criteria and scoring proof for choosing a fine SCDD in mapping a specific landscape class. With the increment of remotely sensed satellite data of the study area, the spatial distribution of Panax notoginseng could be continuously derived in the local area on the basis of SCDDs.


Introduction
Traditional Chinese medicine (TCM) [1] originated in ancient China and has evolved over thousands of years as the only health care and disease healing [2].A long time before the birth of modern Western medicine, traditional medicinal recipes were handed down orally generation by generation in many parts of the world [3].Given that TCM is a practical medicine built on experience, and has been mainly practiced and researched in China [4,5], the essence of TCM has always been the most advanced and experienced medicine in the world [6].Moreover, scientists proved that TCM can coexist with Western medicine [3,7].Geoherb is a type of Chinese herb with a geographical indication corresponding to a specific geographical location or origin, which has a certification that the product Figure 1.Panax notoginseng is a well-known geoherb, which has acquired a very favorable reputation for the treatment of blood disorders, including blood stasis, bleeding, and blood deficiency.Its root can be turned to powder as a medicinal material, and the shadenet cover can be observed from space by means of satellite imagery.
Remote-sensing techniques have been applied to monitor land cover at a range of spatial and temporal scales, in order to satisfy a range of scientific and practical requirements [19][20][21].In particular, remote-sensing mapping is an efficient technique to acquire spatial and temporal cropland information repeatedly and consistently [22,23].In many cases, remotely-sensed data are utilized to derive landscape information on a specific land-cover class of interest [24].The ability to map and monitor land-cover types and their dynamics for diverse applications has been enhanced by the availability and constantly increased coverage, of satellite images [25,26].Many problems are Panax notoginseng is a well-known geoherb, which has acquired a very favorable reputation for the treatment of blood disorders, including blood stasis, bleeding, and blood deficiency.Its root can be turned to powder as a medicinal material, and the shadenet cover can be observed from space by means of satellite imagery.
Remote-sensing techniques have been applied to monitor land cover at a range of spatial and temporal scales, in order to satisfy a range of scientific and practical requirements [19][20][21].In particular, remote-sensing mapping is an efficient technique to acquire spatial and temporal cropland information repeatedly and consistently [22,23].In many cases, remotely-sensed data are utilized to derive landscape information on a specific land-cover class of interest [24].The ability to map and monitor land-cover types and their dynamics for diverse applications has been enhanced by the availability and constantly increased coverage, of satellite images [25,26].Many problems are encountered in mapping land cover from remotely sensed data by a classification analysis or landscape class description [27] in order to quantify the relationships among all the pixels in an image, such as similarities or differences in spectra signature or spatial texture [22], and extract land cover classes from remote-sensing images [28][29][30].The application of remote-sensing techniques plays a significant role in the quantitative resource survey of TCM, particularly in exploring monitoring abilities for the sustainable utilization and bio-diversity protection of Chinese materia medica resources in macrocosm.Under the prerequisite that remote-sensing techniques provide up-to-date landscape surveying at a fine scale [31], one of the motivations for this work comes from the fact that only a single class of interest is involved in a mapping task.
Recently, there has been a number of applications [32], i.e., geological products [33], vegetation indices [34], aerosol products [35], ocean data products [36], dust source identification [37], and crops identification [38] in remote-sensing based on machine learning techniques.In this study, we present an original innovation to apply most of the available single-class data descriptors (SCDDs) through P-learning to conduct mapping of Panax notoginseng fields.After that, the work of measuring performance will give us profound insights into defining the selection criteria and scoring proof for choosing a fine SCDD.As such, the introduction of SCDDs for remote-sensing mapping of Panax notoginseng fields will be helpful to promoting the development of a Panax notoginseng resource inventory and dynamic monitoring towards the quantitative direction.Attributing to the standardized cultivation technique of good agricultural practice (GAP) [39], or so-called controlled-environment agriculture using shade houses provide a distinct image texture to interpret Panax notoginseng fields visually [40].Additionally, due to only Panax notoginseng fields being the target class, the task of mapping Panax notoginseng fields becomes a specific typology of land-cover classification, which could be regarded as a problem of single-class data description or a special type of one-class classification [41].SCDDs are the appealing alternatives to the conventional supervised classifiers because they can be trained with only the target training samples [42].These kinds of algorithms have emerged to only require training samples from the target class, which are referred to as P-learning [43].Notice that the single class meant no more than one landscape class, and the P-learning based class description may depict the sight that no negative samples are used for training.Such a classification approach aims to identify only one landscape class of interest regardless of the other classes presented in the study area [44].In the case of single-class data description, we always face an imbalanced binary classification [45,46] including (1) the positive class (i.e., Panax notoginseng fields); and (2) the negative class (i.e., the other classes).In this case, the positive class is assumed to be sampled well, while the other classes may be sampled very sparsely or totally absent.When no samples of the other classes are available, most classification errors (e.g., false negative) cannot be estimated [47].In addition, the procedure that trains an accurate SCDD is challenging, particularly in the face of a large number of unlabeled samples; or say, only a small class or relatively few training samples are available [42].Therefore, it might be very expensive to collect the negative samples which are so abundant that a good sampling seems elusive.Although this was an extreme case, we carefully designed the training and test sets, which were composed of the qualified positive and negative samples.Note that if we want to improve the overall performance of the numerous classifiers which may differ in complexity, a combination of these classifiers will always be a viable solution [48].
Regarding another motivation of this work with respect to TCM, the quality control of TCM remains a significant issue that affects medicinal herbs, formulations, and even TCM practice.Due more to the lax enforcement of standards [49], resulting in the diminishing popularity of TCM rather than a failure of remedies, particularly, the patchy regulation has led to inconsistent herb quality, unqualified practitioners, unsubstantiated claims for secret formulas, and both deliberate and inadvertent mislabeling and adulteration, sometimes with fatal consequences.Considering Panax notoginseng is a vulnerable crop which has a serious succession cropping obstacle [50], consequently the continuous planting of the same crop in the same field will lead to the decrease of yield and quality.In order to promote the quality of production, it is crucial to monitor the spatial planting patterns of Panax notoginseng fields, such as crop rotation [51,52].The planting pattern implies standardized planting with the specific crop structure and spatio-temporal configuration in the same field for a specific region under the particular natural resource and socio-economic conditions [53,54] so as to realize the sustainable utilization of agricultural resources and crop yield.Until now, the concrete planting pattern changes of Panax notoginseng are still poorly known.To the best of our knowledge, until now no such work has been done which, on the one hand, enriches the approaches to monitor spatial planting pattern changes of the perennial ginseng from space; on the other hand, employs SCDDs for mapping Panax notoginseng.
Our studies on mapping Panax notoginseng aim to provide fruitful information for studies on the quality assurance of TCM production, precision farming, the construction of agro-ecosystems, sustainable development, and the protection of the biodiversity of Panax notoginseng.Furthermore, determining the planting area of Panax notoginseng is an important part of obtaining more accurate information about annual yield and natural storage, except for mapping the spatial distribution.The current study, which involves mapping the planting parcels of Panax notoginseng at a 30 m spatial resolution, has three aims: (1) mapping Panax notoginseng fields through a stack of SCDDs as the future technical milestone for planting pattern analysis; (2) evaluating the abilities of SCDDs in identifying small Panax notoginseng fields in the complex agricultural landscapes; and (3) providing the potential possibilities for monitoring the planting pattern changes of Panax notoginseng fields, further giving us new insights into the planting pattern transitions of the perennial ginseng in macrocosm.The case study area is located in Wenshan City of China, which is characterized by a distinctive crop rotation agricultural system.The highlights of this study include: (1) striving for the research of the landscape-scale remote-sensing interpretation of TCM resources for the first time; (2) employing a stack of SCDDs with a comparative perspective to conduct mapping of Panax notoginseng fields; (3) defining the selection criteria and scoring proof for choosing a most appropriate SCDD; and (4) evaluating the abilities of SCDDs in identifying the fragmented parcels of Panax notoginseng in the complex agricultural landscapes.
The rest of this paper is structured as follows.The description of materials and methods is introduced in Section 2, and the experiments and analysis are presented and discussed in Sections 3 and 4, respectively.Finally, the conclusions of this work are summarized in Section 5.

Study Area and Data
Our study area comprises two independent blocks which are situated in Wenshan City, Wenshan Prefecture, Yunnan Province, in the south-west region of China (see Figure 2).These places mainly lie between longitude 103.71 • E-104.46 • E and latitude 23.07 • N-23.73 • N. Wenshan Prefecture is on a plateau, where the temperatures are quite constant throughout the year, with more precipitation during the summer months.Due to its low latitude and tempered by its high elevation, Wenshan Prefecture has a mild, humid, and subtropical climate, particularly suitable for planting Panax notoginseng.This is the reason why Wenshan Prefecture is the specific geographical location and origin of Panax notoginseng (i.e., why it is called a geoherb).
The "sa" and "sb" are two typical square regions.There are 151 × 151 pixels in the image space, respectively, corresponding to an area 20.5209 km 2 or equivalent to 2052.09 ha for both.The two blocks have a typical representation of the dense Panax notoginseng fields, upon which mapping Panax notoginseng fields will be carried out using multiple SCDDs.Multi-spectral cloud-free images acquired by the Landsat-8 Operational Land Imager (OLI) at a 30 m spatial resolution were utilized in this study.Their acquisition date was 18 March 2015.Since only one scene (path/row: 128/044) was utilized for the analysis in this study, the atmosphere can be considered to be homogeneous, and therefore the atmospheric correction may be not necessary [41,55].Note that the planting fields of Panax notoginseng are rather sparse in most cases, and the cloud-free satellite images are not easy to collect because of the special geographical environment (i.e., a mountainous area is often in heavy clouds, refer to Table 1

Shadenet Structures
Plastic sheets are used as materials to build the shadenet structure which can be regarded as a micro-scale planting environment and are relatively common [56], having unique characteristics [57], i.e., optical transparency, shade percentages, gas-tightness, and high-reflectivity.Agriculturalists have long known the importance of the planting environment for crop growth, always by manipulating the growing environment to provide a more conducive environment for crop growth.As for Panax notoginseng, sunlight is often modified by shading to provide the more optimal growing environments so as to enhance their production.Therefore, the production of Panax notoginseng takes place within an enclosed growing structure called a shade house.The shade house (see Figure 3) provides the protection and maintains the optimal growing conditions throughout the development of Panax notoginseng.The shade house of Panax notoginseng is a framed structure often covered with the black plastic nets which are made of the polyethylene thread with different shade percentages.It provides a partially-controlled atmosphere and environment by reducing the light intensity and effective heat during daytime to Panax notoginseng grown under the very large plastic sheets.

Shadenet Structures
Plastic sheets are used as materials to build the shadenet structure which can be regarded as a micro-scale planting environment and are relatively common [56], having unique characteristics [57], i.e., optical transparency, shade percentages, gas-tightness, and high-reflectivity.Agriculturalists have long known the importance of the planting environment for crop growth, always by manipulating the growing environment to provide a more conducive environment for crop growth.As for Panax notoginseng, sunlight is often modified by shading to provide the more optimal growing environments so as to enhance their production.Therefore, the production of Panax notoginseng takes place within an enclosed growing structure called a shade house.The shade house (see Figure 3) provides the protection and maintains the optimal growing conditions throughout the development of Panax notoginseng.The shade house of Panax notoginseng is a framed structure often covered with the black plastic nets which are made of the polyethylene thread with different shade percentages.It provides a partially-controlled atmosphere and environment by reducing the light intensity and effective heat during daytime to Panax notoginseng grown under the very large plastic sheets.

Design Sets
Good classification depends not only on the factors associated with classifiers, but fine design sets also play a significant role in assessing the classification results objectively [58].There is a nonnegligible truth that, if only given positive samples, we cannot estimate most of the classification errors.Therefore, both positive and negative samples are supposed to be prepared for this study.Silva, et al. [41] utilized a manually-collected set of samples of the target class and a random sampling of samples of all classes, to keep the training effort low.In that case, they used the non-pure negative samples under the assumption of which few samples of the target class would be submerged.
In this study, we manually collected the positive (i.e., 1211) and negative (i.e., 8522) samples with a class-wise separability of 1.9918.Subsequently, we prepared three kinds of design sets, i.e., the training set (i.e., 727 positive samples, and 5114 negative samples at a 60% split); test set (484 positive samples, and 3408 negative samples, the remaining 40% split); and validation set (only 51 positive samples for the "sa", and 94 positive samples for the "sb"); as well as additional reference results (see Figure 5) obtained by the expert processing software.The training and test sets are random subsets of the raw collected samples by splitting operation.Note that once determined, they should be fixed so that all SCDDs could be fairly compared.For a good estimate, the test set should be labeled, randomly drawn from the class of interest, independent from the training set, and as large as possible.The validation set (i.e., only comprising true labels of the positive class) is used for validation and a nominal set.Note that the raw samples covering the whole of Wenshan City are specially designed for training and testing SCDDs.Meanwhile, the true labels for validation set are applied to calculate a representative accuracy (i.e., a correct rate) of the classification results as a final validation.

Design Sets
Good classification depends not only on the factors associated with classifiers, but fine design sets also play a significant role in assessing the classification results objectively [58].There is a non-negligible truth that, if only given positive samples, we cannot estimate most of the classification errors.Therefore, both positive and negative samples are supposed to be prepared for this study.Silva, et al. [41] utilized a manually-collected set of samples of the target class and a random sampling of samples of all classes, to keep the training effort low.In that case, they used the non-pure negative samples under the assumption of which few samples of the target class would be submerged.
In this study, we manually collected the positive (i.e., 1211) and negative (i.e., 8522) samples with a class-wise separability of 1.9918.Subsequently, we prepared three kinds of design sets, i.e., the training set (i.e., 727 positive samples, and 5114 negative samples at a 60% split); test set (484 positive samples, and 3408 negative samples, the remaining 40% split); and validation set (only 51 positive samples for the "sa", and 94 positive samples for the "sb"); as well as additional reference results (see Figure 5) obtained by the expert processing software.The training and test sets are random subsets of the raw collected samples by splitting operation.Note that once determined, they should be fixed so that all SCDDs could be fairly compared.For a good estimate, the test set should be labeled, randomly drawn from the class of interest, independent from the training set, and as large as possible.The validation set (i.e., only comprising true labels of the positive class) is used for validation and a nominal set.Note that the raw samples covering the whole of Wenshan City are specially designed for training and testing SCDDs.Meanwhile, the true labels for validation set are applied to calculate a representative accuracy (i.e., a correct rate) of the classification results as a final validation.

Design Sets
Good classification depends not only on the factors associated with classifiers, but fine design sets also play a significant role in assessing the classification results objectively [58].There is a nonnegligible truth that, if only given positive samples, we cannot estimate most of the classification errors.Therefore, both positive and negative samples are supposed to be prepared for this study.Silva, et al. [41] utilized a manually-collected set of samples of the target class and a random sampling of samples of all classes, to keep the training effort low.In that case, they used the non-pure negative samples under the assumption of which few samples of the target class would be submerged.
In this study, we manually collected the positive (i.e., 1211) and negative (i.e., 8522) samples with a class-wise separability of 1.9918.Subsequently, we prepared three kinds of design sets, i.e., the training set (i.e., 727 positive samples, and 5114 negative samples at a 60% split); test set (484 positive samples, and 3408 negative samples, the remaining 40% split); and validation set (only 51 positive samples for the "sa", and 94 positive samples for the "sb"); as well as additional reference results (see Figure 5) obtained by the expert processing software.The training and test sets are random subsets of the raw collected samples by splitting operation.Note that once determined, they should be fixed so that all SCDDs could be fairly compared.For a good estimate, the test set should be labeled, randomly drawn from the class of interest, independent from the training set, and as large as possible.The validation set (i.e., only comprising true labels of the positive class) is used for validation and a nominal set.Note that the raw samples covering the whole of Wenshan City are specially designed for training and testing SCDDs.Meanwhile, the true labels for validation set are applied to calculate a representative accuracy (i.e., a correct rate) of the classification results as a final validation.

Single-Class Data Descriptors (SCDDs)
The SCDD [47] is a trained mapping to predict classes.Additionally, they can be divided into several categories depending on the type of the training data and the discrimination function [59].For example, the positive samples only (i.e., P-learning) or the positive and unlabeled samples (i.e., PU-learning) [31,59].In general, for certain SCDDs, the corresponding model can be defined as or vice versa in the opposite conditions.The threshold θ is set according to the target error.Formally, each instance is mapped to one element of the set of the positive and negative class labels.In this study, due to only the target class (i.e., Panax notoginseng fields) is the class of interest, and the task of mapping Panax notoginseng fields can be regarded as a specific single-class data description problem.Hence, 10 SCDDs [47], i.e., the simple Gaussian target distribution data description (SGTD coded as c1) [60]; the robust Gaussian target distribution data description (RGTD coded as c2) [60]; the minimum covariance determinant Gaussian data description (MCDG coded as c3) [60]; the mixture of Gaussian data description (MoG coded as c4) [60]; the auto-encoder neural network data description (AENN coded as c6) [61]; the k-means clustering data description (k-means coded as c7) [62]; the self-organizing map data description (SOM coded as c10) [63]; the minimum spanning tree data description (MST coded as c11) [64]; the k-nearest neighbor data description (K-NN coded as c13) [65]; the incremental support vector data description (IncSVDD coded as c17) [66]; the Parzen density estimator data description (PDE coded as c5, which is a known underestimated descriptor) [67]; and the principal component analysis data description (PCA coded as c9, which is known as an overestimated descriptor) [68].
In addition to the SCDDs mentioned above, three ensemble classifiers, i.e., the mean combiner (meanc coded as cmea), the median combiner (medianc coded as cmed), and the voting combiner (votec coded as cvot), taking the mean, median, and vote strategies, respectively.The ensemble-based approach refers to the multiple-classifier system in which the outputs of all member classifiers are combined to derive an accurate classification.We combine the SCDDs which should be accurate but different in an ensemble strategy because each of them can focus on a specific feature or characteristic in the feature space [24].Thus, a much more flexible and outstanding classifier can be obtained by combining all the strong points of the different SCDDs.There are three strategies of combining the different SCDDs, which are referred to as (1) sequential, (2) parallel, and (3) stacked [47].Here, the above-enumerated SCDDs (i.e., c1-c17) are computed in the same feature space, and which are typically combined in a stacked way.Notice that, the combining procedure is computationally intensive in the face of many different base classifiers.Thus, the action that prunes the base classifiers according to their performance (i.e., underestimated or overestimated) is inevitable sometimes.In this study, ten SCDDs except for the underestimated and overestimated ones, which are regarded as the qualified member classifiers.The abovementioned SCDDs and combiners have been well implemented with a Matlab toolbox for data description developed and distributed by Dr. David Tax.

Performance Evaluation
In this study, the error computation and performance evaluation involve several accuracy metrics, i.e., the basic errors (see Table 2), F 1 measure, receiver operating characteristics (ROC) curve, area under the ROC curve (AUC) error, cost curve, confusion matrix, and correct rate, which are employed to evaluate the SCDDs in a more comprehensive way.In order to find a good SCDD, four basic errors can be calculated, and two of them have to be minimized, namely the false positives (FP) and false negatives (FN).Hence, we put forward and discuss several representative measures which can reflect the probability that the prediction is informed versus chance.The FN can be estimated on the positive set.In fact, the FN is much harder to estimate when no negative samples are available [47].If only minimizing the FN will lead to the SCDD which may wrongly label a number of the negative samples as the positive class.In order to avoid such a degenerate solution, the negative samples have to be collected as well.This is the reason why we elaborately prepared the design sets and the reference results for training and testing.Moreover, two other measures, such as precision and recall (i.e., the true positive (TP) rate), are often used in performance evaluation.Finally, a derived performance indicator F 1 score can be computed.Note that a good SCDD should have both small rates of the FN and FP.Due to the error on the positive class can be estimated relatively well, assuming that, a threshold can be set beforehand on the target error for the SCDDs, and then a ROC curve can be obtained by varying this threshold and measuring the error on the negative samples.The ROC graph [69,70] shows how the FP varies for varying the FN, which is a technique for visualizing and selecting the best classifiers based on their performance.The smaller these fractions are, the more the SCDD is to be preferred.Although the ROC graph shows an intuitive metric of the performance of the SCDD, as one side, it is a bit difficult to compare the ROC curves; for another, we want to reduce the ROC curve to a single scalar value representing the expected performance.Thus, the AUC error [71,72] often is taken, and which is computed from the ROC curve, which integrates the fraction TP over varying thresholds (or by varying fraction FP equivalently).The larger the AUC value, the better the SCDD's performance.That is, the higher values may indicate a better separation between the positive and negative samples.As for the ROC graph [70], there are many thresholds that may be suboptimal.That is, there is another operating point for which at least one of the errors is lower.Nevertheless, this concern can be indicated by a cost curve [73], which will be another method for visualizing the performance of the SCDD.For a varying cost-ratio between both classes, the expected cost is computed.Additionally, once a trained mapping has been determined, we can obtain the sufficient site-specific metrics derived from a confusion matrix (i.e., a binary contingency table for the SCDD).The binary confusion matrix (see Table 2) is a specific table layout that allows the visualization of the performance of the SCDD [74].Such an error matrix is constructed via classifying a predefined test set or comparing two sets of labels, and which combines the spatial position and quantitative information of the classification results to implement a performance evaluation.What is important is that the very small changes of labels are well reflected in a confusion matrix.

Resultant Maps
After the SCDDs have been performed, and then the final outputs associated with mapping Panax notoginseng fields can be derived.For one side, we conduct a statistical analysis from a perspective of quantitative evaluation.Besides, we want an intuitive comparison by showing the paired maps which are overlapped in false-color style between the classification and reference maps regarding the spatial distribution of Panax notoginseng fields.
As such, the classification map of each SCDD in comparison with the reference map that can be simultaneously graphed for the "sa" and "sb" (see Figures 6 and 7).By visual inspection, the obvious differences have been observed with respect to the underestimated PDE (see Figures 6 and 7) and the overestimated PCA (see Figures 6 and 7).The heavy magenta patches mean seriously underestimated, while a large number of green patches mean severely overestimated.The majority of the rest of the SCDDs arise from the predominately magenta and slightly green patches, which mean underestimated.Meanwhile, AENN is different (see Figures 6 and 7).Notice that the error threshold on the positive class is set by a default float value of 0.1 for all SCDDs.In a number of clusters a default integer value of 2 will be acceptable because only two classes are available (i.e., the positive and negative class).Additionally, the number of components is set to 5 in terms of PCA.default integer value of 2 will be acceptable because only two classes are available (i.e., the positive and negative class).Additionally, the number of components is set to 5 in terms of PCA.

Measuring Performance
Performance evaluation is crucial in the assessment of a set of SCDDs.For Table 3, the false negative rate (FNR) gives the error on the positive class, while the false positive rate (FPR) shows the error on the negative class.Meanwhile, P is precision, and R denotes recall equivalent with TPR.The statistical metrics can be grouped into (1) a group that the smaller is the better, i.e., the FNR and FPR, and (2) another group that the greater is the better, i.e., P, R, F1, and AUC.Although it is difficult to describe all SCDDs together, we attempt to make it possible by rating them on a rank table later.Table 3 illustrates that the FNR (e.g., the c5) and FPR (e.g., the c2 and c9) are prominently higher than two known inferior SCDDs (e.g., the c5 and c9), and a slightly inferior one (e.g., c2) is marked as well.Additionally, these unqualified SCDDs are again attention-catching in the second group.For all SCDDs, the precision (e.g., the c2 and c9), recall (e.g., the c5), and F1 score (e.g., the c2, c5, and c9) support the analysis drawn from the first group, while the AUC error always appears mediocre.However, there are some differences owing to the measuring ability of statistical metrics and the intrinsic characteristics of the diverse SCDDs.The ROC curve (see Figure 8) gives a two-dimensional depiction of the performance of the SCDD.This is due to too many SCDDs with the approximate accuracies so that discriminating the individual ROC curve seems difficult.Thus, we plot them one by one, and the aforementioned inferior SCDDs still can be well reflected.Note that the c5 has the lowest operation point, while the c2 and c9 are found in terms of the ROC curve.In addition, the c6 deserves attention.Here, the ROC graphs are not going to be read significantly, as the purpose is

Measuring Performance
Performance evaluation is crucial in the assessment of a set of SCDDs.For Table 3, the false negative rate (FNR) gives the error on the positive class, while the false positive rate (FPR) shows the error on the negative class.Meanwhile, P is precision, and R denotes recall equivalent with TPR.The statistical metrics can be grouped into (1) a group that the smaller is the better, i.e., the FNR and FPR, and (2) another group that the greater is the better, i.e., P, R, F 1 , and AUC.Although it is difficult to describe all SCDDs together, we attempt to make it possible by rating them on a rank table later.Table 3 illustrates that the FNR (e.g., the c5) and FPR (e.g., the c2 and c9) are prominently higher than two known inferior SCDDs (e.g., the c5 and c9), and a slightly inferior one (e.g., c2) is marked as well.Additionally, these unqualified SCDDs are again attention-catching in the second group.For all SCDDs, the precision (e.g., the c2 and c9), recall (e.g., the c5), and F 1 score (e.g., the c2, c5, and c9) support the analysis drawn from the first group, while the AUC error always appears mediocre.However, there are some differences owing to the measuring ability of statistical metrics and the intrinsic characteristics of the diverse SCDDs.The ROC curve (see Figure 8) gives a two-dimensional depiction of the performance of the SCDD.This is due to too many SCDDs with the approximate accuracies so that discriminating the individual ROC curve seems difficult.Thus, we plot them one by one, and the aforementioned inferior SCDDs still can be well reflected.Note that the c5 has the lowest operation point, while the c2 and c9 are found in terms of the ROC curve.In addition, the c6 deserves attention.Here, the ROC graphs are not going to be read significantly, as the purpose is achieved.The cost curve (see Figure 9) is a specific performance visualizer using the expected cost, another technique to measure the performance of SCDDs.Each operating point appears as a line in this plot, while the certain one of them is indicated by the dotted line.The combination of operating points that forms the lower hull is indicated by the thick curve and shows the best operating points over the range of costs.Here, the dotted line of c5, and the thick arcs of c2 and c9, again support the performance analysis drawn from the Figure 8.In particular, c6 has a representation that is the same as the ROC graph.
Appl.Sci.2018, 8, x FOR PEER REVIEW 12 of 23 achieved.The cost curve (see Figure 9) is a specific performance visualizer using the expected cost, another technique to measure the performance of SCDDs.Each operating point appears as a line in this plot, while the certain one of them is indicated by the dotted line.The combination of operating points that forms the lower hull is indicated by the thick curve and shows the best operating points over the range of costs.Here, the dotted line of c5, and the thick arcs of c2 and c9, again support the performance analysis drawn from the Figure 8.In particular, c6 has a representation that is the same as the ROC graph.The confusion matrix is often applied to visualize the performance of the SCDD by a specific table layout.The OA means overall accuracy, K is the Kappa coefficient, PA denotes the producer's accuracy, and UA represents the user's accuracy.It is somewhat intricate that we structure two kinds of confusion matrices together (see Figure 10), which look similar but not identical.That is the reason why we design two types of confusion matrix (1) classifier-dependent (i.e., OAt, Kt, PAt, and UAt), which is generated using the test set; and (2) classifier-independent (i.e., OAa, Ka, PAa, UAa, OAb,  The confusion matrix is often applied to visualize the performance of the SCDD by a specific table layout.The OA means overall accuracy, K is the Kappa coefficient, PA denotes the producer's accuracy, and UA represents the user's accuracy.It is somewhat intricate that we structure two kinds of confusion matrices together (see Figure 10), which look similar but not identical.That is the reason why we design two types of confusion matrix (1) classifier-dependent (i.e., OAt, Kt, PAt, and UAt), which is generated using the test set; and (2) classifier-independent (i.e., OAa, Ka, PAa, UAa, OAb, Kb, PAb, and UAb), which is produced by utilizing the reference result for "sa" and "sb".For the two classifier-independent error matrices, which give an analogous presentation and a comforting result, though with a slight discrepancy in the amplitude, the inferior SCDDs, i.e., the c5 has a smaller K and PA while the c9 has a smaller K and UA.As for the classifier-dependent confusion matrix, all SCDDs have good performance except for the c5 has a lower K and PA while the c9 has a lower K and UA.
The summary can be drawn as ( 1) overall accuracy appears mediocre or fails in the face of the SCDD for the imbalanced data; and (2) two classifier-independent error matrices demonstrate that the SCDDs are, indeed, fixed by unchangeable splitting samples.Meanwhile, they seem worse and more unstable compared to the classifier-dependent matrices, which may be disturbed in the presence of more uncertainty.The correct rate (see Figure 10, and Table 4) is a custom performance measure, which is used for validation.Here, the true labels used are a subset of the positive samples drawn from a merged set for the "sa" and "sb".This measure is a simple attempt to repeat and verify the previous analysis.Kb, PAb, and UAb), which is produced by utilizing the reference result for "sa" and "sb".For the two classifier-independent error matrices, which give an analogous presentation and a comforting result, though with a slight discrepancy in the amplitude, the inferior SCDDs, i.e., the c5 has a smaller K and PA while the c9 has a smaller K and UA.As for the classifier-dependent confusion matrix, all SCDDs have good performance except for the c5 has a lower K and PA while the c9 has a lower K and UA.The summary can be drawn as ( 1) overall accuracy appears mediocre or fails in the face of the SCDD for the imbalanced data; and (2) two classifier-independent error matrices demonstrate that the SCDDs are, indeed, fixed by unchangeable splitting samples.Meanwhile, they seem worse and more unstable compared to the classifier-dependent matrices, which may be disturbed in the presence of more uncertainty.The correct rate (see Figure 10, and Table 4) is a custom performance measure, which is used for validation.Here, the true labels used are a subset of the positive samples drawn from a merged set for the "sa" and "sb".This measure is a simple attempt to repeat and verify the previous analysis.
Figure 10.Accuracy metrics derived from the confusion matrix.Here, the postfix "t" means taking the test set as the true labels, while the "a" and "b" mean taking the reference results as the true labels, respectively.The CRa and CRb are the correct rates of the classification results in comparison with the true labels for the "sa" and "sb".Here, the postfix "t" means taking the test set as the true labels, while the "a" and "b" mean taking the reference results as the true labels, respectively.The CRa and CRb are the correct rates of the classification results in comparison with the true labels for the "sa" and "sb".Here, the FN and FP are the false negative rate and false positive rate, respectively.Signs "+" and "-" are the rating directions for judging the ascending or descending order.Here, the bold codes mean the inferior single-class data descriptors (SCDDs).Note that the accuracy metrics derived from confusion matrix for the "sb" have not been included herein.

Selection Criteria
Although sufficient statistical analyses have been conducted, it is not known how to recognize which SCDD looks good.In fact, it is not easy to determine which is a fine, or even the best in the face of so many SCDDs with multiple performance measures.Therefore, we want to set up a handful of naive selection criteria to achieve such a goal by means of a rank board (see Table 4).For this work, more empirical selection criteria are adopted.Intrinsically, most of the statistical metrics are derived from the basic errors (i.e., the true positive, the true negative, the false positive, and the false negative).The derived measures (i.e., the precision, recall, F 1 , AUC, OA, KC, PA, and UA) could be quantitatively analyzed with actions such as rating and scoring.Note that the AUC measure appears mediocre herein.The OA may not be a reliable metric for the real performance of the SCDD in this study, because it yields misleading results supposing the training data are imbalanced when the numbers of observations in different classes vary greatly.The ROC curve and cost graph are used for supporting numerical indicators, which are especially suitable for classification problems in which there are only two classes (i.e., the positive and negative classes).The limitation [70] of both ROC analysis and cost curve is the lack of any effective method to show the performance results obtained from several different data sets in a single plot.This difficulty follows the fact that only two dimensions are used to present the performance of a single data set.
It is important to realize the optimal selection criteria for the hybrid classifiers, such as comparing the performance of an ensemble classifier with a member classifier, which is also presented in this study.The fixed combination strategies or so-called rules (i.e., the mean rule, median rule, and voting rule), are more likely to obtain better classification results, just as the inferior classifiers will reduce the whole performance.In particular, the time taken will be an insufferable problem.It is crucial to address the question of under what criteria does one classifier outperform another.Additionally, a decision needs to be made to determine which classifier should be selected over others.That is if, given the current operating conditions, a set of selection criteria can be derived.It is often easy, by varying the parameter setting, such as a threshold or the variables of the mathematical model, or by varying the class distribution in the training set, to create a whole set of SCDDs.One commonly used selection criterion is to select the SCDD whose parameter settings and training conditions most closely agree with the current operating conditions, which is called the performance-independent criterion [75].This is the reason why we try to fix all irrelevant conditions prior to developing the performance-dependent selection criteria.A plain criterion is to choose the qualified SCDDs regardless of their training conditions or parameter settings.

Scoring Model
For SCDDs with multiple performance measures, it would be expected for them to be scored.Then, there is always one possibility that all SCDDs can be quantitatively evaluated and scored.Consequently, a score-oriented method is presented here to clarify this concern.In this study, we put forward a kind of scoreboard on the basis of the rank board to give each SCDD an explicit score so that we can determine which SCDD is optimal.
According to Table 5, we use the rows (M.) to denote the measures and the columns (C.) to represent the different SCDDs.Signs are used to identify the metric belonging to which group: "-" denotes the error metric (i.e., the smaller the better), while "+" is the accuracy metric (i.e., the greater the better).The score variable Sc j is calculated by the following equation: or and where x i represents the measures in the ith row, x j represents SCDDs in the jth column, and x ij denotes the measured value of the jth SCDD with ith metric.The s i represents the scores in the ith row, the s j represents the scores in the jth column, and the s ij denotes the score value of the jth SCDD with the ith metric.Sc j represents the total score of the jth SCDD.For simplicity, we assume that there are five SCDDs and five measures herein to facilitate the illustration of the scoreboard and the derivation of Equations ( 2)-( 4).The n is a key scale to slice a certain metric for all SCDDs so that each SCDD can be assigned a normalized float value (ranging from 0 to n) associated with this metric.In the end, the gross score of every SCDD can be obtained and plotted by performing the summation by column.Figure 11 illustrates that two inferior SCDDs, i.e., the c5 is underestimated, and the c9 is overestimated, which are prominently identified.Meanwhile, two slightly inferior SCDDs, i.e., the c2 and c6, are apt to be observed again. and where xi represents the measures in the ith row, xj represents SCDDs in the jth column, and xij denotes the measured value of the jth SCDD with ith metric.The si represents the scores in the ith row, the sj represents the scores in the jth column, and the sij denotes the score value of the jth SCDD with the ith metric.Scj represents the total score of the jth SCDD.For simplicity, we assume that there are five SCDDs and five measures herein to facilitate the illustration of the scoreboard and the derivation of Equations ( 2)-( 4).The n is a key scale to slice a certain metric for all SCDDs so that each SCDD can be assigned a normalized float value (ranging from 0 to n) associated with this metric.In the end, the gross score of every SCDD can be obtained and plotted by performing the summation by column.Figure 11 illustrates that two inferior SCDDs, i.e., the c5 is underestimated, and the c9 is overestimated, which are prominently identified.Meanwhile, two slightly inferior SCDDs, i.e., the c2 and c6, are apt to be observed again.contributions and provide the fruitful information for studies on the quality assurance of the production of TCM, precision farming, the construction of agro-ecosystems, sustainable development, and the protection of biodiversity of Panax notoginseng.Special concerns and limitations of this study can be summarized as follows: This work utilizes a manually-collected set of samples of the target class and grid-constraint uniformly-collected negative samples.The uncertainty exists that a few possible land-cover classes may be left out, even though the classification results look rather good.
Thirteen SCDDs are employed and compared, however, there may be many other algorithms and their variants.Anyhow, the available ones have been used in this study.
The comparison with the different SCDDs does not judge them to be good or not.Actually, we wish to extend the ability of SCDDs to achieve the expected experiences in a straightforward way to find the optimal approach to monitoring the plant pattern changes of Panax notoginseng.
The class imbalance is a non-negligible problem in terms of a real specific land-cover classification using SCDDs.We strive for trying to observe what influences it would cause, and find two mediocre-appearing measures, i.e., the OA and AUC.
The selection criteria and scoring model are presented to determine the optimal SCDD which is outstanding and deserves attention.
The division of the site-specific error matrices by discriminating if the SCDD is dependent on the training set or not provides a more comprehensive approach to assess the final results.
The combination of SCDDs which are taken as the base classifiers can reduce the error or improve the accuracy.However, lower computational efficiency would be an annoying problem.Additionally, the pruned ensembles can give better performance.Some classification accuracies may not be the reliable indicators, particularly if the training data are imbalanced [41,77].As single-class data description is a special type of one-class classification, there are difficulties that may exist when trying to fit a single-class learner using the positive samples only.If SCDDs are trained with the samples of the single target class, then only the sensitivity can be estimated.There is a possibility that using only the sensitivity to fine-tune an algorithm may result in a class descriptor with high sensitivity but low specificity and overestimating the true extension of the class of interest [41].In terms of the scope of this study, the introduction of single-class data description regarding remote-sensing mapping of Panax notoginseng fields based on P-learning, which provides us the new insights to promote the development of the resource inventory and dynamic monitoring of Panax notoginseng.

Figure 1 .
Figure 1.Panax notoginseng is a well-known geoherb, which has acquired a very favorable reputation for the treatment of blood disorders, including blood stasis, bleeding, and blood deficiency.Its root can be turned to powder as a medicinal material, and the shadenet cover can be observed from space by means of satellite imagery.

Figure 3 .
Figure 3. Shade house.Sub-figures are the snapshots and photos in the different phases of the construction of a shade house of Panax notoginseng including the materials for building, i.e., sticks(5 cm × 2 m), upright rods, and black plastic net over the poles; viewpoints for the observation, i.e., parallel to bed, perpendicular to bed, at close range, over a long distance, and from a satellite image.

Figure 3 .
Figure 3. Shade house.Sub-figures are the snapshots and photos in the different phases of the construction of a shade house of Panax notoginseng including the materials for building, i.e., sticks (5 cm × 2 m), upright rods, and black plastic net over the poles; viewpoints for the observation, i.e., parallel to bed, perpendicular to bed, at close range, over a long distance, and from a satellite image.

Figure 5 .
Figure 5. Reference results.(a) a regular subset, (b) the overlap, (c) an irregular subset of Landsat-8 OLI image, and (d) the reference result for the "sa"; (e) a regular subset, (f) the overlap, (g) an irregular subset of Landsat-8 OLI image, and (h) the reference result for the "sb".

Figure 5 .
Figure 5. Reference results.(a) a regular subset, (b) the overlap, (c) an irregular subset of Landsat-8 OLI image, and (d) the reference result for the "sa"; (e) a regular subset, (f) the overlap, (g) an irregular subset of Landsat-8 OLI image, and (h) the reference result for the "sb".

Figure 5 .
Figure 5. Reference results.(a) a regular subset, (b) the overlap, (c) an irregular subset of Landsat-8 OLI image, and (d) the reference result for the "sa"; (e) a regular subset, (f) the overlap, (g) an irregular subset of Landsat-8 OLI image, and (h) the reference result for the "sb".

Figure 10 .
Figure10.Accuracy metrics derived from the confusion matrix.Here, the postfix "t" means taking the test set as the true labels, while the "a" and "b" mean taking the reference results as the true labels, respectively.The CRa and CRb are the correct rates of the classification results in comparison with the true labels for the "sa" and "sb".

Table 5 .
Score table.Here, we assume five SCDDs with five metrics.

Table 5 .
Score table.Here, we assume five SCDDs with five metrics.