This article is- freely available
- re-usable
Article
A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest
1
School of Chemical Engineering, Dalian University of Technology, Dalian, Liaoning 116012, China
2
Center of Bioinformatics, Northwest A&F University, Yangling, Shaanxi 712100, China
* Author to whom correspondence should be addressed.
Received: 13 December 2010; in revised form: 10 February 2011 / Accepted: 11 February 2011 / Published: 21 February 2011
Abstract: Experimental pEC50s for 216 selective respiratory syncytial virus (RSV) inhibitors are used to develop classification models as a potential screening tool for a large library of target compounds. Variable selection algorithm coupled with random forests (VS-RF) is used to extract the physicochemical features most relevant to the RSV inhibition. Based on the selected small set of descriptors, four other widely used approaches, i.e., support vector machine (SVM), Gaussian process (GP), linear discriminant analysis (LDA) and k nearest neighbors (kNN) routines are also employed and compared with the VS-RF method in terms of several of rigorous evaluation criteria. The obtained results indicate that the VS-RF model is a powerful tool for classification of RSV inhibitors, producing the highest overall accuracy of 94.34% for the external prediction set, which significantly outperforms the other four methods with the average accuracy of 80.66%. The proposed model with excellent prediction capacity from internal to external quality should be important for screening and optimization of potential RSV inhibitors prior to chemical synthesis in drug development.
Keywords: RSV; variable selection; Mold2 descriptors; random forest
Supplementary Files
- Supplementary File 1:
Table S1
(DOC, 406 KB)
- Supplementary File 2:
Table S2
(XLS, 46 KB)
- Supplementary File 3:
Table S3
(DOC, 351 KB)
Article Statistics
Click here to load and display the download statistics.
Notes: Multiple requests from the same IP address are counted as one view.
Cite This Article
MDPI and ACS Style
Hao, M.; Li, Y.; Wang, Y.; Zhang, S. A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest. Int. J. Mol. Sci. 2011, 12, 1259-1280.
AMA Style
Hao M, Li Y, Wang Y, Zhang S. A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest. International Journal of Molecular Sciences. 2011; 12(2):1259-1280.
Chicago/Turabian Style
Hao, Ming; Li, Yan; Wang, Yonghua; Zhang, Shuwei. 2011. "A Classification Study of Respiratory Syncytial Virus (RSV) Inhibitors by Variable Selection with Random Forest." Int. J. Mol. Sci. 12, no. 2: 1259-1280.