This article is
- freely available
Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays
Research Center for Drug Discovery, School of Pharmaceutical Sciences, Sun Yat-Sen University, 132 East Circle at University City, Guangzhou, 510006, China
* Authors to whom correspondence should be addressed.
Received: 2 July 2010; in revised form: 14 July 2010 / Accepted: 19 July 2010 / Published: 23 July 2010
Abstract: The quality of diverse compound selection mainly depends on cluster algorithms, descriptors, the combinations of the descriptors, and similarity metrics. The Jarvis-Patrick algorithm, MDL search keys, and Daylight fingerprints are a well accepted algorithm and structure descriptors for compound library diversity analysis. Based upon our 288 experiments on selecting compounds from various descriptor combinations, we have found (1) hybrid Daylight and MDL structural descriptors for diversity analyses can produce worse results; (2) selections based purely on 2,048-bit Daylight fingerprints yield better results than the ones based purely on MDL 166-bit search keys; (3) when Daylight fingerprints and MDL search keys are combined, it is better to compute the similarities independently, then to take the smaller value for the outcome. This will yield better average separation of clusters; (4) regarding the consistency of different clustering approaches, the Daylight fingerprints based clustering is more consistent with the SCA approach than it does with the MDL search keys based approach; (5) The MDL search keys based selection approach tends to select a greater number of compounds from larger clusters. As the Daylight fingerprint is folded two and three times, respectively, information is lost, and this approach tends to select a greater number of compounds from larger clusters as well. These results have not been reported before to our knowledge.
Keywords: diversity; cheminformatics; chemical library; compound selection
Article StatisticsClick here to load and display the download statistics.
Notes: Multiple requests from the same IP address are counted as one view.
Cite This Article
MDPI and ACS Style
Gu, Q.; Xu, J.; Gu, L. Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays. Molecules 2010, 15, 5031-5044.
Gu Q, Xu J, Gu L. Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays. Molecules. 2010; 15(7):5031-5044.
Gu, Qiong; Xu, Jun; Gu, Lianquan. 2010. "Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays." Molecules 15, no. 7: 5031-5044.