Next Article in Journal
Platinum-Catalyzed Hydrative Cyclization of 1,6-Diynes for the Synthesis of 3,5-Substituted Conjugated Cyclohexenones
Previous Article in Journal
Anethole Isomerization and Dimerization Induced by Acid Sites or UV Irradiation
Article Menu

Article Versions

Export Article

Open AccessArticle
Molecules 2010, 15(7), 5031-5044; doi:10.3390/molecules15075031

Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays

Research Center for Drug Discovery, School of Pharmaceutical Sciences, Sun Yat-Sen University, 132 East Circle at University City, Guangzhou, 510006, China
Authors to whom correspondence should be addressed.
Received: 2 July 2010 / Revised: 14 July 2010 / Accepted: 19 July 2010 / Published: 23 July 2010
Download PDF [963 KB, uploaded 18 June 2014]


The quality of diverse compound selection mainly depends on cluster algorithms, descriptors, the combinations of the descriptors, and similarity metrics. The Jarvis-Patrick algorithm, MDL search keys, and Daylight fingerprints are a well accepted algorithm and structure descriptors for compound library diversity analysis. Based upon our 288 experiments on selecting compounds from various descriptor combinations, we have found (1) hybrid Daylight and MDL structural descriptors for diversity analyses can produce worse results; (2) selections based purely on 2,048-bit Daylight fingerprints yield better results than the ones based purely on MDL 166-bit search keys; (3) when Daylight fingerprints and MDL search keys are combined, it is better to compute the similarities independently, then to take the smaller value for the outcome. This will yield better average separation of clusters; (4) regarding the consistency of different clustering approaches, the Daylight fingerprints based clustering is more consistent with the SCA approach than it does with the MDL search keys based approach; (5) The MDL search keys based selection approach tends to select a greater number of compounds from larger clusters. As the Daylight fingerprint is folded two and three times, respectively, information is lost, and this approach tends to select a greater number of compounds from larger clusters as well. These results have not been reported before to our knowledge.
Keywords: diversity; cheminformatics; chemical library; compound selection diversity; cheminformatics; chemical library; compound selection
This is an open access article distributed under the Creative Commons Attribution License (CC BY 3.0).

Scifeed alert for new publications

Never miss any articles matching your research from any publisher
  • Get alerts for new papers matching your research
  • Find out the new papers from selected authors
  • Updated daily for 49'000+ journals and 6000+ publishers
  • Define your Scifeed now

SciFeed Share & Cite This Article

MDPI and ACS Style

Gu, Q.; Xu, J.; Gu, L. Selecting Diversified Compounds to Build a Tangible Library for Biological and Biochemical Assays. Molecules 2010, 15, 5031-5044.

Show more citation formats Show less citations formats

Related Articles

Article Metrics

Article Access Statistics



[Return to top]

Molecules EISSN 1420-3049 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top