Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR)
AbstractRediscovery of known natural products hinders the discovery of new, unique scaffolds. Efforts have mostly focused on streamlining the determination of what compounds are known vs. unknown (dereplication), but an alternative strategy is to focus on what is different. Utilizing statistics and assuming that common actinobacterial metabolites are likely known, focus can be shifted away from dereplication and towards discovery. LC-MS-based principal component analysis (PCA) provides a perfect tool to distinguish unique vs. common metabolites, but the variability inherent within natural products leads to datasets that do not fit ideal standards. To simplify the analysis of PCA models, we developed a script that identifies only those masses or molecules that are unique to each strain within a group, thereby greatly reducing the number of data points to be inspected manually. Since the script is written in R, it facilitates integration with other metabolomics workflows and supports automated mass matching to databases such as Antibase. View Full-Text
- Supplementary File 1:
Supplementary (PDF, 391 KB)
Share & Cite This Article
Chanana, S.; Thomas, C.S.; Braun, D.R.; Hou, Y.; Wyche, T.P.; Bugni, T.S. Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR). Metabolites 2017, 7, 34.
Chanana S, Thomas CS, Braun DR, Hou Y, Wyche TP, Bugni TS. Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR). Metabolites. 2017; 7(3):34.Chicago/Turabian Style
Chanana, Shaurya; Thomas, Chris S.; Braun, Doug R.; Hou, Yanpeng; Wyche, Thomas P.; Bugni, Tim S. 2017. "Natural Product Discovery Using Planes of Principal Component Analysis in R (PoPCAR)." Metabolites 7, no. 3: 34.
Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.