Next Article in Journal
Datasets for Aspect-Based Sentiment Analysis in Bangla and Its Baseline Evaluation
Previous Article in Journal
Sigfox and LoRaWAN Datasets for Fingerprint Localization in Large Urban and Rural Areas
Article Menu

Export Article

Open AccessData Descriptor

RetroTransformDB: A Dataset of Generic Transforms for Retrosynthetic Analysis

Faculty of Chemistry, University of Plovidv “P. Hilendarski”, 24 Tsar Assen Str., 4000 Plovdiv, Bulgaria
Author to whom correspondence should be addressed.
Received: 28 March 2018 / Revised: 16 April 2018 / Accepted: 19 April 2018 / Published: 21 April 2018
Full-Text   |   PDF [666 KB, uploaded 3 May 2018]   |  


Presently, software tools for retrosynthetic analysis are widely used by organic, medicinal, and computational chemists. Rule-based systems extensively use collections of retro-reactions (transforms). While there are many public datasets with reactions in synthetic direction (usually non-generic reactions), there are no publicly-available databases with generic reactions in computer-readable format which can be used for the purposes of retrosynthetic analysis. Here we present RetroTransformDB—a dataset of transforms, compiled and coded in SMIRKS line notation by us. The collection is comprised of more than 100 records, with each one including the reaction name, SMIRKS linear notation, the functional group to be obtained, and the transform type classification. All SMIRKS transforms were tested syntactically, semantically, and from a chemical point of view in different software platforms. The overall dataset design and the retrosynthetic fitness were analyzed and curated by organic chemistry experts. The RetroTransformDB dataset may be used by open-source and commercial software packages, as well as chemoinformatics tools. View Full-Text
Keywords: transforms; retrosynthesis; SMIRKS transforms; retrosynthesis; SMIRKS

Graphical abstract

This is an open access article distributed under the Creative Commons Attribution License which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited (CC BY 4.0).

Share & Cite This Article

MDPI and ACS Style

Avramova, S.; Kochev, N.; Angelov, P. RetroTransformDB: A Dataset of Generic Transforms for Retrosynthetic Analysis. Data 2018, 3, 14.

Show more citation formats Show less citations formats

Note that from the first issue of 2016, MDPI journals use article numbers instead of page numbers. See further details here.

Article Metrics

Article Access Statistics



[Return to top]
Data EISSN 2306-5729 Published by MDPI AG, Basel, Switzerland RSS E-Mail Table of Contents Alert
Back to Top