Synthesis, Crystal Structure, and Biological Evaluation of Novel 5-Hydroxymethylpyrimidines

Pyrimidine displays a wide array of bioactivities, and thence, it is still considered a potent unit of new drug research. Its derivative, 5-hydroxymethylpyrimidine, can be found as a scaffold of nontypical nitrogen bases in DNA and as a core of some natural bioactive compounds. In this study, we obtained a series of 5-hydroxymethylpyrimidines that vary in the 4-position by the reduction of proper esters. All compounds were characterized by spectroscopic analysis, and single-crystal X-ray diffraction was performed for some of them. Biological investigations estimated cytotoxic properties against normal (RPTEC) and cancer (HeLa, HepaRG, Caco-2, AGS, A172) cell lines. It was found that the derivatives with an aliphatic amino group at the 4-position are generally less toxic to normal cells than those with a benzylsulfanyl group. Moreover, compounds with bulky constituents exhibit better anticancer properties, though at a moderate level. The specific compounds were chosen due to their most promising IC50 concentration for in silico study. Furthermore, antimicrobial activity tests were performed against six strains of bacteria and one fungus. They demonstrated that only derivatives with at least three carbon chain amino groups at the 4-position have weak antibacterial properties, and only the derivative with 4-benzylsulfanyl constituent exhibits any antifungal action.


Introduction
The pyrimidine ring is a well-known and established constituent of many synthetic drugs commonly used in medicine that demonstrate a variety of pharmacological activities, especially antimicrobial (e.g., trimethoprim, pyrimethamine, zidovudine, flucytosine) and antineoplastic (e.g., fluorouracil, gemcitabine, cytarabine) [1]. It is still among the leading compounds investigated by medicinal chemists for practical clinical applications of its derivatives [2]. It is also ubiquitous in nature due to the presence of nitrogenous bases in DNA and RNA.
The vast majority of pyrimidines present in nucleic acids are cytosine, thymine, and uracil. Other pyrimidine bases occur rarely and constitute only a small fraction of nucleobases [3]. Modifications in the structures of nucleobases include the process of methylation at the 5-position of the pyrimidine ring and further oxidation of this group by TET (ten-eleven translocation) family proteins [4] to form 5-hydroxymethylcytosine (hmC) (Figure 1a) [5] or 5-hydroxymethyluracil (5-hmU) (Figure 1b) [6]. The function of the mentioned nucleobases is still unclear, but they are thought to regulate gene expression or prompt DNA demethylation [7]. The role of TET proteins in cancer formation was The 5-hydroxymethylpyrimidine scaffold can also be found in a naturally occurring antibiotic, bacimethrin (Figure 1c), isolated from Bacillus megaterium [10]. The structural resemblance between bacimethrin and toxopyrimidine (Figure 1d), which is the pyrimidine part of vitamin B1 (thiamine), lies behind its mechanism of action that involves its conversion to 2′-methoxy-thiamin pyrophosphate. It may act as a thiamin-pyrophosphatedependent enzyme or gene inhibitor [11]. The antibiotic and its analogs exhibit antibacterial and anticancer properties [12].
In the continuation of our foregoing research, we present the results of the cytotoxic and antimicrobial study of a series of 5-hydroxymethylpyrimidines that vary in the 4-position and a comparison of their structural properties and biological activity.
All melting points were uncorrected and determined by the open capillary method with an Electrothermal IA9100 melting point apparatus. NMR spectra were recorded using a Bruker ARX 300 MHz NMR spectrometer. The abbreviations used in NMR spectra are: s-singlet, d-doublet, t-triplet, q-quartet, and m-multiplet. IR spectra were recorded with a Thermo Scientific USA Nicolet iS50 FTIR using the ATR technique. MS spectra were recorded with a Bruker Daltonic Compact using the ESI technique.

General Procedure for the Preparation of 2a and 2b
Ethyl 4-methyl-2-phenyl-6-sulfanylpyrimidine-5-carboxylate (1a) (2.74 g, 10 mmol) [15] was dissolved in a mixture of methanol (20 mL) and NaOH (0.40 g, 10 mmol), and proper benzyl chloride was added (11 mmol). The reaction mixture was stirred with a magnetic stirrer for 3 h at ambient temperature. Then the white solid was filtered off and The 5-hydroxymethylpyrimidine scaffold can also be found in a naturally occurring antibiotic, bacimethrin (Figure 1c), isolated from Bacillus megaterium [10]. The structural resemblance between bacimethrin and toxopyrimidine (Figure 1d), which is the pyrimidine part of vitamin B 1 (thiamine), lies behind its mechanism of action that involves its conversion to 2 -methoxy-thiamin pyrophosphate. It may act as a thiamin-pyrophosphatedependent enzyme or gene inhibitor [11]. The antibiotic and its analogs exhibit antibacterial and anticancer properties [12].
In the continuation of our foregoing research, we present the results of the cytotoxic and antimicrobial study of a series of 5-hydroxymethylpyrimidines that vary in the 4position and a comparison of their structural properties and biological activity.
All melting points were uncorrected and determined by the open capillary method with an Electrothermal IA9100 melting point apparatus. NMR spectra were recorded using a Bruker ARX 300 MHz NMR spectrometer. The abbreviations used in NMR spectra are: s-singlet, d-doublet, t-triplet, q-quartet, and m-multiplet. IR spectra were recorded with a Thermo Scientific USA Nicolet iS50 FTIR using the ATR technique. MS spectra were recorded with a Bruker Daltonic Compact using the ESI technique.

General Procedure for the Preparation of 2a and 2b
Ethyl 4-methyl-2-phenyl-6-sulfanylpyrimidine-5-carboxylate (1a) (2.74 g, 10 mmol) [15] was dissolved in a mixture of methanol (20 mL) and NaOH (0.40 g, 10 mmol), and proper benzyl chloride was added (11 mmol). The reaction mixture was stirred with a magnetic stirrer for 3 h at ambient temperature. Then the white solid was filtered off and washed with methanol (10 mL) and water (10 mL) and then dried at atmospheric pressure. Its purity was monitored by TLC using chloroform as eluent.

Preparation of Compounds 2c and 2d-2j
Ethyl 4-amino-6-methyl-2-phenylpyrimidine-5-carboxylate (2c) Ethyl 4-hydroxy-6-methyl-2-phenylpyrimidine-5-carboxylate (1b) (2.58 g, 10 mmol) [16] was placed in a round bottom flask, and 10 mL of POCl 3 was added. The mixture was refluxed for 3 h, poured slowly into 100 mL of icy water, and extracted three times with 25 mL of CHCl 3 . The extracts were combined, dried with anhydrous MgSO 4 for 30 min, and concentrated with a rotary evaporator. Then the solution of the crude product in 10 mL of THF was saturated with ammonia for 3 h, then poured into 100 mL of 2% HCl and extracted three times with 25 mL of CHCl 3 . The extracts were combined and dried with 5 g of anhydrous MgSO 4 for 30 min. The drying agent was filtered off, and the solvent was removed with a rotary evaporator. The crude product was purified by column chromatography on silica gel using CHCl 3 as eluent and dried at atmospheric pressure. The purity of the product was verified by TLC using chloroform as eluent. Other methods for obtaining the compound were also reported [17].

General Procedure for the Preparation of 2d-2j
Ethyl 4-hydroxy-6-methyl-2-phenylpyrimidine-5-carboxylate (1b) (2.58 g, 10 mmol) [16] was placed in a round bottom flask, and 10 mL of POCl 3 was added. The mixture was refluxed for 3 h, poured slowly into 100 mL of icy water, and extracted three times with 25 mL of CHCl 3 . The extracts were combined, dried with anhydrous MgSO 4 for 30 min, and concentrated with a rotary evaporator. Then the crude product was dissolved in a mixture of 10 mL of methanol and 5 mL of triethylamine, and an appropriate substituted primary alkyl amine (12 mmol) was added. The reaction mixture was stirred at room temperature for 24 h. After this time, the precipitate was filtered off, washed with 5 mL of cold methanol, and dried at atmospheric pressure. The crude product was purified by column chromatography on silica gel using chloroform as eluent. The purity of the product was verified by TLC using chloroform as eluent.

General Procedure for the Preparation of 3a-3h
First, 2 mmol of proper ester 2 was dissolved in 20 mL of THF. Then, the mixture was cooled to 0 • C, and LiAlH 4 (0.19 g, 5 mmol) was added gradually in small quantities. After 1 h, 25 mL of CHCl 3 was added; then the mixture was poured into the icy water (100 mL) and extracted three times with CHCl 3 (50 mL). The extracts were combined and dried with 2 g of anhydrous MgSO 4 for 30 min. The drying agent was filtered off, and the solvent was removed with a rotary evaporator. The crude product was crystallized from methanol. The purity of the product was monitored by TLC using a mixture of chloroform and ethyl ether (3:1; v/v) as eluent.

X-ray Structural Studies
Compounds 3c and 3e-3h were dissolved in hot methanol or acetone, and their crystals were obtained by slow evaporation of the solvent at room temperature. Crystals suitable for X-ray diffraction analysis appeared after 1-5 days. Single-crystal X-ray diffraction data were collected with Mo-Kα (λ = 0.71073 Å) (for 3c, 3e, and 3g-3h) or Cu-Kα (λ = 1.5418 Å) (for 3f) radiations and ω-scan modes using automated four-circle diffractometers with CCD detectors: Kuma KM4 (for 3c and 3h), Xcalibur R (for 3e and 3g), and Xcalibur PX (for 3f). The data were measured at 200(2) K (for 3c) or 100(2) K (for 3e-3h) using an Oxford Cryosystems open-flow nitrogen cryostat. This paper provides the selected crystallographic data and structure refinements for compounds 3c and 3e-3h in Table S1 in Supplementary Materials. The CrysAlisPro software package [18] was used for data collection, cell refinement, data reduction, and analysis. The data of 3f were corrected for absorption by the analytical method. The crystal structures were solved by direct methods using SHELXS-97 [19] and refined by a full-matrix least-squares technique on F 2 with SHELXL-2013 (and further also with SHELXL-2018) [20]. All non-H atoms were refined with anisotropic displacement parameters. During the refinement for 3h, one of two crystallographically independent molecules exhibited disorder over two positions related by a pseudo mirror plane (denoted as B and C) and was refined in the same site occupation factors of 0.5. SAME instruction was used in the refinement procedure equal to the restrained geometrical parameters of the disordered molecule of 3h (equivalent bond distances and angles). Additionally, two C41C and C43C atoms were constrained with EADP instruction and refined with the same fractional coordinates. All H atoms were initially found in the difference Fourier maps and refined isotropically. In the final refinement cycles, they were included from the geometry of the molecules and refined isotropically using a riding model, with C-H = 0.95-0.99 Å and U iso (H) = 1.2U eq (C) for CH and CH 2 or U iso (H) = 1.5U eq (C) for CH 3 , N-H = 0.88 Å and O-H = 0.84 Å, and U iso (H) = 1.2U eq (N) and 1.5U eq (O), respectively. All figures were prepared using the DIAMOND program [21]. Analysis of the intra-and intermolecular interactions was performed using the PLATON program [22].

Chemicals
All reagents for cell culture, unless otherwise stated, were purchased from Merck (Darmstadt, Germany). Stock solutions were made 100x concentrated in pure DMSO (dimethyl sulfoxide) (Chempur, Poland) and freshly diluted in appropriate complete cell culture medium on the experiment day. Cell cultures were acquired from the European Collection of Authenticated Cell Cultures (ECACC). Reagents for microbiology assays were purchased from Oxoid (Basingstoke, UK).

Neutral Red Uptake Assay
The cells were put in 96-well plates (1 × 105 cells per mL) of appropriate culture medium (100 µL per well). Before a general experiment, cells were attached for 24 h and then treated with various concentrations (500, 250, 100, and 10 µM) of pyrimidine derivatives and subsequently grown in 200 µL medium volume. Controls with 1% DMSO, culture medium, and 1 µM staurosporine were running simultaneously with the pyrimidine-derivatives-treated cultures. The cells were incubated for 24, 48, and 72 h at 37 • C in a humidified 5% CO 2 /95% air incubator. The neutral red uptake assay was performed according to Repetto et al. [23]. This method uses the ability of the living cells to accumulate dye in their lysosomes. Briefly, at the end of each incubation time the cells were treated with neutral red solution (40 µg mL −1 ) for 2 h at 37 • C. Afterwards, the cells were washed with PBS, and the incorporated dye was solubilized with the extraction solution (50% ethanol, 49% H 2 O, 1% glacial acetic acid). The plates were gently shaken for 30 min at 37 • C. Finally, the absorbance was measured at 540 nm using a Thermo Scientific™ Multiskan™ GO Microplate Spectrophotometer (Thermo Fisher).

Microbiology
Antimicrobial activity assays were performed on seven reference strains obtained from ATCC collection (Acinetobacter baumannii, Candida albicans, Enterococcus faecalis, Escherichia coli, Klebsiella pneumoniae, Pseudomonas aeruginosa, methicillin-resistant Staphylococcus aureus (MRSA)). Microorganisms were propagated on Tryptone Soy Agar (TSA) plates. Bacteria were incubated at 37 • C, and Candida albicans was incubated at 25 • C. After 24 h incubation, microorganisms were diluted in Tryptone Soy Broth (TSB) (bacteria to 0.005MF and C. albicans to 0.025MF) and seeded on 96-well plates, which already contained tested compounds at final concentrations from 256 µg mL −1 to 0.5 µg mL −1 . Every plate included controls: TSB with strain, TSB with strain and solvent (1% DMSO), and TSB with strain and appropriate antibiotic. Microplates were incubated on a horizontal shaker at 500 rpm in 37 ± 1 • C for 24 h, and absorbance was measured by a microplate reader at 580 nm using the Thermo Scientific™ Multiskan™ GO Microplate Spectrophotometer (Thermo Fisher). Subsequently, 50 µL aliquots of 5 µL of 1% aqueous solution of 2,3,5-triphenyltetrazolium chloride (TTC) were added to each well to check the viability of microorganisms. Plates were inspected visually for color change after 24 h incubation.

Statistical Analysis
The viability tests were conducted in quadruple wells (n = 4) for each condition. Values are presented as mean ± standard deviation (SD). Significant differences were calculated using one-way ANOVA, followed by post hoc comparison, and the half-maximal inhibitory concentration (IC50) was obtained by nonlinear regression using GraphPad Prism 9.1 for Windows, GraphPad Software, La Jolla, California, USA, www.graphpad.com. A p-value < 0.05 was considered statistically significant. All results were compared with the control with 1% DMSO, which was considered to be 100%.

In Silico Analysis
For the analysis, compounds 3a, 3h, and 3g were chosen due to their half-maximal inhibitory concentration (IC50) measured after 72 h incubation with cancerous cell lines.
2.4.1. ADME Analysis ADME analysis of compounds 3a, 3g, and 3h was performed via SwissADME [24], a freely available software provided by the Swiss Institute of Bioinformatics. The software provides information about estimated predictors, such as basic physiochemical properties, lipophilicity, water solubility, pharmacokinetics, druglikeness, and medicinal chemistry (Table 4). Physiochemical properties are relevant due to crossing biological barriers. Lipophilicity is very important for pharmacokinetics drug discovery. Orally active drugs should consist of no more than 5 hydrogen bond donors less than 10 hydrogen bond acceptors, molecular weight of less than 500 daltons, and an n-octanol and water partition coefficient lesser than 5 [25]. Water solubility is connected to the oral admission of the drug. Water solubility is the major property of medicine absorption. Being a substrate of the permeability glycoprotein is a predictor of pharmacokinetics. Knowledge about the interaction of the compound and cytochromes P450 gives information about effective drug elimination through metabolic biotransformation. The last predictor-medicinal chemistry-supports structural drug discovery. It informs about problematic structural fragments inside the investigated compound.

Molecular Docking Analysis
Estimation of probable macromolecular targets was performed using SwissTargetPrediction [26], choosing Homo sapiens as a target species. SwissTargetPrediction is a freely available web tool provided by the Swiss Bioinformatics Institute.
The protein target structures listed above were downloaded from the Protein Data Bank (PDB). The inhibitors from the complexes were removed, as well as water molecules. The pH value for the protonation of both the ligand (compounds 3a, 3g, and 3h) and the protein was set to 7.4 using PDB2PQR [27]. The input files were prepared using AutoDock tools [28]. The search space of each protein was defined from "center on hetero". A box was placed on the geometric center of an existing ligand. The molecular docking was performed using AutoDock Vina [29]. The docking was performed for each target from the Table 5. presented in Section 3.4.

Chemistry
The substrates 1a and 1b were prepared as described previously [15,16]. The benzylsulfanyl esters 2a and 2b were obtained using the method exploited in our previous publication ( Figure 2) [14]. In short, substrate 1a was dissolved in a methanolic solution of NaOH and coupled with appropriate benzyl chloride. The precipitate of ester was washed and dried at atmospheric pressure.
Lipophilicity is very important for pharmacokinetics drug discovery. Orally active drugs should consist of no more than 5 hydrogen bond donors less than 10 hydrogen bond acceptors, molecular weight of less than 500 daltons, and an n-octanol and water partition coefficient lesser than 5 [25]. Water solubility is connected to the oral admission of the drug. Water solubility is the major property of medicine absorption. Being a substrate of the permeability glycoprotein is a predictor of pharmacokinetics. Knowledge about the interaction of the compound and cytochromes P450 gives information about effective drug elimination through metabolic biotransformation. The last predictor-medicinal chemistry-supports structural drug discovery. It informs about problematic structural fragments inside the investigated compound.

Molecular Docking Analysis
Estimation of probable macromolecular targets was performed using SwissTar-getPrediction [26], choosing Homo sapiens as a target species. SwissTargetPrediction is a freely available web tool provided by the Swiss Bioinformatics Institute.
The protein target structures listed above were downloaded from the Protein Data Bank (PDB). The inhibitors from the complexes were removed, as well as water molecules. The pH value for the protonation of both the ligand (compounds 3a, 3g, and 3h) and the protein was set to 7.4 using PDB2PQR [27]. The input files were prepared using AutoDock tools [28]. The search space of each protein was defined from "center on hetero". A box was placed on the geometric center of an existing ligand. The molecular docking was performed using AutoDock Vina [29]. The docking was performed for each target from the Table 5. presented in Section 3.4.

Chemistry
The substrates 1a and 1b were prepared as described previously [15,16]. The benzylsulfanyl esters 2a and 2b were obtained using the method exploited in our previous publication ( Figure 2) [14]. In short, substrate 1a was dissolved in a methanolic solution of NaOH and coupled with appropriate benzyl chloride. The precipitate of ester was washed and dried at atmospheric pressure. Aminoesters were obtained by the reaction of the product of chlorination of the substrate 1b with gaseous ammonia in THF (2c) or proper alkyl amines in the mixture of trimethylamine and methanol (2d-2j) ( Figure 3) and purified via column chromatography using CHCl3 as eluent. Final products 3a-3j were obtained by reduction of esters 2a-2j with LiAlH4 in THF ( Figure 4) and purified by column chromatography (3a and 3b) or crystallization (3c-3j).  Final products 3a-3j were obtained by reduction of esters 2a-2j with LiAlH4 in THF ( Figure 4) and purified by column chromatography (3a and 3b) or crystallization (3c-3j). Using the presented set of reactions, we obtained a series of various 5-hydroxymethylpyrimidines that vary at the 4-position. Structural variations of the compounds were applied to draw conclusions about the structure-activity relationship.

Crystal Structures of Compounds 3c and 3e-3h
Compounds 3c and 3e-3g crystallize in a common P21/c space group of the monoclinic system with Z = 4 and one crystallographically independent molecule in the asymmetric unit (Figure 5a-d). In contrast to them, it was found that compound 3h crystallizes in the noncentrosymmetric orthorhombic Pca21 space group, with Z = 8, and comprises two crystallographically independent molecules in the asymmetric unit, one ordered molecule (denoted as A) ( Figure 5e) and a second disordered molecule over two positions being related by a pseudo mirror plane (hereafter referred to as B and C). Selected crystallographic data and structure refinements are presented in Table S1, and comparison of selected geometrical parameters for compounds 3c and 3e-3h are tabulated in Table S2, Supplementary Materials of this paper. Using the presented set of reactions, we obtained a series of various 5-hydroxymethylpyrimidines that vary at the 4-position. Structural variations of the compounds were applied to draw conclusions about the structure-activity relationship.

Crystal Structures of Compounds 3c and 3e-3h
Compounds 3c and 3e-3g crystallize in a common P2 1 /c space group of the monoclinic system with Z = 4 and one crystallographically independent molecule in the asymmetric unit (Figure 5a-d). In contrast to them, it was found that compound 3h crystallizes in the noncentrosymmetric orthorhombic Pca2 1 space group, with Z = 8, and comprises two crystallographically independent molecules in the asymmetric unit, one ordered molecule (denoted as A) ( Figure 5e) and a second disordered molecule over two positions being related by a pseudo mirror plane (hereafter referred to as B and C). Selected crystallographic data and structure refinements are presented in Table S1, and comparison of selected geometrical parameters for compounds 3c and 3e-3h are tabulated in Table S2, Supplementary Materials of this paper.
All molecules of compounds 3c and 3e-3h contain the same 5-hydroxymethyl-2phenylpyrimidine core and vary at the 4-position, in particular containing amino (3c), propylamino (3e), allylamino (3f), isopropylamino (3g), and tert-butylamino (3g) groups. All molecules have a conformation with the planar pyrimidine ring (with rms. deviation range of 0.008-0.020 Å) and the substituent in the 2-position twisted relative to it by 26.4 (2) • in 3c, 9.5 (2) • in 3e, 10.9 (2) • in 3f, 15.7 (2) • in 3g, and 18.5-20.6 (2) • in 3h, respectively. In cases of secondary amines, the heterocyclic N3 atom and the C41 atom of the amino group are nearly located on the same side of the molecule, as shown by the N3-C4-N4-C41 torsion angle (−6.8 (2) • , 17.1 (2) • , 0.4 (2) • , and −12 (3) • in 3e, 3f, 3g, and 3h (see Table S2 3c, 3e, 3f, 3g, and the molecule A (or in the positions B and C of the second molecule) of 3h, respectively. Except for 3f, the orientation of the amino and hydroxyl groups to each other is stabilized by the intramolecular N4−H···O1 hydrogen bond (see Table 1), which closes a six-membered ring with S(6) motif [30]. All molecules of compounds 3c and 3e-3h contain the same 5-hydroxymethyl-2-phenylpyrimidine core and vary at the 4-position, in particular containing amino (3c), propylamino (3e), allylamino (3f), isopropylamino (3g), and tert-butylamino (3g) groups. All molecules have a conformation with the planar pyrimidine ring (with rms. deviation range of 0.008-0.020 Å) and the substituent in the 2-position twisted relative to it by 26.4 (2)° in 3c, 9.5 (2)° in 3e, 10.9 (2)° in 3f, 15.7 (2)° in 3g, and 18.5-20.6 (2)° in 3h, respectively. In cases of secondary amines, the heterocyclic N3 atom and the C41 atom of the amino group are nearly located on the same side of the molecule, as shown by the N3-C4-N4- In the crystal structures, except for 3f, each molecule is connected via O-H···N hydrogen bonds (Table 1), between the hydroxyl O1 (as donors) and the pyrimidine N1 atoms (as acceptors), to form a C(6) chain along the c-axis. Representative one-dimensional chains formed through intermolecular O-H···N interactions in the crystal structures of 3c and 3g are presented in Figure 6. In case of 3f, such O-H···N hydrogen bond interactions lead to the formation of a dimer. These dimers interact further via intermolecular N-H···O hydrogen bonds, between the amine N4 (as donors) and the hydroxyl O1 atoms (as acceptors), resulting in a ribbon of molecules running along the c-axis direction (Figure 7). In 3c, a 2D hydrogen-bonded network is created by the connection of neighboring chains through N−H···N hydrogen-bonding interactions, involving the amine N4 (as donors) and the pyrimidine N3 atoms (as acceptors) (see Figure S61). In all cases, the crystal structures are stabilized by weak C-H···π and also aromatic π-π stacking interactions. In turn, for 3g, additional stabilization is also provided by weak C-H···O hydrogen bonding interactions, involving the isopropyl (as donors) and hydroxyl (as acceptors) groups (Table 1).  The symmetry codes are as in Table 1.  Table 1.   Table  1.

Cytotoxic and Antiproliferative Effect
Ten pyrimidine derivatives, 3a-3j, were tested for the cytotoxic properties on the RPTEC cell line with the neutral red uptake assay. Results are shown in Figure 8. Compounds 3b (500 µM = 34%, 250 µM = 46%, 100 µM = 44%, 10 µM = 91%) and 3j (500 µM = 16%, 250 µM = 70%, 100 µM = 28%, 10 µM = 89%) decreased the cell viability at lower concentrations (100 and 10 µM) and, therefore, were excluded from further analysis on cancerous cell lines and microorganisms. In further research, pyrimidine derivatives were tested for their anticancer activity. Compounds in a wide concentration range (10-250 μM) were applied on cells. Viability and proliferative abilities were tested with the neutral red uptake assay and assessed by comparing with the control with solvent (1% DMSO). Calculated IC50 values for all tested compounds are presented in Table 2. The highest cytotoxic properties were found after HepaRG treatment with 3h (132.3 μM). This cell line was found to be the most susceptible for tested compounds, as three from eight pyrimidine derivatives decreased cell viability to the sufficient level to calculate IC50.  In further research, pyrimidine derivatives were tested for their anticancer activity. Compounds in a wide concentration range (10-250 µM) were applied on cells. Viability and proliferative abilities were tested with the neutral red uptake assay and assessed by comparing with the control with solvent (1% DMSO). Calculated IC50 values for all tested compounds are presented in Table 2. The highest cytotoxic properties were found after HepaRG treatment with 3h (132.3 µM). This cell line was found to be the most susceptible for tested compounds, as three from eight pyrimidine derivatives decreased cell viability to the sufficient level to calculate IC50.

ADME Analysis
For the in silico analysis, compounds 3a, 3h, and 3g were chosen due to their preferable IC50 measured after 72 h incubation with cancer cell lines. ADME analysis was performed regarding six predictors pointed in 2.4.1 [24]. In Table 4, specific details for the chosen compound are presented.
The consensus value partition coefficient between n-octanol and water (log Po/w) is 2.68.
The consensus value partition coefficient between n-octanol and water (log Po/w) is 2.92.

Water solubility
Estimated by three predictors. The value of Log S (ESOL) [38] is −4.47, which makes a compound moderately soluble. The predicted value of solubility is 1.03·10 −2 mg/mL. The value of log S (Ali) [39] is −4.97, which also classifies the compound as moderately soluble. The value of solubility is 3.30 × 10 −3 mg/mL. The value of log S (SILICOS-IT) [25] is −7.00, which classifies the compound as poorly soluble. The predicted value of solubility is 3.09 × 10 −5 mg/mL.
The value of Log S (ESOL) is −3.56, which classifies a compound as soluble. The predicted value of solubility is 7.08 × 10 −2 mg/mL. The value of log S (Ali) is −3.94, which also classifies the compound as soluble. The value of solubility is 2.99 × 10 −2 mg/mL. The value of log S (SILICOS-IT) is −5.26, which classifies the compound as moderately soluble. The predicted value of solubility is 1.40 × 10 −3 mg/mL.
The value of Log S (ESOL) is −3.74, which classifies a compound as soluble. The predicted value of solubility is 4.97 × 10 -2 mg/mL. The value of log S (Ali) is −4.12, which classifies the compound as moderately soluble. The predicted value of solubility is 2.05 × 10 -2 mg/mL. The value of log S (SILICOS-IT) is −5.65, which also classifies the compound as moderately soluble. The predicted value of solubility is 6.09 × 10 -4 mg/mL.

Pharmacokinetics
One of the estimated predictors relates to skin permeability coefficient (Kp) [40]. The more negative Kp is, the less permeant a molecule is. The predicted Kp value of compound 3a is −5.50 cm/s, the predicted interaction of a molecule with cytochromes P450 [41,42]. The inhibition of five isoforms, CYP1A2, CYP2C19, CYP2C9, CYP2D6, and CYP3A4, may cause pharmacokinetics-related drug-drug interactions. Compound 3a is predicted to be an inhibitor of all of the five enzyme isoforms.
The predicted Kp value of compound 3g is −5.70 cm/s. Compound 3g is predicted to be an inhibitor of CYP1A2 and CYP2D6.
The predicted Kp value of compound 3h is −5.66 cm/s. Compound 3g is predicted to be an inhibitor of CYP1A2 and CYP2D6 and CYP3A4.

Druglikeness
Estimation of the chance to be an oral drug. The Swiss ADME software bases on five different predictors. Originally used by major pharmaceutical companies aiming to improve the quality of their chemical substances. The Lipinski (Pfizer) rule of five [43], Ghose (Amgen) [44], Veber (GSK) [45], Egan (Pharmacia) [46], and Muegge (Bayer) [47]. According to all of the predictors, compound 3a is predicted to have a chance to be an oral drug.
According to all of the five predictors, compound 3g is predicted to have a chance to be an oral drug.
According to all of the five predictors, compound 3h is predicted to have a chance to be an oral drug. Medicinal chemistry Two complementary pattern recognition methods allow for the identification of potentially problematic fragments-assay interference compounds (PAINS) [48] and Brenk Structural alert [49]. Zero predicted alerts assist in creating a good druglike molecule.
Compound 3g has zero predicted structural problematic fragments.
Compound 3h has zero predicted structural problematic fragments.

Molecular Docking Analysis
The binding pose and affinity between a ligand and an enzyme are very important pieces of information for computer-aided drug design. In the initial stage of a drug discovery project, this information is often obtained by using molecular docking methods. To conduct an effective docking procedure, some conditions have to be fulfilled. The structure of a target protein extracted from X-ray crystallography gives the possibility to omit protein structural changes during the process of binding the same ligand. According to Jones et al. [50], the resolution of the protein-ligand complex should be below 2.5 Å. Poor resolution structures result in more incorrect conformations of the ligand generated. Moreover, the complex should have only one ligand in the active site. The results of the target prediction are shown in Table 5. The docking was performed for each target from Table 5. The results are presented for one target-the one for which the docking results were better. The results of docking present a prediction of the ligand binding modes, classified by the scoring function as the best. The proteins are presented as "surface", and each ligand is presented as lines (Figures 9-11).
Family A G protein-coupled receptor Adenosine A1 receptor 5UEN 3h Family A G protein-coupled receptor Adenosine A1 receptor 5UEN Family A G protein-coupled receptor Adenosine A2a receptor 5IU4 The results of docking present a prediction of the ligand binding modes, classified by the scoring function as the best. The proteins are presented as "surface", and each ligand is presented as lines (Figures 9-11).   . Figure 10. Compound 3g in a phosphodiesterase 4B binding site. Yellow dashes indicate the hydrogen bonding with amino acids in the binding site (colored as red). The presented ligand was top-scored by the scoring function and obtained the lowest RMSD value (1.015 Å) after docking in AutoDock Vina.

Discussion
The starting esters were obtained by the reaction of 4-sulfanylpyrimidine (1a) with proper benzyl chloride in a methanolic solution of NaOH (2a and 2b) or by coupling the product of chlorination of 1b with gaseous ammonia (2c) or aliphatic amines (2d-2j). Their reduction yielded a series of 10 new 5-hydroxymethylpyrimidines (3a-3j), which were the subject of further research. The structures of the obtained compounds were established by

Discussion
The starting esters were obtained by the reaction of 4-sulfanylpyrimidine (1a) with proper benzyl chloride in a methanolic solution of NaOH (2a and 2b) or by coupling the product of chlorination of 1b with gaseous ammonia (2c) or aliphatic amines (2d-2j). Their reduction yielded a series of 10 new 5-hydroxymethylpyrimidines (3a-3j), which were the subject of further research. The structures of the obtained compounds were established by spectroscopy techniques, and several of them were identical with solid-state structures obtained during X-ray measurements.
The molecular structures of 3c and 3e-3h were studied by single-crystal X-ray diffraction. Compounds 3c and 3e-3g crystallize in the space group P2 1 /c with one molecule in the asymmetric unit, whereas compound 3h crystallizes in the space group Pca2 1 with two molecules in their asymmetric units. The preferred gauche orientation of the hydroxyl O1 and the pyrimidine C4 atoms was confirmed by the values of the O1-C51-C5-C4 torsion angle of 49.0-64.5 (2) • . Such conformation is stabilized by an intramolecular N-H···O interaction, which closes a six-membered ring with S(6) motif with the exception of 3f. It seems that the smaller the absolute value of the O1-C51-C5-C4 torsion angle is, the stronger the intramolecular N-H···O bond becomes. A similar synthon (S(6)), which is formed by an intramolecular interaction of another type (N-H···N), can be found in previously studied crystal structures of 2-phenylpyrimidine-4-amine derivatives [13,51]. Generally, in all structures, a similar hydrogen-bonded network can be observed, which involves mainly -OH moieties as donors (in intermolecular interactions) and acceptors (in intramolecular interactions, except for 3f). Despite the similarity, it should also be mentioned that the structure of 3g is subtly different. In the cases of 3g, the amino group is nearly coplanar to the pyrimidine ring, as shown by the N3-C4-N4-C41 torsion angle 0.4 (2) • , and additional stabilization of the crystal packing is also provided by weak C-H···O hydrogen bonding interactions, involving the isopropyl and hydroxyl groups.
All the obtained 5-hydroxymethylpyrimidines were tested for their cytotoxicity against normal cells on the RPTEC cell line with a neutral red uptake assay. Generally, pyrimidines possessing a 4-benzylsulfanyl group (3a and 3b) exhibit stronger toxicity than their amino analogues (3c-3j). Considering our previous observation [14], it appears that the substitution in the phenyl ring of the benzyl group, with both electron-withdrawing (4-Cl) and electron-donating groups (2-CH 3 , 3b), enhances their toxic properties. Compound 3b decreased the cell viability in low concentration and, for this reason, was excluded from further biological investigations.
Pyrimidines with primary or secondary amino groups at the 4-position (3c-3j) show relatively low toxicity in a wide range of concentrations with the exception of compound 3j. It seems that the substitution of the primary amino group of compound 3c with n-alkyls (ethyl-3d, n-propyl-3e), unsaturated alkyl (allyl-3f), branched alkyls (isopropyl-3g, t-butyl-3h), or hydroxyalkyl groups (2-hydroxyethyl-3i) exerts an insignificant impact on their cytotoxicity. The substantial growth of toxicity can be seen in the case of compound 3j, which possesses a sizable cyclohexyl substituent at the amino group and, because of that, was excepted from further tests.
The anticancer investigation showed that only three out of eight tested compounds demonstrated antineoplastic properties. Derivative 3a with a 4-benzylsulfanyl group exhibited low toxicity to HepaRG and AGS cell lines. Among derivatives with 4-amino groups, only two exhibited moderate toxicity, namely, those with bulky substituents as isopropyl (3g) and t-butyl (3h) groups. At the same time, 3h was found to reach the lowest IC50 among tested compounds (132.2 µM). On the other hand, 3g demonstrated the broadest range of toxicity against cancer cell lines (HeLa, HepaRG, A172). As mentioned above, HepaRG was the most susceptible line to tested compounds. It is also noteworthy that the substitution of the primary amino group of 3c with various alkyl groups does not affect their anticancer properties, and the enhancement is evident only in the case of branched substituents.
Selected compounds for ADME analysis predict similar physiochemical properties and have comparable molecular weights, rotatable bonds, number of hydrogen bond donors/acceptors and water solubility, druglikeness, and medicinal chemistry properties. Due to the lipophilicity of all the investigated compounds, a partition coefficient was less than 5, and 3g has the lowest partition coefficient between n-octanol and water. Moreover 3a has the best predicted pharmacokinetic properties.
The docking of compound 3a to the phosphodiesterase 10A binding site ( The tested compounds exhibited weak antimicrobial properties. Only 3a with a 4benzylsulfanyl group demonstrated any antifungal activity (MIC against C. albicans = 256 µg/mL), but it had no antibacterial features. Among the derivatives with a 4-amino group, compounds with smaller than three carbon chains exhibited no antimicrobial properties (3c with a primary amino group, 3d with ethyl, and 3i with 2-hydroxyethyl groups). The extension of the carbon chain slightly enhances the antibacterial potency of the reminder. The lowest MIC was observed in the case of 3g with an isopropyl substituent against E. coli (MIC = 128 µg/mL). A. baumannii was the most sensitive bacterium to the tested compounds.

Conclusions
Based on our previous research, we obtained a series of novel 5-hydroxymethylpyrimidines, 3a-3j, which vary in the 4-position by the reduction of proper esters, 2a-2j, with LiAlH 4 .
The compounds were characterized using spectroscopic and some of them (3c and 3e-3h) also X-ray crystallography methods.
Except for 3h, which crystallized in the Pca2 1 space group, the other studied compounds crystallized in the P2 1 /c space group. The preferred gauche orientation of the -OH group with respect to the pyrimidine C4 atom is stabilized by an intramolecular N-H···O hydrogen bond, with the exception of 3f. On the whole, a hydrogen-bonded network structure involves primarily -OH moieties as donors (in intermolecular interactions) and acceptors (in intramolecular interactions, except for 3f). Furthermore, in the case of the crystal packing of 3g, weak C-H···O hydrogen bonding interactions involving the isopropyl and hydroxyl groups are also observed.
The cytotoxic test against normal cells on the RPTEC cell line showed that compounds with benzyl sulfanyl groups at the 4-position are generally more toxic than those with amino groups at the 4-position, except for 3j, which possesses a sizable cyclohexyl group. Thus, 2 out of 10 compounds were excluded from further investigation, 3b with a 2-methylbenzyl sulfanyl group at the 4-position and 3j, mentioned above.
We examined the anticancer activity of the compounds on various cancer cell lines, including HeLa, HepaRG, Caco-2, AGS, and A172, and their antimicrobial properties on six bacterial strains and one strain of fungi. On the whole, the compounds showed weak biological activity. Compound 3g exhibited the broadest range of toxicity against cancer cell lines (HeLa, HepaRG, A172), while 3h reached the lowest IC50 among tested compounds (132.2 µM against HepaRG).
Furthermore, in silico analysis reported that ligands that interact with selected enzymes that have been used as docking targets might have moderate therapeutic activity. For instance, the tested compounds have similar values of binding affinities. Nevertheless, the docking of 3g in the enzyme binding site showed the least value of RMSD.
It is worth mentioning that 5-hydroxymethylpyrimidines that possess bulky amino constituents at the 4-position (3g-isopropyl and 3h-t-butyl) showed the best anticancer and antibacterial properties among tested derivatives, which can be a valuable foothold for further research.