<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
<front>
<journal-meta>
<journal-id journal-id-type="publisher-id">ijms</journal-id>
<journal-title>International Journal of Molecular Sciences</journal-title>
<abbrev-journal-title>Int. J. Mol. Sci.</abbrev-journal-title>
<issn pub-type="epub">1422-0067</issn>
<publisher>
<publisher-name>Molecular Diversity Preservation International (MDPI)</publisher-name></publisher></journal-meta>
<article-meta>
<article-id pub-id-type="doi">10.3390/ijms13045207</article-id>
<article-id pub-id-type="publisher-id">ijms-13-05207</article-id>
<article-categories>
<subj-group>
<subject>Article</subject></subj-group></article-categories>
<title-group>
<article-title>Poisson Parameters of Antimicrobial Activity: A Quantitative Structure-Activity Approach</article-title></title-group>
<contrib-group>
<contrib contrib-type="author">
<name><surname>Sestraş</surname><given-names>Radu E.</given-names></name><xref ref-type="aff" rid="af1-ijms-13-05207">1</xref></contrib>
<contrib contrib-type="author">
<name><surname>Jäntschi</surname><given-names>Lorentz</given-names></name><xref ref-type="aff" rid="af1-ijms-13-05207">1</xref><xref ref-type="aff" rid="af2-ijms-13-05207">2</xref><xref ref-type="corresp" rid="c1-ijms-13-05207">*</xref></contrib>
<contrib contrib-type="author">
<name><surname>Bolboacă</surname><given-names>Sorana D.</given-names></name><xref ref-type="aff" rid="af3-ijms-13-05207">3</xref></contrib></contrib-group>
<aff id="af1-ijms-13-05207">
<label>1</label>University of Agricultural Science and Veterinary Medicine Cluj-Napoca, 3-5 Mănăştur, Cluj-Napoca 400372, Romania; E-Mail: <email>rsestras@usamvcluj.ro</email></aff>
<aff id="af2-ijms-13-05207">
<label>2</label>Technical University of Cluj-Napoca, 28 Memorandumului, Cluj-Napoca 400114, Romania</aff>
<aff id="af3-ijms-13-05207">
<label>3</label>Department of Medical Informatics and Biostatistics, “Iuliu Haţieganu” University of Medicine and Pharmacy Cluj-Napoca, 6 Louis Pasteur, Cluj-Napoca 400349, Cluj, Romania; E-Mail: <email>sbolboaca@umfcluj.ro</email></aff>
<author-notes>
<corresp id="c1-ijms-13-05207">
<label>*</label>Author to whom correspondence should be addressed; E-Mail: <email>lorentz.jantschi@gmail.com</email>; Tel.: +4-0264-401775; Fax: +4-0264-401768.</corresp></author-notes>
<pub-date pub-type="collection">
<year>2012</year></pub-date>
<pub-date pub-type="epub">
<day>24</day>
<month>4</month>
<year>2012</year></pub-date>
<volume>13</volume>
<issue>4</issue>
<fpage>5207</fpage>
<lpage>5229</lpage>
<history>
<date date-type="received">
<day>22</day>
<month>3</month>
<year>2012</year></date>
<date date-type="rev-recd">
<day>17</day>
<month>4</month>
<year>2012</year></date>
<date date-type="accepted">
<day>19</day>
<month>4</month>
<year>2012</year></date></history>
<permissions>
<copyright-statement>© 2012 by the authors; licensee Molecular Diversity Preservation International, Basel, Switzerland.</copyright-statement>
<copyright-year>2012</copyright-year>
<license license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0">
<p>This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p></license></permissions>
<abstract>
<p>A contingency of observed antimicrobial activities measured for several compounds <italic>vs</italic>. a series of bacteria was analyzed. A factor analysis revealed the existence of a certain probability distribution function of the antimicrobial activity. A quantitative structure-activity relationship analysis for the overall antimicrobial ability was conducted using the population statistics associated with identified probability distribution function. The antimicrobial activity proved to follow the Poisson distribution if just one factor varies (such as chemical compound or bacteria). The Poisson parameter estimating antimicrobial effect, giving both mean and variance of the antimicrobial activity, was used to develop structure-activity models describing the effect of compounds on bacteria and fungi species. Two approaches were employed to obtain the models, and for every approach, a model was selected, further investigated and found to be statistically significant. The best predictive model for antimicrobial effect on bacteria and fungi species was identified using graphical representation of observed <italic>vs</italic>. calculated values as well as several predictive power parameters.</p></abstract>
<kwd-group>
<kwd>oils compounds</kwd>
<kwd>antimicrobial effect</kwd>
<kwd>bacteria and fungi species</kwd>
<kwd>probability distribution function</kwd>
<kwd>quantitative structure-activity relationship (QSAR)</kwd>
<kwd>multiple linear regression (MLR)</kwd></kwd-group></article-meta></front>
<body>
<sec sec-type="intro">
<title>1. Introduction</title>
<p>Plant extracts, including oils, have been used as therapeutics from ancient times and have been reinvented more often in the last years. Important medical effects of plant extracts have been identified during the time (antioxidant, antimicrobial [<xref ref-type="bibr" rid="b1-ijms-13-05207">1</xref>–<xref ref-type="bibr" rid="b4-ijms-13-05207">4</xref>]) and some mechanisms of actions were investigated [<xref ref-type="bibr" rid="b5-ijms-13-05207">5</xref>–<xref ref-type="bibr" rid="b8-ijms-13-05207">8</xref>]. Research on plant extracts on specific symptoms and diseases is carried out all over the world [<xref ref-type="bibr" rid="b9-ijms-13-05207">9</xref>–<xref ref-type="bibr" rid="b11-ijms-13-05207">11</xref>]. New approaches are applied in drug industry in order to identify promising medicinal plant as source of new drugs and drug leads [<xref ref-type="bibr" rid="b12-ijms-13-05207">12</xref>] even if pharmaceutical companies significantly decreased their activities in natural product discovery during the past few decades [<xref ref-type="bibr" rid="b13-ijms-13-05207">13</xref>].</p>
<p>Quantitative Structure-Activity Relationships (QSARs) are mathematical models resulting from the application of different statistical approaches in correlation analyses of biologic activity and/or physical or chemical properties of active compounds with descriptors derived from structure and/or properties [<xref ref-type="bibr" rid="b14-ijms-13-05207">14</xref>]. Traditional strategies based on animal models are nowadays replaced by <italic>in silico</italic> approaches by moving the experiments into virtual laboratories [<xref ref-type="bibr" rid="b15-ijms-13-05207">15</xref>,<xref ref-type="bibr" rid="b16-ijms-13-05207">16</xref>]. These <italic>in silico</italic> approaches are sustained by the increased power of computers and are widely used due to low costs (no costs for compounds synthesize), possibility to investigate not synthesized compounds as well as possibility to investigate huge amount of promising chemicals. Different QSAR approaches demonstrated their effectiveness in drug design [<xref ref-type="bibr" rid="b17-ijms-13-05207">17</xref>,<xref ref-type="bibr" rid="b18-ijms-13-05207">18</xref>] and in screening of active compounds [<xref ref-type="bibr" rid="b19-ijms-13-05207">19</xref>,<xref ref-type="bibr" rid="b20-ijms-13-05207">20</xref>], also with regards to natural products [<xref ref-type="bibr" rid="b21-ijms-13-05207">21</xref>,<xref ref-type="bibr" rid="b22-ijms-13-05207">22</xref>]. Several methods like MARCH-INSIDE [<xref ref-type="bibr" rid="b23-ijms-13-05207">23</xref>,<xref ref-type="bibr" rid="b24-ijms-13-05207">24</xref>], TOPS-MODE [<xref ref-type="bibr" rid="b25-ijms-13-05207">25</xref>], and TOMO-COMD [<xref ref-type="bibr" rid="b26-ijms-13-05207">26</xref>] have been used in QSAR investigation of anti-bacterial drugs [<xref ref-type="bibr" rid="b27-ijms-13-05207">27</xref>,<xref ref-type="bibr" rid="b28-ijms-13-05207">28</xref>] (including anti-fungi [<xref ref-type="bibr" rid="b29-ijms-13-05207">29</xref>], anti-parasite [<xref ref-type="bibr" rid="b30-ijms-13-05207">30</xref>], and anti-viral drugs [<xref ref-type="bibr" rid="b31-ijms-13-05207">31</xref>]). The MARCH-INSIDE method was further integrated in the Bio-AIMS online platform and can be used as a prediction tool for new anti-microbial drugs or their protein targets [<xref ref-type="bibr" rid="b32-ijms-13-05207">32</xref>].</p>
<p>Jirovetz <italic>et al</italic>. investigated the antimicrobial effects of a series of oils components, oils and mixtures on gram-positive and -negative bacteria (<italic>Staphylococcus aureus</italic>, <italic>Enterococcus faecalis</italic>, <italic>Escherichia coli</italic>, <italic>Pseudomonas aeruginosa</italic>, <italic>Klebsiella pneumoniae</italic>, <italic>Proteus vulgaris</italic>, <italic>Salmonella</italic> sp.) and <italic>Candida albicans</italic> [<xref ref-type="bibr" rid="b33-ijms-13-05207">33</xref>]. In the present research we focused on two major objectives based on the experimental observations of Jirovetz <italic>et al</italic>. [<xref ref-type="bibr" rid="b33-ijms-13-05207">33</xref>]. The first objective was to identify the probability distribution function of the antimicrobial effects of compounds, oils and mixtures on above-presented bacteria and fungus species. Identification of the probability distribution function allows us to compute the population parameters, an overall estimator of the antimicrobial effect that comprises the antimicrobial potencies on different species in a single value. The second objective was to find the appropriate predictivity measures of quantitative structure-activity relationship using the context of the overall antimicrobial activity of 22 active compounds.</p></sec>
<sec sec-type="results">
<title>2. Results</title>
<sec sec-type="methods">
<title>2.1. Probability Distribution Analysis</title>
<p>The antimicrobial effects at contingency of compounds, oils and mixtures on bacteria were investigated to identify the probability distribution function along bacteria series. The Uniform distribution was rejected at the beginning of the analysis due to unreasonable estimates of the population parameters. The remained three discrete distributions were compared based on several agreements. The percentage of rejection according to Fisher’s Chi-Square global statistics for each identified probability distribution function according to the class (as compounds, oils, mixtures) is shown in <xref ref-type="fig" rid="f1-ijms-13-05207">Figure 1</xref> (detailed data can be found in <xref ref-type="supplementary-material" rid="s1-ijms-13-05207">Supplementary material</xref>). The following null hypothesis was tested using F-C-S statistic (F-C-S values in <xref ref-type="fig" rid="f1-ijms-13-05207">Figure 1</xref>): “The parameters of the identified distribution follow for each series of compound/oil/mixture the Binomial/NegBinomial/ Poisson distribution”.</p>
<p>Statistical parameters and estimates of the population properties under assumption of Poisson distribution are presented in <xref ref-type="table" rid="t1-ijms-13-05207">Table 1</xref>.</p>
<p>Assuming the Poisson distribution (as the F-C-S value from <xref ref-type="fig" rid="f1-ijms-13-05207">Figure 1</xref> allowed us to do), statistical parameter (λ) and population properties were computed for Citronellol (CID = 8842, with less than 5 observations, not included in verification of the Poisson distribution assumption-see <xref ref-type="supplementary-material" rid="s1-ijms-13-05207">Supplementary material</xref>) and the following results were obtained: λ = 14.5, Mode = 14, Mean = 14.500, Variance = 4.500, Standard Deviation = 3.808, Skewness = 0.263, Excess Kurtosis = 0.069, Median = 13.832.</p></sec>
<sec>
<title>2.2. QSAR Models</title>
<p>Two requirements were imposed in identification of the proper transformation of Poisson parameter λ: the absence of outliers and the presence of normality at a significance level of 5%. The global F-C-S distribution statistic indicated that the Poisson parameter more likely follows a Log-normal distribution (statistics: K–S = 0.1315; p<sub>K–S</sub> = 0.7948; A–D = 0.3874; Crit<sub>A–D5%</sub> = 2.5018 (critical values associated for Anderson-Darling test); C–S<sub>df = 2</sub> = 0.9403; p<sub>C–S</sub> = 0.6249).</p>
<p>The Eugenol compound was identified as outlier with Grubbs’ test (Z = 3.178, Z<sub>critical–5%</sub> = 2.7338). After natural logarithm transformation of the Poisson parameters, seen as an overall antimicrobial activity of investigated compounds, no other outlier was identified (the highest Z value was of 2.528; Z<sub>critical–5%</sub> = 2.758) and the normality hypothesis of the ln(λ) values could not be rejected (<italic>p</italic> &gt; 0.05). Further testing on ln(λ) under the normal distribution assumption gave no reason to reject the normality of the data in the training test (K–S = 0.14351, p<sub>K–S</sub> = 0.917; A–D = 0.37751, p<sub>A–D</sub> = 0.686; C–S = 0.62246, p<sub>C–S</sub> = 0.430; F–C–S = 1.307; p<sub>F–C–S</sub> = 0.727) nor in test set (K–S = 0.2301, p<sub>K–S</sub> = 0.779; A–D = 0.3860, p<sub>A–D</sub> = 0.679; F–C–S = 0.637; p<sub>F–C–S</sub> = 0.727).</p>
<sec>
<title>2.2.1. Based on DRAGON Descriptors</title>
<p>Sulfametrole (CID = 64939) proved to be influential in the model obtained based on Dragon descriptors (training set, <xref ref-type="fig" rid="f2-ijms-13-05207">Figure 2</xref>). Both Dragon descriptors proved to be higher than expected (h<sub>i–piID</sub> = 0.5643, h<sub>i–R3m+</sub> = 0.7602, where piID and R3m+ are Dragon descriptors) for Sulfametrole compound.</p>
<p>The overall correlation between Dragon descriptors obtained for whole data set (<italic>n</italic> = 21 compounds) was of 0.8461 (<italic>p</italic> &lt; 0.0001). Moreover, a statistically significant correlation was obtained between ln(λ) and R3m+ descriptor (<italic>r</italic> = 0.4800, <italic>p</italic> = 0.0220).</p>
<p>The results of regression analysis with Dragon descriptors provided the equation presented in <xref rid="FD1" ref-type="disp-formula">Equation(1)</xref> relating ln(λ) with compounds structure, after the withdrawal of Sulfametrole from the training set.</p>
<disp-formula id="FD1">
<label>(1)</label>
<mml:math id="mm1" display="block">
<mml:semantics id="sm1">
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mover accent="true">
<mml:mtext>Y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:mn>3.626</mml:mn>
<mml:mo stretchy="false">(</mml:mo>
<mml:mo>±</mml:mo>
<mml:mn>0.496</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>-</mml:mo>
<mml:mn>0.045</mml:mn>
<mml:mo stretchy="false">(</mml:mo>
<mml:mo>±</mml:mo>
<mml:mn>0.012</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>·</mml:mo>
<mml:mtext>piID</mml:mtext>
<mml:mo>+</mml:mo>
<mml:mn>18.569</mml:mn>
<mml:mo stretchy="false">(</mml:mo>
<mml:mo>±</mml:mo>
<mml:mn>19.404</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>·</mml:mo>
<mml:mtext>R</mml:mtext>
<mml:mn>3</mml:mn>
<mml:mtext>m</mml:mtext>
<mml:mo>+</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>n</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>12</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow>
<mml:mrow>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.8970</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow>
<mml:mrow>
<mml:mtext>Adj</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.8741</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>F</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>39</mml:mn>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>3.62</mml:mn>
<mml:mo>×</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>5</mml:mn></mml:mrow></mml:msup>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>se</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.1037</mml:mn>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>intercept</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>4.86</mml:mn>
<mml:mo>×</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>8</mml:mn></mml:mrow></mml:msup>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>piID</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>1.28</mml:mn>
<mml:mo>×</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>5</mml:mn></mml:mrow></mml:msup>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>R</mml:mtext>
<mml:mn>3</mml:mn>
<mml:mtext>m</mml:mtext>
<mml:mo>+</mml:mo></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.058</mml:mn>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>piID</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>R</mml:mtext>
<mml:mn>3</mml:mn>
<mml:mtext>m</mml:mtext>
<mml:mo>+</mml:mo></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.776</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>VIF</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>piID</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>VIF</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>R</mml:mtext>
<mml:mn>3</mml:mn>
<mml:mtext>m</mml:mtext>
<mml:mo>+</mml:mo></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>1.305</mml:mn>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mtext>Y</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>piID</mml:mtext>
<mml:mo stretchy="false">)</mml:mo>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>-</mml:mo>
<mml:mn>0.9183</mml:mn>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mtext>value</mml:mtext></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mn>2.50</mml:mn>
<mml:mo>×</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>5</mml:mn></mml:mrow></mml:msup>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mtext>Y</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>R</mml:mtext>
<mml:mn>3</mml:mn>
<mml:mtext>m</mml:mtext>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">)</mml:mo>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>-</mml:mo>
<mml:mn>0.2410</mml:mn>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mtext>value</mml:mtext></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mn>0.4505</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mtext>piID</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>R</mml:mtext>
<mml:mn>3</mml:mn>
<mml:mtext>m</mml:mtext>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">)</mml:mo>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.4833</mml:mn>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mtext>value</mml:mtext></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mn>0.1114</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow>
<mml:mrow>
<mml:mtext>loo</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.8452</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>F</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>loo</mml:mtext></mml:mrow></mml:msub>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>24</mml:mn>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>2.35</mml:mn>
<mml:mo>×</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>4</mml:mn></mml:mrow></mml:msup>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>se</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>loo</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.1276</mml:mn>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>n</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>7</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.6518</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>F</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>11</mml:mn>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>2.16</mml:mn>
<mml:mo>×</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>2</mml:mn></mml:mrow></mml:msup>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mtext>Y</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>piID</mml:mtext>
<mml:mo stretchy="false">)</mml:mo>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>-</mml:mo>
<mml:mn>0.0869</mml:mn>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mtext>value</mml:mtext></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mn>0.8241</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mtext>Y</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>R</mml:mtext>
<mml:mn>3</mml:mn>
<mml:mtext>m</mml:mtext>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">)</mml:mo>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>-</mml:mo>
<mml:mn>0.2410</mml:mn>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mtext>value</mml:mtext></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mn>0.0024</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mtext>piID</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>R</mml:mtext>
<mml:mn>3</mml:mn>
<mml:mtext>m</mml:mtext>
<mml:mo>+</mml:mo>
<mml:mo stretchy="false">)</mml:mo>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.3469</mml:mn>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mtext>value</mml:mtext></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mn>0.3604</mml:mn>
<mml:mo stretchy="false">)</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:semantics></mml:math></disp-formula>
<p>where Ŷ = ln(λ) estimated by <xref rid="FD1" ref-type="disp-formula">Equation(1)</xref>; <italic>R</italic><sup>2</sup> = determination coefficient; TR = training set; loo = leave-one-out analysis; TS = test set; <italic>Ext</italic> = external set; <italic>R</italic><sup>2</sup><sub>Adj</sub> = adjusted determination coefficient; <italic>F</italic> = <italic>F</italic>-value (from ANOVA table); <italic>p</italic> = <italic>p</italic>-value associated to <italic>F</italic>-value; <italic>se</italic> = standard error of estimate; Dragon descriptors: piID = conventional bond order ID number-walk and path counts; R3m+ = R maximal autocorrelation of lag 3/weighted by mass GETAWAY descriptors; <italic>T</italic> = Tolerance; VIF = Variance Inflation Factor; <italic>R</italic> = correlation coefficient.</p>
<p>The abilities in estimation (training set) and prediction (test set) of the model from <xref rid="FD1" ref-type="disp-formula">Equation(1)</xref> are presented in <xref ref-type="fig" rid="f3-ijms-13-05207">Figure 3</xref>. No statistically significant difference could be identified when the goodness-of-fit was compared in training set and test set for the model presented in <xref rid="FD1" ref-type="disp-formula">Equation (1)</xref> (<italic>Z</italic> = 0.3590, <italic>p</italic> = 0.3598).</p></sec>
<sec>
<title>2.2.2. Based on SAPF Descriptors</title>
<p>No leverage was identified when the SAPF descriptors were investigated (<xref ref-type="fig" rid="f4-ijms-13-05207">Figure 4</xref>).</p>
<p>The overall correlation between SAPF descriptors obtained for whole data set (<italic>n</italic> = 22 compounds) was of 0.4800 (<italic>p</italic> = 0.0238). Moreover, a statistically significant correlation was obtained between ln(λ) and LSSIIETD descriptor (<italic>r</italic> = −0.5249, <italic>p</italic> = 0.0122).</p>
<p>The results of regression analysis with SAPF descriptors relating ln(λ) with compounds structure by using the entire training set is presented in <xref rid="FD2" ref-type="disp-formula">Equation(2)</xref>.</p>
<disp-formula id="FD2">
<label>(2)</label>
<mml:math id="mm2" display="block">
<mml:semantics id="sm2">
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mover accent="true">
<mml:mtext>Y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>=</mml:mo>
<mml:mn>3.858</mml:mn>
<mml:mo stretchy="false">(</mml:mo>
<mml:mo>±</mml:mo>
<mml:mn>0.502</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>+</mml:mo>
<mml:mn>0.398</mml:mn>
<mml:mo stretchy="false">(</mml:mo>
<mml:mo>±</mml:mo>
<mml:mn>0.189</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>·</mml:mo>
<mml:mrow>
<mml:mtext>QSMHIMGP</mml:mtext>
<mml:mo>-</mml:mo></mml:mrow>
<mml:mn>0.149</mml:mn>
<mml:mo stretchy="false">(</mml:mo>
<mml:mo>±</mml:mo>
<mml:mn>0.048</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>·</mml:mo>
<mml:mtext>LSSIIETD</mml:mtext></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>n</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>13</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow>
<mml:mrow>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.8286</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow>
<mml:mrow>
<mml:mtext>Adj</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.7944</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>F</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>24</mml:mn>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>1.48</mml:mn>
<mml:mo>×</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>4</mml:mn></mml:mrow></mml:msup>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>se</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.1419</mml:mn>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>intercept</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>9.66</mml:mn>
<mml:mo>×</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>9</mml:mn></mml:mrow></mml:msup>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>QSMHIMGP</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>8.37</mml:mn>
<mml:mo>×</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>4</mml:mn></mml:mrow></mml:msup>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>p</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>LSSIIETD</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>3.93</mml:mn>
<mml:mo>×</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>5</mml:mn></mml:mrow></mml:msup>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mtext>Y</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>QSMHIMGP</mml:mtext></mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>-</mml:mo>
<mml:mn>0.0122</mml:mn>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mtext>value</mml:mtext></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mn>0.9684</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mtext>Y</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>LSSIIETD</mml:mtext></mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>-</mml:mo>
<mml:mn>0.6705</mml:mn></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mtext>value</mml:mtext></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mn>0.0121</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mtext>QSMHIMGP</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>LSSIIETD</mml:mtext></mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>06862</mml:mn>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mtext>value</mml:mtext></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mn>0.0096</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>QSMHIMGP</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>T</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>LSSIIETD</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.529</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>VIF</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>QSMHIMGP</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>VIF</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>LSSIIETD</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>1.890</mml:mn>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow>
<mml:mrow>
<mml:mtext>loo</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.6998</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>F</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>loo</mml:mtext></mml:mrow></mml:msub>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>11</mml:mn>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>2.90</mml:mn>
<mml:mo>×</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>3</mml:mn></mml:mrow></mml:msup>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>se</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>loo</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.1910</mml:mn>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>n</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>7</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.8624</mml:mn>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>F</mml:mi></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>=</mml:mo>
<mml:mn>24</mml:mn>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mn>4.41</mml:mn>
<mml:mo>×</mml:mo>
<mml:msup>
<mml:mrow>
<mml:mn>10</mml:mn></mml:mrow>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mn>3</mml:mn></mml:mrow></mml:msup>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mtext>(Y</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>QSMHIMGP)TS</mml:mtext></mml:mrow></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.7511</mml:mn>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mtext>value</mml:mtext></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mn>0.0516</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mtext>(Y</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>LSSIIETD)TS</mml:mtext></mml:mrow></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mo>-</mml:mo>
<mml:mn>0.3725</mml:mn></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mtext>value</mml:mtext></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mn>0.4106</mml:mn>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>;</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mi>R</mml:mi></mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:mrow>
<mml:mtext>QSMHIMGP</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mtext>LSSIIETD</mml:mtext></mml:mrow>
<mml:mo stretchy="false">)</mml:mo>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mo>=</mml:mo>
<mml:mn>0.2250</mml:mn>
<mml:mi> </mml:mi>
<mml:mo stretchy="false">(</mml:mo>
<mml:mi>p</mml:mi>
<mml:mrow>
<mml:mo>-</mml:mo>
<mml:mtext>value</mml:mtext></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mn>0.6276</mml:mn>
<mml:mo stretchy="false">)</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:semantics></mml:math></disp-formula>
<p>where Ŷ = ln(λ) estimated/predicted by <xref rid="FD2" ref-type="disp-formula">Equation (2)</xref>; <italic>R</italic><sup>2</sup> = determination coefficient; <italic>R</italic> = correlation coefficient; TR = training set; loo = leave-one-out analysis; TS = test set; <italic>R</italic><sup>2</sup><sub>Adj</sub> = adjusted determination coefficient; <italic>F</italic> = <italic>F</italic>-value (from ANOVA table); <italic>p</italic> = <italic>p</italic>-value associated to <italic>F</italic>-value; se = standard error of estimate; QSMHIMGP and LSSIIETD = SAPF descriptors; <italic>T</italic> = tolerance; VIP = Variance Inflation Factor. The abilities in estimation (training set) and prediction (test set) of the model from <xref rid="FD2" ref-type="disp-formula">Equation (2)</xref> are presented in <xref ref-type="fig" rid="f5-ijms-13-05207">Figure 5</xref>.</p>
<p>No statistically significant difference was identified when the goodness-of-fit in training and test sets were compared for the model presented in <xref rid="FD2" ref-type="disp-formula">Equation (2)</xref> (Z-statistics = 0.3590, p = 0.3598).</p>
<p>The search for the best fit between observed and linear regression model with two descriptors when the joined pool of SAPF and Dragon descriptors retrieved the same model as the one from <xref rid="FD2" ref-type="disp-formula">Equation (2)</xref>.</p></sec>
<sec>
<title>2.2.3. Models Comparison</title>
<p>Parameters defined in Material and Method section were used to compare the QSAR-Dragon model with QSAR-SAPF model. The residuals, defined as the difference between observed value and calculated value based on identified models, are presented in <xref ref-type="table" rid="t2-ijms-13-05207">Table 2</xref>. The values of the parameters used in models assessment analysis were presented in <xref ref-type="table" rid="t3-ijms-13-05207">Table 3</xref>.</p>
<p>Two compounds were randomly chosen as external set. The predictions that were closest to the observed values were obtained by QSAR-SAPF model (<xref rid="FD2" ref-type="disp-formula">Equation (2)</xref>; <xref ref-type="table" rid="t2-ijms-13-05207">Table 2</xref>).</p>
<p>Steiger’s test was used to identify if there are any statistically significant differences in terms of correlation coefficient between the models from <xref rid="FD1" ref-type="disp-formula">Equation (1)</xref> and the model from <xref rid="FD2" ref-type="disp-formula">Equation (2)</xref>. The lowest <italic>p</italic>-value was obtained when the correlation coefficient in training sets was compared (<italic>Z</italic>-statistics = −1.4511, <italic>p</italic> = 0.0734). This suggests that the models are close to being statistically different.</p></sec></sec></sec>
<sec sec-type="discussion">
<title>3. Discussion</title>
<p>The antimicrobial effects of chemical compounds on bacteria and fungi species were analyzed with regards to probability distribution function. In addition, a structure-activity relationship analysis able to describe the effect of chemical compounds on the entire population of bacteria and fungi species was successfully conducted.</p>
<p>The analysis of <xref ref-type="fig" rid="f1-ijms-13-05207">Figure 1</xref> revealed that for compounds series there is at least one sample with no fit (0.00 probability of agreement) for both Binomial and Negative Binomial distributions. Poisson distribution always had the probability of agreement above 0.05 (the hypothesis of Poisson distribution cannot be rejected at 5% significance level), being the only discrete distribution from investigated ones that showed this behavior. Furthermore, the p<sub>F-C-S</sub> value provided a global agreement of 12% for “Is Poisson the distribution of any compound on bacteria and fungi species?”, enough to assure us that the Poisson distribution is the true distribution of compounds’ antimicrobial activities on the studied bacteria and fungi species. The situation is somehow reversed for oils and mixtures; if the Poisson distribution is the only one not rejected for compounds, then the Negative Binomial distribution also cannot be rejected for oils and mixtures. A deeper investigation on factors influencing antimicrobial activities may reveal that the negative binomial distribution should be rejected for the whole data presented in <xref ref-type="table" rid="t4-ijms-13-05207">Table 4</xref>. The reason for this fact should be foundd in the distribution of the compounds series activities on a given bacteria (columns data in <xref ref-type="table" rid="t4-ijms-13-05207">Table 4</xref>).</p>
<p>Thus, it was already proven [<xref ref-type="bibr" rid="b34-ijms-13-05207">34</xref>] that Negative Binomial distribution occurs when both column and row data are shaped by Poisson distribution, which is not our case since only rows (a compound activity) are shaped by Poisson distribution (see <xref ref-type="fig" rid="f1-ijms-13-05207">Figure 1</xref>). Moreover, rows data from <xref ref-type="table" rid="t4-ijms-13-05207">Table 4</xref> are more likely to be Negative Binomial distributed, suggesting that at least two factors coexist in the compounds’ structure and influence their activity.</p>
<p>The analysis of distribution on bacteria and fungi species revealed the following:</p>
<list list-type="bullet">
<list-item>
<p>Compounds series:</p>
<list list-type="simple">
<list-item>
<p>○ Without any exception, the antimicrobial effects of all investigated compounds proved to follow Poisson distribution. Moreover, the hypothesis that any compound has a Poisson distribution of antimicrobial activity on bacteria population could not be rejected by F-C-S statistics (F-C-S statistics = 28.79, <italic>p</italic> = 0.12, <xref ref-type="fig" rid="f1-ijms-13-05207">Figure 1</xref>). Starting with this result, the Poisson λ parameter has been obtained to reflect what happen in the population, this parameter being an estimate for both central tendency and variability of antibacterial effects. The analysis of the obtained Poisson parameters showed to follow more likely a log-normal distribution and a logarithm transformation was applied on these values before quantitative structure-activity relationship search. This transformation was applied to avoid the presence of outliers and to assure the normality assumption needed for linear regression analysis [<xref ref-type="bibr" rid="b35-ijms-13-05207">35</xref>,<xref ref-type="bibr" rid="b36-ijms-13-05207">36</xref>].</p></list-item>
<list-item>
<p>○ Negative binomial distribution was rejected by 55% of compounds while Binomial distribution was rejected in 70% of cases. Negative binomial distribution, also known as the Pascal distribution or Pólya distribution, is a twin of Poisson distribution [<xref ref-type="bibr" rid="b37-ijms-13-05207">37</xref>,<xref ref-type="bibr" rid="b38-ijms-13-05207">38</xref>] widely used in analysis of count data [<xref ref-type="bibr" rid="b39-ijms-13-05207">39</xref>,<xref ref-type="bibr" rid="b40-ijms-13-05207">40</xref>]. The negative binomial distribution could be obtained by superposition of a continuous distribution over Poisson distribution (Fisher showed the convolution between Chi-Square and Poisson distribution [<xref ref-type="bibr" rid="b41-ijms-13-05207">41</xref>]). Other authors showed that the negative binomial distribution might derive from a convolution between the Gamma distribution (Chi-Square distribution is a particular case of Gamma distribution) and Poisson distribution [<xref ref-type="bibr" rid="b42-ijms-13-05207">42</xref>,<xref ref-type="bibr" rid="b43-ijms-13-05207">43</xref>]. Whenever the separation of factors is possible, it is also possible to separate the convolutions of distributions [<xref ref-type="bibr" rid="b44-ijms-13-05207">44</xref>], and this separation give the possibility to analyze separately the factors. The results presented by Jäntschi <italic>et al</italic>. [<xref ref-type="bibr" rid="b44-ijms-13-05207">44</xref>] sustained and/or are sustained by convolution of Poisson distribution with a continuous distribution in regards of both factors (bacteria and chemical compounds) in the expression of antimicrobial activity. The results showed that antimicrobial activity follow a negative binomial distribution under the influence of both factors (bacteria and chemical compound) and Poison distribution under the influence of the bacteria factor [<xref ref-type="bibr" rid="b44-ijms-13-05207">44</xref>]. Furthermore, the negative binomial distribution might be obtained by convolution of log-normal with Gamma distribution; although a high number of observations are needed (<italic>n</italic> &gt; 250) in order to statistically assure the difference between Log-normal and Gamma distributions [<xref ref-type="bibr" rid="b45-ijms-13-05207">45</xref>].</p></list-item></list></list-item>
<list-item>
<p>Oils and mixture series:</p>
<list list-type="simple">
<list-item>
<p>○ Negative Binomial distribution cannot be rejected for oils. Moreover, Negative Binomial distribution for oils had a higher likelihood than Poisson distribution (<italic>p</italic><sub>F-C-S</sub> for Negative Binomial: 0.56; <italic>p</italic><sub>F-C-S</sub> for Poisson: 0.23) while the Binomial distribution was rejected.</p></list-item>
<list-item>
<p>○ Negative Binomial distribution cannot be rejected for mixtures either. Moreover, Negative Binomial distribution for mixtures had also higher likelihood than Poisson distribution (<italic>p</italic><sub>F-C-S</sub> for Negative Binomial = 0.66; <italic>p</italic><sub>F-C-S</sub> for Poisson = 0.44) while the Binomial distribution was rejected.</p></list-item>
<list-item>
<p>○ The above-presented facts suggest that in the case of oils and mixtures, the factors of the antibacterial activity are not completely separated when oil/mixture name are taken as factor; this appears to be because the Negative Binomial distribution often occurs when a convolution/superposition of Poisson distributions characterize the observed data [<xref ref-type="bibr" rid="b46-ijms-13-05207">46</xref>].</p></list-item></list></list-item></list>
<p>Overall, any investigated compound, oil and mixture proved to have an antimicrobial effect that follows the Poisson distribution on studied bacteria and fungi species. The λ Poisson parameter, varied from 7.286 (Nerol acetate) to 28.250 (Eugenol) and represents the mean and variance of inhibition zone of compound/oil/mixture on investigated species. The obtained parameter of Poisson distribution proved able to characterize the overall antimicrobial activity (both mean and variance equals to Poisson parameter λ, <xref ref-type="table" rid="t1-ijms-13-05207">Table 1</xref>) of the compounds on the investigated bacteria population.</p>
<p>The structure-activity relationships between compounds’ structure and the overall antimicrobial effect on bacteria population, as well as the suitability of a pool of descriptors (SAPF and Dragon approaches) for the overall antimicrobial activity estimation and prediction were furthermore investigated.</p>
<p>QSAR model with two descriptors that proved abilities in estimation and prediction was identified for each approach after the split of compounds in training (13 compounds), test (7 compounds) and external (2 compounds) sets. Normal distribution of the observations was assured through natural logarithm transformation (<italic>p</italic> &gt; 0.05) to allow investigation of structure (of compounds)-activity (overall antimicrobial activity) relationships using multiple linear regression.</p>
<p>The analysis of QSAR-Dragon model revealed the following:</p>
<list list-type="bullet">
<list-item>
<p>One compound proved to be influential in the model (CID = 64939, <xref ref-type="fig" rid="f2-ijms-13-05207">Figure 2</xref>). This compound obtained the value of leverage for both Dragon descriptors higher than the accepted threshold (0.41). This compound, which belongs to the training set, was withdrawn, and a model based on 12 compounds in training set was obtained, <xref rid="FD1" ref-type="disp-formula">Equation(1)</xref>.</p></list-item>
<list-item>
<p>Two descriptors were able to describe the linear relation between overall antimicrobial activities of investigated compounds. One descriptor belongs to the walk and path counts and relates the conventional bond order ID number while the second descriptor relates the maximal autocorrelation of lag 3 divided by mass (R3m+). According with associated coefficients, the R3m+ had a higher contribution in the model compared with piID descriptor, but its contribution is to the significance level threshold (5.8% compared to imposed 5% significance level).</p></list-item>
<list-item>
<p>QSAR-Dragon model proved to be statistically significant (<italic>F</italic> = 39, <italic>p</italic> = 3.62 × 10<sup>−5</sup>). A low value of root mean square error was obtained in leave-one-out analysis (0.1276). The contribution of R3m+ descriptor to the model is questionable since the significance associated to its coefficient is very close to 0.05 but since it has a real contribution in the <italic>r</italic><sup>2</sup> value its significance of 5.8% was accepted. Moreover, the R3m+ proved not significantly correlate with Poisson parameter (<italic>r</italic> = −0.2410).</p></list-item>
<list-item>
<p>Multicollianearity is not present in the model since the tolerance value 0.1 &lt; <italic>T</italic> &lt; 1 and the variance inflation factors (VIF) &lt; 10 even if a significant correlation coefficient was obtained between Dragon descriptors.</p></list-item>
<list-item>
<p>The model proved its abilities in estimation (<italic>R</italic><sup>2</sup><sub>TR</sub> = 0.897) as well as in prediction (internal validity of the model in leave-one-out analysis, <italic>R</italic><sup>2</sup><sub>loo</sub> = 0.845 and external validation in test set <italic>R</italic><sup>2</sup><sub>TS</sub> = 0.652) with a difference in the goodness-of-fit from 0.052 (training <italic>vs</italic>. interval validation - leave one out analysis) to 0.245 (training <italic>vs</italic>. external validation-test set). However, the difference of 0.245 proved not statistically significant (<italic>p</italic> &gt; 0.05).</p></list-item>
<list-item>
<p>Unfortunately, external abilities in prediction were away from the expected abilities. The trend is significant far from the expected line-<xref ref-type="fig" rid="f3-ijms-13-05207">Figure 3</xref>.</p></list-item>
<list-item>
<p>The abilities in estimation (training set) proved not statistically significant from the abilities in prediction (test set) since a probability of 0.3598 was obtained in comparison.</p></list-item></list>
<p>The analysis of QSAR-SAPF model revealed the following:</p>
<list list-type="bullet">
<list-item>
<p>The values of SAPF descriptors associated to compounds proved that no compound had significant influence on the model (all leverage values where lower than threshold −0.41, <xref ref-type="fig" rid="f4-ijms-13-05207">Figure 4</xref>).</p></list-item>
<list-item>
<p>SAPF model proved statistically significant (<italic>F</italic> = 24, <italic>p</italic> = 1.48 × 10<sup>−4</sup>). The contribution of both descriptors to the model proved statistically significant (<italic>p</italic>-values associated to coefficients &lt;0.05).</p></list-item>
<list-item>
<p>According to descriptors from <xref rid="FD2" ref-type="disp-formula">Equation(2)</xref>, the global model of antibacterial activity is related to both molecular geometry and topology: one descriptor identified a relation between the geometry of compounds and the overall antimicrobial activity while the second descriptor identified a relation with compounds’ topology. Moreover, the atomic mass and electronegativity proved to be related to the overall antimicrobial activity by the same split ratio in the expression of the model descriptors.</p></list-item>
<list-item>
<p>Multicollianearity was not identified in the QSAR-SAPF model, even if a statistically significant correlation coefficient between descriptors exists (the tolerance values were higher than 0.1 and smaller than 1 and the variance inflation factors (VIF) had values smaller than 10).</p></list-item>
<list-item>
<p>The model proved its abilities in estimation (<italic>R</italic><sup>2</sup><sub>TR</sub> = 0.829) as well as in prediction (internal validity of the model in leave-one-out analysis, <italic>R</italic><sup>2</sup><sub>loo</sub> = 0.700 and external validation in test set <italic>R</italic><sup>2</sup><sub>TS</sub> = 0.862) with a difference in the goodness-of-fit from −0.034 (training <italic>vs</italic>. external validation - test set) to 0.129 (training <italic>vs</italic>. interval validation-leave one out analysis). Moreover, none of these differences were statistically significant (<italic>p</italic> &gt; 0.05).</p></list-item>
<list-item>
<p>External abilities in prediction proved to be close to expected abilities for QSAR-SAPF model (<xref ref-type="fig" rid="f5-ijms-13-05207">Figure 5</xref>).</p></list-item></list>
<p>The comparison of the identified models revealed the following:</p>
<list list-type="bullet">
<list-item>
<p>Dragon model has slightly better abilities in estimation compared to SAPF model, but these abilities proved not statistically significant. The determination coefficient obtained both in training set and in leave-one-out analysis was higher compared to SAPF model with 0.068 and respectively 0.145. Moreover, the abilities of prediction seem to be better for SAPF model compared to Dragon model (a difference of 0.211, not statistically significant <italic>p</italic> &lt; 0.05). This observation is also sustained by the lowest value of residuals in training set for Dragon model and in two compounds from training set and all compounds from test set for SAPF model (<xref ref-type="table" rid="t2-ijms-13-05207">Table 2</xref>).</p></list-item>
<list-item>
<p>The SAPF model systematically obtained smallest values of parameters presented in <xref ref-type="table" rid="t3-ijms-13-05207">Table 3</xref>: best explaining the variability in the observation; smallest typical errors; smallest standard error of prediction as well as smallest relative error of prediction. The highest difference is observed with regards to standard error of prediction that is almost 4 times higher for Dragon model compared to SAPF model.</p></list-item>
<list-item>
<p>The analysis of predictive power of the models demonstrated that SAPF model had significantly higher power of prediction (<xref ref-type="table" rid="t3-ijms-13-05207">Table 3</xref>). According to the obtained results, the <italic>Q</italic><sup>2</sup> values for Dragon model are smaller than 0.6, being considered unacceptable while all <italic>Q</italic><sup>2</sup> values for SAPF model are higher than 0.77. These results show that the Dragon model can be rejected from a statistical point of view, taking also into consideration that the relative error of prediction is almost 2 times higher compared to SAPF model.</p></list-item>
<list-item>
<p>Furthermore, the mean of residuals for training, external and external + test set proved not statistically different by zero when the SAPF model was analyzed. The Fisher’s predictive power identified statistically difference by zero of the residuals obtained by Dragon model in both training and test sets (9 compounds) (<italic>p</italic> &lt; 0.05, <xref ref-type="table" rid="t3-ijms-13-05207">Table 3</xref>).</p></list-item>
<list-item>
<p>The model with a higher concordance between observed and estimated/predicted could be considered the best model. The analysis of concordance correlation coefficient revealed a substantial strength of agreement for training set but a very poor agreement in test set for Dragon model. A moderate strength of agreement was obtained by SAPF model in both training and test sets (<xref ref-type="table" rid="t3-ijms-13-05207">Table 3</xref>).</p></list-item>
<list-item>
<p>Steiger’s test was not able to identify any statistically significant differences between Dragon and SAPF model regarding goodness-of-fit neither in training set nor in external set.</p></list-item></list>
<p>It can be concluded based on the facts presented above that the SAPF model is a reliable, valid (internally as well as externally) and stable model useful in characterization of overall antimicrobial activity on investigated compounds, both in terms of estimation and prediction.</p>
<p>The aim and objectives of the research have been achieved. The antimicrobial effect proved to follow the Poisson distribution and its parameter was furthermore used to identify those descriptors from Dragon and SAPF pools able to characterize the link between compounds and overall antimicrobial activity. Two newly developed models were found statistically valid. However, which of these QSAR models is better? The analysis of applicability domain of the models obtained in training sets was able to identify based on the values of descriptors one structurally influential compound in training set for Dragon model. According to the obtained results, one compound was withdrawn from further analysis in Dragon modeling. Dragon model was created based on 12 compounds in training set while the SAPF model was created based on 13 compounds in training set. Graphical representation of observed <italic>vs</italic>. calculated values based on identified models as well as the predictive power parameters showed that the best model to be applied on new chemicals is the SAPF model.</p></sec>
<sec>
<title>4. Experimental Section</title>
<sec>
<title>4.1. Compounds, Oils and Mixtures</title>
<p>The antimicrobial effects of twenty-two compounds, eight oils and two mixtures on gram-positive and -negative bacteria (<italic>Staphylococcus aureus</italic>, <italic>Enterococcus faecalis</italic>, <italic>Escherichia coli</italic>, <italic>Pseudomonas aeruginosa</italic>, <italic>Klebsiella pneumoniae</italic>, <italic>Proteus vulgaris</italic>, <italic>Salmonella</italic> sp.) and on one fungus (<italic>Candida albicans</italic>), expressed as inhibition zone (mm, Agar diffusion disc method [<xref ref-type="bibr" rid="b33-ijms-13-05207">33</xref>]), were included in the analysis (<xref ref-type="table" rid="t4-ijms-13-05207">Table 4</xref>). The PubChem database was used to retrieve the compounds structure and associated CIDs (Compound IDentification numbers); the data are presented in <xref ref-type="table" rid="t4-ijms-13-05207">Table 4</xref>.</p></sec>
<sec sec-type="methods">
<title>4.2. Distribution Analysis</title>
<p>Since all inhibition zones expressed in mm are integer numbers, a search for a discrete distribution was conducted having as alternatives Uniform, Binomial, Negative Binomial and Poisson distributions (other alternatives were excluded due to lack of fit with observed data). Kolmogorov-Smirnov (K-S) [<xref ref-type="bibr" rid="b47-ijms-13-05207">47</xref>] and Anderson-Darling (A-D) [<xref ref-type="bibr" rid="b48-ijms-13-05207">48</xref>] statistics were used to measure the departure between observations and a certain probability distribution function (PDF). Fisher’s method combining independent tests for significance (Fisher’s Chi-Square, abbreviated as F-C-S [<xref ref-type="bibr" rid="b49-ijms-13-05207">49</xref>]) was used to obtain a global probability of agreement between the distribution and the observed samples.</p>
<p>The whole pool (matrix) of data was prior analyzed and none of the above distribution functions give an acceptable (higher than 5%) agreement with the observations. This fact could be explained by the heterogeneity of the chemicals/oils/mixtures.</p>
<p>In order to obtain the PDF of antimicrobial effects of compounds, oils and mixtures on bacteria and fungus population, rows of experimental values were analyzed as independent samples. A number of five observations in sample qualified the sample for estimation of the distribution parameters, and the analysis was conducted using maximum likelihood estimation (MLE) [<xref ref-type="bibr" rid="b34-ijms-13-05207">34</xref>] procedure. The measure of the agreement was expressed using the probability of F-C-S test. Also the following hypothesis was tested: a certain PDF can be accepted for populations of all samples regardless of PDF parameters values. The identified PDF was further used to estimate the population parameter(s) for sample(s) without enough data (e.g., Citronellol, see <xref ref-type="table" rid="t4-ijms-13-05207">Table 4</xref>).</p>
<p>Population statistics of the identified PDF can be seen as an estimator of overall antimicrobial activity of the investigated compound on the bacteria and fungi population. The series of the population statistics for all investigated compounds was furthermore subject of a structure-activity relationship search intended to relate the overall antimicrobial effect with compounds’ structure.</p></sec>
<sec>
<title>4.3. Molecular Descriptors Calculation</title>
<p>The molecular modeling study was conducted at PM3 semi-empirical level of theory [<xref ref-type="bibr" rid="b50-ijms-13-05207">50</xref>] on chemical compounds series.</p>
<p>A series of home-made programs were used to perform the following tasks: ■ automate transformation the *.sdf or *.mol files as *.hin files; ■ prepare the compounds for modeling (run HyperChem v.8.0 [<xref ref-type="bibr" rid="b51-ijms-13-05207">51</xref>] with HyperChem scripts in order to obtain molecular models) [<xref ref-type="bibr" rid="b52-ijms-13-05207">52</xref>]; ■ calculate the molecular descriptors (SAPF approach) for all compounds (calculate all descriptors; select a relevant subset of descriptors); ■ split the set randomly in training (for model development, ~2/3 compounds in training set) and two test sets (for model validation); ■ search for multiple linear regression (search for two descriptors linear models) in training set; ■ validate the model obtained in training set on test sets.</p>
<p>The molecular descriptors for the chemical compounds were calculated using a home-made software that implemented Structural Atomic Property Family [<xref ref-type="bibr" rid="b53-ijms-13-05207">53</xref>,<xref ref-type="bibr" rid="b54-ijms-13-05207">54</xref>] (SAPF approach, methodology of calculation depicted in <xref ref-type="fig" rid="f6-ijms-13-05207">Figure 6</xref>) and the Dragon software [<xref ref-type="bibr" rid="b55-ijms-13-05207">55</xref>] (all Dragon descriptors).</p>
<p>The SAPF approach is a method that cumulates atomic properties at the molecular level. The approach used a localization of the molecular center using a metric, an atomic property (C = cardinality (number of heavy atoms), H = Hydrogen bonds (number of Hydrogen atoms), M = atomic mass (relative units), E = electronegativity (on Pauling scale [<xref ref-type="bibr" rid="b56-ijms-13-05207">56</xref>]), and A = electron affinity), a power of a distance as well as of an atomic property in the expression of descriptor in regard to atomic effect, a modality of accumulation of atomic properties at the molecular level, and a linearization operation (see <xref ref-type="fig" rid="f6-ijms-13-05207">Figure 6</xref>).</p></sec>
<sec>
<title>4.4. Identification and Characterization of Linear Regression Models</title>
<p>Linear regression models (additive models) were used for search of structure-activity relationship between overall antimicrobial effects as dependent variable and structural descriptors (from SAPF approach and Dragon software) as independent variables.</p>
<p>Kolmogorov-Smirnov, Anderson-Darling, and Chi-Square statistics [<xref ref-type="bibr" rid="b57-ijms-13-05207">57</xref>] as well as Grubbs test for outliers [<xref ref-type="bibr" rid="b58-ijms-13-05207">58</xref>] were used to decide which transformation should be applied to assure the normality of observations (in our case the parameter of the probability distribution function) [<xref ref-type="bibr" rid="b50-ijms-13-05207">50</xref>,<xref ref-type="bibr" rid="b51-ijms-13-05207">51</xref>].</p>
<p>Regression analysis was employed to select the candidate models and the following criteria were used: highest goodness-of-fit, smallest number of descriptors and absence of collinearity between descriptors [<xref ref-type="bibr" rid="b37-ijms-13-05207">37</xref>,<xref ref-type="bibr" rid="b38-ijms-13-05207">38</xref>].</p>
<p>A complete randomization approach was applied to split of compounds in training (~2/3 compounds, 13 compounds), test (7 compounds: geranyl acetate, geranyl butyrate, geranyl tiglate, neral, neryl butyrate, neryl propanoate, citronellyl acetate, citronellyl propionate, and eugenol) and external (2 compounds: citronellyl acetate and neryl propanoate) sets.</p>
<p>Training set was used to identify the model, test set to validate the model and external set to assess the model external predictive power. The predictive power of identified models is sustained by an applied strategy; the models were not obtained on measured data which are subject of measurements errors. Instead, the QSAR models were constructed with population estimates (represented by Poisson parameter) that are less affected by errors. Thus, the QSAR models reflect the behavior of the compound on bacteria and fungi not the behavior of compound on a certain bacteria/fungus.</p>
<p>In order to assess the applicability domain of the obtained models, two approaches were involved on the full model with identified descriptors in the training sets [<xref ref-type="bibr" rid="b59-ijms-13-05207">59</xref>]: leverage and identification of response outliers. A standardized measure of the distance between the descriptor values for the i<sup>th</sup> observation and the means of the descriptor-values for all observations was computed to identify the leverage in descriptors (leverage value, <italic>h</italic><sub>i</sub>). Whenever <italic>h</italic><sub>i</sub> &gt; 3·(<italic>k</italic> + 1)/<italic>n</italic> (where <italic>k</italic> = number of independent variables in the model, <italic>n</italic> = sample size) compound was considered influential in the model [<xref ref-type="bibr" rid="b60-ijms-13-05207">60</xref>] and was excluded from further analysis of the model. The response outliers were defined as compounds with absolute standardized residuals higher than 2.5. Leverage values (<italic>h</italic><sub>i</sub>) <italic>vs</italic>. standardized residuals for compounds in training set was plotted to identify response outliers as well as independent variables with leverage values higher than threshold value (see <xref ref-type="fig" rid="f2-ijms-13-05207">Figures 2</xref> and <xref ref-type="fig" rid="f3-ijms-13-05207">3</xref>).</p>
<p>The model diagnostics was carried out using statistical parameters presented in <xref ref-type="table" rid="t5-ijms-13-05207">Table 5</xref>.</p>
<p>The comparison of the models was performed using Steiger’s <italic>Z</italic> (association assumption between data) and Fisher’s <italic>Z</italic> (independence assumption of the data) statistics [<xref ref-type="bibr" rid="b68-ijms-13-05207">68</xref>].</p></sec></sec>
<sec sec-type="conclusions">
<title>5. Conclusions</title>
<p>Antimicrobial activity of investigated oils, compounds and mixtures on the series of bacteria and fungi were shown to follow the Poisson distribution.</p>
<p>Two newly developed QSAR models, with Dragon and with SAPF descriptors, were found to be statistically significant internally. Even if the Dragon model proved to have higher goodness-of-fit, the model proved unacceptable in terms of prediction power. The SAPF model proved acceptable, with its prediction power being reliable, valid and stable in external validation analysis, with good overall performances in test set and test and external sets.</p></sec>
<sec sec-type="supplementary-material">
<title>Supplementary Information</title>
<supplementary-material id="s1-ijms-13-05207" content-type="local-data">
<media xlink:href="ijms-13-05207-s001.pdf" mimetype="application" mime-subtype="pdf"/></supplementary-material></sec></body>
<back>
<ack>
<title>Acknowledgments</title>
<p>The study was supported by European Social Fund, Human Resources Development Operational Program, project number 89/1.5/62371 through a fellowship for L. Jäntschi. The funder had no role in study design, data collection, analysis and interpretation of data, in the writing of the report or in the decision to submit the article for publication.</p></ack>
<ref-list>
<title>References</title>
<ref id="b1-ijms-13-05207"><label>1</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sengul</surname><given-names>M.</given-names></name><name><surname>Ercisli</surname><given-names>S.</given-names></name><name><surname>Yildiz</surname><given-names>H.</given-names></name><name><surname>Gungor</surname><given-names>N.</given-names></name><name><surname>Kavaz</surname><given-names>A.</given-names></name><name><surname>Cetin</surname><given-names>B.</given-names></name></person-group><article-title>Antioxidant, antimicrobial activity and total phenolic content within the aerial parts of <italic>Artemisia absinthum</italic>, <italic>Artemisia santonicum</italic> and <italic>Saponaria officinalis</italic></article-title><source>Iran. J. Pharm. Res</source><year>2011</year><volume>10</volume><fpage>49</fpage><lpage>55</lpage></citation></ref>
<ref id="b2-ijms-13-05207"><label>2</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Martini</surname><given-names>M.G.</given-names></name><name><surname>Bizzo</surname><given-names>H.R.</given-names></name><name><surname>Moreira</surname><given-names>D.D.</given-names></name><name><surname>Neufeld</surname><given-names>P.M.</given-names></name><name><surname>Miranda</surname><given-names>S.N.</given-names></name><name><surname>Alviano</surname><given-names>C.S.</given-names></name><name><surname>Alviano</surname><given-names>D.S.</given-names></name><name><surname>Leitao</surname><given-names>S.G.</given-names></name></person-group><article-title>Chemical composition and antimicrobial activities of the essential oils from <italic>Ocimum selloi</italic> and hesperozygis myrtoides</article-title><source>Nat. Prod. Commun</source><year>2011</year><volume>6</volume><fpage>1027</fpage><lpage>1030</lpage><pub-id pub-id-type="pmid">21834250</pub-id></citation></ref>
<ref id="b3-ijms-13-05207"><label>3</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Serrano</surname><given-names>C.</given-names></name><name><surname>Matos</surname><given-names>O.</given-names></name><name><surname>Teixeira</surname><given-names>B.</given-names></name><name><surname>Ramos</surname><given-names>C.</given-names></name><name><surname>Neng</surname><given-names>N.</given-names></name><name><surname>Nogueira</surname><given-names>J.</given-names></name><name><surname>Nunes</surname><given-names>M.L.</given-names></name><name><surname>Marques</surname><given-names>A.</given-names></name></person-group><article-title>Antioxidant and antimicrobial activity of <italic>Satureja montana</italic> L. extracts</article-title><source>J. Sci. Food Agric</source><year>2011</year><volume>91</volume><fpage>1554</fpage><lpage>1560</lpage><pub-id pub-id-type="doi">10.1002/jsfa.4347</pub-id><pub-id pub-id-type="pmid">21445865</pub-id></citation></ref>
<ref id="b4-ijms-13-05207"><label>4</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Mothana</surname><given-names>R.A.</given-names></name><name><surname>Alsaid</surname><given-names>M.S.</given-names></name><name><surname>Al-Musayeib</surname><given-names>N.M.</given-names></name></person-group><article-title>Phytochemical analysis and <italic>in vitro</italic> antimicrobial and free-radical-scavenging activities of the essential oils from <italic>Euryops arabicus</italic> and <italic>Laggera decurrens.</italic></article-title><source>Molecules</source><year>2011</year><volume>16</volume><fpage>5149</fpage><lpage>5158</lpage><pub-id pub-id-type="doi">10.3390/molecules16065149</pub-id><pub-id pub-id-type="pmid">21694678</pub-id></citation></ref>
<ref id="b5-ijms-13-05207"><label>5</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Quintans</surname><given-names>L.</given-names></name><name><surname>da Rocha</surname><given-names>R.F.</given-names></name><name><surname>Caregnato</surname><given-names>F.F.</given-names></name><name><surname>Moreira</surname><given-names>J.C.F.</given-names></name><name><surname>da Silva</surname><given-names>F.A.</given-names></name><name><surname>Araujo</surname><given-names>A.A.D.</given-names></name><name><surname>dos Santos</surname><given-names>J.P.A.</given-names></name><name><surname>Melo</surname><given-names>M.S.</given-names></name><name><surname>de Sousa</surname><given-names>D.P.</given-names></name><name><surname>Bonjardim</surname><given-names>L.R.</given-names></name><name><surname>Gelain</surname><given-names>D.P.</given-names></name></person-group><article-title>Antinociceptive action and redox properties of citronellal, an essential oil present in lemongrass</article-title><source>J. Med. Food</source><year>2011</year><volume>14</volume><fpage>630</fpage><lpage>639</lpage><pub-id pub-id-type="doi">10.1089/jmf.2010.0125</pub-id><pub-id pub-id-type="pmid">21480794</pub-id></citation></ref>
<ref id="b6-ijms-13-05207"><label>6</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Ito</surname><given-names>K.</given-names></name><name><surname>Ito</surname><given-names>M.</given-names></name></person-group><article-title>Sedative effects of vapor inhalation of the essential oil of <italic>Microtoena patchoulii</italic> and its related compounds</article-title><source>J. Nat. Med</source><year>2011</year><volume>65</volume><fpage>336</fpage><lpage>343</lpage><pub-id pub-id-type="doi">10.1007/s11418-010-0502-x</pub-id><pub-id pub-id-type="pmid">21287406</pub-id></citation></ref>
<ref id="b7-ijms-13-05207"><label>7</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Garozzo</surname><given-names>A.</given-names></name><name><surname>Timpanaro</surname><given-names>R.</given-names></name><name><surname>Stivala</surname><given-names>A.</given-names></name><name><surname>Bisignano</surname><given-names>G.</given-names></name><name><surname>Castro</surname><given-names>A.</given-names></name></person-group><article-title>Activity of <italic>Melaleuca alternifolia</italic> (tea tree) oil on influenza virus A/PR/8: Study on the mechanism of action</article-title><source>Antivir. Res</source><year>2011</year><volume>89</volume><fpage>83</fpage><lpage>88</lpage><pub-id pub-id-type="doi">10.1016/j.antiviral.2010.11.010</pub-id><pub-id pub-id-type="pmid">21095205</pub-id></citation></ref>
<ref id="b8-ijms-13-05207"><label>8</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pauli</surname><given-names>A.</given-names></name></person-group><article-title>Anticandidal low molecular compounds from higher plants with special reference to compounds from essential oils</article-title><source>Med. Res. Rev</source><year>2011</year><volume>26</volume><fpage>223</fpage><lpage>268</lpage></citation></ref>
<ref id="b9-ijms-13-05207"><label>9</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jaffri</surname><given-names>J.M.</given-names></name><name><surname>Mohamed</surname><given-names>S.</given-names></name><name><surname>Ahmad</surname><given-names>I.N.</given-names></name><name><surname>Mustapha</surname><given-names>N.M.</given-names></name><name><surname>Manap</surname><given-names>Y.A.</given-names></name><name><surname>Rohimi</surname><given-names>N.</given-names></name></person-group><article-title>Effects of catechin-rich oil palm leaf extract on normal and hypertensive rats’ kidney and liver</article-title><source>Food Chem</source><year>2011</year><volume>128</volume><fpage>433</fpage><lpage>441</lpage><pub-id pub-id-type="doi">10.1016/j.foodchem.2011.03.050</pub-id></citation></ref>
<ref id="b10-ijms-13-05207"><label>10</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yu</surname><given-names>F.</given-names></name><name><surname>Gao</surname><given-names>J.</given-names></name><name><surname>Zeng</surname><given-names>Y.</given-names></name><name><surname>Liu</surname><given-names>C.X.</given-names></name></person-group><article-title>Effects of adlay seed oil on blood lipids and antioxidant capacity in hyperlipidemic rats</article-title><source>J. Sci. Food Agric</source><year>2011</year><volume>91</volume><fpage>1843</fpage><lpage>1848</lpage><pub-id pub-id-type="doi">10.1002/jsfa.4393</pub-id><pub-id pub-id-type="pmid">21452173</pub-id></citation></ref>
<ref id="b11-ijms-13-05207"><label>11</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Zhang</surname><given-names>Y.B.</given-names></name><name><surname>Guo</surname><given-names>J.</given-names></name><name><surname>Dong</surname><given-names>H.Y.</given-names></name><name><surname>Zhao</surname><given-names>X.M.</given-names></name><name><surname>Zhou</surname><given-names>L.</given-names></name><name><surname>Li</surname><given-names>X.Y.</given-names></name><name><surname>Liu</surname><given-names>J.C.</given-names></name><name><surname>Niu</surname><given-names>Y.C.</given-names></name></person-group><article-title>Hydroxysafflor yellow a protects against chronic carbon tetrachloride-induced liver fibrosis</article-title><source>Eur. J. Pharmacol</source><year>2011</year><volume>660</volume><fpage>438</fpage><lpage>444</lpage><pub-id pub-id-type="doi">10.1016/j.ejphar.2011.04.015</pub-id><pub-id pub-id-type="pmid">21536026</pub-id></citation></ref>
<ref id="b12-ijms-13-05207"><label>12</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Yordi</surname><given-names>E.G.</given-names></name><name><surname>Molina Pérez</surname><given-names>E.</given-names></name><name><surname>Joao Matos</surname><given-names>M.</given-names></name><name><surname>Uriarte Villares</surname><given-names>E.</given-names></name></person-group><article-title>Structural alerts for predicting clastogenic activity of pro-oxidant flavonoid compounds: Quantitative structure-activity relationship study</article-title><source>J. Biomol. Screen</source><year>2012</year><volume>17</volume><fpage>216</fpage><lpage>224</lpage><pub-id pub-id-type="doi">10.1177/1087057111421623</pub-id><pub-id pub-id-type="pmid">21940715</pub-id></citation></ref>
<ref id="b13-ijms-13-05207"><label>13</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Rishton</surname><given-names>G.M.</given-names></name></person-group><article-title>Natural products as a robust source of new drugs and drug leads: Past successes and present day issues</article-title><source>Am. J. Cardiol</source><year>2008</year><volume>101</volume><fpage>43D</fpage><lpage>49D</lpage><pub-id pub-id-type="pmid">18474274</pub-id></citation></ref>
<ref id="b14-ijms-13-05207"><label>14</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Dunn</surname><given-names>W.J.</given-names><suffix>III</suffix></name></person-group><article-title>Quantitative structure-activity relationships (QSAR)</article-title><source>Chemom. Intell. Lab</source><year>1989</year><volume>6</volume><fpage>181</fpage><lpage>190</lpage><pub-id pub-id-type="doi">10.1016/0169-7439(89)80083-8</pub-id></citation></ref>
<ref id="b15-ijms-13-05207"><label>15</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Khan</surname><given-names>F.</given-names></name><name><surname>Yadav</surname><given-names>D.K.</given-names></name><name><surname>Maurya</surname><given-names>A.</given-names></name><name><surname>Srivastava</surname><given-names>S.K.</given-names></name></person-group><article-title>Modern methods &amp; web resources in drug design &amp; discovery</article-title><source>Lett. Drug Des. Discov</source><year>2011</year><volume>8</volume><fpage>469</fpage><lpage>490</lpage><pub-id pub-id-type="doi">10.2174/157018011795514249</pub-id></citation></ref>
<ref id="b16-ijms-13-05207"><label>16</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Vedani</surname><given-names>A.</given-names></name><name><surname>Dobler</surname><given-names>M.</given-names></name><name><surname>Spreafico</surname><given-names>M.</given-names></name><name><surname>Peristera</surname><given-names>O.</given-names></name><name><surname>Smiesko</surname><given-names>M.</given-names></name></person-group><article-title>VirtualToxLab—<italic>in silico</italic> prediction of the toxic potential of drugs and environmental chemicals: Evaluation status and internet access protocol</article-title><source>Altex</source><year>2007</year><volume>24</volume><fpage>153</fpage><lpage>161</lpage><pub-id pub-id-type="pmid">17891320</pub-id></citation></ref>
<ref id="b17-ijms-13-05207"><label>17</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Castro</surname><given-names>E.A.</given-names></name></person-group><source>QSPR-QSAR Studies on Desired Properties for Drug Design</source><publisher-name>Research Signpost</publisher-name><publisher-loc>Kerala, India</publisher-loc><year>2010</year></citation></ref>
<ref id="b18-ijms-13-05207"><label>18</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Gasteiger</surname><given-names>J.</given-names></name><name><surname>Engel</surname><given-names>T</given-names></name></person-group><source>Chemoinformatics: A Textbook</source><edition>1st ed</edition><publisher-name>Wiley-VCH</publisher-name><publisher-loc>Weinheim, Germany</publisher-loc><year>2003</year></citation></ref>
<ref id="b19-ijms-13-05207"><label>19</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Alvarez</surname><given-names>J.</given-names></name><name><surname>Shoichet</surname><given-names>B</given-names></name></person-group><source>Virtual Screening in Drug Discovery</source><edition>1st ed</edition><publisher-name>CRC Press</publisher-name><publisher-loc>Boca Raton, FL, USA</publisher-loc><year>2005</year></citation></ref>
<ref id="b20-ijms-13-05207"><label>20</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schuster</surname><given-names>D.</given-names></name><name><surname>Wolber</surname><given-names>G.</given-names></name></person-group><article-title>Identification of bioactive natural products by pharmacophore-based virtual screening</article-title><source>Curr. Pharm. Des</source><year>2010</year><volume>16</volume><fpage>1666</fpage><lpage>1681</lpage><pub-id pub-id-type="doi">10.2174/138161210791164072</pub-id><pub-id pub-id-type="pmid">20222852</pub-id></citation></ref>
<ref id="b21-ijms-13-05207"><label>21</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bartalis</surname><given-names>J.</given-names></name><name><surname>Halaweish</surname><given-names>F.T.</given-names></name></person-group><article-title><italic>In vitro</italic> and QSAR studies of cucurbitacins on HepG2 and HSC-T6 liver cell lines</article-title><source>Bioorg. Med. Chem</source><year>2011</year><volume>19</volume><fpage>2757</fpage><lpage>2766</lpage><pub-id pub-id-type="doi">10.1016/j.bmc.2011.01.037</pub-id><pub-id pub-id-type="pmid">21459003</pub-id></citation></ref>
<ref id="b22-ijms-13-05207"><label>22</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bolboacă</surname><given-names>S.D.</given-names></name><name><surname>Pică</surname><given-names>E.M.</given-names></name><name><surname>Cimpoiu</surname><given-names>C.V.</given-names></name><name><surname>Jäntschi</surname><given-names>L.</given-names></name></person-group><article-title>Statistical assessment of solvent mixture models used for separation of biological active compounds</article-title><source>Molecules</source><year>2008</year><volume>13</volume><fpage>1617</fpage><lpage>1639</lpage><pub-id pub-id-type="doi">10.3390/molecules13081617</pub-id><pub-id pub-id-type="pmid">18794776</pub-id></citation></ref>
<ref id="b23-ijms-13-05207"><label>23</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>González-Díaz</surname><given-names>H.</given-names></name><name><surname>Torres-Gomez</surname><given-names>L.A.</given-names></name><name><surname>Guevara</surname><given-names>Y.</given-names></name><name><surname>Almeida</surname><given-names>M.S.</given-names></name><name><surname>Molina</surname><given-names>R.</given-names></name><name><surname>Castanedo</surname><given-names>N.</given-names></name><name><surname>Castañedo</surname><given-names>N.</given-names></name><name><surname>Santana</surname><given-names>L.</given-names></name><name><surname>Uriarte</surname><given-names>E.</given-names></name></person-group><article-title>Markovian chemicals “<italic>in silico</italic>” design (MARCH-INSIDE), a promising approach for computer-aided molecular design III: 2.5D indices for the discovery of antibacterials</article-title><source>J. Mol. Model</source><year>2005</year><volume>11</volume><fpage>116</fpage><lpage>123</lpage><pub-id pub-id-type="doi">10.1007/s00894-004-0228-3</pub-id><pub-id pub-id-type="pmid">15723208</pub-id></citation></ref>
<ref id="b24-ijms-13-05207"><label>24</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonzalez-Diaz</surname><given-names>H.</given-names></name><name><surname>Prado-Prado</surname><given-names>F.</given-names></name><name><surname>Ubeira</surname><given-names>F.M.</given-names></name></person-group><article-title>Predicting antimicrobial drugs and targets with the MARCH-INSIDE approach</article-title><source>Curr. Top. Med. Chem</source><year>2008</year><volume>8</volume><fpage>1676</fpage><lpage>90</lpage><pub-id pub-id-type="doi">10.2174/156802608786786543</pub-id><pub-id pub-id-type="pmid">19075774</pub-id></citation></ref>
<ref id="b25-ijms-13-05207"><label>25</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Molina</surname><given-names>E.</given-names></name><name><surname>Díaz</surname><given-names>H.G.</given-names></name><name><surname>González</surname><given-names>M.P.</given-names></name><name><surname>Rodríguez</surname><given-names>E.</given-names></name><name><surname>Uriarte</surname><given-names>E.</given-names></name></person-group><article-title>Designing antibacterial compounds through a topological substructural approach</article-title><source>J. Chem. Inf. Comput. Sci</source><year>2004</year><volume>44</volume><fpage>515</fpage><lpage>521</lpage><pub-id pub-id-type="doi">10.1021/ci0342019</pub-id><pub-id pub-id-type="pmid">15032531</pub-id></citation></ref>
<ref id="b26-ijms-13-05207"><label>26</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>González-Díaz</surname><given-names>H.</given-names></name><name><surname>Romaris</surname><given-names>F.</given-names></name><name><surname>Duardo-Sanchez</surname><given-names>A.</given-names></name><name><surname>Pérez-Montoto</surname><given-names>L.G.</given-names></name><name><surname>Prado-Prado</surname><given-names>F.</given-names></name><name><surname>Patlewicz</surname><given-names>G.</given-names></name><name><surname>Ubeira</surname><given-names>F.M.</given-names></name></person-group><article-title>Predicting drugs and proteins in parasite infections with topological indices of complex networks: Theoretical backgrounds, applications and legal issues</article-title><source>Curr. Pharm. Des</source><year>2010</year><volume>16</volume><fpage>2737</fpage><lpage>2764</lpage><pub-id pub-id-type="doi">10.2174/138161210792389234</pub-id><pub-id pub-id-type="pmid">20642428</pub-id></citation></ref>
<ref id="b27-ijms-13-05207"><label>27</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Prado-Prado</surname><given-names>F.J.</given-names></name><name><surname>Gonzalez-Diaz</surname><given-names>H.</given-names></name><name><surname>Santana</surname><given-names>L.</given-names></name><name><surname>Uriarte</surname><given-names>E.</given-names></name></person-group><article-title>Unified QSAR approach to antimicrobials. Part 2: Predicting activity against more than 90 different species in order to halt antibacterial resistance</article-title><source>Bioorg. Med. Chem</source><year>2007</year><volume>15</volume><fpage>897</fpage><lpage>902</lpage><pub-id pub-id-type="doi">10.1016/j.bmc.2006.10.039</pub-id><pub-id pub-id-type="pmid">17084086</pub-id></citation></ref>
<ref id="b28-ijms-13-05207"><label>28</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Prado-Prado</surname><given-names>F.J.</given-names></name><name><surname>Uriarte</surname><given-names>E.</given-names></name><name><surname>Borges</surname><given-names>F.</given-names></name><name><surname>González-Díaz</surname><given-names>H.</given-names></name></person-group><article-title>Multi-target spectral moments for QSAR and complex networks study of antibacterial drugs</article-title><source>Eur. J. Med. Chem</source><year>2009</year><volume>44</volume><fpage>4516</fpage><lpage>4521</lpage><pub-id pub-id-type="doi">10.1016/j.ejmech.2009.06.018</pub-id><pub-id pub-id-type="pmid">19631422</pub-id></citation></ref>
<ref id="b29-ijms-13-05207"><label>29</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonzalez-Diaz</surname><given-names>H.</given-names></name><name><surname>Prado-Prado</surname><given-names>F.J.</given-names></name></person-group><article-title>Unified QSAR and network-based computational chemistry approach to antimicrobials, part 1: Multispecies activity models for antifungals</article-title><source>J. Comput. Chem</source><year>2008</year><volume>29</volume><fpage>656</fpage><lpage>667</lpage><pub-id pub-id-type="doi">10.1002/jcc.20826</pub-id><pub-id pub-id-type="pmid">17999385</pub-id></citation></ref>
<ref id="b30-ijms-13-05207"><label>30</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Prado-Prado</surname><given-names>F.J.</given-names></name><name><surname>Ubeira</surname><given-names>F.M.</given-names></name><name><surname>Borges</surname><given-names>F.</given-names></name><name><surname>Gonzalez-Diaz</surname><given-names>H.</given-names></name></person-group><article-title>Unified QSAR &amp; network-based computational chemistry approach to antimicrobials. II. Multiple distance and triadic census analysis of antiparasitic drugs complex networks</article-title><source>J. Comput. Chem</source><year>2010</year><volume>31</volume><fpage>164</fpage><lpage>173</lpage><pub-id pub-id-type="doi">10.1002/jcc.21292</pub-id><pub-id pub-id-type="pmid">19421992</pub-id></citation></ref>
<ref id="b31-ijms-13-05207"><label>31</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Prado-Prado</surname><given-names>F.J.</given-names></name><name><surname>Martinez de la Vega</surname><given-names>O.</given-names></name><name><surname>Uriarte</surname><given-names>E.</given-names></name><name><surname>Ubeira</surname><given-names>F.M.</given-names></name><name><surname>Chou</surname><given-names>K.C.</given-names></name><name><surname>Gonzalez-Diaz</surname><given-names>H.</given-names></name></person-group><article-title>Unified QSAR approach to antimicrobials. 4. Multi-target QSAR modeling and comparative multi-distance study of the giant components of antiviral drug-drug complex networks</article-title><source>Bioorg. Med. Chem</source><year>2009</year><volume>17</volume><fpage>569</fpage><lpage>575</lpage><pub-id pub-id-type="doi">10.1016/j.bmc.2008.11.075</pub-id><pub-id pub-id-type="pmid">19112024</pub-id></citation></ref>
<ref id="b32-ijms-13-05207"><label>32</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Gonzalez-Diaz</surname><given-names>H.</given-names></name><name><surname>Prado-Prado</surname><given-names>F.</given-names></name><name><surname>Sobarzo-Sanchez</surname><given-names>E.</given-names></name><name><surname>Haddad</surname><given-names>M.</given-names></name><name><surname>Maurel Chevalley</surname><given-names>S.</given-names></name><name><surname>Valentin</surname><given-names>A.</given-names></name><name><surname>Quetin-Leclercq</surname><given-names>J.</given-names></name><name><surname>Dea-Ayuela</surname><given-names>M.A.</given-names></name><name><surname>Teresa Gomez-Muños</surname><given-names>M.</given-names></name><name><surname>Munteanu</surname><given-names>C.R.</given-names></name></person-group><article-title>NL MIND-BEST: A web server for ligands and proteins discovery-theoretic-experimental study of proteins of Giardia lamblia and new compounds active against <italic>Plasmodium falciparum</italic></article-title><source>J. Theor. Biol</source><year>2011</year><volume>276</volume><fpage>229</fpage><lpage>249</lpage><pub-id pub-id-type="doi">10.1016/j.jtbi.2011.01.010</pub-id><pub-id pub-id-type="pmid">21277861</pub-id></citation></ref>
<ref id="b33-ijms-13-05207"><label>33</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jirovetz</surname><given-names>L.</given-names></name><name><surname>Eller</surname><given-names>G.</given-names></name><name><surname>Buchbauer</surname><given-names>G.</given-names></name><name><surname>Schmidt</surname><given-names>E.</given-names></name><name><surname>Denkova</surname><given-names>Z.</given-names></name><name><surname>Stoyanova</surname><given-names>A.S.</given-names></name><name><surname>Nikolova</surname><given-names>R.</given-names></name><name><surname>Geissler</surname><given-names>M.</given-names></name></person-group><article-title>Chemical composition, antimicrobial activities and odor descriptions of some essential oils with characteristic</article-title><source>Recent Res. Dev. Agron. Hortic</source><year>2006</year><volume>2</volume><fpage>1</fpage><lpage>12</lpage></citation></ref>
<ref id="b34-ijms-13-05207"><label>34</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fisher</surname><given-names>R.A.</given-names></name></person-group><article-title>On an absolute criterion for fitting frequency curves</article-title><source>Messenger Math</source><year>1912</year><volume>41</volume><fpage>155</fpage><lpage>160</lpage></citation></ref>
<ref id="b35-ijms-13-05207"><label>35</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Sacks</surname><given-names>J.</given-names></name><name><surname>Ylvisaker</surname><given-names>D.</given-names></name></person-group><article-title>Designs for regression problems with correlated errors III</article-title><source>Ann. Math. Stat</source><year>1970</year><volume>41</volume><fpage>2057</fpage><lpage>2074</lpage><pub-id pub-id-type="doi">10.1214/aoms/1177696705</pub-id></citation></ref>
<ref id="b36-ijms-13-05207"><label>36</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jarque</surname><given-names>C.M.</given-names></name><name><surname>Bera</surname><given-names>A.K.</given-names></name></person-group><article-title>A test for normality of observations and regression residuals</article-title><source>Int. Stat. Rev</source><year>1987</year><volume>55</volume><fpage>163</fpage><lpage>172</lpage><pub-id pub-id-type="doi">10.2307/1403192</pub-id></citation></ref>
<ref id="b37-ijms-13-05207"><label>37</label><citation citation-type="web"><person-group person-group-type="author"><name><surname>LeRoy</surname><given-names>J.S.</given-names></name></person-group><article-title>Negative Binomial and Poisson Distributions Compared</article-title><source>Proceedings of the Casualty Actuarial Society</source><publisher-name>Casualty Actuarial Society</publisher-name><publisher-loc>Arlington, VA, USA</publisher-loc><year>1960</year><volume>XLVII</volume><issue>87 &amp; 88</issue><fpage>20</fpage><lpage>24</lpage><comment>Available online: <ext-link xlink:href="http://www.casact.org/pubs/proceed/proceed60/60020.pdf" ext-link-type="uri">http://www.casact.org/pubs/proceed/proceed60/60020.pdf</ext-link></comment><access-date>accessed on 6 August 2011</access-date></citation></ref>
<ref id="b38-ijms-13-05207"><label>38</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Furman</surname><given-names>E.</given-names></name></person-group><article-title>On the convolution of the negative binomial random variables</article-title><source>Stat. Probab. Lett</source><year>2007</year><volume>77</volume><fpage>169</fpage><lpage>172</lpage><pub-id pub-id-type="doi">10.1016/j.spl.2006.06.007</pub-id></citation></ref>
<ref id="b39-ijms-13-05207"><label>39</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Jones</surname><given-names>A</given-names></name></person-group><article-title>Health Econometrics</article-title><source>Handbook of Health Economics</source><person-group person-group-type="editor"><name><surname>Culyer</surname><given-names>A.</given-names></name><name><surname>Newhouse</surname><given-names>J.</given-names></name></person-group><publisher-name>Elsevier</publisher-name><publisher-loc>Amsterdam, The Netherland</publisher-loc><year>2000</year></citation></ref>
<ref id="b40-ijms-13-05207"><label>40</label><citation citation-type="book"><person-group person-group-type="author"><name><surname>Cameron</surname><given-names>A.C.</given-names></name><name><surname>Trivedi</surname><given-names>P.K.</given-names></name></person-group><source>Regression Analysis of Count Data</source><publisher-name>Cambridge University Press</publisher-name><publisher-loc>London, UK</publisher-loc><year>1998</year></citation></ref>
<ref id="b41-ijms-13-05207"><label>41</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fisher</surname><given-names>R.A.</given-names></name></person-group><article-title>A theoretical distribution for the apparent abundance of different species</article-title><source>J. Anim. Ecol.</source><year>1943</year><volume>12</volume><fpage>54</fpage><lpage>58</lpage></citation></ref>
<ref id="b42-ijms-13-05207"><label>42</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shaked</surname><given-names>M.</given-names></name></person-group><article-title>A family of concepts of dependence for bivariate distributions</article-title><source>J. Am. Stat. Assoc</source><year>1977</year><volume>72</volume><fpage>642</fpage><lpage>650</lpage><pub-id pub-id-type="doi">10.1080/01621459.1977.10480628</pub-id></citation></ref>
<ref id="b43-ijms-13-05207"><label>43</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Marshall</surname><given-names>A.W.</given-names></name><name><surname>Olkin</surname><given-names>I.</given-names></name></person-group><article-title>Multivariate distributions generated from mixtures of convolution and product families, lecture notes-monograph series</article-title><source>Top. Stat. Depend</source><year>1990</year><volume>16</volume><fpage>371</fpage><lpage>393</lpage></citation></ref>
<ref id="b44-ijms-13-05207"><label>44</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jäntschi</surname><given-names>L.</given-names></name><name><surname>Bolboacă</surname><given-names>S.D.</given-names></name><name><surname>Bălan</surname><given-names>M.C.</given-names></name><name><surname>Sestraş</surname><given-names>R.E.</given-names></name></person-group><article-title>Distribution fitting 13. Analysis of independent, multiplicative effect of factors. Application to effect of essential oils extracts from plant species on bacterial species. Application to factors of antibacterial activity of plant species</article-title><source>Bull. Univ. Agric. Sci. Vet. Med. Cluj-Napoca. Anim. Sci. Biotechnol</source><year>2011</year><volume>68</volume><fpage>323</fpage><lpage>331</lpage></citation></ref>
<ref id="b45-ijms-13-05207"><label>45</label><citation citation-type="web"><person-group person-group-type="author"><name><surname>Kundu</surname><given-names>D.</given-names></name><name><surname>Manglick</surname><given-names>A</given-names></name></person-group><source>Discriminating between the log-normal and gamma distributions</source><comment>Available online: <ext-link xlink:href="http://home.iitk.ac.in/~kundu/paper93.pdf" ext-link-type="uri">http://home.iitk.ac.in/~kundu/paper93.pdf</ext-link></comment><access-date>accessed on 1 August 2011</access-date></citation></ref>
<ref id="b46-ijms-13-05207"><label>46</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Bolboacă</surname><given-names>S.D.</given-names></name><name><surname>Jäntschi</surname><given-names>L.</given-names></name></person-group><article-title>Modelling the property of compounds from structure: Statistical methods for models validation</article-title><source>Environ. Chem. Lett</source><year>2008</year><volume>6</volume><fpage>175</fpage><lpage>181</lpage><pub-id pub-id-type="doi">10.1007/s10311-007-0119-9</pub-id></citation></ref>
<ref id="b47-ijms-13-05207"><label>47</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Kolmogorov</surname><given-names>A.</given-names></name></person-group><article-title>Confidence limits for an unknown distribution function</article-title><source>Ann. Math. Stat</source><year>1941</year><volume>12</volume><fpage>461</fpage><lpage>463</lpage><pub-id pub-id-type="doi">10.1214/aoms/1177731684</pub-id></citation></ref>
<ref id="b48-ijms-13-05207"><label>48</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Anderson</surname><given-names>T.W.</given-names></name><name><surname>Darling</surname><given-names>D.A.</given-names></name></person-group><article-title>Asymptotic theory of certain “goodness-of-fit” criteria based on stochastic processes</article-title><source>Ann. Math. Stat</source><year>1952</year><volume>23</volume><fpage>193</fpage><lpage>212</lpage><pub-id pub-id-type="doi">10.1214/aoms/1177729437</pub-id></citation></ref>
<ref id="b49-ijms-13-05207"><label>49</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fisher</surname><given-names>R.A.</given-names></name></person-group><article-title>Combining independent tests of significance</article-title><source>Am. Stat</source><year>1948</year><volume>2</volume><fpage>30</fpage></citation></ref>
<ref id="b50-ijms-13-05207"><label>50</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Hobza</surname><given-names>P.</given-names></name><name><surname>Kabeláč</surname><given-names>M.</given-names></name><name><surname>Šponer</surname><given-names>J.</given-names></name><name><surname>Mejzlík</surname><given-names>P.</given-names></name><name><surname>Vondrášek</surname><given-names>J.</given-names></name></person-group><article-title>Performance of empirical potentials (AMBER, CFF95, CVFF, CHARMM, OPLS, POLTEV), semiempirical quantum chemical methods (AM1, MNDO/M, PM3), and Ab initio Hartree-Fock method for interaction of DNA bases: Comparison with nonempirical beyond Hartree-Fock results</article-title><source>J. Comput. Chem</source><year>1997</year><volume>18</volume><fpage>1136</fpage><lpage>1150</lpage><pub-id pub-id-type="doi">10.1002/(SICI)1096-987X(19970715)18:9&lt;1136::AID-JCC3&gt;3.0.CO;2-S</pub-id></citation></ref>
<ref id="b51-ijms-13-05207"><label>51</label><citation citation-type="book"><source>HyperChem, version 8.0</source><publisher-name>Hypercube Inc</publisher-name><publisher-loc>Gainesville, FL, USA</publisher-loc><year>2007</year></citation></ref>
<ref id="b52-ijms-13-05207"><label>52</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jäntschi</surname><given-names>L.</given-names></name></person-group><article-title>Computer assisted geometry optimization for <italic>in silico</italic> modeling</article-title><source>Appl. Med. Inform</source><year>2011</year><volume>29</volume><fpage>11</fpage><lpage>18</lpage></citation></ref>
<ref id="b53-ijms-13-05207"><label>53</label><citation citation-type="thesis"><person-group person-group-type="author"><name><surname>Jäntschi</surname><given-names>L</given-names></name></person-group><article-title>Genetic Algorithms and Their Applications (in Romanian)</article-title><source>Ph.D. Dissertation</source><publisher-name>University of Agricultural Sciences and Veterinary Medicine</publisher-name><publisher-loc>Cluj-Napoca, Romania</publisher-loc><year>2010</year></citation></ref>
<ref id="b54-ijms-13-05207"><label>54</label><citation citation-type="confproc"><person-group person-group-type="author"><name><surname>Jäntschi</surname><given-names>L.</given-names></name><name><surname>Bolboacă</surname><given-names>S.D.</given-names></name><name><surname>Sestraş</surname><given-names>R.E.</given-names></name></person-group><article-title>Quantum Mechanics Study on a Series of Steroids Relating Separation with Structure</article-title><conf-name>Proceedings of 17th International Symposium on Separation Sciences: Book of Abstracts</conf-name><conf-loc>Cluj-Napoca, Romania</conf-loc><conf-date>September 5–9, 2011</conf-date><publisher-name>Casa Cărţii de Ştiinţă</publisher-name><publisher-loc>Cluj-Napoca, Romania</publisher-loc><year>2011</year><fpage>59</fpage></citation></ref>
<ref id="b55-ijms-13-05207"><label>55</label><citation citation-type="book"><source>DRAGON, version 5.5</source><publisher-name>Talete srl</publisher-name><publisher-loc>Milano, Italy</publisher-loc><year>2007</year></citation></ref>
<ref id="b56-ijms-13-05207"><label>56</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Pauling</surname><given-names>L.</given-names></name></person-group><article-title>The nature of the chemical bond. IV. The energy of single bonds and the relative electronegativity of atoms</article-title><source>J. Am. Chem. Soc</source><year>1932</year><volume>54</volume><fpage>3570</fpage><lpage>3582</lpage><pub-id pub-id-type="doi">10.1021/ja01348a011</pub-id></citation></ref>
<ref id="b57-ijms-13-05207"><label>57</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Jäntschi</surname><given-names>L.</given-names></name><name><surname>Bolboacă</surname><given-names>S.D.</given-names></name></person-group><article-title>Distribution Fitting 2. Pearson-Fisher, Kolmogorov-Smirnov, Anderson-Darling, Wilks-Shapiro, Kramer-von-Misses and Jarque-Bera statistics</article-title><source>Bull. Univ. Agric. Sci. Vet. Med. Cluj-Napoca. Hortic</source><year>2009</year><volume>66</volume><fpage>691</fpage><lpage>697</lpage></citation></ref>
<ref id="b58-ijms-13-05207"><label>58</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Grubbs</surname><given-names>F.</given-names></name></person-group><article-title>Procedures for detecting outlying observations in samples</article-title><source>Technometrics</source><year>1969</year><volume>11</volume><fpage>1</fpage><lpage>21</lpage><pub-id pub-id-type="doi">10.1080/00401706.1969.10490657</pub-id></citation></ref>
<ref id="b59-ijms-13-05207"><label>59</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chatterjee</surname><given-names>S.</given-names></name><name><surname>Hadi</surname><given-names>A.S.</given-names></name></person-group><article-title>Influential observations, high leverage points, and outliers in linear regression (with discussion)</article-title><source>Stat. Sci</source><year>1986</year><volume>1</volume><fpage>379</fpage><lpage>416</lpage><pub-id pub-id-type="doi">10.1214/ss/1177013622</pub-id></citation></ref>
<ref id="b60-ijms-13-05207"><label>60</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Eriksson</surname><given-names>L.</given-names></name><name><surname>Jaworska</surname><given-names>J.</given-names></name><name><surname>Worth</surname><given-names>A.P.</given-names></name><name><surname>Cronin</surname><given-names>M.T.D.</given-names></name><name><surname>McDowell</surname><given-names>R.M.</given-names></name><name><surname>Gramatica</surname><given-names>P.</given-names></name></person-group><article-title>Methods for reliability and uncertainty assessment and for applicability evaluations of classification and regression-based QSARs</article-title><source>Environ. Health Perspect</source><year>2003</year><volume>111</volume><fpage>1361</fpage><lpage>1375</lpage><pub-id pub-id-type="doi">10.1289/ehp.5758</pub-id><pub-id pub-id-type="pmid">12896860</pub-id></citation></ref>
<ref id="b61-ijms-13-05207"><label>61</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Chirico</surname><given-names>N.</given-names></name><name><surname>Gramatica</surname><given-names>P.</given-names></name></person-group><article-title>Real external predictivity of QSAR models: How to evaluate it? Comparison of different validation criteria and proposal of using the concordance correlation coefficient</article-title><source>J. Chem. Inf. Model</source><year>2011</year><volume>51</volume><fpage>2320</fpage><lpage>2335</lpage><pub-id pub-id-type="doi">10.1021/ci200211n</pub-id><pub-id pub-id-type="pmid">21800825</pub-id></citation></ref>
<ref id="b62-ijms-13-05207"><label>62</label><citation citation-type="web"><person-group person-group-type="author"><name><surname>McBride</surname><given-names>G.B.</given-names></name></person-group><article-title>A Proposal for Strength-of-Agreement Criteria for Lin’S Concordance Correlation Coefficient</article-title><source>NIWA Client Report: HAM2005-062</source><publisher-name>National Institute of Water &amp; Atmospheric Research</publisher-name><publisher-loc>Hamilton, New Zeeland</publisher-loc><month>May</month><year>2005</year><comment>Available online: <ext-link xlink:href="http://www.medcalc.org/download/pdf/McBride2005.pdf" ext-link-type="uri">http://www.medcalc.org/download/pdf/McBride2005.pdf</ext-link></comment><access-date>accessed on 14 March 2012</access-date></citation></ref>
<ref id="b63-ijms-13-05207"><label>63</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Shi</surname><given-names>L.M.</given-names></name><name><surname>Fang</surname><given-names>H.</given-names></name><name><surname>Tong</surname><given-names>W.</given-names></name><name><surname>Wu</surname><given-names>J.</given-names></name><name><surname>Perkins</surname><given-names>R.</given-names></name><name><surname>Blair</surname><given-names>R.M.</given-names></name><name><surname>Branham</surname><given-names>W.S.</given-names></name><name><surname>Dial</surname><given-names>S.L.</given-names></name><name><surname>Moland</surname><given-names>C.L.</given-names></name><name><surname>Sheehan</surname><given-names>D.M.</given-names></name></person-group><article-title>QSAR models using a large diverse set of estrogens</article-title><source>J. Chem. Inf. Comput. Sci</source><year>2001</year><volume>41</volume><fpage>186</fpage><lpage>195</lpage><pub-id pub-id-type="doi">10.1021/ci000066d</pub-id><pub-id pub-id-type="pmid">11206373</pub-id></citation></ref>
<ref id="b64-ijms-13-05207"><label>64</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Schüürmann</surname><given-names>G.</given-names></name><name><surname>Ebert</surname><given-names>R.U.</given-names></name><name><surname>Chen</surname><given-names>J.</given-names></name><name><surname>Wang</surname><given-names>B.</given-names></name><name><surname>Kühne</surname><given-names>R.</given-names></name></person-group><article-title>External validation and prediction employing the predictive squared correlation coefficient test set activity mean <italic>vs</italic>. training set activity mean</article-title><source>J. Chem. Inf. Model</source><year>2008</year><volume>48</volume><fpage>2140</fpage><lpage>2145</lpage><pub-id pub-id-type="doi">10.1021/ci800253u</pub-id><pub-id pub-id-type="pmid">18954136</pub-id></citation></ref>
<ref id="b65-ijms-13-05207"><label>65</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Consonni</surname><given-names>V.</given-names></name><name><surname>Ballabio</surname><given-names>D.</given-names></name><name><surname>Todeschini</surname><given-names>R.</given-names></name></person-group><article-title>Comments on the definition of the Q<sup>2</sup> parameter for QSAR validation</article-title><source>J. Chem. Inf. Model</source><year>2009</year><volume>49</volume><fpage>1669</fpage><lpage>1678</lpage><pub-id pub-id-type="doi">10.1021/ci900115y</pub-id><pub-id pub-id-type="pmid">19527034</pub-id></citation></ref>
<ref id="b66-ijms-13-05207"><label>66</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Golbraikh</surname><given-names>A.</given-names></name><name><surname>Tropsha</surname><given-names>A.</given-names></name></person-group><article-title>Beware of <italic>q</italic><sup>2</sup>!</article-title><source>J. Mol. Gr. Mod</source><year>2002</year><volume>20</volume><fpage>269</fpage><lpage>276</lpage><pub-id pub-id-type="doi">10.1016/S1093-3263(01)00123-1</pub-id></citation></ref>
<ref id="b67-ijms-13-05207"><label>67</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Fisher</surname><given-names>R.A.</given-names></name></person-group><article-title>The goodness of fit of regression formulae, and the distribution of regression coefficients</article-title><source>J. Royal Stat. Soc</source><year>1922</year><volume>85</volume><fpage>597</fpage><lpage>612</lpage><pub-id pub-id-type="doi">10.2307/2341124</pub-id></citation></ref>
<ref id="b68-ijms-13-05207"><label>68</label><citation citation-type="journal"><person-group person-group-type="author"><name><surname>Steiger</surname><given-names>J.H.</given-names></name></person-group><article-title>Tests for comparing elements of a correlation matrix</article-title><source>Psychol. Bull</source><year>1980</year><volume>87</volume><fpage>245</fpage><lpage>251</lpage><pub-id pub-id-type="doi">10.1037/0033-2909.87.2.245</pub-id></citation></ref></ref-list>
<sec sec-type="display-objects">
<title>Figures and Tables</title>
<fig id="f1-ijms-13-05207" position="float">
<label>Figure 1</label>
<caption>
<p>Results of probability distribution functions analysis. X: Compounds (<bold>1</bold>–<bold>21; 1</bold> = Citral, <bold>2</bold> = Geraniol, <bold>3</bold> = Geranyl formate, <bold>4</bold> = Geranyl acetate, <bold>5</bold> = Geranyl butyrate, <bold>6</bold> = Geranyl tiglate, <bold>7</bold> = Neral, <bold>8</bold> = Nerol, <bold>9</bold> = Nerol acetate, <bold>10</bold> = Neryl butyrate, <bold>11</bold> = Neryl propanoate, <bold>12</bold> = Citronellal, <bold>13</bold> = Citronellyl formate, <bold>14</bold> = Citronellyl acetate, <bold>15</bold> = Citronellyl butyrate, <bold>16</bold> = Citronellyl isobutyrate, <bold>17</bold> = Citronellyl propionate, <bold>18</bold> = Hydroxycitronellal, <bold>19</bold> = Rose oxide, <bold>20</bold> = Eugenol, <bold>21</bold> = Sulfametrole, <bold>32</bold> = Citronellol), Oils (<bold>22</bold>–<bold>29; 22</bold> = Citronella, <bold>23</bold> = Geranium Africa, <bold>24</bold> = Geranium Bourbon, <bold>25</bold> = Geranium China, <bold>26</bold> = Helichrysum, <bold>27</bold> = Palmarosa, <bold>28</bold> = Rose, <bold>29</bold> = Verbena), Mixtures (<bold>30</bold>–<bold>31; 30</bold> = Tetracycline hydrochloride, <bold>31</bold> = Ciproxin); Y: Binomial (◆), NegBino (■), Poisson (▴); “Is Y the distribution of any X on bacteria and fungi species?”.</p></caption>
<graphic xlink:href="ijms-13-05207f1.gif"/></fig>
<fig id="f2-ijms-13-05207" position="float">
<label>Figure 2</label>
<caption>
<p>Williams plot (training set): Dragon descriptors.</p></caption>
<graphic xlink:href="ijms-13-05207f2.gif"/></fig>
<fig id="f3-ijms-13-05207" position="float">
<label>Figure 3</label>
<caption>
<p>Observed <italic>vs</italic>. calculated parameter: QSAR-Dragon (<xref rid="FD1" ref-type="disp-formula">Equation (1)</xref><italic>R</italic><sup>2</sup><sub>TS</sub> = determination coefficient in test set).</p></caption>
<graphic xlink:href="ijms-13-05207f3.gif"/></fig>
<fig id="f4-ijms-13-05207" position="float">
<label>Figure 4</label>
<caption>
<p>Williams plots (training set): SAPF descriptors.</p></caption>
<graphic xlink:href="ijms-13-05207f4.gif"/></fig>
<fig id="f5-ijms-13-05207" position="float">
<label>Figure 5</label>
<caption>
<p>Observed <italic>vs</italic>. calculated parameter: QSAR-SAPF (<xref rid="FD2" ref-type="disp-formula">Equation (2)</xref><italic>R</italic><sup>2</sup><sub>TS</sub> = determination coefficient in test set).</p></caption>
<graphic xlink:href="ijms-13-05207f5.gif"/></fig>
<fig id="f6-ijms-13-05207" position="float">
<label>Figure 6</label>
<caption>
<p>SAPF descriptors (v = value, ln = natural logarithm, V = vector, T = topology, G = geometry, x, y, z = geometric atomic coordinates, i = atom, refD = modality to calculate coordinates—from average, refP = modality to calculate coordinates—from property center formula, t = topological atomic coordinate.</p></caption>
<graphic xlink:href="ijms-13-05207f6.gif"/></fig>
<table-wrap id="t1-ijms-13-05207" position="float">
<label>Table 1</label>
<caption>
<p>Statistical parameters and population properties.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="left" valign="bottom"/>
<th align="center" valign="bottom"><xref ref-type="table-fn" rid="tfn1-ijms-13-05207">λ</xref></th>
<th align="center" valign="bottom">Mode</th>
<th align="center" valign="bottom">Mean</th>
<th align="center" valign="bottom">Var</th>
<th align="center" valign="bottom">StDev</th>
<th align="center" valign="bottom">Skew</th>
<th align="center" valign="bottom">EKurt</th>
<th align="center" valign="bottom">Median</th></tr></thead>
<tbody>
<tr>
<td colspan="9" align="left" valign="top"><bold>Compound (CID)</bold></td></tr>
<tr>
<td align="left" valign="top">Citral (638011)</td>
<td align="center" valign="top">14.125</td>
<td align="center" valign="top">14</td>
<td align="center" valign="top">14.125</td>
<td align="center" valign="top">14.125</td>
<td align="center" valign="top">3.758</td>
<td align="center" valign="top">0.266</td>
<td align="center" valign="top">0.071</td>
<td align="center" valign="top">13.457</td></tr>
<tr>
<td align="left" valign="top">Geraniol (637566)</td>
<td align="center" valign="top">13.750</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">13.750</td>
<td align="center" valign="top">13.750</td>
<td align="center" valign="top">3.708</td>
<td align="center" valign="top">0.270</td>
<td align="center" valign="top">0.073</td>
<td align="center" valign="top">13.082</td></tr>
<tr>
<td align="left" valign="top">Geranyl formate (5282109)</td>
<td align="center" valign="top">8.875</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">8.875</td>
<td align="center" valign="top">8.875</td>
<td align="center" valign="top">2.979</td>
<td align="center" valign="top">0.336</td>
<td align="center" valign="top">0.113</td>
<td align="center" valign="top">8.207</td></tr>
<tr>
<td align="left" valign="top">Geranyl acetate (1549026)</td>
<td align="center" valign="top">8.200</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">8.200</td>
<td align="center" valign="top">8.200</td>
<td align="center" valign="top">2.864</td>
<td align="center" valign="top">0.349</td>
<td align="center" valign="top">0.122</td>
<td align="center" valign="top">7.531</td></tr>
<tr>
<td align="left" valign="top">Geranyl butyrate (5355856)</td>
<td align="center" valign="top">8.714</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">8.714</td>
<td align="center" valign="top">8.714</td>
<td align="center" valign="top">2.952</td>
<td align="center" valign="top">0.339</td>
<td align="center" valign="top">0.115</td>
<td align="center" valign="top">8.046</td></tr>
<tr>
<td align="left" valign="top">Geranyl tiglate (5367785)</td>
<td align="center" valign="top">11.625</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">11.625</td>
<td align="center" valign="top">11.625</td>
<td align="center" valign="top">3.410</td>
<td align="center" valign="top">0.293</td>
<td align="center" valign="top">0.086</td>
<td align="center" valign="top">10.957</td></tr>
<tr>
<td align="left" valign="top">Neral (643779)</td>
<td align="center" valign="top">13.500</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">13.500</td>
<td align="center" valign="top">13.500</td>
<td align="center" valign="top">3.674</td>
<td align="center" valign="top">0.272</td>
<td align="center" valign="top">0.074</td>
<td align="center" valign="top">12.932</td></tr>
<tr>
<td align="left" valign="top">Nerol (643820)</td>
<td align="center" valign="top">11.250</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">11.250</td>
<td align="center" valign="top">11.250</td>
<td align="center" valign="top">3.354</td>
<td align="center" valign="top">0.298</td>
<td align="center" valign="top">0.089</td>
<td align="center" valign="top">10.582</td></tr>
<tr>
<td align="left" valign="top">Nerol acetate (1549025)</td>
<td align="center" valign="top">7.333</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">7.333</td>
<td align="center" valign="top">7.333</td>
<td align="center" valign="top">2.708</td>
<td align="center" valign="top">0.369</td>
<td align="center" valign="top">0.136</td>
<td align="center" valign="top">6.664</td></tr>
<tr>
<td align="left" valign="top">Neryl butyrate (5352162)</td>
<td align="center" valign="top">10.714</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10.714</td>
<td align="center" valign="top">10.714</td>
<td align="center" valign="top">3.273</td>
<td align="center" valign="top">0.306</td>
<td align="center" valign="top">0.093</td>
<td align="center" valign="top">10.046</td></tr>
<tr>
<td align="left" valign="top">Neryl propanoate (5365982)</td>
<td align="center" valign="top">10.714</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10.714</td>
<td align="center" valign="top">10.714</td>
<td align="center" valign="top">3.273</td>
<td align="center" valign="top">0.306</td>
<td align="center" valign="top">0.093</td>
<td align="center" valign="top">10.046</td></tr>
<tr>
<td align="left" valign="top">Citronellal (7794)</td>
<td align="center" valign="top">14.600</td>
<td align="center" valign="top">14</td>
<td align="center" valign="top">14.600</td>
<td align="center" valign="top">14.600</td>
<td align="center" valign="top">3.821</td>
<td align="center" valign="top">0.262</td>
<td align="center" valign="top">0.068</td>
<td align="center" valign="top">13.932</td></tr>
<tr>
<td align="left" valign="top">Citronellyl formate (7778)</td>
<td align="center" valign="top">12.143</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">12.143</td>
<td align="center" valign="top">12.143</td>
<td align="center" valign="top">3.485</td>
<td align="center" valign="top">0.287</td>
<td align="center" valign="top">0.082</td>
<td align="center" valign="top">11.475</td></tr>
<tr>
<td align="left" valign="top">Citronellyl acetate (9017)</td>
<td align="center" valign="top">7.286</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">7.286</td>
<td align="center" valign="top">7.286</td>
<td align="center" valign="top">2.699</td>
<td align="center" valign="top">0.370</td>
<td align="center" valign="top">0.137</td>
<td align="center" valign="top">6.617</td></tr>
<tr>
<td align="left" valign="top">Citronellyl butyrate (8835)</td>
<td align="center" valign="top">8.167</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">8.167</td>
<td align="center" valign="top">8.167</td>
<td align="center" valign="top">2.858</td>
<td align="center" valign="top">0.350</td>
<td align="center" valign="top">0.122</td>
<td align="center" valign="top">7.498</td></tr>
<tr>
<td align="left" valign="top">Citronellyl isobutyrate (60985)</td>
<td align="center" valign="top">8.200</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">8.200</td>
<td align="center" valign="top">8.200</td>
<td align="center" valign="top">2.864</td>
<td align="center" valign="top">0.349</td>
<td align="center" valign="top">0.122</td>
<td align="center" valign="top">7.531</td></tr>
<tr>
<td align="left" valign="top">Citronellyl propionate (8834)</td>
<td align="center" valign="top">14.333</td>
<td align="center" valign="top">14</td>
<td align="center" valign="top">14.333</td>
<td align="center" valign="top">14.333</td>
<td align="center" valign="top">3.786</td>
<td align="center" valign="top">0.264</td>
<td align="center" valign="top">0.070</td>
<td align="center" valign="top">13.665</td></tr>
<tr>
<td align="left" valign="top">Hydroxycitronellal (7888)</td>
<td align="center" valign="top">18.750</td>
<td align="center" valign="top">18</td>
<td align="center" valign="top">18.750</td>
<td align="center" valign="top">18.750</td>
<td align="center" valign="top">4.330</td>
<td align="center" valign="top">0.231</td>
<td align="center" valign="top">0.053</td>
<td align="center" valign="top">18.083</td></tr>
<tr>
<td align="left" valign="top">Rose oxide (27866)</td>
<td align="center" valign="top">12.800</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">12.800</td>
<td align="center" valign="top">12.800</td>
<td align="center" valign="top">3.578</td>
<td align="center" valign="top">0.280</td>
<td align="center" valign="top">0.078</td>
<td align="center" valign="top">12.132</td></tr>
<tr>
<td align="left" valign="top">Eugenol (3314)</td>
<td align="center" valign="top">28.250</td>
<td align="center" valign="top">28</td>
<td align="center" valign="top">28.250</td>
<td align="center" valign="top">28.250</td>
<td align="center" valign="top">5.315</td>
<td align="center" valign="top">0.188</td>
<td align="center" valign="top">0.035</td>
<td align="center" valign="top">27.583</td></tr>
<tr>
<td align="left" valign="top">Sulfametrole (64939)</td>
<td align="center" valign="top">19.200</td>
<td align="center" valign="top">19</td>
<td align="center" valign="top">19.200</td>
<td align="center" valign="top">19.200</td>
<td align="center" valign="top">4.382</td>
<td align="center" valign="top">0.228</td>
<td align="center" valign="top">0.052</td>
<td align="center" valign="top">18.533</td></tr>
<tr>
<td colspan="9" align="left" valign="top"><bold>Oil</bold></td></tr>
<tr>
<td align="left" valign="top">Citronella</td>
<td align="center" valign="top">9.750</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">9.750</td>
<td align="center" valign="top">9.750</td>
<td align="center" valign="top">3.122</td>
<td align="center" valign="top">0.320</td>
<td align="center" valign="top">0.103</td>
<td align="center" valign="top">9.082</td></tr>
<tr>
<td align="left" valign="top">Geranium Africa</td>
<td align="center" valign="top">13.250</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">13.250</td>
<td align="center" valign="top">13.250</td>
<td align="center" valign="top">3.640</td>
<td align="center" valign="top">0.275</td>
<td align="center" valign="top">0.075</td>
<td align="center" valign="top">12.582</td></tr>
<tr>
<td align="left" valign="top">Geranium Bourbon</td>
<td align="center" valign="top">12.500</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">12.500</td>
<td align="center" valign="top">12.500</td>
<td align="center" valign="top">3.536</td>
<td align="center" valign="top">0.283</td>
<td align="center" valign="top">0.080</td>
<td align="center" valign="top">11.832</td></tr>
<tr>
<td align="left" valign="top">Geranium China</td>
<td align="center" valign="top">13.625</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">13.625</td>
<td align="center" valign="top">13.625</td>
<td align="center" valign="top">3.691</td>
<td align="center" valign="top">0.271</td>
<td align="center" valign="top">0.073</td>
<td align="center" valign="top">12.957</td></tr>
<tr>
<td align="left" valign="top">Helichrysum</td>
<td align="center" valign="top">10.667</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10.667</td>
<td align="center" valign="top">10.667</td>
<td align="center" valign="top">3.266</td>
<td align="center" valign="top">0.306</td>
<td align="center" valign="top">0.094</td>
<td align="center" valign="top">9.999</td></tr>
<tr>
<td align="left" valign="top">Palmarosa</td>
<td align="center" valign="top">11.625</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">11.625</td>
<td align="center" valign="top">11.625</td>
<td align="center" valign="top">3.410</td>
<td align="center" valign="top">0.293</td>
<td align="center" valign="top">0.086</td>
<td align="center" valign="top">10.957</td></tr>
<tr>
<td align="left" valign="top">Rose</td>
<td align="center" valign="top">12.750</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">12.750</td>
<td align="center" valign="top">12.750</td>
<td align="center" valign="top">3.571</td>
<td align="center" valign="top">0.280</td>
<td align="center" valign="top">0.078</td>
<td align="center" valign="top">12.082</td></tr>
<tr>
<td align="left" valign="top">Verbena</td>
<td align="center" valign="top">16.500</td>
<td align="center" valign="top">16</td>
<td align="center" valign="top">16.500</td>
<td align="center" valign="top">16.500</td>
<td align="center" valign="top">4.062</td>
<td align="center" valign="top">0.246</td>
<td align="center" valign="top">0.061</td>
<td align="center" valign="top">15.833</td></tr>
<tr>
<td colspan="9" align="left" valign="top"><bold>Mixture</bold></td></tr>
<tr>
<td align="left" valign="top">Tetracycline hydrochloride</td>
<td align="center" valign="top">15.143</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">15.143</td>
<td align="center" valign="top">15.143</td>
<td align="center" valign="top">3.891</td>
<td align="center" valign="top">0.257</td>
<td align="center" valign="top">0.066</td>
<td align="center" valign="top">14.476</td></tr>
<tr>
<td align="left" valign="top">Ciproxin</td>
<td align="center" valign="top">26.000</td>
<td align="center" valign="top">26</td>
<td align="center" valign="top">26.000</td>
<td align="center" valign="top">26.000</td>
<td align="center" valign="top">5.099</td>
<td align="center" valign="top">0.196</td>
<td align="center" valign="top">0.038</td>
<td align="center" valign="top">25.333</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn1-ijms-13-05207">
<p>λ = Parameter of Poisson distribution; Var = variance; StDev = standard deviation; Skew = skewness; EKurt = Excess Kurtosis.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="t2-ijms-13-05207" position="float">
<label>Table 2</label>
<caption>
<p>QSAR Residuals: Dragon <italic>vs</italic>. SAPF.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom">Set</th>
<th align="center" valign="bottom">CID</th>
<th align="center" valign="bottom">Y</th>
<th align="center" valign="bottom">Ŷ<sub>Dragon</sub></th>
<th align="center" valign="bottom">Res<sub>Dragon</sub></th>
<th align="center" valign="bottom">Ŷ<sub>SAPF</sub></th>
<th align="center" valign="bottom">Res<sub>SAPF</sub></th></tr></thead>
<tbody>
<tr>
<td align="center" valign="top">Training</td>
<td align="center" valign="top">1549025</td>
<td align="center" valign="top">1.9924</td>
<td align="center" valign="top">2.0070</td>
<td align="center" valign="top">−0.0146</td>
<td align="center" valign="top">2.0761</td>
<td align="center" valign="top">−0.0836</td></tr>
<tr>
<td align="center" valign="top">Training</td>
<td align="center" valign="top">8835</td>
<td align="center" valign="top">2.1001</td>
<td align="center" valign="top">2.0564</td>
<td align="center" valign="top">0.0437</td>
<td align="center" valign="top">2.1461</td>
<td align="center" valign="top">−0.0460</td></tr>
<tr>
<td align="center" valign="top">Training</td>
<td align="center" valign="top">60985</td>
<td align="center" valign="top">2.1041</td>
<td align="center" valign="top">2.0768</td>
<td align="center" valign="top">0.0273</td>
<td align="center" valign="top">2.0553</td>
<td align="center" valign="top">0.0488</td></tr>
<tr>
<td align="center" valign="top">Training</td>
<td align="center" valign="top">5282109</td>
<td align="center" valign="top">2.1832</td>
<td align="center" valign="top">2.2596</td>
<td align="center" valign="top">−0.0764</td>
<td align="center" valign="top">2.3267</td>
<td align="center" valign="top">−0.1435</td></tr>
<tr>
<td align="center" valign="top">Training</td>
<td align="center" valign="top">643820</td>
<td align="center" valign="top">2.4204</td>
<td align="center" valign="top">2.6106</td>
<td align="center" valign="top">−0.1902</td>
<td align="center" valign="top">2.7127</td>
<td align="center" valign="top">−0.2923</td></tr>
<tr>
<td align="center" valign="top">Training</td>
<td align="center" valign="top">7778</td>
<td align="center" valign="top">2.4968</td>
<td align="center" valign="top">2.4132</td>
<td align="center" valign="top">0.0835</td>
<td align="center" valign="top">2.2816</td>
<td align="center" valign="top">0.2151</td></tr>
<tr>
<td align="center" valign="top">Training</td>
<td align="center" valign="top">27866</td>
<td align="center" valign="top">2.5494</td>
<td align="center" valign="top">2.5905</td>
<td align="center" valign="top">−0.0411</td>
<td align="center" valign="top">2.4957</td>
<td align="center" valign="top">0.0538</td></tr>
<tr>
<td align="center" valign="top">Training</td>
<td align="center" valign="top">637566</td>
<td align="center" valign="top">2.6210</td>
<td align="center" valign="top">2.6106</td>
<td align="center" valign="top">0.0104</td>
<td align="center" valign="top">2.7127</td>
<td align="center" valign="top">−0.0917</td></tr>
<tr>
<td align="center" valign="top">Training</td>
<td align="center" valign="top">638011</td>
<td align="center" valign="top">2.6479</td>
<td align="center" valign="top">2.7061</td>
<td align="center" valign="top">−0.0582</td>
<td align="center" valign="top">2.6042</td>
<td align="center" valign="top">0.0437</td></tr>
<tr>
<td align="center" valign="top">Training</td>
<td align="center" valign="top">8842</td>
<td align="center" valign="top">2.6741</td>
<td align="center" valign="top">2.6435</td>
<td align="center" valign="top">0.0307</td>
<td align="center" valign="top">2.5713</td>
<td align="center" valign="top">0.1029</td></tr>
<tr>
<td align="center" valign="top">Training</td>
<td align="center" valign="top">7794</td>
<td align="center" valign="top">2.6810</td>
<td align="center" valign="top">2.6929</td>
<td align="center" valign="top">−0.0118</td>
<td align="center" valign="top">2.6430</td>
<td align="center" valign="top">0.0380</td></tr>
<tr>
<td align="center" valign="top">Training</td>
<td align="center" valign="top">7888</td>
<td align="center" valign="top">2.9312</td>
<td align="center" valign="top">2.7346</td>
<td align="center" valign="top">0.1966</td>
<td align="center" valign="top">2.8638</td>
<td align="center" valign="top">0.0674</td></tr>
<tr>
<td align="center" valign="top">Training</td>
<td align="center" valign="top">64939</td>
<td align="center" valign="top">2.9549</td>
<td align="center" valign="top"/>
<td align="center" valign="top"/>
<td align="center" valign="top">2.8674</td>
<td align="center" valign="top">0.0875</td></tr>
<tr>
<td align="center" valign="top">Test</td>
<td align="center" valign="top">1549026</td>
<td align="center" valign="top">2.1041</td>
<td align="center" valign="top">2.0070</td>
<td align="center" valign="top">0.0971</td>
<td align="center" valign="top">2.2012</td>
<td align="center" valign="top">−0.0971</td></tr>
<tr>
<td align="center" valign="top">Test</td>
<td align="center" valign="top">5355856</td>
<td align="center" valign="top">2.1650</td>
<td align="center" valign="top">1.9271</td>
<td align="center" valign="top">0.2379</td>
<td align="center" valign="top">2.2830</td>
<td align="center" valign="top">−0.1180</td></tr>
<tr>
<td align="center" valign="top">Test</td>
<td align="center" valign="top">5352162</td>
<td align="center" valign="top">2.3716</td>
<td align="center" valign="top">1.9271</td>
<td align="center" valign="top">0.4445</td>
<td align="center" valign="top">2.7847</td>
<td align="center" valign="top">−0.4132</td></tr>
<tr>
<td align="center" valign="top">Test</td>
<td align="center" valign="top">5367785</td>
<td align="center" valign="top">2.4532</td>
<td align="center" valign="top">1.8661</td>
<td align="center" valign="top">0.5870</td>
<td align="center" valign="top">2.4642</td>
<td align="center" valign="top">−0.0111</td></tr>
<tr>
<td align="center" valign="top">Test</td>
<td align="center" valign="top">643779</td>
<td align="center" valign="top">2.6027</td>
<td align="center" valign="top">2.7061</td>
<td align="center" valign="top">−0.1034</td>
<td align="center" valign="top">2.6006</td>
<td align="center" valign="top">0.0021</td></tr>
<tr>
<td align="center" valign="top">Test</td>
<td align="center" valign="top">8834</td>
<td align="center" valign="top">2.6626</td>
<td align="center" valign="top">2.4108</td>
<td align="center" valign="top">0.2518</td>
<td align="center" valign="top">2.6207</td>
<td align="center" valign="top">0.0418</td></tr>
<tr>
<td align="center" valign="top">Test</td>
<td align="center" valign="top">3314</td>
<td align="center" valign="top">3.3411</td>
<td align="center" valign="top">2.7843</td>
<td align="center" valign="top">0.5568</td>
<td align="center" valign="top">3.3685</td>
<td align="center" valign="top">−0.0274</td></tr>
<tr>
<td align="center" valign="top">External</td>
<td align="center" valign="top">9017</td>
<td align="center" valign="top">1.9859</td>
<td align="center" valign="top">2.1432</td>
<td align="center" valign="top">−0.1572</td>
<td align="center" valign="top">2.0053</td>
<td align="center" valign="top">−0.0194</td></tr>
<tr>
<td align="center" valign="top">External</td>
<td align="center" valign="top">5365982</td>
<td align="center" valign="top">2.3716</td>
<td align="center" valign="top">2.2688</td>
<td align="center" valign="top">0.1028</td>
<td align="center" valign="top">2.2889</td>
<td align="center" valign="top">0.0827</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn2-ijms-13-05207">
<p>CID = compound identification number; Y = observed ln(λ) value; Ŷ = estimated/predicted value; Res = residuals; Dragon = model from <xref rid="FD1" ref-type="disp-formula">Equation(1)</xref>; SAPF = model from <xref rid="FD2" ref-type="disp-formula">Equation(2)</xref>.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="t3-ijms-13-05207" position="float">
<label>Table 3</label>
<caption>
<p>Results of comparison: QSAR-Dragon model <italic>vs</italic>. QSAR-SAPF model.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom">Parameter (Abbreviation)</th>
<th colspan="3" align="center" valign="bottom">Dragon–<xref rid="FD1" ref-type="disp-formula">Equation(1)</xref>–n = 21</th>
<th colspan="3" align="center" valign="bottom">SAPF–<xref rid="FD2" ref-type="disp-formula">Equation(2)</xref>–n = 22</th></tr></thead>
<tbody>
<tr>
<td align="left" valign="top">Root-mean-square error (RMSE)</td>
<td colspan="3" align="center" valign="top">0.2314</td>
<td colspan="3" align="center" valign="top">0.1357</td></tr>
<tr>
<td align="left" valign="top">Mean absolute error (MAE)</td>
<td colspan="3" align="center" valign="top">0.1582</td>
<td colspan="3" align="center" valign="top">0.0967</td></tr>
<tr>
<td align="left" valign="top">Mean Absolute Percentage Error (MAPE)</td>
<td colspan="3" align="center" valign="top">0.0628</td>
<td colspan="3" align="center" valign="top">0.0403</td></tr>
<tr>
<td align="left" valign="top">Standard error of prediction (SEP)</td>
<td colspan="3" align="center" valign="top">0.2371</td>
<td colspan="3" align="center" valign="top">0.0628</td></tr>
<tr>
<td align="left" valign="top">Relative error of prediction (REP%)</td>
<td colspan="3" align="center" valign="top">9.2964</td>
<td colspan="3" align="center" valign="top">5.4523</td></tr>
<tr>
<td colspan="7" align="center" valign="top">Predictive Power of the Model</td></tr>
<tr>
<td align="left" valign="top"> Q<sup>2</sup><sub>F<sub>1</sub></sub></td>
<td colspan="3" align="center" valign="top">0.2121 <xref ref-type="table-fn" rid="tfn3-ijms-13-05207">*</xref></td>
<td colspan="3" align="center" valign="top">0.8436 <xref ref-type="table-fn" rid="tfn3-ijms-13-05207">*</xref></td></tr>
<tr>
<td align="left" valign="top"> Q<sup>2</sup><sub>F<sub>2</sub></sub></td>
<td colspan="3" align="center" valign="top">0.2041 <xref ref-type="table-fn" rid="tfn3-ijms-13-05207">*</xref></td>
<td colspan="3" align="center" valign="top">0.8421 <xref ref-type="table-fn" rid="tfn3-ijms-13-05207">*</xref></td></tr>
<tr>
<td align="left" valign="top"> Q<sup>2</sup><sub>F<sub>3</sub></sub></td>
<td colspan="3" align="center" valign="top">n.a.</td>
<td colspan="3" align="center" valign="top">0.7742 <xref ref-type="table-fn" rid="tfn3-ijms-13-05207">*</xref></td></tr>
<tr>
<td align="left" valign="top"> ρ<sub>c-TR</sub></td>
<td colspan="3" align="center" valign="top">0.9457 <xref ref-type="table-fn" rid="tfn4-ijms-13-05207">a</xref></td>
<td colspan="3" align="center" valign="top">0.9063 <xref ref-type="table-fn" rid="tfn6-ijms-13-05207">c</xref></td></tr>
<tr>
<td align="left" valign="top"> ρ<sub>c-TS</sub></td>
<td colspan="3" align="center" valign="top">0.4885 <xref ref-type="table-fn" rid="tfn5-ijms-13-05207">b</xref></td>
<td colspan="3" align="center" valign="top">0.9219 <xref ref-type="table-fn" rid="tfn7-ijms-13-05207">d</xref></td></tr>
<tr>
<td align="left" valign="top">Fisher’s Predictive Power</td>
<td align="center" valign="top">TS</td>
<td align="center" valign="top">EX <xref ref-type="table-fn" rid="tfn8-ijms-13-05207">e</xref></td>
<td align="center" valign="top">TS + EX <xref ref-type="table-fn" rid="tfn9-ijms-13-05207">f</xref></td>
<td align="center" valign="top">TS</td>
<td align="center" valign="top">EX</td>
<td align="center" valign="top">TS + EX</td></tr>
<tr>
<td align="left" valign="top"> <italic>n</italic></td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">2</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">2</td>
<td align="center" valign="top">9</td></tr>
<tr>
<td align="left" valign="top"> <italic>t</italic>-value</td>
<td align="center" valign="top">3.1148</td>
<td align="center" valign="top">−0.2095</td>
<td align="center" valign="top">2.5071</td>
<td align="center" valign="top">−1.5344</td>
<td align="center" valign="top">0.6198</td>
<td align="center" valign="top">−1.2830</td></tr>
<tr>
<td align="left" valign="top"> <italic>p</italic>-value</td>
<td align="center" valign="top">0.0104</td>
<td align="center" valign="top">0.4343</td>
<td align="center" valign="top">0.0230</td>
<td align="center" valign="top">0.0879</td>
<td align="center" valign="top">0.3234</td>
<td align="center" valign="top">0.1234</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn3-ijms-13-05207">
<label>*</label>
<p>= test set include also external compounds; ρ<sub>c</sub> = concordance correlation coefficient; TR = training set; TS = test set;</p></fn><fn id="tfn4-ijms-13-05207">
<label>a</label>
<p>accuracy = 0.9985, precision = 0.9471;</p></fn><fn id="tfn5-ijms-13-05207">
<label>b</label>
<p>accuracy = 0.7357, precision = 0.6639;s</p></fn><fn id="tfn6-ijms-13-05207">
<label>c</label>
<p>accuracy = 0.9956, precision = 0.9103;</p></fn><fn id="tfn7-ijms-13-05207">
<label>d</label>
<p>accuracy = 0.9867, precision = 0.9344;</p></fn><fn id="tfn8-ijms-13-05207">
<label>e</label>
<p>= external set (two compounds);</p></fn><fn id="tfn9-ijms-13-05207">
<label>f</label>
<p>= training and external sets.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="t4-ijms-13-05207" position="float">
<label>Table 4</label>
<caption>
<p>Compounds, oils and mixtures: inhibition zones (mm).</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom"/>
<th align="center" valign="bottom"/>
<th align="center" valign="bottom"><italic>SA</italic></th>
<th align="center" valign="bottom"><italic>EF</italic></th>
<th align="center" valign="bottom"><italic>EC</italic></th>
<th align="center" valign="bottom"><italic>PV</italic></th>
<th align="center" valign="bottom"><italic>PA</italic></th>
<th align="center" valign="bottom"><italic>S</italic>s</th>
<th align="center" valign="bottom"><italic>KP</italic></th>
<th align="center" valign="bottom"><italic>CA</italic></th>
<th align="center" valign="bottom">n</th></tr></thead>
<tbody>
<tr>
<td colspan="11" align="left" valign="top"><bold>Compound (CID)</bold></td></tr>
<tr>
<td align="left" valign="top"> 1</td>
<td align="left" valign="top">Citral (638011)</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">23</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">28</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 2</td>
<td align="left" valign="top">Geraniol (637566)</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 3</td>
<td align="left" valign="top">Geranyl formate (5282109)</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 4</td>
<td align="left" valign="top">Geranyl acetate (1549026)</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">5</td></tr>
<tr>
<td align="left" valign="top"> 5</td>
<td align="left" valign="top">Geranyl butyrate (5355856)</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">7</td></tr>
<tr>
<td align="left" valign="top"> 6</td>
<td align="left" valign="top">Geranyl tiglate (5367785)</td>
<td align="center" valign="top">17</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 7</td>
<td align="left" valign="top">Neral (643779)</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">20</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 8</td>
<td align="left" valign="top">Nerol (643820)</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">27</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 9</td>
<td align="left" valign="top">Nerol acetate (1549025)</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">6</td></tr>
<tr>
<td align="left" valign="top"> 10</td>
<td align="left" valign="top">Neryl butyrate (5352162)</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">7</td></tr>
<tr>
<td align="left" valign="top"> 11</td>
<td align="left" valign="top">Neryl propanoate (5365982)</td>
<td align="center" valign="top">17</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">14</td>
<td align="center" valign="top">7</td></tr>
<tr>
<td align="left" valign="top"> 12</td>
<td align="left" valign="top">Citronellal (7794)</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">18</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">14</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">5</td></tr>
<tr>
<td align="left" valign="top"> 13</td>
<td align="left" valign="top">Citronellyl formate (7778)</td>
<td align="center" valign="top">18</td>
<td align="center" valign="top">20</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">7</td></tr>
<tr>
<td align="left" valign="top"> 14</td>
<td align="left" valign="top">Citronellyl acetate (9017)</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">6</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">7</td></tr>
<tr>
<td align="left" valign="top"> 15</td>
<td align="left" valign="top">Citronellyl butyrate (8835)</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">6</td></tr>
<tr>
<td align="left" valign="top"> 16</td>
<td align="left" valign="top">Citronellyl isobutyrate (60985)</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">5</td></tr>
<tr>
<td align="left" valign="top"> 17</td>
<td align="left" valign="top">Citronellyl propionate (8834)</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">20</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">6</td></tr>
<tr>
<td align="left" valign="top"> 18</td>
<td align="left" valign="top">Hydroxycitronellal (7888)</td>
<td align="center" valign="top">20</td>
<td align="center" valign="top">20</td>
<td align="center" valign="top">23</td>
<td align="center" valign="top">16</td>
<td align="center" valign="top">17</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">14</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 19</td>
<td align="left" valign="top">Rose oxide (27866)</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">28</td>
<td align="center" valign="top">5</td></tr>
<tr>
<td align="left" valign="top"> 20</td>
<td align="left" valign="top">Eugenol (3314)</td>
<td align="center" valign="top">30</td>
<td align="center" valign="top">30</td>
<td align="center" valign="top">28</td>
<td align="center" valign="top">28</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">28</td>
<td align="center" valign="top">32</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 21</td>
<td align="left" valign="top">Sulfametrole (64939)</td>
<td align="center" valign="top">27</td>
<td align="center" valign="top">27</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">23</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">5</td></tr>
<tr>
<td align="left" valign="top"> 32</td>
<td align="left" valign="top">Citronellol (8842)</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">18</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">4</td></tr>
<tr>
<td colspan="11" align="left" valign="top"><bold>Oil</bold></td></tr>
<tr>
<td align="left" valign="top"> 22</td>
<td align="left" valign="top">Citronella</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">20</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 23</td>
<td align="left" valign="top">Geranium Africa</td>
<td align="center" valign="top">16</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">28</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 24</td>
<td align="left" valign="top">Geranium Bourbon</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 25</td>
<td align="left" valign="top">Geranium China</td>
<td align="center" valign="top">20</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">14</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 26</td>
<td align="left" valign="top">Helichrysum</td>
<td align="center" valign="top">20</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">7</td>
<td align="center" valign="top">6</td></tr>
<tr>
<td align="left" valign="top"> 27</td>
<td align="left" valign="top">Palmarosa</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">20</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 28</td>
<td align="left" valign="top">Rose</td>
<td align="center" valign="top">20</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">8</td>
<td align="center" valign="top">9</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">20</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td align="left" valign="top"> 29</td>
<td align="left" valign="top">Verbena</td>
<td align="center" valign="top">27</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">12</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">8</td></tr>
<tr>
<td colspan="11" align="left" valign="top"><bold>Mixture</bold></td></tr>
<tr>
<td align="left" valign="top"> 30</td>
<td align="left" valign="top">Tetracycline hydrochloride</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">22</td>
<td align="center" valign="top">11</td>
<td align="center" valign="top">13</td>
<td align="center" valign="top">15</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">20</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">7</td></tr>
<tr>
<td align="left" valign="top"> 31</td>
<td align="left" valign="top">Ciproxin</td>
<td align="center" valign="top">35</td>
<td align="center" valign="top">33</td>
<td align="center" valign="top">22</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">32</td>
<td align="center" valign="top">10</td>
<td align="center" valign="top">25</td>
<td align="center" valign="top">NIO</td>
<td align="center" valign="top">7</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn10-ijms-13-05207">
<p>SA = <italic>Staphylococcus aureus</italic>; EF = <italic>Enterococcus faecalis</italic>; EC = <italic>Escherichia coli</italic>; PV = <italic>Proteus vulgaris</italic>; PA = <italic>Pseudomonas aeruginosa</italic>; SS = <italic>Salmonella</italic> sp.; KP = <italic>Klebsiella pneumoniae</italic>; CA = <italic>Candida albicans; n</italic> = sample size; NIO = No Inhibition Observed.</p></fn></table-wrap-foot></table-wrap>
<table-wrap id="t5-ijms-13-05207" position="float">
<label>Table 5</label>
<caption>
<p>Statistical parameters used to assess QSAR models.</p></caption>
<table frame="hsides" rules="groups">
<thead>
<tr>
<th align="center" valign="bottom">Parameter (Abbreviation)</th>
<th align="center" valign="bottom">Formula [ref]</th>
<th align="center" valign="bottom">Remarks</th></tr></thead>
<tbody>
<tr>
<td align="center" valign="middle">Root-mean-square error (RMSE)</td>
<td align="left" valign="top">
<inline-formula>
<mml:math id="mm3">
<mml:semantics id="sm3">
<mml:mrow>
<mml:mtext>RMSE</mml:mtext>
<mml:mo>=</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mtext>n</mml:mtext></mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow>
<mml:mtext>n</mml:mtext></mml:mfrac></mml:mrow></mml:msqrt></mml:mrow></mml:semantics></mml:math></inline-formula></td>
<td align="left" valign="middle" rowspan="2">RMSE &gt; MAE → variation in the errors exist</td></tr>
<tr>
<td align="center" valign="middle">Mean absolute error (MAE)</td>
<td align="left" valign="top">
<inline-formula>
<mml:math id="mm4">
<mml:semantics id="sm4">
<mml:mrow>
<mml:mtext>MAE</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mtext>n</mml:mtext></mml:msubsup>
<mml:mrow>
<mml:mo>∣</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>∣</mml:mo></mml:mrow></mml:mrow>
<mml:mtext>n</mml:mtext></mml:mfrac></mml:mrow></mml:semantics></mml:math></inline-formula></td></tr>
<tr>
<td align="center" valign="middle">Mean Absolute Percentage Error (MAPE) n</td>
<td align="left" valign="top">
<inline-formula>
<mml:math id="mm5">
<mml:semantics id="sm5">
<mml:mrow>
<mml:mtext>MAPE</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mtext>n</mml:mtext></mml:msubsup>
<mml:mrow>
<mml:mo>∣</mml:mo>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>/</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>∣</mml:mo></mml:mrow></mml:mrow>
<mml:mtext>n</mml:mtext></mml:mfrac></mml:mrow></mml:semantics></mml:math></inline-formula></td>
<td align="left" valign="middle">MAPE ~ 0 → perfect fit</td></tr>
<tr>
<td align="center" valign="middle">Standard error of prediction (SEP)</td>
<td align="left" valign="top">
<inline-formula>
<mml:math id="mm6">
<mml:semantics id="sm6">
<mml:mrow>
<mml:mtext>SEP</mml:mtext>
<mml:mo>=</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mtext>n</mml:mtext></mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow>
<mml:mrow>
<mml:mtext>n</mml:mtext>
<mml:mo>-</mml:mo>
<mml:mn>1</mml:mn></mml:mrow></mml:mfrac></mml:mrow></mml:msqrt></mml:mrow></mml:semantics></mml:math></inline-formula></td>
<td align="left" valign="middle">Lower value indicate a good model</td></tr>
<tr>
<td align="center" valign="middle">Relative error of prediction (REP%)</td>
<td align="left" valign="top">
<inline-formula>
<mml:math id="mm7">
<mml:semantics id="sm7">
<mml:mrow>
<mml:mrow>
<mml:mtext>REP(</mml:mtext>
<mml:mo>%</mml:mo>
<mml:mtext>)</mml:mtext></mml:mrow>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>100</mml:mn></mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>¯</mml:mo></mml:mover></mml:mfrac>
<mml:msqrt>
<mml:mrow>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mtext>n</mml:mtext></mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow>
<mml:mtext>n</mml:mtext></mml:mfrac></mml:mrow></mml:msqrt></mml:mrow></mml:semantics></mml:math></inline-formula></td>
<td align="left" valign="middle">Lower value indicate a good model</td></tr>
<tr>
<td align="center" valign="middle">Concordance analysis (ρ<sub>c</sub>)</td>
<td align="left" valign="top">
<inline-formula>
<mml:math id="mm8">
<mml:semantics id="sm8">
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mi>ρ</mml:mi></mml:mrow>
<mml:mtext>c</mml:mtext></mml:msub>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mn>2</mml:mn>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mtext>n</mml:mtext></mml:msubsup>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>¯</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>¯</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mtext>n</mml:mtext></mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>¯</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup>
<mml:mo>+</mml:mo>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mtext>n</mml:mtext></mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>¯</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup>
<mml:mo>+</mml:mo>
<mml:mtext>n</mml:mtext>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>(</mml:mo>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>¯</mml:mo></mml:mover></mml:mrow>
<mml:mo>-</mml:mo>
<mml:mrow>
<mml:mover accent="true">
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>¯</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mo>)</mml:mo></mml:mrow></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></inline-formula> [<xref ref-type="bibr" rid="b61-ijms-13-05207">61</xref>]</td>
<td align="left" valign="middle">Strength of agreement [<xref ref-type="bibr" rid="b62-ijms-13-05207">62</xref>]: &gt;0.99 almost perfect; (0.95; 0.99) substantial; (0.90; 0.95) moderate; &lt;0.90 poor</td></tr>
<tr>
<td align="center" valign="middle" rowspan="3">Predictive Power of the Model Prediction is considered accurate if the predictive power of the model is &gt; 0.6 [<xref ref-type="bibr" rid="b66-ijms-13-05207">66</xref>]</td>
<td align="left" valign="top">
<inline-formula>
<mml:math id="mm9">
<mml:semantics id="sm9">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mtext>Q</mml:mtext></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mtext>F</mml:mtext></mml:mrow>
<mml:mn>1</mml:mn></mml:msub></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>-</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mtext>n</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mtext>n</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>¯</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mrow>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></inline-formula> [<xref ref-type="bibr" rid="b63-ijms-13-05207">63</xref>]</td>
<td align="left" valign="middle">Prediction power relative to mean value of observable in training set</td></tr>
<tr>
<td align="left" valign="top">
<inline-formula>
<mml:math id="mm10">
<mml:semantics id="sm10">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mtext>Q</mml:mtext></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mtext>F</mml:mtext></mml:mrow>
<mml:mn>2</mml:mn></mml:msub></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>-</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mtext>n</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mtext>n</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>¯</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></inline-formula> [<xref ref-type="bibr" rid="b64-ijms-13-05207">64</xref>]</td>
<td align="left" valign="middle">Prediction power relative to mean value of observable in test set</td></tr>
<tr>
<td align="left" valign="top">
<inline-formula>
<mml:math id="mm11">
<mml:semantics id="sm11">
<mml:mrow>
<mml:msubsup>
<mml:mrow>
<mml:mtext>Q</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>F</mml:mtext>
<mml:mn>3</mml:mn></mml:mrow>
<mml:mn>2</mml:mn></mml:msubsup>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo>-</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mtext>n</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow>
<mml:mo>/</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>n</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub></mml:mrow>
<mml:mrow>
<mml:mrow>
<mml:mo>[</mml:mo>
<mml:mrow>
<mml:msubsup>
<mml:mo>∑</mml:mo>
<mml:mrow>
<mml:mtext>i</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mn>1</mml:mn></mml:mrow>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mtext>n</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:msubsup>
<mml:mrow>
<mml:msup>
<mml:mrow>
<mml:mrow>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>y</mml:mtext></mml:mrow>
<mml:mtext>i</mml:mtext></mml:msub>
<mml:mo>-</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>¯</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mrow>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub>
<mml:mo stretchy="false">)</mml:mo></mml:mrow></mml:mrow>
<mml:mn>2</mml:mn></mml:msup></mml:mrow></mml:mrow>
<mml:mo>]</mml:mo></mml:mrow>
<mml:mo>/</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mtext>n</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>TR</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:mfrac></mml:mrow></mml:semantics></mml:math></inline-formula> [<xref ref-type="bibr" rid="b65-ijms-13-05207">65</xref>]</td>
<td align="left" valign="middle">Overall prediction weighted by test set sample size relative to observable weighted by mean of observed value in training set weighted by sample size in training set</td></tr>
<tr>
<td align="center" valign="middle">Predictive Power: Fisher’s approach</td>
<td align="left" valign="top">
<inline-formula>
<mml:math id="mm12">
<mml:semantics id="sm12">
<mml:mtable columnalign="left">
<mml:mtr>
<mml:mtd>
<mml:mtext>t</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mfrac>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mtext>res</mml:mtext></mml:mrow>
<mml:mo>¯</mml:mo></mml:mover></mml:mrow></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mo>-</mml:mo>
<mml:mn>0</mml:mn></mml:mrow>
<mml:mrow>
<mml:mtext>StDev</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:msub>
<mml:mrow>
<mml:mrow>
<mml:mtext>res</mml:mtext></mml:mrow></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>/</mml:mo>
<mml:msqrt>
<mml:mrow>
<mml:msub>
<mml:mrow>
<mml:mtext>n</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub></mml:mrow></mml:msqrt></mml:mrow></mml:mfrac></mml:mtd></mml:mtr>
<mml:mtr>
<mml:mtd>
<mml:mtext>p</mml:mtext>
<mml:mo>=</mml:mo>
<mml:mtext>TDIST</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mtext>abs</mml:mtext>
<mml:mo stretchy="false">(</mml:mo>
<mml:mtext>t</mml:mtext>
<mml:mo stretchy="false">)</mml:mo>
<mml:mo>,</mml:mo>
<mml:mi> </mml:mi>
<mml:msub>
<mml:mrow>
<mml:mtext>n</mml:mtext></mml:mrow>
<mml:mrow>
<mml:mtext>TS</mml:mtext></mml:mrow></mml:msub>
<mml:mrow>
<mml:mo>-</mml:mo></mml:mrow>
<mml:mn>1</mml:mn>
<mml:mo>,</mml:mo>
<mml:mn>1</mml:mn>
<mml:mo stretchy="false">)</mml:mo></mml:mtd></mml:mtr></mml:mtable></mml:semantics></mml:math></inline-formula> [<xref ref-type="bibr" rid="b67-ijms-13-05207">67</xref>]</td>
<td align="left" valign="middle">Evaluate if the mean of residual is statistically different by the expected value (0)</td></tr></tbody></table>
<table-wrap-foot><fn id="tfn11-ijms-13-05207">
<p>y<sub>i</sub> = observed ln(λ) for i<sup>th</sup> compound; ŷ<sub>i</sub> = estimated/predicted ln(λ) by model from <xref rid="FD1" ref-type="disp-formula">Equation(1)</xref>, respectively <xref rid="FD2" ref-type="disp-formula">Equation(2)</xref>; <italic>n</italic> = sample size; ȳ = arithmetic mean of the observed ln(λ); 
<inline-formula>
<mml:math id="mm13">
<mml:semantics id="sm13">
<mml:mrow>
<mml:mover accent="true">
<mml:mover accent="true">
<mml:mtext>y</mml:mtext>
<mml:mo>^</mml:mo></mml:mover>
<mml:mo>¯</mml:mo></mml:mover></mml:mrow></mml:semantics></mml:math></inline-formula> = arithmetic mean of estimated/predicted ln(λ); ρ<sub>c</sub> = concordance correlation coefficient; TR = training set; TS = test set; 
<inline-formula>
<mml:math id="mm14">
<mml:semantics id="sm14">
<mml:mrow>
<mml:mover accent="true">
<mml:mrow>
<mml:mtext>res</mml:mtext></mml:mrow>
<mml:mo>¯</mml:mo></mml:mover></mml:mrow></mml:semantics></mml:math></inline-formula> = arithmetic mean of residuals; res = residuals; StDev = standard deviation; abs = absolute value.</p></fn></table-wrap-foot></table-wrap></sec></back></article>
