<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD Journal Publishing DTD v2.3 20070202//EN" "journalpublishing.dtd">
<article xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xml:lang="en" article-type="research-article">
  <front>
    <journal-meta>
      <journal-id journal-id-type="publisher-id">molecules</journal-id>
      <journal-title>Molecules</journal-title>
      <abbrev-journal-title abbrev-type="publisher">Molecules</abbrev-journal-title>
      <abbrev-journal-title abbrev-type="pubmed">Molecules</abbrev-journal-title>
      <issn pub-type="epub">1420-3049</issn>
      <publisher>
        <publisher-name>MDPI</publisher-name>
      </publisher>
    </journal-meta>
    <article-meta>
      <article-id pub-id-type="doi">10.3390/molecules17088982</article-id>
      <article-id pub-id-type="publisher-id">molecules-17-08982</article-id>
      <article-categories>
        <subj-group>
          <subject>Article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title>Prediction of Acute Mammalian Toxicity Using QSAR Methods: A Case Study of Sulfur Mustard and Its Breakdown Products</article-title>
      </title-group>
      
      <contrib-group>
        <contrib contrib-type="author">
          <name>
            <surname>Ruiz</surname>
            <given-names>Patricia</given-names>
          </name>
          <xref rid="af1-molecules-17-08982" ref-type="aff">1</xref>
          <xref rid="c1-molecules-17-08982" ref-type="corresp">*</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Begluitti</surname>
            <given-names>Gino</given-names>
          </name>
          <xref rid="af2-molecules-17-08982" ref-type="aff">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Tincher</surname>
            <given-names>Terry</given-names>
          </name>
          <xref rid="af2-molecules-17-08982" ref-type="aff">2</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Wheeler</surname>
            <given-names>John</given-names>
          </name>
          <xref rid="af1-molecules-17-08982" ref-type="aff">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <name>
            <surname>Mumtaz</surname>
            <given-names>Moiz</given-names>
          </name>
          <xref rid="af1-molecules-17-08982" ref-type="aff">1</xref>
        </contrib>
      </contrib-group>
      <aff id="af1-molecules-17-08982"><label>1 </label>Computational Toxicology and Methods Development Lab, Division of Toxicology and Human Health Sciences, Agency for Toxic Substances and Disease Registry, Atlanta, GA 30333, USA</aff>
      <aff id="af2-molecules-17-08982"><label>2 </label>Environmental Public Health Readiness Branch, National Center for Environmental Health, Atlanta, GA 30333, USA</aff>
      <author-notes>
        <corresp id="c1-molecules-17-08982"><label>*</label> Author  to whom correspondence should be addressed; Email: <email>pruiz@cdc.gov</email>; Tel.: +1-770-488-3348; Fax: +1-770-488-3470.</corresp>
      </author-notes>
      <pub-date pub-type="epub">
        <day>27</day>
        <month>07</month>
        <year>2012</year>
      </pub-date>
      <pub-date pub-type="collection"><month>08</month>
        <year>2012</year>
      </pub-date>
      <volume>17</volume>
      <issue>8</issue>
      <fpage>8982</fpage>
      <lpage>9001</lpage>
      <history>
        <date date-type="received">
          <day>06</day>
          <month>06</month>
          <year>2012</year>
        </date>
        <date date-type="rev-recd">
          <day>19</day>
          <month>07</month>
          <year>2012</year>
        </date>
        <date date-type="accepted">
          <day>23</day>
          <month>07</month>
          <year>2012</year>
        </date>
      </history>
      <permissions>
        <copyright-statement>©  2012 by the authors; licensee MDPI, Basel, Switzerland.</copyright-statement>
        <copyright-year>2012</copyright-year>
        <license xmlns:xlink="http://www.w3.org/1999/xlink" license-type="open-access" xlink:href="http://creativecommons.org/licenses/by/3.0/">
          <p>This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).</p>
        </license>
      </permissions>
      <abstract>
        <p>Predicting toxicity quantitatively, using Quantitative Structure Activity Relationships (QSAR), has matured over recent years to the point that the predictions can be used to help identify missing comparison values in a substance’s database. In this manuscript we investigate using the lethal dose that kills fifty percent of a test population (the LD<sub>50</sub>) for determining relative toxicity of a number of substances. In general, the smaller the LD<sub>50</sub> value, the more toxic the chemical, and the larger the LD<sub>50</sub> value, the lower the toxicity. When systemic toxicity and other specific toxicity data are unavailable for the chemical(s) of interest, during emergency responses, LD<sub>50</sub> values may be employed to determine the relative toxicity of a series of chemicals. In the present study, a group of chemical warfare agents and their breakdown products have been evaluated using four available rat oral QSAR LD<sub>50</sub> models<italic>. </italic>The QSAR analysis shows that the breakdown products of Sulfur Mustard (HD) are predicted to be less toxic than the parent compound as well as other known breakdown products that have known toxicities. The QSAR estimated break down products LD<sub>50</sub> values ranged from 299 mg/kg to 5,764 mg/kg. This evaluation allows for the ranking and toxicity estimation of compounds for which little toxicity information existed; thus leading to better risk decision making in the field. </p>
      </abstract>
      <kwd-group>
        <kwd>sulfur mustard</kwd>
        <kwd>QSAR</kwd>
        <kwd>SAR</kwd>
        <kwd>mammalian oral</kwd>
        <kwd>LD<sub>50</sub></kwd>
        <kwd>quantitative structure-activity relationship</kwd>
        <kwd><italic>in silico</italic> modeling</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec sec-type="intro">
      <title>1. Introduction</title>
      <p>From World War I to 1968, the United States produced chemical weapons and warfare agents as a deterrent against use of similar weapons by other countries. Sulfur mustard (HD, <xref ref-type="fig" rid="molecules-17-08982-f001">Figure 1</xref>), a vesicant and alkylating chemical, is one such agent [<xref ref-type="bibr" rid="B1-molecules-17-08982">1</xref>,<xref ref-type="bibr" rid="B2-molecules-17-08982">2</xref>] Although never used during warfare by the United States, these chemicals are still abundant and are now deteriorating with age. In 1985, the U.S. Congress mandated that the Department of Defense be responsible for establishing a Chemical and Biological Defense (CBD) program, U.S. Code Title 50, Sections 1521 through 153, and provide for chemical weapons disposal and destruction. In accordance with this congressional mandate the Center for Disease Control and Prevention’s (CDC), Environmental Public Health Readiness Branch (EPHRB), Chemical Weapons Elimination section has been tasked with overseeing the Army’s destruction of chemical weapons to ensure that the general population, worker population and environment are protected.</p>
      <fig id="molecules-17-08982-f001" position="anchor">
        <label>Figure 1</label>
        <caption>
          <p>Bis-(2-chloroethyl) sulfide.</p>
        </caption>
        <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-g001.tif"/>
        
      </fig>
      <p>The Pueblo Chemical Agent Pilot Plant (PCAPP) plans to utilize neutralization processes to destroy the HD stored at the Pueblo Chemical Army Depot (PCAD). This process creates several breakdown products during and after the neutralization. The majority of these breakdown products do not have sufficient toxicity data to properly assess the human health impacts related to their exposure or to select appropriate personnel protective equipment (PPE) which will ensure the safety of the personnel. Thus, various stakeholders, viz., state, local, territorial, and tribal public health departments have stated concerns that only are the toxicity databases weak even indicators of overt mammalian toxicity like the lethal dose values that kill fifty percent of test population (LD<sub>50</sub>) are not available for such chemicals. The LD<sub>50</sub> value is the lethal dose of a substance that will kill 50% of the test animals/organisms within 24 hours of exposure to a chemical [<xref ref-type="bibr" rid="B3-molecules-17-08982">3</xref>,<xref ref-type="bibr" rid="B4-molecules-17-08982">4</xref>]. LD<sub>50</sub> values have been used to express the relative hazards associated with the acute toxicity of chemicals and are often used for the initial evaluation of toxicity. Computer-assisted Structure Activity Relationship (SAR) and Quantitative Structure Activity Relationship (QSAR) models, are being increasingly used to fill the data gaps in chemical (pharmaceutical, agrochemical, food additives and other industrial) toxicity databases. These <italic>in silico </italic>tools thus provide a means of assessing toxicity of chemicals that lack appropriate experimental test data [<xref ref-type="bibr" rid="B5-molecules-17-08982">5</xref>,<xref ref-type="bibr" rid="B6-molecules-17-08982">6</xref>,<xref ref-type="bibr" rid="B7-molecules-17-08982">7</xref>,<xref ref-type="bibr" rid="B8-molecules-17-08982">8</xref>,<xref ref-type="bibr" rid="B9-molecules-17-08982">9</xref>,<xref ref-type="bibr" rid="B10-molecules-17-08982">10</xref>,<xref ref-type="bibr" rid="B11-molecules-17-08982">11</xref>,<xref ref-type="bibr" rid="B12-molecules-17-08982">12</xref>,<xref ref-type="bibr" rid="B13-molecules-17-08982">13</xref>].</p>
      <p>Most <italic>in silico</italic> prediction systems used to estimate qualitative and quantitative toxicity are either SAR or QSAR models [<xref ref-type="bibr" rid="B14-molecules-17-08982">14</xref>,<xref ref-type="bibr" rid="B15-molecules-17-08982">15</xref>,<xref ref-type="bibr" rid="B16-molecules-17-08982">16</xref>,<xref ref-type="bibr" rid="B17-molecules-17-08982">17</xref>,<xref ref-type="bibr" rid="B18-molecules-17-08982">18</xref>]. An SAR model, and/or an expert system, establishes qualitative association between a chemical’s substructures and its potential toxicity [<xref ref-type="bibr" rid="B19-molecules-17-08982">19</xref>,<xref ref-type="bibr" rid="B20-molecules-17-08982">20</xref>]. The confidence in the predictions of a new chemical is based on whether such identified structural alerts are present in the structure of a chemical of interest. Because expert systems are based on qualitative SARs, they usually do not make a quantitative prediction of toxic effect; predictions are expressed in a binary fashion <italic>i.e.</italic>, toxic, indeterminate, or non-toxic. A QSAR model, on the other hand, is a mathematical relationship between the chemical’s quantitative molecular descriptors and its toxicological, biological, and physicochemical activities [<xref ref-type="bibr" rid="B9-molecules-17-08982">9</xref>,<xref ref-type="bibr" rid="B13-molecules-17-08982">13</xref>,<xref ref-type="bibr" rid="B21-molecules-17-08982">21</xref>,<xref ref-type="bibr" rid="B22-molecules-17-08982">22</xref>,<xref ref-type="bibr" rid="B23-molecules-17-08982">23</xref>]. Molecular descriptors derived from atomic or molecular properties that encode physicochemical, topological, and surface properties of molecules are typically used as the backbone of a predictive QSAR model [<xref ref-type="bibr" rid="B21-molecules-17-08982">21</xref>,<xref ref-type="bibr" rid="B24-molecules-17-08982">24</xref>,<xref ref-type="bibr" rid="B25-molecules-17-08982">25</xref>]. These descriptors are then correlated with a toxicological response of interest through a suitable statistical approach such as linear multiple regression, nearest neighbor, clustering, random forest, discriminant analysis, recursive partitioning, artificial neural networks, <italic>etc. </italic>[<xref ref-type="bibr" rid="B26-molecules-17-08982">26</xref>,<xref ref-type="bibr" rid="B27-molecules-17-08982">27</xref>,<xref ref-type="bibr" rid="B28-molecules-17-08982">28</xref>,<xref ref-type="bibr" rid="B29-molecules-17-08982">29</xref>,<xref ref-type="bibr" rid="B30-molecules-17-08982">30</xref>,<xref ref-type="bibr" rid="B31-molecules-17-08982">31</xref>,<xref ref-type="bibr" rid="B32-molecules-17-08982">32</xref>,<xref ref-type="bibr" rid="B33-molecules-17-08982">33</xref>,<xref ref-type="bibr" rid="B34-molecules-17-08982">34</xref>,<xref ref-type="bibr" rid="B35-molecules-17-08982">35</xref>]. Unlike an expert system, a QSAR model provides discrete quantitative predicted values given just the values of the molecular descriptors finally selected in the QSAR model. Choosing the appropriate SAR or QSAR model(s) is a challenge for federal agencies because several such models are available that predict a wide array of endpoints including the LD<sub>50</sub> values. Additionally, acute oral mammalian toxicity (LD<sub>50</sub>) is one of the more complex toxicological endpoints to predict using QSAR methods. Presence of a gamut of biological and molecular events that lead to variable biological mechanisms and a lack of accurate and true understanding of the mechanism of toxicity contribute to this complexity [<xref ref-type="bibr" rid="B8-molecules-17-08982">8</xref>,<xref ref-type="bibr" rid="B36-molecules-17-08982">36</xref>].</p>
    </sec>
    <sec sec-type="results">
      <title>2. Results</title>
      <p>A total of 22 chemicals including the parent chemical HD, and its potential breakdown products, experimentally tested or untested, were evaluated using TOPKAT [<xref ref-type="bibr" rid="B37-molecules-17-08982">37</xref>], ADMET Predictor [<xref ref-type="bibr" rid="B38-molecules-17-08982">38</xref>], and T.E.S.T. [<xref ref-type="bibr" rid="B39-molecules-17-08982">39</xref>] (<xref ref-type="table" rid="molecules-17-08982-t001">Table 1</xref>). Experimental rat oral LD<sub>50</sub> data for this study was available from two sources, ChemIDplus and Registry of Toxic Effects of Chemical Substances databases.</p>
      <table-wrap id="molecules-17-08982-t001" position="anchor">
        <object-id pub-id-type="pii">molecules-17-08982-t001_Table 1</object-id>
        <label>Table 1</label>
        <caption>
          <p>Chemical name, structures and experimental LD<sub>50</sub> of HD and its breakdown products.</p>
        </caption>
        <table rules="cols" style="border: solid thin">
          <thead>
            <tr>
              <th align="left" valign="middle">Chemical Name</th>
              <th align="center" valign="middle">Chemical Structure</th>
              <th align="center" valign="middle">Experimental LD<sub>50</sub> (mg/kg)</th>
            </tr>
          </thead>
          <tbody>
            <tr style="border-top: solid thin">
              <td align="left" valign="middle">Bis (2-chloroethyl) sulfide (HD)</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i001.tif"/></td>
              <td align="center" valign="middle">17</td>
            </tr>
            <tr>
              <td align="left" valign="middle">Bis [2-(2-chloroethylthioethyl) ether] (T)</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i002.tif"/></td>
              <td align="center" valign="middle">na</td>
            </tr>
            <tr>
              <td align="left" valign="middle">1,2-bis(2-chloroethylthio)ethane (Q)</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i003.tif"/></td>
              <td align="center" valign="middle">na</td>
            </tr>
            <tr>
              <td align="left" valign="middle">Bis (2-chloroethyl) disulfide </td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i004.tif"/></td>
              <td align="center" valign="middle">na</td>
            </tr>
            <tr>
              <td align="left" valign="middle">Thiodiglycol (TDG)</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i005.tif"/></td>
              <td align="center" valign="middle">6,610</td>
            </tr>
            <tr>
              <td align="left" valign="top">Thiodiglycol sulfoxide</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i006.tif"/></td>
              <td align="center" valign="top">na</td>
            </tr>
            <tr>
              <td align="left" valign="top">1,4-Dithiane</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i007.tif"/></td>
              <td align="center" valign="top">2,768</td>
            </tr>
            <tr>
              <td align="left" valign="top">1,4-Oxathiane</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i008.tif"/></td>
              <td align="center" valign="top">2,830</td>
            </tr>
            <tr>
              <td align="left" valign="top">Bis-[2-(2-hydroxyethylthioethyl)]ether (TOH)</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i009.tif"/></td>
              <td align="center" valign="top">na</td>
            </tr>
            <tr>
              <td align="left" valign="top">Ethanol, 2,2'-[1,2-ethanediylbis(thio)]-bis- (QOH)</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i010.tif"/></td>
              <td align="center" valign="top">na</td>
            </tr>
            <tr>
              <td align="left" valign="top">1-(2-hydroxyethylthio)-2-(2-vinylthioethoxy)ethane</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i011.tif"/></td>
              <td align="center" valign="top">na</td>
            </tr>
            <tr>
              <td align="left" valign="top">1-(2-hydroxyethylthio)-2-(2-vinylthio)ethane</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i012.tif"/></td>
              <td align="center" valign="top">na</td>
            </tr>
            <tr>
              <td align="left" valign="top">2-Hydroxyethyl vinyl sulfide</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i013.tif"/></td>
              <td align="center" valign="top">na</td>
            </tr>
            <tr>
              <td align="left" valign="top">2-Methylnaphthalene</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i014.tif"/></td>
              <td align="center" valign="top">1,630</td>
            </tr>
            <tr>
              <td align="left" valign="top">Bis-(2-chloroethyl) ether</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i015.tif"/></td>
              <td align="center" valign="top">75</td>
            </tr>
            <tr>
              <td align="left" valign="top">Bis-(2-ethylexyl)phthalate</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i016.tif"/></td>
              <td align="center" valign="bottom">30,600</td>
            </tr>
            <tr>
              <td align="left" valign="top">Propanal</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i017.tif"/></td>
              <td align="center" valign="top">1,410</td>
            </tr>
            <tr>
              <td align="left" valign="top">Ethylene Glycol</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i018.tif"/></td>
              <td align="center" valign="top">4,700</td>
            </tr>
            <tr>
              <td align="left" valign="top">Ethylene dichloride</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i019.tif"/></td>
              <td align="center" valign="top">670</td>
            </tr>
            <tr>
              <td align="left" valign="top">Bis-(2-Chloroethyl) sulfoxide</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i020.tif"/></td>
              <td align="center" valign="top">na</td>
            </tr>
            <tr>
              <td align="left" valign="top">Bis-(2-Chloroethyl) sulfone</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i021.tif"/></td>
              <td align="center" valign="top">na</td>
            </tr>
            <tr>
              <td align="left" valign="top">Thiodiglycol sulfone</td>
              <td align="center" valign="middle"><inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i022.tif"/></td>
              <td align="center" valign="top">na</td>
            </tr>
          </tbody>
        </table>
        <table-wrap-foot>
        <fn>
        <p>na: not available.</p>
        </fn>
        </table-wrap-foot>
      </table-wrap>
      
      <sec>
        <title>2.1. TOPKAT Predictions of LD<sub>50</sub> Values</title>
        <p>Of the 22 chemicals queried, eight chemicals [thiodiglycol (TDG), 1,4-oxathiane, 2-methylnaphthalene, bis-(2-chloroethyl) ether, bis-(2-ethylhexyl)phthalate, propanal, ethylene glycol, and ethylene dichloride] were found in the database associated with the TOPKAT LD<sub>50</sub> model. Seven of the 22 chemicals were outside OPS of the model. Three of these seven were deemed acceptable estimates because the chemicals were outside the OPS in only one dimension. The remaining four were unacceptable estimates because they were outside the OPS in at least two dimensions.</p>
        <p>The 15 remaining chemicals were very well represented in the LD<sub>50</sub> model database as assessed by the univariate analysis. The multivariate analysis also showed that these chemicals were within the OPS of the model. The similarity analysis showed that there are several chemicals in the database that have a very close similarity distance. Hence, the confidence in the assessment of LD<sub>50</sub> values for these chemicals is high. The estimated LD<sub>50</sub> value of the break down products were within a factor of 4 (<xref ref-type="fig" rid="molecules-17-08982-f002">Figure 2</xref>) of the experimental values as shown in <xref ref-type="table" rid="molecules-17-08982-t002">Table 2</xref>. This is well within a default factor of 10 often used in traditional risk assessment of environmental chemicals to compensate for uncertainties [<xref ref-type="bibr" rid="B40-molecules-17-08982">40</xref>]. </p>
        <fig id="molecules-17-08982-f002" position="anchor">
          <label>Figure 2</label>
          <caption>
            <p>Comparison of Experimental pLD<sub>50</sub> values with TOPKAT QSAR pLD<sub>50</sub> model predictions.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-g002.tif"/>
        </fig>
        <table-wrap id="molecules-17-08982-t002" position="anchor">
          <object-id pub-id-type="pii">molecules-17-08982-t002_Table 2</object-id>
          <label>Table 2</label>
          <caption>
            <p>Comparison of the estimated LD<sub>50</sub> values using TOPKAT, 2D and 3D ADMET predictor, and the T.E.S.T model for chemicals with available experimental data.</p>
          </caption>
          <table rules="cols" style="border: solid thin">
            <thead>
              <tr>
                <th align="left" valign="middle">Chemical Name</th>
                <th align="center" valign="middle">Experimental LD<sub>50</sub>*</th>
                <th align="center" valign="middle">TOPKAT</th>
                <th align="center" valign="middle">ADMET 2D</th>
                <th align="center" valign="middle">ADMET 3D</th>
                <th align="center" valign="middle">T.E.S.T Consensus</th>
              </tr>
            </thead>
            <tbody>
              <tr style="border-top: solid thin">
                <td align="left" valign="middle">Bis (2-chloroethyl) sulfide </td>
                <td align="center" valign="middle">17</td>
                <td align="center" valign="middle">125.9</td>
                <td align="center" valign="middle">536.84</td>
                <td align="center" valign="middle">349.32</td>
                <td align="center" valign="middle">38.69</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Thiodiglycol (TDG)</td>
                <td align="center" valign="middle">6,608.56</td>
                <td align="center" valign="middle">2,700</td>
                <td align="center" valign="middle">1,651.93</td>
                <td align="center" valign="middle">1,407.24</td>
                <td align="center" valign="middle">5,426.53</td>
              </tr>
              <tr>
                <td align="left" valign="middle">1,4-Dithiane</td>
                <td align="center" valign="middle">2,767.71</td>
                <td align="center" valign="middle">OPS</td>
                <td align="center" valign="middle">502.47</td>
                <td align="center" valign="middle">762.79</td>
                <td align="center" valign="middle">1,602.59</td>
              </tr>
              <tr>
                <td align="left" valign="middle">1,4-Oxathiane</td>
                <td align="center" valign="middle">2,830.26</td>
                <td align="center" valign="middle">1,900</td>
                <td align="center" valign="middle">791.98</td>
                <td align="center" valign="middle">1,185.75</td>
                <td align="center" valign="middle">1,834.75</td>
              </tr>
              <tr>
                <td align="left" valign="middle">2-Methylnaphthalene</td>
                <td align="center" valign="middle">1,629.3</td>
                <td align="center" valign="middle">1,400</td>
                <td align="center" valign="middle">1,884.9</td>
                <td align="center" valign="middle">1,864.48</td>
                <td align="center" valign="middle">1,391.74</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Bis-(2-chloroethyl) ether</td>
                <td align="center" valign="middle">75</td>
                <td align="center" valign="middle">100.8</td>
                <td align="center" valign="middle">215.35</td>
                <td align="center" valign="middle">130.57</td>
                <td align="center" valign="middle">44.72</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Bis-(2-ethylhexyl)phthalate</td>
                <td align="center" valign="middle">30,600</td>
                <td align="center" valign="middle">&gt;10,000</td>
                <td align="center" valign="middle">8,781.91</td>
                <td align="center" valign="middle">6,791.82</td>
                <td align="center" valign="middle">37,293.36</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Propanal</td>
                <td align="center" valign="middle">1,409.62</td>
                <td align="center" valign="middle">1,800</td>
                <td align="center" valign="middle">404.01</td>
                <td align="center" valign="middle">427.17</td>
                <td align="center" valign="middle">353.02</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Ethylene glycol</td>
                <td align="center" valign="middle">4,700</td>
                <td align="center" valign="middle">6,500</td>
                <td align="center" valign="middle">2,705.28</td>
                <td align="center" valign="middle">2,034.17</td>
                <td align="center" valign="middle">2,587.11</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Ethylene dichloride</td>
                <td align="center" valign="middle">670</td>
                <td align="center" valign="middle">537.5</td>
                <td align="center" valign="middle">298.37</td>
                <td align="center" valign="middle">297.55</td>
                <td align="center" valign="middle">811.49</td>
              </tr>
            </tbody>
          </table>
          <table-wrap-foot>
          <fn>
           <p>LD<sub>50</sub>* = mg/kg unit; OPS: outside of optimum prediction space.</p>
          </fn>
          </table-wrap-foot>
        </table-wrap>
      </sec>
      <sec>
        <title>2.2. ADMET Predictor LD<sub>50</sub> Values</title>
        <p>Of the 22 chemicals queried, 10 chemicals [bis(2-chloroethyl) sulfide, thiodiglycol, 1,4-dithiane, 1,4-oxathiane, 2-methylnaphthalene, bis-(2-chloroethyl) ether, bis-(2-ethylhexyl)phthalate, propanal, ethylene glycol and ethylene dichloride) were found in the database associated with the ADMET 2D and 3D <italic>Predictor </italic>LD<sub>50</sub> model as shown in <xref ref-type="table" rid="molecules-17-08982-t002">Table 2</xref>. All the chemicals were within the applicability domain of the models, and met the requirements of constraints, hence the confidence in all the predicted LD<sub>50</sub> values is high <xref ref-type="fig" rid="molecules-17-08982-f003">Figure 3</xref>a,b. The predicted LD<sub>50</sub> values of HD and its breakdown products were within a factor of five, much smaller than a default factor of 10 often used in chemical risk assessments to compensate for uncertainties (<xref ref-type="table" rid="molecules-17-08982-t002">Table 2</xref>).</p>
        <fig id="molecules-17-08982-f003" position="anchor">
          <label>Figure 3</label>
          <caption>
            <p>(<bold>a</bold>) Comparison of Experimental pLD50 values with 2D ADMET predictor QSAR pLD<sub>50</sub> model predictions; (<bold>b</bold>) Comparison of Experimental pLD50 values with 3D ADMET predictor QSAR pLD<sub>50</sub> model predictions.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-g003.tif"/>
        </fig>
      </sec>
      <sec>
        <title>2.3. T.E.S.T. Predictions of LD<sub>50</sub> Values</title>
        <p>Of the 22 chemicals queried, seven chemicals [bis(2-chloroethyl) sulfide, 1,4-oxathiane, 2-methyl-naphthalene, bis-(2-chloroethyl) ether, ethylene glycol, propanal and ethylene dichloride) were found in the model database associated with the consensus T.E.S.T LD<sub>50</sub> model, and two (1,4-dithiane and thiodiglycol) were used in the external test set. All 22 chemicals were inside the applicability domain of the model. The 13 remaining chemicals were very well represented in the model database as assessed by the statistical analysis. The similarity analysis showed that there are several chemicals in the database that have very close similarity distance. Hence, the confidence in the assessment of LD50 values for these chemicals is high (<xref ref-type="fig" rid="molecules-17-08982-f004">Figure 4</xref>).</p>
        <fig id="molecules-17-08982-f004" position="anchor">
          <label>Figure 4</label>
          <caption>
            <p>Comparison of Experimental pLD50 values with T.E.S.T QSAR pLD<sub>50</sub> consensus model predictions.</p>
          </caption>
          <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-g004.tif"/>
        </fig>
        <p>The estimated LD<sub>50</sub> value of HD and its breakdown products were within a factor of less than two (<xref ref-type="table" rid="molecules-17-08982-t002">Table 2</xref>). This is well within a default factor of 10 often used in risk assessment of environmental chemicals to compensate for uncertainties. The estimated LD<sub>50</sub> values for the breakdown products that lack experimental data are shown on <xref ref-type="table" rid="molecules-17-08982-t003">Table 3</xref>. Six chemicals [1,2-bis(2-chloroethylthio)ethane, bis-[2-(2-hydroxyethylthioethyl)]ether, ethanol, 2,2'-[1,2-ethanediylbis(thio)]-bis-, 1-(2-hydroxyethylthio)-2-(2-vinylthioethoxy)ethane, 1-(2-hydroxyethylthio)-2-(2-vinylthio)ethane, 2-hydroxyethyl vinyl sulfide) were found outside of the optimum prediction space for the TOPKAT model. However, ADMET Predictor and T.E.S.T. (consensus) LD<sub>50</sub> model could estimate the LD<sub>50</sub> values of these chemicals.</p>
        <table-wrap id="molecules-17-08982-t003" position="anchor">
          <object-id pub-id-type="pii">molecules-17-08982-t003_Table 3</object-id>
          <label>Table 3</label>
          <caption>
            <p>Comparison of the estimated LD<sub>50</sub> values using TOPKAT, 2D, 3D ADMET predictor and the T.E.S.T. model for chemicals that lack experimental data.</p>
          </caption>
          <table rules="cols" style="border: solid thin">
            <thead>
              <tr>
                <th align="left" valign="middle">Chemical Name</th>
                <th align="center" valign="middle">TOPKAT</th>
                <th align="center" valign="middle">ADMET 2D</th>
                <th align="center" valign="middle">ADMET 3D</th>
                <th align="center" valign="middle">T.E.S.T (consensus)</th>
              </tr>
            </thead>
            <tbody>
              <tr style="border-top: solid thin">
                <td align="left" valign="middle">Bis [2-(2-chloroethylthioethyl) ether] (T)</td>
                <td align="center" valign="middle">816.1</td>
                <td align="center" valign="middle">197.18</td>
                <td align="center" valign="middle">110.7</td>
                <td align="center" valign="middle">166.96</td>
              </tr>
              <tr>
                <td align="left" valign="middle">1,2-Bis(2-chloroethylthio)ethane (Q)</td>
                <td align="center" valign="middle">OPS</td>
                <td align="center" valign="middle">630.39</td>
                <td align="center" valign="middle">369.39</td>
                <td align="center" valign="middle">93.02</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Bis (2-chloroethyl) disulfide </td>
                <td align="center" valign="middle">153.6</td>
                <td align="center" valign="middle">512.3</td>
                <td align="center" valign="middle">443.37</td>
                <td align="center" valign="middle">351.90</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Thiodiglycol sulfoxide</td>
                <td align="center" valign="middle">1,400</td>
                <td align="center" valign="middle">3,524.73</td>
                <td align="center" valign="middle">2,912.39</td>
                <td align="center" valign="middle">4,094.75</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Bis-[2-(2-hydroxyethylthioethyl)]ether (TOH)</td>
                <td align="center" valign="middle">OPS</td>
                <td align="center" valign="middle">1,355.55</td>
                <td align="center" valign="middle">1,665.45</td>
                <td align="center" valign="middle">14,272.34</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Ethanol, 2,2'-[1,2-ethanediylbis(thio)]-bis- (QOH)</td>
                <td align="center" valign="middle">OPS</td>
                <td align="center" valign="middle">1,014.26</td>
                <td align="center" valign="middle">1,045.92</td>
                <td align="center" valign="middle">3,506.17</td>
              </tr>
              <tr>
                <td align="left" valign="middle">1-(2-Hydroxyethylthio)-2-(2-vinylthioethoxy)-ethane</td>
                <td align="center" valign="middle">OPS</td>
                <td align="center" valign="middle">574.96</td>
                <td align="center" valign="middle">649.36</td>
                <td align="center" valign="middle">2,730.21</td>
              </tr>
              <tr>
                <td align="left" valign="middle">1-(2-Hydroxyethylthio)-2-(2-vinylthio)ethane</td>
                <td align="center" valign="middle">OPS</td>
                <td align="center" valign="middle">391.58</td>
                <td align="center" valign="middle">399</td>
                <td align="center" valign="middle">1,339.93</td>
              </tr>
              <tr>
                <td align="left" valign="middle">2-Hydroxyethyl vinyl sulfide</td>
                <td align="center" valign="middle">OPS</td>
                <td align="center" valign="middle">474.79</td>
                <td align="center" valign="middle">448.1</td>
                <td align="center" valign="middle">979.11</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Bis-(2-chloroethyl) sulfoxide</td>
                <td align="center" valign="middle">67.9</td>
                <td align="center" valign="middle">449.01</td>
                <td align="center" valign="middle">265.64</td>
                <td align="center" valign="middle">577.05</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Bis-(2-chloroethyl) sulfone</td>
                <td align="center" valign="middle">227</td>
                <td align="center" valign="middle">462.07</td>
                <td align="center" valign="middle">260.21</td>
                <td align="center" valign="middle">249.89</td>
              </tr>
              <tr>
                <td align="left" valign="middle">Thiodiglycol sulfone</td>
                <td align="center" valign="middle">2,800</td>
                <td align="center" valign="middle">2,715.91</td>
                <td align="center" valign="middle">1,805.24</td>
                <td align="center" valign="middle">12,190.02</td>
              </tr>
            </tbody>
          </table>
          <table-wrap-foot>
          <fn>
          <p>OPS: outside of l optimum prediction space.</p>
          </fn>
          </table-wrap-foot>
        </table-wrap>
        
      </sec>
    </sec>
    <sec sec-type="discussion">
      <title>3. Discussion</title>
      <p>Sulfur mustard neutralization breakdown products are a diverse group of chemicals that either have no or minimal test data available regarding their toxicological effects [<xref ref-type="bibr" rid="B1-molecules-17-08982">1</xref>]. Some members of this group and their breakdown products even lack LD<sub>50</sub> values. Hence a 2-step <italic>in silico</italic> approach was adopted wherein a QSAR analysis was performed. In the first step we estimated LD<sub>50</sub> values using four different QSAR LD<sub>50</sub> models. Each one of the models is internally and externally validated, robust, and has good applicability domains. The LD<sub>50</sub> predicted values were compared with the available experimental values when available, and were found to be within acceptable ranges (10 fold) used in chemical risk assessment. Analyses were performed to evaluate the use of these chemicals in the training set used for the model development, if they were part of the external validation set, or they were not used in the model development. Through this process we could evaluate which of the chemicals in this assessment were used to develop each of the models, thus we can identify the applicability domain of each model.</p>
      <p>In the second step, using the above models, we determined the LD<sub>50</sub> values for breakdown products that lack experimental data (<xref ref-type="table" rid="molecules-17-08982-t003">Table 3</xref>). The key application of QSAR is to fill the data gaps for chemical toxicity. We achieved this through considerations of the strengths and weaknesses of each model, knowing that none of them is a perfect model. If the strength and advantage of a model is clearly known, the output of such a model can be used easily. TOPKAT model is well known and used by regulatory community, particularly federal agencies. The applicability domain of the model for this particular data set of chemicals was poor, <italic>i.e.</italic>, seven of the twenty two chemical were outside the optimum prediction space (OPS). On the contrary, ADMET predictor and T.E.S.T. had high chemical applicability domain for this data set. These models use both internal and external validated compared with TOPKAT that uses only one validation method. These models not only showed a high chemical applicability domain but also broad activity ranges in this applicability domain for this data set. However, the T.E.S.T model had superior performance compared to ADMET predictor.</p>
      <p>Taking an average of estimates when using multiple models could compensate the limitations of the individual models that use different descriptors and statistical methods to model different aspects of the toxicological affects. Thus, the use of a consensus model is always more beneficial than an individual.</p>
      <p>When using multiple models with varying modeling techniques (molecular descriptors, statistical methods and validation), it is a bigger challenge to judge the model output if their performances are comparable but slightly different [<xref ref-type="bibr" rid="B30-molecules-17-08982">30</xref>,<xref ref-type="bibr" rid="B41-molecules-17-08982">41</xref>,<xref ref-type="bibr" rid="B42-molecules-17-08982">42</xref>,<xref ref-type="bibr" rid="B43-molecules-17-08982">43</xref>,<xref ref-type="bibr" rid="B44-molecules-17-08982">44</xref>]. In such cases, the conventional wisdom is to use an arithmetic averaging scheme which was also used in our study (<xref ref-type="table" rid="molecules-17-08982-t004">Table 4</xref>), or use the most conservative values estimated by the models. For example, as shown in <xref ref-type="table" rid="molecules-17-08982-t003">Table 3</xref>, the estimated LD<sub>50</sub> values of bis-[2-(2-hydroxyethylthioethyl)] ether (TOH) and thiodiglycol sulfone are less toxic by the T.E.S.T. model (much higher LD<sub>50</sub> values) compared to the other models, in such case, the T.ES.T model estimate could be excluded using a more safe approach.</p>
      <table-wrap id="molecules-17-08982-t004" position="anchor">
        <object-id pub-id-type="pii">molecules-17-08982-t004_Table 4</object-id>
        <label>Table 4</label>
        <caption>
          <p>Average estimated LD<sub>50</sub> values and the classification of HD breakdown products that lack of experimental LD<sub>50</sub> values.</p>
        </caption>
        <table rules="cols" style="border: solid thin">
          <thead>
            <tr>
              <th align="left" valign="middle">Chemical Name</th>
              <th align="center" valign="middle">QSAR estimated LD<sub>50 </sub>(average of TOPKAT, 2D, 3D ADMET predictor and T.E.S.T.)</th>
              <th align="center" valign="middle">Toxicity Class *</th>
            </tr>
          </thead>
          <tbody>
            <tr style="border-top: solid thin">
              <td align="left" valign="middle">Bis [2-(2-chloroethylthioethyl) ether] (T)</td>
              <td align="center" valign="middle">322.74</td>
              <td align="center" valign="middle">Very Toxic</td>
            </tr>
            <tr>
              <td align="left" valign="middle">1,2-Bis(2-chloroethylthio)ethane (Q)</td>
              <td align="center" valign="middle">364.27</td>
              <td align="center" valign="middle">Very Toxic</td>
            </tr>
            <tr>
              <td align="left" valign="middle">Bis (2-chloroethyl) disulfide </td>
              <td align="center" valign="middle">365.29</td>
              <td align="center" valign="middle">Very Toxic</td>
            </tr>
            <tr>
              <td align="left" valign="middle">Thiodiglycol sulfoxide</td>
              <td align="center" valign="middle">2,982.97</td>
              <td align="center" valign="middle">Moderately Toxic</td>
            </tr>
            <tr>
              <td align="left" valign="middle">Bis-[2-(2-hydroxyethylthioethyl)]ether (TOH)</td>
              <td align="center" valign="middle">5,764.45</td>
              <td align="center" valign="middle">Slightly Toxic</td>
            </tr>
            <tr>
              <td align="left" valign="middle">Ethanol, 2,2'-[1,2-ethanediylbis(thio)]-bis- (QOH)</td>
              <td align="center" valign="middle">1,855.45</td>
              <td align="center" valign="middle">Moderately Toxic</td>
            </tr>
            <tr>
              <td align="left" valign="middle">1-(2-hydroxyethylthio)-2-(2-vinylthioethoxy)-ethane</td>
              <td align="center" valign="middle">1,318.18</td>
              <td align="center" valign="middle">Moderately Toxic</td>
            </tr>
            <tr>
              <td align="left" valign="middle">1-(2-Hydroxyethylthio)-2-(2-vinylthio)ethane</td>
              <td align="center" valign="middle">710.17</td>
              <td align="center" valign="middle">Moderately Toxic</td>
            </tr>
            <tr>
              <td align="left" valign="middle">2-Hydroxyethyl vinyl sulfide</td>
              <td align="center" valign="middle">634.00</td>
              <td align="center" valign="middle">Moderately Toxic</td>
            </tr>
            <tr>
              <td align="left" valign="middle">Bis-(2-Chloroethyl) sulfoxide</td>
              <td align="center" valign="middle">339.90</td>
              <td align="center" valign="middle">Very Toxic</td>
            </tr>
            <tr>
              <td align="left" valign="middle">Bis-(2-Chloroethyl) sulfone</td>
              <td align="center" valign="middle">299.79</td>
              <td align="center" valign="middle">Very Toxic</td>
            </tr>
            <tr>
              <td align="left" valign="middle">Thiodiglycol sulfone</td>
              <td align="center" valign="middle">4,877.79</td>
              <td align="center" valign="middle">Moderately Toxic</td>
            </tr>
          </tbody>
        </table>
        <table-wrap-foot>
        <fn>
        <p>* Super toxic (&lt;5 mg/kg), extremely toxic (5–50 mg/kg), very toxic (50–500 mg/kg), moderately toxic (500–5,000 mg/kg), slightly toxic (5,000–15,000 mg/kg) and practically non-toxic (&gt;15,000 mg/kg).</p>
        </fn>
        </table-wrap-foot>
      </table-wrap>
      
      <p>With an increase in the number of QSAR models available, commercially or open source, that are developed using a variety of approaches, the final interpretation and application will depend on the user confidence and the transparency of the model [<xref ref-type="bibr" rid="B44-molecules-17-08982">44</xref>,<xref ref-type="bibr" rid="B45-molecules-17-08982">45</xref>,<xref ref-type="bibr" rid="B46-molecules-17-08982">46</xref>,<xref ref-type="bibr" rid="B47-molecules-17-08982">47</xref>] The LD<sub>50</sub> values, experimental or QSAR, are used to determine the relative toxicity of a series of chemicals in which the LD<sub>50</sub> value of a given chemical is compared with the LD<sub>50</sub> values of other chemicals [<xref ref-type="bibr" rid="B4-molecules-17-08982">4</xref>]. Using acceptable toxicity scales, the chemicals are assigned to various groups. One of the most common scales used is the Gosselin, Smith and Hodge scale. Super toxic (&lt;5 mg/kg), extremely toxic (5–50 mg/kg), very toxic (50–500 mg/kg), moderately toxic (500–5,000 mg/kg), slightly toxic (5,000–15,000 mg/kg) and practically non-toxic (&gt;15,000 mg/kg) [<xref ref-type="bibr" rid="B48-molecules-17-08982">48</xref>]. </p>
      <p>Using this scale, the QSAR estimates enabled the ranking of these chemicals on the basis of potential toxicities, in a rapid manner. The breakdown products of sulfur mustard (HD) hydrolysis reaction, thiodiglycol (TDG), 1,4-dithiane, 1,4-oxathiane and 2-hydroxyethyl vinyl sulfide (<xref ref-type="fig" rid="molecules-17-08982-f006">Scheme 1</xref>) were estimated to have moderate LD<sub>50</sub> values (&gt;500 mg/kg), when compared with HD, the parent compound.</p>
      <fig id="molecules-17-08982-f006" position="anchor">
        <object-id pub-id-type="pii">molecules-17-08982-scheme1_Scheme 1</object-id>
        <label>Scheme 1</label>
        <caption>
          <p>Hydrolysis of bis-(2-chloroethyl) sulfide.</p>
        </caption>
        <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-g006.tif"/>
      </fig>
      <p>The dissolution of HD in presence of water leads to the formation of the intermediate sulfonium ion which in turn reacts with another molecule of HD to form 1,2-bis-(2-chloroethylthio) ethane (Q) and 1,2-dichloroethane (<xref ref-type="fig" rid="molecules-17-08982-f007">Scheme 2</xref>). The estimated LD<sub>50</sub> values of 1, 2-bis-(2-chloroethylthio) ethane is &lt;500 mg/kg which is very toxic, but less than HD.</p>
      <fig id="molecules-17-08982-f007" position="anchor">
        <object-id pub-id-type="pii">molecules-17-08982-scheme2_Scheme 2</object-id>
        <label>Scheme 2</label>
        <caption>
          <p>Dissolution of Bis-(2-Chloroethyl) sulfide.</p>
        </caption>
        <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-g007.tif"/>
      </fig>
      <p>HD and its hydrolysis product TDG are oxidized to give the sulfoxide and sulfone analogs of HD and TDG (<xref ref-type="fig" rid="molecules-17-08982-f008">Scheme 3</xref>). The sulfoxide and sulfone analogs of HD are less toxic than HD; the estimated LD<sub>50</sub> values were between 339.90 and 299.79 mg/kg, respectively. TDG and its sulfoxide and sulfone analogs are slightly and moderately toxic, respectively, when compared to the extremely toxic parent compound, HD.</p>
      <p>The estimated LD<sub>50</sub> values of HD, O-mustard (T) and sesquimustard (Q) and their hydroxylated analogs TDG, TOH and QOH show that these analogs are much less toxic (Figure 8). However, when compared with their vinyl analogs, TDG, TOH and QOH are shown to have a higher potential toxicity, but less than HD, T and Q (<xref ref-type="fig" rid="molecules-17-08982-f005">Figure 5</xref>).</p>
      <fig id="molecules-17-08982-f008" position="anchor">
        <object-id pub-id-type="pii">molecules-17-08982-scheme3_Scheme 3</object-id>
        <label>Scheme 3</label>
        <caption>
          <p>Oxidation of Bis-(2-Chloroethyl) sulfide.</p>
        </caption>
        <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-g008.tif"/>
      </fig>
      
      <fig id="molecules-17-08982-f005" position="anchor">
        <label>Figure 5</label>
        <caption>
          <p>Sulfur mustard (HD), O-mustard (T) and sesquimustard (Q), and their hydroxylated and vinyl analogs.</p>
        </caption>
        <graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-g005.tif"/>
      </fig>
      <p>The QSAR estimated LD<sub>50</sub> values for the breakdown products ranged from 299.79 mg/kg to 5,764.45 mg/kg. These data indicate that five breakdown products fall under very toxic class (50–500 mg/kg) while seven fall under moderately toxic (LD<sub>50</sub> values 0.5–5 g/kg) class. None of the chemicals fall under the extremely toxic or super toxic categories (<xref ref-type="table" rid="molecules-17-08982-t004">Table 4</xref>). Thus, the experimentally untested breakdown products are potentially less toxic than those chemicals that were experimentally tested (range from 17 mg/kg to 30,600 mg/kg). The experimentally tested chemicals including the parent chemical fall under the extremely toxic class (5–50 mg/kg) and a breakdown product falls under the very toxic (50–500 mg/kg) class. Six of the breakdown products fall under moderately toxic (LD<sub>50</sub> values 500–5,000 mg/kg), while one each falls under slightly toxic (5,000–15,000 mg/kg) and practically non toxic (&gt;15,000 mg/kg) class. None fall under super toxic class (less than 5 mg/Kg). Thus, overall the potential toxicity of QSAR estimated breakdown products is potentially lower than the parent compound. The QSAR LD<sub>50</sub> models estimates not only are within a factor of 10 of the experimental available data [except for bis(2-chloroethyl) sulfide], but also have shown an overall degree of conservatism, except for the HD, which was predicted as very toxic by TOPKAT, ADMET 2D and 3D models, compared to extremely toxic experimental value. This information will be useful to stakeholders involved in HD neutralization processes to assess risk associated with these breakdown products. </p>
      <p>Mammalian toxicity, particularly LD<sub>50</sub> values are much more challenging because the mechanism of acute lethal toxicity is complex, and is not fully understood because of complex interactions between the organism and pharmacokinetic and pharmaco-dynamics of the chemicals. However, a growing number of <italic>in silico</italic> tools and QSAR models continue to be developed because of demand for such estimates due to resource limitations and timely needs, apart from ethical reasons [<xref ref-type="bibr" rid="B49-molecules-17-08982">49</xref>]. Gaining knowledge from these kinds of activities should help <italic>in vitro</italic> to <italic>in vivo</italic> extrapolations, especially since large <italic>in vitro</italic> data generating efforts are ongoing at the National Academy of Science and the EPA [<xref ref-type="bibr" rid="B50-molecules-17-08982">50</xref>,<xref ref-type="bibr" rid="B51-molecules-17-08982">51</xref>,<xref ref-type="bibr" rid="B52-molecules-17-08982">52</xref>]. Through these efforts, meaningful structural features can be identified that will lead to the development of robust predictive models, and specific selection criteria for use of such models. A thoughtful application and prudent use of such models can help estimate the toxicity of chemicals that lack experimental data, and prioritize chemicals for screening and subsequent toxicity testing while saving cost and time, thus minimizing experimental animal testing while optimizing overall use of resources. Recent laws are pushing the acceptance of these methods and their use by the regulatory and public health communities in the mitigation of potential hazardous exposures that could compromise the quality of human health and environment [<xref ref-type="bibr" rid="B53-molecules-17-08982">53</xref>,<xref ref-type="bibr" rid="B54-molecules-17-08982">54</xref>,<xref ref-type="bibr" rid="B55-molecules-17-08982">55</xref>]. <italic>In silico</italic> tools can play a pivotal role for assessing the health risk of environmental and pharmaceutical chemicals, especially when the only characteristic known is the structure of a chemical.</p>
    </sec>
    <sec>
      <title>4. Materials and Methods</title>
      <p>The chemical and biological defense program of the Department of Defense uses multiple chemical reactions such as dissolution, hydrolysis, oxidation, and neutralization (see discussion section) to dispose, decontaminate and destroy chemicals weapons such as sulfur mustard (HD) that have been stored for a long time with varies type of stabilizers.</p>
      <p>For the present work, we used the following three QSAR modeling packages to estimate the LD<sub>50</sub> values of HD [bis-(2-chloroethyl) sulfide (<xref ref-type="fig" rid="molecules-17-08982-f001">Figure 1</xref>)] and its breakdown products: TOPKAT<sup>®</sup> (Toxicity Prediction by Komputer Assisted Technology, Version 6.2, Accelrys, Inc.) and ADMET Predictor 5.0 (Simulation Plus Inc.) and Toxicity Estimation Software Tool (T.E.S.T. US EPA) [<xref ref-type="bibr" rid="B37-molecules-17-08982">37</xref>,<xref ref-type="bibr" rid="B38-molecules-17-08982">38</xref>,<xref ref-type="bibr" rid="B39-molecules-17-08982">39</xref>].</p>
      <sec>
        <title>4.1. Toxicity Prediction by Komputer Assisted Technology (TOPKAT)</title>
        <p>QSAR software package TOPKAT<sup>®</sup> 6.2, is a tool for structure-based activity assessment, which correlates activity with structural attributes (descriptors). The software encompasses a set of QSAR models, each concerning a different type of activity. TOPKAT QSAR models have been used for the estimation of potential toxicity such as carcinogenicity, mutagenicity, developmental toxicity, LD<sub>50</sub>, LOAELs and skin sensitization. The TOPKAT QSARs models utilize a computer-based method to assess the activity of a chemical solely based on its molecular attributes [<xref ref-type="bibr" rid="B37-molecules-17-08982">37</xref>].</p>
        <sec>
          <title>TOPKAT Rat Oral LD<sub>50</sub> Model</title>
          <p>The LD<sub>50</sub> QSAR model of the TOPKAT package comprises 19 statistically significant and cross-validated QSAR sub-models, and the data from which these models are derived. These models are derived from experimental LD<sub>50</sub> values of approximately 4,000 chemicals from open literature. Each QSAR model assesses the rat oral acute median lethal dose (LD<sub>50</sub>) for a specific class of chemicals. Molecular structure is the only input required to conduct an LD<sub>50</sub> assessment. This model automatically determines whether the submitted structure belongs in the Optimum Prediction Space (OPS) of the model, and (ii) computes QSAR similarity distance from chemicals with experimental LD<sub>50</sub> data in order to evaluate the reliability of the QSAR-based assessment [<xref ref-type="bibr" rid="B56-molecules-17-08982">56</xref>,<xref ref-type="bibr" rid="B57-molecules-17-08982">57</xref>].</p>
        </sec>
      </sec>
      <sec>
        <title>4.2. ADMET Predictor Rat Oral LD<sub>50</sub> Model</title>
        <p>ADMET Predictor™ is a state-of-the-art computer program designed to estimate certain ADMET (Absorption, Distribution, Metabolism, Elimination, and Toxicity) properties of a chemical from its 2 and 3 dimensional (2D and 3D) molecular structures (Simulations Plus Inc.). The program uses molecular descriptor values as inputs to independent mathematical models (generally, nonlinear machine learning techniques) in order to generate estimates for each of the ADMET properties. Qualitative and quantitative models of several measures of toxicity are estimated by ADMET Predictor including maximum recommended therapeutic dose, fathead minnow lethal toxicity, <italic>Daphnia magna</italic> lethal toxicity, acute rat toxicity, Ames mutagenicity in <italic>Salmonella typhimurium,</italic> carcinogenicity in rats, <italic>etc. </italic>The rat oral LD<sub>50</sub> model is supported by data from two sources, CDC’s Registry of Toxic Effects of Chemical Substances (RTECS), and the ChemIDplus database. The LD<sub>50</sub> data was converted to the negative logarithm of LD<sub>50</sub> (p<italic>LD<sub>50</sub></italic>) for the model development, 7150 unique identifiable compounds were selected and used. Two models are available that are based on 2 and 3 dimensional (2D and 3D) descriptors of the chemical structure. Greater than (or equal to) 20% of the data were set aside for the external test sets prior to training the models. It is noteworthy that the models had complete coverage and were able to make predictions having root mean square error (RMSE) of approximately 0.63 log units for both 2D and 3D test sets. </p>
      </sec>
      <sec>
        <title>4.3. Toxicity Estimation Software Tool (T.E.S.T.)</title>
        <p>T.E.S.T. estimates toxicity using a variety of QSAR methodologies, such as hierarchical clustering, the Food and Drug Administration (FDA) MDL, nearest neighbor, and a consensus model which is simply the average of the predicted toxicities from other QSAR methodologies (taking into account the applicability domain of each method) [<xref ref-type="bibr" rid="B30-molecules-17-08982">30</xref>]. The required descriptors are calculated without requiring any external programs. The structure of a chemical can be simply entered through the use of multiple tools including a chemical sketcher window, a text file containing SMILES notations, or importing it from a database of structures. After entering the structure, a chemical’s toxicity can be estimated using one of several advanced methodologies. T.E.S.T. version 4.0 contains LD<sub>50</sub> values from 7,420 chemicals [<xref ref-type="bibr" rid="B28-molecules-17-08982">28</xref>].</p>
        <sec>
          <title>4.3.1. The Hierarchical Clustering Method</title>
          <p>The hierarchical clustering method utilizes a variation of the Ward’s Minimum Variance Clustering Method [<xref ref-type="bibr" rid="B58-molecules-17-08982">58</xref>] to produce a series of clusters from the initial training set. For a training set of <italic>n</italic> chemicals, initially there will be <italic>n</italic> clusters formed. At each step in the clustering process, two clusters are combined so that the increase in variance over all of the clusters in the system is minimized. The change in variance caused by combining clusters <italic>j</italic> and <italic>k</italic> is as follows:</p>
          <disp-formula id="molecules-17-08982-i023">
		<inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i023.tif"/>
		<label>(1)</label>
		</disp-formula>

           <p>where <italic>n<sub>j</sub></italic> = number of chemicals in cluster <italic>j</italic>, <italic>C<sub>j</sub></italic><sub>,<italic>i</italic></sub> is the centroid (or average value) for descriptor <italic>i</italic> for cluster <italic>j</italic>, and <italic>d</italic> is the number of descriptors (~800) [<xref ref-type="bibr" rid="B28-molecules-17-08982">28</xref>]. The process of combining clusters while minimizing variance continues until all of the chemicals are lumped into a single cluster. After the clustering is complete, each cluster is analyzed to determine if an acceptable QSAR model can be developed. A genetic algorithm technique is used to select descriptors to build a multilinear regression model for each cluster [<xref ref-type="bibr" rid="B28-molecules-17-08982">28</xref>], and each model must achieve a leave-one-out cross-validation (LOO-CV) accuracy of 0.5 to be used in making predictions. The predicted value for a given test chemical is calculated using the equally weighted average of the model predictions from the closest cluster from each step in the hierarchical clustering. </p>
        </sec>
        <sec>
          <title>4.3.2. The FDA MDL QSAR Method</title>
          <p>The FDA MDL method is based on the work of Contrera and coworkers [<xref ref-type="bibr" rid="B59-molecules-17-08982">59</xref>]. In this method, predictions for each test chemical are made using a unique cluster (constructed at runtime) which contains structurally similar chemicals selected from the overall training set. This is in contrast to the Hierarchical method, where the predictions are made using one or more clusters that are constructed <italic>a priori</italic> using Ward’s method. For each test chemical, a cluster is constructed using the 30 most similar chemicals from the training set as defined by the cosine similarity coefficient, <italic>SC<sub>i</sub></italic><sub>,<italic>k</italic></sub>, which is calculated as follows: </p>
          <disp-formula id="molecules-17-08982-i024">
		<inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i024.tif"/>
		<label>(2)</label>
		</disp-formula>
          <p>where <italic>x<sub>ij</sub></italic> is the value of the <italic>j</italic>-th normalized descriptor for chemical <italic>i</italic> (normalized with respect to all of the chemicals in the original training set) and <italic>x<sub>kj</sub></italic> is the value of the <italic>j</italic>-th descriptor for chemical <italic>k</italic>. The entire pool of approximately 800 descriptors is used to calculate the similarity coefficient in equation 2. A multiple linear regression model is then built for the new cluster using a genetic algorithm-based method, and the toxicity is predicted [<xref ref-type="bibr" rid="B30-molecules-17-08982">30</xref>].</p>
        </sec>
        <sec>
          <title>4.3.3. The Nearest Neighbor Method</title>
          <p>The nearest neighbor method is a simplification of the variable selection <italic>k</italic>NN approach. In the nearest neighbor method, the toxicity is simply predicted as the average of the toxicity of the three most similar chemicals from the training set. The similarity is defined in terms of the cosine similarity coefficient (Equation 2). In this method, the entire available descriptor pool is used to characterize molecular similarity (as opposed to a subset of the descriptor pool as in the descriptor selection <italic>k</italic>NN method). To make a prediction, each of the neighbors in the training set must exceed a minimum cosine similarity coefficient of 0.5 [<xref ref-type="bibr" rid="B28-molecules-17-08982">28</xref>]. </p>
        </sec>
      </sec>
      <sec>
        <title>4.4. Applicability Domains (ADs)</title>
        <p>Every QSAR model can predict the potential toxicity of any chemical but the confidence in such predictions can vary. However, because each model is developed using a training set of chemicals that cover only a small fraction of the entire chemical universe its prediction capability is restricted to its applicability domain (AD), <italic>i.e.</italic>, its descriptor space. As a consequence of this only a certain fraction of chemicals of an external data set can be reasonably predicted. So it is important to determine if a chemical of interest falls within the AD of a model. If it falls outside the AD, varying degrees of uncertainties could be associated with such a prediction. Model ADs, their characteristics and limitations need to be understood thoroughly for the appropriate interpretation of results [<xref ref-type="bibr" rid="B29-molecules-17-08982">29</xref>,<xref ref-type="bibr" rid="B31-molecules-17-08982">31</xref>,<xref ref-type="bibr" rid="B32-molecules-17-08982">32</xref>,<xref ref-type="bibr" rid="B60-molecules-17-08982">60</xref>,<xref ref-type="bibr" rid="B61-molecules-17-08982">61</xref>,<xref ref-type="bibr" rid="B62-molecules-17-08982">62</xref>,<xref ref-type="bibr" rid="B63-molecules-17-08982">63</xref>]. Described below are the AD restrictions or requirements of the models we used in this study.</p>
        <sec>
          <title>4.4.1. AD of the TOPKAT Model</title>
          <p>In the TOPKAT model two types of analysis, the univariate analysis and multivariate analysis, are automatically performed to determine if a chemical is within the AD of a model. The univariate analysis or coverage examination checks whether all of the structural fragments of the query structure are covered by the chemical database associated with the model. The multivariate analysis or Optimum Prediction Space (OPS) examination checks to see whether the submitted structure fits within or near the periphery of OPS of the equation. If the query compound is deemed outside the OPS a warning about acceptability of the assessment is issued [<xref ref-type="bibr" rid="B37-molecules-17-08982">37</xref>]. The details of the assessment process using the TOPKAT software have been published previously [<xref ref-type="bibr" rid="B13-molecules-17-08982">13</xref>,<xref ref-type="bibr" rid="B37-molecules-17-08982">37</xref>,<xref ref-type="bibr" rid="B56-molecules-17-08982">56</xref>,<xref ref-type="bibr" rid="B57-molecules-17-08982">57</xref>]. Another feature of TOPKAT, the “QSAR similarity” analysis, enhances the confidence in a prediction [<xref ref-type="bibr" rid="B13-molecules-17-08982">13</xref>,<xref ref-type="bibr" rid="B57-molecules-17-08982">57</xref>,<xref ref-type="bibr" rid="B64-molecules-17-08982">64</xref>]. To assign high, moderate, or low confidence in a prediction the nearest 4 neighbors in the data base with a similarity distance of &lt;0.25 are considered [<xref ref-type="bibr" rid="B13-molecules-17-08982">13</xref>].</p>
        </sec>
        <sec>
          <title>4.4.2. AD of the ADMET Predictor Model</title>
          <p>The ADMET Predictor automatically determines whether a given compound is within the AD of the model. The AD is defined in terms of molecular descriptor space, not in terms of the relative value of an ADMET property. Let <italic>X</italic> denote an ADMET model with known training set and let <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i025.tif"/> be a descriptor set of model <italic>X</italic>. For each descriptor, <italic>x<sub>i</sub></italic>(<italic>i</italic>= 1,…, <italic>N</italic>), its minimum and maximum values, <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i026.tif"/> and <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i027.tif"/>, are determined over the training set of <italic>X</italic>. A new compound C is said to be within the scope of <italic>X</italic> if the value of each relevant descriptor <italic>c<sub>i</sub></italic>(<italic>i</italic>= 1,…, <italic>N</italic>) calculated for C is contained within the corresponding interval <inline-graphic xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="molecules-17-08982-i028.tif"/> with tolerance equal to 10% of the interval length. Such a compound has its <italic>X</italic> value typed in black bold font. Otherwise, the compound in question is outside the scope of <italic>X</italic> and its <italic>X</italic> value is marked by magenta font (Simulations Plus Inc.).</p>
        </sec>
        <sec>
          <title>4.4.3. AD of the Hierarchical Clustering Method</title>
          <p>The first restriction, the model ellipsoid constraint, checks if the test chemical is within the multidimensional ellipsoid defined by the ranges of descriptor values for the chemicals in its database. It is satisfied if the leverage of the test chemical (<italic>h</italic><sub>00</sub>) is less than the maximum leverage value for all of the chemicals used in the model. The second restriction, the <italic>R</italic><sub>max</sub> constraint, checks if the distance from the test chemical to the centroid of the database is less than the maximum distance for any chemical in the database to the database centroid. The final constraint, the fragment constraint, stipulates that the chemicals in the database must contain at least one example of each of the fragments that are present in the test chemical [<xref ref-type="bibr" rid="B28-molecules-17-08982">28</xref>,<xref ref-type="bibr" rid="B30-molecules-17-08982">30</xref>].</p>
        </sec>
        <sec>
          <title>4.4.4. AD of the FDA MDL QSAR Method</title>
          <p>For the prediction to be valid, three restrictions must be met. The first two, the model ellipsoid and fragment, are same as described above. Third restriction is that the predicted toxicity value must be within the range of experimental toxicity values for the chemicals in the database used to build the model [<xref ref-type="bibr" rid="B28-molecules-17-08982">28</xref>,<xref ref-type="bibr" rid="B30-molecules-17-08982">30</xref>].</p>
        </sec>
        <sec>
          <title>4.4.5. AD of the Nearest Neighbor Method</title>
          <p>Predictions using this model require at least three chemicals in the training set that are sufficiently similar to the test chemical. That is, the similarity coefficient between each of the three chemicals and the test chemical in equation 1 must exceed 0.5 [<xref ref-type="bibr" rid="B30-molecules-17-08982">30</xref>].</p>
        </sec>
      </sec>
    </sec>
    <sec sec-type="conclusions">
      <title>5. Conclusions</title>
      <p>This study has integrated commercial and open source robust QSAR LD<sub>50</sub> models to develop an approach for evaluation of the toxicity of chemicals that lack experimental toxicity data. This project was undertaken to determine the potential toxicity of sulfur mustard neutralization breakdown products toxicity using QSAR analysis since there is no experimental toxicity data available for such breakdown products. The QSAR estimates obtained show that the breakdown products will have less potential toxicity than some of the better known breakdown products of sulfur mustard. This evaluation can provide stakeholders with the potential toxicity values to make risk based decisions related to these breakdown products. </p>
    </sec>
   
  </body>
  <back>
    <ack>
      <title>Acknowledgments</title>
      <p>The findings and conclusions in this report are those of the author(s) and do not necessarily represent the views of the Centers for Disease Control and Prevention/the Agency for Toxic Substances and Disease Registry.</p>
    </ack>
    <notes>
      <title>Conflict of Interest</title>
      <p>The authors declare no conflict of interest.</p>
    </notes>
    <ref-list>
      <title>References</title>
      <ref id="B1-molecules-17-08982">
        <label>1.</label>
        <citation citation-type="gov">
         <collab>Agency for Toxic Substances and Disease Registry (ATSDR)</collab>
           <source>Toxicological Profile for Sulfur Mustard</source>
          <publisher-name>ATSDR</publisher-name>
          <publisher-loc>Atlanta, GA, USA</publisher-loc>
          <year>2003</year>
        </citation>
      </ref>
      <ref id="B2-molecules-17-08982">
        <label>2.</label>
        <citation citation-type="gov">
          <collab>International Agency for Research on Cancer (IARC)</collab>
           
          <source>Monographs</source>
          <publisher-name>IARC</publisher-name>
          <publisher-loc>Lyon, France</publisher-loc>
          <year>1987</year>
          <fpage>403</fpage>
          <lpage>405</lpage>
        </citation>
      </ref>
      <ref id="B3-molecules-17-08982">
        <label>3.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Lu</surname>
              <given-names>F.</given-names>
            </name>
            <name>
              <surname>Kacew</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>Basic toxicology</article-title>
          <source>Fundamentals, Target Organs, and Risk Assessment</source>
          <publisher-name>Taylor &amp; Francis</publisher-name>
          <publisher-loc>New York, NY, USA</publisher-loc>
          <year>2002</year>
        </citation>
      </ref>
      <ref id="B4-molecules-17-08982">
        <label>4.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Balls</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Why modification of the LD50 test will not be enough</article-title>
          <source>Lab. Anim.</source>
          <year>1991</year>
          <volume>25</volume>
          <fpage>198</fpage>
          <lpage>206</lpage>
          <pub-id pub-id-type="doi">10.1258/002367791780808310</pub-id>
        </citation>
      </ref>
      <ref id="B5-molecules-17-08982">
        <label>5.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Fiedler</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Hutzinger</surname>
              <given-names>O.</given-names>
            </name>
            <name>
              <surname>Giesy</surname>
              <given-names>J.P.</given-names>
            </name>
          </person-group>
          <article-title>Utility of the QSAR modeling system for predicting the toxicity of substances on the European inventory of existing commercial chemicals</article-title>
          <source>Toxicol. Environ. Chem.</source>
          <year>1990</year>
          <volume>28</volume>
          <fpage>167</fpage>
          <lpage>188</lpage>
          <pub-id pub-id-type="doi">10.1080/02772249009357600</pub-id>
        </citation>
      </ref>
      <ref id="B6-molecules-17-08982">
        <label>6.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cronin</surname>
              <given-names>M.T.</given-names>
            </name>
            <name>
              <surname>Dearden</surname>
              <given-names>J.C.</given-names>
            </name>
          </person-group>
          <article-title>QSAR in toxicology. 2. Prediction of acute mammalian toxicity and interspecies correlations</article-title>
          <source>QSAR</source>
          <year>1995</year>
          <volume>14</volume>
          <fpage>117</fpage>
          <lpage>120</lpage>
        </citation>
      </ref>
      <ref id="B7-molecules-17-08982">
        <label>7.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Greene</surname>
              <given-names>N.</given-names>
            </name>
          </person-group>
          <article-title>Computer systems for the prediction of toxicity: An update</article-title>
          <source>Adv. Drug Deliv. Rev.</source>
          <year>2002</year>
          <volume>54</volume>
          <fpage>417</fpage>
          <lpage>431</lpage>
          <pub-id pub-id-type="doi">10.1016/S0169-409X(02)00012-1</pub-id>
        </citation>
      </ref>
      <ref id="B8-molecules-17-08982">
        <label>8.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Tsakovska</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Lessigiarska</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Netzeva</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Worth</surname>
              <given-names>A.P.</given-names>
            </name>
          </person-group>
          <article-title>A mini review of mammalian toxicity (Q)SAR models</article-title>
          <source>QSAR Comb. Sci.</source>
          <year>2008</year>
          <volume>27</volume>
          <fpage>41</fpage>
          <lpage>48</lpage>
          <pub-id pub-id-type="doi">10.1002/qsar.200710107</pub-id>
        </citation>
      </ref>
      <ref id="B9-molecules-17-08982">
        <label>9.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Demchuk</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Ruiz</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Chou</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Fowler</surname>
              <given-names>B.A.</given-names>
            </name>
          </person-group>
          <article-title>SAR/QSAR methods in public health practice</article-title>
          <source>Toxicol. Appl. Pharmacol.</source>
          <year>2011</year>
          <volume>254</volume>
          <fpage>192</fpage>
          <lpage>197</lpage>
          <pub-id pub-id-type="doi">10.1016/j.taap.2010.10.017</pub-id>
        </citation>
      </ref>
      <ref id="B10-molecules-17-08982">
        <label>10.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Schultz</surname>
              <given-names>T.W.</given-names>
            </name>
            <name>
              <surname>Seward</surname>
              <given-names>J.R.</given-names>
            </name>
          </person-group>
          <article-title>Health-effects related structure-toxicity relationships: A paradigm for the first decade of the new millennium</article-title>
          <source>Sci. Total Environ.</source>
          <year>2000</year>
          <volume>249</volume>
          <fpage>73</fpage>
          <lpage>84</lpage>
          <pub-id pub-id-type="doi">10.1016/S0048-9697(99)00512-4</pub-id>
        </citation>
      </ref>
      <ref id="B11-molecules-17-08982">
        <label>11.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Simon-Hettich</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Rothfuss</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Steger-Hartmann</surname>
              <given-names>T.</given-names>
            </name>
          </person-group>
          <article-title>Use of computer-assisted prediction of toxic effects of chemical substances</article-title>
          <source>Toxicology</source>
          <year>2006</year>
          <volume>224</volume>
          <fpage>156</fpage>
          <lpage>162</lpage>
          <pub-id pub-id-type="doi">10.1016/j.tox.2006.04.032</pub-id>
        </citation>
      </ref>
      <ref id="B12-molecules-17-08982">
        <label>12.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Jaworska</surname>
              <given-names>J.S.</given-names>
            </name>
            <name>
              <surname>Comber</surname>
              <given-names>M.A.C.</given-names>
            </name>
            <name>
              <surname>van Leeuwen</surname>
              <given-names>C.J.</given-names>
            </name>
          </person-group>
          <article-title>Summary of a workshop on regulatory acceptance of (Q)SARs for human health and environmental endpoints</article-title>
          <source>Environ. Health Perspect.</source>
          <year>2003</year>
          <volume>111</volume>
          <fpage>1358</fpage>
          <lpage>1360</lpage>
          <pub-id pub-id-type="doi">10.1289/ehp.5757</pub-id>
        </citation>
      </ref>
      <ref id="B13-molecules-17-08982">
        <label>13.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ruiz</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Mumtaz</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Gombar</surname>
              <given-names>V.</given-names>
            </name>
          </person-group>
          <article-title>Assessing the toxic effects of ethylene glycol ethers using Quantitative Structure Toxicity Relationship models</article-title>
          <source>Toxicol. Appl. Pharmacol.</source>
          <year>2011</year>
          <volume>254</volume>
          <fpage>198</fpage>
          <lpage>205</lpage>
          <pub-id pub-id-type="doi">10.1016/j.taap.2010.10.024</pub-id>
        </citation>
      </ref>
      <ref id="B14-molecules-17-08982">
        <label>14.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gombar</surname>
              <given-names>V.K.</given-names>
            </name>
          </person-group>
          <article-title>Quantiative structure-activity relationships in toxicology: From fundamentals to applications</article-title>
          <source>Adv. Mol. Toxicol.</source>
          <year>1997</year>
          <fpage>125</fpage>
          <lpage>139</lpage>
        </citation>
      </ref>
      <ref id="B15-molecules-17-08982">
        <label>15.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Klopman</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Zhu</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Fuller</surname>
              <given-names>M.A.</given-names>
            </name>
            <name>
              <surname>Saiakhov</surname>
              <given-names>R.D.</given-names>
            </name>
          </person-group>
          <article-title>Searching for an enhanced predictive tool for mutagenicity</article-title>
          <source>SAR QSAR Environ. Res.</source>
          <year>2004</year>
          <volume>15</volume>
          <fpage>251</fpage>
          <lpage>263</lpage>
          <pub-id pub-id-type="doi">10.1080/10629360410001724897</pub-id>
        </citation>
      </ref>
      <ref id="B16-molecules-17-08982">
        <label>16.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Richard</surname>
              <given-names>A.M.</given-names>
            </name>
            <name>
              <surname>Benigni</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>AI and SAR approaches for predicting chemical carcinogenicity: Survey and status report</article-title>
          <source>SAR QSAR Environ. Res.</source>
          <year>2002</year>
          <volume>13</volume>
          <fpage>1</fpage>
          <lpage>19</lpage>
          <pub-id pub-id-type="doi">10.1080/10629360290002055</pub-id>
        </citation>
      </ref>
      <ref id="B17-molecules-17-08982">
        <label>17.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Enslein</surname>
              <given-names>K.</given-names>
            </name>
          </person-group>
          <article-title>QSTR applications in acute, chronic, and developmental toxicity, and carcinogenicity</article-title>
          <source>Adv. Mol. Toxicol.</source>
          <year>1998</year>
          <fpage>141</fpage>
          <lpage>164</lpage>
        </citation>
      </ref>
      <ref id="B18-molecules-17-08982">
        <label>18.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Cronin</surname>
              <given-names>M.T.</given-names>
            </name>
            <name>
              <surname>Walker</surname>
              <given-names>J.D.</given-names>
            </name>
            <name>
              <surname>Jaworska</surname>
              <given-names>J.S.</given-names>
            </name>
            <name>
              <surname>Comber</surname>
              <given-names>M.H.</given-names>
            </name>
            <name>
              <surname>Watts</surname>
              <given-names>C.D.</given-names>
            </name>
            <name>
              <surname>Worth</surname>
              <given-names>A.P.</given-names>
            </name>
          </person-group>
          <article-title>Use of QSARs in international decision-making frameworks to predict ecologic effects and environmental fate of chemical substances</article-title>
          <source>Environ. Health Perspect.</source>
          <year>2003</year>
          <volume>111</volume>
          <fpage>1376</fpage>
          <lpage>1390</lpage>
          <pub-id pub-id-type="doi">10.1289/ehp.5759</pub-id>
        </citation>
      </ref>
      <ref id="B19-molecules-17-08982">
        <label>19.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>McKinney</surname>
              <given-names>J.D.</given-names>
            </name>
            <name>
              <surname>Richard</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Waller</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Newman</surname>
              <given-names>M.C.</given-names>
            </name>
            <name>
              <surname>Gerberick</surname>
              <given-names>F.</given-names>
            </name>
          </person-group>
          <article-title>The Practice of Structure Activity Relationships (SAR) in Toxicology</article-title>
          <source>Toxicol. Sci.</source>
          <year>2000</year>
          <volume>56</volume>
          <fpage>8</fpage>
          <lpage>17</lpage>
          <pub-id pub-id-type="doi">10.1093/toxsci/56.1.8</pub-id>
        </citation>
      </ref>
      <ref id="B20-molecules-17-08982">
        <label>20.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Richard</surname>
              <given-names>A.M.</given-names>
            </name>
          </person-group>
          <article-title>Commercial toxicology prediction systems: A regulatory perspective</article-title>
          <source>Toxicol. Lett.</source>
          <year>1998</year>
          <volume>102–103</volume>
          <fpage>611</fpage>
          <lpage>616</lpage>
          <pub-id pub-id-type="doi">10.1016/S0378-4274(98)00257-4</pub-id>
        </citation>
      </ref>
      <ref id="B21-molecules-17-08982">
        <label>21.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gombar</surname>
              <given-names>V.K.</given-names>
            </name>
            <name>
              <surname>Jain</surname>
              <given-names>D.V.S.</given-names>
            </name>
          </person-group>
          <article-title>Quantification of molecular shape and its correlation with physico-chemical properties</article-title>
          <source>Indian J. Chem.</source>
          <year>1987</year>
          <volume>24A</volume>
          <fpage>554</fpage>
          <lpage>555</lpage>
        </citation>
      </ref>
      <ref id="B22-molecules-17-08982">
        <label>22.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Enslein</surname>
              <given-names>K.</given-names>
            </name>
          </person-group>
          <article-title>The future of toxicity prediction with QSAR</article-title>
          <source>In Vitro Toxicol.</source>
          <year>1993</year>
          <volume>6</volume>
          <fpage>163</fpage>
          <lpage>169</lpage>
        </citation>
      </ref>
      <ref id="B23-molecules-17-08982">
        <label>23.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ruiz</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Faroon</surname>
              <given-names>O.</given-names>
            </name>
            <name>
              <surname>Moudgal</surname>
              <given-names>C.J.</given-names>
            </name>
            <name>
              <surname>Hansen</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>De Rosa</surname>
              <given-names>C.T.</given-names>
            </name>
            <name>
              <surname>Mumtaz</surname>
              <given-names>M.</given-names>
            </name>
          </person-group>
          <article-title>Prediction of the health effects of polychlorinated biphenyls (PCBs) and their metabolites using quantitative structure-activity relationship (QSAR)</article-title>
          <source>Toxicol. Lett.</source>
          <year>2008</year>
          <volume>181</volume>
          <fpage>53</fpage>
          <lpage>65</lpage>
          <pub-id pub-id-type="doi">10.1016/j.toxlet.2008.06.870</pub-id>
        </citation>
      </ref>
      <ref id="B24-molecules-17-08982">
        <label>24.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gombar</surname>
              <given-names>V.K.</given-names>
            </name>
            <name>
              <surname>Enslein</surname>
              <given-names>K.</given-names>
            </name>
          </person-group>
          <article-title>Quantitative Structure-Activity Relationship (QSAR) studies using electronic descriptors calculated from topological and Molecular Orbital (MO) methods</article-title>
          <source>QSAR</source>
          <year>1990</year>
          <volume>9</volume>
          <fpage>321</fpage>
          <lpage>325</lpage>
        </citation>
      </ref>
      <ref id="B25-molecules-17-08982">
        <label>25.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Hall</surname>
              <given-names>L.H.</given-names>
            </name>
            <name>
              <surname>Mohney</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Kier</surname>
              <given-names>L.B.</given-names>
            </name>
          </person-group>
          <article-title>The electrotopological state: Structure information at the atomic level for molecular graphs</article-title>
          <source>J. Chem. Inf. Comput. Sci.</source>
          <year>1991</year>
          <volume>31</volume>
          <fpage>76</fpage>
          <lpage>82</lpage>
          <pub-id pub-id-type="doi">10.1021/ci00001a012</pub-id>
        </citation>
      </ref>
      <ref id="B26-molecules-17-08982">
        <label>26.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zheng</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Tropsha</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Novel Variable Selection Quantitative Structure−Property relationship approach based on the k-Nearest-Neighbor principle</article-title>
          <source>J. Chem. Inf. Comput. Sci.</source>
          <year>1999</year>
          <volume>40</volume>
          <fpage>185</fpage>
          <lpage>194</lpage>
        </citation>
      </ref>
      <ref id="B27-molecules-17-08982">
        <label>27.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Eriksson</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Jaworska</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Worth</surname>
              <given-names>A.P.</given-names>
            </name>
            <name>
              <surname>Cronin</surname>
              <given-names>M.T.</given-names>
            </name>
            <name>
              <surname>McDowell</surname>
              <given-names>R.M.</given-names>
            </name>
            <name>
              <surname>Gramatica</surname>
              <given-names>P.</given-names>
            </name>
          </person-group>
          <article-title>Methods for reliability, uncertainty assessment, and applicability evaluations of classifications and regression-based QSARs</article-title>
          <source>Environ. Health Perspect.</source>
          <year>2003</year>
          <volume>111</volume>
          <fpage>1361</fpage>
          <lpage>1375</lpage>
          <pub-id pub-id-type="doi">10.1289/ehp.5758</pub-id>
        </citation>
      </ref>
      <ref id="B28-molecules-17-08982">
        <label>28.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Martin</surname>
              <given-names>T.M.</given-names>
            </name>
            <name>
              <surname>Harten</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Venkatapathy</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Das</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Young</surname>
              <given-names>D.M.</given-names>
            </name>
          </person-group>
          <article-title>A hierarchical clustering methodology for the estimation of toxicity</article-title>
          <source>Toxicol. Mech. Methods</source>
          <year>2008</year>
          <volume>18</volume>
          <fpage>251</fpage>
          <lpage>266</lpage>
          <pub-id pub-id-type="doi">10.1080/15376510701857353</pub-id>
        </citation>
      </ref>
      <ref id="B29-molecules-17-08982">
        <label>29.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Tropsha</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Golbraikh</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Predictive QSAR modeling workflow, model applicability domains, and virtual screening</article-title>
          <source>Curr. Pharm. Des.</source>
          <year>2007</year>
          <volume>13</volume>
          <fpage>3494</fpage>
          <lpage>3504</lpage>
          <pub-id pub-id-type="doi">10.2174/138161207782794257</pub-id>
        </citation>
      </ref>
      <ref id="B30-molecules-17-08982">
        <label>30.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zhu</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Martin</surname>
              <given-names>T.M.</given-names>
            </name>
            <name>
              <surname>Ye</surname>
              <given-names>L.</given-names>
            </name>
            <name>
              <surname>Sedykh</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Young</surname>
              <given-names>D.M.</given-names>
            </name>
            <name>
              <surname>Tropsha</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Quantitative structure-activity relationship modeling of rat acute toxicity by oral exposure</article-title>
          <source>Chem. Res. Toxicol.</source>
          <year>2009</year>
          <volume>22</volume>
          <fpage>1913</fpage>
          <lpage>1921</lpage>
          <pub-id pub-id-type="doi">10.1021/tx900189p</pub-id>
        </citation>
      </ref>
      <ref id="B31-molecules-17-08982">
        <label>31.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Golbraikh</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Shen</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Xiao</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Xiao</surname>
              <given-names>Y.-D.</given-names>
            </name>
            <name>
              <surname>Lee</surname>
              <given-names>K.-H.</given-names>
            </name>
            <name>
              <surname>Tropsha</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Rational selection of training and test sets for the development of validated QSAR models</article-title>
          <source>J. Comput. Aided Mol. Des.</source>
          <year>2003</year>
          <volume>17</volume>
          <fpage>241</fpage>
          <lpage>253</lpage>
          <pub-id pub-id-type="doi">10.1023/A:1025386326946</pub-id>
        </citation>
      </ref>
      <ref id="B32-molecules-17-08982">
        <label>32.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Golbraikh</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Tropsha</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Beware of q2!</article-title>
          <source>J. Mol. Graph. Model.</source>
          <year>2002</year>
          <volume>20</volume>
          <fpage>269</fpage>
          <lpage>276</lpage>
          <pub-id pub-id-type="doi">10.1016/S1093-3263(01)00123-1</pub-id>
        </citation>
      </ref>
      <ref id="B33-molecules-17-08982">
        <label>33.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Breiman</surname>
              <given-names>L.</given-names>
            </name>
          </person-group>
          <article-title>Random Forests</article-title>
          <source>Mach. Learn.</source>
          <year>2001</year>
          <volume>45</volume>
          <fpage>5</fpage>
          <lpage>32</lpage>
          <pub-id pub-id-type="doi">10.1023/A:1010933404324</pub-id>
        </citation>
      </ref>
      <ref id="B34-molecules-17-08982">
        <label>34.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Papa</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Kovarich</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Gramatica</surname>
              <given-names>P.</given-names>
            </name>
          </person-group>
          <article-title>On the use of local and global QSPRs for the prediction of physico-chemical properties of polybrominated diphenyl ethers</article-title>
          <source>Mol. Inform.</source>
          <year>2011</year>
          <volume>30</volume>
          <fpage>232</fpage>
          <lpage>240</lpage>
          <pub-id pub-id-type="doi">10.1002/minf.201000148</pub-id>
        </citation>
      </ref>
      <ref id="B35-molecules-17-08982">
        <label>35.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Li</surname>
              <given-names>J.Z.</given-names>
            </name>
            <name>
              <surname>Gramatica</surname>
              <given-names>P.</given-names>
            </name>
          </person-group>
          <article-title>Classification and virtual screening of androgen receptor antagonists</article-title>
          <source>J. Chem. Inf. Model.</source>
          <year>2010</year>
          <volume>50</volume>
          <fpage>861</fpage>
          <lpage>874</lpage>
          <pub-id pub-id-type="doi">10.1021/ci100078u</pub-id>
        </citation>
      </ref>
      <ref id="B36-molecules-17-08982">
        <label>36.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Sazonovas</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Japertas</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Didziapetris</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>Estimation of reliability of predictions and model applicability domain evaluation in the analysis of acute toxicity (LD 50)</article-title>
          <source>SAR QSAR Environ. Res.</source>
          <year>2010</year>
          <volume>21</volume>
          <fpage>127</fpage>
          <lpage>148</lpage>
          <pub-id pub-id-type="doi">10.1080/10629360903568671</pub-id>
        </citation>
      </ref>
      <ref id="B37-molecules-17-08982">
        <label>37.</label>
        <citation citation-type="book">
          <source>TOPKAT User Guide Version 6.2</source>
          <publisher-name>Accelrys</publisher-name>
          <publisher-loc>San Diego, CA, USA</publisher-loc>
          <year>2004</year>
        </citation>
      </ref>
      <ref id="B38-molecules-17-08982">
        <label>38.</label>
        <citation citation-type="book">
          <source>T. E. S.T Tool, User’s Guide for T.E.S.T, Version 4.0.</source>
          <publisher-name>US EPA</publisher-name>
          <publisher-loc>Cincinatti, OH, USA</publisher-loc>
          <year>2011</year>
        </citation>
      </ref>
      <ref id="B39-molecules-17-08982">
        <label>39.</label>
        <citation citation-type="book">
          <source>User Manual Version 5.5</source>
          <publisher-name>Simulation Plus Inc, S. P.</publisher-name>
          <publisher-loc>Lancaster, CA, USA</publisher-loc>
          <year>2011</year>
        </citation>
      </ref>
      <ref id="B40-molecules-17-08982">
        <label>40.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Stedeford</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Zhao</surname>
              <given-names>Q.J.</given-names>
            </name>
            <name>
              <surname>Dourson</surname>
              <given-names>M.L.</given-names>
            </name>
            <name>
              <surname>Banasik</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Hsu</surname>
              <given-names>C.H.</given-names>
            </name>
          </person-group>
          <article-title>The application of non-default uncertainty factors in the US EPA’s Integrated Risk Information System (IRIS). Part I: UFL, UFS, and “Other uncertainty factors”</article-title>
          <source>J. Environ. Sci. Heal. C</source>
          <year>2007</year>
          <volume>25</volume>
          <fpage>245</fpage>
          <lpage>279</lpage>
          <pub-id pub-id-type="doi">10.1080/10590500701569430</pub-id>
        </citation>
      </ref>
      <ref id="B41-molecules-17-08982">
        <label>41.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Votano</surname>
              <given-names>J.R.</given-names>
            </name>
            <name>
              <surname>Parham</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Hall</surname>
              <given-names>L.H.</given-names>
            </name>
            <name>
              <surname>Kier</surname>
              <given-names>L.B.</given-names>
            </name>
            <name>
              <surname>Oloff</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Tropsha</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Xie</surname>
              <given-names>Q.</given-names>
            </name>
            <name>
              <surname>Tong</surname>
              <given-names>W.</given-names>
            </name>
          </person-group>
          <article-title>Three new consensus QSAR models for the prediction of Ames genotoxicity</article-title>
          <source>Mutagenesis</source>
          <year>2004</year>
          <volume>19</volume>
          <fpage>365</fpage>
          <lpage>377</lpage>
          <pub-id pub-id-type="doi">10.1093/mutage/geh043</pub-id>
        </citation>
      </ref>
      <ref id="B42-molecules-17-08982">
        <label>42.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Moore</surname>
              <given-names>D.R.J.</given-names>
            </name>
            <name>
              <surname>Breton</surname>
              <given-names>R.L.</given-names>
            </name>
            <name>
              <surname>MacDonald</surname>
              <given-names>D.B.</given-names>
            </name>
          </person-group>
          <article-title>A comparison of model performance for six quantitative structure-activity relationship packages that predict acute toxicity to fish</article-title>
          <source>Environ. Toxicol. Chem.</source>
          <year>2003</year>
          <volume>22</volume>
          <fpage>1799</fpage>
          <lpage>1809</lpage>
          <pub-id pub-id-type="doi">10.1897/00-361</pub-id>
        </citation>
      </ref>
      <ref id="B43-molecules-17-08982">
        <label>43.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Tong</surname>
              <given-names>W.</given-names>
            </name>
            <name>
              <surname>Hong</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Fang</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Xie</surname>
              <given-names>Q.</given-names>
            </name>
            <name>
              <surname>Perkins</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>Decision forest: Combining the predictions of multiple independent decision tree models</article-title>
          <source>J. Chem. Inf. Comput. Sci.</source>
          <year>2003</year>
          <volume>43</volume>
          <fpage>525</fpage>
          <lpage>531</lpage>
          <pub-id pub-id-type="doi">10.1021/ci020058s</pub-id>
        </citation>
      </ref>
      <ref id="B44-molecules-17-08982">
        <label>44.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Tunkel</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Mayo</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Austin</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Hickerson</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Howard</surname>
              <given-names>P.</given-names>
            </name>
          </person-group>
          <article-title>Practical considerations on the use of predictive models for regulatory purposes</article-title>
          <source>Environ. Sci. Technol.</source>
          <year>2005</year>
          <volume>39</volume>
          <fpage>2188</fpage>
          <lpage>2199</lpage>
          <pub-id pub-id-type="doi">10.1021/es049220t</pub-id>
        </citation>
      </ref>
      <ref id="B45-molecules-17-08982">
        <label>45.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Devillers</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Mombelli</surname>
              <given-names>E.</given-names>
            </name>
          </person-group>
          <article-title>Evaluation of the OECD QSAR Application Toolbox and Toxtree for estimating the mutagenicity of chemicals. Part 2. alpha-beta unsaturated aliphatic aldehydes</article-title>
          <source>SAR QSAR Environ. Res.</source>
          <year>2010</year>
          <volume>21</volume>
          <fpage>771</fpage>
          <lpage>783</lpage>
          <pub-id pub-id-type="doi">10.1080/1062936X.2010.528961</pub-id>
        </citation>
      </ref>
      <ref id="B46-molecules-17-08982">
        <label>46.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Fjodorova</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Novich</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Vrachko</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Smirnov</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Kharchevnikova</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Zholdakova</surname>
              <given-names>Z.</given-names>
            </name>
            <name>
              <surname>Novikov</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Skvortsova</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Filimonov</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Poroikov</surname>
              <given-names>V.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Directions in QSAR modeling for regulatory uses in OECD member countries, EU and in Russia</article-title>
          <source>J. Environ. Sci. Heal. C</source>
          <year>2008</year>
          <volume>26</volume>
          <fpage>201</fpage>
          <lpage>236</lpage>
          <pub-id pub-id-type="doi">10.1080/10590500802135578</pub-id>
        </citation>
      </ref>
      <ref id="B47-molecules-17-08982">
        <label>47.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Liu</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Papa</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Gramatica</surname>
              <given-names>P.</given-names>
            </name>
          </person-group>
          <article-title>QSAR prediction of estrogen activity for a large set of diverse chemicals under the guidance of OECD principles</article-title>
          <source>Chem. Res. Toxicol.</source>
          <year>2006</year>
          <volume>19</volume>
          <fpage>1540</fpage>
          <lpage>1548</lpage>
          <pub-id pub-id-type="doi">10.1021/tx0601509</pub-id>
        </citation>
      </ref>
      <ref id="B48-molecules-17-08982">
        <label>48.</label>
        <citation citation-type="web">
          <collab>Canadian Center for Occupational Health &amp; Safety</collab>
           
          <article-title>What is an LD<sub>50</sub> and LC<sub>50</sub></article-title>
          <access-date>(accessed on 26 July 2012)</access-date>
          <comment>Available online:<ext-link xmlns:xlink="http://www.w3.org/1999/xlink" xlink:href="http://www.ccohs.ca/oshanswers/chemicals/LD50.html#_1_6" ext-link-type="uri">http://www.ccohs.ca/oshanswers/chemicals/LD50.html#_1_6</ext-link></comment>
        </citation>
      </ref>
      <ref id="B49-molecules-17-08982">
        <label>49.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Barratt</surname>
              <given-names>M.D.</given-names>
            </name>
          </person-group>
          <article-title>Integrating computer prediction systems with in vitro methods towards a better understanding of toxicology</article-title>
          <source>Toxicol. Lett.</source>
          <year>1998</year>
          <volume>102–103</volume>
          <fpage>617</fpage>
          <lpage>621</lpage>
          <pub-id pub-id-type="doi">10.1016/S0378-4274(98)00266-5</pub-id>
        </citation>
      </ref>
      <ref id="B50-molecules-17-08982">
        <label>50.</label>
        <citation citation-type="gov">
          <collab>National Research Council (U.S.)</collab>
           
          <article-title>Committee on Toxicity Testing and Assessment of Environmental Agents</article-title>
          <source>Toxicity Testing in the 21st Century: A Vision and a Strategy</source>
          <publisher-name>National Academies Press</publisher-name>
          <publisher-loc>Washington, DC, USA</publisher-loc>
          <year>2007</year>
          <fpage>xvii</fpage>
          <lpage>196</lpage>
          <page-range>xvii, 196</page-range>
         </citation>
      </ref>
      <ref id="B51-molecules-17-08982">
        <label>51.</label>
        <citation citation-type="book">
          <source>Guidance Document on the Validation and International Acceptance of New or Updated Test Methods for Hazard Assessment, Series on Testing and Assessment</source>
          <publisher-name>OECD</publisher-name>
          <publisher-loc>Paris, France</publisher-loc>
          <year>2005</year>
          <fpage>96</fpage>
        </citation>
      </ref>
      <ref id="B52-molecules-17-08982">
        <label>52.</label>
        <citation citation-type="gov">
          <collab>National Research Council (U.S.)</collab>
           
          <article-title>Committee on Applications of Toxicogenomic Technologies to Predictive Toxicology</article-title>
          <source>Applications of Toxicogenomic Technologies to Predictive Toxicology and Risk Assessment</source>
          <publisher-name>National Academies Press</publisher-name>
          <publisher-loc>Washington, DC, USA</publisher-loc>
          <year>2007</year>
          <fpage>xxii</fpage>
          <lpage>275</lpage>
          <page-range>xxii, 275</page-range>
         </citation>
      </ref>
      <ref id="B53-molecules-17-08982">
        <label>53.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Schrage</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Hempel</surname>
              <given-names>K.</given-names>
            </name>
            <name>
              <surname>Schulz</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Kolle</surname>
              <given-names>S.N.</given-names>
            </name>
            <name>
              <surname>van Ravenzwaay</surname>
              <given-names>B.</given-names>
            </name>
            <name>
              <surname>Landsiedel</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>Refinement and reduction of acute oral toxicity testing: A critical review of the use of cytotoxicity data</article-title>
          <source>Atla.-Altern. Lab. Anim.</source>
          <year>2011</year>
          <volume>39</volume>
          <fpage>273</fpage>
          <lpage>295</lpage>
        </citation>
      </ref>
      <ref id="B54-molecules-17-08982">
        <label>54.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Worth</surname>
              <given-names>A.P.</given-names>
            </name>
            <name>
              <surname>Bassan</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>De Bruijn</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Gallegos Saliner</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Netzeva</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Pavan</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Patlewicz</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Tsakovska</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Eisenreich</surname>
              <given-names>S.</given-names>
            </name>
          </person-group>
          <article-title>The role of the European Chemicals Bureau in promoting the regulatory use of (Q)SAR methods</article-title>
          <source>SAR QSAR Environ. Res.</source>
          <year>2007</year>
          <volume>18</volume>
          <fpage>111</fpage>
          <lpage>125</lpage>
        <pub-id pub-id-type="doi">10.1080/10629360601054255</pub-id><pub-id pub-id-type="pmid">17365963</pub-id></citation>
      </ref>
      <ref id="B55-molecules-17-08982">
        <label>55.</label>
        <citation citation-type="book">
          <collab>Organisation for Economic Co-operation and Development (OECD)</collab>
          
          <source>Report on the Regulatory Uses and Applications in OECD Member Countries of (Quantitative) Structure-Activity Relationship [(Q)SAR] Models in the Assessment of New and Existing Chemicals, Series on Testing and Assessment</source>
          <publisher-name>OECD</publisher-name>
          <publisher-loc>Paris, France</publisher-loc>
          <year>2006</year>
          <fpage>79</fpage>
        </citation>
      </ref>
      <ref id="B56-molecules-17-08982">
        <label>56.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Moudgal</surname>
              <given-names>C.J.</given-names>
            </name>
            <name>
              <surname>Lipscomb</surname>
              <given-names>J.C.</given-names>
            </name>
            <name>
              <surname>Bruce</surname>
              <given-names>R.M.</given-names>
            </name>
          </person-group>
          <article-title>Potential health effects of drinking water disinfection by-products using quantitative structure toxicity relationship</article-title>
          <source>Toxicology</source>
          <year>2000</year>
          <volume>147</volume>
          <fpage>109</fpage>
          <lpage>131</lpage>
          <pub-id pub-id-type="doi">10.1016/S0300-483X(00)00188-8</pub-id>
        </citation>
      </ref>
      <ref id="B57-molecules-17-08982">
        <label>57.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Venkatapathy</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Wang</surname>
              <given-names>C.Y.</given-names>
            </name>
            <name>
              <surname>Bruce</surname>
              <given-names>R.M.</given-names>
            </name>
            <name>
              <surname>Moudgal</surname>
              <given-names>C.</given-names>
            </name>
          </person-group>
          <article-title>Development of quantitative structure-activity relationship (QSAR) models to predict the carcinogenic potency of chemicals: I. Alternative toxicity measures as an estimator of carcinogenic potency</article-title>
          <source>Toxicol. Appl. Pharmacol.</source>
          <year>2009</year>
          <volume>234</volume>
          <fpage>209</fpage>
          <lpage>221</lpage>
          <pub-id pub-id-type="doi">10.1016/j.taap.2008.09.028</pub-id>
        </citation>
      </ref>
      <ref id="B58-molecules-17-08982">
        <label>58.</label>
        <citation citation-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Romesburg</surname>
              <given-names>H.C.</given-names>
            </name>
          </person-group>
          <source>Cluster Analysis for Researchers</source>
          <publisher-name>LULU Press</publisher-name>
          <publisher-loc>North Carolina, NC, USA</publisher-loc>
          <year>1984</year>
        </citation>
      </ref>
      <ref id="B59-molecules-17-08982">
        <label>59.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Contrera</surname>
              <given-names>J.F.</given-names>
            </name>
            <name>
              <surname>Matthews</surname>
              <given-names>E.J.</given-names>
            </name>
            <name>
              <surname>Daniel Benz</surname>
              <given-names>R.</given-names>
            </name>
          </person-group>
          <article-title>Predicting the carcinogenic potential of pharmaceuticals in rodents using molecular structural similarity and E-state indices</article-title>
          <source>Regul. Toxicol. Pharmacol.</source>
          <year>2003</year>
          <volume>38</volume>
          <fpage>243</fpage>
          <lpage>259</lpage>
          <pub-id pub-id-type="doi">10.1016/S0273-2300(03)00071-0</pub-id>
        </citation>
      </ref>
      <ref id="B60-molecules-17-08982">
        <label>60.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Netzeva</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Worth</surname>
              <given-names>A.P.</given-names>
            </name>
            <name>
              <surname>Aldenberg</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Benigni</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Cronin</surname>
              <given-names>M.T.</given-names>
            </name>
            <name>
              <surname>Gramatica</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Jaworska</surname>
              <given-names>J.</given-names>
            </name>
            <name>
              <surname>Kahn</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Klopman</surname>
              <given-names>G.</given-names>
            </name>
            <name>
              <surname>Marchant</surname>
              <given-names>C.A.</given-names>
            </name>
            <etal/>
          </person-group>
          <article-title>Current status of methods for defining the applicability domain of (quantitative) structure-activity relationships. The report and recommendations of ECVAM Workshop 52</article-title>
          <source>Altern. Lab. Anim.</source>
          <year>2005</year>
          <volume>33</volume>
          <fpage>1</fpage>
          <lpage>19</lpage>
        <pub-id pub-id-type="pmid">15804212</pub-id></citation>
      </ref>
      <ref id="B61-molecules-17-08982">
        <label>61.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Schultz</surname>
              <given-names>T.W.</given-names>
            </name>
            <name>
              <surname>Hewitt</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Netzeva</surname>
              <given-names>T.I.</given-names>
            </name>
            <name>
              <surname>Cronin</surname>
              <given-names>M.T.D.</given-names>
            </name>
          </person-group>
          <article-title>Assessing Applicability Domains of Toxicological QSARs: Definition, Confidence in Predicted Values, and the Role of Mechanisms of Action</article-title>
          <source>QSAR Comb. Sci.</source>
          <year>2007</year>
          <volume>26</volume>
          <fpage>238</fpage>
          <lpage>254</lpage>
          <pub-id pub-id-type="doi">10.1002/qsar.200630020</pub-id>
        </citation>
      </ref>
      <ref id="B62-molecules-17-08982">
        <label>62.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Tetko</surname>
              <given-names>I.V.</given-names>
            </name>
            <name>
              <surname>Sushko</surname>
              <given-names>I.</given-names>
            </name>
            <name>
              <surname>Pandey</surname>
              <given-names>A.K.</given-names>
            </name>
            <name>
              <surname>Zhu</surname>
              <given-names>H.</given-names>
            </name>
            <name>
              <surname>Tropsha</surname>
              <given-names>A.</given-names>
            </name>
            <name>
              <surname>Papa</surname>
              <given-names>E.</given-names>
            </name>
            <name>
              <surname>Oberg</surname>
              <given-names>T.</given-names>
            </name>
            <name>
              <surname>Todeschini</surname>
              <given-names>R.</given-names>
            </name>
            <name>
              <surname>Fourches</surname>
              <given-names>D.</given-names>
            </name>
            <name>
              <surname>Varnek</surname>
              <given-names>A.</given-names>
            </name>
          </person-group>
          <article-title>Critical Assessment of QSAR Models of Environmental Toxicity against Tetrahymenapyriformis: Focusing on Applicability Domain and Overfitting by Variable Selection</article-title>
          <source>J. Chem. Inf. Model.</source>
          <year>2008</year>
          <volume>48</volume>
          <fpage>1733</fpage>
          <lpage>1746</lpage>
          <pub-id pub-id-type="doi">10.1021/ci800151m</pub-id>
        </citation>
      </ref>
      <ref id="B63-molecules-17-08982">
        <label>63.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Roy</surname>
              <given-names>P.P.</given-names>
            </name>
            <name>
              <surname>Kovarich</surname>
              <given-names>S.</given-names>
            </name>
            <name>
              <surname>Gramatica</surname>
              <given-names>P.</given-names>
            </name>
          </person-group>
          <article-title>QSAR Model Reproducibility and Applicability: A Case Study of Rate Constants of Hydroxyl Radical Reaction Models Applied to PolybrominatedDiphenyl Ethers and (Benzo-)Triazoles</article-title>
          <source>J. Comput. Chem.</source>
          <year>2011</year>
          <volume>32</volume>
          <fpage>2386</fpage>
          <lpage>2396</lpage>
          <pub-id pub-id-type="doi">10.1002/jcc.21820</pub-id>
        </citation>
      </ref>
      <ref id="B64-molecules-17-08982">
        <label>64.</label>
        <citation citation-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Pohl</surname>
              <given-names>H.R.</given-names>
            </name>
            <name>
              <surname>Chou</surname>
              <given-names>C.H.</given-names>
            </name>
            <name>
              <surname>Ruiz</surname>
              <given-names>P.</given-names>
            </name>
            <name>
              <surname>Holler</surname>
              <given-names>J.S.</given-names>
            </name>
          </person-group>
          <article-title>Chemical risk assessment and uncertainty associated with extrapolation across exposure duration</article-title>
          <source>Regul.Toxicol. Pharmacol.</source>
          <year>2010</year>
          <volume>57</volume>
          <fpage>18</fpage>
          <lpage>23</lpage>
          <pub-id pub-id-type="doi">10.1016/j.yrtph.2009.11.007</pub-id>
        </citation>
      </ref>
    </ref-list>
    <fn-group>
    <fn>
    <p><italic>Sample Availability: </italic>Not available.</p>
    </fn>
    </fn-group>
  </back>
</article>
