Next Article in Journal
Magnetic Nanoemulsions for the Intra-Articular Delivery of Ascorbic Acid and Dexamethasone
Next Article in Special Issue
Molecular Modelling of Polychlorinated Dibenzo-p-Dioxins Non-Covalent Interactions with β and γ-Cyclodextrins
Previous Article in Journal
Structural Insight into the Amino Acid Environment of the Two-Domain Laccase’s Trinuclear Copper Cluster
Previous Article in Special Issue
Iridium Complexes with BIAN-Type Ligands: Synthesis, Structure and Redox Chemistry
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Comparison of Intermolecular Halogen...Halogen Distances in Organic and Organometallic Crystals

Chemistry Department, Moscow M. V. Lomonosov State University, 119991 Moscow, Russia
Int. J. Mol. Sci. 2023, 24(15), 11911; https://doi.org/10.3390/ijms241511911
Submission received: 1 June 2023 / Revised: 15 July 2023 / Accepted: 21 July 2023 / Published: 25 July 2023

Abstract

:
Statistical analysis of halogen...halogen intermolecular distances was performed for three sets of homomolecular crystals under normal conditions: C–Hal1...Hal2–C distances in crystals consisting of: (i) organic compounds (set Org); (ii) organometallic compounds (set Orgmet); and (iii) distances M1–Hal1...Hal2–M2 (set MHal) (in all cases Hal1 = Hal2, and in MHal M1 = M2, M is any metal). When analyzing C–Hal...Hal–C distances, a new method for estimating the values of van der Waals radii is proposed, based on the use of two subsets of distances: (i) the shortest distances from each substance less than a threshold; and (ii) all C–Hal...Hal–C distances less than the same threshold. As initial approximations for these thresholds for different Hal, the Ragg values previously introduced in investigations with the participation of the author were used (Ragg values make it possible to perform a statistical assessment of the presence of halogen aggregates in crystals). The following values are recommended in this work to be used as universal values for crystals of organic and organometallic compounds: RF = 1.57, RCl = 1.90, RBr = 1.99, and RI = 2.15 Å. They are in excellent agreement with the results of some other works but significantly (by 0.10–0.17 Å) greater than the commonly used values. For the Orgmet set, slightly lower values for RI (2.11–2.09 Å) were obtained, but number of the C–I...I–C distances available for analysis was significantly smaller than in the other subgroups, which may be the reason for the discrepancy with value for the Org set (2.15 Å). Statistical analysis of the M–Hal...Hal–M distances was performed for the first time. A Hal-aggregation coefficient for M–Hal bonds is proposed, which allows one to estimate the propensity of M–Hal groups with certain M and Hal to participate in Hal-aggregates formed by M–Hal...Hal–M contacts. In particular, it was found that, for the Hg–Hal groups (Hal = Cl, Br, I), there is a high probability that the crystals have Hg–Hal...Hal–Hg distances with length ≤ Ragg.

1. Introduction

The role played by intermolecular contacts between halogen atoms (halogen...halogen contacts or Hal...Hal for short) in the formation of crystal structures of both organic [1,2] and organometallic [3,4] compounds is well known. This role is often reduced to considering only the shortest (shorter than the sum of van der Waals radii, i.e., 2RHal for identical Hal) contacts that can be described as a halogen bond [5]. Meanwhile, it was noted for the first time in [6] that the grouping of several halogen atoms can play an important role in the formation of crystal structures, even if not all Hal...Hal distances between neighboring atoms are shortened compared with 2RHal. This phenomenon, for which we later proposed the name “aggregation of halogen atoms” or “Hal-aggregation”, was considered in a series of our works [7,8,9,10,11,12] with a number of examples.
Halogen atoms are not among the most common in protein or nucleic acid molecules; nevertheless, even almost 20 years ago in the PDB [13] (July 2004 release), 964 C–Hal bonds were found [14] in proteins and 321 in nucleic acids (for both sets Hal = Cl, Br, I). At the same time, halogenation of drugs often increases membrane binding and permeation [15], leading to the desired steric and conformational effects when interacting with the target site [16], so it is not surprising that the proportion of halogenated compounds among drugs is quite large (about 25% in DrugBank [17] according to [18]). In addition, intermolecular contacts involving halogen atoms contribute to the stabilization of protein-ligand complexes; therefore, plenty of works have been devoted to their study (e.g., [19,20,21,22]).
However, regardless of whether researchers consider only short Hal...Hal contacts or Hal-aggregates or halogen bonds in different type of substances, the starting point in the analysis is the values of the van der Waals radii of halogen atoms (RHal). There have been a large number of works devoted to both the actual determination of RHal [23,24,25,26,27,28,29,30] and the discussion of values obtained by various methods [31]. In 2020, to determine the van der Waals radii of atoms in crystals, the authors [30] created arrays of distances, the longest of which obviously greatly exceeded the sum of van der Waals radii, and then they described them using Gaussian functions. An approach based on the use of several Gaussian functions to describe distance distributions in crystals was proposed by me a little later [32], independently of the authors [30].
In [30], the authors considered distances in crystals with molecules containing only H(D), B, C, N, O, F, Si, P, S, Cl, As, Se, Br, and I atoms. If the values of the van der Waals radii are constant in any environment, then the indicated restriction should not have a significant effect on the result. At the same time, without additional research, it cannot be ruled out that some differences in the structure of crystals consisting of molecules of only organic compounds or of molecules of organometallic compounds [33] may also affect the average values of certain intermolecular contacts.
Therefore, one of the purposes of this work was to compare the distributions of intermolecular distances Hal...Hal in two sets of crystals: those consisting of molecules of organic compounds (Org) and those of molecules of organometallic compounds (Orgmet). In addition, the statistical analysis of intermolecular metal–halogen...halogen–metal distances was of significant interest because of very scarce information about these distances in the literature.

2. Results

The procedure for the formation of different Hal...Hal distance sets that were used to perform statistical analysis is described in detail in Section 4. Table 1 provides information about the number of substances, the molecules of which have C–Hal bonds, for the sets Org and Orgmet. The proportions of these substances making up of the total number of crystals in the Org (138,916) and Orgmet (76,439) sets that satisfy all search conditions were also determined, except for the presence of a certain C–Hal bond, i.e., how common the substances with such bonds are. It can be seen that this fraction is the maximum for crystals with a C–Cl bond in the Org set (10.6%) and the minimum for crystals with a C–I bond in the Orgmet set (0.3%).
Next, in the formed sets, the C–Hal...Hal–C intermolecular distances were calculated for the same Hal atoms (d). In previous works [9], it was shown that the length of the Hal...Hal contacts involved in halogen aggregation usually does not exceed Ragg = 2RHal + 0.5 Å; i.e., in statistical studies (without a detailed analysis of aggregates in each substance), this quantity may be used to estimate the number of structures containing Hal-aggregates. In [9], RF = 1.40 [27], RCl = 1.90 [26], RBr = 1.97 [28], and RI = 2.14 Å [28] were used; therefore, the upper limit d (dmax) was initially determined in this work as 3.300, 4.300, 4.440, and 4.780 Å for the F...F, Cl...Cl, Br...Br, and I...I distances, respectively. The number of substances containing such distances is also indicated in Table 1.
It turned out that, according to this criterion, the fraction of substances containing Hal-aggregates is systematically higher in the Orgmet set than in Org, with the greatest difference (by 22.6%) being observed for F-aggregates and the smallest difference for I-aggregates (3.8%).

3. Discussion

3.1. Analysis of the Shortest C–Hal...Hal–C Distances

The shortest C–Hal...Hal–C distances of various types for crystals of organic compounds are reported in Table 2. The analysis of the adequacy of such distances is usually not carried out, although the almost twofold reduction in distances compared to 2RHal apparently indicates errors in the determination of structures. In this work, the following approach was used to assess the adequacy of the shortest distances. All distances of each type were sorted by increase, and if the difference between the previous and next contact was more than 0.1 Å, then the previous contact was classified as unrealistically short. The presence of too short Hal...Hal distances may be associated with general errors in the determination of the structure; therefore, not only these short distances but also all distances in the corresponding substances (records) were excluded from further analysis. It turned out that there was no need to remove substances only from subsets of contacts I...I. In the remaining subsets, the entries PUCREL, FFMXZP, JODVAZ, VAWNUE, MIWHOP01, and XACXEE in the Org set and TUKNOC, KUSMOD, and CUCNUM in the Orgmet set were deleted. The number and range of distances used in further analysis are shown in Table 3. The results obtained below indicate that the previously used value RF = 1.40 Å is underestimated; therefore, the sample for C–F...F–C distances was also extended for this type of distance in Table 3 and has two rows.
It can be seen that, in the corrected arrays, the reduction in the shortest distances compared to 2RHal (proposed in [26,27,28]) does not exceed ~20%, which seems reasonable.

3.2. Analysis of Distance Distributions

Often, when determining van der Waals radii, only the shortest distances are considered. However, van der Waals radii are most valuable when they allow for estimation of not the shortest but the most probable non-valence distances. In this paper, for each type of distances C–Hal...Hal–C in each of the two main sets Org and Orgmet, two variants of distance arrays were analyzed. The first one included all distances of the corresponding type within dmax in the selected substances, while the second included the one shortest distance of the corresponding type from each selected substance; thus, the longest distances in the second array also did not exceed dmax. For these arrays of distances, histograms were constructed, examples of which for the Org set are shown in Figure 1 and Figure 2.
The maxima on all histograms were described by Gaussian function (red lines in Figure 1 and Figure 2). It should be noted that, when the histogram step changes, the view of histogram changes to some extent (for example, Figure 1a,b shows the distributions of the F...F distances with steps of 0.2 and 0.1 Å), but the position of the maximum when described by the Gaussian function remains almost constant (see Table 4 and Table 5). In Figure 1a, the position and the very presence of the maximum do not seem obvious, which, as noted earlier, is most likely a consequence of the initially chosen Ragg value for the F...F distances being underestimated. Therefore, additional arrays with dmax = 3.50 Å were analyzed for this type of distance. Table 4 shows that, in this case, as for other types of distances, the distribution parameters depend very little on the histogram step.
Table 6 shows the differences between the positions of the maxima of the Gaussian functions for different options used for histograms.
The results for I…I contacts in the Orgmet set do not quite match the trends for other types of contacts in several cases. Perhaps this outcome is due to the significantly smaller number of I…I contacts in this set, especially for the sample of the shortest contacts.
In the Org set, the differences in the positions of the maxima for histograms with steps of 0.2 and 0.1 Å do not exceed 0.011 Å. The same maximum difference appears in the Orgmet set if the results for contacts I…I are not considered. Thus, the differences in the positions of the maxima between Org and Orgmet can be considered significant if they exceed 0.01 Å.
It turns out that the positions of the maxima for the shortest contacts of all types in Orgmet correspond to significantly shorter distances than in Org: the contraction is in the range of 0.023–0.055 Å for contacts F…F, Cl…Cl, and Br…Br. For the same contacts from the All arrays, a different pattern is observed: for F…F, on average, shorter (by 0.050–0.058 Å) contacts exist in the Org set, Cl…Cl contacts are also shorter in Org but only by 0.013–0.014 Å, and the Br…Br contacts are on average longer (by 0.017–0.022 Å) in Org than in Orgmet.
As expected, the maxima for First contacts correspond to shorter distances than the maxima for All for all types of distances in both sets (Org and Orgmet), while in its meaning, the sum of van der Waals radii should be greater than the maximum values for First and less than the maximum values for All.
The expression (xall + xfirst)/4 was used to estimate the value of the van der Waals radii. It turns out (Table 7) that the values obtained in this way are in excellent (within 0.01 Å) agreement with each other, both for different histogram spacing and for the Org and Orgmet sets. An exception is the discrepancy in the RI estimates for Org and Orgmet, which, as noted above, may be due to an insufficient number of contacts in the C–I… I–C sample for Orgmet.
Table 8 compares the results of determining the van der Waals radii in this work with some data from the literature.
The values obtained in this work are in good agreement with the data [30] obtained using a similar technique. At the same time, it is important to note the good agreement between the values for Cl, Br, and I with the results [26,27,28], which were obtained by another method and which were previously used to estimate statistically the values of Ragg. Thus, the previously obtained data on halogen aggregation involving these atoms remain relevant, while the data on F-aggregation can be revised considering the new value of RF.
Thus, the obtained results indicate that the following van der Waals radii for halogen atoms bonded to a carbon atom can be recommended as unified values for crystals of organic and organometallic compounds under normal conditions: RF = 1.57, RCl = 1.90, RBr = 1.99, and RI = 2.15 Å. It makes sense to clarify the value of RI in crystals of organometallic compounds when data for a larger number of structures become available.

3.3. The M–Hal…Hal–M Distances

Halogen bonds involving M–Hal groups have been the subject of many studies [3,4,34,35,36]. However, as a rule, contacts of halogen atoms from such groups have been considered either with other elements or with halogen atoms that do not form M–Hal bonds. Contacts M–Hal…Hal–M have rarely been noted by researchers [37]. Therefore, one of the goals of this work was to statistically analyze the M1–Hal1…Hal2–M2 distances. In this case, as in the analysis of the C–Hal1…Hal2–C distances, only the distances between the same atoms (Hal1 = Hal2, M1 = M2) under normal conditions were considered. The rules for selecting substances for the MHal set are described in more detail in Section 4.
Data on the number of different metal elements (M) with M–Hal bonds, according to the CSD [38] search, are provided in Table 9. Detailed information about the number of symmetrically independent bonds of each M–Hal type is given in Table 10. It can be seen that, among the studied substances, some types of M–Hal bonds are very rare. Therefore, in addition to the number of metal elements that have at least one particular M–Hal bond in all crystals, the numbers of M with more than 20 and more than 40 symmetrically independent M–Hal bonds in total are presented in Table 9.
Table 10 shows the statistical data characterizing all the M–Hal groups found in CSD (under normal conditions of crystal study): NM–Hal is the number of symmetrically independent bonds; NHal…Hal is the number of distances M–Hal…Hal–M ≤ Ragg, as well as the Hal-aggregation coefficient for M–Hal bonds, which is proposed in this work to estimate the propensity of M–Hal groups of a certain type to participate in aggregates formed by M–Hal…Hal–M contacts, kMHal-agg = NHal…Hal/NM–Hal. The higher the kMHal-agg value is, the more likely it is that the M–Hal group with specific M and Hal forms at least one M–Hal…Hal–M contact with length ≤ Ragg. In this part of the work, when calculating Ragg = 2RHal + 0.5 Å, the RHal values obtained in the previous stage as a result of the analysis of the C–Hal…Hal–C distances (Section 3.2) were used, namely: RF = 1.57, RCl = 1.90, RBr = 1.99, and RI = 2.15 Å.
Obviously, the kMHal-agg coefficient has a predictive value if the number of symmetrically independent M–Hal bonds in the training set is sufficiently large. The smallest numbers of such bonds were found for the M–F groups; therefore, the NM–Hal boundaries in the analysis of kMHal-agg were chosen, considering the available NM–F. The graphs illustrating the changes in kMHal-agg depending on M and Hal show values for metals having more than 20 symmetrically independent M–Hal bonds in the set (Figure 3a) and more than 40 bonds (Figure 3b) (for convenience, the abscissa scales on Figure 3a,b are made identical; elements M are listed in alphabetical order).
In the set under consideration, when the condition NM–Hal > 20 is fulfilled (Figure 3a), the maximum and minimum values of kMHal-agg correspond to bonds involving fluorine. The largest value of kMHal-agg (2.17) is for the Sb–F group and the smallest (0) for the Ti–F group; i.e., the groups of Sb–F participate on average in more than two symmetrically independent distances of Sb–F...F–Sb, while the Ti–F groups are not at all inclined to form Ti–F...F–Ti contacts. It should be noted that the number of symmetrically independent groups of M–F in both cases is not too large (35 each), and as their number increases, the values of kMHal-agg can notably change.
In the subset with NM–Hal > 40 (Figure 3b), the largest value of kMHal-agg (1.17) is for the Ge–Cl group, while the value of Hg–I is close to it (1.09). In general, for the Hg–Hal groups (Hal = Cl, Br, I), the kMHal-agg values are high (0.80–1.09), while the numbers of such groups in the set is 190 or more (Table 10); i.e., with high probability, one can expect the presence of substances of distances of Hg–Hal...Hal–Hg with length ≤ Ragg for these Hal.
To estimate the parameters of the distributions of the M–Hal...Hal–M distances, the same approach was used as for the C–Hal...Hal–C distances. It should be noted that the smallest number of C–Hal...Hal–C distances is in the First subset of the Orgmet set for C–I...I–C (140). In the MHal set, there is about the same number of values (124) in the All subset for M–F...F–M distances, and in the First subset for distances of the same type, it is several times less (33). Accordingly, the error in parameters describing these distances can be high, especially in the First subset, as evidenced in particular by the low correlation coefficient for it (r2 = 0.702). The distances of M–Hal1...Hal2–M (Hal1 = Hal2 = Cl, Br, I) were described using Gaussian functions with two step sizes (0.1 and 0.2 Å). In this case, as in the analysis of the C–Hal...Hal–C distances, the position of the maxima of the functions changed very little; therefore, Table 11 lists only the parameters of the distributions obtained with a step of 0.2 Å.
Considering that the Hal atoms in the M–Hal groups can coexist with large ligands around M, which hinder the approach of the same groups from neighboring molecules, it is not surprising that the positions of the maximum exceed 2RHal, even in the First subsets. Somewhat surprisingly, for M–Cl...Cl–M, the xAll value is almost insignificant (by 0.026 Å) but is less than xFirst. Formally, for the distances of M–F...F–M, the same effect is observed and is much larger in magnitude; however, it could be due to the small number of these distances in the set, and, accordingly, the inaccuracy of the obtained parameters. For the M–Br...Br–M and M–I...I–M distances, the values of xm in the All subset are expectedly greater than in the First subset, but the difference of xallxfirst for M–Br...Br–M is notably smaller (0.053 Å) on average than for the C–Hal...Hal–C distances (Table 6).

4. Materials and Methods

4.1. General Procedure for the Formation of Hal...Hal Distance Sets

For the selection of crystalline substances, the Cambridge Structural Database (CSD) [38], version 5.43 (November 2021), +3 updates was used. The search was carried out with the ConQuest program [39], using combinations of several conditions.
Small halogen-containing molecules (CCl4, CHCl3, etc.) often occur in crystals as solvate molecules, and their position and/or orientation is often disordered, which is not always accurately described when determining the structure. Therefore, to increase the accuracy of the results, not only all records with disorder noted in CSD were excluded from consideration but also the structures of only homomolecular crystals (crystals consisting of identical molecules; the search condition in ConQuest NRes = 1) were considered.
Since the goal of this work was to analyze distances involving only terminal halogen atoms, when drawing the C–Hal or M–Hal fragments, the presence of only one bond was specified for Hal.
The specificity of the formation of crystals of polymeric compounds can lead to a difference in the distributions of the lengths of interatomic distances in them compared to crystals of nonpolymeric compounds; therefore, the corresponding entries were also excluded from consideration.
Temperature and pressure can affect the parameters of intermolecular contacts, so room temperature and normal pressure were other search conditions (records with the “pressure” field were excluded).
The Best room temperature list [40] was used to determine the best studies within the same family (records with the same letter part of the reference code).
The conditions listed above, supplemented by the presence of 3D coordinates (which was necessary for further distance calculations) and the absence of CSD error warnings, were used for searches.

4.2. Additional Terms for Particular Sets

To form a set of distances of C–Hal...Hal–C existing in crystals of organic compounds (Org), a search was carried out among records classified in CSD as Organics. Accordingly, to form a set of distances observed in crystals of organometallic compounds (Orgmet), the search was performed among records classified in the CSD as Organometallic. To form a set of distances of M–Hal...Hal–M, when drawing, M–Hal M was taken as ‘any metal’, and the search was performed on all records (i.e., without choosing Organics or Organometallic).

5. Conclusions

The values of the van der Waals radii of halogen atoms obtained in this work are in excellent agreement with the results of Chernyshov et al. [30] but significantly (by 0.10–0.17 Å) greater than the commonly used values of Bondi [25].
The positions of the maxima of the Gaussian functions, describing the shortest distances of C–Hal...Hal–C in each substance, are shifted to shorter distances in the set of homomolecular crystals of organometallic compounds compared to crystals of organic compounds. At the same time, the values of the van der Waals radii proposed considering the characteristics of the two distributions of contacts C–Hal... Hal–C (the shortest in each substance and all less than the threshold while somewhat greater than RHal) are almost the same in these two sets of substances, with the exception of radius I. The difference in values of radius I may be attributed to a significantly smaller number of C–I...I–C contacts in the set of crystals of organometallic compounds.
An extensive statistical analysis of the M–Hal...Hal–M distances has been performed for the first time. The Hal-aggregation coefficient for M–Hal bonds was proposed and calculated for all M–Hal groups presented in the MHal set, making it possible to estimate the propensity of M–Hal groups with certain M and Hal to participate in Hal-aggregates formed by M–Hal...Hal–M contacts.

Funding

This research received no external funding.

Acknowledgments

This work was performed as part of topic no. 121031300090-2.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Sarma, J.A.R.P.; Desiraju, G.R. The Role of Cl...Cl and C–H...O Interactions in the Crystal Engineering of 4-Å Short-Axis Structures. Acc. Chem. Res. 1986, 19, 222–228. [Google Scholar] [CrossRef]
  2. Desiraju, G.R. Crystal Engineering; Elsevier: Amsterdam, The Netherlands, 1989; 307p. [Google Scholar]
  3. Johnson, M.T.; Džolić, Z.; Cetina, M.; Wendt, O.F.; Öhrström, L.; Rissanen, K. Neutral Organometallic Halogen Bond Acceptors: Halogen Bonding in Complexes of PCPPdX (X = Cl, Br, I) with Iodine (I2), 1,4-Diiodotetrafluorobenzene (F4DIBz), and 1,4-Diiodooctafluorobutane (F8DIBu). Cryst. Growth Des. 2012, 12, 362–368. [Google Scholar] [CrossRef] [Green Version]
  4. Nemec, V.; Lisac, K.; Bedeković, N.; Fotović, L.; Stilinović, V.; Cinčić, D. Crystal engineering strategies towards halogen-bonded metal–organic multi-component solids: Salts, cocrystals and salt cocrystals. CrystEngComm 2021, 23, 3063–3083. [Google Scholar] [CrossRef]
  5. Cavallo, G.; Metrangolo, P.; Milani, R.; Pilati, T.; Priimagi, A.; Resnati, G.; Terraneo, G. The Halogen Bond. Chem. Rev. 2016, 116, 2478–2601. [Google Scholar] [CrossRef] [Green Version]
  6. Wheeler, G.L.; Colson, S.D. Intermolecular interactions in polymorphic p-dichlorobenzene crystals: The α, β, and γ phases at 100 °K. J. Chem. Phys. 1976, 65, 1227–1235. [Google Scholar] [CrossRef]
  7. Grineva, O.V.; Zorkii, P.M. Intermolecular interaction energy in crystals of some chlorine-containing organic compounds. Russ. J. Phys. Chem. A 1998, 72, 714–720. [Google Scholar]
  8. Grineva, O.V.; Zorky, P.M. Analysis of molecular Cl...Cl interactions in pentachlorobezene and hexachlorobenzene crystals. Cryst. Rep. 2000, 45, 633–639. [Google Scholar] [CrossRef]
  9. Grineva, O.V.; Zorkii, P.M. Aggregation of halogen atoms in haloorganic crystals with low halogen contents. Russ. J. Phys. Chem. A 2000, 74, 1758–1764. [Google Scholar]
  10. Grineva, O.V.; Zorkii, P.M. Isostructural and nonisostructural compounds in series of halogenated organic crystal substances. Structure of Hal-aggregates. J. Struct. Chem. 2001, 42, 16–23. [Google Scholar] [CrossRef]
  11. Grineva, O.V.; Zorkii, P.M. Aggregation of halogen atoms in crystalline isomers. J. Struct. Chem. 2002, 43, 995–1005. [Google Scholar] [CrossRef]
  12. Grineva, O.V.; Zorky, P.M.; Rostov, E.S. Crystal structures of monohalogenated benzoic acids. Struct. Chem. 2007, 18, 443–448. [Google Scholar] [CrossRef]
  13. Berman, H.M.; Westbrook, J.; Feng, Z.; Gilliland, G.; Bhat, T.N.; Weissig, H.; Shindyalov, I.N.; Bourne, P.E. The Protein Data Bank. Nucleic Acids Res. 2000, 28, 235–242. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  14. Auffinger, P.; Hays, F.A.; Westhof, E.; Ho, P.S. Halogen bonds in biological molecules. Proc. Natl Acad. Sci. USA 2004, 101, 16789–16794. [Google Scholar] [CrossRef] [PubMed]
  15. Gerebtzoff, G.; Li-Blatter, X.; Fischer, H.; Frentzel, A.; Seelig, A. Halogenation of Drugs Enhances Membrane Binding and Permeation. ChemBioChem 2004, 5, 676–684. [Google Scholar] [CrossRef]
  16. Hernandes, M.; Cavalcanti, S.M.; Moreira, D.R.; de Azevedo Junior, W.; Leite, A.C. Halogen Atoms in the Modern Medicinal Chemistry: Hints for the Drug Design. Curr. Drug Targets 2010, 11, 303–314. [Google Scholar] [CrossRef]
  17. Wishart, D.S.; Knox, C.; Guo, A.C.; Shrivastava, S.; Hassanali, M.; Stothard, P.; Chang, Z.; Woolsey, J. Drugbank: A comprehensive resource for in silico drug discovery and exploration. Nucleic Acids Res. 2006, 34, D668–D672. [Google Scholar] [CrossRef]
  18. Liu, N.; Lai, J.; Lyu, C.; Qiang, B.; Wang, H.; Jin, H.; Zhang, L.; Liu, Z. Chemical space, scaffolds, and halogenated compounds of CMNPD: A comprehensive chemoinformatic analysis. J. Chem. Inf. Model. 2021, 61, 3323–3336. [Google Scholar] [CrossRef] [PubMed]
  19. Hardegger, L.A.; Kuhn, B.; Spinnler, B.; Anselm, L.; Ecabert, R.; Stihle, M.; Gsell, B.; Thoma, R.; Diez, J.; Benz, J.; et al. Systematic Investigation of Halogen Bonding in Protein-Ligand Interactions. Angew. Chem. Int. Ed. 2011, 50, 314–318. [Google Scholar] [CrossRef]
  20. Shinada, N.K.; de Brevern, A.G.; Schmidtke, P. Halogens in Protein–Ligand Binding Mechanism: A Structural Perspective. J. Med. Chem. 2019, 62, 9341–9356. [Google Scholar] [CrossRef]
  21. Nunes, R.S.; Vila-Vicosa, D.; Costa, P.J. Halogen Bonding: An Underestimated Player in Membrane-Ligand Interactions. J. Am. Chem. Soc. 2021, 143, 4253–4267. [Google Scholar] [CrossRef]
  22. Mu, K.; Zhu, Z.; Abula, A.; Peng, C.; Zhu, W.; Xu, Z. Halogen Bonds Exist between Noncovalent Ligands and Natural Nucleic Acids. J. Med. Chem. 2022, 65, 4424–4435. [Google Scholar] [CrossRef]
  23. Pauling, L. The Nature of the Chemical Bond and the Structure of Molecules and Crystals, 3rd ed.; Cornell University Press: Ithaca, NY, USA, 1939; 429p. [Google Scholar]
  24. Kitaigorodskii, A.I. Organic Chemical Crystallography; Consultants Bureau: New York, NY, USA, 1961; 541p. [Google Scholar]
  25. Bondi, A. van der Waals Volumes and Radii. J. Phys. Chem. 1964, 68, 441–451. [Google Scholar] [CrossRef]
  26. Zefirov, Y.V.; Zorkii, P.M. Statistical mean values of the van der Waals radii of organogenic elements. J. Struct. Chem. 1974, 15, 102–105. [Google Scholar] [CrossRef]
  27. Zefirov, Y.V.; Porai-Koshits, M.A. Mean-statistical value of the van der Waals radius of the fluorine atom. J. Struct. Chem. 1981, 21, 526–530. [Google Scholar] [CrossRef]
  28. Zefirov, Y.V.; Porai-Koshits, M.A. Geometry of halogen-halogen specific interactions in organic crystals. J. Struct. Chem. 1986, 27, 239–244. [Google Scholar] [CrossRef]
  29. Nyburg, S.C.; Faerman, C.H. A Revision of van der Waals Atomic Radii for Molecular Crystals: N, O, F, S, Cl, Se, Br and I Bonded to Carbon. Acta Cryst. B 1985, 41, 274–279. [Google Scholar] [CrossRef]
  30. Chernyshov, I.Y.; Ananyev, I.V.; Pidko, E.A. Revisiting van der Waals Radii: From Comprehensive Structural Analysis to Knowledge-Based Classification of Interatomic Contacts. ChemPhysChem 2020, 21, 370–376. [Google Scholar] [CrossRef]
  31. Zefirov, Y.V.; Zorkii, P.M. Van der Waals radii and their application in chemistry. Russ. Chem. Rev. 1989, 58, 421–440. [Google Scholar] [CrossRef]
  32. Grineva, O.V. Organic homomolecular crystals with the shortest period of about 4 Å: The role of halogen atoms according to the CSD statistics. Acta Cryst. A 2021, 77, C801. [Google Scholar] [CrossRef]
  33. Grineva, O.V. Space group distributions differ considerably among subsets of organic crystal compounds: The impact of metal content and residue number. CrystEngComm 2023, 25, 1467–1470. [Google Scholar] [CrossRef]
  34. Brammer, L.; Espallargas, G.M.; Libri, S. Combining metals with halogen bonds. CrystEngComm 2008, 10, 1712–1727. [Google Scholar] [CrossRef]
  35. Troff, R.W.; Mäkelä, T.; Topić, F.; Valkonen, A.; Raatikainen, K.; Rissanen, K. Alternative Motifs for Halogen Bonding. Eur. J. Org. Chem. 2013, 2013, 1617–1637. [Google Scholar] [CrossRef]
  36. Thangavadivale, V.; Aguiar, P.M.; Jasim, N.A.; Pike, S.J.; Smith, D.A.; Whitwood, A.C.; Brammer, L.; Perutz, R.N. Self-complementary nickel halides enable multifaceted comparisons of intermolecular halogen bonds: Fluoride ligands vs. other halides. Chem. Sci. 2018, 9, 3767–3781. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  37. Alizadeh, V.; Mahmoudi, G.; Vinokurova, M.A.; Pokazeev, K.M.; Alekseeva, K.A.; Miroslaw, B.; Khandar, A.A.; Frontera, A.; Safin, D.A. Spodium bonds and metal–halogen···halogen–metal interactions in propagation of monomeric units to dimeric or polymeric architectures. J. Mol. Struct. 2022, 1252, 132144. [Google Scholar] [CrossRef]
  38. Groom, C.R.; Bruno, I.J.; Lightfoot, M.P.; Ward, S.C. The Cambridge Structural Database. Acta Cryst. B 2016, 72, 171–179. [Google Scholar] [CrossRef] [PubMed]
  39. Bruno, I.J.; Cole, J.C.; Edgington, P.R.; Kessler, M.; Macrae, C.F.; McCabe, P.; Pearson, J.; Taylor, R. New software for searching the Cambridge Structural Database and visualizing crystal structures. Acta Cryst. B 2002, 58, 389–397. [Google Scholar] [CrossRef]
  40. Streek, J.V.D. Searching the Cambridge Structural Database for the ‘best’ representative of each unique polymorph. Acta Cryst. B 2006, 62, 567–579. [Google Scholar] [CrossRef]
Figure 1. Distributions of the lengths (d, Å) of intermolecular distances C–Hal...Hal–C in crystals of the Org set, not exceeding 3.300, 4.300, 4.440, and 4.780 Å for F...F ((a) has a step of 0.2 Å, (b) has a step 0.1 Å)), Cl...Cl (c), Br...Br (d), and I...I (e), respectively. The red lines are the approximations by the Gaussian function.
Figure 1. Distributions of the lengths (d, Å) of intermolecular distances C–Hal...Hal–C in crystals of the Org set, not exceeding 3.300, 4.300, 4.440, and 4.780 Å for F...F ((a) has a step of 0.2 Å, (b) has a step 0.1 Å)), Cl...Cl (c), Br...Br (d), and I...I (e), respectively. The red lines are the approximations by the Gaussian function.
Ijms 24 11911 g001aIjms 24 11911 g001b
Figure 2. Distributions of the lengths (d, Å) of the shortest intermolecular distances C–Hal...Hal–C (one of each substance) in crystals of the Org set: F...F (a), Cl...Cl (b), Br...Br (c), I...I (d). The red lines are the approximations by the Gaussian function.
Figure 2. Distributions of the lengths (d, Å) of the shortest intermolecular distances C–Hal...Hal–C (one of each substance) in crystals of the Org set: F...F (a), Cl...Cl (b), Br...Br (c), I...I (d). The red lines are the approximations by the Gaussian function.
Ijms 24 11911 g002
Figure 3. Values of Hal-aggregation coefficients for M–Hal bonds (kMHal-agg) for metals with more than 20 M–Hal bonds in the set (a) and more than 40 bonds (b).
Figure 3. Values of Hal-aggregation coefficients for M–Hal bonds (kMHal-agg) for metals with more than 20 M–Hal bonds in the set (a) and more than 40 bonds (b).
Ijms 24 11911 g003aIjms 24 11911 g003b
Table 1. Results of search in the CSD for substances with C–Hal bonds and C–Hal...Hal–C distances satisfying the Hal-aggregation statistical criterion proposed in [9] (N is the number of substances with C–Hal bonds, Ntot is the total number of substances in the sets Org and Orgmet, and Nagg is the number of substances with distances satisfying the Hal-aggregation criterion with RHal from [26,27,28]).
Table 1. Results of search in the CSD for substances with C–Hal bonds and C–Hal...Hal–C distances satisfying the Hal-aggregation statistical criterion proposed in [9] (N is the number of substances with C–Hal bonds, Ntot is the total number of substances in the sets Org and Orgmet, and Nagg is the number of substances with distances satisfying the Hal-aggregation criterion with RHal from [26,27,28]).
C–Hal BondsC–Hal...Hal–C Distances
TypeOrgOrgmetTypeOrgOrgmet
NN/Ntot, %NN/Ntot, %NaggNagg/N, %NaggNagg/N, %
C–F96827.038415.0C–F...F–C406942.0248164.6
C–Cl14,70310.626853.5C–Cl...Cl–C772952.6159259.3
C–Br99607.213591.8C–Br...Br–C454445.680459.2
C–I17921.32260.3C–I...I–C104258.114061.9
Table 2. The CSD refcodes for the five shortest C–Hal...Hal–C distances in homomolecular crystals of organic compounds at normal pressure and room temperature (d is the contact length in Å and in % with respect to 2.80, 3.80, 3.94, and 4.28 Å (2RHal [26,27,28]) for F...F, Cl...Cl, Br...Br, and I...I, respectively).
Table 2. The CSD refcodes for the five shortest C–Hal...Hal–C distances in homomolecular crystals of organic compounds at normal pressure and room temperature (d is the contact length in Å and in % with respect to 2.80, 3.80, 3.94, and 4.28 Å (2RHal [26,27,28]) for F...F, Cl...Cl, Br...Br, and I...I, respectively).
F...FdCl...CldBr...BrdI...Id
Å%Å%Å%Å%
PUCREL1.49854MIWHOP012.83575XACXEE2.99276WEWGAJ3.56783
PUCREL1.55656PORWIC3.05080DBRDOX3.14880QASXOA3.60284
FFMXZP1.77463ZIVKIY3.06481KOVTEX3.24482NUFCIC3.61684
JODVAZ2.18178CLMPMO3.07981HEGCII3.25683DEQKUI3.61884
VAWNUE2.18778TIJVEN3.09481ADOPOB3.26783VALRAD3.64485
Table 3. Characteristics of the C–Hal...Hal–C intermolecular distances in initial arrays (Initial) and used in statistical analysis (In analysis).
Table 3. Characteristics of the C–Hal...Hal–C intermolecular distances in initial arrays (Initial) and used in statistical analysis (In analysis).
Ragg, ÅOrgOrgmet
InitialIn AnalysisInitialIn Analysis
NumberShortest, Å NumberShortest, Å /2RHal, %NumberShortest, Å NumberShortest, Å /2RHal, %
C–F...F–C3.30014,5331.49814,5142.50389.410,7971.96010,7932.35384.0
3.50021,01320,98516,45616,451
C–Cl... Cl–C4.30022,3612.83522,3593.05080.348392.09648283.02879.7
C–Br... Br–C4.44095012.99294723.14879.918331.95418213.17480.6
C–I... I–C4.78021773.56721773.56783.33963.6483963.64885.2
Table 4. The main parameters for describing the distribution of the lengths of all C–Hal...Hal–C intermolecular distances less than the dmax values in crystals of the Org and Orgmet sets using the Gaussian function with different histogram steps (xm is the position of the maximum, r is the correlation coefficient).
Table 4. The main parameters for describing the distribution of the lengths of all C–Hal...Hal–C intermolecular distances less than the dmax values in crystals of the Org and Orgmet sets using the Gaussian function with different histogram steps (xm is the position of the maximum, r is the correlation coefficient).
DistancesF...FCl...Cl
≤4.300 Å
Br…Br
≤ 4.440 Å
I…I
≤ 4.780 Å
≤3.300 Å≤3.500 Å
Step, Å0.10.20.10.20.10.20.10.20.10.2
Org
Total14,51420,98522,35994722177
xm3.1563.1413.2423.2393.8783.8794.0674.0664.3844.388
r20.99710.9790.9870.9800.9870.9800.9900.9670.983
Orgmet
Total10,79316,45148281821396
xm3.1993.1863.3003.2893.8913.8934.0504.0444.4094.262
r20.9990.99990.9890.9960.9780.9930.9770.9970.9070.980
Table 5. The main parameters for describing the distribution of the lengths of the shortest intermolecular distances C–Hal…Hal–C (one of each substance) less than the dmax values in crystals of the Org and Orgmet sets using the Gaussian function with different histogram steps (xm is the position of the maximum, r is the coefficient correlations).
Table 5. The main parameters for describing the distribution of the lengths of the shortest intermolecular distances C–Hal…Hal–C (one of each substance) less than the dmax values in crystals of the Org and Orgmet sets using the Gaussian function with different histogram steps (xm is the position of the maximum, r is the coefficient correlations).
DistancesF…F ≤ 3.500 ÅCl…Cl ≤ 4.300 ÅBr…Br ≤ 4.440 ÅI…I ≤ 4.780 Å
Org
Total4744772845431042
Step, Å0.10.20.10.20.10.20.10.2
xm3.0283.0393.7303.7293.9263.9274.2324.228
r20.9110.9220.9120.9380.9370.9640.9050.952
Orgmet
Total27511591803140
Step, Å0.10.20.10.20.10.20.10.2
x2.9772.9843.7023.7063.8933.8914.0494.102
r20.9490.9660.9510.9780.9260.9500.3730.765
Table 6. Differences between the positions of the maxima (x) of the Gaussian functions for different variants of histograms for the C–Hal… Hal–C distances in the Org and Orgmet sets (All—all contacts less or equal dmax, (Å); First—one shortest contact from each substance, 0.1 is a histogram with a step of 0.1 Å, and 0.2 is a histogram with a step of 0.2 Å).
Table 6. Differences between the positions of the maxima (x) of the Gaussian functions for different variants of histograms for the C–Hal… Hal–C distances in the Org and Orgmet sets (All—all contacts less or equal dmax, (Å); First—one shortest contact from each substance, 0.1 is a histogram with a step of 0.1 Å, and 0.2 is a histogram with a step of 0.2 Å).
Type of Hal…Haldmaxx0.2x0.1xOrgxOrgmetxallxfirst
OrgOrgmet0.10.20.2
AllFirstAllFirstAllFirstAllFirstOrgOrgmet
F…F 3.50−0.0030.011−0.0110.007−0.0580.051−0.0500.0550.2000.305
Cl…Cl4.300.001−0.0010.0020.004−0.0130.028−0.0140.0230.1500.187
Br…Br4.44−0.0010.001−0.006−0.0020.0170.0330.0220.0360.1390.153
I…I 4.780.004−0.004−0.1470.053−0.0250.1830.1260.1260.1600.160
Table 7. Estimation of the van der Waals radii of halogen atoms (Å) by the formula (xall + xfirst)/4, where x is a position of maximum of Gaussian function used for description of the subset of the C–Hal…Hal–C distances (All—all contacts with ≤dmax; First—one shortest contact from each substance ≤ dmax; Org—homomolecular crystals of compounds; Orgmet—homomolecular crystals of organometallic compounds).
Table 7. Estimation of the van der Waals radii of halogen atoms (Å) by the formula (xall + xfirst)/4, where x is a position of maximum of Gaussian function used for description of the subset of the C–Hal…Hal–C distances (All—all contacts with ≤dmax; First—one shortest contact from each substance ≤ dmax; Org—homomolecular crystals of compounds; Orgmet—homomolecular crystals of organometallic compounds).
HalOrgOrgmet
Step, Å0.10.20.10.2
F 1.571.571.571.57
Cl1.901.901.901.90
Br2.002.001.981.98
I2.152.152.112.09
Table 8. Comparison of the van der Waals radii of halogen atoms (Å) obtained in this work with some literature data.
Table 8. Comparison of the van der Waals radii of halogen atoms (Å) obtained in this work with some literature data.
HalThis WorkPrevious Values
Pauling [23]Kitaigorodskii [24]Bondi [25]Zefirov et al.Chernyshov et al. [30]
F 1.571.35 1.471.40 [27]1.55
Cl1.901.801.781.751.90 [26]1.91
Br2.00 (Org), 1.98 (Orgmet) 1.951.951.851.97 [28]2.00
I2.15 (Org), 2.10 (Orgmet)2.152.11.982.145 [28]2.17
Table 9. Number of different metal atoms (M) forming M–Hal bonds in homomolecular crystals, including those with >20 and >40 symmetrically independent bonds with a specific Hal.
Table 9. Number of different metal atoms (M) forming M–Hal bonds in homomolecular crystals, including those with >20 and >40 symmetrically independent bonds with a specific Hal.
HalFClBrI
At least one M–Hal29645357
>20 M–Hal4412721
>40 M–Hal0351916
Table 10. Statistical data for the M–Hal groups (NM–Hal is the number of symmetrically independent bonds; NHal…Hal is the number of distances M1–Hal1…Hal2–M2 ≤ Ragg; kMHal-agg is the statistical Hal-aggregation coefficient for M–Hal bonds, M1 = M2, Hal1 = Hal2; RHal values determined in this work were used to calculate Ragg, kMHal-agg = NHal…Hal/NM–Hal).
Table 10. Statistical data for the M–Hal groups (NM–Hal is the number of symmetrically independent bonds; NHal…Hal is the number of distances M1–Hal1…Hal2–M2 ≤ Ragg; kMHal-agg is the statistical Hal-aggregation coefficient for M–Hal bonds, M1 = M2, Hal1 = Hal2; RHal values determined in this work were used to calculate Ragg, kMHal-agg = NHal…Hal/NM–Hal).
MFClBrI
NM–HalNHal…HalkMHal-aggNM–HalNHal…HalkMHal-aggNM–HalNHal…HalkMHal-aggNM–HalNHal…HalkMHal-agg
1Ag0083170.203870.181400
2Al310.332442290.9438340.8918271.50
3Au005003610.7291820.9070230.33
4Ba0010000300
5Be00830.38200320.67
6Bi30096360.3847200.4220140.70
7Ca00510.20400500
8Cd00263500.1978250.321951190.61
9Ce005000000
10Co4005831210.21102210.2087290.33
11Cr00148140.0915001200
12Cs002000000
13Cu20014654370.304601290.28229260.11
14Dy004050.1260000
15Er00910.1100100
16Eu001130.2700100
17Fe200409860.215290.1789370.42
18Ga133.001651170.7126411.5827260.96
19Gd001530.2010000
20Ge8001541801.1721130.6213110.85
21Hf100097170.18840.50520.40
22Hg004613670.801901820.962452671.09
23Ho001030.300000
24In221.0099380.3831220.7128260.93
25Ir00461320.0716006590.14
26K0000200500
27La001330.23300200
28Li00700700500
29Lu00800100200
30Mg001230.252520.08500
31Mn111.00242390.1699100.102230.14
32Mo620.33310750.24101120.1297270.28
33Na00100200600
34Nb3002091490.719171.89400
35Nd002530.12100100
36Ni200389380.10257360.145390.17
37Np003000000
38Os100153340.223990.232040.20
39Pb003260.1912001960.32
40Pd720.2821332820.13281250.09227550.24
41Pr001630.190000
42Pt420.5015305330.35132450.342481640.66
43Pu0010000122.00
44Rb0010010000
45Re2006701530.23271630.2340160.40
46Rh300619510.084560.1379110.14
47Ru20015331740.1174150.2081200.25
48Sb35762.172052261.1065430.6628160.57
49Sc00820.2500100
50Sm002430.122001910.05
51Sn2120.1010804730.44172900.5290880.98
52Sr0000221.00400
53Ta310.33208910.441040.40300
54Tb00610.170000
55Tc0069120.17810.12100
56Th002620.08100111.00
57Ti35008453480.4139160.41610.17
58Tl1001480.578131.62441.00
59Tm0020000100
60U441.0096120.121810.061110.09
61V410.25135690.51410.25810.12
62W23271.173561740.494970.1486330.38
63Y00200000210.50
64Yb002610.044001000
65Zn2009533220.342621090.421881150.61
66Zr5005791080.191380.621450.36
Table 11. The main parameters for describing the length distribution of all intermolecular M–Hal...Hal–M distances (All) and the shortest ones in each substance (First) less than the Ragg values in crystals of the MHal set by the Gaussian function with a histogram step of 0.2 Å (xm is the maximum position; r is the correlation coefficient; when calculating Ragg, the RHal values determined in this work were used).
Table 11. The main parameters for describing the length distribution of all intermolecular M–Hal...Hal–M distances (All) and the shortest ones in each substance (First) less than the Ragg values in crystals of the MHal set by the Gaussian function with a histogram step of 0.2 Å (xm is the maximum position; r is the correlation coefficient; when calculating Ragg, the RHal values determined in this work were used).
DistancesF...F ≤ 3.640 ÅCl...Cl ≤ 4.300 ÅBr...Br ≤ 4.480 ÅI...I ≤ 4.800 Å
SubsetAllFirstAllFirstAllFirstAllFirst
Total124335554315611276531203686
xm3.3003.5004.0834.1094.2184.1654.5934.449
r20.9980.7020.9960.9950.99970.99970.9980.994
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Grineva, O.V. Comparison of Intermolecular Halogen...Halogen Distances in Organic and Organometallic Crystals. Int. J. Mol. Sci. 2023, 24, 11911. https://doi.org/10.3390/ijms241511911

AMA Style

Grineva OV. Comparison of Intermolecular Halogen...Halogen Distances in Organic and Organometallic Crystals. International Journal of Molecular Sciences. 2023; 24(15):11911. https://doi.org/10.3390/ijms241511911

Chicago/Turabian Style

Grineva, Olga V. 2023. "Comparison of Intermolecular Halogen...Halogen Distances in Organic and Organometallic Crystals" International Journal of Molecular Sciences 24, no. 15: 11911. https://doi.org/10.3390/ijms241511911

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop