Next Article in Journal
Germanium-Based Temperonic Crystal
Previous Article in Journal
Metal–Organic Framework for Plastic Depolymerization and Upcycling
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Incomplete Absorption Correction Results in an Increased Positive Mean Value of Weighted Residuals

DataQ Intelligence UG, Fichtelgebirgsstrasse 66, 95447 Bayreuth, Germany
Crystals 2025, 15(10), 898; https://doi.org/10.3390/cryst15100898
Submission received: 14 August 2025 / Revised: 6 October 2025 / Accepted: 8 October 2025 / Published: 16 October 2025
(This article belongs to the Section Inorganic Crystalline Materials)

Abstract

Incomplete absorption correction procedures in single-crystal diffraction experiments leave a characteristic trace—a “fingerprint”—in the residuals. Specifically, weak intensities are systematically overestimated, contributing disproportionately and sometimes even dominantly to the chi-square sum in least squares refinements. An analysis of six published crystal structures spanning a wide range of absorption coefficients reveals a consistent positive shift of the weighted residuals, which were significant for crystals with > 5.02 mm−1. This shift is all the stronger the greater the absorption coefficient and is accompanied by a proportionally increasing fraction of positive excess residuals. The simultaneous increase in the mean value of the residuals and the fraction of positive excess residuals proves that the shift is not caused by strong reflections or isolated outliers, but rather by the systematic overestimation of many weak intensities. Diagnostic plots and statistical metrics are presented for additional published data sets, supporting the generality of the findings. These findings can support the development of improved methods for absorption correction, which lead to physically meaningful thermal motion parameters even with strong absorption.

1. Introduction

In modern single-crystal X-ray diffraction experiments, the raw intensities I r a w and corresponding standard uncertainties are obtained by integration of the diffraction images from area detectors. These are subjected in a second step to data correction and data reduction steps in order to obtain the Bragg intensity. Many things have to be considered in this second step, including possible changes in the irradiated crystal volume, Lorentz, polarization corrections, crystal decay, beam intensity changes, and absorption effects. The main basic methods for absorption correction procedures are (i) exact numerical corrections, like analytical [1] and Gaussian quadrature [2] methods or—as an example for new developments—those involving ray tracing [3] based on tomographic reconstructions (ii) and semi-empirical corrections like Ψ scans [4] or multiscan [5] methods. Semi empirical methods need additional intensity measurements. Traditional analytical methods may not be applicable when the crystal is irregularly shaped or too small to index the faces, a problem that may be solved with the method of tomographic reconstruction. Empirical methods do not require knowledge of the crystal geometry; they minimize the differences between incident and diffracted beams based on Fourier series [6,7] or spherical harmonics [8]. The latter is implemented also in SADABS [9] with the difference that [8] applied spherical harmonics to both the incident and the diffracted beam, whereas in SADABS, the spherical harmonics are only applied to the diffracted beam and the incident beam is modeled with the help of a scale factor S ( n ) that changes with the frame number n:
I c o r r e c t e d = I r a w S ( n ) P ( u , v , w ) Q ( μ r , 2 θ ) ,
with corrected intensity I c o r r e c t e d ; raw intensity I r a w ; incident beam scale factor S ( n ) with the frame number n; diffracted beam factor P ( u , v , w ) based on spherical harmonics, where u, v, and w are the direction cosines of the diffracted beam relative to crystal-fixed axes and a spherical crystal factor Q ( μ r , 2 θ ) , where r is the effective radius of the crystal and μ the linear absorption coefficient; and Q ( μ r , 2 θ ) is a correction factor to take into account the 2 θ dependence, as equivalent reflections with different paths through the crystal still share the same 2 θ Bragg angle. The correction procedure implemented in SADABS is of great importance due to the wide distribution of this software.
Generally known consequences of insufficient absorption corrections are, for examplem non-positive definite anisotropic displacement parameters (ADPs), and cigar-shaped or oblate thermal motion ellipsoids, in short, non-physical thermal motion. But, accurate description of thermal motion plays a crucial role, for example, to clearly discriminate between disorder and thermal motion or when extracting physical properties like thermodynamic properties from the ADPs [10]. In macromolecular crystallography, insufficient absorption correction methods at long wavelengths limit sulfur SAD phasing experiments [3].
In the present work, error profiling is employed in order to characterize weaknesses of absorption correction procedures. The idea of error profiling in small molecule crystallography—where residuals usually do not receive much attention—is to relate the source of a systematic error with the appearance in the weighted residuals. In small molecule crystallography, application of a weighting scheme is very popular. Typically, a SHELXL [11]-like weighting scheme is applied with two parameters a and b. The weight is then defined as w = 1 s . u . 2 ( I o b s ) + ( a P ) 2 + b P , where s . u . ( I o b s ) are the standard uncertainties of the observed intensities and P is a parameter that represents the intensity and is obtained from averaging observed and calculated intensities according to a procedure suggested by Wilson [12]. Please note that this type of weighting scheme is not able to handle the simple error of too small s . u . ( I o b s ) for weak intensities [13]. Back to how the source of a systematic error is connected to the appearance in the weighted residuals for error profiling: This is achieved by using as many and as different descriptors of the residuals as possible to describe the appearance of the error in the residuals in such great detail that a highly specific and recognizable fingerprint of the systematic error under investigation is obtained. This approach hinges crucially on the availability of metrics describing the weighted residuals. Important recent metrics developed for this purpose like the mean value of the weighted residuals, ζ , the mean value of the weighted residuals separately for only positive ( ζ + ) and for only negative residuals ( | ζ | ), and so forth are described in greater detail further down. The implicit reference in error profiling is always the error-free case in which the residuals are symmetrically distributed (with respect to zero) random variables that do not change with resolution, intensity, and other entities. For example, when systematic errors are absent is the weighted residuals distributed such that the mean value is close to zero, ζ 0 , and such that positive and negative fluctuations in the residuals appear equally often and with approximately equal amplitude. From this requirement follows then, for example, that the mean value from positive residuals and the absolute mean value from negative residuals are equal ζ + | ζ | within the limits given by stochastic fluctuations and that the integer number of positive residuals # ζ + is equal to the integer number of negative residuals # ζ within the limits given by statistical fluctuations. The second important condition for successful error profiling is to obtain data sets with the error in question—but with no other severe errors. Other errors may distort, obstruct, or even reverse some or even all of the details of the fingerprint traces to the point where it is not specific or not even meaningful any more. This can be achieved either by using simulations or by using data from important “proof of principle” studies from well-known crystallographers with a reputation for high quality. The second approach will be used in this work, when analyzing published data that is intended to show that “the IUCr publication requirement that a numerical absorption correction must be applied for strongly absorbing crystals is in need of revision.” [9]. The weighted residuals are treated in the error profiling Ansatz like a very important and central entity that keeps all the relevant information about possible improvements in general and for absorption correction procedures in particular. Error profiling with the help of the weighted residuals has been used successfully earlier for developing new metrics for data sets contaminated by low energy radiation [14] and for characterizing a specific error which appears due to the ineptitude of a SHELXL-like weighting scheme to handle too small standard uncertainties for weak reflections [13]. Error profiling does not only give characteristic fingerprint traces for the identification of systematic errors in diffraction data, it also helps organize types of errors by providing the categories and metrics that are used or invented to describe systematic errors. As an example for such a new metric, the mean value of the weighted residuals will be discussed further down and applied in the context of absorption correction procedures. But, error profiling with the weighted residuals as central entity achieves even more: it facilitates the comprehension of the connection between the sources of systematic errors and the appearance of systematic errors in the weighted residuals. This is of great importance in order to make progress in the field of better understanding systematic errors. Existing tools like the normal probability plot (npp) are also very helpful in proving the existence of systematic errors in a given data set, but the npp alone does not give a clue to the origin of the error, which leaves the experimenter in the embarrassing situation of knowing about the existence of systematic errors but not about the cause or the steps necessary to remove it. A better comprehension of systematic errors may help close this gap. And finally, error profiling guides the way to methodological improvements—this is not limited to absorption correction procedures but includes all data reduction and data correction methods. For example, future protocols for improved absorption correction procedures may use the (missing) fingerprint traces developed in this work for proof of progress.
One additional and very important aspect of error profiling is that it necessarily leads to a critical review of the existing and accepted data quality evaluation protocols and associated metrics. Weaknesses and dangers of the existing protocols may be identified in this way. This is more important as it appears at first glance: We will see in this work that insufficient absorption correction procedures lead one-sidedly more often to the overestimation of observed intensities.
Overestimated intensities, in turn, lead to (i) artificially low agreement factors, which is a separate topic and cannot be discussed fully in the present work but is briefly touched upon. Furthermore, (ii) the anisotropic displacement parameters are also artificially low in the case of overestimated intensities, which result from underestimated absorption in combination with scaling. That underestimation of absorption leads to artificially low ADPs has been known for a long time; however, the question is whether this knowledge is considered in the state of the art of current absorption procedures. Low agreement factors and small thermal motion parameters are both often perceived as signs of high-quality structure refinement and, to a certain extent, they really are; however, certain errors may lead to unrealistic and artificially lowered values. In these cases, the results only appear to be of high quality but need not be of high quality in reality.
The present work aims to establish precise and specific traces of incomplete absorption in the residuals which are easy to recognize even for laymen. Moreover, experts can use these traces to develop even better absorption correction procedures. And finally, it is intended to raise awareness of applying metrics critically in diffraction experiments.
In Section 2, the methodology and the crystallographic structures from the literature are described. A minimal amount of background knowledge about how the mean value of the weighted residuals can be used for data evaluation is provided in Section 3.1 in the main part. How the mean value of the weighted residuals is connected to specific systematic errors is briefly discussed in Section 3.2. In Section 3.3, the mean value of the weighted residuals ζ is shown to tend to increase with the linear absorption coefficient μ by analyzing the published data sets from a study with a set of structures covering an extremely large range of absorption coefficients and including highly absorbing structures. In Section 3.4, a rationale is developed to understand the systematic connection empirically found in the preceding paragraph. From this rationale, expected characteristic traces of incomplete absorption correction in the residuals are derived in Section 3.5. Precisely, these specific traces are found in the above mentioned published data sets in the case of strongly absorbing crystals but they are not found in the weakly absorbing crystal of the same study as shown in Section 3.6, which is taken as evidence for a correct assignment of specific traces of incomplete absorption correction. These specific traces are searched for and found in published data sets with significant absorption from other authors, in order to not be limited to a small number of data sets from a single study (Section 3.7). Section 3.8 focuses on aspects of quantification of these traces. The rationale developed in Section 3.4 is put to another test in Section 3.9. The results are discussed in Section 4. Finally, a summary follows in Section 5.

2. Methods

The main method is to apply error profiling methods to analyze published data sets with the help of residual descriptors in order to identify and quantify traces in the residuals and connect these to incomplete or insufficient absorption correction procedures, for example, by showing that these traces are more pronounced for strong absorbers and missing for weak absorbers. The most important descriptors used are the mean value of the weighted residuals ζ ; the significance of the mean value of the weighted residuals ζ / σ ( ζ ) , which indicates whether a deviation ζ 0 is within the limits of statistical fluctuations; the mean value of positive residuals ζ + and the absolute mean value of negative residuals | ζ | , with the comparison of these latter two indicating the (missing) existence of the symmetry of the distribution of residuals with respect to zero; and the fraction of positive excess residuals ( # ζ + # ζ ) / N o b s and its significance ( # ζ + # ζ ) / N o b s . When the fraction of positive excess residuals and ζ simultaneously show a large positive or a large negative value, this indicates a shift in the distribution of the residuals from zero rather than individual large outliers, causing ζ 0 . This may be confirmed by ζ + | ζ | . These descriptors are introduced in greater detail in the main part. They are applied to six crystals with a large range of absorption coefficients taken from one publication [9], where the data were collected on Bruker Smart Apex II systems (Tokyo, Japan) with a D8 three-circle goniometer at 100 K. Data reduction was accomplished with SAINT (version 7.68A, [15]) from the APEX2 software (version 2.2012.2-0; [15], with refined box sizes for data integration. Data scaling and absorption correction were achieved with SADABS (version 2014/04). The importance of [9] lies in applying multiscan absorption methods to strongly absorbing crystals, where data were collected with microfocus technology, which is known for having a anisotropic beam profile that makes a uniform homogeneous sample illumination impossible. The claim is that multiscan methods work generally well with microfocus technology even in the case of crystals being too large for the highly focused beam. The six crystals used were (1) scandium platinate (spacegroup P b a m ), (2) murdochite ( F m 3 ¯ m ), (3) sodium tungstate ( P b c a ), (4) scandium cobalt carbide ( I m m m ), (5) a dibromoacridine derivative ( P 2 1 / n ), and (6) an inorganic cobalt complex ( P 2 1 / n ) (Table 1). The traces of insufficient absorption correction in the weighted residuals are established with these six crystals and searched for in other published structures (Crystal 7–13) with μ > 6 mm−1. From those, Crystal 7 is the same structure as Crystal 3, but in contrast to Crystal 3, absorption correction was performed with face indexing methods in Crystal 7. In Crystals 8, 9, 12, and 13, multiscan methods were used with SADABS; in Crystal 10, Gaussian absorption correction methods were used with Agilent Software Crysalis PRO [16]; and in Crystal 11, multiscan methods were used with Rigaku Software Crysalis PRO [17]. For the critical reader, diagnostic plots are shown for some additional structures [18,19,20,21] with small absorption coefficients μ < 0.16 mm−1 to demonstrate absence of the typical signs of insufficient absorption correction like in Crystal 6. These examples show that even face indexing absorption correction methods are affected by signs of insufficient absorption correction and that signs of insufficient absorption correction appear with different software.

3. Main Part

3.1. The Mean Value of Residuals Is Generally Connected to Systematic Errors

The mean value of the weighted residuals, ζ = 1 N o b s i = 1 N o b s ζ i , is centered around zero when the weighted residuals ζ i = I o b s , i I c a l c , i σ ( I o b s , i ) , where i = h k l abbreviates the Miller triples, are unbiased random variables. σ ( I o b s ) is the assigned weight. The summation is over the N o b s reflections participating in the least squares refinement. The angle brackets thus just indicate the arithmetic mean. The distribution of the weighted residuals is often associated with a Gaussian distribution. However, statistical fluctuations lead naturally to deviations of the mean value of the residuals from zero. Whether or not such a deviation is still in accordance with the assumption of random errors can easily be quantified by the significance of the deviation of the mean value of the residuals from zero:
ζ σ ( ζ ) = 1 N o b s i = 1 N o b s ζ i v a r ( ζ ) / N o b s
v a r ( ζ ) = 1 N o b s 1 i = 1 N o b s ζ i ζ 2
which is obtained by dividing the mean value of the residuals ζ by the square root of the population variance v a r ( ζ ) of the residuals over N o b s [14,32].
Significant deviations of the mean value of the residuals ( ζ σ ( ζ ) > 3 ) are widespread: from 127 highlighted data sets published with IUCrData, 52% did show significant deviations. The mean significance was 4.30 [32]. Significant positive values of the mean values of the residuals were found to appear more often then negative ones. These results give an overriding importance to the mean value of the weighted residuals as a data quality metric as it has the following pleasing characteristics: it is relevant, easy to understand, easy to apply, and easy to interpret. Significant deviations from zero indicate the presence of systematic errors—regardless of the results from G o F , w R ( F 2 ) , or other metrics. The present work shows how the mean value of the weighted residuals is connected to absorption correction procedures and which specific traces in the residuals characterize incomplete absorption correction.

3.2. More Specific Relationship Between the Mean Value of Residuals and Systematic Errors

It has been demonstrated earlier that, e.g., modeling of disorder and of twinning may reduce the significance of the shift in the mean value of the residuals, ζ σ ( ζ ) , and associated metrics like the significance of the number of positive excess residuals as given by # ζ + # ζ N o b s , where # ζ + denotes the integer number of residuals with positive signs and # ζ denotes the integer number of residuals with negative signs in a set of N o b s = # ζ + + # ζ residuals [14].
Factors that increase the mean value of the residuals were found to be not modeled disorder, missed twinning, low energy contamination [14], application of a significance or an intensity cut off, as well as missing electrons (missing scattering mass). Factors that decrease the mean value of the residuals may occur self evidently from missed extinction, beam shadowing, and detector saturation. Not so evidently, underestimated s . u . ( I o b s ) for weak intensities in combination with the SHELX default “OMIT -2” command can lead to distinct negative shifts [13] as well as excess scattering mass. (The results regarding significance and intensity cut offs, and missing and excess scattering mass are not yet published. The manuscript is still in preparation.)
The strategy in previous publications was to develop specific criteria, describing how exactly certain systematic errors affect the weighted residuals (fingerprint and “smoking gun”) and to quantify these expected traces by corresponding metrics or at least to visualize these in appropriately chosen graphics. In the present work, the same strategy is adopted to investigate absorption correction procedures.

3.3. Absorption Correction: Application to Strongly and Weakly Absorbing Crystals from the Literature

In [9], six crystals of different sizes and with absorption coefficients varying between 1.53 and 121.02 mm−1 were measured at 100 K on a Bruker Apex II diffractometer with microfocus X-ray source and mirror optics. Data sets were taken with Ag Kα and Mo Kα radiation for the same crystals and processed with the same software. Some of the crystals were slightly or even clearly (Crystal 5) too big for being fully immersed in the highly focused microsource beam. The beam diameter at the focal point is given in [9] as 110 μ m for the Mo Kα and 90 μ m for Ag Kα. These values are taken from the literature [33]. The aim of the work was obviously to demonstrate the applicability and validity of an empirical absorption correction with microsource technology. The empirical correction procedure works by refining an incident beam scale factor and spherical harmonic coefficients in one step of the process. In another separate step, a spherical crystal term Q ( μ r , 2 θ ) with absorption coefficient μ and effective radius r is refined to take the θ dependence of the absorption process into account. The empirical absorption correction is much easier to apply compared to methods that require face indexing like analytical methods [1] and Gaussian quadrature [34]. Crystal 1 has the largest and Crystal 6 the lowest absorption coefficients. The absorption coefficients are given for Ag Kα and Mo Kα radiation in Table 1.
Figure 1 shows that (i) the mean value of the residuals is positive in almost all cases ζ > 0 ; (ii) the larger the absorption coefficient, the larger the mean value of the residuals, ζ μ ; and (iii) ζ ( # ζ + # ζ ) N o b s , where (iii) holds in all cases, and (i) and (ii) only with the exception of Crystal 2. ζ scatters stochastically around zero with positive and negative deviations in data sets without systematic errors.
Table 2. Residual descriptors of Crystals 1–13. Column two: anode material; column three: fraction of positive excess residuals (FOPERs) with # ζ + indicating the integer number of positive residuals and # ζ indicating the integer number of negative residuals, negative numbers indicating that negative residuals appear more often than positive ones, and data sets without systematic errors showing positive and negative values with the same probability; column four: significance of FOPER as given by a random walk approach with equal probabilities for positive and for negative steps; columns five and six: weighting scheme parameters a and b; column seven: total mean value of weighted residuals, with data sets without systematic errors showing insignificant small positive and negative values with the same probability; column eight: significance of deviation of mean value of weighted residuals from zero; column nine: bin scale factor K = I o b s I c a l c for bin 1 (lowest 10%) from in total 10 bins of intensity sorted reflections similar to, but not identical with, the numbers given in SHELXL output files due to differing bin widths; column ten: mean value of only positive residuals; column eleven: standard deviation of mean value of only positive residuals; column twelve: absolute mean value of only negative residuals; column eleven: standard deviation of absolute mean value of only negative residuals. For more information about the metrics ( # ζ + # ζ ) N o b s , ( # ζ + # ζ ) N o b s , ζ , ζ σ ( ζ ) , ζ + , σ ( ζ + ) , | ζ | , and σ ( | ζ | ) , see also [14,32].
Table 2. Residual descriptors of Crystals 1–13. Column two: anode material; column three: fraction of positive excess residuals (FOPERs) with # ζ + indicating the integer number of positive residuals and # ζ indicating the integer number of negative residuals, negative numbers indicating that negative residuals appear more often than positive ones, and data sets without systematic errors showing positive and negative values with the same probability; column four: significance of FOPER as given by a random walk approach with equal probabilities for positive and for negative steps; columns five and six: weighting scheme parameters a and b; column seven: total mean value of weighted residuals, with data sets without systematic errors showing insignificant small positive and negative values with the same probability; column eight: significance of deviation of mean value of weighted residuals from zero; column nine: bin scale factor K = I o b s I c a l c for bin 1 (lowest 10%) from in total 10 bins of intensity sorted reflections similar to, but not identical with, the numbers given in SHELXL output files due to differing bin widths; column ten: mean value of only positive residuals; column eleven: standard deviation of mean value of only positive residuals; column twelve: absolute mean value of only negative residuals; column eleven: standard deviation of absolute mean value of only negative residuals. For more information about the metrics ( # ζ + # ζ ) N o b s , ( # ζ + # ζ ) N o b s , ζ , ζ σ ( ζ ) , ζ + , σ ( ζ + ) , | ζ | , and σ ( | ζ | ) , see also [14,32].
Data SetAnode ( # ζ + # ζ ) N obs ( # ζ + # ζ ) N obs ab ζ ζ σ ( ζ ) K 1 [ I calc ] ζ + σ ( ζ + ) | ζ | σ ( | ζ | )
Crystal 1Mo0.305116.64440.02656.50300.395117.752.471.03680.01930.81010.0253
Crystal 2Mo−0.0351−0.62180.03191.83220.04570.541.051.17880.09221.01040.0706
Crystal 3Mo0.226519.72760.01031.13880.324725.131.880.98850.01240.72790.0106
Crystal 4Mo0.08522.26130.01740.05280.20194.311.240.98250.05420.72410.0384
Crystal 5Mo0.02331.25880.02601.62750.09234.781.370.82060.02200.67070.0154
Crystal 6Mo−0.0238−2.11100.02702.43920.01731.431.030.80630.01420.73500.0094
Crystal 1Ag0.298523.51980.00960.98790.372724.876.140.95510.01500.70520.0154
Crystal 2Ag−0.0597−1.65770.03810.00000.03690.771.061.05820.05460.86930.0395
Crystal 3Ag0.169116.95020.00460.85120.263323.681.400.97810.01040.74230.0093
Crystal 4Ag0.04781.90600.01560.02640.12164.481.020.86940.02880.70130.0232
Crystal 5Ag0.01020.55250.02331.40710.04752.491.110.80480.01950.72540.0166
Crystal 6Ag−0.0424−3.74520.02251.9900−0.0147−1.320.640.77200.01110.73750.0093
Crystal 7Mo0.11765.00520.02302.00630.19256.791.490.92580.03120.73630.0253
Crystal 8Mo0.09306.78600.04636.28380.11208.191.900.79080.01330.70600.0120
Crystal 9Mo0.092510.27150.01767.29770.134414.392.280.84110.00900.71640.0082
Crystal 10Cu0.148210.81230.01781.27620.165711.461.250.85450.01270.76270.0141
Crystal 11Cu0.125412.57300.03757.13060.154215.311.520.84340.00890.73250.0091
Crystal 12Mo0.323232.48120.0431226.79330.346641.4211.150.80210.00720.54400.0079
Crystal 13Cu0.242122.71270.000084.15550.353530.084.000.96240.01160.64440.0110

3.4. An Attempt to Rationalize ζ > 0 for Absorbing Crystals

The diffracted beams are weakened by absorption in proportion to the absorption coefficient μ and the path length x h k l of the diffracted beam through the crystal
I m e a s u r e d = I B r a g g exp μ x h k l .
In order to obtain a “Bragg” intensity not affected by absorption, the individual diffracted beams need to be multiplied by an individual factor exp μ x h k l > 1 :
I B r a g g = I m e a s u r e d exp + μ x h k l .
Absorption correction procedures therefore increase the measured intensities. The difficulty is that every individual diffracted beam may have another path length and that these path lengths may be very different for example for crystals with plate or needle shapes. When the path length is underestimated, x h k l < x t r u e , the assumed factor exp μ x h k l > exp μ x t r u e is too large (as the reduction from one is too low), and therefore, the correction factor exp + μ x h k l > 1 is too small. The Bragg intensity is therefore too small. When the path length is overestimated, the Bragg intensity is correspondingly too large.
Now suppose, for a slightly elongated crystal with dimensions, say, d m i n = d m i d 0.75 d m a x , a sphere is assigned with radius r e f f = 0.5 d m i n for absorption correction. The absorption correction would work well for a part of the diffracted beams, namely those that do not propagate along the extended direction. This subset of reflections is called the subset with adequate absorption correction. For the majority of reflections with a component propagating along the extended direction, the path length would be underestimated by assigning an effective radius r e f f = 0.5 d m i n . For these, the Bragg intensity is underestimated, as discussed above, when only Equation (5) is taken into account and scaling procedures are ignored. This subset of reflections is called the subset, with inadequate absorption correction leading to underestimated I o b s . As a result, a part of the Bragg intensities are adequate and a part is underestimated. The situation is sketched schematically in Figure 2a. The overall scale factor would compensate for the loss by the inadequate correction by magnifying all observed intensities. Let us assume that the reflections with adequate correction are the minority. From the many reflections with inadequate correction, which are all too small, multiplication with the overall scale factor will on average restore larger values that are closer to the unbiased, true value as in Figure 2b. However, the reflections with adequate correction will all tend to have slightly too large observed intensities.

3.5. Characteristic Traces of Incomplete/Insufficient Absorption Correction

The above thoughts result in the expectation of overestimated intensities in the case of incomplete absorption correction. The individual overestimation of I o b s may well be much smaller than the corresponding individual standard uncertainties in many or even all cases; however, it will be systematic and will, therefore, in particular, manifest as K = I o b s I c a l c > 1 in the lowest bin of the intensity dependent scale factor K. In other words, partial underestimation of absorption may lead via the overall scale factor to the following characteristic traces, which together may be used as a fingerprint of traces in the data characteristic of this specific error:
  • Many weak observed intensities, that are too large, which may, in turn, lead to invoking the weighting scheme parameter b > 0 ;
  • More positive residuals than negative residuals, # ζ + > # ζ , in particular for weak reflections, that are relatively the most affected by overestimation;
  • The mean value of the positive residuals will tend to be larger than the absolute mean value of the negative residuals, ζ + > ζ ;
  • Connected to all of the aforementioned effects, the weighted residuals will in total tend toward positive values, ζ 0 ;
  • And, the intensity-dependent scale factor K = I o b s I c a l c > 1 tends toward values larger than one in particular for the weakest intensities.
By analogy, partial overestimation of absorption leads more often to negative residuals, absolute larger negative residuals, and a shift in the weighted residuals to negative values and to a small value in K < 1 for the lowest intensity bin. If the absorption correction error is the dominant systematic error, the list can be completed by the following
6.
Low-intensity reflections contribute most to the sum of squared residuals (chi-square sum);
Point (6) is particular convincing, as usually the strong reflections are expected to contribute most to the chi square sum, but here, the weak intensities are the main contributors. Additional traces in the data could be
7.
A broad scatter plot I o b s  vs.  I c a l c , as sketched in Figure 2b
The most important observation is that (partial) underestimation of absorption leads to increased observed intensities. As a consequence, a systematic shift in the weighted residuals to positive values may indicate underestimation of absorption and a systematic shift of the weighted residuals to negative values may indicate overestimation of absorption.
The list (1)–(7) gives a fingerprint of expected effects on the fit data for the case that the anisotropy of the crystal has not been compensated for by other means, like frame scale factors and spherical harmonics.
But, before we report on the application to real data, a general note on systematic errors that lead to overestimated observed intensities is appropriate here, as it might be objected that this error reveals itself by increasing the agreement factors. But, that is—withing certain limits—not the case; it is quite the opposite: overestimation of observed intensities can even lead to artificially reduced agreement factors. This is shown and discussed elsewhere in greater detail, but some arguments are briefly discussed in Appendix A to give the reader at least some evidence for this assertion. Errors that lead to seemingly better results are unlikely to be found easily as good results are less often and less hard-questioned compared to results that are disappointing. This effect is known in the literature as ”confirmation bias”. A confirmation bias may result in methodological errors that are not found for a long time.

3.6. Application to a Strongly and a Weakly Absorbing Crystal from the Literature

Coming back to the study discussed in Section 3.3, it is reasonable to expect visible traces of absorption correction for the strongly absorbing Crystal 1, but not for the weak absorber Crystal 6. Figure 3 shows a variety of diagnostic plots, which were chosen in order to visualize some of the fingerprint criteria 1–7 from the above list, for the Mo data sets of the strongly absorbing Crystal 1 (first row) and of the weakly absorbing Crystal 6 (second row).
A tendency toward larger observed weak intensities as compared to the corresponding calculated intensities, I o b s > I c a l c , is particularly good, illustrated by plotting the observed versus calculated intensities on a logarithmic scale. Figure 3a does show a systematic difference for weak observed and calculated intensities for Crystal 1, whereas Figure 3e does not show this feature for Crystal 6.
More positive than negative residuals particularly for the weak intensities (point (1) from the above list), stronger positive residuals than absolute negative residuals for the weak intensities ((2) and (3)), an overall shift to positive residuals (4), and K > 1 for the weakest intensities (5) is illustrated by Figure 3b, which displays a moving average of the weighted residuals for the data sorted by increasing I c a l c . Different windows of size 50 (blue), 100 (orange), and 500 (green), respectively, were used for the calculation of the moving average. For a data set with no systematic errors, all of these curves fluctuate around zero, similar to Figure 3f for Crystal 6; however, the residuals are, on average larger than two for the lowest intensities of Crystal 1 (Figure 3b). This is an impressive deviation on average from zero.
Figure 3c,g show the moving averages of the squared weighted residuals—the entities constituting the chi-square sum that is minimized in the least squares model refinement—sorted in increasing order of the calculated intensities for Crystal 1, Figure 3c, and Crystal 6, Figure 3g. It is seen that the chi-square sum is clearly dominated by contributions from weak intensities for Crystal 1. Finally, the bent normal probability plots in Figure 3d,h confirm the existence of systematic errors in both data sets [35].
The normal probability plot with one-sidedly large positive residuals like in Figure 3h is a typical sign for contamination with low energy photons, which may occur in micro-sources with a mirror monochromator [14,36,37].
The plots in the top row of Figure 3 for the strongly absorbing Crystal 1 clearly display the expected signs of insufficient absorption correction, while those in the bottom row for the weakly absorbing Crystal 6 from the same study do not exhibit these typical features. This is taken as evidence for the validity of the rationale discussed in Section 3.4 and the characteristic traces derived from it as discussed in Section 3.5.

3.7. Application to Other Data

In order to further substantiate these findings, additional published data sets were analyzed from other authors. The structures of Crystal 8—Crystal 13 were published in the time period 2018–2024 and comprise Cu and Mo radiation sources; absorption coefficients between 8.42 mm−1 (Crystal 13) and 24.53 mm−1 (Crystal 8); and different diffractometer and detector types, including hybrid pixel detectors (Crystal 11). Mirror optics were used in most cases, and all structures were taken at 100 K, but Crystal 12, which was measured at 296 K and is the only one without a mirror monochromator. Crystal 12 is in many ways exceptional as it was also refined as a two-component twin and shows an extremely large weighting scheme parameter b = 226.7933 .
The additional structures are Crystal 7, sodium tungsate again [24], the same structure as Crystal 3; two further tungstate compounds with paratungstate B decorated by iron (II) or copper (II) from [27] (Crystals 8 and 9); a Bismuth compound, taken with copper radiation, Crystal 10 [28]; an Osmium compound again taken with copper radiation, Crystal 11 [29]; the somewhat exotic Djurleite Crystal 12 [30]; and an anti-cancer agent gold compound taken again with copper radiation, Crystal 13 [31]. More information to these data sets is also found in Table 1 and Table 2. The diagnostic plots for all these crystals are found in the Supplementary Materials. In this section, only three of the crystals are very briefly discussed: The diagnostic plots for Crystal 7 (sodium tungstate); Crystal 9, iron (II) paratungstate B; and Crystal 13, the anticancer gold agent STG-2, are shown in Figure 4. Crystal 7 is the same structure as Crystal 3 and was also taken with Mo radiation but with the more traditional analytical absorption correction that requires indexing of the crystal faces. The structure with the chemical formula Na 2 WO 4 · 2 H 2 O crystallizes in the orthorhombic space group P b c a and was measured on an Enraf Nonius diffractometer with graphite monochromator at 291 K and with Mo Kα radiation up to θ m a x = 30 . 0 .
A direct comparison with Crystal 3 is not possible as several variables are different, like temperature and resolution (Crystal 3: T = 100 K, θ m a x = 54 . 1 ); however, it is interesting to see that Crystal 7 also shows traces of insufficient absorption correction but to a smaller degree than Crystal 3. The mean value of the weighted residuals is with ζ C r y s t a l 7 = 0.1925 less than two thirds of ζ C r y s t a l 3 = 0.3247 , and ( # ζ + # ζ ) N o b s Crystal 7 = 0.1176 is approximately only one half ( # ζ + # ζ ) N o b s Crystal 3 = 0.2265 according to Table 2; both values are highly significant in both cases. Crystal 3 was measured up to θ m a x = 54 . 1 and Crystal 7 only to θ m a x = 30 . 0 ; however, when only data up to θ m a x = 30 . 0 are used in a refinement of Crystal 3, the mean of the weighted residuals increases from the previous 0.3247 (Crystal 3, θ m a x = 54 . 1 ) to 0.3996 (Crystal 3, θ m a x = 30 . 0 )! This increase may surprisingly indicate that specifically the weak data from low-resolution batches contribute particularly strong to a positive shift of the residuals. This is in line with the results by [38], who found that weak data tend to be overestimated for short exposure times.
Crystal 9, the new paratungstate B compound decorated with iron (II), chemical formula Na 5 Fe 2.5 [ W 12 O 40 ( OH ) 2 ] · 36 H 2 O , crystallizes in space group P 1 ¯ . Measurements were carried out according to the literature with Mo Kα radiation at 100 K with a Bruker diffractometer equipped with a sealed X-ray tube and Incoatec I μ S mirror optics. Disordered water molecules in the coordination sphere of atoms Na 4 and Na 5 were modeled. The [ W 12 O 40 ( OH ) 2 ] 10 anion links transition metal and sodium cations.
Crystal 13, a new bisphosphine gold (I) complex with anticancer activity, chemical formula C 48 H 38 AuCl 2 NOP 2 , crystallizes in space group C 2 / c and was measured at 100 K with a Bruker diffractometer equipped with a microsource and Cu Kα radiation. The absorption coefficient is with μ = 8.42 mm−1 smaller than those of Crystal 7 ( μ = 18.66 mm−1) and 9 ( μ = 21.09 mm−1). The data are used in an excellent tutorial at the wepage https://xray.uky.edu/Tutorials/shelxl-wght/4-WGHT-example (accessed on 6 October 2025) where they can be downloaded from, too. The SQUEEZE [39] procedure was used. Despite the comparably small absorption coefficient, distinct signs of insufficient absorption correction are visible as seen from the plots in Figure 4i–l.

3.8. Discussion of Table 2

The fingerprint traces 1–5 are found in Table 2:
  • Weak observed intensities too strong. This is reflected in the overall shift to positive residuals as given by ζ > 0 , which applies to all cases but to Ag data of the weakly absorbing Crystal 6. The mean value of the weighted residuals is significantly different from zero for all of the Mo data sets, with the exception of Crystal 2 and Crystal 6. Even for Crystal 5 with a comparatively low absorption coefficient of μ = 5.90 , mm−1 is the shift ζ = 0.0923 with ζ σ ( ζ ) = 4.78 still significant. The mean value of the residuals tends to be positive and significant approximately for crystals with μ 5 mm−1
  • More positive residuals than negative residuals in particular for weak reflections, which are relatively the most affected by overestimation: # ζ + > # ζ . This is seen in the fraction of positive excess residuals, which is positive for all data sets but again the Ag data of the weakly absorbing Crystal 6 and additionally the Ag and Mo data of the outlier Crystal 2. The number of positive excess residuals is significant for all data sets with approximately μ 10 mm−1
  • The mean value of the positive residuals tends to be larger than the absolute mean value of the negative residuals, ζ + > ζ ; this applies to all data sets in the table.
  • A shift in the weighted residuals to slightly positive regions in particular for the weak intensities, ζ 0 . This is more visible from the diagnostic plots, but it is also reflected in the next point (5):
  • The intensity-dependent scale factor tends toward values larger than one, K = I o b s I c a l c > 1 . K ranges between 0.64 ( Ag -data of Crystal 6)—the only case with K < 1 — and stunning 11.15 (Crystal 12, N o b s = 7077 ), followed by still large K = 6.14 ( Ag -data of Crystal 1, N o b s = 2976 ) and K = 4.00 (Crystal 13, N o b s = 8803 ). To verbalize this for Crystal 12 as an example, for the weakest 707 reflections (in order of I c a l c ), the observed intensity is on average 11.15 times as large as the calculated intensity: I o b s = 11.15 I c a l c . The affected number of reflections is in all cases much too large to be accidental. Moreover, for data sets not affected by systematic errors, one would expect numbers of K above and below one to appear approximately equally often. This is evidently not the case here.
All data sets but the Ag data set of Crystal 2 show a weighting scheme parameter b > 0 . This parameter indicates problems in the analysis of variance with weak reflections.

3.9. Testing the Hypothesis ζ exp ( d m a x μ )

In Section 3.4, it was discussed that the crystal dimensions play a role in the absorption procedure: The effective radius is chosen to be close to half of the smallest crystal dimension. As a consequence, when the fraction of the subset with incomplete correction is more pronounced, the larger the absorption for the maximum crystal dimension. In short, ζ exp ( d m a x μ ) . Figure 4b displays ζ  vs.  exp ( d m a x μ ) for all discussed data sets. A fit of the Mo and Ag data sets for Crystals 1, 3, 4, 5, and 6 (Crystal 2 was the outlier and was therefore omitted) results in the equation
ζ = 0.4429 exp ( d m a x μ ) + 0.3649
with R 2 = 0.8370 and standard errors for the parameter estimates ± 0.069 (for the parameter value, −0.4429) and ± 0.035 for the parameter value 0.3649. Thus, for a hypothetically not absorbing crystal with d m a x μ = 0 , the mean value results at ζ m i n = 0.078 , which is only slightly larger than the expected ideal value zero and for increasing values of d m a x μ , ζ becomes increasingly larger with a maximum value of about ζ m a x = 0.3649 . In short, the stronger the absorption, the larger the mean value of the weighted residuals. Figure 5 shows that Crystal 2 is a clear outlier and that Crystal 5 also tends toward too small values on the x-axis (the absorption along the maximum dimension). From Crystals 1 to 6, Crystal 5 was the one with by far the largest maximum dimension, that is with d m a x = 0.2 mm much larger than the diameter of the focused X-ray beam that is given in the literature with 110 and 90 μ m for Mo Kα and Ag Kα, respectively [33]. Taking these diameters of the beam focus as d m a x in the respective equation leads to the positions marked with a dagger, “†”, for Crystal 5—very close to the fit line. This might be taken again as additional evidence for the validity of the fit. The remaining data sets from other authors, however, do not fit in. But, what is very consistent across all data sets is that the mean value of the weighted residuals and the fraction of positive excess residuals increase together, as depicted in Figure 5a. This is evidence for a shift in (a part of) the residual distribution (mainly for the weak intensities) in all cases, rather than large positive outliers being responsible for ζ > 0 . This is an important point as it conforms to the fingerprint criteria from above and it excludes the possibility that other systematic errors with very large positive outliers, e.g., from strong intensities, are the main driving force behind large positive values of ζ .

4. Discussion

The results show that for Crystals 1–6, the residuals of structures with notable absorption coefficient (i) tend to show positive mean values accompanied by (ii) positive excess residuals. The larger the mean value of the residuals, the larger the absorption coefficient. Positive residuals origin mainly from weak intensities, which contribute most to the χ 2 sum of the least squares refinement. The results also show that other factors are important, too, as only the set of structures Crystals 1–6 measured on the same diffractometers to similar resolution with the same temperature fit to Equation (6), whereas Crystal 7–13 do not fit in. However, what almost all data sets have in common is (i) a positive mean value of the residuals (exception: Ag -data of the weakest absorbing Crystal 6) and (ii) an increase in the number of positive excess residuals for increasing mean values of the residuals (no exception, not even for Crystal 2). The rate of increase is similar for Crystals 1–6 (see Figure 5a). Note that Crystals 1–6 were measured to higher resolution compared to Crystals 7–13.

4.1. Other Errors Found in the Data Sets for Crystals 1–6

There seems to be an additional slight contamination with low energy radiation, which also increases the mean value of the residuals [14]. The application of the python script described in [37] yields correction factors k 3 λ = 0.0010 ,   0.0015 ,   0.0011 ,   0.0004 ,   0.0038 ,   0.0067 for the Mo data sets of Crystals 1–6, respectively, some of which are comparable to the correction factors k 3 λ = 0.0123 ,   0.0086 ,   0.0080 ,   0.0061 ,   0.0016 for compounds 1– 5 reported in [37]. This leads to another important open question: how does an insufficient absorption correction affect those low-energy contamination signals and corresponding correction factors and vice versa? Would a more accurate absorption correction increase or decrease k 3 λ and similar metrics? At least it is known that low-energy contamination affects the mean value of the residuals likely only by a little: the significances of the deviation of the mean value of the weighted residuals from zero reported in [14] about those five data sets, each in three “flavors” ((i) contaminated by low-energy radiation; (ii) not contaminated as a thin foil was used to physically block the low-energy radiation; and (iii) the corrected data set, where a correction procedure was applied to the data set contaminated by low-energy radiation), from [37] and all with low absorption coefficients ( 0.081 μ 0.111 mm−1) are in the range of 1.10 ζ σ ( ζ ) 5.74 . The absolute values (these are not given in the cited literature but were extracted from the personal archive of the author) are in the range of 0.0296 ζ 0.0996 for the 15 data sets from five structures. So, the absolute values are much smaller compared to most of those from the present study (see Table 2). This rules out low-energy contamination as a strong contributor to the large mean values of the residuals and leaves the larger resolution and stronger absorption as the likely main cause. The results found by [38] may give a clue in combination with the observation from above that cutting high-resolution data down to standard resolution increases the mean value of the residuals: maybe it is not the high resolution per se but an indirect result caused by those low-intensity reflections from low-resolution batches taken with shorter exposure times which are particularly strong overestimated according to [38]. The very large ζ values for Crystals 12 and 13 are most likely not explained by an insufficient absorption correction alone, but additional sources of error, but these were not yet identified. It was found earlier that not modeled twinning and not modeled disorder may increase the mean value of the weighted residuals [14]. It is, however, an open question of how procedures modeling (or squeezing out) solvent affect the mean value of the residuals, if at all.

4.2. A Remark on Elecron Density Peaks and Holes

It is often seen that the largest residual electron density peaks and holes are close to the heaviest atom(s) of the structure. Sometimes, these peaks and holes are really connected to the heavy atom, like in the case of anharmonic nuclear motion, which may be visible in high-resolution experiments (see, for example, [40]). But, sometimes, errors that may originate elsewhere tend to “condensate” in the residual density at the heaviest atoms in the structure. Insufficient absorption correction with I o b s > I c a l c for many weak reflections would result one-sidedly in a pronounced contribution to the residual electron density peaks, contributing in this way to an often observed imbalance in the residual electron density, where the residual density peaks tend to be larger than the modulus of the residual density holes.

4.3. A Remark on Anisotropic Displacement Parameters, ADPs

When it comes to ADPs and strongly absorbing crystals, one may first think about non positive-definite ADPs, cigar-shaped and oblate thermal motion ellipsoids which are known to indicate problems with absorption correction. However, before thermal motion ellipsoids look pronouncedly misshaped and raise alerts, they may appear first only slightly elongated or flattened, oriented in non-physical ways, strangely aligned or just unusual strong or weak. Although this is of course still part of the progression to cigar-shaped and oblate thermal motion ellipsoids, it is clear that the next important step would be to obtain physically meaningful ADPs that are only affected by absorption corrections and other sources of systematic error by a little and that are similar for same-temperature data sets of identical crystals measured with different radiation. A basic step forward to this goal would be to obtain similar U e q values for same-temperature data of same crystals. For Crystals 1–4, however, the values U e q from Mo data are often several standard deviations larger compared to those from Ag data. Does this mean that U e q from Ag data are more accurate? Not necessarily; it may as well indicate that there are correction performance discrepancies for the Mo and Ag data sets and both may underestimate the physical ADPs, although to different degrees. If the correction procedure itself is affected by systematic errors—which appears to be likely in this case in view of the results of the present work—the inherent main assumption that the absorption correction procedure is equally effective across the data sets breaks down. The information given about “Absorption correction in WinGX” (https://www.chem.gla.ac.uk/~louis/software/wingx/absorb.html (accessed on 6 Octeber 2025)), indicates, in view of the numerical results from Table 2 on that site, that
“The accepted wisdom is that absorption errors cause the adp’s to be somewhat smaller and more anisotropic than the “true” values (in this context the “true” value is assumed to be that obtained from the analytical correction.). Examination of the values obtained with no absorption correction bear this out”.
This could be rephrased as follows: underestimation of absorption effects leads to too small ADPs—which might even be the case for the results from [24] (see again Figure 4a–d). Signs of underestimated absorption are apparent in almost all data sets discussed here. In the present study, the focus was, however, on the traces of absorption correction in the residuals. Identifying these traces might help to reach the next milestone in obtaining even better absorption correction procedures: these would show consistent ADPs among data sets from different radiation sources collected at the same temperature and from the same crystal. Consistent does not mean that they are identical, but they would be similar within just a few standard deviations. The step after this milestone is having ADPs physically meaningful and describing mainly and realistically thermal motion even in strongly absorbing crystals.

5. Summary

In total, it was convincingly shown that absorption correction leads to specific traces in the residuals. If absorption correction procedures were complete, these specific traces were not observable. They can be rationalized and are described by the fingerprint criteria 1–7 as discussed above. Furthermore, they are consistent for all crystals analyzed in this work (apart from Crystal 2) and are even found for crystals with comparatively low absorption coefficients like Crystal 4 (Mo data, μ = 9.78 mm−1), Crystal 11 ( μ = 9.33 mm−1), and Crystal 13 ( μ = 8.42 mm−1). Other sources of systematic errors like low energy contamination and an unknown systematic error in Crystal 2 with an error compensation effect on the mean value of the weighted residuals are present in the analyzed data sets as well. These results could help guide the way to improving absorption correction procedures in the future.

Supplementary Materials

Contain diagnostic plots for Crystals 1—17. The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/cryst15100898/s1, Table S1: Mo data of Crystals 1—6; Table S2: Ag data of Crystals 1—6; Table S3: Crystals 7–13; Table S4: Additional examples.

Funding

This research received no external funding.

Data Availability Statement

The original contributions presented in this study are included in the article/Supplementary Material. Further inquiries can be directed to the corresponding author.

Conflicts of Interest

Author Julian Henn is CEO and founder of the company DataQ Intelligence UG. The remaining authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Appendix A. The Transformational Behavior of Merging Factors Rmerge, Rp.i.m, and Rr.i.m Under Changes Iobs → Iobs + δIobs with δIobs > 0

It was mentioned in the main text that overestimation of I o b s may not easily be detected with the help of agreement factors, as these are not increasing but may even decrease with increasing overestimation of I o b s . A thorough discussion of this topic is out of the scope of the present work, but it is briefly sketched for some of the agreement factors here.
The merging R-factor [41,42,43] is defined according to
R m e r g e = h k l i I i ( h k l ) I ¯ ( h k l ) h k l i I i ( h k l ) ,
with redundant observations I i ( h k l ) and mean value over groups of redundant observations I ¯ ( h k l ) . The precision-indicating merging factor [44] is defined according to
R p . i . m = h k l ( 1 / ( N 1 ) ) 1 / 2 i I i ( h k l ) I ¯ ( h k l ) h k l i I i ( h k l ) ,
and the redundancy-independent merging factor [44] is defined according to
R r . i . m = h k l ( N / ( N 1 ) ) 1 / 2 i I i ( h k l ) I ¯ ( h k l ) h k l i I i ( h k l ) ,
where N indicates the redundancy of equivalent reflections.
When the set I o b s , i ( h k l ) (abbreviated by “A”) is replaced by I o b s , i ( h k l ) + δ I o b s (abbreviated by “B”) with the same constant value δ I o b s > 0 for all reflections to simplify the discussion, the changes induced by this transformation in the respective enumerator of Equations (A1)–(A3) cancel out exactly, whereas the denominator is increased. This leads in all three cases to a reduction: R m e r g e ( B ) R m e r g e ( A ) , R p . i . m ( B ) R p . i . m ( A ) , and R r . i . m ( B ) R r . i . m ( A ) . All three merging agreement factors follow the same trend: the smaller, the more I o b s are overestimated. This common property of the metrics Equations (A1)–(A3) makes overestimation of I o b s rewarding, as a seemingly higher data quality is obtained (and additionally lower ADPs).
When δ I o b s is not a constant for all reflections but only roughly constant for subsets of redundantly measured reflections but may be changing with resolution or for low- and high-intensity data, cancellation in the enumerator still works, although it will depend on the exact values; however, the denominator will still increase by the full amount. The more uniform δ I o b s is distributed within subsets of redundant measured reflections and the larger it is, the larger the reduction in the agreement factors, even when δ I o b s is vastly different for different subsets of redundantly measured reflections.

References

  1. de Meulenaer, J.; Tompa, H. The absorption correction in crystal structure analysis. Acta Crystallogr. 1965, 19, 1014–1018. [Google Scholar] [CrossRef]
  2. Coppens, P. Crystallographic Computing; Munksgaard: Copenhagen, Denmark, 1970; pp. 255–270. [Google Scholar]
  3. Lu, Y.; Duman, R.; Beilsten-Edmands, J.; Winter, G.; Basham, M.; Evans, G.; Kamps, J.J.A.G.; Orville, A.M.; Kwong, H.S.; Beis, K.; et al. Ray-tracing analytical absorption correction for X-ray crystallography based on tomographic reconstructions. J. Appl. Crystallogr. 2024, 57, 649–658. [Google Scholar] [CrossRef] [PubMed]
  4. North, A.C.T.; Phillips, D.C.; Mathews, F.S. A semi-empirical method of absorption correction. Acta Crystallogr. Sect. A 1968, 24, 351–359. [Google Scholar] [CrossRef]
  5. Blessing, R.H. Data Reduction and Error Analysis for Accurate Single Crystal Diffraction Intensities. Crystallogr. Rev. 1987, 1, 3–58. [Google Scholar] [CrossRef]
  6. Katayama, C.; Sakabe, N.; Sakabe, K. A statistical evaluation of absorption. Acta Crystallogr. Sect. A 1972, 28, 293–295. [Google Scholar] [CrossRef]
  7. Walker, N.; Stuart, D. An empirical method for correcting diffractometer data for absorption effects. Acta Crystallogr. Sect. A 1983, 39, 158–166. [Google Scholar] [CrossRef]
  8. Blessing, R.H. An empirical correction for absorption anisotropy. Acta Crystallogr. Sect. A 1995, 51, 33–38. [Google Scholar] [CrossRef]
  9. Krause, L.; Herbst-Irmer, R.; Sheldrick, G.M.; Stalke, D. Comparison of silver and molybdenum microfocus X-ray sources for single-crystal structure determination. J. Appl. Crystallogr. 2015, 48, 3–10. [Google Scholar] [CrossRef]
  10. Hoser, A.; Madsen, A.Ø. Models of thermal motion in small-molecule crystallography. IUCrJ 2025, 12, 421–434. [Google Scholar] [CrossRef]
  11. Sheldrick, G.M. Crystal structure refinement with SHELXL. Acta Crystallogr. Sect. C 2015, 71, 3–8. [Google Scholar] [CrossRef]
  12. Wilson, A.J.C. Statistical bias in least-squares refinement. Acta Crystallogr. Sect. A 1976, 32, 994–996. [Google Scholar] [CrossRef]
  13. Henn, J. Bias caused by a popular weighting scheme. J. Appl. Crystallogr. 2025, 58, 283–289. [Google Scholar] [CrossRef]
  14. Domagala, S.; Nourd, P.; Diederichs, K.; Henn, J. Progress in detection of and correction for low-energy contamination. J. Appl. Crystallogr. 2023, 56, 1200–1220. [Google Scholar] [CrossRef] [PubMed]
  15. Bruker AXS Inc. SAINT, Version 7.68 A; Bruker AXS Inc.: Tokyo, Japan, 2009. [Google Scholar]
  16. Agilent. CrysAlis PRO; Agilent Technologies Ltd.: Yarnton, UK, 2011. [Google Scholar]
  17. Rigaku Software. Crysalis Pro; Rigaku Oxford Diffraction: Yarnton, UK, 2015. [Google Scholar]
  18. Wang, Z. N-(2-Hydroxy-3-methoxybenzylidene)pyrazine-2-carbohydrazide monohydrate. IUCrData 2020, 5, x191731. [Google Scholar] [CrossRef] [PubMed]
  19. Lough, A.J.; Ho, A.; Tam, W. [(1R*,3S*,4S*)-3-(2-Hydroxybenzoyl)-1,2,3,4-tetrahydro-1,4-epoxynaphthalen-1-yl]methyl 4-nitrobenzoate. IUCrData 2020, 5, x200265. [Google Scholar] [CrossRef] [PubMed]
  20. Doboszewski, B.; Nazarenko, A.Y.; Soares, F.d.P. 1,3-Bis(2-oxopropyl)thymine. IUCrData 2020, 5, x200257. [Google Scholar] [CrossRef]
  21. Weil, M.; Kremsmayr, T.; Mihovilovic, M.D. 6-[(tertButyldimethylsilyl)oxy]-3-ethenyl-7-methoxy-4-[(trimethylsilyl)ethyn- 662 yl]naphtho[2,3-c]furan-1(3H)-one. IUCrData 2020, 5, x200224. [Google Scholar] [CrossRef]
  22. Harmening, T.; van Wüllen, L.; Eckert, H.; Rodewald, U.C.; Pöttgen, R. Sc4Pt7Si2—An Intergrowth Structure of ScPtSi and ScPt Related Slabs; WILEY: New York, NY, USA, 2010. [Google Scholar]
  23. Dubler, E.; Vedani, A.; Oswald, H.R. New structure determination of murdochite, Cu6PbO8. Acta Crystallogr. Sect. C Cryst. Struct. Commun. 1983, 39, 1143–1146. [Google Scholar] [CrossRef]
  24. Farrugia, L.J. Sodium tungstate dihydrate: A redetermination. Acta Crystallogr. Sect. E Struct. Rep. Online 2007, 63, i142. [Google Scholar] [CrossRef]
  25. Rohrmoser, B.; Eickerling, G.; Presnitz, M.; Scherer, W.; Eyert, V.; Hoffmann, R.D.; Rodewald, U.C.; Vogt, C.; Poettgen, R. Experimental electron density of the complex carbides Sc3[Fe (C2)2] and Sc3[Co (C2)2]. J. Am. Chem. Soc. 2007, 129, 9356–9365. [Google Scholar] [CrossRef]
  26. Azhakar, R.; Ghadwal, R.S.; Roesky, H.W.; Hey, J.; Krause, L.; Stalke, D. Mixed valence η 6-arene cobalt (i) and cobalt (ii) compound. Dalton Trans. 2013, 42, 10277–10281. [Google Scholar] [CrossRef]
  27. Nadiia, I. Gumerova, a Anatolie Dobrov, a.A.R.; Rompela, A. Iron(II) and copper(II) paratungstates B: A single-crystal X-ray diffraction study. Acta Crystallogr. Sect. C 2018, 74, 1252–1259. [Google Scholar]
  28. Stammler, H.G.; Imran, M. Bis(4-phenyl-2-sulfanylidene-2,3-dihydro-1,3-thiazol-3-ido-κ2S2,N)(4-phenyl-1,3-thiazole-2-thiolato-κS2)bismuth. IUCrData 2020, 5, x200067. [Google Scholar] [CrossRef]
  29. Zometa Paniagua, D.F.; Powell, G.L.; Powell, C.B.; Reinheimer, E.W. Bis(μ 4-adamantane-1,3-dicarboxylato-1κO 1:2κO1′:3κO 3:4κO3′)octacarbonyl-1κ 2C,2κ2C,3κ2C, 4κ2C-tetrakis[tris(4-methylphenyl)phosphane]-1κP, 2κP,3κP,4κP-tetraosmium(I)(2 OsOs). IUCrData 2020, 5, x201204. [Google Scholar] [CrossRef]
  30. Zhou, Y.; Fan, C.; Wen, B.; Zhang, L. Natural djurleite with refined composition Cu61.39S32 revealing disorder of some Cu sites. IUCrData 2022, 7, x220694. [Google Scholar] [CrossRef]
  31. Gilpatrick, S.T.; Obisesan, O.A.; Parkin, S.; Awuah, S.G. Carbon–phosphorus stapled Au(i) anticancer agents via bisphosphine induced reductive elimination. Dalton Trans. 2024, 53, 18974–18982. [Google Scholar] [CrossRef] [PubMed]
  32. Henn, J. Metrics for crystallographic diffraction- and fit-data: A review of existing ones and the need for new ones. Crystallogr. Rev. 2019, 25, 83–156. [Google Scholar] [CrossRef]
  33. Hasse, B.; Graf, J.; Heidorn, U.; Kroth, S.; Hertlein, F.; Wiesmann, J.; Michaelsen, C. State-of-the-Art Multilayer Optics for X-ray Diffractometry; WILEY: New York, NY, USA, 2010. [Google Scholar]
  34. Coppens, P. Crystallographic Computing: Proceedings of an International Summer School Organized by The Commission on Crystallographic Computing of the International Union of Crystallography and Held in Ottawa, 4–11 August 1969; Scandinavian University Books; International Booksellers and Publishers: Geneva, Switzerland, 1970; pp. 255–270. [Google Scholar]
  35. Abrahams, S.C.; Keve, E.T. Normal probability plot analysis of error in measured and derived quantities and standard deviations. Acta Crystallogr. Sect. A 1971, 27, 157–165. [Google Scholar] [CrossRef]
  36. Macchi, P.; Bürgi, H.B.; Chimpri, A.S.; Hauser, J.; Gál, Z. Low-energy contamination of Mo microsource X-ray radiation: Analysis and solution of the problem. J. Appl. Crystallogr. 2011, 44, 763–771. [Google Scholar] [CrossRef]
  37. Krause, L.; Herbst-Irmer, R.; Stalke, D. An empirical correction for the influence of low-energy contamination. J. Appl. Crystallogr. 2015, 48, 1907–1913. [Google Scholar] [CrossRef]
  38. Williams, A.E.; Thompson, A.L.; Watkin, D.J. The role of multiple observations in small-molecule single-crystal service X-ray structure determination. Acta Crystallogr. Sect. B 2019, 75, 657–673. [Google Scholar] [CrossRef]
  39. Spek, A.L. PLATON SQUEEZE: A tool for the calculation of the disordered solvent contribution to the calculated structure factors. Acta Crystallogr. Sect. C 2015, 71, 9–18. [Google Scholar] [CrossRef]
  40. Herbst-Irmer, R.; Henn, J.; Holstein, J.J.; Hübschle, C.B.; Dittrich, B.; Stern, D.; Kratzert, D.; Stalke, D. Anharmonic Motion in Experimental Charge Density Investigations. J. Phys. Chem. A 2013, 117, 633–641. [Google Scholar] [CrossRef]
  41. Stout, G.H.; Jensen, L.H. X-ray Structure Determination: A Practical Guide; John Wiley & Sons: Hoboken, NJ, USA, 1989. [Google Scholar]
  42. Blundell, T.L.; Johnson, L.N. Protein Crystallography; Academic Press: New York, NY, USA, 1976. [Google Scholar]
  43. Drenth, J. Principles of Protein X-Ray Crystallography; Springer Science & Business Media: Berlin/Heidelberg, Germany, 2007. [Google Scholar]
  44. Weiss, M.S. Global indicators of X-ray data quality. J. Appl. Crystallogr. 2001, 34, 130–135. [Google Scholar] [CrossRef]
Figure 1. Descriptors of residuals for 6 different crystals from published data sets [9] with decreasing absorption coefficient ranging from μ Mo = 121.02 mm−1 (Crystal 1), to μ Ag = 1.53 mm−1 (Crystal 6). The exact numbers are found in Table 2. Comparison of the mean value of the weighted residuals ζ (a) and of the fraction of positive excess residuals ( # ζ + # ζ ) / N o b s (b) for Crystals 1–6 from this study. The stronger the absorption coefficient, the larger the mean value of the residuals, with the exception of Crystal 2. The larger the mean value of residuals, the larger the fraction of positive excess residuals, again with the exception of Crystal 2.
Figure 1. Descriptors of residuals for 6 different crystals from published data sets [9] with decreasing absorption coefficient ranging from μ Mo = 121.02 mm−1 (Crystal 1), to μ Ag = 1.53 mm−1 (Crystal 6). The exact numbers are found in Table 2. Comparison of the mean value of the weighted residuals ζ (a) and of the fraction of positive excess residuals ( # ζ + # ζ ) / N o b s (b) for Crystals 1–6 from this study. The stronger the absorption coefficient, the larger the mean value of the residuals, with the exception of Crystal 2. The larger the mean value of residuals, the larger the fraction of positive excess residuals, again with the exception of Crystal 2.
Crystals 15 00898 g001
Figure 2. Schematic representation of how underestimated absorption affects the observed intensities I o b s prior to (a) and after (b) application of an overall scale factor. The intensities I o b s , I t r u e are in arbitrary units. The initial values were generated with the help of a random number generator. Blue dots mark the simulation of the smaller subset of adequately corrected observed intensities, orange dots mark the simulation of the larger subset of reflections with underestimated absorption. (a): Underestimated absorption leads to many too weakly observed intensities (orange circles). After application of the scale factor, the many too weakly observed intensities are much closer to the true value; however, the reflections with formerly adequate absorption correction (blue circles) are now too large, leading to positive residuals. Scaling is the mechanism that leads to increased observed intensities although the real problem is that I o b s have been underestimated. For more information, see the text.
Figure 2. Schematic representation of how underestimated absorption affects the observed intensities I o b s prior to (a) and after (b) application of an overall scale factor. The intensities I o b s , I t r u e are in arbitrary units. The initial values were generated with the help of a random number generator. Blue dots mark the simulation of the smaller subset of adequately corrected observed intensities, orange dots mark the simulation of the larger subset of reflections with underestimated absorption. (a): Underestimated absorption leads to many too weakly observed intensities (orange circles). After application of the scale factor, the many too weakly observed intensities are much closer to the true value; however, the reflections with formerly adequate absorption correction (blue circles) are now too large, leading to positive residuals. Scaling is the mechanism that leads to increased observed intensities although the real problem is that I o b s have been underestimated. For more information, see the text.
Crystals 15 00898 g002
Figure 3. Diagnostic plots for diffraction data taken from study [9]. Top row: Strongly absorbing Crystal 1 ( μ Mo = 121.02 mm−1), bottom row: weakly absorbing Crystal 6 ( μ Mo = 2.87 mm−1). Scatter plot log I o b s vs. log log I c a l c (a,e); Moving averages of weighted residuals ζ (from Mo Kα data) sorted by increasing I c a l c (b,f). The dashed red lines are drawn at +2 and −2 as a guide for the eye. Moving averages of squared weighted residuals ζ 2 sorted by increasing I c a l c (c,g). The red dashed line is drawn at +2 as a guide for the eye. ζ 2 fluctuates with constant amplitude around the solid black line for a data set without systematic errors. Normal probability plot (d,h). The strongly absorbing Crystal 1 shows the fingerprint criteria as discussed in the text: Low-intensity observed reflections tend to be too large (a) and lead to, on average, large positive residuals (b) specifically for weak intensities—please note the scale! The main contribution to the chi-square sum is from low-intensity reflections (c). Thus, the strongly absorbing Crystal 1 shows the fingerprint traces of insufficient absorption correction while the weakly absorbing Crystal 6 does not show these traces. Systematic errors, however, remain in both data sets (d,h). Overestimated weak intensities lead in the model refinement for Crystal 1 to an overall large positive shift in the residuals, ζ = 0.40 with high significance ζ / σ ( ζ = 17.75 , to a large fraction of positive excess residuals ( # ζ + # ζ ) / N o b s = 0.30 , which is highly significant again, ( # ζ + # ζ ) / N o b s = 16.64 , and to larger positive residuals ζ + = 1.04 ± 0.02 , compared to absolute negative residuals | ζ | = 0.81 ± 0.03 (see also Table 2).
Figure 3. Diagnostic plots for diffraction data taken from study [9]. Top row: Strongly absorbing Crystal 1 ( μ Mo = 121.02 mm−1), bottom row: weakly absorbing Crystal 6 ( μ Mo = 2.87 mm−1). Scatter plot log I o b s vs. log log I c a l c (a,e); Moving averages of weighted residuals ζ (from Mo Kα data) sorted by increasing I c a l c (b,f). The dashed red lines are drawn at +2 and −2 as a guide for the eye. Moving averages of squared weighted residuals ζ 2 sorted by increasing I c a l c (c,g). The red dashed line is drawn at +2 as a guide for the eye. ζ 2 fluctuates with constant amplitude around the solid black line for a data set without systematic errors. Normal probability plot (d,h). The strongly absorbing Crystal 1 shows the fingerprint criteria as discussed in the text: Low-intensity observed reflections tend to be too large (a) and lead to, on average, large positive residuals (b) specifically for weak intensities—please note the scale! The main contribution to the chi-square sum is from low-intensity reflections (c). Thus, the strongly absorbing Crystal 1 shows the fingerprint traces of insufficient absorption correction while the weakly absorbing Crystal 6 does not show these traces. Systematic errors, however, remain in both data sets (d,h). Overestimated weak intensities lead in the model refinement for Crystal 1 to an overall large positive shift in the residuals, ζ = 0.40 with high significance ζ / σ ( ζ = 17.75 , to a large fraction of positive excess residuals ( # ζ + # ζ ) / N o b s = 0.30 , which is highly significant again, ( # ζ + # ζ ) / N o b s = 16.64 , and to larger positive residuals ζ + = 1.04 ± 0.02 , compared to absolute negative residuals | ζ | = 0.81 ± 0.03 (see also Table 2).
Crystals 15 00898 g003
Figure 4. Diagnostic plots for diffraction data from the literature, but from differing authors: Crystal 7, sodium tungstate [24], first row; Crystal 9, the iron (II) paratungstate B [27], second row; and Crystal 13, anticancer gold agent STG-2 [31], third row. All data sets show increased values of I o b s for the weak data as depicted in the scatter plots (a,e,i), a mean value of the residuals biased towards positive values for weak intensities (b,f,j), a dominant contribution to the chi-square sum from weak reflections (c,g,k), and the presence of systematic errors as indicated by a bent normal probability plots (d,h,l).
Figure 4. Diagnostic plots for diffraction data from the literature, but from differing authors: Crystal 7, sodium tungstate [24], first row; Crystal 9, the iron (II) paratungstate B [27], second row; and Crystal 13, anticancer gold agent STG-2 [31], third row. All data sets show increased values of I o b s for the weak data as depicted in the scatter plots (a,e,i), a mean value of the residuals biased towards positive values for weak intensities (b,f,j), a dominant contribution to the chi-square sum from weak reflections (c,g,k), and the presence of systematic errors as indicated by a bent normal probability plots (d,h,l).
Crystals 15 00898 g004
Figure 5. Crystals 1–13. (a) Plot of mean weighted residuals vs. fraction of positive excess residuals. A linear relation is quite obvious. (b) A fit between the mean weighted residuals ζ and the absorption along the longest crystal dimension for Crystals 1, 3, 4, 5, and 6 results in an astonishingly consistent plot but cannot be generalized to the other data sets [9].
Figure 5. Crystals 1–13. (a) Plot of mean weighted residuals vs. fraction of positive excess residuals. A linear relation is quite obvious. (b) A fit between the mean weighted residuals ζ and the absorption along the longest crystal dimension for Crystals 1, 3, 4, 5, and 6 results in an astonishingly consistent plot but cannot be generalized to the other data sets [9].
Crystals 15 00898 g005
Table 1. Description of crystals from the literature used in this work. Crystals 1–6 from study [9], listed in decreasing order of absorption coefficient for Mo Kα and Ag Kα radiation, respectively. Crystals 7–13 from various other authors. Column one: crystal number; column two: compound type; column three: anode material; column four: absorption coefficient in [mm−1]; column five: reflections in the least square refinement; column six: reference.
Table 1. Description of crystals from the literature used in this work. Crystals 1–6 from study [9], listed in decreasing order of absorption coefficient for Mo Kα and Ag Kα radiation, respectively. Crystals 7–13 from various other authors. Column one: crystal number; column two: compound type; column three: anode material; column four: absorption coefficient in [mm−1]; column five: reflections in the least square refinement; column six: reference.
Compound TypeAnode μ
[mm−1]
N obs Original
Publication
Crystal 1Scandium platinateMo
Ag
121.02
65.26
2976
6207
[22]
Crystal 2MurdochiteMo
Ag
38.25
20.48
313
770
[23]
Crystal 3Sodium tungstateMo
Ag
18.84
10.16
7584
10047
[24]
Crystal 4Scandium cobalt carbideMo
Ag
9.78
5.02
704
1590
[25]
Crystal 5Dibromoacridine derivativeMo
Ag
5.90
3.16
2918 *
2948
Crystal 6Inorganic cobalt complexMo
Ag
2.87
1.53
7847 +
7811
[26]
Crystal 7Sodium tungstateMo18.661811[24]
Crystal 8Cu(II) paratungstate B
Na 4 Cu 3 [ W 12 O 40 ( OH ) 2 ] 28 H 2 O
Mo24.534720[27]
Crystal 9Iron (II) paratungstate B
Na 5 Fe 2.5 [ W 12 O 40 ( OH ) 2 ] 36 H 2 O
Mo21.0210876[27]
Crystal 10Bismuth compound Bi ( C 9 H 6 NS 2 ) 3 Cu17.345322[28]
Crystal 11Osmium compound
[ { Os 2 ( CO ) 4 ( C 21 H 21 P ) 2 } 2 ( C 12 H 14 O 4 ) 2 ]
Cu9.339674[29]
Crystal 12Djurleite Cu 61.39 S 32 Mo23.547077[30]
Crystal 13Anticancer gold agent STG-2Cu8.428803[31]
Numbers given in [9]: 3183; 8341; * 3130; + 8338. Paratungstate B, [ W 12 O 40 ( OH ) 2 ] 10 .
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Henn, J. Incomplete Absorption Correction Results in an Increased Positive Mean Value of Weighted Residuals. Crystals 2025, 15, 898. https://doi.org/10.3390/cryst15100898

AMA Style

Henn J. Incomplete Absorption Correction Results in an Increased Positive Mean Value of Weighted Residuals. Crystals. 2025; 15(10):898. https://doi.org/10.3390/cryst15100898

Chicago/Turabian Style

Henn, Julian. 2025. "Incomplete Absorption Correction Results in an Increased Positive Mean Value of Weighted Residuals" Crystals 15, no. 10: 898. https://doi.org/10.3390/cryst15100898

APA Style

Henn, J. (2025). Incomplete Absorption Correction Results in an Increased Positive Mean Value of Weighted Residuals. Crystals, 15(10), 898. https://doi.org/10.3390/cryst15100898

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop