1. Introduction
Liquid argon (LAr) technology is widely used by many previous and current neutrino experiments, such as MicroBooNE [
1], ArgoNeuT [
2], and ICARUS [
3], and it is also planned to be employed by future experiments such as SBND [
4] as well as one of the next-generation large-scale neutrino experiments DUNE [
5]. Neutrinos are mostly detected by their interactions with argon nuclei, in which many types of hadrons are involved, including both the knockout of nucleons and the production of mesons. These neutrino-induced hadrons can also interact with nucleons before they escape the nucleus. This may change the kinematics and types of final state particles that are detected, which complicates the reconstruction of neutrino interactions. These are known as final state interaction (FSI) effects. In addition, the propagation and further interactions of these final state hadrons also need to be simulated properly. Therefore, knowledge of hadron-argon cross sections is required, which is useful for informing FSI and improve simulations as well as their associated uncertainties.
However, there are few experimental data available on argon, and the predictions are mostly derived by interpolating cross section results from lighter and heavier nuclei [
6,
7], such as carbon, sulfur, and iron, which have more data available [
8,
9,
10,
11]. In those experiments, the common set-up was to implement a beam of a certain type of hadron of interest and shoot the beam toward a thin target of the material of interest. The survival rate of the hadron beam after the thin target can be measured and used to calculate the cross section. The increasing popularity of LAr-based detectors has motivated efforts toward making cross section measurements of LAr. The LArIAT collaboration proposed the thin slice method to measure hadron-argon cross sections using an LAr time projection chamber (LArTPC) [
12], which itself can no longer be considered a thin target of LAr for hadrons. The precise track reconstruction capability of LArTPCs enables researchers to hypothetically divide the detector into several thin slices, and each slice can be considered an individual thin target experiment. The measurements from all these slices fill the corresponding energy bins.
The original method treats the measured cross section in each bin independently and performs an effective correction in each bin to account for inefficiency and bin migration caused by the detector’s resolution. We keep the essential idea of the thin slice method and further develop the method with more rigorous statistical procedures, including using multi-dimensional unfolding to consider the full correlations of different measurements. In this paper,
Section 2 shows the derivation of the cross section formula, and
Section 3 describes the slicing method in more detail.
Section 4 describes the measurement procedures for a simplified simulation sample, where all results are derived using an IPython notebook, referred to as
hadron-Ar_XS [
13]. Some further discussions on the results as well as a summary are given in
Section 5.
2. Cross Section Formula
The total cross section
as a function of the incident particle’s kinetic energy
E (For convenience,
and
are used interchangeably throughout the text). is defined according to
where
denotes the particle beam flux,
is the infinitesimal reduction of flux, and
n is the number density of the target material. By moving
, the infinitesimal path length of the particle inside the material shown on the right-hand side of Equation (
1), and then integrating both sides, we obtain
where
is the path length integral. This assumes that the cross section
remains constant within the variation of
E during its passage of
. (In reality,
indicates an effective mean value for the cross section within the variation of
E, since there will always be energy loss during the particle’s passage inside the material when we measure the cross section. This also applies to a finite passage length, as discussed in the last paragraph of this section). For a certain area and a certain period of time, the number of surviving particles detected is proportional to the outgoing particle flux, and thus we have
We can also define the number of interacting particles as
. Therefore, after measuring the number of incident particles and the number of surviving particles, the total cross section can be calculated as follows:
When it comes to the exclusive cross section, (Even for the inclusive cross section, there may be a reduction in flux due to particle decay. In this case,
a denotes the total inelastic interaction,
b denotes the particle decay, and thus
is considered an effective cross section. For convenience, we will also refer to particle decay as an “interaction”). we denote
a as the signal interaction and
b as all the other interactions, and thus we have
, where
is the reduction in flux due to the signal interaction. Also, we have
. By separating
based on the type of interactions into
in Equation (
4), we obtain
where
and
in Equation (
5) are not separable given the logarithm function on the right-hand side. Only when
, which implies that
is quite small, and the thin target approximation holds can we use the approximation
and find
Therefore, we have
which is in fact a direct implication from the definition of the exclusive cross section
However, in the slicing method described in
Section 3,
for each slice, which we used to calculate
, is not necessarily small. Therefore, we wish to obtain an unbiased cross section formula without the thin target approximation. From Equations (
1) and (
8), we have
For a finite
, we can estimate this relationship as
where
is the effective mean value for the cross section within the variation of
E during the passage of
. (According to Equations (
1) and (
8), assuming
n is constant,
can be expressed as
). Therefore, combined with Equation (
4), we have the expression for any channel
a:
Because we can never measure
in an infinitely small
E bin, we will express
as
in the following sections. In the thin target approximation, where
, Equation (
11) can be approximated to Equation (
7).
3. Slicing Method
An LArTPC cannot be seen as a thin target in terms of hadrons, whose mean free path in LAr is to the order of 10–100 cm. However, thanks to its high-resolution track reconstruction ability, the LArIAT collaboration proposed the thin slice method [
12], where the detector is hypothetically divided into several slices along the hadron beam direction. Each slice is viewed as a thin target with a width of several millimeters, based on the spacing of the sensing wires. When detecting tracks in the TPC, each slice serves as an individual thin target experiment. By detecting where the track ends, we know where the interaction happens and thus fill in the corresponding energy bins of
and
, which are used to calculate the cross section. The final results are rebinned to wider energy bins, such as 50 MeV, in order to obtain the results.
Based on the thin slice method, Stocker et al. first proposed the idea of energy slicing [
14] based on a study of the ProtoDUNE-SP experiment [
15]. In the energy slicing method, each energy bin is directly considered a slice, which is natural since the cross section is measured as a function of the kinetic energy of the incident particle. (The authors of [
16] also had a similar description of the slicing method, while the cross section formula used in these papers is proven to be an approximation of Equation (
11), according to
Section 2).
Figure 1 shows an illustration of an LArTPC. A beam hadron is incident from the left side of the detector and leaves a track inside the detector. The beam hadron track ends at the end vertex, where either an interaction occurs or the hadron comes to rest, potentially producing some daughter particles, which can be used to determine the type of interaction or the decay. The kinetic energy of the beam hadron when it enters the detector is denoted by
, which is known from the beam, and it approximately follows a Gaussian distribution, given the momentum spread. The kinetic energy of the beam hadron at the end vertex is denoted by
. Given these two energies, the track can be divided into several slices based on the predefined energy bins. (From this point onward, the slice can be used interchangeably with the energy bin). The bin edges are indicated by dark red bars in
Figure 1, where the last bar is dashed because the beam hadron did not reach that energy. As shown in
Figure 1, the first complete slice is referred to as the initial slice, and the slice which has the end vertex is referred to as the end slice. If the interaction occurring at the end vertex is a signal interaction, then the end slice is also referred to as the interaction slice.
The piece of track prior to the initial slice is referred to as an incomplete slice, which will not be used. In contrast, is inside the end slice.
For convenience, we define the slice index
from 1 to the number of energy bins
N, starting with the highest energy bin. Therefore, for each beam hadron track, there is an initial slice index
, an end slice index
, as well as an interaction slice index
, which is designated as null if the interaction occurring at the end vertex is not the signal interaction. In addition, if the end vertex is inside the incomplete slice, then the whole track is not usable, and thus the indices for all three slices will be designated as null. For a sample of events with a beam hadron track in the detector, the distribution of
forms the initial histogram
, and similarly, we have the end histogram
and the interaction histogram
. We also define the incident histogram
, which will later appear in the cross section formula in Equation (
14). Each bin of
counts the number of tracks which reach the energy corresponding to the slice index
, and thus we say the tracks are incident to that slice. Note that for
, one event is likely to contribute to multiple bins, since a track can be incident to a sequence of slices until it interacts. In the thin slice method, for each track,
fills
, and
fills
, while
should be filled from the value of
to
. Equivalently,
can also be calculated by the derived
and
as shown below. This calculation enables us to derive
using the unfolded histogram given in
Section 4.4. This is because after unfolding, the event-wise information is lost, and
cannot be derived by counting events. In addition, for
, it is no longer one entry per track, and thus it would be problematic to unfold the counted
directly: [
14]
The two expressions are equivalent given that
which equals the total number of beam hadron events. Given the relationship between the slice index
and the energy
E by definition, all of these histograms can also be given as energy histograms.
Compared with Equation (
11), by replacing
with
,
with
,
with
, and also
with
, we derive the cross section for the signal interaction in each energy bin, given by
where
is the energy bin width and
is the stopping power of the hadron in LAr. Therefore, for each beam hadron event, three properties are needed, which are
,
, and whether or not there is signal interaction in order to derive the slice indices for
,
, and
. This allows us to treat the three indices as a combined 3D variable, thus enabling the multi-dimensional unfolding discussed in
Section 4.3 and
Section 4.4.
5. Discussion and Summary
In the previous section, we described how to extract the true cross section of the simulation sample as well as how to measure the cross section of a data sample. Here,
was calculated in both cases, which was used to quantify the consistency against the simulation curve. In order to further test the results, we performed toy studies. Four hundred simulation samples, referred to as toys, each with a sample of 10,000 events, were generated in the same way as what was described in
Section 4. The true cross section as well as its covariance matrix was calculated in each toy simulation sample. For each cross section bin, we calculated the pull value of each toy, which is defined as
where
is the uncertainty for
. In each bin, the pull values were expected to follow a normal distribution.
Figure 14a shows the test results, where a Gaussian distribution was fitted onto the pull histograms in each cross section bin, as shown by the blue error bars. By visually comparison with the reference lines, we can see that they were generally consistent with the expectation that each of them centered at zero and had a bar length of one, corresponding to the two parameters in the Gaussian fit. As another test, which took into account the covariance among cross section bins, we show in
Figure 14b the histogram of
, calculated according to Equation (
15), for each toy. A
distribution was fitted onto the histogram, whose degree of freedom
, as shown in the legend, was consistent with the expectation, with 18 being the number of cross section bins. These tests served as a validation of the slicing method. They also suggest that given the current statistics of the events for each toy, the bias caused by the approximations of the method (For example, we considered the cross section calculated to be at the middle point in each energy bin, and we evaluated
at the middle energy value as well. Further corrections are needed if the statistical uncertainty becomes smaller when the sample size is much larger.) was insignificant.
Similarly, we generated 400 toy fake data samples, each with a sample of 10,000 events before selection, in order to study the performance of the procedures to measure the cross section. The 400 toy simulation samples used above were combined into a total of 4,000,000 events in order to model the response matrix as well as the efficiency plot for each toy fake data sample, and thus we could ignore the statistical uncertainty of the simulation sample. The cross section was measured for each toy fake data sample, and we could also derive the pull distributions in each cross section bin as well as the histogram of
, as shown in
Figure 15. In subplot (a), we can see that the lengths of the blue error bars were generally consistent with one, but some of their central points showed a small bias away from zero. This bias is considered the unfolding error. The general unfolding result effectively applies a re-smearing matrix on the true information [
24]. Treating the re-smeared truth as the truth introduces an unfolding error, and thus publishing the re-smearing matrix is suggested in order for others to consider this error when comparing the results. In addition, the unfolding error tends to be smaller when the regularization becomes weaker with a greater number of iterations. Since we did not include this bias, the derived
was supposed to be larger, whose distribution is shown in
Figure 15b.
In the fake data toy study, the simulation sample used to model the response matrix and the efficiency plot was consistent with the toy fake data samples because they were generated in the same way. When it comes to real data, we need to consider the uncertainties caused by the differences between the data and simulation, which can be estimated by fluctuating the relevant parameters of the simulation sample. Additional model validation procedures are essential to examine the compatibility between the data and simulation and ensure the differences were within the quoted simulation uncertainties.
In summary, a method as well as the corresponding procedures for the hadron-argon cross section measurement in an LArTPC detector were provided in this paper. The method requires the inputs of the initial energy and the energy at the end vertex of the track, as well as whether it is signal interaction occurring at the end vertex. The method showed good statistical performance, with no obvious bias except for that caused by unfolding, and good estimation of statistical uncertainties, as suggested by the toy studies. To apply it to real data, the systematic uncertainties due to the difference between the data and simulation should be considered, and the parameters of the unfolding algorithm used should be optimized with further investigations into the trade-off between bias and variance. These features can be added to the IPython notebook
hadron-Ar_XS [
13], which also has the potential to be extended to more cross section studies.