1. Introduction
Liquid argon time projection chamber (LArTPC) technology is promising, and is used in many modern neutrino experiments, including the future Deep Underground Neutrino Experiment (DUNE) [
1]. Knowledge of the pion–argon cross-section is important for us to understand and explain the data collected in the LArTPC. For one thing, neutrinos are detected by their interaction with argon nuclei, which can produce hadronic particles, and these features will be used to identify the neutrino. For another, when a neutrino interacts with an argon nucleus, the initially produced pions can also interact within the nucleus, which will change the kinematics of particles emitted from the nucleus. This is called the hadronic final state interaction (FSI) effect. These effects serve as corrections for data and account for a major source of systematic uncertainty in neutrino oscillation analyses, which need constraints from the pion–argon cross-section.
Currently, the pion–argon cross-section is predicted mainly by interpolating data from lighter and heavier nuclei, while the experimental measurements are scarce. Only the LADS collaboration [
2] and the LArIAT collaboration [
3] have performed such measurements. ProtoDUNE-SP [
4], which contains 770 tons of LAr, is a prototype for one of DUNE’s far detectors. Charged particles, including pions with various momentum modes ranging from 300 MeV to 7 GeV, are delivered into the TPC through a beam plug, making it a good place to measure the pion–argon cross-section.
In this paper, we first introduce the energy slicing method in
Section 2, which is used to extract the cross-section. After that, we describe the analysis procedures in
Section 3, including selections, background subtraction, and unfolding. Finally, in
Section 4, we use a subset of the 1 GeV MC as a fake data sample to exemplify using the method and the procedures to measure the cross-section and compare it to the input cross-sections as validation. A few discussions towards measuring real data are given in
Section 5.
2. Energy Slicing Method
Based on the idea of the thin slice method proposed by the LArIAT collaboration [
3], we develop the energy slicing method [
5].
Figure 1 shows one TPC of ProtoDUNE-SP. A beam pion is shown as the dashed orange track with some possible daughter particles originating from the end vertex. The initial kinetic energy of the beam pion at the front-face of the TPC is denoted as
, and the kinetic energy at the end vertex is denoted as
. Given these two energies, the track can be divided into several energy slices by pre-defined energy bins, determined according to the energy resolution and the statistics on a case-by-case basis. In this analysis, the energy bins are chosen to be [0, 350, 450, 500, 550, 600, 650, 700, 750, 800, 850, 950,
∞) MeV. The bin edges are indicated by red bars in
Figure 1, where the last bar is dashed because the beam pion stops before it reaches that energy. The first complete energy slice is referred to as the
nitial slice; the slice where the track stops is called the
end slice. If the signal interaction occurs at the end vertex, then the end slice is also called an
interaction slice.
It is worth noting that is larger than the energy in the initial slice, and the piece of track before the initial slice is referred to as an incomplete slice, which is not usable. On the contrary, is inside the end slice. For convenience, we define the slice ID from 1 to the number of energy bins, N, starting with the bin with the highest energy. For example, energy bin [950, ∞) has the slice ID 1, and [0, 350] has the slice ID 12. Therefore, for each event which has a beam pion track in the TPC, there is an initial slice ID, an end slice ID, as well as an interaction slice ID, which is assigned as null if the interaction occurring at the end vertex is not the signal interaction. However, if the beam pion stops inside the incomplete slice, the whole track is not usable, so all three slice IDs will be assigned as null.
For all events with a beam pion track in the TPC, the distribution of initial slice ID is called the
initial histogram , and similarly we have the
end histogram and the
interaction histogram . The number of incident events in each energy bin, where the energy of the beam pion reaches the upper boundary of the energy bin, forms the
incident histogram , and it can be calculated as
Finally, from the equation in which the cross-section
is defined,
the cross-section in each energy bin can be given by
where
is the molar mass of argon,
is the density of LAr,
is the Avogadro constant.
is the energy bin width, while
is the spatial bin width. The stopping power of the pion in LAr,
, is evaluated from theory at the midpoint of each bin using the Bethe–Bloch formula [
6] and is assumed to be a constant within each energy bin.
3. Analysis Procedures
The whole MC sample (the nominal test beam simulation of the ProtoDUNE-SP experiment [
4] is employed) is divided into two halves: one is used as
fake data, and the other is referred to as
true MC. Using slice ID histograms derived from its true energy information, we can extract the true cross-section of the fake data sample, which can then be used to validate the slicing method. On the other hand, we can also treat fake data the same as real data and perform the same procedures to measure the cross-section using reconstruction information. The results can be used to validate the procedures, which will later be applied to measuring real data. The reconstructed
is derived from the measured beamline instrumented kinetic energy minus a constant upstream energy loss, the value of which is estimated using MC.
is calculated from
and the reconstructed track length based on the theoretical Bethe–Bloch formula as
To conduct the energy slicing method, we need to select beam pion tracks. Based on machine learning, Pandora [
7] selects one track in each event which is most likely to be the beam track. The information of this track and its daughters will then be used for further selections. In addition to technical cuts ensuring useful information from the track, we also perform some specific cuts to veto certain types of backgrounds. The beam particle enters the TPC through the beam plug, and we know approximately where the particle enters the TPC as well as its direction. Thus, we set cuts on the start position as well as the angle of the detected beam track in order to reduce non-beam tracks caused by misidentification. Proton tracks have a very different
curve compared to pions and muons [
4], so this feature can be used to reduce proton background. In order to mitigate muon background, a CNN-based Michel electron identifier is built [
8] and gives the detected beam track a daughter Michel score that can be used to distinguish pions from stopping muons. After full selection, we have 80.1% of pion inelastic events, which are the signal events for the inclusive measurement. The remaining backgrounds, estimated using the true MC, are given in
Table 1.
Next, we subtract the background histograms, estimated by the true MC, from the reconstructed histogram after selections,
, which represents
,
, or
. However, to account for the difference in background fractions
f between the (fake) data sample and the true MC, we perform a data-driven method by introducing a scale factor
for each background type
i. Thus, the reconstructed signal histogram is
In practice, we only consider for the three major backgrounds, which are muon, non-beam proton, and non-beam pion. is fitted for the best agreement between data and the true MC in the sideband of a variable distribution used in data selection, where background i dominates. For the fake data sample, the fitted results of the three are all consistent with unity, which is reasonable since the fake data are statistically the same as the true MC. However, this may not be the case for real data.
Finally, we perform unfolding [
9] to transform reconstructed histograms into true histograms, expressed as
The response matrix
is estimated using true beam pion tracks in the true MC sample. With
as input, we use the d’Agostini method [
10] implemented in the RooUnfold package [
11] to model the unfolding matrix
. To take into account the correlations among the three histograms,
,
, and
, we combine them as one variable
and perform 3-D unfolding. The index of the combined variable is
(the definition of the combined variable can be understood as flattening a 3D array into a 1D array), where
N indicates the number of bins in each histogram. To be conservative, the number of iterations in the d’Agostini method is chosen to be as large as 20 to obtain the results in
Section 4, but this parameter should be optimized based on further studies.
5. Conclusions and Outlook
In this paper, we describe the energy slicing method and use a 1 GeV fake data sample to validate the procedures of measuring pion–argon-inclusive cross-sections. When applying these procedures to measuring real data, we first need to study the differences between data and MC. We will reweight MC accordingly in order to make it more consistent with real data, since both the background estimations and the response matrix rely on MC to model. The differences between real data and MC will also be included in studies of systematic uncertainties.
The energy slicing method can be applied to other momentum modes in ProtoDUNE-SP as well. It can also be used to measure the exclusive cross-section with further cuts on daughter particles. Moreover, ProtoDUNE Run 2, with a slightly different detector configuration, is currently being assembled and is scheduled to start collecting data next year. It will offer opportunities to measure the pion–argon cross-section in a wider momentum range with higher statistics.