4.1. ToF-MS and Multivariate Methods for Key Gas Species Selection in Mixed Plasma
Figure 2 shows an example of data collected over time using ToF-MS. To collect the data, a
mixed gas was injected into the chamber, and a pressure of 2 mTorr and a target temperature were maintained throughout the entire process. At the same time, the gas species in the chamber were monitored by ToF-MS. After a sufficient amount of steady-state ToF-MS data had been obtained, an ICP plasma power of 500 W was applied for 600 s. Some molecular weight data as a function of time are shown in
Figure 2a. The raw data contain considerable noise and insignificant signals, so it is necessary to distinguish significant signals.
To accomplish this, PCA and NMF were applied to analyze the importance of each gas species by molecular weight, as shown in
Figure 2b. Gases involved in the plasma reaction show considerable variation in quantity as the plasma is turned on and off, resulting in increased variance. PCA and NMF can be used to emphasize these significant gas species while suppressing irrelevant noise signals with less variance. Some gas species that were difficult to detect in the cumulative time data (green) were identified using PCA importance (blue) and NMF importance (orange).
In ToF-MS, various factors can influence signal accuracy, such as the dependence of flight time on the starting position of ions, spatial gradients in the accelerating field, and potential timing errors caused by rapid changes in the accelerating pulse [
28,
29]. These factors can introduce inaccuracies that propagate through subsequent analyses, leading to erroneous interpretations. To mitigate these inaccuracies and focus on the most relevant gas species, we selected only the species corresponding to molecular weights that exhibit local maxima in importance.
Table 1 presents the relative importance values obtained from PCA and NMF for various molecular weights (
m/
z). This table lists gas species with local maximum importance values—those having higher importance compared to their neighboring species—and orders them by descending relative importance. Although the relative importance values obtained from PCA and NMF are generally similar, some differences in the rankings and magnitudes are observed due to the distinct characteristics of the two methods.
PCA captures the global variance structure of the data and allows for both positive and negative loading values, enabling the representation of opposing relationships between variables. This characteristic is particularly valuable in plasma chemistry, as it allows PCA to capture conflicting relationships between variables, such as the production and consumption of chemical species due to secondary reactions within the plasma chamber. In contrast, NMF emphasizes non-negative, additive parts-based representations, focusing on capturing dominant features present in the data [
30], which can be especially effective for the non-negative ToF-MS measurements. However, because the components in NMF are not orthogonal, there can be redundancy or overlap among variables, potentially leading to certain chemical species being unnecessarily emphasized multiple times across different components. Despite these methodological differences, the top eight chemical species identified by both methods were identical, and the ninth and tenth most important species in PCA were ranked tenth and fourteenth in NMF, respectively. This similarity indicates that both methods consistently identify the key gas species involved in the plasma processes.
In selecting the most significant gas species, those with relative importance values exceeding 0.1% in both PCA and NMF analyses were considered. Specifically, the mass-to-charge ratio (m/z) of 85 corresponds to , a fluorinated compound containing fluorine, which is crucial in plasma chemistry. Notably, showed a high correlation with and COF at elevated temperatures, indicating that participates in reactions at high temperatures. Similarly, the m/z of 119 likely represents . This species exhibited a weak negative correlation with the input gases (, ) and a positive correlation with other reaction products, suggesting that is formed within the plasma chamber through reactions of the input gases, indicating active plasma chemistry generating new fluorinated species. On the other hand, m/z 78 (estimated as ) showed relative importance values above 0.1% in both PCA and NMF, but did not show a clear correlation with the other species, so we excluded it from the analysis. By combining the relative importance values from both methods with chemical relevance and correlation analysis, the ten gas species discussed in the manuscript were selected as the most significant for the study.
Based on these findings,
Figure 2c illustrates the ion count over time for the ten most significant gas species, as identified by PCA. In
mixed plasma, fluorocarbons and fluorinated compounds, such as
,
, and
(molecular weights 69, 50, and 119, respectively), arise from
dissociation. Along with this dissociation,
is not detected because it breaks down into
during the electron impact ionization process in mass spectrometry [
31,
32]. The oxidation of carbon results in the formation of carbon oxides, including
and CO (molecular weights 44 and 28, respectively), result from oxidation processes. Carbonyl fluorides, such as COF,
, and
(molecular weights 47, 66, and 85, respectively), are intermediates formed through interactions between carbon, oxygen, and fluorine species in the plasma. Ar (molecular weight 40) serves as an inert carrier gas. Conversely, most signals that remained consistent upon ICP plasma power application were diminished.
4.2. Calculation of F-Radical Production and FOPDT Model Analysis
In semiconductor etching processes, particularly those involving silicon-based materials, fluorine radicals (F radicals) are crucial due to their high reactivity. F radicals efficiently break silicon bonds and form volatile silicon fluoride compounds, enabling precise etching. In the context of advanced 3D-NAND and DRAM technologies, maintaining the integrity of the ACL masks and managing carbon particle contamination in ACL deposition chambers are critical [
7,
33]. This necessitates the use of high-temperature gas plasma processes and fluorine-based plasma cleaning.
To optimize these processes, it is essential to understand the production and behavior of F radicals under high-temperature conditions. However, F radicals have a limited lifetime in the gaseous molecular state due to their high reactivity, high electronegativity, and high ionization potential, making them difficult to collect and analyze. To address this issue, we propose a method that utilizes the signals of other species participating in reactions involving F to perform calculations. This is feasible due to the fact that ToF-MS gathers all gaseous species in parallel. The resulting data matrix, denoted as
, is structured based on the species selected using
. This matrix captures the time evolution of the aforementioned species and serves as the foundation for calculating the total F radicals produced during the process. The selected species are
,
,
, Ar,
,
, CO, COF,
, and
. The reduced data matrix
is as follows:
The contribution of each species to the production of F radicals is represented by the vector
, which quantifies the number of F radicals produced per molecule of each species. The vector
is defined as
In this mixed plasma, a reduction in the concentration of results in the generation of three times the quantity of F radicals. Therefore, contributes −3 F radicals, contributes −2 F radicals, and contributes −5 F radicals. Ar, , , and CO do not contribute any F radicals, as they are either inert or do not contain fluorine. COF, , and contribute −1, −2, and −3 F radicals, respectively.
The total F radicals produced (
) can be calculated by multiplying the matrices
and
:
Species that were not selected for
either did not participate significantly in the reactions or had negligible detectable concentrations; thus, they were excluded from the calculations.
4.3. Dynamic Analysis of Gas Species Transitions Using FOPDT Modeling
Figure 3 illustrates the time-dependent changes in the signals of selected gas species (
,
, and
) measured using ToF-MS during the plasma process. Due to the presence of a dead volume in the connection interface between the process chamber and the monitoring equipment, delays may occur, potentially leading to signal distortion. The FOPDT model corrects these delays, thereby ensuring that the observed concentration changes reflect the plasma process itself and not artifacts caused by the physical structure of the chamber.
The FOPDT model divides the dynamic behavior of the system into two components: dead time and first-order response. The dead time is used to describe delays that are due to external factors, such as the physical configuration of the system and the measurement equipment being used. The first-order response, defined by the process gain and time constant, uses differential equations to model the intrinsic kinetics of the process. Despite the rapidity of plasma chemical reactions, the migration delay of gas between the main chamber and the monitoring equipment can result in concentration discrepancies. Given that gas concentrations fluctuate due to diffusion and drift, which are represented by differential equations, it is appropriate to utilize a model based on differential equations. By applying an FOPDT model, this complex gas transfer process can be simplified, and the process gain reflects the dynamics of the plasma process in the main chamber, independent of various delays [
24]. This approach ensures that the observed concentration changes are representative of the plasma process without distortions due to the physical configuration of the system.
The regression analysis results obtained using the FOPDT model, as shown in
Figure 3, provide specific values of the process gain for each gas species. For
and
, the process gains (
and
) are determined to be −27,060.9 and −7188.5, respectively, indicating that their concentrations decrease rapidly as the plasma voltage is applied. Conversely, the process gain (
) for
is
, indicating an increase in its concentration with the application of plasma voltage. These results suggest that fluorocarbons are rapidly oxidized in plasma environments.
The ToF-MS data were collected by controlling the process environment using the system described in
Figure 1. To simulate reactions at both room temperature and the conditions under which ACL deposition and etching occur, eight repeated experiments were conducted at four temperature conditions: 20 °C, 200 °C, 400 °C, and 650 °C. During these experiments, the ICP plasma power was varied from 100 W to 900 W. A total of 288 datasets were obtained, which were used to calculate the process gain for F radicals, as well as for each gas species selected based on PCA importance. This calculation was performed using the FOPDT model.
Figure 4a–k present these process gains as two-dimensional color maps, illustrating their dependence on process conditions. Additionally, the correlation between different gas species is visualized in
Figure 4l.
A detailed analysis of the
mixed plasma was conducted, covering the following elements:
, introduced into the process;
and
, produced from the decomposition of
[
31]; the inert gas Ar; and the species F radicals—CO,
, COF,
, and
—formed by the plasma reactions. The concentration of F radicals, which play a crucial role in the etching process, generally increased with rising plasma power and gas temperature within the chamber. However, at a high temperature of 650 °C, the F-radical concentration peaked around 300 W and slightly decreased as plasma power increased further (
Figure 4a). This decrease in F-radical concentration at higher power levels is likely due to the increased formation of carbonyl fluoride compounds (
Figure 4i–k), which consume F radicals as they are generated.
At temperatures around 400 °C, with high plasma power applied, the formation of
(
Figure 4d) can be explained by the decomposition of
into lower fluorocarbon ions such as
, followed by their recombination to form larger fluorocarbon species [
34,
35]. This is consistent with our observation that the decrease in
concentration is relatively small in the regions where
concentration increases, suggesting that
is formed when sufficient
ions are available in the high-power plasma environment.
Additionally,
Figure 4e shows that the concentration of Ar remained relatively unchanged under various plasma conditions, indicating that Ar does not participate in the plasma reactions. As the temperature rises, the concentration of
decreases steadily (
Figure 4f), while the concentrations of
and CO show an upward trend (
Figure 4g,h). This increase in CO concentration can be attributed to the behavior of COF intermediates within the
/
plasma.
radicals react with
to form COF intermediates [
36,
37], which can either dissociate into CO or react further with
to produce
. With increasing plasma power, the dissociation of COF into CO becomes more dominant compared to the formation of
, leading to a higher production of CO. This shift in reaction pathways, driven by elevated plasma power, results in a preferential generation of CO over
as power increases, thereby explaining the observed rise in CO concentration [
38].
However, at 650 °C and plasma power levels exceeding approximately 400 W, the concentrations of
and CO begin to decrease again. This behavior suggests that at higher power levels, these species may be recombining to form other carbonyl fluoride compounds. The data revealed that as the energy input increased at higher temperatures and plasma power levels, the production of
became more thermodynamically favorable [
39].
Moreover, under high-temperature conditions (at 650 °C) and with plasma powers above 600 W, the concentration of increased significantly. This finding implies that the elevated energy levels facilitate the further fluorination of , resulting in the generation of . The additional energy supplied by the increased plasma power likely provides the necessary activation energy for these fluorination reactions, suggesting that is a secondary product formed when excess F radicals react with under these high-energy conditions.
Figure 4i visually represents the complex interactions within the entire process to illustrate the interdependencies between chemical species in the plasma. The left side shows the correlation matrix, where each cell represents the correlation coefficient between two chemical species. The color gradient indicates the correlation strength, with blue representing negative correlations and orange representing positive correlations.
The right side of
Figure 4i displays a network map constructed from the correlation matrix. In this map, gas species that increase in concentration when the plasma is turned on (K > 0) are labeled in red, while those that decrease (K < 0) are labeled in blue. The font size of each label corresponds to the magnitude of concentration change. Connections between the species are depicted as lines, with the color and thickness of the lines indicating the strength and type of correlation: purple lines represent positive correlations, while orange lines denote negative correlations. In the network, the input gases
and
are fixed at the top-left corner, while the main product F is positioned at the bottom-right corner, providing a clear overview of the dominant interactions throughout the entire set of experimental conditions. This network visualization not only highlights the interdependencies and reaction pathways between the various chemical species in the plasma environment but also allows for a straightforward comparison of which species are predominantly involved in reactions under different conditions. In particular, the next section will provide insights into process control by clustering similar processes and analyzing each cluster comparatively.
4.4. Dimensionality Reduction and Clustering Analysis of Plasma Process Data
PCA is a powerful tool for reducing the dimensionality of complex datasets, enabling the identification of key patterns and relationships between variables that may not be immediately apparent. By transforming a large set of variables into a smaller set of principal components, PCA allows for the visualization of high-dimensional data in a lower-dimensional space, facilitating the interpretation of intricate interactions between process parameters and chemical species. In this study, PCA was employed to effectively visualize the complex interactions between process conditions and the resulting gas-phase species. The analysis was conducted using the process gain (K) values for each chemical species under various process conditions, and the resulting principal components were mapped onto a two-dimensional plane, as shown in
Figure 5.
In
Figure 5a, the process gains of gas species collected under varying temperature and plasma power conditions are visualized in a reduced-dimensional space defined by principal component 1 (PC1) and principal component 2 (PC2) through PCA. The color of each data point corresponds to the specific process condition, while its coordinates indicate the position within this reduced-dimensional space.
To better interpret this reduced-dimensional space, the original process gains (K) of each chemical species are represented as arrows in the lower-right corner of
Figure 5a. These arrows are loading vectors representing the contribution of each chemical species to the principal component axes within the two-dimensional space. Because the scales of the
x- and
y-axes differ, dashed lines were used to represent vectors of equal length. The direction and length of each arrow show how the process gain for each gas species influences the position of data points in the reduced-dimensional space. Specifically, the direction of each arrow signifies the axis along which variations in a particular chemical species are most pronounced, while the length of the arrow reflects the extent of the species’ contribution to variations in the corresponding principal component. This enables a straightforward interpretation of the relationship between high-dimensional process variables and the low-dimensional representation highlighting how variations in the chemical species shape the distribution of experimental data.
By leveraging these loading vectors,
Figure 5a effectively conveys the relationship between process conditions and the behavior of individual chemical species, providing a comprehensive overview of their interdependencies. This visualization underscores the dominant species influencing variance in the principal component space, thereby offering valuable insights into interaction pathways and reaction dynamics within the plasma process under different experimental conditions.
Figure 5b,c illustrate the loadings of each chemical species on the first and second principal components (PC1 and PC2), respectively. Positive or negative values indicate the direction and magnitude of the influence of each species on the principal components, thereby providing insight into which species drive variations along these key axes. The PC1 is defined as the linear combination of
K that captures the maximum variance in the data. The PC2 is the linear combination that captures the second highest variance in the data, subject to being orthogonal to PC1. This orthogonality constraint ensures that PC2 captures a new, independent direction of variance that was not accounted for by PC1. As a result, this approach preserves as much information as possible while reducing its dimensionality, allowing for a clear visualization of complex relationships between variables in a simplified two-dimensional space.
Building on these PCA results, clustering was applied to the process data to further analyze patterns and relationships. PCA effectively reduces the dimensionality of high-dimensional data while preserving the most critical variance, thereby revealing inherent patterns and relationships among the variables. The k-means algorithm was used to cluster experimental conditions with similar process gains in the reduced-dimensional space, identifying four distinct groups based on the Elbow Method. As shown in
Figure 6a, the PCA scatter plot displays these clusters, indicating underlying groupings in the chemical species’ behavior or process conditions. These groupings suggest different operational regimes or reaction pathways that are not easily observable in the original high-dimensional space.
Figure 6b–e provide a detailed analysis of each cluster, displaying the correlations among process variables. For each subfigure (b, c, d, and e), the left panel shows the correlation matrix for the gas species within the cluster, while the right panel presents a network map based on these correlations. The network map highlights strong correlations, making it easier to identify which species interact closely in each cluster.
In Cluster 1 (
Figure 6b), the production of carbonyl fluoride compounds (e.g., COF,
) is lower compared to other clusters, with limited
breakdown and reduced F-radical generation, suggesting fewer reactions leading to F radicals. In Cluster 2 (
Figure 6c), COF,
,
, and
show high correlations, with an increased
breakdown and F-radical generation relative to Cluster 1, indicating that
is promoting F-radical formation through its interaction with
, resulting in higher production of fluorinated compounds. In Cluster 3 (
Figure 6d), the correlations between gas species become stronger, suggesting that the reactions observed in Cluster 2 are more pronounced here, with CO playing a more active role in forming COF and
[
36,
37,
39], leading to an increased production of carbonyl fluoride compounds. In Cluster 4 (
Figure 6e), the correlation between carbonyl fluoride compounds and other species decreases relative to Cluster 3, with a significant drop in
concentration accompanied by a notable increase in F-radical generation, indicating extensive
decomposition and a corresponding rise in F radicals.
The methodology presented in this study effectively captures the complex interactions of plasma processes by combining PCA and clustering techniques. Through dimensionality reduction, we were able to highlight key patterns and visualize relationships between chemical species more clearly. Clustering in this reduced space allowed for the identification of distinct process behaviors, revealing variations in chemical reactions under different conditions. Additionally, there is the versatility to use different clustering methods to focus on specific aspects of the process, allowing for a more customized analysis of plasma chemistry.
Furthermore, the clustering results provide valuable insights for process control and optimization. By identifying how different process conditions influence reaction pathways within the plasma, we can determine which gas species are produced or consumed under specific conditions. This understanding enables us to adjust process parameters, such as temperature and plasma power, to achieve the desired plasma behaviors associated with specific clusters. For example, if the goal is to enhance the formation of beneficial chemical species like F radicals or to suppress the production of undesirable byproducts, we can modify the temperature and plasma power to operate within the conditions corresponding to the cluster that exhibits these characteristics. This approach offers concrete guidance on controlling process variables to achieve specific plasma behaviors, thereby improving the efficiency and outcomes of the plasma process.