2. The Literature
In the continuous casting of steel, quality issues can also be pinpointed by observing mold signals, such as readings from level sensors, thermocouples embedded in mold walls, friction measurements, and more. Based on these observations, corrective measures can be taken, like reducing the casting speed or conducting subsequent visual inspections of the surface to determine potential downgrades or the need for scarfing. Distinctive thermocouple patterns associated with various defects have been recognized [
1], encompassing sticker breakouts, transverse depressions, pronounced oscillation marks, narrow-face bleeds, transverse corner cracks, longitudinal cracks, and mold level variation defects, among others. This study narrows its focus specifically to the issue of longitudinal cracks.
In order to continuously monitor the quality, non-destructive testing methods have been widely developed, such as image analysis [
3], eddy currents [
6], magnetic powder [
7], ultrasound [
8], or sulfur prints [
9]. Most of these systems require the installation of sensitive sensors in the continuous caster line, which is a harsh environment. Longitudinal cracks are less visible on certain grades of steel (usually slabs casted with low carbon content—also known as soft grades), which renders visual inspection systems ineffective [
10].
The authors of [
11] present a precision inspection system tailored to autonomously identify cracks in “as-cast” steel slabs. The task of identifying cracks is further complicated by the oxidized layer on the slab’s exterior. A custom-designed laser triangulation mechanism has been introduced. Cracks are detected using a combination of morphological detection and an SVM classifier. A different technique for forecasting visual breakouts during mold monitoring is detailed in [
12]. This method utilizes temperatures captured by thermocouples inside the mold and combines them with computer vision methodologies. Dubbed as mold temperature rate thermography, this strategy aids in pinpointing and extracting distinct characteristics of temperature zones, including the pace of temperature fluctuations over time, their geometric shape, and propagation speed.
Another non-destructive approach, as proposed in this paper, is to work with data-driven models based on process parameters already captured for process steering. Such data-driven models have proven to be efficient in several sectors of steel production, like end-point prediction at the basic oxygen furnace [
13] and fault detection at the hot strip mill [
14]. Currently, the available literature on longitudinal crack detection lacks contributions related to big-data applications and is limited to process knowledge systems.
The impact of chemical components, primarily carbon and sulfur, on longitudinal surface cracks has been comprehensively studied. As demonstrated by the researchers in [
15], the carbon concentration at which the longitudinal surface cracking peaks diminishes as the sulfur content rises. Conversely, for a fixed carbon concentration, the likelihood of surface cracking escalates with an increase in sulfur content. Separate research [
16] delves into the influence of the mold powder utilized, particularly its SiO
2 concentration, on the surface quality of slabs with a high aluminum composition. This study also reveals that, under identical processing conditions, wider slabs require a more extended period for the powder to achieve equilibrium. This pinpoints the importance of aggregating both chemical and process information into the input space.
Utilizing the thermocouple information, a temperature differential thermograph has been developed to identify slab cracks [
5]. To pinpoint the anomalous temperature zones resulting from longitudinal cracks, potentially problematic regions are isolated and segmented using computer image processing techniques.
In the study presented in [
17], a model is presented for longitudinal crack detection, leveraging principal component analysis (PCA) and support vector machine (SVM). The temperature patterns associated with the longitudinal crack defect are isolated. These encompass the standard casting temperature exhibiting minor and major oscillations, as well as the specific temperature of the longitudinal crack. PCA is employed to diminish the dimensionality of these features and an SVM is utilized as a classifier.
Novel machine learning techniques are typically introduced to industrial applications only after they have gathered substantial validation in academic research. Time-series data find utility across various industrial domains, including quality assurance, soft sensing, and predictive maintenance. Although numerous advanced methodologies have been employed for longitudinal crack detection, the primary focus has been on temperature and rolling attributes, without incorporating chemical components into the analysis. Moreover, no study was found where several years of historical data were used for analysis.
2.2. The CNN-SE Network
In this section, we provide an overview of the Convolutional Neural Network (CNN) framework. Additionally, we discuss the Squeeze and Excitation (SE) blocks, as detailed in [
18], adjusted for one-dimensional convolutions to align with our time-series dataset.
The CNN block is represented by the convolution transformation
. This transformation maps an input
to feature maps
. Here,
T and
denote the time dimensions, while
C and
represent the number of channels,
In this context, is derived from the convolution of the input vector with spatial kernels . Here, denotes a spatial kernel that represents an individual channel of , interacting with the corresponding channel of .
The SE architecture is illustrated in
Figure 4; the operation takes as input the channels originating from the convolutional transformation
. Overlaid on this signal are two distinct processes: Squeeze, where information is globally aggregated to distil essential features, and Excitation, which shows the dynamic modulation of these features, emphasizing certain channels of the input for enhanced representation and understanding. This representation is then used to scale the original input.
After the convolutional unit, the SE block starts with the Squeeze function, which employs global average pooling to produce channel-wise statistics. Statistic information
is derived by squeezing
across its temporal dimension
T. The
c-th element of
is computed as:
Following the Squeeze computation, the aggregated information undergoes an Excite operation. This step is designed to discern and capture dependencies specific to each channel.
where weights
and
are parameters that can be trained. Here,
denotes the sigmoid activation function, while
represents the gating function ReLU. The factor
r is introduced to reduce dimensionality, serving to curb the model’s complexity. This factor can be fine-tuned as a hyper-parameter. Research presented in [
18] indicates that setting
strikes a balance between model complexity and accuracy, especially when layers consist of 128 to 512 filters. The output block’s
c-th channel undergoes a rescaling process, defined as:
where
, and
refers to channel-wise multiplication between the scalar
and feature map
.
The SE block can be incorporated into conventional convolutional frameworks, enabling adaptive adjustments to the input feature maps. This mechanism is similar to a self-attention module, where the input values are used to determine their own significance. Studies have demonstrated that by integrating SE blocks into ResNet-50 [
18], a performance approaching that of ResNet-101 can be achieved. This is impressive for a model requiring only half of the computational costs. The quantity of additional parameters
P needed to learn these SE maps can be determined as follows:
where
S represents the total number of stages, with a stage being defined as a sequence of consecutive layers sharing the same kernel size.
indicates the count of blocks that are repeated within stage
s, while
indicates the number of feature maps associated with stage
s.
3. Methodology
Our approach comprises three stages: Firstly, static and temporal data from multiple databases are joined and indexed for each slab (sample), then corrupted samples are filtered out. In the second stage, an extensive number of features from each signal are extracted and the most relevant are selected through model selection. In the last step, classification models are trained and evaluated on different levels of imbalance, shuffling or maintaining time order. The impact of adding the process information and chemical composition in the dataset is also evaluated. In what follows, we describe the model architectures, dataset, and evaluation metrics used.
Two baselines are considered, namely, (1) an analytical model based on the physical behavior of the process which is currently being used in the factory and (2) a GBRT method trained on the static data and features extracted from the multivariate time series. The first one is purely the detections from a differential model similar to the one presented in [
5], based solely on historical data. The second is an ensemble model following the methodology proposed in [
4], where a GBRT model was trained using automated time series feature extraction and selection along with chemical features. The GBRT model combines multiple decision trees to make predictions. It leverages the strengths of decision trees, such as non-linearity and ability to handle complex relationships, while mitigating their weaknesses by boosting the performance of individual trees through iterative training.
Our proposed CNN-SE-FCN model is depicted in
Figure 5, where a fully convolutional block processes the temporal data and an FCN handles the static features. The outputs of both blocks are then concatenated and passed to the last activation layer.
Regarding the FCN model, two input tensors are defined: one for the time series and another for the static data. The time series tensor adopts the shape of . Here, N represents the maximum count of samples (batches) within the dataset, T is the number of time steps, and M corresponds to the count of time signals in our MTS dataset. Conversely, the tensor for static data is structured as , with K denoting the cumulative number of static features employed.
In our architectural design, both the time series and static data are processed concurrently by the CNN and FCN blocks, respectively. The convolutional block comprises three convolutional layers, serving as feature extractors. These layers have kernel sizes of 8, 5, and 3, with the corresponding number of filters set at 128, 256, and 128. Each of these layers is succeeded by batch normalization and a ReLU activation function. Additionally, the initial two blocks end with SE blocks, where the reduction ratio is designated as
, in line with the recommendations from the original paper [
18]. This inclusion augments the model’s complexity by
parameters, which translates to a roughly 5% relative increase in the trainable parameters of our model. The SE mechanism bolsters performance on multivariate datasets, given that each feature map can influence the outcome to varying extents. This autonomously acquired form of channel-specific attention integrates the inter-correlation data among multiple variables.
The FCN block consists of a hidden layer featuring 64 neurons, accompanied by a dropout rate of 50% to mitigate overfitting. This is succeeded by a ReLU activation layer. Outputs from both the CNN and FCN blocks are subsequently concatenated and fed to the concluding activation layer, which employs a sigmoid activation function for our classification task.
3.1. Data Preparation
The continuous casting process, also known as strand casting, is an essential phase in steel production, where liquid steel is solidified into a semi-finished slab for subsequent rolling in the hot strip mill. The quality of the final product is strongly dependent on numerous variables, such as the temperature, composition of the liquid steel, and casting speed, among others.
Predictive models for parameters such as temperature and solidification rates are utilized to determine these specifications. These models enable precise control and management of the continuous casting process, thus ensuring that the semi-finished product aligns with the desired quality standards. Based on these predictions, further adjustments can be made during the casting process to achieve the desired properties in the steel slab. The casting phase duration can vary, but is a relatively continuous process, hence the name.
One key aspect of managing the continuous casting process is the collection and analysis of time-series signals, which provide invaluable data about the casting process. In this project, we will focus on the temperature data collected from thermocouple sensors situated around the mold. In addition to this, chemical composition and other process features will be used to further improve the precision of our predictive models.
Time-series signals can be conceptualized as univariate time series, symbolized by
, which is a one-dimensional signal, sampled across a time domain. To be more specific,
is a sequentially arranged set of real values
, where
denotes the comprehensive length of the signal. This sequence typically arises from the output of a sensor that is sampling while overseeing a process. When a process is under the surveillance of multiple sensors, it is characterized as a multivariate time series (MTS) because it encompasses multiple time-dependent variables. An MTS, represented by
, is composed of various univariate time series
, with
indicating the total number of signals (or channels). The casting procedure is extensively instrumented, with several sensor readings and measurements available throughout its duration. In terms of data preparation, the time signals undergo rescaling to fit within the range of
, and the static features are normalized to achieve a zero mean and a standard deviation of one (it is noteworthy that both the training and test sets are rescaled and normalized based solely on the training set’s values).
Figure 3 displays the signals for a specific slab before being rescaled.
The accessible recorded data consist of the latest 85,000 slabs produced. After filtering out corrupted or incomplete entries, the usable dataset comprises approximately 80,000 samples. Each of these samples is equipped with static features and thermocouple time signals, each spanning roughly timesteps, contingent on the casting speed. This dataset was partitioned, with 70% allocated for training and validation, and the remaining 30% reserved for testing. To maintain the chronological sequence, the samples were not randomized prior to this division. This approach mirrors a realistic scenario where a model would be integrated into a production environment to forecast outcomes for subsequent heat batches. It is crucial to highlight that, during training, the training set undergoes shuffling at the start of each epoch.
Regarding the static data, they are composed of process variables and chemical compositions. The chemical composition pertains to the concentrations of chemicals identified prior to the pouring of steel into the tundish, with 18 features such as and , among others. On the other hand, the remaining five static features from the process variables consist of categorical features like the type of powder, nozzle, and stopper, as well as numerical features such as casting width and the average speed recorded over the last 3 min.
Time stamp labels were created at the time-of-crack. These labels can be of two different origins; if detected at the continuous caster by the current alarm system, a time stamp is recorded, similar to how they are marked in
Figure 3a,b. The slab will later be visually verified as to whether if the defect exists or not. The defects that are not detected by the system will be automatically identified after the slab is hot-rolled. The longitudinal cracks are then measured by an automated system and the time-of-crack for each defect is estimated from these measurements.
4. Results
In this section, we evaluate the described classification models with five different training configurations in order to measure the impact of different training techniques and the influence of different features. The evaluation metric is the
score as discussed in the previous section. Precision–recall and Receiver Operating Characteristic (ROC) [
22] curves are also shown for some experiments.
Table 1 presents a comparison of different models on four metrics: precision, recall, F1 score, and F-beta score (with beta = 2.7). The models compared are rule-based, GBRT, CNN-SE-FCN without chemical features (−chem), CNN-FCN without Squeeze and Excitation (−SE), and CNN-SE-FCN with chemical features (+chem) models. The last one is the model with our proposed architecture.
Looking at the precision, the highest score is achieved by the CNN-SE-FCN model without SE, with a score of 0.158. This suggests that this model has the highest proportion of true positive results among all the models but has a lower recall when compared to others. In terms of recall, the GBRT model performs the best with a score of 0.351, but shows very low precision.
The F-beta score, our main metric since it reflects production costs, is also highest for the CNN-SE-FCN model with chemical features (+chem), with a score of 0.265. This indicates that when giving more importance to recall, this model outperforms the others. Furthermore, we see a large drop in performance when no chemical features are used (−chem), highlighting the importance of including chemical and process features into time series models for industrial applications.
Figure 7 shows the Receiver Operating Curve; the area under the curve for feature extraction was 0.66, 0.70 for CNN-SE-FCN without chemical features (−chem), 0.72 for CNN-SE-FCN, and 0.71 for CNN-FCN. For lower false positive rates (<0.15), which are desired in our case given the high imbalance, GBRT, CNN, and FCN-CNN show similar performances, while FCN-CNN-SE provides a higher true positive rate. Although the ROC can be quite informative for classification tasks, in cases with a high class imbalance, it can be quite misleading [
22]. In our case, it enables a cost analysis, since at the factory the real cost of false positives and false negatives is known; the optimal threshold for a model can be selected using a cost function based on these values.
Figure 8 shows the precision–recall curve, as it better visualizes the highly imbalanced nature of this classification task. Here, we can verify that the GBRT has a higher precision when the recall is low (<0.1), but fails to maintain the performance at higher recall rates, its performance dropping. Meanwhile, CNN-SE-FCN shows a better performance with a higher recall.
5. Summary and Discussion
A data-driven approach for multivariate time series combined with numerical static data was presented in this paper; the goal was to detect longitudinal cracks in steel slabs at the continuous caster. Thermocouple data from 68 sensors were combined with chemical and process information. We evaluated the performance of our models compared to a more classic industrial approach, and we also evaluated the performance gain when adding process information alongside the multivariate time series data.
Our two baseline models for comparison were the one currently in use at ArcelorMittal Belgium and a rule-based differential model that was inspired by the current literature. GBRT was used as the classifier because it meets several criteria required by industry, mainly in its ability to handle large input spaces and highly imbalanced data while being a gray box model, where feature importance can be inferred after the training process for a better understanding of process behavior.
Our choices of the baseline model, to not shuffle the data, and the resampling techniques for the training set were based on our previous work [
4], where, based only on feature extraction methods, we demonstrated the potential pitfalls of shuffling data when creating training/testing splits. Shuffling can inadvertently lead to data leakage, where information from the test set leaks into the training set, thereby overestimating the model’s performance. This is particularly problematic in time-series data, where shuffling can disrupt the inherent temporal order of observations. Furthermore, we explored the impact of data imbalance on model training. Imbalanced datasets, where one class significantly outnumbers the other, can bias the model towards the majority class, resulting in poor predictive performance for the minority class. We highlighted the importance of using appropriate techniques, such as resampling or using class weights, to mitigate the effects of data imbalance and ensure robust model training.
The experiments conducted here have shown that more complex models, such as CNN-SE-FCN, can outperform classic machine learning techniques, while the addition of the Squeeze and Excitation blocks has slightly improved model performance via an additional 5% more trainable parameters, and the addition of SE blocks also improved training times. Our model showed a 15% performance gain (higher ) over our baseline models. We also showed that removing the chemical features had the largest negative impact.
We have shown that our model is robust in an industrial setting, showing little decay in detection numbers over time. The recommended model updating period would be every 6 months; this process involves retraining the model with new data, either by including new data in the existing dataset or by adjusting the model based on the new data alone.
Optimizing the detection of defects early in the production chain is essential to enhance steel quality and guarantee the supply chain within the factory, as the cost of a defect significantly grows the longer it stays in production, wasting time and material. Evidence of which variables are more significant for detection is valuable as it compels the search for more complex approaches, which can further improve prediction. Additionally, our work can serve as a stepping stone for future research that may explore adaptable metric frameworks or tackle the challenges posed by class imbalance and cost considerations in predictive modeling in industrial domains. Based on the outcome of this research, the factory aims to further enhance the currently used model with the most important features. Feature engineering is also a topic to be discussed, as several signals have a larger influence, e.g., sensors at the center of the mold are more relevant since longitudinal cracks are more common in that region.
Author Contributions
Conceptualization, D.A.S. and A.V.Y.-D.D.; methodology, D.A.S.; software, D.A.S.; validation, D.A.S., A.V.Y.-D.D. and A.J.; formal analysis, D.A.S.; investigation, D.A.S.; resources, D.A.S. and A.V.Y.-D.D.; data curation, D.A.S. and A.V.Y.-D.D.; writing—original draft preparation, D.A.S.; writing—review and editing, A.V.Y.-D.D. and A.J.; visualization, D.A.S.; supervision, A.J. and E.M.; project administration, A.J. and E.M.; funding acquisition, D.A.S. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by Flanders Innovation & Entrepreneurship (VLAIO), in cooperation with ArcelorMittal and Ghent University–imec under grant HBC.2019.2173.
Data Availability Statement
The data are not publicly available due to legal or ethical reasons.
Acknowledgments
We would like to thank Bart Soenen from ArcelorMittal Belgium for their valuable and constructive suggestions during the planning and development of this research work.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Thomas, B.G. Review on Modeling and Simulation of Continuous Casting. Steel Res. Int. 2018, 89, 1700312. [Google Scholar] [CrossRef]
- Laha, D.; Ren, Y.; Suganthan, P.N. Modeling of steelmaking process with effective machine learning techniques. Expert Syst. Appl. 2015, 42, 4687–4696. [Google Scholar] [CrossRef]
- Landstrom, A.; Thurley, M.J. Morphology-based crack detection for steel slabs. IEEE J. Sel. Top. Signal Process. 2012, 6, 866–875. [Google Scholar] [CrossRef]
- Sala, D.A.; Jalalvand, A.; Van Yperen-De Deyne, A.; Mannens, E. Combining Time Series and Chemical Composition Data for Early Longitudinal Crack Detection at the Continuous Caster. In Proceedings of the 2022 Iron and Steel Technology Conference, Pittsburgh, PA, USA, 16–18 May 2022; pp. 711–718. [Google Scholar]
- Gao, Y.; Wang, X.; Liu, Y.; Sun, Y.; Du, F.; Wang, F.; Wang, J. A Visual Detection Method of Longitudinal Crack Based on Computer Image Processing During Slab Continuous Casting. Arch. Metall. Mater. 2018, 63, 673–680. [Google Scholar]
- Meilland, P.; Metz, M. Novel multiplexed eddy-current array for surface crack detection on rough steel surface. In Proceedings of the 9th ECNDT, Berlin, Germany, 25–29 September 2006; pp. 1–9. [Google Scholar]
- Nishimine, T.; Tsuyama, O.; Tanaka, T.; Fujiwara, H. Automatic magnetic particle testing system for square billets. In Proceedings of the IAS’95, Conference Record of the 1995 IEEE Industry Applications Conference Thirtieth IAS Annual Meeting, Orlando, FL, USA, 8–12 October 1995; Volume 2, pp. 1585–1590. [Google Scholar]
- Allazadeh, M.; Garcia, C.; Alderson, K.; Deardo, A. Ultrasonic Image Analysis of Steel Slabs. Adv. Mater. Process. 2008, 166, 26–27. [Google Scholar]
- Sirgo, J.A.; Campo, R.; Lopez, A.; Diaz, A.; Sancho, L. Measurement of centerline segregation in steel slabs. In Proceedings of the Conference Record of the 2006 IEEE Industry Applications Conference Forty-First IAS Annual Meeting, Tampa, FL, USA, 8–12 October 2006; Volume 1, pp. 516–520. [Google Scholar]
- Jin, C.; Kong, X.; Chang, J.; Cheng, H.; Liu, X. Internal crack detection of castings: A study based on relief algorithm and Adaboost-SVM. Int. J. Adv. Manuf. Technol. 2020, 108, 3313–3322. [Google Scholar] [CrossRef]
- Veitch-Michaelis, J.; Tao, Y.; Walton, D.; Muller, J.; Crutchley, B.; Storey, J.; Paterson, C.; Chown, A. Crack detection in “as-cast” steel using laser triangulation and machine learning. In Proceedings of the 2016 13th Conference on Computer and Robot Vision (CRV), Victoria, BC, Canada, 1–3 June 2016; pp. 342–349. [Google Scholar]
- Liu, Y.; Wang, X.; Du, F.; Yao, M.; Gao, Y.; Wang, F.; Wang, J. Computer vision detection of mold breakout in slab continuous casting using an optimized neural network. Int. J. Adv. Manuf. Technol. 2017, 88, 557–564. [Google Scholar] [CrossRef]
- Sala, D.A.; Jalalvand, A.; Van Yperen-De Deyne, A.; Mannens, E. Multivariate time series for data-driven endpoint prediction in the basic oxygen furnace. In Proceedings of the 2018 17th IEEE International Conference on Machine Learning and Applications (ICMLA), Orlando, FL, USA, 17–20 December 2018; pp. 1419–1426. [Google Scholar]
- Ma, L.; Dong, J.; Peng, K.; Zhang, K. A novel data-based quality-related fault diagnosis scheme for fault detection and root cause diagnosis with application to hot strip mill process. Control Eng. Pract. 2017, 67, 43–51. [Google Scholar] [CrossRef]
- Kim, K.H.; Yeo, T.J.; Oh, K.H.; Lee, D.N. Effect of carbon and sulfur in continuously cast strand on longitudinal surface cracks. ISIJ Int. 1996, 36, 284–289. [Google Scholar] [CrossRef]
- Ji, C.X.; Cui, Y.; Zeng, Z.; Tian, Z.H.; Zhao, C.L.; Zhu, G.S. Continuous casting of high-Al steel in Shougang Jingtang steel works. J. Iron Steel Res. Int. 2015, 22, 53–56. [Google Scholar] [CrossRef]
- Duan, H.; Wei, J.; Qi, L.; Wang, X.; Liu, Y.; Yao, M. Longitudinal Crack Detection Approach Based on Principal Component Analysis and Support Vector Machine for Slab Continuous Casting. Steel Res. Int. 2021, 92, 2100168. [Google Scholar] [CrossRef]
- Hu, J.; Shen, L.; Sun, G. Squeeze-and-Excitation Networks. In Proceedings of the 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2017; pp. 7132–7141. [Google Scholar]
- Hota, H.; Handa, R.; Shrivas, A. Time series data prediction using sliding window based RBF neural network. Int. J. Comput. Intell. Res. 2017, 13, 1145–1156. [Google Scholar]
- Christ, M.; Braun, N.; Neuffer, J.; Kempa-Liehr, A.W. Time Series FeatuRe Extraction on basis of Scalable Hypothesis tests (tsfresh—A Python package). Neurocomputing 2018, 307, 72–77. [Google Scholar] [CrossRef]
- Flach, P.; Kull, M. Precision-Recall-Gain Curves: PR Analysis Done Right. In Proceedings of the Advances in Neural Information Processing Systems; Cortes, C., Lawrence, N., Lee, D., Sugiyama, M., Garnett, R., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2015; Volume 28. [Google Scholar]
- Branco, P.; Torgo, L.; Ribeiro, R.P. A survey of predictive modeling on imbalanced domains. ACM Comput. Surv. (CSUR) 2016, 49, 1–50. [Google Scholar] [CrossRef]
Figure 1.
Example of a visible longitudinal crack.
Figure 2.
Simplified diagram of the casting process and thermocouple location. The molten steel is poured from the ladle (1) into the tundish (2), and through a nozzle (3) it enters the mold (4). The mold walls are equipped with thermocouples (5).
Figure 3.
Time series of 10 thermocouple signals. Regarding longitudinal cracks on these samples: (a) shows a clear true positive; (b) shows an undetected defect; (c) a false positive where an alarm was given but no defect was present; and (d) normal behavior.
Figure 4.
Computation of the Squeeze and Excitation block following a CNN.
Figure 5.
In the FCN-CNN-SE model, both static and time-series data are processed in parallel. Once processed, the outputs from both blocks are concatenated. This combined output then passes through a final activation layer.
Figure 6.
Using the sliding window method to segment the original Multivariate Time Series into fixed-length segments. Each data segment corresponds to the time span highlighted by the solid black box, with intervals between sampled time steps represented by the dotted black boxes.
Figure 7.
Receiver Operating Characteristic (ROC) curves for models: feature extraction (GBRT), CNN-SE-FCN without chemical features (−chem), CNN-SE-FCN with chemical features, and CNN-FCN (no Squeeze and Excitation block). The curves illustrate the trade-off between the true positive rate and the false positive rate for each model.
Figure 8.
Precision–recall curves for various models: feature extraction, GBRT, CNN-SE-FCN without chemical features (−chem), CNN-SE-FCN with chemical features, and CNN-FCN (no Squeeze and Excitation block). The curves depict the trade-off between precision and recall for each model. The standard lines for are also shown.
Figure 9.
Relative indication of true positive (TP) and false negative (FN) detection over time, represented by full lines. Each data point represents 100 steel slabs. The dotted lines indicate the trend of TPs and FNs over the same period, providing a visual representation of the overall direction of the detections.
Table 1.
Comparison of precision, recall, F1, and scores across rule-based, GBRT, and CNN-SE-FCN models with the following variations: no chemical data (−chem), no Squeeze and Excitation (−SE), and the complete model (+chem).
| Baseline | CNN-SE-FCN |
---|
Rule-Based | GBRT | −chem | −SE | +chem |
---|
Precision | 0.118 | 0.074 | 0.070 | 0.158 | 0.123 |
Recall | 0.271 | 0.351 | 0.303 | 0.291 | 0.331 |
F1 | 0.165 | 0.122 | 0.113 | 0.205 | 0.179 |
Fβ=2.4 | 0.228 | 0.226 | 0.203 | 0.259 | 0.265 |
| Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2023 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).