1. Introduction
Microstructures on the surface of 40Cr13 are widely used in the die industry owing to their properties, such as being aesthetically appealing, tribological, and easy to detach from injection points [
1,
2]. However, it is difficult to precisely process the ideal microstructure on the surface of 40Cr13 because of its hardness and strength. Laser ablation is a non-contact processing method that uses a focused spot of high energy density to instantly vaporize surface materials and is becoming an effective means of processing surface microstructures on hard-to-machine material [
3,
4]. The processing quality of laser ablation depends on the selection of process parameters, such as laser power, pulse frequency, and scanning speed [
5]. Based on theoretical analysis and experimental results, choosing the right combination of process parameters can significantly improve the quality of laser ablation [
6]. However, owing to the numerous combinations of process parameters in the laser ablation process, optimizing the process parameters through a trial-and-error approach can be time consuming. Therefore, finding a suitable method of improving optimal laser processing parameter selection to ultimately achieve precision laser ablation has become an urgent issue.
Machine learning has developed rapidly since it was proposed and has been successfully applied in the prediction of laser processing results [
7]. Chen et al. [
8] proposed a supervised machine learning approach to detect trajectory defects in selective laser melting, four metrics were taken as input variables to the model to quantitatively assess the quality of the trajectory. They believe that the machine learning model can significantly improve the search efficiency of process parameter windows and has great application potential in future unmanned factories. Yousef et al. [
9] proposed a neural network that can be used to model a nonlinear laser micromachining process, which can help to predict the pulse energy level required to produce microstructures with the desired depth and diameter. Teixidor et al. [
10] used three types of machine learning models to predict the process results of surface features. The inputs of these models are the scan speed, pulse intensity, and pulse frequency while the outputs are depth, width, surface roughness, and material removal rate (MRR). Jimin et al. [
11] used a multi-layer neural network to predict the processing results of laser cutting. The inputs of the model included not only the typical parameters but also the slant angle of the laser head. Further, they applied the radial basis function (RBF) neural network to predict the processing results of stainless-steel sheets [
12]. Dhupal et al. [
13] applied artificial neural network (ANN) and response surface methodology to optimize the machining features of the pulsed Nd:YAG laser for microgrooving on aluminum titanate. The experimental results show that the prediction results of the two methods are consistent. Later, the genetic algorithm was applied to further adjust the model, which improved the prediction accuracy [
14]. However, the calculation time of the algorithm was inevitably increased. Ciurana et al. [
15] applied the ANN for predicting the quality of 3D geometrical features on AISI H13 steel and used the particle swarm optimization algorithm to improve the accuracy. The back propagation (BP) neural network, which is another typical prediction model, was successfully applied to optimize the process parameters of laser micro-machining [
16,
17].
Despite the successful application of the above machine learning models to the prediction of laser ablation surface features, a few limitations still exist. First, the above-mentioned models require a large amount of experimental data for training to achieve high prediction accuracy. In addition, the computational efficiency of these algorithms is relatively low and the training time of the models is on the second level, making them unsuitable for real-time applications.
The technological test of laser ablated 40Cr13 surfaces is typically time-consuming and laborious, and it is difficult to obtain a large amount of data to train a prediction model. Therefore, an accurate and efficient method that can be used to predict surface features with limited experimental data is required. To the best of our knowledge, there are only a few scientific studies on the laser ablation of 40Cr13 to obtain target surface features. Our objective is to bridge this knowledge gap and improve the laser ablation quality of superhard materials such as 40Cr13 stainless steel.
This paper presents an extreme learning machine of single-track scanning (ELMSS) and plane scanning (ELMPS) for the prediction of the surface features, which improve prediction accuracies and significantly reduce the training time.
The novelties of this study are that the extreme learning machine is applied to the prediction of the laser processing surface features for the first time, all the unknown parameters in the ELMSS and ELMPS are determined, and the accurate prediction of the surface feature of nanosecond laser processing 40Cr13 material is realized. The practical significance of the presented methodology is that researchers can use the proposed models to accurately predict the dimension of surface features according to the input laser processing process parameters. If the dimension cannot meet the tolerance requirements, the process parameters must be modified. In this way, In this way, trial and error experiments do not need to be conducted, which reduces the time and labor costs.
The rest of this paper is organized as follows:
Section 2 introduces the experimental details.
Section 3 presents the establishment of the prediction models.
Section 4 compares the prediction accuracies and algorithm efficiency of presented models and those of two other ANN models.
Section 5 utilizes the genetic algorithm to further improve the prediction accuracies of ELMSS and ELMPS. Finally,
Section 6 presents the concluding remarks of the study.
2. Experiment Details
2.1. Material
40Cr13 stainless steel was used in this study, and its main components are listed in
Table 1. The chemical composition analysis was conducted by suppliers (Yuzhixiang Material Co., Suzhou, China). 40Cr13 has high hardness, abrasion resistance, and corrosion resistance and is widely used to manufacture high-end precision molds. The surfaces of the samples were milled and ground before the technological tests to eliminate potential influences of the initial surface roughness on the machining results [
17].
2.2. Experimental Platform
The experimental platform is shown in
Figure 1. This study used a nanosecond pulse laser generator (FORMULA-355, Advanced Optowave Co., New York, NY, USA) with a wavelength of 355 nm, and its main parameters are listed in
Table 2. The output spot of the laser generator enters a 3D galvanometer scanner (G3-3D, JCZ Technology Co., Beijing, China) with a focal length of 290 mm through a beam expander (BEX-355-6X, Wavelength OE Co., Nanjing, China) and uses an F-theta field lens (SL-355, Wavelength OE Co., Nanjing, China) to focus the laser spot on the surface of the samples. The diameter of focus spot is 19.38 μm. The laser power irradiating on the surfaces was measured with a power meter (Ophir 30A-SV-17, Thorlabs Co., Newton, MA, USA) [
17].
2.3. Technological Test Methods
To obtain excellent properties, micro-grooves and planes are taken as the two typical microstructures on the surface of the dies. These two structures are usually obtained by single-track and plane scanning, respectively. The details of these methods are given below.
2.3.1. Single-Track Scanning
Single-track scanning is realized to ensure that the laser spot linearly scans the surface of a sample, producing a micro-groove structure by laser ablation. Its process parameters include laser power (LP), pulse frequency (PF), and scanning speed (SS). The laser pulse accumulation of single-pass machining is shown in
Figure 2. Since the output power of a laser with a wavelength of 355 nm is significantly affected by the variation of the PF, the selection of the variation range of the process parameters must be reasonable. The process parameters of single-track scanning in this study are listed in
Table 3. The value of a single process parameter is changed each time during processing, and other parameters remain unchanged. A total of 48 micro-grooves are processed [
17].
2.3.2. Plane Scanning
Plane scanning is realized using the laser spot for reciprocating the scanning motion on the surface of a sample with a certain line spacing (LS). Its process parameters add the LS based on single-track scanning parameters. The laser pulse accumulation of plane scanning is shown in
Figure 3. The variation range of the process parameters is listed in
Table 4. Similarly, the value of a single process parameter is changed during each processing while others remain fixed. A total of 81 planes are formed.
2.4. Features Measurement
The cross-section of the microgroove structure produced by single-track laser ablation is approximately V-shaped owing to the Gaussian distribution of the laser beam [
18]. The micro-groove of single-track scanning was measured by laser confocal microscopy (LEXT OLS5100, Olympus Co., Tokyo, Japan), and a typical micro-groove is shown in
Figure 4a where the cross-section of the micro-groove is observed along the A-A direction. As is shown in
Figure 4b, three features are defined on the cross-section: depth (D), width (W), and sectional area (SA). The material removal rate (MRR) is defined as
where SA denotes the sectional area of the micro-groove, SS denotes the scanning speed, and
t denotes the scanning time.
During the measurement of D, W, and MRR, three sections with a length of 1 mm were cut along each micro-groove, and the average value was taken as the measurement result of the three features.
The physical image of the surface obtained from the plane scanning is shown in
Figure 4c, and a typical surface morphology observed using LEXT OLS5100 is exemplified in
Figure 4d. A contact roughness meter (SJ-310, Mitutoyo Co., Kawasaki, Japan) was used to measure the surface roughness (Ra) along the direction shown in
Figure 4d. Ra was calculated as the average value of three measurements.
Measurement results of the four surface features—D, W, MRR, and Ra—obtained using the above mentioned methods are shown in
Table A1 of
Appendix A, which contains 48 samples of single-track scanning and 81 samples of plane scanning.
3. Establishment of the Prediction Model
ELM was proposed by Guang-Bin Huang et al. [
19] in 2004 and was soon applied to function fitting and classification problems [
20,
21,
22]. It is part of a single-hidden-layer feedforward neural network. A typical structure of ELM is shown in
Figure 5, consisting of input, hidden, and output layers. Assuming that
were the neuron numbers of the input, hidden, and output layers, respectively, the connection weight matrix between the input and hidden layers is
where
w represents the connection weight between the
i neuron in the input layer and
j neuron in the hidden layer.
The connection threshold matrix of the hidden layer is
where
b represents the connection threshold of the
i neuron in the hidden layer.
The connection weight matrix between the hidden and output layers is
where
represents the connection weight between the
j neuron in the hidden layer and
k neuron in the output layer.
Assuming that there are
P samples in the training set data, the input and output matrices of ELM can be expressed, respectively, as
If
g(
x) denotes the activation function, the output matrix of the hidden layer then becomes
Guang-Bin Huang et al. [
19] pointed out that, when the activation function
g(
x) is infinitely differentiable, the connection weight between the hidden and output layers can be obtained using:
where
H is the Moore–Penrose inverse of the hidden layer output matrix.
The connection weight (
W) and threshold (
b) between the input and hidden layers can be randomly generated before training and remain unchanged during the training process. After the activation function
g(
x) is determined before training and the connection weight (
) is solved using Equation (
8), all the parameters of ELM are determined. The entire training process does not require error back-propagation and iterative calculation. Hence, the training time of ELM can be greatly reduced on the premise of ensuring prediction accuracy.
Assuming that there are
Q samples in the test set data, the prediction result of ELM is:
where
w and
b are the preset connection weights and thresholds, respectively,
is the calculation result of training, and
x is the input data of the test set.
The ELM model for predicting the surface features was developed using MATLAB R2018a software. The single-track scanning (ELMSS) model proposed for feature prediction is shown in
Figure 6a. LP, PF, and SS were set as the input layer parameters, and D, W, and MRR as the output layer parameters. The number of neurons in the hidden layer was 33, and the activation function was “sigmoid.” The plane scanning (ELMPS) model is shown in
Figure 6b. LS was added to the input layer parameters, and the output layer parameter was changed to Ra. The number of neurons in the hidden layer was 60, and the activation function was again “sigmoid”.
Groups 1–45 of data in
Table A1 of
Appendix A were treated as the training set and groups 46–48 as the test set. Based on the ELM model, the surface features were predicted through the following process.
First, the technological test was carried out according to the designed parameters, and the machining result data of the surface features were obtained.
Second, the data were categorized into training and test sets after they were normalized, and the number of neurons in the hidden layer was determined. The connection weights (W) and thresholds (b) between the input layer and the hidden layer were randomly set.
Third, an infinitely differentiable activation function was selected, and the hidden layer output matrix (H) combining the data of the training sets was calculated.
Fourth, the connection weights () between the hidden and output layers were calculated to complete the training of the ELM model.
Finally, the test set data were inputted into the trained model, the predicted results were compared with the measured results, and the prediction accuracy of the ELMSS and ELMPS was analyzed.
The ELMSS and ELMPS are based on a typical perceptron with one hidden layer. The innovation of this paper is to find the most suitable number of neurons in the hidden layer used to predict the surface feature of laser processing and use the genetic algorithm to optimize the connection weight and threshold to obtain the highest prediction accuracy.
4. Prediction Results of ELMSS and ELMPS
To verify the accuracy of the ELMSS, the prediction results were compared with that of BP [
16] and RBF [
12] neural network machine learning models for the same training and test data sets. For the prediction models of single-track scanning, the network structure of the BP neural network is 3-6-3, which is the network structure with the highest prediction accuracy. The activation function of BP neural network is “tansig,” which is infinitely differentiable. The expression of the function is:
The spread value of the RBF neural network is set to 1.0. The measured values and predicted results of D, W, and MRR after training with the same data are shown in
Table 5.
The average prediction accuracy is defined as
From the calculation for prediction accuracy, ELMSS has the highest accuracy for D, reaching 92.6%, followed by that of the BP and RBF neural networks, which are 89.8% and 89.6%, respectively. The prediction accuracies of the three models for W are relatively high. Among them, the ELMSS has the highest prediction accuracy of 98.3%, followed by the BP and RBF neural networks, with prediction accuracies of 97.4% and 96.1%, respectively. The prediction accuracy ranking for MRR is ELMSS (92.4%), BP neural network (91.1%), and RBF neural network (89.1%), respectively.
Similarly, to verify the accuracy of the ELMPS, the prediction results were compared with two other models, the network structure of the BP neural network is 4-3-1, the activation function is “tansig,” and the spread value of the RBF is still set to 1.0. The measured values and prediction results of Ra by the three models are shown in
Table 6.
The average prediction accuracy of plane scanning Ra was also calculated. The average prediction accuracies of the three models for Ra were relatively low. ELMPS had the highest average accuracy of 86.8%, and the average prediction accuracies of the BP and RBF neural networks were 80.7% and 79.5%, respectively.
To test the generalization performance of ELMSS and ELMPS, 45 samples of the training set data in single-track scanning were treated as the input of the models, and 45 predicted values of D, W, and MRR were obtained. Further, the absolute error between the predicted and actual values was plotted in
Figure 7a–c. Then, 72 samples of the training set data in plane scanning were used as the input of the models, and 72 predicted values of Ra were obtained. The absolute error between the predicted and actual values is plotted in
Figure 7d.
Figure 7 shows that the absolute error of ELMSS and ELMPS is minimal compared to the other two models.
The training period of the models were used to compare the computing efficiency of the three models. The CPU performance of the computing platform was unified as Intel(R) Core(TM) i7-10750H CPU @ 2.60GHz. The values of the training time are listed in
Table 7. The BP neural network requires the longest calculation time, because the training process of the model needs error back propagation and iteration. The RBF neural network belongs to the forward network, but the calculation time is longer than that of ELMSS and ELMPS. In summary, the computational efficiency of the prediction models proposed in this study have been improved from tens to hundreds of times compared with the other two types of machine learning models, providing the possibility of using the ELMSS and ELMPS in real-time applications.
5. Optimization of ELMSS and ELMPS Based on Genetic Algorithm
The connection weights and thresholds from the input layer to the hidden layer of the ELMSS and ELMPS models are randomly determined, which leads to unstable prediction results. This section applied genetic algorithm to optimize the parameters of prediction models, in order to obtain the optimal connection weights and thresholds, thereby improving the prediction accuracy of the models.
When optimizing the ELMSS and ELMPS models, the genetic algorithm first encodes the connection weights and thresholds of the model to form a certain number of individuals, and then sets the solutions that may have optimal solution individuals to form the population to be optimized. After the population is formed, individuals are screened through selection, crossover, and mutation operations according to the selected fitness function to complete a generation of evolution. Each generation retains the individual with the best fitness value, and generation-by-generation evolution produces approximate solutions with smaller and smaller errors. The flowchart of genetic-algorithm-optimized prediction models is shown in
Figure 8.
When performing a selection operation, the fitness function of the individual should be calculated first, and the calculation method is
Among them, X is the predicted value of the sample, is the actual value of the sample, and N is the number of samples in the test set.
Then, the probability that an individual is selected to remain in the next generation is
The crossover operation is to select a part of variables in two individuals to exchange at the same chromosome position. The
k individual
a and the
l individual
a crossover at position
j as follows:
Among them, b is a random number between 0 and 1, and the probability of crossover between each generation is 0.7.
The mutation operation is that an individual itself mutates at a certain chromosome position. The mutation operation of the
i individual at position
j is as follows
Among them, g is the current evolutionary generation, G is the maximum evolutionary generation, a and a are the maximum and minimum values of variables in the individual, respectively, and r is a random number between 0 and 1, and the probability of mutation between each generation is 0.01.
In ELMSS optimized by genetic algorithm (GA-ELMSS), the maximum number of genetic iterations is set to 100, and the change process of prediction errors are shown in
Figure 9.
The connection weights and thresholds of GA-ELMSS are shown in
Table A2 of
Appendix A. At this point, all of the parameters of the ELMSS have been determined, which can be directly applied to the prediction of single-pass machining. Groups 1–45 of data in
Table A1 of
Appendix A are treated as the training set, and groups 46–48 as the test set. The comparison of the prediction results of the ELMSS before and after the optimization of the genetic algorithm on D, W and MRR are shown in
Figure 10.
The prediction accuracies are calculated according to Equation (
12), and the results are shown in
Table 8. It was found that the genetic algorithm can further improve the prediction accuracies of the GA-ELMSS.
In ELMPS optimized by genetic algorithm (GA-ELMPS), the maximum number of genetic iterations is also set to 100, and the change process of prediction errors are shown in
Figure 11.
The connection weights and thresholds of GA-ELMSPS are shown in
Table A3 of
Appendix A. Similarly, groups 1–45 of data in
Table A1 of
Appendix A are treated as the training set, and groups 46–48 as the test set. The comparison of the prediction results of the ELMPS before and after the optimization of the genetic algorithm on Ra are shown in
Figure 12.
The prediction accuracy is also calculated according to Equation (
12), and the results are shown in in
Table 9. It was found that the genetic algorithm improves the prediction accuracy of the ELMPS model, with greater improvement than that of ELMSS.
6. Conclusions
Surface feature prediction models for laser ablated 40Cr13 stainless steel based on ELM are proposed in this paper. The results of D, W, and MRR processed by single-track scanning were predicted by the ELMSS model, and the results of Ra processed by plane scanning were predicted by the ELMPS model. The prediction accuracy and algorithm efficiency of the proposed models are superior to the BP [
16] and RBF [
12] neural networks. The main conclusions are summarized as follows:
The ELMSS model has the highest prediction accuracy for the W, with an average prediction accuracy of 98.3%, followed by D and MRR, which are 92.6% and 92.4%, respectively. Meanwhile, the prediction accuracy for Ra of ELMPS is relatively low, with an average prediction accuracy of 86.8%. Therefore, the models proposed in this paper are relatively more suitable for predicting surface feature of single-pass scanning.
The prediction accuracies of the ELMSS and ELMPS are higher than that of the BP and RBF neural networks. The prediction accuracies for D were 92.6%, 89.8%, and 89.6% for the ELMSS, BP, and RBF neural networks, respectively; they were 98.3%, 97.4%, and 96.1% for W, respectively; they were 92.4%, 91.1%, and 89.1% for MRR, respectively. The prediction accuracies for Ra were 86.8%, 80.7%, and 79.5% for the ELMPS, BP, and RBF neural networks, respectively. In general, the prediction accuracies of the models proposed in this paper are higher than those of the above two traditional ANN models.
The computation time of the ELMSS and ELMPS model is significantly lower than that of the other two machine learning methods. In terms of computational efficiency, the models can be ordered as ELMPS, ELMSS, RBF, and BP neural network. Moreover, the computation time of the model proposed in this paper is at the millisecond level, which can be used for the real-time control of laser processing scenarios.
Through genetic algorithm optimization, the prediction accuracies of ELMSS and ELMPS are further improved. The prediction accuracies of D, W, and MRR are improved from 92.6%, 98.3%, and 92.4% to 94.0%, 99.0%, and 93.2%, respectively. The prediction accuracy of Ra is improved from 86.8% to 91.2%.
In this study, ELMSS and ELMPS were used for the first time in the field of laser processing to accurately predict the features of the ablated surface. With the support of these two models, the results of surface features can be predicted in advance before processing, allowing appropriate process parameters to be selected. Operators do not need much experience or time for experimentation to produce a desirable product. Further, the algorithms of ELMSS and ELMPS are extremely efficient, and determining how to apply them to real-time control will be studied in the future.