Author Contributions
Conceptualization, H.K.; methodology, H.K. and K.-S.M.; software, H.W.L.; validation, H.K., K.-S.M. and H.W.L.; formal analysis, H.K. and K.-S.M.; investigation, H.K.; resources, H.K.; data curation, H.K.; writing—original draft preparation, H.K.; writing—review and editing, H.K. and K.-S.M.; visualization, H.W.L.; supervision, H.K.; project administration, H.K.; funding acquisition, H.K. and K.-S.M. All authors have read and agreed to the published version of the manuscript.
Figure 1.
Schematic of (a) DNN and (b) RNN.
Figure 1.
Schematic of (a) DNN and (b) RNN.
Figure 2.
(a) All data X without clustering, (b) partitioning of X by k-means clustering, and (c) covering of X with ADS clustering. Each color represents the data with the same characteristics.
Figure 2.
(a) All data X without clustering, (b) partitioning of X by k-means clustering, and (c) covering of X with ADS clustering. Each color represents the data with the same characteristics.
Figure 3.
Schematic of the proposed ML-based prediction method using unsupervised clustering methods.
Figure 3.
Schematic of the proposed ML-based prediction method using unsupervised clustering methods.
Figure 4.
Feature importance from (a) DT, (b) RF, and (c) GB.
Figure 4.
Feature importance from (a) DT, (b) RF, and (c) GB.
Figure 5.
The blue solid line represents the accuracy,
in (
6), of the DT method for different
k values of
k-means for Zener diodes. The yellow dotted line represents the
, and the intersecting red dot represents the optimal value
.
Figure 5.
The blue solid line represents the accuracy,
in (
6), of the DT method for different
k values of
k-means for Zener diodes. The yellow dotted line represents the
, and the intersecting red dot represents the optimal value
.
Figure 6.
The blue solid line represents the accuracy (MRE) of DT for different radii of ADS for Zener diodes. The yellow dotted line represents the minimum value of MRE, and the intersecting red dot represents the optimal value .
Figure 6.
The blue solid line represents the accuracy (MRE) of DT for different radii of ADS for Zener diodes. The yellow dotted line represents the minimum value of MRE, and the intersecting red dot represents the optimal value .
Figure 7.
The values of the hyperparameters of DT without clustering and with k-means.
Figure 7.
The values of the hyperparameters of DT without clustering and with k-means.
Figure 8.
The histogram of predicted obsolescence dates for non-discontinued data using the proposed method with ADS and DT.
Figure 8.
The histogram of predicted obsolescence dates for non-discontinued data using the proposed method with ADS and DT.
Figure 9.
Distributions of MRE values of predictions using the DT method and naive statistic without clustering, with k-means clustering, and with ADS clustering.
Figure 9.
Distributions of MRE values of predictions using the DT method and naive statistic without clustering, with k-means clustering, and with ADS clustering.
Figure 10.
Distributions of RMSRE values of predictions using DT method and naive statistic without clustering, with k-means clustering, and ADS clustering.
Figure 10.
Distributions of RMSRE values of predictions using DT method and naive statistic without clustering, with k-means clustering, and ADS clustering.
Figure 11.
Distributions of (a) MRE and (b) RMSRE values of predictions using the RF method and naive statistic without clustering, with k-means clustering, and with ADS clustering.
Figure 11.
Distributions of (a) MRE and (b) RMSRE values of predictions using the RF method and naive statistic without clustering, with k-means clustering, and with ADS clustering.
Figure 12.
Distributions of MRE values of predictions using DT, RF, GB, DNN, and RNN methods (a) without clustering, (b) with k-means clustering, and (c) with ADS clustering.
Figure 12.
Distributions of MRE values of predictions using DT, RF, GB, DNN, and RNN methods (a) without clustering, (b) with k-means clustering, and (c) with ADS clustering.
Figure 13.
Distributions of RMSRE values of predictions using DT, RF, GB, DNN, and RNN methods (a) without clustering, (b) with k-means clustering, and (c) with ADS clustering.
Figure 13.
Distributions of RMSRE values of predictions using DT, RF, GB, DNN, and RNN methods (a) without clustering, (b) with k-means clustering, and (c) with ADS clustering.
Figure 14.
Comparison of 95% confidence intervals of predicted values using various methods. (a) Without clustering, (b) with k-means clustering, and (c) with ADS clustering.
Figure 14.
Comparison of 95% confidence intervals of predicted values using various methods. (a) Without clustering, (b) with k-means clustering, and (c) with ADS clustering.
Table 2.
Features of data of Zener diodes.
Table 2.
Features of data of Zener diodes.
Type | Features |
---|
Numeric | Power Dissipation, Reverse Zener Voltage (Min), Reverse Zener Voltage (Max), Test Current, Zener Impedance (Max), Zener Impedance at IZK, Maximum Zener Current, Reverse Leakage Current at VR, Reverse Voltage, Forward Voltage, Voltage Tolerance, Forward Current, Diode Capacitance, Operating Temperature (Min), Operating Temperature (Max), Number of Terminals |
Categorical | Part Number, Mfr Name, Description, Polarity, ESD Protection, Temperature Coefficient, EU RoHS, Halogen Free, Package Code, Soldering Surface Treatment, Mounting Type, JESD-30 Code, Package Body Material, Package Shape, Package Style, Terminal Form, Terminal Position, Temperature Grade, Part Status, Part Introduction, and Obsolete Date (LTB Date) |
Table 3.
Example of Zener diode data.
Table 3.
Example of Zener diode data.
Feature | Value |
---|
Zener Impedance at IZK () | 2000.0 |
Forward Voltage (V) | 1.2 |
Part Number | 1N4761A |
Description | ZENER DIODE, 1 W, 75 V@3 MA, 5% |
ESD Protection | Unknown |
Operating Temperature (Max) () | 200 |
Mounting Type | Through Hole |
Mfr Name | CENTRAL SEMICONDUCTOR CORP. |
Power Dissipation (Max) (W) | 1.0 |
Package Code | DO-41 |
Table 4.
Statistics of certain numeric features of discontinued Zener diodes.
Table 4.
Statistics of certain numeric features of discontinued Zener diodes.
| Count | Mean | Std | Min | 25% | 50% | 75% | Max |
---|
Power Dissipation (Max) (W) | 2366 | 2.52 | 2.49 | 0.12 | 0.50 | 1.0 | 5.00 | 10.0 |
Reverse Zener Voltage (Min) (V) | 1051 | 26.70 | 36.66 | 1.80 | 5.60 | 12.4 | 28.50 | 190.0 |
Reverse Zener Voltage (Max) (V) | 2366 | 42.23 | 52.44 | 1.80 | 8.53 | 20.0 | 53.16 | 270.0 |
Test Current (mA) | 2366 | 32.61 | 61.26 | 0.25 | 5.00 | 10.0 | 30.00 | 640.0 |
Zener Impedance (Max) () | 2366 | 110.81 | 240.05 | 1.00 | 9.00 | 30.0 | 100.00 | 2500.0 |
Zener Impedance at IZK (Max) () | 1936 | 1129.16 | 1361.70 | 60.00 | 400.00 | 700.0 | 1300.00 | 8000.0 |
Maximum Zener Current (mA) | 1244 | 200.44 | 288.66 | 1.54 | 31.60 | 85.0 | 264.00 | 2380.0 |
Reverse Leakage Current @ VR (uA) | 2366 | 8.73 | 22.78 | 0.05 | 1.00 | 2.0 | 5.00 | 300.0 |
Reverse Voltage (V) | 2366 | 31.16 | 39.76 | 0.50 | 6.00 | 15.0 | 38.80 | 206.0 |
Forward Voltage (V) | 1811 | 1.29 | 0.21 | 0.90 | 1.20 | 1.2 | 1.50 | 1.5 |
Voltage Tolerance (Max) (%) | 2366 | 4.62 | 2.56 | 1.00 | 2.38 | 5.0 | 5.00 | 20.0 |
Forward Current (Max) (mA) | 1811 | 437.88 | 395.93 | 2.00 | 200.00 | 200.0 | 1000.00 | 1000.0 |
Diode Capacitance @ f = 1 MHz, VR = 0 V (pF) | 157 | 178.23 | 148.24 | 19.00 | 70.00 | 130.0 | 225.00 | 450.0 |
Operating Temperature (Min) () | 2366 | −62.52 | 4.32 | −65.00 | −65.00 | −65.0 | −65.00 | −55.0 |
Operating Temperature (Max) () | 2366 | 171.56 | 16.36 | 125.00 | 150.00 | 175.0 | 175.00 | 200.0 |
Number of Terminals | 2366 | 1.98 | 0.49 | 0.00 | 2.00 | 2.0 | 2.00 | 4.0 |
Table 5.
Hyperparameters for unsupervised learning algorithms.
Table 5.
Hyperparameters for unsupervised learning algorithms.
Method | Hyperparameter | Value |
---|
k-means | Number of clusters (k) | 5 |
ADS | Radius of cover (r) | |
Table 6.
Hyperparameters for supervised learning algorithms.
Table 6.
Hyperparameters for supervised learning algorithms.
DT | Definition | Values |
---|
max_depth | Maximum depth of tree | None, 2, 4, 6, 8 |
min_samples_split | Minimum number of samples required to split internal node | 2, 4, 6, 8 |
min_samples_leaf | Minimum number of samples required to be at leaf node | 2, 3, 4, 5 |
max_leaf_nodes | Maximum number of terminal nodes | 20, 40, 60, unlimited |
RF | Definition | Values |
min_samples_split | Minimum number of samples required to split internal node | 2, 3, 4, 5 |
n_estimators | Number of trees in forest | 100, 150, 200 |
max_features | Number of features to consider when determining best split | auto, sqrt, log2 |
GB | Definition | Values |
learning_rate | Learning rate reduces contribution of each tree | 0.01, 0.1, 0.2 |
subsample | Fraction of samples to be used for fitting individual base learners | 0.5, 0.6, 0.7, 0.8, 0.9, 1 |
n_estimators | Number of boosting stages to perform | 100, 200, 300, 400, 500 |
max_depth | Maximum depth of individual regression estimators | 2, 4, 6, 8, 10 |
DNN | Definition | Values |
unit | Processors between input and output units in connectionist system | 32, 64 |
optimizer | Adjusts model weights to maximize loss function | Adam, Nadam, RMSprop |
dropout | Technique by which randomly selected neurons are ignored during training | 0, 0.1, 0.01 |
RNN | Definition | Values |
unit | Processors between input and output units in connectionist system | 32, 64 |
optimizer | Adjusts model weights to maximize loss function | Adam, Nadam, RMSprop |
dropout | Technique by which randomly selected neurons are ignored during training | 0, 0.1, 0.01 |
Table 7.
Comparison of widths of 95% confidence intervals of predicted values using various methods.
Table 7.
Comparison of widths of 95% confidence intervals of predicted values using various methods.
| Without_Clustering | k-Means | ADS |
---|
DT | 1.249144 | 0.716651 | 0.285660 |
RF | 1.384982 | 1.157487 | 0.216856 |
GB | 1.253258 | 1.046753 | 0.190099 |
DNN | 1.630955 | 1.753372 | 0.699891 |
RNN | 1.946344 | 1.614736 | 0.266980 |