Enhancing Electrocardiogram Classification with Multiple Datasets and Distant Transfer Learning
Abstract
:1. Introduction
1.1. Literature Review
1.1.1. Traditional Transfer Learning for ECG Classification
1.1.2. Distant Transfer Learning Applications
1.2. Limitations of the Existing Works
- There was a lack of research on distant transfer learning for ECG classifications.
- The details of the design and the formulation of multiple-source datasets on the distant transfer learning process [18] were insufficient.
- There were limited discussions on negative transfer avoidance between source and target domains in the aspects of domain, instance, and feature.
1.3. Research Contributions of the Article
- Distant transfer learning was newly applied for ECG classifications. Six benchmark ECG datasets were selected for the research studies.
- With the unrestricted discipline of the source domain in distant transfer learning, generative-adversarial-network-based auxiliary domains were designed using both the source and target datasets.
- To minimize the risk of negative transfer from the source model to the target model, a domain-feature-classifier negative-transfer-avoidance algorithm was proposed to minimize loss for domain reconstruction, feature extraction, and the classifier.
- The GANAD-DFCNTA algorithm improved the accuracy by 0.303–5.19% compared with existing works.
- An investigation was carried out on the extension of the GANAD-DFCNTA algorithm with multiple-source datasets. An evaluation showed that the target model can enhance the accuracy by 3.67 to 4.89% with multiple-source datasets.
- Ablation studies of the GANAD-DFCNTA algorithm revealed the improvement of the accuracy of the target model by 2.42–3.58%, 1.35–2.73%, 0.767–2.37%, and 1.72–2.90%, compared with DFCNTA, GANAD-FCNTA, GANAD-DCNTA, and GANAD-DFNTA algorithms, respectively.
1.4. Organization of the Article
2. Methodology
2.1. Overview of the Proposed Distant Transfer Learning Algorithm for ECG Classification
2.2. Generative-Adversarial-Network-Based Auxiliary Domains (GANAD) Algorithm
Algorithm 1: GAN-based auxiliary domain stage 1 |
for epoch i = 1:L do |
for D, steps j = 1:M do |
Sample minibatch of size p from the ground truth sample of the distant source dataset |
Sample minibatch of size p from the latent space |
Applying gradient ascent to D to solve the maximization problem: |
end |
for G, steps k = 1:N do |
Sample minibatch of size p from the latent space |
Applying gradient descent to G to solve the minimization problem: |
end |
end |
Algorithm 2: GAN-based auxiliary domain stage 2 |
for epoch i = 1:L do |
for D, steps j = 1:M do |
Sample minibatch of size p from the ground truth sample of the distant target dataset |
Sample minibatch of size p from the latent space |
Applying gradient ascent to D to solve the maximization problem: |
end |
for G, steps k = 1:N do |
Sample minibatch of size p from the latent space |
Applying gradient descent to G to solve the minimization problem: |
end |
end |
2.3. Domain-Feature-Classifier Negative-Transfer-Avoidance (DFCNTA) Algorithm
3. Benchmark Datasets and Performance Evaluation
3.1. Benchmark Dataset
3.2. Performance Evaluation and Analysis of the GANAD-DFCNTA Algorithm
3.2.1. Two Auxiliary Domains
Specificity/Sensitivity/Accuracy (%) | |||||
---|---|---|---|---|---|
Target Dataset | Baseline | One Dataset | Two Datasets | Three Datasets | Four Datasets |
PTB-XL [32] | 92.8/93.7/93.3 | 93.8/94.5/94.2 with COCO | 94.6/95.5/95.1 with COCO and ImageNet | 95.7/96.4/96.1 with COCO, ImageNet, and Sentiment140 | 96.6/97.2/96.9 |
MIT-BIH Arrhythmia Database [33] | 94.4/95.1/94.8 | 96.3/96.8/96.6 with COCO | 97.5/98.1/97.8 with COCO and ImageNet | 98.4/99.1/98.8 with COCO, ImageNet, and Sentiment140 | 99.1/99.7/99.4 |
European ST-T Database [34] | 92.6/93.5/93.0 | 93.7/94.7/94.2 with ImageNet | 94.8/95.5/95.2 with COCO and ImageNet | 95.6/96.4/96.0 with COCO, ImageNet, and Sentiment140 | 96.4/97.1/96.8 |
Long-Term ST Database [35] | 94.4/93.6/94.1 | 95.5/94.5/95.1 with COCO | 96.3/95.4/95.9 with COCO and ImageNet | 97.1/96.3/96.8 with COCO, ImageNet, and WordNet | 97.9/97.0/97.5 |
Specificity/Sensitivity/Accuracy (%) | |||||
---|---|---|---|---|---|
Target Dataset | Baseline | One Dataset | Two Datasets | Three Datasets | Four Datasets |
PTB-XL [32] | 91.4/92.2/91.9 | 92.0/92.5/92.3 with COCO | 93.1/93.8/93.5 with COCO and ImageNet | 94.5/95.1/94.9 with COCO, ImageNet, and Sentiment140 | 95.6/96.1/95.9 |
MIT-BIH Arrhythmia Database [33] | 92.6/93.4/93.0 | 94.5/95.2/94.8 with ImageNet | 95.8/96.5/96.1 with COCO and ImageNet | 96.7/97.5/97.1 with COCO, ImageNet, and Sentiment140 | 97.5/98.4/97.9 |
European ST-T Database [34] | 90.6/91.7/91.1 | 91.9/93.0/92.4 with ImageNet | 93.5/94.1/93.8 with COCO and ImageNet | 93.8/94.7/94.3 with COCO, ImageNet, and Sentiment140 | 94.8/95.4/95.1 |
Long-Term ST Database [35] | 92.3/91.7/92.0 | 93.5/92.7/93.1 with COCO | 94.7/93.8/94.3 with COCO and ImageNet | 95.5/94.3/95.0 with COCO, ImageNet, and WordNet | 96.3/95.2/95.8 |
Specificity/Sensitivity/Accuracy (%) | |||||
---|---|---|---|---|---|
Target Dataset | Baseline | One Dataset | Two Datasets | Three Datasets | Four Datasets |
PTB-XL [32] | 90.7/91.6/91.2 | 91.3/92.1/91.8 with COCO | 92.4/93.0/92.7 with COCO and ImageNet | 93.9/94.4/94.2 with COCO, ImageNet, and Sentiment140 | 94.5/95.2/94.9 |
MIT-BIH Arrhythmia Database [33] | 92.0/92.7/92.3 | 93.6/94.4/94.0 with ImageNet | 96.1/94.9/96.4 with COCO and ImageNet | 96.2/97.0/96.5 with COCO, ImageNet, and Sentiment140 | 96.6/97.3/96.9 |
European ST-T Database [34] | 89.9/91.1/90.5 | 91.1/92.3/91.6 with COCO | 92.8/93.3/93.0 with COCO and ImageNet | 93.1/93.7/93.4 with COCO, ImageNet, and Sentiment140 | 93.6/94.3/93.9 |
Long-Term ST Database [35] | 91.8/91.3/91.5 | 92.1/92.5/92.3 with COCO | 93.9/93.1/93.5 with COCO and ImageNet | 93.3/94.2/93.7 with COCO, ImageNet, and WordNet | 95.4/94.4/94.9 |
- Distant transfer learning via the GANAD-DFCNTA algorithm improved the performance (specificity, sensitivity, and accuracy) of the baseline ECG classification model. With more source datasets, the performance of the model can further be enhanced. It is worth noting that the saturation of model performance may be reached at some point, depending on the similarities between the source and target datasets.
- The percentage improvement of the specificity, sensitivity, and accuracy in PTB-XL was: 1.08, 0.854, and 0.965% for one dataset; 1.94, 1.92, and 1.93% for two datasets; 3.13, 2.88, and 3.00% for three datasets; 4.09, 3.74, and 3.86% for four datasets; and 1.02, 0.935, and 0.965% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the MIT-BIH arrhythmia database was: 2.01, 1.79, and 1.90% for one dataset; 3.28, 3.15, and 3.19% for two datasets; 4.24, 4.21, and 4.22% for three datasets; 4.98, 4.84, and 4.89% for four datasets; and 1.25, 1.21, and 1.22% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the European ST-T database was: 1.19, 1.28, and 1.25% for one dataset; 2.38, 2.14, and 2.22% for two datasets; 3.24, 3.10, and 3.18% for three datasets; 4.10, 3.85, and 4.02% for four datasets; and 1.03, 0.963, and 1.00% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the long-term ST database was: 1.17, 0.962, and 1.07% for one dataset; 2.01, 1.92, and 1.96% for two datasets; 2.86, 2.88, and 2.87% for three datasets; 3.71, 3.63, and 3.67% for four datasets; and 0.928, 0.908, and 0.918% on average.
- The deviations between overall specificity and sensitivity with a varying number of datasets were 0.763% in PTB-XL, 0.613% in the MIT-BIH arrhythmia database, 0.842% in the European ST-T database, and 0.940% in long-term ST database.
- To better investigate the individual classes of highly imbalanced datasets [33], the overall deviations of the top five classes of the highest imbalanced ratios were 1.81% in Class 14, 1.45% in Class 13, 1.27% in Class 12, 1.13% in Class 11, and 1.03% in Class 10.
- As a remark, the baseline CNN algorithm serves as a common architecture that was adopted in many existing works. The main theme is the distant transfer learning process between distant multiple-source domains and target domains.
3.2.2. One Auxiliary Domain
- Sensitivity and specificity: The differences between sensitivity and specificity were 3.22% for [11] which suggests a slightly biased classification towards the majority class. The differences in our work were ranged from 0.621 to 0.928%. Other works [12,13,14,16] did not report the sensitivity and specificity.
3.3. Performance Comparison between GANAD-DFCNTA Algorithm and Existing Works
- Method: The basic architecture for ECG classification was typically the CNN, except [14] when using XGBoost. The CNN was a useful architecture that could automatically extract a feature and serve as a classifier.
- Source domain: In related works, the source domain was similar to the target domain in the field of ECG datasets. To the best of our knowledge, this work was the first work to consider distant transfer learning for ECG classifications with multiple distant source domains and target domains.
Work | Method | Source Domain | Target Domain | Cross- Validation | Ablation Study | Specificity (%) | Sensitivity (%) | Accuracy (%) |
---|---|---|---|---|---|---|---|---|
[11] | Continuous wavelet transform and CNN | ImageNet | MIT-BIH Arrhythmia Database | No | No | 96.2 | 99.3 | 99.1 |
[12] | RMSprop, Adam, SGDM optimizers, and CNN | ImageNet and COCO | MIT-BIH Arrhythmia Database | No | No | N/A | N/A | 97.0 to 99.5 |
[13] | Short-time Fourier transform and CNN | ImageNet | MIT-BIH Arrhythmia Database | No | Yes | N/A | N/A | 97.0 |
[14] | Continuous wavelet transform and XGBoost | COCO | MIT-BIH Arrhythmia Database and Long-Term ST Database | No | No | N/A | N/A | 98.3 |
[16] | Autoencoder and CNN | Dataset from different hospitals | MIT-BIH Arrhythmia Database and | 5-fold | Yes | N/A | N/A | 94.5% to 98.9% |
Our Work | GANAD-DFCNTA and CNN | ImageNet, COCO, WordNet, and Sentiment140 | PTB-XL | 5-fold | Yes | 96.6 | 97.2 | 96.9 |
MIT-BIH Arrhythmia Database, | 99.1 | 99.7 | 99.4 | |||||
European ST-T Database, and | 96.4 | 97.1 | 96.8 | |||||
Long-Term ST Database | 97.9 | 97.0 | 97.5 |
4. Ablation Studies
4.1. DFCNTA
- The percentage improvement of the specificity, sensitivity, and accuracy in PTB-XL was: 0.323, 0.213, and 0.276% for one dataset; 0.647, 0.640, and 0.643% for two datasets; 1.08, 0.961, and 0.989% for three datasets; 1.29, 1.07, and 1.18% for four datasets; and 0.323, 0.268, and 0.295% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the MIT-BIH arrhythmia database was: 0.424, 0.421, and 0.422% for one dataset; 0.742, 0.736, and 0.738% for two datasets; 0.953, 1.05, and 0.988% for three datasets; 1.27, 1.25, and 1.26% for four datasets; and 0.318, 0.313, and 0.315% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the European ST-T database was: 0.324, 0.428, and 0.382% for one dataset; 0.756, 0.749, and 0.752% for two datasets; 0.972, 1.07, and 0.995% for three datasets; 1.30, 0.962, and 1.09% for four datasets; and 0.325, 0.241, and 0.273% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the long-term ST database was: 0.424, 0.321, and 0.379% for one dataset; 0.742, 0.641, and 0.690% for two datasets; 0.953, 0.962, and 0.956% for three datasets; 1.27, 1.18, and 1.22% for four datasets; and 0.318, 0.295, and 0.305% on average.
- Consider four dataset-based scenarios: the GANAD-DFCNTA algorithm outperforms the DFCNTA algorithm by 2.77, 2.64, and 2.69% for PTB-XL; 3.66, 3.53, and 3.58% for the MIT-BIH arrhythmia database; 2.77, 2.53, and 2.71% for the European ST-T database; and 2.41, 2.43, and 2.42% for the long-term ST database.
Specificity/Sensitivity/Accuracy (%) | |||||
---|---|---|---|---|---|
Target Dataset | Baseline | One Dataset | Two Datasets | Three Datasets | Four Datasets |
PTB-XL [32] | 92.8/93.7/93.3 | 93.1/93.9/93.5 | 93.4/94.3/93.9 | 93.8/94.6/94.2 | 94.0/94.7/94.4 |
MIT-BIH Arrhythmia Database [33] | 94.4/95.1/94.8 | 94.8/95.5/95.2 | 95.1/95.8/95.5 | 95.3/96.1/95.7 | 95.6/96.3/96.0 |
European ST-T Database [34] | 92.6/93.5/93.0 | 92.9/93.9/93.4 | 93.3/94.2/93.7 | 93.5/94.5/94.0 | 93.8/94.7/94.2 |
Long-Term ST Database [35] | 94.4/93.6/94.1 | 94.8/93.9/94.4 | 95.1/94.2/94.7 | 95.3/94.5/94.9 | 95.6/94.7/95.2 |
- The percentage improvement of the accuracy in Class 10 was: 29.9% for one dataset, 25.7% for two datasets, 25.6% for three datasets, 21.0% for four datasets, and 25.6% on average.
- The percentage improvement of the accuracy in Class 11 was: 28.5% for one dataset, 32.1% for two datasets, 31.6% for three datasets, 28.6% for four datasets, and 30.2% on average.
- The percentage improvement of the accuracy in Class 12 was: 40.3% for one dataset, 33.6% for two datasets, 34.7% for three datasets, 36.3% for four datasets, and 34.7% on average.
- The percentage improvement of the accuracy in Class 13 was: 44.8% for one dataset, 41.7% for two datasets, 43.8% for three datasets, 37.6% for four datasets, and 42.0% on average.
- The percentage improvement of the accuracy in Class 14 was: 153% for one dataset, 120% for two datasets, 56.8% for three datasets, 66.7% for four datasets, and 99.1% on average.
Accuracy (%) | |||||
---|---|---|---|---|---|
Algorithm | Class | One Dataset | Two Datasets | Three Datasets | Four Datasets |
GANAD-DFCNTA | Class 10 | 74.7 | 78.2 | 83.9 | 87.1 |
Class 11 | 70.3 | 75.7 | 81.6 | 84.0 | |
Class 12 | 68.9 | 73.1 | 78.8 | 82.3 | |
Class 13 | 62.7 | 68.3 | 74.5 | 77.9 | |
Class 14 | 47.5 | 55 | 58.8 | 62.5 | |
DFCNTA | Class 10 | 57.5 | 62.2 | 66.8 | 72.0 |
Class 11 | 54.7 | 57.3 | 62 | 65.3 | |
Class 12 | 49.1 | 54.7 | 58.5 | 60.4 | |
Class 13 | 43.3 | 48.2 | 51.8 | 56.6 | |
Class 14 | 18.8 | 25 | 37.5 | 37.5 |
4.2. GANAD-FCNTA
- The percentage improvement of the specificity, sensitivity, and accuracy in PTB-XL was: 0.539, 0.467, and 0.502% for one dataset; 0.970, 0.961, and 0.965% for two datasets; 1.40, 1.28, and 1.34% for three datasets; 1.51, 1.49, and 1.50% for four datasets; and 0.378, 0.373, and 0.375% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the MIT-BIH arrhythmia database was: 0.742, 0.736, and 0.739% for one dataset; 1.17, 1.16, and 1.16% for two datasets; 1.59, 1.79, and 1.69% for three datasets; 2.12, 2.10, and 2.11% for four datasets; and 0.53, 0.525, and 0.528% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the European ST-T database was: 0.540, 0.642, and 0.610% for one dataset; 1.19, 1.18, and 1.18% for two datasets; 1.84, 1.93, and 1.89% for three datasets; 2.59, 2.57, and 2.58% for four datasets; and 0.648, 0.643, and 0.645% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the long-term ST database was: 0.742, 0.641, and 0.689% for one dataset; 1.17, 1.07, and 1.11% for two datasets; 1.69, 1.71, and 1.70% for three datasets; 2.33, 2.24, and 2.27% for four datasets; and 0.583, 0.56, and 0.568% on average.
- Consider four dataset-based scenarios: the GANAD-DFCNTA algorithm outperformed the GANAD-FCNTA algorithm by 2.55, 2.21, and 2.32% for PTB-XL; 2.80, 2.68, and 2.73% for the MIT-BIH arrhythmia database; 1.47, 1.25, and 1.36% for the European ST-T database; and 1.35, 1.36, and 1.35% for the long-term ST database.
4.3. GANAD-DCNTA
- The percentage improvement of the specificity, sensitivity, and accuracy in PTB-XL was: 0.754, 0.534, and 0.643% for one dataset; 1.40, 1.39, and 1.39% for two datasets; 1.94, 1.81, and 1.86% for three datasets; 2.48, 2.13, and 2.25% for four datasets; and 0.62, 0.533, and 0.563% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the MIT-BIH arrhythmia database was: 0.847, 0.841, and 0.844% for one dataset; 1.48, 1.37, and 1.42% for two datasets; 2.12, 2.21, and 2.17% for three datasets; 2.54, 2.42, and 2.47% for four datasets; and 0.635, 0.605, and 0.618% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the European ST-T database was: 0.756, 0.856, and 0.802% for one dataset; 1.62, 1.93, and 1.81% for two datasets; 2.27, 2.46, and 2.35% for three datasets; 2.81, 2.99, and 2.88% for four datasets; and 0.703, 0.748, and 0.72% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the long-term ST database was: 0.953, 0.855, and 0.899% for one dataset; 1.69, 1.60, and 1.65% for two datasets; 2.33, 2.35, and 2.34% for three datasets; 2.86, 2.88, and 2.87% for four datasets; and 0.715, 0.72, and 0.718% on average.
- Consider four dataset-based scenarios: the GANAD-DFCNTA algorithm outperformed the GANAD-FCNTA algorithm by 1.58, 1.57, and 1.57% for PTB-XL; 2.38, 2.36, and 2.37% for the MIT-BIH arrhythmia database; 1.26, 0.831, and 1.11% for the European ST-T database; and 0.824, 0.727, and 0.767% for the long-term ST database.
4.4. GANAD-DFNTA
- The percentage improvement of the specificity, sensitivity, and accuracy in PTB-XL was: 0.431, 0.320, and 0.375% for one dataset; 0.862, 0.747, and 0.785% for two datasets; 1.19, 1.07, and 1.13% for three datasets; 1.51, 1.39, and 1.45% for four datasets; and 0.378, 0.348, and 0.363% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the MIT-BIH arrhythmia database was: 0.636, 0.631, and 0.633% for one dataset; 1.06, 1.05, and 1.05% for two datasets; 1.48, 1.68, and 1.58% for three datasets; 1.80, 2.00, and 1.90% for four datasets; and 0.45, 0.5, and 0.475% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the European ST-T database was: 0.432, 0.535, and 0.538% for one dataset; 0.972, 1.07, and 1.02% for two datasets; 1.51, 1.60, and 1.55% for three datasets; 1.94, 2.03, and 1.98% for four datasets; and 0.485, 0.508, and 0.495% on average.
- The percentage improvement of the specificity, sensitivity, and accuracy in the long-term ST database was: 0.530, 0.534, and 0.532% for one dataset; 1.06, 1.07, and 1.06% for two datasets; 1.48, 1.50, and 1.49% for three datasets; 1.91, 1.92, and 1.91% for four datasets; and 0.478, 0.48, and 0.479% on average.
- Consider four dataset-based scenarios: the GANAD-DFCNTA algorithm outperformed the GANAD-DFNTA algorithm by 2.55, 2.32, and 2.43% for PTB-XL; 3.12, 2.78, and 2.90% for the MIT-BIH arrhythmia database; 2.12, 1.78, and 2.00% for the European ST-T database; and 1.77, 1.68, and 1.72% for the long-term ST database.
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Liu, Y.; Chen, J.; Bao, N.; Gupta, B.B.; Lv, Z. Survey on atrial fibrillation detection from a single-lead ECG wave for Internet of Medical Things. Comp. Commun. 2021, 178, 245–258. [Google Scholar] [CrossRef]
- Lih, O.S.; Jahmunah, V.; San, T.R.; Ciaccio, E.J.; Yamakawa, T.; Tanabe, M.; Kobayashi, M.; Faust, O.; Acharya, U.R. Comprehensive electrocardiographic diagnosis based on deep learning. Art. Intell. Med. 2020, 103, 101789. [Google Scholar] [CrossRef] [PubMed]
- Chui, K.T. Driver stress recognition for smart transportation: Applying multiobjective genetic algorithm for improving fuzzy c-means clustering with reduced time and model complexity. Sustain. Comput. Inform. Syst. 2022, 35, 100668. [Google Scholar] [CrossRef]
- Hasan, M.M.; Watling, C.N.; Larue, G.S. Physiological signal-based drowsiness detection using machine learning: Singular and hybrid signal approaches. J. Saf. Res. 2022, 80, 215–225. [Google Scholar] [CrossRef] [PubMed]
- Peng, J.; Yang, B.; Gupta, B.B.; El-Latif, A.; Ahmed, A. A biometric cryptosystem scheme based on random projection and neural network. Soft Comput. 2021, 25, 7657–7670. [Google Scholar] [CrossRef]
- Sarrab, M.; Alshohoumi, F. Assisted-fog-based framework for IoT-based healthcare data preservation. Int. J. Cloud Appl. Comput. 2021, 11, 1–16. [Google Scholar] [CrossRef]
- Cvitić, I.; Perakovic, D.; Gupta, B.B.; Choo, K.K.R. Boosting-based DDoS detection in internet of things systems. IEEE Internet Things J. 2022, 9, 2109–2123. [Google Scholar] [CrossRef]
- Nguyen, G.; Dlugolinsky, S.; Bobák, M.; Tran, V.; López García, Á.; Heredia, I.; Malík, P.; Hluchý, L. Machine learning and deep learning frameworks and libraries for large-scale data mining: A survey. Art. Intell. Rev. 2019, 52, 77–124. [Google Scholar] [CrossRef] [Green Version]
- Dargan, S.; Kumar, M.; Ayyagari, M.R.; Kumar, G. A survey of deep learning and its applications: A new paradigm to machine learning. Arch. Comput. Methods Eng. 2020, 27, 1071–1092. [Google Scholar] [CrossRef]
- Chui, K.T.; Liu, R.W.; Zhao, M.; De Pablos, P.O. Predicting students’ performance with school and family tutoring using generative adversarial network-based deep support vector machine. IEEE Access 2020, 8, 86745–86752. [Google Scholar] [CrossRef]
- Sun, X.; Liu, P.; He, Z.; Han, Y.; Su, B. Automatic classification of electrocardiogram signals based on transfer learning and continuous wavelet transform. Ecol. Inform. 2022, 69, 101628. [Google Scholar] [CrossRef]
- Kumar, L.V.; Sai, Y.P. A new transfer learning approach to detect cardiac arrhythmia from ECG signals. Signal Image Video Process. 2022, 16, 1945–1953. [Google Scholar]
- Liu, T.; Yang, Y.; Fan, W.; Wu, C. Reprint of: Few-shot learning for cardiac arrhythmia detection based on electrocardiogram data from wearable devices. Digit. Signal Process. 2022, 125, 103574. [Google Scholar] [CrossRef]
- Bassiouni, M.M.; Hegazy, I.; Rizk, N.; El-Dahshan, E.S.A.; Salem, A.M. Deep learning approach based on transfer learning with different classifiers for ECG diagnosis. Int. J. Intell. Comput. Inf. Sci. 2022, 22, 44–62. [Google Scholar] [CrossRef]
- Liu, C.-M.; Liu, C.-L.; Hu, K.-W.; Tseng, V.S.; Chang, S.-L.; Lin, Y.-J.; Lo, L.-W.; Chung, F.-P.; Chao, T.-F.; Tuan, T.-C.; et al. A deep learning–enabled electrocardiogram model for the identification of a rare inherited arrhythmia: Brugada syndrome. Can. J. Cardiol. 2022, 38, 152–159. [Google Scholar] [CrossRef] [PubMed]
- Raza, A.; Tran, K.P.; Koehl, L.; Li, S. Designing ecg monitoring healthcare system with federated transfer learning and explainable ai. Knowl.-Based Syst. 2022, 236, 107763. [Google Scholar] [CrossRef]
- Tang, F.; Ding, J.; Wang, L.; Ning, C. A Novel Distant Domain Transfer Learning Framework for Thyroid Image Classification. Neural Process. Lett. 2022, 1–17. [Google Scholar] [CrossRef]
- Niu, S.; Liu, M.; Liu, Y.; Wang, J.; Song, H. Distant domain transfer learning for medical imaging. IEEE J. Biomed. Health Inform. 2021, 25, 3784–3793. [Google Scholar] [CrossRef]
- Ren, Y.; Liu, J.; Zhang, H.; Wang, W. TDDA-Net: A Transitive Distant Domain Adaptation Network for Industrial Sample Enhancement. Inf. Sci. 2022, 606, 927–944. [Google Scholar] [CrossRef]
- Wu, X.; Chen, C.; Li, P.; Zhong, M.; Wang, J.; Qian, Q.; Ding, P.; Yao, J.; Guo, Y. FTAP: Feature transferring autonomous machine learning pipeline. Inf. Sci. 2022, 593, 385–397. [Google Scholar] [CrossRef]
- Chen, Y.; Yang, X.H.; Wei, Z.; Heidari, A.A.; Zheng, N.; Li, Z.; Chen, H.; Hu, H.; Zhou, Q.; Guan, Q. Generative adversarial networks in medical image augmentation: A review. Comput. Biol. Med. 2022, 144, 105382. [Google Scholar] [CrossRef] [PubMed]
- Salman, O.; Elhajj, I.H.; Kayssi, A.; Chehab, A. Mutated traffic detection and recovery: An adversarial generative deep learning approach. Ann. Telecommun. 2022, 77, 395–406. [Google Scholar] [CrossRef]
- Gao, N.; Xue, H.; Shao, W.; Zhao, S.; Qin, K.K.; Prabowo, A.; Rahaman, M.S.; Salim, F.D. Generative adversarial networks for spatio-temporal data: A survey. ACM Trans. Intell. Syst. Technol. 2022, 13, 22. [Google Scholar] [CrossRef]
- Gurumurthy, S.; Kiran Sarvadevabhatla, R.; Venkatesh Babu, R. Deligan: Generative Adversarial Networks for Diverse and Limited Data. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Honolulu, HI, USA, 21–26 July 2016. [Google Scholar]
- Wang, Z.; Dai, Z.; Póczos, B.; Carbonell, J. Characterizing and Avoiding Negative Transfer. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 16–20 June 2019. [Google Scholar]
- Uehara, M.; Sato, I.; Suzuki, M.; Nakayama, K.; Matsuo, Y. Generative adversarial nets from a density ratio estimation perspective. arXiv 2016, arXiv:1610.02920. [Google Scholar]
- Yu, Y.; Szepesvári, C. Analysis of kernel mean matching under covariate shift. arXiv 2012, arXiv:1206.4650. [Google Scholar]
- Deng, J.; Dong, W.; Socher, R.; Li, L.J.; Li, K.; Li, F.-F. Imagenet: A Large-Scale Hierarchical Image Database. In Proceedings of the 2009 IEEE Conference on Computer Vision and Pattern Recognition, Miami, FL, USA, 20–25 June 2009. [Google Scholar]
- Lin, T.Y.; Maire, M.; Belongie, S.; Hays, J.; Perona, P.; Ramanan, D.; Zitnick, C.L. Microsoft coco: Common Objects in Context. In Proceedings of the European Conference on Computer Vision, Zurich, Switzerland, 6–12 September 2014. [Google Scholar]
- Fellbaum, C. WordNet. In Theory and Applications of Ontology: Computer Applications, 1st ed.; Poli, R., Healy, M., Kameas, A., Eds.; Springer: Dordrecht, The Netherlands, 2010; pp. 231–243. [Google Scholar]
- Go, A.; Bhayani, R.; Huang, L. Twitter Sentiment Classification using Distant Supervision. In CS224N Project Report; Stanford: Redwood City, CA, USA, 2019. [Google Scholar]
- Wagner, P.; Strodthoff, N.; Bousseljot, R.D.; Kreiseler, D.; Lunze, F.I.; Samek, W.; Schaeffter, T. PTB-XL, a large publicly available electrocardiography dataset. Sci. Data 2020, 7, 154. [Google Scholar] [CrossRef]
- Moody, G.B.; Mark, R.G. The impact of the MIT-BIH arrhythmia database. IEEE Eng. Med. Biol. Mag. 2001, 20, 45–50. [Google Scholar] [CrossRef]
- Taddei, A.; Distante, G.; Emdin, M.; Pisani, P.; Moody, G.B.; Zeelenberg, C.; Marchesi, C. The European ST-T database: Standard for evaluating systems for the analysis of ST-T changes in ambulatory electrocardiography. Eur. Heart J. 1992, 13, 1164–1172. [Google Scholar] [CrossRef]
- Jager, F.; Taddei, A.; Moody, G.B.; Emdin, M.; Antolič, G.; Dorn, R.; Mark, R.G. Long-term ST database: A reference for the development and evaluation of automated ischaemia detectors and for the study of the dynamics of myocardial ischaemia. Med. Biol. Eng. Comput. 2003, 41, 172–182. [Google Scholar] [CrossRef]
- Pan, J.; Tompkins, W.J. A real-time QRS detection algorithm. IEEE Trans. Biomed. Eng. 1985, 3, 230–236. [Google Scholar] [CrossRef]
- Kohler, B.U.; Hennig, C.; Orglmeister, R. The principles of software QRS detection. IEEE Eng. Med. Biol. Mag. 2002, 21, 42–57. [Google Scholar] [CrossRef] [PubMed]
- Cvitić, I.; Peraković, D.; Periša, M.; Gupta, B. Ensemble machine learning approach for classification of IoT devices in smart home. Int. J. Mach. Learn. Cybern. 2021, 12, 3179–3202. [Google Scholar] [CrossRef]
- Prathiba, S.B.; Raja, G.; Bashir, A.K.; AlZubi, A.A.; Gupta, B. SDN-assisted safety message dissemination framework for vehicular critical energy infrastructure. IEEE Trans. Ind. Inform. 2022, 18, 3510–3518. [Google Scholar] [CrossRef]
- Elgendy, I.A.; Zhang, W.Z.; He, H.; Gupta, B.B.; El-Latif, A.; Ahmed, A. Joint computation offloading and task caching for multi-user and multi-task MEC systems: Reinforcement learning-based algorithms. Wirel. Netw. 2021, 27, 2023–2038. [Google Scholar] [CrossRef]
- Pałczyński, K.; Śmigiel, S.; Ledziński, D.; Bujnowski, S. Study of the Few-Shot Learning for ECG Classification Based on the PTB-XL Dataset. Sensors 2022, 22, 904. [Google Scholar] [CrossRef]
- Chumachenko, D.; Butkevych, M.; Lode, D.; Frohme, M.; Schmailzl, K.J.; Nechyporenko, A. Machine Learning Methods in Predicting Patients with Suspected Myocardial Infarction Based on Short-Time HRV Data. Sensors 2022, 22, 7033. [Google Scholar] [CrossRef]
- Ketu, S.; Mishra, P.K. Empirical analysis of machine learning algorithms on imbalance electrocardiogram based arrhythmia dataset for heart disease detection. Arab. J. Sci. Eng. 2022, 47, 1447–1469. [Google Scholar] [CrossRef]
- Kumari, L.V.; Sai, Y.P. Classification of ECG beats using optimized decision tree and adaptive boosted optimized decision tree. Signal Image Video Process. 2022, 16, 695–703. [Google Scholar]
- Jannah, N.; Hadjiloucas, S. Detection of ECG Arrhythmia Conditions using CSVM and MSVM Classifiers. In Proceedings of the 2015 IEEE Signal Processing in Medicine and Biology Symposium (SPMB), Philadelphia, PA, USA, 12–15 December 2015. [Google Scholar]
- Kayikcioglu, I.; Akdeniz, F.; Köse, C.; Kayikcioglu, T. Time-frequency approach to ECG classification of myocardial infarction. Comput. Electr. Eng. 2020, 84, 106621. [Google Scholar] [CrossRef]
- Revathi, J.; Anitha, J. Automatic Diagnosis of Myocardial Infarction with Left Bundle Branch Block. In Proceedings of the International Conference on Recent Trends in Image Processing and Pattern Recognition, Solapur, India, 21–22 December 2018; Springer: Singapore, 2018; pp. 235–244. [Google Scholar]
- Liu, R.W.; Guo, Y.; Lu, Y.; Chui, K.T.; Gupta, B.B. Deep Network-Enabled Haze Visibility Enhancement for Visual IoT-Driven Intelligent Transportation Systems. IEEE Trans. Ind. Inform. 2022. early access. [Google Scholar] [CrossRef]
- Aggarwal, A.; Mittal, M.; Battineni, G. Generative adversarial network: An overview of theory and applications. Int. J. Inf. Manag. Data Insights 2021, 1, 100004. [Google Scholar] [CrossRef]
Dataset | Domain | Data Type | Description |
---|---|---|---|
ImageNet [28] | Source | Image | It contains about 14.2 million annotated images which are divided into more than 21,000 categories. |
COCO [29] | Source | Image | It comprises about 330,000 images (>200,000 labeled images). The images involve 250,000 people and 1.5 million objects. |
WordNet [30] | Source | Text | It has more than 150,000 index words which are categorized into adverbs, adjectives, verbs, and nouns. Every word can be attached to multiple-synonym sets (representing a semantic concept). |
Sentiment140 [31] | Source | Text | It has about 1.6 million annotated tweets with positive, neutral, and negative polarities. |
PTB-XL [32] | Target | ECG | It recruits 18,885 patients for the data collection of 21,837 clinical 12-lead ECGs. |
MIT-BIH Arrhythmia Database [33] | Target | ECG | It comprises 48 30-min 2-channel ECGs from 47 volunteers. |
European ST-T Database [34] | Target | ECG | It has 90 annotated ECGs from 79 subjects. |
Long-Term ST Database [35] | Target | ECG | It collects 86 long-term ECG recordings of at least 21 h from 80 participants. |
Dataset | Class | Sample Size |
---|---|---|
PTB-XL [32] | Class 0: Normal | 118962 |
Class 1: Myocardial infarction | 68410 | |
Class 2: ST/T Change | 65463 | |
Class 3: Conduction disturbance | 61259 | |
Class 4: Hypertrophy | 33108 | |
MIT-BIH Arrhythmia Database [33] | Class 0: Normal | 75052 |
Class 1: Left bundle branch block | 8075 | |
Class 2: Right bundle branch block | 7259 | |
Class 3: Premature ventricular contraction | 7130 | |
Class 4: Paced beat | 7028 | |
Class 5: Atrial premature contraction | 2546 | |
Class 6: Fusion of paced and normal beat | 982 | |
Class 7: Fusion of ventricular and normal beat | 803 | |
Class 8: Ventricular flutter wave | 472 | |
Class 9: Nodal escape beat | 229 | |
Class 10: Non-conducted P-wave | 193 | |
Class 11: Aberrated atrial premature beat | 150 | |
Class 12: Ventricular escape beat | 106 | |
Class 13: Nodal premature beat | 83 | |
Class 14: Atrial escape beat | 16 | |
European ST-T Database [34] | Class 0: Tachycardia | 780606 |
Class 1: Normal | 431376 | |
Class 2: Bradycardia | 32760 | |
Long-Term ST Database [35] | Class 0: Normal | 8832788 |
Class 1: Myocardial ischemia | 727956 |
Work | Method | Dataset | Cross- Validation | Ablation Study | Specificity (%) | Sensitivity (%) | Accuracy (%) |
---|---|---|---|---|---|---|---|
[41] | Few-shot learning with random forest | PTB-XL | 5-fold | No | 67.8 | 68.4 | N/A |
[42] | K-nearest neighbor, | PTB-XL | No | No | N/A | N/A | 71.1 |
Our Work | GANAD-DFCNTA, and CNN | PTB-XL | 5-fold | Yes | 96.6 | 97.2 | 96.9 |
[43] | Support vector machine with SMOTE | MIT-BIH Arrhythmia Database | 5-fold | Yes | N/A | N/A | 93 |
[44] | Decision tree | MIT-BIH Arrhythmia Database | 10-fold | No | 98.6 | 88.6 | N/A |
Our Work | GANAD-DFCNTA and CNN | MIT-BIH Arrhythmia Database | 5-fold | Yes | 99.1 | 99.7 | 99.4 |
[45] | Complex support vector machine | European ST-T Database | No | No | N/A | N/A | 94 |
[46] | Subspace k-nearest neighbor | European ST-T Database | 10-fold | No | N/A | N/A | 94.4 |
Our Work | GANAD-DFCNTA and CNN | European ST-T Database | 5-fold | Yes | 96.4 | 97.1 | 96.8 |
[47] | Support vector machine | Long-Term ST Database | 5-fold | No | N/A | N/A | 94.7 |
[47] | Neural network | Long-Term ST Database | 5-fold | No | N/A | N/A | 89.5 |
Our Work | GANAD-DFCNTA and CNN | Long-Term ST Database | 5-fold | Yes | 97.9 | 97.0 | 97.5 |
Specificity/Sensitivity/Accuracy (%) | |||||
---|---|---|---|---|---|
Target Dataset | Baseline | One Dataset | Two Datasets | Three Datasets | Four Datasets |
PTB-XL [32] | 92.8/93.7/93.3 | 93.3/94.1/93.7 | 93.7/94.6/94.2 | 94.1/94.9/94.5 | 94.2/95.1/94.7 |
MIT-BIH Arrhythmia Database [33] | 94.4/95.1/94.8 | 95.1/95.8/95.5 | 95.5/96.2/95.9 | 95.9/96.8/96.4 | 96.4/97.1/96.8 |
European ST-T Database [34] | 92.6/93.5/93.0 | 93.1/94.1/93.6 | 93.7/94.6/94.1 | 94.3/95.3/94.8 | 95.0/95.9/95.4 |
Long-Term ST Database [35] | 94.4/93.6/94.1 | 95.1/94.2/94.7 | 95.5/94.6/95.1 | 96.0/95.2/95.7 | 96.6/95.7/96.2 |
Specificity/Sensitivity/Accuracy (%) | |||||
---|---|---|---|---|---|
Target Dataset | Baseline | One Dataset | Two Datasets | Three Datasets | Four Datasets |
PTB-XL [32] | 92.8/93.7/93.3 | 92.8/93.7/93.3 | 93.5/94.2/93.9 | 94.1/95.0/94.6 | 94.6/95.4/95.0 |
MIT-BIH Arrhythmia Database [33] | 94.4/95.1/94.8 | 94.4/95.1/94.8 | 95.2/95.9/95.6 | 95.8/96.4/96.1 | 96.4/97.2/96.8 |
European ST-T Database [34] | 92.6/93.5/93.0 | 92.6/93.5/93.0 | 93.3/94.3/93.8 | 94.1/95.3/94.7 | 94.7/95.8/95.2 |
Long-Term ST Database [35] | 94.4/93.6/94.1 | 94.4/93.6/94.1 | 95.3/94.4/94.9 | 96.0/95.1/95.7 | 96.6/95.8/96.3 |
Specificity/Sensitivity/Accuracy (%) | |||||
---|---|---|---|---|---|
Target Dataset | Baseline | One Dataset | Two Datasets | Three Datasets | Four Datasets |
PTB-XL [32] | 92.8/93.7/93.3 | 93.2/94.0/93.6 | 93.6/94.4/94.0 | 93.9/94.7/94.4 | 94.2/95.0/94.7 |
MIT-BIH Arrhythmia Database [33] | 94.4/95.1/94.8 | 95.0/95.7/95.4 | 95.4/96.1/95.8 | 95.8/96.7/96.3 | 96.1/97.0/96.6 |
European ST-T Database [34] | 92.6/93.5/93.0 | 93.0/94.0/93.5 | 93.5/94.5/94.0 | 94.0/95.0/94.5 | 94.4/95.4/94.9 |
Long Term ST Database [35] | 94.4/93.6/94.1 | 94.9/94.1/94.6 | 95.4/94.6/95.1 | 95.8/95.0/95.5 | 96.2/95.4/95.9 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chui, K.T.; Gupta, B.B.; Zhao, M.; Malibari, A.; Arya, V.; Alhalabi, W.; Ruiz, M.T. Enhancing Electrocardiogram Classification with Multiple Datasets and Distant Transfer Learning. Bioengineering 2022, 9, 683. https://doi.org/10.3390/bioengineering9110683
Chui KT, Gupta BB, Zhao M, Malibari A, Arya V, Alhalabi W, Ruiz MT. Enhancing Electrocardiogram Classification with Multiple Datasets and Distant Transfer Learning. Bioengineering. 2022; 9(11):683. https://doi.org/10.3390/bioengineering9110683
Chicago/Turabian StyleChui, Kwok Tai, Brij B. Gupta, Mingbo Zhao, Areej Malibari, Varsha Arya, Wadee Alhalabi, and Miguel Torres Ruiz. 2022. "Enhancing Electrocardiogram Classification with Multiple Datasets and Distant Transfer Learning" Bioengineering 9, no. 11: 683. https://doi.org/10.3390/bioengineering9110683
APA StyleChui, K. T., Gupta, B. B., Zhao, M., Malibari, A., Arya, V., Alhalabi, W., & Ruiz, M. T. (2022). Enhancing Electrocardiogram Classification with Multiple Datasets and Distant Transfer Learning. Bioengineering, 9(11), 683. https://doi.org/10.3390/bioengineering9110683