Next Article in Journal
How Does Rural Resilience Affect Return Migration: Evidence from Frontier Regions in China
Previous Article in Journal
Enhancing Invoice Processing Automation Through the Integration of DevOps Methodologies and Machine Learning
Previous Article in Special Issue
Recent Trends in Information and Cyber Security Maturity Assessment: A Systematic Literature Review
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
This is an early access version, the complete PDF, HTML, and XML versions will be available soon.
Article

A Generalized Framework for Adversarial Attack Detection and Prevention Using Grad-CAM and Clustering Techniques

by
Jeong-Hyun Sim
and
Hyun-Min Song
*
Department of Industrial Security, Dankook University, Jukjeon-ro 152, Yongin-si 16890, Republic of Korea
*
Author to whom correspondence should be addressed.
Systems 2025, 13(2), 88; https://doi.org/10.3390/systems13020088
Submission received: 6 January 2025 / Revised: 26 January 2025 / Accepted: 30 January 2025 / Published: 31 January 2025

Abstract

Through advances in AI-based computer vision technology, the performance of modern image classification models has surpassed human perception, making them valuable in various fields. However, adversarial attacks, which involve small changes to images that are hard for humans to perceive, can cause classification models to misclassify images. Considering the availability of classification models that use neural networks, it is crucial to prevent adversarial attacks. Recent detection methods are only effective for specific attacks or cannot be applied to various models. Therefore, in this paper, we proposed an attention mechanism-based method for detecting adversarial attacks. We utilized a framework using an ensemble model, Grad-CAM and calculated the silhouette coefficient for detection. We applied this method to Resnet18, Mobilenetv2, and VGG16 classification models that were fine-tuned on the CIFAR-10 dataset. The average performance demonstrated that Mobilenetv2 achieved an F1-Score of 0.9022 and an accuracy of 0.9103, Resnet18 achieved an F1-Score of 0.9124 and an accuracy of 0.9302, and VGG16 achieved an F1-Score of 0.9185 and an accuracy of 0.9252. The results demonstrated that our method not only detects but also prevents adversarial attacks by mitigating their effects and effectively restoring labels.
Keywords: adversarial attack; AI security; computer vision; explainable AI adversarial attack; AI security; computer vision; explainable AI

Share and Cite

MDPI and ACS Style

Sim, J.-H.; Song, H.-M. A Generalized Framework for Adversarial Attack Detection and Prevention Using Grad-CAM and Clustering Techniques. Systems 2025, 13, 88. https://doi.org/10.3390/systems13020088

AMA Style

Sim J-H, Song H-M. A Generalized Framework for Adversarial Attack Detection and Prevention Using Grad-CAM and Clustering Techniques. Systems. 2025; 13(2):88. https://doi.org/10.3390/systems13020088

Chicago/Turabian Style

Sim, Jeong-Hyun, and Hyun-Min Song. 2025. "A Generalized Framework for Adversarial Attack Detection and Prevention Using Grad-CAM and Clustering Techniques" Systems 13, no. 2: 88. https://doi.org/10.3390/systems13020088

APA Style

Sim, J.-H., & Song, H.-M. (2025). A Generalized Framework for Adversarial Attack Detection and Prevention Using Grad-CAM and Clustering Techniques. Systems, 13(2), 88. https://doi.org/10.3390/systems13020088

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop