Beyond Black-Box AI: A Quantitative Grad-CAM Analysis of Convolutional Neural Network Interpretability in COVID-19 Chest X-Ray Classification

Authors

  • Aiman Abd Saeed Department of Computer Science and Information Technology, College of Science, Salahaddin University-Erbil, Erbil, Kurdistan Region, Iraq.
  • Rasber Dhahir Rashid Department of Computer Science and Information Technology, College of Science, Salahaddin University-Erbil, Erbil, Kurdistan Region, Iraq.

DOI:

https://doi.org/10.21271/ZJPAS.38.2.13

Keywords:

Deep learning, Convolutional neural networks (CNNs), Transfer learning, Gradient-weighted class activation mapping (Grad-CAM), COVID-19

Abstract

Modern AI models use deep architectures that obscure how predictions are made. Without understanding how models reach their predictions, it becomes difficult to verify reasoning, identify biases, or trust their reliability in high-stakes domains like healthcare. Many COVID-19 chest X-ray (CXR) studies report high accuracy and present qualitative gradient-weighted class activation mapping (Grad-CAM) heatmaps, providing no quantitative evidence of alignment with lung anatomy and relying on manual, subjective inspection. We introduce an automated quantitative pipeline that converts interpretability into objective, anatomy grounded metrics between Grad-CAM heatmaps and lung masks. We evaluate six convolutional neural networks (CNNs): VGG16, VGG19, ResNet-101, NASNet-Mobile, NASNet-Large, and Xception, for both classification performance and anatomical interpretability in COVID-19 CXR detection. Classification accuracies ranged from 90% to 96%, with Xception achieving the highest accuracy (95.90%) and a balanced precision, recall, and F1-score of 95.92%. NASNet-Large and VGG19 followed at 94.87%, with VGG19 reaching the highest precision (98.89%). To assess model transparency, we automated interpretability analysis by thresholding the Grad-CAM outputs and comparing them to radiologist-annotated lung masks using Intersection-over-Union (IoU) and Dice score metrics.

References

APOSTOLOPOULOS, I. D. & MPESIANA, T. A. 2020. Covid-19: automatic detection from X-ray images utilizing transfer learning with convolutional neural networks. Physical and Engineering Sciences in Medicine, 43, 635-640.

CHOW, L. S., TANG, G. S., SOLIHIN, M. I., GOWDH, N. M., RAMLI, N. & RAHMAT, K. 2023. Quantitative and Qualitative Analysis of 18 Deep Convolutional Neural Network (CNN) Models with Transfer Learning to Diagnose COVID-19 on Chest X-Ray (CXR) Images. SN Computer Science, 4, 141.

EL HOUBY, E. M. F. 2024. COVID 19 detection from chest X-ray images using transfer learning. Scientific Reports, 14, 11639.

HALGURD, S. M., ARAS, T. A., KAYHAN ZRAR, G., ALI SAFAA, S., SEYEDALI, M. & MUHAMMAD KHURRAM, K. Diagnosing COVID-19 pneumonia from x-ray and CT images using deep learning and transfer learning algorithms. Proc.SPIE, 2021. 117340E.

HAMAD, Z. H. & MAJEED, T. F. 2022. Lung Region Segmentation Using Modified U-Net Architecture. EURASIAN JOURNAL OF SCIENCE AND ENGINEERING, 8, 25-38.

HARIRI, M. & AVŞAR, E. 2023. COVID-19 and pneumonia diagnosis from chest X-ray images using convolutional neural networks. Network Modeling Analysis in Health Informatics and Bioinformatics, 12, 17.

HASSIJA, V., CHAMOLA, V., MAHAPATRA, A., SINGAL, A., GOEL, D., HUANG, K., SCARDAPANE, S., SPINELLI, I., MAHMUD, M. & HUSSAIN, A. 2024. Interpreting Black-Box Models: A Review on Explainable Artificial Intelligence. Cognitive Computation, 16, 45-74.

KHURANA, Y. & SONI, U. 2022. Leveraging deep learning for COVID-19 diagnosis through chest imaging. Neural Computing and Applications, 34, 14003-14012.

MAJEED, T., RASHID, R., ALI, D. & ASAAD, A. 2020. Issues associated with deploying CNN transfer learning to detect COVID-19 from chest X-rays. Physical and Engineering Sciences in Medicine, 43, 1289-1303.

NARAYAN DAS, N., KUMAR, N., KAUR, M., KUMAR, V. & SINGH, D. 2022. Automated Deep Transfer Learning-Based Approach for Detection of COVID-19 Infection in Chest X-rays. IRBM, 43, 114-119.

NAYLA FAIQ, O. & SHAHAB WAHHAB, K. 2025. Enhancing Brain Tumor Classification Accuracy Using Deep Learning with Real and Synthetic MRI Images. Zanco Journal of Pure and Applied Sciences, 37, 126-149.

PANWAR, H., GUPTA, P. K., SIDDIQUI, M. K., MORALES-MENENDEZ, R. & SINGH, V. 2020. Application of deep learning for fast detection of COVID-19 in X-Rays using nCOVnet. Chaos, Solitons & Fractals, 138, 109944.

ROJGAR QARANI, I. & HAVAL ABDULJABBAR, S. 2025. Sequential Hybrid Integration of U-Net and Fully Convolutional Networks with Mask R-CNN for Enhanced Building Boundary Segmentation from Satellite Imagery. Zanco Journal of Pure and Applied Sciences, 37, 157-171.

SELVARAJU, R. R., COGSWELL, M., DAS, A., VEDANTAM, R., PARIKH, D. & BATRA, D. Grad-CAM: Visual Explanations from Deep Networks via Gradient-Based Localization. 2017 IEEE International Conference on Computer Vision (ICCV), 22-29 Oct. 2017 2017. 618-626.

SETHY, P. K., BEHERA, S. K., RATHA, P. K. & BISWAS, P. 2020. Detection of Coronavirus Disease (COVID-19) Based on Deep Features and Support Vector Machine. Preprints. Preprints.

TAHIR, Y. M. & HAMARASH, I. I. 2025. Enhanced Human Activity Recognition (HAR) with IMU Sensors in Smartphones: Insights from Machine Learning Models. Zanco Journal of Pure and Applied Sciences, 37, 101-110.

WANG, L., LIN, Z. Q. & WONG, A. 2020. COVID-Net: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest X-ray images. Scientific Reports, 10, 19549.

Published

2026-04-30

How to Cite

Aiman Abd Saeed, & Rasber Dhahir Rashid. (2026). Beyond Black-Box AI: A Quantitative Grad-CAM Analysis of Convolutional Neural Network Interpretability in COVID-19 Chest X-Ray Classification. Zanco Journal of Pure and Applied Sciences, 38(2), 183–196. https://doi.org/10.21271/ZJPAS.38.2.13

Issue

Section

Engineering and Computer Sciences