Go to:
Logótipo
Você está em: Start > Publications > View > Grad-CAM: The impact of large receptive fields and other caveats
Publication

Grad-CAM: The impact of large receptive fields and other caveats

Title
Grad-CAM: The impact of large receptive fields and other caveats
Type
Article in International Scientific Journal
Year
2025
Authors
Santos, R
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. Without AUTHENTICUS Without ORCID
Pedrosa, J
(Author)
Other
The person does not belong to the institution. The person does not belong to the institution. The person does not belong to the institution. View Authenticus page Without ORCID
Ana Maria Mendonça
(Author)
FEUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page View ORCID page
Aurélio Campilho
(Author)
FEUP
View Personal Page You do not have permissions to view the institutional email. Search for Participant Publications View Authenticus page Without ORCID
Journal
Vol. 258
ISSN: 1077-3142
Publisher: Elsevier
Other information
Authenticus ID: P-018-RQB
Abstract (EN): The increase in complexity of deep learning models demands explanations that can be obtained with methods like Grad-CAM. This method computes an importance map for the last convolutional layer relative to a specific class, which is then upsampled to match the size of the input. However, this final step assumes that there is a spatial correspondence between the last feature map and the input, which may not be the case. We hypothesize that, for models with large receptive fields, the feature spatial organization is not kept during the forward pass, which may render the explanations devoid of meaning. To test this hypothesis, common architectures were applied to a medical scenario on the public VinDr-CXR dataset, to a subset of ImageNet and to datasets derived from MNIST. The results show a significant dispersion of the spatial information, which goes against the assumption of Grad-CAM, and that explainability maps are affected by this dispersion. Furthermore, we discuss several other caveats regarding Grad-CAM, such as feature map rectification, empty maps and the impact of global average pooling or flatten layers. Altogether, this work addresses some key limitations of Grad-CAM which may go unnoticed for common users, taking one step further in the pursuit for more reliable explainability methods.
Language: English
Type (Professor's evaluation): Scientific
No. of pages: 10
Documents
We could not find any documents associated to the publication.
Related Publications

Of the same authors

Automatic Eye-Tracking-Assisted Chest Radiography Pathology Screening (2023)
Article in International Conference Proceedings Book
Santos, R; Pedrosa, J; Ana Maria Mendonça; Aurélio Campilho

Of the same journal

Texture collinearity foreground segmentation for night videos (2020)
Article in International Scientific Journal
Martins, I; Pedro Carvalho; Luís Corte-Real; Luis Alba Castro, JL
Partition-distance methods for assessing spatial segmentations of images and videos (2009)
Article in International Scientific Journal
Jaime S Cardoso; Pedro Carvalho; Luis F Teixeira; Luis Corte Real
Recommend this page Top
Copyright 1996-2026 © Faculdade de Desporto da Universidade do Porto  I Terms and Conditions  I Acessibility  I Index A-Z
Page created on: 2026-03-14 at 08:08:47 | Privacy Policy | Personal Data Protection Policy | Whistleblowing | Electronic Yellow Book