Você está em: Início > Publicações > Visualização > Grad-CAM: The impact of large receptive fields and other caveats

Mapa das Instalações

Publicação

Pesquisa de Publicações

Grad-CAM: The impact of large receptive fields and other caveats

Título

Grad-CAM: The impact of large receptive fields and other caveatsExportar publicação no formato APA Exportar publicação no formato EXCEL Exportar publicação no formato RIS

Tipo

Artigo em Revista Científica Internacional

Data

2025

Título

Grad-CAM: The impact of large receptive fields and other caveats

Tipo

Artigo em Revista Científica Internacional

Ano

2025

Autores

Santos, R

(Autor)

Outra

A pessoa não pertence à instituição. A pessoa não pertence à instituição. A pessoa não pertence à instituição. Sem AUTHENTICUS Sem ORCID

Pedrosa, J

(Autor)

Outra

A pessoa não pertence à instituição. A pessoa não pertence à instituição. A pessoa não pertence à instituição. Ver página do Authenticus Sem ORCID

Ana Maria Mendonça

(Autor)

FEUP

Ver página pessoal Sem permissões para visualizar e-mail institucional Pesquisar Publicações do Participante Ver página do Authenticus Ver página ORCID

Aurélio Campilho

(Autor)

FEUP

Ver página pessoal Sem permissões para visualizar e-mail institucional Pesquisar Publicações do Participante Ver página do Authenticus Sem ORCID

Revista

Título: Computer Vision and Image UnderstandingImportada do Authenticus Pesquisar Publicações da Revista

Vol. 258

ISSN: 1077-3142

Editora: Elsevier

Indexação

ISI Web of Knowledge - 6 Citações

Scopus - 7 Citações

Outras Informações

ID Authenticus: P-018-RQB

DOI: 10.1016/j.cviu.2025.104383

Abstract (EN): The increase in complexity of deep learning models demands explanations that can be obtained with methods like Grad-CAM. This method computes an importance map for the last convolutional layer relative to a specific class, which is then upsampled to match the size of the input. However, this final step assumes that there is a spatial correspondence between the last feature map and the input, which may not be the case. We hypothesize that, for models with large receptive fields, the feature spatial organization is not kept during the forward pass, which may render the explanations devoid of meaning. To test this hypothesis, common architectures were applied to a medical scenario on the public VinDr-CXR dataset, to a subset of ImageNet and to datasets derived from MNIST. The results show a significant dispersion of the spatial information, which goes against the assumption of Grad-CAM, and that explainability maps are affected by this dispersion. Furthermore, we discuss several other caveats regarding Grad-CAM, such as feature map rectification, empty maps and the impact of global average pooling or flatten layers. Altogether, this work addresses some key limitations of Grad-CAM which may go unnoticed for common users, taking one step further in the pursuit for more reliable explainability methods.

Idioma: Inglês

Tipo (Avaliação Docente): Científica

Nº de páginas: 10

Documentos

Não foi encontrado nenhum documento associado à publicação.

Publicações Relacionadas

Dos mesmos autores

Automatic Eye-Tracking-Assisted Chest Radiography Pathology Screening (2023)
Artigo em Livro de Atas de Conferência Internacional
Santos, R; Pedrosa, J; Ana Maria Mendonça; Aurélio Campilho

Da mesma revista

Texture collinearity foreground segmentation for night videos (2020)
Artigo em Revista Científica Internacional
Martins, I; Pedro Carvalho; Luís Corte-Real; Luis Alba Castro, JL

Partition-distance methods for assessing spatial segmentations of images and videos (2009)
Artigo em Revista Científica Internacional
Jaime S Cardoso; Pedro Carvalho; Luis F Teixeira; Luis Corte Real

Recomendar Página Voltar ao Topo

Copyright 1996-2026 © Faculdade de Desporto da Universidade do Porto I Termos e Condições I Acessibilidade I Índice A-Z
Página gerada em: 2026-03-14 às 23:50:48 | Política de Privacidade | Política de Proteção de Dados Pessoais | Denúncias | Livro Amarelo Eletrónico