Analysis and Evaluation of Grad-CAM Explanations

Final Project for DD2412 Course (Deep Learning, Advanced), KTH

The scope of this project was to reproduce the findings of Grad-CAM, a deep visualization technique suitable to any CNN. We performed the following tasks:

Evaluated Grad-CAM on the Weakly Supervised Localisation Task (ILSVRC15 validation set), in which the agent aims to localise the object (via a bounding box) without being explicitly trained to do so, based solely on the visualization.
Compute Pointing Game Accuracy and Recall (ILSVRC15 validation set).
Compare Grad-CAM, Guided Grad-CAM and Guided Backpropagation with occlusion maps.
Reproduce and analyze a User study to compare the thetrustworthiness of Guided Grad-CAM and Guided Backpropagation using VGG-16 and AlexNet, leveraging the fact that the former is known to be more accurate.
Compare Grad-CAM with Grad-CAM++, Integrated Gradients and SHAP medical data.
Propose a novel experiment for evaluating Grad-CAM's sensitivity.
Compare Grad-CAM with Integrated Gradients and SHAP in regards to contrastivity and fidelity

For more information, please refer to our report

Task 1

Implemented method visualization

In the first task, we managed to successfully reproduce the results of the original paper. We see that Grad-CAM manages noteworthy results in a task in which it was not excplicitly trained for.

Task 1 results (original paper results in parentheses)

Task 2

In the pointing game we examine the maximally activated point produced by the heatman and check whether it lies inside the real label's bounding box (accuracy). We also consider the recall, by allowing the model to renounce any top-5 visualization with a max activation below a given threshold.

Task 2 results

Task 3

Measuring Rank Correlation between Occlusion maps and Grad-CAM, Guided Grad-CAM and guided Backpropagation. Relative to occlusion maps, Guided Grad-CAM is slightly more similar than Grad-CAM which is significantly more similar than guided backpropagation.

Task 3 results

Task 4

In this user study, users were tasksd with choosing between the two agents and grading them on a scale from -2 to 2 (-2: A is substantially better ... 2: B is substantially better). Our results indicate that the user study conducted in the original paper is not robust enough, as seen by the high variance.

Task 4 results

Guided Grad-CAM

Guided Backpropagation

Task 5

In this task, we examined the efficacy of Grad-CAM and Grad-CAM++ with Integrated gradients and SHAP, using a DenseNet121 architecture pretrained on Chest-X-Ray14. We then measured the ratio of activated pixels (beyond 85 %) which lay within the target bounding boxes.

Task 5 results

Task 6

A visualization method is sensitive if it assigns non-zero significance to all features which are capable of singlehandedly change the prediction of the classifier. For this task, we generated single pixel attacks and analyzed Grad-CAM with VGG-16 and GoogLeNet. Empirical results indicate that Grad-CAM with GoogLeNet exhibits sensitivity.

Task 6 results

Example of single pixel attack (baseball -> assault rifle)

Task 7

We measure fidelity (highlighted feature relevance to result) and contrastivity (overlap between different class visualizations). Grad-CAM showcases the highest contrastivity.

Task 7 results

Robustness to adversarial attacks

Grac-CAM exhibits robustness to adversarial attacks: Even when the network gets tricked into misclassifying an image, the visualization remains focused and virtually unchanged.

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
datasets		datasets
images		images
images_for_readme		images_for_readme
literature		literature
models		models
pickles		pickles
results		results
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
Grad-CAM Explanation Analysis.pdf		Grad-CAM Explanation Analysis.pdf
README.md		README.md
resized.csv		resized.csv
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Analysis and Evaluation of Grad-CAM Explanations

Task 1

Implemented method visualization

Task 1 results (original paper results in parentheses)

Task 2

Task 2 results

Task 3

Task 3 results

Task 4

Task 4 results

Guided Grad-CAM

Guided Backpropagation

Task 5

Task 5 results

Task 6

Task 6 results

Example of single pixel attack (baseball -> assault rifle)

Task 7

Task 7 results

Robustness to adversarial attacks

About

Releases

Packages

Contributors 3

Languages

ddnimara/DD2412-Project-Grad-CAM

Folders and files

Latest commit

History

Repository files navigation

Analysis and Evaluation of Grad-CAM Explanations

Task 1

Implemented method visualization

Task 1 results (original paper results in parentheses)

Task 2

Task 2 results

Task 3

Task 3 results

Task 4

Task 4 results

Guided Grad-CAM

Guided Backpropagation

Task 5

Task 5 results

Task 6

Task 6 results

Example of single pixel attack (baseball -> assault rifle)

Task 7

Task 7 results

Robustness to adversarial attacks

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages