LSAS: Lightweight Sub-attention Strategy for Alleviating Attention Bias Problem

This repository is the implementation of "LSAS: Lightweight Sub-attention Strategy for Alleviating Attention Bias Problem" [paper] on CIFAR-100, CIFAR-10, STL10 and ImageNet datasets. Our paper has been accepted for presentation at ICME 2023.

Introduction

In computer vision, the performance of deep neural networks (DNNs) is highly related to the feature extraction ability, i.e., the ability to recognize and focus on key pixel regions in an image. However, in this paper, we quantitatively and statistically illustrate that DNNs have a serious attention bias problem on many samples from some popular datasets: (1) Position bias: DNNs fully focus on label-independent regions; (2) Range bias: The focused regions from DNN are not completely contained in the ideal region. Moreover, we find that the existing self-attention modules can alleviate these biases to a certain extent, but the biases are still non-negligible. To further mitigate them, we propose a lightweight sub-attention strategy (LSAS), which utilizes high-order sub-attention modules to improve the original self-attention modules. The effectiveness of LSAS is demonstrated by extensive experiments on widely-used benchmark datasets and popular attention networks.

Requirement

Python and PyTorch.

pip install -r requirements.txt

Usage

# run ResNet164-SENet on cifar10, 1 GPU
CUDA_VISIBLE_DEVICES=0 python run.py --arch senet --dataset cifar10 --block-name bottleneck --depth 164 --epochs 164 --schedule 81 122 --gamma 0.1 --wd 1e-4

# run ResNet164-LSAS-SENet on cifar10, 1 GPU
CUDA_VISIBLE_DEVICES=0 python run.py --arch lsas_senet --dataset cifar10 --block-name bottleneck --depth 164 --epochs 164 --schedule 81 122 --gamma 0.1 --wd 1e-4

# run ResNet50-SENet on ImageNet, 8 GPUs
python -u -W ignore -m torch.distributed.launch --nproc_per_node=8 --master_port='29503' run_imagenet.py -a senet_resnet50 --info normal --data /data1/ZSS/datasets/ILSVRC2012_Data --epochs 100 --schedule 30 60 90 --wd 1e-4 --gamma 0.1 --train-batch 32 --opt-level O0 --wd-all --label-smoothing 0. --warmup 0

# run ResNet50-LSAS-SENet on ImageNet, 8 GPUs
python -u -W ignore -m torch.distributed.launch --nproc_per_node=8 --master_port='29503' run_imagenet.py -a lsas_senet_resnet50 --info normal --data /data1/ZSS/datasets/ILSVRC2012_Data --epochs 100 --schedule 30 60 90 --wd 1e-4 --gamma 0.1 --train-batch 32 --opt-level O0 --wd-all --label-smoothing 0. --warmup 0

Results

	Dataset	SENet	LSAS-SENet
ResNet164	CIFAR10	94.57	95.01
ResNet164	CIFAR100	75.30	76.47
ResNet164	STL10	83.81	85.71
ResNet50	ImageNet	76.63	77.28

Citation

@inproceedings{Zhong2023LSASLS,
  title={LSAS: Lightweight Sub-attention Strategy for Alleviating Attention Bias Problem},
  author={Shan Zhong and Wushao Wen and Jinghui Qin and Qiangpu Chen and Zhongzhan Huang},
  year={2023}
}

Acknowledgments

Many thanks to bearpaw for his simple and clean Pytorch framework for image classification task.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
cifar		cifar
imagenet		imagenet
images		images
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
flops_counter.py		flops_counter.py
requirements.txt		requirements.txt
run.py		run.py
run_imagenet.py		run_imagenet.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LSAS: Lightweight Sub-attention Strategy for Alleviating Attention Bias Problem

Introduction

Requirement

Usage

Results

Citation

Acknowledgments

About

Releases

Packages

Languages

License

Qrange-group/LSAS

Folders and files

Latest commit

History

Repository files navigation

LSAS: Lightweight Sub-attention Strategy for Alleviating Attention Bias Problem

Introduction

Requirement

Usage

Results

Citation

Acknowledgments

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages