Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
metafile.yml		metafile.yml
vgg11_8xb32_in1k.py		vgg11_8xb32_in1k.py
vgg11bn_8xb32_in1k.py		vgg11bn_8xb32_in1k.py
vgg13_8xb32_in1k.py		vgg13_8xb32_in1k.py
vgg13bn_8xb32_in1k.py		vgg13bn_8xb32_in1k.py
vgg16_8xb16_voc.py		vgg16_8xb16_voc.py
vgg16_8xb32_in1k.py		vgg16_8xb32_in1k.py
vgg16bn_8xb32_in1k.py		vgg16bn_8xb32_in1k.py
vgg19_8xb32_in1k.py		vgg19_8xb32_in1k.py
vgg19bn_8xb32_in1k.py		vgg19bn_8xb32_in1k.py

README.md

VGG

Very Deep Convolutional Networks for Large-Scale Image Recognition

Abstract

In this work we investigate the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting. Our main contribution is a thorough evaluation of networks of increasing depth using an architecture with very small (3x3) convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers. These findings were the basis of our ImageNet Challenge 2014 submission, where our team secured the first and the second places in the localisation and classification tracks respectively. We also show that our representations generalise well to other datasets, where they achieve state-of-the-art results. We have made our two best-performing ConvNet models publicly available to facilitate further research on the use of deep visual representations in computer vision.

How to use it?

Predict image

from mmpretrain import inference_model

predict = inference_model('vgg11_8xb32_in1k', 'demo/bird.JPEG')
print(predict['pred_class'])
print(predict['pred_score'])

Use the model

import torch
from mmpretrain import get_model

model = get_model('vgg11_8xb32_in1k', pretrained=True)
inputs = torch.rand(1, 3, 224, 224)
out = model(inputs)
print(type(out))
# To extract features.
feats = model.extract_feat(inputs)
print(type(feats))

Train/Test Command

Prepare your dataset according to the docs.

Train:

python tools/train.py configs/vgg/vgg11_8xb32_in1k.py

Test:

python tools/test.py configs/vgg/vgg11_8xb32_in1k.py https://download.openmmlab.com/mmclassification/v0/vgg/vgg11_batch256_imagenet_20210208-4271cd6c.pth

Models and results

Image Classification on ImageNet-1k

Model	Pretrain	Params (M)	Flops (G)	Top-1 (%)	Top-5 (%)	Config	Download
`vgg11_8xb32_in1k`	From scratch	132.86	7.63	68.75	88.87	config	model \| log
`vgg13_8xb32_in1k`	From scratch	133.05	11.34	70.02	89.46	config	model \| log
`vgg16_8xb32_in1k`	From scratch	138.36	15.50	71.62	90.49	config	model \| log
`vgg19_8xb32_in1k`	From scratch	143.67	19.67	72.41	90.80	config	model \| log
`vgg11bn_8xb32_in1k`	From scratch	132.87	7.64	70.67	90.16	config	model \| log
`vgg13bn_8xb32_in1k`	From scratch	133.05	11.36	72.12	90.66	config	model \| log
`vgg16bn_8xb32_in1k`	From scratch	138.37	15.53	73.74	91.66	config	model \| log
`vgg19bn_8xb32_in1k`	From scratch	143.68	19.70	74.68	92.27	config	model \| log

Citation

@article{simonyan2014very,
  title={Very deep convolutional networks for large-scale image recognition},
  author={Simonyan, Karen and Zisserman, Andrew},
  journal={arXiv preprint arXiv:1409.1556},
  year={2014}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vgg

vgg

README.md

VGG

Abstract

How to use it?

Models and results

Image Classification on ImageNet-1k

Citation

Files

vgg

Directory actions

More options

Directory actions

More options

Latest commit

History

vgg

Folders and files

parent directory

README.md

VGG

Abstract

How to use it?

Models and results

Image Classification on ImageNet-1k

Citation