Skip to content

Latest commit

 

History

History
913 lines (874 loc) · 134 KB

README.md

File metadata and controls

913 lines (874 loc) · 134 KB

Computer vision models on MXNet/Gluon

PyPI Downloads

This is a collection of image classification, segmentation, detection, and pose estimation models. Many of them are pretrained on ImageNet-1K, CIFAR-10/100, SVHN, CUB-200-2011, Pascal VOC2012, ADE20K, Cityscapes, and COCO datasets and loaded automatically during use. All pretrained models require the same ordinary normalization. Scripts for training/evaluating/converting models are in the imgclsmob repo.

List of implemented models

Installation

To use the models in your project, simply install the gluoncv2 package with mxnet:

pip install gluoncv2 mxnet>=1.2.1

To enable different hardware supports such as GPUs, check out MXNet variants. For example, you can install with CUDA-9.2 supported MXNet:

pip install gluoncv2 mxnet-cu92>=1.2.1

Usage

Example of using a pretrained ResNet-18 model:

from gluoncv2.model_provider import get_model as glcv2_get_model
import mxnet as mx

net = glcv2_get_model("resnet18", pretrained=True)
x = mx.nd.zeros((1, 3, 224, 224), ctx=mx.cpu())
y = net(x)

Pretrained models

ImageNet-1K

Some remarks:

  • Top1/Top5 are the standard 1-crop Top-1/Top-5 errors (in percents) on the validation subset of the ImageNet-1K dataset.
  • FLOPs/2 is the number of FLOPs divided by two to be similar to the number of MACs.
  • ResNet/PreResNet with b-suffix is a version of the networks with the stride in the second convolution of the bottleneck block. Respectively a network without b-suffix has the stride in the first convolution.
  • ResNet/PreResNet models do not use biases in convolutions at all.
  • CondenseNet models are only so-called converted versions.
  • ShuffleNetV2 and ShuffleNetV2b are different implementations of the same architecture.
  • ResNet(A) is an average downsampled ResNet intended for use as an feature extractor in some pose estimation networks.
  • ResNet(D) is a dilated ResNet intended for use as an feature extractor in some segmentation networks.
  • Models with *-suffix use non-standard preprocessing (see the training log).
Model Top1 Top5 Params FLOPs/2 Remarks
AlexNet 38.07 16.10 62,378,344 1,132.33M Training (log)
AlexNet-b 39.30 17.05 61,100,840 714.83M Training (log)
ZFNet 39.21 16.78 62,357,608 1,170.33M Training (log)
ZFNet-b 35.81 14.59 107,627,624 2,479.13M Training (log)
VGG-11 29.59 10.16 132,863,336 7,615.87M Training (log)
VGG-13 28.37 9.50 133,047,848 11,317.65M Training (log)
VGG-16 26.61 8.32 138,357,544 15,480.10M Training (log)
VGG-19 25.58 7.67 143,667,240 19,642.55M Training (log)
BN-VGG-11 28.56 9.34 132,866,088 7,630.21M Training (log)
BN-VGG-13 27.68 8.87 133,050,792 11,341.62M Training (log)
BN-VGG-16 25.50 7.57 138,361,768 15,506.38M Training (log)
BN-VGG-19 23.91 6.89 143,672,744 19,671.15M Training (log)
BN-VGG-11b 29.24 9.75 132,868,840 7,630.72M Training (log)
BN-VGG-13b 28.23 9.12 133,053,736 11,342.14M Training (log)
BN-VGG-16b 25.83 7.75 138,365,992 15,507.20M Training (log)
BN-VGG-19b 24.79 7.35 143,678,248 19,672.26M Training (log)
BN-Inception 25.12 7.54 11,295,240 2,048.06M Training (log)
ResNet-10 32.54 12.53 5,418,792 894.04M Training (log)
ResNet-12 31.68 12.03 5,492,776 1,126.25M Training (log)
ResNet-14 30.38 10.86 5,788,200 1,357.94M Training (log)
ResNet-BC-14b 29.22 10.33 10,064,936 1,479.12M Training (log)
ResNet-16 28.53 9.78 6,968,872 1,589.34M Training (log)
ResNet-18 x0.25 39.31 17.40 3,937,400 270.94M Training (log)
ResNet-18 x0.5 33.41 12.84 5,804,296 608.70M Training (log)
ResNet-18 x0.75 30.00 10.66 8,476,056 1,129.45M Training (log)
ResNet-18 26.79 8.67 11,689,512 1,820.41M Training (log)
ResNet-26 25.96 8.23 17,960,232 2,746.79M Training (log)
ResNet-BC-26b 24.86 7.58 15,995,176 2,356.67M Training (log)
ResNet-34 24.53 7.43 21,797,672 3,672.68M Training (log)
ResNet-BC-38b 23.50 6.72 21,925,416 3,234.21M Training (log)
ResNet-50 22.15 6.04 25,557,032 3,877.95M Training (log)
ResNet-50b 22.06 6.11 25,557,032 4,110.48M Training (log)
ResNet-101 20.52 5.16 44,549,160 7,597.95M Training (log)
ResNet-101b 20.26 5.12 44,549,160 7,830.48M Training (log)
ResNet-152 19.20 4.44 60,192,808 11,321.85M Training (log)
ResNet-152b 18.84 4.29 60,192,808 11,554.38M Training (log)
PreResNet-10 34.65 14.01 5,417,128 894.19M Training (log)
PreResNet-12 33.57 13.21 5,491,112 1,126.40M Training (log)
PreResNet-14 32.29 12.18 5,786,536 1,358.09M Training (log)
PreResNet-BC-14b 30.67 11.51 10,057,384 1,476.62M Training (log)
PreResNet-16 30.21 10.81 6,967,208 1,589.49M Training (log)
PreResNet-18 x0.25 39.62 17.78 3,935,960 270.93M Training (log)
PreResNet-18 x0.5 33.67 13.19 5,802,440 608.73M Training (log)
PreResNet-18 x0.75 29.96 10.68 8,473,784 1,129.51M Training (log)
PreResNet-18 28.16 9.51 11,687,848 1,820.56M Training (log)
PreResNet-26 26.03 8.34 17,958,568 2,746.94M Training (log)
PreResNet-BC-26b 25.21 7.86 15,987,624 2,354.16M Training (log)
PreResNet-34 24.55 7.51 21,796,008 3,672.83M Training (log)
PreResNet-BC-38b 22.67 6.33 21,917,864 3,231.70M Training (log)
PreResNet-50 22.27 6.20 25,549,480 3,875.44M Training (log)
PreResNet-50b 22.36 6.32 25,549,480 4,107.97M Training (log)
PreResNet-101 20.60 5.34 44,541,608 7,595.44M Training (log)
PreResNet-101b 20.85 5.40 44,541,608 7,827.97M Training (log)
PreResNet-152 19.17 4.46 60,185,256 11,319.34M Training (log)
PreResNet-152b 19.01 4.38 60,185,256 11,551.87M Training (log)
PreResNet-200b 18.96 4.46 64,666,280 15,068.63M Training (log)
PreResNet-269b 20.17 5.01 102,065,832 20,101.11M Training (log)
ResNeXt-14 (16x4d) 31.66 12.23 7,127,336 1,045.77M Training (log)
ResNeXt-14 (32x2d) 32.16 12.47 7,029,416 1,031.32M Training (log)
ResNeXt-14 (32x4d) 29.95 11.10 9,411,880 1,603.46M Training (log)
ResNeXt-26 (32x2d) 26.34 8.50 9,924,136 1,461.06M Training (log)
ResNeXt-26 (32x4d) 23.93 7.21 15,389,480 2,488.07M Training (log)
ResNeXt-50 (32x4d) 20.84 5.45 25,028,904 4,255.86M Training (log)
ResNeXt-101 (32x4d) 18.46 4.18 44,177,704 8,003.45M Training (log)
ResNeXt-101 (64x4d) 18.80 4.39 83,455,272 15,500.27M Training (log)
SE-ResNet-10 31.36 11.69 5,463,332 894.08M Training (log)
SE-ResNet-12 31.64 11.76 5,537,896 1,126.29M Training (log)
SE-ResNet-14 30.34 10.95 5,835,504 1,357.99M Training (log)
SE-ResNet-16 28.41 9.72 7,024,640 1,589.40M Training (log)
SE-ResNet-18 27.95 9.20 11,778,592 1,820.51M Training (log)
SE-ResNet-26 25.42 8.03 18,093,852 2,746.93M Training (log)
SE-ResNet-BC-26b 23.44 6.82 17,395,976 2,358.07M Training (log)
SE-ResNet-BC-38b 21.44 5.75 24,026,616 3,236.32M Training (log)
SE-ResNet-50 21.07 5.60 28,088,024 3,880.49M Training (log)
SE-ResNet-50b 20.58 5.33 28,088,024 4,113.02M Training (log)
SE-ResNet-101 19.00 4.41 49,326,872 7,602.76M Training (log)
SE-ResNet-101b 19.46 4.62 49,326,872 7,835.29M Training (log)
SE-ResNet-152 18.59 4.30 66,821,848 11,328.52M Training (log)
SE-PreResNet-10 32.37 12.21 5,461,668 894.23M Training (log)
SE-PreResNet-12 31.58 11.80 5,536,232 1,126.44M Training (log)
SE-PreResNet-16 28.39 9.56 7,022,976 1,589.55M Training (log)
SE-PreResNet-18 27.16 8.81 11,776,928 1,820.66M Training (log)
SE-PreResNet-26 25.95 8.04 18,092,188 2,747.08M Training (log)
SE-PreResNet-BC-26b 22.95 6.36 17,388,424 2,355.57M Training (log)
SE-PreResNet-BC-38b 21.42 5.63 24,019,064 3,233.81M Training (log)
SE-PreResNet-50b 20.67 5.32 28,080,472 4,110.51M Training (log)
SE-ResNeXt-50 (32x4d) 18.74 4.33 27,559,896 4,258.40M Training (log)
SE-ResNeXt-101 (32x4d) 19.06 4.44 48,955,416 8,008.26M Training (log)
SE-ResNeXt-101 (64x4d) 18.98 4.66 88,232,984 15,505.08M From dmlc/gluon-cv (log)
SENet-16 25.34 8.06 31,366,168 5,080.55M Training (log)
SENet-28 21.68 5.91 36,453,768 5,731.20M Training (log)
SENet-154 18.84 4.40 115,088,984 20,745.78M Training (log)
ResNeSt(A)-BC-14 22.27 6.34 10,611,688 2,766.86M Training (log)
ResNeSt(A)-18 23.43 6.89 12,763,784 2,587.11M Training (log)
ResNeSt(A)-BC-26 19.57 4.70 17,069,448 3,645.87M Training (log)
ResNeSt(A)-50 18.92 4.38 27,483,240 5,402.09M Training (log)
ResNeSt(A)-101 17.74 3.99 48,275,016 10,246.42M From dmlc/gluon-cv (log)
ResNeSt(A)-152 18.72 4.51 65,316,040 13,974.33M Training (log)
ResNeSt(A)-200 16.78 3.40 70,201,544 22,854.22M From dmlc/gluon-cv (log)
ResNeSt(A)-269 16.38 3.36 110,929,480 46,005.88M From dmlc/gluon-cv (log)
IBN-ResNet-50 21.46 5.59 25,557,032 4,110.48M Training (log)
IBN-ResNet-101 21.89 5.87 44,549,160 7,830.48M From XingangPan/IBN-Net (log)
IBN(b)-ResNet-50 23.91 6.97 25,558,568 4,112.89M From XingangPan/IBN-Net (log)
IBN-ResNeXt-101 (32x4d) 21.43 5.62 44,177,704 8,003.45M From XingangPan/IBN-Net (log)
IBN-DenseNet-121 23.33 6.46 7,978,856 2,872.13M Training (log)
IBN-DenseNet-169 22.14 6.08 14,149,480 3,403.89M Training (log)
AirNet50-1x64d (r=2) 20.45 5.23 27,425,864 4,772.11M Training (log)
AirNet50-1x64d (r=16) 21.11 5.44 25,714,952 4,399.97M Training (log)
AirNeXt50-32x4d (r=2) 19.84 5.04 27,604,296 5,339.58M Training (log)
BAM-ResNet-50 20.59 5.38 25,915,099 4,196.09M Training (log)
CBAM-ResNet-50 19.94 4.88 28,089,624 4,116.97M Training (log)
SCNet-50 20.53 5.11 25,564,584 3,951.01M Training (log)
SCNet-101 19.21 4.46 44,565,416 7,204.19M Training (log)
SCNet(A)-50 19.01 4.59 25,583,816 4,715.79M Training (log)
RegNetX-200MF 29.91 10.38 2,684,792 203.32M Training (log)
RegNetX-400MF 26.25 8.55 5,157,512 403.44M Training (log)
RegNetX-600MF 24.71 7.56 6,196,040 608.36M Training (log)
RegNetX-800MF 24.09 7.24 7,259,656 809.47M Training (log)
RegNetX-1.6GF 22.12 6.13 9,190,136 1,618.97M Training (log)
RegNetX-3.2GF 21.28 5.68 15,296,552 3,199.52M Training (log)
RegNetX-4.0GF 19.51 4.69 22,118,248 3,986.26M Training (log)
RegNetX-6.4GF 19.22 4.58 26,209,256 6,490.97M Training (log)
RegNetX-8.0GF 19.62 4.66 39,572,648 8,017.90M Training (log)
RegNetX-12GF 19.99 5.18 46,106,056 12,124.16M Training (log)
RegNetX-16GF 19.12 4.56 54,278,536 15,986.59M Training (log)
RegNetX-32GF 19.57 4.89 107,811,560 31,790.18M From rwightman/pyt...models (log)
RegNetY-200MF 28.50 9.53 3,162,996 203.80M Training (log)
RegNetY-400MF 24.85 7.47 4,344,144 409.95M Training (log)
RegNetY-600MF 23.59 6.97 6,055,160 609.91M Training (log)
RegNetY-800MF 22.54 6.45 6,263,168 808.07M Training (log)
RegNetY-1.6GF 21.23 5.68 11,202,430 1,628.43M Training (log)
RegNetY-3.2GF 18.32 4.13 19,436,338 3,197.70M From rwightman/pyt...models (log)
RegNetY-4.0GF 19.56 4.67 20,646,656 3,997.63M Training (log)
RegNetY-6.4GF 18.95 4.45 30,583,252 6,386.79M Training (log)
RegNetY-8.0GF 18.78 4.36 39,180,068 7,994.33M Training (log)
RegNetY-12GF 18.51 4.31 51,822,544 12,129.89M Training (log)
RegNetY-16GF 18.63 4.30 83,590,140 15,941.65M Training (log)
RegNetY-32GF 19.10 4.58 145,046,770 32,313.76M From rwightman/pyt...models (log)
PyramidNet-101 (a=360) 20.41 5.20 42,455,070 8,743.54M Training (log)
DiracNetV2-18 30.61 11.17 11,511,784 1,796.62M From szagoruyko/diracnets (log)
DiracNetV2-34 27.93 9.46 21,616,232 3,646.93M From szagoruyko/diracnets (log)
CRU-Net-56 25.72 8.25 25,609,384 5,660.66M From cypw/CRU-Net (log)
DenseNet-121 23.25 6.85 7,978,856 2,872.13M Training (log)
DenseNet-161 21.82 5.92 28,681,000 7,793.16M Training (log)
DenseNet-169 22.10 6.05 14,149,480 3,403.89M Training (log)
DenseNet-201 21.56 5.90 20,013,928 4,347.15M Training (log)
CondenseNet-74 (C=G=4) 26.82 8.64 4,773,944 546.06M From ShichenLiu/CondenseNet (log)
CondenseNet-74 (C=G=8) 29.76 10.49 2,935,416 291.52M From ShichenLiu/CondenseNet (log)
PeleeNet 29.38 9.79 2,802,248 514.87M Training (log)
WRN-50-2 22.02 6.06 68,849,128 11,405.42M Training (log)
DRN-C-26 24.32 7.11 21,126,584 16,993.90M Training (log)
DRN-C-42 23.80 6.92 31,234,744 25,093.75M From fyu/drn (log)
DRN-C-58 22.35 6.27 40,542,008 32,489.94M From fyu/drn (log)
DRN-D-22 24.67 7.44 16,393,752 13,051.33M Training (log)
DRN-D-38 24.51 7.36 26,501,912 21,151.19M From fyu/drn (log)
DRN-D-54 22.05 6.27 35,809,176 28,547.38M From fyu/drn (log)
DRN-D-105 21.31 5.81 54,801,304 43,442.43M From fyu/drn (log)
DPN-68 22.87 6.58 12,611,602 2,351.84M Training (log)
DPN-98 18.29 4.22 61,570,728 11,716.51M Training (log)
DPN-131 19.42 4.77 79,254,504 16,076.15M Training (log)
DarkNet Tiny 40.31 17.46 1,042,104 500.85M Training (log)
DarkNet Ref 38.00 16.68 7,319,416 367.59M Training (log)
DarkNet-53 21.27 5.50 41,609,928 7,133.86M Training (log)
i-RevNet-301 26.97 8.97 125,120,356 14,453.87M From jhjacobsen/pytorch-i-revnet (log)
BagNet-9 59.57 35.44 15,688,744 16,049.19M From wielandbrendel/bag...models (log)
BagNet-17 44.76 21.52 16,213,032 15,768.77M From wielandbrendel/bag...models (log)
BagNet-33 36.43 14.95 18,310,184 16,371.52M From wielandbrendel/bag...models (log)
DLA-34 24.31 7.05 15,742,104 3,071.37M Training (log)
DLA-46-C 33.84 12.86 1,301,400 585.45M Training (log)
DLA-X-46-C 32.96 12.25 1,068,440 546.72M Training (log)
DLA-60 21.27 5.54 22,036,632 4,255.49M Training (log)
DLA-X-60 20.70 5.53 17,352,344 3,543.68M Training (log)
DLA-X-60-C 30.67 10.74 1,319,832 596.06M Training (log)
DLA-102 20.58 5.17 33,268,888 7,190.95M Training (log)
DLA-X-102 19.59 4.70 26,309,272 5,884.94M Training (log)
DLA-X2-102 18.66 4.23 41,282,200 9,340.61M Training (log)
DLA-169 19.28 4.60 53,389,720 11,593.20M Training (log)
FishNet-150 19.15 4.66 24,959,400 6,435.02M Training (log)
ESPNetv2 x0.5 43.61 21.07 1,241,332 35.36M From sacmehta/ESPNetv2 (log)
ESPNetv2 x1.0 35.33 14.27 1,670,072 98.09M From sacmehta/ESPNetv2 (log)
ESPNetv2 x1.25 33.14 12.73 1,965,440 138.18M From sacmehta/ESPNetv2 (log)
ESPNetv2 x1.5 31.05 11.35 2,314,856 185.77M Training (log)
ESPNetv2 x2.0 28.91 9.94 3,498,136 306.93M From sacmehta/ESPNetv2 (log)
DiCENet x0.2 55.15 30.67 1,130,704 18.70M From sacmehta/EdgeNets (log)
DiCENet x0.5 47.59 23.70 1,214,120 30.39M From sacmehta/EdgeNets (log)
DiCENet x0.75 38.25 16.47 1,495,676 55.64M From sacmehta/EdgeNets (log)
DiCENet x1.0 35.02 14.11 1,805,604 81.96M Training (log)
DiCENet x1.25 33ю11 12.51 2,162,888 111.60M Training (log)
DiCENet x1.5 31.00 11.44 2,652,200 151.48M Training (log)
DiCENet x1.75 30.08 10.81 3,264,932 200.87M Training (log)
DiCENet x2.0 29.93 10.58 3,979,044 257.49M From sacmehta/EdgeNets (log)
HRNet-W18 Small V1 26.20 8.73 13,187,464 1,614.87M Training (log)
HRNet-W18 Small V2 21.71 6.02 15,597,464 2,618.54M Training (log)
HRNetV2-W18 20.15 5.00 21,299,004 4,322.66M Training (log)
HRNetV2-W30 20.30 5.08 37,712,220 8,156.14M Training (log)
HRNetV2-W32 19.94 4.96 41,232,680 8,973.31M Training (log)
HRNetV2-W40 19.65 4.81 57,557,160 12,751.34M Training (log)
HRNetV2-W44 19.67 4.86 67,064,984 14,945.95M Training (log)
HRNetV2-W48 19.46 4.84 77,469,864 17,344.29M Training (log)
HRNetV2-W64 19.50 4.78 128,059,944 28,974.95M Training (log)
VoVNet-39 21.54 5.48 22,600,296 7,086.16M Training (log)
VoVNet-57 20.14 5.10 36,640,296 8,943.09M Training (log)
SelecSLS-42b 21.72 5.96 32,458,248 2,980.62M Training (log)
SelecSLS-60 20.20 5.11 30,670,768 3,591.78M Training (log)
SelecSLS-60b 20.62 5.37 32,774,064 3,629.14M Training (log)
HarDNet-39DS 26.52 8.64 3,488,228 437.52M Training (log)
HarDNet-68DS 24.25 7.38 4,180,602 788.86M Training (log)
HarDNet-68 24.10 7.10 17,565,348 4,256.32M Training (log)
HarDNet-85 21.87 5.72 36,670,212 9,088.58M Training (log)
SqueezeNet v1.0 38.73 17.34 1,248,424 823.67M Training (log)
SqueezeNet v1.1 39.09 17.39 1,235,496 352.02M Training (log)
SqueezeResNet v1.0 39.32 17.67 1,248,424 823.67M Training (log)
SqueezeResNet v1.1 39.83 17.84 1,235,496 352.02M Training (log)
1.0-SqNxt-23 42.25 18.66 724,056 287.28M Training (log)
1.0-SqNxt-23v5 40.43 17.43 921,816 285.82M Training (log)
1.5-SqNxt-23 34.46 13.21 1,511,824 552.39M Training (log)
1.5-SqNxt-23v5 33.48 12.68 1,953,616 550.97M Training (log)
2.0-SqNxt-23 30.24 10.63 2,583,752 898.48M Training (log)
2.0-SqNxt-23v5 29.27 10.24 3,366,344 897.60M Training (log)
ShuffleNet x0.25 (g=1) 62.00 36.77 209,746 12.35M Training (log)
ShuffleNet x0.25 (g=3) 61.34 36.17 305,902 13.09M Training (log)
ShuffleNet x0.5 (g=1) 46.22 22.38 534,484 41.16M Training (log)
ShuffleNet x0.5 (g=3) 43.83 20.60 718,324 41.70M Training (log)
ShuffleNet x0.75 (g=1) 39.25 16.75 975,214 86.42M Training (log)
ShuffleNet x0.75 (g=3) 37.81 16.09 1,238,266 85.82M Training (log)
ShuffleNet x1.0 (g=1) 34.41 13.50 1,531,936 148.13M Training (log)
ShuffleNet x1.0 (g=2) 33.98 13.32 1,733,848 147.60M Training (log)
ShuffleNet x1.0 (g=3) 33.96 13.29 1,865,728 145.46M Training (log)
ShuffleNet x1.0 (g=4) 33.84 13.10 1,968,344 143.33M Training (log)
ShuffleNet x1.0 (g=8) 33.65 13.19 2,434,768 150.76M Training (log)
ShuffleNetV2 x0.5 40.61 18.30 1,366,792 43.31M Training (log)
ShuffleNetV2 x1.0 30.94 11.23 2,278,604 149.72M Training (log)
ShuffleNetV2 x1.5 27.17 9.13 4,406,098 320.77M Training (log)
ShuffleNetV2 x2.0 25.80 8.23 7,601,686 595.84M Training (log)
ShuffleNetV2b x0.5 39.81 17.82 1,366,792 43.31M Training (log)
ShuffleNetV2b x1.0 30.39 11.01 2,279,760 150.62M Training (log)
ShuffleNetV2b x1.5 26.90 8.79 4,410,194 323.98M Training (log)
ShuffleNetV2b x2.0 25.20 8.10 7,611,290 603.37M Training (log)
108-MENet-8x1 (g=3) 43.62 20.30 654,516 42.68M Training (log)
128-MENet-8x1 (g=4) 42.10 19.13 750,796 45.98M Training (log)
128-MENet-8x1 (g=4) 42.10 19.13 750,796 45.98M Training (log)
160-MENet-8x1 (g=8) 43.47 20.28 850,120 45.63M Training (log)
256-MENet-12x1 (g=4) 32.23 12.16 1,888,240 150.65M Training (log)
348-MENet-12x1 (g=3) 27.85 9.36 3,368,128 312.00M Training (log)
352-MENet-12x1 (g=8) 31.30 11.67 2,272,872 157.35M Training (log)
456-MENet-24x1 (g=3) 25.02 7.80 5,304,784 567.90M Training (log)
MobileNet x0.25 45.78 22.18 470,072 44.09M Training (log)
MobileNet x0.5 33.94 13.30 1,331,592 155.42M Training (log)
MobileNet x0.75 29.85 10.51 2,585,560 333.99M Training (log)
MobileNet x1.0 26.43 8.65 4,231,976 579.80M Training (log)
MobileNetb x0.25 45.25 21.65 467,592 42.88M Training (log)
MobileNetb x0.5 32.89 12.71 1,326,632 153.00M Training (log)
MobileNetb x0.75 29.08 10.20 2,578,120 330.37M Training (log)
MobileNetb x1.0 25.06 7.88 4,222,056 574.97M Training (log)
FD-MobileNet x0.25 55.44 30.53 383,160 12.95M Training (log)
FD-MobileNet x0.5 42.62 19.69 993,928 41.84M Training (log)
FD-MobileNet x0.75 37.91 16.01 1,833,304 86.68M Training (log)
FD-MobileNet x1.0 33.80 13.12 2,901,288 147.46M Training (log)
MobileNetV2 x0.25 48.08 24.12 1,516,392 34.24M Training (log)
MobileNetV2 x0.5 35.63 14.42 1,964,736 100.13M Training (log)
MobileNetV2 x0.75 29.78 10.44 2,627,592 198.50M Training (log)
MobileNetV2 x1.0 26.77 8.64 3,504,960 329.36M Training (log)
MobileNetV2b x0.25 46.72 23.38 1,516,312 33.18M Training (log)
MobileNetV2b x0.5 34.26 13.73 1,964,448 96.42M Training (log)
MobileNetV2b x0.75 30.19 10.64 2,626,968 190.52M Training (log)
MobileNetV2b x1.0 27.16 8.84 3,503,872 315.51M Training (log)
MobileNetV3 L/224/1.0 24.36 7.29 5,481,752 226.80M Training (log)
IGCV3 x0.25 53.43 28.30 1,534,020 41.29M Training (log)
IGCV3 x0.5 39.41 17.03 1,985,528 111.12M Training (log)
IGCV3 x0.75 30.71 10.96 2,638,084 210.95M Training (log)
IGCV3 x1.0 27.73 9.00 3,491,688 340.79M Training (log)
MnasNet-B1 24.67 7.23 4,383,312 326.30M Training (log)
MnasNet-A1 24.04 7.05 3,887,038 325.77M Training (log)
DARTS 24.91 7.56 4,718,752 539.86M Training (log)
ProxylessNAS CPU 24.78 7.50 4,361,648 459.96M Training (log)
ProxylessNAS GPU 24.67 7.24 7,119,848 476.08M Training (log)
ProxylessNAS Mobile 25.31 7.80 4,080,512 332.46M Training (log)
ProxylessNAS Mob-14 22.96 6.51 6,857,568 597.10M Training (log)
FBNet-Cb 24.86 7.61 5,572,200 399.26M Training (log)
Xception 20.43 5.11 22,855,952 8,403.63M Training (log)
InceptionV3 21.22 5.59 23,834,568 5,743.06M From dmlc/gluon-cv (log)
InceptionV4 19.83 4.88 42,679,816 12,304.93M Training (log)
InceptionResNetV2 19.48 4.70 55,843,464 13,188.64M Training (log)
PolyNet 19.09 4.53 95,366,600 34,821.34M From Cadene/pretrained...pytorch (log)
NASNet-A 4@1056 25.16 7.90 5,289,978 584.90M Training (log)
NASNet-A 6@4032 18.17 4.24 88,753,150 23,976.44M From Cadene/pretrained...pytorch (log)
PNASNet-5-Large 17.90 4.28 86,057,668 25,140.77M From Cadene/pretrained...pytorch (log)
SPNASNet 25.10 7.76 4,421,616 346.73M Training (log)
EfficientNet-B0 24.50 7.22 5,288,548 413.13M Training (log)
EfficientNet-B1 22.89 6.26 7,794,184 730.44M Training (log)
EfficientNet-B0b 22.96 6.70 5,288,548 413.13M From rwightman/pyt...models (log)
EfficientNet-B1b 20.98 5.65 7,794,184 730.44M From rwightman/pyt...models (log)
EfficientNet-B2b 19.94 5.16 9,109,994 1,049.29M From rwightman/pyt...models (log)
EfficientNet-B3b 18.60 4.31 12,233,232 1,923.98M From rwightman/pyt...models (log)
EfficientNet-B4b 17.25 3.76 19,341,616 4,597.56M From rwightman/pyt...models (log)
EfficientNet-B5b 16.39 3.34 30,389,784 10,674.67M From rwightman/pyt...models (log)
EfficientNet-B6b 15.96 3.12 43,040,704 19,761.35M From rwightman/pyt...models (log)
EfficientNet-B7b 15.70 3.11 66,347,960 38,949.07M From rwightman/pyt...models (log)
EfficientNet-B0c* 22.52 6.46 5,288,548 413.13M From rwightman/pyt...models (log)
EfficientNet-B1c* 20.50 5.55 7,794,184 730.44M From rwightman/pyt...models (log)
EfficientNet-B2c* 19.60 4.89 9,109,994 1,049.29M From rwightman/pyt...models (log)
EfficientNet-B3c* 18.19 4.34 12,233,232 1,923.98M From rwightman/pyt...models (log)
EfficientNet-B4c* 16.74 3.59 19,341,616 4,597.56M From rwightman/pyt...models (log)
EfficientNet-B5c* 15.79 3.02 30,389,784 10,674.67M From rwightman/pyt...models (log)
EfficientNet-B6c* 15.29 2.85 43,040,704 19,761.35M From rwightman/pyt...models (log)
EfficientNet-B7c* 14.87 2.77 66,347,960 38,949.07M From rwightman/pyt...models (log)
EfficientNet-B8c* 14.61 2.70 87,413,142 64,446.06M From rwightman/pyt...models (log)
EfficientNet-Edge-Small-b* 22.48 6.29 5,438,392 2,378.09M From rwightman/pyt...models (log)
EfficientNet-Edge-Medium-b* 21.08 5.53 6,899,496 3,700.08M From rwightman/pyt...models (log)
EfficientNet-Edge-Large-b* 19.50 4.77 10,589,712 9,747.58M From rwightman/pyt...models (log)
MixNet-S 23.83 7.03 4,134,606 260.26M Training (log)
MixNet-M 22.37 6.31 5,014,382 366.05M Training (log)
MixNet-L 21.48 5.57 7,329,252 590.45M Training (log)
ResNet(A)-10 30.89 11.59 5,438,024 1,135.85M Training (log)
ResNet(A)-BC-14 27.75 9.56 10,084,168 1,721.52M Training (log)
ResNet(A)-18 25.38 8.02 11,708,744 2,062.22M Training (log)
ResNet(A)-50b 20.78 5.34 25,576,264 4,352.88M Training (log)
ResNet(A)-101b 18.98 4.42 44,568,392 8,072.88M Training (log)
ResNet(A)-152b 18.58 4.24 60,212,040 11,796.78M Training (log)
ResNet(D)-50b 20.79 5.49 25,680,808 20,496.80M From dmlc/gluon-cv (log)
ResNet(D)-101b 19.49 4.61 44,672,936 35,391.85M From dmlc/gluon-cv (log)
ResNet(D)-152b 19.39 4.67 60,316,584 47,661.38M From dmlc/gluon-cv (log)

CIFAR-10

Some remarks:

  • Testing subset is used for validation purpose.
  • Features means feature extractor output size.
Model Error, % Features Params FLOPs/2 Remarks
NIN 7.43 192 966,986 222.97M Training (log)
ResNet-20 5.97 64 272,474 41.29M Training (log)
ResNet-56 4.52 64 855,770 127.06M Training (log)
ResNet-110 3.69 64 1,730,714 255.70M Training (log)
ResNet-164(BN) 3.68 256 1,704,154 255.31M Training (log)
ResNet-272(BN) 3.33 256 2,816,986 420.61M Training (log)
ResNet-542(BN) 3.43 256 5,599,066 833.87M Training (log)
ResNet-1001 3.28 256 10,328,602 1,536.40M Training (log)
ResNet-1202 3.53 64 19,424,026 2,857.17M Training (log)
PreResNet-20 6.51 64 272,282 41.27M Training (log)
PreResNet-56 4.49 64 855,578 127.03M Training (log)
PreResNet-110 3.86 64 1,730,522 255.68M Training (log)
PreResNet-164(BN) 3.64 256 1,703,258 255.08M Training (log)
PreResNet-272(BN) 3.25 256 2,816,090 420.38M Training (log)
PreResNet-542(BN) 3.14 256 5,598,170 833.64M Training (log)
PreResNet-1001 2.65 256 10,327,706 1,536.18M Training (log)
PreResNet-1202 3.39 64 19,423,834 2,857.14M Training (log)
ResNeXt-20 (1x64d) 4.33 1024 3,446,602 538.36M Training (log)
ResNeXt-20 (2x32d) 4.53 1024 2,672,458 425.15M Training (log)
ResNeXt-20 (4x16d) 4.70 1024 2,285,386 368.55M Training (log)
ResNeXt-20 (8x8d) 4.66 1024 2,091,850 340.25M Training (log)
ResNeXt-20 (16x4d) 4.04 1024 1,995,082 326.10M Training (log)
ResNeXt-20 (32x2d) 4.61 1024 1,946,698 319.03M Training (log)
ResNeXt-20 (64x1d) 4.93 1024 1,922,506 315.49M Training (log)
ResNeXt-20 (2x64d) 4.03 1024 6,198,602 987.98M Training (log)
ResNeXt-20 (4x32d) 3.73 1024 4,650,314 761.57M Training (log)
ResNeXt-20 (8x16d) 4.04 1024 3,876,170 648.37M Training (log)
ResNeXt-20 (16x8d) 3.94 1024 3,489,098 591.77M Training (log)
ResNeXt-20 (32x4d) 4.20 1024 3,295,562 563.47M Training (log)
ResNeXt-20 (64x2d) 4.38 1024 3,198,794 549.32M Training (log)
ResNeXt-56 (1x64d) 2.87 1024 9,317,194 1,399.33M Training (log)
ResNeXt-56 (2x32d) 3.01 1024 6,994,762 1,059.72M Training (log)
ResNeXt-56 (4x16d) 3.11 1024 5,833,546 889.91M Training (log)
ResNeXt-56 (8x8d) 3.07 1024 5,252,938 805.01M Training (log)
ResNeXt-56 (16x4d) 3.12 1024 4,962,634 762.56M Training (log)
ResNeXt-56 (32x2d) 3.14 1024 4,817,482 741.34M Training (log)
ResNeXt-56 (64x1d) 3.41 1024 4,744,906 730.72M Training (log)
ResNeXt-29 (32x4d) 3.15 1024 4,775,754 780.55M Training (log)
ResNeXt-29 (16x64d) 2.41 1024 68,155,210 10,709.34M Training (log)
ResNeXt-272 (1x64d) 2.55 1024 44,540,746 6,565.15M Training (log)
ResNeXt-272 (2x32d) 2.74 1024 32,928,586 4,867.11M Training (log)
SE-ResNet-20 6.01 64 274,847 41.30M Training (log)
SE-ResNet-56 4.13 64 862,889 127.07M Training (log)
SE-ResNet-110 3.63 64 1,744,952 255.72M Training (log)
SE-ResNet-164(BN) 3.39 256 1,906,258 255.52M Training (log)
SE-ResNet-272(BN) 3.39 256 3,153,826 420.96M Training (log)
SE-ResNet-542(BN) 3.47 256 6,272,746 834.57M Training (log)
SE-PreResNet-20 6.18 64 274,559 41.30M Training (log)
SE-PreResNet-56 4.51 64 862,601 127.07M Training (log)
SE-PreResNet-110 4.54 64 1,744,664 255.72M Training (log)
SE-PreResNet-164(BN) 3.73 256 1,904,882 255.29M Training (log)
SE-PreResNet-272(BN) 3.39 256 3,152,450 420.73M Training (log)
SE-PreResNet-542(BN) 3.08 256 6,271,370 834.34M Training (log)
PyramidNet-110 (a=48) 3.72 64 1,772,706 408.37M Training (log)
PyramidNet-110 (a=84) 2.98 100 3,904,446 778.15M Training (log)
PyramidNet-110 (a=270) 2.51 286 28,485,477 4,730.60M Training (log)
PyramidNet-164 (a=270, BN) 2.42 1144 27,216,021 4,608.81M Training (log)
PyramidNet-200 (a=240, BN) 2.44 1024 26,752,702 4,563.40M Training (log)
PyramidNet-236 (a=220, BN) 2.47 944 26,969,046 4,631.32M Training (log)
PyramidNet-272 (a=200, BN) 2.39 864 26,210,842 4,541.36M Training (log)
DenseNet-40 (k=12) 5.61 258 599,050 210.80M Training (log)
DenseNet-BC-40 (k=12) 6.43 132 176,122 74.89M Training (log)
DenseNet-BC-40 (k=24) 4.52 264 690,346 293.09M Training (log)
DenseNet-BC-40 (k=36) 4.04 396 1,542,682 654.60M Training (log)
DenseNet-100 (k=12) 3.66 678 4,068,490 1,353.55M Training (log)
DenseNet-100 (k=24) 3.13 1356 16,114,138 5,354.19M Training (log)
DenseNet-BC-100 (k=12) 4.16 342 769,162 298.45M Training (log)
DenseNet-BC-190 (k=40) 2.52 2190 25,624,430 9,400.45M Training (log)
DenseNet-BC-250 (k=24) 2.67 1734 15,324,406 5,519.54M Training (log)
X-DenseNet-BC-40-2 (k=24) 5.31 264 690,346 293.09M Training (log)
X-DenseNet-BC-40-2 (k=36) 4.37 396 1,542,682 654.60M Training (log)
WRN-16-10 2.93 640 17,116,634 2,414.04M Training (log)
WRN-28-10 2.39 640 36,479,194 5,246.98M Training (log)
WRN-40-8 2.37 512 35,748,314 5,176.90M Training (log)
WRN-20-10-1bit 3.26 640 26,737,140 4,019.14M Training (log)
WRN-20-10-32bit 3.14 640 26,737,140 4,019.14M Training (log)
RoR-3-56 5.43 64 762,746 113.43M Training (log)
RoR-3-110 4.35 64 1,637,690 242.07M Training (log)
RoR-3-164 3.93 64 2,512,634 370.72M Training (log)
RiR 3.28 384 9,492,980 1,281.08M Training (log)
Shake-Shake-ResNet-20-2x16d 5.15 64 541,082 81.78M Training (log)
Shake-Shake-ResNet-26-2x32d 3.17 64 2,923,162 428.89M Training (log)
DIA-ResNet-20 6.22 64 286,866 41.34M Training (log)
DIA-ResNet-56 5.05 64 870,162 127.18M Training (log)
DIA-ResNet-110 4.10 64 1,745,106 255.94M Training (log)
DIA-ResNet-164(BN) 3.50 256 1,923,002 259.18M Training (log)
DIA-PreResNet-20 6.42 64 286,674 41.31M Training (log)
DIA-PreResNet-56 4.83 64 869,970 127.15M Training (log)
DIA-PreResNet-110 4.25 64 1,744,914 255.92M Training (log)
DIA-PreResNet-164(BN) 3.56 256 1,922,106 258.95M Training (log)

CIFAR-100

Some remarks:

  • Testing subset is used for validation purpose.
Model Error, % Params FLOPs/2 Remarks
NIN 28.39 984,356 224.08M Training (log)
ResNet-20 29.64 278,324 41.30M Training (log)
ResNet-56 24.88 861,620 127.06M Training (log)
ResNet-110 22.80 1,736,564 255.71M Training (log)
ResNet-164(BN) 20.44 1,727,284 255.33M Training (log)
ResNet-272(BN) 20.07 2,840,116 420.63M Training (log)
ResNet-542(BN) 19.32 5,622,196 833.89M Training (log)
ResNet-1001 19.79 10,351,732 1,536.43M Training (log)
ResNet-1202 21.56 19,429,876 2,857.17M Training (log)
PreResNet-20 30.22 278,132 41.28M Training (log)
PreResNet-56 25.05 861,428 127.04M Training (log)
PreResNet-110 22.67 1,736,372 255.68M Training (log)
PreResNet-164(BN) 20.18 1,726,388 255.10M Training (log)
PreResNet-272(BN) 19.63 2,839,220 420.40M Training (log)
PreResNet-542(BN) 18.71 5,621,300 833.66M Training (log)
PreResNet-1001 18.41 10,350,836 1,536.20M Training (log)
ResNeXt-20 (1x64d) 21.97 3,538,852 538.45M Training (log)
ResNeXt-20 (2x32d) 22.55 2,764,708 425.25M Training (log)
ResNeXt-20 (4x16d) 23.04 2,377,636 368.65M Training (log)
ResNeXt-20 (8x8d) 22.82 2,184,100 340.34M Training (log)
ResNeXt-20 (16x4d) 22.82 2,087,332 326.19M Training (log)
ResNeXt-20 (32x2d) 21.73 2,038,948 319.12M Training (log)
ResNeXt-20 (64x1d) 23.53 2,014,756 315.58M Training (log)
ResNeXt-20 (2x64d) 20.60 6,290,852 988.07M Training (log)
ResNeXt-20 (4x32d) 21.31 4,742,564 761.66M Training (log)
ResNeXt-20 (8x16d) 21.72 3,968,420 648.46M Training (log)
ResNeXt-20 (16x8d) 21.73 3,581,348 591.86M Training (log)
ResNeXt-20 (32x4d) 22.13 3,387,812 563.56M Training (log)
ResNeXt-20 (64x2d) 22.35 3,291,044 549.41M Training (log)
ResNeXt-56 (1x64d) 18.25 9,409,444 1,399.42M Training (log)
ResNeXt-56 (2x32d) 17.86 7,087,012 1,059.81M Training (log)
ResNeXt-56 (4x16d) 18.09 5,925,796 890.01M Training (log)
ResNeXt-56 (8x8d) 18.06 5,345,188 805.10M Training (log)
ResNeXt-56 (16x4d) 18.24 5,054,884 762.65M Training (log)
ResNeXt-56 (32x2d) 18.60 4,909,732 741.43M Training (log)
ResNeXt-56 (64x1d) 18.16 4,837,156 730.81M Training (log)
ResNeXt-29 (32x4d) 19.50 4,868,004 780.64M Training (log)
ResNeXt-29 (16x64d) 16.93 68,247,460 10,709.43M Training (log)
ResNeXt-272 (1x64d) 19.11 44,632,996 6,565.25M Training (log)
ResNeXt-272 (2x32d) 18.34 33,020,836 4,867.20M Training (log)
SE-ResNet-20 28.54 280,697 41.30M Training (log)
SE-ResNet-56 22.94 868,739 127.07M Training (log)
SE-ResNet-110 20.86 1,750,802 255.72M Training (log)
SE-ResNet-164(BN) 19.95 1,929,388 255.54M Training (log)
SE-ResNet-272(BN) 19.07 3,176,956 420.98M Training (log)
SE-ResNet-542(BN) 18.87 6,295,876 834.59M Training (log)
SE-PreResNet-20 28.31 280,409 41.31M Training (log)
SE-PreResNet-56 23.05 868,451 127.08M Training (log)
SE-PreResNet-110 22.61 1,750,514 255.73M Training (log)
SE-PreResNet-164(BN) 20.05 1,928,012 255.31M Training (log)
SE-PreResNet-272(BN) 19.13 3,175,580 420.75M Training (log)
SE-PreResNet-542(BN) 19.45 6,294,500 834.36M Training (log)
PyramidNet-110 (a=48) 20.95 1,778,556 408.38M Training (log)
PyramidNet-110 (a=84) 18.87 3,913,536 778.16M Training (log)
PyramidNet-110 (a=270) 17.10 28,511,307 4,730.62M Training (log)
PyramidNet-164 (a=270, BN) 16.70 27,319,071 4,608.91M Training (log)
PyramidNet-200 (a=240, BN) 16.09 26,844,952 4,563.49M Training (log)
PyramidNet-236 (a=220, BN) 16.34 27,054,096 4,631.41M Training (log)
PyramidNet-272 (a=200, BN) 16.19 26,288,692 4,541.43M Training (log)
DenseNet-40 (k=12) 24.90 622,360 210.82M Training (log)
DenseNet-BC-40 (k=12) 28.41 188,092 74.90M Training (log)
DenseNet-BC-40 (k=24) 22.67 714,196 293.11M Training (log)
DenseNet-BC-40 (k=36) 20.50 1,578,412 654.64M Training (log)
DenseNet-100 (k=12) 19.64 4,129,600 1,353.62M Training (log)
DenseNet-100 (k=24) 18.08 16,236,268 5,354.32M Training (log)
DenseNet-BC-100 (k=12) 21.19 800,032 298.48M Training (log)
DenseNet-BC-250 (k=24) 17.39 15,480,556 5,519.69M Training (log)
X-DenseNet-BC-40-2 (k=24) 23.96 714,196 293.11M Training (log)
X-DenseNet-BC-40-2 (k=36) 21.65 1,578,412 654.64M Training (log)
WRN-16-10 18.95 17,174,324 2,414.09M Training (log)
WRN-28-10 17.88 36,536,884 5,247.04M Training (log)
WRN-40-8 18.03 35,794,484 5,176.95M Training (log)
WRN-20-10-1bit 19.04 26,794,920 4,022.81M Training (log)
WRN-20-10-32bit 18.12 26,794,920 4,022.81M Training (log)
RoR-3-56 25.49 768,596 113.43M Training (log)
RoR-3-110 23.64 1,643,540 242.08M Training (log)
RoR-3-164 22.34 2,518,484 370.72M Training (log)
RiR 19.23 9,527,720 1,283.29M Training (log)
Shake-Shake-ResNet-20-2x16d 29.22 546,932 81.79M Training (log)
Shake-Shake-ResNet-26-2x32d 18.80 2,934,772 428.90M Training (log)
DIA-ResNet-20 27.71 292,716 41.34M Training (log)
DIA-ResNet-56 24.35 876,012 127.18M Training (log)
DIA-ResNet-110 22.11 1,750,956 255.95M Training (log)
DIA-ResNet-164(BN) 19.53 1,946,132 259.20M Training (log)
DIA-PreResNet-20 28.37 292,524 41.32M Training (log)
DIA-PreResNet-56 25.05 875,820 127.16M Training (log)
DIA-PreResNet-110 22.69 1,750,764 255.92M Training (log)
DIA-PreResNet-164(BN) 19.99 1,945,236 258.97M Training (log)

SVHN

Model Error, % Params FLOPs/2 Remarks
NIN 3.76 966,986 222.97M Training (log)
ResNet-20 3.43 272,474 41.29M Training (log)
ResNet-56 2.75 855,770 127.06M Training (log)
ResNet-110 2.45 1,730,714 255.70M Training (log)
ResNet-164(BN) 2.42 1,704,154 255.31M Training (log)
ResNet-272(BN) 2.43 2,816,986 420.61M Training (log)
ResNet-542(BN) 2.34 5,599,066 833.87M Training (log)
ResNet-1001 2.41 10,328,602 1,536.40M Training (log)
PreResNet-20 3.22 272,282 41.27M Training (log)
PreResNet-56 2.80 855,578 127.03M Training (log)
PreResNet-110 2.79 1,730,522 255.68M Training (log)
PreResNet-164(BN) 2.58 1,703,258 255.08M Training (log)
PreResNet-272(BN) 2.34 2,816,090 420.38M Training (log)
PreResNet-542(BN) 2.36 5,598,170 833.64M Training (log)
ResNeXt-20 (1x64d) 2.98 3,446,602 538.36M Training (log)
ResNeXt-20 (2x32d) 2.96 2,672,458 425.15M Training (log)
ResNeXt-20 (4x16d) 3.17 2,285,386 368.55M Training (log)
ResNeXt-20 (8x8d) 3.18 2,091,850 340.25M Training (log)
ResNeXt-20 (16x4d) 3.21 1,995,082 326.10M Training (log)
ResNeXt-20 (32x2d) 3.27 1,946,698 319.03M Training (log)
ResNeXt-20 (64x1d) 3.42 1,922,506 315.49M Training (log)
ResNeXt-20 (2x64d) 2.83 6,198,602 987.98M Training (log)
ResNeXt-20 (4x32d) 2.98 4,650,314 761.57M Training (log)
ResNeXt-20 (8x16d) 3.01 3,876,170 648.37M Training (log)
ResNeXt-20 (16x8d) 2.93 3,489,098 591.77M Training (log)
ResNeXt-20 (32x4d) 3.09 3,295,562 563.47M Training (log)
ResNeXt-20 (64x2d) 3.14 3,198,794 549.32M Training (log)
ResNeXt-56 (1x64d) 2.42 9,317,194 1,399.33M Training (log)
ResNeXt-56 (2x32d) 2.46 6,994,762 1,059.72M Training (log)
ResNeXt-56 (4x16d) 2.44 5,833,546 889.91M Training (log)
ResNeXt-56 (8x8d) 2.47 5,252,938 805.01M Training (log)
ResNeXt-56 (16x4d) 2.56 4,962,634 762.56M Training (log)
ResNeXt-56 (32x2d) 2.53 4,817,482 741.34M Training (log)
ResNeXt-56 (64x1d) 2.55 4,744,906 730.72M Training (log)
ResNeXt-29 (32x4d) 2.80 4,775,754 780.55M Training (log)
ResNeXt-29 (16x64d) 2.68 68,155,210 10,709.34M Training (log)
ResNeXt-272 (1x64d) 2.35 44,540,746 6,565.15M Training (log)
ResNeXt-272 (2x32d) 2.44 32,928,586 4,867.11M Training (log)
SE-ResNet-20 3.23 274,847 41.30M Training (log)
SE-ResNet-56 2.64 862,889 127.07M Training (log)
SE-ResNet-110 2.35 1,744,952 255.72M Training (log)
SE-ResNet-164(BN) 2.45 1,906,258 255.52M Training (log)
SE-ResNet-272(BN) 2.38 3,153,826 420.96M Training (log)
SE-ResNet-542(BN) 2.26 6,272,746 834.57M Training (log)
SE-PreResNet-20 3.24 274,559 41.30M Training (log)
SE-PreResNet-56 2.71 862,601 127.07M Training (log)
SE-PreResNet-110 2.59 1,744,664 255.72M Training (log)
SE-PreResNet-164(BN) 2.56 1,904,882 255.29M Training (log)
SE-PreResNet-272(BN) 2.49 3,152,450 420.73M Training (log)
SE-PreResNet-542(BN) 2.47 6,271,370 834.34M Training (log)
PyramidNet-110 (a=48) 2.47 1,772,706 408.37M Training (log)
PyramidNet-110 (a=84) 2.43 3,904,446 778.15M Training (log)
PyramidNet-110 (a=270) 2.38 28,485,477 4,730.60M Training (log)
PyramidNet-164 (a=270, BN) 2.33 27,216,021 4,608.81M Training (log)
PyramidNet-200 (a=240, BN) 2.32 26,752,702 4,563.40M Training (log)
PyramidNet-236 (a=220, BN) 2.35 26,969,046 4,631.32M Training (log)
PyramidNet-272 (a=200, BN) 2.40 26,210,842 4,541.36M Training (log)
DenseNet-40 (k=12) 3.05 599,050 210.80M Training (log)
DenseNet-BC-40 (k=12) 3.20 176,122 74.89M Training (log)
DenseNet-BC-40 (k=24) 2.90 690,346 293.09M Training (log)
DenseNet-BC-40 (k=36) 2.60 1,542,682 654.60M Training (log)
DenseNet-100 (k=12) 2.60 4,068,490 1,353.55M Training (log)
X-DenseNet-BC-40-2 (k=24) 2.87 690,346 293.09M Training (log)
X-DenseNet-BC-40-2 (k=36) 2.74 1,542,682 654.60M Training (log)
WRN-16-10 2.78 17,116,634 2,414.04M Training (log)
WRN-28-10 2.71 36,479,194 5,246.98M Training (log)
WRN-40-8 2.54 35,748,314 5,176.90M Training (log)
WRN-20-10-1bit 2.73 26,737,140 4,019.14M Training (log)
WRN-20-10-32bit 2.59 26,737,140 4,019.14M Training (log)
RoR-3-56 2.69 762,746 113.43M Training (log)
RoR-3-110 2.57 1,637,690 242.07M Training (log)
RoR-3-164 2.73 2,512,634 370.72M Training (log)
RiR 2.68 9,492,980 1,281.08M Training (log)
Shake-Shake-ResNet-20-2x16d 3.17 541,082 81.78M Training (log)
Shake-Shake-ResNet-26-2x32d 2.62 2,923,162 428.89M Training (log)
DIA-ResNet-20 3.23 286,866 41.34M Training (log)
DIA-ResNet-56 2.68 870,162 127.18M Training (log)
DIA-ResNet-110 2.47 1,745,106 255.94M Training (log)
DIA-ResNet-164(BN) 2.44 1,923,002 259.18M Training (log)
DIA-PreResNet-20 3.03 286,674 41.31M Training (log)
DIA-PreResNet-56 2.80 869,970 127.15M Training (log)
DIA-PreResNet-110 2.42 1,744,914 255.92M Training (log)
DIA-PreResNet-164(BN) 2.56 1,922,106 258.95M Training (log)

CUB-200-2011

Model Error, % Params FLOPs/2 Remarks
ResNet-10 27.65 5,008,392 893.63M Training (log)
ResNet-12 26.58 5,082,376 1,125.84M Training (log)
ResNet-14 24.35 5,377,800 1,357.53M Training (log)
ResNet-16 23.21 6,558,472 1,588.93M Training (log)
ResNet-18 23.30 11,279,112 1,820.00M Training (log)
ResNet-26 22.52 17,549,832 2,746.38M Training (log)
SE-ResNet-10 27.39 5,052,932 893.67M Training (log)
SE-ResNet-12 26.04 5,127,496 1,125.88M Training (log)
SE-ResNet-14 23.63 5,425,104 1,357.58M Training (log)
SE-ResNet-16 23.21 6,614,240 1,588.99M Training (log)
SE-ResNet-18 23.08 11,368,192 1,820.10M Training (log)
SE-ResNet-26 22.51 17,683,452 2,746.52M Training (log)
MobileNet x1.0 23.46 3,411,976 578.98M Training (log)
ProxylessNAS Mobile 21.88 3,055,712 331.44M Training (log)
NTS-Net 13.26 28,623,333 33,361.39M From yangze0930/NTS-Net (log)

Pascal VOC20102

Model Extractor Pix.Acc.,% mIoU,% Params FLOPs/2 Remarks
PSPNet ResNet(D)-101b 98.09 81.44 65,708,501 230,586.69M From dmlc/gluon-cv (log)
DeepLabv3 ResNet(D)-101b 97.95 80.24 58,754,773 47,624.54M From dmlc/gluon-cv (log)
DeepLabv3 ResNet(D)-152b 98.11 81.20 74,398,421 59,894.06M From dmlc/gluon-cv (log)
FCN-8s(d) ResNet(D)-101b 97.80 80.40 52,072,917 196,562.96M From dmlc/gluon-cv (log)

ADE20K

Model Extractor Pix.Acc.,% mIoU,% Params FLOPs/2 Remarks
PSPNet ResNet(D)-50b 79.37 36.87 46,782,550 162,410.82M From dmlc/gluon-cv (log)
PSPNet ResNet(D)-101b 79.93 37.97 65,774,678 230,824.47M From dmlc/gluon-cv (log)
DeepLabv3 ResNet(D)-50b 79.72 37.13 39,795,798 32,755.38M From dmlc/gluon-cv (log)
DeepLabv3 ResNet(D)-101b 80.21 37.84 58,787,926 47,650.43M From dmlc/gluon-cv (log)
FCN-8s(d) ResNet(D)-50b 76.92 33.39 33,146,966 128,387.08M From dmlc/gluon-cv (log)
FCN-8s(d) ResNet(D)-101b 79.01 35.88 52,139,094 196,800.73M From dmlc/gluon-cv (log)

Cityscapes

Model Extractor Pix.Acc.,% mIoU,% Params FLOPs/2 Remarks
PSPNet ResNet(D)-101b 96.17 71.72 65,707,475 230,583.01M From dmlc/gluon-cv (log)
ICNet ResNet(D)-50b 95.50 64.02 47,489,184 14,241.91M From dmlc/gluon-cv (log)
Fast-SCNN - 95.14 65.76 1,138,051 3,490.05M From dmlc/gluon-cv (log)
SINet - 93.66 60.31 119,418 1,411.97M From clovaai/c3_sinet (log)
DANet ResNet(D)-50b 95.91 67.99 47,586,427 180,370.99M From dmlc/gluon-cv (log)
DANet ResNet(D)-101b 96.03 68.10 66,578,555 248,784.64M From dmlc/gluon-cv (log)

COCO Semantic Segmentation

Model Extractor Pix.Acc.,% mIoU,% Params FLOPs/2 Remarks
PSPNet ResNet(D)-101b 92.05 67.41 65,708,501 230,586.69M From dmlc/gluon-cv (log)
DeepLabv3 ResNet(D)-101b 92.19 67.73 58,754,773 47,624.54M From dmlc/gluon-cv (log)
DeepLabv3 ResNet(D)-152b 92.24 68.99 74,398,421 275,084.22M From dmlc/gluon-cv (log)
FCN-8s(d) ResNet(D)-101b 91.44 60.11 52,072,917 196,562.96M From dmlc/gluon-cv (log)

CelebAMask-HQ

Model Extractor Params FLOPs/2 Remarks
BiSeNet ResNet-18 13,300,416 - From zllrunning/face...Torch (log)

COCO Keypoints Detection

Model Extractor OKS AP, % Params FLOPs/2 Remarks
AlphaPose Fast-SE-ResNet-101b 74.15/91.59/80.68 59,569,873 9,553.15M From dmlc/gluon-cv (log)
SimplePose ResNet-18 66.31/89.20/73.41 15,376,721 1,799.25M From dmlc/gluon-cv (log)
SimplePose ResNet-50b 71.02/91.23/78.57 33,999,697 4,041.06M From dmlc/gluon-cv (log)
SimplePose ResNet-101b 72.44/92.18/79.76 52,991,825 7,685.04M From dmlc/gluon-cv (log)
SimplePose ResNet-152b 72.53/92.14/79.61 68,635,473 11,332.86M From dmlc/gluon-cv (log)
SimplePose ResNet(A)-50b 71.70/91.31/78.66 34,018,929 4,278.56M From dmlc/gluon-cv (log)
SimplePose ResNet(A)-101b 72.97/92.24/80.81 53,011,057 7,922.54M From dmlc/gluon-cv (log)
SimplePose ResNet(A)-152b 73.44/92.27/80.72 68,654,705 11,570.36M From dmlc/gluon-cv (log)
SimplePose(Mobile) ResNet-18 66.25/89.17/74.32 12,858,208 1,960.96M From dmlc/gluon-cv (log)
SimplePose(Mobile) ResNet-50b 71.10/91.28/78.67 25,582,944 4,221.30M From dmlc/gluon-cv (log)
SimplePose(Mobile) 1.0 MobileNet-224 64.10/88.06/71.23 5,019,744 751.36M From dmlc/gluon-cv (log)
SimplePose(Mobile) 1.0 MobileNetV2b-224 63.74/88.12/71.06 4,102,176 495.95M From dmlc/gluon-cv (log)
SimplePose(Mobile) MobileNetV3 Small 224/1.0 54.34/83.67/59.35 2,625,088 236.51M From dmlc/gluon-cv (log)
SimplePose(Mobile) MobileNetV3 Large 224/1.0 63.67/88.91/70.82 4,768,336 403.97M From dmlc/gluon-cv (log)
Lightweight OpenPose 2D MobileNet 39.99/65.95/40.70 4,091,698 8,948.96M From Daniil-Osokin/lighw...ch (log)
Lightweight OpenPose 3D MobileNet 39.99/65.95/40.70 5,085,983 11,049.43M From Daniil-Osokin/li...3d...ch (log)
IBPPose - 64.86/83.62/70.12 95,827,784 57,193.82M From jialee93/Improved...Parts (log)

Mozilla Common Voice (Corpus 6.1, dev subset)

Some remarks:

  • NR means Noise Reduction.
  • LS means trained on LibriSpeech dataset.
Model Lang WER, % Params FLOPs/2 Remarks
Jasper DR 10x5 En 21.89 332,632,349 16,617.80M From NVIDIA/NeMo (log)
Jasper DR 10x5 NR En 17.89 332,632,349 16,617.80M From NVIDIA/NeMo (log)
QuartzNet 5x5 LS En 44.68 6,713,181 332.93M From NVIDIA/NeMo (log)
QuartzNet 15x5 En 16.76 18,924,381 938.49M From NVIDIA/NeMo (log)
QuartzNet 15x5 NR En 17.73 18,924,381 938.49M From NVIDIA/NeMo (log)
QuartzNet 15x5 De 11.66 18,927,456 938.64M From NVIDIA/NeMo (log)
QuartzNet 15x5 Fr 13.88 18,938,731 939.20M From NVIDIA/NeMo (log)
QuartzNet 15x5 It 15.02 18,934,631 939.00M From NVIDIA/NeMo (log)
QuartzNet 15x5 Es 12.95 18,931,556 938.85M From NVIDIA/NeMo (log)
QuartzNet 15x5 Ca 8.42 18,934,631 939.00M From NVIDIA/NeMo (log)
QuartzNet 15x5 Pl 13.59 18,929,506 938.74M From NVIDIA/NeMo (log)
QuartzNet 15x5 Ru 16.47 18,930,531 938.79M From NVIDIA/NeMo (log)