rwightman / pytorch-image-models

TOP 1 ACCURACY TOP 5 ACCURACY
SPEED
MODEL CODE PAPER
ε-REPR
CODE PAPER
ε-REPR
PAPER
GLOBAL RANK
Adversarial Inception V3
77.6% -- 93.7% -- 269.1 #203
DLA-102
78.0% -- 94.0% -- 384.0 #188
DLA-169
78.7% -- 94.3% -- 368.5 #153
DLA-34
74.6% -- 92.1% -- 404.7 #280
DLA-46-C
64.9% -- 86.3% -- 407.5 #387
DLA-60
77.0% -- 93.3% -- 391.0 #231
DLA-X-102
78.5% -- 94.2% -- 381.6 #165
DLA-X-102 64
79.5% -- 94.6% -- 348.3 #108
DLA-X-46-C
66.0% -- 87.0% -- 405.5 #376
DLA-X-60
78.2% -- 94.0% -- 387.5 #181
DLA-X-60-C
67.9% -- 88.4% -- 404.8 #357
DPN-107
(224x224)
80.2% -- 94.9% -- 215.4 #83
DPN-107
(320x320, Mean-Max Pooling)
81.8% -- 95.9% -- 106.6 #50
DPN-131
(224x224)
79.8% 80.1% 94.7% 94.9% 240.0 #95
DPN-131
(320x320, Mean-Max Pooling)
81.4% 81.5% 95.8% 95.8% 118.6 #53
DPN-68
(224x224)
76.3% 76.4% 93.0% 93.1% 357.0 #244
DPN-68
(320x320, Mean-Max Pooling)
78.5% 78.5% 94.4% 94.5% 292.3 #140
DPN-68b
(224x224)
77.5% -- 93.8% -- 363.5 #207
DPN-68b
(320x320, Mean-Max Pooling)
79.4% -- 95.0% -- 295.3 #82
DPN-92
(224x224)
80.0% 79.3%
94.8% 94.6% 338.0 #95
DPN-92
(320x320, Mean-Max Pooling)
81.3% 81.0%
95.7% 95.5% 244.4 #56
DPN-98
(224x224)
79.6% 80.0% 94.6% 94.8% 317.9 #113
DPN-98
(320x320, Mean-Max Pooling)
81.2% 81.3% 95.7% 95.6% 164.0 #59
EfficientNet-B0
76.9% 76.3%
93.2% 93.2% 363.4 #236
EfficientNet-B0
(AdvProp)
77.1% -- 93.3% -- 366.9 #233
EfficientNet-B0
(AutoAugment)
76.8% -- 93.2% -- 358.6 #231
EfficientNet-B1
78.7% 78.8% 94.2% 94.4% 309.3 #155
EfficientNet-B1
(AdvProp)
79.3% -- 94.3% -- 349.3 #119
EfficientNet-B1
(AutoAugment)
78.8% -- 94.2% -- 339.9 #162
EfficientNet-B2
80.4% 79.8%
95.1% 94.9% 336.5 #76
EfficientNet-B2
(AdvProp)
80.3% -- 95.0% -- 333.1 #78
EfficientNet-B2
(AutoAugment)
80.1% -- 94.9% -- 325.8 #91
EfficientNet-B3
(AdvProp)
81.8% -- 95.6% -- 303.7 #58
EfficientNet-B3
(AutoAugment)
81.6% -- 95.7% -- 300.5 #55
EfficientNet-B4
(AdvProp)
83.2% -- 96.4% -- 240.2 #31
EfficientNet-B4
(AutoAugment)
83.0% -- 96.3% -- 234.5 #39
EfficientNet-B5
(AdvProp)
84.3% -- 97.0% -- 146.8 #20
EfficientNet-B5
(RandAugment)
83.8% -- 96.8% -- 126.4 #29
EfficientNet-B6
(AdvProp)
84.8% -- 97.1% -- 86.1 #18
EfficientNet-B6
(AutoAugment)
84.1% -- 96.9% -- 73.6 #23
EfficientNet-B7
(AdvProp)
85.1% -- 97.3% -- 50.1 #8
EfficientNet-B7
(RandAugment)
84.9% -- 97.2% -- 43.1 #14
EfficientNet-B8
(AdvProp)
85.4% -- 97.3% -- 32.8 #11
EfficientNet-CondConv-B0 4 experts
77.3% -- 93.3% -- 364.5 #229
EfficientNet-CondConv-B0 8 experts
77.9% -- 93.7% -- 360.9 #195
EfficientNet-CondConv-B1 8 experts
79.3% -- 94.4% -- 348.2 #144
EfficientNet-EdgeTPU-L
80.4% -- 95.2% -- 282.0 #73
EfficientNet-EdgeTPU-M
78.7% -- 94.3% -- 336.5 #150
EfficientNet-EdgeTPU-S
77.3% -- 93.6% -- 358.9 #207
Ensemble Adversarial Inception V3
80.0% -- 94.9% -- 289.2 #86
FBNet-C
75.1% 74.9% 92.4% -- 422.1 #266
HRNet-W18-C
76.8% -- 93.4% -- 389.9 #219
HRNet-W18-C-Small-V1
72.3% -- 90.7% -- 409.3 #314
HRNet-W18-C-Small-V2
75.1% -- 92.4% -- 402.9 #266
HRNet-W30-C
78.2% -- 94.2% -- 387.4 #159
HRNet-W32-C
78.4% -- 94.2% -- 378.9 #164
HRNet-W40-C
78.9% -- 94.5% -- 355.8 #125
HRNet-W44-C
78.9% -- 94.4% -- 336.9 #142
HRNet-W48-C
79.3% -- 94.5% -- 315.7 #115
HRNet-W64-C
79.5% -- 94.7% -- 265.4 #107
Inception ResNet V2
80.5% 80.1%
95.3% 95.1% 287.1 #72
Inception V3
77.9% 78.8% 94.4% 94.4% 303.2 #196
Inception V4
80.2% -- 95.0% -- 288.0 #84
MixNet-L
79.0% 78.9% 94.0% 94.2% 356.0 #184
MixNet-M
77.3% 77.0% 93.2% 93.3% 357.4 #239
MixNet-S
76.0% 75.8% 92.8% 92.8% 360.8 #251
MixNet-XL
80.1% -- 95.0% -- 347.4 #80
MnasNet-A1
75.5% 75.2% 92.6% 92.5% 356.2 #257
MnasNet-B1
74.7% -- 92.1% -- 356.9 #276
MobileNet V3-Large 0.75
73.4% -- 91.4% -- 410.5 #300
MobileNet V3-Large 1.0
75.5% 75.2%
92.6% -- 417.6 #256
MobileNet V3-Large Minimal 1.0
72.2% -- 90.6% -- 410.5 #318
MobileNet V3-Small 0.75
65.7% -- 86.1% -- 422.9 #384
MobileNet V3-Small 1.0
67.9% -- 87.7% -- 414.6 #363
MobileNet V3-Small Minimal 1.0
62.9% -- 84.2% -- 414.4 #394
Modified Aligned Xception
79.6% 79.8% 94.7% 94.8% 260.0 #100
NASNet-A Large
82.6% -- 96.0% -- 116.7 #40
PNASNet-5
82.7% 82.9% 96.0% 96.2% 118.1 #36
Res2Net-50 14x8s
78.2% -- 93.8% -- 386.0 #193
Res2Net-50 26x4s
77.9% -- 93.9% -- 394.0 #194
Res2Net-50 26x6s
78.6% -- 94.1% -- 382.9 #161
Res2Net-50 26x8s
79.2% -- 94.4% -- 377.1 #121
Res2Net-50 48x2s
77.5% -- 93.5% -- 386.9 #214
Res2Net-DLA-60
78.5% 79.5% 94.2% -- 391.6 #161
Res2NeXt-101 26x4s
79.2% -- 94.4% -- 377.8 #128
Res2NeXt-50
78.2% -- 93.9% -- 377.0 #181
Res2NeXt-DLA-60
78.4% -- 94.1% -- 377.1 #169
ResNet-101
79.3% -- 94.5% -- 346.2 #117
ResNet-101-C
79.5% -- 94.6% -- 349.5 #114
ResNet-101-D
80.4% -- 95.0% -- 346.8 #74
ResNet-101-S
80.3% -- 95.2% -- 341.9 #75
ResNet-152
79.7% -- 94.7% -- 339.9 #100
ResNet-152-C
79.9% -- 94.8% -- 333.1 #93
ResNet-152-D
80.5% -- 95.2% -- 337.3 #71
ResNet-152-S
81.0% -- 95.4% -- 335.2 #63
ResNet-18
69.8% -- 89.8% -- 415.7 #336
ResNet-18
73.3% -- 91.4% -- 423.6 #304
ResNet-26
75.3% -- 92.6% -- 354.9 #259
ResNet-26-D
76.7% -- 93.2% -- 352.7 #238
ResNet-34
74.6% -- 92.3% -- 372.3 #268
ResNet-50
77.6% -- 93.7% -- 355.0 #204
ResNet-50
79.2% -- 96.0% -- 397.4 #120
ResNet-50
(288x288 Mean-Max Pooling)
80.3% -- 95.6% -- 346.0 #60
ResNet-50-C
78.0% -- 94.0% -- 362.6 #186
ResNet-50-D
79.1% 77.2%
94.5% 93.5%
357.3 #130
ResNet-50-S
78.7% -- 94.2% -- 352.4 #151
ResNeXt-101 32x16d
84.2% -- 97.2% -- 150.9 #18
ResNeXt-101 32x16d
83.3% -- 96.9% -- 150.9 #24
ResNeXt-101 32x16d
(288x288 Mean-Max Pooling)
84.3% -- 97.3% -- 81.6 #10
ResNeXt-101 32x16d
(288x288 Mean-Max Pooling)
85.0% -- 97.6% -- 84.1 #11
ResNeXt-101 32x32d
85.1% 85.1% 97.4% 97.5% 60.5 #9
ResNeXt-101 32x32d
(288x288 Mean-Max Pooling)
85.9% -- 97.8% -- 37.6 #3
ResNeXt-101 32x48d
85.4% 85.4% 97.6% 97.6% 31.6 #4
ResNeXt-101 32x48d
(288x288 Mean-Max Pooling)
86.1% -- 97.9% -- 19.1 #2
ResNeXt-101 32x4d
80.3% -- 94.9% -- 342.0 #87
ResNeXt-101 32x4d
80.9% -- 96.8% -- 372.3 #27
ResNeXt-101 32x4d
(288x288 Mean-Max Pooling)
84.0% -- 97.2% -- 283.3 #23
ResNeXt-101 32x8d
82.7% 82.2%
96.6% 96.4% 286.2 #37
ResNeXt-101 32x8d
81.6% -- 96.0% -- 286.7 #52
ResNeXt-101 32x8d
(288x288 Mean-Max Pooling)
85.1% -- 96.5% -- 175.3 #36
ResNeXt-101 32x8d
(288x288 Mean-Max Pooling)
83.5% -- 97.1% -- 174.6 #19
ResNeXt-101 64x4d
80.6% -- 95.0% -- 277.5 #84
ResNeXt-50 32x4d
82.2% -- 96.2% -- 393.7 #43
ResNeXt-50 32x4d
78.5% -- 94.1% -- 343.3 #164
ResNeXt-50 32x4d
(288x288 Mean-Max Pooling)
83.1% -- 96.8% -- 337.8 #25
ResNeXt-50-D 32x4d
79.7% -- 94.9% -- 348.5 #92
SENet-154
81.2% -- 95.4% -- 214.8 #66
SENet-154
81.3% 82.7% 95.5% 96.2% 215.6 #61
SE-ResNet-101
78.4% -- 94.3% -- 391.5 #153
SE-ResNet-152
78.7% -- 94.4% -- 371.8 #141
SE-ResNet-18
71.8% -- 90.3% -- 354.7 #326
SE-ResNet-34
74.8% -- 92.1% -- 410.7 #277
SE-ResNet-50
77.6% -- 93.8% -- 399.1 #198
SE-ResNeXt-101 32x4d
80.9% -- 95.3% -- 335.2 #69
SE-ResNeXt-101 32x4d
80.2% -- 95.0% -- 375.5 #79
SE-ResNeXt-101 64x4d
80.9% -- 95.3% -- 266.0 #68
SE-ResNeXt-26 32x4d
77.1% -- 93.3% -- 353.9 #219
SE-ResNeXt-50 32x4d
79.9% -- 94.8% -- 347.1 #97
SE-ResNeXt-50 32x4d
79.1% -- 94.4% -- 393.5 #129
Single-Path NAS
74.1% 75.0% 91.8% 92.2% 417.0 #287
Xception
79.0% 79.0% 94.4% 94.5% 295.6 #136
See Full Build Details +get badge code
[![SotaBench](https://img.shields.io/endpoint.svg?url=https://sotabench.com/api/v0/badge/gh/rwightman/pytorch-image-models)](https://sotabench.com/user/rwightman/repos/rwightman/pytorch-image-models)

How the Repository is Evaluated

The full sotabench.py file - source
import torch
from torchbench.image_classification import ImageNet
from timm import create_model
from timm.data import resolve_data_config, create_transform
from timm.models import TestTimePoolHead
import os

NUM_GPU = 1
BATCH_SIZE = 256 * NUM_GPU


def _entry(model_name, paper_model_name, paper_arxiv_id, batch_size=BATCH_SIZE,
           ttp=False, args=dict(), model_desc=None):
    return dict(
        model=model_name,
        model_description=model_desc,
        paper_model_name=paper_model_name,
        paper_arxiv_id=paper_arxiv_id,
        batch_size=batch_size,
        ttp=ttp,
        args=args)

# NOTE For any original PyTorch models, I'll remove from this list when you add to sotabench to
# avoid overlap and confusion. Please contact me.
model_list = [
    ## Weights ported by myself from other frameworks or trained myself in PyTorch
    _entry('adv_inception_v3', 'Adversarial Inception V3', '1611.01236',
           model_desc='Ported from official Tensorflow weights'),
    _entry('ens_adv_inception_resnet_v2', 'Ensemble Adversarial Inception V3', '1705.07204',
           model_desc='Ported from official Tensorflow weights'),
    _entry('dpn68', 'DPN-68 (224x224)', '1707.01629'),
    _entry('dpn68b', 'DPN-68b (224x224)', '1707.01629'),
    _entry('dpn92', 'DPN-92 (224x224)', '1707.01629'),
    _entry('dpn98', 'DPN-98 (224x224)', '1707.01629'),
    _entry('dpn107', 'DPN-107 (224x224)', '1707.01629'),
    _entry('dpn131', 'DPN-131 (224x224)', '1707.01629'),
    _entry('dpn68', 'DPN-68 (320x320, Mean-Max Pooling)', '1707.01629', ttp=True, args=dict(img_size=320)),
    _entry('dpn68b', 'DPN-68b (320x320, Mean-Max Pooling)', '1707.01629', ttp=True, args=dict(img_size=320)),
    _entry('dpn92', 'DPN-92 (320x320, Mean-Max Pooling)', '1707.01629',
           ttp=True, args=dict(img_size=320), batch_size=BATCH_SIZE//2),
    _entry('dpn98', 'DPN-98 (320x320, Mean-Max Pooling)', '1707.01629',
           ttp=True, args=dict(img_size=320), batch_size=BATCH_SIZE//2),
    _entry('dpn107', 'DPN-107 (320x320, Mean-Max Pooling)', '1707.01629',
           ttp=True, args=dict(img_size=320), batch_size=BATCH_SIZE//4),
    _entry('dpn131', 'DPN-131 (320x320, Mean-Max Pooling)', '1707.01629',
           ttp=True, args=dict(img_size=320), batch_size=BATCH_SIZE//4),
    _entry('efficientnet_b0', 'EfficientNet-B0', '1905.11946'),
    _entry('efficientnet_b1', 'EfficientNet-B1', '1905.11946'),
    _entry('efficientnet_b2', 'EfficientNet-B2', '1905.11946'),
    _entry('fbnetc_100', 'FBNet-C', '1812.03443',
           model_desc='Trained in PyTorch with RMSProp, exponential LR decay'),
    _entry('gluon_inception_v3', 'Inception V3', '1512.00567', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet18_v1b', 'ResNet-18', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet34_v1b', 'ResNet-34', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet50_v1b', 'ResNet-50', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet50_v1c', 'ResNet-50-C', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet50_v1d', 'ResNet-50-D', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet50_v1s', 'ResNet-50-S', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet101_v1b', 'ResNet-101', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet101_v1c', 'ResNet-101-C', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet101_v1d', 'ResNet-101-D', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet101_v1s', 'ResNet-101-S', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet152_v1b', 'ResNet-152', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet152_v1c', 'ResNet-152-C', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet152_v1d', 'ResNet-152-D', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnet152_v1s', 'ResNet-152-S', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnext50_32x4d', 'ResNeXt-50 32x4d', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnext101_32x4d', 'ResNeXt-101 32x4d', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_resnext101_64x4d', 'ResNeXt-101 64x4d', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_senet154', 'SENet-154', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_seresnext50_32x4d', 'SE-ResNeXt-50 32x4d', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_seresnext101_32x4d', 'SE-ResNeXt-101 32x4d', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_seresnext101_64x4d', 'SE-ResNeXt-101 64x4d', '1812.01187', model_desc='Ported from GluonCV Model Zoo'),
    _entry('gluon_xception65', 'Modified Aligned Xception', '1802.02611', batch_size=BATCH_SIZE//2,
           model_desc='Ported from GluonCV Model Zoo'),
    _entry('mixnet_xl', 'MixNet-XL', '1907.09595', model_desc="My own scaling beyond paper's MixNet Large"),
    _entry('mixnet_l', 'MixNet-L', '1907.09595'),
    _entry('mixnet_m', 'MixNet-M', '1907.09595'),
    _entry('mixnet_s', 'MixNet-S', '1907.09595'),
    _entry('mnasnet_100', 'MnasNet-B1', '1807.11626'),
    _entry('mobilenetv3_rw', 'MobileNet V3-Large 1.0', '1905.02244',
           model_desc='Trained in PyTorch with RMSProp, exponential LR decay, and hyper-params matching '
                      'paper as closely as possible.'),
    _entry('resnet18', 'ResNet-18', '1812.01187'),
    _entry('resnet26', 'ResNet-26', '1812.01187', model_desc='Block cfg of ResNet-34 w/ Bottleneck'),
    _entry('resnet26d', 'ResNet-26-D', '1812.01187',
           model_desc='Block cfg of ResNet-34 w/ Bottleneck, deep stem, and avg-pool in downsample layers.'),
    _entry('resnet34', 'ResNet-34', '1812.01187'),
    _entry('resnet50', 'ResNet-50', '1812.01187'),
    _entry('resnext50_32x4d', 'ResNeXt-50 32x4d', '1812.01187'),
    _entry('resnext50d_32x4d', 'ResNeXt-50-D 32x4d', '1812.01187',
           model_desc="'D' variant (3x3 deep stem w/ avg-pool downscale). Trained with "
                      "SGD w/ cosine LR decay, random-erasing (gaussian per-pixel noise) and label-smoothing"),
    _entry('semnasnet_100', 'MnasNet-A1', '1807.11626'),
    _entry('seresnet18', 'SE-ResNet-18', '1709.01507'),
    _entry('seresnet34', 'SE-ResNet-34', '1709.01507'),
    _entry('seresnext26_32x4d', 'SE-ResNeXt-26 32x4d', '1709.01507',
           model_desc='Block cfg of SE-ResNeXt-34 w/ Bottleneck, deep stem, and avg-pool in downsample layers.'),
    _entry('spnasnet_100', 'Single-Path NAS', '1904.02877',
           model_desc='Trained in PyTorch with SGD, cosine LR decay'),
    _entry('tf_efficientnet_b0', 'EfficientNet-B0 (AutoAugment)', '1905.11946',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b1', 'EfficientNet-B1 (AutoAugment)', '1905.11946',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b2', 'EfficientNet-B2 (AutoAugment)', '1905.11946',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b3', 'EfficientNet-B3 (AutoAugment)', '1905.11946', batch_size=BATCH_SIZE//2,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b4', 'EfficientNet-B4 (AutoAugment)', '1905.11946', batch_size=BATCH_SIZE//2,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b5', 'EfficientNet-B5 (RandAugment)', '1905.11946', batch_size=BATCH_SIZE//4,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b6', 'EfficientNet-B6 (AutoAugment)', '1905.11946', batch_size=BATCH_SIZE//8,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b7', 'EfficientNet-B7 (RandAugment)', '1905.11946', batch_size=BATCH_SIZE//8,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b0_ap', 'EfficientNet-B0 (AdvProp)', '1911.09665',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b1_ap', 'EfficientNet-B1 (AdvProp)', '1911.09665',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b2_ap', 'EfficientNet-B2 (AdvProp)', '1911.09665',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b3_ap', 'EfficientNet-B3 (AdvProp)', '1911.09665', batch_size=BATCH_SIZE // 2,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b4_ap', 'EfficientNet-B4 (AdvProp)', '1911.09665', batch_size=BATCH_SIZE // 2,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b5_ap', 'EfficientNet-B5 (AdvProp)', '1911.09665', batch_size=BATCH_SIZE // 4,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b6_ap', 'EfficientNet-B6 (AdvProp)', '1911.09665', batch_size=BATCH_SIZE // 8,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b7_ap', 'EfficientNet-B7 (AdvProp)', '1911.09665', batch_size=BATCH_SIZE // 8,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_b8_ap', 'EfficientNet-B8 (AdvProp)', '1911.09665', batch_size=BATCH_SIZE // 8,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_cc_b0_4e', 'EfficientNet-CondConv-B0 4 experts', '1904.04971',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_cc_b0_8e', 'EfficientNet-CondConv-B0 8 experts', '1904.04971',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_cc_b1_8e', 'EfficientNet-CondConv-B1 8 experts', '1904.04971',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_es', 'EfficientNet-EdgeTPU-S', '1905.11946',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_em', 'EfficientNet-EdgeTPU-M', '1905.11946',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_efficientnet_el', 'EfficientNet-EdgeTPU-L', '1905.11946', batch_size=BATCH_SIZE//2,
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_inception_v3', 'Inception V3', '1512.00567', model_desc='Ported from official Tensorflow weights'),
    _entry('tf_mixnet_l', 'MixNet-L', '1907.09595', model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mixnet_m', 'MixNet-M', '1907.09595', model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mixnet_s', 'MixNet-S', '1907.09595', model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mobilenetv3_large_100', 'MobileNet V3-Large 1.0', '1905.02244',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mobilenetv3_large_075', 'MobileNet V3-Large 0.75', '1905.02244',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mobilenetv3_large_minimal_100', 'MobileNet V3-Large Minimal 1.0', '1905.02244',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mobilenetv3_small_100', 'MobileNet V3-Small 1.0', '1905.02244',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mobilenetv3_small_075', 'MobileNet V3-Small 0.75', '1905.02244',
           model_desc='Ported from official Google AI Tensorflow weights'),
    _entry('tf_mobilenetv3_small_minimal_100', 'MobileNet V3-Small Minimal 1.0', '1905.02244',
           model_desc='Ported from official Google AI Tensorflow weights'),

    ## Cadene ported weights (to remove if Cadene adds sotabench)
    _entry('inception_resnet_v2', 'Inception ResNet V2', '1602.07261'),
    _entry('inception_v4', 'Inception V4', '1602.07261'),
    _entry('nasnetalarge', 'NASNet-A Large', '1707.07012', batch_size=BATCH_SIZE // 4),
    _entry('pnasnet5large', 'PNASNet-5', '1712.00559', batch_size=BATCH_SIZE // 4),
    _entry('seresnet50', 'SE-ResNet-50', '1709.01507'),
    _entry('seresnet101', 'SE-ResNet-101', '1709.01507'),
    _entry('seresnet152', 'SE-ResNet-152', '1709.01507'),
    _entry('seresnext50_32x4d', 'SE-ResNeXt-50 32x4d', '1709.01507'),
    _entry('seresnext101_32x4d', 'SE-ResNeXt-101 32x4d', '1709.01507'),
    _entry('senet154', 'SENet-154', '1709.01507'),
    _entry('xception', 'Xception', '1610.02357',  batch_size=BATCH_SIZE//2),

    ## Torchvision weights
    # _entry('densenet121'),
    # _entry('densenet161'),
    # _entry('densenet169'),
    # _entry('densenet201'),
    # _entry('inception_v3', paper_model_name='Inception V3', ),
    # _entry('tv_resnet34', , ),
    # _entry('tv_resnet50', , ),
    # _entry('resnet101', , ),
    # _entry('resnet152', , ),
    # _entry('tv_resnext50_32x4d', , ),
    # _entry('resnext101_32x8d', ),
    # _entry('wide_resnet50_2' , ),
    # _entry('wide_resnet101_2', , ),

    ## Facebook WSL weights
    _entry('ig_resnext101_32x8d', 'ResNeXt-101 32x8d', '1805.00932',
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),
    _entry('ig_resnext101_32x16d', 'ResNeXt-101 32x16d', '1805.00932',
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),
    _entry('ig_resnext101_32x32d', 'ResNeXt-101 32x32d', '1805.00932', batch_size=BATCH_SIZE // 2,
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),
    _entry('ig_resnext101_32x48d', 'ResNeXt-101 32x48d', '1805.00932', batch_size=BATCH_SIZE // 4,
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),

    _entry('ig_resnext101_32x8d', 'ResNeXt-101 32x8d (288x288 Mean-Max Pooling)', '1805.00932',
           ttp=True, args=dict(img_size=288),
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),
    _entry('ig_resnext101_32x16d', 'ResNeXt-101 32x16d (288x288 Mean-Max Pooling)', '1805.00932',
           ttp=True, args=dict(img_size=288), batch_size=BATCH_SIZE // 2,
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),
    _entry('ig_resnext101_32x32d', 'ResNeXt-101 32x32d (288x288 Mean-Max Pooling)', '1805.00932',
           ttp=True, args=dict(img_size=288), batch_size=BATCH_SIZE // 4,
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),
    _entry('ig_resnext101_32x48d', 'ResNeXt-101 32x48d (288x288 Mean-Max Pooling)', '1805.00932',
           ttp=True, args=dict(img_size=288), batch_size=BATCH_SIZE // 8,
           model_desc='Weakly-Supervised pre-training on 1B Instagram hashtag dataset by Facebook Research'),

    ## Facebook SSL weights
    _entry('ssl_resnet18', 'ResNet-18', '1905.00546',
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnet50', 'ResNet-50', '1905.00546',
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext50_32x4d', 'ResNeXt-50 32x4d', '1905.00546',
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext101_32x4d', 'ResNeXt-101 32x4d', '1905.00546',
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext101_32x8d', 'ResNeXt-101 32x8d', '1905.00546',
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext101_32x16d', 'ResNeXt-101 32x16d', '1905.00546',
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),

    _entry('ssl_resnet50', 'ResNet-50 (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext50_32x4d', 'ResNeXt-50 32x4d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext101_32x4d', 'ResNeXt-101 32x4d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext101_32x8d', 'ResNeXt-101 32x8d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),
    _entry('ssl_resnext101_32x16d', 'ResNeXt-101 32x16d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288), batch_size=BATCH_SIZE // 2,
           model_desc='Semi-Supervised pre-training on YFCC100M dataset by Facebook Research'),

    ## Facebook SWSL weights
    _entry('swsl_resnet18', 'ResNet-18', '1905.00546',
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnet50', 'ResNet-50', '1905.00546',
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext50_32x4d', 'ResNeXt-50 32x4d', '1905.00546',
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext101_32x4d', 'ResNeXt-101 32x4d', '1905.00546',
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext101_32x8d', 'ResNeXt-101 32x8d', '1905.00546',
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext101_32x16d', 'ResNeXt-101 32x16d', '1905.00546',
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),

    _entry('swsl_resnet50', 'ResNet-50 (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext50_32x4d', 'ResNeXt-50 32x4d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext101_32x4d', 'ResNeXt-101 32x4d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext101_32x8d', 'ResNeXt-101 32x8d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288),
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),
    _entry('swsl_resnext101_32x16d', 'ResNeXt-101 32x16d (288x288 Mean-Max Pooling)', '1905.00546',
           ttp=True, args=dict(img_size=288), batch_size=BATCH_SIZE // 2,
           model_desc='Semi-Weakly-Supervised pre-training on 1 billion unlabelled dataset by Facebook Research'),

    ## DLA official impl weights (to remove if sotabench added to source)
    _entry('dla34', 'DLA-34', '1707.06484'),
    _entry('dla46_c', 'DLA-46-C', '1707.06484'),
    _entry('dla46x_c', 'DLA-X-46-C', '1707.06484'),
    _entry('dla60x_c', 'DLA-X-60-C', '1707.06484'),
    _entry('dla60', 'DLA-60', '1707.06484'),
    _entry('dla60x', 'DLA-X-60', '1707.06484'),
    _entry('dla102', 'DLA-102', '1707.06484'),
    _entry('dla102x', 'DLA-X-102', '1707.06484'),
    _entry('dla102x2', 'DLA-X-102 64', '1707.06484'),
    _entry('dla169', 'DLA-169', '1707.06484'),

    ## Res2Net official impl weights (to remove if sotabench added to source)
    _entry('res2net50_26w_4s', 'Res2Net-50 26x4s', '1904.01169'),
    _entry('res2net50_14w_8s', 'Res2Net-50 14x8s', '1904.01169'),
    _entry('res2net50_26w_6s', 'Res2Net-50 26x6s', '1904.01169'),
    _entry('res2net50_26w_8s', 'Res2Net-50 26x8s', '1904.01169'),
    _entry('res2net50_48w_2s', 'Res2Net-50 48x2s', '1904.01169'),
    _entry('res2net101_26w_4s', 'Res2NeXt-101 26x4s', '1904.01169'),
    _entry('res2next50', 'Res2NeXt-50', '1904.01169'),
    _entry('dla60_res2net', 'Res2Net-DLA-60', '1904.01169'),
    _entry('dla60_res2next', 'Res2NeXt-DLA-60', '1904.01169'),

    ## HRNet official impl weights
    _entry('hrnet_w18_small', 'HRNet-W18-C-Small-V1', '1908.07919'),
    _entry('hrnet_w18_small_v2', 'HRNet-W18-C-Small-V2', '1908.07919'),
    _entry('hrnet_w18', 'HRNet-W18-C', '1908.07919'),
    _entry('hrnet_w30', 'HRNet-W30-C', '1908.07919'),
    _entry('hrnet_w32', 'HRNet-W32-C', '1908.07919'),
    _entry('hrnet_w40', 'HRNet-W40-C', '1908.07919'),
    _entry('hrnet_w44', 'HRNet-W44-C', '1908.07919'),
    _entry('hrnet_w48', 'HRNet-W48-C', '1908.07919'),
    _entry('hrnet_w64', 'HRNet-W64-C', '1908.07919'),
]

for m in model_list:
    model_name = m['model']
    # create model from name
    model = create_model(model_name, pretrained=True)
    param_count = sum([m.numel() for m in model.parameters()])
    print('Model %s, %s created. Param count: %d' % (model_name, m['paper_model_name'], param_count))

    # get appropriate transform for model's default pretrained config
    data_config = resolve_data_config(m['args'], model=model, verbose=True)
    if m['ttp']:
        model = TestTimePoolHead(model, model.default_cfg['pool_size'])
        data_config['crop_pct'] = 1.0
    input_transform = create_transform(**data_config)

    # Run the benchmark
    ImageNet.benchmark(
        model=model,
        model_description=m.get('model_description', None),
        paper_model_name=m['paper_model_name'],
        paper_arxiv_id=m['paper_arxiv_id'],
        input_transform=input_transform,
        batch_size=m['batch_size'],
        num_gpu=NUM_GPU,
        data_root=os.environ.get('IMAGENET_DIR', './imagenet')
    )

    torch.cuda.empty_cache()


STATUS
BUILD
COMMIT MESSAGE
RUN TIME
Update README.md Update latest training hparam/command line wit…
rwightman   5d7af97  ·  3 days ago
0h:33m:56s
Update README.md
rwightman   3129bdb  ·  4 days ago
0h:40m:02s
New PyTorch trained EfficientNet-B2 weights with my RandAugment …
rwightman   ff421e5  ·  4 days ago
0h:39m:42s
Update results-all.csv with latest models/weights
rwightman   00b9340  ·  Nov 29 2019
0h:34m:00s
Update README.md with latest changes
rwightman   5259dbc  ·  Nov 29 2019
0h:35m:42s
Finish with HRNet, weights and models updated. Improve consisten…
rwightman   3bef524  ·  Nov 29 2019
0h:55m:33s
Merge pull request #53 from rwightman/condconvs_and_features Ma…
rwightman   3ceeedc  (+12 commits )  ·  Nov 28 2019
2h:01m:52s
Merge pull request #52 from rwightman/randaugment RandAugment a…
rwightman   db04677  (+5 commits )  ·  Nov 22 2019
0h:31m:20s
Fix non-prefetch variant of Mixup. Fixes #50
rwightman   4748c6d  ·  Nov 02 2019
0h:32m:07s
Add TF RandAug weights for B5/B7 EfficientNet models.
rwightman   0d58c50  ·  Oct 30 2019
0h:57m:20s
Better differentiate sotabench WSL, SSL, and SWSL models via mod…
rwightman   62105ed  ·  Oct 20 2019
0h:31m:09s
Map pretrained checkpoint to cpu to avoid issue with some pretra…
rwightman   c099374  ·  Oct 19 2019
0h:29m:28s
Add Facebook Research Semi-Supervised and Semi-Weakly Supervised…
rwightman a9eb484    b93fcf0  ·  Oct 19 2019
7h:41m:03s
6h:58m:34s
Add support for loading args from yaml file (and saving them wit…
rwightman   187ecba  ·  Sep 09 2019
0h:27m:02s
Fix Mobilenet V3 model name for sotabench. Minor res2net cleanup.
rwightman   d3ba34e  ·  Sep 05 2019
2h:26m:03s
Silly typos.
rwightman   b5a8bb5  ·  Sep 04 2019
3h:23m:37s
sotabench fail
rwightman   7dc5d7a  ·  Sep 04 2019
1h:11m:43s
Merge pull request #35 from rwightman/res2net_dla Add Res2net a…
rwightman   96364fc  (+2 commits )  ·  Sep 04 2019
0h:58m:37s
1h:36m:10s
0h:58m:34s