Can you release the training config for Mobilenext-0.75? fail to reproduce. Here is my training settings:
1、SGDwM(m = 0.9), cosine lr, init lr = 0.1 and batchsize = 256 on a single V100 GPU, weight_decay = 1e-4, epochs = 240
2、label smooth(eps = 0.1)
3、Common data augmentation settings(RandomResizedCrop+ColorJitter+RandomFlip)
4、lr warmup is also used exactly following the settings you mentioned in the paper, and also cool-down epochs
I tried dozens of times, and the top-1 acc just hang around 69.5%,never break 70%, let alone the 72% reported in the paper.
Actually, I am currently research in light-weight architecture desgin, I don't know if I should report the performance of mobilenext-0.75 according to my own experiment or the statistics reported in the paper.
Many thanks
Can you release the training config for Mobilenext-0.75? fail to reproduce. Here is my training settings:
1、SGDwM(m = 0.9), cosine lr, init lr = 0.1 and batchsize = 256 on a single V100 GPU, weight_decay = 1e-4, epochs = 240
2、label smooth(eps = 0.1)
3、Common data augmentation settings(RandomResizedCrop+ColorJitter+RandomFlip)
4、lr warmup is also used exactly following the settings you mentioned in the paper, and also cool-down epochs
I tried dozens of times, and the top-1 acc just hang around 69.5%,never break 70%, let alone the 72% reported in the paper.
Actually, I am currently research in light-weight architecture desgin, I don't know if I should report the performance of mobilenext-0.75 according to my own experiment or the statistics reported in the paper.
Many thanks