ResNet-50 ImageNet-1k Classifier
Upload an image to classify it into one of 1000 ImageNet categories.
This model is a ResNet-50 trained on the ImageNet-1k dataset with modern optimization techniques:
- Architecture: ResNet-50 with Bottleneck blocks [3, 4, 6, 3]
- Parameters: ~25.6M trainable parameters
- Training Optimizations:
- Progressive resizing (128→160→192→224px)
- CutMix and MixUp augmentation
- Label smoothing (0.1)
- Exponential Moving Average (EMA)
- Automatic Mixed Precision (AMP)
- PyTorch 2.0 compilation
- FFCV high-performance data loading
- Target Accuracy: 78%+ (Top-1), 94%+ (Top-5)
- Training Time: ~90 minutes on 8x A100 GPUs
Class labels are from the official HuggingFace ImageNet-1k dataset.
The model works best with natural images containing objects, animals, or scenes from the ImageNet categories.
Training code: github.com/arghyaiitb/assignment_9
Examples