license: apache-2.0
datasets:
- ILSVRC/imagenet-1k
language:
- en
pipeline_tag: image-classification
tags:
- Robust SSL
- DINO
- SimCLR
- Perspective Distortion
- MPD
- ImageNet-PD
- Self-supervised Learning
Self-Supervised Pretrained Models with MPD Integration
Publication- Möbius Transform for Mitigating Perspective Distortions in Representation Learning, European Conference on Computer Vision (ECCV 2024)
Model Description
This release includes two self-supervised pretrained models integrated with the Mitigating Perspective Distortion (MPD) method. The models are:
- ResNet50 pretrained using SimCLR- https://huggingface.co/prakashchhipa/MPD_SSL/blob/main/SimCLR_resnet50_with_MPD.pth.tar
- ViT-small pretrained using DINO- https://huggingface.co/prakashchhipa/MPD_SSL/blob/main/DINO_vit-small_with_MPD.pth
Both models were trained with a batch size of 512 over 100 epochs. The MPD method enhances the robustness of these models by simulating real-world perspective distortions, making them more robust in various computer vision tasks.
Training Details
- Algorithms- SimCLR for ResNet50, DINO for ViT-small
- Batch Size- 512
- Epochs- 100
- Method- Mitigating Perspective Distortion (MPD)
Performance
The integration of MPD in both SimCLR and DINO frameworks significantly improves the models' performance on tasks affected by perspective distortion. The models can be used directly for downstream tasks or further fine-tuned for specific applications. Refer results in MPD paper.
Source Code
Two minutes summary on MPD and links to access source code repository and ImageNet-PD bacnhmark are available at https://prakashchhipa.github.io/projects/mpd/
Chhipa, P. C., Chippa, M. S., De, K., Saini, R., Liwicki, M., & Shah, M. (2024). M" obius Transform for Mitigating Perspective Distortions in Representation Learning. arXiv preprint arXiv:2405.02296.