File size: 692 Bytes
e5d167a e19024b e5d167a |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 |
---
license: bsd-3-clause
datasets:
- ILSVRC/imagenet-1k
pipeline_tag: image-classification
---
# Model Card
<!-- Provide a quick summary of what the model is/does. -->
ImageNet-1k Swin-Transformer pre-trained model with Rotary Position Embedding
## Rotary Position Embedding for Vision Transformer [ECCV 2024]
- **Repository:** https://github.com/naver-ai/rope-vit
- **Paper:** https://arxiv.org/abs/2403.13298
## Citation
```
@inproceedings{heo2024ropevit,
title={Rotary Position Embedding for Vision Transformer},
author={Heo, Byeongho and Park, Song and Han, Dongyoon and Yun, Sangdoo},
year={2024},
booktitle={European Conference on Computer Vision (ECCV)},
}
```
|