Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,30 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
base_model:
|
4 |
+
- google/siglip-so400m-patch14-384
|
5 |
+
pipeline_tag: image-classification
|
6 |
+
---
|
7 |
+
# Oryx-ViT
|
8 |
+
|
9 |
+
## Model Summary
|
10 |
+
|
11 |
+
The Oryx-ViT model is trained on 200M data and can seamlessly and efficiently process visual inputs with arbitrary spatial sizes and temporal lengths.
|
12 |
+
|
13 |
+
- **Repository:** https://github.com/liuzuyan/oryx
|
14 |
+
- **Languages:** English, Chinese
|
15 |
+
- **Paper:** Coming Soon
|
16 |
+
|
17 |
+
|
18 |
+
### Model Architecture
|
19 |
+
|
20 |
+
- **Architecture:** SigLip
|
21 |
+
- **Data:** a mixture of 200M data, 2 epoch
|
22 |
+
- **Precision:** BFloat16
|
23 |
+
|
24 |
+
#### Hardware & Software
|
25 |
+
|
26 |
+
- **Hardware:** 64 * NVIDIA Tesla A100
|
27 |
+
- **Orchestration:** HuggingFace Trainer
|
28 |
+
- **Code:** Pytorch
|
29 |
+
|
30 |
+
## Citation
|