Upload 4 files
Browse files- .gitattributes +1 -0
- README.md +53 -3
- card.gif +3 -0
- gitattributes +36 -0
- vista.safetensors +3 -0
.gitattributes
CHANGED
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
card.gif filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
@@ -1,3 +1,53 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
pipeline_tag: image-to-video
|
4 |
+
tags:
|
5 |
+
- autonomous driving
|
6 |
+
- video generation
|
7 |
+
- world model
|
8 |
+
---
|
9 |
+
|
10 |
+
# Model Card for Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability
|
11 |
+
|
12 |
+
![](card.gif)
|
13 |
+
|
14 |
+
## Brief Introduction
|
15 |
+
|
16 |
+
**Vista** is a generalizable driving world model that is capable of:
|
17 |
+
- **High-Fidelity Future Prediction:** *Predict high-fidelity futures in various scenarios*.
|
18 |
+
- **Coherent Long-Horizon Rollout:** *Extend its predictions to continuous and long horizons*.
|
19 |
+
- **Versatile Action Controllability:** *Execute multi-modal actions (steering angles, speeds, commands, trajectories, goal points)*.
|
20 |
+
- **Generalizable Reward Function:** *Provide rewards for different actions without accessing ground truth actions*.
|
21 |
+
|
22 |
+
## Related Links
|
23 |
+
|
24 |
+
For more technical details and discussions, please refer to:
|
25 |
+
- **Paper:** https://arxiv.org/abs/2405.17398
|
26 |
+
- **Code:** https://github.com/OpenDriveLab/Vista
|
27 |
+
- **Demo:** https://vista-demo.github.io
|
28 |
+
|
29 |
+
## How to Use
|
30 |
+
|
31 |
+
Check out https://github.com/OpenDriveLab/Vista
|
32 |
+
|
33 |
+
## Citation
|
34 |
+
|
35 |
+
```bibtex
|
36 |
+
@article{gao2024vista,
|
37 |
+
title={Vista: A Generalizable Driving World Model with High Fidelity and Versatile Controllability},
|
38 |
+
author={Shenyuan Gao and Jiazhi Yang and Li Chen and Kashyap Chitta and Yihang Qiu and Andreas Geiger and Jun Zhang and Hongyang Li},
|
39 |
+
journal={arXiv preprint arXiv:2405.17398},
|
40 |
+
year={2024}
|
41 |
+
}
|
42 |
+
|
43 |
+
@inproceedings{yang2024genad,
|
44 |
+
title={Generalized Predictive Model for Autonomous Driving},
|
45 |
+
author={Jiazhi Yang and Shenyuan Gao and Yihang Qiu and Li Chen and Tianyu Li and Bo Dai and Kashyap Chitta and Penghao Wu and Jia Zeng and Ping Luo and Jun Zhang and Andreas Geiger and Yu Qiao and Hongyang Li},
|
46 |
+
booktitle={Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
|
47 |
+
year={2024}
|
48 |
+
}
|
49 |
+
```
|
50 |
+
|
51 |
+
## Contact
|
52 |
+
|
53 |
+
If you have any questions or comments, feel free to leave a message to [email protected]
|
card.gif
ADDED
Git LFS Details
|
gitattributes
ADDED
@@ -0,0 +1,36 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
*.7z filter=lfs diff=lfs merge=lfs -text
|
2 |
+
*.arrow filter=lfs diff=lfs merge=lfs -text
|
3 |
+
*.bin filter=lfs diff=lfs merge=lfs -text
|
4 |
+
*.bz2 filter=lfs diff=lfs merge=lfs -text
|
5 |
+
*.ckpt filter=lfs diff=lfs merge=lfs -text
|
6 |
+
*.ftz filter=lfs diff=lfs merge=lfs -text
|
7 |
+
*.gz filter=lfs diff=lfs merge=lfs -text
|
8 |
+
*.h5 filter=lfs diff=lfs merge=lfs -text
|
9 |
+
*.joblib filter=lfs diff=lfs merge=lfs -text
|
10 |
+
*.lfs.* filter=lfs diff=lfs merge=lfs -text
|
11 |
+
*.mlmodel filter=lfs diff=lfs merge=lfs -text
|
12 |
+
*.model filter=lfs diff=lfs merge=lfs -text
|
13 |
+
*.msgpack filter=lfs diff=lfs merge=lfs -text
|
14 |
+
*.npy filter=lfs diff=lfs merge=lfs -text
|
15 |
+
*.npz filter=lfs diff=lfs merge=lfs -text
|
16 |
+
*.onnx filter=lfs diff=lfs merge=lfs -text
|
17 |
+
*.ot filter=lfs diff=lfs merge=lfs -text
|
18 |
+
*.parquet filter=lfs diff=lfs merge=lfs -text
|
19 |
+
*.pb filter=lfs diff=lfs merge=lfs -text
|
20 |
+
*.pickle filter=lfs diff=lfs merge=lfs -text
|
21 |
+
*.pkl filter=lfs diff=lfs merge=lfs -text
|
22 |
+
*.pt filter=lfs diff=lfs merge=lfs -text
|
23 |
+
*.pth filter=lfs diff=lfs merge=lfs -text
|
24 |
+
*.rar filter=lfs diff=lfs merge=lfs -text
|
25 |
+
*.safetensors filter=lfs diff=lfs merge=lfs -text
|
26 |
+
saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
27 |
+
*.tar.* filter=lfs diff=lfs merge=lfs -text
|
28 |
+
*.tar filter=lfs diff=lfs merge=lfs -text
|
29 |
+
*.tflite filter=lfs diff=lfs merge=lfs -text
|
30 |
+
*.tgz filter=lfs diff=lfs merge=lfs -text
|
31 |
+
*.wasm filter=lfs diff=lfs merge=lfs -text
|
32 |
+
*.xz filter=lfs diff=lfs merge=lfs -text
|
33 |
+
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
+
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
+
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
card.gif filter=lfs diff=lfs merge=lfs -text
|
vista.safetensors
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1b267bb753aa5c5e4d363f6e4c864dd4e343996dedc18b5ab20af6d20be34b98
|
3 |
+
size 10053461572
|