File size: 604 Bytes
8771c62 85567f4 825286e 85567f4 26e6d18 ce2ccd2 f6d725d 85567f4 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 |
---
license: mit
---
### (NeurIPS 2023) Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models
### Official Model Repo
#### Model Include:
- Stage1-CAVP Pretrained Model.
- Stage2-LDM Pretrained Model.
- Double Guidance Classifier.
<p align="center">
<img src="teaser.png">
</p>
## BibTeX
```bibtex
@misc{luo2023difffoley,
title={Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models},
author={Simian Luo and Chuanhao Yan and Chenxu Hu and Hang Zhao},
year={2023},
eprint={2306.17203},
archivePrefix={arXiv},
primaryClass={cs.SD}
}
```
|