--- license: mit --- ### (NeurIPS 2023) Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models ### Official Model Repo #### Model Include: - Stage1-CAVP Pretrained Model. - Stage2-LDM Pretrained Model. - Double Guidance Classifier.
## BibTeX ```bibtex @misc{luo2023difffoley, title={Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models}, author={Simian Luo and Chuanhao Yan and Chenxu Hu and Hang Zhao}, year={2023}, eprint={2306.17203}, archivePrefix={arXiv}, primaryClass={cs.SD} } ```