Offical implementation of the NeurIPS 2023 paper: Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models.
Project Page: https://diff-foley.github.io
diff_foley_inference.ipynb
in inference
folder.diff_foley_ckpt
from Hugging Face 🤗 here and place it under inference
folder.diff_foley_inference.ipynb
.
@misc{luo2023difffoley,
title={Diff-Foley: Synchronized Video-to-Audio Synthesis with Latent Diffusion Models},
author={Simian Luo and Chuanhao Yan and Chenxu Hu and Hang Zhao},
year={2023},
eprint={2306.17203},
archivePrefix={arXiv},
primaryClass={cs.SD}
}