dmarx / video-killed-the-radio-star

Notebook and tools for end-to-end automation of music video production with generative AI
https://colab.research.google.com/github/dmarx/video-killed-the-radio-star/blob/main/Video_Killed_The_Radio_Star_Defusion.ipynb#scrollTo=oPbeyWtesAoh
MIT License
196 stars 35 forks source link

handle dummy prompts #100

Open dmarx opened 1 year ago

dmarx commented 1 year ago

particularly, case where first scene no audio

mattpetters commented 1 year ago

https://github.com/mattpetters/video-killed-the-radio-star/commit/991d0547bb56b4038e2cde019e6ae186a5a52d2b

@dmarx I did a quick and dirty solution on this on my fork, just don't write captions if there aren't words.

I did have a question tho, in my testing I found the Whisper API results to be a little unstable / unpredictable (sometimes I got 1 segment, sometimes I got 17, for the same song). Getting something really usable has required turning around and cleaning results or just straight up adding segments manually in the storyboard.yaml

Is that anything we can tune or improve?