With the training - I think we can just train on 1 video - and over fit it to begin with - no need just yet for 40gb videos.
so given 1 video trained / overfit / layers saved - give 1 frame - and sounds - and have model generate video matching region + speeds.
I have a crack at this.
With the training - I think we can just train on 1 video - and over fit it to begin with - no need just yet for 40gb videos. so given 1 video trained / overfit / layers saved - give 1 frame - and sounds - and have model generate video matching region + speeds. I have a crack at this.