Open momobobe opened 1 year ago
@momobobe Could you please clarify more ? I didn't quite understand!
Sure. The use case is, I have a video (or audio) of, for example, a lecture. First I want to generate the caption/subtitle, okay, done easily now. Next I want to focus on certain interesting parts of the video, and I just want to watch those clips based on the subtitle I have now. Then I want to extract some of those clips and maybe assemble them again to get a refined lecture video, so I can store and review it when necessary (like for exam preparation). This is a typical use case for college students, which involves basic video editing. If it's still not clear, you can simply try the app autocut-client and basically it just does this job (but unfortunately its upstream project hasn't been developed for months).
Yeah I understand now, so basically you mean basic video editing features like: cut, merge etc. I think it can be added, but it won't be as powerful as existing video editing softwares, because of the limitations of the UI library. I will add this feature to the TO-DO list. Thanks @momobobe.
https://github.com/zcf0508/autocut-client/ (package for https://github.com/mli/autocut) does a job to extract and assemble video clips based on subtitle generated by whisper, which is very useful. However, unfortunately that project looks lacking sustainable development and emerging modules like fast-whisper are not in their vision. So I wonder if such a feature can be implemented here to help more people with this use case.