Closed mvoodarla closed 5 months ago
Hey folks! We just built a cost-effective, lightweight way to generate audiovisual summaries for videos.
The goal here is not to build a single E2E model but something that could actually be used in production while preserving relatively high quality.
You can try it out yourself here: https://www.sievedata.com/functions/sieve/describe
How we built it: https://www.sievedata.com/blog/describe-video-summary-beta-launch
The code: https://github.com/sieve-community/describe
https://github.com/InternLM/InternLM-XComposer/assets/11367688/286b0045-e2fe-43b5-90ae-9a9e35b840d6
Thanks for sharing this project! Thanks for your contribution to the video captioning community!
Hey folks! We just built a cost-effective, lightweight way to generate audiovisual summaries for videos.
The goal here is not to build a single E2E model but something that could actually be used in production while preserving relatively high quality.
You can try it out yourself here: https://www.sievedata.com/functions/sieve/describe
How we built it: https://www.sievedata.com/blog/describe-video-summary-beta-launch
The code: https://github.com/sieve-community/describe
https://github.com/InternLM/InternLM-XComposer/assets/11367688/286b0045-e2fe-43b5-90ae-9a9e35b840d6