-
Hello, thank you for your work. I would like to ask why you think the task of synchronized subtitles is important. How can it help in action generation and action understanding?
-
Currently we do a character/word based chunking that is very simple. We should enhance our chunking strategies to possibly include:
* Recursive Character Chunking
* Token Based Chunking
* Documen…
-
I want to train on Chinese speeches. But I don't know how to convert the speech videos to the type that used for training. Could u public the processing codes for raw video?
Another question confuse…
-
Hi!
This is a great work with amazing results, good job!
I was hoping you could provide some guidance on the following issue. I'm trying to condition the video generation by providing the first …
-
Generating videos takes time, minutes to tens of minutes, even with parallelism and async patterns. I would love to see even a sketch y low fidelity version of my video, without music, without sound, …
-
```
llm_cfg = {
# Use the model service provided by DashScope:
'model': 'qwen-vl-max-0809',
#'api_key': 'YOUR_DASHSCOPE_API_KEY',
# It will use the `DASHSCOPE_API_KEY' environment…
-
Thanks for your greak work.
I tried to use an AnimateDiff model, but i failed to generate meaningful contents. Have you ever tried to apply your method to other models? (The model i use do not have …
-
Azure Open AI is services from Auzure platform for Generative AI
Here we can perform search
I has APIs using REST, we can
Dense Captions. : For every Item detected in the image, it can genera…
-
Hello, can I ask Stability.ai team to share samples from each bucket of dataset that was used for training SVD?
I guess you clastered videos by motion and sort them by magnitude. So each motion_bu…
-
Chromium have a new VAAPI decoder for OpenGL renderer, it supports Wayland and can run great with new intel generations using media driver and onevpl driver, but for the i965 is the contrary have some…