-
NeRF-Insert: 3D Local Editing with Multimodal Control Signals
https://arxiv.org/abs/2404.19204
-
### Describe the bug
I installed text generation webui and downloaded the model(TheBloke_Yarn-Mistral-7B-128k-AWQ) and I can't run it. I chose Transofmer as Model loader. I tried installing autoawq b…
-
When using a DEM (such as [any of the DEMs in gz-sim](https://github.com/gazebosim/gz-sim/blob/gz-sim8/examples/worlds)) and the recommended collision detector (bullet) in conjunction with models that…
-
Hi, thanks for your great work, I m trying with the open-sora-plan code following the Readme doc, but after installing everything, when i run the script opensora_fifo_65.sh, it shows me unable to load…
-
How about the quality when using more than 100 frames for training?
-
Thank you for your excellent work! How is the model for video-level forgery detection tested? Do you test all the frames of the test video? I see a lot of papers that do video forgery detection where …
-
# Bug description
When I try to run the Kalman filter (Predict > run inference> flow, enable filter after 10 frames, connect single track breaks), I get an error in my terminal saying that the Kalm…
-
**Is your feature request related to a problem? Please describe.**
I have been actively using this repository for multimodal training involving images and text. It has been incredibly helpful for my …
-
Downloaded windows-20240811 package and ran app.bat, as soon as I click animate (with is_animal checked) I get the following error:
```
process source:C:\Users\Real Marshal\AppData\Local\Temp\grad…
-
### Question
Dear LLaVA Developer Team,
I must say the LMM is truly brilliant! 😊 I have a question: is LLaVA capable of performing video-QA? In other words, can the model accept a video or a set o…