-
Thanks for your awesome work!
In your paper, you mentioned that the data obtained was 12Hz, but during training, the input data was also 12Hz or 4Hz, because in the later implementation details, it …
-
### Your current environment
from PIL import Image
from transformers import AutoProcessor
from vllm import LLM, SamplingParams
from qwen_vl_utils import process_vision_info
MODEL_PATH = '/w…
-
I would like to create a BEP to store the audio and/or video recordings of behaving subjects.
While this would obviously be problematic for sharing human data, it would be useful to internal human …
-
Instantaneous groups scans is a common form of data collection for wild life biologists. Often performed during focal follows, groups scans allow for data to be collected on the position and activity …
-
From my understanding, right now calling `ns-process-data video` just randomly samples `--num-frames-target` images from the video. This is suboptimal because of two reasons:
1. The random strate…
-
### Reviewed guidelines
- [X] I have read and understand the suggestion guidelines
### Checked for duplicate suggestions
- [X] I checked for existing similar suggestions
### Summary
It would be g…
-
Using the inbuilt save/load latent with vae tiling enabled results in the error:
`Could not run 'aten::slow_conv3d_forward' with arguments from the 'CUDA' backend. This could be because the operato…
-
1.PARE: Part Attention Regressor for 3D Human Body Estimation(2021)
img-->volumetric features(before the global average pooling)-->part branch: estimates attention weights +feature branch: performs S…
-
Hi, I'm having issues sampling from the latest checkpoint [29x480p](https://huggingface.co/LanguageBind/Open-Sora-Plan-v1.2.0/tree/main/29x480p). The videos are overly saturated and with no texture.
…
-