-
### System Info
```Shell
colab
t4
https://huggingface.co/docs/accelerate/concept_guides/
https://huggingface.co/docs/accelerate/concept_guides/big_model_inference
If I have a single 16 G…
-
### Prerequisite
- [X] I have searched [Issues](https://github.com/open-mmlab/mmrotate/issues) and [Discussions](https://github.com/open-mmlab/mmrotate/discussions) but cannot get the expected help.
…
-
-
I'm finding that training a 1-expert dMoE (brown) has worse training loss than an otherwise equivalent dense model (green). Is there some reason why this difference is expected or can I expect them to…
-
The inference code provided does not work
```
import sys
sys.path.append('./')
from videollama2 import model_init, mm_infer
from videollama2.utils import disable_torch_init
def inference():…
-
Hello!
I´m trying to reproduce the results of your paper as a baseline for my thesis. However, I´m not able to reach the same results for pretraining on UCF101 as indicated in tables 1 & 3 (81.2% t…
-
Hello, I find that theres no lm_head weight in model checkpoints(.safetensors).
How does model load weight for the Linear Layer of lm_head ?
-
Loading checkpoint shards: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2/2 [00:42
-
I think it would be very useful to add more discriminators, from the tests I have done with conditional GANs, it seems that having several discriminators with different levels of reception fields incr…
-
Trying to deploy and run demo on a 4 A6000 cluster but it seemed that the runtime froze without any exceptions... Could there be any possible problems? Sorry for asking a naive question and thanks for…