-
https://github.com/ParadoxZW/LLaVA-UHD-Better/blob/main/llava_uhd/adapt_llava.py#L136-L138
这里由于The first token is for CLS,是不是需要把
```python
m[:w * h] = True
```
改成
```python
m[:w * h+1] = …
-
Hi there.
I well read uploaded source code. I found you set k-shot masked sample with k-'image' not 'instance'.
It might be okay that there is binary mask for input of PRN.
But the same k-'image' i…
-
Is it possible to use the braingen model for transfer learning for a segmentation problem, similar to how brainy was used for transfer learning in the AMS paper?
-
### Question
Hello,
I have trained a LlavaMistralForCausalLM model based on openchat (**not moe version**), but when I use predict.py
I get the following error:
```
File ~/scripts/MoE-LLaVA/…
-
Hey @zucchini-nlp and @NielsRogge👋,
I created a [notebook](https://colab.research.google.com/drive/1DEne3yuCmHKMgvtV3sMxJZQRRkDiLXYB?usp=sharing) for fine-tuning [Llava-OneVision-0.5b-ov-hf](https:…
-
We need to convert keras.io examples to work with Keras 3.
This involves two stages:
## Stage 1: tf.keras backwards compatibility check
Keras 3 is intended as a drop-in replacement for tf.ker…
-
Hi!
Let's bring the documentation to all the Korean-speaking community 🌏 (currently 9 out of 77 complete)
Would you want to translate? Please follow the 🤗 [TRANSLATING guide](https://github.com…
-
Thanks for the excellent work.
I tried to use a non-squared input image size since my data contains people only.
While from `patchify()` it seems like the input is limited to squared ones?
-
Solution over on vikhyat/moondream in the closed issues section: https://github.com/vikhyat/moondream/issues/50
Original issue was an outdated transformers package. Once updated to the most current…
-
![image](https://user-images.githubusercontent.com/52986798/234012132-d7f660c1-ba5e-4c23-b4af-6187fd80dab1.png)
# Abstract
We mask random patches of the input image and reconstruct the missing pix…