-
### Question
## motivation:
I try to use chinese-clip replace clip.
## environment
```bash
$ uname -a
Linux localhost.localdomain 3.10.0-1160.80.1.el7.x86_64 #1 SMP Tue Nov 8 15:48:59 UTC…
-
## Problem statement
1. performance bottleneck in knowledge-based VQA due to two-phase architecture consists of knowledge retrieval from external soruces and training question answering task in super…
-
Hi, I am trying to finetune LLaVA-NeXT with my custom dataset, using "finetune_clip.sh" shell file.
I gave some edits to the shell for my convenience and to satisfy my task so far, like this:
```
…
-
Hey all!
The video models are all supported in Transformers now and will be part of the v4.42 release. Feel free to check out the model checkpoints [here](https://huggingface.co/collections/llava-h…
-
UserWarning: Failed to initialize NumPy: _ARRAY_API not found (Triggered internally at ../torch/csrc/utils/tensor_numpy.cpp:84.)
device: torch.device = torch.device("cpu"),
Models: ['llavamed']
-
Post a reading of your own that uses deep learning for social science analysis and understanding, with a focus on Solving Problems & Creating Digital Doubles - in this case, we want you to look for ex…
-
Iterate on the SFT-8 dataset mixes to create pretraining and final SFT mixes for SFT-9. This requires investigating the quality and usefulness of the datasets. Community input welcome below. See the `…
-
Hi authors,
Great work! Can you please share more details about the pointcloud alignment mentioned in your paper below:
> To ensure cohesion across various sources, we conduct preprocessing ste…
-
In this issue you can either:
- Add papers that you think are interesting to read and discuss (please stick to the format).
- vote: should be done using :+1: on comments
Example: https://githu…
-
In this issue you can either:
- **Add papers** that you think are interesting to read and discuss (please stick to the format).
- **vote**: should be done using :+1: on comments