-
How to reproduce the behaviour
---------
I am experimenting with sequence labelling using Doccano and found that when I upload a dataset and start the annotation, there is no way to access the origi…
-
### Environment
* **Tesseract Version**:
tesseract 4.0.0
leptonica-1.77.0
libjpeg 9c : libpng 1.6.36 : libtiff 4.0.10 : zlib 1.2.11
Found AVX2
Found AVX
Found SSE
* **Platform**:
…
-
Hello,
I would like to help. I've already cloned all repository. How do I start?
-
First thanks for your great job!
Now We're trying to replace the vision encoder in llava, i.e., clip-l-336, with RADIO. Under the default LLaVA 1.5 settings, we pretrain a multimodal projection MLP a…
-
### 🐛 Describe the bug
When attempting to export the UDOP model to ONNX from the transformers library, the torch.onnx.export() command fails with a RuntimeError. Below is a minimal example to repro…
-
> [!TIP]
> ## Want to get involved?
> We'd love it if you did! Please get in contact with the people assigned to this issue, or leave a comment. See general contributing advice [here](https://micros…
-
I've been attempting to install openrecall on my M2 MacBook Air for a couple of weeks, and keep running into issues. I read both https://github.com/openrecall/openrecall?tab=readme-ov-file#get-started…
-
### Question
Hi, i create a llava model with only 12 layers (instead of 32 layers). However, the evaluation on TextVQA two times slower than the larger llava 7 billion parameters
Code to create …
-
I did some tests when using both detection+recognition with a set of 30 images and I've seen that there is no speed improvements when using batches.
So I checked the code and if I got it right in yo…
-
for a fine tuned model
https://github.com/PaddlePaddle/PaddleOCR/blob/release/2.7/doc/doc_ch/finetune.md#22-%E6%A8%A1%E5%9E%8B%E9%80%89%E6%8B%A9
> 注意:在使用上述预训练模型的时候,需要使用文件夹中的student.pdparams文件作为预训练…
mlfrd updated
2 months ago