-
Hi I am using LayoutReader and I can see that there is a layout only model but the code provided needs the textual data from ReadingBank. Is there a way to get/use the layout only version of LayoutRea…
-
Thanks for providing such a concise and clean code for beit3. There may be a typo/error in the `datasets.py` here: https://github.com/microsoft/unilm/blob/9102ed91f8e56baa31d7ae7e09e0ec98e77d779c/bei…
-
**Describe**
Model I am using (UniLM, MiniLM, LayoutLM ...):
I'm confused about the dimension of max_2d_position_embedding which is **1024** given in the code. I think the x,y or w,h is only **one**…
-
zcuuu updated
2 years ago
-
**Describe**
Thanks for the amazing work [kosmos-2].
I deploy the web-demo in my local host following the Readme, and the running script like this:
```
#!/bin/bash
partition=
model_path=mo…
-
When I use layoutlmv3 for object detection tasks, and after training, I infer the image, the code reference is unilm/dit/object_ Inference. py under detection, but an error occurred, OSError: Couldn't…
-
Hi @NielsRogge ! I want to create my own pretrained raw model on Bangla language (like trocr-small-stage1)and further fine tune it with bangla dataset. I have gone through the official implementation …
-
**Describe**
Model I am using (UniLM, MiniLM, LayoutLM ...): LayoutXML
I am trying to fine-tune LayoutXML model on DocVQA task. I am wondering if a document in a foreign language say in German, wo…
-
**Describe**
Model I am using (UniLM, MiniLM, LayoutLM ...):
Hello! Personally, I like this model very much. I used it to fine-tune the downstream tasks and easily achieved good results. I'm curious…
-
When I run "gunicorn -k uvicorn.workers.UvicornWorker --chdir /app/src app:app --bind 0.0.0.0:5060 --timeout 10000" to start, there appears an issue to read "doclaynet_VGT_model.pth".
It turns out tha…