-
**Is your feature request related to a problem? Please describe.**
NAPS2 is great - it's very useful to me and many others, and a big part of the utility it offers is the integrated OCR function, sin…
-
Hi @NielsRogge,
I plan to finetune a LayoutXLM large like model. Why "like model"? Because until now, Microsoft did not release LayoutXLM large mas only a version base.
As I want to train a vers…
piegu updated
7 months ago
-
**Describe**
Model I am using (TextDiffuser) on windows machine with GPU:
I'm wondering if it's possible to run the inference.py for the "text_to_image" model without training??? I have already do…
-
The provided model cannot correctly categorize some "vaguely" plotted Figures and Tables. In this case, the word in the Table region will be considered as normal Text, thus hinder the normal reading o…
-
### 请提出你的问题 Please ask your question
[2024-06-04 16:22:14,518] [ INFO] - Already cached e:\users\WangK16\.paddlenlp\models\vi-layoutxlm-base-uncased\model_state.pdparams
[2024/06/04 16:22:29] ppo…
-
ModuleNotFoundError: No module named 'fused_layer_norm_cuda'
-
# 1. Clone and push in github repository
1. Fork the Repository: Go to the repository https://github.com/NME-rahul/AI-AGS on GitHub and click on the "Fork" button in the upper right corner. This cr…
-
- https://arxiv.org/abs/2012.14740
- 2020
テキストとレイアウトの事前学習は、効果的なモデルアーキテクチャと、大規模な非ラベルのスキャン/デジタル生文書の利点により、視覚的に豊かな文書理解タスクの様々な分野で有効であることが証明されている。
本論文では、マルチモーダルなフレームワークを用いて、テキスト、レイアウト、画像の事前学習を行うLayoutL…
e4exp updated
3 years ago
-
### System Info
- `transformers` version: 4.38.1
- Platform: Linux-5.15.146.1-microsoft-standard-WSL2-x86_64-with-glibc2.31
- Python version: 3.10.13
- Huggingface_hub version: 0.20.3
- Safeten…
-
Hello, can you please provide some information on how the dicts and keys pth files were created. I am trying to use the model on my own data but am failing to do so (I already have the other box, img …