-
### Ticket Contents
## Goal
Create a bot capable of answering user questions based on RAG framework using government data extracted from PDFs.
## Description
The project aims to develop a chatbo…
-
Hello, thank you for your published paper and the open model. I am preparing to use your method to train on LaTeX-type data, such as [im2latex](https://zenodo.org/records/56198#.YkniL25Bx_S). I would…
-
### System Info
Hello, we are using the TR-OCR model exported to Onnx. We notice a problem with the large checkpoints for both printed and handwritten; when we run inference using the onnxruntime j…
-
> 很感谢你的想法,关于合成公章我这边生成很多,目前缺少的是真实公章。
>
> 还是很感谢你的分享,不知道你这边是否收集了这个[ICDAR 2023 Competition on Reading the Seal Title](https://rrc.cvc.uab.es/?ch=20&com=downloads)比赛的数据集。
>
> 至于生成公章,不知道…
-
### System Info
platform: Windows 10
optimum version 1.19.2
transformers version 4.40.2
onnx version 1.16.0
onnxruntime version 1.17.3
### Who can help?
@amyeroberts
@pacman100
### Informatio…
feff2 updated
5 months ago
-
**Document Understanding**
Some example models:
1. DiT: https://huggingface.co/microsoft/dit-large
2. LayoutLMv3: https://huggingface.co/microsoft/layoutlmv3-large
3. Donut: https://huggingfac…
jlia0 updated
2 weeks ago
-
I encountered an issue when trying to export the facebook/m2m100_418M model using the optimum-cli tool. The error message indicates that the m2m-100-encoder is not supported, despite m2m-100 being lis…
-
您好,请教一下,我在您提供的数据集上训练了100个epochs,训练后的模型在训练集上效果很好,但是在新的印章图像上推理,却完全没有效果。训练参数没有做变动,数据差不多15000张。
这是在训练集上的推理效果
![下载](https://github.com/user-attachments/assets/4b5c1063-c541-4fd8-8e12-b67dc6b3a665)
这是在…
-
### System Info
```shell
- `optimum` version: 1.6.4.dev0
- `transformers` version: 4.26.0
- Platform: Linux-5.4.0-125-generic-x86_64-with-glibc2.17
- Python version: 3.8.15
- Huggingface_hub vers…
-
The goal of this project is to get a pipeline which is able to extract desired fields from an image practicing LLM fine-tuning/prompt engineering.
Approach: Use image OCR (optical character recogn…