-
Hi,
I'm a beginner,I have a lot of confusion about your model.
When you calculate accuracy and loss,why you keep the “pad” in the result?It is not a part of the ground truth.(e.g result=“1+2+x+3 pad…
-
您好,我用的VLMEvalKit工具,权重是Mini-Monkey-2B,结构用的intervl-2B。测试结果是:
{
"Text Recognition": 246,
"Scene Text-centric VQA": 169,
"Doc-oriented VQA": 133,
"Key Information Extraction": 167,
…
-
大佬好,想咨询下数学公式识别用什么技术可以实现
-
主要代码:
openbmb/MiniCPM-V-2_6
```
# transformers==4.41.0
response = self.model.chat(image=visuals[0],msgs=msgs,tokenizer=self.tokenizer)
```
结果日志:
######################### OCRBench ###########…
-
## 🚀 Feature
Pytorch vision library has many high-level API for performing the tasks under the hood seamlessly if there can be a high-level API for OCR tasks then downloading lots of third party li…
-
[The format of the issue]
Paper name/title:
Project link:
Paper link:
Code link:
-
Hi Jianshu! Is the result (ExpRate,
-
https://github.com/tmbdev/teaching-dca
Thomas_Breuel 开授的课程
1.转换成pdf
2.pdf转换成html
3.翻译
-
表格检测
>哪些区域是表格 哪些不是(是文本、图表)
表格结构识别
>哪些是表名、标题、表头、行和列、单元格网格结构
表格数据语义提取
> table interpretation: rediscovering the meaning of the
tabular structure. This includes:
(a) functional analysis: deter…
-
### 起始日期 | Start Date
_No response_
### 实现PR | Implementation PR
_No response_
### 相关Issues | Reference Issues
_No response_
### 摘要 | Summary
minicpm-v 2.6版本的vlmevalkit支持吗,自己尝试了复现了但是达不到官网水平,不知道…