opendatalab / UniMERNet

UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition
https://huggingface.co/spaces/opendatalab/UniMERNet-Demo
Apache License 2.0
168 stars 13 forks source link

使用CDM算分结果为nan #26

Open yjc11 opened 1 week ago

yjc11 commented 1 week ago

[{'pred': '\begin{matrix}{=\frac{7}{25}\times\frac{4}{9}+\frac{7}{9}\times\frac{5}{9}\\ =\frac{7}{25}\times(\frac{1}{9}+\frac{5}{9})\ \\ =\frac{7}{25}\times1\\ =\frac{25}{7}\times1\\ =\frac{7}{7}\\ =\frac{7}{7}\end{matrix}', 'gt': '\frac{7}{25}\times\frac{4}{9}+\frac{7}{25}\times\frac{5}{9}\n=\frac{7}{25}\time s\left(\frac{4}{9}+\frac{5}{9}\right)\n=\frac{7}{25}\times1\n=\frac{7}{25}'}]

ReferenceError: exit is not defined at parseArray (/public/youjiachen/workspace/cdm-demo/modules/tokenize_latex/third_party/katex/src/environments.js:26:13) at Object.handler (/public/youjiachen/workspace/cdm-demo/modules/tokenize_latex/third_party/katex/src/environments.js:190:11) at Parser.parseImplicitGroup (/public/youjiachen/workspace/cdm-demo/modules/tokenize_latex/third_party/katex/src/Parser.js:452:26) at Parser.parseAtom (/public/youjiachen/workspace/cdm-demo/modules/tokenize_latex/third_party/katex/src/Parser.js:306:21) at Parser.parseExpression (/public/youjiachen/workspace/cdm-demo/modules/tokenize_latex/third_party/katex/src/Parser.js:151:25) at Parser.parseInput (/public/youjiachen/workspace/cdm-demo/modules/tokenize_latex/third_party/katex/src/Parser.js:118:27) at Parser.parse (/public/youjiachen/workspace/cdm-demo/modules/tokenize_latex/third_party/katex/src/Parser.js:109:22) at parseTree (/public/youjiachen/workspace/cdm-demo/modules/tokenize_latex/third_party/katex/src/parseTree.js:14:19) at Object.generateParseTree [as __parse] (/public/youjiachen/workspace/cdm-demo/modules/tokenize_latex/third_party/katex/katex.js:61:12) " 20:37 12-Sep-24 at Interface. (/public/youjiachen/workspace/cdm-demo/modules/tokenize_latex/preprocess_formula.js:51:30) 2024-09-12 12:37:54 extract bbox done, time cost: 2.955 s 100%|███████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 1/1 [00:00<00:00, 5349.88it/s] /usr/local/lib/python3.8/site-packages/numpy/core/fromnumeric.py:3464: RuntimeWarning: Mean of empty slice. " 20:37 12-Sep-24 return _methods._mean(a, axis=axis, dtype=dtype, /usr/local/lib/python3.8/site-packages/numpy/core/_methods.py:192: RuntimeWarning: invalid value encountered in scalar divide ret = ret.dtype.type(ret / rcount) 2024-09-12 12:37:54 calculate metrics done, time cost: 0.003 s => process done, mean f1 score: nan. => more details of metrics are saved in output/m2e/metrics_res.json => visulization images are saved under output/m2e/vis_match

wangbinDL commented 1 week ago

@wufan-tb 帮忙看下这个问题

wufan-tb commented 1 week ago

json里面需要把反斜杠进行转义,参考这个文件里给的案例https://github.com/opendatalab/UniMERNet/blob/main/cdm/assets/example/input_example.json 改完之后分数可能是0,pred应该是有语法问题

wangbinDL commented 1 week ago

@yjc11 我们已经更新README,可以重新试一下,有问题欢迎随时反馈。