这个模型好厉害，在Int4精度下都能做到准确无误，只能说是最强OCR神器，期待第二版

THUDM / CogVLM2

GPT4V-level open-source multi-modal model based on Llama3-8B

Apache License 2.0

2.02k stars 134 forks source link

这个模型好厉害，在Int4精度下都能做到准确无误，只能说是最强OCR神器，期待第二版 #103

Closed whysirier closed 3 months ago

whysirier commented 3 months ago

System Info / 系統信息

后续会有更新不

Who can help? / 谁可以帮助到您？

No response

Information / 问题信息

[ ] The official example scripts / 官方的示例脚本
[ ] My own modified scripts / 我自己修改的脚本和任务

Reproduction / 复现过程

1张V100就能跑

Expected behavior / 期待表现

期待后续更新

zRzRzRzRzRzRzR commented 3 months ago

感谢鼓励和支持！

chuangzhidan commented 2 months ago

System Info / 系統信息

后续会有更新不

Who can help? / 谁可以帮助到您？

No response

Information / 问题信息

[ ] The official example scripts / 官方的示例脚本

[ ] My own modified scripts / 我自己修改的脚本和任务

Reproduction / 复现过程

1张V100就能跑

Expected behavior / 期待表现

期待后续更新

你测试图片是怎样的，有难度吗