有关数据集的处理

InsaneOnion / ConCLR

A code repository that attempts to implementation the Context-Based Contrastive Learning for Scene Text Recognition (AAAI 2022) (unofficial)

Other

1 stars 0 forks source link

Closed SuperGGB0 closed 3 months ago

SuperGGB0 commented 3 months ago

大佬好，我最近也在准备复现这篇文章，但是对于数据集处理存在问题，在Synthtext数据集当中，一个图片可能会很复杂，没有论文当中提到的那么简单，那么在拼接的时候，是需要对图片先进行切割成小块字符串吗？ ![Uploading 屏幕截图 2024-07-01 203037.jpg…]()

InsaneOnion commented 3 months ago

一个图片可能会很复杂是指什么呢？我的处理方法是直接拼接后resize,实现效果有在readme中贴出。

InsaneOnion commented 3 months ago

您贴出的截图没有正常显示呢

SuperGGB0 commented 3 months ago

不好意思，图是这个，就像这种，图像当中的文字分布很乱，没有像论文当中的单一和规则

数据集：

论文当中：屏幕截图 2024-07-01 212324

InsaneOnion commented 3 months ago

抱歉，刚刚的回答有些问题，确实是需要对原始数据集做crop处理才能得到论文中的数据集，可见FangShancheng/ABINet/blob/main/tools/crop_by_word_bb_syn90k.py，或者直接下载FangShancheng/ABINet仓库中提供的lmdb数据集。

SuperGGB0 commented 3 months ago

hh，我刚刚也是找到了ABINet的数据集，谢谢！

InsaneOnion commented 3 months ago

嗯嗯，有其他问题也欢迎讨论~