Jyouhou / UnrealText

Synthetic Scene Text from 3D Engines
MIT License
244 stars 39 forks source link

Can you share the word crop code #23

Closed wushilian closed 2 years ago

wushilian commented 3 years ago

In the paper : "We crop from the proposed multilingual dataset. We discard images with widths shorter than 32 pixels as they are too blurry, and obtain 4.1M word images in total." But I ended up with more than 7 million text line images.

Jyouhou commented 3 years ago

How did you crop the text regions? Did you use axis-aligned boxes or quadrilaterals?

wushilian commented 3 years ago

@Jyouhou I use axis-aligned boxes,and only the rectangle with width and height greater than 32 is reserved

Jyouhou commented 3 years ago

Thanks for the reply.

Most text are highly oriented in the dataset. I filtered by the shortest edge of the quadrilaterals (not the axis-aligned boxes).

wushilian commented 3 years ago

@Jyouhou Can you share your wechat? It's more convenient to communicate

Jyouhou commented 3 years ago

Sure. You can send your wechat account to my cmu email: shangbal@cs.cmu.edu