TencentARC / BTS

BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild
Other
24 stars 0 forks source link

BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild

This is the repo to host the dataset BTS from the following paper:

[Xixi Xu](), [Zhongang Qi](), Jianqi Ma, [Honglun Zhang](), [Ying Shan](), [Xiaohu Qie](), BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild

Summary of license permissions:

Our dataset is now fully released for academic use. The researcher shall use the BTS dataset only for non-commercial algorithm research and educational purposes. Except for the above purposes, the researcher may not use the BTS dataset for any other purposes, including but not limited to distribution, commercial use, advertising, etc.

You can download the dataset from the following link, only if you agree to the above permissions.

https://drive.weixin.qq.com/s?k=AJEAIQdfAAofh5N4rQ

Selection of scenes.

The key motivation of the selection of scenes is to ensure the representation and generalization of the dataset.

We believe that varieties in these three perspectives can ensure the segmentation model to be well-trained with better generalization.

Dataset annotation.

BTS eliminate algorithms and out-of-the-box models for the labeling process to prevent some bad labeling cases. The annotation workflow is as follows.

The designed workflow ensures all annotations to be made in relatively high quality and benchmark to be highly-reliable.

Dataset statistics.

BTS contains 14250 images.

The distribution is nearly balanced, which is consistent with real-world distribution.

Download

A full download should contain these files:

In each zip packages, there are three folds.

In this table, we compare BTS with a variety of representative datasets.

Dataset Text Type Images Words Chars Masks Char Classes Language
ICDAR13 FST Scene 462 1944 6620 Word,Char 36 English
COCO_TS Scene 14690 139034 - Word 36 English
MLT_S Scene 6896 30691 - Word 36 English
Total-Text Scene 1555 9330 - Word 36 English
TextSeg Scene+Design 4024 15691 73790 Word,Word-Effect,Char 36 English
BTS(Ours) Scene 14250 44280 209090 Word,Char 3985 Bi-lingual

Acknowledgements