annotation (corresponding to one line in .jsonl):
{
image_id: str,
file_name: str,
width: int,
height: int,
annotations: [sentance_0, sentance_1, sentance_2, ...], # MUST NOT be empty
ignore: [ignore_0, ignore_1, ignore_2, ...], # MAY be an empty list
}
sentance:
[instance_0, instance_1, instance_2, ...] # MUST NOT be empty
instance:
{
polygon: [[x0, y0], [x1, y1], [x2, y2], [x3, y3]], # x, y are floating-point numbers
text: str, # the length of the text MUST be exactly 1
is_chinese: bool,
attributes: [attr_0, attr_1, attr_2, ...], # MAY be an empty list
adjusted_bbox: [xmin, ymin, w, h], # x, y, w, h are floating-point numbers
}
attr:
"occluded" | "bgcomplex" | "distorted" | "raised" | "wordart" | "handwritten"
ignore:
{
polygon: [[x0, y0], [x1, y1], [x2, y2], [x3, y3]],
bbox: [xmin, ymin, w, h],
]
https://github.com/yuantailing/ctw-baseline/blob/master/tutorial/1-basics.ipynb said:
Maybe the
sentance
should besentence
, right?