Closed pHaeusler closed 10 months ago
There is RLE mention: https://opencv.github.io/cvat/docs/manual/advanced/xml_format/
Basically bitmap:
[0 0 1] [1 1 1] [0 0 1]
encoded as: 2 4 2 1 (two zeros, four ones, two zeros, one one)
Is there documentation for the mask format used in the cvat_sdk?
Not sure I understand this question. cvat_sdk can take different formats and pass them to CVAT server where they are parsed in different ways.
@bsekachev - the RLE in the XML format doesn't contain the bounding box dimensions suffix, rather the width and height are added as attributes to the
When using the cvat_sdk - you must concatenate the RLE and the bounding box dimensions to make a valid points array
The ask is to add documentation on this
Is there documentation for the mask format used in the
cvat_sdk
?Reverse engineered it looking at: https://github.com/opencv/cvat/blob/develop/cvat-core/src/annotations-collection.ts#L849
Essentially you need to provide a points array for bounding box RLE values followed by [left, top, right, bottom] corners of the bounding box