Open chullhwan-song opened 5 years ago
Given predictions on pixels and links, two different thresholds can be applied on them separately.
Positive pixels are then grouped together using positive links, resulting in a collection of CCs,
each representing a detected text instance.
Thus instance segmentation is achieved. It is worth noting that, given two neighboring
positive pixels, their link are predicted by both of them, and they should be connected
when one or both of the two link predictions are positive.
This linking process can be implemented using disjoint-set data structure.
https://arxiv.org/abs/1801.01315