Open afiaka87 opened 3 years ago
@lucidrains How easy/possible would it be to use a custom mask "structure"? perhaps the target format could be whatever the typical coco-style segmentation data looks like; and then maybe you could abstract something on top of that which can generate a segment from e.g. a white background? Or even better; the inverse of the white background so e.g. the shape of the "mannequin" or what-not is all that is left unmasked.
I think "image mask engineering" is going to be about as equally important as people are finding prompt engineering. It's widely used in the Open AI blog post. Anyway, per usual my scatterbrain is constantly editing code and forgetting to upstream if it's worthwhile - here's a partial diff so I don't forget to merge this or if anyone else wants to I don't mind.