Matting Anything Model (MAM), an efficient and versatile framework for estimating the alpha matte of any instance in an image with flexible and interactive visual or linguistic user prompt guidance.
As mentioned in the paper, we trained MAM with SAM ViT-B checkpoints frozen on 8 RTX A6000 GPUs, with
a batch size of 10 images per GPU. The total 20,000 iterations take ~15h.
As mentioned in the paper, we trained MAM with SAM ViT-B checkpoints frozen on 8 RTX A6000 GPUs, with a batch size of 10 images per GPU. The total 20,000 iterations take ~15h.