Open cuixing158 opened 3 years ago
I don't believe we currently have any CV examples with transformers that are public, but it's something we're actively looking into.
Note that we now have the built-in function visionTransformer
to load a pre-trained vision transformer, and an example fine-tuning a vision transformer.
In addition to the NLP applications, what about the CV applications? Are there any examples of transformer being used for "object detection", "behaviour recognition" or even "image classification"? It would be nice to have