We have a working implementation of BLIP and 3 of its variants in huggingface transformers (image captioning, visual question answering, image text retrieval): https://github.com/huggingface/transformers/pull/20716 that is not merged yet
The license of the repository and model states that:
3. Neither the name of [Salesforce.com](http://salesforce.com/) nor the names of its contributors may be used to endorse or promote products derived from this software without specific prior written permission.
We would like to promote the addition of this architecture to transformers library. Therefore I would like to ask you the permission for promoting this contribution
Dear authors,
We have a working implementation of
BLIP
and 3 of its variants in huggingfacetransformers
(image captioning, visual question answering, image text retrieval): https://github.com/huggingface/transformers/pull/20716 that is not merged yetThe license of the repository and model states that:
We would like to promote the addition of this architecture to
transformers
library. Therefore I would like to ask you the permission for promoting this contributionThank you very much in advance