Open rom1504 opened 1 year ago
Hi @rom1504, I am very interested in this, particularly text/text/image using a single tower for text. If you have started anything I would love to see it, otherwise I will probably try and get something going in the next couple of weeks.
Check #323
Amazing! Thank you!
Can we adapt openclip to be able to train text/text contrastive models?
And beyond that maybe, text/test/image models ?
use case:
options:
It would be nice to find a way to do this without making the code overly complicated.
It goes in a direction of supporting more modalities combination in openclip
A motivation is there are few good models for text,text surprisingly, even though the community on this is quite active
a related idea is image/image as inspired by https://arxiv.org/abs/2212.08045
reference of private models to beat: