Closed lh-13 closed 6 months ago
What should I use instead of clip image encoder? Because clip image encoder will reduce the reference image to 224*224, and some details may be last. so I wonder if there is a feature extrator with a higher resolution ?
Higher resolution results in higher computational overhead...... I believe one way to enhance the performance of an image encoder is to train its image restoration capabilities. (Maybe you can develop a better feature extractor.)
What should I use instead of clip image encoder? Because clip image encoder will reduce the reference image to 224*224, and some details may be last. so I wonder if there is a feature extrator with a higher resolution ?