rhysdg / vision-at-a-clip

Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
16 stars 1 forks source link

Bug/feat - SigLIP ful model handling #4

Closed rhysdg closed 2 months ago

rhysdg commented 2 months ago

Environment

Incoming Changes :