rhysdg / vision-at-a-clip

Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
16 stars 1 forks source link

Feat - Siglip onnx, clip surgery onnx and multiple context #1

Closed rhysdg closed 3 months ago

rhysdg commented 3 months ago

Environment

Incoming Changes :