rhysdg / vision-at-a-clip

Low-latency ONNX and TensorRT based zero-shot classification and detection with contrastive language-image pre-training based prompts
16 stars 1 forks source link

Feat- add a Whisper (onnxruntime-genai) option for realtime prompts, robotics etc #24

Open rhysdg opened 1 month ago

rhysdg commented 1 month ago