TheR1D / shell_gpt

A command-line productivity tool powered by AI large language models like GPT-4, will help you accomplish your tasks faster and more efficiently.
MIT License
8.86k stars 697 forks source link

Added GPT-4 Vision Support & Token Counter #418

Open duysqubix opened 6 months ago

duysqubix commented 6 months ago

Great project, not sure if its dead or not.. but with the new vision preview models. I have modified the original codebase to support vision by supplying a --image option. Only works with the gpt-4-vision-preview model, supports both local images and URLs.

Also added an extra dependency that estimates the number of tokens.

luandro commented 5 months ago

Would be great to have vision available.