ztjhz / BetterChatGPT

An amazing UI for OpenAI's ChatGPT (Website + Windows + MacOS + Linux)
http://bettergpt.chat/
Creative Commons Zero v1.0 Universal
7.87k stars 2.65k forks source link

BetterChatGPT compatibility with GPT-4 Turbo Vision API for image and text processing #488

Open SpeederSpeederSpeder opened 8 months ago

SpeederSpeederSpeder commented 8 months ago

At present, BetterChatGPT offers an enriched interaction experience compared to the standard version of ChatGPT. Users can write messages in text form and receive text responses. However, the current version does not support multimodal capabilities, such as analyzing and understanding images in conjunction with text.

I would like BetterChatGPT to integrate the latest version of the GPT-4 API, GPT-4 Turbo Vision. This advanced functionality would enable BetterChatGPT not only to process the text entered by users, but also to analyze the images provided to generate more contextual and accurate responses.

The idea would be to endow BetterChatGPT with the ability to read images attached by the user in the dialog box. In addition to the question or text command, the user could add an image. BetterChatGPT, drawing on the power of GPT-4 Turbo Vision, could then examine the image to provide an answer that takes into account the visual content. For example, the user could ask a question about the historical or cultural content of a photograph, request analyses or summaries based on graphics, and much more.

animalnots commented 2 weeks ago

https://github.com/animalnots/BetterChatGPT-VISION/releases