ChatGPTNextWeb / ChatGPT-Next-Web

A cross-platform ChatGPT/Gemini UI (Web / PWA / Linux / Win / MacOS). 一键拥有你自己的跨平台 ChatGPT/Gemini 应用。
https://app.nextchat.dev/
MIT License
74.33k stars 58.68k forks source link

[Feature] Add google gemini pro vision support with image uploading from local and url support #3841

Open HakaishinShwet opened 7 months ago

HakaishinShwet commented 7 months ago

Please add google gemini pro vision support with local image uploading and if possible add url support with it so that we can directly give online image link and can retrieve image and scan and give us data according to our prompt :-))

tangze-Asuka commented 7 months ago

Hello, how did you deploy gemini, I can't access the app after creating it from Cloudflare Workers, can you tell me the detailed steps, thanks

vual commented 7 months ago

我这项目支持了geimi-pro-vision上传图片,https://github.com/vual/ChatGPT-Next-Web-Pro

Issues-translate-bot commented 7 months ago

Bot detected the issue body's language is not English, translate it automatically.


My project supports geimi-pro-vision to upload pictures

HakaishinShwet commented 7 months ago

@tangze-Asuka i deployed with docker and did reverse proxy for https support with my domain and didnt got any issue tbh

HakaishinShwet commented 7 months ago

@tangze-Asuka you can selfhost better and advanced project like dify which is also open source but more advanced simply just my suggestion

SunsetMkt commented 6 months ago

https://github.com/blacksev/Gemini-Next-Web/commit/20ff4e3e5023827e4c42f74bc647b522ac2d4ff4

fred-bf commented 6 months ago

@HakaishinShwet I tested out Google Gemini Vision, there is a significant issue that prevents conversations containing images from continuing after sending them. It is expected that after Google fixes this problem, it will invest efforts in support

HakaishinShwet commented 6 months ago

@fred-bf thanks for checking dev, because i have seen that particular thing working fine in one foss project called dify so i thought maybe you could implement in that way and take idea from there codebase if something is missing that might guide, thats all i can say arigatou gozaimasu dev :-))