szczyglis-dev / py-gpt

Desktop AI Assistant powered by GPT-4, GPT-4 Vision, GPT-3.5, DALL-E 3, Langchain, Llama-index, chat, vision, voice control, image generation and analysis, autonomous agents, code and command execution, file upload and download, speech synthesis and recognition, access to Web, memory, prompt presets, plugins, assistants & more. Linux, Windows, Mac.
https://pygpt.net
MIT License
449 stars 92 forks source link

this app vs. GPT browser version #56

Open tpl opened 1 month ago

tpl commented 1 month ago

What can this application actually do more than the browser-based version of GPT. Where there is an advantage ?

Pointfit commented 3 weeks ago

I am thinking the same thing

gfsysa commented 1 day ago

ah, check the read.me

gfsysa commented 1 day ago

Even easier:

Desktop AI Assistant for Linux, Windows and Mac, written in Python. Works similarly to ChatGPT, but locally (on a desktop computer). 9 modes of operation: Chat, Vision, Completion, Assistant, Image generation, Langchain, Chat with files, Experts and Agent (autonomous). Supports multiple models: GPT-4, GPT-3.5, and any model accessible through Langchain. Included support features for individuals with disabilities: customizable keyboard shortcuts, voice control, and translation of on-screen actions into audio via speech synthesis. Handles and stores the full context of conversations (short-term memory). Real-time video camera capture in Vision mode. Internet access via Google and Microsoft Bing. Speech synthesis via Microsoft Azure, Google, Eleven Labs and OpenAI Text-To-Speech services. Speech recognition via OpenAI Whisper, Google, Google Cloud and Microsoft Bing. Image analysis via GPT-4 Vision. Crontab / Task scheduler included. Integrated Langchain support (you can connect to any LLM, e.g., on HuggingFace). Integrated Llama-index support: chat with txt, pdf, csv, html, md, docx, json, epub, xlsx, xml, webpages, Google, GitHub, video/audio, images and other data types, or use conversation history as additional context provided to the model. Integrated calendar, day notes and search in contexts by selected date. Commands execution (via plugins: access to the local filesystem, Python code interpreter, system commands execution). Custom commands creation and execution. Manages files and attachments with options to upload, download, and organize. Context history with the capability to revert to previous contexts (long-term memory). Allows you to easily manage prompts with handy editable presets. Provides an intuitive operation and interface. Includes a notepad. Includes simple painter / drawing tool. Includes optional Autonomous Mode (Agents). Supports multiple languages. Enables the use of all the powerful features of GPT-4, GPT-4V, and GPT-3.5. Requires no previous knowledge of using AI models. Simplifies image generation using DALL-E 3 and DALL-E 2. Possesses the potential to support future OpenAI models. Fully configurable. Themes support. Real-time code syntax highlighting. Plugins support. Built-in token usage calculation. It's open source; source code is available on GitHub. Utilizes the user's own API key.