Closed ahsan3219 closed 6 months ago
That is the plan. Thank you ashan3219
Hi @ahsan3219 @Real-Experienes there is an open-source model available to use in the project. Check out the Llava section in the README.md
: https://github.com/OthersideAI/self-operating-computer?tab=readme-ov-file#locally-hosted-llava-through-ollama.
Let me know if you have any questions about it!
@joshbickett
It is only for Linux and macOS. No option available for windows
@ahsan3219, oh ok. We can reopen this issue in that case. I noticed it didn't mention Windows specifically in the initial issue creation. It may be helpful to create a new issue that mentions Windows specifically so that the open-source community can quickly understand and address this issue specifically
Hi @ahsan3219 @Real-Experienes there is an open-source model available to use in the project. Check out the Llava section in the
README.md
: https://github.com/OthersideAI/self-operating-computer?tab=readme-ov-file#locally-hosted-llava-through-ollama.Let me know if you have any questions about it!
Thanks @joshbickett !! I'm on it!
Is your feature request related to a problem? Please describe. The current reliance on OpenAI for vision models poses limitations and concerns regarding accessibility and transparency due to its proprietary nature.
Describe the solution you'd like I propose integrating an open-source vision model into the project to replace the dependency on OpenAI. This solution would ensure greater accessibility, transparency, and long-term sustainability.
Describe alternatives you've considered Alternative solutions include continuing to use OpenAI, but this maintains the issues of proprietary technology and potential barriers to access for users. Another option is developing a custom vision model, but this could be time-consuming and resource-intensive.
Additional context Incorporating an open-source vision model aligns with the principles of open access and transparency, fostering community engagement and innovation. It offers benefits such as greater customization, flexibility, and trust within the user community.