Adding vision through a web cam

jackwuwei / gptspeaker

The ChatGPT Voice Assistant uses a Raspberry Pi (or desktop) to enable spoken conversation with OpenAI large language models. This implementation listens to speech, processes the conversation through the OpenAI service, and responds back. Like Apple Siri, Amazon Alex, Google Nest Home, Mi XiaoAi etc.

BSD 2-Clause "Simplified" License

41 stars 6 forks source link

Adding vision through a web cam #7

Open tarunaroraonline opened 1 month ago

tarunaroraonline commented 1 month ago

What are your thoughts on adding vision capability by bolting on a web cam?

jackwuwei commented 1 month ago

This is a good idea and can be used to achieve more interesting things with the gpt-4v model.