Closed MeDott29 closed 4 days ago
Please add this so transcription can be local/free. What about the tts text to speech part, though? It’d be cool to have a convo inside aider back and forth. It’d be great to be able to configure the endpoint called independently there too. For instance, use any model/provider for coding LLM, but allow an independent endpoint for the TTS. Example: openedai-speech is a pretty good tts server, and free. Or third party libs
I'd be in favour of this. https://github.com/Aider-AI/aider/issues/1225
voice control still feels like a party trick to me. is there a person who exists in the world that has interacted with a useful voice model?
Rather than thinking of voice control as a way of programming, what if you want to summarise the business case of this weird thing, and also quickly explain the deployment environment for a documentation document. You probably can speak that fairly fast, and AI and write a neat version faster than you.
On Fri, 18 Oct 2024, 20:41 ATH, @.***> wrote:
voice control still feels like a party trick to me. is there a person who exists in the world that has interacted with a useful voice model?
— Reply to this email directly, view it on GitHub https://github.com/Aider-AI/aider/issues/2039#issuecomment-2423123544, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAJBQQBMJX6YX7DUODYCFTDZ4FQANAVCNFSM6AAAAABP2HC2X6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIMRTGEZDGNJUGQ . You are receiving this because you commented.Message ID: @.***>
Ran this thread through ChatGPT. Results below https://chatgpt.com/share/6712d1b7-bc50-8012-826a-c1cf6b1ac44b I think ChatGPT wants us to know that the code is the documentation.
@tdobson I think I get you though, it is good to transform a problem into a plan of action.
Thanks for trying aider and filing this issue.
A very important consideration is making it easy for users to install aider across many different platforms and environments. It seems like a dependency on local whisper would be very difficult wrt portability?
Sorry, are you proposing we switch to Groq whisper?
I propose adding the ability to use (not switch to) groq whisper. Groq Whisper is currently free (yay) and also quite good.
On Wed, 23 Oct 2024 at 22:20, paul-gauthier @.***> wrote:
Sorry, are you proposing we switch to Groq whisper?
— Reply to this email directly, view it on GitHub https://github.com/Aider-AI/aider/issues/2039#issuecomment-2433485852, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAJBQQDLSWDDNISBFUOCIHTZ5AHLRAVCNFSM6AAAAABP2HC2X6VHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDIMZTGQ4DKOBVGI . You are receiving this because you were mentioned.Message ID: @.***>
Yall are getting hung up on the GROQ part.
That is the point (I think) that the original poster was trying to make… and it’s a good point.
There are also some whisper solutions that can run locally on-machine.
If it can be powered by local whisper already, then perhaps the poster (and me) is not aware of that. If that’s the case, maybe it can be documented more prominently.
This looks interesting. Just saw it in passing. Written in python. https://github.com/juanmc2005/diart
Going to look into when I have some spare time aaaaaa.
I'm labeling this issue as stale because it has been open for 2 weeks with no activity. If there are no additional comments, it will be closed in 7 days.
I'm closing this issue because it has been stalled for 3 weeks with no activity. Feel free to add a comment here and we can re-open it. Or feel free to file a new issue at any time.Note: A bot script made these updates to the issue.
I'm still looking for something that makes it easier for me to articulate my problems and transforms them into a plan of action and then executes on those actions. https://chatgpt.com/share/6712d1b7-bc50-8012-826a-c1cf6b1ac44b
upgrade to fast and free whisper
This is hard coded to 'device_id=8' Notes: did I miss the config option to set up a default audio device? It would be nice to be presented with a list of available audio devices so we can select the one we want.
Version and model info
No response