MotazSabri / Hanami-release

Live translator that captures any audio that comes from a WINDOWS speaker or microphone and translates it to the desired language.
32 stars 7 forks source link

Option for transcribe only - transcribe to text? #5

Closed FlowZenMaster closed 1 month ago

FlowZenMaster commented 1 year ago

I've been looking for a simple tool that takes system audio and transcribes into a text file on a streaming basis. It seems like this tool can do it, but I can't figure out how to get it to NOT translate. Looking for english->english essentially, but written to a file. There is a lot of use for a free program that can accomplish this.

MotazSabri commented 1 year ago

I've been looking for a simple tool that takes system audio and transcribes into a text file on a streaming basis. It seems like this tool can do it, but I can't figure out how to get it to NOT translate. Looking for english->english essentially, but written to a file. There is a lot of use for a free program that can accomplish this.

Thanks for your remark. The app transcribes the audio into text as you kindly highlighted but we don't save it or keep any copy for it to maintain user privacy.

As we mentioned in the project Readme, we used Google API for the transcription. You can give it a try or try a more modern alternative such as OpenAI Whipser.

Hope this helps.

MotazSabri commented 5 months ago

@FlowZenMaster Appreciate your input! In response to your request, I've incorporated language transcription in the latest release. While it currently doesn't save text to a file, it can assist users with hearing difficulties by enabling them to read meeting/video content through the Transcribe service.

Looking ahead, if there's substantial user support, I might explore creating individual transcriptions with speaker identification. However, this could be a challenging endeavor, particularly if I aim to maintain the app as a free resource. Your support is invaluable!

SmolYui commented 3 months ago

expanding on this, a feature where it shows both the transcription and the translation would be useful for me.

MotazSabri commented 3 months ago

expanding on this, a feature where it shows both the transcription and the translation would be useful for me.

Thanks for your input, @SmolYui !

I've implemented a new feature called Mixture, which combines both translation and transcription functionalities. However, due to the fixed size of the application, selecting the Mixture service causes the font size to decrease. The font size returns to normal when reverting back to the original services (Translate and Transcribe).

SmolYui commented 3 months ago

@MotazSabri Excellent! Thank you for you quick implimentation! Although, the service is labelled "Mixtrue" instead of "Mixture". ^w^

MotazSabri commented 3 months ago

@MotazSabri Excellent! Thank you for you quick implimentation! Although, the service is labelled "Mixtrue" instead of "Mixture". ^w^

Thanks for your feedback, @SmolYui . I created a hotfix setup for this and pushed it as an intermediate release. You may get it from here.