speech-translator-ext / speech-translator-readme

This repository outlines the procedures and general information for the Speech Translator project.
https://st.kappaflow.dev
41 stars 2 forks source link

[feature request] Tweaking the layout + translation both ways ? #12

Open Loumona7 opened 9 months ago

Loumona7 commented 9 months ago

Hello. First of all, happy New year and THANK YOU for your amazing app.

I have multiple questions.

1) Would an app version be possible ? Or not ? I guess it's more logic to make it a chrome extension, so like... You don't need to explain why too much, I already assume it's better as an extension.

2) apparently you put a multiple-setup option for interviews ? I personally use your app as a way to put subtitles ON my twitch OBS layout. But does this option allows to translate back and forth between two languages ? This is the weird setup I did before using an extension in chrome and one in edge :

Screenshot_2024-01-08-02-40-13-155_tv orange

Basically on one side I had french to English with chrome On the other side English to french with edge. But of course, one side was ALWAYS gibberish when I talked in the other languages.

Or does this option only translate multiple languages SO one language ? Like, German and french TO English only ?...

3) Would it be possible to make the space between each sentence thinner ? :((( Like, this is hard to put in my OBS layout because of the huge space between each sentence/translation... I need to be able to make it smaller. 1) space between lines 2) space between sentences 3) space between translations

Also, do you think we could make each language a different color ? I am okay with the font, also you explained the reason for choosing this font. But the space between each translations is just too big o:

Thank you. Also maybe... Maybe options where it doesn't scroll ? But just goes down without scrolling down ? Do you see what I mean ?

Loumona7 commented 9 months ago

By the way huh... An issue. Sometimes it just doesn't work. I talk but no text is appearing at all. :( How can I solve this ?

kappaflow commented 9 months ago

Hello @Loumona7, I am going to try to answer all your questions. But in case it is not clear, feel free to ask for more details.

  1. Do you mean as an app for PC or for a mobile device? Currently, it is made as a web browser extension, because it makes it easier to find, install and use. You can use the speech recognition engine embedded in the web browser basically for free. At the same time, the Edge Browser allows you to use it basically as a stand-alone application. The extension has the instructions on how to do it.

  2. Currently, you have to specify the speech language to make it work correctly. It is difficult for the speech recognition engine to detect if your pronunciation is bad or you just switched the language you are speaking in. A possible workaround is to use a hotkey macro (maybe a stream deck) to hide one overlay and make another one visible instead. Sometimes, the auto-hide option may help, but most of the time it will try to detect some gibberish. A single extension instance can translate only from a single language to another one. The instruction has information on how to run multiple instances of the browser to translate from/to multiple languages.

  3. I have done many tests with the spacing with different languages... Basically, it has the smallest gap between lines and translations possible to make translation chunks differentiable by the spacing and both transcript and translation readable for any language, which is the main goal and the difference from some other similar translation overlays which are very inconvenient to use as a viewer. The issue with the spacing can be resolved by removing the transcript and leaving only the translation, as well as making the text size smaller and the translation chunk smaller. Also, you can display it only for a single language at a time, that will free space on the screen. Other workarounds have some cons, but still, you can disable the interim translation or/and use a subtitle-like displaying style.

  4. You can use the overlay mode to change the text color (and set the transparent background).

  5. I am no sure that I understand what you mean by the "scrolling" here and what behavior would you expect for the old text. It can do the final translation only after the interim speech chunk is finished. The interim translation is designed to avoid the delay. It is not recommended, but you can disable the interim translation and leave only the final translation to save some space. The old text gets "scrolled" down. You can leave only the last translation by changing the number of preserved translations to 1 in the expert options, but most likely you don't need it.

  6. Do you have the issue that it doesn't work in both Edge and Chrome? It is a known issue that sometimes Google speech recognition servers don't work properly... but it works stably in Edge where it uses Microsoft servers. The other possible issue is Google Translate limitations, you can check it on the instruction page while you are experiencing the issue. If the test shows the error, then you might need to use Google Translate (Script) translation method. You can read more here.

Happy New Year and thank you for using the extension and your questions/feedback!

Loumona7 commented 9 months ago

Hello. Thank you for your answer. PLEASE TO NOT CLOSE THIS CONVERSATION.

I want to answer you but am literally sick with anxiety and can't function to answer you.

If you have any way of direct contact (so real time messaging for example, not a forum like here) it would be appreciated. I feel as tho communicating here is like talking through a wall. Just text here doesn't allow me to explain stuff correctly. + I'm autistic + I'm french so right now, let's just say the anxiety is making sleeping impossible.

I would really like to be able to have a better conversation. Please just tell me what you think of that. Thank you for reading this.

Take care. I won't say anything more because I just want this message to be one matter "how can we continue this conversation better". That's why I'm not answering your previous message.

kappaflow commented 9 months ago

@Loumona7 sure, you can contact me by email and tell your discord id. The email you can find on this page: https://github.com/speech-translator-ext

Also, just keep in mind, I accept suggestions and feedback, but can't promise implementing any or a time frame for that.