cyberofficial / Synthalingua

Synthalingua - Real Time Translation
http://synthalingua.site/
GNU General Public License v3.0
188 stars 16 forks source link

Everything runs but.... #65

Closed ZeChArtiahSaher closed 7 months ago

ZeChArtiahSaher commented 7 months ago

I get something along the lines of

Audio stream detected...
Given groups=1, weight of size [1280, 128, 3], expected input[1, 80, 3000] to have 128 channels, but got 80 channels instead

Args used:

python "transcribe_audio.py" --ram 12gb --non_english --translate --energy_threshold 250

However it works with 4gb model. 3090 rtx so not a vram issue

cyberofficial commented 7 months ago

Thanks for the report, I'll look into it.

cyberofficial commented 7 months ago

I may have found the issue, and will experiment with some fixes.

Are you using the master branch or portable build?

ZeChArtiahSaher commented 7 months ago

Cloned master yesterday. Windows, virtualenv w/ python 3.10.11. Maybe I should be using 3.10.9? :D

ZeChArtiahSaher commented 7 months ago

yeah no , same thing on 3.10.9

cyberofficial commented 7 months ago

Are you using any sort of program or software for the microphone?

You'll need to make the script is designated to a microphone with the --set_microphone # command.

opera_80vfVJMddW

Example python "transcribe_audio.py" --ram 12gb --set_microphone 8 --non_english --translate will set it to nvidia broadcast best results are lower IDs of the device you want.

I'm unable to replicate the issue via code or force replicate via code, the given error seems to be that your microphone is out of range for the program to pick up.

ZeChArtiahSaher commented 7 months ago

I got a loopback off an ssl 2+ audio interface. I suspect it's not the problem cuz again, audio streaming works just fine with small/medium whisper models.

-------- Original Message -------- On Nov 19, 2023, 20:02, Joe K. wrote:

Are you using any sort of program or software for the microphone?

You'll need to make the script is designated to a microphone with the --set_microphone # command.

opera_80vfVJMddW

Example python "transcribe_audio.py" --ram 12gb --set_microphone 8 --non_english --translate will set it to nvidia broadcast best results are lower IDs of the device you want.

— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>

cyberofficial commented 7 months ago

In your model's folder do you have large-v2.pt ?

ZeChArtiahSaher commented 7 months ago

Large-v3 actually so I'm suppose to be on v2?

cyberofficial commented 7 months ago

Could you download the latest dev build and see if this fixes the issue

ZeChArtiahSaher commented 7 months ago

yeah that fixes, also so much faster, idk if that's the model or just the branch

cyberofficial commented 7 months ago

Thanks for testing.