Closed RandomInternetPreson closed 5 months ago
This issue has been closed due to inactivity for 2 months. If you believe it is still relevant, please leave a comment below. You can tag a developer in your comment.
@oobabooga still not closed unfortunately
Describe the bug
I originally made this issue: https://github.com/oobabooga/text-generation-webui/issues/5850
Then I saw this commit: https://github.com/oobabooga/text-generation-webui/pull/5856#issuecomment-2073426890
I downloaded this commit of textgen: https://github.com/oobabooga/text-generation-webui/commits/main/
I tried whisper in firefox, it seemed to work.
Then I tried longer conversations and superboogav2 and started to encounter a lot of issues:
Issues I am having:
Is there an existing issue for this?
Reproduction
Download the latest repo: https://github.com/oobabooga/text-generation-webui/commits/main/
Install with start_linux.sh
Run update_wizard_linux.sh, select "B" to install extensions that come with textgen
Start textgen
In sessions tab, select whisper_stt Then superboogav2 OR superboogav2 Then whisper_stt (I realize that sometimes issues with loading multiple extensions can be alieviated by changing the load sequence)
Load your model
Go to the instruct Mode (occasionally get a disconnect error here in my browser, I randomly get them while loading models, using extensions, and doing nothing in particular; textgen does not crash and the UI elements look like they are working still without refreshing the browser page) I did not see this behavior prior to the major gradio update.
Load medium.en as the model; a note here: I have never experience transcription errors when using this model with the previous gradio version. I was constantly astonished that it could trasribe extreamly long ramblings with perfect precision, I say this to demonstrate that there is a stark constrast in the transcription quality when oobaboogav2 is being used at the same time. It's difficult for me to fully describe this issue with screenshots.
Spend some time talking to the model, yes the first go around will probably work, maybe even 2 or 3 but 5 and beyond glitches and errors keep occurring and you will notice a loss in transcription quality, say words that might rhyme like "pool" and it will transcribe as "fool"; this never has happened before. I have a set of tests I conduct on new instances of textgen and have never encountered these transcription questions before when trying to reference my "Radium Pool" document using superboogav2.
Screenshot
Logs
System Info