rmusser01 / tldw

tl/dw (Too Long, Didn't Watch): Your Personal Research Multi-Tool - a naive attempt at 'A Young Lady's Illustrated Primer'
Apache License 2.0
382 stars 12 forks source link

Unable to transcribe #436

Open adrian52999 opened 8 hours ago

adrian52999 commented 8 hours ago

Are You on the Latest version? You did a git pull and are running the latest version/build? Yes

Please describe the bug I am trying to transcribe audio to text but I can not get it to work. No matter what I do, it always crashes. I've tried with multiple files and what ends up happening is that I have bunch of directories in the temp folder with the audio files, and the script keeps crashing after attempting to initialize whisper.

Is the bug reproducable reliably? Yes, at least on my end.

Steps to to reproduce the issue I cannot get transcription to work at all.

Desktop (please complete the following information):

Any Additional context I managed to screenshot the cmd window the last millisecond before it autoclosed and saw this error:

INFO:root:speech-to-text: Starting transcription... INFO:root:Initializing new WhisperModel with size medium on device cuda INFO:faster whisper:Processing audio with duration 12:22.203 Could not locate cudnn_ops64_9.d11. Please make sure it is in your library path! Invalid handle. Cannot load symbol cudnnCreateTensorDescriptor

I have installed cudnn and CUDA drivers from NVIDIA's website, but the script keeps crashing. I tried updating it with no GPU support, and it still crashes. I tried using whisperXXL as stated in the readme, and then it cannot even start, throwing this error:

Default API Endpoint: openai Traceback (most recent call last): File "D:\tldw\summarize.py", line 16, in from App_Function_Libraries. Books. Book_Ingestion_Lib import ingest_folder, ingest text_file File "D:\tldw\App_Function_Libraries\Books\Book_Ingestion_Lib.py", line 31, in from App_Function_Libraries.DB.DB Manager import add_media_with_keywords, add_media to_database File "D:\tldw\App_Function_Libraries\DB\DB_Manager.py", line 15, in from App_Function_Libraries.DB.SQLite_DB import DatabaseError File "D:\tldw\App_Function_Libraries\DB\SQLite_DB.py", line 69, in from App_Function_Libraries.Chunk_Lib import chunk_options, chunk_text File "D:\tldw\App_Function_Libraries\Chunk_Lib.py", line 20, in from transformers import GPT2Tokenizer File "D:\tldw\venv\Lib\site-packages\transformers\ init .py", line 26, in from . import dependency_versions_check File "D:\tldw\venv\Lib\site-packages\transformers\dependency_versions_check.py", line 16, in from .utils.versions import require_version, require_version_core File "D:\tldw\venv\Lib\site-packages\transformers\utils__init .py", line 27, in from . chat_template_utils import DocstringParsingException, TypeHintParsingException, get_json_schema File "D:\tldw\venv\Lib\site-packages\transformers\utils\chat_template_utils.py", line 39, in from torch import Tensor File "D:\tldw\venv\Lib\site-packages\torch__init .py", line 130, in raise err 05Error: [WinError 127] The specified procedure could not be found. Error loading "D:\tldw\venv\Lib\site-packag es\torch\lib\cudnn_cnn_train64_8.d11" or one of its dependencies.

I installed with the curl -0 method and tried using medium, medium.en and small whisper models. None work. Please help, I've been trying to get it to work for many hours and I still can't.

rmusser01 commented 2 hours ago

Thanks for filling this, sorry you’re having issues.

problem is that your python env isn’t seeing the CUDA installation. There are two possible fixes, one is to change your system ENV path to include your cuda installation, and the other is to drop the two cuda DLL files into the tldw directory.

On the front page/README there is a section ‘Potential issues’, that section has information on how to resolve this issue. Please try following those and then let me know if that resolved your problem. Specifically:

https://github.com/Purfview/whisper-standalone-win/releases

download that, extract the two Dlls, and copy/paste them into the ‘tldw’ folder and the error should go away. Apologies if you already did so, if you did, did you copy two files, cudnnops and cudnninfer? I will rewrite the docs regarding this later today after work.