hayabhay / frogbase

Transform audio-visual content into navigable knowledge.
https://frogbase.dev
MIT License
781 stars 95 forks source link

Problem after update. #15

Closed Cate9021 closed 1 year ago

Cate9021 commented 1 year ago

filenotfound

hayabhay commented 1 year ago

Interesting. Could you provide more context around this.

  1. What was the format of the file uploaded?
  2. What is the Python Env?
Cate9021 commented 1 year ago

what i test for format is : youtube link, wav, mp4 so, I don't think is file format problem. I use anaconda, here is the environment.yml: `name: whisper channels:

Cate9021 commented 1 year ago

here is the log that occur, it may help to figure out what's wrong: 2023-02-08 20:02:01,326 INFO sqlalchemy.engine.Engine BEGIN (implicit) 2023-02-08 20:02:01.326 BEGIN (implicit) 2023-02-08 20:02:01,327 INFO sqlalchemy.engine.Engine PRAGMA main.table_info("media") 2023-02-08 20:02:01.327 PRAGMA main.table_info("media") 2023-02-08 20:02:01,327 INFO sqlalchemy.engine.Engine [raw sql] () 2023-02-08 20:02:01.327 [raw sql] () 2023-02-08 20:02:01,328 INFO sqlalchemy.engine.Engine PRAGMA main.table_info("transcript") 2023-02-08 20:02:01.328 PRAGMA main.table_info("transcript") 2023-02-08 20:02:01,328 INFO sqlalchemy.engine.Engine [raw sql] () 2023-02-08 20:02:01.328 [raw sql] () 2023-02-08 20:02:01,328 INFO sqlalchemy.engine.Engine PRAGMA main.table_info("segment") 2023-02-08 20:02:01.328 PRAGMA main.table_info("segment") 2023-02-08 20:02:01,328 INFO sqlalchemy.engine.Engine [raw sql] () 2023-02-08 20:02:01.328 [raw sql] () 2023-02-08 20:02:01,329 INFO sqlalchemy.engine.Engine COMMIT 2023-02-08 20:02:01.329 COMMIT 2023-02-08 20:02:01,736 INFO sqlalchemy.engine.Engine BEGIN (implicit) 2023-02-08 20:02:01.736 BEGIN (implicit) 2023-02-08 20:02:01,739 INFO sqlalchemy.engine.Engine SELECT media.source_type AS media_source_type, media.source_name AS media_source_name, media.source_link AS media_source_link, media.filepath AS media_filepath, media.duration AS media_duration, media.id AS media_id, media.created AS media_created, media.updated AS media_updated FROM media JOIN transcript ON media.id = transcript.media_id ORDER BY media.created DESC LIMIT ? OFFSET ? 2023-02-08 20:02:01.739 SELECT media.source_type AS media_source_type, media.source_name AS media_source_name, media.source_link AS media_source_link, media.filepath AS media_filepath, media.duration AS media_duration, media.id AS media_id, media.created AS media_created, media.updated AS media_updated FROM media JOIN transcript ON media.id = transcript.media_id ORDER BY media.created DESC LIMIT ? OFFSET ? 2023-02-08 20:02:01,740 INFO sqlalchemy.engine.Engine [generated in 0.00055s] (10, 0) 2023-02-08 20:02:01.740 [generated in 0.00055s] (10, 0) 2023-02-08 20:02:12,361 INFO sqlalchemy.engine.Engine SELECT media.source_type AS media_source_type, media.source_name AS media_source_name, media.source_link AS media_source_link, media.filepath AS media_filepath, media.duration AS media_duration, media.id AS media_id, media.created AS media_created, media.updated AS media_updated FROM media JOIN transcript ON media.id = transcript.media_id ORDER BY media.created DESC LIMIT ? OFFSET ? 2023-02-08 20:02:12.361 SELECT media.source_type AS media_source_type, media.source_name AS media_source_name, media.source_link AS media_source_link, media.filepath AS media_filepath, media.duration AS media_duration, media.id AS media_id, media.created AS media_created, media.updated AS media_updated FROM media JOIN transcript ON media.id = transcript.media_id ORDER BY media.created DESC LIMIT ? OFFSET ? 2023-02-08 20:02:12,363 INFO sqlalchemy.engine.Engine [cached since 10.62s ago] (10, 0) 2023-02-08 20:02:12.363 [cached since 10.62s ago] (10, 0) 2023-02-08 20:02:15,336 INFO sqlalchemy.engine.Engine SELECT media.source_type AS media_source_type, media.source_name AS media_source_name, media.source_link AS media_source_link, media.filepath AS media_filepath, media.duration AS media_duration, media.id AS media_id, media.created AS media_created, media.updated AS media_updated FROM media JOIN transcript ON media.id = transcript.media_id ORDER BY media.created DESC LIMIT ? OFFSET ? 2023-02-08 20:02:15.336 SELECT media.source_type AS media_source_type, media.source_name AS media_source_name, media.source_link AS media_source_link, media.filepath AS media_filepath, media.duration AS media_duration, media.id AS media_id, media.created AS media_created, media.updated AS media_updated FROM media JOIN transcript ON media.id = transcript.media_id ORDER BY media.created DESC LIMIT ? OFFSET ? 2023-02-08 20:02:15,336 INFO sqlalchemy.engine.Engine [cached since 13.6s ago] (10, 0) 2023-02-08 20:02:15.336 [cached since 13.6s ago] (10, 0) 2023-02-08 20:02:24,441 INFO sqlalchemy.engine.Engine SELECT media.source_type AS media_source_type, media.source_name AS media_source_name, media.source_link AS media_source_link, media.filepath AS media_filepath, media.duration AS media_duration, media.id AS media_id, media.created AS media_created, media.updated AS media_updated FROM media JOIN transcript ON media.id = transcript.media_id ORDER BY media.created DESC LIMIT ? OFFSET ? 2023-02-08 20:02:24.441 SELECT media.source_type AS media_source_type, media.source_name AS media_source_name, media.source_link AS media_source_link, media.filepath AS media_filepath, media.duration AS media_duration, media.id AS media_id, media.created AS media_created, media.updated AS media_updated FROM media JOIN transcript ON media.id = transcript.media_id ORDER BY media.created DESC LIMIT ? OFFSET ? 2023-02-08 20:02:24,442 INFO sqlalchemy.engine.Engine [cached since 22.7s ago] (10, 0) 2023-02-08 20:02:24.442 [cached since 22.7s ago] (10, 0) C:\Users\PCUL\.conda\envs\whisper\lib\site-packages\whisper\transcribe.py:78: UserWarning: FP16 is not supported on CPU; using FP32 instead warnings.warn("FP16 is not supported on CPU; using FP32 instead") 2023-02-08 20:02:49.533 Uncaught app exception

hayabhay commented 1 year ago

I added an environment.yml file. Please try creating a new conda environment with it to see if it helps. I couldn't reproduce the error on my machine (Ubuntu) and I wonder if it is a Windows quirk. There was another windows related issue that was fixed as well so they may be related.

Cate9021 commented 1 year ago

just tried it, turn out there have no windows ncurses. errorwhisper

hayabhay commented 1 year ago

ah! I expected a new conda to install ncurses as well. For context, I installed it off a miniconda installation on Ubuntu 22.04

Cate9021 commented 1 year ago

One more that can't be install to windows. would like to know do you have intent to make this work on windows? image then maybe I can help you figure out one by one when I'm free, however if you don't have the intent to make this work on window, then I don't want to bother you with this. I can totally understand the pain to make different platform work.

hayabhay commented 1 year ago

This does seem confusing since other Issues raised have this running on Windows successfully (see #16). So I'm unsure if this is Windows specifically or some issue with the environment that might be unique to what is installed in your system (perhaps windows version or other libraries?)

Since the issue is primarily with Whisper, one way is to see if you can get Whisper working directly from their repo. If this works, I'm guessing this repo should work too.

The code in this repo is very skinny and uses mostly platform independent code (there may be some bugs like with #16). Since I don't have a windows machine, it is hard for me to reproduce but I'm happy to accept PRs if there is any specific dependency that needs to be updated to support windows.

Cate9021 commented 1 year ago

Finally make it work! Problem solved.

hayabhay commented 1 year ago

Thanks for the PR! Was it ffmpeg or were there issues with the emoji in the filename?

Cate9021 commented 1 year ago

it was: ffmpeg, pip install setuptools-rust. from my testing, emoji do cause issues if not going through conda env, but it's okay in conda.