sign-language-processing / transcription

Text to pose model for sign language pose generation from a text sequence
35 stars 16 forks source link

ModuleNotFoundError: No module named 'pose_to_video.stylegan3' #29

Closed hacker009-sudo closed 7 months ago

hacker009-sudo commented 11 months ago

@AmitMY , please help me to resolve this issue

AmitMY commented 11 months ago

Please fill in a proper issue - specifically:

  1. What commands are you running?
  2. What is the output?

In general, the stylegan3 code is still work-in-progress, but if I understand what you are trying to do I might be able to help.

hacker009-sudo commented 11 months ago

@AmitMY , I am following the documentation to run the code for pose_to_video and running the below command on colab:

!python /content/transcription/pose_to_video/bin.py --model=stylegan3 --pose=sign.pose --video=sign.mp4

Issue:

Loading input pose ... Generating video ... Traceback (most recent call last): File "/content/transcription/pose_to_video/bin.py", line 58, in main() File "/content/transcription/pose_to_video/bin.py", line 33, in main module = importlib.import_module(f"pose_to_video.{args.model}") File "/usr/lib/python3.10/importlib/init.py", line 126, in import_module return _bootstrap._gcd_import(name[level:], package, level) File "", line 1050, in _gcd_import File "", line 1027, in _find_and_load File "", line 1004, in _find_and_load_unlocked ModuleNotFoundError: No module named 'pose_to_video.stylegan3'

I have even tried with pix_to_pix but same issue:

ModuleNotFoundError: No module named 'pose_to_video.pix_to_pix'

Request you to please help

AmitMY commented 11 months ago

Thank you for your explanation. It is now fixed, and the commands do run, for example:

pose_to_video --model=pix_to_pix --pose=sign.pose --video=sign.mp4 --upscale

Please understand however, that what you are running is under "Next steps" which is NOT ready. For example, this pix_to_pix model yields very bad results, and I plan to rework it in the next month or so, as well as the stylegan model.


EDIT: Sorry, this isn't working right now actually because of too-large model files. Instead of fixing those right now with LFS, since the output quality is anyway very very bad, I will fix them later differently.

hacker009-sudo commented 11 months ago

Thanks for the explanation. Will wait for the final implementation.

AmitMY commented 10 months ago

I had some time yesterday to re-train and fix some issues https://github.com/sign-language-processing/transcription/commit/a308b5868f16290623f3f0f408b6a151905e0e82

The output is now more reasonable. I'll let it train for another day or two, and then deploy it

https://github.com/sign-language-processing/transcription/assets/5757359/2080da8f-3498-4ce5-9627-09ee553b62fe

AmitMY commented 7 months ago

Pose to video moved to https://github.com/sign-language-processing/pose-to-video