JeremyCJM / DiffSHEG

[CVPR'24] DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation
https://jeremycjm.github.io/proj/DiffSHEG/
BSD 3-Clause "New" or "Revised" License
112 stars 9 forks source link

Visualization results #2

Open yangdaowu opened 4 months ago

yangdaowu commented 4 months ago

Hello, I found that the video results visualized using the method you provided are different from the demonstration, with facial expressions and posture movements in different regions. Can you tell me how to run the training and testing code

JeremyCJM commented 4 months ago

Visualization

yangdaowu commented 4 months ago

Thank you for providing the training command. I encountered the following issues when using this command: No such file or directory: 'data/BEAT/beat_cache/beat_4english_15_141/weights/GesAxisAngle_Face_300.bin'

Is this because the dataset was not preprocessed, using the preprocessingipynb file provided in BEAT.

JeremyCJM commented 4 months ago

The "GesAxisAngle_Face_300.bin" is an autoencoder checkpoint for computing the Frechet Distance metrics. I will upload the autoencoder checkpoints later. You can temporally comment them out without computing Frechet Distance metrics during training.

yangdaowu commented 4 months ago

I encountered a new issue using ae_100.bin in Camn. Traceback (most recent call last): File "/home/ydw/anaconda3/envs/Talko/lib/python3.8/site-packages/torch/multiprocessing/spawn.py", line 74, in _wrap fn(i, *args) File "/media/ydw/sda/PycharmProjects/Diffsheg/runner.py", line 303, in main_worker train_dataset = import(f"datasets.{opt.dataset_name}", fromlist=["something"]).BeatDataset(opt, "train") File "/media/ydw/sda/PycharmProjects/Diffsheg/datasets/beat.py", line 116, in init self.aud_lmdb_env = lmdb.open(self.aud_feat_path, readonly=True, lock=False) lmdb.Error: data/BEAT/beat_cache/beat_4english_15_141/train/aud_feat_cache/hubert_large_ls960_ft: No such file or directory

JeremyCJM commented 4 months ago

This is the lmdb directory for precomputed HuBERT features. You can refer to the function below to create the HuBERT feature cache for training and testing audios. https://github.com/JeremyCJM/DiffSHEG/blob/3ebf3058f48cba3da9146afb7623e9ec1ab9e9a5/trainers/ddpm_beat_trainer.py#L1430

lovemino commented 2 months ago

Thank you for this awesome work! I encountered the following issues when using this command: No such file: ges_axis_angle_300.bin I noticed that the metrics in the experimental results you provided differ from those in Beat. I am wondering if you used different MotionAutoencoder for testing. If so, could you kindly provide the autoencoder checkpoint file? I would greatly appreciate it.

JeremyCJM commented 2 months ago

Hi @lovemino and @yangdaowu , you can find all the autoencoder weights here: https://drive.google.com/file/d/1Wm2WMlacwStFaciCh7UlhQeyA3E2yEnj/view?usp=sharing . Note that autoencoders are only computing features for Frechet Distances.

ylhua commented 1 month ago

Hi, could you please offer the code related to visualizing BEAT motion in UE?