KU-CVLAB / GaussianTalker

Official implementation of “GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting” by Kyusun Cho, Joungbin Lee, Heeji Yoon, Yeobin Hong, Jaehoon Ko, Sangjun Ahn and Seungryong Kim
Other
189 stars 21 forks source link

Question in data preparation #9

Closed xxSpencer closed 1 month ago

xxSpencer commented 2 months ago

This is a great work!I'm new to this field, so I lack a lot of experience. If you happen to have some time, could you help me? In the Data Preparation section, I tried to run python data_utils/process.py my_dataset/Obama/Obama.mp4, but it gave an error indicating that the track_params.pt file is missing.

After comparing with the directory structure you provided, I found that I'm missing

  1. au.csv
  2. track_params.pt
  3. transformers_train.json
  4. transformers_val.json

Could you please guide me in as much detail as possible on how to obtain the files that I am missing? Thanks a lot!

Here's a printout of my directory structure: (/data2/conda_envs/GaussianTalker) root@my_workspace/gaussiantalker/xq_dataset/Obama# ls -lh total 121M -- aud_ds.npy -- aud_novel.wav -- aud.npy ( seems not included in your directory structure given) -- aud_train.wav -- aud.wav --- bc.jpg -- gt_imgs -- Obama.mp4 -- ori_imgs -- torso_imgs

Wish you all the best

anujsinha72094 commented 2 months ago

To get au,csv you have to run feature extractor in OpenFace

xxSpencer commented 1 month ago

To get au,csv you have to run feature extractor in OpenFace

Thanks for your reply! respect