Hi @AmitMY , I'm getting errors when trying to run the script following the steps from the file.

I'm at a point where I have the and files for my custom dataset (which is just a single video and corresponding .pose file for now). I then run script in the controlnet folder to prepare these into huggingface dataset and the script wrote all necessary files to HF_DATASET_DIR. However, when I try running ! accelerate launch diffusers/examples/controlnet/ ... with --training_data_dir="$HF_DATASET_DIR", I get the following errors:

[rank0]: Traceback (most recent call last):
[rank0]:   File "/root/pose_to_video/pose-to-video/pose_to_video/conditional/controlnet/diffusers/examples/controlnet/", line 1187, in <module>
[rank0]:     main(args)
[rank0]:   File "/root/pose_to_video/pose-to-video/pose_to_video/conditional/controlnet/diffusers/examples/controlnet/", line 923, in main
[rank0]:     train_dataset = make_train_dataset(args, tokenizer, accelerator)
[rank0]:                     ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
[rank0]:   File "/root/pose_to_video/pose-to-video/pose_to_video/conditional/controlnet/diffusers/examples/controlnet/", line 639, in make_train_dataset
[rank0]:     raise ValueError(
[rank0]: ValueError: `--image_column` value 'image' not found in dataset columns. Dataset columns are: _data_files, _fingerprint, _format_columns, _format_kwargs, _format_type, _output_all_columns, _split

Would you be able to advise? Thanks!

AmitMY commented 3 months ago

I would try to load the dataset with a huggingface dataset loader, and observe it, to see what is going on. Since the dataset should have these columns, and not the ones it says for you:

sparkkid1234 commented 3 months ago

@AmitMY would you be able to share your datasets module version and what are the expected output files in HF_DATASET_DIR after I run the script? Thanks!

AmitMY commented 3 months ago

The directory I processed was in scratch storage, and so it was removed a while back.

The environment I used:

sparkkid1234 commented 3 months ago

Thanks @AmitMY let me check. Another quick question, is there a recommended way to run pose-to-video/data/BIU-MG/ for multiple videos so my custom dataset is larger than 1 single video? I'm assuming I can change the write mode for the zipfile to a instead of w like in the code and run the script separately for each video but pointing to the same output zips? Thanks!

AmitMY commented 3 months ago

I guess that could work. My recommended solution though would be to modify the code itself, to take a directory of videos and a directory of poses with the same names, then iterate them

sparkkid1234 commented 3 months ago

hey @AmitMY I've got the code working by downgrading datasets to your same version. Seems like there was a breaking change.

One last question on this issue. The download link for the BIU-MG dataset is no longer valid, so can I ask how long was the video used to train the controlnet model? Or, even better, how many frames in total? Thank you, I'll close this issue after

AmitMY commented 3 months ago

that's great! wanna add a change to the controlnet README or setup?

The original video was 30 minutes at 30fps, if i am not mistaken. It was recorded with a green screen, that was then keyed out and replaced with a single green color (there was lighting differences on the green screen, easier to key out than to learn)

sparkkid1234 commented 3 months ago

will do!