Closed KihongK closed 4 weeks ago
hello,I meet some problem need to communicate with you.do you have Wechat?
hello,I meet some problem need to communicate with you.do you have Wechat?
Sorry I don't have wechat account
Did you train normally after reducing the batch_size according to my method yesterday? Is the inference effect of your trained model normal?
Did you train normally after reducing the batch_size according to my method yesterday? Is the inference effect of your trained model normal?
This issue created before we communicated 😀
But another error occurred and is being resolved (I haven't trained the model yet 😇)
[2024-07-26 08:21:23] #training batch: 16, #training sample: 16, #non empty bucket: 7
[2024-07-26 08:21:23] Building models...
/home/ac01-kkhong/miniconda3/envs/opensora-inf/lib/python3.9/site-packages/huggingface_hub/file_download.py:1150: FutureWarning: `resume_download` is deprecated and will be removed in version 1.0.0. Downloads always resume when possible. If you want to force a new download, use `force_download=True`.
warnings.warn(
Loading checkpoint shards: 100%
This issue is stale because it has been open for 7 days with no activity.
This issue was closed because it has been inactive for 7 days since being marked as stale.
I tried to test Training with sample data
my dataset
download video (from https://sample-videos.com/) and follow data_processing until 3.2 Filter by aesthetic scores. (Cause i have generate caption issue)
I just want to test Training so I wrote caption myself
And run Training Script
torchrun --standalone --nproc_per_node 1 scripts/train.py configs/opensora-v1-2/train/stage1.py --data-path /home/hed/Open-Sora/merged_file.csv
I don't think it's trained normally Could you kindly advise me on how to fix this issue?