Kwai-Kolors / Kolors

Kolors Team
Apache License 2.0
3.92k stars 277 forks source link

pydantic.errors.PydanticUserError: If you use `@root_validator` with pre=False (the default) you MUST specify `skip_on_failure=True`. Note that `@root_validator` is deprecated and should be replaced with `@model_validator`. #150

Open lvsh2012 opened 2 months ago

lvsh2012 commented 2 months ago

/mnt/conda/envs/kolors/lib/python3.8/site-packages/pydantic/_internal/_config.py:341: UserWarning: Valid config keys have changed in V2:

You may be able to resolve this warning by setting model_config['protected_namespaces'] = (). warnings.warn( /mnt/conda/envs/kolors/lib/python3.8/site-packages/pydantic/_internal/_config.py:341: UserWarning: Valid config keys have changed in V2:

For further information visit https://errors.pydantic.dev/2.9/u/root-validator-pre-skip ERROR:torch.distributed.elastic.multiprocessing.api:failed (exitcode: 1) local_rank: 0 (pid: 87663) of binary: /mnt/conda/envs/kolors/bin/python Traceback (most recent call last): File "/mnt/conda/envs/kolors/bin/accelerate", line 8, in sys.exit(main()) File "/mnt/conda/envs/kolors/lib/python3.8/site-packages/accelerate/commands/accelerate_cli.py", line 47, in main args.func(args) File "/mnt/conda/envs/kolors/lib/python3.8/site-packages/accelerate/commands/launch.py", line 1008, in launch_command deepspeed_launcher(args) File "/mnt/conda/envs/kolors/lib/python3.8/site-packages/accelerate/commands/launch.py", line 724, in deepspeed_launcher distrib_run.run(args) File "/mnt/conda/envs/kolors/lib/python3.8/site-packages/torch/distributed/run.py", line 753, in run elastic_launch( File "/mnt/conda/envs/kolors/lib/python3.8/site-packages/torch/distributed/launcher/api.py", line 132, in call return launch_agent(self._config, self._entrypoint, list(args)) File "/mnt/conda/envs/kolors/lib/python3.8/site-packages/torch/distributed/launcher/api.py", line 246, in launch_agent raise ChildFailedError( torch.distributed.elastic.multiprocessing.errors.ChildFailedError:

dreambooth/train_dreambooth_lora.py FAILED

Failures:

------------------------------------------------------------ Root Cause (first observed failure): [0]: time : 2024-09-26_14:30:29 host : dl rank : 0 (local_rank: 0) exitcode : 1 (pid: 87663) error_file: traceback : To enable traceback see: https://pytorch.org/docs/stable/elastic/errors.html ============================================================
Deng-Xian-Sheng commented 2 weeks ago

update deepspeed

pip install --uxxxx deepspeed