Name FFHQBlindJointDataset is not found, use name: FFHQBlindJointDataset_basicsr!
rank0: Traceback (most recent call last):
rank0: File "basicsr/train.py", line 220, in
rank0: File "basicsr/train.py", line 140, in train_pipeline
rank0: result = create_train_val_dataloader(opt, logger)
rank0: File "basicsr/train.py", line 83, in create_train_val_dataloader
rank0: train_set = build_dataset(dataset_opt)
rank0: File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/basicsr/data/init.py", line 38, in build_dataset
rank0: dataset = DATASET_REGISTRY.get(dataset_opt['type'])(dataset_opt)
rank0: File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/basicsr/utils/registry.py", line 71, in get
rank0: raise KeyError(f"No object named '{name}' found in '{self._name}' registry!")
E0719 09:56:05.516689 139715360970560 torch/distributed/elastic/multiprocessing/api.py:826] failed (exitcode: 1) local_rank: 0 (pid: 1325241) of binary: /home/cvbl/miniconda3/envs/codeFormer/bin/python
Traceback (most recent call last):
File "/home/cvbl/miniconda3/envs/codeFormer/bin/torchrun", line 8, in
sys.exit(main())
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/elastic/multiprocessing/errors/init.py", line 347, in wrapper
return f(*args, **kwargs)
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/run.py", line 879, in main
run(args)
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/run.py", line 870, in run
elastic_launch(
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/launcher/api.py", line 132, in call
return launch_agent(self._config, self._entrypoint, list(args))
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/launcher/api.py", line 263, in launch_agent
raise ChildFailedError(
torch.distributed.elastic.multiprocessing.errors.ChildFailedError:
logger:[ print_freq: 100 save_checkpoint_freq: 5000.0 use_tb_logger: True wandb:[ project: None resume_id: None ] ] dist_params:[ backend: nccl port: 29413 ] find_unused_parameters: True root_path: /path/to/your/project/root is_train: True dist: True rank: 0 world_size: 1
Name FFHQBlindJointDataset is not found, use name: FFHQBlindJointDataset_basicsr! rank0: Traceback (most recent call last): rank0: File "basicsr/train.py", line 220, in
rank0: File "basicsr/train.py", line 140, in train_pipeline rank0: result = create_train_val_dataloader(opt, logger) rank0: File "basicsr/train.py", line 83, in create_train_val_dataloader rank0: train_set = build_dataset(dataset_opt) rank0: File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/basicsr/data/init.py", line 38, in build_dataset rank0: dataset = DATASET_REGISTRY.get(dataset_opt['type'])(dataset_opt) rank0: File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/basicsr/utils/registry.py", line 71, in get rank0: raise KeyError(f"No object named '{name}' found in '{self._name}' registry!")
E0719 09:56:05.516689 139715360970560 torch/distributed/elastic/multiprocessing/api.py:826] failed (exitcode: 1) local_rank: 0 (pid: 1325241) of binary: /home/cvbl/miniconda3/envs/codeFormer/bin/python Traceback (most recent call last): File "/home/cvbl/miniconda3/envs/codeFormer/bin/torchrun", line 8, in
sys.exit(main())
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/elastic/multiprocessing/errors/init.py", line 347, in wrapper
return f(*args, **kwargs)
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/run.py", line 879, in main
run(args)
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/run.py", line 870, in run
elastic_launch(
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/launcher/api.py", line 132, in call
return launch_agent(self._config, self._entrypoint, list(args))
File "/home/cvbl/miniconda3/envs/codeFormer/lib/python3.8/site-packages/torch/distributed/launcher/api.py", line 263, in launch_agent
raise ChildFailedError(
torch.distributed.elastic.multiprocessing.errors.ChildFailedError:
basicsr/train.py FAILED
Can Someone tell how to fix this ?