On custom datasets, training ppyoloe+ GPU stucks #758

🐞 Describe the bug

On custom datasets, yolov5 and rtndet can be trained.just ppyoloe+ can't train. my config file: base = './configs/ppyoloe/' data_root = '/data/'

Path of train annotation file

train_ann_file = 'annotations/train.json' train_data_prefix = 'images/' # Prefix of train image path

Path of val annotation file

val_ann_file = 'annotations/val.json' val_data_prefix = 'images/' # Prefix of val image path class_name = ("a","b") num_classes = len(class_name) # Number of classes for classification

Batch size of a single GPU during training

train_batch_size_per_gpu = 16

Worker to pre-fetch data for each single GPU during training

train_num_workers = 10

persistent_workers must be False if num_workers is 0.

persistent_workers = True metainfo = dict( classes=class_name, palette=[(220, 20, 60)]
) model = dict(bbox_head=dict(head_module=dict(num_classes=num_classes)), train_cfg=dict(initial_assigner=dict(num_classes=num_classes),assigner=dict(num_classes=num_classes)) ) train_dataloader = dict( batch_size=train_batch_size_per_gpu, num_workers=train_num_workers, persistent_workers=persistent_workers, dataset=dict( data_root=data_root, ann_file=train_ann_file, metainfo=metainfo, data_prefix=dict(img=train_data_prefix), filter_cfg=dict(filter_empty_gt=True, min_size=32), ) )

val_dataloader = dict( dataset=dict( metainfo=metainfo, data_root=data_root, ann_file=val_ann_file, data_prefix=dict(img=val_data_prefix), test_mode=True, ))

test_dataloader = val_dataloader

val_evaluator = dict(ann_file=data_root + 'annotations/test.json') test_evaluator = val_evaluator


Additional information

hhaAndroid commented 1 year ago

@wang002 Can you run properly? If so, please refer to this configuration for changes

Yuanyang-Zhu commented 1 year ago

@wang002 Can you run properly? If so, please refer to this configuration for changes
