[BUG] Autotrain Object Detection Error: KeyError: 'autotrain_label'

rileybolen commented 1 month ago

Prerequisites

[X] I have read the documentation.
[X] I have checked other issues for similar problems.

Backend

Hugging Face Space/Endpoints

Interface Used

UI

CLI Command

No response

UI Screenshots & Parameters

Screenshot 2024-05-22 at 8 01 39 AM

Error Logs

Downloading data: 0%| | 0/802 [00:00<?, ?files/s] Downloading data: 100%|██████████| 802/802 [00:00<00:00, 17761.12files/s]

Downloading data: 0%| | 0/203 [00:00<?, ?files/s] Downloading data: 100%|██████████| 203/203 [00:00<00:00, 21729.93files/s]

Generating train split: 0 examples [00:00, ? examples/s] Generating train split: 629 examples [00:00, 6266.56 examples/s] Generating train split: 799 examples [00:00, 6202.62 examples/s]

Generating validation split: 0 examples [00:00, ? examples/s] Generating validation split: 200 examples [00:00, 5944.98 examples/s]

Saving the dataset (0/1 shards): 0%| | 0/799 [00:00<?, ? examples/s] Saving the dataset (0/1 shards): 100%|██████████| 799/799 [00:00<00:00, 5077.60 examples/s] Saving the dataset (1/1 shards): 100%|██████████| 799/799 [00:00<00:00, 5077.60 examples/s] Saving the dataset (1/1 shards): 100%|██████████| 799/799 [00:00<00:00, 5050.56 examples/s]

Saving the dataset (0/1 shards): 0%| | 0/200 [00:00<?, ? examples/s] Saving the dataset (1/1 shards): 100%|██████████| 200/200 [00:00<00:00, 5317.32 examples/s] Saving the dataset (1/1 shards): 100%|██████████| 200/200 [00:00<00:00, 5287.99 examples/s] INFO | 2024-05-22 18:49:36 | autotrain.backends.local:create:8 - Starting local training... INFO | 2024-05-22 18:49:36 | autotrain.commands:launch_command:372 - ['accelerate', 'launch', '--num_machines', '1', '--num_processes', '1', '--mixed_precision', 'fp16', '-m', 'autotrain.trainers.object_detection', '--training_config', 'autotrain-717ma-3oxi0/training_params.json'] INFO | 2024-05-22 18:49:36 | autotrain.commands:launch_command:373 - {'data_path': 'autotrain-717ma-3oxi0/autotrain-data', 'model': 'facebook/detr-resnet-101', 'username': 'rileybol', 'lr': 5e-05, 'epochs': 3, 'batch_size': 8, 'warmup_ratio': 0.1, 'gradient_accumulation': 1, 'optimizer': 'adamw_torch', 'scheduler': 'linear', 'weight_decay': 0.0, 'max_grad_norm': 1.0, 'seed': 42, 'train_split': 'train', 'valid_split': 'validation', 'logging_steps': -1, 'project_name': 'autotrain-717ma-3oxi0', 'auto_find_batch_size': False, 'mixed_precision': 'fp16', 'save_total_limit': 1, 'token': '**', 'push_to_hub': True, 'evaluation_strategy': 'epoch', 'image_column': 'autotrain_image', 'objects_column': 'autotrain_label', 'log': 'tensorboard', 'image_square_size': 600, 'early_stopping_patience': 5, 'early_stopping_threshold': 0.01} INFO | 2024-05-22 18:49:36 | autotrain.backends.local:create:13 - Training PID: 154 INFO: 10.16.2.201:24345 - "POST /ui/create_project HTTP/1.1" 200 OK INFO: 10.16.41.118:23391 - "GET /ui/is_model_training HTTP/1.1" 200 OK The following values were not passed to accelerate launch and had defaults used instead: --dynamo_backend was set to a value of 'no' To avoid this warning pass in values for each of the problematic parameters or run accelerate config. INFO: 10.16.41.118:34820 - "GET /ui/is_model_training HTTP/1.1" 200 OK INFO: 10.16.41.118:24196 - "GET /ui/accelerators HTTP/1.1" 200 OK INFO:matplotlib.font_manager:generated new fontManager INFO: 10.16.15.199:26769 - "GET /ui/is_model_training HTTP/1.1" 200 OK INFO | 2024-05-22 18:49:45 | main:train:83 - Train data: Dataset({ features: ['autotrain_image', 'autotrain_objects'], num_rows: 799 }) INFO | 2024-05-22 18:49:45 | main:train:84 - Valid data: Dataset({ features: ['autotrain_image', 'autotrain_objects'], num_rows: 200 }) ERROR | 2024-05-22 18:49:45 | autotrain.trainers.common:wrapper:120 - train has failed due to an exception: Traceback (most recent call last): File "/app/env/lib/python3.10/site-packages/autotrain/trainers/common.py", line 117, in wrapper return func(args, kwargs) File "/app/env/lib/python3.10/site-packages/autotrain/trainers/object_detection/main.py", line 86, in train categories = train_data.features[config.objects_column].feature["category"].names KeyError: 'autotrain_label'

ERROR | 2024-05-22 18:49:45 | autotrain.trainers.common:wrapper:121 - 'autotrain_label'

Additional Information

It seems that my training process gets past the last error, but now I am running into this new error.