ultralytics / hub

Ultralytics HUB tutorials and support
https://hub.ultralytics.com
GNU Affero General Public License v3.0
125 stars 11 forks source link

Unexpected interruption in training process #761

Open Aq114 opened 2 months ago

Aq114 commented 2 months ago

Search before asking

HUB Component

No response

Bug

Using yolo detect train data=data/classify. yaml model=yolov8n. pt epochs=100 imgsz=640 to train a custom dataset on the local command line, an error termination occurs after approximately seven to eight iterations. The error message is as follows: Traceback (most recent call last): File "", line 198, in _run_module_as_main File "", line 88, in _run_code File "C:\Users\Y\AppData\Local\anaconda3\envs\yolov5\Scripts\yolo.exe__main.py", line 7, in File "C:\Users\Y\AppData\Local\anaconda3\envs\yolov5\Lib\site-packages\ultralytics\cfg__init__.py", line 591, in entrypoint getattr(model, mode)(*overrides) # default args from model ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\Users\Y\AppData\Local\anaconda3\envs\yolov5\Lib\site-packages\ultralytics\engine\model.py", line 650, in train self.trainer.train() File "C:\Users\Y\AppData\Local\anaconda3\envs\yolov5\Lib\site-packages\ultralytics\engine\trainer.py", line 204, in train self._do_train(world_size) File "C:\Users\Y\AppData\Local\anaconda3\envs\yolov5\Lib\site-packages\ultralytics\engine\trainer.py", line 429, in _do_train self.metrics, self.fitness = self.validate() ^^^^^^^^^^^^^^^ File "C:\Users\Y\AppData\Local\anaconda3\envs\yolov5\Lib\site-packages\ultralytics\engine\trainer.py", line 570, in validate metrics = self.validator(self) ^^^^^^^^^^^^^^^^^^^^ File "C:\Users\Y\AppData\Local\anaconda3\envs\yolov5\Lib\site-packages\torch\utils_contextlib.py", line 115, in decorate_context return func(args, **kwargs) ^^^^^^^^^^^^^^^^^^^^^ File "C:\Users\Y\AppData\Local\anaconda3\envs\yolov5\Lib\site-packages\ultralytics\engine\validator.py", line 195, in call__ stats = self.get_stats() ^^^^^^^^^^^^^^^^ File "C:\Users\Y\AppData\Local\anaconda3\envs\yolov5\Lib\site-packages\ultralytics\models\yolo\detect\val.py", line 172, in get_stats stats = {k: torch.cat(v, 0).cpu().numpy() for k, v in self.stats.items()} # to numpy ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "C:\Users\Y\AppData\Local\anaconda3\envs\yolov5\Lib\site-packages\ultralytics\models\yolo\detect\val.py", line 172, in stats = {k: torch.cat(v, 0).cpu().numpy() for k, v in self.stats.items()} # to numpy ^^^^^^^^^^^^^^^ RuntimeError: torch.cat(): expected a non-empty list of Tensors

Environment

Ultralytics YOLOv8.2.48 🚀 Python-3.11.9 torch-2.3.1 CUDA:0 (NVIDIA GeForce GTX 1050 Ti, 4096MiB)

Minimal Reproducible Example

No response

Additional

The yolov5m6 model has been successfully trained on this dataset

github-actions[bot] commented 1 month ago

👋 Hello there! We wanted to give you a friendly reminder that this issue has not had any recent activity and may be closed soon, but don't worry - you can always reopen it if needed. If you still have any questions or concerns, please feel free to let us know how we can help.

For additional resources and information, please see the links below:

Feel free to inform us of any other issues you discover or feature requests that come to mind in the future. Pull Requests (PRs) are also always welcomed!

Thank you for your contributions to YOLO 🚀 and Vision AI ⭐