while running train-yolov9-object-detection-on-custom-dataset.ipynb through colab.
when I run following cell
!python train.py \
--batch 16 --epochs 25 --img 640 --device 0 --min-items 0 --close-mosaic 15 \
--data {dataset.location}/data.yaml \
--weights {HOME}/weights/gelan-c.pt \
--cfg models/detect/gelan-c.yaml \
--hyp hyp.scratch-high.yaml
It runs, completes 25 epochs but many files are not generated . Their names are
train_batch0.jpg
val_batch0_labels.jpg
val_batch0_pred.jpg
train_batch1.jpg
val_batch1_labels.jpg
val_batch1_pred.jpg
train_batch2.jpg
Execution of above gives manyt messages . I have updated repo to handle version and deprecation issue, but this error is alarming for me.
Error 1
/usr/lib/python3.10/multiprocessing/popen_fork.py:66: RuntimeWarning: os.fork() was called. os.fork() is incompatible with multithreaded code, and JAX is multithreaded, so this will likely lead to a deadlock.
self.pid = os.fork()
Error 2
E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:485] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered
2024-09-19 06:42:42.368354: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:8454] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered
2024-09-19 06:42:42.374337: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1452] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered
2024-09-19 06:42:42.389616: I tensorflow/core/platform/cpu_feature_guard.cc:210] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
To enable the following instructions: AVX2 AVX512F FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.
2024-09-19 06:42:43.588455: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT.
Note:
Which of above two is causing the error. I can see the plots, confusion matrix, but not the actual detection results.
What to do ?
while running train-yolov9-object-detection-on-custom-dataset.ipynb through colab. when I run following cell !python train.py \ --batch 16 --epochs 25 --img 640 --device 0 --min-items 0 --close-mosaic 15 \ --data {dataset.location}/data.yaml \ --weights {HOME}/weights/gelan-c.pt \ --cfg models/detect/gelan-c.yaml \ --hyp hyp.scratch-high.yaml
It runs, completes 25 epochs but many files are not generated . Their names are train_batch0.jpg val_batch0_labels.jpg val_batch0_pred.jpg train_batch1.jpg val_batch1_labels.jpg val_batch1_pred.jpg train_batch2.jpg
Execution of above gives manyt messages . I have updated repo to handle version and deprecation issue, but this error is alarming for me.
Error 1 /usr/lib/python3.10/multiprocessing/popen_fork.py:66: RuntimeWarning: os.fork() was called. os.fork() is incompatible with multithreaded code, and JAX is multithreaded, so this will likely lead to a deadlock. self.pid = os.fork()
Error 2 E external/local_xla/xla/stream_executor/cuda/cuda_fft.cc:485] Unable to register cuFFT factory: Attempting to register factory for plugin cuFFT when one has already been registered 2024-09-19 06:42:42.368354: E external/local_xla/xla/stream_executor/cuda/cuda_dnn.cc:8454] Unable to register cuDNN factory: Attempting to register factory for plugin cuDNN when one has already been registered 2024-09-19 06:42:42.374337: E external/local_xla/xla/stream_executor/cuda/cuda_blas.cc:1452] Unable to register cuBLAS factory: Attempting to register factory for plugin cuBLAS when one has already been registered 2024-09-19 06:42:42.389616: I tensorflow/core/platform/cpu_feature_guard.cc:210] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations. To enable the following instructions: AVX2 AVX512F FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags. 2024-09-19 06:42:43.588455: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT.
Note:
Which of above two is causing the error. I can see the plots, confusion matrix, but not the actual detection results. What to do ?