Tensorboard was outputting logs constantly when running a training aware command for a yolov5-n (base) model trained on coco, this made it difficult to see training progress or if training was actually being run or not. This PR fixes that by suppressing tensorboard logs
Changes include:
Set the tensorboard logging level to warnings to suppress constant stream of tensorboard logs
rahul at quad-mle-1 in ~/projects/sparsify (sparsify)
$ sparsify.run training-aware --model "zoo:cv/detection/yolov5-n/pytorch/ultralytics/coco/base-none" --data VOC.yaml --use-case cv-detection --optim-level 0.5
Checking for GPU...
GPU check completed successfully
/home/rahul/venvs/sparsify/lib/python3.8/site-packages/tensorboard_data_server/bin/server: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.33' not found (required by /home/rahul/venvs/sparsify/lib/python3.8/site-packages/tensorboard_data_server/bin/server)
/home/rahul/venvs/sparsify/lib/python3.8/site-packages/tensorboard_data_server/bin/server: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.34' not found (required by /home/rahul/venvs/sparsify/lib/python3.8/site-packages/tensorboard_data_server/bin/server)
/home/rahul/venvs/sparsify/lib/python3.8/site-packages/tensorboard_data_server/bin/server: /lib/x86_64-linux-gnu/libc.so.6: version `GLIBC_2.32' not found (required by /home/rahul/venvs/sparsify/lib/python3.8/site-packages/tensorboard_data_server/bin/server)
*************************SPARSIFY***************************
TensorBoard listening on http://localhost:6006/
************************************************************
INFO:auto_banner:TensorBoard listening on http://localhost:6006/
INFO:tensorboard:TensorBoard reload process beginning
TensorBoard reload process beginning
INFO:tensorboard:Starting AddRunsFromDirectory: /home/rahul/projects/sparsify/training_aware_object_detection_2023_07_11_17_17_42/logs
Starting AddRunsFromDirectory: /home/rahul/projects/sparsify/training_aware_object_detection_2023_07_11_17_17_42/logs
INFO:tensorboard:GetLogdirSubdirectories: Starting to list directories via walking.
GetLogdirSubdirectories: Starting to list directories via walking.
INFO:tensorboard:Done with AddRunsFromDirectory: /home/rahul/projects/sparsify/training_aware_object_detection_2023_07_11_17_17_42/logs
Done with AddRunsFromDirectory: /home/rahul/projects/sparsify/training_aware_object_detection_2023_07_11_17_17_42/logs
INFO:tensorboard:TensorBoard reload process: Reload the whole Multiplexer
TensorBoard reload process: Reload the whole Multiplexer
INFO:tensorboard:Beginning EventMultiplexer.Reload()
Beginning EventMultiplexer.Reload()
INFO:tensorboard:Reloading runs serially (one after another) on the main thread.
Reloading runs serially (one after another) on the main thread.
INFO:tensorboard:Finished with EventMultiplexer.Reload()
Finished with EventMultiplexer.Reload()
INFO:tensorboard:TensorBoard done reloading. Load took 0.001 secs
TensorBoard done reloading. Load took 0.001 secs
INFO:tensorboard:TensorBoard reload process beginning
TensorBoard reload process beginning
INFO:tensorboard:Starting AddRunsFromDirectory: /home/rahul/projects/sparsify/training_aware_object_detection_2023_07_11_17_17_42/logs
Starting AddRunsFromDirectory: /home/rahul/projects/sparsify/training_aware_object_detection_2023_07_11_17_17_42/logs
INFO:tensorboard:GetLogdirSubdirectories: Starting to list directories via walking.
GetLogdirSubdirectories: Starting to list directories via walking.
INFO:tensorboard:Done with AddRunsFromDirectory: /home/rahul/projects/sparsify/training_aware_object_detection_2023_07_11_17_17_42/logs
Done with AddRunsFromDirectory: /home/rahul/projects/sparsify/training_aware_object_detection_2023_07_11_17_17_42/logs
INFO:tensorboard:TensorBoard reload process: Reload the whole Multiplexer
TensorBoard reload process: Reload the whole Multiplexer
INFO:tensorboard:Beginning EventMultiplexer.Reload()
Beginning EventMultiplexer.Reload()
INFO:tensorboard:Reloading runs serially (one after another) on the main thread.
Reloading runs serially (one after another) on the main thread.
INFO:tensorboard:Finished with EventMultiplexer.Reload()
Finished with EventMultiplexer.Reload()
INFO:tensorboard:TensorBoard done reloading. Load took 0.001 secs
TensorBoard done reloading. Load took 0.001 secs
INFO:root:Using nproc_per_node=auto.
Using nproc_per_node=auto.
INFO:torch.distributed.elastic.rendezvous.static_tcp_rendezvous:Creating TCPStore as the c10d::Store implementation
Creating TCPStore as the c10d::Store implementation
INFO:tensorboard:TensorBoard reload process beginning
TensorBoard reload process beginning
INFO:tensorboard:Starting AddRunsFromDirectory: /home/rahul/projects/sparsify/training_aware_object_detection_2023_07_11_17_17_42/logs
Starting AddRunsFromDirectory: /home/rahul/projects/sparsify/training_aware_object_detection_2023_07_11_17_17_42/logs
INFO:tensorboard:GetLogdirSubdirectories: Starting to list directories via walking.
GetLogdirSubdirectories: Starting to list directories via walking.
INFO:tensorboard:Done with AddRunsFromDirectory: /home/rahul/projects/sparsify/training_aware_object_detection_2023_07_11_17_17_42/logs
Done with AddRunsFromDirectory: /home/rahul/projects/sparsify/training_aware_object_detection_2023_07_11_17_17_42/logs
INFO:tensorboard:TensorBoard reload process: Reload the whole Multiplexer
TensorBoard reload process: Reload the whole Multiplexer
INFO:tensorboard:Beginning EventMultiplexer.Reload()
Beginning EventMultiplexer.Reload()
INFO:tensorboard:Reloading runs serially (one after another) on the main thread.
Reloading runs serially (one after another) on the main thread.
INFO:tensorboard:Finished with EventMultiplexer.Reload()
Finished with EventMultiplexer.Reload()
INFO:tensorboard:TensorBoard done reloading. Load took 0.001 secs
TensorBoard done reloading. Load took 0.001 secs
Tensorboard was outputting logs constantly when running a training aware command for a yolov5-n (base) model trained on coco, this made it difficult to see training progress or if training was actually being run or not. This PR fixes that by suppressing tensorboard logs
Changes include:
Test Command:
Before this PR:
After This PR: