Closed pratapaprasanna closed 4 years ago
Hi all,
Seems like there is an issue with my nvlink,
so if Incase if your training is taking too much Time please check if the links are up and not down.
The following command is as follows.
$ nvidia-smi nvlink --status
GPU 0: GeForce RTX 2080 Ti (UUID: GPU-797d7153-ea28-d678-dc38-859b914d6dd7)
Link 0: 25.781 GB/s
Link 1: 25.781 GB/s
GPU 1: GeForce RTX 2080 Ti (UUID: GPU-8807c553-7571-582d-c2ee-02993527b0a6)
Link 0: 25.781 GB/s
Link 1: 25.781 GB/s
Thanks
Hi all,
I have been trying to fire a training on openseq2seq and i see that the training doesn't start.
I have installed the drivers freshly and i see that the training doesnt start at all.
It is stuck here for almost 12 hrs
Can anyone help me in understanding the issue
when i fire other trainings i see that the Gpu is being utilized but donno why it is not working
with Tensorflow or Openseq2seq
I followed all the steps in installations instructions.
Thank you.
Environment