after install DCGM, When I enter that statement, I get the following error
dcgmi discovery -l
Error: unable to establish a connection to the specified host: localhost
Error: Unable to connect to host engine. Host engine connection invalid/disconnected.
dcgmi discovery -v
Version : 2.4.5
Build ID : 9
Build Date : 2022-06-03
Build Type : Release
Commit ID : 82470ec91c4a20565182d65d2b8f0ea756c70285
Branch Name : rel_dcgm_2_4
CPU Arch : x86_64
Build Platform : Linux 4.15.0-180-generic #189-Ubuntu SMP Wed May 18 14:13:57 UTC 2022 x86_64
CRC : 54832e64be3a6a8ad586bcae022ca6cb
I'm using aws and my environment is:
and install DCGM refer to this link https://developer.nvidia.com/dcgm (The version was modified from ubuntu20 to 18 and installed)
after install DCGM, When I enter that statement, I get the following error
dcgmi discovery -l
dcgmi discovery -v