Closed amathews-amd closed 1 year ago
Hi, the issue is "/sys/fs/cgroup/cpuacct/cpuacct" missing in your current enviroment causing Superbench monitor failures. Could you please temporarily disable monitor feature by changing the config file "atoa_small_ndv4.yaml" from
# SuperBench Config
version: v0.4
superbench:
enable: null
monitor:
enable: true
sample_duration: 1
sample_interval: 10
to
# SuperBench Config
version: v0.4
superbench:
enable: null
monitor:
enable: false
sample_duration: 1
sample_interval: 10
Docker container: nvidia/cuda:11.6.1-cudnn8-devel-ubuntu20.04 GPU 0: NVIDIA A100 80GB PCIe
https://github.com/microsoft/superbenchmark/blob/6e357fb9d2038dabd4e2c07854c92ca7b0805cee/superbench/monitor/monitor.py#L83