issues
search
NVIDIA
/
go-dcgm
Golang bindings for Nvidia Datacenter GPU Manager (DCGM)
Apache License 2.0
95
stars
27
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Data Corruption on dcgm_fi_dev_gpu_util Metric
#75
TortoiseHam
closed
5 days ago
2
Compilation errors of dcgm-exporter and go-dcgm
#74
zoobab
opened
2 weeks ago
1
feat: expose function for listening to policy violations on a specific GPU group
#73
sanjams2
closed
2 months ago
1
DCGM Policy Violation Notification channel reporting too many PCIe violations on P5
#72
haardm
closed
3 months ago
1
Darwin ARM64 Compile issue after Upgrading from v0.0.0-20230816170901-d898cc7820fe to v0.0.0-20240118201113-3385e277e49f
#71
TamerSherif
closed
4 months ago
1
GPU Health API improvements
#70
nvvfedorov
closed
4 months ago
0
Returning dcgmRunDiagnostic results for a partial success
#69
EugenioSiciliano
opened
5 months ago
1
cant set -p parameter when using go-dcgm, but cli can
#68
freelikeff
closed
5 months ago
1
Test injection events are not caught in embedded mode.
#67
bingiflash
closed
6 months ago
2
Error setting up dcgm with startHostEngine mode from a golang based container
#66
haardm
opened
6 months ago
1
#64 fix 'GPU' of ProcessInfo in GetProcessInfo
#65
berkaroad
closed
7 months ago
2
always return 0, when get GPU of process info by `dcgm.GetProcessInfo(XXX)`
#64
berkaroad
opened
7 months ago
1
Compile error, cannot use _Ctype_long(ts) (value of type _Ctype_long) as _Ctype_longlong
#63
prtsh
closed
7 months ago
6
Add ability to access GroupHandle and FieldHandle
#62
rohit-arora-dev
closed
8 months ago
0
Deadlock in ListenForPolicyViolations
#61
sanjams2
opened
9 months ago
1
feat: allow watch pid with pre-created group.
#60
rootfs
opened
9 months ago
1
provide more flexible WatchPidFields API
#59
rootfs
opened
9 months ago
2
feat: export policy condition types
#58
sanjams2
closed
8 months ago
10
The `dcgmGetValuesSince_v2` binding has been added to `go-dcgm`
#57
nvvfedorov
closed
10 months ago
0
compile go-dcgm statically
#56
287400117
closed
9 months ago
1
Sync and export DCGM_GROUP_MAX_ENTITIES
#55
glowkey
closed
10 months ago
0
Add new API ListenForPolicyViolations to replace Policy
#54
dran-dev
closed
10 months ago
0
Lost dcgm policy notifications
#53
nvvfedorov
closed
11 months ago
0
Diagnostic may lead to increased memory resident of the program.
#51
BetaZYN
opened
11 months ago
1
Fix darwin do not support `--export-dynamic`
#50
zwpaper
opened
11 months ago
5
Lost dcgm policy notifications
#49
sanjams-amzn
closed
8 months ago
3
Add support for CPU/CPU Core entity types and queries
#48
glowkey
closed
11 months ago
0
Added `--export-dynamic` linker flag for go 1.21 support
#47
nvvfedorov
closed
11 months ago
0
run topology sample error
#46
lengrongfu
closed
1 month ago
0
Updates for DCGM 3.3.0
#45
glowkey
closed
1 year ago
4
Fixed dcgm.WatchPidFields using wrong time unit
#44
helinfan
closed
1 year ago
1
Where can I get the sample output of various policy failures?
#43
vinayburugu
closed
1 year ago
1
Update headers for DCGM 3.2
#42
glowkey
closed
1 year ago
0
Issue 13 - fix Init() when errors are encountered
#41
glowkey
closed
1 year ago
1
How do i get the fields using golang?
#40
vickyvikas7988
opened
1 year ago
2
dyld[33398]: symbol not found in flat namespace '_DcgmFieldGetById'
#39
vickyvikas7988
closed
1 year ago
3
Occasional metric loss and hangs in DCGM Exporter
#38
zlseu-edu
opened
1 year ago
3
default makefile build failure
#36
emeraldbay
closed
1 year ago
2
Add support for dcgmRunDiagnostic
#35
vilkaspilkas
closed
1 year ago
2
Fix getSupportedMetricGroups to match actual DCGM API
#34
glowkey
closed
1 year ago
0
Update getSupportedMetricGroups to use handle (Issue #32)
#33
glowkey
closed
1 year ago
0
getSupportedMetricGroups function takes uint `grpid` and the value is not used
#32
LujieDuan
closed
1 year ago
2
Question related to GPU device attributes
#31
starry91
opened
1 year ago
3
Question regarding MIG UUID
#30
starry91
closed
1 year ago
1
Add new nvswitch/nvlink fields
#29
glowkey
closed
2 years ago
0
Refactor API errors to include DCGM code
#28
glowkey
closed
2 years ago
0
Remove DestroyGroup function int the GetProcessInfo()
#27
xigang
closed
2 years ago
2
Update to DCGM 3.0 headers, Add new switch and link APIs
#26
glowkey
closed
2 years ago
0
process's SM Utilization is always lower than the gpu's SM Utilization
#25
seanchen022
opened
2 years ago
0
concurrent calls to the dcgm.GetProcessInfo() fucntion sometimes block
#24
xigang
closed
2 years ago
4
Next