issues
search
AMDResearch
/
omnistat
https://amdresearch.github.io/omnistat/
MIT License
5
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
[Req] Partition info in "GPU Utilization by Node"
#131
coleramos425
opened
4 weeks ago
0
adding documentation badge
#130
koomie
closed
1 month ago
0
Update documentation for release tarball usage
#129
koomie
closed
1 month ago
0
Improve Grafana documentation
#128
jordap
closed
1 month ago
0
Add minimum rocm version check for rocm-smi variant.
#127
koomie
closed
1 month ago
0
Fix node dashboard with older Grafana versions
#126
jordap
closed
1 month ago
0
update default config file example to not have rms collector enabled
#125
koomie
closed
1 month ago
0
Improve automated dashboard generation
#124
jordap
closed
1 month ago
0
update squeue check during rms collector startup
#123
koomie
closed
1 month ago
0
Fix standalone node URL
#122
jordap
closed
1 month ago
0
exporter fails to start correctly in default config when slurm squeue is not available
#121
koomie
closed
1 month ago
0
Test different configurations
#120
jordap
opened
1 month ago
0
Fix temperature in job dashboard
#119
jordap
closed
1 month ago
0
Minor documentation updates and fixes
#118
jordap
closed
1 month ago
0
Fix temperatures in dashboards
#117
jordap
closed
1 month ago
0
Add overview of access restriction configuration in installation discussion
#116
koomie
closed
1 month ago
0
update name and query logic for memory temperature metric
#115
koomie
closed
1 month ago
2
Fix usermode options
#114
jordap
closed
1 month ago
0
edits to documentation landing page
#113
koomie
closed
1 month ago
0
Tweak inventory and network panels
#112
jordap
closed
1 month ago
0
Feature/rmsv2
#111
omri-amd
opened
1 month ago
5
Corebinding defaults
#110
jordap
closed
1 month ago
1
Document OMNISTAT_PROMSERVER_DATADIR
#109
jordap
opened
1 month ago
0
Consistent use of Omnistat port 8001
#108
jordap
closed
1 month ago
0
Update usermode documentation
#107
jordap
closed
1 month ago
0
Consistent use of Omnistat port
#106
jordap
closed
1 month ago
1
provide metric naming parity between rocm-smi and amd-smi collector variants
#105
koomie
closed
1 month ago
0
doc additions: add slurm integration discussion for system-mode
#104
koomie
closed
1 month ago
0
re-enable ansible example with updated variant
#103
koomie
closed
1 month ago
0
documentation updates/additions
#102
koomie
closed
1 month ago
0
Dashboard improvements
#101
jordap
closed
1 month ago
0
Remove unused Flask metrics from exporter
#100
jordap
closed
1 month ago
1
Update minimum version check for rocm 6.1
#99
koomie
closed
1 month ago
0
Features/m2m
#98
omri-amd
opened
1 month ago
0
Major dashboard update
#97
jordap
closed
1 month ago
0
update temperature_hbm_celsius metric - only register when hbm temperature is available
#96
koomie
closed
1 month ago
0
Increase number of samples
#95
jordap
closed
1 month ago
0
addition of optional collector for event detection
#94
koomie
closed
1 month ago
0
adding several new metrics
#93
koomie
closed
1 month ago
0
Throttling events and other dashboard updates
#92
jordap
closed
1 month ago
0
update psecs wait time to be based on size of job (user mode execution)
#91
koomie
closed
1 month ago
0
Update gpu index-mapping for amd-smi variant
#90
koomie
opened
1 month ago
0
Updates for gpu index mapping; use gpu_ids provided by newer API in rocm-smi based collector
#89
koomie
closed
1 month ago
0
Test usermode in package deployments
#88
jordap
closed
1 month ago
0
Node dashboard and dashboard links
#87
jordap
closed
2 months ago
0
Add job step sorting to local dashboard
#86
koomie
closed
2 months ago
0
with multiple node allocation on SLURM, #GPUs is showing only one instance
#85
GowriShankarEAAS
closed
2 months ago
4
Test multiple nodes and usermode
#84
jordap
closed
2 months ago
1
Issue while starting the omnistat-monitor
#83
GowriShankarEAAS
closed
2 months ago
1
minor cosmetic update for query utility
#82
koomie
closed
2 months ago
0
Next