issues
search
ml-energy
/
zeus
Deep Learning Energy Measurement and Optimization
https://ml.energy/zeus
Apache License 2.0
180
stars
24
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Fix CPU measurements so DRAM isn't included in package reading
#92
wbjin
closed
3 weeks ago
0
RAPL energy reading wrap around.
#91
wbjin
opened
1 month ago
0
Add CPU benchmarking to ZeusMonitor
#90
wbjin
closed
4 weeks ago
0
CPU socket detection for the current process
#89
jaywonchung
opened
1 month ago
1
[Zeusd] Better failure handling and testing
#88
jaywonchung
opened
1 month ago
1
`zeusd` debug outputs and doc comments
#87
jaywonchung
closed
1 month ago
0
Fix typo in GitHub Actions
#86
jaywonchung
closed
1 month ago
0
Integrate `zeusd` into `zeus.device.gpu`
#85
jaywonchung
closed
1 month ago
0
Add CPU in devices
#84
wbjin
closed
1 month ago
1
Reorg `zeus.device.gpu`
#83
jaywonchung
closed
1 month ago
0
Allow `zeusd` dev and testing on MacOS
#82
jaywonchung
closed
1 month ago
0
Zeus daemon
#81
jaywonchung
closed
1 month ago
0
replace `time.time()` calls by `time.perf_counter()`
#80
ImahnShekhzadeh
closed
1 month ago
1
[Usage] Question about Distributed Training
#79
ImahnShekhzadeh
closed
1 month ago
2
Better energy observability
#78
jaywonchung
opened
1 month ago
0
Training framework integration opportunities
#77
jaywonchung
opened
1 month ago
0
Fix: Pandas warnings from `PowerMonitor`
#75
jaywonchung
closed
2 months ago
0
Remove annoying warning messages in PowerMonitor
#72
Sunt-ing
closed
2 months ago
3
Detect and reject unofficial `pynvml` bindings
#71
jaywonchung
closed
2 months ago
0
improve the GPU energy monitoring demo
#70
Sunt-ing
closed
2 months ago
0
doc: fix typo
#69
Sunt-ing
closed
2 months ago
3
Docs: Add warnings about instantiating `ZeusMonitor` as a global variable.
#68
jaywonchung
closed
2 months ago
0
Bump to v0.9.1
#67
jaywonchung
closed
2 months ago
0
Chore: Fix CI and add back check doc build
#66
jaywonchung
closed
2 months ago
0
Use instant power draw explicitly when available
#65
jaywonchung
closed
2 months ago
0
Chore: Fix Zeus logo link in README
#64
jaywonchung
closed
2 months ago
0
Bump to v0.9.0
#63
jaywonchung
closed
2 months ago
0
Chore: Fix link in REAMDE.md
#62
jaywonchung
closed
2 months ago
0
Chore: Fix REAMDE.md typo
#61
jaywonchung
closed
2 months ago
0
Docs, READMEs, and examples big reorg
#60
jaywonchung
closed
2 months ago
0
Test and verify `nvmlDeviceSetAPIRestriction`
#59
jaywonchung
opened
2 months ago
0
Add Pyright type checking
#58
jaywonchung
closed
2 months ago
1
Added AMD GPU Support to Zeus
#57
parthraut
closed
2 months ago
3
CI: Fix doc build
#56
jaywonchung
closed
2 months ago
0
Stale API deprecations and example adjustments
#55
jaywonchung
closed
2 months ago
0
Testing bso examples on CloudLab and EKS
#54
show981111
closed
2 months ago
0
Carbon-aware Zeus (Chase) as an optimizer
#53
jaywonchung
opened
2 months ago
1
Fix: Test fixture and GPU abstraction in BSO
#52
jaywonchung
closed
2 months ago
0
Reorg dockerfiles and fix docs
#51
jaywonchung
closed
2 months ago
0
Fix: Align parameters of setGpuLockedClocks to NVML API
#49
FuryMartin
closed
3 months ago
0
Question about the missing of Graphics Clock Setting
#48
FuryMartin
closed
3 months ago
4
Doc Fix for zeus.device
#47
parthraut
closed
3 months ago
1
Abstracting away the gpu
#46
parthraut
closed
3 months ago
10
Distinguish instantaneous power vs. average power
#45
jaywonchung
closed
2 months ago
0
Docs: Test Vercel documentation preview
#44
jaywonchung
closed
3 months ago
1
`GlobalPowerLimitOptimizer` for distributed data parallel training
#43
jaywonchung
opened
3 months ago
0
Add `SFTTrainer` integration example
#42
jaywonchung
closed
3 months ago
0
Example for `SFTTrainer` + `HFGlobalPowerLimitOptimizer`
#41
jaywonchung
closed
3 months ago
0
Update `pyproject.toml` based on Ruff warning
#40
jaywonchung
closed
4 months ago
0
added base exception class
#39
parthraut
closed
4 months ago
0
Next