issues
search
project-codeflare
/
codeflare-cli
Apache License 2.0
11
stars
12
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
feat: bump to `@guidebooks/store@8`
#996
starpit
closed
1 year ago
1
fix: more EOF protection fixes
#995
starpit
closed
1 year ago
0
Adjust release steps to contain complete and detailed release description
#994
sutaakar
closed
1 year ago
2
fix: update s3fs pvc to prevent its caching mechanism from consuming …
#993
starpit
closed
1 year ago
0
fix: improve cpu utilization metrics, and retry on ray head initContainer
#992
starpit
closed
1 year ago
0
Enhance codeflare help to return other verbs
#991
yuanchi2807
opened
1 year ago
0
codeflare -y still prompted for Choice 10 Choose an S3 Bucket
#990
yuanchi2807
opened
1 year ago
0
ctrl+c should only prompt "do you want to kill the job"
#989
starpit
opened
1 year ago
0
if only one option, don't use multiselect ui
#988
starpit
opened
1 year ago
0
Allow specifying no/empty response to the "path to working directory" option
#987
starpit
opened
1 year ago
0
fix: multinic detection was broken; also was hard-wiring name of resource
#986
starpit
closed
1 year ago
0
fix: custodian pods can linger forever
#985
starpit
closed
1 year ago
0
fix: avoid downloading helm chart on every run (cache it)
#984
starpit
closed
1 year ago
0
fix: improve torchx support for running multiple gpus per pod
#983
starpit
closed
1 year ago
0
fix: codeflare top may fail due to Array(fractionalnumber)
#982
starpit
closed
1 year ago
0
feat: improve multinic and NCCL performance
#981
starpit
closed
1 year ago
0
fix: use initContainer to wait for ray workers
#980
starpit
closed
1 year ago
0
fix: increase ray gcs rpc timeout to 30s
#979
starpit
closed
1 year ago
0
fix: increase resilience to network disconnects for torchx
#978
starpit
closed
1 year ago
0
fix: wait for ray workers prior to server-side job submit
#977
starpit
closed
1 year ago
0
fix: increase resilience to network disconnects, restore helm delete in custodian
#976
starpit
closed
1 year ago
0
fix: add websocat to custodian to avoid having to wget it every time
#975
starpit
closed
1 year ago
0
fix: avoid helm delete in custodian for now
#974
starpit
closed
1 year ago
0
fix: avoid use of all-containers in ray log streamer
#973
starpit
closed
1 year ago
0
fix: increase memory for runtime-env custodian pod
#972
starpit
closed
1 year ago
0
fix: increase memory for ray head logs container
#971
starpit
closed
1 year ago
0
fix: torchx volume mount paths may have extra quotes
#970
starpit
closed
1 year ago
0
fix: remove reliance on wget in ray head container
#969
starpit
closed
1 year ago
0
fix: improve custodian memory requests for larger jobs
#968
starpit
closed
1 year ago
0
fix: add h5py to pyarrow base image
#967
starpit
closed
1 year ago
0
fix: another fix for loading yoga.wasm (codeflare top exits silently)
#966
starpit
closed
1 year ago
0
fix: fix for torchx+s3 and lightning+s3
#965
starpit
closed
1 year ago
0
fix: improve support for pytorch lightning's fsspec[s3] support
#964
starpit
closed
1 year ago
0
fix: add `fsspec[s3]` to lightning image, and `conda clean -afy`
#963
starpit
closed
1 year ago
0
fix: shorter uuids resulted in uppercase kubernetes resource names
#962
starpit
closed
1 year ago
0
fix: use shorter uuids, custodian trackers for runtime-env and worker status
#961
starpit
closed
1 year ago
0
fix: force yoga.wasm to be loaded as a file resource
#960
starpit
closed
1 year ago
0
chore: update github actions to cancel previous in same workflow+branch
#959
starpit
closed
1 year ago
0
test: improve kind tests on linux/arm
#958
starpit
closed
1 year ago
0
fix: update kind tests to delete new name of cleaner -> logs
#957
starpit
closed
1 year ago
0
feat: update custodian to track cpu, mem, and gpu utilization
#956
starpit
closed
1 year ago
0
fix: clear selection when changing context
#955
starpit
closed
1 year ago
0
fix: pick up wasm webpack fix from kui
#954
starpit
closed
1 year ago
0
fix: failures on kind against default namespace
#953
starpit
closed
1 year ago
0
fix: clean up custodian command, and rename container 'logs'
#952
starpit
closed
1 year ago
0
fix: namespace keyboard shortcuts were incorrect
#951
starpit
closed
1 year ago
0
fix: another tweak to try to address ink loading bugs
#950
starpit
closed
1 year ago
0
fix: add keyboard shortcut hints
#949
starpit
closed
1 year ago
0
fix: Top should allow pageup/pagedown to cycle through clusters
#948
starpit
closed
1 year ago
0
fix: Top should allow uparrown/downarrow to cycle through namespaces
#947
starpit
closed
1 year ago
0
Next