issues
search
run-house
/
runhouse
Dispatch and distribute your ML training to "serverless" clusters in Python, like PyTorch for ML infra. Iterable, debuggable, multi-cloud/on-prem, identical across research and production.
https://run.house
Apache License 2.0
965
stars
37
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
upgrade upload-artifact to v4 in ci build_docs
#1255
BelSasha
closed
2 weeks ago
1
status thread accessing deleted env bugfix
#1254
BelSasha
closed
1 week ago
1
Switch `multinode_cpu_cluster` to `multinode_cpu_docker_conda_cluster`.
#1253
rohinb2
closed
3 weeks ago
2
some buffering fix attempts
#1252
rohinb2
closed
3 weeks ago
1
Find logs dir correctly.
#1251
rohinb2
closed
3 weeks ago
2
Skip status teardown test for now
#1250
carolineechen
closed
3 weeks ago
1
Fix disk size attribute
#1249
carolineechen
closed
3 weeks ago
1
Handle failures in a_up
#1248
dongreenberg
closed
3 weeks ago
2
Introduce cluster.a_up
#1247
dongreenberg
closed
3 weeks ago
2
Add more sky resource properties to cluster launched properties
#1246
carolineechen
closed
3 weeks ago
2
Cluster list tests
#1245
BelSasha
closed
1 week ago
1
remove sagemaker github action
#1244
jlewitt1
closed
3 weeks ago
1
update env vars for den accounts in CI
#1243
jlewitt1
closed
2 weeks ago
1
update docs with new status output
#1242
BelSasha
closed
3 weeks ago
1
update release pre-check and drop py3.7 support
#1241
jlewitt1
closed
2 weeks ago
2
Install default_env on all nodes
#1240
dongreenberg
closed
3 weeks ago
2
Switch `ondemand_aws_cluster` to `ondemand_aws_docker_cluster`.
#1239
rohinb2
closed
3 weeks ago
2
Install from explicit `dest_path` instead of relative path.
#1238
rohinb2
closed
3 weeks ago
3
Temporary fix for pydantic version error with FastAPI.
#1237
rohinb2
closed
3 weeks ago
2
updated tokens for dev ci testing
#1236
jlewitt1
closed
2 weeks ago
3
Run everything via `ssh` + `docker exec` as opposed to SSH Proxy.
#1235
rohinb2
closed
3 weeks ago
3
Update version to 0.0.34.
#1234
rohinb2
closed
4 weeks ago
2
cluster list filters
#1233
BelSasha
closed
1 week ago
1
More docstrings cleanup
#1232
carolineechen
closed
4 weeks ago
0
add last_active info to cluster list
#1231
BelSasha
closed
1 week ago
1
Update docstrings
#1230
carolineechen
closed
4 weeks ago
1
add num ports to try for ssh tunnel as constant
#1229
jlewitt1
closed
4 weeks ago
3
load cluster token from Den instead of generating client side
#1228
jlewitt1
closed
2 weeks ago
2
By default, cluster list prints only running clusters
#1227
BelSasha
closed
1 week ago
1
status cmd prints cluster name as hyperlink to den
#1226
BelSasha
closed
3 weeks ago
1
Introduce cluster list cli cmd
#1225
BelSasha
closed
1 week ago
1
status cli prints active funtions info
#1224
BelSasha
closed
3 weeks ago
1
instrument obj store methods for traces and spans
#1223
jlewitt1
opened
1 month ago
1
remove SageMaker cluster
#1222
jlewitt1
closed
4 weeks ago
2
Delete `provenance` and `run` related code.
#1221
rohinb2
closed
4 weeks ago
3
Teardown k8s cluster in nightly gha release test
#1220
carolineechen
closed
1 month ago
1
Add Multicloud Airflow example
#1219
py-rh
closed
3 weeks ago
2
Propagate `_ssh_mode`.
#1218
rohinb2
closed
1 month ago
2
Try fixing logs again
#1217
dongreenberg
closed
1 month ago
3
delete table
#1216
BelSasha
closed
1 month ago
1
remove unused `add_secrets` func
#1215
jlewitt1
closed
1 month ago
1
prevent duplicate logs locally
#1214
jlewitt1
closed
4 weeks ago
1
Deprecate file and blob
#1213
BelSasha
closed
4 weeks ago
2
Reintroduce Log Hierarchy
#1212
dongreenberg
closed
1 month ago
2
Stream logs in SSH setup commands
#1211
dongreenberg
closed
1 month ago
2
add local telemetry agent for clusters
#1210
jlewitt1
opened
1 month ago
1
print cloud info in cluster status
#1209
BelSasha
closed
3 weeks ago
1
Fix ondemand test.
#1208
rohinb2
closed
1 month ago
2
minor status cli bugfix
#1207
BelSasha
closed
1 month ago
1
Update client error message
#1206
carolineechen
closed
1 month ago
1
Previous
Next