issues
search
Fluid-Dynamics-Group
/
distribute
easy to use distribtued computing
https://fluid-dynamics-group.github.io/distribute-docs
GNU General Public License v3.0
1
stars
0
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Paper revisions
#46
VanillaBrooks
opened
11 months ago
0
JOSS Review: Automated Testing Pipeline
#45
jeremylt
opened
11 months ago
0
Joss Review: Install Warning
#44
jeremylt
opened
11 months ago
1
Joss Review: Cargo Install
#43
jeremylt
opened
11 months ago
0
JOSS Review: Contributing Guidelines
#42
jeremylt
opened
11 months ago
0
JOSS Review: Contributions
#41
HaoZeke
opened
1 year ago
1
Documentation on using docker configurations with distribute
#40
VanillaBrooks
opened
1 year ago
0
Python bindings for generating docker configurations
#39
VanillaBrooks
opened
1 year ago
0
distribute kill leaves empty jobset
#38
VanillaBrooks
opened
1 year ago
1
Remove ansi escape codes from files, Update crate dependencies, use system time for logs
#37
VanillaBrooks
closed
1 year ago
0
Run containers with `--no-home` + documentation
#36
VanillaBrooks
opened
1 year ago
0
Add additional logging to debug random halting in simulations
#35
VanillaBrooks
closed
1 year ago
0
Update documentation book to remove fat and provide better examples
#34
VanillaBrooks
closed
1 year ago
0
user docs on exporting distribute specification to use on slurm
#33
VanillaBrooks
opened
1 year ago
0
dev docs on how to run full test suite
#32
VanillaBrooks
opened
1 year ago
0
Add paper writeup and cleanup documentation writeup
#31
VanillaBrooks
closed
1 year ago
0
Possible for node to appear online constantly if keep alives are timed properly
#30
VanillaBrooks
opened
1 year ago
0
Fix python CI to remove artifacts, gh-pages action now uploads to main repo
#29
VanillaBrooks
closed
1 year ago
0
Export distribute-jobs.yaml configuration files to execute on SLURM cluster
#28
VanillaBrooks
closed
1 year ago
0
Send large files with streaming APIs, reuse `send_files` state machine for compilation and job files.
#27
VanillaBrooks
closed
1 year ago
0
tracing logs that are sent to stdout and a file are unreadable in file format
#26
VanillaBrooks
opened
1 year ago
0
fix division by zero with % operator in server status output
#25
VanillaBrooks
closed
1 year ago
0
High memory usage for (possibly, but not for sure) large .sif files
#24
VanillaBrooks
closed
1 year ago
0
divide by zero for short job runtime
#23
VanillaBrooks
closed
1 year ago
0
use `tracing` to implement logs, verify that `apptainer` is in the client's $PATH to make debugging possible errors easier.
#22
VanillaBrooks
closed
1 year ago
0
Option to skip folder when using `distribute pull`
#21
VanillaBrooks
closed
1 year ago
0
bump dependencies, convert structopt cli to clap
#20
VanillaBrooks
closed
1 year ago
0
`distribute pull` flag to disable downloading folder structure
#19
VanillaBrooks
closed
1 year ago
0
Command to convert distribute-jobs.yaml file to SLURM files
#18
VanillaBrooks
closed
1 year ago
0
Store the names of the jobs that are running, how long it has been running, and the name of the node / IP that is running that job
#17
VanillaBrooks
closed
1 year ago
1
Better logs to user if runtime dependencies (like apptainer) are not correctly installed on a compute machine
#16
VanillaBrooks
closed
1 year ago
0
Decrement job running count after cancelling a job
#15
VanillaBrooks
closed
1 year ago
0
`distribute node-status` may hang indefinitely if the IP address of a node is not configured correctly - does not timeout
#14
VanillaBrooks
opened
2 years ago
0
Add python bindings to config generation
#13
VanillaBrooks
closed
2 years ago
0
update some points of documentation and ensure tests are passing
#12
VanillaBrooks
closed
2 years ago
0
Apptainer
#11
VanillaBrooks
closed
2 years ago
0
Update to state machine implementation
#10
VanillaBrooks
closed
2 years ago
0
Flag on `distribute status` to determine the version of each node
#9
VanillaBrooks
closed
2 years ago
0
test case for job being added to queue after it has been terminated / failed keepalive
#8
VanillaBrooks
opened
2 years ago
0
Kill command does not properly stop jobs on nodes
#7
VanillaBrooks
closed
2 years ago
1
Don't overwrite build logs from different nodes
#6
VanillaBrooks
opened
2 years ago
0
Better logging for distribute pull command
#5
VanillaBrooks
closed
1 year ago
0
Additional documentation, pull commands, run local commands, bunch of other useful stuff
#4
VanillaBrooks
closed
2 years ago
0
distribute pull command
#3
VanillaBrooks
closed
2 years ago
0
Ping nodes constantly to determine if they go offline
#2
VanillaBrooks
closed
1 year ago
0
Nodes / Job pool are not correctly communicating when a job has finished running
#1
VanillaBrooks
closed
2 years ago
0