-
I have encountered a cluster using the PBS/Torque scheduling system on which the `qstat` and `tracejob` commands give no information about job IDs that have finished running. Instead the `checkjob` co…
-
When I use `addprocs_qrsh()` I get an error message and no jobs are created (checked in `qstat`).
```
ClusterManagers.addprocs_qrsh(3,res_list="h_rt=2:00:00,h_data=4G,highp")
Error launching worker…
-
When you qsub a job that needs to wait for another job to finish, `starcluster loadbalance` treats that job as a waiting job and fires up new nodes. It should really only consider jobs that are just w…
-
If I lauch a PAF job in cluster mode, when it ends it is not automatically deleted from the list of my active jobs (the list I access to using qstat -u trevisanin).
This causes problems as I can launc…
-
Both SGE and LSF have qsub options that cause the qsub command to wait until the job has completed before returning. Some 3rd party pipeline applications make use of this feature, which makes porting…
-
Monkey jobs with lots of samples do not scale well at certain steps.
Here is a script that doesn't scale well because it needs to read in all the bams of all the files before creating a gene body c…
-
For example, if I have a job in the error state because it exceeded its h_vmem resource, I try to alter the job to change the resource request:
$ qalter -l h_vmem=2G 1132393
denied: can't change cons…
-
make sure that pbsnodes does not do a live query from the compute nodes, but uses a mongo query, e.g. pbsnodesneeds to fisrt be called, the contents put in mongo, and than we just display from mngo
…
-
ToDo:
* [x] build and package dependencies as nix flakes
* [x] build and package engine
* [x] build and package assets (I've given up on handling assets in nix)
* [x] seemless rebuild and restarts…
-
The following is a list of TODOs associated with PR#2767. These are items that would be good to revisit in the near future:
- [x] Create a new unit test for submit(), so that it doesn't have to rel…