FredHutch / motuz

Motuz - A web based infrastructure for large scale data movements between on-premise and cloud
MIT License
103 stars 12 forks source link

Run copy jobs on HPC cluster #408

Open dtenenba opened 2 years ago

dtenenba commented 2 years ago

Motuz currently does not scale well. When the number of jobs equals the number of CPUs on the (single) machine where it is running, it puts new jobs into a queue and they do not start until a running job completes.

This makes Motuz less useful when it is needed most.

Since here at Fred Hutch we have an HPC cluster, it would make sense to have the option to submit jobs to the cluster, then we could remove this limitation.

Since not everyone who may want to run Motuz has an HPC cluster, we should still support the Celery backend.

This issue is just to track progress on this work.

Some thoughts -

dtenenba commented 1 year ago

There is somewhat of a solution for the issue of the celery queue size.