pepkit / looper

A job submitter for Portable Encapsulated Projects
http://looper.databio.org
BSD 2-Clause "Simplified" License
20 stars 7 forks source link

Looper process streamlining #414

Closed nsheff closed 3 months ago

nsheff commented 1 year ago

A collection of problems I ran into when trying to use looper for a project:

donaldcampbelljr commented 11 months ago

looper variable namespaces need to be updated; in particular, does the looper namespace get what's in the .looper config file now? -> Q: looper namespace was relative to PEP is it now the looper config? Think everythign in the looper config is in the looper namespace. See #423

looper.pep_config -> command_template -> it works if passing pep that is local, if using a registry path it doesn't appear to work

how do to parallel-process files with looper locally -> originally a divvy idea. issue: 100 files -> divvy submits to cluster no problem, if local they will run serial. Could run in background process using ampersand. in command shell script with &. So can we lump 100 samples (in 10 background processes), new divvy template to accomplish that.

donaldcampbelljr commented 11 months ago

there was no clear way to pull from pephub into Python (I guess pephubclient can do this? Possibly just need docs)

For using looper with pephub looks like there is documentation: https://looper.databio.org/en/dev/hello-world-pephub/

However, it does appear that our pephubclient api docs could be expanded. There is currently only a light readme.

donaldcampbelljr commented 3 months ago

Because we've solved the majority of the issues here and the remaining two have child issues tracking them, I will close this parent issue.