aws / amazon-genomics-cli

https://aws.github.io/amazon-genomics-cli/
Apache License 2.0
147 stars 82 forks source link

User-specified dependencies in runtime environment #615

Open bballew opened 1 year ago

bballew commented 1 year ago

Description

It would be great if there was a way to have the user specify minimal dependencies to be installed in the runtime environment.

Use Case

Specifically for Snakemake, a very common paradigm is to use pandas to read in a tab-separated manifest file and then query the dataframe with lambda functions in rules. In fact, pandas is part of the dependency chain for installing Snakemake via conda. It appears that we don't have access to pandas in the Snakemake container being used by agc. Since Snakemake is based on Python, I could see users wanting access to other Python libraries at runtime, that might not currently be installed in the Snakemake container.

Proposed Solution

Thanks!