Open mertcelebi opened 1 year ago
debug
folders nuked!.h5ad
files are only 10s of Mbs so they should be easily downloadable from Zenodo and visualized using a lightweight instance; generating the compression and doing large data transformations is what requires large amounts of RAM, but for the purposes of visualization/ exploration the compressed files should workThis is awesome, thanks @mezarque. Binder is great if it's multiple notebooks. Though I've run into some rate-limiting issues, so FYI. Colab would be fine if, it's a single notebook with more compute available.
install_locs.py
called S3_BUCKET_ADDRESS
install_locs.py
and indicated in the README that users would need to modify these paths to reflect their local installation environment.glial_origins_tidy.yml
which seems to be working.
glial_origins_tidy.yml
conda environment should now have everything needed to broadly reproduce the analysis. Exact reproduction should be possible using the full glial_origins.yml
file, which I'm going to leave in there for now.a_
etc. and all should be described in the "Pipeline Organization" section of the main README.md
postBuild
set of calls to wget
a la this tutorial to download the .h5ad
compressed data files, which should be pretty small (~250Mb) and loadable.env.yml
file in the top of the directory to build a lightweight conda environment for the Binder.notebooks
to store the Binder-specific code. It likely won't use biofile_handling
Hi @mezarque, just went through the repo, here are some rough thoughts:
tutorial
folders to clean things up.debug
folders.Setup.md
to the main README (talking about what type of instance you ran things on and how you configured it etc). If you need inspiration for some of the level of detail around setting up Snakemake, the actin prediction repo has a great README.1_download/
or smth like that and within those you'd have download notebooks for each species etc?