neherlab / pangraph

A bioinformatic toolkit to align genome assemblies into pangenome graphs
https://neherlab.github.io/pangraph
MIT License
77 stars 7 forks source link

size of docker image #43

Closed mmolari closed 1 year ago

mmolari commented 1 year ago

The docker image has the size of ~1.5Gb and takes ~30 mins to compile. Half of this size is due to the fact that we use the ete3 python library, and we need to package a python distribution with conda in the docker image. This only gets used in the export command, and in particular for the PanX-compatible export (file src/panX.jl).

A possible solution to reduce compilation time and docker image size is to use TreeTools instead of ete3. In particular we need functions to: