Closed selkamand closed 1 month ago
Reference signatures are still quite small except for c-elegans_sbs96_unfiltered_v1_0.csv & the other unfiltered experimental signatures. These might be worth compressing
Decided to use gzip not bgzip since its on basically all operating systems natively and requires no special dependencies / code changes to parse.
The R package is now getting quite large. We should consider compressing each signature dataset e.g. using bgzip to keep package install snappy and ensure we won't hit problems when going to CRAN.
note if signature data ends up too large, we may need sigstash to stream in the signature data from some sort of public data-store (github should be fine).