selkamand / sigstash

Easily load published mutational signatures into R
Other
0 stars 0 forks source link

bgzip signature data #14

Closed selkamand closed 1 month ago

selkamand commented 5 months ago

The R package is now getting quite large. We should consider compressing each signature dataset e.g. using bgzip to keep package install snappy and ensure we won't hit problems when going to CRAN.

note if signature data ends up too large, we may need sigstash to stream in the signature data from some sort of public data-store (github should be fine).

selkamand commented 2 months ago

Reference signatures are still quite small except for c-elegans_sbs96_unfiltered_v1_0.csv & the other unfiltered experimental signatures. These might be worth compressing

selkamand commented 1 month ago

Decided to use gzip not bgzip since its on basically all operating systems natively and requires no special dependencies / code changes to parse.