BUStools / bustools

Tools for working with BUS files
https://bustools.github.io/
BSD 2-Clause "Simplified" License
91 stars 23 forks source link

Consider output in loom file format #14

Closed slinnarsson closed 4 years ago

slinnarsson commented 5 years ago

Hi! Thanks for the great work on kallisto and bustools - it's impressively fast and looks accurate so far.

Since kallisto already uses HDF5, would you consider an option to write the output of kallisto+bustools to a loom file (which is based on HDF5)? Loom files are especially convenient for RNA velocity, since they support multiple layers. See http://loompy.org for docs and spec.

Ideally, bustools would write the count output to a new loom file (matrix plus row and column metadata), or add a layer to an existing loom file (e.g. for adding unspliced counts to a file with spliced counts, or for feature barcodes, or both at the same time).

The loom spec is simple enough that you can write the file directly using the HDF5 API in C/C++. I'll be happy to help if anything is unclear in the spec.

lakigigar commented 5 years ago

Thanks for the suggestion. We’ll definitely look into this. A bus2loom tool is a good idea.

lakigigar commented 4 years ago

The kb wrapper now supports loom output.