broadinstitute / CellBender

CellBender is a software package for eliminating technical artifacts from high-throughput single-cell RNA sequencing (scRNA-seq) data.
https://cellbender.rtfd.io
BSD 3-Clause "New" or "Revised" License
297 stars 54 forks source link

Feat: Using output from Kallisto/Bustools (or other non-CellRanger pipelines) #274

Closed yfarjoun closed 1 year ago

yfarjoun commented 1 year ago

CellRanger format is different than the native AnnData/scanpy format. and while AnnData knows how to import 10X files, it doesn't know how to export them....i.e. if I have data that wasn't generated by 10X I have a hard time running CellBender on it. It would be nice to have a way to input generic data by providing

The "normal" 10x run, would be invoking the generic pipeline with some set of inputs, but those of us who use non-CellRanger output could also have the benefit of using CellBender....

sjfleming commented 1 year ago

CellBender does now have the ability to take a variety of file formats as input. It can take a CellRanger h5, but it can also take an AnnData h5ad file, or a dropseq DGE file (not recommended though because the input parsing takes forever on large samples), or a loom file (same note as DGE), or a CellRanger MTX formatted tuple of files.

The AnnData format and the MTX format are both general enough that I was hoping that most use cases would fall into one of those bins. Is the AnnData option workable for you? (Raw counts should be in adata.X... or for complete details see here)

yfarjoun commented 1 year ago

ok. I'll give it a try.

yfarjoun commented 1 year ago

For the record, it seems to be working!! thanks!