Closed axelalmet closed 3 weeks ago
I haven’t looked at that dataset, but without the index files, how do you correspond reads to cells? I.e. how do you know which read belongs to which cell?
Hi @Yenaled,
I'm definitely not an expert on Smart-seq3 data by any means, but from what I can tell from the GEO accession, there's an associated metadata file that tells you which cell corresponds to which run, which matches the processed barcodes provided by the authors.
I'm not sure if that fully answers your question, but that's the most I know right now.
Best wishes, Axel.
In that case, your data is "demultiplexed" (i.e. each FASTQ file represents an individual cell). This is identical to the issue here: https://github.com/pachterlab/kb_python/issues/235 where a solution is presented.
Solution #235 worked a treat, thank you so much!
Hi kb-python team,
I am interested in processing some published Smart-seq3 data to get unspliced and spliced counts (mainly out of curiosity).
This was the command I tried to run with
kb count
for a single sample:This is the resulting error I get:
which all makes sense, as I understand that one has to provide the index I1 and I2 files for SmartSeq3 analysis.
However, when I reexamined uploaded data at SRA run browser here, I see that the authors didn't upload the index files.
Therefore, is it possible to still realign the data without these index files?
Thank you for your time!
Best wishes, Axel.