Correct SORT-seq data - Githubissues

Thanks for your interest!

Good question. Unfortunately, it seems that the count tables deposited are somewhat older than the ones used for the latest analysis. So thank you for bringing my attention to this.

I am currently in the middle of two busy weeks, so I will look into this next week.

Briefly,

The files you are talking about are from a previous mapping pipeline, that we didn't use in the end. The counts that are determined there are:
- ReadCounts: Raw counts, without UMI correction (integer number).
- BarcodeCounts: UMI counts (integer).
- TranscriptCounts: UMI counts with additional correction for the fact that there are only 4^6 = 4096 UMIs possible, which leads to UMI redundancy for genes with very high counts. This can be corrected for, which is done, which is why this parameter contains float numbers.
I will next week try to make sure (a) you get the right files (please send me your contact details at m.wehrens[AT]uva.nl), (b) to upload the correct files into a repository.

All the best, Martijn

vanrooij-lab / scRNAseq-HCM-human

Correct SORT-seq data #1