Closed alanocallaghan closed 3 years ago
There is a well-established processing pipeline for the Zeisel data in the simpleSingleCell workflow. I hope to deploy it into a package - possibly scRNAseq
- so that we don't have to keep on re-defining it.
Hi there,
This is a pretty minor issue, but I thought I would document it here in case anybody else runs into it, and for my own records in future. Thanks for providing a comprehensive repository!
I tried to replicate the preprocessing for the Zeisel data and I ran into a couple of difficulties with the metadata. When I run the code using the files on the Linarsson site, I run into some issues with the metadata, which messes up the selection of cells, ie:
https://github.com/MarioniLab/RegressionBASiCS2017/blob/fb1d833614e0469db51ec1677cabf66433f5e19e/Preprocessing/Data_preparation.R#L70
I can't reconcile this part of the code with the structure I get from the files, which seems for some reason to have been read in as some sort of nested list, and with some rows apparently being different in value. I have no idea if this is due to a change in R, a change in the file hosted on the lab website (modification date is July 17 2016 despite the filename suggesting 17 August 2014), a difference in operating system, or different system/R options (or a combination of these).
In case it's useful for anybody, the code below is self-contained and should work with the current data from the Linarsson lab site. Be aware, it creates and writes to the
data
directory in the current working directory by default. Cheers