Open methornton opened 5 years ago
Hi! I'm also trying/testing linseed and used CPMs (from edgeR), TPMs (from RSEM) and also FPKM (cufflinks) matrices.
Matrices looked like: transcript_id sample1 sample2 sample3 <--------header ENST000000000 5.456 7.876 4.194 <-------- transcript/gene id and it's expression values per sample in CPMs/TPMs/FPKMs
The expected cell type number entered by hand into R script. Idk, if linseed allows to add more than one number simultaneously. I just tried different expected numbers per each script run.
By now my results are not as beautiful as they could be.
Some more detailed tutorial is appreciated! :)
@methornton
You can just provide the expression matrix to a constructor of the Linseed Class (basically matrix objects) I would suggest using something like TPMs, any normalization that already took library size into an account.
Cheers and sorry for the slow replies, Konstantin
Hello!
I am working through the tutorial and I have my own RNA-seq data that I would like to process with linseed. Does the LinseedObject function require data be formatted exactly as "GSE19830_series_matrix.txt"? I have an RNA-seq data set that has annotation for genes , raw counts, and RPKM. I don't know how many cell types are present, but I expect at least 10 -12.
Can you tell me which of these fields must be supplied?
The header of my RNA-seq data looks like this:
I can get the 'normCounts' out from the R package 'edgeR', if this is necessary, how to format it? Any advice or assistance is greatly appreciated!! Thank you!