Closed npklein closed 7 years ago
@npklein Since you have used Taiji to analyze RNA-seq data before, you can just look at "outputdir/RNA-seq/*_TPM_by_names.tsv" for examples. You should not use gene id, but this is not the reason for the issue. Example:
STPG1 413.00
NIPAL3 3495.00
LAS1L 1133.00
ENPP4 292.00
SEMA3F 1186.00
CFTR 0.00
ANKIB1 6129.00
CYP51A1 4533.98
KRIT1 1442.31
What happen if you grep ENSG00000000003 your_input.txt
?
@kaizhang I didn't finish running it with the fastq files because it was taking too long, so decided to use the quantification instead, which is why I didn't have example data in outputdir/RNA-seq
I removed all my output, remade input files and did a fresh run. Now it doesn't crash so somewhere I had the wrong input, sorry for the trouble. Still have the problem that the Rank file is empty, but I will first change the gene IDs to gene names and see if that works.
Thanks for the help.
I'm trying to use gene quantification data (using https://gist.github.com/npklein/04d3d7f46d4ac683827aadf315be49ff as input.yml) but I get the following error
It seems that it tries to convert the gene name to Double instead of the quantification. Opening the expression file with
cat -vet expression.txt | head -n5
(so tab reads as ^I and end of line as $) I getThis seems to be according to specs from the example_input.yml comment
Incase it expects counts instead of normalized data I also tried to input ints instead of the floats, but got same error.
Could you upload an example quantification file so that I can check if I have the right format?