Closed BenvanderVeer closed 3 years ago
Any update on this? Because I tried everything - using different genome versions etc, but it keeps giving me the same error, but i don't understand where the error is coming from?
For human data it works fine, but for the mouse index it doesn't work?
best, Ben
I face the same issue as well. Please let me know if you found a fix.
I could fix the issue by fixing the gene names in the fragements2genes.txt, unspliced_fragments.txt and the fragments.fa used to build the kallisto index. For some reason the gene names are appended with their coordinates which does not have a corresponding match with the gene metadata file. I think this is because of some string parsing error in the code used in generating these files. For now i fixed the files post generation with pandas and sed and it works all the way. Hope this helps
Hello.
I found what causes the issue. I followed the pipeline here: https://github.com/linnarsson-lab/loompy/tree/master/kallisto And I faced the same type of error.
After some research, I found that an update in bedtools change its behavior. I fixed it based on this: https://github.com/arq5x/bedtools2/issues/805
If you followed the same pipeline (which is really great by the way), I suggest you to change the line 23 of the "mouse_download.sh" code as follow:
bedtools getfasta -name -fo gencode.vM23.unspliced.fa -fi GRCm38.primary_assembly.genome.fa -bed gencode.vM23.primary_assembly.annotation.sorted.bed
Becomes
bedtools getfasta -name -fi GRCm38.primary_assembly.genome.fa -bed gencode.vM23.primary_assembly.annotation.sorted.bed | sed 's/::.*//' > gencode.vM23.unspliced.fa
I just did this and rerun the pipeline and everything did as expected, and I could have a .loom file at the end.
I hope this helps. This is my first message on github, sorry if I did not followed some guidelines that I am unaware of. I was stuck with this same problem and I would be happy if someone already posted this answer.
I wish you all the best in your analyses.
Jérôme
Changing the line works for me indeed! Thank you for your help!
best, Ben
Hi, I made an index using the recent posted solution for the mouse index (with Kallisto), but when try to run loom fromfq on my fastq files, it get this error:
I think it has to do with the naming of the genes/fragments, but I do not understand enough of the code to be able to fix it.
best, Ben