Closed drneavin closed 3 years ago
Hi @drneavin ,
I'm not sure what the issue is exactly, and you are right it seems to be a scvi
issue. However, they are currently updating their code, so if its a true bug on their end thing might get tough. However they might have some suggestions about things to try.
Another option is you can try reading your file using scanpy
into python and then write an anndata
file. And then running solo
on that file. Sorry, I'm of more help.
taking a quick look at some files I have. the ones you posted seem normal.
nicholas@sci-pvm-nicholas:~$ head matrix.mtx
%%MatrixMarket matrix coordinate integer general
%metadata_json: {"format_version": 2, "software_version": "3.1.0"}
33646 4745 4759983
33574 1 30
33567 1 69
33566 1 28
33559 1 522
33558 1 50
33551 1 788
33509 1 45
nicholas@sci-pvm-nicholas:~$ head barcodes.tsv
AAACCCAGTAAGATCA-1
AAACCCATCAGAGCAG-1
AAACGAACACAAATAG-1
AAACGAACAGATTAAG-1
AAACGAAGTTGCCATA-1
AAACGAATCAGGTGTT-1
AAACGCTAGAGTCTTC-1
AAACGCTAGATGTAGT-1
AAACGCTGTCAAGTTC-1
AAACGCTGTGACTCGC-1
nicholas@sci-pvm-nicholas:~$ head features.tsv
ENSG00000243485 MIR1302-2HG Gene Expression
ENSG00000237613 FAM138A Gene Expression
ENSG00000186092 OR4F5 Gene Expression
ENSG00000238009 AL627309.1 Gene Expression
ENSG00000239945 AL627309.3 Gene Expression
ENSG00000239906 AL627309.2 Gene Expression
ENSG00000241599 AL627309.4 Gene Expression
ENSG00000236601 AL732372.1 Gene Expression
ENSG00000284733 OR4F29 Gene Expression
ENSG00000235146 AC114498.1 Gene Expression
Thanks for the recommendation @njbernstein!
I just finished checking on scanpy loading and that seems to be failing as well. However, some of my other codes that use just scipy to read the matrix in work fine so I think it probably has to do with some assumptions about the file structures that are built into these functions expected for 10x data.
I'll let you know if I find a good solution.
Solved per your recommendation to create an AnnData object saved as standard h5ad and used that as input for solo. Seems to be working now. Thanks for the help!
Hello,
I'm not sure the best location to put this issue - it arises when using the solo package but I'm fairly certain that the issue lies with the scvi package. I am getting the following output with the error:
I can replicate this error when in python after importing scvi and trying to load the dataset with Dataset10X but can't identify why I am receiving this error.
The data is a dataset where the barcodes are letters followed by a dash, a number, and sometimes another letter, ie:
and the files are produced with
umitools_to_mtx
from the Rscrunchy
package. I have a feeling the main problem is due somehow related to the fact that these files were not directly produced by the 10x cellranger pipeline. Here's the top of the matrix.mtx file:I'll do some more digging but would love if you have some recommendations or input.
Thanks!