Closed gefujing closed 2 weeks ago
Hello, @gefujing , thank you for the reporting the issue. I will try to figure out. It would be also very helpful, if you can provide a very minimal example where it fails, like with two or three specific datasets.
Thank you very much! You could see the specific dataset when you run _artifact_to_remove = artifacts.get(uid="upR31puIm5bp3AC7Xy8m")_. The code work well after we remove the data set by _collection.artifacts.remove(artifact_toremove).
Thank you, i will check what is with the dataset.
Ok, i see now that the dataset has .X
in csc sparse format, MappedCollection
doesn't support csc yet, i will track this here https://github.com/laminlabs/lamindb/issues/1873 . I will also add a check for csc matrices for now.
Hello! We are testing large-scale data processing using the cellxgene database in lamindb. We carried out in accordance with the guide of https://docs.lamin.ai/scrna5.
The main code we run is as follows:
However, the code is reporting an error for unknown reasons:
After testing, we tentatively realized that this reported error seems to be independent of the number of cells (we test in many small datasets, running well). The source of the error is the MappedCollection (a class) defined by lamindb. In the def getitem section, this class performs the extraction of gene expression information and labels through the indexes generated earlier. Due to the merging of multiple datasets, an unknown error occurred in the generation of the index.
Could you please help me check and fix this problem? error information.pptx