Closed HenriettaHolze closed 6 months ago
Hi,
When you say:
I applied the change suggested here https://github.com/epi2me-labs/wf-single-cell/issues/100#issuecomment-2118984118 to run the pipeline.
Do you mean that having applied this change you then hit the new error reported here? I think I would expect that because both errors are fundamentally related to the same underlying issue.
The immediate cause in the current case is that the code has selected all features (i.e. genes in this case) for removal as they do not pass the filters set. The filters include absolute values on the numbers of cells in which the feature appears.
I presume that your test data is a small dataset and so does not contain sufficient data to pass these defaults. When testing with small datasets we set the parameters --matrix_min_genes 1 --matrix_min_cells 1
in order to, in effect, bypass the filtering.
Yes, I applied the change and then hit this error. Setting those parameters worked, thanks!
I'm reopening this issue just to remind us to make the code throw a more helpful error message when all cells or features have been filtered away.
I got a similar error (j referenced before assignment) even when running on the whole dataset. I had to add the parameters again to make it work.
v2.0.3 will be available shortly and will provide a more reasonable error message.
Operating System
CentOS 7
Other Linux
No response
Workflow Version
wf-single-cell v2.0.2-ge9dac45 + bugfix
Workflow Execution
EPI2ME Desktop (Local)
Other workflow execution
No response
EPI2ME Version
No response
CLI command run
nextflow run epi2me-labs/wf-single-cell -profile singularity --expected_cells 50000 --fastq '/pipeline/Runs/Nanopore/20240514_1009_MN22007_FAY30355_68a76471/no_sample/20240514_1009_MN22007_FAY30355_68a76471/fastq_pass/FAY30355_pass_68a76471_936db429_24.fastq.gz' --kit_name '5prime' --kit_version 'v1' --ref_genome_dir '/data/reference/dawson_labs/genomes/cellranger_reference_GRCh38-2020-A/refdata-gex-GRCh38-2020-A' -w '/scratch/teams/dawson_genomics/Projects/PRC2_BE_screen/results/MF01_nanopore/epi2me_output/work/' --out_dir '/scratch/teams/dawson_genomics/Projects/PRC2_BE_screen/results/MF01_nanopore/epi2me_output/' --threads 16
Workflow Execution - CLI Execution Profile
singularity
What happened?
Hi, I got an error when running the pipeline on a test fastq file in the pipeline:process_bams:process_matrix process, related to following line: https://github.com/epi2me-labs/wf-single-cell/blob/e9dac45261530195a4efbf64ed1437b703ce86b4/bin/workflow_glue/expression_matrix.py#L312 I don't fully understand the logic of the code but seems like there should be a check for len(feat_mask) != 0 or something the like.
I applied the change suggested here https://github.com/epi2me-labs/wf-single-cell/issues/100#issuecomment-2118984118 to run the pipeline.
Relevant log output
Application activity log entry
No response
Were you able to successfully run the latest version of the workflow with the demo data?
yes
Other demo data information
No response