Open nick-youngblut opened 1 month ago
Adding mtx_matrices.view()
to MTX_CONVERSION
shows that all of the samples have the same file names, which is causing the name collision:
[
[id:20240925_ADI_batch5_flex_3, ...],
[
/path/to/sample1/barcodes.tsv.gz,
/path/to/sample1/features.tsv.gz,
/path/to/sample1/matrix.mtx.gz,
...
]
]
It might help to include info on how to handle multiple barcodes per sample in the sample barcode table: https://nf-co.re/scrnaseq/2.7.1/docs/usage ("Additional samplesheet for multiplexed samples").
For example:
sample,multiplexed_sample_id,probe_barcode_ids,cmo_ids,description
20240905_ADI_batch3_flex_1,20240905_ADI_batch3_flex_1,BC001|BC002|BC003|BC004,,
From the 10X docs:
If multiple Probe Barcodes were used for a sample, separate IDs with a pipe (e.g., BC001|BC002).
Description of the bug
Running CellRanger with all GEX samples, in which there are multiple barcodes per sample, but the all go to the same sample (see the samples & barcodes tables below). This results in a file name collision at the
MTX_TO_H5AD
step. I haven't been able to determine why, based on the pipeline code.Command used and terminal output
The command:
Relevant files
The samples table (full paths removed for clarity):
The sample barcodes table:
System information
Nextflow: 24.04.4.5917 Hardward: HPC Executor: SLURM Engine: Apptainer OS: Ubuntu Pipeline: 2.7.1