Open jesswhitts opened 2 months ago
Hi @jesswhitts
Hmm this should not be the case, not sure how this can happen. Would you be able to trace back where they got lost in the pycisTopic analysis?
Best,
Seppe
Reading in cistopic object: cisTopic_obj.region_data: [382839 rows x 8 columns]
Run impute accessibility function: len(imputed_acc_obj.feature_names): 382167
Subset for common annotations step: cisTopic_obj.region_data: [382691 rows x 8 columns] len(imputed_acc_obj.feature_names): 382167
It looks like when we 'check which annotations are common and if necessary subset', there are some regions lost which aren't removed from the imputed accessibility object, could this be the problem?
Best, Jess
Hi @jesswhitts
What are the dimensions of
cisTopic_obj.selected_model.region_topic
S
Hi @SeppeDeWinter
[382839 rows x 100 columns] when first reading the file After doing the subset, the 'selected_model' field becomes an empty list
Still unsure about the cause of this error, but I've found a workaround in case anyone else comes across this. I select for common cell types in my GEX and ATAC datasets at the very beginning, and the pipeline now works fine. Thanks again for the interesting tool!
Hello,
When running the Snakemake pipeline, I get the following error:
[Wed Apr 24 18:08:33 2024] localrule prepare_GEX_ACC_non_multiome: input: ../../scATAC/cistopic_obj.pkl, ../../Lambo_AML12DX.h5ad output: ACC_GEX.h5mu jobid: 2 reason: Missing output files: ACC_GEX.h5mu resources: tmpdir=/tmp
[Wed Apr 24 18:10:06 2024] Finished job 8. 2 of 14 steps (14%) done Select jobs to execute... Traceback (most recent call last): File "/data/stemcell/jwhittle/mambaforge/envs/scenic-plus/bin/scenicplus", line 8, in
sys.exit(main())
^^^^^^
File "/data/stemcell/jwhittle/mambaforge/envs/scenic-plus/lib/python3.11/site-packages/scenicplus/cli/scenicplus.py", line 1137, in main
args.func(args)
File "/data/stemcell/jwhittle/mambaforge/envs/scenic-plus/lib/python3.11/site-packages/scenicplus/cli/scenicplus.py", line 44, in command_prepare_GEX_ACC
prepare_GEX_ACC(
File "/data/stemcell/jwhittle/mambaforge/envs/scenic-plus/lib/python3.11/site-packages/scenicplus/cli/commands.py", line 67, in prepare_GEX_ACC
mdata = process_non_multiome_data(
^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/data/stemcell/jwhittle/mambaforge/envs/scenic-plus/lib/python3.11/site-packages/scenicplus/data_wrangling/adata_cistopic_wrangling.py", line 253, in process_non_multiome_data
ACC_region_metadata_subset = ACC_region_metadata.loc[imputed_acc_obj.feature_names]