Open BoxWong opened 1 year ago
Hi Xin, the integration failed since the peaks are not unified for reference and query. We are working on a script for using quickATAC for scATAnno and will post it soon.
Hi Xin, we have uploaded a script to preprocess your PBMC scATAC-seq data (https://github.com/Yijia-Jiang/scATAnno-main/blob/main/prep_data/RunPBMC.sh).
To prepare for the PBMC peak matrix, you will need to use QuickATAC (https://github.com/AllenWLynch/QuickATAC/tree/main/quickatac). Then you can use the script RunPBMC.sh to get matrix for your query scATAC data.
Feel free to let me know if you have any questions.
Hi Yijia, I've got a problem trying to integrate the reference adata (PBMC data downloaded from your dropbox link) with my own PBMC scATAC-seq data processed with quickATAC while the query data that I downloaded from dropbox went just fine.
integrated_adata1 = scATAnno_assignment.scATAnno_integrate(reference_data, query_data1, variable_prefix = "Pub", sample_size = 25000)
and the error message was: _ValueError: Adatas have different variables. Please specify joinvars='inner' for intersection._ValueError: Adatas have different variables. Please specify joinvars='inner' for intersection.
sub_ref_data = reference_data[:, (query_data2.var.index & reference_data.var.index)]
and did the integration again:integrated_adata2_subref = scATAnno_assignment.scATAnno_integrate(sub_ref_data, query_data2, variable_prefix = "Pub", sample_size = 25000)
Now the 'Compute similarity matrix' step seems to be fine but the normalization step went wrong again and the error was different from the previous one: PanicException: called
Result::unwrap()
on anErr
value: TooSteepPS: I would also like to mention that I changed your script (scATAnno_integration.py) and added the parameter "join_vars = 'inner'" to line 44 which caused the problem. Doing this also gave me the same error: PanicException: called
Result::unwrap()
on anErr
value: TooSteepSorry for the lengthy text and looking forward to your input.
Best, Xin