I am running export pseudobulk with a cell_data pandas dataframe, that contains my variable for pseudobulk grouping, a barcode column corresponding to my fragment files, a sample_id_col corresponding to my fragment dictionary. The function runs with no problems, but I don't have any saved bw or bed files. I have read a few similar issues suggesting the barcode syntax is different from fragment files but I don't think this is the case because I can find intersections. Can you suggest a fix?
example of cell_data columns for sample id (orig.ident) and barcode:
Below is my Jupyter notebook log:
2024-01-13 23:21:27,591 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03313.tsv.gz
2024-01-13 23:22:07,609 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03314.tsv.gz
2024-01-13 23:23:09,939 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03315.tsv.gz
2024-01-13 23:23:56,134 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03316.tsv.gz
2024-01-13 23:24:56,093 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03328.tsv.gz
2024-01-13 23:25:26,070 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03329.tsv.gz
2024-01-13 23:26:06,107 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03330.tsv.gz
2024-01-13 23:27:14,705 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03331.tsv.gz
2024-01-13 23:28:20,442 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03447.tsv.gz
2024-01-13 23:28:50,178 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03448.tsv.gz
2024-01-13 23:29:04,993 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03449.tsv.gz
2024-01-13 23:29:40,523 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03450.tsv.gz
2024-01-13 23:29:40,798 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03459.tsv.gz
2024-01-13 23:29:49,566 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03460.tsv.gz
2024-01-13 23:30:48,727 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03461.tsv.gz
2024-01-13 23:30:52,853 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03462.tsv.gz
2024-01-13 23:30:52,983 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03471.tsv.gz
2024-01-13 23:31:02,134 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03472.tsv.gz
2024-01-13 23:31:56,579 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03473.tsv.gz
2024-01-13 23:32:02,520 cisTopic INFO Reading fragments from /projects/dscott_prj/amfong/Multiome_DZ/data/pycisTopic/atac_fragments_CLC03474.tsv.gz
2024-01-13 23:33:04,924 INFO worker.py:1664 -- Started a local Ray instance. View the dashboard at [1m[32m127.0.0.1:8265 [39m[22m
[36m(export_pseudobulk_ray pid=26703)[0m 2024-01-13 23:33:29,265 cisTopic INFO Creating pseudobulk for NEG
[36m(export_pseudobulk_ray pid=26688)[0m 2024-01-13 23:33:29,283 cisTopic INFO Creating pseudobulk for POS
[36m(export_pseudobulk_ray pid=26701)[0m 2024-01-13 23:33:33,689 cisTopic INFO Creating pseudobulk for UNCLASS
[36m(export_pseudobulk_ray pid=26701)[0m /home/amfong/pycisTopic/pycisTopic/pseudobulk_peak_calling.py:274: FutureWarning: The frame.append method is deprecated and will be removed from pandas in a future version. Use pandas.concat instead.
[36m(export_pseudobulk_ray pid=26701)[0m group_fragments = group_fragments_list[0].append(group_fragments_list[1:])
[36m(export_pseudobulk_ray pid=26703)[0m /home/amfong/pycisTopic/pycisTopic/pseudobulk_peak_calling.py:274: FutureWarning: The frame.append method is deprecated and will be removed from pandas in a future version. Use pandas.concat instead.[32m [repeated 2x across cluster] (Ray deduplicates logs by default. Set RAY_DEDUP_LOGS=0 to disable log deduplication, or see https://docs.ray.io/en/master/ray-observability/ray-logging.html#log-deduplication for more options.)[0m
[36m(export_pseudobulk_ray pid=26703)[0m group_fragments = group_fragments_list[0].append(group_fragments_list[1:])[32m [repeated 2x across cluster][0m
I am running export pseudobulk with a cell_data pandas dataframe, that contains my variable for pseudobulk grouping, a barcode column corresponding to my fragment files, a sample_id_col corresponding to my fragment dictionary. The function runs with no problems, but I don't have any saved bw or bed files. I have read a few similar issues suggesting the barcode syntax is different from fragment files but I don't think this is the case because I can find intersections. Can you suggest a fix?
Below is my code:
example of fragment file:
example of cell_data columns for sample id (orig.ident) and barcode:
Below is my Jupyter notebook log: