Open fafa92 opened 5 months ago
Hi @fafa92,
I've reviewed your files and noticed that the counts are not in the format required by CellPhoneDB. Please ensure that any normalisation procedure you use does not transform zeros into any other value. Therefore, you should apply log-normalisation only, without z-scaling.
Hope this helps!
Best, Batu
Hi @cakirb,
I appreciate your suggestion. That solved the problem. I have another question regarding my dataset. my ann data object is 372081 (cells) × 3000 (genes). When I run it with the command below, it takes up all my memory (about 300 GB) until it crashes.
from cellphonedb.src.core.methods import cpdb_statistical_analysis_method
# Define file paths
cpdb_file_path = '/content/cellphonedb.zip'
meta_file_path = '/content/metadata.txt'
counts_file_path = '/content/counts.txt'
output_path = 'cellphonedb_results'
# Run CellPhoneDB statistical analysis
cpdb_results = cpdb_statistical_analysis_method.call(
cpdb_file_path=cpdb_file_path,
meta_file_path=meta_file_path,
counts_file_path=counts_file_path,
counts_data='ensembl',
threshold=0.1,
output_path=output_path,
subsampling_num_cells=100)
When I lowered the number of genes to about 300, it worked but didn't show any significant connection. Is there any way around this problem? Can we run it with as many genes as possible without crashing the system?
If memory is a problem you can downsample the dataset either manually or by using the argument subsampling_num_cells
when running cpdb_statistical_analysis_method.call()
.
Hi,
Thanks for the great package. I'm trying to use cellphone db v5 to find a connection in my dataset. I'm using this code with the dataset that I provided in the link below. I'm getting "No CellphoneDB interactions found in this input" as the output. My input is already log-transformed. Any help would be appreciated.
counts: https://drive.google.com/file/d/1YE-G2LF05uCb7-Cl0IW-9kiKeaj4VWuB/view?usp=sharing metadata: https://drive.google.com/file/d/1GTsiAz9CcJKa1WhQ4ZvQ8wGCxmCaQ3Ml/view?usp=sharing
log output: