labomics / midas

MIT License
40 stars 4 forks source link

How many peaks should we select when using midas? #11

Closed Smilenone closed 1 month ago

Smilenone commented 1 month ago

Usually, for a scATAC-seq data, I have 1M peaks, and I wonder How many peaks should we select when using midas?

zhen-he commented 1 month ago

Please refer to Supplementary Table 13 for the number of peaks for MIDAS as $D^{ATAC}$. I recommend not exceeding 100K peaks to avoid overfitting due to an excessive number of model parameters, especially with fewer cells (e.g., less than 10K). You may consider merging peaks to reduce their number.

Smilenone commented 1 month ago

Thanks, and I met a new problem, I got stuck (for more than one hour) when using model.train(), plz see the picture, can you help? image

JiahaoChow commented 1 month ago

Hi, @Smilenone !

It seems difficult to obtain valid information from the image. Have you successfully run the demo data with the same device? Additionally, please comment out warnings.filterwarnings('ignore') to view the warning messages for further diagnosis of the problem.