Open GERMAN00VP opened 9 months ago
Hi, I guess the problem is the input matrix contains negative values. I guess you z-score scaled after the log-transformation. Why don't you try without z-score scaling so that the matrix contains only non-negative values. By the way, there is an upgrade version of MarkerCount; called HiCAT, https://github.com/combio-dku/hicat/. I suggest to try it as it performs better and is more stable.
If you want to try HiCAT to annotate pancreas tissue, you may use this one. (You can edit the list of markers with yours) cell_markers_rndsystems_with_pancreas_hs.tsv.txt
Hi, I tried to use the non scaled data but i'm still getting the same problem. I also tried the HiCAT package and I got a "LinAlgError: SVD did not converge in Linear Least Squares".
The X data I'm using is (it is a subsample) :
Cells_expression_matrix_subsample.tsv.txt
The cell_markers data I'm using is the one you gave me.
Thank you for your help!
Hi, I looked at your data and find that it is quite different from normal single-cell count matrix (even if it is normalized and log-transformed.) Most of all, all the entries in the data below are all non-zero. Since both hicat and markerCount uses binary information, either express or not, if all the expression value is non-zero (greater than 0), the number of expressed marker genes will be the same for all the cells. I think this was the problem why HiCAT and MarkerCount issued error.
Therefore, I tried as follows
As I said, HiCAT and MarkerCount uses binary information (either expressed or not). And some of expression values must be zeros for the tools to work properly. I hope this resolve your problem.
Yes it resolved my problem, thank you so much for your help!
Hi, I'm using the MarkerCount function to predict the cell type labels in my sc RNA-Seq data, but when I call the function like that:
df_res = MarkerCount( X=X, mkr_mat=marker_matrix, log_transformed = True, verbose = True )
It produces this error message:
ValueError: Input X contains NaN. GaussianMixture does not accept missing values encoded as NaN natively. (...)
I have checked both input data and didnt find NaN values so I dont know what else to do.
My X dataframe looks like this:
And my marker_matrix like this: