MSingerLab / COMETSC

COMET Single-Cell Marker Detection tool
BSD 3-Clause "New" or "Revised" License
31 stars 7 forks source link

Input matrix - raw or normalized? #2

Closed CodeInTheSkies closed 5 years ago

CodeInTheSkies commented 5 years ago

Hello there! Thanks for your tool, and I'm excited to try it out!

I took a look at your preprint as well as the manual, but I'm still confused as to whether the raw or normalized data is better as the input matrix. Both are accepted, I know. But my question is, is any one input type more preferred (normalized, log-normalized, or raw)?

I have 10X data, processed by Seurat. So, can I give as input to COMET the log-normalized data as pre-processed by Seurat v2?

Any advice would be very helpful!

Thanks!

Cnrdelaney commented 5 years ago

Hello! Thanks for reaching out.

I would say that, in general, the best input would depend on the context of your data. It is indeed true that the tool does not have a preference between the two so the extent to which a run is successful can be determined by the output. If the output with the normalized data does not have strict enough genes for example, count data can be used. To directly answer your question, you can absolutely give the log-normalized data as pre-processed by Seurat as long as the formatting fits the guidelines in the documentation!

Let us know if you run into any troubles.