saeyslab / CytoNorm

R library to normalize cytometry data
33 stars 6 forks source link

What type of data should be used? #32

Open mattvtcea opened 2 years ago

mattvtcea commented 2 years ago

Hi, I would like to apply CytoNorm to my mass-cytometry dataset but I would like to know more about the data used. Do you use raw FCS files (after beads normalization and debarcoding) or do you "clean" your data first (singlets, live cells etc...)?

Best

SofieVG commented 2 years ago

In general, I would recommend to work on clean data, but it is not an absolute requirement. The main things of importance are:

In my experience clustering gives better results on cleaned data, and doublets and dead cells might also differ a bit more between the control samples, resulting in different distributions.

On the other hand, sometimes you want to normalize first to ensure you can have a more robust preprocessing pipeline for cleaning the data. So as long as these main requirements are quite ok, you can also apply it on raw data.

It is important data is transformed for correct clustering and quantile alignment. Either you need to pass a transformList, or you can also transform upfront, save these preprocessed fcs files and set the transformList to NULL.

All the best, Sofie

On Thu, 19 May 2022, 10:45 mattvtcea, @.***> wrote:

Hi, I would like to apply CytoNorm to my mass-cytometry dataset but I would like to know more about the data used. Do you use raw FCS files (after beads normalization and debarcoding) or do you "clean" your data first (singlets, live cells etc...)?

Best

— Reply to this email directly, view it on GitHub https://github.com/saeyslab/CytoNorm/issues/32, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAOS725LUQGIRYIEUL3QEDLVKX5SFANCNFSM5WLNYILQ . You are receiving this because you are subscribed to this thread.Message ID: @.***>