cytomining / pycytominer

Python package for processing image-based profiling data
https://pycytominer.readthedocs.io
BSD 3-Clause "New" or "Revised" License
81 stars 36 forks source link

Why is the blocklisted feature list only for Nuclei_* ? #176

Open dlogan opened 3 years ago

dlogan commented 3 years ago

It seems to me that the analogous Cell* and Cytoplasm* features would also have the same issues. Of course, the specific compartments would depend upon which were measured in the particular pipeline. But in general, I doubt that across the channels, for the mostly Correlation and Granularity features, that there would be any biology specific to nuclei that wouldn't also apply to the Cytoplasm and Cell compartments. @gwaygenomics

gwaybio commented 3 years ago

Interesting note!! We derived that blocklisted feature list using features that have previously proven unreliable - maybe the other compartments are somehow more reliable?

Either way, a more in depth study on feature reliability is probably important!

dlogan commented 3 years ago

We're looking at this internally on non-JUMP, but similar data, and I can report in general when that's done.

gwaybio commented 3 years ago

that would be great. Once you determine this, we can make version 2 of: https://figshare.com/articles/dataset/Blacklist_Features_-_Cell_Profiler/10255811 and/or update/modify the blocklist features in this repo.

Happy to also discuss authorship on that resource at a later date.

Either way, I think documenting your approach/method will be important for version 2. I am generally unsatisfied by my current lack of detailed documentation in version 1 ☹️

AnneCarpenter commented 3 years ago

Can anyone report what features are captured by this filter (and which aren't, that would be expected to?)

dlogan commented 3 years ago

@AnneCarpenter They're listed in this link: https://figshare.com/articles/dataset/Blacklist_Features_-_Cell_Profiler/10255811

Basically, various Nuclei[Correlation|Granularity] features. But not Cell[Correlation|Granularity] or Cytoplasm[Correlation|Granularity]*