linnarsson-lab / loom-viewer

Tool for sharing, browsing and visualizing single-cell data stored in the Loom file format
BSD 2-Clause "Simplified" License
35 stars 6 forks source link

Data Binning question #69

Closed JobLeonard closed 7 years ago

JobLeonard commented 7 years ago

I need input on what is a sensible data representation.

I'm trying to make the legend accompanying the sparklines/scatterplots a bit "smarter", basically having sensible defaults (that you can override, don't worry):

image

(aside: I've used an interpolation tool to increase the solar9 gradient to a solar256 gradient, and also added the YlGnBu scale as a Heatmap2 option - being protanomalous the latter is easier to read for me)

The issue is that some metadata fields, like _LogCV or _tSNE1 & _tSNE2 tend to end up with (nearly) unique values for each column/row entry. This leads to these silly results:

image

image

(also, in the process of opening this issue I realised that gene data is no longer being sorted. Ignore that visual error for now, I'll fix that soon)

There's also the fact that zero values often representing missing data:

image

So I think that it would make sense to "bin" numerical data before creating a legend. Here are my thoughts so far:

Is this a valid, clear way of representing the data, or would you prefer something else? How many bins would you consider sensible? Do you need filtering capabilities for these bins?

Thanks for your feedback,

/Job

JobLeonard commented 7 years ago

Conclusion after quick chat with Gioele: "Eh, fixing this would be nice to have, but not high priority"