siddharthst / CoreFunctions

Repository containing common (read random) functions I use - updated rarely.
Mozilla Public License 2.0
0 stars 1 forks source link

Functional categorisation of functions. #2

Open aron0093 opened 5 years ago

aron0093 commented 5 years ago

Creating different files representing specific functional categories to store the functions will make them more accessible. I propose this be done by both analytical purpose aswell as the nature of the function itself.

For example, interactive_visualisation.py might contain functions that result in interactive plots.

@siddharthst What do you think?

siddharthst commented 5 years ago

I agree. However, I believe it should be based on the methods themselves instead of broad categories like interactive visualisation. An example of this would be PCA class - which will do the PCA given pandas data frame and along with that, will also plot the accompanying visualisations. Typical case: http://www.sthda.com/english/articles/31-principal-component-methods-in-r-practical-guide/112-pca-principal-component-analysis-essentials/

aron0093 commented 5 years ago

I agree that we must create classes corresponding to techniques like PCA which input data as pandas frames and the associated object has the processed data as well as data for plotting. It will also have the option to generate the plots.

However these building blocks also need to be combined in terms of the tasks that a biologist needs to perform. So we need also create a dashboard functionality that can take any of the building blocks. E.g. a dashboard that has differential expression analysis, pathway enrichment etc. could be a useful tool to characterise clusters in exploring single cell rnaseq data.

I think developing these pre-built and customisable dashboards might be the key differentiator compared to other packages like scanpy or more general libraries like seaborn.