VirtualFlyBrain / VFB_connect

A library for querying VFB servers (neo4j, owlery, solr)
GNU General Public License v3.0
4 stars 1 forks source link

Tutorial to show off transcriptomics queries and analysis #223

Open dosumis opened 2 months ago

dosumis commented 2 months ago

The problems:

  1. Comparing between datasets is potentially misleading. Differences may be due to differences in
    • sequencing depth - easy to report; easy to use.
    • quality of annotation/clustering - We have no real way to judge this at present. In future we could potentially look at comparisons to split bulk and some kind of Blast between cell types in different datasets (=> some measure of consistency). IN absence of these - enough to report paper
    • origin of cells - sex, tissue etc. report by dataset or cell set?
    • number of cells (can we give a ballpark for what would be a low number that might skew stats?
  2. Users need a quick way to visualise gene expression comparison when filtered down to small numbers of gene by function --> aggregate and sort
  1. Given a set of cell types, which datasets are available that have data on all cell types, at what granularity and how many cells for each type?
  2. If multiple dataset have data on the cell types of interest report: sequencing depth, tissue inputs, number of cells, pub
  3. Query to compare expression between specified cell types on chosen dataset - option to aggregate on