Teichlab / Pan_fetal_immune

Collection of scripts for analysis of pan fetal immune atlas
24 stars 6 forks source link

Divide&conquer for annotation #6

Closed emdann closed 3 years ago

emdann commented 3 years ago

The goal is to divide the full atlas into subsets for detailed annotation. We decided to go for a hierarchical approach to achieve higher resolution on the immune cell compartment.

Steps

  1. Make data subsets [notebook]
    • Cluster full atlas at high resolution (res=1.5) and label each cluster with most abundant annotated cell type
    • Group clusters into data "splits"
    • Save log-norm data for each split (so pre-scaling, feature selection, ridge regression etc): saved as /nfs/team205/ed6/data/Fetal_immune/PAN.A01.v01.entire_data_normalised_log.wGut.batchCorrected_20210118.SUBSETNAME.h5ad
  2. Preprocess + batch correct data subset [script]: output is saved as /nfs/team205/ed6/data/Fetal_immune/PAN.A01.v01.entire_data_normalised_log.wGut.batchCorrected_20210118.SUBSETNAME.batchCorrected.h5ad
  3. Visualize results and split more if necessary: see notebooks in notebooks/PFI_subset_EDA

What are the splits See slides illustrating splitting and output

Outstanding problems

@suochenqu @Issacgoh let me know what you think of the results

emdann commented 3 years ago

Notes from meeting: