Use case: human dataset, I have my disease samples, but no matched controls from my own study. The atlas contains n samples. Can I choose a group of these as my control samples?
Using dimensionality reduction on the samples: make a latent space Z of dimensions s x d, where s is the number of samples and d is the number of latent dimensions, to match each sample to the most similar samples in the atlas for comparison.
with PCA on nhood counts
with scVI on nhood counts: can we correct for technical effects?
How do we know when samples are significantly different? Diverging from the atlas background (distribution considering all the cells together). This should give an indication on when matched controls are needed: if the single samples do not agree with general distribution.
Use case: human dataset, I have my disease samples, but no matched controls from my own study. The atlas contains n samples. Can I choose a group of these as my control samples?