Closed pbalajiv closed 5 years ago
Dear pbalajiv, Thank you for your interest and question. And sorry for the delay, I had missed this question...
So to reply to your questions:
Indeed in PBMC samples we would except the "other cells" to be of much lower proportions as the reference profiles contain the most important cell types. Here some suggestions of what could be the cause of it:
The signature genes are important because they tell which genes need to be used when doing the deconvolution. EPIC is first using the full set of genes found in the intersection of the bulk samples and of the reference profiles to do an initial normalization based on the library size remaining from this genes' intersection. But then EPIC only keeps the subset of genes defined in the signature genes (both from the bulk samples and the reference profiles). It will then do a least-square optimization on this subset of signature genes in order to estimate the proportions of the various cell types.
As explained in response Nr. 2, EPIC will consider the intersection of the genes defined between the bulk and reference profiles to do a normalization based on this intersect.
I hope this helps and that you find why you have this large amount of other cells in your PBMC data.
Best wishes,
Julien
Hi Thank you for this fantastic work. I had several questions while using the package.
Thanks in advance for the answers.