Open noob000007 opened 4 months ago
Oh, do you mean it should be like equation 13 in the sense that there should be a subscript j? Yeah, I think that can be more accurate. Are you talking about the substraction in the first equation?
"First of all, thank you for your response. Secondly, I still have questions about the handling of 'unknown' and 'mask' in formulas 12 and 13 of the article. For 'unknown', does scGPT only input the embedding of gene tokens during processing, rather than the sum of embeddings for gene tokens and their corresponding expression tokens? For 'mask', is it inputting a special expression token called 'mask'?"
"The selected positions with high prediction confidence" is no longer mentioned in the paper.
What does "high prediction confidence" mean, and how does scGPT determine which positions have high prediction confidence?
[ \mathbf{h}^{(i)}_n
] Isn't the shape of ( \mathbf{h} ) ( M \times D )? ( M ) represents the gene input. After passing through the MLP, it should be ( M \times 1 ), right? So how is the formula in the "Learning objective for pretraining" calculated? The dimensions don't match, do they?