Closed agolicz closed 6 months ago
Hi,
Thank you for your interest :)
quote from our paper: We recommend prior preprocessing and normalization of input gene or transcript expression data, including any necessary batch correction.
The data in the tutorial is the TPM matrix and within the package, we usually would like to remove any low-expressed genes across all samples, and TPMcutoff
is the threshold to define low-expressed genes however it's up to the user to decide which input would work best so, in summary, you can pass any form of gene expression data as an input (csv, tsv or AnnData) and if you wish to not remove any low expressed genes you can put TPMcutoff to zero
Ok, thanks. So beyond TPMcutoff there is no assumptions of data being TPM and I can pass anything so long I set TPMcutoff=0. Did I understand correctly?
The only assumption is that you pass an expression data.
TPMcutoff
can also considered as an expression cutoff and you can assign any value that suits your data
Hi, Thanks for the package! I have one question. From the documentation it is not entirely clear to me in which form counts should be passed to PyWGCNA.WGCNA. Ideally I would like to use varianceStabilizingTransformation of DESeq2.
Can I pass vst.csv from code below directly to PyWGCNA.WGCNA as geneExp matrix? From TPMcutoff paramater it looks like it expects TPM data? If that is the case what is your recommended process to get TPM from raw counts?