liulab-dfci / TRUST4

TCR and BCR assembly from RNA-seq data
MIT License
272 stars 47 forks source link

trust-stats.py output description #158

Open itagaf opened 1 year ago

itagaf commented 1 year ago

Hi, Where can I find a documentation and/or description for the trust-stats.py output? i.e. what the meaning of each column in the output table. Thanks in advance and thank you for this great tool!

mourisl commented 1 year ago

"Abundance": the total number of CDR3 reads for the chain. "Richness": the number of distinct CDR3s "CPK": number of clonotypes per thousand of CDR3. ( richness / (abundance/1000) ) "Entropy": Shannon entropy of the clonotypes. "Clonality": 1-normalized_Shannon_entropy ( 1-entropy/log(Richness) )

Does this help?