Open ahy1221 opened 7 years ago
Hi, ahy1221
FYI, the consideration is the detection limit of micro arrays. The small values at the tale of the distribution (per sample) after normalization tends to be purely noise. The values 1.5 and 0.75 have thus been chosen based on the size of the lower-end peak. Drawing a violin plot for a reference sample would make things clear.
Huipeng
Hi I am looking at the source code of RCA to learn how the algorithm works. In the
featureConstruct()
function there is a line 38data3[data3<=(sysdata$at)[i]] = (sysdata$at)[i]
assigning the expression value in globalpanel which smaller than 1.5 or 0.75. I am wondering why we should do this cutoff, would that impact the clustering result ?Thanks