Closed TomKellyGenetics closed 6 years ago
May be resolved by updating to current version (with thresholds corresponding to internal use of log-p-values). Testing currently underway.
The outputs of predict
and convergence
appear to be consistent upon updating the package on the test server.
On an in-house test dataset:
213/2332 doublets are identified.
However 127/2332 doublets are counted by
convergence
plot.These do not overlap:
The
predict
method doublets are more consistent with those from the Python implementation (which identifies 97/2332 cells as doublets):The "convergence" plot doublets appear to be computed correctly. Running method
predict
as a function onall_log_p_values <- clf$all_log_p_values
returns the same result. Thus the issue appears to be due to passing variables to these functions as their outputs are the same with test inputs.Note: computing doublets from log-p-values (log-p <= log(0.01)) or directly from p-values (p < 0.01) returns the same results in R.