sjspielman commented 3 weeks ago

Purpose/implementation Section

Please link to the GitHub issue that this pull request addresses.

446

What is the goal of this pull request?

This PR explores the overlap for each method's doublet calls for each dataset, and also assesses performance of a "consensus caller."

Briefly describe the general approach you took to achieve this goal.

I wrote a single notebook to process all datasets with three main analysis sections, in addition to a conclusions section at the end:

Upset plot comparing doublet calls
PCA colored by consensus calls
Confusion matrix and associated metric calculations

I also updated the overall module run script to render this notebook as the next step.

There are a few other changes here:

Spelling
Fixed a bug in template-notebooks/02_explore-benchmark-results.Rmd where I was using the wrong variable in some functions

If known, do you anticipate filing additional pull requests to complete this analysis module?

Yep.

Results

What is the name of your results bucket on S3?

researcher-654654257431-us-east-2

What types of results does your code produce (e.g., table, figure)?

There are no additional result files, only the rendered notebook which contains all results from this analysis. I directly committed this notebook to the directory where I saved it in the module. Is this ok, or should I export it to results?

What is your summary of the results?

Doublet calls are not much in agreement, so consensus calls are small sets. The consensus calls do not appear to be the most accurate, either.

Provide directions for reviewers

What are the software and computational requirements needed to be able to run the code in this PR?

renv environment needed to render this notebook

Are there particularly areas you'd like reviewers to have a close look at?

I suppose I could add more analysis or interpretation to the notebook, but as there really isn't "much of a there there" to these results, as it were, I wasn't sure what else might be useful and informative to include. Do you have any ideas?

Is there anything that you want to discuss further?

-

Author checklists

Check all those that apply. Note that you may find it easier to check off these items after the pull request is actually filed.

Analysis module and review

[x] This analysis module uses the analysis template and has the expected directory structure.
[x] The analysis module README.md has been updated to reflect code changes in this pull request.
[x] The analytical code is documented and contains comments.
[ ] Any results and/or plots this code produces have been added to your S3 bucket for review.

Reproducibility checklist

[x] Code in this pull request has been added to the GitHub Action workflow that runs this module.
[x] The dependencies required to run the code in this pull request have been added to the analysis module Dockerfile.
[x] If applicable, the dependencies required to run the code in this pull request have been added to the analysis module conda environment.yml file.
[x] If applicable, R package dependencies required to run the code in this pull request have been added to the analysis module renv.lock file.

sjspielman commented 1 week ago

We're back! I updated code throughout notebook in response to reviews, including using a 0.5 threshold for cxds and adding more PCAs. While looking at the PCAs, it actually looked to me like it was cxds that was capturing a lot of the consensus false negative droplets, and those points were missed by scDblFinder and scrublet. Therefore, for this first round returning back to you, I didn't do a re-analysis with just those two methods. Do you still think it's worth doing?

Edit: notebook for review convenience! 03_compare-benchmark-results.nb.html.zip

sjspielman commented 3 days ago

The next iteration has finally landed! I've incorporated the conceptual items brought up in review, and did some notebook rearrangement accordingly. Note that I do think this could be more modular since there is some repeated code between different types of consensus analyses (all 3 methods vs only 2 methods), but given where we anticipate this module headed overall, I wasn't sure that was really worth the effort.

Here's a rendered notebook: 03_compare-benchmark-results.nb.html.zip