Closed reubano closed 7 years ago
I'm reading through this one right now. Please correct me if I'm wrong, but it seems like the competition target variable was to predict the probability of the patient having cancer as a whole as opposed to the probability of each individual nodule being cancerous?
It looks like most of these go through the same z-slice normalization, lung segmentation, and nodule detection/isolation steps. But then it's the final feature generation steps that might need to be refactored quite a bit to tackle the new problem statement of predicting the probability of each nodule being cancerous. Is that a fair statement?
A feature such as total_number_of_nodules
(I made that up as an example) aggregated at the patient level might still be important at the nodule level, but it does seem to imply that the goal was slightly different.
Answer posted here.
@tjvananne Did @reubano's link answer your question? :)
@lamby same here - is something preventing this issue from getting closed? :)
Overview
Participants in the Data Science Bowl produced several algorithms that we would like to incorporate. To help facilitate this effort, we also want to add documentation so that contributors can make an educated decision when selecting an algorithm to incorporate.
Expected Behavior
This documentation should enable people to:
Design doc reference: Detect and select
Algorithm info
Technical details
docs folder
Acceptance criteria
NOTE: All PRs must follow the standard PR checklist.