calculate_outcomes is being run on the output of collect_tabulated_molecules, which means % novel/% valid are constant and 1.
We should either (1) provide separately as input a single sample of 500k "raw" SMILES (without removing invalid ones and canonicalizing) or (2) just remove these plots
calculate_outcomes
is being run on the output ofcollect_tabulated_molecules
, which means % novel/% valid are constant and 1. We should either (1) provide separately as input a single sample of 500k "raw" SMILES (without removing invalid ones and canonicalizing) or (2) just remove these plots