Open kjappelbaum opened 1 year ago
Need more clarity here. Are you talking of featurizer that converts from one representation to another (e.g., SELFIES
to SMILES
) or talking of a featurizer that takes in two molecular representations and tells if they represent the same molecule?
could also be the question: are X and Y the same molecule? Where X and Y are in different representations (or randomized SMILES)
I think the Comparator
API may be of use here.
yeah, so the basic thing might be solved with a comparator. I think the question here is rather where the sampling will be implemented.
yeah, so the basic thing might be solved with a comparator. I think the question here is rather where the sampling will be implemented.
Could you provide more expatiate on this?
Could you provide more expatiate on this?
you will only get a meaningful signal if you run this on a set of molecules. And then you have the questions:
could we build something based on the Comparator
API?
could we build something based on the
Comparator
API?
Yes, I'm sure we can. The Comparator exists already, so the harder part is done. All I'd need to figure out is the basis of the comparison.
I think I have a basic solution worked out for this. Expect a PR tomorrow.
needs a bit of thinking for the best design
but, in principle, we can make many pre-training tasks by translating between representations
could also be the question: are X and Y the same molecule? Where X and Y are in different representations (or randomized SMILES)