Open DavidLandup0 opened 1 year ago
Sure, let me know what you want to do... I'm currently preparing a draft for a working paper.
Awesome, thanks! What do you think would be the areas that currently need to be fleshed out more?
I figure that one of the most important things to tweak/tune here is the representation produced by SMILES+gzip. Testing out augmentations, canonicalization schemes, etc. or simply doing analysis on the produced representations may lead to improving metrics.
Besides that, a decent analysis section of the representations would be a nice addition, IMO.
Hey! Could we perhaps collaborate on turning this into a small paper? :)