Closed violetguos closed 4 years ago
Unfortunately this code base only supports languages right now and it will certainly not be plug-n-chug for chemical reactions. This application area has been brought to our attention though and we are hoping to make the interface generalizable.
If you would like to plow ahead and implement this yourself, there are several moving parts that will need to be modified:
hoo/transformers
branch as this contains the most recent code.aligner
module will need to support the custom tokenizer for chemical reactionsspacy
, which is the library used to annotate language data (e.g., with part of speech/entity information), will need to be replaced with your own annotator for chemical strings, or you can choose to strip this feature out of the system entirely.These are all backend changes. You should only need to change the frontend if:
Hope this helps!
Thank you for the quick reply! I'll fork this and see what I can do!
Hi all,
I am training a transformer model to predict chemical reactions from chemical molecules in string representation.
Does your project support plug-n-chug for models not trained on languages?
If not, any pointers on how I should proceed?