ServiceNow / picard

PICARD - Parsing Incrementally for Constrained Auto-Regressive Decoding from Language Models. PICARD is a ServiceNow Research project that was started at Element AI.
https://arxiv.org/abs/2109.05093
Apache License 2.0
342 stars 122 forks source link

PICARD for Cross-lingual Parsing #97

Closed tomsherborne closed 2 years ago

tomsherborne commented 2 years ago

Hi @tscholak !

Thanks for the really great codebase for PICARD. I'm looking at how PICARD style decoding could help with a parser for multiple languages. I think I might need to re-train the base parser using mT5 instead of English T5. Did you try this?

I am assuming the process would be the same but replaces the pre-trained model in the config if this is for CSPIDER (same SQL, different input utterances). I'm happy to work on a PR to add Cspider as a dataset if I find that this works well. Let me know if this would be valuable.

tscholak commented 2 years ago

Hi @tomsherborne! I haven't tried mT5 or CSpider with this codebase. I think thought that this would be a cool thing to try. Contributions are very welcome!