Making extensible parsers

A bit of context. While working on enaml (https://github.com/nucleic/enaml) (which through the time I have maintained it has supported Python 2, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, and strives to be able to parse any valid Python), I often add to marginally modify the parser to either support new syntax or changes to the ast nodes creation. Since I was using ply I used to subclass the parser to add/modify rules or overwrite methods to handle ast node changes.

Using pegen mean a new parser would need to be generated for each supported python version. The grammar files would obviously share a lot of code and I am wondering if there are ways to limit duplication. If only the ast node changes, one can probably alter the subheader to use a different base class for the parser and use a method in the affected rules, but this does not scale to changes in the grammar proper.

we-like-parsers / pegen

Making extensible parsers #12