waxeye-org / waxeye

Waxeye is a parser generator based on parsing expression grammars (PEGs). It supports C, Java, JavaScript, Python, Racket, and Ruby.
https://waxeye-org.github.io/waxeye/index.html
Other
235 stars 38 forks source link

Create a grammar file with some common Unicode character classes #53

Open glebm opened 7 years ago

glebm commented 7 years ago

As Unicode character classes change over time, the grammar file should be automatically generated.

For the initial implementation, implement predicates and Unicode general categories supported by the racket standard library: https://docs.racket-lang.org/reference/characters.html

Here is some code that demonstrates iterating over all Unicode codepoints using (make-known-char-range-list).