ridiculousfish / regress

REGex in Rust with EcmaScript Syntax
Apache License 2.0
176 stars 11 forks source link

Correct first-byte computation for interval #74

Closed ridiculousfish closed 12 months ago

ridiculousfish commented 12 months ago

When computing a predicate to match the first byte, we incorrectly assumed that the set of first bytes of a closed interval (in UTF8) are contiguous. But of course this is false. Reimplement this algorithm and add tests.

Fixes #73