Closed sebbASF closed 2 years ago
interesting!
Is this fixing a parsing issue with a real PDF, or just a result of reviewing the codebase for regexp's without anchors?
It was found by looking for missing anchors, but then I did a test which showed at least one of the invalid PDFs matches without the anchor but not with it.
As it happens, that test still completes OK, presumably because of some other error.
Must not match e.g. a1a
Should also speed matching as no need to scan entire token if the first char is no a digit