Language detection buggy?

I just pushed a new tool to gh-pages: https://graphicore.github.io/mdlFontSpecimen/html/language-coverage-details.html

There we can see what the language coverage detection is based on. In the lax mode (which we are using) mainly some punctuation is missing for Acme: ("…" U+2026,"†" U+2020,"‡" U+2021)

The lax mode uses a list of chars that we (I) decided is not important enough to break language support, looks like this at the moment:

https://github.com/graphicore/specimenTools/blob/master/lib/services/FontsData.js#L155

    FontsData.DEFAULT_LAX_CHAR_LIST = new Set([
        0x2010 // HYPHEN -> we usually use/include HYPHEN-MINUS: 0x002D
      , 0x2032 // PRIME
      , 0x2033 // DOUBLE PRIME
      , 0x27e8 // MATHEMATICAL LEFT ANGLE BRACKET
      , 0x27e9 // MATHEMATICAL RIGHT ANGLE BRACKET
      , 0x2052 // COMMERCIAL MINUS SIGN
    ]);

The question is of course whether we can add all chars that break the fun to the lax-list and still that claim the language is supported. E.g. I personally use the ellipsis regularly but rather never dagger and double dagger.

Acme is now down to 4 languages, because "Root" was removed:

selection_119

graphicore / mdlFontSpecimen

Language detection buggy? #4