how to build lexer supports uncode?

Genivia / RE-flex

A high-performance C++ regex library and lexical analyzer generator with Unicode support. Extends Flex++ with Unicode support, indent/dedent anchors, lazy quantifiers, functions for lex and syntax error reporting and more. Seamlessly integrates with Bison and other parsers.

BSD 3-Clause "New" or "Revised" License

523 stars 86 forks source link

Hello,

I made following change to build wc with unicode support, but when I tried some utf8 input, wc does not report correct number of characters, instead it just report the number of bytes, same as without the option '--unicode'. Is there anything I missed?

Thanks,

$ git diff
diff --git a/examples/Make b/examples/Make
index 4080e27c..474e1049 100644
--- a/examples/Make
+++ b/examples/Make
@@ -326,7 +326,7 @@ calc:               calc.l calc.y
                ./calc < calc.test

 wc:            wc.l
-               $(REFLEX) $(REFLAGS) --flex wc.l
+               $(REFLEX) $(REFLAGS) --flex --unicode wc.l
                $(CXX) $(CXXFLAGS) -o $@ lex.yy.cpp $(LIBREFLEX)

 wcu:           wcu.l

$ echo 好 | ./wc
       1       1       4

Genivia / RE-flex

how to build lexer supports uncode? #184