ThePhD / future_cxx

Work done today for the glory of tomorrow - or, C and C++ systems programming papers.
https://thephd.dev/portfolio/standard
46 stars 8 forks source link

cXXXX - Unicode Conversions, Encoding Registries, and More #54

Open ThePhD opened 2 years ago

ThePhD commented 2 years ago

@h-vetinari Take a look: https://thephd.dev/_vendor/future_cxx/papers/C%20-%20Restartable%20and%20Non-Restartable%20Character%20Functions%20for%20Efficient%20Conversions.html

It's got new sections to (hopefully) kill of the wchar_t requirement issues voiced last meeting. I'm getting to work on the benchmarks, and hopefully it'll be published in the Journal of Open Source Software (JOSS) before our next meeting in January.

ThePhD commented 2 years ago

Some initial benchmarks, since I'm gonna need these for the updated paper. I plan on improving cuneicode but, for now, it's easy enough to prove that cuneicode_registry_fast can compete (by a slight sleight-of-hand, but still works nevertheless).

Conversion Tests - UTF-16 to UTF-8 (Well-Formed) Conversion Tests - UTF-16 to UTF-32 (Well-Formed) Conversion Tests - UTF-32 to UTF-8 (Well-Formed) Conversion Tests - UTF-32 to UTF-16 (Well-Formed) Conversion Tests - UTF-8 to UTF-16 (Well-Formed) Conversion Tests - UTF-8 to UTF-32 (Well-Formed)