bodo-hugo-barwich / text-sanitizer-rs

Rust Library to convert rich UTF-8 Text into plain ASCII Text
https://crates.io/crates/text-sanitizer
1 stars 0 forks source link

ConversionMap and LanguageMap interface #15

Closed bodo-hugo-barwich closed 1 year ago

bodo-hugo-barwich commented 1 year ago

This is the next step of the implementation of configurability required for reusability as documented at: Configurable Conversion Map New tests extend the existing test suite to prove that the development produces the desired behaviour.

bodo-hugo-barwich commented 1 year ago

the new tests:

check the operativity of the new structures ConversionMap and LanguageMap

bodo-hugo-barwich commented 1 year ago

The profiling test result of \ "Sanitizing a 27KB text document":

real    0m0.002s
user    0m0.002s
sys 0m0.000s
processing time: '.002728892' s

shows that the performance is unaffected or even improved by this architectural change.