emres / turkish-deasciifier

Turkish deasciifier in Python based on Deniz Yüret's turkish-mode for Emacs
https://ileriseviye.wordpress.com/tag/turkish-deasciifier/
146 stars 23 forks source link

Slow performance #8

Closed setanarut closed 4 years ago

setanarut commented 4 years ago

Hardware

MacBookPro13,3 Quad-Core Intel Core i7 - 2,7 GHz Memory - 16 GB

Benchmark results

Word Count Character Count Result (seconds)
10000 82236 5.3s
20000 176226 23.1s
40000 376746 94.3s
80000 804532 438.6s
100000 1025479 819.4s

Summary

Converting a 1000-page book will take an average of 3 hours. It takes weeks to translate a large old ascii website SQL database.

So a progress bar and optimization are required. fast word processing libraries can be used.