Open jasmina94 opened 7 years ago
What do you mean with validating the words? If they are spelled correctly or if it are real words?
If spelling is correct. But if there is something for checking if it is a real word, it will also be helpful. @Sicos1977
You can use nhunspell for that --> https://www.nuget.org/packages/NHunspell/
Thank you
From: Eric
Checkout Lucene.Net (.net implementation of Lucene) for fuzzy logic searching. Then perhaps compare the result against a dictionary file that has been preloaded into Lucene.Net.
There are other fuzzy logic services which you can use too like Elastic search. But you may not want to incur the time penalty of using API calls. If your program is in java or .net you can include Lucene library directly into your program.
Eric
Sent from my Galaxy S®III
Hi everyone! I'm doing some project about word recognition based on Tesseract engine. So far, everything work fine. Idea is to have some kind of snipping tool which will give user possibility to make image containing text. After processing image- text is recognized, but now I want to validate it somehow. Is there a way to validate text that engine returns? I know there is Levenstein distance algorithm, but I don't know how to use it. I would compare word by word which Tesseract returns but don't know with what to compare it. Please help me if you have any ideas for solving my problem.
Thank you :)